US20230357837A1 - Diagnostic use of cell free dna chromatin immunoprecipitation - Google Patents
Diagnostic use of cell free dna chromatin immunoprecipitation Download PDFInfo
- Publication number
- US20230357837A1 US20230357837A1 US18/217,575 US202318217575A US2023357837A1 US 20230357837 A1 US20230357837 A1 US 20230357837A1 US 202318217575 A US202318217575 A US 202318217575A US 2023357837 A1 US2023357837 A1 US 2023357837A1
- Authority
- US
- United States
- Prior art keywords
- dna
- cfdna
- leukocytes
- cell
- tissue
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 210000004027 cell Anatomy 0.000 title description 280
- 238000002487 chromatin immunoprecipitation Methods 0.000 title description 31
- 238000000034 method Methods 0.000 claims abstract description 152
- 230000001413 cellular effect Effects 0.000 claims abstract description 126
- 230000034994 death Effects 0.000 claims abstract description 18
- 108090000623 proteins and genes Proteins 0.000 claims description 326
- 108020004414 DNA Proteins 0.000 claims description 238
- 102000004169 proteins and genes Human genes 0.000 claims description 196
- 201000010099 disease Diseases 0.000 claims description 99
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 99
- 238000012163 sequencing technique Methods 0.000 claims description 97
- 239000003153 chemical reaction reagent Substances 0.000 claims description 93
- 206010028980 Neoplasm Diseases 0.000 claims description 76
- 108010051779 histone H3 trimethyl Lys4 Proteins 0.000 claims description 52
- 201000011510 cancer Diseases 0.000 claims description 50
- 108010033040 Histones Proteins 0.000 claims description 48
- 230000002103 transcriptional effect Effects 0.000 claims description 40
- 230000035772 mutation Effects 0.000 claims description 39
- 238000013518 transcription Methods 0.000 claims description 38
- 230000035897 transcription Effects 0.000 claims description 38
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 35
- 239000012634 fragment Substances 0.000 claims description 33
- 210000004369 blood Anatomy 0.000 claims description 31
- 239000008280 blood Substances 0.000 claims description 31
- 239000011324 bead Substances 0.000 claims description 29
- 210000001124 body fluid Anatomy 0.000 claims description 25
- 238000004458 analytical method Methods 0.000 claims description 20
- 230000006378 damage Effects 0.000 claims description 17
- 238000011282 treatment Methods 0.000 claims description 16
- 108700028146 Genetic Enhancer Elements Proteins 0.000 claims description 12
- 230000004054 inflammatory process Effects 0.000 claims description 11
- 206010061218 Inflammation Diseases 0.000 claims description 10
- 206010021143 Hypoxia Diseases 0.000 claims description 9
- 206010067125 Liver injury Diseases 0.000 claims description 9
- 230000007954 hypoxia Effects 0.000 claims description 9
- 230000004913 activation Effects 0.000 claims description 8
- 208000023275 Autoimmune disease Diseases 0.000 claims description 7
- 208000031729 Bacteremia Diseases 0.000 claims description 7
- 208000019423 liver disease Diseases 0.000 claims description 7
- 238000007481 next generation sequencing Methods 0.000 claims description 7
- 208000015181 infectious disease Diseases 0.000 claims description 6
- 239000003550 marker Substances 0.000 claims description 6
- 230000000451 tissue damage Effects 0.000 claims description 6
- 231100000827 tissue damage Toxicity 0.000 claims description 6
- 208000020446 Cardiac disease Diseases 0.000 claims description 5
- 208000019622 heart disease Diseases 0.000 claims description 5
- 238000012165 high-throughput sequencing Methods 0.000 claims description 5
- 230000010468 interferon response Effects 0.000 claims description 5
- 230000003211 malignant effect Effects 0.000 claims description 5
- 230000017074 necrotic cell death Effects 0.000 claims description 5
- 230000003185 calcium uptake Effects 0.000 claims description 4
- 230000001351 cycling effect Effects 0.000 claims description 4
- 230000004769 mitochondrial stress Effects 0.000 claims description 4
- 230000004770 neurodegeneration Effects 0.000 claims description 4
- 208000015122 neurodegenerative disease Diseases 0.000 claims description 4
- 230000009758 senescence Effects 0.000 claims description 4
- 230000035882 stress Effects 0.000 claims description 4
- 208000030453 Drug-Related Side Effects and Adverse reaction Diseases 0.000 claims description 3
- 206010003246 arthritis Diseases 0.000 claims description 3
- 230000008993 bowel inflammation Effects 0.000 claims description 3
- 208000029028 brain injury Diseases 0.000 claims description 3
- 206010012601 diabetes mellitus Diseases 0.000 claims description 3
- 208000006454 hepatitis Diseases 0.000 claims description 3
- 208000018191 liver inflammation Diseases 0.000 claims description 3
- 208000014644 Brain disease Diseases 0.000 claims description 2
- 208000018522 Gastrointestinal disease Diseases 0.000 claims description 2
- 208000010643 digestive system disease Diseases 0.000 claims description 2
- 208000018685 gastrointestinal system disease Diseases 0.000 claims description 2
- 230000003584 silencer Effects 0.000 claims description 2
- 238000004590 computer program Methods 0.000 abstract description 12
- 210000000265 leukocyte Anatomy 0.000 description 299
- 210000004185 liver Anatomy 0.000 description 203
- 210000001519 tissue Anatomy 0.000 description 193
- -1 repressors Substances 0.000 description 174
- 210000004698 lymphocyte Anatomy 0.000 description 133
- 210000004556 brain Anatomy 0.000 description 121
- 210000001744 T-lymphocyte Anatomy 0.000 description 103
- 239000000523 sample Substances 0.000 description 94
- 101100495925 Schizosaccharomyces pombe (strain 972 / ATCC 24843) chr3 gene Proteins 0.000 description 74
- 210000003205 muscle Anatomy 0.000 description 67
- 210000000496 pancreas Anatomy 0.000 description 53
- 238000001514 detection method Methods 0.000 description 44
- 210000002826 placenta Anatomy 0.000 description 44
- 230000027455 binding Effects 0.000 description 40
- 230000001079 digestive effect Effects 0.000 description 39
- 230000002496 gastric effect Effects 0.000 description 35
- 210000002381 plasma Anatomy 0.000 description 32
- 210000000440 neutrophil Anatomy 0.000 description 30
- 108700009124 Transcription Initiation Site Proteins 0.000 description 28
- 238000003556 assay Methods 0.000 description 27
- 230000014509 gene expression Effects 0.000 description 26
- 210000002443 helper t lymphocyte Anatomy 0.000 description 26
- 230000030833 cell death Effects 0.000 description 24
- 210000001541 thymus gland Anatomy 0.000 description 24
- 238000003860 storage Methods 0.000 description 23
- 108010077544 Chromatin Proteins 0.000 description 22
- 210000003483 chromatin Anatomy 0.000 description 22
- 102000040945 Transcription factor Human genes 0.000 description 19
- 108091023040 Transcription factor Proteins 0.000 description 19
- 210000004877 mucosa Anatomy 0.000 description 18
- 238000012360 testing method Methods 0.000 description 18
- 230000003321 amplification Effects 0.000 description 17
- 238000003199 nucleic acid amplification method Methods 0.000 description 17
- 230000037361 pathway Effects 0.000 description 17
- 239000007787 solid Substances 0.000 description 17
- 239000003795 chemical substances by application Substances 0.000 description 16
- 210000001616 monocyte Anatomy 0.000 description 16
- 210000002784 stomach Anatomy 0.000 description 15
- 206010000891 acute myocardial infarction Diseases 0.000 description 14
- 238000007069 methylation reaction Methods 0.000 description 14
- 102000039446 nucleic acids Human genes 0.000 description 13
- 108020004707 nucleic acids Proteins 0.000 description 13
- 150000007523 nucleic acids Chemical class 0.000 description 13
- 239000003623 enhancer Substances 0.000 description 12
- 230000004048 modification Effects 0.000 description 12
- 238000012986 modification Methods 0.000 description 12
- 238000012544 monitoring process Methods 0.000 description 12
- 230000008569 process Effects 0.000 description 12
- 230000002068 genetic effect Effects 0.000 description 11
- 230000000670 limiting effect Effects 0.000 description 11
- 238000013146 percutaneous coronary intervention Methods 0.000 description 11
- 239000012212 insulator Substances 0.000 description 10
- 239000000463 material Substances 0.000 description 10
- 238000012545 processing Methods 0.000 description 10
- 239000000243 solution Substances 0.000 description 10
- 238000001353 Chip-sequencing Methods 0.000 description 9
- 102100039498 Cytotoxic T-lymphocyte protein 4 Human genes 0.000 description 9
- 101000889276 Homo sapiens Cytotoxic T-lymphocyte protein 4 Proteins 0.000 description 9
- 101000971513 Homo sapiens Natural killer cells antigen CD94 Proteins 0.000 description 9
- 102100021462 Natural killer cells antigen CD94 Human genes 0.000 description 9
- 238000009826 distribution Methods 0.000 description 9
- 210000003734 kidney Anatomy 0.000 description 9
- 230000011987 methylation Effects 0.000 description 9
- 230000004640 cellular pathway Effects 0.000 description 8
- 102000052116 epidermal growth factor receptor activity proteins Human genes 0.000 description 8
- 108700015053 epidermal growth factor receptor activity proteins Proteins 0.000 description 8
- 239000012530 fluid Substances 0.000 description 8
- YOHYSYJDKVYCJI-UHFFFAOYSA-N n-[3-[[6-[3-(trifluoromethyl)anilino]pyrimidin-4-yl]amino]phenyl]cyclopropanecarboxamide Chemical compound FC(F)(F)C1=CC=CC(NC=2N=CN=C(NC=3C=C(NC(=O)C4CC4)C=CC=3)C=2)=C1 YOHYSYJDKVYCJI-UHFFFAOYSA-N 0.000 description 8
- 108010014064 CCCTC-Binding Factor Proteins 0.000 description 7
- 102100024739 E3 ubiquitin-protein ligase UHRF1 Human genes 0.000 description 7
- 102000004190 Enzymes Human genes 0.000 description 7
- 108090000790 Enzymes Proteins 0.000 description 7
- 101000760417 Homo sapiens E3 ubiquitin-protein ligase UHRF1 Proteins 0.000 description 7
- 108010047956 Nucleosomes Proteins 0.000 description 7
- 108700020796 Oncogene Proteins 0.000 description 7
- 208000006994 Precancerous Conditions Diseases 0.000 description 7
- 102100027671 Transcriptional repressor CTCF Human genes 0.000 description 7
- 239000012190 activator Substances 0.000 description 7
- 239000000427 antigen Substances 0.000 description 7
- 108091007433 antigens Proteins 0.000 description 7
- 102000036639 antigens Human genes 0.000 description 7
- 239000011230 binding agent Substances 0.000 description 7
- 210000000349 chromosome Anatomy 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 210000003743 erythrocyte Anatomy 0.000 description 7
- 210000003754 fetus Anatomy 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 238000010197 meta-analysis Methods 0.000 description 7
- 238000010606 normalization Methods 0.000 description 7
- 210000001623 nucleosome Anatomy 0.000 description 7
- 230000005298 paramagnetic effect Effects 0.000 description 7
- 102100036848 C-C motif chemokine 20 Human genes 0.000 description 6
- 102100026094 C-type lectin domain family 12 member A Human genes 0.000 description 6
- 102100029380 CMRF35-like molecule 2 Human genes 0.000 description 6
- 102100020736 Chromosome-associated kinesin KIF4A Human genes 0.000 description 6
- 230000007067 DNA methylation Effects 0.000 description 6
- 101000713099 Homo sapiens C-C motif chemokine 20 Proteins 0.000 description 6
- 101000912622 Homo sapiens C-type lectin domain family 12 member A Proteins 0.000 description 6
- 101000990046 Homo sapiens CMRF35-like molecule 2 Proteins 0.000 description 6
- 101001139157 Homo sapiens Chromosome-associated kinesin KIF4A Proteins 0.000 description 6
- 101000711455 Homo sapiens Kinetochore protein Spc25 Proteins 0.000 description 6
- 101000585555 Homo sapiens PCNA-associated factor Proteins 0.000 description 6
- 101000945496 Homo sapiens Proliferation marker protein Ki-67 Proteins 0.000 description 6
- 101001074602 Homo sapiens Protein PIMREG Proteins 0.000 description 6
- 101000575639 Homo sapiens Ribonucleoside-diphosphate reductase subunit M2 Proteins 0.000 description 6
- 101000875498 Homo sapiens Serine protease FAM111B Proteins 0.000 description 6
- 101000807354 Homo sapiens Ubiquitin-conjugating enzyme E2 C Proteins 0.000 description 6
- 102100029879 PCNA-associated factor Human genes 0.000 description 6
- 102100034836 Proliferation marker protein Ki-67 Human genes 0.000 description 6
- 102100036258 Protein PIMREG Human genes 0.000 description 6
- 238000003559 RNA-seq method Methods 0.000 description 6
- 102100035992 Serine protease FAM111B Human genes 0.000 description 6
- 102100023776 Signal peptidase complex subunit 2 Human genes 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 102100037256 Ubiquitin-conjugating enzyme E2 C Human genes 0.000 description 6
- 230000003247 decreasing effect Effects 0.000 description 6
- 238000001114 immunoprecipitation Methods 0.000 description 6
- 230000005291 magnetic effect Effects 0.000 description 6
- 229920001184 polypeptide Polymers 0.000 description 6
- 102000004196 processed proteins & peptides Human genes 0.000 description 6
- 108090000765 processed proteins & peptides Proteins 0.000 description 6
- 230000001105 regulatory effect Effects 0.000 description 6
- 230000004044 response Effects 0.000 description 6
- 102100025248 C-X-C motif chemokine 10 Human genes 0.000 description 5
- 101100279436 Caenorhabditis elegans egg-2 gene Proteins 0.000 description 5
- 102000011682 Centromere Protein A Human genes 0.000 description 5
- 108010076303 Centromere Protein A Proteins 0.000 description 5
- 102100032857 Cyclin-dependent kinase 1 Human genes 0.000 description 5
- 101710106279 Cyclin-dependent kinase 1 Proteins 0.000 description 5
- 102100035185 DNA excision repair protein ERCC-6-like Human genes 0.000 description 5
- 102100030960 DNA replication licensing factor MCM2 Human genes 0.000 description 5
- 102100028773 Endonuclease 8-like 3 Human genes 0.000 description 5
- 101000858088 Homo sapiens C-X-C motif chemokine 10 Proteins 0.000 description 5
- 101000876524 Homo sapiens DNA excision repair protein ERCC-6-like Proteins 0.000 description 5
- 101000583807 Homo sapiens DNA replication licensing factor MCM2 Proteins 0.000 description 5
- 101001018431 Homo sapiens DNA replication licensing factor MCM7 Proteins 0.000 description 5
- 101001123819 Homo sapiens Endonuclease 8-like 3 Proteins 0.000 description 5
- 101000619112 Homo sapiens Proline-rich protein 11 Proteins 0.000 description 5
- 101000583797 Homo sapiens Protein MCM10 homolog Proteins 0.000 description 5
- 102100022566 Proline-rich protein 11 Human genes 0.000 description 5
- 102100030962 Protein MCM10 homolog Human genes 0.000 description 5
- 102100026006 Ribonucleoside-diphosphate reductase subunit M2 Human genes 0.000 description 5
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 5
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 5
- 210000002593 Y chromosome Anatomy 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 230000021615 conjugation Effects 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 5
- 238000003745 diagnosis Methods 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 230000034659 glycolysis Effects 0.000 description 5
- 210000002064 heart cell Anatomy 0.000 description 5
- 210000005229 liver cell Anatomy 0.000 description 5
- 208000010125 myocardial infarction Diseases 0.000 description 5
- 210000003924 normoblast Anatomy 0.000 description 5
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 5
- 230000009870 specific binding Effects 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- KIAPWMKFHIKQOZ-UHFFFAOYSA-N 2-[[(4-fluorophenyl)-oxomethyl]amino]benzoic acid methyl ester Chemical compound COC(=O)C1=CC=CC=C1NC(=O)C1=CC=C(F)C=C1 KIAPWMKFHIKQOZ-UHFFFAOYSA-N 0.000 description 4
- 229920000936 Agarose Polymers 0.000 description 4
- 102100036817 Ankyrin-3 Human genes 0.000 description 4
- 102100028117 Annexin A10 Human genes 0.000 description 4
- 102100026032 BRI3-binding protein Human genes 0.000 description 4
- 102100032367 C-C motif chemokine 5 Human genes 0.000 description 4
- 102100025279 C-X-C motif chemokine 11 Human genes 0.000 description 4
- 102100036170 C-X-C motif chemokine 9 Human genes 0.000 description 4
- 102100026197 C-type lectin domain family 2 member D Human genes 0.000 description 4
- 102100025475 Carcinoembryonic antigen-related cell adhesion molecule 5 Human genes 0.000 description 4
- 102100023344 Centromere protein F Human genes 0.000 description 4
- 102100023343 Centromere protein I Human genes 0.000 description 4
- 102100040835 Claudin-18 Human genes 0.000 description 4
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 4
- 206010052360 Colorectal adenocarcinoma Diseases 0.000 description 4
- 238000007400 DNA extraction Methods 0.000 description 4
- 102100021389 DNA replication licensing factor MCM4 Human genes 0.000 description 4
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 4
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 4
- 102100029075 Exonuclease 1 Human genes 0.000 description 4
- 102100040133 Free fatty acid receptor 2 Human genes 0.000 description 4
- 102100036255 Glucose-6-phosphatase 2 Human genes 0.000 description 4
- 102000006947 Histones Human genes 0.000 description 4
- 101000928342 Homo sapiens Ankyrin-3 Proteins 0.000 description 4
- 101000768069 Homo sapiens Annexin A10 Proteins 0.000 description 4
- 101000933488 Homo sapiens BRI3-binding protein Proteins 0.000 description 4
- 101000797762 Homo sapiens C-C motif chemokine 5 Proteins 0.000 description 4
- 101000858060 Homo sapiens C-X-C motif chemokine 11 Proteins 0.000 description 4
- 101000947172 Homo sapiens C-X-C motif chemokine 9 Proteins 0.000 description 4
- 101000912615 Homo sapiens C-type lectin domain family 2 member D Proteins 0.000 description 4
- 101000914324 Homo sapiens Carcinoembryonic antigen-related cell adhesion molecule 5 Proteins 0.000 description 4
- 101000907941 Homo sapiens Centromere protein F Proteins 0.000 description 4
- 101000907944 Homo sapiens Centromere protein I Proteins 0.000 description 4
- 101000749329 Homo sapiens Claudin-18 Proteins 0.000 description 4
- 101000615280 Homo sapiens DNA replication licensing factor MCM4 Proteins 0.000 description 4
- 101000918264 Homo sapiens Exonuclease 1 Proteins 0.000 description 4
- 101000890668 Homo sapiens Free fatty acid receptor 2 Proteins 0.000 description 4
- 101000930907 Homo sapiens Glucose-6-phosphatase 2 Proteins 0.000 description 4
- 101001008953 Homo sapiens Kinesin-like protein KIF11 Proteins 0.000 description 4
- 101001091232 Homo sapiens Kinesin-like protein KIF18B Proteins 0.000 description 4
- 101000598335 Homo sapiens Metalloprotease TIKI1 Proteins 0.000 description 4
- 101001018147 Homo sapiens Mitogen-activated protein kinase kinase kinase 4 Proteins 0.000 description 4
- 101000957259 Homo sapiens Mitotic spindle assembly checkpoint protein MAD2A Proteins 0.000 description 4
- 101000991410 Homo sapiens Nucleolar and spindle-associated protein 1 Proteins 0.000 description 4
- 101000731015 Homo sapiens Peptidoglycan recognition protein 1 Proteins 0.000 description 4
- 101001074631 Homo sapiens Phosphatidylinositol N-acetylglucosaminyltransferase subunit Y Proteins 0.000 description 4
- 101001130243 Homo sapiens RAD51-associated protein 1 Proteins 0.000 description 4
- 101000707664 Homo sapiens Rho GTPase-activating protein 11A Proteins 0.000 description 4
- 101000633677 Homo sapiens Spindle and kinetochore-associated protein 3 Proteins 0.000 description 4
- 101000716149 Homo sapiens T-cell surface glycoprotein CD1b Proteins 0.000 description 4
- 101000663002 Homo sapiens TNFAIP3-interacting protein 3 Proteins 0.000 description 4
- 101000735431 Homo sapiens Terminal nucleotidyltransferase 4A Proteins 0.000 description 4
- 101000866298 Homo sapiens Transcription factor E2F8 Proteins 0.000 description 4
- 101000662056 Homo sapiens Ubiquitin D Proteins 0.000 description 4
- 108060003951 Immunoglobulin Proteins 0.000 description 4
- 102100030704 Interleukin-21 Human genes 0.000 description 4
- 108010041872 Islet Amyloid Polypeptide Proteins 0.000 description 4
- 102000036770 Islet Amyloid Polypeptide Human genes 0.000 description 4
- 102100027629 Kinesin-like protein KIF11 Human genes 0.000 description 4
- 102100034896 Kinesin-like protein KIF18B Human genes 0.000 description 4
- 102100036941 Metalloprotease TIKI1 Human genes 0.000 description 4
- 102100033060 Mitogen-activated protein kinase kinase kinase 4 Human genes 0.000 description 4
- 102100038792 Mitotic spindle assembly checkpoint protein MAD2A Human genes 0.000 description 4
- 102100030991 Nucleolar and spindle-associated protein 1 Human genes 0.000 description 4
- 102000043276 Oncogene Human genes 0.000 description 4
- 102100032341 PCNA-interacting partner Human genes 0.000 description 4
- 101710196737 PCNA-interacting partner Proteins 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- 102100032393 Peptidoglycan recognition protein 1 Human genes 0.000 description 4
- 102100036331 Phosphatidylinositol N-acetylglucosaminyltransferase subunit Y Human genes 0.000 description 4
- 102100031535 RAD51-associated protein 1 Human genes 0.000 description 4
- 102100031354 Rho GTPase-activating protein 11A Human genes 0.000 description 4
- 108091006556 SLC30A8 Proteins 0.000 description 4
- 102100029220 Spindle and kinetochore-associated protein 3 Human genes 0.000 description 4
- 102100035982 T-cell surface glycoprotein CD1b Human genes 0.000 description 4
- 102100037666 TNFAIP3-interacting protein 3 Human genes 0.000 description 4
- 102100034939 Terminal nucleotidyltransferase 4A Human genes 0.000 description 4
- 102100034838 Thymidine kinase, cytosolic Human genes 0.000 description 4
- 102100031555 Transcription factor E2F8 Human genes 0.000 description 4
- 102000004903 Troponin Human genes 0.000 description 4
- 108090001027 Troponin Proteins 0.000 description 4
- 102100037932 Ubiquitin D Human genes 0.000 description 4
- PLOPBXQQPZYQFA-AXPWDRQUSA-N amlintide Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H]1NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CSSC1)[C@@H](C)O)C(C)C)C1=CC=CC=C1 PLOPBXQQPZYQFA-AXPWDRQUSA-N 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 4
- 210000004413 cardiac myocyte Anatomy 0.000 description 4
- 230000007423 decrease Effects 0.000 description 4
- 230000001605 fetal effect Effects 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 210000001035 gastrointestinal tract Anatomy 0.000 description 4
- 231100000234 hepatic damage Toxicity 0.000 description 4
- 102000018358 immunoglobulin Human genes 0.000 description 4
- 108010074108 interleukin-21 Proteins 0.000 description 4
- 230000008818 liver damage Effects 0.000 description 4
- 210000000056 organ Anatomy 0.000 description 4
- 230000007170 pathology Effects 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 150000003384 small molecules Chemical class 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 102100023621 4-hydroxyphenylpyruvate dioxygenase-like protein Human genes 0.000 description 3
- 101150075418 ARHGAP15 gene Proteins 0.000 description 3
- 102100022117 Abnormal spindle-like microcephaly-associated protein Human genes 0.000 description 3
- 102100027211 Albumin Human genes 0.000 description 3
- 102100033393 Anillin Human genes 0.000 description 3
- 102100032306 Aurora kinase B Human genes 0.000 description 3
- 102100029648 Beta-arrestin-2 Human genes 0.000 description 3
- 102100036305 C-C chemokine receptor type 8 Human genes 0.000 description 3
- 102100025473 Carcinoembryonic antigen-related cell adhesion molecule 6 Human genes 0.000 description 3
- 102100027047 Cell division control protein 6 homolog Human genes 0.000 description 3
- 102100024485 Cell division cycle-associated protein 7 Human genes 0.000 description 3
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 description 3
- 102100023444 Centromere protein K Human genes 0.000 description 3
- 102100031219 Centrosomal protein of 55 kDa Human genes 0.000 description 3
- 101710092479 Centrosomal protein of 55 kDa Proteins 0.000 description 3
- 102100036217 Collagen alpha-1(X) chain Human genes 0.000 description 3
- 206010009944 Colon cancer Diseases 0.000 description 3
- 102100025191 Cyclin-A2 Human genes 0.000 description 3
- 102100038254 Cyclin-F Human genes 0.000 description 3
- 102100038387 Cystatin-SN Human genes 0.000 description 3
- 102100028183 Cytohesin-interacting protein Human genes 0.000 description 3
- 102100029910 DNA polymerase epsilon subunit 2 Human genes 0.000 description 3
- 102100025450 DNA replication factor Cdt1 Human genes 0.000 description 3
- 101710096438 DNA-binding protein Proteins 0.000 description 3
- 101001046554 Dictyostelium discoideum Thymidine kinase 1 Proteins 0.000 description 3
- 206010061818 Disease progression Diseases 0.000 description 3
- 101100310856 Drosophila melanogaster spri gene Proteins 0.000 description 3
- 102100025734 Dual specificity protein phosphatase CDC14A Human genes 0.000 description 3
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 description 3
- 102100021860 Endothelial cell-specific molecule 1 Human genes 0.000 description 3
- 102100031968 Ephrin type-B receptor 2 Human genes 0.000 description 3
- 102100039623 Epithelial splicing regulatory protein 1 Human genes 0.000 description 3
- 239000004593 Epoxy Substances 0.000 description 3
- 101150026630 FOXG1 gene Proteins 0.000 description 3
- 102100040683 Fermitin family homolog 1 Human genes 0.000 description 3
- 102100027634 Fibronectin type 3 and ankyrin repeat domains protein 1 Human genes 0.000 description 3
- 102100020871 Forkhead box protein G1 Human genes 0.000 description 3
- 102100032340 G2/mitotic-specific cyclin-B1 Human genes 0.000 description 3
- 102100024412 GTPase IMAP family member 4 Human genes 0.000 description 3
- 108010054147 Hemoglobins Proteins 0.000 description 3
- 102000001554 Hemoglobins Human genes 0.000 description 3
- 102100034523 Histone H4 Human genes 0.000 description 3
- 101001048445 Homo sapiens 4-hydroxyphenylpyruvate dioxygenase-like protein Proteins 0.000 description 3
- 101000900939 Homo sapiens Abnormal spindle-like microcephaly-associated protein Proteins 0.000 description 3
- 101000732632 Homo sapiens Anillin Proteins 0.000 description 3
- 101000798306 Homo sapiens Aurora kinase B Proteins 0.000 description 3
- 101000716063 Homo sapiens C-C chemokine receptor type 8 Proteins 0.000 description 3
- 101000914326 Homo sapiens Carcinoembryonic antigen-related cell adhesion molecule 6 Proteins 0.000 description 3
- 101000914465 Homo sapiens Cell division control protein 6 homolog Proteins 0.000 description 3
- 101000980893 Homo sapiens Cell division cycle-associated protein 7 Proteins 0.000 description 3
- 101000907931 Homo sapiens Centromere protein K Proteins 0.000 description 3
- 101000875027 Homo sapiens Collagen alpha-1(X) chain Proteins 0.000 description 3
- 101000934320 Homo sapiens Cyclin-A2 Proteins 0.000 description 3
- 101000884183 Homo sapiens Cyclin-F Proteins 0.000 description 3
- 101000884768 Homo sapiens Cystatin-SN Proteins 0.000 description 3
- 101000916686 Homo sapiens Cytohesin-interacting protein Proteins 0.000 description 3
- 101000864190 Homo sapiens DNA polymerase epsilon subunit 2 Proteins 0.000 description 3
- 101000914265 Homo sapiens DNA replication factor Cdt1 Proteins 0.000 description 3
- 101000932600 Homo sapiens Dual specificity protein phosphatase CDC14A Proteins 0.000 description 3
- 101000897959 Homo sapiens Endothelial cell-specific molecule 1 Proteins 0.000 description 3
- 101000814084 Homo sapiens Epithelial splicing regulatory protein 1 Proteins 0.000 description 3
- 101000892670 Homo sapiens Fermitin family homolog 1 Proteins 0.000 description 3
- 101000937169 Homo sapiens Fibronectin type 3 and ankyrin repeat domains protein 1 Proteins 0.000 description 3
- 101000868643 Homo sapiens G2/mitotic-specific cyclin-B1 Proteins 0.000 description 3
- 101000833375 Homo sapiens GTPase IMAP family member 4 Proteins 0.000 description 3
- 101001067880 Homo sapiens Histone H4 Proteins 0.000 description 3
- 101001078133 Homo sapiens Integrin alpha-2 Proteins 0.000 description 3
- 101000971538 Homo sapiens Killer cell lectin-like receptor subfamily F member 1 Proteins 0.000 description 3
- 101001008949 Homo sapiens Kinesin-like protein KIF14 Proteins 0.000 description 3
- 101000575048 Homo sapiens MTOR-associated protein MEAK7 Proteins 0.000 description 3
- 101000577881 Homo sapiens Macrophage metalloelastase Proteins 0.000 description 3
- 101000593405 Homo sapiens Myb-related protein B Proteins 0.000 description 3
- 101001018553 Homo sapiens MyoD family inhibitor Proteins 0.000 description 3
- 101000938705 Homo sapiens N-acetyltransferase ESCO2 Proteins 0.000 description 3
- 101001109501 Homo sapiens NKG2-D type II integral membrane protein Proteins 0.000 description 3
- 101000886231 Homo sapiens Polypeptide N-acetylgalactosaminyltransferase 6 Proteins 0.000 description 3
- 101000876829 Homo sapiens Protein C-ets-1 Proteins 0.000 description 3
- 101001028905 Homo sapiens Protein FAM177B Proteins 0.000 description 3
- 101000848917 Homo sapiens Protein FAM72D Proteins 0.000 description 3
- 101000962438 Homo sapiens Protein MAL2 Proteins 0.000 description 3
- 101000629807 Homo sapiens RNA-binding protein MEX3A Proteins 0.000 description 3
- 101000994788 Homo sapiens Ras GTPase-activating-like protein IQGAP3 Proteins 0.000 description 3
- 101001094554 Homo sapiens Rhotekin-2 Proteins 0.000 description 3
- 101000707218 Homo sapiens SH2 domain-containing protein 1B Proteins 0.000 description 3
- 101000632529 Homo sapiens Shugoshin 1 Proteins 0.000 description 3
- 101000835928 Homo sapiens Signal-regulatory protein gamma Proteins 0.000 description 3
- 101000822549 Homo sapiens Sterile alpha motif domain-containing protein 3 Proteins 0.000 description 3
- 101000577874 Homo sapiens Stromelysin-2 Proteins 0.000 description 3
- 101000577877 Homo sapiens Stromelysin-3 Proteins 0.000 description 3
- 101000716994 Homo sapiens Suppressor APC domain-containing protein 2 Proteins 0.000 description 3
- 101000798076 Homo sapiens T cell receptor delta constant Proteins 0.000 description 3
- 101000831007 Homo sapiens T-cell immunoreceptor with Ig and ITIM domains Proteins 0.000 description 3
- 101000897407 Homo sapiens T-cell surface glycoprotein CD1e, membrane-associated Proteins 0.000 description 3
- 101000830894 Homo sapiens Targeting protein for Xklp2 Proteins 0.000 description 3
- 101000945477 Homo sapiens Thymidine kinase, cytosolic Proteins 0.000 description 3
- 101000652332 Homo sapiens Transcription factor SOX-1 Proteins 0.000 description 3
- 101000642514 Homo sapiens Transcription factor SOX-4 Proteins 0.000 description 3
- 101000795117 Homo sapiens Triggering receptor expressed on myeloid cells 2 Proteins 0.000 description 3
- 101000851334 Homo sapiens Troponin I, cardiac muscle Proteins 0.000 description 3
- 101000587313 Homo sapiens Tyrosine-protein kinase Srms Proteins 0.000 description 3
- 101000777120 Homo sapiens Ubiquitin carboxyl-terminal hydrolase 44 Proteins 0.000 description 3
- 101000837581 Homo sapiens Ubiquitin-conjugating enzyme E2 T Proteins 0.000 description 3
- 101000900760 Homo sapiens Uncharacterized protein encoded by LINC01551 Proteins 0.000 description 3
- 101000771675 Homo sapiens WD repeat and HMG-box DNA-binding protein 1 Proteins 0.000 description 3
- 102100025305 Integrin alpha-2 Human genes 0.000 description 3
- 108010050904 Interferons Proteins 0.000 description 3
- 102000014150 Interferons Human genes 0.000 description 3
- 102100021458 Killer cell lectin-like receptor subfamily F member 1 Human genes 0.000 description 3
- 102100027631 Kinesin-like protein KIF14 Human genes 0.000 description 3
- 102100024629 Laminin subunit beta-3 Human genes 0.000 description 3
- 102100025533 MTOR-associated protein MEAK7 Human genes 0.000 description 3
- 102100027998 Macrophage metalloelastase Human genes 0.000 description 3
- 102100024299 Maternal embryonic leucine zipper kinase Human genes 0.000 description 3
- 101710154611 Maternal embryonic leucine zipper kinase Proteins 0.000 description 3
- 108091007780 MiR-122 Proteins 0.000 description 3
- 102100034670 Myb-related protein B Human genes 0.000 description 3
- 102100033694 MyoD family inhibitor Human genes 0.000 description 3
- 102100030822 N-acetyltransferase ESCO2 Human genes 0.000 description 3
- 102100022680 NKG2-D type II integral membrane protein Human genes 0.000 description 3
- 206010028851 Necrosis Diseases 0.000 description 3
- 102100039695 Polypeptide N-acetylgalactosaminyltransferase 6 Human genes 0.000 description 3
- 102100035251 Protein C-ets-1 Human genes 0.000 description 3
- 102100037218 Protein FAM177B Human genes 0.000 description 3
- 102100034519 Protein FAM72D Human genes 0.000 description 3
- 102100039191 Protein MAL2 Human genes 0.000 description 3
- 102100026875 RNA-binding protein MEX3A Human genes 0.000 description 3
- 102100034417 Ras GTPase-activating-like protein IQGAP3 Human genes 0.000 description 3
- 102100027660 Rho GTPase-activating protein 15 Human genes 0.000 description 3
- 102100035117 Rhotekin-2 Human genes 0.000 description 3
- 102100031778 SH2 domain-containing protein 1B Human genes 0.000 description 3
- 108091006277 SLC5A1 Proteins 0.000 description 3
- 108091036332 SOX2OT Proteins 0.000 description 3
- 102100028402 Shugoshin 1 Human genes 0.000 description 3
- 102100025795 Signal-regulatory protein gamma Human genes 0.000 description 3
- 102000058090 Sodium-Glucose Transporter 1 Human genes 0.000 description 3
- 102100030677 Sphingosine-1-phosphate phosphatase 2 Human genes 0.000 description 3
- 102100022468 Sterile alpha motif domain-containing protein 3 Human genes 0.000 description 3
- 102100028848 Stromelysin-2 Human genes 0.000 description 3
- 102100028847 Stromelysin-3 Human genes 0.000 description 3
- 102100020923 Suppressor APC domain-containing protein 2 Human genes 0.000 description 3
- 102100032272 T cell receptor delta constant Human genes 0.000 description 3
- 102100024834 T-cell immunoreceptor with Ig and ITIM domains Human genes 0.000 description 3
- 102100021989 T-cell surface glycoprotein CD1e, membrane-associated Human genes 0.000 description 3
- 102100024813 Targeting protein for Xklp2 Human genes 0.000 description 3
- 108091046869 Telomeric non-coding RNA Proteins 0.000 description 3
- 102100030248 Transcription factor SOX-1 Human genes 0.000 description 3
- 102100036693 Transcription factor SOX-4 Human genes 0.000 description 3
- 102100029678 Triggering receptor expressed on myeloid cells 2 Human genes 0.000 description 3
- 239000007983 Tris buffer Substances 0.000 description 3
- 102100036859 Troponin I, cardiac muscle Human genes 0.000 description 3
- 102100029654 Tyrosine-protein kinase Srms Human genes 0.000 description 3
- 102100031306 Ubiquitin carboxyl-terminal hydrolase 44 Human genes 0.000 description 3
- 102100028705 Ubiquitin-conjugating enzyme E2 T Human genes 0.000 description 3
- 102100022066 Uncharacterized protein encoded by LINC01551 Human genes 0.000 description 3
- 102100029469 WD repeat and HMG-box DNA-binding protein 1 Human genes 0.000 description 3
- ZPCCSZFPOXBNDL-ZSTSFXQOSA-N [(4r,5s,6s,7r,9r,10r,11e,13e,16r)-6-[(2s,3r,4r,5s,6r)-5-[(2s,4r,5s,6s)-4,5-dihydroxy-4,6-dimethyloxan-2-yl]oxy-4-(dimethylamino)-3-hydroxy-6-methyloxan-2-yl]oxy-10-[(2r,5s,6r)-5-(dimethylamino)-6-methyloxan-2-yl]oxy-5-methoxy-9,16-dimethyl-2-oxo-7-(2-oxoe Chemical compound O([C@H]1/C=C/C=C/C[C@@H](C)OC(=O)C[C@H]([C@@H]([C@H]([C@@H](CC=O)C[C@H]1C)O[C@H]1[C@@H]([C@H]([C@H](O[C@@H]2O[C@@H](C)[C@H](O)[C@](C)(O)C2)[C@@H](C)O1)N(C)C)O)OC)OC(C)=O)[C@H]1CC[C@H](N(C)C)[C@@H](C)O1 ZPCCSZFPOXBNDL-ZSTSFXQOSA-N 0.000 description 3
- 230000002583 anti-histone Effects 0.000 description 3
- 238000003491 array Methods 0.000 description 3
- 101150036080 at gene Proteins 0.000 description 3
- 108010032967 beta-Arrestin 2 Proteins 0.000 description 3
- 239000000090 biomarker Substances 0.000 description 3
- 210000000601 blood cell Anatomy 0.000 description 3
- 230000036755 cellular response Effects 0.000 description 3
- 238000012350 deep sequencing Methods 0.000 description 3
- 239000000539 dimer Substances 0.000 description 3
- 230000005750 disease progression Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 208000005017 glioblastoma Diseases 0.000 description 3
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 3
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 3
- 210000003494 hepatocyte Anatomy 0.000 description 3
- 229940079322 interferon Drugs 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 108010028309 kalinin Proteins 0.000 description 3
- 238000010801 machine learning Methods 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 206010061289 metastatic neoplasm Diseases 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 210000004940 nucleus Anatomy 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 238000011002 quantification Methods 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 238000001847 surface plasmon resonance imaging Methods 0.000 description 3
- 230000008685 targeting Effects 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- 102100034517 (E2-independent) E3 ubiquitin-conjugating enzyme FATS Human genes 0.000 description 2
- 102100038837 2-Hydroxyacid oxidase 1 Human genes 0.000 description 2
- 102100034531 AP-1 complex subunit mu-2 Human genes 0.000 description 2
- 102100028781 AP-1 complex subunit sigma-3 Human genes 0.000 description 2
- 102100034035 Alcohol dehydrogenase 1A Human genes 0.000 description 2
- 102100020959 Alpha/beta hydrolase domain-containing protein 17C Human genes 0.000 description 2
- 208000024827 Alzheimer disease Diseases 0.000 description 2
- 102100021619 Ankyrin repeat and SOCS box protein 5 Human genes 0.000 description 2
- 102100036525 Anoctamin-4 Human genes 0.000 description 2
- 102100022977 Antithrombin-III Human genes 0.000 description 2
- 102100030970 Apolipoprotein C-III Human genes 0.000 description 2
- 101100001771 Arabidopsis thaliana AOC4 gene Proteins 0.000 description 2
- 102100022718 Atypical chemokine receptor 2 Human genes 0.000 description 2
- 102100022976 B-cell lymphoma/leukemia 11A Human genes 0.000 description 2
- 108700020462 BRCA2 Proteins 0.000 description 2
- 101001027057 Bacillus subtilis (strain 168) Flagellin Proteins 0.000 description 2
- 102100021971 Bcl-2-interacting killer Human genes 0.000 description 2
- 101000944273 Bos taurus Inward rectifier potassium channel 2 Proteins 0.000 description 2
- 101150008921 Brca2 gene Proteins 0.000 description 2
- 102100025399 Breast cancer type 2 susceptibility protein Human genes 0.000 description 2
- 102100028243 Breast carcinoma-amplified sequence 1 Human genes 0.000 description 2
- 102100025430 Butyrophilin-like protein 3 Human genes 0.000 description 2
- 102100025371 Butyrophilin-like protein 8 Human genes 0.000 description 2
- 102100037084 C4b-binding protein alpha chain Human genes 0.000 description 2
- 102100025238 CD302 antigen Human genes 0.000 description 2
- 102100040531 CKLF-like MARVEL transmembrane domain-containing protein 2 Human genes 0.000 description 2
- 102100029382 CMRF35-like molecule 6 Human genes 0.000 description 2
- 102100029400 CMRF35-like molecule 7 Human genes 0.000 description 2
- 102100035350 CUB domain-containing protein 1 Human genes 0.000 description 2
- 108090000835 CX3C Chemokine Receptor 1 Proteins 0.000 description 2
- 102100039196 CX3C chemokine receptor 1 Human genes 0.000 description 2
- 102100025805 Cadherin-1 Human genes 0.000 description 2
- 102100023243 Calcium-activated potassium channel subunit beta-3 Human genes 0.000 description 2
- 102100030004 Calpain-8 Human genes 0.000 description 2
- 102100024942 Calsequestrin-1 Human genes 0.000 description 2
- 102100035023 Carboxypeptidase B2 Human genes 0.000 description 2
- 102100032215 Cathepsin E Human genes 0.000 description 2
- 102100035654 Cathepsin S Human genes 0.000 description 2
- 108091007854 Cdh1/Fizzy-related Proteins 0.000 description 2
- 102100034786 Cell migration-inducing and hyaluronan-binding protein Human genes 0.000 description 2
- 102100035366 Centromere protein M Human genes 0.000 description 2
- 102100025832 Centromere-associated protein E Human genes 0.000 description 2
- 208000031404 Chromosome Aberrations Diseases 0.000 description 2
- 102100023337 Chymotrypsin-like elastase family member 3A Human genes 0.000 description 2
- 102100040484 Claspin Human genes 0.000 description 2
- 102100038447 Claudin-4 Human genes 0.000 description 2
- 102100029058 Coagulation factor XIII B chain Human genes 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 102100033601 Collagen alpha-1(I) chain Human genes 0.000 description 2
- 102100027995 Collagenase 3 Human genes 0.000 description 2
- 102100035325 Complement factor H-related protein 5 Human genes 0.000 description 2
- 102100024340 Contactin-4 Human genes 0.000 description 2
- 102100033283 Creatine kinase U-type, mitochondrial Human genes 0.000 description 2
- 102100034501 Cyclin-dependent kinases regulatory subunit 1 Human genes 0.000 description 2
- 102100028007 Cystatin-SA Human genes 0.000 description 2
- 102100026515 Cytochrome P450 2S1 Human genes 0.000 description 2
- 102100039523 Cytoskeleton-associated protein 2-like Human genes 0.000 description 2
- 102100037810 DEP domain-containing protein 1B Human genes 0.000 description 2
- 102100040263 DNA dC->dU-editing enzyme APOBEC-3A Human genes 0.000 description 2
- 230000005971 DNA damage repair Effects 0.000 description 2
- 102100035925 DNA methyltransferase 1-associated protein 1 Human genes 0.000 description 2
- 230000009946 DNA mutation Effects 0.000 description 2
- 102100029766 DNA polymerase theta Human genes 0.000 description 2
- 102100022286 DNA repair-scaffolding protein Human genes 0.000 description 2
- 102100036218 DNA replication complex GINS protein PSF2 Human genes 0.000 description 2
- 206010012335 Dependence Diseases 0.000 description 2
- 102100034578 Desmoglein-2 Human genes 0.000 description 2
- 101100477411 Dictyostelium discoideum set1 gene Proteins 0.000 description 2
- 102100037980 Disks large-associated protein 5 Human genes 0.000 description 2
- 102100035275 E3 ubiquitin-protein ligase CBL-C Human genes 0.000 description 2
- 102100026245 E3 ubiquitin-protein ligase RNF43 Human genes 0.000 description 2
- 102100029520 E3 ubiquitin-protein ligase TRIM31 Human genes 0.000 description 2
- 108010008802 ELAV-Like Protein 4 Proteins 0.000 description 2
- 102100021665 ELAV-like protein 4 Human genes 0.000 description 2
- 101150097734 EPHB2 gene Proteins 0.000 description 2
- 102100039578 ETS translocation variant 4 Human genes 0.000 description 2
- 102100038285 Endogenous retroviral envelope protein HEMO Human genes 0.000 description 2
- 102100025471 Epiphycan Human genes 0.000 description 2
- 102100030376 Ermin Human genes 0.000 description 2
- 102100031939 Erythropoietin Human genes 0.000 description 2
- 108700039887 Essential Genes Proteins 0.000 description 2
- 108010022894 Euchromatin Proteins 0.000 description 2
- 102100040553 FXYD domain-containing ion transport regulator 3 Human genes 0.000 description 2
- 102100026748 Fatty acid-binding protein, intestinal Human genes 0.000 description 2
- 102100031512 Fc receptor-like protein 3 Human genes 0.000 description 2
- 102100033925 GS homeobox 1 Human genes 0.000 description 2
- 102100028463 Galactose-3-O-sulfotransferase 3 Human genes 0.000 description 2
- 102100036529 General transcription factor 3C polypeptide 1 Human genes 0.000 description 2
- 201000010915 Glioblastoma multiforme Diseases 0.000 description 2
- 102100022626 Glutamate receptor ionotropic, NMDA 2D Human genes 0.000 description 2
- 102100033044 Glutathione peroxidase 2 Human genes 0.000 description 2
- 102100039264 Glycogen [starch] synthase, liver Human genes 0.000 description 2
- 102100032610 Guanine nucleotide-binding protein G(s) subunit alpha isoforms XLas Human genes 0.000 description 2
- 102100031546 HLA class II histocompatibility antigen, DO beta chain Human genes 0.000 description 2
- 108010034791 Heterochromatin Proteins 0.000 description 2
- 102100022130 High mobility group protein B3 Human genes 0.000 description 2
- 102100029009 High mobility group protein HMG-I/HMG-Y Human genes 0.000 description 2
- 102100034535 Histone H3.1 Human genes 0.000 description 2
- 102100031672 Homeobox protein CDX-1 Human genes 0.000 description 2
- 102100030087 Homeobox protein DLX-1 Human genes 0.000 description 2
- 101000710194 Homo sapiens (E2-independent) E3 ubiquitin-conjugating enzyme FATS Proteins 0.000 description 2
- 101001031589 Homo sapiens 2-Hydroxyacid oxidase 1 Proteins 0.000 description 2
- 101000924636 Homo sapiens AP-1 complex subunit mu-2 Proteins 0.000 description 2
- 101000768014 Homo sapiens AP-1 complex subunit sigma-3 Proteins 0.000 description 2
- 101000780443 Homo sapiens Alcohol dehydrogenase 1A Proteins 0.000 description 2
- 101000783862 Homo sapiens Alpha/beta hydrolase domain-containing protein 17C Proteins 0.000 description 2
- 101000754309 Homo sapiens Ankyrin repeat and SOCS box protein 5 Proteins 0.000 description 2
- 101000928366 Homo sapiens Anoctamin-4 Proteins 0.000 description 2
- 101000757319 Homo sapiens Antithrombin-III Proteins 0.000 description 2
- 101000793223 Homo sapiens Apolipoprotein C-III Proteins 0.000 description 2
- 101000678892 Homo sapiens Atypical chemokine receptor 2 Proteins 0.000 description 2
- 101000903703 Homo sapiens B-cell lymphoma/leukemia 11A Proteins 0.000 description 2
- 101000970576 Homo sapiens Bcl-2-interacting killer Proteins 0.000 description 2
- 101000935635 Homo sapiens Breast carcinoma-amplified sequence 1 Proteins 0.000 description 2
- 101000934741 Homo sapiens Butyrophilin-like protein 3 Proteins 0.000 description 2
- 101000934742 Homo sapiens Butyrophilin-like protein 8 Proteins 0.000 description 2
- 101000740685 Homo sapiens C4b-binding protein alpha chain Proteins 0.000 description 2
- 101000749427 Homo sapiens CKLF-like MARVEL transmembrane domain-containing protein 2 Proteins 0.000 description 2
- 101000990034 Homo sapiens CMRF35-like molecule 6 Proteins 0.000 description 2
- 101000990007 Homo sapiens CMRF35-like molecule 7 Proteins 0.000 description 2
- 101000737742 Homo sapiens CUB domain-containing protein 1 Proteins 0.000 description 2
- 101001049845 Homo sapiens Calcium-activated potassium channel subunit beta-2 Proteins 0.000 description 2
- 101001049846 Homo sapiens Calcium-activated potassium channel subunit beta-3 Proteins 0.000 description 2
- 101000793675 Homo sapiens Calpain-8 Proteins 0.000 description 2
- 101000761381 Homo sapiens Calsequestrin-1 Proteins 0.000 description 2
- 101000855412 Homo sapiens Carbamoyl-phosphate synthase [ammonia], mitochondrial Proteins 0.000 description 2
- 101000946518 Homo sapiens Carboxypeptidase B2 Proteins 0.000 description 2
- 101000869031 Homo sapiens Cathepsin E Proteins 0.000 description 2
- 101000945881 Homo sapiens Cell migration-inducing and hyaluronan-binding protein Proteins 0.000 description 2
- 101000721661 Homo sapiens Cellular tumor antigen p53 Proteins 0.000 description 2
- 101000737696 Homo sapiens Centromere protein M Proteins 0.000 description 2
- 101000914247 Homo sapiens Centromere-associated protein E Proteins 0.000 description 2
- 101000907964 Homo sapiens Chymotrypsin-like elastase family member 3A Proteins 0.000 description 2
- 101000750011 Homo sapiens Claspin Proteins 0.000 description 2
- 101000882890 Homo sapiens Claudin-4 Proteins 0.000 description 2
- 101000918350 Homo sapiens Coagulation factor XIII B chain Proteins 0.000 description 2
- 101000577887 Homo sapiens Collagenase 3 Proteins 0.000 description 2
- 101000878134 Homo sapiens Complement factor H-related protein 5 Proteins 0.000 description 2
- 101000909504 Homo sapiens Contactin-4 Proteins 0.000 description 2
- 101001135413 Homo sapiens Creatine kinase U-type, mitochondrial Proteins 0.000 description 2
- 101000710200 Homo sapiens Cyclin-dependent kinases regulatory subunit 1 Proteins 0.000 description 2
- 101000722958 Homo sapiens Cystatin-SA Proteins 0.000 description 2
- 101000855328 Homo sapiens Cytochrome P450 2S1 Proteins 0.000 description 2
- 101000888538 Homo sapiens Cytoskeleton-associated protein 2-like Proteins 0.000 description 2
- 101000950656 Homo sapiens DEP domain-containing protein 1B Proteins 0.000 description 2
- 101000964378 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3A Proteins 0.000 description 2
- 101000930289 Homo sapiens DNA methyltransferase 1-associated protein 1 Proteins 0.000 description 2
- 101001094659 Homo sapiens DNA polymerase kappa Proteins 0.000 description 2
- 101000865085 Homo sapiens DNA polymerase theta Proteins 0.000 description 2
- 101000825159 Homo sapiens DNA repair-scaffolding protein Proteins 0.000 description 2
- 101000736065 Homo sapiens DNA replication complex GINS protein PSF2 Proteins 0.000 description 2
- 101000924314 Homo sapiens Desmoglein-2 Proteins 0.000 description 2
- 101000951365 Homo sapiens Disks large-associated protein 5 Proteins 0.000 description 2
- 101000737269 Homo sapiens E3 ubiquitin-protein ligase CBL-C Proteins 0.000 description 2
- 101000692702 Homo sapiens E3 ubiquitin-protein ligase RNF43 Proteins 0.000 description 2
- 101000634974 Homo sapiens E3 ubiquitin-protein ligase TRIM31 Proteins 0.000 description 2
- 101000966913 Homo sapiens ELL-associated factor 2 Proteins 0.000 description 2
- 101000813747 Homo sapiens ETS translocation variant 4 Proteins 0.000 description 2
- 101001033183 Homo sapiens Endogenous retroviral envelope protein HEMO Proteins 0.000 description 2
- 101001056751 Homo sapiens Epiphycan Proteins 0.000 description 2
- 101001063322 Homo sapiens Ermin Proteins 0.000 description 2
- 101000893731 Homo sapiens FXYD domain-containing ion transport regulator 3 Proteins 0.000 description 2
- 101000911337 Homo sapiens Fatty acid-binding protein, intestinal Proteins 0.000 description 2
- 101000846910 Homo sapiens Fc receptor-like protein 3 Proteins 0.000 description 2
- 101001068303 Homo sapiens GS homeobox 1 Proteins 0.000 description 2
- 101000714249 Homo sapiens General transcription factor 3C polypeptide 1 Proteins 0.000 description 2
- 101000972840 Homo sapiens Glutamate receptor ionotropic, NMDA 2D Proteins 0.000 description 2
- 101000871129 Homo sapiens Glutathione peroxidase 2 Proteins 0.000 description 2
- 101001036117 Homo sapiens Glycogen [starch] synthase, liver Proteins 0.000 description 2
- 101001014590 Homo sapiens Guanine nucleotide-binding protein G(s) subunit alpha isoforms XLas Proteins 0.000 description 2
- 101001014594 Homo sapiens Guanine nucleotide-binding protein G(s) subunit alpha isoforms short Proteins 0.000 description 2
- 101000866281 Homo sapiens HLA class II histocompatibility antigen, DO beta chain Proteins 0.000 description 2
- 101001045794 Homo sapiens High mobility group protein B3 Proteins 0.000 description 2
- 101000986380 Homo sapiens High mobility group protein HMG-I/HMG-Y Proteins 0.000 description 2
- 101001067844 Homo sapiens Histone H3.1 Proteins 0.000 description 2
- 101000777808 Homo sapiens Homeobox protein CDX-1 Proteins 0.000 description 2
- 101000864690 Homo sapiens Homeobox protein DLX-1 Proteins 0.000 description 2
- 101001035951 Homo sapiens Hyaluronan-binding protein 2 Proteins 0.000 description 2
- 101001035752 Homo sapiens Hydroxycarboxylic acid receptor 3 Proteins 0.000 description 2
- 101001010610 Homo sapiens Immunoglobulin-like domain-containing receptor 1 Proteins 0.000 description 2
- 101000994365 Homo sapiens Integrin alpha-6 Proteins 0.000 description 2
- 101000609396 Homo sapiens Inter-alpha-trypsin inhibitor heavy chain H2 Proteins 0.000 description 2
- 101001076422 Homo sapiens Interleukin-1 receptor type 2 Proteins 0.000 description 2
- 101001010591 Homo sapiens Interleukin-20 Proteins 0.000 description 2
- 101001013150 Homo sapiens Interstitial collagenase Proteins 0.000 description 2
- 101000998020 Homo sapiens Keratin, type I cytoskeletal 18 Proteins 0.000 description 2
- 101001007008 Homo sapiens Keratin-associated protein 4-1 Proteins 0.000 description 2
- 101001027631 Homo sapiens Kinesin-like protein KIF20B Proteins 0.000 description 2
- 101001027602 Homo sapiens Kinesin-like protein KIF26B Proteins 0.000 description 2
- 101001006776 Homo sapiens Kinesin-like protein KIFC1 Proteins 0.000 description 2
- 101000701585 Homo sapiens Kinetochore protein Spc24 Proteins 0.000 description 2
- 101001023271 Homo sapiens Laminin subunit gamma-2 Proteins 0.000 description 2
- 101001065663 Homo sapiens Lipolysis-stimulated lipoprotein receptor Proteins 0.000 description 2
- 101000972357 Homo sapiens Liver-expressed antimicrobial peptide 2 Proteins 0.000 description 2
- 101001018034 Homo sapiens Lymphocyte antigen 75 Proteins 0.000 description 2
- 101000899339 Homo sapiens Lymphoid-specific helicase Proteins 0.000 description 2
- 101000922402 Homo sapiens Lysosomal membrane ascorbate-dependent ferrireductase CYB561A3 Proteins 0.000 description 2
- 101100400377 Homo sapiens MARVELD2 gene Proteins 0.000 description 2
- 101000990902 Homo sapiens Matrix metalloproteinase-9 Proteins 0.000 description 2
- 101000636209 Homo sapiens Matrix-remodeling-associated protein 5 Proteins 0.000 description 2
- 101001055106 Homo sapiens Metastasis-associated in colon cancer protein 1 Proteins 0.000 description 2
- 101000831175 Homo sapiens Mitochondrial import inner membrane translocase subunit TIM16 Proteins 0.000 description 2
- 101000972276 Homo sapiens Mucin-5B Proteins 0.000 description 2
- 101001030211 Homo sapiens Myc proto-oncogene protein Proteins 0.000 description 2
- 101001128427 Homo sapiens Myeloma-overexpressed gene protein Proteins 0.000 description 2
- 101000958866 Homo sapiens Myogenic factor 6 Proteins 0.000 description 2
- 101000635885 Homo sapiens Myosin light chain 1/3, skeletal muscle isoform Proteins 0.000 description 2
- 101000983292 Homo sapiens N-fatty-acyl-amino acid synthase/hydrolase PM20D1 Proteins 0.000 description 2
- 101000818546 Homo sapiens N-formyl peptide receptor 2 Proteins 0.000 description 2
- 101000594773 Homo sapiens NXPE family member 4 Proteins 0.000 description 2
- 101000995194 Homo sapiens Nebulette Proteins 0.000 description 2
- 101001111320 Homo sapiens Nestin Proteins 0.000 description 2
- 101000969961 Homo sapiens Neurexin-3 Proteins 0.000 description 2
- 101000969963 Homo sapiens Neurexin-3-beta Proteins 0.000 description 2
- 101000581986 Homo sapiens Neurocan core protein Proteins 0.000 description 2
- 101001014610 Homo sapiens Neuroendocrine secretory protein 55 Proteins 0.000 description 2
- 101000604168 Homo sapiens Neuromedin-B Proteins 0.000 description 2
- 101001023729 Homo sapiens Neuropilin and tolloid-like protein 2 Proteins 0.000 description 2
- 101000588303 Homo sapiens Nuclear factor erythroid 2-related factor 3 Proteins 0.000 description 2
- 101000735484 Homo sapiens Paired box protein Pax-9 Proteins 0.000 description 2
- 101000738523 Homo sapiens Pancreas transcription factor 1 subunit alpha Proteins 0.000 description 2
- 101001120056 Homo sapiens Phosphatidylinositol 3-kinase regulatory subunit alpha Proteins 0.000 description 2
- 101000983245 Homo sapiens Phosphatidylinositol 4-kinase type 2-beta Proteins 0.000 description 2
- 101001081953 Homo sapiens Phosphoribosylaminoimidazole carboxylase Proteins 0.000 description 2
- 101000730616 Homo sapiens Pleckstrin homology domain-containing family G member 6 Proteins 0.000 description 2
- 101000583702 Homo sapiens Pleckstrin homology-like domain family A member 2 Proteins 0.000 description 2
- 101001126466 Homo sapiens Pleckstrin-2 Proteins 0.000 description 2
- 101000886179 Homo sapiens Polypeptide N-acetylgalactosaminyltransferase 3 Proteins 0.000 description 2
- 101001009552 Homo sapiens Probable G-protein coupled receptor 34 Proteins 0.000 description 2
- 101000701518 Homo sapiens Probable phospholipid-transporting ATPase IM Proteins 0.000 description 2
- 101000959489 Homo sapiens Protein AF-9 Proteins 0.000 description 2
- 101000797903 Homo sapiens Protein ALEX Proteins 0.000 description 2
- 101000817237 Homo sapiens Protein ECT2 Proteins 0.000 description 2
- 101000848919 Homo sapiens Protein FAM72B Proteins 0.000 description 2
- 101000911400 Homo sapiens Protein FAM83H Proteins 0.000 description 2
- 101000979565 Homo sapiens Protein NLRC5 Proteins 0.000 description 2
- 101000801296 Homo sapiens Protein O-mannosyl-transferase TMTC4 Proteins 0.000 description 2
- 101000814371 Homo sapiens Protein Wnt-10a Proteins 0.000 description 2
- 101000964691 Homo sapiens Protein Z-dependent protease inhibitor Proteins 0.000 description 2
- 101000931682 Homo sapiens Protein furry homolog-like Proteins 0.000 description 2
- 101000880952 Homo sapiens Protein ripply3 Proteins 0.000 description 2
- 101001098560 Homo sapiens Proteinase-activated receptor 2 Proteins 0.000 description 2
- 101000974433 Homo sapiens Putative ADP-ribosylation factor-like protein 5C Proteins 0.000 description 2
- 101000843133 Homo sapiens Putative heat shock protein HSP 90-beta 4 Proteins 0.000 description 2
- 101000788242 Homo sapiens Putative trace amine-associated receptor 3 Proteins 0.000 description 2
- 101000755643 Homo sapiens RIMS-binding protein 2 Proteins 0.000 description 2
- 101000742310 Homo sapiens Rab15 effector protein Proteins 0.000 description 2
- 101001079070 Homo sapiens Ras-related protein Rab-19 Proteins 0.000 description 2
- 101001099871 Homo sapiens Ras-related protein Rab-42 Proteins 0.000 description 2
- 101000635761 Homo sapiens Receptor-transporting protein 3 Proteins 0.000 description 2
- 101001106412 Homo sapiens Regulator of hemoglobinization and erythroid cell expansion protein Proteins 0.000 description 2
- 101001110356 Homo sapiens Relaxin-3 receptor 2 Proteins 0.000 description 2
- 101001096355 Homo sapiens Replication factor C subunit 3 Proteins 0.000 description 2
- 101000580351 Homo sapiens Respirasome Complex Assembly Factor 1 Proteins 0.000 description 2
- 101000729271 Homo sapiens Retinoid isomerohydrolase Proteins 0.000 description 2
- 101000742932 Homo sapiens Retinol dehydrogenase 16 Proteins 0.000 description 2
- 101000756365 Homo sapiens Retinol-binding protein 2 Proteins 0.000 description 2
- 101000733266 Homo sapiens Rho guanine nucleotide exchange factor 35 Proteins 0.000 description 2
- 101001091968 Homo sapiens Rhophilin-2 Proteins 0.000 description 2
- 101000655897 Homo sapiens Serine protease 1 Proteins 0.000 description 2
- 101000829203 Homo sapiens Serine/arginine repetitive matrix protein 4 Proteins 0.000 description 2
- 101000777293 Homo sapiens Serine/threonine-protein kinase Chk1 Proteins 0.000 description 2
- 101001092910 Homo sapiens Serum amyloid P-component Proteins 0.000 description 2
- 101000863900 Homo sapiens Sialic acid-binding Ig-like lectin 5 Proteins 0.000 description 2
- 101000911790 Homo sapiens Sister chromatid cohesion protein DCC1 Proteins 0.000 description 2
- 101000665250 Homo sapiens Small nuclear ribonucleoprotein Sm D2 Proteins 0.000 description 2
- 101000703460 Homo sapiens Sphingosine-1-phosphate phosphatase 2 Proteins 0.000 description 2
- 101000861263 Homo sapiens Steroid 21-hydroxylase Proteins 0.000 description 2
- 101000600912 Homo sapiens Substance-K receptor Proteins 0.000 description 2
- 101000704557 Homo sapiens Sulfiredoxin-1 Proteins 0.000 description 2
- 101000661816 Homo sapiens Suppression of tumorigenicity 18 protein Proteins 0.000 description 2
- 101000661807 Homo sapiens Suppressor of tumorigenicity 14 protein Proteins 0.000 description 2
- 101000630134 Homo sapiens Syncollin Proteins 0.000 description 2
- 101000837401 Homo sapiens T-cell leukemia/lymphoma protein 1A Proteins 0.000 description 2
- 101000763966 Homo sapiens TLC domain-containing protein 1 Proteins 0.000 description 2
- 101000651291 Homo sapiens TRAF-interacting protein with FHA domain-containing protein B Proteins 0.000 description 2
- 101000809797 Homo sapiens Thymidylate synthase Proteins 0.000 description 2
- 101000669402 Homo sapiens Toll-like receptor 7 Proteins 0.000 description 2
- 101000711846 Homo sapiens Transcription factor SOX-9 Proteins 0.000 description 2
- 101000636213 Homo sapiens Transcriptional activator Myb Proteins 0.000 description 2
- 101000904724 Homo sapiens Transmembrane glycoprotein NMB Proteins 0.000 description 2
- 101000798702 Homo sapiens Transmembrane protease serine 4 Proteins 0.000 description 2
- 101000798548 Homo sapiens Transmembrane protein 238 Proteins 0.000 description 2
- 101000764260 Homo sapiens Troponin T, cardiac muscle Proteins 0.000 description 2
- 101000648505 Homo sapiens Tumor necrosis factor receptor superfamily member 12A Proteins 0.000 description 2
- 101000920026 Homo sapiens Tumor necrosis factor receptor superfamily member EDAR Proteins 0.000 description 2
- 101000607320 Homo sapiens UL16-binding protein 2 Proteins 0.000 description 2
- 101000956263 Homo sapiens Uncharacterized protein C19orf48 Proteins 0.000 description 2
- 101000702706 Homo sapiens Uncharacterized protein encoded by SND1-IT1 Proteins 0.000 description 2
- 101000582993 Homo sapiens Unconventional myosin-Vb Proteins 0.000 description 2
- 101001055377 Homo sapiens Ventricular zone-expressed PH domain-containing protein homolog 1 Proteins 0.000 description 2
- 101000782485 Homo sapiens Zinc finger protein 460 Proteins 0.000 description 2
- 102100039238 Hyaluronan-binding protein 2 Human genes 0.000 description 2
- 102100039356 Hydroxycarboxylic acid receptor 3 Human genes 0.000 description 2
- 102100030713 Immunoglobulin-like domain-containing receptor 1 Human genes 0.000 description 2
- 102100027004 Inhibin beta A chain Human genes 0.000 description 2
- 102100032816 Integrin alpha-6 Human genes 0.000 description 2
- 102100039440 Inter-alpha-trypsin inhibitor heavy chain H2 Human genes 0.000 description 2
- 102000006992 Interferon-alpha Human genes 0.000 description 2
- 108010047761 Interferon-alpha Proteins 0.000 description 2
- 102100026017 Interleukin-1 receptor type 2 Human genes 0.000 description 2
- 102100030692 Interleukin-20 Human genes 0.000 description 2
- 102100033421 Keratin, type I cytoskeletal 18 Human genes 0.000 description 2
- 102100028480 Keratin-associated protein 4-1 Human genes 0.000 description 2
- 102100037691 Kinesin-like protein KIF20B Human genes 0.000 description 2
- 102100037692 Kinesin-like protein KIF26B Human genes 0.000 description 2
- 102100027942 Kinesin-like protein KIFC1 Human genes 0.000 description 2
- 102100030536 Kinetochore protein Spc24 Human genes 0.000 description 2
- 102100035159 Laminin subunit gamma-2 Human genes 0.000 description 2
- 108010017736 Leukocyte Immunoglobulin-like Receptor B1 Proteins 0.000 description 2
- 102100025584 Leukocyte immunoglobulin-like receptor subfamily B member 1 Human genes 0.000 description 2
- 102100032010 Lipolysis-stimulated lipoprotein receptor Human genes 0.000 description 2
- 102100022685 Liver-expressed antimicrobial peptide 2 Human genes 0.000 description 2
- 102100033486 Lymphocyte antigen 75 Human genes 0.000 description 2
- 102100022539 Lymphoid-specific helicase Human genes 0.000 description 2
- 102100031659 Lysosomal membrane ascorbate-dependent ferrireductase CYB561A3 Human genes 0.000 description 2
- 108700008222 MARVEL Domain Containing 2 Proteins 0.000 description 2
- 102000049280 MARVEL Domain Containing 2 Human genes 0.000 description 2
- 102000000380 Matrix Metalloproteinase 1 Human genes 0.000 description 2
- 102100030412 Matrix metalloproteinase-9 Human genes 0.000 description 2
- 102100030776 Matrix-remodeling-associated protein 5 Human genes 0.000 description 2
- 102100026892 Metastasis-associated in colon cancer protein 1 Human genes 0.000 description 2
- 102100024285 Mitochondrial import inner membrane translocase subunit TIM16 Human genes 0.000 description 2
- 102100022494 Mucin-5B Human genes 0.000 description 2
- 101100007955 Mus musculus Ctcf gene Proteins 0.000 description 2
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 2
- 102100031791 Myeloma-overexpressed gene protein Human genes 0.000 description 2
- 102100038379 Myogenic factor 6 Human genes 0.000 description 2
- 102100030740 Myosin light chain 1/3, skeletal muscle isoform Human genes 0.000 description 2
- 102100026873 N-fatty-acyl-amino acid synthase/hydrolase PM20D1 Human genes 0.000 description 2
- 102100021126 N-formyl peptide receptor 2 Human genes 0.000 description 2
- 108010057466 NF-kappa B Proteins 0.000 description 2
- 102000003945 NF-kappa B Human genes 0.000 description 2
- 102100036100 NXPE family member 4 Human genes 0.000 description 2
- 102100034431 Nebulette Human genes 0.000 description 2
- 102100024014 Nestin Human genes 0.000 description 2
- 102100021310 Neurexin-3 Human genes 0.000 description 2
- 102100030466 Neurocan core protein Human genes 0.000 description 2
- 102100035485 Neuropilin and tolloid-like protein 2 Human genes 0.000 description 2
- 102100031700 Nuclear factor erythroid 2-related factor 3 Human genes 0.000 description 2
- 101100001772 Oryza sativa subsp. indica AOC gene Proteins 0.000 description 2
- 108091007056 PEDS1-UBE2V1 Proteins 0.000 description 2
- 102100034901 Paired box protein Pax-9 Human genes 0.000 description 2
- 102100037878 Pancreas transcription factor 1 subunit alpha Human genes 0.000 description 2
- 241000282376 Panthera tigris Species 0.000 description 2
- 208000037273 Pathologic Processes Diseases 0.000 description 2
- 102100026169 Phosphatidylinositol 3-kinase regulatory subunit alpha Human genes 0.000 description 2
- 102100026883 Phosphatidylinositol 4-kinase type 2-beta Human genes 0.000 description 2
- 102100027330 Phosphoribosylaminoimidazole carboxylase Human genes 0.000 description 2
- 102100032591 Pleckstrin homology domain-containing family G member 6 Human genes 0.000 description 2
- 102100030926 Pleckstrin homology-like domain family A member 2 Human genes 0.000 description 2
- 102100030470 Pleckstrin-2 Human genes 0.000 description 2
- 102100039685 Polypeptide N-acetylgalactosaminyltransferase 3 Human genes 0.000 description 2
- 102100030263 Probable G-protein coupled receptor 34 Human genes 0.000 description 2
- 102100030468 Probable phospholipid-transporting ATPase IM Human genes 0.000 description 2
- 102100039686 Protein AF-9 Human genes 0.000 description 2
- 102100040437 Protein ECT2 Human genes 0.000 description 2
- 102100034521 Protein FAM72B Human genes 0.000 description 2
- 102100026738 Protein FAM83H Human genes 0.000 description 2
- 102100023432 Protein NLRC5 Human genes 0.000 description 2
- 102100033737 Protein O-mannosyl-transferase TMTC4 Human genes 0.000 description 2
- 102100039461 Protein Wnt-10a Human genes 0.000 description 2
- 102100040790 Protein Z-dependent protease inhibitor Human genes 0.000 description 2
- 102100020916 Protein furry homolog-like Human genes 0.000 description 2
- 102100037708 Protein ripply3 Human genes 0.000 description 2
- 102100037132 Proteinase-activated receptor 2 Human genes 0.000 description 2
- 102100022868 Putative ADP-ribosylation factor-like protein 5C Human genes 0.000 description 2
- 102100031040 Putative heat shock protein HSP 90-beta 4 Human genes 0.000 description 2
- 102100025206 Putative trace amine-associated receptor 3 Human genes 0.000 description 2
- 102000001183 RAG-1 Human genes 0.000 description 2
- 108060006897 RAG1 Proteins 0.000 description 2
- 102100022371 RIMS-binding protein 2 Human genes 0.000 description 2
- 102100038203 Rab15 effector protein Human genes 0.000 description 2
- 102100028190 Ras-related protein Rab-19 Human genes 0.000 description 2
- 102100038506 Ras-related protein Rab-42 Human genes 0.000 description 2
- 102100030849 Receptor-transporting protein 3 Human genes 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 102100021434 Regulator of hemoglobinization and erythroid cell expansion protein Human genes 0.000 description 2
- 102100022100 Relaxin-3 receptor 2 Human genes 0.000 description 2
- 102100037855 Replication factor C subunit 3 Human genes 0.000 description 2
- 102100027558 Respirasome Complex Assembly Factor 1 Human genes 0.000 description 2
- 102100031176 Retinoid isomerohydrolase Human genes 0.000 description 2
- 102100038057 Retinol dehydrogenase 16 Human genes 0.000 description 2
- 102100033206 Rho guanine nucleotide exchange factor 35 Human genes 0.000 description 2
- 102100035749 Rhophilin-2 Human genes 0.000 description 2
- 102100028029 SCL-interrupting locus protein Human genes 0.000 description 2
- 108091006156 SLC17A2 Proteins 0.000 description 2
- 108091006744 SLC22A1 Proteins 0.000 description 2
- 108091006755 SLC22A10 Proteins 0.000 description 2
- 108091006740 SLC22A9 Proteins 0.000 description 2
- 108091006700 SLC24A2 Proteins 0.000 description 2
- 108091006569 SLC32A1 Proteins 0.000 description 2
- 102100032491 Serine protease 1 Human genes 0.000 description 2
- 102100023663 Serine/arginine repetitive matrix protein 4 Human genes 0.000 description 2
- 102100031081 Serine/threonine-protein kinase Chk1 Human genes 0.000 description 2
- 102100036202 Serum amyloid P-component Human genes 0.000 description 2
- 102100029957 Sialic acid-binding Ig-like lectin 5 Human genes 0.000 description 2
- 102100027040 Sister chromatid cohesion protein DCC1 Human genes 0.000 description 2
- 102100038685 Small nuclear ribonucleoprotein Sm D2 Human genes 0.000 description 2
- 102100038435 Sodium-dependent phosphate transport protein 3 Human genes 0.000 description 2
- 102100031998 Sodium/potassium/calcium exchanger 2 Human genes 0.000 description 2
- 102100032416 Solute carrier family 22 member 1 Human genes 0.000 description 2
- 102100035282 Solute carrier family 22 member 10 Human genes 0.000 description 2
- 102100035246 Solute carrier family 22 member 9 Human genes 0.000 description 2
- 238000000692 Student's t-test Methods 0.000 description 2
- 102100037342 Substance-K receptor Human genes 0.000 description 2
- 102100031797 Sulfiredoxin-1 Human genes 0.000 description 2
- 102100037942 Suppressor of tumorigenicity 14 protein Human genes 0.000 description 2
- 102100026185 Syncollin Human genes 0.000 description 2
- 102100028676 T-cell leukemia/lymphoma protein 1A Human genes 0.000 description 2
- 102100026917 TLC domain-containing protein 1 Human genes 0.000 description 2
- 102100027653 TRAF-interacting protein with FHA domain-containing protein B Human genes 0.000 description 2
- 102100038618 Thymidylate synthase Human genes 0.000 description 2
- 102100039390 Toll-like receptor 7 Human genes 0.000 description 2
- 102100034204 Transcription factor SOX-9 Human genes 0.000 description 2
- 102100030780 Transcriptional activator Myb Human genes 0.000 description 2
- 102100023935 Transmembrane glycoprotein NMB Human genes 0.000 description 2
- 102100032471 Transmembrane protease serine 4 Human genes 0.000 description 2
- 102100032476 Transmembrane protein 238 Human genes 0.000 description 2
- 108010088412 Trefoil Factor-1 Proteins 0.000 description 2
- 102100039175 Trefoil factor 1 Human genes 0.000 description 2
- 102100026893 Troponin T, cardiac muscle Human genes 0.000 description 2
- 102100028786 Tumor necrosis factor receptor superfamily member 12A Human genes 0.000 description 2
- 102100030810 Tumor necrosis factor receptor superfamily member EDAR Human genes 0.000 description 2
- 102100039989 UL16-binding protein 2 Human genes 0.000 description 2
- 102100038573 Uncharacterized protein C19orf48 Human genes 0.000 description 2
- 102100030915 Uncharacterized protein encoded by SND1-IT1 Human genes 0.000 description 2
- 102100030366 Unconventional myosin-Vb Human genes 0.000 description 2
- 102100026175 Ventricular zone-expressed PH domain-containing protein homolog 1 Human genes 0.000 description 2
- 102100038170 Vesicular inhibitory amino acid transporter Human genes 0.000 description 2
- 102100035843 Zinc finger protein 460 Human genes 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 108010029483 alpha 1 Chain Collagen Type I Proteins 0.000 description 2
- 150000001412 amines Chemical class 0.000 description 2
- 230000006907 apoptotic process Effects 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 230000033590 base-excision repair Effects 0.000 description 2
- 210000000941 bile Anatomy 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 238000004820 blood count Methods 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 210000002230 centromere Anatomy 0.000 description 2
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000004883 computer application Methods 0.000 description 2
- 230000001268 conjugating effect Effects 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 238000004132 cross linking Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 230000001973 epigenetic effect Effects 0.000 description 2
- 210000000632 euchromatin Anatomy 0.000 description 2
- 210000003722 extracellular fluid Anatomy 0.000 description 2
- 238000007667 floating Methods 0.000 description 2
- 230000008014 freezing Effects 0.000 description 2
- 238000007710 freezing Methods 0.000 description 2
- 108010004142 galactose-3-O-sulfotransferase 3 Proteins 0.000 description 2
- 230000030279 gene silencing Effects 0.000 description 2
- 210000005003 heart tissue Anatomy 0.000 description 2
- 210000004458 heterochromatin Anatomy 0.000 description 2
- 210000004251 human milk Anatomy 0.000 description 2
- 235000020256 human milk Nutrition 0.000 description 2
- 108010019691 inhibin beta A subunit Proteins 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 230000000968 intestinal effect Effects 0.000 description 2
- 238000005304 joining Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 210000003593 megakaryocyte Anatomy 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 108091089992 miR-9-1 stem-loop Proteins 0.000 description 2
- 108091076838 miR-9-3 stem-loop Proteins 0.000 description 2
- 102000035118 modified proteins Human genes 0.000 description 2
- 108091005573 modified proteins Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 230000020520 nucleotide-excision repair Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 230000009054 pathological process Effects 0.000 description 2
- 239000013610 patient sample Substances 0.000 description 2
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 2
- 230000010412 perfusion Effects 0.000 description 2
- 210000005259 peripheral blood Anatomy 0.000 description 2
- 239000011886 peripheral blood Substances 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 230000035790 physiological processes and functions Effects 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 230000001902 propagating effect Effects 0.000 description 2
- 230000004850 protein–protein interaction Effects 0.000 description 2
- 210000003296 saliva Anatomy 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 210000000582 semen Anatomy 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 238000000528 statistical test Methods 0.000 description 2
- 238000001356 surgical procedure Methods 0.000 description 2
- 230000009885 systemic effect Effects 0.000 description 2
- 238000012353 t test Methods 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 210000004881 tumor cell Anatomy 0.000 description 2
- 238000010798 ubiquitination Methods 0.000 description 2
- 210000002700 urine Anatomy 0.000 description 2
- YDRYQBCOLJPFFX-REOHCLBHSA-N (2r)-2-amino-3-(1,1,2,2-tetrafluoroethylsulfanyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CSC(F)(F)C(F)F YDRYQBCOLJPFFX-REOHCLBHSA-N 0.000 description 1
- 102100024341 10 kDa heat shock protein, mitochondrial Human genes 0.000 description 1
- 102100038794 17-beta-hydroxysteroid dehydrogenase type 6 Human genes 0.000 description 1
- 102100027621 2'-5'-oligoadenylate synthase 2 Human genes 0.000 description 1
- 102100035389 2'-5'-oligoadenylate synthase 3 Human genes 0.000 description 1
- 102100033260 2'-deoxynucleoside 5'-phosphate N-hydrolase 1 Human genes 0.000 description 1
- APHFXDBDLKPMTA-UHFFFAOYSA-N 2-(3-decanoyl-4,5,7-trihydroxynaphthalen-2-yl)acetic acid Chemical compound CCCCCCCCCC(=O)c1c(CC(O)=O)cc2cc(O)cc(O)c2c1O APHFXDBDLKPMTA-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 102100023817 26S proteasome complex subunit SEM1 Human genes 0.000 description 1
- 102100039082 3 beta-hydroxysteroid dehydrogenase/Delta 5->4-isomerase type 1 Human genes 0.000 description 1
- 102100040842 3-galactosyl-N-acetylglucosaminide 4-alpha-L-fucosyltransferase FUT3 Human genes 0.000 description 1
- 102100029077 3-hydroxy-3-methylglutaryl-coenzyme A reductase Human genes 0.000 description 1
- 102100026146 39S ribosomal protein L13, mitochondrial Human genes 0.000 description 1
- 102100026433 39S ribosomal protein L14, mitochondrial Human genes 0.000 description 1
- 102100027556 39S ribosomal protein L17, mitochondrial Human genes 0.000 description 1
- 102100037710 40S ribosomal protein S21 Human genes 0.000 description 1
- 102100024088 40S ribosomal protein S7 Human genes 0.000 description 1
- 102100036183 5'-3' exonuclease PLD4 Human genes 0.000 description 1
- 102100027493 5-hydroxytryptamine receptor 1D Human genes 0.000 description 1
- 102100036311 5-hydroxytryptamine receptor 1F Human genes 0.000 description 1
- 102100023990 60S ribosomal protein L17 Human genes 0.000 description 1
- 102100038008 60S ribosomal protein L22-like 1 Human genes 0.000 description 1
- 102100028348 60S ribosomal protein L26 Human genes 0.000 description 1
- 102100031002 60S ribosomal protein L36a Human genes 0.000 description 1
- 102100035988 60S ribosomal protein L39 Human genes 0.000 description 1
- 102100040623 60S ribosomal protein L41 Human genes 0.000 description 1
- 102100036512 7-dehydrocholesterol reductase Human genes 0.000 description 1
- 102100040077 A-kinase anchor protein 6 Human genes 0.000 description 1
- 102100040084 A-kinase anchor protein 9 Human genes 0.000 description 1
- 108091007507 ADAM12 Proteins 0.000 description 1
- 102100031585 ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase 1 Human genes 0.000 description 1
- 102100022911 ADP-ribosylation factor-like protein 17 Human genes 0.000 description 1
- 102100025678 APC membrane recruitment protein 2 Human genes 0.000 description 1
- 101710146210 APC membrane recruitment protein 2 Proteins 0.000 description 1
- 108010088547 ARNTL Transcription Factors Proteins 0.000 description 1
- 101150009379 AS1 gene Proteins 0.000 description 1
- 102100038511 AT-rich interactive domain-containing protein 3A Human genes 0.000 description 1
- 102100036618 ATP-binding cassette sub-family A member 13 Human genes 0.000 description 1
- 102100037129 ATP-binding cassette sub-family C member 11 Human genes 0.000 description 1
- 102100020969 ATP-binding cassette sub-family E member 1 Human genes 0.000 description 1
- 102100033106 ATP-binding cassette sub-family G member 5 Human genes 0.000 description 1
- 102100033092 ATP-binding cassette sub-family G member 8 Human genes 0.000 description 1
- 102100030089 ATP-dependent RNA helicase DHX8 Human genes 0.000 description 1
- 102100024825 ATPase MORC2 Human genes 0.000 description 1
- 102100039864 ATPase family AAA domain-containing protein 2 Human genes 0.000 description 1
- 102100028080 ATPase family AAA domain-containing protein 5 Human genes 0.000 description 1
- 102100022144 Achaete-scute homolog 2 Human genes 0.000 description 1
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 1
- 102100033757 Acyl-coenzyme A thioesterase 11 Human genes 0.000 description 1
- 102100030959 Acyl-coenzyme A thioesterase MBLAC2 Human genes 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- 102100035990 Adenosine receptor A2a Human genes 0.000 description 1
- 102100036006 Adenosine receptor A3 Human genes 0.000 description 1
- 102100032157 Adenylate cyclase type 10 Human genes 0.000 description 1
- 102100031934 Adhesion G-protein coupled receptor G1 Human genes 0.000 description 1
- 208000006468 Adrenal Cortex Neoplasms Diseases 0.000 description 1
- 102000054930 Agouti-Related Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102100024092 Aldo-keto reductase family 1 member C4 Human genes 0.000 description 1
- 102100024086 Aldo-keto reductase family 1 member D1 Human genes 0.000 description 1
- 102100027714 Alpha-(1,3)-fucosyltransferase 10 Human genes 0.000 description 1
- 102100022712 Alpha-1-antitrypsin Human genes 0.000 description 1
- 102100021787 Alpha-2,8-sialyltransferase 8F Human genes 0.000 description 1
- 102100038778 Amphiregulin Human genes 0.000 description 1
- 102100031993 Amphoterin-induced protein 3 Human genes 0.000 description 1
- 102100034594 Angiopoietin-1 Human genes 0.000 description 1
- 102100034604 Angiopoietin-like protein 8 Human genes 0.000 description 1
- 102100027150 Ankyrin repeat and SAM domain-containing protein 4B Human genes 0.000 description 1
- 102100033898 Ankyrin repeat and SOCS box protein 18 Human genes 0.000 description 1
- 102100034282 Ankyrin repeat domain-containing protein 23 Human genes 0.000 description 1
- 102100034280 Ankyrin repeat domain-containing protein 26 Human genes 0.000 description 1
- 102100034564 Ankyrin repeat domain-containing protein 36A Human genes 0.000 description 1
- 102100034566 Ankyrin repeat domain-containing protein 36B Human genes 0.000 description 1
- 102100034565 Ankyrin repeat domain-containing protein 36C Human genes 0.000 description 1
- 102100031316 Ankyrin repeat domain-containing protein 63 Human genes 0.000 description 1
- 102100036830 Annexin A9 Human genes 0.000 description 1
- 102100031936 Anterior gradient protein 2 homolog Human genes 0.000 description 1
- 102100031930 Anterior gradient protein 3 Human genes 0.000 description 1
- 102100025511 Anti-Muellerian hormone type-2 receptor Human genes 0.000 description 1
- 108010032595 Antibody Binding Sites Proteins 0.000 description 1
- 102100032252 Antizyme inhibitor 2 Human genes 0.000 description 1
- 102100030942 Apolipoprotein A-II Human genes 0.000 description 1
- 102100029463 Aquaporin-8 Human genes 0.000 description 1
- 101100177160 Arabidopsis thaliana HAC2 gene Proteins 0.000 description 1
- 102100038238 Aromatic-L-amino-acid decarboxylase Human genes 0.000 description 1
- 102100026376 Artemin Human genes 0.000 description 1
- 102100021661 Aryl hydrocarbon receptor nuclear translocator-like protein 2 Human genes 0.000 description 1
- 241000501754 Astronotus ocellatus Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 102100027205 B-cell antigen receptor complex-associated protein alpha chain Human genes 0.000 description 1
- 102100027203 B-cell antigen receptor complex-associated protein beta chain Human genes 0.000 description 1
- 102100025218 B-cell differentiation antigen CD72 Human genes 0.000 description 1
- 102100035634 B-cell linker protein Human genes 0.000 description 1
- 102100024222 B-lymphocyte antigen CD19 Human genes 0.000 description 1
- 102100022005 B-lymphocyte antigen CD20 Human genes 0.000 description 1
- 102100027360 BAH and coiled-coil domain-containing protein 1 Human genes 0.000 description 1
- 102100021247 BCL-6 corepressor Human genes 0.000 description 1
- 102000036365 BRCA1 Human genes 0.000 description 1
- 108700020463 BRCA1 Proteins 0.000 description 1
- 101150072950 BRCA1 gene Proteins 0.000 description 1
- 102100021694 BTB/POZ domain-containing protein 17 Human genes 0.000 description 1
- 102100024288 BTB/POZ domain-containing protein 8 Human genes 0.000 description 1
- 102100022790 BTB/POZ domain-containing protein KCTD11 Human genes 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108020000946 Bacterial DNA Proteins 0.000 description 1
- 102100021663 Baculoviral IAP repeat-containing protein 5 Human genes 0.000 description 1
- 102100024747 Band 4.1-like protein 1 Human genes 0.000 description 1
- 102100021896 Bcl-2-like protein 15 Human genes 0.000 description 1
- 102100039888 Beta-1,3-galactosyl-O-glycosyl-glycoprotein beta-1,6-N-acetylglucosaminyltransferase Human genes 0.000 description 1
- 102100020857 Beta-1,3-glucuronyltransferase LARGE2 Human genes 0.000 description 1
- 102100027386 Beta-1,4-galactosyltransferase 6 Human genes 0.000 description 1
- 102100030802 Beta-2-glycoprotein 1 Human genes 0.000 description 1
- 102100031006 Beta-Ala-His dipeptidase Human genes 0.000 description 1
- 102100027950 Bile acid-CoA:amino acid N-acyltransferase Human genes 0.000 description 1
- 102100028282 Bile salt export pump Human genes 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 1
- 102100035631 Bloom syndrome protein Human genes 0.000 description 1
- 108091009167 Bloom syndrome protein Proteins 0.000 description 1
- 102100037086 Bone marrow stromal antigen 2 Human genes 0.000 description 1
- 102100024506 Bone morphogenetic protein 2 Human genes 0.000 description 1
- 102100031746 Bone sialoprotein 2 Human genes 0.000 description 1
- 102100028253 Breast cancer anti-estrogen resistance protein 3 Human genes 0.000 description 1
- 102100026435 Breast carcinoma-amplified sequence 4 Human genes 0.000 description 1
- 102100032312 Brevican core protein Human genes 0.000 description 1
- 102100029892 Bromodomain and WD repeat-containing protein 1 Human genes 0.000 description 1
- 102100021714 Bystin Human genes 0.000 description 1
- 102100023700 C-C motif chemokine 16 Human genes 0.000 description 1
- 102100023701 C-C motif chemokine 18 Human genes 0.000 description 1
- 102100036842 C-C motif chemokine 19 Human genes 0.000 description 1
- 102100028989 C-X-C chemokine receptor type 2 Human genes 0.000 description 1
- 102100031658 C-X-C chemokine receptor type 5 Human genes 0.000 description 1
- 102100036189 C-X-C motif chemokine 3 Human genes 0.000 description 1
- 102100032528 C-type lectin domain family 11 member A Human genes 0.000 description 1
- 102100023458 C-type lectin-like domain family 1 Human genes 0.000 description 1
- 102100032996 C5a anaphylatoxin chemotactic receptor 2 Human genes 0.000 description 1
- 102100024080 CASP8-associated protein 2 Human genes 0.000 description 1
- 102100038077 CD226 antigen Human genes 0.000 description 1
- 101150013553 CD40 gene Proteins 0.000 description 1
- 108010062802 CD66 antigens Proteins 0.000 description 1
- 102100025221 CD70 antigen Human genes 0.000 description 1
- 102100035793 CD83 antigen Human genes 0.000 description 1
- 108700020472 CDC20 Proteins 0.000 description 1
- 108010083123 CDX2 Transcription Factor Proteins 0.000 description 1
- 101150061077 CERS6 gene Proteins 0.000 description 1
- 102100038015 CHRNA7-FAM7A fusion protein Human genes 0.000 description 1
- 102100025659 Cadherin EGF LAG seven-pass G-type receptor 1 Human genes 0.000 description 1
- 102100024153 Cadherin-15 Human genes 0.000 description 1
- 102100024152 Cadherin-17 Human genes 0.000 description 1
- 102100039372 Calcium uniporter regulatory subunit MCUb, mitochondrial Human genes 0.000 description 1
- 102100020671 Calcium-transporting ATPase type 2C member 2 Human genes 0.000 description 1
- 102100032539 Calpain-3 Human genes 0.000 description 1
- 101000841393 Candida albicans Probable NADPH dehydrogenase Proteins 0.000 description 1
- 102100027668 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 Human genes 0.000 description 1
- 102100024533 Carcinoembryonic antigen-related cell adhesion molecule 1 Human genes 0.000 description 1
- 201000009030 Carcinoma Diseases 0.000 description 1
- 208000017897 Carcinoma of esophagus Diseases 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 101150023302 Cdc20 gene Proteins 0.000 description 1
- 102100026770 Cell cycle control protein 50B Human genes 0.000 description 1
- 102100038099 Cell division cycle protein 20 homolog Human genes 0.000 description 1
- 102100034231 Cell surface A33 antigen Human genes 0.000 description 1
- 102100038504 Cellular retinoic acid-binding protein 2 Human genes 0.000 description 1
- 102100031214 Centromere protein N Human genes 0.000 description 1
- 102100038125 Centromere protein Q Human genes 0.000 description 1
- 102100023310 Centrosomal protein of 128 kDa Human genes 0.000 description 1
- 101710170458 Centrosomal protein of 128 kDa Proteins 0.000 description 1
- 102100035434 Ceramide synthase 6 Human genes 0.000 description 1
- 102100038216 Charged multivesicular body protein 4c Human genes 0.000 description 1
- 102100025944 Chemokine-like protein TAFA-4 Human genes 0.000 description 1
- 108010066813 Chitinase-3-Like Protein 1 Proteins 0.000 description 1
- 102100038196 Chitinase-3-like protein 1 Human genes 0.000 description 1
- 102100038731 Chitinase-3-like protein 2 Human genes 0.000 description 1
- 101100324551 Chlamydomonas reinhardtii ARSA1 gene Proteins 0.000 description 1
- 102100023457 Chloride channel protein 1 Human genes 0.000 description 1
- 101150055427 Chmp4c gene Proteins 0.000 description 1
- 102100039497 Choline transporter-like protein 3 Human genes 0.000 description 1
- 102100039496 Choline transporter-like protein 4 Human genes 0.000 description 1
- 102100039505 Choline transporter-like protein 5 Human genes 0.000 description 1
- 102100032404 Cholinesterase Human genes 0.000 description 1
- 102100030499 Chorion-specific transcription factor GCMa Human genes 0.000 description 1
- 102100032918 Chromobox protein homolog 5 Human genes 0.000 description 1
- 102100038164 Chromodomain-helicase-DNA-binding protein 9 Human genes 0.000 description 1
- 206010008805 Chromosomal abnormalities Diseases 0.000 description 1
- 241001630118 Chrysomphalus bifasciculatus Species 0.000 description 1
- 102100039511 Chymotrypsin-C Human genes 0.000 description 1
- 102100023336 Chymotrypsin-like elastase family member 3B Human genes 0.000 description 1
- 102100039514 Chymotrypsinogen B2 Human genes 0.000 description 1
- 102100026328 Ciliogenesis and planar polarity effector 1 Human genes 0.000 description 1
- 102100031058 Clarin-3 Human genes 0.000 description 1
- 102100028736 Claudin-10 Human genes 0.000 description 1
- 102100039537 Claudin-14 Human genes 0.000 description 1
- 102100038445 Claudin-2 Human genes 0.000 description 1
- 102100040935 Claudin-20 Human genes 0.000 description 1
- 102100038423 Claudin-3 Human genes 0.000 description 1
- 102100026098 Claudin-7 Human genes 0.000 description 1
- 102100023173 Clavesin-2 Human genes 0.000 description 1
- 102100031045 Coiled-coil domain-containing protein 14 Human genes 0.000 description 1
- 102100025411 Coiled-coil domain-containing protein 144B Human genes 0.000 description 1
- 102100034963 Coiled-coil domain-containing protein 167 Human genes 0.000 description 1
- 102100031087 Coiled-coil domain-containing protein 18 Human genes 0.000 description 1
- 102100033825 Collagen alpha-1(XI) chain Human genes 0.000 description 1
- 102100037285 Collagen alpha-1(XXIV) chain Human genes 0.000 description 1
- 102100039551 Collagen triple helix repeat-containing protein 1 Human genes 0.000 description 1
- 102100025882 Complement C1q-like protein 3 Human genes 0.000 description 1
- 102100040132 Complement factor H-related protein 1 Human genes 0.000 description 1
- 102100035321 Complement factor H-related protein 3 Human genes 0.000 description 1
- 102100035324 Complement factor H-related protein 4 Human genes 0.000 description 1
- 102100025150 Complex III assembly factor LYRM7 Human genes 0.000 description 1
- 102100032952 Condensin complex subunit 3 Human genes 0.000 description 1
- 102100024342 Contactin-2 Human genes 0.000 description 1
- 102100040497 Contactin-associated protein-like 4 Human genes 0.000 description 1
- 102100032636 Copine-1 Human genes 0.000 description 1
- 102100032644 Copine-2 Human genes 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 102100041023 Coronin-2A Human genes 0.000 description 1
- 241001275954 Cortinarius caperatus Species 0.000 description 1
- 102100025278 Coxsackievirus and adenovirus receptor Human genes 0.000 description 1
- 108050006400 Cyclin Proteins 0.000 description 1
- 108010025464 Cyclin-Dependent Kinase 4 Proteins 0.000 description 1
- 108010025468 Cyclin-Dependent Kinase 6 Proteins 0.000 description 1
- 102000009503 Cyclin-Dependent Kinase Inhibitor p18 Human genes 0.000 description 1
- 108010009367 Cyclin-Dependent Kinase Inhibitor p18 Proteins 0.000 description 1
- 102000000577 Cyclin-Dependent Kinase Inhibitor p27 Human genes 0.000 description 1
- 108010016777 Cyclin-Dependent Kinase Inhibitor p27 Proteins 0.000 description 1
- 102100036252 Cyclin-dependent kinase 4 Human genes 0.000 description 1
- 102100026804 Cyclin-dependent kinase 6 Human genes 0.000 description 1
- 102100026278 Cysteine sulfinic acid decarboxylase Human genes 0.000 description 1
- 102100038695 Cysteine-rich secretory protein LCCL domain-containing 1 Human genes 0.000 description 1
- 201000003883 Cystic fibrosis Diseases 0.000 description 1
- 102100035300 Cystine/glutamate transporter Human genes 0.000 description 1
- 102000009666 Cytochrome P-450 CYP2B6 Human genes 0.000 description 1
- 108010020070 Cytochrome P-450 CYP2B6 Proteins 0.000 description 1
- 108010000561 Cytochrome P-450 CYP2C8 Proteins 0.000 description 1
- 108010000543 Cytochrome P-450 CYP2C9 Proteins 0.000 description 1
- 102100036194 Cytochrome P450 2A6 Human genes 0.000 description 1
- 102100029359 Cytochrome P450 2C8 Human genes 0.000 description 1
- 102100029358 Cytochrome P450 2C9 Human genes 0.000 description 1
- 102100027422 Cytochrome P450 4A22 Human genes 0.000 description 1
- 102100030549 Cytochrome b5 type B Human genes 0.000 description 1
- 102100028181 Cytokine-like protein 1 Human genes 0.000 description 1
- 102100028630 Cytoskeleton-associated protein 2 Human genes 0.000 description 1
- 102100022768 D-beta-hydroxybutyrate dehydrogenase, mitochondrial Human genes 0.000 description 1
- 102100031515 D-ribitol-5-phosphate cytidylyltransferase Human genes 0.000 description 1
- 101700026669 DACH1 Proteins 0.000 description 1
- 108010014790 DAX-1 Orphan Nuclear Receptor Proteins 0.000 description 1
- 102100029021 DBIRD complex subunit ZNF326 Human genes 0.000 description 1
- 108091007703 DDX11-AS1 Proteins 0.000 description 1
- 102100025270 DENN domain-containing protein 4C Human genes 0.000 description 1
- 102100037753 DEP domain-containing protein 1A Human genes 0.000 description 1
- 102100024810 DNA (cytosine-5)-methyltransferase 3B Human genes 0.000 description 1
- 101710123222 DNA (cytosine-5)-methyltransferase 3B Proteins 0.000 description 1
- 102100040262 DNA dC->dU-editing enzyme APOBEC-3B Human genes 0.000 description 1
- 102100029133 DNA damage-induced apoptosis suppressor protein Human genes 0.000 description 1
- 108090000133 DNA helicases Proteins 0.000 description 1
- 102000003844 DNA helicases Human genes 0.000 description 1
- 102100034157 DNA mismatch repair protein Msh2 Human genes 0.000 description 1
- 102100033215 DNA nucleotidylexotransferase Human genes 0.000 description 1
- 102100022307 DNA polymerase alpha catalytic subunit Human genes 0.000 description 1
- 102100027830 DNA repair protein XRCC2 Human genes 0.000 description 1
- 102100033072 DNA replication ATP-dependent helicase DNA2 Human genes 0.000 description 1
- 102100036740 DNA replication complex GINS protein PSF3 Human genes 0.000 description 1
- 102100030380 DNA replication complex GINS protein SLD5 Human genes 0.000 description 1
- 102100033587 DNA topoisomerase 2-alpha Human genes 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 102100021045 DNA-binding protein RFX7 Human genes 0.000 description 1
- 102100021040 DNA-binding protein RFX8 Human genes 0.000 description 1
- 102100032883 DNA-binding protein SATB2 Human genes 0.000 description 1
- 102100039436 DNA-binding protein inhibitor ID-3 Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 102100027491 DNA-directed RNA polymerase I subunit RPA43 Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 102100028735 Dachshund homolog 1 Human genes 0.000 description 1
- 102100036727 Deformed epidermal autoregulatory factor 1 homolog Human genes 0.000 description 1
- 102100036466 Delta-like protein 3 Human genes 0.000 description 1
- 102100029792 Dentin sialophosphoprotein Human genes 0.000 description 1
- 102100039487 Deoxyhypusine hydroxylase Human genes 0.000 description 1
- 102100037458 Dephospho-CoA kinase Human genes 0.000 description 1
- 102100032302 Diphosphoinositol polyphosphate phosphohydrolase NUDT4B Human genes 0.000 description 1
- 102100031112 Disintegrin and metalloproteinase domain-containing protein 12 Human genes 0.000 description 1
- 102100022258 Disks large homolog 5 Human genes 0.000 description 1
- 102100040381 Dol-P-Glc:Glc(2)Man(9)GlcNAc(2)-PP-Dol alpha-1,2-glucosyltransferase Human genes 0.000 description 1
- 102100035316 Doublesex- and mab-3-related transcription factor A2 Human genes 0.000 description 1
- 102100021074 Dynactin subunit 4 Human genes 0.000 description 1
- 102100023945 Dynein light chain Tctex-type 5 Human genes 0.000 description 1
- 102100032253 Dystrotelin Human genes 0.000 description 1
- 102100022867 E3 SUMO-protein ligase KIAA1586 Human genes 0.000 description 1
- 102100022183 E3 ubiquitin-protein ligase MIB1 Human genes 0.000 description 1
- 102100029944 E3 ubiquitin-protein ligase SHPRH Human genes 0.000 description 1
- 108010008796 ELAV-Like Protein 3 Proteins 0.000 description 1
- 102100021664 ELAV-like protein 3 Human genes 0.000 description 1
- 102000012804 EPCAM Human genes 0.000 description 1
- 101150084967 EPCAM gene Proteins 0.000 description 1
- 102100021558 ER lumen protein-retaining receptor 3 Human genes 0.000 description 1
- 102100036449 Early lymphoid activation gene protein Human genes 0.000 description 1
- 102100035087 Ectoderm-neural cortex protein 1 Human genes 0.000 description 1
- 102100036516 Ectonucleoside triphosphate diphosphohydrolase 7 Human genes 0.000 description 1
- 102100040465 Elongation factor 1-beta Human genes 0.000 description 1
- 102100040590 Endogenous Bornavirus-like nucleoprotein 2 Human genes 0.000 description 1
- 102100027847 Endonuclease ZRANB3 Human genes 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 102100030377 Endoplasmic reticulum metallopeptidase 1 Human genes 0.000 description 1
- 102100030011 Endoribonuclease Human genes 0.000 description 1
- 102100032460 Ensconsin Human genes 0.000 description 1
- 102100021600 Ephrin type-A receptor 10 Human genes 0.000 description 1
- 102100036443 Epiplakin Human genes 0.000 description 1
- 102100036725 Epithelial discoidin domain-containing receptor 1 Human genes 0.000 description 1
- 101710131668 Epithelial discoidin domain-containing receptor 1 Proteins 0.000 description 1
- 102100025403 Epoxide hydrolase 1 Human genes 0.000 description 1
- 102100039354 Epoxide hydrolase 4 Human genes 0.000 description 1
- 108090000394 Erythropoietin Proteins 0.000 description 1
- 102100038595 Estrogen receptor Human genes 0.000 description 1
- 102100032833 Exportin-4 Human genes 0.000 description 1
- 108010007457 Extracellular Signal-Regulated MAP Kinases Proteins 0.000 description 1
- 101710205374 Extracellular elastase Proteins 0.000 description 1
- 102100037789 FAST kinase domain-containing protein 1, mitochondrial Human genes 0.000 description 1
- 102100035438 FRAS1-related extracellular matrix protein 3 Human genes 0.000 description 1
- 102100027285 Fanconi anemia group B protein Human genes 0.000 description 1
- 102100034553 Fanconi anemia group J protein Human genes 0.000 description 1
- 102100037679 Fasciculation and elongation protein zeta-2 Human genes 0.000 description 1
- 102100036089 Fascin Human genes 0.000 description 1
- 102100031381 Fc receptor-like A Human genes 0.000 description 1
- 102100029104 Ferroxidase HEPHL1 Human genes 0.000 description 1
- 102100028044 Fetuin-B Human genes 0.000 description 1
- 102100038664 Fibrinogen-like protein 1 Human genes 0.000 description 1
- 102100040991 Fibronectin type III domain-containing protein 10 Human genes 0.000 description 1
- 206010016654 Fibrosis Diseases 0.000 description 1
- 102100037000 Fidgetin-like protein 1 Human genes 0.000 description 1
- 102100026121 Flap endonuclease 1 Human genes 0.000 description 1
- 108090000652 Flap endonucleases Proteins 0.000 description 1
- 108010008599 Forkhead Box Protein M1 Proteins 0.000 description 1
- 102100023374 Forkhead box protein M1 Human genes 0.000 description 1
- 102100027581 Forkhead box protein P3 Human genes 0.000 description 1
- 108090000123 Fos-related antigen 1 Proteins 0.000 description 1
- 102000003817 Fos-related antigen 1 Human genes 0.000 description 1
- 102100022642 Fructose-2,6-bisphosphatase Human genes 0.000 description 1
- 102100022272 Fructose-bisphosphate aldolase B Human genes 0.000 description 1
- 108020000949 Fungal DNA Proteins 0.000 description 1
- 102100041035 G-protein coupled receptor 151 Human genes 0.000 description 1
- 102100030279 G-protein coupled receptor 35 Human genes 0.000 description 1
- 102100033046 G-protein coupled receptor 52 Human genes 0.000 description 1
- 102100037854 G1/S-specific cyclin-E2 Human genes 0.000 description 1
- 102100037488 G2 and S phase-expressed protein 1 Human genes 0.000 description 1
- 102100033821 GDP-Man:Man(3)GlcNAc(2)-PP-Dol alpha-1,2-mannosyltransferase Human genes 0.000 description 1
- 108010062427 GDP-mannose 4,6-dehydratase Proteins 0.000 description 1
- 102000002312 GDPmannose 4,6-dehydratase Human genes 0.000 description 1
- 102100024405 GPI-linked NAD(P)(+)-arginine ADP-ribosyltransferase 1 Human genes 0.000 description 1
- 102100036838 GRAM domain-containing protein 2B Human genes 0.000 description 1
- 102100028464 Galactose-3-O-sulfotransferase 4 Human genes 0.000 description 1
- 102100022898 Galactoside-binding soluble lectin 13 Human genes 0.000 description 1
- 108010001515 Galectin 4 Proteins 0.000 description 1
- 102000000805 Galectin 4 Human genes 0.000 description 1
- 101001077417 Gallus gallus Potassium voltage-gated channel subfamily H member 6 Proteins 0.000 description 1
- 102100021023 Gamma-glutamyl hydrolase Human genes 0.000 description 1
- 102100040004 Gamma-glutamylcyclotransferase Human genes 0.000 description 1
- 102100025623 Gap junction delta-2 protein Human genes 0.000 description 1
- 102100025624 Gap junction delta-3 protein Human genes 0.000 description 1
- 102100037386 Gasdermin-C Human genes 0.000 description 1
- 208000015872 Gaucher disease Diseases 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 102100037390 Genetic suppressor element 1 Human genes 0.000 description 1
- 102100030479 Germinal center-associated signaling and motility protein Human genes 0.000 description 1
- 102100039719 Germinal center-associated signaling and motility-like protein Human genes 0.000 description 1
- 102100036769 Girdin Human genes 0.000 description 1
- 102100025894 Glomulin Human genes 0.000 description 1
- 102100038676 Glutamine and serine-rich protein 1 Human genes 0.000 description 1
- 102100033305 Glutathione S-transferase A3 Human genes 0.000 description 1
- 102100039272 Glycine N-acyltransferase-like protein 1 Human genes 0.000 description 1
- 102100023177 Glycoprotein endo-alpha-1,2-mannosidase Human genes 0.000 description 1
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 1
- 102100021186 Granulysin Human genes 0.000 description 1
- 102100022087 Granzyme M Human genes 0.000 description 1
- 102100026828 Group IID secretory phospholipase A2 Human genes 0.000 description 1
- 102100036685 Growth arrest-specific protein 2 Human genes 0.000 description 1
- 102100035911 Guanine nucleotide-binding protein G(T) subunit gamma-T1 Human genes 0.000 description 1
- 102100040754 Guanylate cyclase soluble subunit alpha-1 Human genes 0.000 description 1
- 102100035688 Guanylate-binding protein 1 Human genes 0.000 description 1
- 102100028533 Guanylate-binding protein 7 Human genes 0.000 description 1
- 102100022662 Guanylyl cyclase C Human genes 0.000 description 1
- 102100039334 HAUS augmin-like complex subunit 1 Human genes 0.000 description 1
- 102100030493 HEAT repeat-containing protein 1 Human genes 0.000 description 1
- 102100031547 HLA class II histocompatibility antigen, DO alpha chain Human genes 0.000 description 1
- 108010009907 HLA-DRB6 antigen Proteins 0.000 description 1
- 108700039143 HMGA2 Proteins 0.000 description 1
- 208000010496 Heart Arrest Diseases 0.000 description 1
- 102100022191 Hemogen Human genes 0.000 description 1
- 102100037532 Heparan sulfate N-sulfotransferase 4 Human genes 0.000 description 1
- 102100030500 Heparin cofactor 2 Human genes 0.000 description 1
- 238000012752 Hepatectomy Methods 0.000 description 1
- 102100029283 Hepatocyte nuclear factor 3-alpha Human genes 0.000 description 1
- 102100029274 Hexokinase HKDC1 Human genes 0.000 description 1
- 102100038006 High affinity immunoglobulin epsilon receptor subunit alpha Human genes 0.000 description 1
- 102100028999 High mobility group protein HMGI-C Human genes 0.000 description 1
- 102100022653 Histone H1.5 Human genes 0.000 description 1
- 102100039849 Histone H2A type 1 Human genes 0.000 description 1
- 102100033070 Histone acetyltransferase KAT6B Human genes 0.000 description 1
- 102100022103 Histone-lysine N-methyltransferase 2A Human genes 0.000 description 1
- 101150073387 Hmga2 gene Proteins 0.000 description 1
- 102100022107 Holliday junction recognition protein Human genes 0.000 description 1
- 102100031671 Homeobox protein CDX-2 Human genes 0.000 description 1
- 102100021090 Homeobox protein Hox-A9 Human genes 0.000 description 1
- 102100029423 Homeobox protein Hox-B8 Human genes 0.000 description 1
- 102100029433 Homeobox protein Hox-B9 Human genes 0.000 description 1
- 102100024208 Homeobox protein MIXL1 Human genes 0.000 description 1
- 102100030641 Homeobox protein orthopedia Human genes 0.000 description 1
- 101000980303 Homo sapiens 10 kDa heat shock protein, mitochondrial Proteins 0.000 description 1
- 101001031333 Homo sapiens 17-beta-hydroxysteroid dehydrogenase type 6 Proteins 0.000 description 1
- 101001008910 Homo sapiens 2'-5'-oligoadenylate synthase 2 Proteins 0.000 description 1
- 101000597332 Homo sapiens 2'-5'-oligoadenylate synthase 3 Proteins 0.000 description 1
- 101000927689 Homo sapiens 2'-deoxynucleoside 5'-phosphate N-hydrolase 1 Proteins 0.000 description 1
- 101000684297 Homo sapiens 26S proteasome complex subunit SEM1 Proteins 0.000 description 1
- 101000744065 Homo sapiens 3 beta-hydroxysteroid dehydrogenase/Delta 5->4-isomerase type 1 Proteins 0.000 description 1
- 101000866618 Homo sapiens 3-beta-hydroxysteroid-Delta(8),Delta(7)-isomerase Proteins 0.000 description 1
- 101000893701 Homo sapiens 3-galactosyl-N-acetylglucosaminide 4-alpha-L-fucosyltransferase FUT3 Proteins 0.000 description 1
- 101000988577 Homo sapiens 3-hydroxy-3-methylglutaryl-coenzyme A reductase Proteins 0.000 description 1
- 101000691550 Homo sapiens 39S ribosomal protein L13, mitochondrial Proteins 0.000 description 1
- 101000692875 Homo sapiens 39S ribosomal protein L14, mitochondrial Proteins 0.000 description 1
- 101000650291 Homo sapiens 39S ribosomal protein L17, mitochondrial Proteins 0.000 description 1
- 101001097814 Homo sapiens 40S ribosomal protein S21 Proteins 0.000 description 1
- 101000690200 Homo sapiens 40S ribosomal protein S7 Proteins 0.000 description 1
- 101001074382 Homo sapiens 5'-3' exonuclease PLD4 Proteins 0.000 description 1
- 101000724739 Homo sapiens 5-hydroxytryptamine receptor 1D Proteins 0.000 description 1
- 101000783605 Homo sapiens 5-hydroxytryptamine receptor 1F Proteins 0.000 description 1
- 101000682512 Homo sapiens 60S ribosomal protein L17 Proteins 0.000 description 1
- 101000661567 Homo sapiens 60S ribosomal protein L22-like 1 Proteins 0.000 description 1
- 101001080179 Homo sapiens 60S ribosomal protein L26 Proteins 0.000 description 1
- 101001127203 Homo sapiens 60S ribosomal protein L36a Proteins 0.000 description 1
- 101001127258 Homo sapiens 60S ribosomal protein L36a-like Proteins 0.000 description 1
- 101000716179 Homo sapiens 60S ribosomal protein L39 Proteins 0.000 description 1
- 101000674326 Homo sapiens 60S ribosomal protein L41 Proteins 0.000 description 1
- 101000928720 Homo sapiens 7-dehydrocholesterol reductase Proteins 0.000 description 1
- 101000890611 Homo sapiens A-kinase anchor protein 6 Proteins 0.000 description 1
- 101000890598 Homo sapiens A-kinase anchor protein 9 Proteins 0.000 description 1
- 101000777636 Homo sapiens ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase 1 Proteins 0.000 description 1
- 101000974511 Homo sapiens ADP-ribosylation factor-like protein 17 Proteins 0.000 description 1
- 101000808887 Homo sapiens AT-rich interactive domain-containing protein 3A Proteins 0.000 description 1
- 101000929660 Homo sapiens ATP-binding cassette sub-family A member 13 Proteins 0.000 description 1
- 101001029057 Homo sapiens ATP-binding cassette sub-family C member 11 Proteins 0.000 description 1
- 101000783786 Homo sapiens ATP-binding cassette sub-family E member 1 Proteins 0.000 description 1
- 101000864666 Homo sapiens ATP-dependent RNA helicase DHX8 Proteins 0.000 description 1
- 101001051808 Homo sapiens ATPase MORC2 Proteins 0.000 description 1
- 101000887284 Homo sapiens ATPase family AAA domain-containing protein 2 Proteins 0.000 description 1
- 101000789829 Homo sapiens ATPase family AAA domain-containing protein 5 Proteins 0.000 description 1
- 101000901109 Homo sapiens Achaete-scute homolog 2 Proteins 0.000 description 1
- 101000801224 Homo sapiens Acyl-coenzyme A thioesterase 11 Proteins 0.000 description 1
- 101000583826 Homo sapiens Acyl-coenzyme A thioesterase MBLAC2 Proteins 0.000 description 1
- 101000783751 Homo sapiens Adenosine receptor A2a Proteins 0.000 description 1
- 101000783645 Homo sapiens Adenosine receptor A3 Proteins 0.000 description 1
- 101000775498 Homo sapiens Adenylate cyclase type 10 Proteins 0.000 description 1
- 101000775042 Homo sapiens Adhesion G-protein coupled receptor G1 Proteins 0.000 description 1
- 101000690301 Homo sapiens Aldo-keto reductase family 1 member C4 Proteins 0.000 description 1
- 101000690251 Homo sapiens Aldo-keto reductase family 1 member D1 Proteins 0.000 description 1
- 101000862183 Homo sapiens Alpha-(1,3)-fucosyltransferase 10 Proteins 0.000 description 1
- 101000823116 Homo sapiens Alpha-1-antitrypsin Proteins 0.000 description 1
- 101000616701 Homo sapiens Alpha-2,8-sialyltransferase 8F Proteins 0.000 description 1
- 101000809450 Homo sapiens Amphiregulin Proteins 0.000 description 1
- 101000776186 Homo sapiens Amphoterin-induced protein 3 Proteins 0.000 description 1
- 101000924552 Homo sapiens Angiopoietin-1 Proteins 0.000 description 1
- 101000924544 Homo sapiens Angiopoietin-like protein 8 Proteins 0.000 description 1
- 101000694601 Homo sapiens Ankyrin repeat and SAM domain-containing protein 4B Proteins 0.000 description 1
- 101000925502 Homo sapiens Ankyrin repeat and SOCS box protein 18 Proteins 0.000 description 1
- 101000780120 Homo sapiens Ankyrin repeat domain-containing protein 23 Proteins 0.000 description 1
- 101000780116 Homo sapiens Ankyrin repeat domain-containing protein 26 Proteins 0.000 description 1
- 101000924343 Homo sapiens Ankyrin repeat domain-containing protein 36A Proteins 0.000 description 1
- 101000924345 Homo sapiens Ankyrin repeat domain-containing protein 36B Proteins 0.000 description 1
- 101000924344 Homo sapiens Ankyrin repeat domain-containing protein 36C Proteins 0.000 description 1
- 101000796078 Homo sapiens Ankyrin repeat domain-containing protein 63 Proteins 0.000 description 1
- 101000928294 Homo sapiens Annexin A9 Proteins 0.000 description 1
- 101000775021 Homo sapiens Anterior gradient protein 2 homolog Proteins 0.000 description 1
- 101000775037 Homo sapiens Anterior gradient protein 3 Proteins 0.000 description 1
- 101000693801 Homo sapiens Anti-Muellerian hormone type-2 receptor Proteins 0.000 description 1
- 101000793406 Homo sapiens Apolipoprotein A-II Proteins 0.000 description 1
- 101000771417 Homo sapiens Aquaporin-8 Proteins 0.000 description 1
- 101000785776 Homo sapiens Artemin Proteins 0.000 description 1
- 101000884385 Homo sapiens Arylamine N-acetyltransferase 1 Proteins 0.000 description 1
- 101000798902 Homo sapiens Atypical chemokine receptor 4 Proteins 0.000 description 1
- 101000914489 Homo sapiens B-cell antigen receptor complex-associated protein alpha chain Proteins 0.000 description 1
- 101000914491 Homo sapiens B-cell antigen receptor complex-associated protein beta chain Proteins 0.000 description 1
- 101000934359 Homo sapiens B-cell differentiation antigen CD72 Proteins 0.000 description 1
- 101000803266 Homo sapiens B-cell linker protein Proteins 0.000 description 1
- 101000980825 Homo sapiens B-lymphocyte antigen CD19 Proteins 0.000 description 1
- 101000897405 Homo sapiens B-lymphocyte antigen CD20 Proteins 0.000 description 1
- 101000937836 Homo sapiens BAH and coiled-coil domain-containing protein 1 Proteins 0.000 description 1
- 101100165236 Homo sapiens BCOR gene Proteins 0.000 description 1
- 101000896707 Homo sapiens BTB/POZ domain-containing protein 17 Proteins 0.000 description 1
- 101000761857 Homo sapiens BTB/POZ domain-containing protein 8 Proteins 0.000 description 1
- 101000974815 Homo sapiens BTB/POZ domain-containing protein KCTD11 Proteins 0.000 description 1
- 101000971075 Homo sapiens Bcl-2-like protein 15 Proteins 0.000 description 1
- 101000887645 Homo sapiens Beta-1,3-galactosyl-O-glycosyl-glycoprotein beta-1,6-N-acetylglucosaminyltransferase Proteins 0.000 description 1
- 101001138033 Homo sapiens Beta-1,3-glucuronyltransferase LARGE2 Proteins 0.000 description 1
- 101000937502 Homo sapiens Beta-1,4-galactosyltransferase 6 Proteins 0.000 description 1
- 101000793425 Homo sapiens Beta-2-glycoprotein 1 Proteins 0.000 description 1
- 101000919694 Homo sapiens Beta-Ala-His dipeptidase Proteins 0.000 description 1
- 101000765010 Homo sapiens Beta-galactosidase Proteins 0.000 description 1
- 101000697858 Homo sapiens Bile acid-CoA:amino acid N-acyltransferase Proteins 0.000 description 1
- 101000740785 Homo sapiens Bone marrow stromal antigen 2 Proteins 0.000 description 1
- 101000762366 Homo sapiens Bone morphogenetic protein 2 Proteins 0.000 description 1
- 101000707248 Homo sapiens Bone sialoprotein 2 Proteins 0.000 description 1
- 101000935648 Homo sapiens Breast cancer anti-estrogen resistance protein 3 Proteins 0.000 description 1
- 101000766275 Homo sapiens Breast carcinoma-amplified sequence 4 Proteins 0.000 description 1
- 101000731086 Homo sapiens Brevican core protein Proteins 0.000 description 1
- 101000794040 Homo sapiens Bromodomain and WD repeat-containing protein 1 Proteins 0.000 description 1
- 101000896419 Homo sapiens Bystin Proteins 0.000 description 1
- 101000716070 Homo sapiens C-C chemokine receptor type 9 Proteins 0.000 description 1
- 101000978375 Homo sapiens C-C motif chemokine 16 Proteins 0.000 description 1
- 101000978371 Homo sapiens C-C motif chemokine 18 Proteins 0.000 description 1
- 101000713106 Homo sapiens C-C motif chemokine 19 Proteins 0.000 description 1
- 101000922405 Homo sapiens C-X-C chemokine receptor type 5 Proteins 0.000 description 1
- 101000947193 Homo sapiens C-X-C motif chemokine 3 Proteins 0.000 description 1
- 101000942297 Homo sapiens C-type lectin domain family 11 member A Proteins 0.000 description 1
- 101000906643 Homo sapiens C-type lectin-like domain family 1 Proteins 0.000 description 1
- 101000868001 Homo sapiens C5a anaphylatoxin chemotactic receptor 2 Proteins 0.000 description 1
- 101000910382 Homo sapiens CASP8-associated protein 2 Proteins 0.000 description 1
- 101000934356 Homo sapiens CD70 antigen Proteins 0.000 description 1
- 101000946856 Homo sapiens CD83 antigen Proteins 0.000 description 1
- 101000878673 Homo sapiens CHRNA7-FAM7A fusion protein Proteins 0.000 description 1
- 101000914155 Homo sapiens Cadherin EGF LAG seven-pass G-type receptor 1 Proteins 0.000 description 1
- 101000762242 Homo sapiens Cadherin-15 Proteins 0.000 description 1
- 101000762247 Homo sapiens Cadherin-17 Proteins 0.000 description 1
- 101000714553 Homo sapiens Cadherin-3 Proteins 0.000 description 1
- 101000961406 Homo sapiens Calcium uniporter regulatory subunit MCUb, mitochondrial Proteins 0.000 description 1
- 101000785236 Homo sapiens Calcium-transporting ATPase type 2C member 2 Proteins 0.000 description 1
- 101000867715 Homo sapiens Calpain-3 Proteins 0.000 description 1
- 101000725945 Homo sapiens Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 Proteins 0.000 description 1
- 101000910843 Homo sapiens Carboxypeptidase N catalytic chain Proteins 0.000 description 1
- 101000909153 Homo sapiens Carboxypeptidase N subunit 2 Proteins 0.000 description 1
- 101000910820 Homo sapiens Cell cycle control protein 50B Proteins 0.000 description 1
- 101000996823 Homo sapiens Cell surface A33 antigen Proteins 0.000 description 1
- 101001099851 Homo sapiens Cellular retinoic acid-binding protein 2 Proteins 0.000 description 1
- 101000776412 Homo sapiens Centromere protein N Proteins 0.000 description 1
- 101000884553 Homo sapiens Centromere protein Q Proteins 0.000 description 1
- 101000788132 Homo sapiens Chemokine-like protein TAFA-4 Proteins 0.000 description 1
- 101000883325 Homo sapiens Chitinase-3-like protein 2 Proteins 0.000 description 1
- 101000906651 Homo sapiens Chloride channel protein 1 Proteins 0.000 description 1
- 101000943274 Homo sapiens Cholinesterase Proteins 0.000 description 1
- 101000862639 Homo sapiens Chorion-specific transcription factor GCMa Proteins 0.000 description 1
- 101000797581 Homo sapiens Chromobox protein homolog 5 Proteins 0.000 description 1
- 101000883548 Homo sapiens Chromodomain-helicase-DNA-binding protein 9 Proteins 0.000 description 1
- 101000889306 Homo sapiens Chymotrypsin-C Proteins 0.000 description 1
- 101000907951 Homo sapiens Chymotrypsin-like elastase family member 3B Proteins 0.000 description 1
- 101000889299 Homo sapiens Chymotrypsinogen B2 Proteins 0.000 description 1
- 101000855375 Homo sapiens Ciliogenesis and planar polarity effector 1 Proteins 0.000 description 1
- 101000992976 Homo sapiens Clarin-3 Proteins 0.000 description 1
- 101000766993 Homo sapiens Claudin-10 Proteins 0.000 description 1
- 101000888570 Homo sapiens Claudin-14 Proteins 0.000 description 1
- 101000882901 Homo sapiens Claudin-2 Proteins 0.000 description 1
- 101000749339 Homo sapiens Claudin-20 Proteins 0.000 description 1
- 101000882908 Homo sapiens Claudin-3 Proteins 0.000 description 1
- 101000912652 Homo sapiens Claudin-7 Proteins 0.000 description 1
- 101000907131 Homo sapiens Clavesin-2 Proteins 0.000 description 1
- 101000777340 Homo sapiens Coiled-coil domain-containing protein 14 Proteins 0.000 description 1
- 101000934952 Homo sapiens Coiled-coil domain-containing protein 144B Proteins 0.000 description 1
- 101000946681 Homo sapiens Coiled-coil domain-containing protein 167 Proteins 0.000 description 1
- 101000777411 Homo sapiens Coiled-coil domain-containing protein 18 Proteins 0.000 description 1
- 101000710623 Homo sapiens Collagen alpha-1(XI) chain Proteins 0.000 description 1
- 101000952969 Homo sapiens Collagen alpha-1(XXIV) chain Proteins 0.000 description 1
- 101000746121 Homo sapiens Collagen triple helix repeat-containing protein 1 Proteins 0.000 description 1
- 101000933641 Homo sapiens Complement C1q-like protein 3 Proteins 0.000 description 1
- 101000890732 Homo sapiens Complement factor H-related protein 1 Proteins 0.000 description 1
- 101000878136 Homo sapiens Complement factor H-related protein 3 Proteins 0.000 description 1
- 101000878133 Homo sapiens Complement factor H-related protein 4 Proteins 0.000 description 1
- 101001005524 Homo sapiens Complex III assembly factor LYRM7 Proteins 0.000 description 1
- 101000942622 Homo sapiens Condensin complex subunit 3 Proteins 0.000 description 1
- 101000909516 Homo sapiens Contactin-2 Proteins 0.000 description 1
- 101000749880 Homo sapiens Contactin-associated protein-like 4 Proteins 0.000 description 1
- 101000941754 Homo sapiens Copine-1 Proteins 0.000 description 1
- 101000941777 Homo sapiens Copine-2 Proteins 0.000 description 1
- 101000748858 Homo sapiens Coronin-2A Proteins 0.000 description 1
- 101000858031 Homo sapiens Coxsackievirus and adenovirus receptor Proteins 0.000 description 1
- 101000855583 Homo sapiens Cysteine sulfinic acid decarboxylase Proteins 0.000 description 1
- 101000957711 Homo sapiens Cysteine-rich secretory protein LCCL domain-containing 1 Proteins 0.000 description 1
- 101000761960 Homo sapiens Cytochrome P450 11B1, mitochondrial Proteins 0.000 description 1
- 101000761956 Homo sapiens Cytochrome P450 11B2, mitochondrial Proteins 0.000 description 1
- 101000875170 Homo sapiens Cytochrome P450 2A6 Proteins 0.000 description 1
- 101000725117 Homo sapiens Cytochrome P450 4A22 Proteins 0.000 description 1
- 101000726631 Homo sapiens Cytochrome b5 type B Proteins 0.000 description 1
- 101000916671 Homo sapiens Cytokine-like protein 1 Proteins 0.000 description 1
- 101000766848 Homo sapiens Cytoskeleton-associated protein 2 Proteins 0.000 description 1
- 101000903373 Homo sapiens D-beta-hydroxybutyrate dehydrogenase, mitochondrial Proteins 0.000 description 1
- 101000994204 Homo sapiens D-ribitol-5-phosphate cytidylyltransferase Proteins 0.000 description 1
- 101000915665 Homo sapiens DBIRD complex subunit ZNF326 Proteins 0.000 description 1
- 101000722273 Homo sapiens DENN domain-containing protein 4C Proteins 0.000 description 1
- 101000950642 Homo sapiens DEP domain-containing protein 1A Proteins 0.000 description 1
- 101000964385 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3B Proteins 0.000 description 1
- 101000918646 Homo sapiens DNA damage-induced apoptosis suppressor protein Proteins 0.000 description 1
- 101001134036 Homo sapiens DNA mismatch repair protein Msh2 Proteins 0.000 description 1
- 101000800646 Homo sapiens DNA nucleotidylexotransferase Proteins 0.000 description 1
- 101000902558 Homo sapiens DNA polymerase alpha catalytic subunit Proteins 0.000 description 1
- 101000649306 Homo sapiens DNA repair protein XRCC2 Proteins 0.000 description 1
- 101000927313 Homo sapiens DNA replication ATP-dependent helicase DNA2 Proteins 0.000 description 1
- 101001136564 Homo sapiens DNA replication complex GINS protein PSF3 Proteins 0.000 description 1
- 101000702463 Homo sapiens DNA replication complex GINS protein SLD5 Proteins 0.000 description 1
- 101001075459 Homo sapiens DNA-binding protein RFX7 Proteins 0.000 description 1
- 101001075468 Homo sapiens DNA-binding protein RFX8 Proteins 0.000 description 1
- 101000655236 Homo sapiens DNA-binding protein SATB2 Proteins 0.000 description 1
- 101001036287 Homo sapiens DNA-binding protein inhibitor ID-3 Proteins 0.000 description 1
- 101000650570 Homo sapiens DNA-directed RNA polymerase I subunit RPA43 Proteins 0.000 description 1
- 101000929421 Homo sapiens Deformed epidermal autoregulatory factor 1 homolog Proteins 0.000 description 1
- 101000928513 Homo sapiens Delta-like protein 3 Proteins 0.000 description 1
- 101000865404 Homo sapiens Dentin sialophosphoprotein Proteins 0.000 description 1
- 101000590225 Homo sapiens Diphosphoinositol polyphosphate phosphohydrolase NUDT4B Proteins 0.000 description 1
- 101000902114 Homo sapiens Disks large homolog 5 Proteins 0.000 description 1
- 101000890957 Homo sapiens Dol-P-Glc:Glc(2)Man(9)GlcNAc(2)-PP-Dol alpha-1,2-glucosyltransferase Proteins 0.000 description 1
- 101000949728 Homo sapiens Doublesex- and mab-3-related transcription factor A2 Proteins 0.000 description 1
- 101001041189 Homo sapiens Dynactin subunit 4 Proteins 0.000 description 1
- 101000904021 Homo sapiens Dynein light chain Tctex-type 5 Proteins 0.000 description 1
- 101001016173 Homo sapiens Dystrotelin Proteins 0.000 description 1
- 101000973503 Homo sapiens E3 ubiquitin-protein ligase MIB1 Proteins 0.000 description 1
- 101000863869 Homo sapiens E3 ubiquitin-protein ligase SHPRH Proteins 0.000 description 1
- 101000898776 Homo sapiens ER lumen protein-retaining receptor 3 Proteins 0.000 description 1
- 101000851938 Homo sapiens Early lymphoid activation gene protein Proteins 0.000 description 1
- 101000877456 Homo sapiens Ectoderm-neural cortex protein 1 Proteins 0.000 description 1
- 101000852006 Homo sapiens Ectonucleoside triphosphate diphosphohydrolase 7 Proteins 0.000 description 1
- 101000967447 Homo sapiens Elongation factor 1-beta Proteins 0.000 description 1
- 101000966722 Homo sapiens Endogenous Bornavirus-like nucleoprotein 2 Proteins 0.000 description 1
- 101000723417 Homo sapiens Endonuclease ZRANB3 Proteins 0.000 description 1
- 101001063315 Homo sapiens Endoplasmic reticulum metallopeptidase 1 Proteins 0.000 description 1
- 101001010787 Homo sapiens Endoribonuclease Proteins 0.000 description 1
- 101001016782 Homo sapiens Ensconsin Proteins 0.000 description 1
- 101000898673 Homo sapiens Ephrin type-A receptor 10 Proteins 0.000 description 1
- 101000851943 Homo sapiens Epiplakin Proteins 0.000 description 1
- 101001077852 Homo sapiens Epoxide hydrolase 1 Proteins 0.000 description 1
- 101000812405 Homo sapiens Epoxide hydrolase 4 Proteins 0.000 description 1
- 101000882584 Homo sapiens Estrogen receptor Proteins 0.000 description 1
- 101001034811 Homo sapiens Eukaryotic translation initiation factor 4 gamma 2 Proteins 0.000 description 1
- 101000847062 Homo sapiens Exportin-4 Proteins 0.000 description 1
- 101000878541 Homo sapiens FAST kinase domain-containing protein 1, mitochondrial Proteins 0.000 description 1
- 101000877877 Homo sapiens FRAS1-related extracellular matrix protein 3 Proteins 0.000 description 1
- 101000914679 Homo sapiens Fanconi anemia group B protein Proteins 0.000 description 1
- 101000848171 Homo sapiens Fanconi anemia group J protein Proteins 0.000 description 1
- 101001027414 Homo sapiens Fasciculation and elongation protein zeta-2 Proteins 0.000 description 1
- 101001021925 Homo sapiens Fascin Proteins 0.000 description 1
- 101000846860 Homo sapiens Fc receptor-like A Proteins 0.000 description 1
- 101000988798 Homo sapiens Ferroxidase HEPHL1 Proteins 0.000 description 1
- 101001060279 Homo sapiens Fetuin-B Proteins 0.000 description 1
- 101001031635 Homo sapiens Fibrinogen-like protein 1 Proteins 0.000 description 1
- 101000892776 Homo sapiens Fibronectin type III domain-containing protein 10 Proteins 0.000 description 1
- 101000878272 Homo sapiens Fidgetin-like protein 1 Proteins 0.000 description 1
- 101000861452 Homo sapiens Forkhead box protein P3 Proteins 0.000 description 1
- 101000823442 Homo sapiens Fructose-2,6-bisphosphatase Proteins 0.000 description 1
- 101000755933 Homo sapiens Fructose-bisphosphate aldolase B Proteins 0.000 description 1
- 101001039308 Homo sapiens G-protein coupled receptor 151 Proteins 0.000 description 1
- 101001009545 Homo sapiens G-protein coupled receptor 35 Proteins 0.000 description 1
- 101000871149 Homo sapiens G-protein coupled receptor 52 Proteins 0.000 description 1
- 101000738575 Homo sapiens G1/S-specific cyclin-E2 Proteins 0.000 description 1
- 101001026457 Homo sapiens G2 and S phase-expressed protein 1 Proteins 0.000 description 1
- 101000779347 Homo sapiens GDP-Man:Man(3)GlcNAc(2)-PP-Dol alpha-1,2-mannosyltransferase Proteins 0.000 description 1
- 101000981252 Homo sapiens GPI-linked NAD(P)(+)-arginine ADP-ribosyltransferase 1 Proteins 0.000 description 1
- 101100393249 Homo sapiens GPR148 gene Proteins 0.000 description 1
- 101001071433 Homo sapiens GRAM domain-containing protein 2B Proteins 0.000 description 1
- 101001061348 Homo sapiens Galactose-3-O-sulfotransferase 4 Proteins 0.000 description 1
- 101000620927 Homo sapiens Galactoside-binding soluble lectin 13 Proteins 0.000 description 1
- 101001075374 Homo sapiens Gamma-glutamyl hydrolase Proteins 0.000 description 1
- 101000886680 Homo sapiens Gamma-glutamylcyclotransferase Proteins 0.000 description 1
- 101000858028 Homo sapiens Gap junction alpha-9 protein Proteins 0.000 description 1
- 101000856653 Homo sapiens Gap junction delta-2 protein Proteins 0.000 description 1
- 101000856663 Homo sapiens Gap junction delta-3 protein Proteins 0.000 description 1
- 101001026279 Homo sapiens Gasdermin-C Proteins 0.000 description 1
- 101001026271 Homo sapiens Genetic suppressor element 1 Proteins 0.000 description 1
- 101000862655 Homo sapiens Germinal center-associated signaling and motility protein Proteins 0.000 description 1
- 101001034755 Homo sapiens Germinal center-associated signaling and motility-like protein Proteins 0.000 description 1
- 101001071367 Homo sapiens Girdin Proteins 0.000 description 1
- 101000857303 Homo sapiens Glomulin Proteins 0.000 description 1
- 101001100728 Homo sapiens Glutamine and serine-rich protein 1 Proteins 0.000 description 1
- 101000870590 Homo sapiens Glutathione S-transferase A3 Proteins 0.000 description 1
- 101000888230 Homo sapiens Glycine N-acyltransferase-like protein 1 Proteins 0.000 description 1
- 101000978837 Homo sapiens Glycoprotein endo-alpha-1,2-mannosidase Proteins 0.000 description 1
- 101000746373 Homo sapiens Granulocyte-macrophage colony-stimulating factor Proteins 0.000 description 1
- 101001040751 Homo sapiens Granulysin Proteins 0.000 description 1
- 101000900697 Homo sapiens Granzyme M Proteins 0.000 description 1
- 101000983153 Homo sapiens Group IID secretory phospholipase A2 Proteins 0.000 description 1
- 101001072710 Homo sapiens Growth arrest-specific protein 2 Proteins 0.000 description 1
- 101001073279 Homo sapiens Guanine nucleotide-binding protein G(T) subunit gamma-T1 Proteins 0.000 description 1
- 101001038755 Homo sapiens Guanylate cyclase soluble subunit alpha-1 Proteins 0.000 description 1
- 101001001336 Homo sapiens Guanylate-binding protein 1 Proteins 0.000 description 1
- 101001058844 Homo sapiens Guanylate-binding protein 7 Proteins 0.000 description 1
- 101000899808 Homo sapiens Guanylyl cyclase C Proteins 0.000 description 1
- 101001035831 Homo sapiens HAUS augmin-like complex subunit 1 Proteins 0.000 description 1
- 101000990572 Homo sapiens HEAT repeat-containing protein 1 Proteins 0.000 description 1
- 101000866278 Homo sapiens HLA class II histocompatibility antigen, DO alpha chain Proteins 0.000 description 1
- 101001045553 Homo sapiens Hemogen Proteins 0.000 description 1
- 101000601604 Homo sapiens Heparan sulfate N-sulfotransferase 4 Proteins 0.000 description 1
- 101001082432 Homo sapiens Heparin cofactor 2 Proteins 0.000 description 1
- 101001062353 Homo sapiens Hepatocyte nuclear factor 3-alpha Proteins 0.000 description 1
- 101000988521 Homo sapiens Hexokinase HKDC1 Proteins 0.000 description 1
- 101000878611 Homo sapiens High affinity immunoglobulin epsilon receptor subunit alpha Proteins 0.000 description 1
- 101001016841 Homo sapiens Histamine H1 receptor Proteins 0.000 description 1
- 101000899879 Homo sapiens Histone H1.5 Proteins 0.000 description 1
- 101001035431 Homo sapiens Histone H2A type 1 Proteins 0.000 description 1
- 101000944174 Homo sapiens Histone acetyltransferase KAT6B Proteins 0.000 description 1
- 101001045846 Homo sapiens Histone-lysine N-methyltransferase 2A Proteins 0.000 description 1
- 101001045907 Homo sapiens Holliday junction recognition protein Proteins 0.000 description 1
- 101000988994 Homo sapiens Homeobox protein Hox-B8 Proteins 0.000 description 1
- 101000989000 Homo sapiens Homeobox protein Hox-B9 Proteins 0.000 description 1
- 101001052462 Homo sapiens Homeobox protein MIXL1 Proteins 0.000 description 1
- 101000584427 Homo sapiens Homeobox protein orthopedia Proteins 0.000 description 1
- 101000988649 Homo sapiens Humanin-like 3 Proteins 0.000 description 1
- 101001079904 Homo sapiens Hyaluronan and proteoglycan link protein 1 Proteins 0.000 description 1
- 101001078445 Homo sapiens Hyaluronan and proteoglycan link protein 2 Proteins 0.000 description 1
- 101001011967 Homo sapiens IQ motif and ankyrin repeat domain-containing protein 1 Proteins 0.000 description 1
- 101001032334 Homo sapiens Immunity-related GTPase family M protein Proteins 0.000 description 1
- 101000854886 Homo sapiens Immunoglobulin iota chain Proteins 0.000 description 1
- 101000840267 Homo sapiens Immunoglobulin lambda-like polypeptide 1 Proteins 0.000 description 1
- 101001054793 Homo sapiens Importin subunit alpha-7 Proteins 0.000 description 1
- 101001054791 Homo sapiens Importin subunit alpha-8 Proteins 0.000 description 1
- 101000707660 Homo sapiens Inactive Rho GTPase-activating protein 11B Proteins 0.000 description 1
- 101001134447 Homo sapiens Inactive pancreatic lipase-related protein 1 Proteins 0.000 description 1
- 101000721401 Homo sapiens Inactive ubiquitin thioesterase OTULINL Proteins 0.000 description 1
- 101001037256 Homo sapiens Indoleamine 2,3-dioxygenase 1 Proteins 0.000 description 1
- 101001054832 Homo sapiens Inhibin beta C chain Proteins 0.000 description 1
- 101000852852 Homo sapiens Innate immunity activator protein Proteins 0.000 description 1
- 101000599779 Homo sapiens Insulin-like growth factor 2 mRNA-binding protein 2 Proteins 0.000 description 1
- 101000599782 Homo sapiens Insulin-like growth factor 2 mRNA-binding protein 3 Proteins 0.000 description 1
- 101001081567 Homo sapiens Insulin-like growth factor-binding protein 1 Proteins 0.000 description 1
- 101001044940 Homo sapiens Insulin-like growth factor-binding protein 2 Proteins 0.000 description 1
- 101000998810 Homo sapiens Insulin-like peptide INSL6 Proteins 0.000 description 1
- 101000998808 Homo sapiens Integrator complex subunit 2 Proteins 0.000 description 1
- 101000994378 Homo sapiens Integrin alpha-3 Proteins 0.000 description 1
- 101000994375 Homo sapiens Integrin alpha-4 Proteins 0.000 description 1
- 101000609406 Homo sapiens Inter-alpha-trypsin inhibitor heavy chain H3 Proteins 0.000 description 1
- 101001082070 Homo sapiens Interferon alpha-inducible protein 6 Proteins 0.000 description 1
- 101001054334 Homo sapiens Interferon beta Proteins 0.000 description 1
- 101001044447 Homo sapiens Interferon kappa Proteins 0.000 description 1
- 101001011382 Homo sapiens Interferon regulatory factor 3 Proteins 0.000 description 1
- 101001011441 Homo sapiens Interferon regulatory factor 4 Proteins 0.000 description 1
- 101001034846 Homo sapiens Interferon-induced transmembrane protein 3 Proteins 0.000 description 1
- 101001010600 Homo sapiens Interleukin-12 subunit alpha Proteins 0.000 description 1
- 101001044887 Homo sapiens Interleukin-22 receptor subunit alpha-2 Proteins 0.000 description 1
- 101000852980 Homo sapiens Interleukin-23 subunit alpha Proteins 0.000 description 1
- 101000852964 Homo sapiens Interleukin-27 subunit beta Proteins 0.000 description 1
- 101001036194 Homo sapiens Isopentenyl-diphosphate delta-isomerase 2 Proteins 0.000 description 1
- 101000691574 Homo sapiens Junction plakoglobin Proteins 0.000 description 1
- 101001046633 Homo sapiens Junctional adhesion molecule A Proteins 0.000 description 1
- 101001050282 Homo sapiens Jupiter microtubule associated homolog 2 Proteins 0.000 description 1
- 101001026918 Homo sapiens KRAB-A domain-containing protein 2 Proteins 0.000 description 1
- 101000971351 Homo sapiens KRR1 small subunit processome component homolog Proteins 0.000 description 1
- 101000605522 Homo sapiens Kallikrein-1 Proteins 0.000 description 1
- 101001091385 Homo sapiens Kallikrein-6 Proteins 0.000 description 1
- 101000945451 Homo sapiens Kelch domain-containing protein 8B Proteins 0.000 description 1
- 101001047047 Homo sapiens Kelch repeat and BTB domain-containing protein 8 Proteins 0.000 description 1
- 101001027207 Homo sapiens Kelch-like protein 40 Proteins 0.000 description 1
- 101000998027 Homo sapiens Keratin, type I cytoskeletal 17 Proteins 0.000 description 1
- 101000998011 Homo sapiens Keratin, type I cytoskeletal 19 Proteins 0.000 description 1
- 101000975496 Homo sapiens Keratin, type II cytoskeletal 8 Proteins 0.000 description 1
- 101000613871 Homo sapiens Keratinocyte-associated protein 3 Proteins 0.000 description 1
- 101001027621 Homo sapiens Kinesin-like protein KIF20A Proteins 0.000 description 1
- 101000605743 Homo sapiens Kinesin-like protein KIF23 Proteins 0.000 description 1
- 101001112162 Homo sapiens Kinetochore protein NDC80 homolog Proteins 0.000 description 1
- 101001006909 Homo sapiens Kinetochore-associated protein 1 Proteins 0.000 description 1
- 101001139130 Homo sapiens Krueppel-like factor 5 Proteins 0.000 description 1
- 101000588045 Homo sapiens Kunitz-type protease inhibitor 1 Proteins 0.000 description 1
- 101001121408 Homo sapiens L-amino-acid oxidase Proteins 0.000 description 1
- 101001047511 Homo sapiens LLGL scribble cell polarity complex component 2 Proteins 0.000 description 1
- 101000874532 Homo sapiens Lactosylceramide 1,3-N-acetyl-beta-D-glucosaminyltransferase Proteins 0.000 description 1
- 101001010037 Homo sapiens Ladinin-1 Proteins 0.000 description 1
- 101001005166 Homo sapiens Lens fiber membrane intrinsic protein Proteins 0.000 description 1
- 101001054848 Homo sapiens Leucine zipper protein 1 Proteins 0.000 description 1
- 101001039113 Homo sapiens Leucine-rich repeat-containing protein 15 Proteins 0.000 description 1
- 101001004923 Homo sapiens Leucine-rich repeat-containing protein 31 Proteins 0.000 description 1
- 101000619621 Homo sapiens Leucine-rich repeat-containing protein 4C Proteins 0.000 description 1
- 101000579912 Homo sapiens Leucine-rich repeat-containing protein 58 Proteins 0.000 description 1
- 101000984684 Homo sapiens Leucine-rich single-pass membrane protein 1 Proteins 0.000 description 1
- 101001063878 Homo sapiens Leukemia-associated protein 1 Proteins 0.000 description 1
- 101000970777 Homo sapiens Leukemia-associated protein 7 Proteins 0.000 description 1
- 101000984196 Homo sapiens Leukocyte immunoglobulin-like receptor subfamily A member 5 Proteins 0.000 description 1
- 101000608935 Homo sapiens Leukosialin Proteins 0.000 description 1
- 101001017968 Homo sapiens Leukotriene B4 receptor 1 Proteins 0.000 description 1
- 101000619643 Homo sapiens Ligand-dependent nuclear receptor-interacting factor 1 Proteins 0.000 description 1
- 101001017764 Homo sapiens Lipopolysaccharide-responsive and beige-like anchor protein Proteins 0.000 description 1
- 101001064870 Homo sapiens Lon protease homolog, mitochondrial Proteins 0.000 description 1
- 101000780205 Homo sapiens Long-chain-fatty-acid-CoA ligase 5 Proteins 0.000 description 1
- 101000923835 Homo sapiens Low density lipoprotein receptor adapter protein 1 Proteins 0.000 description 1
- 101001051093 Homo sapiens Low-density lipoprotein receptor Proteins 0.000 description 1
- 101000958332 Homo sapiens Lymphocyte antigen 6 complex locus protein G6d Proteins 0.000 description 1
- 101000958312 Homo sapiens Lymphocyte antigen 6 complex locus protein G6f Proteins 0.000 description 1
- 101001038043 Homo sapiens Lysophosphatidic acid receptor 4 Proteins 0.000 description 1
- 101001038037 Homo sapiens Lysophosphatidic acid receptor 5 Proteins 0.000 description 1
- 101001122938 Homo sapiens Lysosomal protective protein Proteins 0.000 description 1
- 101001115414 Homo sapiens M-phase phosphoprotein 9 Proteins 0.000 description 1
- 101000980566 Homo sapiens MAPK regulated corepressor interacting protein 2 Proteins 0.000 description 1
- 101000972918 Homo sapiens MAX gene-associated protein Proteins 0.000 description 1
- 101000616456 Homo sapiens MEF2-activating motif and SAP domain-containing transcriptional regulator Proteins 0.000 description 1
- 101001052076 Homo sapiens Maltase-glucoamylase Proteins 0.000 description 1
- 101001056015 Homo sapiens Mannan-binding lectin serine protease 2 Proteins 0.000 description 1
- 101001128500 Homo sapiens Marginal zone B- and B1-cell-specific protein Proteins 0.000 description 1
- 101001028025 Homo sapiens Mdm2-binding protein Proteins 0.000 description 1
- 101000592685 Homo sapiens Meiotic nuclear division protein 1 homolog Proteins 0.000 description 1
- 101001057131 Homo sapiens Melanoma-associated antigen D4 Proteins 0.000 description 1
- 101000625226 Homo sapiens Melanoregulin Proteins 0.000 description 1
- 101000687968 Homo sapiens Membrane-associated tyrosine- and threonine-specific cdc2-inhibitory kinase Proteins 0.000 description 1
- 101001027295 Homo sapiens Metabotropic glutamate receptor 8 Proteins 0.000 description 1
- 101000615488 Homo sapiens Methyl-CpG-binding domain protein 2 Proteins 0.000 description 1
- 101000653360 Homo sapiens Methylcytosine dioxygenase TET1 Proteins 0.000 description 1
- 101000968916 Homo sapiens Methylsterol monooxygenase 1 Proteins 0.000 description 1
- 101000588145 Homo sapiens Microtubule-associated tumor suppressor 1 Proteins 0.000 description 1
- 101000581272 Homo sapiens Midasin Proteins 0.000 description 1
- 101000578867 Homo sapiens Mis18-binding protein 1 Proteins 0.000 description 1
- 101000763951 Homo sapiens Mitochondrial import inner membrane translocase subunit Tim8 A Proteins 0.000 description 1
- 101000764239 Homo sapiens Mitochondrial import receptor subunit TOM5 homolog Proteins 0.000 description 1
- 101001013832 Homo sapiens Mitochondrial peptide methionine sulfoxide reductase Proteins 0.000 description 1
- 101001019013 Homo sapiens Mitotic interactor and substrate of PLK1 Proteins 0.000 description 1
- 101000623900 Homo sapiens Mucin-13 Proteins 0.000 description 1
- 101000972286 Homo sapiens Mucin-4 Proteins 0.000 description 1
- 101001015220 Homo sapiens Myelin-associated oligodendrocyte basic protein Proteins 0.000 description 1
- 101000934338 Homo sapiens Myeloid cell surface antigen CD33 Proteins 0.000 description 1
- 101001059654 Homo sapiens Myeloid-associated differentiation marker-like protein 2 Proteins 0.000 description 1
- 101000589002 Homo sapiens Myogenin Proteins 0.000 description 1
- 101000635895 Homo sapiens Myosin light chain 4 Proteins 0.000 description 1
- 101000584208 Homo sapiens Myosin light chain kinase 2, skeletal/cardiac muscle Proteins 0.000 description 1
- 101000990929 Homo sapiens Myosin regulatory light chain 2, skeletal muscle isoform Proteins 0.000 description 1
- 101000982032 Homo sapiens Myosin-binding protein C, cardiac-type Proteins 0.000 description 1
- 101000635965 Homo sapiens Myosin-binding protein C, slow-type Proteins 0.000 description 1
- 101001030184 Homo sapiens Myotilin Proteins 0.000 description 1
- 101001030169 Homo sapiens Myozenin-1 Proteins 0.000 description 1
- 101000874528 Homo sapiens N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase 3 Proteins 0.000 description 1
- 101000588232 Homo sapiens N-alpha-acetyltransferase 11 Proteins 0.000 description 1
- 101001128489 Homo sapiens N-alpha-acetyltransferase 25, NatB auxiliary subunit Proteins 0.000 description 1
- 101001059802 Homo sapiens N-formyl peptide receptor 3 Proteins 0.000 description 1
- 101000962076 Homo sapiens NADH-cytochrome b5 reductase-like Proteins 0.000 description 1
- 101000603359 Homo sapiens NADPH oxidase organizer 1 Proteins 0.000 description 1
- 101000928259 Homo sapiens NADPH:adrenodoxin oxidoreductase, mitochondrial Proteins 0.000 description 1
- 101000601130 Homo sapiens NHL-repeat-containing protein 4 Proteins 0.000 description 1
- 101001109503 Homo sapiens NKG2-C type II integral membrane protein Proteins 0.000 description 1
- 101000634783 Homo sapiens NOP protein chaperone 1 Proteins 0.000 description 1
- 101000604461 Homo sapiens NUT family member 2G Proteins 0.000 description 1
- 101001128158 Homo sapiens Nanos homolog 2 Proteins 0.000 description 1
- 101000978730 Homo sapiens Nephrin Proteins 0.000 description 1
- 101000624947 Homo sapiens Nesprin-1 Proteins 0.000 description 1
- 101000822072 Homo sapiens Neuronal acetylcholine receptor subunit alpha-6 Proteins 0.000 description 1
- 101000822093 Homo sapiens Neuronal acetylcholine receptor subunit alpha-9 Proteins 0.000 description 1
- 101001108246 Homo sapiens Neuronal pentraxin-2 Proteins 0.000 description 1
- 101000577541 Homo sapiens Neuronal regeneration-related protein Proteins 0.000 description 1
- 101001128911 Homo sapiens Neutral cholesterol ester hydrolase 1 Proteins 0.000 description 1
- 101000583239 Homo sapiens Nicotinate-nucleotide pyrophosphorylase [carboxylating] Proteins 0.000 description 1
- 101001124991 Homo sapiens Nitric oxide synthase, inducible Proteins 0.000 description 1
- 101000972834 Homo sapiens Normal mucosa of esophagus-specific gene 1 protein Proteins 0.000 description 1
- 101000577309 Homo sapiens Notch-regulated ankyrin repeat-containing protein Proteins 0.000 description 1
- 101000973615 Homo sapiens Nuclear envelope integral membrane protein 1 Proteins 0.000 description 1
- 101000996563 Homo sapiens Nuclear pore complex protein Nup214 Proteins 0.000 description 1
- 101000974340 Homo sapiens Nuclear receptor corepressor 1 Proteins 0.000 description 1
- 101000978926 Homo sapiens Nuclear receptor subfamily 1 group D member 1 Proteins 0.000 description 1
- 101000603882 Homo sapiens Nuclear receptor subfamily 1 group I member 3 Proteins 0.000 description 1
- 101000633503 Homo sapiens Nuclear receptor subfamily 2 group E member 1 Proteins 0.000 description 1
- 101001008429 Homo sapiens Nucleobindin-2 Proteins 0.000 description 1
- 101000634768 Homo sapiens Nucleolar protein 16 Proteins 0.000 description 1
- 101001109512 Homo sapiens Nucleoplasmin-3 Proteins 0.000 description 1
- 101001024723 Homo sapiens Nucleoporin NDC1 Proteins 0.000 description 1
- 101000979629 Homo sapiens Nucleoside diphosphate kinase A Proteins 0.000 description 1
- 101000979623 Homo sapiens Nucleoside diphosphate kinase B Proteins 0.000 description 1
- 101001086785 Homo sapiens Occludin Proteins 0.000 description 1
- 101000986821 Homo sapiens P2Y purinoceptor 4 Proteins 0.000 description 1
- 101001071238 Homo sapiens PHD finger protein 14 Proteins 0.000 description 1
- 101001095089 Homo sapiens PML-RARA-regulated adapter molecule 1 Proteins 0.000 description 1
- 101000572986 Homo sapiens POU domain, class 3, transcription factor 2 Proteins 0.000 description 1
- 101000738965 Homo sapiens POU domain, class 6, transcription factor 2 Proteins 0.000 description 1
- 101000604540 Homo sapiens PRKCA-binding protein Proteins 0.000 description 1
- 101000613575 Homo sapiens Paired box protein Pax-1 Proteins 0.000 description 1
- 101000601724 Homo sapiens Paired box protein Pax-5 Proteins 0.000 description 1
- 101001129850 Homo sapiens Paired immunoglobulin-like type 2 receptor beta Proteins 0.000 description 1
- 101000786370 Homo sapiens Palmitoyltransferase ZDHHC21 Proteins 0.000 description 1
- 101000600748 Homo sapiens Pancreatic progenitor cell differentiation and proliferation factor-like protein Proteins 0.000 description 1
- 101001084266 Homo sapiens Parathyroid hormone 2 receptor Proteins 0.000 description 1
- 101000617151 Homo sapiens Pentraxin-4 Proteins 0.000 description 1
- 101000735594 Homo sapiens Peptidyl-prolyl cis-trans isomerase A-like 4G Proteins 0.000 description 1
- 101001091191 Homo sapiens Peptidyl-prolyl cis-trans isomerase F, mitochondrial Proteins 0.000 description 1
- 101000987581 Homo sapiens Perforin-1 Proteins 0.000 description 1
- 101001117010 Homo sapiens Pericentrin Proteins 0.000 description 1
- 101001095308 Homo sapiens Periostin Proteins 0.000 description 1
- 101001124867 Homo sapiens Peroxiredoxin-1 Proteins 0.000 description 1
- 101001090065 Homo sapiens Peroxiredoxin-2 Proteins 0.000 description 1
- 101000873719 Homo sapiens Phakinin Proteins 0.000 description 1
- 101000733743 Homo sapiens Phorbol-12-myristate-13-acetate-induced protein 1 Proteins 0.000 description 1
- 101000741974 Homo sapiens Phosphatidylinositol 3,4,5-trisphosphate-dependent Rac exchanger 1 protein Proteins 0.000 description 1
- 101000730493 Homo sapiens Phosphatidylinositol-glycan-specific phospholipase D Proteins 0.000 description 1
- 101000582978 Homo sapiens Phospholipid phosphatase-related protein type 1 Proteins 0.000 description 1
- 101000582986 Homo sapiens Phospholipid phosphatase-related protein type 3 Proteins 0.000 description 1
- 101000923320 Homo sapiens Phospholipid-transporting ATPase IF Proteins 0.000 description 1
- 101000923340 Homo sapiens Phospholipid-transporting ATPase VB Proteins 0.000 description 1
- 101001053641 Homo sapiens Plasma serine protease inhibitor Proteins 0.000 description 1
- 101000596041 Homo sapiens Plastin-1 Proteins 0.000 description 1
- 101001097889 Homo sapiens Platelet-activating factor acetylhydrolase Proteins 0.000 description 1
- 101000609363 Homo sapiens Platelet-activating factor acetylhydrolase IB subunit alpha1 Proteins 0.000 description 1
- 101000600766 Homo sapiens Podoplanin Proteins 0.000 description 1
- 101000872170 Homo sapiens Polycomb complex protein BMI-1 Proteins 0.000 description 1
- 101000886182 Homo sapiens Polypeptide N-acetylgalactosaminyltransferase 4 Proteins 0.000 description 1
- 101000886222 Homo sapiens Polypeptide N-acetylgalactosaminyltransferase 5 Proteins 0.000 description 1
- 101000705615 Homo sapiens Polypyrimidine tract-binding protein 3 Proteins 0.000 description 1
- 101001049841 Homo sapiens Potassium channel subfamily K member 1 Proteins 0.000 description 1
- 101000974732 Homo sapiens Potassium channel subfamily K member 17 Proteins 0.000 description 1
- 101001026192 Homo sapiens Potassium voltage-gated channel subfamily A member 6 Proteins 0.000 description 1
- 101000606310 Homo sapiens Pre T-cell antigen receptor alpha Proteins 0.000 description 1
- 101000617708 Homo sapiens Pregnancy-specific beta-1-glycoprotein 1 Proteins 0.000 description 1
- 101000617725 Homo sapiens Pregnancy-specific beta-1-glycoprotein 2 Proteins 0.000 description 1
- 101000617726 Homo sapiens Pregnancy-specific beta-1-glycoprotein 3 Proteins 0.000 description 1
- 101000617721 Homo sapiens Pregnancy-specific beta-1-glycoprotein 6 Proteins 0.000 description 1
- 101000617536 Homo sapiens Presenilin-1 Proteins 0.000 description 1
- 101000952078 Homo sapiens Probable ATP-dependent RNA helicase DDX60 Proteins 0.000 description 1
- 101000914997 Homo sapiens Probable C-mannosyltransferase DPY19L4 Proteins 0.000 description 1
- 101001130147 Homo sapiens Probable D-lactate dehydrogenase, mitochondrial Proteins 0.000 description 1
- 101000857740 Homo sapiens Probable G-protein coupled receptor 160 Proteins 0.000 description 1
- 101001040717 Homo sapiens Probable G-protein coupled receptor 174 Proteins 0.000 description 1
- 101001069607 Homo sapiens Probable G-protein coupled receptor 75 Proteins 0.000 description 1
- 101001069601 Homo sapiens Probable G-protein coupled receptor 82 Proteins 0.000 description 1
- 101001024626 Homo sapiens Probable N-acetyltransferase 16 Proteins 0.000 description 1
- 101000919183 Homo sapiens Probable carboxypeptidase X1 Proteins 0.000 description 1
- 101000630121 Homo sapiens Probable cysteine-tRNA ligase, mitochondrial Proteins 0.000 description 1
- 101000788412 Homo sapiens Probable methyltransferase TARBP1 Proteins 0.000 description 1
- 101000679303 Homo sapiens Probable non-functional T cell receptor gamma variable Proteins 0.000 description 1
- 101001056707 Homo sapiens Proepiregulin Proteins 0.000 description 1
- 101001117312 Homo sapiens Programmed cell death 1 ligand 2 Proteins 0.000 description 1
- 101000741708 Homo sapiens Proline-rich protein 15 Proteins 0.000 description 1
- 101001068552 Homo sapiens Proline-rich protein 15-like protein Proteins 0.000 description 1
- 101001090538 Homo sapiens Proline-rich protein 7 Proteins 0.000 description 1
- 101001069757 Homo sapiens Prospero homeobox protein 2 Proteins 0.000 description 1
- 101001125574 Homo sapiens Prostasin Proteins 0.000 description 1
- 101001115830 Homo sapiens Prostate-associated microseminoprotein Proteins 0.000 description 1
- 101000803296 Homo sapiens Protein BNIP5 Proteins 0.000 description 1
- 101000766826 Homo sapiens Protein CIP2A Proteins 0.000 description 1
- 101000813321 Homo sapiens Protein FAM124B Proteins 0.000 description 1
- 101000882138 Homo sapiens Protein FAM131C Proteins 0.000 description 1
- 101001062760 Homo sapiens Protein FAM13A Proteins 0.000 description 1
- 101000911541 Homo sapiens Protein FAM214A Proteins 0.000 description 1
- 101000937717 Homo sapiens Protein FAM222A Proteins 0.000 description 1
- 101000882231 Homo sapiens Protein FAM43B Proteins 0.000 description 1
- 101000848922 Homo sapiens Protein FAM72A Proteins 0.000 description 1
- 101000877861 Homo sapiens Protein FAM83B Proteins 0.000 description 1
- 101000931462 Homo sapiens Protein FosB Proteins 0.000 description 1
- 101001046896 Homo sapiens Protein HIDE1 Proteins 0.000 description 1
- 101001033832 Homo sapiens Protein INSYN2B Proteins 0.000 description 1
- 101001065830 Homo sapiens Protein LRATD1 Proteins 0.000 description 1
- 101001019885 Homo sapiens Protein MGARP Proteins 0.000 description 1
- 101000577964 Homo sapiens Protein MIX23 Proteins 0.000 description 1
- 101000977257 Homo sapiens Protein MMS22-like Proteins 0.000 description 1
- 101000963905 Homo sapiens Protein MOST-1 Proteins 0.000 description 1
- 101000727775 Homo sapiens Protein S100-A7A Proteins 0.000 description 1
- 101000740224 Homo sapiens Protein SCAI Proteins 0.000 description 1
- 101000652104 Homo sapiens Protein SMIM45 Proteins 0.000 description 1
- 101000652263 Homo sapiens Protein SOGA1 Proteins 0.000 description 1
- 101000596012 Homo sapiens Protein TASOR 2 Proteins 0.000 description 1
- 101000814380 Homo sapiens Protein Wnt-7b Proteins 0.000 description 1
- 101000726148 Homo sapiens Protein crumbs homolog 1 Proteins 0.000 description 1
- 101000726113 Homo sapiens Protein crumbs homolog 3 Proteins 0.000 description 1
- 101000928535 Homo sapiens Protein delta homolog 1 Proteins 0.000 description 1
- 101000920916 Homo sapiens Protein eva-1 homolog A Proteins 0.000 description 1
- 101000971468 Homo sapiens Protein kinase C zeta type Proteins 0.000 description 1
- 101000695187 Homo sapiens Protein patched homolog 1 Proteins 0.000 description 1
- 101000688355 Homo sapiens Protein phosphatase 1 regulatory subunit 14D Proteins 0.000 description 1
- 101000686996 Homo sapiens Protein phosphatase 1 regulatory subunit 1B Proteins 0.000 description 1
- 101000685298 Homo sapiens Protein sel-1 homolog 3 Proteins 0.000 description 1
- 101000825257 Homo sapiens Protein spinster homolog 3 Proteins 0.000 description 1
- 101000844010 Homo sapiens Protein tweety homolog 3 Proteins 0.000 description 1
- 101000649077 Homo sapiens Protein-tyrosine sulfotransferase 2 Proteins 0.000 description 1
- 101000579425 Homo sapiens Proto-oncogene tyrosine-protein kinase receptor Ret Proteins 0.000 description 1
- 101000610015 Homo sapiens Protocadherin beta-9 Proteins 0.000 description 1
- 101000655540 Homo sapiens Protransforming growth factor alpha Proteins 0.000 description 1
- 101000590549 Homo sapiens Pseudouridylate synthase 7 homolog Proteins 0.000 description 1
- 101001086519 Homo sapiens Pseudouridylate synthase 7 homolog-like protein Proteins 0.000 description 1
- 101000974905 Homo sapiens Putative ATP synthase subunit g 2, mitochondrial Proteins 0.000 description 1
- 101000890956 Homo sapiens Putative Dol-P-Glc:Glc(2)Man(9)GlcNAc(2)-PP-Dol alpha-1,2-glucosyltransferase Proteins 0.000 description 1
- 101000915594 Homo sapiens Putative KRAB domain-containing protein ZNF788 Proteins 0.000 description 1
- 101001133957 Homo sapiens Putative POU domain, class 5, transcription factor 1B Proteins 0.000 description 1
- 101000848490 Homo sapiens Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 Proteins 0.000 description 1
- 101000697601 Homo sapiens Putative STAG3-like protein 2 Proteins 0.000 description 1
- 101000658082 Homo sapiens Putative U2 small nuclear ribonucleoprotein auxiliary factor 35 kDa subunit-related protein 1 Proteins 0.000 description 1
- 101000797918 Homo sapiens Putative aldo-keto reductase family 1 member C8 Proteins 0.000 description 1
- 101000823113 Homo sapiens Putative ankyrin repeat domain-containing protein 26-like 1 Proteins 0.000 description 1
- 101000966772 Homo sapiens Putative apolipoprotein(a)-like protein 2 Proteins 0.000 description 1
- 101000868776 Homo sapiens Putative coiled-coil domain-containing protein 26 Proteins 0.000 description 1
- 101000836459 Homo sapiens Putative glycosyltransferase ALG1-like Proteins 0.000 description 1
- 101000890962 Homo sapiens Putative glycosyltransferase ALG1L2 Proteins 0.000 description 1
- 101001124901 Homo sapiens Putative histone-lysine N-methyltransferase PRDM6 Proteins 0.000 description 1
- 101000602504 Homo sapiens Putative homeobox protein NANOG2 Proteins 0.000 description 1
- 101001074414 Homo sapiens Putative phospholipase B-like 2 Proteins 0.000 description 1
- 101001087362 Homo sapiens Putative pituitary tumor-transforming gene 3 protein Proteins 0.000 description 1
- 101001134263 Homo sapiens Putative protein MSS51 homolog, mitochondrial Proteins 0.000 description 1
- 101000873438 Homo sapiens Putative protein SEM1, isoform 2 Proteins 0.000 description 1
- 101000723918 Homo sapiens Putative protein ZNF321 Proteins 0.000 description 1
- 101000648676 Homo sapiens Putative protein encoded by LINC02912 Proteins 0.000 description 1
- 101000734542 Homo sapiens Putative pyridoxal-dependent decarboxylase domain-containing protein 2 Proteins 0.000 description 1
- 101000777204 Homo sapiens Putative ubiquitin carboxyl-terminal hydrolase 41 Proteins 0.000 description 1
- 101000957894 Homo sapiens Putative uncharacterized protein CRYM-AS1 Proteins 0.000 description 1
- 101000937675 Homo sapiens Putative uncharacterized protein FAM30A Proteins 0.000 description 1
- 101000892082 Homo sapiens Putative uncharacterized protein FBXL19-AS1 Proteins 0.000 description 1
- 101001044397 Homo sapiens Putative uncharacterized protein IGF2BP2-AS1 Proteins 0.000 description 1
- 101000634182 Homo sapiens Putative uncharacterized protein NTM-AS1 Proteins 0.000 description 1
- 101000651610 Homo sapiens Putative uncharacterized protein SSBP3-AS1 Proteins 0.000 description 1
- 101000945539 Homo sapiens Putative uncharacterized protein encoded by LINC01554 Proteins 0.000 description 1
- 101000725919 Homo sapiens Putative uncharacterized protein encoded by LINC02910 Proteins 0.000 description 1
- 101000760018 Homo sapiens Putative zinc finger and SCAN domain-containing protein 5D Proteins 0.000 description 1
- 101000760178 Homo sapiens Putative zinc finger protein 66 Proteins 0.000 description 1
- 101000609335 Homo sapiens Pyrroline-5-carboxylate reductase 1, mitochondrial Proteins 0.000 description 1
- 101001100186 Homo sapiens RBBP8 N-terminal-like protein Proteins 0.000 description 1
- 101000580317 Homo sapiens RNA 3'-terminal phosphate cyclase-like protein Proteins 0.000 description 1
- 101001048703 Homo sapiens RNA polymerase II elongation factor ELL3 Proteins 0.000 description 1
- 101000668168 Homo sapiens RNA-binding motif, single-stranded-interacting protein 3 Proteins 0.000 description 1
- 101001111928 Homo sapiens RNA-binding protein 41 Proteins 0.000 description 1
- 101001100309 Homo sapiens RNA-binding protein 47 Proteins 0.000 description 1
- 101100181207 Homo sapiens RSKR gene Proteins 0.000 description 1
- 101000620788 Homo sapiens Rab proteins geranylgeranyltransferase component A 2 Proteins 0.000 description 1
- 101000579758 Homo sapiens Raftlin Proteins 0.000 description 1
- 101000848718 Homo sapiens Rap guanine nucleotide exchange factor 5 Proteins 0.000 description 1
- 101000870945 Homo sapiens Ras guanyl-releasing protein 3 Proteins 0.000 description 1
- 101000744536 Homo sapiens Ras-related protein Rab-27B Proteins 0.000 description 1
- 101001100204 Homo sapiens Ras-related protein Rab-40A-like Proteins 0.000 description 1
- 101000670549 Homo sapiens RecQ-mediated genome instability protein 2 Proteins 0.000 description 1
- 101000831949 Homo sapiens Receptor for retinol uptake STRA6 Proteins 0.000 description 1
- 101001089248 Homo sapiens Receptor-interacting serine/threonine-protein kinase 4 Proteins 0.000 description 1
- 101000932478 Homo sapiens Receptor-type tyrosine-protein kinase FLT3 Proteins 0.000 description 1
- 101000606546 Homo sapiens Receptor-type tyrosine-protein phosphatase H Proteins 0.000 description 1
- 101000606535 Homo sapiens Receptor-type tyrosine-protein phosphatase epsilon Proteins 0.000 description 1
- 101000584743 Homo sapiens Recombining binding protein suppressor of hairless Proteins 0.000 description 1
- 101000712891 Homo sapiens Recombining binding protein suppressor of hairless-like protein Proteins 0.000 description 1
- 101000700402 Homo sapiens Regulatory solute carrier protein family 1 member 1 Proteins 0.000 description 1
- 101000869654 Homo sapiens Relaxin receptor 2 Proteins 0.000 description 1
- 101000582404 Homo sapiens Replication factor C subunit 4 Proteins 0.000 description 1
- 101000927776 Homo sapiens Rho guanine nucleotide exchange factor 11 Proteins 0.000 description 1
- 101000637411 Homo sapiens Rho guanine nucleotide exchange factor TIAM2 Proteins 0.000 description 1
- 101000846336 Homo sapiens Ribitol-5-phosphate transferase FKTN Proteins 0.000 description 1
- 101001132643 Homo sapiens Ribonucleoprotein PTB-binding 2 Proteins 0.000 description 1
- 101001074727 Homo sapiens Ribonucleoside-diphosphate reductase large subunit Proteins 0.000 description 1
- 101000794048 Homo sapiens Ribosome biogenesis protein BRX1 homolog Proteins 0.000 description 1
- 101000803747 Homo sapiens Ribosome biogenesis protein WDR12 Proteins 0.000 description 1
- 101000650694 Homo sapiens Roundabout homolog 1 Proteins 0.000 description 1
- 101000711466 Homo sapiens SAM pointed domain-containing Ets transcription factor Proteins 0.000 description 1
- 101000650872 Homo sapiens SH2 domain-containing adapter protein D Proteins 0.000 description 1
- 101000880302 Homo sapiens SH3 and cysteine-rich domain-containing protein 3 Proteins 0.000 description 1
- 101000616556 Homo sapiens SH3 domain-containing protein 19 Proteins 0.000 description 1
- 101000863815 Homo sapiens SHC SH2 domain-binding protein 1 Proteins 0.000 description 1
- 101000633782 Homo sapiens SLAM family member 8 Proteins 0.000 description 1
- 101000617796 Homo sapiens SPARC-related modular calcium-binding protein 1 Proteins 0.000 description 1
- 101000868747 Homo sapiens SPOC domain-containing protein 1 Proteins 0.000 description 1
- 101000697602 Homo sapiens STAG3-like protein 3 Proteins 0.000 description 1
- 101000687715 Homo sapiens SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A containing DEAD/H box 1 Proteins 0.000 description 1
- 101000650820 Homo sapiens Semaphorin-4A Proteins 0.000 description 1
- 101000707474 Homo sapiens Serine incorporator 2 Proteins 0.000 description 1
- 101000705949 Homo sapiens Serine protease 57 Proteins 0.000 description 1
- 101000998893 Homo sapiens Serine protease HTRA4 Proteins 0.000 description 1
- 101000598311 Homo sapiens Serine protease inhibitor Kazal-type 13 Proteins 0.000 description 1
- 101001053311 Homo sapiens Serine protease inhibitor Kazal-type 8 Proteins 0.000 description 1
- 101000829208 Homo sapiens Serine/arginine repetitive matrix protein 3 Proteins 0.000 description 1
- 101000780111 Homo sapiens Serine/threonine-protein phosphatase 6 regulatory ankyrin repeat subunit A Proteins 0.000 description 1
- 101000642456 Homo sapiens Serpin A11 Proteins 0.000 description 1
- 101000637835 Homo sapiens Serum amyloid A-4 protein Proteins 0.000 description 1
- 101001094647 Homo sapiens Serum paraoxonase/arylesterase 1 Proteins 0.000 description 1
- 101000631848 Homo sapiens Sex comb on midleg-like protein 2 Proteins 0.000 description 1
- 101000863880 Homo sapiens Sialic acid-binding Ig-like lectin 6 Proteins 0.000 description 1
- 101000654500 Homo sapiens Signal-induced proliferation-associated 1-like protein 2 Proteins 0.000 description 1
- 101000836849 Homo sapiens Signal-induced proliferation-associated 1-like protein 3 Proteins 0.000 description 1
- 101000709256 Homo sapiens Signal-regulatory protein beta-1 Proteins 0.000 description 1
- 101000709188 Homo sapiens Signal-regulatory protein beta-1 isoform 3 Proteins 0.000 description 1
- 101000648030 Homo sapiens Signal-transducing adaptor protein 2 Proteins 0.000 description 1
- 101000826125 Homo sapiens Single-stranded DNA-binding protein 2 Proteins 0.000 description 1
- 101000653812 Homo sapiens Small nuclear ribonucleoprotein E Proteins 0.000 description 1
- 101000713494 Homo sapiens Small nuclear ribonucleoprotein F Proteins 0.000 description 1
- 101000713459 Homo sapiens Small nuclear ribonucleoprotein G Proteins 0.000 description 1
- 101000665150 Homo sapiens Small nuclear ribonucleoprotein Sm D1 Proteins 0.000 description 1
- 101000941138 Homo sapiens Small subunit processome component 20 homolog Proteins 0.000 description 1
- 101000864006 Homo sapiens Smith-Magenis syndrome chromosomal region candidate gene 5 protein Proteins 0.000 description 1
- 101001012599 Homo sapiens Sodium-dependent glucose transporter 1 Proteins 0.000 description 1
- 101000639975 Homo sapiens Sodium-dependent noradrenaline transporter Proteins 0.000 description 1
- 101001125064 Homo sapiens Sodium/potassium-transporting ATPase subunit beta-1-interacting protein 1 Proteins 0.000 description 1
- 101000980900 Homo sapiens Sororin Proteins 0.000 description 1
- 101000825179 Homo sapiens Spindle and centriole-associated protein 1 Proteins 0.000 description 1
- 101000825632 Homo sapiens Spindle and kinetochore-associated protein 1 Proteins 0.000 description 1
- 101000828807 Homo sapiens Spindlin-2A Proteins 0.000 description 1
- 101000629413 Homo sapiens Spindlin-4 Proteins 0.000 description 1
- 101000663181 Homo sapiens Splicing regulatory glutamine/lysine-rich protein 1 Proteins 0.000 description 1
- 101001056878 Homo sapiens Squalene monooxygenase Proteins 0.000 description 1
- 101000716931 Homo sapiens Sterile alpha motif domain-containing protein 12 Proteins 0.000 description 1
- 101000641015 Homo sapiens Sterile alpha motif domain-containing protein 9 Proteins 0.000 description 1
- 101000633445 Homo sapiens Structural maintenance of chromosomes protein 2 Proteins 0.000 description 1
- 101000825726 Homo sapiens Structural maintenance of chromosomes protein 4 Proteins 0.000 description 1
- 101000879116 Homo sapiens Succinate receptor 1 Proteins 0.000 description 1
- 101000585365 Homo sapiens Sulfotransferase 2A1 Proteins 0.000 description 1
- 101000820777 Homo sapiens Syncytin-1 Proteins 0.000 description 1
- 101000820789 Homo sapiens Syncytin-2 Proteins 0.000 description 1
- 101000585049 Homo sapiens Syntaxin-19 Proteins 0.000 description 1
- 101000820476 Homo sapiens Syntaxin-binding protein 4 Proteins 0.000 description 1
- 101000666775 Homo sapiens T-box transcription factor TBX3 Proteins 0.000 description 1
- 101000914484 Homo sapiens T-lymphocyte activation antigen CD80 Proteins 0.000 description 1
- 101000889527 Homo sapiens TOG array regulator of axonemal microtubules protein 1 Proteins 0.000 description 1
- 101000679548 Homo sapiens TOX high mobility group box family member 3 Proteins 0.000 description 1
- 101000892439 Homo sapiens Taste receptor type 2 member 10 Proteins 0.000 description 1
- 101000714920 Homo sapiens Taste receptor type 2 member 13 Proteins 0.000 description 1
- 101000714926 Homo sapiens Taste receptor type 2 member 14 Proteins 0.000 description 1
- 101000674770 Homo sapiens Taste receptor type 2 member 19 Proteins 0.000 description 1
- 101000626662 Homo sapiens Taste receptor type 2 member 3 Proteins 0.000 description 1
- 101000674777 Homo sapiens Taste receptor type 2 member 30 Proteins 0.000 description 1
- 101000787841 Homo sapiens Taste receptor type 2 member 31 Proteins 0.000 description 1
- 101000626629 Homo sapiens Taste receptor type 2 member 4 Proteins 0.000 description 1
- 101000596174 Homo sapiens Taste receptor type 2 member 41 Proteins 0.000 description 1
- 101000838279 Homo sapiens Taste receptor type 2 member 42 Proteins 0.000 description 1
- 101000838273 Homo sapiens Taste receptor type 2 member 43 Proteins 0.000 description 1
- 101000838287 Homo sapiens Taste receptor type 2 member 46 Proteins 0.000 description 1
- 101000836166 Homo sapiens Taste receptor type 2 member 50 Proteins 0.000 description 1
- 101000836159 Homo sapiens Taste receptor type 2 member 60 Proteins 0.000 description 1
- 101000759314 Homo sapiens Tau-tubulin kinase 1 Proteins 0.000 description 1
- 101001095487 Homo sapiens Telomere-associated protein RIF1 Proteins 0.000 description 1
- 101000800633 Homo sapiens Teneurin-2 Proteins 0.000 description 1
- 101000835745 Homo sapiens Teratocarcinoma-derived growth factor 1 Proteins 0.000 description 1
- 101000759882 Homo sapiens Tetraspanin-12 Proteins 0.000 description 1
- 101000847107 Homo sapiens Tetraspanin-8 Proteins 0.000 description 1
- 101000795815 Homo sapiens Tetratricopeptide repeat protein 24 Proteins 0.000 description 1
- 101000759409 Homo sapiens Tetratricopeptide repeat protein 39C Proteins 0.000 description 1
- 101001027052 Homo sapiens Thymidylate kinase Proteins 0.000 description 1
- 101000625739 Homo sapiens Thymosin beta-15A Proteins 0.000 description 1
- 101000837626 Homo sapiens Thyroid hormone receptor alpha Proteins 0.000 description 1
- 101000785517 Homo sapiens Tight junction protein ZO-3 Proteins 0.000 description 1
- 101000893741 Homo sapiens Tissue alpha-L-fucosidase Proteins 0.000 description 1
- 101000830710 Homo sapiens Torsin-4A Proteins 0.000 description 1
- 101001067250 Homo sapiens Transcription cofactor HES-6 Proteins 0.000 description 1
- 101000622239 Homo sapiens Transcription cofactor vestigial-like protein 2 Proteins 0.000 description 1
- 101000596772 Homo sapiens Transcription factor 7-like 1 Proteins 0.000 description 1
- 101000666382 Homo sapiens Transcription factor E2-alpha Proteins 0.000 description 1
- 101000866292 Homo sapiens Transcription factor E2F7 Proteins 0.000 description 1
- 101000837837 Homo sapiens Transcription factor EC Proteins 0.000 description 1
- 101001057127 Homo sapiens Transcription factor ETV7 Proteins 0.000 description 1
- 101000825086 Homo sapiens Transcription factor SOX-11 Proteins 0.000 description 1
- 101000825082 Homo sapiens Transcription factor SOX-12 Proteins 0.000 description 1
- 101000825182 Homo sapiens Transcription factor Spi-B Proteins 0.000 description 1
- 101000933296 Homo sapiens Transcription factor TFIIIB component B'' homolog Proteins 0.000 description 1
- 101000845264 Homo sapiens Transcription termination factor 2 Proteins 0.000 description 1
- 101001010792 Homo sapiens Transcriptional regulator ERG Proteins 0.000 description 1
- 101000685104 Homo sapiens Transcriptional repressor scratch 1 Proteins 0.000 description 1
- 101000685107 Homo sapiens Transcriptional repressor scratch 2 Proteins 0.000 description 1
- 101000764619 Homo sapiens Translation machinery-associated protein 16 Proteins 0.000 description 1
- 101000658571 Homo sapiens Transmembrane 4 L6 family member 19 Proteins 0.000 description 1
- 101000638105 Homo sapiens Transmembrane channel-like protein 5 Proteins 0.000 description 1
- 101000764634 Homo sapiens Transmembrane gamma-carboxyglutamic acid protein 4 Proteins 0.000 description 1
- 101000655146 Homo sapiens Transmembrane protein 14EP Proteins 0.000 description 1
- 101000851563 Homo sapiens Transmembrane protein 211 Proteins 0.000 description 1
- 101000596324 Homo sapiens Transmembrane protein 229B Proteins 0.000 description 1
- 101000831862 Homo sapiens Transmembrane protein 45B Proteins 0.000 description 1
- 101000800065 Homo sapiens Treslin Proteins 0.000 description 1
- 101000680650 Homo sapiens Tripartite motif-containing protein 15 Proteins 0.000 description 1
- 101000766324 Homo sapiens Tripartite motif-containing protein 60 Proteins 0.000 description 1
- 101000850794 Homo sapiens Tropomyosin alpha-3 chain Proteins 0.000 description 1
- 101000851357 Homo sapiens Troponin T, slow skeletal muscle Proteins 0.000 description 1
- 101000838350 Homo sapiens Tubulin alpha-1C chain Proteins 0.000 description 1
- 101000834944 Homo sapiens Tubulin epsilon and delta complex protein 2 Proteins 0.000 description 1
- 101000830603 Homo sapiens Tumor necrosis factor ligand superfamily member 11 Proteins 0.000 description 1
- 101000830596 Homo sapiens Tumor necrosis factor ligand superfamily member 15 Proteins 0.000 description 1
- 101000597779 Homo sapiens Tumor necrosis factor ligand superfamily member 18 Proteins 0.000 description 1
- 101000638161 Homo sapiens Tumor necrosis factor ligand superfamily member 6 Proteins 0.000 description 1
- 101000610980 Homo sapiens Tumor protein D52 Proteins 0.000 description 1
- 101000889732 Homo sapiens Tyrosine-protein kinase Tec Proteins 0.000 description 1
- 101001087412 Homo sapiens Tyrosine-protein phosphatase non-receptor type 18 Proteins 0.000 description 1
- 101001135565 Homo sapiens Tyrosine-protein phosphatase non-receptor type 3 Proteins 0.000 description 1
- 101001135561 Homo sapiens Tyrosine-protein phosphatase non-receptor type 4 Proteins 0.000 description 1
- 101000708381 Homo sapiens U11/U12 small nuclear ribonucleoprotein 25 kDa protein Proteins 0.000 description 1
- 101001017894 Homo sapiens U6 snRNA-associated Sm-like protein LSm3 Proteins 0.000 description 1
- 101000807276 Homo sapiens UHRF1-binding protein 1 Proteins 0.000 description 1
- 101000607318 Homo sapiens UL16-binding protein 3 Proteins 0.000 description 1
- 101000932608 Homo sapiens Uncharacterized membrane protein C3orf80 Proteins 0.000 description 1
- 101000794430 Homo sapiens Uncharacterized protein C1orf53 Proteins 0.000 description 1
- 101000746200 Homo sapiens Uncharacterized protein C21orf62 Proteins 0.000 description 1
- 101000884595 Homo sapiens Uncharacterized protein C6orf223 Proteins 0.000 description 1
- 101000879000 Homo sapiens Uncharacterized protein CD300LD-AS1 Proteins 0.000 description 1
- 101000585623 Homo sapiens Unconventional myosin-X Proteins 0.000 description 1
- 101000768133 Homo sapiens Unhealthy ribosome biogenesis protein 2 homolog Proteins 0.000 description 1
- 101001061851 Homo sapiens V(D)J recombination-activating protein 2 Proteins 0.000 description 1
- 101000743490 Homo sapiens V-set and immunoglobulin domain-containing protein 2 Proteins 0.000 description 1
- 101000667104 Homo sapiens Vacuolar protein sorting-associated protein 13C Proteins 0.000 description 1
- 101000622427 Homo sapiens Vang-like protein 1 Proteins 0.000 description 1
- 101000641959 Homo sapiens Villin-1 Proteins 0.000 description 1
- 101000910758 Homo sapiens Voltage-dependent calcium channel gamma-2 subunit Proteins 0.000 description 1
- 101000910745 Homo sapiens Voltage-dependent calcium channel gamma-3 subunit Proteins 0.000 description 1
- 101000910748 Homo sapiens Voltage-dependent calcium channel gamma-4 subunit Proteins 0.000 description 1
- 101000814254 Homo sapiens WD repeat-containing protein 49 Proteins 0.000 description 1
- 101000666072 Homo sapiens WD repeat-containing protein 76 Proteins 0.000 description 1
- 101000983271 Homo sapiens Xaa-Arg dipeptidase Proteins 0.000 description 1
- 101000743863 Homo sapiens ZW10 interactor Proteins 0.000 description 1
- 101000976379 Homo sapiens ZZ-type zinc finger-containing protein 3 Proteins 0.000 description 1
- 101000786319 Homo sapiens Zinc finger BED domain-containing protein 3 Proteins 0.000 description 1
- 101000786321 Homo sapiens Zinc finger BED domain-containing protein 4 Proteins 0.000 description 1
- 101000786335 Homo sapiens Zinc finger BED domain-containing protein 6 Proteins 0.000 description 1
- 101000916507 Homo sapiens Zinc finger CCCH-type antiviral protein 1-like Proteins 0.000 description 1
- 101000788678 Homo sapiens Zinc finger MYM-type protein 1 Proteins 0.000 description 1
- 101000788736 Homo sapiens Zinc finger protein 100 Proteins 0.000 description 1
- 101000976576 Homo sapiens Zinc finger protein 121 Proteins 0.000 description 1
- 101000976577 Homo sapiens Zinc finger protein 124 Proteins 0.000 description 1
- 101000759241 Homo sapiens Zinc finger protein 138 Proteins 0.000 description 1
- 101000915543 Homo sapiens Zinc finger protein 14 homolog Proteins 0.000 description 1
- 101000759232 Homo sapiens Zinc finger protein 141 Proteins 0.000 description 1
- 101000782153 Homo sapiens Zinc finger protein 221 Proteins 0.000 description 1
- 101000818786 Homo sapiens Zinc finger protein 253 Proteins 0.000 description 1
- 101000723759 Homo sapiens Zinc finger protein 26 Proteins 0.000 description 1
- 101000818783 Homo sapiens Zinc finger protein 260 Proteins 0.000 description 1
- 101000785697 Homo sapiens Zinc finger protein 275 Proteins 0.000 description 1
- 101000723761 Homo sapiens Zinc finger protein 28 Proteins 0.000 description 1
- 101000964390 Homo sapiens Zinc finger protein 280D Proteins 0.000 description 1
- 101000785713 Homo sapiens Zinc finger protein 283 Proteins 0.000 description 1
- 101000723904 Homo sapiens Zinc finger protein 296 Proteins 0.000 description 1
- 101000788732 Homo sapiens Zinc finger protein 367 Proteins 0.000 description 1
- 101000788735 Homo sapiens Zinc finger protein 37A Proteins 0.000 description 1
- 101000976622 Homo sapiens Zinc finger protein 42 homolog Proteins 0.000 description 1
- 101000760180 Homo sapiens Zinc finger protein 43 Proteins 0.000 description 1
- 101000818845 Homo sapiens Zinc finger protein 439 Proteins 0.000 description 1
- 101000744940 Homo sapiens Zinc finger protein 491 Proteins 0.000 description 1
- 101000785689 Homo sapiens Zinc finger protein 519 Proteins 0.000 description 1
- 101000802315 Homo sapiens Zinc finger protein 551 Proteins 0.000 description 1
- 101000964699 Homo sapiens Zinc finger protein 566 Proteins 0.000 description 1
- 101000964759 Homo sapiens Zinc finger protein 573 Proteins 0.000 description 1
- 101000818706 Homo sapiens Zinc finger protein 618 Proteins 0.000 description 1
- 101000782278 Homo sapiens Zinc finger protein 621 Proteins 0.000 description 1
- 101000782292 Homo sapiens Zinc finger protein 625 Proteins 0.000 description 1
- 101000785611 Homo sapiens Zinc finger protein 660 Proteins 0.000 description 1
- 101000743802 Homo sapiens Zinc finger protein 675 Proteins 0.000 description 1
- 101000743807 Homo sapiens Zinc finger protein 678 Proteins 0.000 description 1
- 101000743810 Homo sapiens Zinc finger protein 681 Proteins 0.000 description 1
- 101000964746 Homo sapiens Zinc finger protein 69 Proteins 0.000 description 1
- 101000723641 Homo sapiens Zinc finger protein 695 Proteins 0.000 description 1
- 101000964729 Homo sapiens Zinc finger protein 70 Proteins 0.000 description 1
- 101000964755 Homo sapiens Zinc finger protein 708 Proteins 0.000 description 1
- 101000964738 Homo sapiens Zinc finger protein 714 Proteins 0.000 description 1
- 101000964744 Homo sapiens Zinc finger protein 717 Proteins 0.000 description 1
- 101000964743 Homo sapiens Zinc finger protein 718 Proteins 0.000 description 1
- 101000760267 Homo sapiens Zinc finger protein 724 Proteins 0.000 description 1
- 101000760266 Homo sapiens Zinc finger protein 726 Proteins 0.000 description 1
- 101000760277 Homo sapiens Zinc finger protein 736 Proteins 0.000 description 1
- 101000760276 Homo sapiens Zinc finger protein 737 Proteins 0.000 description 1
- 101000915602 Homo sapiens Zinc finger protein 772 Proteins 0.000 description 1
- 101000976457 Homo sapiens Zinc finger protein 805 Proteins 0.000 description 1
- 101000785577 Homo sapiens Zinc finger protein 850 Proteins 0.000 description 1
- 101000785578 Homo sapiens Zinc finger protein 852 Proteins 0.000 description 1
- 101000785587 Homo sapiens Zinc finger protein 878 Proteins 0.000 description 1
- 101000818740 Homo sapiens Zinc finger protein 888 Proteins 0.000 description 1
- 101000818742 Homo sapiens Zinc finger protein 891 Proteins 0.000 description 1
- 101000743781 Homo sapiens Zinc finger protein 91 Proteins 0.000 description 1
- 101000743788 Homo sapiens Zinc finger protein 92 Proteins 0.000 description 1
- 101000856554 Homo sapiens Zinc finger protein Gfi-1b Proteins 0.000 description 1
- 101000782089 Homo sapiens Zinc finger protein ZFAT Proteins 0.000 description 1
- 101000788690 Homo sapiens Zinc fingers and homeoboxes protein 1 Proteins 0.000 description 1
- 101001032478 Homo sapiens cAMP-dependent protein kinase inhibitor alpha Proteins 0.000 description 1
- 101000818522 Homo sapiens fMet-Leu-Phe receptor Proteins 0.000 description 1
- 101000591280 Homo sapiens mRNA turnover protein 4 homolog Proteins 0.000 description 1
- 101000830183 Homo sapiens tRNA (guanine-N(7)-)-methyltransferase Proteins 0.000 description 1
- 101000615747 Homo sapiens tRNA-splicing endonuclease subunit Sen2 Proteins 0.000 description 1
- 101000667350 Homo sapiens von Willebrand factor A domain-containing protein 2 Proteins 0.000 description 1
- 102100029072 Humanin-like 3 Human genes 0.000 description 1
- 108090000320 Hyaluronan Synthases Proteins 0.000 description 1
- 102000003918 Hyaluronan Synthases Human genes 0.000 description 1
- 102100028084 Hyaluronan and proteoglycan link protein 1 Human genes 0.000 description 1
- 102100025264 Hyaluronan and proteoglycan link protein 2 Human genes 0.000 description 1
- 102100030211 IQ motif and ankyrin repeat domain-containing protein 1 Human genes 0.000 description 1
- 102100038249 Immunity-related GTPase family M protein Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 102100020744 Immunoglobulin iota chain Human genes 0.000 description 1
- 102100029616 Immunoglobulin lambda-like polypeptide 1 Human genes 0.000 description 1
- 102100027001 Importin subunit alpha-8 Human genes 0.000 description 1
- 102100031355 Inactive Rho GTPase-activating protein 11B Human genes 0.000 description 1
- 102100033358 Inactive pancreatic lipase-related protein 1 Human genes 0.000 description 1
- 102100025186 Inactive ubiquitin thioesterase OTULINL Human genes 0.000 description 1
- 102100040061 Indoleamine 2,3-dioxygenase 1 Human genes 0.000 description 1
- 102100026812 Inhibin beta C chain Human genes 0.000 description 1
- 102100036724 Innate immunity activator protein Human genes 0.000 description 1
- 102100037919 Insulin-like growth factor 2 mRNA-binding protein 2 Human genes 0.000 description 1
- 102100037920 Insulin-like growth factor 2 mRNA-binding protein 3 Human genes 0.000 description 1
- 102100027636 Insulin-like growth factor-binding protein 1 Human genes 0.000 description 1
- 102100022710 Insulin-like growth factor-binding protein 2 Human genes 0.000 description 1
- 102100033235 Insulin-like peptide INSL6 Human genes 0.000 description 1
- 102100033265 Integrator complex subunit 2 Human genes 0.000 description 1
- 102100032819 Integrin alpha-3 Human genes 0.000 description 1
- 102100032818 Integrin alpha-4 Human genes 0.000 description 1
- 102100039460 Inter-alpha-trypsin inhibitor heavy chain H3 Human genes 0.000 description 1
- 102100027354 Interferon alpha-inducible protein 6 Human genes 0.000 description 1
- 102100026720 Interferon beta Human genes 0.000 description 1
- 102100022469 Interferon kappa Human genes 0.000 description 1
- 102100029843 Interferon regulatory factor 3 Human genes 0.000 description 1
- 102100030126 Interferon regulatory factor 4 Human genes 0.000 description 1
- 102100040035 Interferon-induced transmembrane protein 3 Human genes 0.000 description 1
- 102100030698 Interleukin-12 subunit alpha Human genes 0.000 description 1
- 108010017411 Interleukin-21 Receptors Proteins 0.000 description 1
- 102100030699 Interleukin-21 receptor Human genes 0.000 description 1
- 102100022703 Interleukin-22 receptor subunit alpha-2 Human genes 0.000 description 1
- 102100036705 Interleukin-23 subunit alpha Human genes 0.000 description 1
- 102100036712 Interleukin-27 subunit beta Human genes 0.000 description 1
- 108010018951 Interleukin-8B Receptors Proteins 0.000 description 1
- 102100039349 Intraflagellar transport protein 25 homolog Human genes 0.000 description 1
- 101710134808 Intraflagellar transport protein 25 homolog Proteins 0.000 description 1
- 102100039618 Isopentenyl-diphosphate delta-isomerase 2 Human genes 0.000 description 1
- 102100026153 Junction plakoglobin Human genes 0.000 description 1
- 102100022304 Junctional adhesion molecule A Human genes 0.000 description 1
- 108091036429 KCNQ1OT1 Proteins 0.000 description 1
- 101710169267 KIAA1586 Proteins 0.000 description 1
- 102100037321 KRAB-A domain-containing protein 2 Human genes 0.000 description 1
- 102100021559 KRR1 small subunit processome component homolog Human genes 0.000 description 1
- 102100034866 Kallikrein-6 Human genes 0.000 description 1
- 102100033604 Kelch domain-containing protein 8B Human genes 0.000 description 1
- 102100022830 Kelch repeat and BTB domain-containing protein 8 Human genes 0.000 description 1
- 102100037656 Kelch-like protein 40 Human genes 0.000 description 1
- 102100033511 Keratin, type I cytoskeletal 17 Human genes 0.000 description 1
- 102100033420 Keratin, type I cytoskeletal 19 Human genes 0.000 description 1
- 102100023972 Keratin, type II cytoskeletal 8 Human genes 0.000 description 1
- 102100040542 Keratinocyte-associated protein 3 Human genes 0.000 description 1
- 101710197581 Ketoisovalerate oxidoreductase subunit VorC Proteins 0.000 description 1
- 102100037694 Kinesin-like protein KIF20A Human genes 0.000 description 1
- 102100038406 Kinesin-like protein KIF23 Human genes 0.000 description 1
- 102100023890 Kinetochore protein NDC80 homolog Human genes 0.000 description 1
- 102100028394 Kinetochore-associated protein 1 Human genes 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- 102100020680 Krueppel-like factor 5 Human genes 0.000 description 1
- 102100031607 Kunitz-type protease inhibitor 1 Human genes 0.000 description 1
- 102100026388 L-amino-acid oxidase Human genes 0.000 description 1
- 102100022957 LLGL scribble cell polarity complex component 2 Human genes 0.000 description 1
- 102100035655 Lactosylceramide 1,3-N-acetyl-beta-D-glucosaminyltransferase Human genes 0.000 description 1
- 102100030931 Ladinin-1 Human genes 0.000 description 1
- 208000031671 Large B-Cell Diffuse Lymphoma Diseases 0.000 description 1
- 102100038204 Large neutral amino acids transporter small subunit 1 Human genes 0.000 description 1
- 102100026038 Lens fiber membrane intrinsic protein Human genes 0.000 description 1
- 102100026915 Leucine zipper protein 1 Human genes 0.000 description 1
- 102100040645 Leucine-rich repeat-containing protein 15 Human genes 0.000 description 1
- 102100025952 Leucine-rich repeat-containing protein 31 Human genes 0.000 description 1
- 102100022187 Leucine-rich repeat-containing protein 4C Human genes 0.000 description 1
- 102100027507 Leucine-rich repeat-containing protein 58 Human genes 0.000 description 1
- 102100027109 Leucine-rich single-pass membrane protein 1 Human genes 0.000 description 1
- 102100030893 Leukemia-associated protein 1 Human genes 0.000 description 1
- 102100021900 Leukemia-associated protein 7 Human genes 0.000 description 1
- 102100025574 Leukocyte immunoglobulin-like receptor subfamily A member 5 Human genes 0.000 description 1
- 102100033374 Leukotriene B4 receptor 1 Human genes 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 101710147185 Light-dependent protochlorophyllide reductase Proteins 0.000 description 1
- 108091036060 Linker DNA Proteins 0.000 description 1
- 102100033353 Lipopolysaccharide-responsive and beige-like anchor protein Human genes 0.000 description 1
- 102100031955 Lon protease homolog, mitochondrial Human genes 0.000 description 1
- 102100034318 Long-chain-fatty-acid-CoA ligase 5 Human genes 0.000 description 1
- 102100034389 Low density lipoprotein receptor adapter protein 1 Human genes 0.000 description 1
- 102100024640 Low-density lipoprotein receptor Human genes 0.000 description 1
- 101000680845 Luffa aegyptiaca Ribosome-inactivating protein luffin P1 Proteins 0.000 description 1
- 102100038210 Lymphocyte antigen 6 complex locus protein G6d Human genes 0.000 description 1
- 102100040405 Lysophosphatidic acid receptor 4 Human genes 0.000 description 1
- 102100040404 Lysophosphatidic acid receptor 5 Human genes 0.000 description 1
- 102100028524 Lysosomal protective protein Human genes 0.000 description 1
- 102100023269 M-phase phosphoprotein 9 Human genes 0.000 description 1
- 102100024163 MAPK regulated corepressor interacting protein 2 Human genes 0.000 description 1
- 102100022621 MAX gene-associated protein Human genes 0.000 description 1
- 102000003626 MCOLN2 Human genes 0.000 description 1
- 102100021795 MEF2-activating motif and SAP domain-containing transcriptional regulator Human genes 0.000 description 1
- 102100024997 MOB kinase activator 1B Human genes 0.000 description 1
- 101700028414 MOB1B Proteins 0.000 description 1
- 229910015837 MSH2 Inorganic materials 0.000 description 1
- 108091007877 MYCBP2 Proteins 0.000 description 1
- 108700012912 MYCN Proteins 0.000 description 1
- 101150022024 MYCN gene Proteins 0.000 description 1
- 102100026046 Mannan-binding lectin serine protease 2 Human genes 0.000 description 1
- 102100031826 Marginal zone B- and B1-cell-specific protein Human genes 0.000 description 1
- 102100030612 Mast cell carboxypeptidase A Human genes 0.000 description 1
- 108010072582 Matrilin Proteins Proteins 0.000 description 1
- 102100033689 Matrilin-4 Human genes 0.000 description 1
- 238000007476 Maximum Likelihood Methods 0.000 description 1
- 101150056690 Mcoln2 gene Proteins 0.000 description 1
- 102100037572 Mdm2-binding protein Human genes 0.000 description 1
- 102000008135 Mechanistic Target of Rapamycin Complex 1 Human genes 0.000 description 1
- 108010035196 Mechanistic Target of Rapamycin Complex 1 Proteins 0.000 description 1
- 102100033679 Meiotic nuclear division protein 1 homolog Human genes 0.000 description 1
- 102100027257 Melanoma-associated antigen D4 Human genes 0.000 description 1
- 102100024976 Melanoregulin Human genes 0.000 description 1
- 108010093662 Member 11 Subfamily B ATP Binding Cassette Transporter Proteins 0.000 description 1
- 102000047724 Member 2 Solute Carrier Family 12 Human genes 0.000 description 1
- 108010090837 Member 5 Subfamily G ATP Binding Cassette Transporter Proteins 0.000 description 1
- 108010090822 Member 8 Subfamily G ATP Binding Cassette Transporter Proteins 0.000 description 1
- 102100024262 Membrane-associated tyrosine- and threonine-specific cdc2-inhibitory kinase Human genes 0.000 description 1
- 102100037636 Metabotropic glutamate receptor 8 Human genes 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 102100021299 Methyl-CpG-binding domain protein 2 Human genes 0.000 description 1
- 102100030819 Methylcytosine dioxygenase TET1 Human genes 0.000 description 1
- 102100021091 Methylsterol monooxygenase 1 Human genes 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- 108091093085 MiR-338 Proteins 0.000 description 1
- 102100031550 Microtubule-associated tumor suppressor 1 Human genes 0.000 description 1
- 102100027666 Midasin Human genes 0.000 description 1
- 102100028344 Mis18-binding protein 1 Human genes 0.000 description 1
- 102100026808 Mitochondrial import inner membrane translocase subunit Tim8 A Human genes 0.000 description 1
- 102100026902 Mitochondrial import receptor subunit TOM5 homolog Human genes 0.000 description 1
- 102100031767 Mitochondrial peptide methionine sulfoxide reductase Human genes 0.000 description 1
- 102100033607 Mitotic interactor and substrate of PLK1 Human genes 0.000 description 1
- 102100025394 Monofunctional C1-tetrahydrofolate synthase, mitochondrial Human genes 0.000 description 1
- 108091006676 Monovalent cation:proton antiporter-3 Proteins 0.000 description 1
- 102100023124 Mucin-13 Human genes 0.000 description 1
- 102100022693 Mucin-4 Human genes 0.000 description 1
- 101100162168 Mus musculus Adam1a gene Proteins 0.000 description 1
- 102100032977 Myelin-associated oligodendrocyte basic protein Human genes 0.000 description 1
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 1
- 102100025243 Myeloid cell surface antigen CD33 Human genes 0.000 description 1
- 102100028794 Myeloid-associated differentiation marker-like protein 2 Human genes 0.000 description 1
- 102100032970 Myogenin Human genes 0.000 description 1
- 102100030739 Myosin light chain 4 Human genes 0.000 description 1
- 102100030788 Myosin light chain kinase 2, skeletal/cardiac muscle Human genes 0.000 description 1
- 102100030419 Myosin regulatory light chain 2, skeletal muscle isoform Human genes 0.000 description 1
- 102100026771 Myosin-binding protein C, cardiac-type Human genes 0.000 description 1
- 102100030735 Myosin-binding protein C, slow-type Human genes 0.000 description 1
- 102100038894 Myotilin Human genes 0.000 description 1
- 102100038898 Myozenin-1 Human genes 0.000 description 1
- 108700026495 N-Myc Proto-Oncogene Proteins 0.000 description 1
- 102100035629 N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase 3 Human genes 0.000 description 1
- 102100031640 N-alpha-acetyltransferase 11 Human genes 0.000 description 1
- 102100031832 N-alpha-acetyltransferase 25, NatB auxiliary subunit Human genes 0.000 description 1
- 102100028130 N-formyl peptide receptor 3 Human genes 0.000 description 1
- 102100030124 N-myc proto-oncogene protein Human genes 0.000 description 1
- 102100039232 NADH-cytochrome b5 reductase-like Human genes 0.000 description 1
- 102000004019 NADPH Oxidase 1 Human genes 0.000 description 1
- 108090000424 NADPH Oxidase 1 Proteins 0.000 description 1
- 102100039033 NADPH oxidase organizer 1 Human genes 0.000 description 1
- 102100023897 NADPH-cytochrome P450 reductase Human genes 0.000 description 1
- 102100036777 NADPH:adrenodoxin oxidoreductase, mitochondrial Human genes 0.000 description 1
- 108050006691 NEDD4-binding protein 2 Proteins 0.000 description 1
- 102100036542 NEDD4-binding protein 2 Human genes 0.000 description 1
- 102100037367 NHL-repeat-containing protein 4 Human genes 0.000 description 1
- 102100022683 NKG2-C type II integral membrane protein Human genes 0.000 description 1
- 102100029093 NOP protein chaperone 1 Human genes 0.000 description 1
- 102100038706 NUT family member 2G Human genes 0.000 description 1
- 102100023195 Nephrin Human genes 0.000 description 1
- 102100023306 Nesprin-1 Human genes 0.000 description 1
- 102000048238 Neuregulin-1 Human genes 0.000 description 1
- 108090000556 Neuregulin-1 Proteins 0.000 description 1
- 102000007530 Neurofibromin 1 Human genes 0.000 description 1
- 108010085793 Neurofibromin 1 Proteins 0.000 description 1
- 102100021518 Neuronal acetylcholine receptor subunit alpha-6 Human genes 0.000 description 1
- 102100021520 Neuronal acetylcholine receptor subunit alpha-9 Human genes 0.000 description 1
- 102100021878 Neuronal pentraxin-2 Human genes 0.000 description 1
- 102100028745 Neuronal regeneration-related protein Human genes 0.000 description 1
- 102100032087 Neutral cholesterol ester hydrolase 1 Human genes 0.000 description 1
- 102100030830 Nicotinate-nucleotide pyrophosphorylase [carboxylating] Human genes 0.000 description 1
- 102100029438 Nitric oxide synthase, inducible Human genes 0.000 description 1
- 102100022646 Normal mucosa of esophagus-specific gene 1 protein Human genes 0.000 description 1
- 102100028809 Notch-regulated ankyrin repeat-containing protein Human genes 0.000 description 1
- 108010062309 Nuclear Receptor Interacting Protein 1 Proteins 0.000 description 1
- 102100022220 Nuclear envelope integral membrane protein 1 Human genes 0.000 description 1
- 102100033819 Nuclear pore complex protein Nup214 Human genes 0.000 description 1
- 102100022935 Nuclear receptor corepressor 1 Human genes 0.000 description 1
- 102100039019 Nuclear receptor subfamily 0 group B member 1 Human genes 0.000 description 1
- 102100023170 Nuclear receptor subfamily 1 group D member 1 Human genes 0.000 description 1
- 102100038512 Nuclear receptor subfamily 1 group I member 3 Human genes 0.000 description 1
- 102100029534 Nuclear receptor subfamily 2 group E member 1 Human genes 0.000 description 1
- 102100029558 Nuclear receptor-interacting protein 1 Human genes 0.000 description 1
- 102100027441 Nucleobindin-2 Human genes 0.000 description 1
- 102100029102 Nucleolar protein 16 Human genes 0.000 description 1
- 102100022684 Nucleoplasmin-3 Human genes 0.000 description 1
- 102100037826 Nucleoporin NDC1 Human genes 0.000 description 1
- 102100023252 Nucleoside diphosphate kinase A Human genes 0.000 description 1
- 102100023258 Nucleoside diphosphate kinase B Human genes 0.000 description 1
- 102100032604 Occludin Human genes 0.000 description 1
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 1
- 102000016304 Origin Recognition Complex Human genes 0.000 description 1
- 108010067244 Origin Recognition Complex Proteins 0.000 description 1
- 102100028070 P2Y purinoceptor 4 Human genes 0.000 description 1
- 102100036866 PHD finger protein 14 Human genes 0.000 description 1
- 102100037019 PML-RARA-regulated adapter molecule 1 Human genes 0.000 description 1
- 102100026459 POU domain, class 3, transcription factor 2 Human genes 0.000 description 1
- 102100037484 POU domain, class 6, transcription factor 2 Human genes 0.000 description 1
- 102000036938 POU2AF1 Human genes 0.000 description 1
- 108060006456 POU2AF1 Proteins 0.000 description 1
- 102100024894 PR domain zinc finger protein 1 Human genes 0.000 description 1
- 102100038730 PRKCA-binding protein Human genes 0.000 description 1
- 102100040851 Paired box protein Pax-1 Human genes 0.000 description 1
- 102100037504 Paired box protein Pax-5 Human genes 0.000 description 1
- 102100031652 Paired immunoglobulin-like type 2 receptor beta Human genes 0.000 description 1
- 102100025790 Palmitoyltransferase ZDHHC21 Human genes 0.000 description 1
- 102100037264 Pancreatic progenitor cell differentiation and proliferation factor-like protein Human genes 0.000 description 1
- 102100030869 Parathyroid hormone 2 receptor Human genes 0.000 description 1
- 208000018737 Parkinson disease Diseases 0.000 description 1
- 102100021653 Pentraxin-4 Human genes 0.000 description 1
- 102100035580 Peptidyl-prolyl cis-trans isomerase A-like 4G Human genes 0.000 description 1
- 102100034943 Peptidyl-prolyl cis-trans isomerase F, mitochondrial Human genes 0.000 description 1
- 102100028467 Perforin-1 Human genes 0.000 description 1
- 102100024315 Pericentrin Human genes 0.000 description 1
- 102100037765 Periostin Human genes 0.000 description 1
- 102100029139 Peroxiredoxin-1 Human genes 0.000 description 1
- 102100034763 Peroxiredoxin-2 Human genes 0.000 description 1
- 102100035832 Phakinin Human genes 0.000 description 1
- 102100033716 Phorbol-12-myristate-13-acetate-induced protein 1 Human genes 0.000 description 1
- 101710178747 Phosphatidate cytidylyltransferase 1 Proteins 0.000 description 1
- 102100038634 Phosphatidylinositol 3,4,5-trisphosphate-dependent Rac exchanger 1 protein Human genes 0.000 description 1
- 102100032538 Phosphatidylinositol-glycan-specific phospholipase D Human genes 0.000 description 1
- 102100030288 Phospholipid phosphatase-related protein type 1 Human genes 0.000 description 1
- 102100030383 Phospholipid phosphatase-related protein type 3 Human genes 0.000 description 1
- 102100032687 Phospholipid-transporting ATPase IF Human genes 0.000 description 1
- 102100032666 Phospholipid-transporting ATPase VB Human genes 0.000 description 1
- 102100024078 Plasma serine protease inhibitor Human genes 0.000 description 1
- 102100035181 Plastin-1 Human genes 0.000 description 1
- 102100037518 Platelet-activating factor acetylhydrolase Human genes 0.000 description 1
- 102100039446 Platelet-activating factor acetylhydrolase IB subunit alpha1 Human genes 0.000 description 1
- 102100037265 Podoplanin Human genes 0.000 description 1
- 101100146539 Podospora anserina RPS15 gene Proteins 0.000 description 1
- 108010064218 Poly (ADP-Ribose) Polymerase-1 Proteins 0.000 description 1
- 102100023712 Poly [ADP-ribose] polymerase 1 Human genes 0.000 description 1
- 102100033566 Polycomb complex protein BMI-1 Human genes 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- 102100039682 Polypeptide N-acetylgalactosaminyltransferase 4 Human genes 0.000 description 1
- 102100039697 Polypeptide N-acetylgalactosaminyltransferase 5 Human genes 0.000 description 1
- 102100031243 Polypyrimidine tract-binding protein 3 Human genes 0.000 description 1
- 108010009975 Positive Regulatory Domain I-Binding Factor 1 Proteins 0.000 description 1
- 102100023242 Potassium channel subfamily K member 1 Human genes 0.000 description 1
- 102100022757 Potassium channel subfamily K member 17 Human genes 0.000 description 1
- 102100037448 Potassium voltage-gated channel subfamily A member 6 Human genes 0.000 description 1
- 102100022807 Potassium voltage-gated channel subfamily H member 2 Human genes 0.000 description 1
- 102100039824 Pre T-cell antigen receptor alpha Human genes 0.000 description 1
- 102100022019 Pregnancy-specific beta-1-glycoprotein 2 Human genes 0.000 description 1
- 102100022020 Pregnancy-specific beta-1-glycoprotein 3 Human genes 0.000 description 1
- 102100022026 Pregnancy-specific beta-1-glycoprotein 6 Human genes 0.000 description 1
- 102100021983 Pregnancy-specific beta-1-glycoprotein 9 Human genes 0.000 description 1
- 102100022033 Presenilin-1 Human genes 0.000 description 1
- 102100037439 Probable ATP-dependent RNA helicase DDX60 Human genes 0.000 description 1
- 102100028695 Probable C-mannosyltransferase DPY19L4 Human genes 0.000 description 1
- 102100031708 Probable D-lactate dehydrogenase, mitochondrial Human genes 0.000 description 1
- 102100039878 Probable G-protein coupled receptor 148 Human genes 0.000 description 1
- 102100025346 Probable G-protein coupled receptor 160 Human genes 0.000 description 1
- 102100021199 Probable G-protein coupled receptor 174 Human genes 0.000 description 1
- 102100033860 Probable G-protein coupled receptor 75 Human genes 0.000 description 1
- 102100033866 Probable G-protein coupled receptor 82 Human genes 0.000 description 1
- 102100037014 Probable N-acetyltransferase 16 Human genes 0.000 description 1
- 102100029401 Probable carboxypeptidase X1 Human genes 0.000 description 1
- 102100026179 Probable cysteine-tRNA ligase, mitochondrial Human genes 0.000 description 1
- 102100025214 Probable methyltransferase TARBP1 Human genes 0.000 description 1
- 102100022588 Probable non-functional T cell receptor gamma variable Human genes 0.000 description 1
- 102100025498 Proepiregulin Human genes 0.000 description 1
- 102100024213 Programmed cell death 1 ligand 2 Human genes 0.000 description 1
- 102100036691 Proliferating cell nuclear antigen Human genes 0.000 description 1
- 102100038788 Proline-rich protein 15 Human genes 0.000 description 1
- 102100033950 Proline-rich protein 15-like protein Human genes 0.000 description 1
- 102100034740 Proline-rich protein 7 Human genes 0.000 description 1
- 102100033878 Prospero homeobox protein 2 Human genes 0.000 description 1
- 102100029500 Prostasin Human genes 0.000 description 1
- 102100025013 Prostate-associated microseminoprotein Human genes 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 102100035651 Protein BNIP5 Human genes 0.000 description 1
- 102100028634 Protein CIP2A Human genes 0.000 description 1
- 102100039198 Protein FAM124B Human genes 0.000 description 1
- 102100038986 Protein FAM131C Human genes 0.000 description 1
- 102100030557 Protein FAM13A Human genes 0.000 description 1
- 102100026957 Protein FAM214A Human genes 0.000 description 1
- 102100027298 Protein FAM222A Human genes 0.000 description 1
- 102100038921 Protein FAM43B Human genes 0.000 description 1
- 102100034514 Protein FAM72A Human genes 0.000 description 1
- 102100035443 Protein FAM83B Human genes 0.000 description 1
- 102100020847 Protein FosB Human genes 0.000 description 1
- 102100022876 Protein HIDE1 Human genes 0.000 description 1
- 102100039740 Protein INSYN2B Human genes 0.000 description 1
- 102100032095 Protein LRATD1 Human genes 0.000 description 1
- 102100035797 Protein MGARP Human genes 0.000 description 1
- 102100028174 Protein MIX23 Human genes 0.000 description 1
- 102100023475 Protein MMS22-like Human genes 0.000 description 1
- 102100040098 Protein MOST-1 Human genes 0.000 description 1
- 102100032446 Protein S100-A7 Human genes 0.000 description 1
- 102100030102 Protein S100-A7A Human genes 0.000 description 1
- 102100037197 Protein SCAI Human genes 0.000 description 1
- 102100030585 Protein SMIM45 Human genes 0.000 description 1
- 102100030527 Protein SOGA1 Human genes 0.000 description 1
- 102100030244 Protein SOX-15 Human genes 0.000 description 1
- 102100035171 Protein TASOR 2 Human genes 0.000 description 1
- 102100039470 Protein Wnt-7b Human genes 0.000 description 1
- 102100038012 Protein cramped-like Human genes 0.000 description 1
- 102100027331 Protein crumbs homolog 1 Human genes 0.000 description 1
- 102100027316 Protein crumbs homolog 3 Human genes 0.000 description 1
- 102100036467 Protein delta homolog 1 Human genes 0.000 description 1
- 102100031798 Protein eva-1 homolog A Human genes 0.000 description 1
- 102100021538 Protein kinase C zeta type Human genes 0.000 description 1
- 102100028680 Protein patched homolog 1 Human genes 0.000 description 1
- 102100024202 Protein phosphatase 1 regulatory subunit 14D Human genes 0.000 description 1
- 102100024556 Protein phosphatase 1 regulatory subunit 1B Human genes 0.000 description 1
- 102100023163 Protein sel-1 homolog 3 Human genes 0.000 description 1
- 102100022321 Protein spinster homolog 3 Human genes 0.000 description 1
- 102100032186 Protein tweety homolog 3 Human genes 0.000 description 1
- 108091000520 Protein-Arginine Deiminase Type 4 Proteins 0.000 description 1
- 102100035731 Protein-arginine deiminase type-4 Human genes 0.000 description 1
- 102100028087 Protein-tyrosine sulfotransferase 2 Human genes 0.000 description 1
- 102100028286 Proto-oncogene tyrosine-protein kinase receptor Ret Human genes 0.000 description 1
- 102100040144 Protocadherin beta-9 Human genes 0.000 description 1
- 101710193909 Protochlorophyllide reductase, chloroplastic Proteins 0.000 description 1
- 102100032350 Protransforming growth factor alpha Human genes 0.000 description 1
- 102100032494 Pseudouridylate synthase 7 homolog Human genes 0.000 description 1
- 102100032779 Pseudouridylate synthase 7 homolog-like protein Human genes 0.000 description 1
- 102100022993 Putative ATP synthase subunit g 2, mitochondrial Human genes 0.000 description 1
- 102100040383 Putative Dol-P-Glc:Glc(2)Man(9)GlcNAc(2)-PP-Dol alpha-1,2-glucosyltransferase Human genes 0.000 description 1
- 102100028594 Putative KRAB domain-containing protein ZNF788 Human genes 0.000 description 1
- 102100038206 Putative L-type amino acid transporter 1-like protein MLAS Human genes 0.000 description 1
- 102100034145 Putative POU domain, class 5, transcription factor 1B Human genes 0.000 description 1
- 102100034621 Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 Human genes 0.000 description 1
- 102100028010 Putative STAG3-like protein 2 Human genes 0.000 description 1
- 102100035035 Putative U2 small nuclear ribonucleoprotein auxiliary factor 35 kDa subunit-related protein 1 Human genes 0.000 description 1
- 102100032387 Putative aldo-keto reductase family 1 member C8 Human genes 0.000 description 1
- 102100022713 Putative ankyrin repeat domain-containing protein 26-like 1 Human genes 0.000 description 1
- 102100040609 Putative apolipoprotein(a)-like protein 2 Human genes 0.000 description 1
- 102100032409 Putative coiled-coil domain-containing protein 26 Human genes 0.000 description 1
- 102100027276 Putative glycosyltransferase ALG1-like Human genes 0.000 description 1
- 102100040378 Putative glycosyltransferase ALG1L2 Human genes 0.000 description 1
- 102100029134 Putative histone-lysine N-methyltransferase PRDM6 Human genes 0.000 description 1
- 102100037201 Putative homeobox protein NANOG2 Human genes 0.000 description 1
- 102100036164 Putative phospholipase B-like 2 Human genes 0.000 description 1
- 102100033003 Putative pituitary tumor-transforming gene 3 protein Human genes 0.000 description 1
- 102100034191 Putative protein MSS51 homolog, mitochondrial Human genes 0.000 description 1
- 102100028441 Putative protein ZNF321 Human genes 0.000 description 1
- 102100028834 Putative protein encoded by LINC02912 Human genes 0.000 description 1
- 102100034760 Putative pyridoxal-dependent decarboxylase domain-containing protein 2 Human genes 0.000 description 1
- 102100031285 Putative ubiquitin carboxyl-terminal hydrolase 41 Human genes 0.000 description 1
- 102100029919 Putative uncharacterized protein C2orf48 Human genes 0.000 description 1
- 102100038714 Putative uncharacterized protein CRYM-AS1 Human genes 0.000 description 1
- 102100027323 Putative uncharacterized protein FAM30A Human genes 0.000 description 1
- 102100040780 Putative uncharacterized protein FBXL19-AS1 Human genes 0.000 description 1
- 102100022482 Putative uncharacterized protein IGF2BP2-AS1 Human genes 0.000 description 1
- 102100029277 Putative uncharacterized protein NTM-AS1 Human genes 0.000 description 1
- 102100027312 Putative uncharacterized protein SSBP3-AS1 Human genes 0.000 description 1
- 102100034829 Putative uncharacterized protein encoded by LINC01554 Human genes 0.000 description 1
- 102100027606 Putative uncharacterized protein encoded by LINC02910 Human genes 0.000 description 1
- 102100025010 Putative zinc finger and SCAN domain-containing protein 5D Human genes 0.000 description 1
- 102100024668 Putative zinc finger protein 66 Human genes 0.000 description 1
- 102100039407 Pyrroline-5-carboxylate reductase 1, mitochondrial Human genes 0.000 description 1
- 101710109491 Pyruvate synthase subunit PorA Proteins 0.000 description 1
- 101710109487 Pyruvate synthase subunit PorB Proteins 0.000 description 1
- 101710109489 Pyruvate synthase subunit PorC Proteins 0.000 description 1
- 101710109484 Pyruvate synthase subunit PorD Proteins 0.000 description 1
- 101150040459 RAS gene Proteins 0.000 description 1
- 101150076031 RAS1 gene Proteins 0.000 description 1
- 102100038421 RBBP8 N-terminal-like protein Human genes 0.000 description 1
- 102100027566 RNA 3'-terminal phosphate cyclase-like protein Human genes 0.000 description 1
- 102100023749 RNA polymerase II elongation factor ELL3 Human genes 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 102100039689 RNA-binding motif, single-stranded-interacting protein 3 Human genes 0.000 description 1
- 102100023862 RNA-binding protein 41 Human genes 0.000 description 1
- 102100038822 RNA-binding protein 47 Human genes 0.000 description 1
- 102100022880 Rab proteins geranylgeranyltransferase component A 2 Human genes 0.000 description 1
- 102100028208 Raftlin Human genes 0.000 description 1
- 102100034590 Rap guanine nucleotide exchange factor 5 Human genes 0.000 description 1
- 102100033450 Ras guanyl-releasing protein 3 Human genes 0.000 description 1
- 102100039765 Ras-related protein Rab-27B Human genes 0.000 description 1
- 102100038416 Ras-related protein Rab-40A-like Human genes 0.000 description 1
- 101100322557 Rattus norvegicus Adam1 gene Proteins 0.000 description 1
- 102100039613 RecQ-mediated genome instability protein 2 Human genes 0.000 description 1
- 102100024235 Receptor for retinol uptake STRA6 Human genes 0.000 description 1
- 102100033734 Receptor-interacting serine/threonine-protein kinase 4 Human genes 0.000 description 1
- 102100020718 Receptor-type tyrosine-protein kinase FLT3 Human genes 0.000 description 1
- 102100039664 Receptor-type tyrosine-protein phosphatase H Human genes 0.000 description 1
- 102100039665 Receptor-type tyrosine-protein phosphatase epsilon Human genes 0.000 description 1
- 102100030000 Recombining binding protein suppressor of hairless Human genes 0.000 description 1
- 102100033134 Recombining binding protein suppressor of hairless-like protein Human genes 0.000 description 1
- 101710148333 Regulator of G-protein signaling 13 Proteins 0.000 description 1
- 102100021035 Regulator of G-protein signaling 18 Human genes 0.000 description 1
- 102100037415 Regulator of G-protein signaling 3 Human genes 0.000 description 1
- 101710140411 Regulator of G-protein signaling 3 Proteins 0.000 description 1
- 102100029521 Regulatory solute carrier protein family 1 member 1 Human genes 0.000 description 1
- 102100032445 Relaxin receptor 2 Human genes 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 102100030542 Replication factor C subunit 4 Human genes 0.000 description 1
- 208000007660 Residual Neoplasm Diseases 0.000 description 1
- 108010002342 Retinoblastoma-Like Protein p107 Proteins 0.000 description 1
- 102000000582 Retinoblastoma-Like Protein p107 Human genes 0.000 description 1
- 102100033194 Rho guanine nucleotide exchange factor 11 Human genes 0.000 description 1
- 102100032206 Rho guanine nucleotide exchange factor TIAM2 Human genes 0.000 description 1
- 102100031754 Ribitol-5-phosphate transferase FKTN Human genes 0.000 description 1
- 102100033918 Ribonucleoprotein PTB-binding 2 Human genes 0.000 description 1
- 102100036320 Ribonucleoside-diphosphate reductase large subunit Human genes 0.000 description 1
- 102100024914 Ribosomal protein S6 kinase-related protein Human genes 0.000 description 1
- 102100029834 Ribosome biogenesis protein BRX1 homolog Human genes 0.000 description 1
- 102100035119 Ribosome biogenesis protein WDR12 Human genes 0.000 description 1
- 102100027702 Roundabout homolog 1 Human genes 0.000 description 1
- 108010005256 S100 Calcium Binding Protein A7 Proteins 0.000 description 1
- 102100034018 SAM pointed domain-containing Ets transcription factor Human genes 0.000 description 1
- 108091005487 SCARB1 Proteins 0.000 description 1
- 108010005173 SERPIN-B5 Proteins 0.000 description 1
- 102100027719 SH2 domain-containing adapter protein D Human genes 0.000 description 1
- 102100037647 SH3 and cysteine-rich domain-containing protein 3 Human genes 0.000 description 1
- 102100021782 SH3 domain-containing protein 19 Human genes 0.000 description 1
- 102100029989 SHC SH2 domain-binding protein 1 Human genes 0.000 description 1
- 102100029214 SLAM family member 8 Human genes 0.000 description 1
- 108091006620 SLC12A2 Proteins 0.000 description 1
- 108091006632 SLC13A3 Proteins 0.000 description 1
- 108091006593 SLC15A2 Proteins 0.000 description 1
- 108091006741 SLC22A8 Proteins 0.000 description 1
- 108091006697 SLC24A1 Proteins 0.000 description 1
- 108091006467 SLC25A36 Proteins 0.000 description 1
- 108091006500 SLC25A6P5 Proteins 0.000 description 1
- 108091006505 SLC26A2 Proteins 0.000 description 1
- 108091006951 SLC35E1P1 Proteins 0.000 description 1
- 108091007000 SLC44A3 Proteins 0.000 description 1
- 108091007561 SLC44A4 Proteins 0.000 description 1
- 108091007564 SLC44A5 Proteins 0.000 description 1
- 102000005021 SLC6A13 Human genes 0.000 description 1
- 108060007752 SLC6A13 Proteins 0.000 description 1
- 108091006241 SLC7A11 Proteins 0.000 description 1
- 108091006232 SLC7A5 Proteins 0.000 description 1
- 108091006235 SLC7A5P1 Proteins 0.000 description 1
- 108091006656 SLC9A7 Proteins 0.000 description 1
- 108091006687 SLCO1A2 Proteins 0.000 description 1
- 108091006730 SLCO1B3 Proteins 0.000 description 1
- 102100021995 SPARC-related modular calcium-binding protein 1 Human genes 0.000 description 1
- 102100032395 SPOC domain-containing protein 1 Human genes 0.000 description 1
- 102100028009 STAG3-like protein 3 Human genes 0.000 description 1
- 102100024792 SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A containing DEAD/H box 1 Human genes 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 102100037118 Scavenger receptor class B member 1 Human genes 0.000 description 1
- 101100010298 Schizosaccharomyces pombe (strain 972 / ATCC 24843) pol2 gene Proteins 0.000 description 1
- 102100023843 Selenoprotein P Human genes 0.000 description 1
- 108091061939 Selfish DNA Proteins 0.000 description 1
- 102100027718 Semaphorin-4A Human genes 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 102100031733 Serine incorporator 2 Human genes 0.000 description 1
- 102100031056 Serine protease 57 Human genes 0.000 description 1
- 102100033196 Serine protease HTRA4 Human genes 0.000 description 1
- 102100036970 Serine protease inhibitor Kazal-type 13 Human genes 0.000 description 1
- 102100024377 Serine protease inhibitor Kazal-type 8 Human genes 0.000 description 1
- 102100023665 Serine/arginine repetitive matrix protein 3 Human genes 0.000 description 1
- 102100031075 Serine/threonine-protein kinase Chk2 Human genes 0.000 description 1
- 102100034285 Serine/threonine-protein phosphatase 6 regulatory ankyrin repeat subunit A Human genes 0.000 description 1
- 102100030420 Serpin A9 Human genes 0.000 description 1
- 102100030333 Serpin B5 Human genes 0.000 description 1
- 102100032016 Serum amyloid A-4 protein Human genes 0.000 description 1
- 102100035476 Serum paraoxonase/arylesterase 1 Human genes 0.000 description 1
- 102100028816 Sex comb on midleg-like protein 2 Human genes 0.000 description 1
- 102100029947 Sialic acid-binding Ig-like lectin 6 Human genes 0.000 description 1
- 102100031451 Signal-induced proliferation-associated 1-like protein 2 Human genes 0.000 description 1
- 102100027099 Signal-induced proliferation-associated 1-like protein 3 Human genes 0.000 description 1
- 102100032770 Signal-regulatory protein beta-1 isoform 3 Human genes 0.000 description 1
- 102100025259 Signal-transducing adaptor protein 2 Human genes 0.000 description 1
- 102100023007 Single-stranded DNA-binding protein 2 Human genes 0.000 description 1
- 102100029809 Small nuclear ribonucleoprotein E Human genes 0.000 description 1
- 102100036758 Small nuclear ribonucleoprotein F Human genes 0.000 description 1
- 102100036768 Small nuclear ribonucleoprotein G Human genes 0.000 description 1
- 102100031321 Small subunit processome component 20 homolog Human genes 0.000 description 1
- 102100029935 Smith-Magenis syndrome chromosomal region candidate gene 5 protein Human genes 0.000 description 1
- 102100029795 Sodium-dependent glucose transporter 1 Human genes 0.000 description 1
- 102100033929 Sodium-dependent noradrenaline transporter Human genes 0.000 description 1
- 102100029971 Sodium/hydrogen exchanger 7 Human genes 0.000 description 1
- 102100029410 Sodium/potassium-transporting ATPase subunit beta-1-interacting protein 1 Human genes 0.000 description 1
- 102100036947 Sodium/potassium/calcium exchanger 1 Human genes 0.000 description 1
- 102100035208 Solute carrier family 13 member 3 Human genes 0.000 description 1
- 102100021488 Solute carrier family 15 member 2 Human genes 0.000 description 1
- 102100035227 Solute carrier family 22 member 8 Human genes 0.000 description 1
- 102100030106 Solute carrier family 25 member 36 Human genes 0.000 description 1
- 102100032846 Solute carrier organic anion transporter family member 1A2 Human genes 0.000 description 1
- 102100027239 Solute carrier organic anion transporter family member 1B3 Human genes 0.000 description 1
- 102100024483 Sororin Human genes 0.000 description 1
- 101710168938 Sphingosine-1-phosphate phosphatase 2 Proteins 0.000 description 1
- 102100022282 Spindle and centriole-associated protein 1 Human genes 0.000 description 1
- 102100022915 Spindle and kinetochore-associated protein 1 Human genes 0.000 description 1
- 102100023505 Spindlin-2A Human genes 0.000 description 1
- 102100026815 Spindlin-4 Human genes 0.000 description 1
- 102100037079 Splicing regulatory glutamine/lysine-rich protein 1 Human genes 0.000 description 1
- 102100025560 Squalene monooxygenase Human genes 0.000 description 1
- 102100020929 Sterile alpha motif domain-containing protein 12 Human genes 0.000 description 1
- 102100034291 Sterile alpha motif domain-containing protein 9 Human genes 0.000 description 1
- 108010048349 Steroidogenic Factor 1 Proteins 0.000 description 1
- 102100029856 Steroidogenic factor 1 Human genes 0.000 description 1
- 208000005718 Stomach Neoplasms Diseases 0.000 description 1
- 102100029540 Structural maintenance of chromosomes protein 2 Human genes 0.000 description 1
- 102100022842 Structural maintenance of chromosomes protein 4 Human genes 0.000 description 1
- 102100037464 Succinate receptor 1 Human genes 0.000 description 1
- 102100030113 Sulfate transporter Human genes 0.000 description 1
- 102100029867 Sulfotransferase 2A1 Human genes 0.000 description 1
- 108010002687 Survivin Proteins 0.000 description 1
- 102100021696 Syncytin-1 Human genes 0.000 description 1
- 102100021742 Syncytin-2 Human genes 0.000 description 1
- 102100029960 Syntaxin-19 Human genes 0.000 description 1
- 102100021678 Syntaxin-binding protein 4 Human genes 0.000 description 1
- 102100038409 T-box transcription factor TBX3 Human genes 0.000 description 1
- 102100027222 T-lymphocyte activation antigen CD80 Human genes 0.000 description 1
- 101150057140 TACSTD1 gene Proteins 0.000 description 1
- 102000003715 TNF receptor-associated factor 4 Human genes 0.000 description 1
- 108090000008 TNF receptor-associated factor 4 Proteins 0.000 description 1
- 102100039142 TOG array regulator of axonemal microtubules protein 1 Human genes 0.000 description 1
- 102100022608 TOX high mobility group box family member 3 Human genes 0.000 description 1
- 108091007283 TRIM24 Proteins 0.000 description 1
- 102000003621 TRPC5 Human genes 0.000 description 1
- 101150042815 TRPC5 gene Proteins 0.000 description 1
- 102100040649 Taste receptor type 2 member 10 Human genes 0.000 description 1
- 102100036737 Taste receptor type 2 member 13 Human genes 0.000 description 1
- 102100036720 Taste receptor type 2 member 14 Human genes 0.000 description 1
- 102100021233 Taste receptor type 2 member 19 Human genes 0.000 description 1
- 102100024842 Taste receptor type 2 member 3 Human genes 0.000 description 1
- 102100021235 Taste receptor type 2 member 30 Human genes 0.000 description 1
- 102100025866 Taste receptor type 2 member 31 Human genes 0.000 description 1
- 102100024854 Taste receptor type 2 member 4 Human genes 0.000 description 1
- 102100035216 Taste receptor type 2 member 41 Human genes 0.000 description 1
- 102100029030 Taste receptor type 2 member 42 Human genes 0.000 description 1
- 102100029031 Taste receptor type 2 member 43 Human genes 0.000 description 1
- 102100029029 Taste receptor type 2 member 46 Human genes 0.000 description 1
- 102100027220 Taste receptor type 2 member 50 Human genes 0.000 description 1
- 102100027216 Taste receptor type 2 member 60 Human genes 0.000 description 1
- 102100023277 Tau-tubulin kinase 1 Human genes 0.000 description 1
- 102100033227 Teneurin-2 Human genes 0.000 description 1
- 102100026404 Teratocarcinoma-derived growth factor 1 Human genes 0.000 description 1
- 102100024991 Tetraspanin-12 Human genes 0.000 description 1
- 102100032802 Tetraspanin-8 Human genes 0.000 description 1
- 102100031768 Tetratricopeptide repeat protein 24 Human genes 0.000 description 1
- 102100023273 Tetratricopeptide repeat protein 39C Human genes 0.000 description 1
- 101150050472 Tfr2 gene Proteins 0.000 description 1
- 102100037357 Thymidylate kinase Human genes 0.000 description 1
- 102100024702 Thymosin beta-15A Human genes 0.000 description 1
- 102100026640 Tight junction protein ZO-3 Human genes 0.000 description 1
- 102100040526 Tissue alpha-L-fucosidase Human genes 0.000 description 1
- 102100024605 Torsin-4A Human genes 0.000 description 1
- 102100034424 Transcription cofactor HES-6 Human genes 0.000 description 1
- 102100023477 Transcription cofactor vestigial-like protein 2 Human genes 0.000 description 1
- 102100035097 Transcription factor 7-like 1 Human genes 0.000 description 1
- 102100031556 Transcription factor E2F7 Human genes 0.000 description 1
- 102100028503 Transcription factor EC Human genes 0.000 description 1
- 102100027263 Transcription factor ETV7 Human genes 0.000 description 1
- 102100022415 Transcription factor SOX-11 Human genes 0.000 description 1
- 102100022281 Transcription factor Spi-B Human genes 0.000 description 1
- 102100022011 Transcription intermediary factor 1-alpha Human genes 0.000 description 1
- 102100031080 Transcription termination factor 2 Human genes 0.000 description 1
- 102100023185 Transcriptional repressor scratch 1 Human genes 0.000 description 1
- 102100023178 Transcriptional repressor scratch 2 Human genes 0.000 description 1
- 102100026143 Transferrin receptor protein 2 Human genes 0.000 description 1
- 102100026239 Translation machinery-associated protein 16 Human genes 0.000 description 1
- 102100034906 Transmembrane 4 L6 family member 19 Human genes 0.000 description 1
- 102100031994 Transmembrane channel-like protein 5 Human genes 0.000 description 1
- 102100026222 Transmembrane gamma-carboxyglutamic acid protein 4 Human genes 0.000 description 1
- 102100033021 Transmembrane protein 14EP Human genes 0.000 description 1
- 102100036759 Transmembrane protein 211 Human genes 0.000 description 1
- 102100035058 Transmembrane protein 229B Human genes 0.000 description 1
- 102100024181 Transmembrane protein 45B Human genes 0.000 description 1
- 108010078184 Trefoil Factor-3 Proteins 0.000 description 1
- 102100039145 Trefoil factor 3 Human genes 0.000 description 1
- 102100033387 Treslin Human genes 0.000 description 1
- 102100022347 Tripartite motif-containing protein 15 Human genes 0.000 description 1
- 102100026412 Tripartite motif-containing protein 60 Human genes 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- 102100033080 Tropomyosin alpha-3 chain Human genes 0.000 description 1
- 102000004987 Troponin T Human genes 0.000 description 1
- 108090001108 Troponin T Proteins 0.000 description 1
- 102100036860 Troponin T, slow skeletal muscle Human genes 0.000 description 1
- 102100028985 Tubulin alpha-1C chain Human genes 0.000 description 1
- 102100026157 Tubulin epsilon and delta complex protein 2 Human genes 0.000 description 1
- 108010078814 Tumor Suppressor Protein p53 Proteins 0.000 description 1
- 102100024568 Tumor necrosis factor ligand superfamily member 11 Human genes 0.000 description 1
- 102100024587 Tumor necrosis factor ligand superfamily member 15 Human genes 0.000 description 1
- 102100035283 Tumor necrosis factor ligand superfamily member 18 Human genes 0.000 description 1
- 102100031988 Tumor necrosis factor ligand superfamily member 6 Human genes 0.000 description 1
- 102100040245 Tumor necrosis factor receptor superfamily member 5 Human genes 0.000 description 1
- 102100040418 Tumor protein D52 Human genes 0.000 description 1
- 108010046308 Type II DNA Topoisomerases Proteins 0.000 description 1
- 108010035075 Tyrosine decarboxylase Proteins 0.000 description 1
- 102100033018 Tyrosine-protein phosphatase non-receptor type 18 Human genes 0.000 description 1
- 102100033131 Tyrosine-protein phosphatase non-receptor type 3 Human genes 0.000 description 1
- 102100033136 Tyrosine-protein phosphatase non-receptor type 4 Human genes 0.000 description 1
- 102100031474 U11/U12 small nuclear ribonucleoprotein 25 kDa protein Human genes 0.000 description 1
- 102100033313 U6 snRNA-associated Sm-like protein LSm3 Human genes 0.000 description 1
- 102100029633 UDP-glucuronosyltransferase 2B15 Human genes 0.000 description 1
- 101710200683 UDP-glucuronosyltransferase 2B15 Proteins 0.000 description 1
- 102100029785 UDP-glucuronosyltransferase 2B4 Human genes 0.000 description 1
- 101710200334 UDP-glucuronosyltransferase 2B4 Proteins 0.000 description 1
- 102100037610 UHRF1-binding protein 1 Human genes 0.000 description 1
- 102100040011 UL16-binding protein 3 Human genes 0.000 description 1
- 102100025739 Uncharacterized membrane protein C3orf80 Human genes 0.000 description 1
- 102100030193 Uncharacterized protein C1orf53 Human genes 0.000 description 1
- 102100039599 Uncharacterized protein C21orf62 Human genes 0.000 description 1
- 102100038119 Uncharacterized protein C6orf223 Human genes 0.000 description 1
- 102100037992 Uncharacterized protein CD300LD-AS1 Human genes 0.000 description 1
- 102100029827 Unconventional myosin-X Human genes 0.000 description 1
- 102100028185 Unhealthy ribosome biogenesis protein 2 homolog Human genes 0.000 description 1
- 102100029591 V(D)J recombination-activating protein 2 Human genes 0.000 description 1
- 102100038295 V-set and immunoglobulin domain-containing protein 2 Human genes 0.000 description 1
- 101150102979 VIL1 gene Proteins 0.000 description 1
- 102100039112 Vacuolar protein sorting-associated protein 13C Human genes 0.000 description 1
- 102100023517 Vang-like protein 1 Human genes 0.000 description 1
- 102100038286 Vasoactive intestinal polypeptide receptor 2 Human genes 0.000 description 1
- 101710137651 Vasoactive intestinal polypeptide receptor 2 Proteins 0.000 description 1
- 102100033419 Villin-1 Human genes 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 102100024141 Voltage-dependent calcium channel gamma-2 subunit Human genes 0.000 description 1
- 102100024138 Voltage-dependent calcium channel gamma-3 subunit Human genes 0.000 description 1
- 102100024143 Voltage-dependent calcium channel gamma-4 subunit Human genes 0.000 description 1
- 108010020277 WD repeat containing planar cell polarity effector Proteins 0.000 description 1
- 102100020877 WD repeat-containing and planar cell polarity effector protein fritz homolog Human genes 0.000 description 1
- 102100039413 WD repeat-containing protein 49 Human genes 0.000 description 1
- 102100038092 WD repeat-containing protein 76 Human genes 0.000 description 1
- 230000006682 Warburg effect Effects 0.000 description 1
- 102000013814 Wnt Human genes 0.000 description 1
- 108050003627 Wnt Proteins 0.000 description 1
- 210000001766 X chromosome Anatomy 0.000 description 1
- 102100026867 Xaa-Arg dipeptidase Human genes 0.000 description 1
- 102100039102 ZW10 interactor Human genes 0.000 description 1
- 102100023893 ZZ-type zinc finger-containing protein 3 Human genes 0.000 description 1
- 102100025798 Zinc finger BED domain-containing protein 3 Human genes 0.000 description 1
- 102100025788 Zinc finger BED domain-containing protein 4 Human genes 0.000 description 1
- 102100025792 Zinc finger BED domain-containing protein 6 Human genes 0.000 description 1
- 102100028877 Zinc finger CCCH-type antiviral protein 1-like Human genes 0.000 description 1
- 102100025100 Zinc finger MYM-type protein 1 Human genes 0.000 description 1
- 102100025439 Zinc finger protein 100 Human genes 0.000 description 1
- 102100023570 Zinc finger protein 121 Human genes 0.000 description 1
- 102100023573 Zinc finger protein 124 Human genes 0.000 description 1
- 102100023394 Zinc finger protein 138 Human genes 0.000 description 1
- 102100028616 Zinc finger protein 14 homolog Human genes 0.000 description 1
- 102100023391 Zinc finger protein 141 Human genes 0.000 description 1
- 102100036556 Zinc finger protein 221 Human genes 0.000 description 1
- 102100021361 Zinc finger protein 253 Human genes 0.000 description 1
- 102100028392 Zinc finger protein 26 Human genes 0.000 description 1
- 102100021360 Zinc finger protein 260 Human genes 0.000 description 1
- 102100026334 Zinc finger protein 275 Human genes 0.000 description 1
- 102100028399 Zinc finger protein 28 Human genes 0.000 description 1
- 102100040319 Zinc finger protein 280D Human genes 0.000 description 1
- 102100026418 Zinc finger protein 283 Human genes 0.000 description 1
- 102100028430 Zinc finger protein 296 Human genes 0.000 description 1
- 102100025438 Zinc finger protein 367 Human genes 0.000 description 1
- 102100025435 Zinc finger protein 37A Human genes 0.000 description 1
- 102100023550 Zinc finger protein 42 homolog Human genes 0.000 description 1
- 102100024666 Zinc finger protein 43 Human genes 0.000 description 1
- 102100021414 Zinc finger protein 439 Human genes 0.000 description 1
- 102100039970 Zinc finger protein 491 Human genes 0.000 description 1
- 102100026528 Zinc finger protein 519 Human genes 0.000 description 1
- 102100034649 Zinc finger protein 551 Human genes 0.000 description 1
- 102100040787 Zinc finger protein 566 Human genes 0.000 description 1
- 102100040656 Zinc finger protein 573 Human genes 0.000 description 1
- 102100021103 Zinc finger protein 618 Human genes 0.000 description 1
- 102100035818 Zinc finger protein 621 Human genes 0.000 description 1
- 102100035801 Zinc finger protein 625 Human genes 0.000 description 1
- 102100026454 Zinc finger protein 660 Human genes 0.000 description 1
- 102100039039 Zinc finger protein 675 Human genes 0.000 description 1
- 102100039054 Zinc finger protein 678 Human genes 0.000 description 1
- 102100039053 Zinc finger protein 681 Human genes 0.000 description 1
- 102100040717 Zinc finger protein 69 Human genes 0.000 description 1
- 102100027855 Zinc finger protein 695 Human genes 0.000 description 1
- 102100040709 Zinc finger protein 70 Human genes 0.000 description 1
- 102100040660 Zinc finger protein 708 Human genes 0.000 description 1
- 102100040725 Zinc finger protein 714 Human genes 0.000 description 1
- 102100040719 Zinc finger protein 717 Human genes 0.000 description 1
- 102100040722 Zinc finger protein 718 Human genes 0.000 description 1
- 102100024711 Zinc finger protein 724 Human genes 0.000 description 1
- 102100024708 Zinc finger protein 726 Human genes 0.000 description 1
- 102100024698 Zinc finger protein 736 Human genes 0.000 description 1
- 102100024715 Zinc finger protein 737 Human genes 0.000 description 1
- 102100028578 Zinc finger protein 772 Human genes 0.000 description 1
- 102100023624 Zinc finger protein 805 Human genes 0.000 description 1
- 102100026589 Zinc finger protein 850 Human genes 0.000 description 1
- 102100026590 Zinc finger protein 852 Human genes 0.000 description 1
- 102100026474 Zinc finger protein 878 Human genes 0.000 description 1
- 102100021373 Zinc finger protein 888 Human genes 0.000 description 1
- 102100021375 Zinc finger protein 891 Human genes 0.000 description 1
- 102100039070 Zinc finger protein 91 Human genes 0.000 description 1
- 102100039046 Zinc finger protein 92 Human genes 0.000 description 1
- 102100025531 Zinc finger protein Gfi-1b Human genes 0.000 description 1
- 102100036606 Zinc finger protein ZFAT Human genes 0.000 description 1
- 102100025105 Zinc fingers and homeoboxes protein 1 Human genes 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 208000009956 adenocarcinoma Diseases 0.000 description 1
- 208000037842 advanced-stage tumor Diseases 0.000 description 1
- 208000006682 alpha 1-Antitrypsin Deficiency Diseases 0.000 description 1
- 125000003275 alpha amino acid group Chemical group 0.000 description 1
- 108010075843 alpha-2-HS-Glycoprotein Proteins 0.000 description 1
- 102000012005 alpha-2-HS-Glycoprotein Human genes 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 208000007502 anemia Diseases 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 230000001640 apoptogenic effect Effects 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- 101150056596 azin2 gene Proteins 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 210000000227 basophil cell of anterior lobe of hypophysis Anatomy 0.000 description 1
- 208000005980 beta thalassemia Diseases 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 230000017531 blood circulation Effects 0.000 description 1
- 238000010241 blood sampling Methods 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 230000006931 brain damage Effects 0.000 description 1
- 231100000874 brain damage Toxicity 0.000 description 1
- 201000007983 brain glioma Diseases 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 102100038086 cAMP-dependent protein kinase inhibitor alpha Human genes 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000003683 cardiac damage Effects 0.000 description 1
- 230000000747 cardiac effect Effects 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000024245 cell differentiation Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 230000005754 cellular signaling Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 231100000005 chromosome aberration Toxicity 0.000 description 1
- 108091092240 circulating cell-free DNA Proteins 0.000 description 1
- 230000004087 circulation Effects 0.000 description 1
- 230000007882 cirrhosis Effects 0.000 description 1
- 208000019425 cirrhosis of liver Diseases 0.000 description 1
- 201000010897 colon adenocarcinoma Diseases 0.000 description 1
- 208000029742 colonic neoplasm Diseases 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 239000013256 coordination polymer Substances 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000005574 cross-species transmission Effects 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 229940127276 delta-like ligand 3 Drugs 0.000 description 1
- 229960003964 deoxycholic acid Drugs 0.000 description 1
- KXGVEGMKQFWNSR-LLQZFEROSA-N deoxycholic acid Chemical compound C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 KXGVEGMKQFWNSR-LLQZFEROSA-N 0.000 description 1
- 108010028753 deoxyhypusine hydroxylase Proteins 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 108010049285 dephospho-CoA kinase Proteins 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 206010012818 diffuse large B-cell lymphoma Diseases 0.000 description 1
- FOCAHLGSDWHSAH-UHFFFAOYSA-N difluoromethanethione Chemical compound FC(F)=S FOCAHLGSDWHSAH-UHFFFAOYSA-N 0.000 description 1
- CZWHMRTTWFJMBC-UHFFFAOYSA-N dinaphtho[2,3-b:2',3'-f]thieno[3,2-b]thiophene Chemical compound C1=CC=C2C=C(SC=3C4=CC5=CC=CC=C5C=C4SC=33)C3=CC2=C1 CZWHMRTTWFJMBC-UHFFFAOYSA-N 0.000 description 1
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 1
- 238000002224 dissection Methods 0.000 description 1
- 238000013399 early diagnosis Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 108010041998 erythrocyte membrane protein band 4.1-like 1 Proteins 0.000 description 1
- 229940105423 erythropoietin Drugs 0.000 description 1
- 201000005619 esophageal carcinoma Diseases 0.000 description 1
- 210000001808 exosome Anatomy 0.000 description 1
- 238000013213 extrapolation Methods 0.000 description 1
- 102100021145 fMet-Leu-Phe receptor Human genes 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 108010022790 formyl-methenyl-methylenetetrahydrofolate synthetase Proteins 0.000 description 1
- 230000004547 gene signature Effects 0.000 description 1
- 231100000118 genetic alteration Toxicity 0.000 description 1
- 230000004077 genetic alteration Effects 0.000 description 1
- 230000037442 genomic alteration Effects 0.000 description 1
- 230000008826 genomic mutation Effects 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 150000003278 haem Chemical class 0.000 description 1
- 230000003394 haemopoietic effect Effects 0.000 description 1
- 231100000753 hepatic injury Toxicity 0.000 description 1
- 108010027263 homeobox protein HOXA9 Proteins 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- 239000011147 inorganic material Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 208000024312 invasive carcinoma Diseases 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 230000000302 ischemic effect Effects 0.000 description 1
- 238000011528 liquid biopsy Methods 0.000 description 1
- 206010025135 lupus erythematosus Diseases 0.000 description 1
- 102100034098 mRNA turnover protein 4 homolog Human genes 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 230000008774 maternal effect Effects 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000004066 metabolic change Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 108091074368 miR-216 stem-loop Proteins 0.000 description 1
- 108091086642 miR-216a stem-loop Proteins 0.000 description 1
- 108091043104 miR-3065 stem-loop Proteins 0.000 description 1
- 230000002906 microbiologic effect Effects 0.000 description 1
- 230000033607 mismatch repair Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000001343 mnemonic effect Effects 0.000 description 1
- 238000004802 monitoring treatment efficacy Methods 0.000 description 1
- 201000006417 multiple sclerosis Diseases 0.000 description 1
- 230000003680 myocardial damage Effects 0.000 description 1
- 230000002107 myocardial effect Effects 0.000 description 1
- 230000001338 necrotic effect Effects 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 238000012753 partial hepatectomy Methods 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 230000001991 pathophysiological effect Effects 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 239000002861 polymer material Substances 0.000 description 1
- 238000011176 pooling Methods 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 108010000627 pregnancy-specific beta-1-glycoprotein 7 Proteins 0.000 description 1
- 238000003793 prenatal diagnosis Methods 0.000 description 1
- 238000009609 prenatal screening Methods 0.000 description 1
- 230000000135 prohibitive effect Effects 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 238000012514 protein characterization Methods 0.000 description 1
- 238000002005 protein protein interaction detection Methods 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000002762 protein-protein interaction assay Methods 0.000 description 1
- 238000010384 proximity ligation assay Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000002096 quantum dot Substances 0.000 description 1
- 238000001959 radiotherapy Methods 0.000 description 1
- 102000016914 ras Proteins Human genes 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000000754 repressing effect Effects 0.000 description 1
- 230000008672 reprogramming Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000020874 response to hypoxia Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 210000003765 sex chromosome Anatomy 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 210000002460 smooth muscle Anatomy 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 206010041823 squamous cell carcinoma Diseases 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000010741 sumoylation Effects 0.000 description 1
- 230000000153 supplemental effect Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 102100025028 tRNA (guanine-N(7)-)-methyltransferase Human genes 0.000 description 1
- 102100021774 tRNA-splicing endonuclease subunit Sen2 Human genes 0.000 description 1
- 238000010381 tandem affinity purification Methods 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 210000003411 telomere Anatomy 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000011277 treatment modality Methods 0.000 description 1
- 239000000439 tumor marker Substances 0.000 description 1
- 230000007306 turnover Effects 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 102000009310 vitamin D receptors Human genes 0.000 description 1
- 108050000156 vitamin D receptors Proteins 0.000 description 1
- 102100039762 von Willebrand factor A domain-containing protein 2 Human genes 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1003—Extracting or separating nucleic acids from biological samples, e.g. pure separation or isolation methods; Conditions, buffers or apparatuses therefor
- C12N15/1006—Extracting or separating nucleic acids from biological samples, e.g. pure separation or isolation methods; Conditions, buffers or apparatuses therefor by means of a solid support carrier, e.g. particles, polymers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6834—Enzymatic or biochemical coupling of nucleic acids to a solid phase
Definitions
- the present invention is in the field of cell free DNA-protein complex analysis.
- cfDNA circulating cell-free DNA
- ctDNA circulating tumor DNA
- the presence of cfDNA has been recognized for decades, with typical length of cfDNA fragments around multiples of ⁇ 166 bp—the length of mononucleosomal DNA ( ⁇ 146 bp), with some additional linker DNA.
- the plasma of healthy people contains the equivalent of ⁇ 1000 genomes per ml, with up to 100 times more cfDNA present in many pathologies (e.g. cancer) and some physiological conditions (e.g. following exercise).
- cfDNA as a diagnostic tool has greatly expanded in recent years. For example, next-generation sequencing of fetal cfDNA in maternal blood is now used for non-invasive prenatal screening/diagnosis of chromosomal abnormalities and parental derived mutations. Because of the very low amount of cfDNA present in a blood sample, most current cfDNA diagnostic methods rely on mutations with the cfDNA to distinguish it from cfDNA from healthy tissues and blood cells. Indeed, white blood cell cfDNA is by far the greatest contributor to the total cfDNA pool and can make cfDNA diagnosis of conditions from other tissues difficult.
- cfDNA-based methods rely on detecting genomic alterations in cfDNA to quantify the contribution of cfDNA from cells with altered genomic sequence, such as fetus, a transplant, or mutated genes in tumors. Thus, these methods are biased towards a set of pre-selected genes and are blind to events that involve turnover and death of cells whose genome is identical to the host genome. More recent approaches leverage epigenetic information in cell free DNA. Extremely deep sequencing of total cfDNA can provide data that reflect tissue of origin and gene expression.
- DNA methylation serves as a stable epigenetic memory and is mostly unchanged after differentiation. As such, it is highly informative regarding cell lineage, but much less about transient changes in expression, and cells originating from a close or same lineage. Moreover, unbiased analysis of DNA methylation requires high sequencing depth since most CpGs are methylated.
- a method that allows for accurately determining the origin of cfDNA, as well as providing information on the molecular events that occurred in the cell close to the time of cell death would allow not just for early diagnosis of conditions that were unknown to the doctor or patient, but also may help in tailoring a treatment to newly discovered malady.
- the present invention provides methods of determining the origin of cell free DNA (cfDNA), for detecting death of a cell type or tissue, for determining a cellular state of a cell in a subject, and combinations thereof, by sequencing cfDNA isolated by extracting proteins and modified proteins bound to that cfDNA. Computer program products for doing same are also provided.
- cfDNA cell free DNA
- a method of determining a cellular state, tissue of origin, cell type or a combination thereof of a cell that released its DNA comprising:
- a computer program product for determining a cell or tissue of origin of cell free DNA (cfDNA), comprising a non-transitory computer-readable storage medium having program code embodied thereon, the program code executable by at least one hardware processor to
- a computer program product for determining a cellular state, tissue of origin, cell type or a combination thereof of a cell in a subject as the cell died, comprising a non-transitory computer-readable storage medium having program code embodied thereon, the program code executable by at least one hardware processor to
- a solid support comprising a capturing agent and a barcoding reagent.
- a method for multiplexing an assay on more than one molecule of interest in a single solution comprising:
- the sample is from a subject.
- the cell that released its DNA is a dead cell and the method is for detecting death of at least one of:
- the cellular state is a disease state.
- the disease state is selected from bacteremia, cancer, pre-cancer, infection, neurodegenerative disease, tissue damage, cardiac disease, liver disease, inflammation, autoimmune disease, arthritis, liver inflammation, bowel inflammation, autoimmune disease, tissue damage from drug side effects, tissue necrosis, and diabetes.
- the disease state is selected from cardiac disease or damage, brain disease or damage, gastrointestinal disease or damage, cancer, bacteremia, infection and liver disease or damage.
- At least 500 genomes of cfDNA are provided.
- the designating can be performed with as little as 0.1% of the cfDNA in the sample being from the cell type, the tissue, or the cellular state.
- the reagent is selected from an antibody or antigen binding fragment thereof, a protein, or a small molecule.
- the reagent is conjugated to a physical support.
- the DNA-associated protein is selected from a histone, a high-mobility group (HMG) protein and a member of the transcriptional machinery.
- the histone is a histone variant and/or a modified histone.
- the histone variant is selected from Histone 3 monomethylated lysine 4 (H3K4me1), Histone 3 demethylated lysine 4 (H3K4me2), Histone 3 trimethylated lysine 36 (H3K36me3) and Histone 3 trimethylated lysine 4 (H3K4me3).
- the reagent is an anti-modified histone antibody or fragment thereof.
- association of the DNA-associated protein with the genomic location is indicative of active transcription and the genomic location is within a tissue, cell type or cellular state specific gene or enhancer element or is at a disease-specific mutation.
- association of the DNA-associated protein with the genomic location is indicative of silenced transcription and the genomic location is within a repressor element, or a gene silenced in the tissue, cell-type or cellular state, or is at a disease-specific mutation.
- the method of the invention further comprises performing steps a-d again using a reagent that binds to a second DNA-associated protein, and wherein the second DNA associated protein is different from the first DNA-associated protein.
- the method of the invention comprises contacting the sample with at least 2 reagents, wherein each regent is bound to a physical support and the support comprises a short DNA tag unique to each reagent, wherein upon sequencing the isolated cfDNA the short DNA tag identifies the reagent that isolated the cfDNA.
- the designating comprises comparing the sequenced cfDNA to at least 10 genomic locations with the greatest unique association of the DNA-associated protein in a tissue, cell type or cellular state, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 10 genomic locations is considered to be from the tissue, cell type or cellular state.
- the DNA-associated protein is a marker of active transcription and the designating comprises comparing the sequenced cfDNA to a known transcriptional program of a tissue, cell type or cellular state, wherein a cfDNA with a sequence that is from a gene transcribed in the transcriptional program is from the tissue, cell type or cellular state.
- the designating comprises comparing the sequenced cfDNA to a DNA-protein association atlas of at least 5 cell types or tissues, wherein the atlas comprises at least 10 genomic location with the greatest unique association of the DNA-associated protein in each of the 5 cell types or tissues, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 10 genomic locations is considered to be from the tissue or cell type.
- the designating comprises comparing the sequenced cfDNA to a transcriptional program atlas of at least 5 transcriptional programs, wherein the atlas comprises at least one genomic location with the greatest unique association of the DNA-associated protein in each of the 5 transcriptional programs and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least one genomic location indicates activation of the transcriptional program.
- the cellular state is selected from: hypoxia, inflammation, ER stress, mitochondrial stress, interferon response, quiescence, senescence, cycling, malignant, and calcium flux.
- the informative genomic location is selected from a promoter, an enhancer element, a silencer element, a gene body and a disease-associated mutation.
- the method of the invention is wherein:
- the method of the invention is for use in detecting a disease state in the subject.
- the method of the invention is wherein the detecting a disease state comprises at least one of:
- the method of the invention further comprises treating the subject with a suitable treatment based on the cellular state, tissue of origin, cell type or a combination thereof of the cell that died in the subject.
- the solid support is a magnetic or paramagnetic bead, or an agarose bead.
- the capturing agent is a protein.
- the capturing protein is an antibody, or antigen binding fragment thereof.
- the barcoding reagent is a short nucleic acid molecule.
- the nucleic acid molecule is between 5 and 30 nucleotides.
- the capturing agent and barcoding reagent are conjugated to the solid support.
- the molecule of interest is a protein or a nucleic acid molecule.
- the assay is chromatin immunoprecipitation followed by sequencing (ChIP-Seq).
- FIGS. 1 A-I ( 1 A) Outline of the proposed method. Chromatin fragments from different cells in the body are released to the blood. These are immunoprecipitated, and sequenced. Interpretation of the resulting sequences informs of tissue-of-origin and gene activity program. Inset-cfChIP protocol, using antibody covalently bound to paramagnetic beads. Target fragments are immunoprecipitated directly from plasma. After removing the plasma, and washing the beads bound to target fragments, on-bead-ligation is used to add sequencing adapters (possibly with indexing barcodes) to the fragments and isolation of ligated DNA, and PCR amplification a sequencing-ready library is ready.
- sequencing adapters possibly with indexing barcodes
- FIG. 1 E Histogram of the distribution of sizes of sequenced cfChIP fragments shows clear mono- and di-nucleosome sizes.
- 1 F Browser view of cfChIP-seq signal over two regions with megakaryocyte-specific genes that appear in healthy subject but not in ChIP of blood cells and solid tissues.
- 1 G Browser view of non-PBMC H3K4me3 signal at promoters of selected genes (similar to FIG. 1 F ). Upper and lower panels depict cfChIP and tissue ChIP signal, respectively.
- FIG. 1 H Browser view of mouse CTCF signal at known CTCF sites. The sites are confirmed by the depletion of H3K4me3 signal.
- 1 I Meta-analysis of mouse CTCF (upper) and H3K4me3 (lower) signals throughout the mouse genome.
- FIGS. 2 A-M ( 2 A- 2 B) Scatter plots comparing cfChIP with anti-H3K4me1 and anti-H3K4me3 antibodies of ( 2 A) technical replicates and ( 2 B) 1 male and 1 female healthy individual. Each point is a 2 kb window in the genome, the x and y axises are the number of reads (in log(x+1) scale) mapped to the window in the two samples. Color code reflects density of points.
- 2 C Histograms of the correlations between H3K4me3 cfChIP samples of healthy subjects.
- Correlations of counts in 2 kb windows (as FIG.
- ( 2 G) Line graph estimation of the probability of detecting a specific location with different number of reads. The probability of detection was estimated by down-sampling from the actual result. Bars represent 95% confidence interval for the estimate.
- ( 2 H) Line graph extrapolation of spike-in for larger signatures size. Shown are the probability of detecting 0.1% male for two sample sizes.
- top ChIP-seq of PBMC (leukocytes) vs RNA-seq of PBMC.
- FIGS. 3 A-J A bar chart of H3K4me3 cfChIP-seq signal in heart-specific windows in four healthy subjects and samples from myocardial infarction (MI) patients. Inset, measured troponin levels at the time the blood samples were drawn.
- 3 B Examples of browser views of signals at heart-specific windows. Each browser section displays 20 kb region around windows (marked with gray background). Tracks are all normalized and shown to the same scale. Top tracks show cfChIP samples and bottom ChIP-seq of tissue samples (bottom) from Roadmap Epigenomics Atlas.
- 3 C Dot plot comparisons to external indications of cardiomyocyte death.
- x-axis measured Troponin levels (top panel) cardiomyocyte fraction as measured using DNA methylation markers (bottom panel).
- y-axis strength of Heart-specific signature (relative to healthy subjects).
- 3 D Heatmap showing the level (Brown scale), and significance (Blue scale) of selected cell-type signatures in healthy subjects and myocardial infarction patients. Each cell in the map is divided in half, the top left half represents statistical significance (FDR corrected q-value) and the bottom half density of reads in the signature (normalized reads per kb).
- 3 E Heatmap of the tissue signatures for all samples; and extension of 3 D and 3 I.
- 3 F Examples of browser view of of Liver-specific windows that are part of evaluated signature (see FIG. 3 B ).
- ( 3 G) Line graph of the change in signature strength in a myocardial infarction patient before/after PCI. Signatures strength are normalized to healthy subjects. The variability among healthy subjects is shown on the left. We can see initially high level of liver cells and elevated levels of heart cells. Following PCI liver cell decline and heart cells increase.
- ( 3 H) Dot plot comparisons to external indications of cancer patients and liver signature. Presented as in FIG. 3 C .
- FIGS. 4 A-H ( 4 A) Heatmap (as in FIG. 3 D ) showing processes Hallmark genes that are over-represented in subjects (compared to healthy baseline). See FIG. 4 C for full table with all hallmarks and subjects. ( 4 B) Examples of browser view of genes with higher than expected signal in these expression signatures (see FIG. 3 B ). ( 4 C) Heatmap of hallmarks signatures for all samples and signatures. Extends FIG. 4 A . ( 4 D) Browser view of H3K4me3 cfChIP and tissue ChIP signals at promoters of selected glycolytic genes (see FIG. 4 B ). ( 4 E) Exemplary scatter plots of method for defining genes with elevated signal at a specific sample.
- FIGS. 5 A-M ( 5 A) Histograms of meta-analysis of cfChIP signal over active promoters and enhancers.
- 5 B-C Browser view of tracks of cfChIP of H3K4me3, H3K4me2, and H3K4me13 from a healthy subject.
- 5 B Shown is a region of highly expressed genes. We can see di-methylation and mono-methylation extending out from the tri-methylation signal.
- 5 C Shown is the locus surrounding IFNB1. ChromHMM tracks show prediction of promoters and enhancers according to combination of histone modification and chromatin accessibility assays. Arrows mark regions with enriched di- and mono-methylation.
- Color dots represent genes that are overexpressed in colorectal adenocarcinoma (COAD—red) or glioblastoma multiforme (GBM—green).
- COAD—red colorectal adenocarcinoma
- GBM—green glioblastoma multiforme
- 5 M Browser view of H3K4me3 and H3K36me3 signals at genes that show differential levels of these marks between healthy subject and colorectal adenocarcinoma patient.
- VIL1 gene shows differential signal for the two marks while CTDSP1 shows similar levels of H3K4me3 but marked increase in H3K36me3 in the sample from the colorectal adenocarcinoma patient.
- FIGS. 6 A-C ( 6 A-C) Line graph examples of the background estimate for ( 6 A) a healthy male sample, ( 6 B) a healthy female sample and ( 6 C) a cancer patient.
- FIG. 7 Work flow of processing and analysis of cf-ChIP.
- FIG. 8 A meta-plot (top) and heatmap (bottom) for 1,000 highly expressed promoters and the location relative to the transcriptional start site (TSS) of H3K4me3 in cf-nucleosomes from plasma that had already undergone cfChIP with anti-H3K4me1 antibodies.
- TSS transcriptional start site
- FIG. 9 Meta-plots (above) and heatmaps (below) for 1000 highly expressed promoters and the location relative to the TSS of H3K9Ac, H3K27ac and H2A.Zac in cf-nucleosomes from plasma of healthy patient and one colorectal cancer patient.
- FIGS. 10 A-D ( 10 A) Schematic drawing depicting the scheme of multiplexed ChIP-Seq.
- 10 B Schematic drawing of the experiment performed to test mixing during MPL based ChIP-Seq.
- Each rectangle represent an MPL barcoded surface that combines a unique barcode (BC1-BC4) in combination with anti H3K4me3 (K4) or anti H3K36me3 (K36) antibody each targeting a chromatin modification with distinct genomic location.
- Ellipses are chromatin from two yeast species: S. cerevisiae and K. lactis .
- Various mixing is then performed either before library prep (pink circle in the middle) or after.
- the present invention provides methods of determining the origin of cell free DNA (cfDNA), detecting death of a cell type or tissue in a subject and for determining a cellular state of a cell as it died by determining DNA-protein associations from transcriptionally active or inactive chromatin in the subject.
- the methods of the invention are based on the surprising finding that cell free nucleosomes retain protein-DNA associations that are informative about not only the tissue/cell of origin of the nucleosome, but also the pathways that were active and inactive in the cell as it died. Further, this was surprisingly possible even though the amount of cf-nucleosomes that can be captured was very small. As little as a thousand genomes worth of cf-nucleosomes was sufficient to perform the methods of the invention.
- the methods of the invention can be performed with very little input cfDNA, as little as 1000 genomes, and with very shallow sequencing, as little as 0.5M reads.
- the technique can be so performed because only positive associations are examined.
- the entire cfDNA is not sequenced, rather only cfDNA bound by a particular protein (e.g. a modified histone) is isolated and sequenced. Since only a small fraction of the cfDNA is sequenced the process is cheaper, faster and can be done with a lower depth of sequencing.
- Even within this smaller sample only informative genomic loci are examined; most locations are not informative about the tissue/cell of origin or the pathways active in the cell that died. By only examining informative loci, much of the noise present in the cfDNA can be ignored.
- cfChIP has the potential to circumvent many of the limitations that exist in current analysis of cfDNA.
- Targeted enrichment of active marks results in reduced representation of the genome such that fewer sequencing reads ( ⁇ two orders of magnitude less) are required to obtain informative signal. Since we target marks associated with active transcription, we are assaying a positive signal, where few reads are indicative to the presence of a particular cell type or expression program. This is in contrast to methods such as occupancy or DNA methylation that either measure negative signal (lack of nucleosome occupancy) or both negative and positive signals (e.g., % methylated).
- the cfChIP assay leaves most of the original sample intact, enabling using the same material for multiple assays (e.g., genomic sequencing, methylation analysis, or cfChIP with additional antibodies), which is important where blood volume is a limiting factor.
- cfChIP A key factor in using cfDNA-based assay for detection of cfDNA from rare cells, such as in early cancer diagnosis, is low detection limit.
- cfChIP detects “positive” signal, thus even low signals contribute significantly.
- cfChIP can be performed with various antibodies targeting different genomic regions and states thus generating large signatures and the range of hundreds or thousands of sites with differential signal between different tissues or transcriptional programs.
- cfChIP is by nature a low representation method cfChIP is unbiased, as all the captured DNA fragments are sequenced.
- Assaying modified cf-nucleosomes has multiple potential medical applications, such as early disease detection (e.g., detecting unknown tumors), improved diagnosis (e.g., replacing tissue biopsy with liquid biopsy), and non-invasive monitoring of disease progression and treatment efficacy.
- a method of determining a cellular state, tissue of origin, cell type or a combination thereof of a cell that released its DNA comprising:
- a method of determining a cell or tissue of origin of cell free DNA comprising:
- a method of determining a cell or tissue of origin of cell free DNA comprising sequencing cfDNA isolated by the cfDNA's binding to a DNA-associated protein; wherein the isolated cfDNA comprises a DNA sequence of an informative genomic location and association of the DNA-associated protein with the informative genomic location is indicative of a cell type or tissue; thereby determining the cell or tissue of origin of cfDNA.
- a method of determining a cellular state of a cell in a subject comprising:
- a method of determining a cellular state of a cell in a subject, as the cell died comprising:
- a method of determining a cellular state of a cell in a subject comprising sequencing cfDNA isolated by the cfDNA's binding to a DNA-associated protein; wherein the isolated cfDNA comprises a DNA sequence of an informative genomic location and association of the DNA-associated protein with the informative genomic location is indicative of a cellular state; thereby determining a cellular state of a cell in a subject.
- the cell is a cell that has died in the subject.
- a method of determining a cellular state, tissue of origin or cell type of a cell in a subject, as the cell died comprising:
- a method of determining a cellular state, tissue of origin or cell type of a cell in a subject, as the cell died comprising:
- the method is for determining a cellular state of the cell. In some embodiments, the method is for determining a tissue of origin of the cell. In some embodiments, the method is for determining the cell type of the cell. In some embodiments, the sample is from a subject and the method is for detecting death of any one of: cells of a tissue, a cell type and cells in a cellular state in the subject. In some embodiments, the sample is from a subject and the method is for detecting a disease in the subject, wherein death of cells of a tissue, of a cell type or in a cellular state are indicative so the disease. For non-limiting example, death of liver cells may be indicative of liver disease, death of GI cells may indicate GI cancer, death of cells with active interferon response may indicate an infection and death of beta cells may indicate pancreatic damage/disease.
- detecting a disease state comprises at least one of: early detection of the disease state, detection of residual disease and monitoring disease progression. In some embodiments, detecting a disease state comprises early detection. In some embodiments, early detection comprises detection during routine blood work. In some embodiments, early detection comprises detection before development of symptoms. In some embodiments, the residual disease is residual metastatic disease. In some embodiments, residual disease is residual cancer after surgery. In some embodiments, disease monitoring comprises monitoring before treatment. In some embodiments, disease monitoring comprises monitoring after treatment. In some embodiments, disease monitoring comprises monitoring disease relapse. In some embodiments, disease monitoring comprises monitoring treatment efficacy.
- the cell died. In some embodiments, the cell released its DNA. In some embodiments, a cell that released its DNA is a dead and/or dying cell or a cell that denucleated. In some embodiments, a cell that released its DNA is a dead and/or dying cell. In some embodiments, the cell death is selected from apoptotic death and necrotic death. In some embodiments, a denucleated cell is an erythrocyte. In some embodiments a cell that is losing its nucleus is an erythroblast. Erythroblasts lose their nucleus to become erythrocytes, and as such the lost nucleus may appear in cfDNA.
- the sample is from a subject.
- the cfDNA is from a subject and detecting a cfDNA molecule of a cell tissue of origin or cellular state indicates detection of death of that cell type, tissue or cellular state.
- the subject is suspected of having increased cell death.
- the subject is not suspected of having increased cell death.
- the subject appears healthy and/or is not known to suffer from a disease or condition.
- the determining is determining the cellular state of the cell as it died.
- the methods of the invention are further for determining a cellular state of a cell in a subject as the cell died and further comprise designating a cfDNA molecule comprising a DNA sequence of an informative genomic location as originating from a cell in a cellular state, which association of the DNA-associated protein with the informative genomic location is indicative of the cellular state.
- the sample is a bodily fluid.
- the bodily fluid is blood.
- the bodily fluid is selected from at least one of: blood, serum, gastric fluid, intestinal fluid, saliva, bile, tumor fluid, cerebrospinal fluid, breast milk, semen, urine, vaginal fluid, interstitial fluid, and stool.
- Standard techniques for cell-free DNA extraction are known to a skilled artisan, a non-limiting example of which is the QIAamp Circulating Nucleic Acid kit (QIAGEN).
- a reagent that binds refers to any protein binding molecule or composition. Protein binding is well known in the art and may be assessed by any assay known in the art, including but not limited to yeast-2-hybrid, immunoprecipitation, competition assay, phage display, tandem affinity purification, and proximity ligation assay.
- the reagent is a proteinaceous molecule.
- the reagent is selected from an antibody or antigen binding fragment thereof, a protein and a small molecule. Small molecules that bind to specific proteins are well known in the art and may be used for pull-down experiments. Additionally, well characterized protein-protein interactions may be used for pull-downs. Indeed, any reagent that may be used for precipitation, immunoprecipitation (IP) or chromatin immunoprecipitation (ChIP), may be used as the reagent.
- the reagent is an antibody or antigen binding fragment thereof.
- antibody refers to a polypeptide or group of polypeptides that include at least one binding domain that is formed from the folding of polypeptide chains having three-dimensional binding spaces with internal surface shapes and charge distributions complementary to the features of an antigenic determinant of an antigen.
- An antibody typically has a tetrameric form, comprising two identical pairs of polypeptide chains, each pair having one “light” and one “heavy” chain. The variable regions of each light/heavy chain pair form an antibody binding site.
- An antibody may be oligoclonal, polyclonal, monoclonal, chimeric, camelised, CDR-grafted, multi-specific, bi-specific, catalytic, humanized, fully human, anti-idiotypic and antibodies that can be labeled in soluble or bound form as well as fragments, including epitope-binding fragments, variants or derivatives thereof, either alone or in combination with other amino acid sequences.
- An antibody may be from any species.
- the term antibody also includes binding fragments, including, but not limited to Fv, Fab, Fab′, F(ab′)2 single stranded antibody (svFC), dimeric variable region (Diabody) and disulphide-linked variable region (dsFv).
- antibodies include immunoglobulin molecules and immunologically active fragments of immunoglobulin molecules, i.e., molecules that contain an antigen binding site.
- Antibody fragments may or may not be fused to another immunoglobulin domain including but not limited to, an Fc region or fragment thereof.
- fusion products may be generated including but not limited to, scFv-Fc fusions, variable region (e.g., VL and VH) ⁇ Fc fusions and scFv-scFv-Fc fusions.
- Immunoglobulin molecules can be of any type (e.g., IgG, IgE, IgM, IgD, IgA and IgY), class (e.g., IgG1, IgG2, IgG3, IgG4, IgA1 and IgA2) or subclass.
- type e.g., IgG, IgE, IgM, IgD, IgA and IgY
- class e.g., IgG1, IgG2, IgG3, IgG4, IgA1 and IgA2 or subclass.
- one reagent is contacted. In some embodiments, at least one reagent is contacted. In some embodiments, more than one reagent is contacted. In some embodiments, each reagent binds a different DNA-associated protein. In some embodiments, the DNA-associated protein is a sequence specific DNA-binder, and more than one reagent targeting more than one protein is contacted. Since the target sequence of binding is known, following sequencing of the isolated cfDNA the sequences can be assigned to each binding reagent based on the target motif present in the sequence. Sequences containing more than one motif can either be discarded or included as bound by multiple DNA-associated proteins.
- the reagent is conjugated to a physical support.
- the term “physical support” refers to a solid and stable molecule that gives support to the reagent.
- the support is a scaffold or scaffolding agent.
- the support is a resin.
- the support is a bead.
- the support is a magnetic or paramagnetic bead. Magnetic beads may be purchased for examples from Dynabeads or Pierce.
- the support is an agarose bead.
- the support is a protein A/G bead.
- the reagent is conjugated to the physical support before the contacting.
- the conjugating is a covalent linkage.
- the conjugating is by epoxy chemistry.
- the support aids in isolation of the reagent, wherein the isolating is isolating the physical support.
- DNA-associated protein refers to any protein that can be precipitated with DNA or when precipitated brings along DNA.
- the DNA-associated protein directly binds DNA.
- the DNA-associated protein is a component of chromatin.
- the DNA-associated protein binds-indirectly to DNA.
- the DNA-associated protein binds to genomic DNA.
- the DNA-binding protein binds in the promoter.
- the DNA-binding protein binds in a gene body.
- the DNA-binding protein binds to a cis or trans regulatory element.
- the DNA-associated protein binds DNA and is a non-sequence specific DNA binder. In some embodiments, the DNA-associated protein binds DNA is a sequence specific DNA binder or a non-sequence specific DNA binder. Examples of non-sequence specific DNA binders include histones, high-mobility group (HMG) proteins, members of the DNA damage repair machinery and members of the general transcriptional machinery.
- HMG high-mobility group
- the general transcriptional machinery is well defined and includes, but is not limited to, RNA polymerases, DNA helicases, general cofactors, the splicing machinery and the polyA machinery.
- the DNA damage repair machinery is also well defined and includes, but is not limited to, members of the nucleotide excision repair pathway, base excision repair pathway and the mismatch repair system.
- the DNA-associated protein is a modified protein.
- the modification is a post-translational modification.
- the reagent binds to the modified form of the protein. In some embodiments, the reagent binds only or predominantly to the modified form of the protein.
- the DNA-associated protein is a histone, modified histone or histone variant.
- Modifications to the histone tail are well known in the art, and include but are not limited to methylation, acetylation, sumoylation, ubiquitylation and phosphorylation. Modifications may be multiple such as tri-methylation or poly-ubiquitylation. In some embodiments, a tail may have multiple modifications such as methylation and phosphorylation.
- the histone may be one of the core histones, H1, H2A, H2B, H3 and H4, or it may be a histone variant such as, for non-limiting example, H2A.z, gammaH2AX, H1T, and H3.3.
- the modified or variant histone has an activating function or a repressing function on transcription. In some embodiments, the modified or variant contributes to the formation of euchromatin or heterochromatin. In some embodiments, the modified histone is selected from Histone 3 monomethylated lysine 4 (H3K4me1), Histone 3 demethylated lysine 4 (H3K4me2), Histone 3 trimethylated lysine 36 (H3K36me3) and Histone 3 trimethylated lysine 4 (H3K4me3).
- the DNA-associated protein binds DNA and is a sequence specific DNA binder.
- sequence specific DNA binders include but are not limited to transcription factors (TFs), activators, repressors, insulators, DNA modifying enzymes and members of the general transcriptional machinery.
- the DNA-associated protein is a transcription factor.
- the DNA-associated protein is an insulator.
- the transcription factor is selected from an activator, a repressor, an insulator, a DNA modifying enzyme and a member of the general transcriptional machinery.
- the transcription factor is selected from an activator, a repressor, and an insulator.
- the transcription factor is an insulator.
- the transcription factor is CTCF.
- transcription factor refers to any protein that is not part of the general transcriptional machinery but controls/modulates the rate of transcription of a DNA sequence.
- TFs are factors that bind in a promoter region. Transcription factors are well known in the art, as are reagents that bind to them. Performing ChIP with TFs is also well known.
- the agent binds a transcription factor that binds to tissue and/or cell type specific enhancer elements.
- the DNA sequence in the cfDNA is the sequence located at the tissue and/or cell type specific enhancer element. Due to the tissue/cell type specificity, association of the TF with this element in the cfDNA indicates the cfDNA is from that tissue and/or cell type. Since the enhancer element enhances transcription of a particular target the target may indicate the cellular state of the cell. In this one, one association of TF to genomic locus can be informative of both the tissue/cell of origin and the cellular state. A non-limiting example of this is tissue-specific NF-kB enhancer binding.
- NF-kB is known to bind at specific loci only in various tissues (cardiac for example) and mediate inflammation.
- tissue-specific enhancer sequence in the cfDNA indicates not only the cellular origin of the cfDNA, but also that the cell was in an inflammatory state at the time of death.
- the DNA-associated protein is a transcription factor (TF), and the binding site is the TF binding site.
- activators refer to proteins that increase transcription. In some embodiments, activators bind to enhancer elements in DNA. In some embodiments, activators bind to promoter proximal or distal elements. As used herein, “repressors” refer to proteins that decrease transcription. In some embodiments, repressors bind to repressor elements in DNA. In some embodiments, repressors bind to promoter proximal or distal elements.
- insulators refers to proteins that separate regions of DNA that have different chromatin architecture or transcriptional rates.
- an insulator is an enhancer-blocker.
- an insulator separates euchromatin and heterochromatin.
- Non-limiting examples of insulators in include CTCF, gypsy and BDF1.
- insulators bind outside of the promoter and gene bodies.
- DNA-modifying enzymes are well known in art, and examples include members of the base/nucleotide excision repair machinery, DNA methyltransferases and DNA demethylases.
- the DNA-associated protein does not bind DNA.
- the protein modifies a protein that binds DNA. Examples of such include but are not limited to histone modifying enzymes and polycomb proteins.
- association of the DNA-associated protein with an informative genetic locus is tissue or cell type specific. In some embodiments, association of the DNA-associated protein with an informative genetic locus is differentiation specific. In some embodiments, association of the DNA-associated protein with an informative genetic locus is cellular state specific. In some embodiments, association of the DNA-associated protein with an informative genetic locus is indicative of transcriptional activation, active transcription, transcriptionally active chromatin or a combination thereof. In some embodiments, association of the DNA-associated protein with an informative genetic locus is indicative of transcriptional silencing, lack of transcription, transcriptionally inactive chromatin or a combination thereof.
- the transcription need not be at the genetic locus of binding, but may be at a near or far gene, such as is the case with activators and suppressors.
- the term “genetic locus” and “genomic location” are synonymous and refer to a particular region of DNA that can be bound by a protein.
- the genetic locus is a TF binding site, or some other short sequence of DNA.
- the locus is between 2 and 20, 2 and 16, 2 and 12, 2 and 10, 2 and 8, 2 and 6, 2 and 4, 4 and 20, 4 and 16, 4 and 12, 4 and 10, 4 and 8 or 4 and 6 base pairs. Each possibility represents a separate embodiment of the invention.
- the locus is a nucleosome, or a nucleosome length of DNA ( ⁇ 170 bp).
- the genetic locus is between 150 and 190, or 160 and 180 bp.
- the terms “informative genomic location” and “informative genetic locus” are used synonymously and refer to a unique DNA sequence in a particular location in the genome that when associated with a given DNA-associated protein is informative of the cell in which the association occurs. In some embodiments, it is informative of the tissue of origin or cell type of the cell in which the association occurs. In some embodiments, the location is a tissue or cell type specific binding/association site. In some embodiments, the binding/association is not specific/unique, but highly enriched in the tissue or cell type. In some embodiments, it is informative of the cellular state of the cell in which the association occurs. In some embodiments, it is informative of both the tissue of origin and/or cell type and the cellular state of the cell in which the association occurs. In some embodiments, it is informative of a disease in the cell. In some embodiments, it is informative of a transcriptional program in the cell.
- transcriptional program refers to a group of genes that act in concert transcriptionally.
- the genes may be actively transcribed and/or repressed, and/or inactive, and/or accessible, and/or inaccessible.
- the genes are all transcriptional regulated together.
- a transcriptional program is indicative of a cellular state.
- a transcriptional program is indicative of an active signaling pathway.
- Signatures for tissue specific, cell type specific, cellular state specific and/or transcriptional programs can be found in, for example, the Roadmap Epigenomics Project (roadmapepigenomics.org), the Cancer Genome Atlas (cancergenome.nih.gov), the Genotype-Tissue Expression (GTEx) Project (gtexportal.org) or the Xena project (xena.ucsc.edu). Tables provided herein also provide such signatures.
- the reagent binds histones. In some embodiments, the reagent is an anti-histone antibody or fragment thereof. In some embodiments, the reagent binds modified or variant histones. In some embodiments, the reagent is an anti-modified histone antibody or fragment thereof. In some embodiments, the reagent is an anti-variant histone antibody or fragment thereof. In some embodiments, the reagent is selected from anti-Histone 3 monomethylated lysine 4 (H3K4me1) and anti-Histone 3 trimethylated lysine 4 (H3K4me3) antibodies.
- the isolating comprises isolating a physical support conjugated to the reagent. In some embodiments, the isolating comprises contacting the reagent and bound proteins and DNA with the physical support and then isolating the physical support. In some embodiments, the methods of the invention comprise ChIP. In some embodiments, the isolating comprises ChIP. In some embodiments, the isolating comprised washing steps.
- the sequencing comprises sequencing at least an average of 1, 2, 3, 5, or 10 million sequencing reads. Each possibility represents a separate embodiment of the invention. In some embodiments, the sequencing comprises sequencing at least 1, 2, 3, 5, or 10 million sequencing reads. Each possibility represents a separate embodiment of the invention. In some embodiments, the amplified cfDNA comprises less than 1, 2, 3, 5, or 10 million sequencing reads. Each possibility represents a separate embodiment of the invention. In some embodiments, the sequencing is at a depth of at most 1, 2, 3, 5, or 10 million sequencing reads. Each possibility represents a separate embodiment of the invention.
- the sequencing comprises PCR amplification of the cfDNA. In some embodiments, the amplification comprises ligation of a barcode or additional DNA sequence. In some embodiments, the amplification is performed while the cfDNA is still associated with the protein. In some embodiments, the amplification is performed while the cfDNA is still associated with the physical support. In some embodiments, the amplification is performed without disassociating the cfDNA from the reagent and/or the support.
- the method further comprises comparing the sequencing data to tissue/cell type specific data of DNA binding proteins, wherein binding of a protein to a sequence that is specifically bound by that protein in a tissue/cell type indicates the cfDNA is from that tissue/cell type.
- Tissue/cell type specific binding data can be found in sources such as the Encode consortium, the NIH Epigenome Roadmap consortium and the Gene Transcription Regulation Database to name but a few.
- the genomic location is within a tissue or cell-type specific gene or element.
- the protein is associated with active transcription and the genomic location is within a tissue or cell-type specific gene or enhancer element.
- Non-limiting examples of this include H3K4me3 located in tissue specific genes, or H3K4me1 in enhancers.
- tissue specific genes include TNNI3 and MYBPC3 in heart cells, and C8a and C8b in liver cells.
- Tissue expression levels for specific proteins can be found on numerous websites including the Uniprot database (www.uniprot.org), and the GTEx portal (www.gtexportal.org) for example. Tissue specific gene expression and regulation can also be found in numerous locations, most notably the TiGER database (bioinfo.wilmer.jhu.edu/tiger), and the Human Protein Atlas (www.proteinatlas.org).
- the protein is associated with silenced transcription and the genomic location is within a repressor element, or a gene specifically silenced in the tissue or cell-type.
- Tissue-specific protein-DNA binding is well known in the art and can be found in the resources described herein above. Any informative locus binding may be used to determine the source of the cfDNA.
- the sequencing data is compared to at least the top 1, 2, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90 or 100 peaks of binding in a particular tissue/cell type.
- Each possibility represents a separate embodiment of the invention.
- only one particular tissue/cell type is investigated.
- binding data from at least 1, 2, 3, 5, 10, 15, 20, 25, 30, 35, 40, 45, or 50 tissues or cell/types is used to compare to the sequencing data.
- Each possibility represents a separate embodiment of the invention.
- the methods of the invention further comprise comparing the sequenced cfDNA to at least 1 genomic location with the greatest association of the DNA-associated protein in a tissue, cell type and/or cellular state, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 1 genomic location is considered to be from that tissue, cell type and/or cellular state.
- the at least one genomic location has the greatest unique association of the DNA-associated protein.
- unique association refers to an association that occurs only, or nearly exclusively, within a tissue or cell type, or a cell state.
- the 10 most unique locations are selected for example, locations with protein binding only in a certain tissue, cell type or state should be examined, and specifically the 10 with the highest binding should be selected. Should there not be the required number of sites with completely unique binding, then the site with the most unique binding should be selected. Any determination of greatest uniqueness may be used. Examples of such include, but are not limited to, having binding in the fewest other tissues, and having the highest difference in the amount of binding between the tissue of interest and another tissue.
- the methods of the invention further comprise comparing the sequenced cfDNA to a DNA-protein association atlas of at least 2 cell types and/or tissues, wherein the atlas comprises at least 1 genomic location with the greatest association of the DNA-associated protein in each of the 2 tissues, cell types and/or cellular states, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 1 genomic location is considered to be from that t tissue, cell type and/or cellular state.
- genomic locations have the greatest unique association of the DNA-associated protein.
- the atlas is of at least 1, 2, 3, 5, 7, 10, 15, or 20 cell types and/or tissues. Each possibility represents a separate embodiment of the invention.
- the atlas comprises at least 1, 2, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 75, 80, 90 or 100 genomic locations per tissue, cell type and/or cellular state. Each possibility represents a separate embodiment of the invention.
- genomic locations that when associated with a DNA-associated protein are indicative of a tissue or cell type can be found in Table 1.
- Table 1 gives exemplary locations for H3K4me3 tissue-informative locations.
- the atlas comprises all or a portion of the location in Table 1.
- the sequencing of cfDNA is compared to Table 1.
- Thymus RAG1 chr11 36579801-36602400 Thymus RAG1 chr11: 36582401-36604800 Thymus RAG2 chr11: 36608001-36630200 Thymus CD1E chr1: 158316001-158338000 Thymus CD1B chr1: 158287601-158309400 Thymus OR10R3P chr1: 158449201-158471600 Thymus CD1E chr1: 158314001-158335200 Thymus CCR9 chr3: 45919001-45940200 Thymus .
- chr17 74980401-75001800 Thymus AK057554
- chr1 158451801-158474200 Thymus .
- chr1 158322001-158345000 Thymus .
- chr15 58058601-58080800 Thymus .
- chr12 123546401-123567800 Thymus CTB-26E19.1 chr5: 177857801-177880400 Thymus CD1B chr1: 158291401-158313200 Thymus CD1B chr1: 158290401-158311200 Thymus RP11-35J23.1 chr10: 98057201-98079800 Thymus CD1E chr1: 158315201-158336000 Thymus AC005780.1 chr19: 28863601-28886000 Thymus CD1B chr1: 158289401-158310200 Thymus PTCRA chr6: 42873201-42894800 Thymus ARL5C chr17: 37315801-37338400 Thymus .
- chr2 57914401-57936600 Thymus ARPP21-AS1 chr3: 35681801-35704200 Brain DLX1 chr2: 172935201-172957800 Brain LINC00461 chr5: 87956001-87978400 Brain SRRM4 chr12: 119409601-119432400 Brain MIR9-3; LINC00925 chr15: 89896801-89919000 Brain SLC32A1 chr20: 37341401-37363400 Brain NCAN chr19: 19312401-19334400 Brain C14orf23 chr14: 29228001-29250400 Brain .
- chr7 96629601-96652200 Brain MIR9-3; LINC00925 chr15: 89894601-89916800 Brain NRXN3 chr14: 78627201-78649200 Brain TMEM161B-AS1 chr5: 87680201-87703000 Brain AKAP6 chr14: 32788401-32811000 Brain C14orf23 chr14: 29230401-29252800 Brain . chr14: 23945401-23966600 Brain GPR56 chr16: 57663601-57685200 Brain BC034423 chr14: 29233801-29256400 Brain SOX2-OT chr3: 181410201-181432400 Brain .
- chr2 172944801-172966800 Brain TIAM2 chr6: 155527601-155549800 Brain .
- chr20 37344401-37365600 Brain MBP chr18: 74716401-74738600 Brain .
- chr2 172946801-172968600 Brain NRXN3 chr14: 78629201-78651200 Brain TMEM161B-AS1 chr5: 87677201-87700200 Brain SRXN1; SCRT2 chr20: 642601-665400 Brain AK055364 chr16: 72448601-72471600 Brain .
- chr20 37343401-37364400 Brain LOC100130155 chr8: 65276601-65298200 Brain .
- chr7 96628601-96649600 Brain C10orf90 chr10: 128347801-128370000 Brain POU3F2 chr6: 99279201-99301800 Brain ERMN chr2: 158169201-158191600 Brain HAPLN2 chr1: 156578601-156601600 Brain MIR9-1 chr1: 156378001-156399600 Brain RAPGEF5 chr7: 22219201-22242000 Brain LOC100130155 chr8: 65275001-65296600 Brain CNTN2 chr1: 205002801-205024000 Brain PRKCZ chr1: 1995601-2017000 Brain SOX1 chr13: 112713201-112734400 Brain ROBO1 chr3: 78707201-78730200 Brain MBP chr18: 74713601
- chr13 112702401-112723200 Brain BCAS1 chr20: 52634401-52656200 Brain ELAVL3 chr19: 11580201-11601200 Brain CAPN3 chr15: 42686401-42708600 Brain EPHA10 chr1: 38220601-38242000 Brain .
- chr2 172932801-172954800 Brain SLC24A2 chr9: 19776001-19798400 Brain FOXG1 chr14: 29227001-29247600 Brain ABR chr17: 971001-991800 Brain SCRT1 chr8: 145547601-145568800 Brain SLCO1A2 chr12: 21466001-21488600 Brain DLX6-AS1 chr7: 96633201-96655200 Brain C14orf23 chr14: 29232801-29253800 Brain MSRA chr8: 9942801-9965800 Brain TTBK1 chr6: 43200801-43222200 Brain ANK3 chr10: 62136401-62158600 Brain SOX1 chr13: 112716001-112737600 Brain DLEU7 chr13: 51407201-51428000 Brain CACNG2 chr22: 37085801-37108800 Brain SRRM4 chr12: 119408401
- chr20 37345601-37366600 Brain OMG chr17: 29613001-29634400 Brain LINC00461 chr5: 87895401-87917800 Brain BCAS1 chr20: 52632401-52654400 Brain CNTNAP4 chr16: 76301601-76323600 Brain SLC24A2 chr9: 19773601-19796000 Brain BX537900 chr8: 65279001-65299800 Brain NR2E1 chr6: 108476201-108497000 Brain AMER2 chr13: 25730601-25752600 Brain .
- chr13 112700801-112721600 Brain AL832737 chr6: 168084401-168107400 Brain MIR9-1 chr1: 156376401-156398000 Brain . chr13: 112699401-112720800 Brain . chr7: 96632201-96653200 Brain NTM; NTM-AS1 chr11: 131522401-131544200 Brain TMTC4 chr13: 101277001-101299200 Brain .
- chr11 2601801-2623200 Brain AC017053.1 chr2: 6452401-6475400 Brain MTUS1 chr8: 17568601-17590400 Brain MOBP chr3: 39501201-39523800 Brain LRRC4C chr11: 41469601-41491600 Brain LPPR1 chr9: 104021401-104043200 Brain ANO4 chr12: 101178401-101199200 Brain ANLN chr7: 36436801-36458800 Brain LINC00634 chr22: 42337401-42358800 Brain TMEM161B-AS1 chr5: 87674201-87697200 Brain ST18 chr8: 53306801-53329800 Brain LOC400940 chr2: 6113001-6135200 Brain SOX2-OT chr3: 181318401-181341000 Brain BC040833 chr10: 91091601-91112600 Brain CTD-2049O4.1 chr16
- Muscle MYLK2 chr20 30397401-30420200 Sk. Muscle MYOG chr1: 203043801-203066400 Sk. Muscle . chr20: 3275401-3298400 Sk. Muscle ANKRD23 chr2: 97498401-97520800 Sk. Muscle . chr6: 51990401-52013000 Sk. Muscle MYF6 chr12: 81089601-81111800 Sk. Muscle AC005616.2 chr19: 28989401-29011400 Sk. Muscle NEB chr2: 152579201-152602000 Sk. Muscle MYOT chr5: 137193801-137216200 Sk.
- Muscle CLCN1 chr7 143002801-143025600 Sk. Muscle . chr6: 51993001-52015400 Sk. Muscle ART1 chr11: 3655401-3678000 Sk. Muscle PCNT chr21: 47758201-47780400 Sk. Muscle STAC3 chr12: 57633601-57656200 Sk. Muscle . chr20: 23119201-23141200 Sk. Muscle KLHL40 chr3: 42718001-42738800 Sk. Muscle FEZ2 chr2: 36808801-36831800 Sk. Muscle NEB chr2: 152576401-152599200 Sk. Muscle IDI2 chr10: 1059801-1082600 Sk.
- Muscle TPM3 chr1 154153401-154174400 Sk.
- Muscle PTPN3 chr9 112172401-112193800 Sk.
- Muscle CASQ1 chr1 160151201-160172600 Sk.
- Muscle MYADML2 chr17 79888401-79909400 Sk.
- Muscle C8orf22 chr8 49974801-49996400 Sk.
- Muscle MYOZ1 chr10 75390401-75411400 Sk. Muscle .
- chr8 6620801-6641800 Sk.
- Muscle RP11-18B3.3 chr9 108461201-108483200 Sk. Muscle . chr5: 137196601-137218200 Sk. Muscle .
- Muscle MYL1 chr2 211166001-211188600 Sk. Muscle VGLL2 chr6: 117578001-117599800 Sk. Muscle . chr13: 21384801-21407400 Sk. Muscle LSMEM1 chr7: 112110201-112133000 Sk. Muscle . chr4: 23381601-23403600 Sk. Muscle . chr15: 93507801-93529600 Sk. Muscle LOC100507537 chr3: 154998801-155020800 Sk. Muscle TNNT1 chr19: 55650601-55672600 Sk. Muscle MYBPC1 chr12: 101951201-101973800 Sk.
- chr1 231971401-231993600 Heart MYL4 chr17: 45276001-45298400 Heart TNNI3 chr19: 55658201-55679800 Heart NEBL chr10: 21171001-21193800 Heart PRDM6 chr5: 122423401-122444400 Heart . chr10: 15745201-15767200 Heart . chr3: 24220001-24242400 Heart RP11-405A12.2 chr12: 19918801-19940600 Heart SLC5A1 chr22: 32445001-32467200 Heart . chr17: 8516601-8539000 Heart . chr10: 92715801-92738400 GI Sm.
- Muscle TACR2 chr10 71165401-71187200 GI Sm. Muscle TACR2 chr10: 71163401-71185400 GI Sm. Muscle . chr19: 7409601-7432600 GI Sm. Muscle AC007392.3 chr2: 66909201-66931200 GI Sm. Muscle . chr15: 98548401-98570200 GI Sm. Muscle . chr1: 225636601-225658800 GI Sm.
- chr17 71941801-71963800
- Stomach FAM177B chr1 222900601-222922400
- Stomach ANXA10 chr4 169003801-169026000
- Stomach C5orf66-AS1 chr5 134364601-134385000
- Stomach PAX9 chr14 37122201-37142600 Stomach .
- chr14 37125401-37146400 GI Mucosa BTNL3 chr5: 180405801-180428000 GI Mucosa .
- chr1 233830001-233852800 GI Mucosa RBP2 chr3: 139183201-139205400 GI Mucosa .
- chr17 46794401-46815600 GI Mucosa FRYL chr4: 48671401-48693200 GI Mucosa UBE2V2P1 chr10: 19327801-19350000 GI Mucosa .
- chr17 71941801-71963800 GI Mucosa FAM177B chr1: 222900601-222922400 GI Mucosa REP15 chr12: 27840801-27862600 GI Mucosa .
- chr7 12830601-12853600 GI Mucosa ANXA10 chr4: 169003801-169026000 GI Mucosa FABP2 chr4: 120231801-120253400 GI Mucosa .
- chr10 3017801-3039200 GI Mucosa NXPE4 chr11: 114454001-114475600 GI Mucosa .
- chr11 76775401-76797200 GI Mucosa SLC5A1 chr22: 32429201-32450000 Digestive CLDN18 chr3: 137707601-137730000 Digestive CTSE chr1: 206306801-206329000 Digestive TFF1 chr21: 43774801-43797000 Digestive .
- chr1 233830001-233852800 Digestive SPDEF chr6: 34512401-34534000 Digestive CAPN8 chr1: 223840401-223862200 Digestive RBP2 chr3: 139183201-139205400 Digestive .
- chr17 46794401-46815600 Digestive FRYL chr4: 48671401-48693200 Digestive .
- chr11 119767401-119789600 Digestive UBE2V2P1 chr10: 19327801-19350000 Digestive AX747250 chr6: 1069601-1091400 Digestive .
- chr4 149590201-149612200 Digestive ERN2 chr16: 23713001-23734400 Digestive C5orf66-AS1 chr5: 134364601-134385000 Digestive NXPE4 chr11: 114454001-114475600 Digestive PAX9 chr14: 37122201-37142600 Digestive . chr16: 68407201-68429400 Digestive LOC93432 chr7: 141801601-141823000 Digestive . chr14: 37125401-37146400 Digestive .
- chr2 77425801-77448600 Pancreas Islet ELAVL4 chr1: 50561401-50583000 Pancreas Islet .
- chr4 43755001-43777000 Pancreas Islet GNAS chr20: 57405601-57427400 Pancreas Islet .
- chr14 62341201-62363800 Pancreas Islet CNTN4 chr3: 2544801-2567400 Pancreas Islet LINC01099 chr4: 178815401-178837400 Pancreas Islet IAPP chr12: 21517401-21539400 Pancreas SLC30A8 chr8: 118138601-118161400 Pancreas SLC30A8 chr8: 118135801-118158600 Pancreas PRSS1 chr7: 142447001-142468600 Pancreas TPST2 chr22: 26950201-26971200 Pancreas CTRB2 chr16: 75230001-75251200 Pancreas PAM chr5: 102134201-102156200 Pancreas CELA3A chr1: 22318001-22340600 Pancreas G6PC2 chr2: 169747201-169769400 Pancreas SYCN ch
- chr2 77425801-77448600 Pancreas ELAVL4 chr1: 50561401-50583000 Pancreas AQP8 chr16: 25218201-25239400 Pancreas .
- chr4 43755001-43777000 Pancreas NPHS1 chr19: 36331601-36354200 Pancreas RP11-20I20.2
- chr4 1117401-1139800 Pancreas GNAS chr20: 57405601-57427400 Pancreas .
- chr14 62341201-62363800 Pancreas CNTN4 chr3: 2544801-2567400 Pancreas LINC01099 chr4: 178815401-178837400 Pancreas IAPP chr12: 21517401-21539400 Placenta .
- chr19 9108001-9130600 Placenta LOC100506655
- chr16 25148801-25170400 Placenta
- DLG5 chr10 79605601-79627600 Placenta .
- chr6 140458801-140481400 Placenta .
- chr6 168803001-168825200 Placenta STRA6; HP11097 chr15: 74483601-74505400 Placenta ZFAT chr8: 135697001-135719600 Placenta .
- chr3 72495001-72517800 Placenta BCAR4
- chr16 11909801-11932800 Placenta ZFP42
- chr4 188906601-188928400 Placenta ERVW-1
- chr7 92095801-92117400 Placenta .
- chr11 2147001-2168000 Placenta HSD3B1 chr1: 120039801-120060800 Placenta .
- chr5 86536801-86559200 Placenta PSG1 chr19: 43371001-43394000 Placenta .
- chr1 245057601-245078800 Placenta LGALS13 chr19: 40083201-40104800 Placenta RPS20P15 chr3: 27494401-27516800 Placenta .
- chr1 120040801-120061800 Placenta GRAMD3 chr5: 125685801-125707800 Placenta AK127846 chr19: 53500801-53522400 Placenta PSG6 chr19: 43410601-43432000 Placenta BRWD1 chr21: 40682201-40703600 Placenta .
- chr7 114530401-114552000 Placenta CGA chr6: 87793201-87814800 Placenta ARID3A chr19: 949401-970800 Placenta LINC00577 chr6: 105374201-105396800 Placenta BC031304 chr2: 46645601-46667600 Placenta . chr20: 39599401-39621800 Placenta . chr11: 105376001-105397200 Placenta .
- chr1 204272601-204293400 Placenta PSG3; PSG7 chr19: 43428601-43451600 Placenta RP3-407E4.4 chr6: 64141401-64162200 Placenta .
- chr19 54140601-54162000 Placenta FAM13A chr4: 89845601-89867800 Liver .
- chr4 155518001-155542600 Liver CPS1 chr2: 211411001-211433600 Liver APOH chr17: 64212601-64235600 Liver BAAT chr9: 104134601-104157400 Liver C8B chr1: 57418401-57441000 Liver TAT chr16: 71599001-71621400 Liver CCL16 chr17: 34296601-34319200 Liver .
- chr7 45719401-45741600 Liver CCND2P1 chr11: 63001001-63023000 Liver .
- chr17 41530801-41553400 Liver .
- chr6 132157001-132179400 Liver C5orf27 chr5: 95178201-95200400 Liver CYP2B6 chr19: 41487401-41509600 Liver SERPINA10 chr14: 94747001-94768800 Liver SLC22A1 chr6: 160533601-160556200 Liver SEPP1 chr5: 42813801-42836200 Liver CPN1 chr10: 101829401-101852000 Liver CPB2 chr13: 46666201-46688000 Liver C9 chr5: 39353401-39375000 Liver KCTD21-AS1 chr11: 77870601-77892200 Liver AHSG chr3: 186320401-186342200
- chr12 111502801-111524800 Liver ADCY10 chr1: 167871201-167893400 Liver JB175316 chr12: 9269801-9291800 Liver AOC4; AF047486 chr17: 41009601-41031800 Liver TTC39C chr18: 21683801-21706400 Liver C19orf80 chr19: 11340201-11362000 Liver TFR2 chr7: 100227801-100249800 Liver APOC3 chr11: 116690001-116712000 Liver ALB chr4: 74262801-74285400 Liver AKR1C4 chr10: 5229201-5252200 Liver ITIH2 chr10: 7734201-7756400 Liver .
- chr4 77221201-77243600 Liver C8A chr1: 57310801-57332800 Liver MIR122 chr18: 56102601-56124400 Liver NCOR1 chr17: 15930001-15952600 Liver APOA2 chr1: 161182601-161204000 Liver CYP2C8 chr10: 96818001-96840200 Liver .
- chr3 16363001-16385400 Liver ABCB11 chr2: 169876201-169898000 Liver C9 chr5: 39351601-39373400 Liver HAO1 chr20: 7909801-7931600 Liver CYP2C9 chr10: 96689201-96710800 Liver FGL1 chr8: 17740801-17763200 Liver C8B chr1: 57421001-57443600 Liver PON1 chr7: 94942001-94963600 Liver CPS1 chr2: 211413601-211436000 Liver LEAP2 chr5: 132199401-132220400 Liver .
- chr3 148924401-148946600 Liver C5 chr9: 123801201-123822600 Liver AKR1D1 chr7: 137750801-137773000 Liver .
- chr5 68030601-68052600 Liver SLC22A10
- chr11 63046601-63068800 Liver C5 chr9: 123799401-123821200 Liver GBP7 chr1: 89629801-89651800 Liver .
- chr16 76555001-76577000 Liver APCS chr1: 159549001-159571200 Liver AOC4; AF047486 chr17: 41007201-41029600 Liver MASP2 chr1: 11095801-11117400 Liver U91324.1 chr2: 8269401-8292000 Liver .
- chr5 115064401-115086800 Liver GPLD1 chr6: 24477801-24500400 Liver .
- chr10 113975801-113997400 Liver PZP chr12: 9349001-9371000 Liver .
- chr2 138689001-138711400 Liver HAO1 chr20: 7907801-7929800 Liver .
- chr10 96817201-96838000 Liver APOC3
- chr11 116692001-116714000 Liver
- SERPINA11 chr14 94907801-94929400 Liver .
- chr2 31923201-31945400 Liver ITIH2 chr10: 7736401-7758600 Liver .
- chr12 24178601-24201400 Liver SERPINA10 chr14: 94748801-94769600 Liver RCL1 chr9: 4829201-4850800 Liver NR1I3 chr1: 161196001-161218400 Liver SLC17A2 chr6: 25919801-25942200 Liver RTP3 chr3: 46530201-46552000 Liver CFHR5 chr1: 196937001-196960000 Liver SLC17A2 chr6: 25917401-25939800 Liver HRG chr3: 186373201-186395000 Liver CYP2A6 chr19: 41345001-41366400 Liver GLYATL1 chr11: 58696801-58718800 Liver ADH1A chr4: 100199201-100221200 Liver .
- chr16 51074401-51096600 Liver .
- chr12 7635801-7658000 Liver .
- chr2 151010601-151033400 Liver BDH1 chr3: 197289201-197311000 Liver SLC22A10 chr11: 63048801-63071000 Liver A1BG chr19: 58853801-58875400 Liver CFHR4 chr1: 196847601-196869600 Liver RTP3 chr3: 46528401-46550200 Liver CYP4A22 chr1: 47593001-47615200 Liver .
- chr8 63909801-63931800 Liver C4BPA chr1: 207270001-207292200 Liver EPHX1 chr1: 226004401-226025800 Liver .
- chr6 132154601-132177000 Liver LDHD chr16: 75143001-75165600 Liver INHBC chr12: 57818401-57839600 Liver F12 chr5: 176825201-176846600 Liver LPAL2 chr6: 160921201-160943000 Liver FETUB chr3: 186349001-186371200 Liver SLC6A13 chr12: 336001-357400 Liver SLC22A1 chr6: 160530801-160553600 Liver ANXA10 chr4: 169075401-169098400 Liver .
- chr22 43860201-43881800 Liver IGFBP1 chr7: 45919601-45941800 Liver SLC22A9 chr11: 63126601-63148200 Liver .
- chr2 21306401-21328800 Liver APOC4-APOC2 chr19: 45435401-45457000 Liver CSAD chr12: 53552001-53574400 Liver .
- chr1 84222001-84244800 Liver HABP2
- chr10 115304201-115326000 Liver INHBA chr7: 41737001-41758800 Liver AKR1CL1
- chr10 5215201-5236800 Liver ALB
- chr4 74260001-74281200 Liver CLDN14
- chr21 37842201-37864600 Liver
- AC004862.6 chr7 79991801-80014200 Liver HRG chr3: 186375001-186396800 Liver .
- chr6 136139401-136161600 Liver SAA4 chr11: 18247401-18269200 Liver SERPINC1 chr1: 173876001-173897600 Liver .
- chr2 21308801-21331000 Liver ADH1A chr4: 100201201-100223200 Liver .
- chr16 28526401-28548400 Liver CFHR5 chr1: 196934001-196957000 Liver .
- chr13 74729801-74752200 Liver .
- chr1 119912601-119935400 Liver LINC01485
- chr5 173206601-173228000 Liver
- chr6 8896801-8919400 Liver
- chr4 155495201-155517000 Liver ITIH3
- chr3 52818201-52839800 Liver
- chr9 4604001-4626000 Liver .
- chr5 133740201-133762400 Liver EVA1A chr2: 75734201-75756200 Liver DQ590166; HSD17B6 chr12: 57148201-57170200 Liver RDH16 chr12: 57341001-57362600 Liver CRP chr1: 159670801-159693400 Liver FAM214A chr15: 52988401-53010800 Liver SERPIND1 chr22: 21117801-21139600 Liver . chr10: 77781001-77803200 Liver RP11-328K4.1 chr4: 104335601-104357400 Liver .
- chr16 76552801-76575000 Liver GAS2 chr11: 22687201-22710200 Liver .
- chr1 161182001-161202600 Liver CFHR3; CFHR1 chr1: 196733201-196756000 Liver SMUG1P1 chr18: 47166801-47189200 Liver CP
- chr3 148926601-148947800 Liver GYS2 chr12: 21742801-21765400 Liver
- CPN2 chr3 194061201-194082800 Liver ABCG5; ABCG8 chr2: 44058201-44080000 Liver RNA5SP288 chr9: 97895001-97916600 Liver .
- chr8 85387401-85410200 Liver ATP11B chr3: 182602601-182624400 Liver LEAP2 chr5: 132200401-132222200 Liver HABP2 chr10: 115301601-115323400 Liver F13B chr1: 197023001-197045200 Liver LOC157273 chr8: 9173001-9195800 Liver MIR122 chr18: 56100601-56122600 Liver . chr4: 155522601-155543600 Liver . chr10: 72073201-72093800 Liver .
- chr12 92884601-92906800 Liver SLCO1B3 chr12: 20953001-20975200 Liver .
- chr5 56936401-56957600 Liver MIR122 chr18: 56104601-56125200 Liver SDS
- chr12 113830001-113851400 Liver .
- chr3 125326001-125348200 Liver F13B chr1: 197025201-197047400 Liver .
- chr6 23572401-23595000 Liver UGT2B15
- chr4 69523801-69546000 Liver .
- chr10 93857601-93880200 Vasculary .
- chr12 105852801-105875400 Vasculary AC073130.3 chr7: 115956001-115977800 Vasculary PTPRE chr10: 129774601-129797600 Vasculary .
- chr12 96820801-96843600 Vasculary CTD-2337A12.1
- chr5 95910001-95932000 Vasculary ESM1 chr5: 54269201-54289800 Vasculary VEPH1 chr3: 157240001-157261000 Vasculary RP11-322E11.5 chr18: 33034401-33055800 Vasculary .
- chr2 216702601-216724000 Vasculary LOC729987 chr1: 98666201-98687400 Vasculary .
- chr6 12284601-12306200 Vasculary .
- chr1 98542601-98565600 Vasculary .
- chr4 85779401-85801400 Vasculary RP3-390M24.1
- chr6 79303001-79325000 Vasculary RP11-679B17.2
- chr6 11598201-11619800 Vasculary RP11-115J23.1
- chr11 28714401-28736800
- Vasculary ESM1 chr5 54269801-54290400
- Vasculary VEPH1 chr3 157237601-157260000 Vasculary .
- chr2 56133801-56155600 Vasculary CTD-2337A12.1
- chr5 95912001-95933800 Vasculary TNFSF18
- chr1 173008201-1730600 Vasculary RP3-390M24.1
- chr6 79305001-79325600 Vasculary .
- chr5 28145601-28167600 Vasculary PCAT19 chr19: 41974401-41995000
- Vasculary MIR216A chr2 56204201-56227000 Vasculary LOC100507254
- chr6 132445801-132466600 Vasculary .
- chr8 78257401-78280400 T-Helper-Cells .
- chr2 228661001-228683200 T-Helper-Cells .
- chr21 26929001-26949600 T-Helper-Cells .
- chr17 80269601-80291000 T-Helper-Cells BC045668;
- IL21 chr4 123529201-123551000 T-Helper-Cells .
- chr1 206746601-206768000 T-Helper-Cells .
- chr7 130631401-130652200 T-Helper-Cells .
- chr6 161493201-161514800 T-Helper-Cells .
- chr6 16427801-16449400 T-Helper-Cells .
- chr10 6090801-6111800 T-Cells ARHGAP15 chr2: 143995201-144017000 T-Cells CCL20 chr2: 228666201-228688600 T-Cells .
- chr21 26931801-26953600 T-Cells CTLA4 chr2: 204722601-204743400 T-Cells .
- chr2 204707801-204730400 T-Cells MAP3K4 chr6: 161491601-161513200 T-Cells ICOS chr2: 204788401-204810600 T-Cells RNU6-933P chr11: 60744201-60765800 T-Cells AX747844 chr12: 47593001-47614600 T-Cells TNIP3 chr4: 122135801-122158200 T-Cells . chr12: 94157201-94179600 T-Cells . chr15: 60853601-60876400 T-Cells . chr8: 128992001-129013600 T-Cells .
- chr9 123672801-123693800 T-Cells .
- chr9 123675001-123695800 T-Cells .
- chr8 134065601-134087400 T-Cells TLDC1
- chr16 84575801-84597200 T-Cells .
- chr2 228661001-228683200 T-Cells .
- chr21 26929001-26949600 T-Cells .
- chr3 16333201-16354200 T-Cells CDC14A chr1: 100875801-100898600 T-Cells .
- chr4 90199801-90222200 T-Cells CYTIP chr2: 158293001-158314600 T-Cells . chr2: 204796001-204817800 T-Cells . chr22: 37605601-37626400 T-Cells . chr17: 80269601-80291000 T-Cells . chr21: 26933601-26955200 T-Cells RP11-291B21.2 chr12: 10695601-10716800 T-Cells CTLA4 chr2: 204724801-204745600 T-Cells .
- chr15 60863601-60886200 T-Cells
- RTKN2 chr10 63983001-64005400 T-Cells RP11-799D4.2
- chr17 33505001-33527400 T-Cells
- ETS1 chr11 128324001-128346600 T-Cells .
- chr1 160518401-160540200 T-Cells BC045668;
- IL21 chr4 123529201-123551000 T-Cells RP11-61O1.1
- chr14 98655401-98678000 T-Cells .
- chr6 128289201-128312200 T-Cells .
- chr4 143305601-143328400 T-Cells .
- chr12 9938801-9960200 T-Cells .
- chr14 61791401-61814000 T-Cells .
- chr1 206746601-206768000 T-Cells .
- chr2 204566201-204588800 T-Cells .
- chr1 90062201-90084400 T-Cells .
- chr7 130631401-130652200 T-Cells .
- chr22 40294801-40317200 T-Cells .
- chr6 161493201-161514800 T-Cells .
- chr16 27409001-27431000 T-Cells .
- chr1 117292801-117314600 T-Cells .
- chr6 16427801-16449400 T-Cells .
- chr6 33302001-33323400 T-Cells .
- chr4 143308401-143331200 T-Cells .
- chr22 47061801-47083200 T-Cells CTLA4 chr2: 204723801-204744400 T-Cells .
- chr4 143287401-143310000 T-Cells .
- chr8 128993601-129014600 T-Cells .
- chr21 36404601-36426800 T-Cells .
- chr14 103262801-103285200 T-Cells .
- chr3 108542801-108564600 T-Cells .
- chr15 60982201-61004600 T-Cells .
- chr2 228663201-228685200 T-Cells BC062769
- chr2 197115801-197137400 T-Cells .
- chr1 90066401-90088800 T-Cells USP44 chr12: 95934801-95957200 T-Cells .
- chr3 59982801-60005400 T-Cells FANK1 chr10: 127674201-127696600 T-Cells .
- chr2 106344201-106367000 T-Cells ANK3 chr10: 62475401-62498200 T-Cells .
- chr19 9963401-9985000 T-Cells .
- chr8 121733601-121754600 T-Cells .
- chr17 38758401-38780400 T-Cells .
- chr18 9079201-9101400 T-Cells .
- chr2 228703401-228725600 T-Cells .
- chr5 156640201-156661200 T-Cells .
- chr14 98640001-98662400 T-Cells .
- chr3 42669401-42691200 T-Cells .
- chr8 129539401-129562400 T-Cells .
- chr7 50411401-50433000 T-Cells .
- chr15 60856401-60879200 T-Cells .
- chr2 181994801-182017400 T-Cells TRABD2A chr2: 85056801-85079600 T-Cells RN7SL328P chr9: 134593001-134614600 T-Cells TIGIT chr3: 114003001-114024400 T-Cells .
- chr16 27407801-27428600 T-Cells GIMAP4 chr7: 150250801-150272800 T-Cells .
- chr5 156608401-156630800 T-Cells .
- chr21 26928401-26949000 T-Cells .
- chr2 157174801-157195600 T-Cells
- CLEC2D chr12: 9818401-9840200 T-Cells
- chr1 12560601-12581600 T-Cells
- RP11-61O1.1 chr14: 98660401-98681600 T-Cells .
- chr14 102271201-102293000 T-Cells .
- chr6 154541401-154564200 T-Cells .
- chr1 214803401-214825800 T-Cells .
- chr2 143999001-144021000 T-Cells RP1-281H8.3
- chr6 149806001-149828200
- NK TARP chr7 38304201-38325800 NK
- chr2 8411201-8432600 NK KLRF1
- chr12 9970201-9991600 NK .
- chr17 66206801-66229600 NK CMC1 chr3: 28323201-28344800 NK KLRD1 chr12: 10451601-10472400 NK TRDC chr14: 22910401-22932800 NK AK096766 chr7: 38333401-38355000 NK KLRD1 chr12: 10452401-10473400 NK CMC1 chr3: 28324801-28346400 NK SH2D1B chr1: 162369401-162391200 NK TCRDV2 chr14: 22916401-22938600 NK KLRD1 chr12: 10448001-10470000 Lymphocytes .
- chr6 112176801-112199600 Lymphocytes
- RNU6-933P chr11: 60740601-60763400 Lymphocytes
- chr21 26927401-26948400 Lymphocytes
- HLA-DOB chr6 32774001-32794800 Lymphocytes
- chr10 6087801-6110800 Lymphocytes
- chr5 156641201-156666200 Lymphocytes
- TARP chr7 38304201-38325800 Lymphocytes
- RP11-61O1.1 chr14 98658001-98680400 Lymphocytes .
- chr10 6090801-6111800 Lymphocytes AK128525 chr2: 89147801-89169600 Lymphocytes .
- chr2 8411201-8432600 Lymphocytes KLRF1 chr12: 9970201-9991600 Lymphocytes ARHGAP15 chr2: 143995201-144017000 Lymphocytes .
- chr3 40655601-40676800 Lymphocytes
- chr21 26931801-26953600 Lymphocytes CTLA4 chr2: 204722601-204743400 Lymphocytes .
- chr2 204707801-204730400 Lymphocytes .
- chr17 66206801-66229600 Lymphocytes
- MAP3K4 chr6: 161491601-161513200 Lymphocytes
- ICOS chr2: 204788401-204810600 Lymphocytes
- RNU6-933P chr11: 60744201-60765800 Lymphocytes
- AX747844 chr12 47593001-47614600 Lymphocytes
- TNIP3 chr4 122135801-122158200 Lymphocytes .
- chr12 94157201-94179600 Lymphocytes .
- chr15 60853601-60876400 Lymphocytes .
- chr8 128992001-129013600 Lymphocytes .
- chr9 123672801-123693800 Lymphocytes .
- chr9 123675001-123695800 Lymphocytes KLRD1
- chr12 10451601-10472400 Lymphocytes .
- chr8 134065601-134087400 Lymphocytes TLDC1
- chr16 84575801-84597200 Lymphocytes .
- chr2 228661001-228683200 Lymphocytes FYN chr6: 112103401-112125600 Lymphocytes LILRB1 chr19: 55118401-55140200 Lymphocytes .
- chr21 26929001-26949600 Lymphocytes .
- chr3 16333201-16354200 Lymphocytes CDC14A
- chr1 100875801-100898600 Lymphocytes .
- chr4 90199801-90222200 Lymphocytes CYTIP chr2: 158293001-158314600 Lymphocytes .
- chr2 204796001-204817800 Lymphocytes TRDC chr14: 22910401-22932800 Lymphocytes .
- chr22 37605601-37626400 Lymphocytes .
- chr17 80269601-80291000 Lymphocytes SAMD3
- chr6 130525801-130546600 Lymphocytes .
- chr21 26933601-26955200 Lymphocytes
- AK096766 chr7 38333401-38355000 Lymphocytes RP11-291B21.2
- chr12 10695601-10716800 Lymphocytes CTLA4 chr2: 204724801-204745600 Lymphocytes .
- chr15 60863601-60886200 Lymphocytes
- FCRL3 chr1: 157659001-157680800 Lymphocytes
- RTKN2 chr10: 63983001-64005400 Lymphocytes RP11-799D4.2
- chr17 33505001-33527400 Lymphocytes
- ETS1 chr11 128324001-128346600 Lymphocytes .
- chr1 160518401-160540200 Lymphocytes BC045668;
- IL21 chr4 123529201-123551000 Lymphocytes RP11-61O1.1
- chr14 98655401-98678000 Lymphocytes .
- chr6 128289201-128312200 Lymphocytes .
- chr4 143305601-143328400 Lymphocytes .
- chr12 9938801-9960200 Lymphocytes .
- chr14 61791401-61814000 Lymphocytes .
- chr1 206746601-206768000 Lymphocytes KLRD1
- chr12 10452401-10473400 Lymphocytes .
- chr2 204566201-204588800 Lymphocytes .
- chr1 90062201-90084400 Lymphocytes .
- chr7 130631401-130652200 Lymphocytes .
- chr22 40294801-40317200 Lymphocytes .
- chr6 161493201-161514800 Lymphocytes .
- chr16 27409001-27431000 Lymphocytes .
- chr1 117292801-117314600 Lymphocytes .
- chr6 16427801-16449400 Lymphocytes .
- chr6 33302001-33323400 Lymphocytes .
- chr4 143308401-143331200 Lymphocytes CCL5 chr17: 34196201-34217200 Lymphocytes .
- chr22 47061801-47083200 Lymphocytes
- CTLA4 chr2 204723801-204744400 Lymphocytes .
- chr4 143287401-143310000 Lymphocytes .
- chr8 128993601-129014600 Lymphocytes .
- chr21 36404601-36426800 Lymphocytes .
- chr14 103262801-103285200 Lymphocytes .
- chr3 108542801-108564600 Lymphocytes .
- chr15 60982201-61004600 Lymphocytes .
- chr2 228663201-228685200 Lymphocytes
- CMC1 chr3 28324801-28346400 Lymphocytes
- CYB561A3 chr11: 61112201-61134000 Lymphocytes BC062769 chr2: 197115801-197137400 Lymphocytes .
- chr1 90066401-90088800 Lymphocytes USP44 chr12: 95934801-95957200 Lymphocytes SH2D1B chr1: 162369401-162391200 Lymphocytes .
- chr3 59982801-60005400 Lymphocytes FANK1 chr10: 127674201-127696600 Lymphocytes .
- chr2 106344201-106367000 Lymphocytes ANK3 chr10: 62475401-62498200 Lymphocytes .
- chr19 9963401-9985000 Lymphocytes .
- chr1 25241001-25262600 Lymphocytes .
- chr8 121733601-121754600 Lymphocytes .
- chr17 38758401-38780400 Lymphocytes .
- chr18 9079201-9101400 Lymphocytes .
- chr2 228703401-228725600 Lymphocytes .
- chr5 156640201-156661200 Lymphocytes .
- chr14 98640001-98662400 Lymphocytes .
- chr3 42669401-42691200 Lymphocytes .
- chr8 12953 9401-129562400 Lymphocytes .
- chr17 61995401-62017200 Lymphocytes AC002480.5 chr7: 22689001-22711600 Lymphocytes .
- chr7 50411401-50433000 Lymphocytes
- CCL5 chr17 34192801-34214800 Lymphocytes
- GTF3C1 chr16 27464001-27486000 Lymphocytes
- chr15 60856401-60879200 Lymphocytes .
- chr2 181994801-182017400 Lymphocytes AK128525 chr2: 89149801-89170400 Lymphocytes TRABD2A chr2: 85056801-85079600 Lymphocytes RN7SL328P chr9: 134593001-134614600 Lymphocytes TIGIT chr3: 114003001-114024400 Lymphocytes TCL1A; BX247990 chr14: 96170201-96192800 Lymphocytes TCRDV2 chr14: 22916401-22938600 Lymphocytes .
- chr16 27407801-27428600 Lymphocytes GIMAP4 chr7: 150250801-150272800 Lymphocytes .
- chr5 156608401-156630800 Lymphocytes .
- chr21 26928401-26949000 Lymphocytes .
- chr2 157174801-157195600 Lymphocytes KLRD1
- chr12 10448001-10470000 Lymphocytes
- CLEC2D chr12 9818401-9840200 Lymphocytes .
- chr1 12560601-12581600 Lymphocytes RP11-61O1.1 chr14: 98660401-98681600 Lymphocytes .
- chr14 102271201-102293000 Lymphocytes .
- chr6 154541401-154564200 Lymphocytes .
- chr1 214803401-214825800 Lymphocytes .
- chr2 143999001-144021000 Lymphocytes RP11-428G5.5
- chr12 32029401-32050200 Lymphocytes RP1-281H8.3
- chr6 149806001-149828200 Neutrophils SIGLEC5 chr19: 52120201-52141600 Neutrophils
- MMP9 chr20 44627401-44649000 Neutrophils
- CMTM2 chr16 66611001-66632200 Neutrophils RP1-229K20.5
- chr6 41228801-41249800 Neutrophils
- PGLYRP1 chr19 46515801-46536800 Neutrophils .
- chr3 128940401-128961200 Neutrophils .
- chr16 84800001-84821600 Neutrophils HCAR3
- chr12 123188801-123210200 Neutrophils RP1-229K20.5
- chr6 41227401-41248800 Neutrophils IL1R2 chr2: 102599401-102620000 Neutrophils .
- chr16 1978001-1999200 Neutrophils FFAR2 chr19: 35931001-35951800 Neutrophils APOBEC3A chr22: 39338601-39359200 Neutrophils 8-Mar chr10: 45948801-45969800 Neutrophils FFAR2 chr19: 35930201-35950800 Neutrophils .
- chr1 180091001-180111800 Neutrophils BTNL8 chr5: 180326601-180347800 Neutrophils .
- chr16 84799401-84820000 Neutrophils .
- chr6 112176801-112199600 Leukocytes
- PRKCQ-AS1 chr10 6616201-6637800 Leukocytes
- PRF1 chr10 72351601-72372600 Leukocytes
- RPL7AP64 chr17 7029801-7052600 Leukocytes
- SEMA4A chr1 156114001-156135400 Leukocytes
- RNU6-933P chr11 60740601-60763400 Leukocytes
- chr21 26927401-26948400 Leukocytes HLA-DOB chr6: 32774001-32794800 Leukocytes .
- chr10 6087801-6110800 Leukocytes
- KLRK1 chr12 10531401-10552400 Leukocytes
- NUP214 chr9 134093401-134114400 Leukocytes
- KLRK1 chr12 10529601-10551400 Leukocytes .
- chr5 156641201-156666200 Leukocytes SIGLEC5 chr19: 52120201-52141600 Leukocytes TARP chr7: 38304201-38325800 Leukocytes FASLG chr1: 172618601-172640600 Leukocytes RP4-576H24.4; SIRPB1 chr20: 1557601-1579000 Leukocytes RP11-61O1.1 chr14: 98658001-98680400 Leukocytes .
- chr10 6090801-6111800 Leukocytes SYNE1 chr6: 152495001-152516800 Leukocytes AK128525 chr2: 89147801-89169600 Leukocytes .
- chr1 206729201-206751200 Leukocytes KLRC4-KLRK1
- chr12 10549001-10571400 Leukocytes CD226 chr18: 67613201-67634600 Leukocytes MMP9
- chr20 44627401-44649000 Leukocytes .
- chr8 128978201-128999200 Leukocytes .
- chr2 8411201-8432600 Leukocytes KLRK1 chr12: 10527001-10549600 Leukocytes KLRF1 chr12: 9970201-9991600 Leukocytes CD86 chr3: 121764201-121785400 Leukocytes ARHGAP15 chr2: 143995201-144017000 Leukocytes ARRB2 chr17: 4607201-4628800 Leukocytes . chr8: 126949201-126969800 Leukocytes . chr3: 40655601-40676800 Leukocytes LINC01259 chr4: 38501601-38522800 Leukocytes .
- chr20 24922601-24943800 Leukocytes MORC2 chr22: 31325201-31347800 Leukocytes E02193 chr12: 69735601-69758600 Leukocytes CMTM2 chr16: 66611001-66632200 Leukocytes CD33 chr19: 51718401-51739400 Leukocytes CCL20 chr2: 228666201-228688600 Leukocytes .
- chr21 26931801-26953600 Leukocytes CTLA4 chr2: 204722601-204743400 Leukocytes RP1-229K20.5 chr6: 41228801-41249800 Leukocytes RGS3 chr9: 116269601-116290600 Leukocytes . chr8: 126942201-126967800 Leukocytes . chr6: 90778601-90800000 Leukocytes . chr2: 204707801-204730400 Leukocytes .
- chr20 57724601-57745800 Leukocytes SAMD3 chr6: 130522801-130544800 Leukocytes CLEC2D chr12: 9815001-9836000 Leukocytes .
- chr17 66206801-66229600 Leukocytes CD300E chr17: 72608201-72629400 Leukocytes MAP3K4 chr6: 161491601-161513200 Leukocytes LIM2 chr19: 51880401-51901800 Leukocytes ICOS chr2: 204788401-204810600 Leukocytes NLRC5 chr16: 57064001-57085400 Leukocytes RNU6-933P chr11: 60744201-60765800 Leukocytes .
- chr6 90780001-90801200 Leukocytes SPATA13-AS1 chr13: 24817401-24838800 Leukocytes CMC1 chr3: 28323201-28344800 Leukocytes AX747844 chr12: 47593001-47614600 Leukocytes TNIP3 chr4: 122135801-122158200 Leukocytes . chr12: 94157201-94179600 Leukocytes FPR1 chr19: 52241201-52262600 Leukocytes PGLYRP1 chr19: 46515801-46536800 Leukocytes . chr17: 8846601-8868800 Leukocytes .
- chr15 60853601-60876400 Leukocytes .
- chr8 128992001-129013600 Leukocytes .
- chr9 123672801-123693800 Leukocytes CD300E chr17: 72606001-72628200 Leukocytes RBPJ chr4: 26264601-26286000 Leukocytes .
- chr9 123675001-123695800 Leukocytes CD300LB chr17: 72516401-72537400 Leukocytes LTB4R chr14: 24773201-24794600 Leukocytes KLRD1 chr12: 10451601-10472400 Leukocytes .
- chr3 128940401-128961200 Leukocytes .
- chr20 54993801-55015400 Leukocytes AK291611 chr7: 38364201-38386000 Leukocytes .
- chr8 134065601-134087400 Leukocytes CD300C chr17: 72531201-72552200 Leukocytes TLDC1 chr16: 84575801-84597200 Leukocytes .
- chr2 228661001-228683200 Leukocytes FYN chr6: 112103401-112125600 Leukocytes PADI4 chr1: 17624801-17646200 Leukocytes CD300E chr17: 72609401-72631000 Leukocytes LILRB1 chr19: 55118401-55140200 Leukocytes ARHGEF11 chr1: 156922801-156943800 Leukocytes . chr21: 26929001-26949600 Leukocytes . chr3: 16333201-16354200 Leukocytes .
- chr10 71842201-71863600 Leukocytes PTPN4 chr2: 120677201-120698400 Leukocytes .
- chr16 84800001-84821600 Leukocytes CDC14A chr1: 100875801-100898600 Leukocytes .
- chr4 90199801-90222200 Leukocytes .
- chr17 76902001-76923000 Leukocytes
- CYTIP chr2 158293001-158314600 Leukocytes .
- chr2 204796001-204817800 Leukocytes TRDC chr14: 22910401-22932800 Leukocytes .
- chr22 37605601-37626400 Leukocytes .
- chr17 80269601-80291000 Leukocytes SAMD3 chr6: 130525801-130546600 Leukocytes HCAR3 chr12: 123188801-123210200 Leukocytes .
- chr21 26933601-26955200 Leukocytes .
- chr1 206726201-206747200 Leukocytes .
- chr3 58329201-58350200 Leukocytes AK096766 chr7: 38333401-38355000 Leukocytes .
- chr14 99492001-99513200 Leukocytes .
- chr13 40952601-40973800 Leukocytes RP11-291B21.2
- chr12 10695601-10716800 Leukocytes CTLA4 chr2: 204724801-204745600 Leukocytes RP1-229K20.5
- chr6 41227401-41248800 Leukocytes IL1R2 chr2: 102599401-102620000 Leukocytes
- PRAM1 chr19 8556201-8577200 Leukocytes .
- chr1 150575601-150597000 Leukocytes
- SIRPG chr20 1626801-1647400 Leukocytes .
- chr15 60863601-60886200 Leukocytes .
- chr1 206728001-206749200 Leukocytes .
- chr1 28442001-28463200 Leukocytes
- FCRL3 chr1 157659001-157680800 Leukocytes
- RTKN2 chr10 63983001-64005400 Leukocytes RP11-799D4.2
- chr17 33505001-33527400 Leukocytes
- PIK3R1 chr5 67566601-67587600 Leukocytes
- ETS1 chr11 128324001-128346600 Leukocytes .
- chr19 18494001-18515200 Leukocytes .
- chr16 1978001-1999200 Leukocytes .
- chr1 160518401-160540200 Leukocytes .
- chr2 235388801-235410200 Leukocytes BC045668;
- IL21 chr4 123529201-123551000 Leukocytes RP11-61O1.1 chr14: 98655401-98678000 Leukocytes .
- chr6 128289201-128312200 Leukocytes .
- chr4 143305601-143328400 Leukocytes FFAR2 chr19: 35931001-35951800 Leukocytes AK056689 chr13: 74795201-74816400 Leukocytes .
- chr12 9938801-9960200 Leukocytes .
- chr14 61791401-61814000 Leukocytes FPR2 chr19: 52256801-52278200 Leukocytes .
- chr1 206746601-206768000 Leukocytes APOBEC3A chr22: 39338601-39359200 Leukocytes .
- chr2 69394401-69415600 Leukocytes .
- chr17 75419601-75440800 Leukocytes .
- chr20 57728401-57749600 Leukocytes
- SIRPG chr20 1625401-1646800 Leukocytes .
- chr7 130631401-130652200 Leukocytes .
- chr22 40294801-40317200 Leukocytes .
- chr6 161493201-161514800 Leukocytes .
- chr16 27409001-27431000 Leukocytes .
- chr1 180091001-180111800 Leukocytes .
- chr1 117292801-117314600 Leukocytes .
- chr6 16427801-16449400 Leukocytes .
- chr5 156604801-156626600 Leukocytes .
- chr6 33302001-33323400 Leukocytes .
- chr4 143308401-143331200 Leukocytes BTNL8 chr5: 180326601-180347800 Leukocytes CCL5 chr17: 34196201-34217200 Leukocytes PIK3R1 chr5: 67567801-67588800 Leukocytes PILRB chr7: 99939201-99961000 Leukocytes RP11-326C3.13; chr11: 316201-337800 RP11-326C3.12; IFITM3 Leukocytes 9-Sep chr17: 75443201-75464400 Leukocytes CARS2 chr13: 111307601-111329000 Leukocytes .
- chr7 36746601-36769200 Leukocytes .
- chr1 111754201-111775600 Leukocytes .
- chr22 47061801-47083200 Leukocytes
- PRKCQ-AS1 chr10 6615601-6636200 Leukocytes CTLA4 chr2: 204723801-204744400 Leukocytes .
- chr8 142234601-142255800 Leukocytes .
- chr4 143287401-143310000 Leukocytes .
- chr2 20614001-20635000 Leukocytes .
- chr8 128993601-129014600 Leukocytes .
- chr15 60982201-61004600 Leukocytes .
- chr2 228663201-228685200 Leukocytes
- CMC1 chr3 28324801-28346400 Leukocytes .
- chr17 76402401-76423000 Leukocytes CYB561A3
- chr11 61112201-61134000 Leukocytes
- SIRPG chr20 1627801-1648400 Leukocytes .
- chr13 110373601-110394600 Leukocytes .
- chr16 84799401-84820000 Leukocytes BC062769 chr2: 197115801-197137400 Leukocytes .
- chr1 90066401-90088800 Leukocytes USP44 chr12: 95934801-95957200 Leukocytes SH2D1B chr1: 162369401-162391200 Leukocytes .
- chr3 59982801-60005400 Leukocytes RP11-14I17.3 chr8: 26287801-26309200 Leukocytes DCTN4 chr5: 150103601-150124800 Leukocytes .
- chr17 79113601-79134400 Leukocytes AC018816.3 chr3: 4862201-4883600 Leukocytes FANK1 chr10: 127674201-127696600 Leukocytes C19orf38 chr19: 10949001-10969800 Leukocytes CLEC12A chr12: 10115801-10136600 Leukocytes LDLRAP1 chr1: 25880601-25901400 Leukocytes . chr2: 106344201-106367000 Leukocytes ANK3 chr10: 62475401-62498200 Leukocytes LINC01259 chr4: 38503001-38525400 Leukocytes .
- chr16 3090401-3111200 Leukocytes . chr19: 9963401-9985000 Leukocytes . chr1: 25241001-25262600 Leukocytes . chr8: 121733601-121754600 Leukocytes . chr17: 38758401-38780400 Leukocytes . chr18: 9079201-9101400 Leukocytes . chr2: 228703401-228725600 Leukocytes PAM16 chr16: 4385601-4406600 Leukocytes . chr5: 156640201-156661200 Leukocytes .
- chr14 98640001-98662400 Leukocytes FPR2 chr19: 52254001-52275000 Leukocytes .
- chr8 126947801-126968800 Leukocytes .
- chr3 42669401-42691200 Leukocytes .
- chr8 129539401-129562400 Leukocytes SPIDR chr8: 48262801-48285200 Leukocytes
- CLEC12A chr12 10115201-10135800 Leukocytes .
- chr17 61995401-62017200 Leukocytes AC002480.5 chr7: 22689001-22711600 Leukocytes .
- chr12 105066001-105087000 Leukocytes AK057187 chr2: 38044801-38066400 Leukocytes .
- chr7 50411401-50433000 Leukocytes
- CCL5 chr17 34192801-34214800 Leukocytes
- GTF3C1 chr16 27464001-27486000 Leukocytes
- THEMIS chr6 128228601-128249600 Leukocytes NLRC5 chr16: 57063001-57084000 Leukocytes .
- chr7 142614001-142635600 Leukocytes .
- chr19 52121601-52142200 Leukocytes .
- chr15 60856401-60879200 Leukocytes .
- chr5 66478801-66499800 Leukocytes RP11-127L20.3
- chr10 106073401-106094600 Leukocytes .
- chr2 181994801-182017400 Leukocytes AK128525 chr2: 89149801-89170400 Leukocytes TRABD2A chr2: 85056801-85079600 Leukocytes RN7SL328P chr9: 134593001-134614600 Leukocytes
- PRKCQ-AS1 chr10 6617801-6639400 Leukocytes C5AR2 chr19: 47830001-47851000 Leukocytes TIGIT chr3: 114003001-114024400 Leukocytes TCL1A; BX247990 chr14: 96170201-96192800 Leukocytes TCRDV2 chr14: 22916401-22938
- chr1 42194201-42215600 Leukocytes IRF4 chr6: 384001-404600 Leukocytes .
- chr16 27407801-27428600 Leukocytes GIMAP4 chr7: 150250801-150272800 Leukocytes .
- chr5 156608401-156630800 Leukocytes .
- chr17 76903801-76924800 Leukocytes .
- chr16 84621001-84642000 Leukocytes .
- chr8 142120401-142141800 Leukocytes DNAJC19P3 chr19: 42134001-42155800 Leukocytes PGLYRP1 chr19: 46515201-46535800 Leukocytes . chr21: 26928401-26949000 Leukocytes . chr2: 157174801-157195600 Leukocytes KLRD1 chr12: 10448001-10470000 Leukocytes CLEC2D chr12: 9818401-9840200 Leukocytes . chr3: 71818801-71839800 Leukocytes .
- chr1 12560601-12581600 Leukocytes AC002511.3 chr19: 35911201-35933000 Leukocytes . chr5: 118665601-118687000 Leukocytes . chr3: 196407201-196427800 Leukocytes ARRB2 chr17: 4610401-4631400 Leukocytes RP11-61O1.1 chr14: 98660401-98681600 Leukocytes . chr1: 244377801-244398600 Leukocytes . chr14: 102271201-102293000 Leukocytes . chr6: 154541401-154564200 Leukocytes .
- chr1 214803401-214825800 Leukocytes .
- chr2 143999001-144021000 Leukocytes .
- chr6 24917801-24939400 Leukocytes CTSS chr1: 150724601-150745200 Leukocytes RP11-428G5.5
- chr12 32029401-32050200 Leukocytes
- OSCAR chr19 54593601-54614200
- Leukocytes CLEC12A chr12: 10114601-10135200 Leukocytes BC016366 chr4: 185765601-185787000 Leukocytes RP1-281H8.3 chr6: 149806001-149828200 HSC .
- chr2 124658401-124680600 HSC MLLT3 chr9: 20370801-20392400 HSC ATP8B4 chr15: 50397801-50420200 HSC MLLT3 chr9: 20369001-20390400 HSC BC016361 chr4: 75184201-75206000 HSC AP001171.1 chr21: 21618801-21640600 HSC . chr6: 142691401-142714400 HSC .
- chr2 124656201-124678400 HSC GFI1B chr9: 135844201-135865200 HSC RP11-598F7.3 chr12: 207601-230400 HSC RP13-786C16.1 chr11: 33891201-33913400 HSC . chr2: 16648401-16670400 HSC . chr21: 16803001-16825200 HSC HEMGN chr9: 100687801-100710000 HSC . chr6: 142694401-142717400 Kidney CLDN10 chr13: 96074201-96097200 Kidney . chr17: 26939601-26964600 Kidney .
- chr4 22705601-22730600 Kidney SLC22A8 chr11: 62771001-62793000 Kidney . chr10: 17154801-17179000 Kidney SLC13A3 chr20: 45268401-45289600 Kidney . chr2: 209398201-209419000 Kidney . chr11: 128724001-128745800 Skin . chr6: 106932801-106955800 Skin . chr6: 143644001-143666400 Skin . chr2: 239753801-239774600 Skin . chr7: 19131001-19153200 Skin . chr11: 71852001-71874000 Skin AC003986.6 chr7: 19138401-19160400 Skin . chr2: 239754601-239776200 Skin GSDMC chr8: 130787001-130808400
- the sequencing data from a subject is deconvoluted by comparison to the DNA-protein association atlas. In this way the percent contribution of different tissues, cell types and/or cellular states to the total cfnucleosomes in a sample can be determined. In some embodiments, the deconvolution gives the percent contribution of only informative cfnucleosomes.
- the cfDNA and cf-nucleosomes are not analyzed by comparison to healthy tissue data but rather are analyzed by machine learning.
- Machine learning is well known in the art, and by performing the methods of the invention of patients with known conditions the machine learning algorithm can learn to recognize specific disease states and conditions in the cfDNA sequences provided when specific DNA-associated proteins are isolated.
- at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, or 20 subjects with a specific condition are analyzed before the algorithm can identify that condition in a new subject.
- the term “cellular state” refers to a condition or cellular response or pathway that is active in a cell.
- the cellular state is the cellular condition that led to and/or caused cell death.
- the cellular state was the state of the cell just before it died but is not directly the cause of that death.
- the cellular state is a pathway that was active or inactive in the cell when it died.
- the cellular state is a cellular response that was active or inactive in the cell when it died.
- the cellular state comprises expression of at least one gene informative of the cause of cell death.
- the cellular state is a disease state.
- the cellular state comprises expression of at least one gene informative of an active pathway in the cell when it died. In some embodiments, at least 1, 2, 3, 4, or 5 genes from a pathway is indicative of an active pathway.
- Signaling pathways are well known in the art and online resources for determining the members of various pathways can be found by examining Gene Ontology or Thermo Fisher Scientific for example.
- determining a cellular state comprises determining a cellular pathway active in the cell. In some embodiments, determining a cellular state comprises determining a transcriptional program active in the cell. In some embodiments, determining a cellular state comprises determining active transcription of at least one gene informative of an active pathway. In some embodiments, at least 1, 2, 3, 4, or 5 genes from a pathway are determined. Each possibility represents a separate embodiment of the invention. In some embodiments, determining a cellular state comprises determining association of the DNA-associated protein to at least 1 genomic region that regulates a gene of the pathway.
- the cellular state is any one of hypoxia, inflammation, ER stress, mitochondrial stress, quiescence, senescence, interferon response, cycling, malignant, and calcium flux. Any cellular state which can be defined by the expression of a gene or set of genes can be investigated by the methods of the invention.
- the methods of the invention further comprise comparing the sequenced cfDNA to at least 1 genomic location with the greatest association of the DNA-associated protein during activation of a cellular pathway, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 1 genomic location indicates activation of the cellular pathway.
- the genomic locations with the greatest unique association are compared.
- the methods of the invention further comprise comparing the sequenced cfDNA to a pathway atlas of at least 2 cellular pathways, wherein the atlas comprises at least 1 genomic location with the greatest association of the DNA-associated protein in each of the 2 cellular pathways, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 1 genomic location indicates activation of that cellular pathway.
- the sequenced cfDNA is compared to at least 1, 2, 3, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90 or 100 genomic locations. Each possibility represents a separate embodiment of the invention. In some embodiments, the sequenced cfDNA is compared to at least 10 genomic locations. In some embodiments, the sequenced cfDNA is compared to at least 25 genomic locations. In some embodiments the designating comprises comparing the sequenced DNA to a given number of genomic locations.
- the atlas is of at least 1, 2, 3, 5, 10, 10, 15, 20, 25, 30, 35, 40, 45 or 50 cell types and/or tissues. Each possibility represents a separate embodiment of the invention. In some embodiments, the atlas is of at least 1, 2, 3, 5, 10, 10, 15, 20, 25, 30, 35, 40, 45 or 50 cellular pathways. Each possibility represents a separate embodiment of the invention. In some embodiments, the atlas comprises at least 1, 2, 3, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, or 100 genomic locations. Each possibility represents a separate embodiment of the invention. In some embodiments, the atlas comprises at least 2 genomic locations. In some embodiments, the atlas comprises at least 10 genomic locations. In some embodiments, the atlas comprises at least 25 genomic locations.
- association of the DNA-associated protein with the genomic location is indicative of active transcription and the genomic location is within a tissue, cell type or cellular state specific gene, enhancer element or is at a disease specific mutation. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of active transcription and the genomic location is within a tissue, cell type or cellular state specific gene, enhancer element. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of active transcription and the genomic location is at a disease-specific mutation. In some embodiments, the DNA-associated protein is a marker of active transcription and association of the DNA-associated protein with the disease associated mutation is indicative of the disease state.
- association of the DNA-associated protein with the genomic location is indicative of active transcription and the disease-associated mutation is within an oncogene. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of silenced transcription and the genomic location is within a repressor element or a gene silenced in the tissue, cell type or cellular state or is at a disease specific mutation. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of silenced transcription and the genomic location is within a repressor element or a gene silenced in the tissue, cell type or cellular state. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of silenced transcription and the genomic location is at a disease-specific mutation.
- association of the DNA-associated protein with the genomic location is indicative of silenced transcription and the disease-associated mutation is within a tumor suppressor gene.
- Oncogenes and tumor suppressor genes are well known in the art. Examples of oncogenes include, but are not limited to, WNT, RAS, MYC and ERK. Examples of tumor suppressor genes include, but are not limited to, p53, PTCH, NF1, p27Kip1, and APC.
- cfDNA refers to any DNA obtained from an organism which existed in the organism outside of a cell.
- cfnucleosome refers to cfDNA and any proteins bound and/or associated with the cfDNA.
- cfnucleosomes comprises cfDNA and cfhistones.
- the cfDNA is associated with DNA-associated protein.
- the cfDNA is not naked.
- the cfDNA is in the sample as cfnucleosomes.
- the cfDNA is not crosslinked.
- the methods of the invention further comprise crosslinking the cfDNA and DNA-associated proteins prior to the contacting. In some embodiments, the methods do not comprise crosslinking the cfDNA and DNA-associated proteins prior to the contacting.
- the cfDNA is DNA obtained from an organism and existed in the organism outside of any vesicle.
- Cell-free DNA is well known in the art, and generally refers to DNA that is free floating within a bodily fluid. This DNA is generally not enclosed in a vesicle and thus DNA in transport, such as by exosomes or other vesicular transporters, in not considered cfDNA.
- cfDNA is DNA from a dying and/or dead cell. When a cell dies the DNA is generally fragmented and released from the cell as it lyses. This DNA however, is not all immediately removed or cleaned up and thus persists in the organism. Frequently the DNA from the dead cell enters the bloodstream.
- vesicular chromatin that is lysed is also included in the cfDNA.
- the cfDNA is mammalian cfDNA. In some embodiments, the cfDNA is human cfDNA. In some embodiments, the cfDNA is from a mammalian or human genome. In some embodiments, the cfDNA is fetal DNA. In some embodiments, the DNA is viral DNA. In some embodiments, the DNA is bacterial DNA. In some embodiments, the DNA is fungal DNA. In some embodiments, the DNA parasitic DNA. In some embodiments, the DNA if from a pathogen. In some embodiments, the DNA is from an organism that lives in a healthy subject. In some embodiments, the cfDNA is extracted from bodily fluid.
- the providing comprises providing a bodily fluid and isolating the cfDNA from the bodily fluid. In some embodiments, the providing comprises providing a bodily fluid comprising cfDNA and performing the contacting in the bodily fluid. In some embodiments, the methods of the invention are performed on as little as 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 1.5, 2, 2.5 or 3 ml of bodily fluid. In some embodiments, the methods of the invention are performed on as little as 2 ml of bodily fluid. Each possibility represents a separate embodiment of the invention.
- the methods of the invention are performed with less than 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 1.5, 2, 2.5, 3, 3.5, 4, 4.5, or 5 ml of bodily fluid.
- the bodily fluid is blood.
- the cfDNA is fetal cell free DNA (cffDNA).
- the method is for non-invasive fetal monitoring.
- the subject is the mother of the fetus.
- the method is for determining a cellular state of a cell in a fetus.
- the method is for determining a disease in a fetus.
- the method is for determining a genetic abnormality in a fetus.
- the method is for determining the origin of cell death in a fetus.
- cfDNA Since cfDNA has a short half-life in an organism, it provides a snapshot of the cell death occurring in the organism at that moment.
- the methods of the invention detect cell death that has occurred within the last 1 minute, 2 minutes, 3 minutes, 4 minutes, 5 minutes, 10 minutes, 15 minutes, 20 minutes, 25 minutes, 30 minutes, 35 minutes, 40 minutes, 45 minutes, 50 minutes, 55 minutes, 1 hour 2 hours, 3 hours, 6 hours, 12 hours, 18 hours, 1 day, 2 days, 3 days, 4 days, 5 days, 6 days, 1 week 2 weeks, 3 weeks or a month from the time the sample was taken from the subject.
- the methods of the invention detect cell death that occurred immediately before taking the sample from the subject.
- the methods of the invention further comprise extracting a sample from the subject before the providing, wherein the sample comprises cfDNA. In some embodiments, the methods of the invention further comprise freezing or keeping at about 4 degrees the sample after it is taken from the subject and before it is contacted. By freezing or keeping the sample cold the methods of the invention can still detect the cell death that occurred just before the sample was taken from the subject.
- At least 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90 or 100 ng of cfDNA are provided.
- as little as 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90 or 100 ng of cfDNA are provided.
- at least 50 ng are provided.
- the provided cfDNA is between 0.1 and 1000, 0.1 and 900, 0.1 and 800, 0.1 and 700, 0.1 and 600, 0.1 and 500, 0.1 and 400, 0.1 and 300, 0.1 and 250, 0.1 and 200, 0.1 and 150, 0.1 and 100, 0.1.
- each possibility represents a separate embodiment of the invention.
- at most 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 400, 500, 600, 700, 800, 900 or 1000 ng of cfDNA are provided.
- 1000 genomes are roughly equivalent to 6.6 ng of cfDNA.
- the cfDNA comprises at least 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 2, 5, 10, 50, 100, 200, 300, 500, 700, 800, 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000 or 10000 genomes.
- Each possibility represents a separate embodiment of the invention.
- the cfDNA comprises as little as 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 2, 5, 10, 50, 100, 200, 300, 500, 700, 800, 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000 or 10000 genomes.
- Each possibility represents a separate embodiment of the invention.
- the cfDNA comprises between 0.1 and 10000, 0.1 and 9000, 0.1 and 8000, 0.1 and 7000, 0.1 and 6000, 0.1 and 5000, 0.1 and 4000, 0.1 and 3000, 0.1 and 2000, 0.1 and 1000, 1 and 10000, 1 and 9000, 1 and 8000, 1 and 7000, 1 and 6000, 1 and 5000, 1 and 4000, 1 and 3000, 1 and 2000, 1 and 1000, 5 and 10000, 5 and 9000, 5 and 8000, 5 and 7000, 5 and 6000, 5 and 5000, 5 and 4000, 5 and 3000, 5 and 2000, 5 and 1000, 10 and 10000, 10 and 9000, 10 and 8000, 10 and 7000, 10 and 6000, 10 and 5000, 10 and 4000, 10 and 3000, 10 and 2000, 10 and 1000, 100 and 10000, 100 and 9000, 100 and 8000, 100 and 7000, 100 and 6000, 100 and 5000, 100 and 4000, 100 and 3000, 100
- the cfDNA is from the cell type, tissue or cell in the cellular state.
- each possibility represents a separate embodiment of the invention.
- as little as 0.1% of the cfDNA is from the cell type, tissue or cell in the cellular state.
- as little as 1% of the cfDNA is from the cell type, tissue or cell in the cellular state.
- the limit of detection of the method is 0.1% of the cfDNA in the sample being from the cell type, the tissue, or the cellular state.
- the limit of detection of the method is 1% of the cfDNA in the sample being from the cell type, the tissue, or the cellular state. In some embodiments, as little as 0.1% of the cfDNA is from the cell type, the tissue, or the cellular state and at least 45 peaks corresponding to the cell type, tissue or cellular state are detected. In some embodiments, as little as 0.1% of the cfDNA is from the cell type, the tissue, or the cellular state and at least 45-200 peaks corresponding to the cell type, tissue or cellular state are detected.
- analyzing at least 25 peaks provides a limit of detection of 1% of the cfDNA being from the cell type, the tissue, or the cellular state. In some embodiments, analyzing at least 45 peaks provides a limit of detection of 0.1% of the cfDNA being from the cell type, the tissue, or the cellular state. In some embodiments, analyzing a number of peaks comprises detecting cfDNA from at least that number of peaks.
- the cfDNA comprises between 0.001-10, 0.001-5, 0.001-3, 0.001-2, 0.001-1.5, 0.001-1, 0.01-10, 0.01-5, 0.01-3, 0.01-2, 0.01-1.5, 0.01-1, 0.1-10, 0.1-5, 0.1-3, 0.1-2, 0.1-1.5, 0.1-1, 0.5-10, 0.5-5, 0.5-3, 0.5-2, 0.5-1.5 0.5-1, 1-10, 1-5, 1-3, or 1-2% cfDNA from the cell type, the tissue, or the cellular state.
- the cfDNA comprises between 0.1-1% cfDNA from the cell type, the tissue, or the cellular state.
- the cfDNA comprises between 0.1-3% cfDNA from the cell type, the tissue, or the cellular state.
- the sequencing is at a low depth. In some embodiments, the sequencing is at a depth of less than 1 billion, 750 million, 500 million, 400 million, 300 million, 200 million, 100 million, 90 million, 80 million, 70 million, 60 million, 50 million, 40 million, 30 million, 20 million, 10 million, 9 million, 8 million, 7 million, 6 million, 5 million, 4 million, 3 million, 2 million, 1 million, 0.5 million, 0.1 million, 0.05 million, 0.01 million, 0.005 million or 0.001 million reads. Each possibility represents a separate embodiment of the invention. In some embodiments, the sequencing is at a depth of less than 10 million reads. In some embodiments, the sequencing is at a depth of less than 1 million reads.
- the providing comprises providing a bodily fluid comprising the cfDNA.
- the contacting occurs in the bodily fluid.
- the contacting comprises providing a bodily fluid and isolating the cfDNA from the bodily fluid.
- the bodily fluid is selected from: blood, serum, gastric fluid, intestinal fluid, saliva, bile, tumor fluid, interstitial fluid, breast milk, cerebrospinal fluid, urine, semen, vaginal fluid, and stool.
- the bodily fluid is any bodily fluid that contains cfDNA.
- the bodily fluid is blood.
- the bodily fluid is any one of whole blood, partially lysed whole blood, plasma, or partially processed whole blood.
- the sample of blood can be obtained by standard techniques, such as using a needle and syringe.
- the blood sample is a peripheral blood sample.
- the blood sample can be a fractionated portion of peripheral blood, such as a plasma sample.
- total DNA can be extracted from the sample using standard techniques known to one skilled in the art.
- intact cells are removed before DNA extraction, so that only free-floating DNA is extracted.
- Intact cells can be removed by any method known in the art, such as for non-limiting example by centrifugation or by gradient separation, such as by Ficol gradient separation.
- a non-limiting example for DNA extraction is the FlexiGene DNA kit (QIAGEN). Standard techniques for receiving cell-free DNA extraction are known to a skilled artisan, a non-limiting example of which is the QIAamp Circulating Nucleic Acid kit (QIAGEN).
- the sequencing is next generation sequencing.
- Next generation sequencing also known as high-throughput sequencing or massively parallel sequencing, is any sequencing method that allows for rapid high-throughput sequencing of base pairs from DNA or RNA samples.
- the sequencing is high-throughput sequencing.
- the sequencing is massively parallel sequencing.
- Such sequencing is well known in the art and can include the use of Illumina arrays, pore and nanopore sequencers and ion torrent as non-limiting examples. Sequencing machines such as the Illumina Nextseq 500 machine may be used, and processing may be performed using the Illumina 500/550 V2 kit, for non-limiting example.
- the sequencing is whole genome sequencing. In some embodiments, only a portion of the genome is sequenced. In some embodiments, a chip or array pertaining to only a portion of the genome is used for next generation sequencing.
- the sequencing is methylation sensitive sequencing.
- the methods of the invention further comprise bisulfite conversion before sequencing.
- the methylation status of the DNA can also be discerned, in this way the protein-DNA association data can also be combined with DNA methylation data. This can provide further information about the activity of genes in the cell as it died, which can provide insight into the cell or tissue of origin or the cellular state of the cell.
- the methods of the invention can be used to determine the origin of cfDNA even when the cfDNA from one tissue/cell types is a very small percentage of the whole cfDNA.
- the cfDNA of a tissue and/or cell type comprises as little 0.0001%, 0.0005%, 0.001%, 0.005%, 0.01%, 0.05%, 0.1% 0.5%, 1%, 1.5%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9% or 10% of all of the cfDNA.
- Each possibility represents a separate embodiment of the invention.
- the cfDNA of a tissue and/or cell type comprises more than 0.0001%, 0.0005%, 0.001%, 0.005%, 0.01%, 0.05%, 0.1% 0.5%, 1%, 1.5%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9% or 10% of all of the cfDNA.
- a tissue and/or cell type comprises more than 0.0001%, 0.0005%, 0.001%, 0.005%, 0.01%, 0.05%, 0.1% 0.5%, 1%, 1.5%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9% or 10% of all of the cfDNA.
- the cfDNA of a tissue and/or cell type comprises less than 0.0001%, 0.0005%, 0.001%, 0.005%, 0.01%, 0.05%, 0.1%, 0.5%, 1%, 1.5%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9% or 10% of all of the cfDNA.
- a tissue and/or cell type comprises less than 0.0001%, 0.0005%, 0.001%, 0.005%, 0.01%, 0.05%, 0.1%, 0.5%, 1%, 1.5%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9% or 10% of all of the cfDNA.
- the cfDNA of a tissue and/or cell type comprises between 0.0001%-10%, 0.001%-10%, 0.01%-10%, 0.1%-10%, 0.5%-10%, 1%-10%, 1.5%-10%, 2%-10%, 0.0001%-9%, 0.001%-9%, 0.01%-9%, 0.1%-9%, 0.5%-9%, 1%-9%, 1.5%-9%, 2%-9%, 0.0001%-8%, 0.001%-8%, 0.01%-8%, 0.1%-8%, 0.5%-8%, 1%-8%, 1.5%-8%, 2%-8%, 0.0001%-7%, 0.001%-7%, 0.01%-7%, 0.1%-7%, 0.5%-7%, 1%-7%, 1.5%-7%, 2%-7%, 0.0001%-6%, 0.001%-6%, 0.01%-6%, 0.1%-6%, 0.5%-6%, 1%-6%, 1.5%-6%, 2%-6%, 0.0001%-5%, 0.001%-5%, 0.0.01%-5%, 0.5%, 0.1%
- the contacting is incubating the reagent in the bodily fluid comprising the cfDNA. In some embodiments, the contacting is incubating the reagent in blood comprising the cfDNA. In some embodiments, the contacting is incubating the reagent and cfDNA in a binding/incubating solution. Buffers for performing ChIP, and specifically incubating buffers are well known in the art. Such buffers may be purchased from companies such as Abcam and Cell Signaling Technology who cell ChIP kits.
- the contacting is performed with constant mixing. In some embodiments, the contacting is done with constant rotation. In some embodiments, the contacting is done at room temperature or at 4 degrees. In some embodiments, the contacting is done on ice. In some embodiments, the contacting is for at least 1, 2, 3, 4, 5, 6, 12, 18, or 24 hours. Each possibility represents a separate embodiment of the invention. In some embodiments, the contacting is for a time sufficient for the reagent to bind to the DNA-associated protein. In some embodiments, the contacting is for a time sufficient for the reagent to bind at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97%, or 99% of the DNA-associated protein provided. Each possibility represents a separate embodiment of the invention.
- the methods of the invention are for use in detecting a disease state or condition in a subject in need thereof and wherein the cfDNA is from the subject. In some embodiments, the methods of the invention are for diagnosing a disease, and/or condition in a subject in need thereof and wherein the cfDNA is from the subject. In some embodiments, the methods of the invention are for diagnosing an increased risk of a disease or condition.
- the disease state or condition is selected from cardiac disease or damage and liver disease or damage.
- the disease state or condition is selected from cardiac disease or damage, liver disease or damage, and cancer.
- the disease state is cancer.
- the disease state is a pre-cancerous state.
- the disease state is cancer or a pre-cancerous state.
- the disease state or condition is selected from cardiac arrest and liver shock.
- the disease state is brain damage.
- the disease state is bacteremia.
- the disease state is an infection.
- the disease state or condition is selected from cancer, neurodegenerative disease, infection, tissue damage, inflammation, autoimmune disease, arthritis, liver inflammation, bowel inflammation, autoimmune disease, bacteremia, tissue damage from drug side effects, tissue necrosis, and diabetes.
- the neurodegenerative disease is Parkinson's disease or Alzheimer's disease.
- the autoimmune disease is lupus or multiple sclerosis.
- the disease is cancer and the methods of the invention determine the cell or tissue of origin of the cancer. It will be well understood by a person of skill in the art, that association of proteins that indicate active transcription and sequences of cfDNA from oncogenes is indicative of a cancerous or pre-cancerous state.
- association of proteins that indicate transcriptional silencing and sequences of cfDNA from tumor suppressors is also indicative of a cancerous or pre-cancerous state.
- activation or repression of enhancer regions for oncogenes and tumor suppressors respectively is also indicative of a cancerous or pre-cancerous state.
- the methods of the invention further comprise performing steps a-d again using a reagent that binds to a second DNA-associated protein, and wherein the second DNA-associated protein is a different protein than the DNA associated protein already used.
- the second DNA-associated protein is different than the first DNA-associated protein.
- the methods of the invention can be repeated at least 2, 3, 4, 5, 6, 7, 8, 9, or 10 times wherein each time a different DNA-associated protein is bound. Each possibility represents a separate embodiment of the invention.
- the methods of the invention comprise contacting the sample with at least 2 reagents, wherein each reagent is bound to a physical support and the support comprises a short DNA tag unique to each reagent, wherein upon sequencing the isolated cfDNA the short DNA tag identifies the reagent that isolated the cfDNA.
- the DNA tag is ligated to a cfDNA molecule before sequencing.
- the support is a bead conjugated to a single reagent and a short DNA tag that is unique to the reagent.
- the reagent is an antibody and the short DNA tag is a DNA barcode.
- This can, for example be paramagnetic beads covalently bound to a ChIP antibody such as H3K4me1 and bound to a barcode for identifying H3K4me1 associated DNA, and other paramagnetic beads covalently bound to a ChIP antibody such as H3K4me3 and bound to a barcode identifying H3K4me3 associated DNA which are added to the sample simultaneously, ligated to cfDNA simultaneously and sequenced simultaneously.
- the method further comprises treating the subject.
- the treatment is for the detected disease.
- the treatment is a suitable treatment based on the cellular state, tissue of origin, cell type of a combination thereof of the cell that died in the subject. It will be understood by a skilled artisan that if for example a cancer is found in a particular organ that therapy can be tailored to that type of cancer. Similarly, in a particular pathway is active in a cancer or disease, then one treatment modality may be more suitable than another.
- RNA EGFR-AS1 that mediates cancer addiction to EGFR and when highly expressed can render tumors insensitive to EGFR inhibition by anti EGFR antibody treatment, would indicate that EGFR inhibitory treatments should be avoided.
- a method of detecting a disease state in a subject comprising:
- a method for improving disease detection in cfDNA from a subject comprising performing chromatin immunoprecipitation on the cfDNA from the subject before performing the disease detection in the immunoprecipitated cfDNA.
- the chromatin immunoprecipitation comprises:
- disease detection comprises sequencing of the cfDNA. In some embodiments, disease detection comprises sequencing of the cfDNA from the subject. In some embodiments, disease detection comprises sequencing of the immunoprecipitated cfDNA. In some embodiments, disease detection and/or sequencing comprises amplification of the cfDNA. In some embodiments, disease detection and/or sequencing does not comprise amplification of the cfDNA. In some embodiments, disease detection in cfDNA from the subject comprises amplification of the cfDNA.
- non-improved disease detection comprises amplification of the cfDNA.
- disease detection in the immunoprecipitated cfDNA does not comprise amplification of the immunoprecipitated cfDNA.
- improved disease detection does not comprise amplification of the immunoprecipitated cfDNA.
- the amplification is PCR amplification.
- the amplification is non-specific amplification.
- the amplification is amplification of a disease-associated sequence.
- disease-associated mutation refers to a DNA mutation that is known to cause or increase the risk of developing the disease.
- Disease-associated mutations are well known, and include for example, deletion of 1522A, 1523T and 1524C (deletion of F508) of CFTR in cystic fibrosis, 1226A to G (N370S) in the GBA locus in Gaucher disease, mutations in SERPINA1 in alpha1-antitrypsin deficiency, mutations in HBB in Beta-thalassemia, and mutations in PSEN1 in Alzheimer's disease.
- Many disease-associated mutations are known in cancer, some are common to many types of cancer and some are specific to specific cancers.
- Mutations in p53, MYC, BREF, BRCA, to name but a few, are well known in the art. Panels of disease associated mutations can also be investigated. In some embodiments, at least 1, 2, 3, 5, 7, 10, 12, 15, 17, or 10 mutations are investigated. Each possibility represents a separate embodiment of the invention. In some embodiments, instead of sequencing, PCR with mutation specific-primers is employed. Any method of detecting DNA mutations may be employed in place of sequencing; however, sequencing provides the advantage of checking multiple mutations simultaneously, including a panel of mutations.
- association of the DNA-associated protein with the disease-associated mutation is indicative of the disease state. It will be well understood by a person of skill in the art, that association of proteins that indicate active transcription mutations in the coding region of a gene would indicate the mutant gene is being transcribed and would be indicative of a cancerous or pre-cancerous state. Similarly, mutations in a regulatory region would be associated with a protein that is indicative of that regulatory region and thus would also be indicative of a cancerous or pre-cancerous state.
- the disease-associated mutation is in a coding region of a gene and association of the DNA-associated protein with DNA is indicative of active transcription.
- the gene is an oncogene or a tumor suppressor gene.
- the disease-associated mutation is in a regulatory region, and association of the DNA-associated protein with DNA is indicative of the regulatory region.
- the method of the invention further comprises performing steps b-e again using a reagent that binds to a second DNA-associated protein, and wherein the second DNA-associated protein is different from the first DNA-associated protein. If multiple mutations are to be investigated and they are located in different genomic regions (gene body and an enhancer for instance), the ChIP can be repeated for a different DNA-associated protein.
- Immunoprecipitation of only a portion of the cfDNA greatly enhances the concentration of informative DNA.
- sequencing can be performed much more cheaply and using fewer reagents.
- this methodology allows detection of mutations associated with specific genomic annotations (active gene, active promoter, active enhancer etc.) without predefining a limited set of genomic locations (such as a set of cancer risk genes) and designing specific reagents to amplify and/or detect those predefined sequences.
- sequencing cost is dramatically reduced.
- sequencing at a given depth provides more reads over the sequences that are informative.
- the improving comprises at least one of: decreasing signal to noise ratio, increasing confidence in positive detection of the disease, decreasing false detection of the disease and accurately detecting the disease with less cfDNA from the subject.
- the increasing is at least a 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500 or 1000% increase.
- the decreasing is at least a 10, 20, 30, 40, 50, 60, 70, 80, 90, 85, 97, 99 or 100% decrease. Each possibility represents a separate embodiment of the invention.
- the less cfDNA from the subject is less than 1000, 900, 800, 700, 600, 500, 400, 300, 200, 100, 90, 80, 70, 60, or 50 ng of cfDNA.
- the same accuracy can be achieved with the less cfDNA as compared to a larger amount of cfDNA.
- the same sequencing coverage at the mutation can be achieved with the less cfDNA as compared to the coverage achieved with a larger amount of cfDNA.
- a computer program product for determining a cell or tissue of origin of cell free DNA comprising a non-transitory computer-readable storage medium having program code embodied thereon, the program code executable by at least one hardware processor to:
- a computer program product for determining a cellular state of a cell in a subject as the cell died, comprising a non-transitory computer-readable storage medium having program code embodied thereon, the program code executable by at least one hardware processor to:
- a system for determining the cell or tissue of origin of cfDNA comprising:
- a system for determining a cellular state of a cell in a subject as the cell died comprising:
- a computer program product for detecting a disease state in a subject, comprising a non-transitory computer-readable storage medium having program code embodied thereon, the program code executable by at least one hardware processor to
- the computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device.
- the computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
- a non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing.
- RAM random access memory
- ROM read-only memory
- EPROM or Flash memory erasable programmable read-only memory
- SRAM static random access memory
- CD-ROM compact disc read-only memory
- DVD digital versatile disk
- memory stick a floppy disk
- a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon
- a computer readable storage medium is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
- Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network.
- the network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers.
- a network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
- Computer readable program instructions for carrying out operations of the present invention may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages.
- the computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
- the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
- electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present invention.
- These computer readable program instructions may be provided to a processor of a general-purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
- These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
- Embodiments may comprise a computer program that embodies the functions described and illustrated herein, wherein the computer program is implemented in a computer system that comprises instructions stored in a machine-readable medium and a processor that executes the instructions.
- the embodiments should not be construed as limited to any one set of computer program instructions.
- a skilled programmer would be able to write such a computer program to implement one or more of the disclosed embodiments described herein. Therefore, disclosure of a particular set of program code instructions is not considered necessary for an adequate understanding of how to make and use embodiments.
- testing device for sequencing it is meant a combination of components that allows the sequence of a piece of DNA to be determined.
- the testing device allows for the high-throughput sequencing of DNA.
- the testing device allows for massively parallel sequencing of DNA.
- the components may include any of those described above with respect to the methods for sequencing.
- system or test kit further comprises a display for the output from the processor.
- a solid support comprising a capturing agent and a barcoding reagent.
- the term “capturing agent” refers to a molecule that binds to a protein and can thus capture and retain the protein to the solid support.
- the capturing agent is a small molecule.
- the capturing agent is a protein.
- the capturing protein captures a second protein by protein-protein interaction.
- the capture protein is an antibody or antigen binding fragment thereof.
- the capturing agent can be any molecule that specifically binds chromatin or nucleic acids.
- barcoding reagent refers to any substrate that comprises a unique molecule or moiety that can be used as a barcode for identifying a molecule of interest. Barcodes are well known in the art and any molecule or moiety that is unique enough to identify a molecule of interest can be used as a barcode.
- the barcoding reagent is the barcode itself.
- the barcode is a protein barcode.
- the barcode is a protein tag.
- the barcode is a fluorescent protein.
- the barcode is a nucleic acid barcode.
- the nucleic acid molecule is a short nucleic acid molecule. In some embodiments, it is less than 3, 5, 7, 10, 12, 15, 17, 20 or 25 nucleotides long. Each possibility represents a separate embodiment of the invention.
- the nucleic acid molecule is between 3 and 10, 3 and 15, 3 and 20, 3 and 25, 3 and 30, 3 and 35, 3 and 40, 4 and 45, 3 and 50, 5 and 10, 5 and 15, 5 and 20, 5 and 25, 5 and 30, 5 and 35, 5 and 40, 5 and 45, or 5 and 50 nucleotides long. Each possibility represents a separate embodiment of the invention.
- the barcoding reagent is an enzyme for joining the barcode to a molecule of interest. In some embodiments, the barcoding reagent is a ligase. In some embodiments, the barcoding reagent is the barcode and the solid support further comprises an enzyme for joining the barcode to a molecule of interest.
- the molecule of interest is a protein. In some embodiments, the molecule of interest is a nucleic acid molecule. In some embodiments, the molecule of interest in a DNA or RNA. In some embodiments, the capturing agent captures a protein associated with the molecule of interest. In some embodiments, the capturing agent captures a DNA-associated protein and the molecule of interest is DNA. In some embodiments, the DNA is cfDNA. In some embodiments, the molecule of interest is in complex with the protein captured by the capturing agent.
- the solid support may be any polymer, or inorganic material to which biological macromolecules can be attached. The attachment can be direct or indirect.
- the solid support is made from material used to assemble microfluidic devices.
- the solid support is a bead.
- the bead is an agarose bead.
- the solid support is a magnetic or paramagnetic bead.
- the solid support is an agarose or magnetic or paramagnetic bead.
- the support is conjugated to the capturing agent.
- the support is conjugated to a ChIP antibody.
- the support is conjugated to the barcoding reagent.
- the support is conjugated to capturing agent and the barcoding reagent.
- Conjugation can be performed by any method known in the art including, but not limited to covalent bonding, charge-based bonding, and hydrophobic interactions.
- the conjugation is biotin to avidin conjugation.
- the conjugation is by amine binding technologies.
- the amine binding technology is epoxy.
- the conjugation is by carboxyl group capturing.
- a method for multiplexing an assay on more than one molecule of interest in a single solution comprising:
- multiplexing an assay refers to performing one assay on multiple samples simultaneously. Multiplexing is useful when samples are limited, the assay is costly in terms of time, money, reagents or sample input.
- assays can be performed from start to finish simultaneously so that variation between samples is reduced ( FIG. 5 B ).
- Chromatation-Seq next generation sequencing
- the protein capture for all antibodies used is performed in one tube and at one time. The ligation of the barcode all occurs at once as well and washing and sequencing is all also performed as one. This greatly limits any inter-sample variability in assay performance.
- the assay is any one of ChIP, ChIP-Seq, cfChIP, cfChIP-Seq, protein quantification, and a protein-protein interaction assay. Protein quantification can be achieved by adding a generic DNA adapter/sequence to the protein and then ligating the barcode.
- the methods of the invention further comprise attaching an adapter to the protein of interest.
- the assay is chromatin immunoprecipitation followed by sequencing (ChiP-Seq).
- the molecule of interest is a protein. In some embodiments, the molecule of interest is a nucleic acid molecule. In some embodiments, the molecule of interest is a protein and/or a nucleic acid molecule.
- the identifying by the barcode comprises quantification of the amount and/or number of the molecules of interest. In some embodiments, the amount and/or number of barcodes is equal to the amount and/or number of molecules of interest. In some embodiments, the amount and/or number of barcodes is proportional to the amount and/or number of molecules of interest. In some embodiments, the amount and/or number of barcodes is equal or proportional to the amount and/or number of molecules of interest.
- nm nanometers
- antibody-beads 50 ⁇ g of antibody were conjugated to 5 mg of epoxy M270 Dynabeads (Invitrogen) according to manufacturer instructions.
- the antibody-beads complexes were kept at 4° C. in PBS, 0.02% azide solution.
- conjugated beads 0.2 mg were used per cfChIP sample.
- the antibody-beads complexes were added directly into the plasma (1-2 ml of plasma) and allowed to bind to cf-nucleosomes by rotating overnight at 4° C.
- the beads were magnetized and washed 6 times with blood wash buffer (BWB 50 mM Tris-HCl, 150 mM NaCl, 1% Triton X-100, 0.1% Sodium DeoxyCholate, 2 mM EDTA, lx protease inhibitors cocktail), twice with BWB-500 (same as BWB only with 500 mM NaCl), and three times with 10 mM Tris pH 7.4.
- blood wash buffer BWB 50 mM Tris-HCl, 150 mM NaCl, 1% Triton X-100, 0.1% Sodium DeoxyCholate, 2 mM EDTA, lx protease inhibitors cocktail
- On-beads chromatin barcoding and library amplification was done to overcome the issue of low input material. This procedure supports preparation of cfChIP from as little as 1000 cells. The following steps were all performed on the beads so as to reduce loss of cfDNA that might occur during freeing of the cfDNA and tube transfers. DNA ends were repaired by T4 DNA polymerase and T4 polynucleotide kinase. Following a wash Klenow exo minus was employed to add an adenine base to the repaired end of the DNA. Following another wash DNA adapters were ligated on; in this case Illumina adapters with DNA barcode sequences were used. For the DNA elution and cleanup steps the beads were incubated for 1 hour at 55° C.
- chromatin elution buffer (10 mM Tris pH 8.0, 5 mM EDTA, 300 mM NaCl, 0.6% SDS) supplemented with 50 units of proteinase K (Epicenter), and the DNA was purified by 1.2 ⁇ SPRI cleanup (Ampure xp, agencourt).
- the purified DNA is eluted in 25 ⁇ l EB (10 mM tris pH 8.0) and 23 ⁇ l of the eluted DNA were used for PCR amplification with Kapa hotstart polymerase (16 cycles).
- the amplified DNA was purified by 1.2 ⁇ SPRI cleanup and eluted in 12 ⁇ l EB.
- Kidney samples egg2.wustl.edu/roadmap/data/byFileType/alignments/unconsolidated/H3K4me3/BI.Adult_Kidney.H3K4me3.27.filt.tagAlign.gz and egg2.wustl.edu/roadmap/data/byFileType/alignments/unconsolidated/H3K4me3/BI.Adult_Kidney.H3K4me3.153.filt.tagAlign.gz).
- Kidney samples egg2.wustl.edu/roadmap/data/byFileType/alignments/unconsolidated/H3K4me3/BI.Adult_Kidney.H3K4me3.153.filt.tagAlign.gz.
- RNA-seq data from the TCGA and GTEx projects as analyzed by the Xena project (Toil enables reproducible, open source, big biomedical data analyses, Vivian J, et al., Nat. Biotechnol., 2017, and Toil RNAseq Recompute database, tcga-data.nci.nih.gov).
- tissue specific signatures of a specific modification we examined binned representation of the atlas. For each tissue we defined a signature of unique windows with signal in one of the samples of the target tissue and without coverage in all others (see below).
- the first test is whether a signature is present.
- Rejection of the null hypothesis for a specific signature is an indication that some of the windows in the signature carry the modification in question in a subpopulation of cells contributing to the cf-nucleosome pool.
- the second test is whether a signature is over-represented than expected in healthy baseline subjects.
- a signature is over-represented than expected in healthy baseline subjects.
- we use average signal from 5 healthy samples to define the average number of reads (per million) in each window.
- p-value of the actual number of observed reads in signature windows as the probability of having this number or higher according to the null hypothesis. Rejection of the null hypothesis for a specific signature is an indication that some of the windows in the signature have higher signal than we expected in healthy subjects. The interpretation is that these are abnormal processes active in the cells contributing to the subject's cf-nucleosome pool.
- the resulting catalogue was saved as BED file (TSS.bed).
- Base calling was performed with bcl2fastq (2.18). Paired-end reads were mapped to the human genomes (hg19) using bowtie2 with “no-mixed” and “no-discordant” flags descarding reads with quality 0. BEDPE files (start and end of every fragment) were obtained using BEDtools “bamtobed” with “bedpe” flag, discarding duplicate fragments. BEDPE files were converted to coverage counts over windows in the catalogue using BEDtools “intersect” command and also using BioConductor “GenomicRanges” countOverlaps( ) function. Both methods count for each window the number of sequenced fragments that overlap with the window.
- the estimate at each level used the estimate of the previous level as a prior (using pseudo-counts of 1000 windows in levels 2 and 3, and 500 windows in level 4).
- FIG. 6 A shows the background estimate for a healthy male sample.
- FIG. 6 B shows a healthy female sample, wherein the chrX background is lower than autosomal chromosomes (slightly more than half) and chrY background is a bit lower. Many locations in chrY are orthologous to ones in chrX leading to skewed estimates.
- W g is the set of windows assigned to gene g, C[w,s] the coverage of window w in sample s, and ⁇ circumflex over ( ) ⁇ [w, s] the estimated background rate of window w in sample s.
- C[g,s,] be a real signal if it is larger than two standard deviations from the mean of the background level for the gene.
- the scaling factors were rescaled so that the total normalized signal (below) at the set of reference healthy samples will be a million on average.
- N[w,s] ⁇ [s ]*max( C[w,s] ⁇ B[w,s], 0)
- Roadmap Epigenomics metadata table we defined sets of Roadmap samples that belonged to a tissue or group of tissues. These definitions included some redundancies. For example, the group Lymphocytes included B-Cells, T-Cells, and NK samples, and thus subsumed each of these groups.
- W is a set of windows, it can be the windows associated with a gene or tissue-specific signature as above.
- Over-expression test To test whether the observed signal of a set of genes is higher than expected in healthy samples, we used a reference of healthy subjects to define the expected normalized signal of the gene H [g] as the average of N [g, s] in the reference samples.
- the main difference from the previous test is that we included the contribution of healthy samples after we transform from normalized units to the units of the specific sample.
- the second difference is that we work at the level of genes.
- cfDNA is likely in the form of nucleosomal DNA (cf-nucleosomes) with intact histone modifications. Whether extracting and sequencing DNA from cf-nucleosomes carrying specific histone marks could be used to determine information about the cell of origin of the cfDNA was investigated ( FIG. 1 A ). This is approach is attractive for several reasons. First, ChIP experiments by design sequence only the positive signal reducing the number of reads required for a positive signal and thus lower the costs and work associated with the assay. Second, positive targets are relatively rare; promoter marks, such as H3K4me3, appear in ⁇ 50,000 locations in the genome ( ⁇ 1% of the genome).
- Enhancer marks can appear in many regions ( ⁇ 10% of the genome) but are limited in each cell.
- histone marks are mostly tissue-specific. In particular, most enhancers are tissue-specific and thus provide strong H3K4me1/3 tissue specificity ( FIG. 1 B ).
- histone modifications reflect transcriptional activity and respond to changes in cell state. Thus, they create an opportunity to detect changes in activity in cells as they die.
- the cfChIP signal surrounding ubiquitously-expressed genes shows high correlation with reference ChIP-seq from tissues (NIH Epigenome Roadmap consortium) ( FIG. 1 C ).
- FIG. 1 C Globally, meta-analysis of the cfChIP signal for H3K4me1 and H3K4me3 yields the expected typical distribution for these marks around enhancers and promoters ( FIG. 1 D ).
- Non-histone DNA associated proteins can also be used for cf-ChIP.
- Meta-analysis of the data shows a clear signal at CTCF sites throughout the genome, and similar analysis of cfChIP with anti-H3K4me3 antibody showed depletion of the histone mark at the same sites ( FIG. 1 I ).
- CTCF binding and H3K4 trimethylation are generally mutually exclusive, so this result helps to confirm that the CTCF signal is genuine.
- H3K4me3 The level of H3K4me3 at promoters from tissue samples correlates with the level of transcription and is strongly predictive of gene expression levels.
- cfChIP H3K4me3 correlates with leukocytes RNA-seq at constitutive genes (based on Roadmap Epigenomics Consortium and GTEx Consortium 2015) and this correlation is similar to that of ChIP-seq from leukocytes ( FIG. 2 J ).
- leukocyte expressed genes in agreement with their major contribution to the cfDNA pool in healthy individuals ( FIG. 2 K ).
- tissue-specific signatures in samples from healthy subjects.
- a previous study on cfDNA CpG methylation estimated that ⁇ 55% of cfDNA emanates from leukocytes, and ⁇ 1% from the liver, with minimal or no contribution of cfDNA from the heart and brain.
- we defined tissue-specific signatures in an unbiased manner see Materials and Methods, Table 1).
- liver cfDNA is also clearly and significantly detected in contrast to brain and heart signatures that are absent in blood, as expected. These signals are specific with high statistical confidence (q ⁇ 10-20, see Materials and Methods), and they demonstrate the ability of cfChIP to detect cf-nucleosomes from rare populations of cells.
- heart-specific H3K4me3 peaks were strongly and significantly detected in post-PCI patient samples, but not in samples from healthy individuals or from pre-PCI patients ( FIG. 3 A ).
- the heart signal includes clear peaks in promoters of heart-specific genes.
- TNNT2 and TNNI3 that encode for the heart-specific Troponin T2 and 13 are clearly active and observed only in these samples.
- These two genes encode canonical protein markers for myocardial damage ( FIG. 3 B ). Indeed, we see good correlation between the strength of the cfChIP heart signature, the levels of troponin measured in the blood, and the estimate of heart cfDNA based on heart-specific differentially methylated CpGs ( FIG. 3 C ).
- FIG. 3 D-E For an unbiased view of our ability to describe tissue-of-origin, we evaluated a panel of cell-type specific signatures across cfChIP samples ( FIG. 3 D-E ). This analysis shows that in all samples we can detect signatures of a range of cell types from the blood (e.g., monocytes and neutrophils), and organs (e.g., liver).
- One of our subjects was four months pregnant carrying a male fetus and her sample exhibited a significant placenta signature. Indeed, we were able to detect low but significant Y-chromosome signal in her plasma as well ( FIG. 3 D ).
- AMI patients sampled several hours post PCI show a clear cardiomyocyte signature ( FIG. 3 A, 3 D ).
- AMI patients both before and shortly after PCI we observed a significant increase in hepatocyte signature.
- This signature includes clear signal at liver-specific genes, such as Albumin and complement genes ( FIG. 3 F ). This unanticipated observation is presumably a result of the well-known phenomenon of liver injury in AMI patients secondary to low organ perfusion and liver hypoxia.
- AMI patients also exhibited increased levels of active chromatin from erythroblast-associated genes, including the hemoglobin locus (see below), and the erythropoietin locus (EPO), most likely from hepatocytes due to liver response to hypoxia. Together this data suggests a systemic response to oxygen shortage.
- the liver and erythroblasts signals were likely due to transient damage caused by reduced systemic perfusion associated with AMI. Indeed, a follow up cfChIP-seq on this patient 11 months later appeared normal.
- a second AMI patient we observe a gradual decrease in liver signature within hours following PCI, suggesting a prompt relief of the liver oxygen shortage ( FIG. 3 G ).
- cfChIP cancer tissue of origin. Advanced stage cancer is often accompanied by higher cfDNA content in the plasma, much of it from tumor cells (ctDNA).
- GI gastrointestinal
- FIG. 3 I, 3 E Plasma samples from cancer patients contains signal from tissues that are not observed in healthy subjects. Most evidently, we observe signal originating from gastrointestinal (GI) tissue and from GI smooth muscle, which is in agreement with the primary locations of the tumors. A weaker but significant GI signature was evident even when the primary tumor was removed by surgery and only residual metastatic disease was evident (patients C004 and C005).
- HCC localized hepatocellular carcinoma
- PHx partial hepatectomy
- a major challenge for cfDNA analysis is the inference of gene expression in the tissue of origin.
- the major approach proposed to this problem so far relied on under-representation of specific promoter elements in cfDNA as an indication of gene expression; however, this approach requires extremely deep sequencing and is limited to situations where cfDNA from the tissue of interest comprises a majority population in blood.
- cfChIP can report on non-constitutive gene expression programs that took place in the cells of origin.
- H3K4me3 is closely associated with transcriptional activity and dynamically changes in response to changes in transcriptional programs, raising the exciting possibility that cfChIP might be able to detect more dynamic transcriptional programs beyond the information on tissue of origin.
- H3K4me3 cfChIP signal from patients with AMI or cancer was compared to a collection of hallmark gene expression signatures representing different cellular processes and responses ( FIG. 4 A-C ).
- This analysis uncovered multiple signatures that have higher than expected signal—that is, the amount of cf-nucleosomes captured for the signature is significantly higher than what we see in healthy subjects. For example, we see a strong signature of Heme Metabolism in M002 and C005, patients who suffered from hypoxia and bacteremia, respectively.
- the blood count of C005 indeed shows high red cell distribution width (RDW) and low red blood cell count (RBC) and hemoglobin (HGB) indicating high production of red blood cells due to anemia.
- This signal could be from enhanced cell death of erythrocytes progenitors or closely related cells, or due to nuclear loss in erythroblasts undergoing maturation into erythrocytes.
- this signature is indicative of a specific hematopoietic cellular differentiation process.
- glycolysis or interferon-alpha response reflect processes that can take place in multiple cell types.
- M002 we also see an increased signal of several liver-specific glycolysis genes such as ALDOB and PFKFB1 while in cancer patients the enhanced glycolysis signature does not include signal from these genes.
- Interferon-alpha response that is normally induced due to the presence of pathogens such as viruses and bacteria.
- pathogens such as viruses and bacteria.
- M004 and C005. In the latter this is likely due to severe bacteremia for which he was hospitalized.
- M004 whose samples show high interferon and inflammation signatures, seems to have experienced more severe heart damage as compared to other AMI patients in terms of troponin levels and cfChIP heart markers ( FIG. 3 A, 3 C ). This could be due to induction of IRF3/interferon I response in M004 which was recently shown to promote a fatal response to AMI.
- Plasma Chromatin Allows Dissection of Patient-Specific Molecular Phenotypes
- a hallmark of cancer cells is genetic alterations that lead to dysregulated gene expression programs. Identification of such cancer-specific transcriptional programs can assist diagnosis and treatment choice. For each sample, we tested for genes whose signal was elevated compared to five “reference” healthy samples. As a control, unrelated healthy samples outside the reference set were in high correlation with healthy reference, with few genes (usually less than 50) showing significantly elevated signal ( FIG. 4 E ). In contrast, samples from patients revealed hundreds to thousands of genes with significantly elevated signal ( FIG. 4 E ). Examining these genes for enrichment in annotated gene lists recapitulated some of the results discussed above. For example, genes in C001 were enriched for gene sets of GI tract and Brain consistent with the pathology of this patient.
- RNA CCAT1 colonrectal cancer associated transcript 1
- FIG. 411 Focusing on specific genes that are known to be upregulated in gastric and colorectal cancers we observe a clear increase of the H3K4me3 cfChIP signal in these patients compared to healthy reference ( FIG. 411 ).
- these genes we find the carcinoma markers CEACAM5 and CEACAM6.
- the protein products of these genes are used in an antibody-based assay for clinical cancer diagnosis.
- a second colorectal cancer marker, the long non-coding RNA CCAT1 colonrectal cancer associated transcript 1
- Another example is the long non-coding RNA EGFR-AS1 that mediates cancer addiction to EGFR and when highly expressed can render tumors insensitive to EGFR inhibition.
- cfChIP signal for EGFR is detected in all cancers, EGFR-AS1 is only detected in C002 but not in other patients. This finding, which would not be detected by cfDNA mutation analysis, raises the exciting possibility that cfChIP can be informative for treatment choice beyond genomic mutations.
- Adrenocortical Cancer EBP, FAM64A, C1orf53, EPB41L1, TSPAN12, RP5-940J5.9, CTSA, NR5A1, GSTA3, TNFRSF12A, PPIF, ZNF275, RP11-215A21.2, NANOGP1, MINOS1-NBL1, SERPINA5, RP11-649E7.5, FAM222A, PLBD2, SRRM3, LDLR, AGRP, FAM19A4, LONP1, GRTP1-AS1, DBP, HSPE1, RP11-688G15.3, NPTX2, KLHDC8B, TECRP1, SPINK13, C1orf233, MGARP, RP11-320G24.1, BST2, UBE2C, RP11-40C6.2, FAM131C, FDXR, SCARB1, PRDX2, HMGCR, SLC26A2, NETO2, C19orf48, RP1-241P17.4, RP11-
- RP11-229P13.23 Brain Lower Grade Glioma: RP11-229P13.23, RP5-940J5.9, FERMT1, TAAR3, RP11-215A21.2, RP11-64K12.4, NUDT4P1, RP11-698N11.2, UHRF1, RP4-791C19.1, RP11-565P22.6, AC000089.3, VIPR2, BTBD17, MDFI, NMB, BMP2, RP5-1177M21.1, SHD, DLL3, CX3CR1, MAGED4, NES, BCHE, RPE65, RP11-231C18.1, SOX2OT_exon3, PAX1, RP11-40C6.2, FREM3, GSX1, SOX11, HES6, PCDHB9, NDST4, TREM2, RP1-241P17.4, SOX4, KLRC2, SMOC1, PIGY, PAPD7, RPS2P55, HAPLN1, BCAN, GPR34, MEX
- Colon Adenocarcinoma ITGA2, POU5F1B, GCNT1, RP11-357H14.17, FAM64A, CHMP4C, RP11-229P13.23, SKA3, LAMB3, RP5-940J5.9, LY6G6D, RBBP8NL, HMGB3, SEL1L3, CKMT1A, BCL2L15, CEP55, ATP10B, TRABD2A, TDGF1, FERMT1, CDH17, HTR1D, RRM2, KRT8P45, NEIL3, MAP7, FAM111B, RP11-93K22.13, ILDR1, CDCA7, RP11-215A21.2, MAD2L1, CEMIP, EPCAM, ANKS4B, AC012363.4, PPP1R14D, VWA2, MUC13, KRTCAP3, HKDC1, F2RL1, AP000439.3, AC007099.1, NOX1, AC005255.3,
- Diffuse Large B-Cell Lymphoma CKS1B, DLGAP5, MZB1, C16orf59, PLA2G7, RMI2, PCNA, DNPH1, CCNA2, SKA1, SKA3, WNT10A, PLA2G2D, ZWINT, AC023590.1, EAF2, PARP1, SNRPD2, SNRPD1, MND1, EEF1A1P19, LSM3, CEP55, TYMS, HSPA8P1, FCRLA, PDCD1LG2, KBTBD8, RRM2, RRM1, NEIL3, HLA-DOA, HLA-DRB6, HLA- DQA1, FAM111B, CDCA7, CDCA5, RP11-215A21.2, MAD2L1, MRPL13, MRPL14, MRPL17, MS4A1, CCL18, RASGRP3, RPS21, POU2AF1, RFC4, RP11-815N9.2, FEN1, RP4-706A16.3, IL21R, CXCR
- Esophageal Carcinoma ITGA2, ITGA6, DLGAP5, RP11-357H14.17, FAM64A, SKA3, LAMB3, RP11-397A16.1, BIVM-ERCC5, SIPA1L3, RNU1-28P, RAB27B, AC002543.2, HIST1H1B, CKMT1A, CEP55, FERMT1, RPSAP52, PI3, RRM2, RIPPLY3, MXRA5, NEIL3, PMAIP1, SAMD9, FAM111B, RP11-93K22.13, FOXM1, ILDR1, MUC4, KLK6, MAD2L1, PRDM1, SNORD17, COL1A1, CEMIP, AC012363.4, RP11-421E14.2, CLSPN, WNT7B, ENTPD7, HEPHL1, RSC1A1, POU6F2, CKAP2, HIST1H4C, CTD-2555C10.3, SERINC2, F2RL1, KLF5,
- Glioblastoma Multiforme FAM64A, OTP, MSMP, TUBA1C, TNFRSF12A, RRM2, NEIL3, TAAR3, DMRTA2, FAM111B, CHRNA9, CRISPLD1, CHI3L1, RP11-351I24.3, HIST1H4J, SPINK8, HSPE1-MOB4, RP11-134N1.2, ADORA3, IGFBP2, GAL3ST4, TMEM189-UBE2V1, RP11-698N11.2, UHRF1, WARS2-IT1, KIF4A, F2R, TK1, RP11- 565P22.2, TMSB15A, MEST, AC092675.3, RP11-84A19.3, TP53, CENPI, MDFI, IBSP, NMB, SPC24, PDPN, CX3CR1, NES, MKI67, HAS2, UBE2C, RPE65, SPOCD1, CDKN2C, MCM2,
- Kidney Chromophobe RP11-2N1.2, RP11-215A21.2, KLK15, DNTT, CR1L, AC106869.2, AC005255.3, RP4- 791C19.1, RP11-459C13.1, NCOA7-AS1, PSG9, KRTAP5-3, RP11-40C6.2, RP11-1E11.1.
- Kidney Clear Cell Carcinoma AP000439.3, RP11-283G6.5, FABP7, RP11-2L8.1, RP11-40C6.2, AC114803.3, RP11-10G12.1, LDHAP7, PI4K2B.
- Liver Hepatocellular Carcinoma GPC3, RP11-334L9.1, RRM2, FAM111B, RP11-215A21.2, FTH1P20, AC005255.3, KIF4A, UBD, ALG1L, SPC24, RP11-40C6.2, RP11-10G12.1, PIGY, CTC-518P12.6, RP11-556E13.1, ACSL4, CXCL10.
- Lung Adenocarcinoma IGKV1-12, RP11-229P13.23, RP5-940J5.9, HMGB3, RP11-334L9.1, HS6ST2, RRM2, LRRC15, FAM111B, RP11-93K22.13, EPCAM, RP11-649E7.5, RP11-350J20.12, AC005255.3, CST1, HMGN2P17, UHRF1, EPYC, KPNA7, BPIFA1, KIF4A, AGR2, ARAP1-AS1, MMP1, UBD, CASC9, RP3-407E4.4, MMP11, MMP12, MMP13, ALG1L, KRTAP4-1, ETV4, PAEP, UBE2C, YWHAZP4, DSG2, RP11-40C6.2, SGPP2, RP11-10G12.1, KIAA0101, CTHRC1, LDHAP7, IL22RA2, RP1-241P17.4, MCM10,
- Lung Squamous Cell Carcinoma ARTN, FAM64A, RP11-229P13.23, ADAM12, CCNA2, RP11-397A16.1, RP5- 940J5.9, HMGB3, BMP7, RP13-463N16.6, CEP55, TP63, FERMT1, PTHLH, RRM2, PVRL1, TICRR, NEIL3, LRRC15, FAM111B, CHRNA5, FOXM1, CDCA7, RP11-215A21.2, MAD2L1, PCDH19, KRT6B, WNT7B, POU6F2, NPBWR1, NKX1-2, RP11-649E7.5, RP4-594A5.1, RAD51AP1, CDT1, CST1, IGFBP2, UHRF1, AURKB, ZP3, KRT17, KRT16, AF127577.8, COL11A1, RP11-108M12.3, IGHV1-69, KIF4A, HIST1H3G
- Ovarian Serous Cystadenocarcinoma CKS1B, PHOX2A, KRTAP2-3, SRD5A3, NACC1, FAM64A, CNGB1, RP11- 624M8.1, WNT10A, RP5-1065P14.2, CITED4, CRABP2, TTC30B, KLHL14, LDLRAD1, TUBA1C, RNU1-28P, HMGB3, AC144450.2, LYPD1, HIST1H1B, AC004870.4, CEP55, LGALS17A, PNOC, USP18, TNFRSF12A, SCNN1A, RRM2, RIPPLY3, MXRA5, IGSF9, BASP1P1, NEIL3, MESP2, FAM111B, RP11-93K22.13, FOXM1, CTC-513N18.6, ILDR1, PTH2R, KLK6, KLK7, KLK8, MAD2L1, SNORD17, C19orf33, EPCAM, KLK
- Pancreatic Adenocarcinoma ITGA2, GCNT3, LIF, ADAM12, LAMB3, TIMP1, RP5-940J5.9, RP13-463N16.6, CTSE, STRA6, RRM2, MXRA5, ITGB6, HLA-DRB6, LRRC15, FAM111B, KLK6, SFRP2, COL1A2, COL1A1, CEMIP, C19orf33, CCL17, WNT7A, CGB8, CGB5, NREP, F2RL2, S100A6, RP11-350J20.12, LCN2, AC005255.3, S100A10, CST2, CST1, CST4, PLAU, PLAT, PHLDA2, IGFBP3, VILL, KCNK1, CCL20, CCL22, SULF1, FNDC1, MATN3, EPYC, GABRP, KRT19, COL11A1, LY75, ONECUT3, KPNA7, RP5-907D15.4, KIF4A, TMS
- Pheochromocytoma & Paraganglioma CHST1, PHOX2A, PHOX2B, SEZ6L, BEX1, SLC18A2, SLC18A1, SYT1, SYT2, SYT4, SYT5, CNGB1, UNC5B, VGF, UNC5A, PCSK2, DYNC1I1, ASTN2, PCDHAC2, PCDHAC1, H2AFY2, FAIM2, RP5- 940J5.9, AC005944.2, DCX, FLRT1, RP11-334L9.1, BMP7, RP11-124O11.1, RP11-294J22.6, HAND2, PENK, TMEM35, DLX1, DLX2, STRA6, HS6ST2, FAM163A, HAND2-AS1, SMIM18, EPHA8, GLB1L3, TH, CRYBA2, RIPPLY2, C18orf42, CHRNA5, CHRNA7, CHRNA3, PRPH, GDAP1, IGF2, PCD
- Prostate Adenocarcinoma GCNT1, RP11-170M17.1, PLA2G7, PCAT1, RP11-229P13.23, WI2-85898F10.1, AC144450.2, DLX1, DLX2, RP11-215A21.2, AMACR, MINOS1-NBL1, TMEFF2, AP006748.1, NPY4R, PODXL2, HMGN2P17, ACSM1, UBD, TRPM8, AC000089.3, TMSB15A, TRGC1, XXbac-BPG308K3.5, MMP10, RPL7P16, LUZP2, RP11-1084J3.3, RP11-40C6.2, VSTM2L, RP11-10G12.1, ABCC4, PIGY, RP11-579D7.2, COL10A1, CLDN8, CXCL11, RPS2P55, DNASE2B, GDF15, GOLM1, PCA3, AP001610.9, SIM2, RP11-483P2
- Rectum Adenocarcinoma POU5F1B, GCNT1, RP11-357H14.17, FAM64A, CHMP4C, RP11-229P13.23, SKA3, LAMB3, GPR143, LY6G6D, RBBP8NL, HMGB3, SEL1L3, CKMT1A, BCL2L15, CEP55, ATP10B, TRABD2A, TDGF1, PRSS33, FERMT1, CDH17, RRM2, RIPPLY3, KRT8P45, NEIL3, MAP7, FAM111B, RP11-93K22.13, ILDR1, CDCA7, RP11-215A21.2, MAD2L1, CEMIP, EPCAM, RNF128, ANKS4B, PPP1R14D, VWA2, MUC13, KRTCAP3, CKAP2, HKDC1, F2RL1, RP11-649E7.5, AP000439.3, AC007099.1, NOX
- Testicular Germ Cell Tumor CHST2, ALPP, FAM64A, SYT2, GBX2, AC068134.6, PLA2G2D, SLC7A3, RP5-940J5.9, GLDC, ZFP42, TUBA1C, PQLC3, CTC-260E6.6, PRDM14, MAN1C1, CPXM1, TDGF1, RRM2, AICDA, PMAIP1, TEAD4, VRTN, RP11-215A21.2, PRDM1, MRPL15, NANOGP1, ALPPL2, NANOG, CGB8, CTC-260E6.2, NKX1-2, ZSCAN10, SALL4, HIST1H2AJ, CCR5, CXCR3, KLRG2, TUBB, PHLDA3, BTBD6P1, RP11-148B6.2, CCL26, FOXH1, DACT2, TFAP2C, AURKB, TBPL2, SNHG16, CRYGEP, LIN28A, RPRM, FZD5,
- Thyroid Carcinoma NPC2, LAMB3, CYSLTR2, CITED1, RP11-221N13.3, RP5-940J5.9, PSG1, SIGLEC15, TNFRSF12A, RPSAP52, LRP4, FAM111B, RP11-215A21.2, ZCCHC12, ZCCHC16, CCL17, AC018816.3, RP11- 649E7.5, RP11-474O21.5, RP11-16K12.1, CST2, GABRB2, TGFA, HMGN2P17, MET, RP4-791C19.1, SLC27A6, LPAR5, DCSTAMP, SLC34A2, HMGA2, AC079630.2, AC079630.4, SDC4, RXRG, ETV4, PTCHD4, FN1, SLC25A47P1, ACTBL2, RP11-40C6.2, PRR15, RP11-542B15.1, AE000661.37, RP11-498C9.3, PDLIM4, RP
- Uterine Carcinosarcoma FAM64A, C16orf59, WNT10A, CRABP1, CRABP2, CLUL1, RP5-940J5.9, HMGB3, RP11- 334L9.1, PLAC1, HSPA8P1, RRM2, RIPPLY3, RNASEH2A, NEIL3, FAM111B, RP11-215A21.2, MAD2L1, MINOS1- NBL1, NREP, SLC29A4, CBX2, RCOR2, RAD51AP1, CDT1, RPL13AP20, HMGN2P17, VAX2, UHRF1, AURKB, KIF4A, PAFAH1B3, KIF26B, KIFC1, PARPBP, TK1, RP11-608O21.1, CCNF, KIF18B, ERCC6L, TMSB15A, MEST, SBK1, RHPN1-AS1, KIF14, KIF11, IGF2BP1, IGF2BP3, HMGA2,
- Uterine Corpus Endometrioid Carcinoma RP6-65G23.3, FAM64A, C16orf59, RP11-638I2.6, RP5-940J5.9, RBBP8NL, HMGB3, RP11-334L9.1, AC004870.4, RP11-519G16.5, CEP55, DLX5, DLX6, RPSAP52, RRM2, RIPPLY3, KRT8P45, NEIL3, FAM111B, DLX6-AS1, ILDR1, RP11-215A21.2, MAD2L1, EPCAM, ALPPL2, OCIAD2, MUC16, WNT7A, KRTCAP3, HIST1H4J, SERINC2, FEN1, WFDC2, RP11-649E7.5, FTH1P20, PRSS8, CDT1, RHPN2, RPL13AP20, TMEM132A, KRT8, DACT2, HMGN2P17, UHRF1, AURKB, RP11-
- H3K4me3 Our analysis of the active promoter mark H3K4me3 provided rich information regarding transcriptional programs in tissue of origin. Can we gain information from chromatin marks that are associated with enhancer and gene activity? Mono and di-methylation of H3 lysine 4 (H3K4me1 and H3K4me2, respectively) are found in two types of genomic regions: 1) promoter flanking regions at the boundaries of regions marked with H3K4me3, or 2) poised/active enhancers, where the H3K4me3 is barely detected. ChIP-seq for these marks in tissues shows H3K4me2 peaks in the near vicinity of enhancers, while H3K4me1 peaks are wider around enhancers (on the order of ⁇ 10 kb).
- H3K4me2 and H3K4me1 flank the main H3K4me3 peak ( FIG. 5 A-B ) and correlate with H3K4me3 and RNA level ( FIG. 5 D ). Additionally, in gene-poor regions, such as the IFNB1 locus, we clearly see marks at enhancers, which match experimentally verified enhancers for IFNB1 (Banerjee et al. 2014) ( FIG. 5 C ).
- H3K4me2 Since the H3K4me2 is more condensed and the signal between healthy subjects is highly similar ( FIG. 5 E ) we chose to focus on this mark for enhancer analysis.
- H3K4me2 peaks that have low or no H3K4me3 signal.
- ChromHMM chromatin marks
- FIG. 5 E The differences between healthy subjects and cancer samples illustrate the additional information that can be gained from enhancers over promoter signatures.
- a few examples for cancer-specific enhancers signals include TCF3, CDX1, and CEACAM5 ( FIG. 5 F-H ).
- the transcription factor TCF3 has promoter H3K4me3 signal in every subject tested.
- the enhancer activity marks in the vicinity of the gene are strikingly different, with the cancer samples showing clear H3K4me2 peaks in regions that correspond to putative colon enhancers ( FIG. 5 F ).
- CEACAM5 has H3K4me3 signal at its promoter in the cancer sample. This is accompanied by a signal in GI-specific enhancers. However, in healthy subjects there is a strong H3K4me2 signal in a neighboring enhancer/promoter region. This suggests that some of these enhancers might be involved in repression of CEACAM5 in monocytes and neutrophils.
- H3K36me3 Tri-methylation of H3 lysine 36 (H3K36me3) is found at the body of transcribed genes. Unlike H3K4me3, which marks transcription start sites at both poised and active genes, H3K36me3 requires active transcription elongation to be deposited, and is hence more indicative of gene activity.
- cfChIP of H3K36me3 results in a typical enrichment at gene bodies ( FIG. 5 I ) and the signal is in correlation to leukocyte H3K36me3 and RNA-seq ( FIG. 5 J-K ).
- cf-ChIP-seq can probe the state of various genomic functionalities including promoters, enhancers, and gene bodies, and this information is highly informative on transcriptional activity in cells of origin.
- lactis that can be differentiated by their genomic sequences.
- K4 and K36 MPL barcoded surface with chromatin from a single source and follow the genomic distribution of the immobilized chromatin.
- K4 MPL barcoded surface and K36 MPL barcoded surface should ChIP chromatin skewed towards the 5′ or 3′ end of genes respectively.
- MPL barcoded surfaces which were incubated with different yeast strains, after the IP step to test mixing during library preparation and/or later on in the process. Mixing the outputs of the IPs into a single tube prior to library preparation will reveal mixing that occurs at that stage, for example if K. lactis DNA is sequenced with barcode 1 or barcode 2 we know that undesired mixing occurred during library preparation).
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Genetics & Genomics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Analytical Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Immunology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Plant Pathology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
Methods of determining the origin of cell free DNA (cfDNA), for detecting death of a cell type or tissue in a subject, for determining a cellular state of a cell as it died, and combinations thereof, are provided. As are computer program products for doing same.
Description
- This application is a continuation of U.S. Ser. No. 16/980,497 filed Sep. 14, 2020, which is a National Phase of PCT Patent Application No. PCT/IL2019/050281 having International filing date of Mar. 13, 2019, which claims the benefit of priority of U.S. Provisional Patent Application Nos. 62/642,158, filed Mar. 13, 2018, and 62/667,528, filed May 6, 2018, the contents of which are all incorporated herein by reference in their entirety.
- The present invention is in the field of cell free DNA-protein complex analysis.
- Upon cell death (via apoptosis or necrosis) short DNA fragments are released to the blood plasma. These are often termed circulating cell-free DNA (cfDNA) or circulating tumor DNA (ctDNA, if originating from tumor cells). The presence of cfDNA has been recognized for decades, with typical length of cfDNA fragments around multiples of ˜166 bp—the length of mononucleosomal DNA (˜146 bp), with some additional linker DNA. The plasma of healthy people contains the equivalent of ˜1000 genomes per ml, with up to 100 times more cfDNA present in many pathologies (e.g. cancer) and some physiological conditions (e.g. following exercise). These fragments are short-lived, with an estimated half-life of less than an hour, making them ideal biomarkers for monitoring physiological and pathological processes in a noninvasive manner. Use of cfDNA as a diagnostic tool has greatly expanded in recent years. For example, next-generation sequencing of fetal cfDNA in maternal blood is now used for non-invasive prenatal screening/diagnosis of chromosomal abnormalities and parental derived mutations. Because of the very low amount of cfDNA present in a blood sample, most current cfDNA diagnostic methods rely on mutations with the cfDNA to distinguish it from cfDNA from healthy tissues and blood cells. Indeed, white blood cell cfDNA is by far the greatest contributor to the total cfDNA pool and can make cfDNA diagnosis of conditions from other tissues difficult.
- Most current cfDNA-based methods rely on detecting genomic alterations in cfDNA to quantify the contribution of cfDNA from cells with altered genomic sequence, such as fetus, a transplant, or mutated genes in tumors. Thus, these methods are biased towards a set of pre-selected genes and are blind to events that involve turnover and death of cells whose genome is identical to the host genome. More recent approaches leverage epigenetic information in cell free DNA. Extremely deep sequencing of total cfDNA can provide data that reflect tissue of origin and gene expression. However, it relies on detecting changes in coverage over target regions, with a signal of source tissue imposed on the background of normal cells (e.g., detection of an event causing nucleosome depletion in 10% of the cells requires 90% occupancy to be distinguished from 100% occupancy). Thus, such methods avoid sampling noise by using extremely deep sequencing coverage (100s of million reads per sample). Even with such sequencing depth, there is a prohibitive harsh detection limit for events in rare subsets of cells. A promising alternative is assaying DNA CpG methylation along the sequence to identify cell of origin. DNA methylation serves as a stable epigenetic memory and is mostly unchanged after differentiation. As such, it is highly informative regarding cell lineage, but much less about transient changes in expression, and cells originating from a close or same lineage. Moreover, unbiased analysis of DNA methylation requires high sequencing depth since most CpGs are methylated.
- A method that allows for accurately determining the origin of cfDNA, as well as providing information on the molecular events that occurred in the cell close to the time of cell death would allow not just for early diagnosis of conditions that were unknown to the doctor or patient, but also may help in tailoring a treatment to newly discovered malady.
- The present invention provides methods of determining the origin of cell free DNA (cfDNA), for detecting death of a cell type or tissue, for determining a cellular state of a cell in a subject, and combinations thereof, by sequencing cfDNA isolated by extracting proteins and modified proteins bound to that cfDNA. Computer program products for doing same are also provided.
- According to a first aspect, there is provided a method of determining a cellular state, tissue of origin, cell type or a combination thereof of a cell that released its DNA, comprising:
-
- a. providing a sample, wherein the sample comprises cell free DNA (cfDNA);
- b. contacting the sample with at least one reagent that binds to a DNA-associated protein;
- c. isolating the reagent and any thereto bound proteins and cfDNA;
- d. sequencing the isolated cfDNA; and
- e. designating a cfDNA molecule comprising a DNA sequence of an informative genomic location as originating from a cell in a cellular state, originating from a tissue, originative from a cell type or a combination thereof, wherein association of the DNA-associated protein with the informative genomic location is indicative of the cellular state, tissue of origin, cell type or combination thereof in the cell that released the cfDNA;
- thereby determining a cellular state, tissue of origin cell type or combination thereof of a cell that released its DNA.
- According to another aspect, there is provided a computer program product for determining a cell or tissue of origin of cell free DNA (cfDNA), comprising a non-transitory computer-readable storage medium having program code embodied thereon, the program code executable by at least one hardware processor to
-
- a. measure or access sequencing of cfDNA isolated with a reagent that binds a DNA-associated protein;
- b. assign a cfDNA molecule from the cfDNA to a cell or tissue of origin by comparing a DNA sequence of the molecule to sequences associated with the DNA-associated protein in the cell type or tissue; and
- c. provide an output regarding the cell or tissue of origin of cfDNA.
- According to another aspect, there is provided a computer program product for determining a cellular state, tissue of origin, cell type or a combination thereof of a cell in a subject as the cell died, comprising a non-transitory computer-readable storage medium having program code embodied thereon, the program code executable by at least one hardware processor to
-
- a. measure or access sequencing of cfDNA from the subject isolated with a reagent that binds a DNA-associated protein;
- b. assign a cfDNA molecule from the cfDNA to a cellular state, tissue of origin, cell type or combination thereof by comparing a DNA sequence of the molecule to sequences associated with the DNA-associated protein in the cellular state, tissue, cell type or combination thereof; and
- c. provide an output regarding the cellular state, tissue of origin, cell type or combination thereof of a cell in the subject as the cell died.
- According to another aspect, there is provided a solid support, comprising a capturing agent and a barcoding reagent.
- According to another aspect, there is provided a method for multiplexing an assay on more than one molecule of interest in a single solution, the method comprising:
-
- a. capturing within the solution a first molecule of interest to a first solid support of the invention;
- b. capturing within the solution at least a second molecule of interest to a second solid support of the invention;
- c. attaching the first molecule of interest and a first barcode and at least the second molecule of interest and a second barcode;
- d. simultaneously performing the assay on the first and second molecules of interest, wherein the result of the assay on the first molecule of interest is identified by the first barcode and the result of the assay on the second molecule of interest is identified by the second barcode;
- thereby multiplexing an assay on more than one molecule of interest in a single solution.
- According to some embodiments, the sample is from a subject.
- According to some embodiments, the cell that released its DNA is a dead cell and the method is for detecting death of at least one of:
-
- a. a cell type in a subject,
- b. a tissue in a subject, and
- c. a cell in a cellular state in a subject.
- According to some embodiments, the cellular state is a disease state. According to some embodiments, the disease state is selected from bacteremia, cancer, pre-cancer, infection, neurodegenerative disease, tissue damage, cardiac disease, liver disease, inflammation, autoimmune disease, arthritis, liver inflammation, bowel inflammation, autoimmune disease, tissue damage from drug side effects, tissue necrosis, and diabetes. According to some embodiments, the disease state is selected from cardiac disease or damage, brain disease or damage, gastrointestinal disease or damage, cancer, bacteremia, infection and liver disease or damage.
- According to some embodiments, at least 500 genomes of cfDNA are provided. According to some embodiments, the designating can be performed with as little as 0.1% of the cfDNA in the sample being from the cell type, the tissue, or the cellular state.
- According to some embodiments, the reagent is selected from an antibody or antigen binding fragment thereof, a protein, or a small molecule.
- According to some embodiments, the reagent is conjugated to a physical support.
- According to some embodiments, the DNA-associated protein is selected from a histone, a high-mobility group (HMG) protein and a member of the transcriptional machinery. According to some embodiments, the histone is a histone variant and/or a modified histone. According to some embodiments, the histone variant is selected from
Histone 3 monomethylated lysine 4 (H3K4me1),Histone 3 demethylated lysine 4 (H3K4me2),Histone 3 trimethylated lysine 36 (H3K36me3) andHistone 3 trimethylated lysine 4 (H3K4me3). According to some embodiments, the reagent is an anti-modified histone antibody or fragment thereof. - According to some embodiments, association of the DNA-associated protein with the genomic location is indicative of active transcription and the genomic location is within a tissue, cell type or cellular state specific gene or enhancer element or is at a disease-specific mutation. According to some embodiments, association of the DNA-associated protein with the genomic location is indicative of silenced transcription and the genomic location is within a repressor element, or a gene silenced in the tissue, cell-type or cellular state, or is at a disease-specific mutation.
- According to some embodiments, the method of the invention further comprises performing steps a-d again using a reagent that binds to a second DNA-associated protein, and wherein the second DNA associated protein is different from the first DNA-associated protein.
- According to some embodiments, the method of the invention comprises contacting the sample with at least 2 reagents, wherein each regent is bound to a physical support and the support comprises a short DNA tag unique to each reagent, wherein upon sequencing the isolated cfDNA the short DNA tag identifies the reagent that isolated the cfDNA.
- According to some embodiments, the designating comprises comparing the sequenced cfDNA to at least 10 genomic locations with the greatest unique association of the DNA-associated protein in a tissue, cell type or cellular state, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 10 genomic locations is considered to be from the tissue, cell type or cellular state.
- According to some embodiments, the DNA-associated protein is a marker of active transcription and the designating comprises comparing the sequenced cfDNA to a known transcriptional program of a tissue, cell type or cellular state, wherein a cfDNA with a sequence that is from a gene transcribed in the transcriptional program is from the tissue, cell type or cellular state.
- According to some embodiments, the designating comprises comparing the sequenced cfDNA to a DNA-protein association atlas of at least 5 cell types or tissues, wherein the atlas comprises at least 10 genomic location with the greatest unique association of the DNA-associated protein in each of the 5 cell types or tissues, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 10 genomic locations is considered to be from the tissue or cell type.
- According to some embodiments, the designating comprises comparing the sequenced cfDNA to a transcriptional program atlas of at least 5 transcriptional programs, wherein the atlas comprises at least one genomic location with the greatest unique association of the DNA-associated protein in each of the 5 transcriptional programs and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least one genomic location indicates activation of the transcriptional program.
- According to some embodiments, the cellular state is selected from: hypoxia, inflammation, ER stress, mitochondrial stress, interferon response, quiescence, senescence, cycling, malignant, and calcium flux.
- According to some embodiments, the informative genomic location is selected from a promoter, an enhancer element, a silencer element, a gene body and a disease-associated mutation.
- According to some embodiments, the method of the invention is wherein:
-
- a. the DNA-associated protein is a marker of active transcription and the disease associated mutation is within an oncogene or
- b. the DNA-associated protein is a marker of silenced transcription and the disease associated mutation is within a tumor suppressor gene.
- According to some embodiments, the method of the invention is for use in detecting a disease state in the subject.
- According to some embodiments, the method of the invention is wherein the detecting a disease state comprises at least one of:
-
- a. early detection of the disease state;
- b. detection of residual metastatic disease; and
- c. monitoring of disease progression with or without treatment.
- According to some embodiments, the method of the invention further comprises treating the subject with a suitable treatment based on the cellular state, tissue of origin, cell type or a combination thereof of the cell that died in the subject.
- According to some embodiments, the solid support is a magnetic or paramagnetic bead, or an agarose bead.
- According to some embodiments, the capturing agent is a protein. According to some embodiments, the capturing protein is an antibody, or antigen binding fragment thereof.
- According to some embodiments, the barcoding reagent is a short nucleic acid molecule. According to some embodiments, the nucleic acid molecule is between 5 and 30 nucleotides.
- According to some embodiments, the capturing agent and barcoding reagent are conjugated to the solid support.
- According to some embodiments, the molecule of interest is a protein or a nucleic acid molecule.
- According to some embodiments, the assay is chromatin immunoprecipitation followed by sequencing (ChIP-Seq).
- Further embodiments and the full scope of applicability of the present invention will become apparent from the detailed description given hereinafter. However, it should be understood that the detailed description and specific examples, while indicating preferred embodiments of the invention, are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention will become apparent to those skilled in the art from this detailed description.
- Some embodiments of the invention are herein described, by way of example only, with reference to the accompanying drawings. With specific reference now to the drawings in detail, it is stressed that the particulars shown are by way of example and for purposes of illustrative discussion of embodiments of the invention. In this regard, the description together with the drawings makes apparent to those skilled in the art how embodiments of the invention may be practiced.
-
FIGS. 1A-I : (1A) Outline of the proposed method. Chromatin fragments from different cells in the body are released to the blood. These are immunoprecipitated, and sequenced. Interpretation of the resulting sequences informs of tissue-of-origin and gene activity program. Inset-cfChIP protocol, using antibody covalently bound to paramagnetic beads. Target fragments are immunoprecipitated directly from plasma. After removing the plasma, and washing the beads bound to target fragments, on-bead-ligation is used to add sequencing adapters (possibly with indexing barcodes) to the fragments and isolation of ligated DNA, and PCR amplification a sequencing-ready library is ready. (1B) Heatmaps of reads for cell type-specific H3K4me1 and H3K4me3 sites as compiled from Roadmap Epigenomics data. Shown are the sites that are specific to a single tissue/cell-type and/or related group of cells. (1C) Aligned segment ofchromosome 2, showing a cfChIP-seq signal. Top tracks are cfChIP-seq signals from four subjects identified as healthy. The lower tracks are published ChIP-seq results from human white blood cells (leukocytes) and tissues. Below is a 100-fold zoom in showing agreement in location of peaks. (1D) Histograms of meta-analysis of cfChIP signals over active promoters and enhancers. (1E) Histogram of the distribution of sizes of sequenced cfChIP fragments shows clear mono- and di-nucleosome sizes. (1F) Browser view of cfChIP-seq signal over two regions with megakaryocyte-specific genes that appear in healthy subject but not in ChIP of blood cells and solid tissues. (1G) Browser view of non-PBMC H3K4me3 signal at promoters of selected genes (similar toFIG. 1F ). Upper and lower panels depict cfChIP and tissue ChIP signal, respectively. (1H) Browser view of mouse CTCF signal at known CTCF sites. The sites are confirmed by the depletion of H3K4me3 signal. (1I) Meta-analysis of mouse CTCF (upper) and H3K4me3 (lower) signals throughout the mouse genome. -
FIGS. 2A-M : (2A-2B) Scatter plots comparing cfChIP with anti-H3K4me1 and anti-H3K4me3 antibodies of (2A) technical replicates and (2B) 1 male and 1 female healthy individual. Each point is a 2 kb window in the genome, the x and y axises are the number of reads (in log(x+1) scale) mapped to the window in the two samples. Color code reflects density of points. (2C) Histograms of the correlations between H3K4me3 cfChIP samples of healthy subjects. (left) Correlations of counts in 2 kb windows (asFIG. 2A-B ) and (right) Correlation in counts at gene promoters. We see that samples from the same subject (red histograms) tend to be slightly more correlated to each other than samples of different subjects (blue histograms). (2D) Browser examples of gender-specific peaks. Male and female plasma samples were mixed in known proportions, and cfChIP of H3K4me3 performed. (2E) Bar chart of detection of male-specific chrY signature in samples shown in 2D. FDR adjusted q-values for background signal are shown. (2F) Chart showing H3K4me3 male signal is linear with the fraction. Compared are read counts in simulations based on the 100% male sample and Poisson samples vs observed numbers. (2G) Line graph estimation of the probability of detecting a specific location with different number of reads. The probability of detection was estimated by down-sampling from the actual result. Bars represent 95% confidence interval for the estimate. (2H) Line graph extrapolation of spike-in for larger signatures size. Shown are the probability of detecting 0.1% male for two sample sizes. (2I) Bar chart of the size of tissue highly specific signatures (see Table 1). (2J) Scatter plots of the correlations between H3K4me3 at promoters of constitutively expressed genes and RNA levels. (top) ChIP-seq of PBMC (leukocytes) vs RNA-seq of PBMC. (bottom) cfChIP of healthy subjects vs RNA-seq of PBMC. (2K) Scatter plot comparison of H3K4me3 cfChIP-seq and expression levels. Each dot is a gene. x-axis: number of H3K4me3 reads (after normalization; Methods) in the gene promoter. y-axis: Leukocytes RNA-seq counts of the gene. (2L) Dot plot of tissue specific signatures detected in cfChIP of healthy subjects. Shown are the signature counts for cells whose cfDNA is expected to be represented in cfDNA: neutrophils, 35% cfDNA; monocytes, 25% cfDNA; and hepatocytes, 1% and for negative control (heart). The points in each column are the counts for specific individuals. (2M) A dot plot showing the significance of the signatures fromFIG. 2L . -
FIGS. 3A-J : (3A) A bar chart of H3K4me3 cfChIP-seq signal in heart-specific windows in four healthy subjects and samples from myocardial infarction (MI) patients. Inset, measured troponin levels at the time the blood samples were drawn. (3B) Examples of browser views of signals at heart-specific windows. Each browser section displays 20 kb region around windows (marked with gray background). Tracks are all normalized and shown to the same scale. Top tracks show cfChIP samples and bottom ChIP-seq of tissue samples (bottom) from Roadmap Epigenomics Atlas. (3C) Dot plot comparisons to external indications of cardiomyocyte death. x-axis: measured Troponin levels (top panel) cardiomyocyte fraction as measured using DNA methylation markers (bottom panel). y-axis: strength of Heart-specific signature (relative to healthy subjects). (3D) Heatmap showing the level (Brown scale), and significance (Blue scale) of selected cell-type signatures in healthy subjects and myocardial infarction patients. Each cell in the map is divided in half, the top left half represents statistical significance (FDR corrected q-value) and the bottom half density of reads in the signature (normalized reads per kb). (3E) Heatmap of the tissue signatures for all samples; and extension of 3D and 3I. (3F) Examples of browser view of of Liver-specific windows that are part of evaluated signature (seeFIG. 3B ). (3G) Line graph of the change in signature strength in a myocardial infarction patient before/after PCI. Signatures strength are normalized to healthy subjects. The variability among healthy subjects is shown on the left. We can see initially high level of liver cells and elevated levels of heart cells. Following PCI liver cell decline and heart cells increase. (3H) Dot plot comparisons to external indications of cancer patients and liver signature. Presented as inFIG. 3C . (3I) Heatmap showing cell-type signatures in cancer patients (seeFIGS. 3D and 3E ). (3J) Combined line graph and bar graph of changes in liver signature (bars) and ALT levels (liver damage biomarker, black line) from blood samples of a patient undergoing hepatectomy. -
FIGS. 4A-H : (4A) Heatmap (as inFIG. 3D ) showing processes Hallmark genes that are over-represented in subjects (compared to healthy baseline). SeeFIG. 4C for full table with all hallmarks and subjects. (4B) Examples of browser view of genes with higher than expected signal in these expression signatures (seeFIG. 3B ). (4C) Heatmap of hallmarks signatures for all samples and signatures. ExtendsFIG. 4A . (4D) Browser view of H3K4me3 cfChIP and tissue ChIP signals at promoters of selected glycolytic genes (seeFIG. 4B ). (4E) Exemplary scatter plots of method for defining genes with elevated signal at a specific sample. Scatter plot of the normalized H3K4me3 counts at promoters of each gene. x-axis: average of reference healthy samples. y-axis: counts in the sample in question. Color dots represent genes in cancer signatures. Larger dots are significantly over-represented. (4F) Heatmap showing enrichment of tumor-specific signatures in the over-expressed genes. Each cell is divided in half, the top left half represents statistical significance (FDR corrected q-value) and the bottom half overlap with the signature (% number of genes in signature). SeeFIG. 4G for full table with all tumors and subjects. (4G) Heatmap of cancer signatures for all samples. (4H) Examples of browser view of cancer-associated genes and their signal in different samples. -
FIGS. 5A-M : (5A) Histograms of meta-analysis of cfChIP signal over active promoters and enhancers. (5B-C) Browser view of tracks of cfChIP of H3K4me3, H3K4me2, and H3K4me13 from a healthy subject. (5B) Shown is a region of highly expressed genes. We can see di-methylation and mono-methylation extending out from the tri-methylation signal. (5C) Shown is the locus surrounding IFNB1. ChromHMM tracks show prediction of promoters and enhancers according to combination of histone modification and chromatin accessibility assays. Arrows mark regions with enriched di- and mono-methylation. (5D-E) Scatter plots showing (5D) correlation of H3K4me2 and H3K4me3 at promoters in two samples, a healthy subject and a cancer patient and (5E) agreement between H3K4me2 of healthy subjects, and between H3K4me2 of two samples taken months apart from a cancer patient. Notable differences between healthy and cancer samples. (5F-H) Browser view of tracks comparing H3K4 methylation marks between healthy sample and a cancer sample (C002.2) for (5F) TCF3, (5G) CDX1 and (5H) CEACAM5 and CEACAM6. (5I) Histogram of meta-analysis of cfChIP for H3K36me3 signal over gene bodies with 5 kb flanking the transcription start site (TSS) and transcription end site (TES). Genes length is scaled. (5J) Scatter plot of correlation of H3K36me3 between Leukocytes and healthy sample (5K) Box plot of H3K36me3 marks active genes—healthy sample H3K36me3 counts (normalized by gene length) broken by quantiles of RNA levels of Leukocytes. (5L) Scatter plot of the raw Hf3K36me3 counts at gene bodies. Each dot represents a gene. x-axis: healthy samples. y-axis: colorectal adenocarcinoma sample. Color dots represent genes that are overexpressed in colorectal adenocarcinoma (COAD—red) or glioblastoma multiforme (GBM—green). (5M) Browser view of H3K4me3 and H3K36me3 signals at genes that show differential levels of these marks between healthy subject and colorectal adenocarcinoma patient. VIL1 gene shows differential signal for the two marks while CTDSP1 shows similar levels of H3K4me3 but marked increase in H3K36me3 in the sample from the colorectal adenocarcinoma patient. -
FIGS. 6A-C : (6A-C) Line graph examples of the background estimate for (6A) a healthy male sample, (6B) a healthy female sample and (6C) a cancer patient. -
FIG. 7 : Work flow of processing and analysis of cf-ChIP. -
FIG. 8 : A meta-plot (top) and heatmap (bottom) for 1,000 highly expressed promoters and the location relative to the transcriptional start site (TSS) of H3K4me3 in cf-nucleosomes from plasma that had already undergone cfChIP with anti-H3K4me1 antibodies. -
FIG. 9 : Meta-plots (above) and heatmaps (below) for 1000 highly expressed promoters and the location relative to the TSS of H3K9Ac, H3K27ac and H2A.Zac in cf-nucleosomes from plasma of healthy patient and one colorectal cancer patient. -
FIGS. 10A-D . (10A) Schematic drawing depicting the scheme of multiplexed ChIP-Seq. (10B) Schematic drawing of the experiment performed to test mixing during MPL based ChIP-Seq. Each rectangle represent an MPL barcoded surface that combines a unique barcode (BC1-BC4) in combination with anti H3K4me3 (K4) or anti H3K36me3 (K36) antibody each targeting a chromatin modification with distinct genomic location. Ellipses are chromatin from two yeast species: S. cerevisiae and K. lactis. Various mixing is then performed either before library prep (pink circle in the middle) or after. (10C) A bar graph of the fraction of immobilized chromatin (shown as a % of the input) captured. (10D) A line graph meta-analysis of H3K4me3 and H3K36me3 distribution over a gene body from ChIP-seq signal from MPL barcoded surfaces. - The present invention provides methods of determining the origin of cell free DNA (cfDNA), detecting death of a cell type or tissue in a subject and for determining a cellular state of a cell as it died by determining DNA-protein associations from transcriptionally active or inactive chromatin in the subject. The methods of the invention are based on the surprising finding that cell free nucleosomes retain protein-DNA associations that are informative about not only the tissue/cell of origin of the nucleosome, but also the pathways that were active and inactive in the cell as it died. Further, this was surprisingly possible even though the amount of cf-nucleosomes that can be captured was very small. As little as a thousand genomes worth of cf-nucleosomes was sufficient to perform the methods of the invention.
- The methods of the invention can be performed with very little input cfDNA, as little as 1000 genomes, and with very shallow sequencing, as little as 0.5M reads. The technique can be so performed because only positive associations are examined. The entire cfDNA is not sequenced, rather only cfDNA bound by a particular protein (e.g. a modified histone) is isolated and sequenced. Since only a small fraction of the cfDNA is sequenced the process is cheaper, faster and can be done with a lower depth of sequencing. Even within this smaller sample only informative genomic loci are examined; most locations are not informative about the tissue/cell of origin or the pathways active in the cell that died. By only examining informative loci, much of the noise present in the cfDNA can be ignored. Lastly, because DNA sequences that only are associated with the protein of interest in certain circumstances are investigated, only a few reads in these regions are needed to identify a positive reading in the cfDNA. For example, if binding of a protein of interest to DNA sequences that are uniquely bound in cardiac tissue is investigated, a healthy person (who has none or negligible cardiac cell death) would have only a handful of reads within these regions (See
FIG. 3A-E ). Healthy subjects showed a very low variance in the tissues and reads that were found, and so detection of abnormal cell death could be performed even with very few reads that were different than healthy individuals. A subject with an elevated number of reads at genomic regions unique to heart tissue would be identified as having ongoing elevated cardiac cell death. Not every read within these regions would need to be measured because negative data is not relevant, rather just a significant elevation of reads over the baseline of a healthy person would be sufficient. This can also be done to investigate the pathways, and cellular state of the dying cells. As cfDNA from a healthy subject has very few reads showing activation of genes in the hypoxia pathway, reads within in these regions would be indicative of hypoxia being the cause of increased cell death in a patient. - cfChIP has the potential to circumvent many of the limitations that exist in current analysis of cfDNA. Targeted enrichment of active marks results in reduced representation of the genome such that fewer sequencing reads (˜two orders of magnitude less) are required to obtain informative signal. Since we target marks associated with active transcription, we are assaying a positive signal, where few reads are indicative to the presence of a particular cell type or expression program. This is in contrast to methods such as occupancy or DNA methylation that either measure negative signal (lack of nucleosome occupancy) or both negative and positive signals (e.g., % methylated). Moreover, the cfChIP assay leaves most of the original sample intact, enabling using the same material for multiple assays (e.g., genomic sequencing, methylation analysis, or cfChIP with additional antibodies), which is important where blood volume is a limiting factor.
- Intensive research during the last two decades established the connection between specific histone marks and chromatin-templated processes including transcription, replication, and damage repair. Leveraging this rich and complex information to blood cfDNA analysis has the potential to unravel physiological processes in remote organs, such as cell proliferation, hypoxia, inflammation, metabolic changes, and cancerous transformation, in real time and with minimal invasiveness. All of these processes involve activation of large transcriptional programs, which leave unique imprint on chromatin.
- A key factor in using cfDNA-based assay for detection of cfDNA from rare cells, such as in early cancer diagnosis, is low detection limit. Several features of cfChIP can dramatically improve detection limits. 1. cfChIP detects “positive” signal, thus even low signals contribute significantly. 2. cfChIP can be performed with various antibodies targeting different genomic regions and states thus generating large signatures and the range of hundreds or thousands of sites with differential signal between different tissues or transcriptional programs. 3. Since cfChIP is by nature a low representation method cfChIP is unbiased, as all the captured DNA fragments are sequenced.
- Assaying modified cf-nucleosomes, either used alone or in combination with existing biomarkers, has multiple potential medical applications, such as early disease detection (e.g., detecting unknown tumors), improved diagnosis (e.g., replacing tissue biopsy with liquid biopsy), and non-invasive monitoring of disease progression and treatment efficacy.
- By a first aspect, there is provided a method of determining a cell or tissue of origin of cell free DNA (cfDNA), the method comprising:
-
- a. providing a sample comprising cfDNA;
- b. contacting the sample with at least one reagent that binds a DNA-associated protein;
- c. isolating the reagent and any thereto bound proteins and cfDNA; and
- d. sequencing the isolated cfDNA;
wherein the isolated cfDNA comprises a DNA sequence of an informative genomic location and association of the DNA-associated protein with the informative genomic location is indicative of a cell type or tissue; thereby determining the cell or tissue of origin of cfDNA.
- By another aspect, there is provided a method of determining a cellular state, tissue of origin, cell type or a combination thereof of a cell that released its DNA, comprising:
-
- a. providing a sample, wherein the sample comprises cell free DNA (cfDNA);
- b. contacting the sample with at least one reagent that binds to a DNA-associated protein;
- c. isolating the reagent and any thereto bound proteins and cfDNA;
- d. sequencing the isolated cfDNA; and
- e. designating a cfDNA molecule comprising a DNA sequence of an informative genomic location as originating from a cell in a cellular state, originating from a tissue, originative from a cell type or a combination thereof, wherein association of the DNA-associated protein with the informative genomic location is indicative of the cellular state, tissue of origin, cell type or combination thereof in the cell that released the cfDNA;
thereby determining a cellular state, tissue of origin cell type or combination thereof of a cell that released its DNA
- By another aspect, there is provided a method of determining a cell or tissue of origin of cell free DNA (cfDNA), comprising:
-
- a. providing a sample comprising cfDNA;
- b. contacting the sample with at least one reagent that binds a DNA-associated protein;
- c. isolating the reagent and any thereto bound proteins and cfDNA;
- d. sequencing the isolated cfDNA; and
- e. designating a cfDNA molecule comprising a DNA sequence of an informative genomic location as originating in a cell type or tissue, wherein association of the DNA-associated protein with the informative genomic location is indicative of the cell type or tissue;
thereby determining the cell or tissue of origin of cfDNA.
- By another aspect, there is provided a method of determining a cell or tissue of origin of cell free DNA comprising sequencing cfDNA isolated by the cfDNA's binding to a DNA-associated protein; wherein the isolated cfDNA comprises a DNA sequence of an informative genomic location and association of the DNA-associated protein with the informative genomic location is indicative of a cell type or tissue; thereby determining the cell or tissue of origin of cfDNA.
- By another aspect, there is provided a method of determining a cellular state of a cell in a subject, comprising:
-
- a. providing a sample from the subject, wherein the sample comprising cfDNA;
- b. contacting the cfDNA with at least one reagent that binds to a DNA-associated protein;
- c. isolating the reagent and any thereto bound proteins and cfDNA; and
- d. sequencing the isolated cfDNA;
wherein the isolated cfDNA comprises a DNA sequence of an informative genomic location and association of the DNA-associated protein with the informative genomic location is indicative of a cellular state; thereby determining a cellular state of a cell in a subject.
- By another aspect there is provided a method of determining a cellular state of a cell in a subject, as the cell died, comprising:
-
- a. providing a sample from the subject, wherein the sample comprises cfDNA;
- b. contacting the sample with a reagent that binds to a DNA-associated protein;
- c. isolating the reagent and any thereto bound proteins and cfDNA;
- d. sequencing the isolated cfDNA; and
- e. designating a cfDNA molecule comprising a DNA sequence of an informative genomic location as originating from a cell in a cellular state, wherein association of the DNA-associated protein with the informative genomic location is indicative of the cellular state;
thereby determining a cellular state of a cell as the cell died in a subject.
- By another aspect, there is provided a method of determining a cellular state of a cell in a subject, comprising sequencing cfDNA isolated by the cfDNA's binding to a DNA-associated protein; wherein the isolated cfDNA comprises a DNA sequence of an informative genomic location and association of the DNA-associated protein with the informative genomic location is indicative of a cellular state; thereby determining a cellular state of a cell in a subject. In some embodiments, the cell is a cell that has died in the subject.
- By another aspect, there is provided a method of determining a cellular state, tissue of origin or cell type of a cell in a subject, as the cell died, comprising:
-
- a. providing a sample from the subject, wherein the sample comprises cfDNA;
- b. contacting the sample with a reagent that binds to a DNA-associated protein;
- c. isolating the reagent and any thereto bound proteins and cfDNA; and
- d. sequencing the isolated cfDNA;
wherein the isolated cfDNA comprises a DNA sequence of a tissue or cell type-specific binding site of the DNA-associated protein indicating the cell type or tissue of origin and association of the DNA-associated protein with the tissue or cell type-specific binding site is indicative of a cellular state; thereby determining a cellular state of a cell in a subject.
- By another aspect, there is provided a method of determining a cellular state, tissue of origin or cell type of a cell in a subject, as the cell died, comprising:
-
- a. providing a sample from the subject, wherein the sample comprises cfDNA;
- b. contacting the sample with a reagent that binds to a DNA-associated protein;
- c. isolating the reagent and any thereto bound proteins and cfDNA;
- d. sequencing the isolated cfDNA; and
- e. designating a cfDNA molecule comprising a DNA sequence of a tissue or cell type-specific binding site of the DNA-associated protein as originating from the tissue or cell type, and as originating from a cell in a cellular state, wherein association of the DNA-associated protein with the binding site is indicative of the cellular state;
thereby determining a cellular state and tissue of origin or cell type of a cell as the cell died in a subject.
- In some embodiments, the method is for determining a cellular state of the cell. In some embodiments, the method is for determining a tissue of origin of the cell. In some embodiments, the method is for determining the cell type of the cell. In some embodiments, the sample is from a subject and the method is for detecting death of any one of: cells of a tissue, a cell type and cells in a cellular state in the subject. In some embodiments, the sample is from a subject and the method is for detecting a disease in the subject, wherein death of cells of a tissue, of a cell type or in a cellular state are indicative so the disease. For non-limiting example, death of liver cells may be indicative of liver disease, death of GI cells may indicate GI cancer, death of cells with active interferon response may indicate an infection and death of beta cells may indicate pancreatic damage/disease.
- In some embodiments, detecting a disease state comprises at least one of: early detection of the disease state, detection of residual disease and monitoring disease progression. In some embodiments, detecting a disease state comprises early detection. In some embodiments, early detection comprises detection during routine blood work. In some embodiments, early detection comprises detection before development of symptoms. In some embodiments, the residual disease is residual metastatic disease. In some embodiments, residual disease is residual cancer after surgery. In some embodiments, disease monitoring comprises monitoring before treatment. In some embodiments, disease monitoring comprises monitoring after treatment. In some embodiments, disease monitoring comprises monitoring disease relapse. In some embodiments, disease monitoring comprises monitoring treatment efficacy.
- In some embodiments, the cell died. In some embodiments, the cell released its DNA. In some embodiments, a cell that released its DNA is a dead and/or dying cell or a cell that denucleated. In some embodiments, a cell that released its DNA is a dead and/or dying cell. In some embodiments, the cell death is selected from apoptotic death and necrotic death. In some embodiments, a denucleated cell is an erythrocyte. In some embodiments a cell that is losing its nucleus is an erythroblast. Erythroblasts lose their nucleus to become erythrocytes, and as such the lost nucleus may appear in cfDNA.
- In some embodiments, the sample is from a subject. In some embodiments, the cfDNA is from a subject and detecting a cfDNA molecule of a cell tissue of origin or cellular state indicates detection of death of that cell type, tissue or cellular state. In some embodiments, the subject is suspected of having increased cell death. In some embodiments, the subject is not suspected of having increased cell death. In some embodiments, the subject appears healthy and/or is not known to suffer from a disease or condition.
- In some embodiments, the determining is determining the cellular state of the cell as it died. In some embodiments, the methods of the invention are further for determining a cellular state of a cell in a subject as the cell died and further comprise designating a cfDNA molecule comprising a DNA sequence of an informative genomic location as originating from a cell in a cellular state, which association of the DNA-associated protein with the informative genomic location is indicative of the cellular state.
- In some embodiments, the sample is a bodily fluid. In some embodiments, the bodily fluid is blood. In some embodiments, the bodily fluid is selected from at least one of: blood, serum, gastric fluid, intestinal fluid, saliva, bile, tumor fluid, cerebrospinal fluid, breast milk, semen, urine, vaginal fluid, interstitial fluid, and stool. Standard techniques for cell-free DNA extraction are known to a skilled artisan, a non-limiting example of which is the QIAamp Circulating Nucleic Acid kit (QIAGEN).
- As used herein, “a reagent that binds” refers to any protein binding molecule or composition. Protein binding is well known in the art and may be assessed by any assay known in the art, including but not limited to yeast-2-hybrid, immunoprecipitation, competition assay, phage display, tandem affinity purification, and proximity ligation assay. In some embodiments, the reagent is a proteinaceous molecule. In some embodiments, the reagent is selected from an antibody or antigen binding fragment thereof, a protein and a small molecule. Small molecules that bind to specific proteins are well known in the art and may be used for pull-down experiments. Additionally, well characterized protein-protein interactions may be used for pull-downs. Indeed, any reagent that may be used for precipitation, immunoprecipitation (IP) or chromatin immunoprecipitation (ChIP), may be used as the reagent. In some embodiments, the reagent is an antibody or antigen binding fragment thereof.
- As used herein, the term “antibody” refers to a polypeptide or group of polypeptides that include at least one binding domain that is formed from the folding of polypeptide chains having three-dimensional binding spaces with internal surface shapes and charge distributions complementary to the features of an antigenic determinant of an antigen. An antibody typically has a tetrameric form, comprising two identical pairs of polypeptide chains, each pair having one “light” and one “heavy” chain. The variable regions of each light/heavy chain pair form an antibody binding site. An antibody may be oligoclonal, polyclonal, monoclonal, chimeric, camelised, CDR-grafted, multi-specific, bi-specific, catalytic, humanized, fully human, anti-idiotypic and antibodies that can be labeled in soluble or bound form as well as fragments, including epitope-binding fragments, variants or derivatives thereof, either alone or in combination with other amino acid sequences. An antibody may be from any species. The term antibody also includes binding fragments, including, but not limited to Fv, Fab, Fab′, F(ab′)2 single stranded antibody (svFC), dimeric variable region (Diabody) and disulphide-linked variable region (dsFv). In particular, antibodies include immunoglobulin molecules and immunologically active fragments of immunoglobulin molecules, i.e., molecules that contain an antigen binding site. Antibody fragments may or may not be fused to another immunoglobulin domain including but not limited to, an Fc region or fragment thereof. The skilled artisan will further appreciate that other fusion products may be generated including but not limited to, scFv-Fc fusions, variable region (e.g., VL and VH)˜Fc fusions and scFv-scFv-Fc fusions.
- Immunoglobulin molecules can be of any type (e.g., IgG, IgE, IgM, IgD, IgA and IgY), class (e.g., IgG1, IgG2, IgG3, IgG4, IgA1 and IgA2) or subclass.
- In some embodiments, one reagent is contacted. In some embodiments, at least one reagent is contacted. In some embodiments, more than one reagent is contacted. In some embodiments, each reagent binds a different DNA-associated protein. In some embodiments, the DNA-associated protein is a sequence specific DNA-binder, and more than one reagent targeting more than one protein is contacted. Since the target sequence of binding is known, following sequencing of the isolated cfDNA the sequences can be assigned to each binding reagent based on the target motif present in the sequence. Sequences containing more than one motif can either be discarded or included as bound by multiple DNA-associated proteins.
- In some embodiments, the reagent is conjugated to a physical support. As used herein, the term “physical support” refers to a solid and stable molecule that gives support to the reagent. In some embodiments, the support is a scaffold or scaffolding agent. In some embodiments, the support is a resin. In some embodiments, the support is a bead. In some embodiments, the support is a magnetic or paramagnetic bead. Magnetic beads may be purchased for examples from Dynabeads or Pierce. In some embodiments, the support is an agarose bead. In some embodiments, the support is a protein A/G bead. In some embodiments, the reagent is conjugated to the physical support before the contacting. In some embodiments, the conjugating is a covalent linkage. In some embodiments, the conjugating is by epoxy chemistry. In some embodiments, the support aids in isolation of the reagent, wherein the isolating is isolating the physical support.
- As used herein, the term “DNA-associated protein” refers to any protein that can be precipitated with DNA or when precipitated brings along DNA. In some embodiments, the DNA-associated protein directly binds DNA. In some embodiments, the DNA-associated protein is a component of chromatin. In some embodiments, the DNA-associated protein binds-indirectly to DNA. In some embodiments, the DNA-associated protein binds to genomic DNA. In some embodiments, the DNA-binding protein binds in the promoter. In some embodiments, the DNA-binding protein binds in a gene body. In some embodiments, the DNA-binding protein binds to a cis or trans regulatory element.
- In some embodiments, the DNA-associated protein binds DNA and is a non-sequence specific DNA binder. In some embodiments, the DNA-associated protein binds DNA is a sequence specific DNA binder or a non-sequence specific DNA binder. Examples of non-sequence specific DNA binders include histones, high-mobility group (HMG) proteins, members of the DNA damage repair machinery and members of the general transcriptional machinery. The general transcriptional machinery is well defined and includes, but is not limited to, RNA polymerases, DNA helicases, general cofactors, the splicing machinery and the polyA machinery. The DNA damage repair machinery is also well defined and includes, but is not limited to, members of the nucleotide excision repair pathway, base excision repair pathway and the mismatch repair system. In some embodiments, the DNA-associated protein is a modified protein. In some embodiments, the modification is a post-translational modification. In some embodiments, the reagent binds to the modified form of the protein. In some embodiments, the reagent binds only or predominantly to the modified form of the protein.
- In some embodiments, the DNA-associated protein is a histone, modified histone or histone variant. Modifications to the histone tail are well known in the art, and include but are not limited to methylation, acetylation, sumoylation, ubiquitylation and phosphorylation. Modifications may be multiple such as tri-methylation or poly-ubiquitylation. In some embodiments, a tail may have multiple modifications such as methylation and phosphorylation. The histone may be one of the core histones, H1, H2A, H2B, H3 and H4, or it may be a histone variant such as, for non-limiting example, H2A.z, gammaH2AX, H1T, and H3.3. In some embodiments, the modified or variant histone has an activating function or a repressing function on transcription. In some embodiments, the modified or variant contributes to the formation of euchromatin or heterochromatin. In some embodiments, the modified histone is selected from
Histone 3 monomethylated lysine 4 (H3K4me1),Histone 3 demethylated lysine 4 (H3K4me2),Histone 3 trimethylated lysine 36 (H3K36me3) andHistone 3 trimethylated lysine 4 (H3K4me3). - In some embodiments, the DNA-associated protein binds DNA and is a sequence specific DNA binder. Examples of sequence specific DNA binders include but are not limited to transcription factors (TFs), activators, repressors, insulators, DNA modifying enzymes and members of the general transcriptional machinery. In some embodiments, the DNA-associated protein is a transcription factor. In some embodiments, the DNA-associated protein is an insulator. In some embodiments, the transcription factor is selected from an activator, a repressor, an insulator, a DNA modifying enzyme and a member of the general transcriptional machinery. In some embodiments, the transcription factor is selected from an activator, a repressor, and an insulator. In some embodiments, the transcription factor is an insulator. In some embodiments, the transcription factor is CTCF.
- As used herein, the term “transcription factor” refers to any protein that is not part of the general transcriptional machinery but controls/modulates the rate of transcription of a DNA sequence. In some embodiments, TFs are factors that bind in a promoter region. Transcription factors are well known in the art, as are reagents that bind to them. Performing ChIP with TFs is also well known.
- In some embodiments, the agent binds a transcription factor that binds to tissue and/or cell type specific enhancer elements. In some embodiments, the DNA sequence in the cfDNA is the sequence located at the tissue and/or cell type specific enhancer element. Due to the tissue/cell type specificity, association of the TF with this element in the cfDNA indicates the cfDNA is from that tissue and/or cell type. Since the enhancer element enhances transcription of a particular target the target may indicate the cellular state of the cell. In this one, one association of TF to genomic locus can be informative of both the tissue/cell of origin and the cellular state. A non-limiting example of this is tissue-specific NF-kB enhancer binding. NF-kB is known to bind at specific loci only in various tissues (cardiac for example) and mediate inflammation. Thus, isolation with an anti-NF-kB agent, and then identification of the tissue-specific enhancer sequence in the cfDNA indicates not only the cellular origin of the cfDNA, but also that the cell was in an inflammatory state at the time of death. In some embodiments, the DNA-associated protein is a transcription factor (TF), and the binding site is the TF binding site.
- As used herein, “activators” refer to proteins that increase transcription. In some embodiments, activators bind to enhancer elements in DNA. In some embodiments, activators bind to promoter proximal or distal elements. As used herein, “repressors” refer to proteins that decrease transcription. In some embodiments, repressors bind to repressor elements in DNA. In some embodiments, repressors bind to promoter proximal or distal elements.
- As used herein, “insulators” refers to proteins that separate regions of DNA that have different chromatin architecture or transcriptional rates. In some embodiments, an insulator is an enhancer-blocker. In some embodiments, an insulator separates euchromatin and heterochromatin. Non-limiting examples of insulators in include CTCF, gypsy and BDF1. In some embodiments, insulators bind outside of the promoter and gene bodies.
- DNA-modifying enzymes are well known in art, and examples include members of the base/nucleotide excision repair machinery, DNA methyltransferases and DNA demethylases.
- In some embodiments, the DNA-associated protein does not bind DNA. In some embodiments, the protein modifies a protein that binds DNA. Examples of such include but are not limited to histone modifying enzymes and polycomb proteins.
- In some embodiments, association of the DNA-associated protein with an informative genetic locus is tissue or cell type specific. In some embodiments, association of the DNA-associated protein with an informative genetic locus is differentiation specific. In some embodiments, association of the DNA-associated protein with an informative genetic locus is cellular state specific. In some embodiments, association of the DNA-associated protein with an informative genetic locus is indicative of transcriptional activation, active transcription, transcriptionally active chromatin or a combination thereof. In some embodiments, association of the DNA-associated protein with an informative genetic locus is indicative of transcriptional silencing, lack of transcription, transcriptionally inactive chromatin or a combination thereof. The transcription need not be at the genetic locus of binding, but may be at a near or far gene, such as is the case with activators and suppressors. As used herein, the term “genetic locus” and “genomic location” are synonymous and refer to a particular region of DNA that can be bound by a protein. In some embodiments, the genetic locus is a TF binding site, or some other short sequence of DNA. In some embodiments, the locus is between 2 and 20, 2 and 16, 2 and 12, 2 and 10, 2 and 8, 2 and 6, 2 and 4, 4 and 20, 4 and 16, 4 and 12, 4 and 10, 4 and 8 or 4 and 6 base pairs. Each possibility represents a separate embodiment of the invention. In some embodiments, the locus is a nucleosome, or a nucleosome length of DNA (˜170 bp). In some embodiments, the genetic locus is between 150 and 190, or 160 and 180 bp.
- As used herein, the terms “informative genomic location” and “informative genetic locus” are used synonymously and refer to a unique DNA sequence in a particular location in the genome that when associated with a given DNA-associated protein is informative of the cell in which the association occurs. In some embodiments, it is informative of the tissue of origin or cell type of the cell in which the association occurs. In some embodiments, the location is a tissue or cell type specific binding/association site. In some embodiments, the binding/association is not specific/unique, but highly enriched in the tissue or cell type. In some embodiments, it is informative of the cellular state of the cell in which the association occurs. In some embodiments, it is informative of both the tissue of origin and/or cell type and the cellular state of the cell in which the association occurs. In some embodiments, it is informative of a disease in the cell. In some embodiments, it is informative of a transcriptional program in the cell.
- As used herein, the term “transcriptional program” refers to a group of genes that act in concert transcriptionally. The genes may be actively transcribed and/or repressed, and/or inactive, and/or accessible, and/or inaccessible. In some embodiments, the genes are all transcriptional regulated together. In some embodiments, a transcriptional program is indicative of a cellular state. In some embodiments, a transcriptional program is indicative of an active signaling pathway. Signatures for tissue specific, cell type specific, cellular state specific and/or transcriptional programs can be found in, for example, the Roadmap Epigenomics Project (roadmapepigenomics.org), the Cancer Genome Atlas (cancergenome.nih.gov), the Genotype-Tissue Expression (GTEx) Project (gtexportal.org) or the Xena project (xena.ucsc.edu). Tables provided herein also provide such signatures.
- In some embodiments, the reagent binds histones. In some embodiments, the reagent is an anti-histone antibody or fragment thereof. In some embodiments, the reagent binds modified or variant histones. In some embodiments, the reagent is an anti-modified histone antibody or fragment thereof. In some embodiments, the reagent is an anti-variant histone antibody or fragment thereof. In some embodiments, the reagent is selected from
anti-Histone 3 monomethylated lysine 4 (H3K4me1) and anti-Histone 3 trimethylated lysine 4 (H3K4me3) antibodies. - In some embodiments, the isolating comprises isolating a physical support conjugated to the reagent. In some embodiments, the isolating comprises contacting the reagent and bound proteins and DNA with the physical support and then isolating the physical support. In some embodiments, the methods of the invention comprise ChIP. In some embodiments, the isolating comprises ChIP. In some embodiments, the isolating comprised washing steps.
- In some embodiments, the sequencing comprises sequencing at least an average of 1, 2, 3, 5, or 10 million sequencing reads. Each possibility represents a separate embodiment of the invention. In some embodiments, the sequencing comprises sequencing at least 1, 2, 3, 5, or 10 million sequencing reads. Each possibility represents a separate embodiment of the invention. In some embodiments, the amplified cfDNA comprises less than 1, 2, 3, 5, or 10 million sequencing reads. Each possibility represents a separate embodiment of the invention. In some embodiments, the sequencing is at a depth of at most 1, 2, 3, 5, or 10 million sequencing reads. Each possibility represents a separate embodiment of the invention.
- In some embodiments, the sequencing comprises PCR amplification of the cfDNA. In some embodiments, the amplification comprises ligation of a barcode or additional DNA sequence. In some embodiments, the amplification is performed while the cfDNA is still associated with the protein. In some embodiments, the amplification is performed while the cfDNA is still associated with the physical support. In some embodiments, the amplification is performed without disassociating the cfDNA from the reagent and/or the support.
- In some embodiments, the method further comprises comparing the sequencing data to tissue/cell type specific data of DNA binding proteins, wherein binding of a protein to a sequence that is specifically bound by that protein in a tissue/cell type indicates the cfDNA is from that tissue/cell type. Tissue/cell type specific binding data can be found in sources such as the Encode consortium, the NIH Epigenome Roadmap consortium and the Gene Transcription Regulation Database to name but a few. In some embodiments, the genomic location is within a tissue or cell-type specific gene or element. In some embodiments, the protein is associated with active transcription and the genomic location is within a tissue or cell-type specific gene or enhancer element. Non-limiting examples of this include H3K4me3 located in tissue specific genes, or H3K4me1 in enhancers. Non-limiting examples of tissue specific genes include TNNI3 and MYBPC3 in heart cells, and C8a and C8b in liver cells. Tissue expression levels for specific proteins can be found on numerous websites including the Uniprot database (www.uniprot.org), and the GTEx portal (www.gtexportal.org) for example. Tissue specific gene expression and regulation can also be found in numerous locations, most notably the TiGER database (bioinfo.wilmer.jhu.edu/tiger), and the Human Protein Atlas (www.proteinatlas.org). In some embodiments, the protein is associated with silenced transcription and the genomic location is within a repressor element, or a gene specifically silenced in the tissue or cell-type. Tissue-specific protein-DNA binding is well known in the art and can be found in the resources described herein above. Any informative locus binding may be used to determine the source of the cfDNA.
- In some embodiments, the sequencing data is compared to at least the top 1, 2, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90 or 100 peaks of binding in a particular tissue/cell type. Each possibility represents a separate embodiment of the invention. In some embodiments, only one particular tissue/cell type is investigated. In some embodiments, binding data from at least 1, 2, 3, 5, 10, 15, 20, 25, 30, 35, 40, 45, or 50 tissues or cell/types is used to compare to the sequencing data. Each possibility represents a separate embodiment of the invention.
- In some embodiments, the methods of the invention further comprise comparing the sequenced cfDNA to at least 1 genomic location with the greatest association of the DNA-associated protein in a tissue, cell type and/or cellular state, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 1 genomic location is considered to be from that tissue, cell type and/or cellular state. In some embodiments, the at least one genomic location has the greatest unique association of the DNA-associated protein. As used herein, the term “unique association” refers to an association that occurs only, or nearly exclusively, within a tissue or cell type, or a cell state. Thus, if the 10 most unique locations are selected for example, locations with protein binding only in a certain tissue, cell type or state should be examined, and specifically the 10 with the highest binding should be selected. Should there not be the required number of sites with completely unique binding, then the site with the most unique binding should be selected. Any determination of greatest uniqueness may be used. Examples of such include, but are not limited to, having binding in the fewest other tissues, and having the highest difference in the amount of binding between the tissue of interest and another tissue.
- In some embodiments, the methods of the invention further comprise comparing the sequenced cfDNA to a DNA-protein association atlas of at least 2 cell types and/or tissues, wherein the atlas comprises at least 1 genomic location with the greatest association of the DNA-associated protein in each of the 2 tissues, cell types and/or cellular states, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 1 genomic location is considered to be from that t tissue, cell type and/or cellular state. In some embodiments, genomic locations have the greatest unique association of the DNA-associated protein. In some embodiments, the atlas is of at least 1, 2, 3, 5, 7, 10, 15, or 20 cell types and/or tissues. Each possibility represents a separate embodiment of the invention. In some embodiments, the atlas comprises at least 1, 2, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 75, 80, 90 or 100 genomic locations per tissue, cell type and/or cellular state. Each possibility represents a separate embodiment of the invention.
- Examples of genomic locations that when associated with a DNA-associated protein are indicative of a tissue or cell type can be found in Table 1. Table 1 gives exemplary locations for H3K4me3 tissue-informative locations. In some embodiments, the atlas comprises all or a portion of the location in Table 1. In some embodiments, the sequencing of cfDNA is compared to Table 1.
-
TABLE 1 H3K4me3 tissue-specific locations Signature Gene Genomic Location Thymus RAG1 chr11: 36579801-36602400 Thymus RAG1 chr11: 36582401-36604800 Thymus RAG2 chr11: 36608001-36630200 Thymus CD1E chr1: 158316001-158338000 Thymus CD1B chr1: 158287601-158309400 Thymus OR10R3P chr1: 158449201-158471600 Thymus CD1E chr1: 158314001-158335200 Thymus CCR9 chr3: 45919001-45940200 Thymus . chr17: 74980401-75001800 Thymus AK057554 chr1: 158451801-158474200 Thymus . chr1: 158322001-158345000 Thymus . chr15: 58058601-58080800 Thymus . chr12: 123546401-123567800 Thymus CTB-26E19.1 chr5: 177857801-177880400 Thymus CD1B chr1: 158291401-158313200 Thymus CD1B chr1: 158290401-158311200 Thymus RP11-35J23.1 chr10: 98057201-98079800 Thymus CD1E chr1: 158315201-158336000 Thymus AC005780.1 chr19: 28863601-28886000 Thymus CD1B chr1: 158289401-158310200 Thymus PTCRA chr6: 42873201-42894800 Thymus ARL5C chr17: 37315801-37338400 Thymus . chr2: 57914401-57936600 Thymus ARPP21-AS1 chr3: 35681801-35704200 Brain DLX1 chr2: 172935201-172957800 Brain LINC00461 chr5: 87956001-87978400 Brain SRRM4 chr12: 119409601-119432400 Brain MIR9-3; LINC00925 chr15: 89896801-89919000 Brain SLC32A1 chr20: 37341401-37363400 Brain NCAN chr19: 19312401-19334400 Brain C14orf23 chr14: 29228001-29250400 Brain . chr7: 96629601-96652200 Brain MIR9-3; LINC00925 chr15: 89894601-89916800 Brain NRXN3 chr14: 78627201-78649200 Brain TMEM161B-AS1 chr5: 87680201-87703000 Brain AKAP6 chr14: 32788401-32811000 Brain C14orf23 chr14: 29230401-29252800 Brain . chr14: 23945401-23966600 Brain GPR56 chr16: 57663601-57685200 Brain BC034423 chr14: 29233801-29256400 Brain SOX2-OT chr3: 181410201-181432400 Brain . chr2: 172944801-172966800 Brain TIAM2 chr6: 155527601-155549800 Brain . chr20: 37344401-37365600 Brain MBP chr18: 74716401-74738600 Brain . chr2: 172946801-172968600 Brain NRXN3 chr14: 78629201-78651200 Brain TMEM161B-AS1 chr5: 87677201-87700200 Brain SRXN1; SCRT2 chr20: 642601-665400 Brain AK055364 chr16: 72448601-72471600 Brain . chr20: 37343401-37364400 Brain LOC100130155 chr8: 65276601-65298200 Brain . chr7: 96628601-96649600 Brain C10orf90 chr10: 128347801-128370000 Brain POU3F2 chr6: 99279201-99301800 Brain ERMN chr2: 158169201-158191600 Brain HAPLN2 chr1: 156578601-156601600 Brain MIR9-1 chr1: 156378001-156399600 Brain RAPGEF5 chr7: 22219201-22242000 Brain LOC100130155 chr8: 65275001-65296600 Brain CNTN2 chr1: 205002801-205024000 Brain PRKCZ chr1: 1995601-2017000 Brain SOX1 chr13: 112713201-112734400 Brain ROBO1 chr3: 78707201-78730200 Brain MBP chr18: 74713601-74735600 Brain SLC32A1 chr20: 37338801-37361000 Brain PREX1 chr20: 47261001-47283600 Brain CLVS2 chr6: 123308001-123329400 Brain MYT1 chr20: 62781201-62803600 Brain DLX1 chr2: 172943201-172964400 Brain BC034423 chr14: 29236401-29258800 Brain ERMN chr2: 158166801-158189200 Brain ST18 chr8: 53310401-53332800 Brain CNDP1 chr18: 72191201-72214200 Brain DOHH chr19: 3482401-3504800 Brain SOX2-OT chr3: 181412401-181434600 Brain NCAN chr19: 19314401-19336200 Brain NAT16 chr7: 100812801-100833800 Brain FOXG1 chr14: 29221801-29244200 Brain C10orf90 chr10: 128346001-128367800 Brain CRB1 chr1: 197227601-197250000 Brain CACNG3 chr16: 24257201-24279800 Brain HPN-AS1 chr19: 35585601-35608400 Brain MIR3065; MIR338 chr17: 79091201-79112600 Brain FOXG1 chr14: 29224801-29245600 Brain LOC150622 chr2: 6064001-6086000 Brain . chr13: 112702401-112723200 Brain BCAS1 chr20: 52634401-52656200 Brain ELAVL3 chr19: 11580201-11601200 Brain CAPN3 chr15: 42686401-42708600 Brain EPHA10 chr1: 38220601-38242000 Brain . chr2: 172932801-172954800 Brain SLC24A2 chr9: 19776001-19798400 Brain FOXG1 chr14: 29227001-29247600 Brain ABR chr17: 971001-991800 Brain SCRT1 chr8: 145547601-145568800 Brain SLCO1A2 chr12: 21466001-21488600 Brain DLX6-AS1 chr7: 96633201-96655200 Brain C14orf23 chr14: 29232801-29253800 Brain MSRA chr8: 9942801-9965800 Brain TTBK1 chr6: 43200801-43222200 Brain ANK3 chr10: 62136401-62158600 Brain SOX1 chr13: 112716001-112737600 Brain DLEU7 chr13: 51407201-51428000 Brain CACNG2 chr22: 37085801-37108800 Brain SRRM4 chr12: 119408401-119429600 Brain SOX1 chr13: 112714401-112736000 Brain . chr20: 37345601-37366600 Brain OMG chr17: 29613001-29634400 Brain LINC00461 chr5: 87895401-87917800 Brain BCAS1 chr20: 52632401-52654400 Brain CNTNAP4 chr16: 76301601-76323600 Brain SLC24A2 chr9: 19773601-19796000 Brain BX537900 chr8: 65279001-65299800 Brain NR2E1 chr6: 108476201-108497000 Brain AMER2 chr13: 25730601-25752600 Brain . chr13: 112700801-112721600 Brain AL832737 chr6: 168084401-168107400 Brain MIR9-1 chr1: 156376401-156398000 Brain . chr13: 112699401-112720800 Brain . chr7: 96632201-96653200 Brain NTM; NTM-AS1 chr11: 131522401-131544200 Brain TMTC4 chr13: 101277001-101299200 Brain . chr11: 2601801-2623200 Brain AC017053.1 chr2: 6452401-6475400 Brain MTUS1 chr8: 17568601-17590400 Brain MOBP chr3: 39501201-39523800 Brain LRRC4C chr11: 41469601-41491600 Brain LPPR1 chr9: 104021401-104043200 Brain ANO4 chr12: 101178401-101199200 Brain ANLN chr7: 36436801-36458800 Brain LINC00634 chr22: 42337401-42358800 Brain TMEM161B-AS1 chr5: 87674201-87697200 Brain ST18 chr8: 53306801-53329800 Brain LOC400940 chr2: 6113001-6135200 Brain SOX2-OT chr3: 181318401-181341000 Brain BC040833 chr10: 91091601-91112600 Brain CTD-2049O4.1 chr16: 27717401-27739600 Brain . chr17: 40112801-40134400 Sk. Muscle MYLK2 chr20: 30397401-30420200 Sk. Muscle MYOG chr1: 203043801-203066400 Sk. Muscle . chr20: 3275401-3298400 Sk. Muscle ANKRD23 chr2: 97498401-97520800 Sk. Muscle . chr6: 51990401-52013000 Sk. Muscle MYF6 chr12: 81089601-81111800 Sk. Muscle AC005616.2 chr19: 28989401-29011400 Sk. Muscle NEB chr2: 152579201-152602000 Sk. Muscle MYOT chr5: 137193801-137216200 Sk. Muscle CLCN1 chr7: 143002801-143025600 Sk. Muscle . chr6: 51993001-52015400 Sk. Muscle ART1 chr11: 3655401-3678000 Sk. Muscle PCNT chr21: 47758201-47780400 Sk. Muscle STAC3 chr12: 57633601-57656200 Sk. Muscle . chr20: 23119201-23141200 Sk. Muscle KLHL40 chr3: 42718001-42738800 Sk. Muscle FEZ2 chr2: 36808801-36831800 Sk. Muscle NEB chr2: 152576401-152599200 Sk. Muscle IDI2 chr10: 1059801-1082600 Sk. Muscle TPM3 chr1: 154153401-154174400 Sk. Muscle PTPN3 chr9: 112172401-112193800 Sk. Muscle CASQ1 chr1: 160151201-160172600 Sk. Muscle MYADML2 chr17: 79888401-79909400 Sk. Muscle C8orf22 chr8: 49974801-49996400 Sk. Muscle MYOZ1 chr10: 75390401-75411400 Sk. Muscle . chr8: 6620801-6641800 Sk. Muscle RP11-18B3.3 chr9: 108461201-108483200 Sk. Muscle . chr5: 137196601-137218200 Sk. Muscle . chr4: 23383601-23405400 Sk. Muscle ASB5 chr4: 177178801-177200400 Sk. Muscle MSS51 chr10: 75180601-75203600 Sk. Muscle . chr10: 908001-929200 Sk. Muscle ASB18 chr2: 237138601-237160200 Sk. Muscle MYL1 chr2: 211156201-211179000 Sk. Muscle RP11-6I2.4 chr8: 75004601-75027600 Sk. Muscle . chr10: 53877601-53900200 Sk. Muscle ASB5 chr4: 177177201-177198800 Sk. Muscle . chr13: 21393201-21416000 Sk. Muscle MYL1 chr2: 211166001-211188600 Sk. Muscle VGLL2 chr6: 117578001-117599800 Sk. Muscle . chr13: 21384801-21407400 Sk. Muscle LSMEM1 chr7: 112110201-112133000 Sk. Muscle . chr4: 23381601-23403600 Sk. Muscle . chr15: 93507801-93529600 Sk. Muscle LOC100507537 chr3: 154998801-155020800 Sk. Muscle TNNT1 chr19: 55650601-55672600 Sk. Muscle MYBPC1 chr12: 101951201-101973800 Sk. Muscle PKIA chr8: 79494401-79516600 Sk. Muscle . chr3: 16869601-16891000 Sk. Muscle PICK1 chr22: 38454201-38475400 Sk. Muscle MYLPF chr16: 30375801-30397600 Sk. Muscle . chr3: 69157401-69179000 Sk. Muscle . chr12: 117865601-117887200 Sk. Muscle . chr2: 97495801-97517400 Sk. Muscle . chr8: 42125001-42146200 Sk. Muscle . chr10: 76822201-76844000 Sk. Muscle . chr3: 15996801-16019400 Sk. Muscle . chr10: 75389201-75410400 Sk. Muscle CASQ1 chr1: 160150601-160171200 Sk. Muscle MYF6 chr12: 81093001-81115000 Heart TNNT2 chr1: 201335801-201356800 Heart TENM2 chr5: 167352201-167374400 Heart NEBL chr10: 21173801-21196600 Heart RBMS3 chr3: 29956401-29978800 Heart . chr1: 231971401-231993600 Heart MYL4 chr17: 45276001-45298400 Heart TNNI3 chr19: 55658201-55679800 Heart NEBL chr10: 21171001-21193800 Heart PRDM6 chr5: 122423401-122444400 Heart . chr10: 15745201-15767200 Heart . chr3: 24220001-24242400 Heart RP11-405A12.2 chr12: 19918801-19940600 Heart SLC5A1 chr22: 32445001-32467200 Heart . chr17: 8516601-8539000 Heart . chr10: 92715801-92738400 GI Sm. Muscle TACR2 chr10: 71165401-71187200 GI Sm. Muscle TACR2 chr10: 71163401-71185400 GI Sm. Muscle . chr19: 7409601-7432600 GI Sm. Muscle AC007392.3 chr2: 66909201-66931200 GI Sm. Muscle . chr15: 98548401-98570200 GI Sm. Muscle . chr1: 225636601-225658800 GI Sm. Muscle SIPA1L2 chr1: 232638201-232660600 Stomach CLDN18 chr3: 137707601-137730000 Stomach CTSE chr1: 206306801-206329000 Stomach TFF1 chr21: 43774801-43797000 Stomach . chr8: 124945601-124968400 Stomach CLDN18 chr3: 137710001-137732400 Stomach MUC5B chr11: 1141601-1163800 Stomach CAPN8 chr1: 223840401-223862200 Stomach AX747250 chr6: 1069601-1091400 Stomach C5orf66-AS1 chr5: 134363801-134384600 Stomach . chr17: 71941801-71963800 Stomach FAM177B chr1: 222900601-222922400 Stomach ANXA10 chr4: 169003801-169026000 Stomach C5orf66-AS1 chr5: 134364601-134385000 Stomach PAX9 chr14: 37122201-37142600 Stomach . chr14: 37125401-37146400 GI Mucosa BTNL3 chr5: 180405801-180428000 GI Mucosa . chr1: 233830001-233852800 GI Mucosa RBP2 chr3: 139183201-139205400 GI Mucosa . chr17: 46794401-46815600 GI Mucosa FRYL chr4: 48671401-48693200 GI Mucosa UBE2V2P1 chr10: 19327801-19350000 GI Mucosa . chr17: 71941801-71963800 GI Mucosa FAM177B chr1: 222900601-222922400 GI Mucosa REP15 chr12: 27840801-27862600 GI Mucosa . chr7: 12830601-12853600 GI Mucosa ANXA10 chr4: 169003801-169026000 GI Mucosa FABP2 chr4: 120231801-120253400 GI Mucosa . chr10: 3017801-3039200 GI Mucosa NXPE4 chr11: 114454001-114475600 GI Mucosa . chr16: 68407201-68429400 GI Mucosa LOC93432 chr7: 141801601-141823000 GI Mucosa . chr11: 76775401-76797200 GI Mucosa SLC5A1 chr22: 32429201-32450000 Digestive CLDN18 chr3: 137707601-137730000 Digestive CTSE chr1: 206306801-206329000 Digestive TFF1 chr21: 43774801-43797000 Digestive . chr8: 124945601-124968400 Digestive CLDN18 chr3: 137710001-137732400 Digestive MUC5B chr11: 1141601-1163800 Digestive BTNL3 chr5: 180405801-180428000 Digestive . chr1: 233830001-233852800 Digestive SPDEF chr6: 34512401-34534000 Digestive CAPN8 chr1: 223840401-223862200 Digestive RBP2 chr3: 139183201-139205400 Digestive . chr17: 46794401-46815600 Digestive FRYL chr4: 48671401-48693200 Digestive . chr11: 119767401-119789600 Digestive UBE2V2P1 chr10: 19327801-19350000 Digestive AX747250 chr6: 1069601-1091400 Digestive . chr11: 119765001-119787400 Digestive C5orf66-AS1 chr5: 134363801-134384600 Digestive TMC5 chr16: 19419201-19440600 Digestive TRIM31 chr6: 30068401-30090000 Digestive . chr17: 71941801-71963800 Digestive FAM177B chr1: 222900601-222922400 Digestive REP15 chr12: 27840801-27862600 Digestive . chr7: 12830601-12853600 Digestive ANXA10 chr4: 169003801-169026000 Digestive FABP2 chr4: 120231801-120253400 Digestive . chr10: 3017801-3039200 Digestive BCAR3 chr1: 94037801-94059200 Digestive . chr4: 149590201-149612200 Digestive ERN2 chr16: 23713001-23734400 Digestive C5orf66-AS1 chr5: 134364601-134385000 Digestive NXPE4 chr11: 114454001-114475600 Digestive PAX9 chr14: 37122201-37142600 Digestive . chr16: 68407201-68429400 Digestive LOC93432 chr7: 141801601-141823000 Digestive . chr14: 37125401-37146400 Digestive . chr11: 76775401-76797200 Digestive VSIG2 chr11: 124610401-124631800 Digestive SLC5A1 chr22: 32429201-32450000 Pancreas Islet SLC30A8 chr8: 118138601-118161400 Pancreas Islet SLC30A8 chr8: 118135801-118158600 Pancreas Islet PAM chr5: 102134201-102156200 Pancreas Islet G6PC2 chr2: 169747201-169769400 Pancreas Islet G6PC2 chr2: 169749401-169771600 Pancreas Islet KCNMB2-IT1; chr3: 178127201-178149200 KCNMB2 Pancreas Islet IAPP chr12: 21515201-21537400 Pancreas Islet GCK chr7: 44216201-44239000 Pancreas Islet . chr2: 77425801-77448600 Pancreas Islet ELAVL4 chr1: 50561401-50583000 Pancreas Islet . chr4: 43755001-43777000 Pancreas Islet GNAS chr20: 57405601-57427400 Pancreas Islet . chr14: 62341201-62363800 Pancreas Islet CNTN4 chr3: 2544801-2567400 Pancreas Islet LINC01099 chr4: 178815401-178837400 Pancreas Islet IAPP chr12: 21517401-21539400 Pancreas SLC30A8 chr8: 118138601-118161400 Pancreas SLC30A8 chr8: 118135801-118158600 Pancreas PRSS1 chr7: 142447001-142468600 Pancreas TPST2 chr22: 26950201-26971200 Pancreas CTRB2 chr16: 75230001-75251200 Pancreas PAM chr5: 102134201-102156200 Pancreas CELA3A chr1: 22318001-22340600 Pancreas G6PC2 chr2: 169747201-169769400 Pancreas SYCN chr19: 39684001-39705600 Pancreas PNLIPRP1 chr10: l18341201-118364200 Pancreas LOC644838 chr2: 67431001-67452000 Pancreas G6PC2 chr2: 169749401-169771600 Pancreas PTF1A chr10: 23469201-23491000 Pancreas KCNMB2-IT1; chr3: 178127201-178149200 KCNMB2 Pancreas RBPJL; MATN4 chr20: 43925001-43946800 Pancreas . chr1: 236428801-236451600 Pancreas . chr10: 23477601-23500000 Pancreas IAPP chr12: 21515201-21537400 Pancreas CELA3A chr1: 22320601-22343200 Pancreas CTRC chr1: 15754801-15776800 Pancreas SYCN chr19: 39682401-39704000 Pancreas GCK chr7: 44216201-44239000 Pancreas PTF1A chr10: 23471601-23492400 Pancreas CELA3B chr1: 22292601-22315000 Pancreas AC011298.2 chr2: 241616201-241638400 Pancreas PRSS1 chr7: 142448601-142470200 Pancreas . chr2: 77425801-77448600 Pancreas ELAVL4 chr1: 50561401-50583000 Pancreas AQP8 chr16: 25218201-25239400 Pancreas . chr4: 43755001-43777000 Pancreas NPHS1 chr19: 36331601-36354200 Pancreas RP11-20I20.2 chr4: 1117401-1139800 Pancreas GNAS chr20: 57405601-57427400 Pancreas . chr14: 62341201-62363800 Pancreas CNTN4 chr3: 2544801-2567400 Pancreas LINC01099 chr4: 178815401-178837400 Pancreas IAPP chr12: 21517401-21539400 Placenta . chr19: 9108001-9130600 Placenta LOC100506655 chr16: 25148801-25170400 Placenta DLG5 chr10: 79605601-79627600 Placenta . chr6: 140458801-140481400 Placenta . chr6: 168803001-168825200 Placenta STRA6; HP11097 chr15: 74483601-74505400 Placenta ZFAT chr8: 135697001-135719600 Placenta . chr3: 72495001-72517800 Placenta BCAR4 chr16: 11909801-11932800 Placenta ZFP42 chr4: 188906601-188928400 Placenta ERVW-1 chr7: 92095801-92117400 Placenta . chr11: 2147001-2168000 Placenta HSD3B1 chr1: 120039801-120060800 Placenta . chr19: 6204401-6225800 Placenta TRIM60 chr4: 165942401-165964000 Placenta NAA11 chr4: 80236001-80257600 Placenta . chr4: 133255401-133277000 Placenta RNU6-964P chr2: 230792601-230815000 Placenta GCM1 chr6: 53002201-53024000 Placenta CCBP2 chr3: 42840801-42862800 Placenta . chr11: 2152001-2173800 Placenta . chr5: 86536801-86559200 Placenta PSG1 chr19: 43371001-43394000 Placenta . chr1: 245057601-245078800 Placenta LGALS13 chr19: 40083201-40104800 Placenta RPS20P15 chr3: 27494401-27516800 Placenta . chr1: 120040801-120061800 Placenta GRAMD3 chr5: 125685801-125707800 Placenta AK127846 chr19: 53500801-53522400 Placenta PSG6 chr19: 43410601-43432000 Placenta BRWD1 chr21: 40682201-40703600 Placenta . chr7: 114530401-114552000 Placenta CGA chr6: 87793201-87814800 Placenta ARID3A chr19: 949401-970800 Placenta LINC00577 chr6: 105374201-105396800 Placenta BC031304 chr2: 46645601-46667600 Placenta . chr20: 39599401-39621800 Placenta . chr11: 105376001-105397200 Placenta . chr1: 204272601-204293400 Placenta PSG3; PSG7 chr19: 43428601-43451600 Placenta RP3-407E4.4 chr6: 64141401-64162200 Placenta . chr19: 54140601-54162000 Placenta FAM13A chr4: 89845601-89867800 Liver . chr4: 155518001-155542600 Liver CPS1 chr2: 211411001-211433600 Liver APOH chr17: 64212601-64235600 Liver BAAT chr9: 104134601-104157400 Liver C8B chr1: 57418401-57441000 Liver TAT chr16: 71599001-71621400 Liver CCL16 chr17: 34296601-34319200 Liver . chr17: 44000401-44023000 Liver UGT2B4 chr4: 70349201-70372200 Liver C4BPA chr1: 207267801-207290000 Liver APCS chr1: 159546801-159569000 Liver GYS2 chr12: 21745401-21767800 Liver CCND2P1 chr11: 62998801-63021000 Liver SULT2A1 chr19: 48378401-48399600 Liver RDH16 chr12: 57339401-57361000 Liver SERPINC1 chr1: 173874401-173896000 Liver . chr7: 45719401-45741600 Liver CCND2P1 chr11: 63001001-63023000 Liver . chr17: 41530801-41553400 Liver . chr6: 132157001-132179400 Liver C5orf27 chr5: 95178201-95200400 Liver CYP2B6 chr19: 41487401-41509600 Liver SERPINA10 chr14: 94747001-94768800 Liver SLC22A1 chr6: 160533601-160556200 Liver SEPP1 chr5: 42813801-42836200 Liver CPN1 chr10: 101829401-101852000 Liver CPB2 chr13: 46666201-46688000 Liver C9 chr5: 39353401-39375000 Liver KCTD21-AS1 chr11: 77870601-77892200 Liver AHSG chr3: 186320401-186342200 Liver CPB2 chr13: 46668001-46689600 Liver SPP2 chr2: 234949001-234971600 Liver . chr12: 111502801-111524800 Liver ADCY10 chr1: 167871201-167893400 Liver JB175316 chr12: 9269801-9291800 Liver AOC4; AF047486 chr17: 41009601-41031800 Liver TTC39C chr18: 21683801-21706400 Liver C19orf80 chr19: 11340201-11362000 Liver TFR2 chr7: 100227801-100249800 Liver APOC3 chr11: 116690001-116712000 Liver ALB chr4: 74262801-74285400 Liver AKR1C4 chr10: 5229201-5252200 Liver ITIH2 chr10: 7734201-7756400 Liver . chr4: 77221201-77243600 Liver C8A chr1: 57310801-57332800 Liver MIR122 chr18: 56102601-56124400 Liver NCOR1 chr17: 15930001-15952600 Liver APOA2 chr1: 161182601-161204000 Liver CYP2C8 chr10: 96818001-96840200 Liver . chr3: 16363001-16385400 Liver ABCB11 chr2: 169876201-169898000 Liver C9 chr5: 39351601-39373400 Liver HAO1 chr20: 7909801-7931600 Liver CYP2C9 chr10: 96689201-96710800 Liver FGL1 chr8: 17740801-17763200 Liver C8B chr1: 57421001-57443600 Liver PON1 chr7: 94942001-94963600 Liver CPS1 chr2: 211413601-211436000 Liver LEAP2 chr5: 132199401-132220400 Liver . chr3: 148924401-148946600 Liver C5 chr9: 123801201-123822600 Liver AKR1D1 chr7: 137750801-137773000 Liver . chr5: 68030601-68052600 Liver SLC22A10 chr11: 63046601-63068800 Liver C5 chr9: 123799401-123821200 Liver GBP7 chr1: 89629801-89651800 Liver . chr16: 76555001-76577000 Liver APCS chr1: 159549001-159571200 Liver AOC4; AF047486 chr17: 41007201-41029600 Liver MASP2 chr1: 11095801-11117400 Liver U91324.1 chr2: 8269401-8292000 Liver . chr5: 115064401-115086800 Liver GPLD1 chr6: 24477801-24500400 Liver . chr10: 113975801-113997400 Liver PZP chr12: 9349001-9371000 Liver . chr2: 138689001-138711400 Liver HAO1 chr20: 7907801-7929800 Liver . chr10: 96817201-96838000 Liver APOC3 chr11: 116692001-116714000 Liver SERPINA11 chr14: 94907801-94929400 Liver . chr2: 31923201-31945400 Liver ITIH2 chr10: 7736401-7758600 Liver . chr12: 24178601-24201400 Liver SERPINA10 chr14: 94748801-94769600 Liver RCL1 chr9: 4829201-4850800 Liver NR1I3 chr1: 161196001-161218400 Liver SLC17A2 chr6: 25919801-25942200 Liver RTP3 chr3: 46530201-46552000 Liver CFHR5 chr1: 196937001-196960000 Liver SLC17A2 chr6: 25917401-25939800 Liver HRG chr3: 186373201-186395000 Liver CYP2A6 chr19: 41345001-41366400 Liver GLYATL1 chr11: 58696801-58718800 Liver ADH1A chr4: 100199201-100221200 Liver . chr16: 51074401-51096600 Liver . chr12: 7635801-7658000 Liver . chr2: 151010601-151033400 Liver BDH1 chr3: 197289201-197311000 Liver SLC22A10 chr11: 63048801-63071000 Liver A1BG chr19: 58853801-58875400 Liver CFHR4 chr1: 196847601-196869600 Liver RTP3 chr3: 46528401-46550200 Liver CYP4A22 chr1: 47593001-47615200 Liver . chr8: 63909801-63931800 Liver C4BPA chr1: 207270001-207292200 Liver EPHX1 chr1: 226004401-226025800 Liver . chr6: 132154601-132177000 Liver LDHD chr16: 75143001-75165600 Liver INHBC chr12: 57818401-57839600 Liver F12 chr5: 176825201-176846600 Liver LPAL2 chr6: 160921201-160943000 Liver FETUB chr3: 186349001-186371200 Liver SLC6A13 chr12: 336001-357400 Liver SLC22A1 chr6: 160530801-160553600 Liver ANXA10 chr4: 169075401-169098400 Liver . chr22: 43860201-43881800 Liver IGFBP1 chr7: 45919601-45941800 Liver SLC22A9 chr11: 63126601-63148200 Liver . chr2: 21306401-21328800 Liver APOC4-APOC2 chr19: 45435401-45457000 Liver CSAD chr12: 53552001-53574400 Liver . chr1: 84222001-84244800 Liver HABP2 chr10: 115304201-115326000 Liver INHBA chr7: 41737001-41758800 Liver AKR1CL1 chr10: 5215201-5236800 Liver ALB chr4: 74260001-74281200 Liver CLDN14 chr21: 37842201-37864600 Liver AC004862.6 chr7: 79991801-80014200 Liver HRG chr3: 186375001-186396800 Liver . chr6: 136139401-136161600 Liver SAA4 chr11: 18247401-18269200 Liver SERPINC1 chr1: 173876001-173897600 Liver . chr2: 21308801-21331000 Liver ADH1A chr4: 100201201-100223200 Liver . chr16: 28526401-28548400 Liver CFHR5 chr1: 196934001-196957000 Liver . chr13: 74729801-74752200 Liver . chr1: 119912601-119935400 Liver LINC01485 chr5: 173206601-173228000 Liver SLC22A9 chr11: 63128201-63149800 Liver . chr6: 8896801-8919400 Liver . chr4: 155495201-155517000 Liver ITIH3 chr3: 52818201-52839800 Liver . chr9: 4604001-4626000 Liver . chr5: 133740201-133762400 Liver EVA1A chr2: 75734201-75756200 Liver DQ590166; HSD17B6 chr12: 57148201-57170200 Liver RDH16 chr12: 57341001-57362600 Liver CRP chr1: 159670801-159693400 Liver FAM214A chr15: 52988401-53010800 Liver SERPIND1 chr22: 21117801-21139600 Liver . chr10: 77781001-77803200 Liver RP11-328K4.1 chr4: 104335601-104357400 Liver . chr16: 76552801-76575000 Liver GAS2 chr11: 22687201-22710200 Liver . chr1: 161182001-161202600 Liver CFHR3; CFHR1 chr1: 196733201-196756000 Liver SMUG1P1 chr18: 47166801-47189200 Liver CP chr3: 148926601-148947800 Liver GYS2 chr12: 21742801-21765400 Liver CPN2 chr3: 194061201-194082800 Liver ABCG5; ABCG8 chr2: 44058201-44080000 Liver RNA5SP288 chr9: 97895001-97916600 Liver . chr8: 85387401-85410200 Liver ATP11B chr3: 182602601-182624400 Liver LEAP2 chr5: 132200401-132222200 Liver HABP2 chr10: 115301601-115323400 Liver F13B chr1: 197023001-197045200 Liver LOC157273 chr8: 9173001-9195800 Liver MIR122 chr18: 56100601-56122600 Liver . chr4: 155522601-155543600 Liver . chr10: 72073201-72093800 Liver . chr12: 92884601-92906800 Liver SLCO1B3 chr12: 20953001-20975200 Liver . chr5: 56936401-56957600 Liver MIR122 chr18: 56104601-56125200 Liver SDS chr12: 113830001-113851400 Liver . chr3: 125326001-125348200 Liver F13B chr1: 197025201-197047400 Liver . chr6: 23572401-23595000 Liver UGT2B15 chr4: 69523801-69546000 Liver . chr10: 93857601-93880200 Vasculary . chr12: 105852801-105875400 Vasculary AC073130.3 chr7: 115956001-115977800 Vasculary PTPRE chr10: 129774601-129797600 Vasculary . chr12: 96820801-96843600 Vasculary CTD-2337A12.1 chr5: 95910001-95932000 Vasculary ESM1 chr5: 54269201-54289800 Vasculary VEPH1 chr3: 157240001-157261000 Vasculary RP11-322E11.5 chr18: 33034401-33055800 Vasculary . chr2: 216702601-216724000 Vasculary LOC729987 chr1: 98666201-98687400 Vasculary . chr6: 12284601-12306200 Vasculary . chr1: 98542601-98565600 Vasculary . chr4: 85779401-85801400 Vasculary RP3-390M24.1 chr6: 79303001-79325000 Vasculary RP11-679B17.2 chr6: 11598201-11619800 Vasculary RP11-115J23.1 chr11: 28714401-28736800 Vasculary ESM1 chr5: 54269801-54290400 Vasculary VEPH1 chr3: 157237601-157260000 Vasculary . chr2: 56133801-56155600 Vasculary CTD-2337A12.1 chr5: 95912001-95933800 Vasculary TNFSF18 chr1: 173008201-173030600 Vasculary RP3-390M24.1 chr6: 79305001-79325600 Vasculary . chr5: 28145601-28167600 Vasculary PCAT19 chr19: 41974401-41995000 Vasculary MIR216A chr2: 56204201-56227000 Vasculary LOC100507254 chr6: 132445801-132466600 Vasculary . chr8: 78257401-78280400 T-Helper-Cells . chr21: 26927401-26948400 T-Helper-Cells . chr5: 156641201-156666200 T-Helper-Cells CCL20 chr2: 228666201-228688600 T-Helper-Cells . chr21: 26931801-26953600 T-Helper-Cells MAP3K4 chr6: 161491601-161513200 T-Helper-Cells . chr12: 94157201-94179600 T-Helper-Cells . chr8: 128992001-129013600 T-Helper-Cells . chr9: 123675001-123695800 T-Helper-Cells . chr2: 228661001-228683200 T-Helper-Cells . chr21: 26929001-26949600 T-Helper-Cells . chr17: 80269601-80291000 T-Helper-Cells BC045668; IL21 chr4: 123529201-123551000 T-Helper-Cells . chr1: 206746601-206768000 T-Helper-Cells . chr7: 130631401-130652200 T-Helper-Cells . chr6: 161493201-161514800 T-Helper-Cells . chr6: 16427801-16449400 T-Helper-Cells . chr22: 47061801-47083200 T-Helper-Cells . chr8: 128993601-129014600 T-Helper-Cells . chr14: 103262801-103285200 T-Helper-Cells . chr15: 60982201-61004600 T-Helper-Cells . chr2: 228663201-228685200 T-Helper-Cells . chr18: 9079201-9101400 T-Helper-Cells . chr2: 228703401-228725600 T-Helper-Cells . chr5: 156640201-156661200 T-Helper-Cells . chr21: 26928401-26949000 T-Helper-Cells . chr2: 157174801-157195600 T-Cells . chr6: 112176801-112199600 T-Cells RNU6-933P chr11: 60740601-60763400 T-Cells . chr21: 26927401-26948400 T-Cells . chr10: 6087801-6110800 T-Cells . chr5: 156641201-156666200 T-Cells RP11-61O1.1 chr14: 98658001-98680400 T-Cells . chr10: 6090801-6111800 T-Cells ARHGAP15 chr2: 143995201-144017000 T-Cells CCL20 chr2: 228666201-228688600 T-Cells . chr21: 26931801-26953600 T-Cells CTLA4 chr2: 204722601-204743400 T-Cells . chr2: 204707801-204730400 T-Cells MAP3K4 chr6: 161491601-161513200 T-Cells ICOS chr2: 204788401-204810600 T-Cells RNU6-933P chr11: 60744201-60765800 T-Cells AX747844 chr12: 47593001-47614600 T-Cells TNIP3 chr4: 122135801-122158200 T-Cells . chr12: 94157201-94179600 T-Cells . chr15: 60853601-60876400 T-Cells . chr8: 128992001-129013600 T-Cells . chr9: 123672801-123693800 T-Cells . chr9: 123675001-123695800 T-Cells . chr8: 134065601-134087400 T-Cells TLDC1 chr16: 84575801-84597200 T-Cells . chr2: 228661001-228683200 T-Cells . chr21: 26929001-26949600 T-Cells . chr3: 16333201-16354200 T-Cells CDC14A chr1: 100875801-100898600 T-Cells . chr4: 90199801-90222200 T-Cells CYTIP chr2: 158293001-158314600 T-Cells . chr2: 204796001-204817800 T-Cells . chr22: 37605601-37626400 T-Cells . chr17: 80269601-80291000 T-Cells . chr21: 26933601-26955200 T-Cells RP11-291B21.2 chr12: 10695601-10716800 T-Cells CTLA4 chr2: 204724801-204745600 T-Cells . chr15: 60863601-60886200 T-Cells RTKN2 chr10: 63983001-64005400 T-Cells RP11-799D4.2 chr17: 33505001-33527400 T-Cells ETS1 chr11: 128324001-128346600 T-Cells . chr1: 160518401-160540200 T-Cells BC045668; IL21 chr4: 123529201-123551000 T-Cells RP11-61O1.1 chr14: 98655401-98678000 T-Cells . chr6: 128289201-128312200 T-Cells . chr4: 143305601-143328400 T-Cells . chr12: 9938801-9960200 T-Cells . chr14: 61791401-61814000 T-Cells . chr1: 206746601-206768000 T-Cells . chr2: 204566201-204588800 T-Cells . chr1: 90062201-90084400 T-Cells . chr7: 130631401-130652200 T-Cells . chr22: 40294801-40317200 T-Cells . chr6: 161493201-161514800 T-Cells . chr16: 27409001-27431000 T-Cells . chr1: 117292801-117314600 T-Cells . chr6: 16427801-16449400 T-Cells . chr6: 33302001-33323400 T-Cells . chr4: 143308401-143331200 T-Cells . chr22: 47061801-47083200 T-Cells CTLA4 chr2: 204723801-204744400 T-Cells . chr4: 143287401-143310000 T-Cells . chr8: 128993601-129014600 T-Cells . chr21: 36404601-36426800 T-Cells . chr14: 103262801-103285200 T-Cells . chr3: 108542801-108564600 T-Cells . chr15: 60982201-61004600 T-Cells . chr2: 228663201-228685200 T-Cells BC062769 chr2: 197115801-197137400 T-Cells . chr1: 90066401-90088800 T-Cells USP44 chr12: 95934801-95957200 T-Cells . chr3: 59982801-60005400 T-Cells FANK1 chr10: 127674201-127696600 T-Cells . chr2: 106344201-106367000 T-Cells ANK3 chr10: 62475401-62498200 T-Cells . chr19: 9963401-9985000 T-Cells . chr8: 121733601-121754600 T-Cells . chr17: 38758401-38780400 T-Cells . chr18: 9079201-9101400 T-Cells . chr2: 228703401-228725600 T-Cells . chr5: 156640201-156661200 T-Cells . chr14: 98640001-98662400 T-Cells . chr3: 42669401-42691200 T-Cells . chr8: 129539401-129562400 T-Cells . chr7: 50411401-50433000 T-Cells . chr15: 60856401-60879200 T-Cells . chr2: 181994801-182017400 T-Cells TRABD2A chr2: 85056801-85079600 T-Cells RN7SL328P chr9: 134593001-134614600 T-Cells TIGIT chr3: 114003001-114024400 T-Cells . chr16: 27407801-27428600 T-Cells GIMAP4 chr7: 150250801-150272800 T-Cells . chr5: 156608401-156630800 T-Cells . chr21: 26928401-26949000 T-Cells . chr2: 157174801-157195600 T-Cells CLEC2D chr12: 9818401-9840200 T-Cells . chr1: 12560601-12581600 T-Cells RP11-61O1.1 chr14: 98660401-98681600 T-Cells . chr14: 102271201-102293000 T-Cells . chr6: 154541401-154564200 T-Cells . chr1: 214803401-214825800 T-Cells . chr2: 143999001-144021000 T-Cells RP1-281H8.3 chr6: 149806001-149828200 NK TARP chr7: 38304201-38325800 NK . chr2: 8411201-8432600 NK KLRF1 chr12: 9970201-9991600 NK . chr17: 66206801-66229600 NK CMC1 chr3: 28323201-28344800 NK KLRD1 chr12: 10451601-10472400 NK TRDC chr14: 22910401-22932800 NK AK096766 chr7: 38333401-38355000 NK KLRD1 chr12: 10452401-10473400 NK CMC1 chr3: 28324801-28346400 NK SH2D1B chr1: 162369401-162391200 NK TCRDV2 chr14: 22916401-22938600 NK KLRD1 chr12: 10448001-10470000 Lymphocytes . chr6: 112176801-112199600 Lymphocytes RNU6-933P chr11: 60740601-60763400 Lymphocytes . chr21: 26927401-26948400 Lymphocytes HLA-DOB chr6: 32774001-32794800 Lymphocytes . chr10: 6087801-6110800 Lymphocytes . chr5: 156641201-156666200 Lymphocytes TARP chr7: 38304201-38325800 Lymphocytes RP11-61O1.1 chr14: 98658001-98680400 Lymphocytes . chr10: 6090801-6111800 Lymphocytes AK128525 chr2: 89147801-89169600 Lymphocytes . chr2: 8411201-8432600 Lymphocytes KLRF1 chr12: 9970201-9991600 Lymphocytes ARHGAP15 chr2: 143995201-144017000 Lymphocytes . chr3: 40655601-40676800 Lymphocytes CCL20 chr2: 228666201-228688600 Lymphocytes . chr21: 26931801-26953600 Lymphocytes CTLA4 chr2: 204722601-204743400 Lymphocytes . chr2: 204707801-204730400 Lymphocytes . chr17: 66206801-66229600 Lymphocytes MAP3K4 chr6: 161491601-161513200 Lymphocytes ICOS chr2: 204788401-204810600 Lymphocytes RNU6-933P chr11: 60744201-60765800 Lymphocytes CMC1 chr3: 28323201-28344800 Lymphocytes AX747844 chr12: 47593001-47614600 Lymphocytes TNIP3 chr4: 122135801-122158200 Lymphocytes . chr12: 94157201-94179600 Lymphocytes . chr15: 60853601-60876400 Lymphocytes . chr8: 128992001-129013600 Lymphocytes . chr9: 123672801-123693800 Lymphocytes . chr9: 123675001-123695800 Lymphocytes KLRD1 chr12: 10451601-10472400 Lymphocytes . chr8: 134065601-134087400 Lymphocytes TLDC1 chr16: 84575801-84597200 Lymphocytes . chr2: 228661001-228683200 Lymphocytes FYN chr6: 112103401-112125600 Lymphocytes LILRB1 chr19: 55118401-55140200 Lymphocytes . chr21: 26929001-26949600 Lymphocytes . chr3: 16333201-16354200 Lymphocytes CDC14A chr1: 100875801-100898600 Lymphocytes . chr4: 90199801-90222200 Lymphocytes CYTIP chr2: 158293001-158314600 Lymphocytes . chr2: 204796001-204817800 Lymphocytes TRDC chr14: 22910401-22932800 Lymphocytes . chr22: 37605601-37626400 Lymphocytes . chr17: 80269601-80291000 Lymphocytes SAMD3 chr6: 130525801-130546600 Lymphocytes . chr21: 26933601-26955200 Lymphocytes AK096766 chr7: 38333401-38355000 Lymphocytes RP11-291B21.2 chr12: 10695601-10716800 Lymphocytes CTLA4 chr2: 204724801-204745600 Lymphocytes . chr15: 60863601-60886200 Lymphocytes FCRL3 chr1: 157659001-157680800 Lymphocytes RTKN2 chr10: 63983001-64005400 Lymphocytes RP11-799D4.2 chr17: 33505001-33527400 Lymphocytes ETS1 chr11: 128324001-128346600 Lymphocytes . chr1: 160518401-160540200 Lymphocytes BC045668; IL21 chr4: 123529201-123551000 Lymphocytes RP11-61O1.1 chr14: 98655401-98678000 Lymphocytes . chr6: 128289201-128312200 Lymphocytes . chr4: 143305601-143328400 Lymphocytes . chr12: 9938801-9960200 Lymphocytes . chr14: 61791401-61814000 Lymphocytes . chr1: 206746601-206768000 Lymphocytes KLRD1 chr12: 10452401-10473400 Lymphocytes . chr2: 204566201-204588800 Lymphocytes . chr1: 90062201-90084400 Lymphocytes . chr7: 130631401-130652200 Lymphocytes . chr22: 40294801-40317200 Lymphocytes . chr6: 161493201-161514800 Lymphocytes . chr16: 27409001-27431000 Lymphocytes . chr1: 117292801-117314600 Lymphocytes . chr6: 16427801-16449400 Lymphocytes . chr6: 33302001-33323400 Lymphocytes . chr4: 143308401-143331200 Lymphocytes CCL5 chr17: 34196201-34217200 Lymphocytes . chr22: 47061801-47083200 Lymphocytes CTLA4 chr2: 204723801-204744400 Lymphocytes . chr4: 143287401-143310000 Lymphocytes . chr8: 128993601-129014600 Lymphocytes . chr21: 36404601-36426800 Lymphocytes . chr14: 103262801-103285200 Lymphocytes . chr3: 108542801-108564600 Lymphocytes . chr15: 60982201-61004600 Lymphocytes . chr2: 228663201-228685200 Lymphocytes CMC1 chr3: 28324801-28346400 Lymphocytes CYB561A3 chr11: 61112201-61134000 Lymphocytes BC062769 chr2: 197115801-197137400 Lymphocytes . chr1: 90066401-90088800 Lymphocytes USP44 chr12: 95934801-95957200 Lymphocytes SH2D1B chr1: 162369401-162391200 Lymphocytes . chr3: 59982801-60005400 Lymphocytes FANK1 chr10: 127674201-127696600 Lymphocytes . chr2: 106344201-106367000 Lymphocytes ANK3 chr10: 62475401-62498200 Lymphocytes . chr19: 9963401-9985000 Lymphocytes . chr1: 25241001-25262600 Lymphocytes . chr8: 121733601-121754600 Lymphocytes . chr17: 38758401-38780400 Lymphocytes . chr18: 9079201-9101400 Lymphocytes . chr2: 228703401-228725600 Lymphocytes . chr5: 156640201-156661200 Lymphocytes . chr14: 98640001-98662400 Lymphocytes . chr3: 42669401-42691200 Lymphocytes . chr8: 12953 9401-129562400 Lymphocytes . chr17: 61995401-62017200 Lymphocytes AC002480.5 chr7: 22689001-22711600 Lymphocytes . chr7: 50411401-50433000 Lymphocytes CCL5 chr17: 34192801-34214800 Lymphocytes GTF3C1 chr16: 27464001-27486000 Lymphocytes . chr15: 60856401-60879200 Lymphocytes . chr2: 181994801-182017400 Lymphocytes AK128525 chr2: 89149801-89170400 Lymphocytes TRABD2A chr2: 85056801-85079600 Lymphocytes RN7SL328P chr9: 134593001-134614600 Lymphocytes TIGIT chr3: 114003001-114024400 Lymphocytes TCL1A; BX247990 chr14: 96170201-96192800 Lymphocytes TCRDV2 chr14: 22916401-22938600 Lymphocytes . chr16: 27407801-27428600 Lymphocytes GIMAP4 chr7: 150250801-150272800 Lymphocytes . chr5: 156608401-156630800 Lymphocytes . chr21: 26928401-26949000 Lymphocytes . chr2: 157174801-157195600 Lymphocytes KLRD1 chr12: 10448001-10470000 Lymphocytes CLEC2D chr12: 9818401-9840200 Lymphocytes . chr1: 12560601-12581600 Lymphocytes RP11-61O1.1 chr14: 98660401-98681600 Lymphocytes . chr14: 102271201-102293000 Lymphocytes . chr6: 154541401-154564200 Lymphocytes . chr1: 214803401-214825800 Lymphocytes . chr2: 143999001-144021000 Lymphocytes RP11-428G5.5 chr12: 32029401-32050200 Lymphocytes RP1-281H8.3 chr6: 149806001-149828200 Neutrophils SIGLEC5 chr19: 52120201-52141600 Neutrophils MMP9 chr20: 44627401-44649000 Neutrophils CMTM2 chr16: 66611001-66632200 Neutrophils RP1-229K20.5 chr6: 41228801-41249800 Neutrophils PGLYRP1 chr19: 46515801-46536800 Neutrophils . chr3: 128940401-128961200 Neutrophils . chr16: 84800001-84821600 Neutrophils HCAR3 chr12: 123188801-123210200 Neutrophils RP1-229K20.5 chr6: 41227401-41248800 Neutrophils IL1R2 chr2: 102599401-102620000 Neutrophils . chr16: 1978001-1999200 Neutrophils FFAR2 chr19: 35931001-35951800 Neutrophils APOBEC3A chr22: 39338601-39359200 Neutrophils 8-Mar chr10: 45948801-45969800 Neutrophils FFAR2 chr19: 35930201-35950800 Neutrophils . chr1: 180091001-180111800 Neutrophils BTNL8 chr5: 180326601-180347800 Neutrophils . chr16: 84798801-84819400 Neutrophils . chr17: 76402401-76423000 Neutrophils . chr16: 84799401-84820000 Neutrophils . chr17: 79113601-79134400 Neutrophils . chr16: 3090401-3111200 Neutrophils PAM16 chr16: 4385601-4406600 Neutrophils . chr19: 52121601-52142200 Neutrophils PGLYRP1 chr19: 46515201-46535800 Neutrophils AC002511.3 chr19: 35911201-35933000 Neutrophils ARRB2 chr17: 4610401-4631400 Neutrophils BC016366 chr4: 185765601-185787000 Monocytes RPL7AP64 chr17: 7029801-7052600 Monocytes E02193 chr12: 69735601-69758600 Monocytes CD300E chr17: 72608201-72629400 Monocytes CD300E chr17: 72606001-72628200 Monocytes CD300LB chr17: 72516401-72537400 Monocytes CD300C chr17: 72531201-72552200 Monocytes CD300E chr17: 72609401-72631000 Monocytes . chr7: 36746601-36769200 Monocytes CLEC12A chr12: 10115801-10136600 Monocytes SPIDR chr8: 48262801-48285200 Monocytes CLEC12A chr12: 10115201-10135800 Monocytes . chr3: 196407201-196427800 Monocytes CTSS chr1: 150724601-150745200 Monocytes CLEC12A chr12: 10114601-10135200 Leukocytes GNLY chr2: 85912001-85933000 Leukocytes . chr6: 112176801-112199600 Leukocytes PRKCQ-AS1 chr10: 6616201-6637800 Leukocytes PRF1 chr10: 72351601-72372600 Leukocytes RPL7AP64 chr17: 7029801-7052600 Leukocytes SEMA4A chr1: 156114001-156135400 Leukocytes RNU6-933P chr11: 60740601-60763400 Leukocytes . chr21: 26927401-26948400 Leukocytes HLA-DOB chr6: 32774001-32794800 Leukocytes . chr10: 6087801-6110800 Leukocytes KLRK1 chr12: 10531401-10552400 Leukocytes NUP214 chr9: 134093401-134114400 Leukocytes KLRK1 chr12: 10529601-10551400 Leukocytes . chr5: 156641201-156666200 Leukocytes SIGLEC5 chr19: 52120201-52141600 Leukocytes TARP chr7: 38304201-38325800 Leukocytes FASLG chr1: 172618601-172640600 Leukocytes RP4-576H24.4; SIRPB1 chr20: 1557601-1579000 Leukocytes RP11-61O1.1 chr14: 98658001-98680400 Leukocytes . chr10: 6090801-6111800 Leukocytes SYNE1 chr6: 152495001-152516800 Leukocytes AK128525 chr2: 89147801-89169600 Leukocytes . chr1: 206729201-206751200 Leukocytes KLRC4-KLRK1 chr12: 10549001-10571400 Leukocytes CD226 chr18: 67613201-67634600 Leukocytes MMP9 chr20: 44627401-44649000 Leukocytes . chr8: 128978201-128999200 Leukocytes . chr2: 8411201-8432600 Leukocytes KLRK1 chr12: 10527001-10549600 Leukocytes KLRF1 chr12: 9970201-9991600 Leukocytes CD86 chr3: 121764201-121785400 Leukocytes ARHGAP15 chr2: 143995201-144017000 Leukocytes ARRB2 chr17: 4607201-4628800 Leukocytes . chr8: 126949201-126969800 Leukocytes . chr3: 40655601-40676800 Leukocytes LINC01259 chr4: 38501601-38522800 Leukocytes . chr20: 24922601-24943800 Leukocytes MORC2 chr22: 31325201-31347800 Leukocytes E02193 chr12: 69735601-69758600 Leukocytes CMTM2 chr16: 66611001-66632200 Leukocytes CD33 chr19: 51718401-51739400 Leukocytes CCL20 chr2: 228666201-228688600 Leukocytes . chr21: 26931801-26953600 Leukocytes CTLA4 chr2: 204722601-204743400 Leukocytes RP1-229K20.5 chr6: 41228801-41249800 Leukocytes RGS3 chr9: 116269601-116290600 Leukocytes . chr8: 126942201-126967800 Leukocytes . chr6: 90778601-90800000 Leukocytes . chr2: 204707801-204730400 Leukocytes . chr20: 57724601-57745800 Leukocytes SAMD3 chr6: 130522801-130544800 Leukocytes CLEC2D chr12: 9815001-9836000 Leukocytes . chr17: 66206801-66229600 Leukocytes CD300E chr17: 72608201-72629400 Leukocytes MAP3K4 chr6: 161491601-161513200 Leukocytes LIM2 chr19: 51880401-51901800 Leukocytes ICOS chr2: 204788401-204810600 Leukocytes NLRC5 chr16: 57064001-57085400 Leukocytes RNU6-933P chr11: 60744201-60765800 Leukocytes . chr6: 90780001-90801200 Leukocytes SPATA13-AS1 chr13: 24817401-24838800 Leukocytes CMC1 chr3: 28323201-28344800 Leukocytes AX747844 chr12: 47593001-47614600 Leukocytes TNIP3 chr4: 122135801-122158200 Leukocytes . chr12: 94157201-94179600 Leukocytes FPR1 chr19: 52241201-52262600 Leukocytes PGLYRP1 chr19: 46515801-46536800 Leukocytes . chr17: 8846601-8868800 Leukocytes . chr15: 60853601-60876400 Leukocytes . chr8: 128992001-129013600 Leukocytes . chr9: 123672801-123693800 Leukocytes CD300E chr17: 72606001-72628200 Leukocytes RBPJ chr4: 26264601-26286000 Leukocytes . chr9: 123675001-123695800 Leukocytes CD300LB chr17: 72516401-72537400 Leukocytes LTB4R chr14: 24773201-24794600 Leukocytes KLRD1 chr12: 10451601-10472400 Leukocytes . chr3: 128940401-128961200 Leukocytes . chr20: 54993801-55015400 Leukocytes AK291611 chr7: 38364201-38386000 Leukocytes . chr8: 134065601-134087400 Leukocytes CD300C chr17: 72531201-72552200 Leukocytes TLDC1 chr16: 84575801-84597200 Leukocytes . chr2: 228661001-228683200 Leukocytes FYN chr6: 112103401-112125600 Leukocytes PADI4 chr1: 17624801-17646200 Leukocytes CD300E chr17: 72609401-72631000 Leukocytes LILRB1 chr19: 55118401-55140200 Leukocytes ARHGEF11 chr1: 156922801-156943800 Leukocytes . chr21: 26929001-26949600 Leukocytes . chr3: 16333201-16354200 Leukocytes . chr10: 71842201-71863600 Leukocytes PTPN4 chr2: 120677201-120698400 Leukocytes . chr16: 84800001-84821600 Leukocytes CDC14A chr1: 100875801-100898600 Leukocytes . chr4: 90199801-90222200 Leukocytes . chr17: 76902001-76923000 Leukocytes CYTIP chr2: 158293001-158314600 Leukocytes . chr2: 204796001-204817800 Leukocytes TRDC chr14: 22910401-22932800 Leukocytes . chr22: 37605601-37626400 Leukocytes . chr17: 80269601-80291000 Leukocytes SAMD3 chr6: 130525801-130546600 Leukocytes HCAR3 chr12: 123188801-123210200 Leukocytes . chr21: 26933601-26955200 Leukocytes . chr1: 206726201-206747200 Leukocytes . chr3: 58329201-58350200 Leukocytes AK096766 chr7: 38333401-38355000 Leukocytes . chr14: 99492001-99513200 Leukocytes . chr13: 40952601-40973800 Leukocytes RP11-291B21.2 chr12: 10695601-10716800 Leukocytes CTLA4 chr2: 204724801-204745600 Leukocytes RP1-229K20.5 chr6: 41227401-41248800 Leukocytes IL1R2 chr2: 102599401-102620000 Leukocytes PRAM1 chr19: 8556201-8577200 Leukocytes . chr1: 150575601-150597000 Leukocytes SIRPG chr20: 1626801-1647400 Leukocytes . chr15: 60863601-60886200 Leukocytes . chr1: 206728001-206749200 Leukocytes . chr1: 28442001-28463200 Leukocytes FCRL3 chr1: 157659001-157680800 Leukocytes RTKN2 chr10: 63983001-64005400 Leukocytes RP11-799D4.2 chr17: 33505001-33527400 Leukocytes PIK3R1 chr5: 67566601-67587600 Leukocytes ETS1 chr11: 128324001-128346600 Leukocytes . chr19: 18494001-18515200 Leukocytes . chr16: 1978001-1999200 Leukocytes . chr1: 160518401-160540200 Leukocytes . chr2: 235388801-235410200 Leukocytes BC045668; IL21 chr4: 123529201-123551000 Leukocytes RP11-61O1.1 chr14: 98655401-98678000 Leukocytes . chr6: 128289201-128312200 Leukocytes . chr4: 143305601-143328400 Leukocytes FFAR2 chr19: 35931001-35951800 Leukocytes AK056689 chr13: 74795201-74816400 Leukocytes . chr12: 9938801-9960200 Leukocytes . chr14: 61791401-61814000 Leukocytes FPR2 chr19: 52256801-52278200 Leukocytes . chr1: 206746601-206768000 Leukocytes APOBEC3A chr22: 39338601-39359200 Leukocytes . chr2: 69394401-69415600 Leukocytes . chr19: 8623801-8644800 Leukocytes KLRD1 chr12: 10452401-10473400 Leukocytes 8-Mar chr10: 45948801-45969800 Leukocytes FFAR2 chr19: 35930201-35950800 Leukocytes CXCR2 chr2: 218984401-219005400 Leukocytes . chr20: 47366601-47388000 Leukocytes LILRA5 chr19: 54813801-54834600 Leukocytes . chr2: 204566201-204588800 Leukocytes . chr1: 90062201-90084400 Leukocytes . chr17: 75419601-75440800 Leukocytes . chr20: 57728401-57749600 Leukocytes SIRPG chr20: 1625401-1646800 Leukocytes . chr7: 130631401-130652200 Leukocytes . chr22: 40294801-40317200 Leukocytes . chr6: 161493201-161514800 Leukocytes . chr16: 27409001-27431000 Leukocytes . chr1: 180091001-180111800 Leukocytes . chr1: 117292801-117314600 Leukocytes . chr6: 16427801-16449400 Leukocytes . chr5: 156604801-156626600 Leukocytes . chr6: 33302001-33323400 Leukocytes . chr4: 143308401-143331200 Leukocytes BTNL8 chr5: 180326601-180347800 Leukocytes CCL5 chr17: 34196201-34217200 Leukocytes PIK3R1 chr5: 67567801-67588800 Leukocytes PILRB chr7: 99939201-99961000 Leukocytes RP11-326C3.13; chr11: 316201-337800 RP11-326C3.12; IFITM3 Leukocytes 9-Sep chr17: 75443201-75464400 Leukocytes CARS2 chr13: 111307601-111329000 Leukocytes . chr7: 36746601-36769200 Leukocytes . chr1: 111754201-111775600 Leukocytes . chr22: 47061801-47083200 Leukocytes PRKCQ-AS1 chr10: 6615601-6636200 Leukocytes CTLA4 chr2: 204723801-204744400 Leukocytes . chr8: 142234601-142255800 Leukocytes . chr4: 143287401-143310000 Leukocytes . chr2: 20614001-20635000 Leukocytes . chr8: 128993601-129014600 Leukocytes . chr21: 36404601-36426800 Leukocytes GZMM chr19: 534001-554800 Leukocytes . chr14: 103262801-103285200 Leukocytes . chr17: 74238001-74259600 Leukocytes AC092580.4 chr2: 7857401-7879200 Leukocytes . chr19: 4079001-4100000 Leukocytes IL23A chr12: 56722601-56743600 Leukocytes . chr16: 84798801-84819400 Leukocytes . chr3: 108542801-108564600 Leukocytes . chr15: 60982201-61004600 Leukocytes . chr2: 228663201-228685200 Leukocytes CMC1 chr3: 28324801-28346400 Leukocytes . chr17: 76402401-76423000 Leukocytes CYB561A3 chr11: 61112201-61134000 Leukocytes SIRPG chr20: 1627801-1648400 Leukocytes . chr13: 110373601-110394600 Leukocytes . chr16: 84799401-84820000 Leukocytes BC062769 chr2: 197115801-197137400 Leukocytes . chr1: 90066401-90088800 Leukocytes USP44 chr12: 95934801-95957200 Leukocytes SH2D1B chr1: 162369401-162391200 Leukocytes . chr3: 59982801-60005400 Leukocytes RP11-14I17.3 chr8: 26287801-26309200 Leukocytes DCTN4 chr5: 150103601-150124800 Leukocytes . chr17: 79113601-79134400 Leukocytes AC018816.3 chr3: 4862201-4883600 Leukocytes FANK1 chr10: 127674201-127696600 Leukocytes C19orf38 chr19: 10949001-10969800 Leukocytes CLEC12A chr12: 10115801-10136600 Leukocytes LDLRAP1 chr1: 25880601-25901400 Leukocytes . chr2: 106344201-106367000 Leukocytes ANK3 chr10: 62475401-62498200 Leukocytes LINC01259 chr4: 38503001-38525400 Leukocytes . chr16: 3090401-3111200 Leukocytes . chr19: 9963401-9985000 Leukocytes . chr1: 25241001-25262600 Leukocytes . chr8: 121733601-121754600 Leukocytes . chr17: 38758401-38780400 Leukocytes . chr18: 9079201-9101400 Leukocytes . chr2: 228703401-228725600 Leukocytes PAM16 chr16: 4385601-4406600 Leukocytes . chr5: 156640201-156661200 Leukocytes . chr14: 98640001-98662400 Leukocytes FPR2 chr19: 52254001-52275000 Leukocytes . chr8: 126947801-126968800 Leukocytes . chr3: 42669401-42691200 Leukocytes . chr8: 129539401-129562400 Leukocytes SPIDR chr8: 48262801-48285200 Leukocytes CLEC12A chr12: 10115201-10135800 Leukocytes . chr17: 61995401-62017200 Leukocytes AC002480.5 chr7: 22689001-22711600 Leukocytes . chr12: 105066001-105087000 Leukocytes AK057187 chr2: 38044801-38066400 Leukocytes . chr7: 50411401-50433000 Leukocytes CCL5 chr17: 34192801-34214800 Leukocytes GTF3C1 chr16: 27464001-27486000 Leukocytes THEMIS chr6: 128228601-128249600 Leukocytes NLRC5 chr16: 57063001-57084000 Leukocytes . chr7: 142614001-142635600 Leukocytes . chr19: 52121601-52142200 Leukocytes . chr15: 60856401-60879200 Leukocytes . chr5: 66478801-66499800 Leukocytes RP11-127L20.3 chr10: 106073401-106094600 Leukocytes . chr2: 181994801-182017400 Leukocytes AK128525 chr2: 89149801-89170400 Leukocytes TRABD2A chr2: 85056801-85079600 Leukocytes RN7SL328P chr9: 134593001-134614600 Leukocytes PRKCQ-AS1 chr10: 6617801-6639400 Leukocytes C5AR2 chr19: 47830001-47851000 Leukocytes TIGIT chr3: 114003001-114024400 Leukocytes TCL1A; BX247990 chr14: 96170201-96192800 Leukocytes TCRDV2 chr14: 22916401-22938600 Leukocytes . chr1: 42194201-42215600 Leukocytes IRF4 chr6: 384001-404600 Leukocytes . chr16: 27407801-27428600 Leukocytes GIMAP4 chr7: 150250801-150272800 Leukocytes . chr5: 156608401-156630800 Leukocytes . chr17: 76903801-76924800 Leukocytes . chr16: 84621001-84642000 Leukocytes . chr8: 142120401-142141800 Leukocytes DNAJC19P3 chr19: 42134001-42155800 Leukocytes PGLYRP1 chr19: 46515201-46535800 Leukocytes . chr21: 26928401-26949000 Leukocytes . chr2: 157174801-157195600 Leukocytes KLRD1 chr12: 10448001-10470000 Leukocytes CLEC2D chr12: 9818401-9840200 Leukocytes . chr3: 71818801-71839800 Leukocytes . chr1: 12560601-12581600 Leukocytes AC002511.3 chr19: 35911201-35933000 Leukocytes . chr5: 118665601-118687000 Leukocytes . chr3: 196407201-196427800 Leukocytes ARRB2 chr17: 4610401-4631400 Leukocytes RP11-61O1.1 chr14: 98660401-98681600 Leukocytes . chr1: 244377801-244398600 Leukocytes . chr14: 102271201-102293000 Leukocytes . chr6: 154541401-154564200 Leukocytes . chr1: 214803401-214825800 Leukocytes . chr2: 143999001-144021000 Leukocytes . chr6: 24917801-24939400 Leukocytes CTSS chr1: 150724601-150745200 Leukocytes RP11-428G5.5 chr12: 32029401-32050200 Leukocytes OSCAR chr19: 54593601-54614200 Leukocytes CLEC12A chr12: 10114601-10135200 Leukocytes BC016366 chr4: 185765601-185787000 Leukocytes RP1-281H8.3 chr6: 149806001-149828200 HSC . chr2: 124658401-124680600 HSC MLLT3 chr9: 20370801-20392400 HSC ATP8B4 chr15: 50397801-50420200 HSC MLLT3 chr9: 20369001-20390400 HSC BC016361 chr4: 75184201-75206000 HSC AP001171.1 chr21: 21618801-21640600 HSC . chr6: 142691401-142714400 HSC . chr2: 124656201-124678400 HSC GFI1B chr9: 135844201-135865200 HSC RP11-598F7.3 chr12: 207601-230400 HSC RP13-786C16.1 chr11: 33891201-33913400 HSC . chr2: 16648401-16670400 HSC . chr21: 16803001-16825200 HSC HEMGN chr9: 100687801-100710000 HSC . chr6: 142694401-142717400 Kidney CLDN10 chr13: 96074201-96097200 Kidney . chr17: 26939601-26964600 Kidney . chr4: 22705601-22730600 Kidney SLC22A8 chr11: 62771001-62793000 Kidney . chr10: 17154801-17179000 Kidney SLC13A3 chr20: 45268401-45289600 Kidney . chr2: 209398201-209419000 Kidney . chr11: 128724001-128745800 Skin . chr6: 106932801-106955800 Skin . chr6: 143644001-143666400 Skin . chr2: 239753801-239774600 Skin . chr7: 19131001-19153200 Skin . chr11: 71852001-71874000 Skin AC003986.6 chr7: 19138401-19160400 Skin . chr2: 239754601-239776200 Skin GSDMC chr8: 130787001-130808400 - In some embodiments, the sequencing data from a subject is deconvoluted by comparison to the DNA-protein association atlas. In this way the percent contribution of different tissues, cell types and/or cellular states to the total cfnucleosomes in a sample can be determined. In some embodiments, the deconvolution gives the percent contribution of only informative cfnucleosomes.
- In some embodiments, the cfDNA and cf-nucleosomes are not analyzed by comparison to healthy tissue data but rather are analyzed by machine learning. Machine learning is well known in the art, and by performing the methods of the invention of patients with known conditions the machine learning algorithm can learn to recognize specific disease states and conditions in the cfDNA sequences provided when specific DNA-associated proteins are isolated. In some embodiments, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, or 20 subjects with a specific condition are analyzed before the algorithm can identify that condition in a new subject.
- As used herein, the term “cellular state” refers to a condition or cellular response or pathway that is active in a cell. In some embodiments, the cellular state is the cellular condition that led to and/or caused cell death. In some embodiments, the cellular state was the state of the cell just before it died but is not directly the cause of that death. In some embodiments, the cellular state is a pathway that was active or inactive in the cell when it died. In some embodiments, the cellular state is a cellular response that was active or inactive in the cell when it died. In some embodiments, the cellular state comprises expression of at least one gene informative of the cause of cell death. In some embodiments, the cellular state is a disease state. In some embodiments, the cellular state comprises expression of at least one gene informative of an active pathway in the cell when it died. In some embodiments, at least 1, 2, 3, 4, or 5 genes from a pathway is indicative of an active pathway. Each possibility represents a separate embodiment of the invention. Signaling pathways are well known in the art and online resources for determining the members of various pathways can be found by examining Gene Ontology or Thermo Fisher Scientific for example.
- In some embodiments, determining a cellular state comprises determining a cellular pathway active in the cell. In some embodiments, determining a cellular state comprises determining a transcriptional program active in the cell. In some embodiments, determining a cellular state comprises determining active transcription of at least one gene informative of an active pathway. In some embodiments, at least 1, 2, 3, 4, or 5 genes from a pathway are determined. Each possibility represents a separate embodiment of the invention. In some embodiments, determining a cellular state comprises determining association of the DNA-associated protein to at least 1 genomic region that regulates a gene of the pathway. In some embodiments, the cellular state is any one of hypoxia, inflammation, ER stress, mitochondrial stress, quiescence, senescence, interferon response, cycling, malignant, and calcium flux. Any cellular state which can be defined by the expression of a gene or set of genes can be investigated by the methods of the invention.
- In some embodiments, the methods of the invention further comprise comparing the sequenced cfDNA to at least 1 genomic location with the greatest association of the DNA-associated protein during activation of a cellular pathway, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 1 genomic location indicates activation of the cellular pathway. In some embodiments, the genomic locations with the greatest unique association are compared.
- In some embodiments, the methods of the invention further comprise comparing the sequenced cfDNA to a pathway atlas of at least 2 cellular pathways, wherein the atlas comprises at least 1 genomic location with the greatest association of the DNA-associated protein in each of the 2 cellular pathways, and wherein a cfDNA with a sequence that is the same as a DNA sequence within the at least 1 genomic location indicates activation of that cellular pathway.
- In some embodiments, the sequenced cfDNA is compared to at least 1, 2, 3, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90 or 100 genomic locations. Each possibility represents a separate embodiment of the invention. In some embodiments, the sequenced cfDNA is compared to at least 10 genomic locations. In some embodiments, the sequenced cfDNA is compared to at least 25 genomic locations. In some embodiments the designating comprises comparing the sequenced DNA to a given number of genomic locations.
- In some embodiments, the atlas is of at least 1, 2, 3, 5, 10, 10, 15, 20, 25, 30, 35, 40, 45 or 50 cell types and/or tissues. Each possibility represents a separate embodiment of the invention. In some embodiments, the atlas is of at least 1, 2, 3, 5, 10, 10, 15, 20, 25, 30, 35, 40, 45 or 50 cellular pathways. Each possibility represents a separate embodiment of the invention. In some embodiments, the atlas comprises at least 1, 2, 3, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, or 100 genomic locations. Each possibility represents a separate embodiment of the invention. In some embodiments, the atlas comprises at least 2 genomic locations. In some embodiments, the atlas comprises at least 10 genomic locations. In some embodiments, the atlas comprises at least 25 genomic locations.
- In some embodiments, association of the DNA-associated protein with the genomic location is indicative of active transcription and the genomic location is within a tissue, cell type or cellular state specific gene, enhancer element or is at a disease specific mutation. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of active transcription and the genomic location is within a tissue, cell type or cellular state specific gene, enhancer element. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of active transcription and the genomic location is at a disease-specific mutation. In some embodiments, the DNA-associated protein is a marker of active transcription and association of the DNA-associated protein with the disease associated mutation is indicative of the disease state. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of active transcription and the disease-associated mutation is within an oncogene. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of silenced transcription and the genomic location is within a repressor element or a gene silenced in the tissue, cell type or cellular state or is at a disease specific mutation. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of silenced transcription and the genomic location is within a repressor element or a gene silenced in the tissue, cell type or cellular state. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of silenced transcription and the genomic location is at a disease-specific mutation. In some embodiments, association of the DNA-associated protein with the genomic location is indicative of silenced transcription and the disease-associated mutation is within a tumor suppressor gene. Oncogenes and tumor suppressor genes are well known in the art. Examples of oncogenes include, but are not limited to, WNT, RAS, MYC and ERK. Examples of tumor suppressor genes include, but are not limited to, p53, PTCH, NF1, p27Kip1, and APC.
- As used herein, “cfDNA” refers to any DNA obtained from an organism which existed in the organism outside of a cell. As used herein, “cfnucleosome” refers to cfDNA and any proteins bound and/or associated with the cfDNA. In some embodiments, cfnucleosomes comprises cfDNA and cfhistones. In some embodiments, the cfDNA is associated with DNA-associated protein. In some embodiments, the cfDNA is not naked. In some embodiments, the cfDNA is in the sample as cfnucleosomes. In some embodiments, the cfDNA is not crosslinked. In some embodiments, the methods of the invention further comprise crosslinking the cfDNA and DNA-associated proteins prior to the contacting. In some embodiments, the methods do not comprise crosslinking the cfDNA and DNA-associated proteins prior to the contacting.
- In some embodiments, the cfDNA is DNA obtained from an organism and existed in the organism outside of any vesicle. Cell-free DNA is well known in the art, and generally refers to DNA that is free floating within a bodily fluid. This DNA is generally not enclosed in a vesicle and thus DNA in transport, such as by exosomes or other vesicular transporters, in not considered cfDNA. In some embodiments, cfDNA is DNA from a dying and/or dead cell. When a cell dies the DNA is generally fragmented and released from the cell as it lyses. This DNA however, is not all immediately removed or cleaned up and thus persists in the organism. Frequently the DNA from the dead cell enters the bloodstream. In some embodiments, vesicular chromatin that is lysed is also included in the cfDNA.
- In some embodiments, the cfDNA is mammalian cfDNA. In some embodiments, the cfDNA is human cfDNA. In some embodiments, the cfDNA is from a mammalian or human genome. In some embodiments, the cfDNA is fetal DNA. In some embodiments, the DNA is viral DNA. In some embodiments, the DNA is bacterial DNA. In some embodiments, the DNA is fungal DNA. In some embodiments, the DNA parasitic DNA. In some embodiments, the DNA if from a pathogen. In some embodiments, the DNA is from an organism that lives in a healthy subject. In some embodiments, the cfDNA is extracted from bodily fluid. In some embodiments, the providing comprises providing a bodily fluid and isolating the cfDNA from the bodily fluid. In some embodiments, the providing comprises providing a bodily fluid comprising cfDNA and performing the contacting in the bodily fluid. In some embodiments, the methods of the invention are performed on as little as 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 1.5, 2, 2.5 or 3 ml of bodily fluid. In some embodiments, the methods of the invention are performed on as little as 2 ml of bodily fluid. Each possibility represents a separate embodiment of the invention. In some embodiments, the methods of the invention are performed with less than 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 1.5, 2, 2.5, 3, 3.5, 4, 4.5, or 5 ml of bodily fluid. Each possibility represents a separate embodiment of the invention. In some embodiments, the bodily fluid is blood.
- In some embodiments, the cfDNA is fetal cell free DNA (cffDNA). In some embodiments, the method is for non-invasive fetal monitoring. In some embodiments, the subject is the mother of the fetus. In some embodiments, the method is for determining a cellular state of a cell in a fetus. In some embodiments, the method is for determining a disease in a fetus. In some embodiments, the method is for determining a genetic abnormality in a fetus. In some embodiments, the method is for determining the origin of cell death in a fetus.
- Since cfDNA has a short half-life in an organism, it provides a snapshot of the cell death occurring in the organism at that moment. In some embodiments, the methods of the invention detect cell death that has occurred within the last 1 minute, 2 minutes, 3 minutes, 4 minutes, 5 minutes, 10 minutes, 15 minutes, 20 minutes, 25 minutes, 30 minutes, 35 minutes, 40 minutes, 45 minutes, 50 minutes, 55 minutes, 1
hour 2 hours, 3 hours, 6 hours, 12 hours, 18 hours, 1 day, 2 days, 3 days, 4 days, 5 days, 6 days, 1week 2 weeks, 3 weeks or a month from the time the sample was taken from the subject. Each possibility represents a separate embodiment of the invention. In some embodiments, the methods of the invention detect cell death that occurred immediately before taking the sample from the subject. In some embodiments, the methods of the invention further comprise extracting a sample from the subject before the providing, wherein the sample comprises cfDNA. In some embodiments, the methods of the invention further comprise freezing or keeping at about 4 degrees the sample after it is taken from the subject and before it is contacted. By freezing or keeping the sample cold the methods of the invention can still detect the cell death that occurred just before the sample was taken from the subject. - In some embodiments, at least 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90 or 100 ng of cfDNA are provided. Each possibility represents a separate embodiment of the invention. In some embodiments, as little as 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90 or 100 ng of cfDNA are provided. Each possibility represents a separate embodiment of the invention. In some embodiments, at least 50 ng are provided. In some embodiments, as little as 50 ng are provided. In some embodiments, at least 7 ng are provided. In some embodiments, as little as 7 ng are provided. In some embodiments, at least 0.5 ng are provided. In some embodiments, as little as 0.5 ng are provided. In some embodiments, the provided cfDNA is between 0.1 and 1000, 0.1 and 900, 0.1 and 800, 0.1 and 700, 0.1 and 600, 0.1 and 500, 0.1 and 400, 0.1 and 300, 0.1 and 250, 0.1 and 200, 0.1 and 150, 0.1 and 100, 0.1. and 90, 0.1 and 80, 0.1 and 70, 0.1 and 60, 0.1 and 50, 0.1 and 40, 0.1 and 30 or 0.1 and 20 ng, 0.1 and 10 ng, 0.1 and 5 ng, 0.1 and 1 ng, 0.5 and 1000, 0.5 and 900, 0.5 and 800, 0.5 and 700, 0.5 and 600, 0.5 and 500, 0.5 and 400, 0.5 and 300, 0.5 and 250, 05 and 200, 0.5 and 150, 0.5 and 100, 0.5. and 90, 0.5 and 80, 0.5 and 70, 0.5 and 60, 0.5 and 50, 0.5 and 40, 0.5 and 30 or 0.5 and 20 ng, 0.1 and 10 ng, 0.5 and 5 ng, 0.5 and 1 ng, 1 and 1000, 1 and 900, 1 and 800, 1 and 700, 1 and 600, 1 and 500, 1 and 400, 1 and 300, 1 and 250, 1 and 200, 1 and 150, 1 and 100, 1 and 90, 1 and 80, 1 and 70, 1 and 60, 1 and 50, 1 and 40, 1 and 30 or 1 and 20 ng, 10 and 1000, 10 and 900, 10 and 800, 10 and 700, 10 and 600, 10 and 500, 10 and 400, 10 and 300, 10 and 250, 10 and 200, 10 and 150, 10 and 100, 10 and 90, 10 and 80, 10 and 70, 10 and 60, 10 and 50, 10 and 40, 10 and 30 or 10 and 20 ng. Each possibility represents a separate embodiment of the invention. In some embodiments, at most 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 400, 500, 600, 700, 800, 900 or 1000 ng of cfDNA are provided. Each possibility represents a separate embodiment of the invention. 1000 genomes are roughly equivalent to 6.6 ng of cfDNA.
- In some embodiments, the cfDNA comprises at least 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 2, 5, 10, 50, 100, 200, 300, 500, 700, 800, 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000 or 10000 genomes. Each possibility represents a separate embodiment of the invention. In some embodiments, the cfDNA comprises as little as 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 2, 5, 10, 50, 100, 200, 300, 500, 700, 800, 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000 or 10000 genomes. Each possibility represents a separate embodiment of the invention. In some embodiments, the cfDNA comprises between 0.1 and 10000, 0.1 and 9000, 0.1 and 8000, 0.1 and 7000, 0.1 and 6000, 0.1 and 5000, 0.1 and 4000, 0.1 and 3000, 0.1 and 2000, 0.1 and 1000, 1 and 10000, 1 and 9000, 1 and 8000, 1 and 7000, 1 and 6000, 1 and 5000, 1 and 4000, 1 and 3000, 1 and 2000, 1 and 1000, 5 and 10000, 5 and 9000, 5 and 8000, 5 and 7000, 5 and 6000, 5 and 5000, 5 and 4000, 5 and 3000, 5 and 2000, 5 and 1000, 10 and 10000, 10 and 9000, 10 and 8000, 10 and 7000, 10 and 6000, 10 and 5000, 10 and 4000, 10 and 3000, 10 and 2000, 10 and 1000, 100 and 10000, 100 and 9000, 100 and 8000, 100 and 7000, 100 and 6000, 100 and 5000, 100 and 4000, 100 and 3000, 100 and 2000, 100 and 1000, 500 and 10000, 500 and 9000, 500 and 8000, 500 and 7000, 500 and 6000, 500 and 5000, 500 and 4000, 500 and 3000, 500 and 2000, 500 and 1000, 1000 and 10000, 1000 and 9000, 1000 and 8000, 1000 and 7000, 1000 and 6000, 1000 and 5000, 1000 and 4000, 1000 and 3000, 1000 and 2000 genomes. Each possibility represents a separate embodiment of the invention.
- In some embodiments, as little as 0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.2, 0.3, 0.5, 1, 2, 3, 4, 5, or 10% of the cfDNA is from the cell type, tissue or cell in the cellular state. Each possibility represents a separate embodiment of the invention. In some embodiments, as little as 0.1% of the cfDNA is from the cell type, tissue or cell in the cellular state. In some embodiments, as little as 1% of the cfDNA is from the cell type, tissue or cell in the cellular state. In some embodiments, the limit of detection of the method is 0.1% of the cfDNA in the sample being from the cell type, the tissue, or the cellular state. In some embodiments, the limit of detection of the method is 1% of the cfDNA in the sample being from the cell type, the tissue, or the cellular state. In some embodiments, as little as 0.1% of the cfDNA is from the cell type, the tissue, or the cellular state and at least 45 peaks corresponding to the cell type, tissue or cellular state are detected. In some embodiments, as little as 0.1% of the cfDNA is from the cell type, the tissue, or the cellular state and at least 45-200 peaks corresponding to the cell type, tissue or cellular state are detected. In some embodiments, as little as 1% of the cfDNA is from the cell type, the tissue, or the cellular state and at least 25 peaks corresponding to the cell type, tissue or cellular state are detected. In some embodiments, analyzing at least 25 peaks provides a limit of detection of 1% of the cfDNA being from the cell type, the tissue, or the cellular state. In some embodiments, analyzing at least 45 peaks provides a limit of detection of 0.1% of the cfDNA being from the cell type, the tissue, or the cellular state. In some embodiments, analyzing a number of peaks comprises detecting cfDNA from at least that number of peaks. In some embodiments, the cfDNA comprises between 0.001-10, 0.001-5, 0.001-3, 0.001-2, 0.001-1.5, 0.001-1, 0.01-10, 0.01-5, 0.01-3, 0.01-2, 0.01-1.5, 0.01-1, 0.1-10, 0.1-5, 0.1-3, 0.1-2, 0.1-1.5, 0.1-1, 0.5-10, 0.5-5, 0.5-3, 0.5-2, 0.5-1.5 0.5-1, 1-10, 1-5, 1-3, or 1-2% cfDNA from the cell type, the tissue, or the cellular state. Each possibility represents a separate embodiment of the invention. In some embodiments, the cfDNA comprises between 0.1-1% cfDNA from the cell type, the tissue, or the cellular state. In some embodiments, the cfDNA comprises between 0.1-3% cfDNA from the cell type, the tissue, or the cellular state.
- In some embodiments, the sequencing is at a low depth. In some embodiments, the sequencing is at a depth of less than 1 billion, 750 million, 500 million, 400 million, 300 million, 200 million, 100 million, 90 million, 80 million, 70 million, 60 million, 50 million, 40 million, 30 million, 20 million, 10 million, 9 million, 8 million, 7 million, 6 million, 5 million, 4 million, 3 million, 2 million, 1 million, 0.5 million, 0.1 million, 0.05 million, 0.01 million, 0.005 million or 0.001 million reads. Each possibility represents a separate embodiment of the invention. In some embodiments, the sequencing is at a depth of less than 10 million reads. In some embodiments, the sequencing is at a depth of less than 1 million reads. It will be understood by a skilled artisan that as the amount of information increases the limit of detection goes down. Further, as the amount of input data is increased (increasing the number of reagents, i.e. antibodies used for ChIP, increasing the number of informative loci for a given cell type/issue/state, increasing the amount of cfDNA from the cell type/issue/state) so too the required sequencing depth is decreased, and the limit of detection is decreased.
- In some embodiments, the providing comprises providing a bodily fluid comprising the cfDNA. In some embodiments, the contacting occurs in the bodily fluid. In some embodiments, the contacting comprises providing a bodily fluid and isolating the cfDNA from the bodily fluid. In some embodiments, the bodily fluid is selected from: blood, serum, gastric fluid, intestinal fluid, saliva, bile, tumor fluid, interstitial fluid, breast milk, cerebrospinal fluid, urine, semen, vaginal fluid, and stool. In some embodiments, the bodily fluid is any bodily fluid that contains cfDNA. In some embodiments, the bodily fluid is blood. In some embodiments, the bodily fluid is any one of whole blood, partially lysed whole blood, plasma, or partially processed whole blood.
- The sample of blood can be obtained by standard techniques, such as using a needle and syringe. In another embodiment, the blood sample is a peripheral blood sample. Alternatively, the blood sample can be a fractionated portion of peripheral blood, such as a plasma sample. In another embodiment, once the blood sample is obtained, total DNA can be extracted from the sample using standard techniques known to one skilled in the art. In some embodiments, intact cells are removed before DNA extraction, so that only free-floating DNA is extracted. Intact cells can be removed by any method known in the art, such as for non-limiting example by centrifugation or by gradient separation, such as by Ficol gradient separation. A non-limiting example for DNA extraction is the FlexiGene DNA kit (QIAGEN). Standard techniques for receiving cell-free DNA extraction are known to a skilled artisan, a non-limiting example of which is the QIAamp Circulating Nucleic Acid kit (QIAGEN).
- In some embodiments, the sequencing is next generation sequencing. Next generation sequencing, also known as high-throughput sequencing or massively parallel sequencing, is any sequencing method that allows for rapid high-throughput sequencing of base pairs from DNA or RNA samples. In some embodiments, the sequencing is high-throughput sequencing. In some embodiments, the sequencing is massively parallel sequencing. Such sequencing is well known in the art and can include the use of Illumina arrays, pore and nanopore sequencers and ion torrent as non-limiting examples. Sequencing machines such as the
Illumina Nextseq 500 machine may be used, and processing may be performed using theIllumina 500/550 V2 kit, for non-limiting example. In some embodiments, the sequencing is whole genome sequencing. In some embodiments, only a portion of the genome is sequenced. In some embodiments, a chip or array pertaining to only a portion of the genome is used for next generation sequencing. - In some embodiments, the sequencing is methylation sensitive sequencing. In some embodiments, the methods of the invention further comprise bisulfite conversion before sequencing. In performing sequencing, the methylation status of the DNA can also be discerned, in this way the protein-DNA association data can also be combined with DNA methylation data. This can provide further information about the activity of genes in the cell as it died, which can provide insight into the cell or tissue of origin or the cellular state of the cell.
- In some embodiments the methods of the invention can be used to determine the origin of cfDNA even when the cfDNA from one tissue/cell types is a very small percentage of the whole cfDNA. In some embodiments, the cfDNA of a tissue and/or cell type comprises as little 0.0001%, 0.0005%, 0.001%, 0.005%, 0.01%, 0.05%, 0.1% 0.5%, 1%, 1.5%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9% or 10% of all of the cfDNA. Each possibility represents a separate embodiment of the invention. In some embodiments, the cfDNA of a tissue and/or cell type comprises more than 0.0001%, 0.0005%, 0.001%, 0.005%, 0.01%, 0.05%, 0.1% 0.5%, 1%, 1.5%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9% or 10% of all of the cfDNA. Each possibility represents a separate embodiment of the invention. In some embodiments, the cfDNA of a tissue and/or cell type comprises less than 0.0001%, 0.0005%, 0.001%, 0.005%, 0.01%, 0.05%, 0.1%, 0.5%, 1%, 1.5%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9% or 10% of all of the cfDNA. Each possibility represents a separate embodiment of the invention. In some embodiments, the cfDNA of a tissue and/or cell type comprises between 0.0001%-10%, 0.001%-10%, 0.01%-10%, 0.1%-10%, 0.5%-10%, 1%-10%, 1.5%-10%, 2%-10%, 0.0001%-9%, 0.001%-9%, 0.01%-9%, 0.1%-9%, 0.5%-9%, 1%-9%, 1.5%-9%, 2%-9%, 0.0001%-8%, 0.001%-8%, 0.01%-8%, 0.1%-8%, 0.5%-8%, 1%-8%, 1.5%-8%, 2%-8%, 0.0001%-7%, 0.001%-7%, 0.01%-7%, 0.1%-7%, 0.5%-7%, 1%-7%, 1.5%-7%, 2%-7%, 0.0001%-6%, 0.001%-6%, 0.01%-6%, 0.1%-6%, 0.5%-6%, 1%-6%, 1.5%-6%, 2%-6%, 0.0001%-5%, 0.001%-5%, 0.01%-5%, 0.1%-5%, 0.5%-5%, 1%-5%, 1.5%-5%, 2%-5%, 0.0001%-4%, 0.001%-4%, 0.01%-4%, 0.1%-4%, 0.5%-4%, 1%-4%, 1.5%-4%, 2%-4%, 0.0001%-3%, 0.001%-3%, 0.01%-3%, 0.1%-3%, 0.5%-3%, 1%-3%, 1.5%-3%, 2%-3%, 0.0001%-2%, 0.001%-2%, 0.01%-2%, 0.1%-2%, 0.5%-2%, 1%-2%, 1.5%-2%, 0.0001%-1%, 0.001%-1%, 0.01%-1%, 0.1%-1.5%, 0.5%-1.5%, 0.1%-1%, 0.5%-1%, 0.0001%-0.1%, 0.001%-0.1%, or 0.0001%-0.001% of all of the cfDNA. Each possibility represents a separate embodiment of the invention.
- In some embodiments, the contacting is incubating the reagent in the bodily fluid comprising the cfDNA. In some embodiments, the contacting is incubating the reagent in blood comprising the cfDNA. In some embodiments, the contacting is incubating the reagent and cfDNA in a binding/incubating solution. Buffers for performing ChIP, and specifically incubating buffers are well known in the art. Such buffers may be purchased from companies such as Abcam and Cell Signaling Technology who cell ChIP kits.
- In some embodiments, the contacting is performed with constant mixing. In some embodiments, the contacting is done with constant rotation. In some embodiments, the contacting is done at room temperature or at 4 degrees. In some embodiments, the contacting is done on ice. In some embodiments, the contacting is for at least 1, 2, 3, 4, 5, 6, 12, 18, or 24 hours. Each possibility represents a separate embodiment of the invention. In some embodiments, the contacting is for a time sufficient for the reagent to bind to the DNA-associated protein. In some embodiments, the contacting is for a time sufficient for the reagent to bind at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97%, or 99% of the DNA-associated protein provided. Each possibility represents a separate embodiment of the invention.
- In some embodiments, the methods of the invention are for use in detecting a disease state or condition in a subject in need thereof and wherein the cfDNA is from the subject. In some embodiments, the methods of the invention are for diagnosing a disease, and/or condition in a subject in need thereof and wherein the cfDNA is from the subject. In some embodiments, the methods of the invention are for diagnosing an increased risk of a disease or condition. A skilled artisan will recognize that many if not all disease states induce cell death in the tissue or cells in which the disease is manifest. As such knowledge of the origin of cell death be a surrogate for the disease. In some embodiments, the disease state or condition is selected from cardiac disease or damage and liver disease or damage. In some embodiments, the disease state or condition is selected from cardiac disease or damage, liver disease or damage, and cancer. In some embodiments, the disease state is cancer. In some embodiments, the disease state is a pre-cancerous state. In some embodiments, the disease state is cancer or a pre-cancerous state. In some embodiments, the disease state or condition is selected from cardiac arrest and liver shock. In some embodiments, the disease state is brain damage. In some embodiments, the disease state is bacteremia. In some embodiments, the disease state is an infection. In some embodiments, the disease state or condition is selected from cancer, neurodegenerative disease, infection, tissue damage, inflammation, autoimmune disease, arthritis, liver inflammation, bowel inflammation, autoimmune disease, bacteremia, tissue damage from drug side effects, tissue necrosis, and diabetes. In some embodiments, the neurodegenerative disease is Parkinson's disease or Alzheimer's disease. In some embodiments, the autoimmune disease is lupus or multiple sclerosis. In some embodiments, the disease is cancer and the methods of the invention determine the cell or tissue of origin of the cancer. It will be well understood by a person of skill in the art, that association of proteins that indicate active transcription and sequences of cfDNA from oncogenes is indicative of a cancerous or pre-cancerous state. Further, association of proteins that indicate transcriptional silencing and sequences of cfDNA from tumor suppressors is also indicative of a cancerous or pre-cancerous state. Similarly, activation or repression of enhancer regions for oncogenes and tumor suppressors respectively is also indicative of a cancerous or pre-cancerous state.
- In some embodiments, the methods of the invention further comprise performing steps a-d again using a reagent that binds to a second DNA-associated protein, and wherein the second DNA-associated protein is a different protein than the DNA associated protein already used. In some embodiments, the second DNA-associated protein is different than the first DNA-associated protein. In some embodiments, the methods of the invention can be repeated at least 2, 3, 4, 5, 6, 7, 8, 9, or 10 times wherein each time a different DNA-associated protein is bound. Each possibility represents a separate embodiment of the invention.
- In some embodiments, the methods of the invention comprise contacting the sample with at least 2 reagents, wherein each reagent is bound to a physical support and the support comprises a short DNA tag unique to each reagent, wherein upon sequencing the isolated cfDNA the short DNA tag identifies the reagent that isolated the cfDNA. In some embodiments, the DNA tag is ligated to a cfDNA molecule before sequencing. In some embodiments, the support is a bead conjugated to a single reagent and a short DNA tag that is unique to the reagent. In some embodiments, the reagent is an antibody and the short DNA tag is a DNA barcode. This can, for example be paramagnetic beads covalently bound to a ChIP antibody such as H3K4me1 and bound to a barcode for identifying H3K4me1 associated DNA, and other paramagnetic beads covalently bound to a ChIP antibody such as H3K4me3 and bound to a barcode identifying H3K4me3 associated DNA which are added to the sample simultaneously, ligated to cfDNA simultaneously and sequenced simultaneously.
- In some embodiments, the method further comprises treating the subject. In some embodiments, the treatment is for the detected disease. In some embodiments, the treatment is a suitable treatment based on the cellular state, tissue of origin, cell type of a combination thereof of the cell that died in the subject. It will be understood by a skilled artisan that if for example a cancer is found in a particular organ that therapy can be tailored to that type of cancer. Similarly, in a particular pathway is active in a cancer or disease, then one treatment modality may be more suitable than another. For example, detection of active transcription of the long non-coding RNA EGFR-AS1 that mediates cancer addiction to EGFR and when highly expressed can render tumors insensitive to EGFR inhibition by anti EGFR antibody treatment, would indicate that EGFR inhibitory treatments should be avoided.
- By another aspect, there is provided a method of detecting a disease state in a subject, the method comprising:
-
- a. providing a sample from the subject, wherein the sample comprises cfDNA;
- b. contacting the sample with at least one reagent that binds to a DNA-associated protein;
- c. isolating the reagent and any thereto bound proteins and cfDNA;
- d. sequencing the isolated cfDNA; and
- e. designating a cfDNA molecule comprising a disease-associated mutation as originating from a cell in the disease state;
thereby detecting a disease state in a subject.
- By another aspect, there is provided a method for improving disease detection in cfDNA from a subject, the method comprising performing chromatin immunoprecipitation on the cfDNA from the subject before performing the disease detection in the immunoprecipitated cfDNA.
- In some embodiments, the chromatin immunoprecipitation comprises:
-
- a. contacting the cfDNA from the subject with at least one reagent that binds to a DNA-associated protein; and
- b. isolating the reagent and any thereto bound proteins and cfDNA.
- In some embodiments, disease detection comprises sequencing of the cfDNA. In some embodiments, disease detection comprises sequencing of the cfDNA from the subject. In some embodiments, disease detection comprises sequencing of the immunoprecipitated cfDNA. In some embodiments, disease detection and/or sequencing comprises amplification of the cfDNA. In some embodiments, disease detection and/or sequencing does not comprise amplification of the cfDNA. In some embodiments, disease detection in cfDNA from the subject comprises amplification of the cfDNA.
- In some embodiments, non-improved disease detection comprises amplification of the cfDNA. In some embodiments, disease detection in the immunoprecipitated cfDNA does not comprise amplification of the immunoprecipitated cfDNA. In some embodiments, improved disease detection does not comprise amplification of the immunoprecipitated cfDNA. In some embodiments, the amplification is PCR amplification. In some embodiments, the amplification is non-specific amplification. In some embodiments, the amplification is amplification of a disease-associated sequence.
- As used herein, “disease-associated mutation” refers to a DNA mutation that is known to cause or increase the risk of developing the disease. Disease-associated mutations are well known, and include for example, deletion of 1522A, 1523T and 1524C (deletion of F508) of CFTR in cystic fibrosis, 1226A to G (N370S) in the GBA locus in Gaucher disease, mutations in SERPINA1 in alpha1-antitrypsin deficiency, mutations in HBB in Beta-thalassemia, and mutations in PSEN1 in Alzheimer's disease. Many disease-associated mutations are known in cancer, some are common to many types of cancer and some are specific to specific cancers. Mutations in p53, MYC, BREF, BRCA, to name but a few, are well known in the art. Panels of disease associated mutations can also be investigated. In some embodiments, at least 1, 2, 3, 5, 7, 10, 12, 15, 17, or 10 mutations are investigated. Each possibility represents a separate embodiment of the invention. In some embodiments, instead of sequencing, PCR with mutation specific-primers is employed. Any method of detecting DNA mutations may be employed in place of sequencing; however, sequencing provides the advantage of checking multiple mutations simultaneously, including a panel of mutations.
- In some embodiments, association of the DNA-associated protein with the disease-associated mutation is indicative of the disease state. It will be well understood by a person of skill in the art, that association of proteins that indicate active transcription mutations in the coding region of a gene would indicate the mutant gene is being transcribed and would be indicative of a cancerous or pre-cancerous state. Similarly, mutations in a regulatory region would be associated with a protein that is indicative of that regulatory region and thus would also be indicative of a cancerous or pre-cancerous state.
- In some embodiments, the disease-associated mutation is in a coding region of a gene and association of the DNA-associated protein with DNA is indicative of active transcription. In some embodiments, the gene is an oncogene or a tumor suppressor gene. In some embodiments, the disease-associated mutation is in a regulatory region, and association of the DNA-associated protein with DNA is indicative of the regulatory region.
- In some embodiments, the method of the invention further comprises performing steps b-e again using a reagent that binds to a second DNA-associated protein, and wherein the second DNA-associated protein is different from the first DNA-associated protein. If multiple mutations are to be investigated and they are located in different genomic regions (gene body and an enhancer for instance), the ChIP can be repeated for a different DNA-associated protein.
- Immunoprecipitation of only a portion of the cfDNA (such as enrichment for gene body sequences of actively transcribed genes by H3K36me3 immunoprecipitation) greatly enhances the concentration of informative DNA. As a result, sequencing can be performed much more cheaply and using fewer reagents. Further, this methodology allows detection of mutations associated with specific genomic annotations (active gene, active promoter, active enhancer etc.) without predefining a limited set of genomic locations (such as a set of cancer risk genes) and designing specific reagents to amplify and/or detect those predefined sequences. By first reducing the effective size of the sequenced portion of the genome (to only what is immunoprecipitated) sequencing cost is dramatically reduced. Additionally, because there are fewer repeat sequences and uninformative sequences there is less background and fewer false positive results. Lastly, sequencing at a given depth provides more reads over the sequences that are informative.
- In some embodiments, the improving comprises at least one of: decreasing signal to noise ratio, increasing confidence in positive detection of the disease, decreasing false detection of the disease and accurately detecting the disease with less cfDNA from the subject. In some embodiments, the increasing is at least a 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500 or 1000% increase. Each possibility represents a separate embodiment of the invention. In some embodiments, the decreasing is at least a 10, 20, 30, 40, 50, 60, 70, 80, 90, 85, 97, 99 or 100% decrease. Each possibility represents a separate embodiment of the invention.
- In some embodiments, the less cfDNA from the subject is less than 1000, 900, 800, 700, 600, 500, 400, 300, 200, 100, 90, 80, 70, 60, or 50 ng of cfDNA. Each possibility represents a separate embodiment of the invention. In some embodiments, the same accuracy can be achieved with the less cfDNA as compared to a larger amount of cfDNA. In some embodiments, the same sequencing coverage at the mutation can be achieved with the less cfDNA as compared to the coverage achieved with a larger amount of cfDNA.
- By another aspect, there is provided a computer program product for determining a cell or tissue of origin of cell free DNA (cfDNA), comprising a non-transitory computer-readable storage medium having program code embodied thereon, the program code executable by at least one hardware processor to:
-
- a. sequence or access sequencing of cfDNA isolated with a reagent that binds a DNA-associated protein;
- b. assign a cfDNA molecule from the cfDNA to a cell or tissue of origin by comparing a DNA sequence of the molecule to sequences associated with the DNA-associated protein in the cell type or tissue; and
- c. provide an output regarding the cell or tissue of origin of cfDNA.
- By another aspect, there is provided a computer program product for determining a cellular state of a cell in a subject as the cell died, comprising a non-transitory computer-readable storage medium having program code embodied thereon, the program code executable by at least one hardware processor to:
-
- a. sequence or access sequencing of cfDNA from the subject isolated with a reagent that binds a DNA-associated protein;
- b. assign a cfDNA molecule from the cfDNA to a cellular state by comparing a DNA sequence of the molecule to sequences associated with the DNA-associated protein in the cellular state; and
- c. provide an output regarding the cellular state of a cell in the subject as the cell died.
- By another aspect, there is provided a system for determining the cell or tissue of origin of cfDNA, comprising:
-
- a. one or more devices for sequencing cfDNA isolated with a reagent that binds a DNA-associated protein;
- b. a processor; and
- c. storage medium comprising a computer application that, when executed by the processor, is configured to:
- i. sequence or access sequencing of cfDNA isolated with a reagent that binds a DNA-associated protein;
- ii. assign a cfDNA molecule from the cfDNA to a cell or tissue of origin by comparing a DNA sequence of the molecule to sequences associated with the DNA-associated protein in the cell type or tissue; and
- iii. output from the processor the cell or tissue of origin of cfDNA.
- By another aspect, there is provided a system for determining a cellular state of a cell in a subject as the cell died, comprising:
-
- a. one or more devices for sequencing cfDNA isolated with a reagent that binds a DNA-associated protein;
- b. a processor; and
- c. storage medium comprising a computer application that, when executed by the processor, is configured to:
- i. sequence or access sequencing of cfDNA isolated with a reagent that binds a DNA-associated protein;
- ii. assign a cfDNA molecule from the cfDNA to a cellular state by comparing a DNA sequence of the molecule to sequences associated with the DNA-associated protein in the cellular state; and
- iii. output from the processor the cell or tissue of origin of cfDNA.
- By another aspect, there is provided a computer program product for detecting a disease state in a subject, comprising a non-transitory computer-readable storage medium having program code embodied thereon, the program code executable by at least one hardware processor to
-
- a. assign a cfDNA molecule from the cfDNA to a disease state by comparing a DNA sequence of the molecule to mutant sequences associated with the disease state;
- b. provide an output regarding the disease state in a subject.
- The computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
- Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
- Computer readable program instructions for carrying out operations of the present invention may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). In some embodiments, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present invention.
- These computer readable program instructions may be provided to a processor of a general-purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
- Embodiments may comprise a computer program that embodies the functions described and illustrated herein, wherein the computer program is implemented in a computer system that comprises instructions stored in a machine-readable medium and a processor that executes the instructions. However, it should be apparent that there could be many different ways of implementing embodiments in computer programming, and the embodiments should not be construed as limited to any one set of computer program instructions. Further, a skilled programmer would be able to write such a computer program to implement one or more of the disclosed embodiments described herein. Therefore, disclosure of a particular set of program code instructions is not considered necessary for an adequate understanding of how to make and use embodiments. Further, those skilled in the art will appreciate that one or more aspects of embodiments described herein may be performed by hardware, software, or a combination thereof, as may be embodied in one or more computing systems. Moreover, any reference to an act being performed by a computer should not be construed as being performed by a single computer as more than one computer may perform the act.
- By device for sequencing it is meant a combination of components that allows the sequence of a piece of DNA to be determined. In some embodiments, the testing device allows for the high-throughput sequencing of DNA. In some embodiments, the testing device allows for massively parallel sequencing of DNA. The components may include any of those described above with respect to the methods for sequencing.
- In certain embodiments the system or test kit further comprises a display for the output from the processor.
- By another aspect, there is provided a solid support, comprising a capturing agent and a barcoding reagent.
- As used herein, the term “capturing agent” refers to a molecule that binds to a protein and can thus capture and retain the protein to the solid support. In some embodiments, the capturing agent is a small molecule. In some embodiments, the capturing agent is a protein. In some embodiments, the capturing protein captures a second protein by protein-protein interaction. In some embodiments, the capture protein is an antibody or antigen binding fragment thereof. The capturing agent can be any molecule that specifically binds chromatin or nucleic acids.
- As used herein, the term “barcoding reagent” refers to any substrate that comprises a unique molecule or moiety that can be used as a barcode for identifying a molecule of interest. Barcodes are well known in the art and any molecule or moiety that is unique enough to identify a molecule of interest can be used as a barcode. In some embodiments, the barcoding reagent is the barcode itself. In some embodiments, the barcode is a protein barcode. In some embodiments, the barcode is a protein tag. In some embodiments, the barcode is a fluorescent protein.
- In some embodiments, the barcode is a nucleic acid barcode. In some embodiments, the nucleic acid molecule is a short nucleic acid molecule. In some embodiments, it is less than 3, 5, 7, 10, 12, 15, 17, 20 or 25 nucleotides long. Each possibility represents a separate embodiment of the invention. In some embodiments, the nucleic acid molecule is between 3 and 10, 3 and 15, 3 and 20, 3 and 25, 3 and 30, 3 and 35, 3 and 40, 4 and 45, 3 and 50, 5 and 10, 5 and 15, 5 and 20, 5 and 25, 5 and 30, 5 and 35, 5 and 40, 5 and 45, or 5 and 50 nucleotides long. Each possibility represents a separate embodiment of the invention. In some embodiments, the barcoding reagent is an enzyme for joining the barcode to a molecule of interest. In some embodiments, the barcoding reagent is a ligase. In some embodiments, the barcoding reagent is the barcode and the solid support further comprises an enzyme for joining the barcode to a molecule of interest.
- In some embodiments, the molecule of interest is a protein. In some embodiments, the molecule of interest is a nucleic acid molecule. In some embodiments, the molecule of interest in a DNA or RNA. In some embodiments, the capturing agent captures a protein associated with the molecule of interest. In some embodiments, the capturing agent captures a DNA-associated protein and the molecule of interest is DNA. In some embodiments, the DNA is cfDNA. In some embodiments, the molecule of interest is in complex with the protein captured by the capturing agent.
- The solid support may be any polymer, or inorganic material to which biological macromolecules can be attached. The attachment can be direct or indirect. In some embodiments, the solid support is made from material used to assemble microfluidic devices. In some embodiments, the solid support is a bead. In some embodiments, the bead is an agarose bead. In some embodiments, the solid support is a magnetic or paramagnetic bead. In some embodiments, the solid support is an agarose or magnetic or paramagnetic bead. In some embodiments, the support is conjugated to the capturing agent. In some embodiments, the support is conjugated to a ChIP antibody. In some embodiments, the support is conjugated to the barcoding reagent. In some embodiments, the support is conjugated to capturing agent and the barcoding reagent. Conjugation can be performed by any method known in the art including, but not limited to covalent bonding, charge-based bonding, and hydrophobic interactions. In some embodiments, the conjugation is biotin to avidin conjugation. In some embodiments, the conjugation is by amine binding technologies. In some embodiments, the amine binding technology is epoxy. In some embodiments, the conjugation is by carboxyl group capturing.
- By another aspect, there is provided a method for multiplexing an assay on more than one molecule of interest in a single solution, the method comprising:
-
- a. capturing within the solution a first molecule of interest to a first solid support of the invention;
- b. capturing within said solution at least a second molecule of interest to a second solid support on the invention;
- c. attaching the first molecule of interest and a first barcode and at least the second molecule of interest and a second barcode;
- d. simultaneously performing the assay on the first and second molecules of interest, wherein the result of the assay on the first molecule of interest is identified by the first barcode and the result of the assay on the second molecule of interest is identified by the second barcode;
thereby multiplexing an assay on more than one molecule of interest in a single solution.
- As used herein, “multiplexing an assay” refers to performing one assay on multiple samples simultaneously. Multiplexing is useful when samples are limited, the assay is costly in terms of time, money, reagents or sample input. By multiplexing using the method of the invention assays can be performed from start to finish simultaneously so that variation between samples is reduced (
FIG. 5B ). For example, in a multiplex chromatin immunoprecipitation followed by next generation sequencing (ChIP-Seq) assay using the method of the invention, the protein capture for all antibodies used is performed in one tube and at one time. The ligation of the barcode all occurs at once as well and washing and sequencing is all also performed as one. This greatly limits any inter-sample variability in assay performance. In some embodiments, the assay is any one of ChIP, ChIP-Seq, cfChIP, cfChIP-Seq, protein quantification, and a protein-protein interaction assay. Protein quantification can be achieved by adding a generic DNA adapter/sequence to the protein and then ligating the barcode. In some embodiments, the methods of the invention further comprise attaching an adapter to the protein of interest. In some embodiments, the assay is chromatin immunoprecipitation followed by sequencing (ChiP-Seq). - In some embodiments, the molecule of interest is a protein. In some embodiments, the molecule of interest is a nucleic acid molecule. In some embodiments, the molecule of interest is a protein and/or a nucleic acid molecule.
- In some embodiments, the identifying by the barcode comprises quantification of the amount and/or number of the molecules of interest. In some embodiments, the amount and/or number of barcodes is equal to the amount and/or number of molecules of interest. In some embodiments, the amount and/or number of barcodes is proportional to the amount and/or number of molecules of interest. In some embodiments, the amount and/or number of barcodes is equal or proportional to the amount and/or number of molecules of interest.
- As used herein, the term “about” when combined with a value refers to plus and minus 10% of the reference value. For example, a length of about 1000 nanometers (nm) refers to a length of 1000 nm+−100 nm.
- It is noted that as used herein and in the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a polynucleotide” includes a plurality of such polynucleotides and reference to “the polypeptide” includes reference to one or more polypeptides and equivalents thereof known to those skilled in the art, and so forth. It is further noted that the claims may be drafted to exclude any optional element. As such, this statement is intended to serve as antecedent basis for use of such exclusive terminology as “solely,” “only” and the like in connection with the recitation of claim elements or use of a “negative” limitation.
- In those instances where a convention analogous to “at least one of A, B, and C, etc.” is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., “a system having at least one of A, B, and C” would include but not be limited to systems that have A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.). It will be further understood by those within the art that virtually any disjunctive word and/or phrase presenting two or more alternative terms, whether in the description, claims, or drawings, should be understood to contemplate the possibilities of including one of the terms, either of the terms, or both terms. For example, the phrase “A or B” will be understood to include the possibilities of “A” or “B” or “A and B.”
- It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable sub-combination. All combinations of the embodiments pertaining to the invention are specifically embraced by the present invention and are disclosed herein just as if each and every combination was individually and explicitly disclosed. In addition, all sub-combinations of the various embodiments and elements thereof are also specifically embraced by the present invention and are disclosed herein just as if each and every such sub-combination was individually and explicitly disclosed herein.
- Additional objects, advantages, and novel features of the present invention will become apparent to one ordinarily skilled in the art upon examination of the following examples, which are not intended to be limiting. Additionally, each of the various embodiments and aspects of the present invention as delineated hereinabove and as claimed in the claims section below finds experimental support in the following examples.
- Various embodiments and aspects of the present invention as delineated hereinabove and as claimed in the claims section below find experimental support in the following examples.
- Generally, the nomenclature used herein, and the laboratory procedures utilized in the present invention include molecular, biochemical, microbiological and recombinant DNA techniques. Such techniques are thoroughly explained in the literature. See, for example, “Molecular Cloning: A laboratory Manual” Sambrook et al., (1989); “Current Protocols in Molecular Biology” Volumes I-III Ausubel, R. M., ed. (1994); Ausubel et al., “Current Protocols in Molecular Biology”, John Wiley and Sons, Baltimore, Maryland (1989); Perbal, “A Practical Guide to Molecular Cloning”, John Wiley & Sons, New York (1988); Watson et al., “Recombinant DNA”, Scientific American Books, New York; Birren et al. (eds) “Genome Analysis: A Laboratory Manual Series”, Vols. 1-4, Cold Spring Harbor Laboratory Press, New York (1998); methodologies as set forth in U.S. Pat. Nos. 4,666,828; 4,683,202; 4,801,531; 5,192,659 and 5,272,057; “Cell Biology: A Laboratory Handbook”, Volumes I-III Cellis, J. E., ed. (1994); “Culture of Animal Cells—A Manual of Basic Technique” by Freshney, Wiley-Liss, N. Y. (1994), Third Edition; “Current Protocols in Immunology” Volumes I-III Coligan J. E., ed. (1994); Stites et al. (eds), “Basic and Clinical Immunology” (8th Edition), Appleton & Lange, Norwalk, C T (1994); Mishell and Shiigi (eds), “Strategies for Protein Purification and Characterization—A Laboratory Course Manual” CSHL Press (1996); all of which are incorporated by reference. Other general references are provided throughout this document.
- All clinical studies were approved by the relevant local ethics committees. The study was approved by the Ethics Committees of the Hebrew University-Hadassah Medical Center of Jerusalem. Informed consent was obtained from all subjects or from their legal guardians before blood sampling.
- Blood samples were collected in VACUETTE® K3 EDTA tubes, transferred immediately to ice and 1× protease inhibitor cocktail (Roche) and 10 mM EDTA were added. The blood was centrifuged (10 minutes, 1500×g, 4° C.), the supernatant was transferred to fresh 14 ml tubes, centrifuged again (10 minutes, 3000×g, 4° C.), and the supernatant was used as plasma for ChIP experiments. The plasma was used fresh or flash frozen and stored at −80° C. for long storage.
- 50 μg of antibody were conjugated to 5 mg of epoxy M270 Dynabeads (Invitrogen) according to manufacturer instructions. The antibody-beads complexes were kept at 4° C. in PBS, 0.02% azide solution.
- 0.2 mg of conjugated beads (˜2 μg of antibody) were used per cfChIP sample. The antibody-beads complexes were added directly into the plasma (1-2 ml of plasma) and allowed to bind to cf-nucleosomes by rotating overnight at 4° C. The beads were magnetized and washed 6 times with blood wash buffer (
BWB 50 mM Tris-HCl, 150 mM NaCl, 1% Triton X-100, 0.1% Sodium DeoxyCholate, 2 mM EDTA, lx protease inhibitors cocktail), twice with BWB-500 (same as BWB only with 500 mM NaCl), and three times with 10 mM Tris pH 7.4. All washes were done with 150 ul buffer on ice by shifting the beads from side to side on a magnet. Vacuum was not used to remove supernatant during washes in buffers that do not contain detergents. Following removal of the beads, the plasma was stored as it was suitable for further rounds of cfChIP. - On-beads chromatin barcoding and library amplification was done to overcome the issue of low input material. This procedure supports preparation of cfChIP from as little as 1000 cells. The following steps were all performed on the beads so as to reduce loss of cfDNA that might occur during freeing of the cfDNA and tube transfers. DNA ends were repaired by T4 DNA polymerase and T4 polynucleotide kinase. Following a wash Klenow exo minus was employed to add an adenine base to the repaired end of the DNA. Following another wash DNA adapters were ligated on; in this case Illumina adapters with DNA barcode sequences were used. For the DNA elution and cleanup steps the beads were incubated for 1 hour at 55° C. in 50 μl of chromatin elution buffer (10 mM Tris pH 8.0, 5 mM EDTA, 300 mM NaCl, 0.6% SDS) supplemented with 50 units of proteinase K (Epicenter), and the DNA was purified by 1.2×SPRI cleanup (Ampure xp, agencourt). The purified DNA is eluted in 25 μl EB (10 mM tris pH 8.0) and 23 μl of the eluted DNA were used for PCR amplification with Kapa hotstart polymerase (16 cycles). The amplified DNA was purified by 1.2×SPRI cleanup and eluted in 12 μl EB. The eluted DNA concentration was measured by Qubit and the fragments size was analysed by tapestation visualization. Note: If adapter dimers were still visible by tapestation post library amplification, samples that are not identically barcoded can be pooled, separated on a 4% agarose gel (E-Gel® EX Agarose Gels, 4%, Invitrogen), and fragments larger than adapter dimers (>200 bp) are gel purified. Alternatively, gel purification can be avoided by performing an additional X 0.8 SPRI cleanup post sample pooling, which remove most of the adapter dimers. DNA libraries were paired end sequenced by
Illumina NextSeq 500. - Reads were aligned to the human genome (hg19) using bowtie2 with ‘no-mixed’ and ‘no-discordant’ flags. We discarded reads with low alignment scores and duplicate fragments.
- We downloaded consolidated aligned data from the Roadmap Epigenome Consortium database (egg2.wustl.edu/roadmap/data/byFileType/alignments/consolidated/). To these we added Kidney samples (egg2.wustl.edu/roadmap/data/byFileType/alignments/unconsolidated/H3K4me3/BI.Adult_Kidney.H3K4me3.27.filt.tagAlign.gz and egg2.wustl.edu/roadmap/data/byFileType/alignments/unconsolidated/H3K4me3/BI.Adult_Kidney.H3K4me3.153.filt.tagAlign.gz). For our analysis we discarded pre-natal, ESC, and cell-line samples, resulting with 71 tissues and cell types.
- We downloaded RNA-seq data from the TCGA and GTEx projects as analyzed by the Xena project (Toil enables reproducible, open source, big biomedical data analyses, Vivian J, et al., Nat. Biotechnol., 2017, and Toil RNAseq Recompute database, tcga-data.nci.nih.gov). We defined the set of genes that are over-expressed in a tumor type to satisfy three requirements: 1) Significantly higher expression in tumor samples compared to the corresponding tissue samples (t-test, q<0.001 after FDR correction); 2) Significantly higher expression compared to all healthy samples (t-test, q<0.001 after FDR correction); and 3) Median expression in the tumor is higher than median expression in each of the healthy samples.
- We downloaded the Roadmap Epigenome Consortium ChromHMM annotation of all consolidated tissues (egg2.wustl.edu/roadmap/data/byFileType/chromhmmSegmentations/ChmmModels/coreMarks/jointModel/final/all.mnemonics.bedFiles.tgz). Using these annotations, we constructed a catalogue of potential TSS sites. We extended the catalogue to include 3 kb regions centered on TSS of annotated transcripts in the UCSC gene database and ENSEMBL transcript database (UCSC known genes: Bioconductor AnnotationHub AH5036; ENSEMBL transcripts Bioconductor AnnotationHub AH5046; Genomic annotations: Bioconductor AnnotationHub AH5040). We used the combined catalogue to define regions along the genome, which are either TSS or “background” (most likely not TSS). The later regions were tiled by 5 kb sized windows.
- We quantified the number of reads covering each region in the catalogue in each of our samples and atlas samples. We estimated locally adaptive model of non-specific reads along the genome for each of the samples and extracted counts that represent specific ChIP signal in the catalogue for each sample (see below). These were then normalized (Supplemental Text) and scaled to 1M reads in the reference healthy samples.
- To define tissue specific signatures of a specific modification, we examined binned representation of the atlas. For each tissue we defined a signature of unique windows with signal in one of the samples of the target tissue and without coverage in all others (see below).
- To define process signature, we converted gene specific annotations (e.g., GO) to genomic windows by including all windows that overlap with promoters of genes in the annotation.
- We consider two different statistical tests. For both tests we need to estimate the background coverage, that is reads from non-specific pulldown (see below).
- The first test is whether a signature is present. Formally, we examined whether we can reject the null hypothesis that the number of reads in signature windows will be Poisson distribution according to background rate (see below). We compute p-value of the actual number of observed reads in signature windows as the probability of having this number or higher according to the null hypothesis. Rejection of the null hypothesis for a specific signature is an indication that some of the windows in the signature carry the modification in question in a subpopulation of cells contributing to the cf-nucleosome pool.
- The second test is whether a signature is over-represented than expected in healthy baseline subjects. To define the latter expected we use average signal from 5 healthy samples to define the average number of reads (per million) in each window. We then estimate two sample-specific parameters—first a background rate (discussed above) and second a scaling factor that rescales average expectations to the sequencing depth of the specific sample (see below) Together, these define the expected coverage in each window under the null-hypothesis that the subject is from the healthy population. We compute p-value of the actual number of observed reads in signature windows as the probability of having this number or higher according to the null hypothesis. Rejection of the null hypothesis for a specific signature is an indication that some of the windows in the signature have higher signal than we expected in healthy subjects. The interpretation is that these are abnormal processes active in the cells contributing to the subject's cf-nucleosome pool.
- We constructed a TSS catalogue through the following steps. All steps were carried on human genome version “hg19”:
-
- 1. We downloaded ChromHMM calls for 111 tissues and cell types throughout the human genome from Roadmap Epigenomics website (egg2.wustl.edu/roadmap/data/byDataType/rna/expression/57epigenomes.RPKM.pc.gz). UCSC Browser known gene annotations and ENSEMBL transcript annotations were downloaded (UCSC known genes: Bioconductor AnnotationHub AH5036; ENSEMBL transcripts Bioconductor AnnotationHub AH5046; Genomic annotations: Bioconductor AnnotationHub AH5040).
- 2. We filtered all genomic ranges that were marked with states “1_TssA” or “2_TssAFlnk” and merged adjacent ranges that were marked as either state in exactly the same set of tissues. We call these “ChromHMM TSS windows”. We found 476,931 such windows. Each ChromHMM TSS window was assigned gene name(s) through the following steps.
- a. If it was within 2.5 Kb of one or more TSSs in UCSC known gene annotation, it was assigned the name of these genes.
- b. If not, we searched for an ENSEMBL transcript start within 2.5 Kb. Again, if such were found the TSS window received the gene name associated with the transcript.
- c. All other TSS windows remained without a name
- 3. To include transcripts that are not represented in the TSS catalogue, we examined all genes in the UCSC Known Gene database and all transcripts in the ENSEMBL database. For each we defined a TSS window of
size 3 Kb centered on the TSS. We discarded all such windows that overlapped with a TSS window fromstep 2. In total this step added 14,857 and 41,376 TSS windows from UCSC Known genes and ENSEMBL transcripts, respectively. - 4. We created windows that tile the remaining genomic regions between TSS windows. For each TSS window without an adjacent TSS window, we created “flanking” regions of
size 1 Kb (or less). This resulted in 370,332 flanking windows (as some of the TSS windows are adjacent to each other, depending on ChromHMM calls in different tissues). The remaining uncovered regions were tiled with “background” regions ofsize 5 Kb (or less). In total there were 502,263 such windows.
- The resulting catalogue was saved as BED file (TSS.bed).
- Base calling was performed with bcl2fastq (2.18). Paired-end reads were mapped to the human genomes (hg19) using bowtie2 with “no-mixed” and “no-discordant” flags descarding reads with
quality 0. BEDPE files (start and end of every fragment) were obtained using BEDtools “bamtobed” with “bedpe” flag, discarding duplicate fragments. BEDPE files were converted to coverage counts over windows in the catalogue using BEDtools “intersect” command and also using BioConductor “GenomicRanges” countOverlaps( ) function. Both methods count for each window the number of sequenced fragments that overlap with the window. - Every ChIP procedure has non-specific background signal. In the case of cfChIP the background is due to some forms of non-specific binding of DNA and chromatin fragments to the beads-antibody complex. Our experience showed that the background levels varied between samples and batches of bead-antibody ligation. Moreover, the sequencing depth varied between samples, and in a deeply sequenced sample the number of background reads increase. Thus, it was important to estimate background signal levels to be able to contrast them with actual signal.
- We initially applied a simple-minded procedure for removing background in H3K4me3 signal. We reasoned that virtually all of the specific signal in H3K4me3 is at TSS and
gene 5′ regions. Thus, reads in other locations represent background. To account for TSSs that are not annotated in our TSS catalogue, we reasoned that some small fraction of background windows might contain real signal, and thus we removed the ones with the highest values. - In more detail we did the following. We created a vector with the coverage of all “background” windows of size >=4 Kb (421,465 out of 549,385). And applied the following procedure:
-
estimateBackground(X) T ← quantile(95, X) // find the 95th quantile of X X ← X[X ≤ T] // restrict ourselves to values below T // maximum likelihood of truncated poisson Return {circumflex over (λ)}/5 // convert to reads/Kb - This procedure was relatively robust to the choice of quantile for removing outlier windows.
- However, in some samples the Poisson distribution was not a good fit for background values. Further examination revealed that much of this discrepancy was due to local background effects. One local effect is the sex chromosomes that appear in 50% levels in males, and 100% (X) and 0% (Y) in female. These were not the only local effects—some regions showed higher levels of background. This could be due to segmental duplication (regions close to centromeres and telomeres) or accessibility issues. Moreover, in cancer samples there were clear aberrations that were patient specific.
- To overcome these issues, we devised a localized background rate estimate. We used the above estimation procedure, but in successive levels of resolution.
-
- 1. Genome-wide background level.
- 2. Chromosome-specific background.
- 3. Tiles of 10 Mb covering each chromosome at offsets of 2.5 Mb.
- 4. Tiles of 5 Mb covering each chromosome at offset of 1.25 Mb.
- The estimate at each level used the estimate of the previous level as a prior (using pseudo-counts of 1000 windows in
levels - The result is an estimate of background coverage rate at overlapping tiles of 5 Mb. To get a single estimate, for each location we take the maximum of the estimate of the tiles covering it (typically 4 tiles). We choose the maximum as we reasoned that over-estimate of background might reduce the estimated signal but would reduce the number of background artifact.
FIG. 6A shows the background estimate for a healthy male sample.FIG. 6B shows a healthy female sample, wherein the chrX background is lower than autosomal chromosomes (slightly more than half) and chrY background is a bit lower. Many locations in chrY are orthologous to ones in chrX leading to skewed estimates. Other deviations occur close to centromeres where we find the background level to be higher in some chromosomes (e.g., chr1, chr9). When a patient with cancer is examined, the background estimate is much more variable, presumably reflecting chromosomal aberrations in the tumor (FIG. 6C ). - For each gene we assigned a set of TSS windows that are annotated with the gene name. For each sample we computed the actual total coverage over the windows assigned to the gene and the expected mean of background reads over these windows (using possibly different local rate at each window and the window size).
-
- Where W g is the set of windows assigned to gene g, C[w,s] the coverage of window w in sample s, and λ{circumflex over ( )}[w, s] the estimated background rate of window w in sample s.
- The null assumption is that the coverage Cg is distributed as a Poisson with parameter Gb. Thus, we argued that values much larger than expected are signal. We define the raw signal at gene g as:
-
S[g,s]=C[g,s]−B[g,s] if C[g,s]≥B[g,s]+2√B[g,s] and 0 otherwise - Thus, we consider C[g,s,] to be a real signal if it is larger than two standard deviations from the mean of the background level for the gene.
- Applying this procedure for each sample generates a matrix of counts for each gene in each sample. We also include in this matrix the samples from Roadmap Epigenomics data of H3K4me3 ChIP which were processed in the same manner.
- To normalize the effect of different coverage, we reasoned that the signal at promoters of “housekeeping” autosomal genes should be similar in different samples. We defined these genes as ones with highly significant signal in a set of reference healthy samples. The precise choice of significance level did not change the normalization.
- On the matrix of raw signal samples X housekeeping genes we applied quantile normalization (Bioconductor normalize.quantiles). This resulted in normalized values for housekeeping genes in each sample. However, it does not assign values for all other genes. We thus, estimate a multiplicative normalization factor for each sample to best match quantile-normalized values to raw values. For most samples the relation between the two was linear.
- The scaling factors were rescaled so that the total normalized signal (below) at the set of reference healthy samples will be a million on average.
- Using these normalization factors, levels ν [s] we computed for each sample the normalized gene levels:
-
N[g,s]=ν[s]*S[g,s] - Using the same normalization procedure, we also normalized the coverage at each window in each sample:
-
N[w,s]=ν[s]*max(C[w,s]−B[w,s],0) - Using the Roadmap Epigenomics metadata table we defined sets of Roadmap samples that belonged to a tissue or group of tissues. These definitions included some redundancies. For example, the group Lymphocytes included B-Cells, T-Cells, and NK samples, and thus subsumed each of these groups.
- We then defined for each group the set of specific windows, as windows w passing the following criteria:
-
- 1. The window w is on an autosomal chromosome
- 2. In at least one of the atlas samples in the group, N [w, s]≥35
- 3. In all atlas samples outside the group, N [w, s]<15
- 4. n all windows w′ within 1 Kb of w, N [w, s]<15
The last condition is added as we noticed that often when a gene is expressed there is “spill over” to neighboring windows.
- Groups for which we found less than 4 specific windows were considered to be without signature. For all other groups, we define the signature as the set of specific windows (see Table 1). The workflow of cfChIP processing and analysis is provided in
FIG. 7 . - We use two main tests herein:
- Detection test. To test whether a gene or a signature is present above background in a sample, we used a Poisson distribution. More specifically:
-
computeDetectionPValue(W, s) Return Pλ(X ≥ x) // Poisson p-value - Here W is a set of windows, it can be the windows associated with a gene or tissue-specific signature as above.
- Over-expression test. To test whether the observed signal of a set of genes is higher than expected in healthy samples, we used a reference of healthy subjects to define the expected normalized signal of the gene H [g] as the average of N [g, s] in the reference samples.
- We then used the following procedure:
-
computeOverExpressionPValue(G,s) Return Pλ(X ≥ x) // Poisson p-value - The main difference from the previous test is that we included the contribution of healthy samples after we transform from normalized units to the units of the specific sample. The second difference is that we work at the level of genes.
- Most plasma cfDNA is likely in the form of nucleosomal DNA (cf-nucleosomes) with intact histone modifications. Whether extracting and sequencing DNA from cf-nucleosomes carrying specific histone marks could be used to determine information about the cell of origin of the cfDNA was investigated (
FIG. 1A ). This is approach is attractive for several reasons. First, ChIP experiments by design sequence only the positive signal reducing the number of reads required for a positive signal and thus lower the costs and work associated with the assay. Second, positive targets are relatively rare; promoter marks, such as H3K4me3, appear in ˜50,000 locations in the genome (<1% of the genome). Enhancer marks (e.g., H3K4me1) can appear in many regions (˜10% of the genome) but are limited in each cell. Third, histone marks are mostly tissue-specific. In particular, most enhancers are tissue-specific and thus provide strong H3K4me1/3 tissue specificity (FIG. 1B ). Fourth, histone modifications reflect transcriptional activity and respond to changes in cell state. Thus, they create an opportunity to detect changes in activity in cells as they die. - We devised a simple protocol for cf-nucleosome ChIP-seq (cfChIP) from as little as 1-2 ml of plasma (
FIG. 1A inset, 1C). cfChIP and paired-end sequencing on plasma samples from 11 healthy individuals produced 0.3-1.7 and 0.9-25 million unique reads per sample for H3K4me3, and H3K4me1 respectively, suggesting that ˜1-2% of nucleosomes in the plasma with the respective mark (e.g., H3K4me3) were captured, adapter ligated, and sequenced (see Materials and Methods). Importantly, the cfChIP signal surrounding ubiquitously-expressed genes shows high correlation with reference ChIP-seq from tissues (NIH Epigenome Roadmap consortium) (FIG. 1C ). Globally, meta-analysis of the cfChIP signal for H3K4me1 and H3K4me3 yields the expected typical distribution for these marks around enhancers and promoters (FIG. 1D ). - A potential concern is contamination by chromatin released from the lysis of white blood cells during blood draw. Several lines of evidence argue that this is highly unlikely. (a) Fragment size distributions of cfChIP libraries show two peaks at ˜170 and ˜320 bp corresponding to DNA wrapped around mono- and di-nucleosomes (
FIG. 1E ), consistent with apoptotic and in some cases necrotic cell death, but not cell lysis, which results in fragments at the range of 10 kb or larger. (b) We identified thousands of enhancers carrying H3K4me1 and tens of promoters carrying H3K4me3 that are absent in ChIP-seq from leukocytes that constitute the largest portion of nucleated blood cells (peripheral blood mononuclear cells;FIG. 1F-1G ). Analysis of the promoters that are marked by H3K4me3 in cfChIP but not in leukocytes identifies strong signal from megakaryocytes that reside in the bone marrow. (c) We are able to detect disease-related chromatin from remote tissues from patients (see below). - Non-histone DNA associated proteins can also be used for cf-ChIP. We spiked human plasma with 90 ng of chromatin prepared by native MNase treatment of DNA from mouse embryonic stem cells and performed cfChIP with anti-CTCF antibody. Sequencing reads from the cf-ChIP were aligned to the mouse genome, and clear sharp peaks that overlapped with peaks obtained from ChIP-Seq for CTCF from mouse cells were observed (
FIG. 1H ). Meta-analysis of the data shows a clear signal at CTCF sites throughout the genome, and similar analysis of cfChIP with anti-H3K4me3 antibody showed depletion of the histone mark at the same sites (FIG. 1I ). CTCF binding and H3K4 trimethylation are generally mutually exclusive, so this result helps to confirm that the CTCF signal is genuine. - Together, these results strongly suggest that cf-nucleosomes preserve well established endogenous patterns of active histone marks and transcription factor binding. We focus our analysis on H3K4me3 at this point since it is relatively straightforward to assign H3K4me3 peaks to specific genes.
- To assess the reproducibility of cfChIP, we performed technical and biological replicates from several subjects. Replicates from the same individual and between healthy individuals show correlations of 0.94-0.97 and 0.92-0.94, respectively (
FIG. 2A-C ). Peaks with low reproducibility between subjects are enriched for X or Y-chromosome specific genes and are indeed not apparent when comparing two individuals of the same sex (FIG. 2B ). - To test the detection limit of cfChIP we took advantage of sequences unique to the Y chromosome and titrated male-derived plasma into female-derived plasma. We evaluated the sensitivity for a particular genomic location as well as for a genomic signature—an ensemble of differentially represented genomic locations that can define a certain cell type or a certain transcriptional program. The H3K4me3 cfChIP signal at male-specific peaks on the Y chromosome (
FIG. 2D ) shows that we can reliably identify single male-specific peaks even when male plasma accounted for less than 10% of the total plasma (FIG. 2D ). Moreover, the contribution at male specific peaks increases linearly with the fraction of male plasma mixed (FIG. 2E-F ), demonstrating that cfChIP sensitivity is linearly related to the fraction and size of the signature locations and sequencing depth. Indeed, combining signal from 25 male-specific peaks improves the detection sensitivity to 1% (FIG. 2E, 2G ). This is likely an underestimate since there is a single Y chromosome in a diploid genome. Extrapolating from our male spike-in experiments, we estimate that modest signature sizes of 45 to 200 peaks can detect cfDNA from cells that constitute 0.1% of the cfDNA pool with high probability (0.95 or higher) at low sequencing depths (FIG. 2H ). Signatures of this size can be identified for specific cell types or transcriptional programs (FIG. 2I ). - The level of H3K4me3 at promoters from tissue samples correlates with the level of transcription and is strongly predictive of gene expression levels. We find that the cfChIP H3K4me3 correlates with leukocytes RNA-seq at constitutive genes (based on Roadmap Epigenomics Consortium and GTEx Consortium 2015) and this correlation is similar to that of ChIP-seq from leukocytes (
FIG. 2J ). Similarly, we find high correlation with leukocyte expressed genes in agreement with their major contribution to the cfDNA pool in healthy individuals (FIG. 2K ). These results strongly suggest that cfChIP of transcription-related histone modifications can provide insights into the gene expression patterns at the cfDNA's cells of origin. - With these findings in mind, we set out to test the ability to detect tissue-specific signatures in samples from healthy subjects. A previous study on cfDNA CpG methylation estimated that ˜55% of cfDNA emanates from leukocytes, and ˜1% from the liver, with minimal or no contribution of cfDNA from the heart and brain. Using the Roadmap Epigenomics dataset of H3K4me3 ChIP-seq on multiple healthy tissues as a reference, we defined tissue-specific signatures in an unbiased manner (see Materials and Methods, Table 1). We then evaluated the normalized number of reads for each signature in each subject and the statistical significance of these counts (
FIG. 2L-M ; Materials and Methods). As expected, the presence of leukocytes can be detected when using a few specific peaks or even a single one. Using a larger signature, the presence of liver cfDNA is also clearly and significantly detected in contrast to brain and heart signatures that are absent in blood, as expected. These signals are specific with high statistical confidence (q<10-20, see Materials and Methods), and they demonstrate the ability of cfChIP to detect cf-nucleosomes from rare populations of cells. - The ability of cfChIP to identify signatures of cells from remote tissues suggested the exciting possibility for this tool to detect cf-nucleosomes originating from disease-related pathologic cell death. To test this hypothesis, we collected samples from patients diagnosed with acute myocardial infarction (AMI), a process that results in extensive cardiomyocyte cell death. We collected samples from patients admitted to the emergency room, with samples obtained both prior to, immediately after, and ˜12 hours following urgent percutaneous coronary intervention (PCI) to restore blood flow. We expect to observe cfChIP signal from myocardial cell death only in AMI patients but not in healthy subjects, particularly in samples following PCI.
- As predicted, heart-specific H3K4me3 peaks were strongly and significantly detected in post-PCI patient samples, but not in samples from healthy individuals or from pre-PCI patients (
FIG. 3A ). The heart signal includes clear peaks in promoters of heart-specific genes. For example, TNNT2 and TNNI3 that encode for the heart-specific Troponin T2 and 13 are clearly active and observed only in these samples. These two genes encode canonical protein markers for myocardial damage (FIG. 3B ). Indeed, we see good correlation between the strength of the cfChIP heart signature, the levels of troponin measured in the blood, and the estimate of heart cfDNA based on heart-specific differentially methylated CpGs (FIG. 3C ). - For an unbiased view of our ability to describe tissue-of-origin, we evaluated a panel of cell-type specific signatures across cfChIP samples (
FIG. 3D-E ). This analysis shows that in all samples we can detect signatures of a range of cell types from the blood (e.g., monocytes and neutrophils), and organs (e.g., liver). One of our subjects (H008) was four months pregnant carrying a male fetus and her sample exhibited a significant placenta signature. Indeed, we were able to detect low but significant Y-chromosome signal in her plasma as well (FIG. 3D ). - In AMI patient samples the picture is more complex. As discussed above, AMI patients sampled several hours post PCI show a clear cardiomyocyte signature (
FIG. 3A, 3D ). In addition, however, in AMI patients both before and shortly after PCI, we observed a significant increase in hepatocyte signature. This signature includes clear signal at liver-specific genes, such as Albumin and complement genes (FIG. 3F ). This unanticipated observation is presumably a result of the well-known phenomenon of liver injury in AMI patients secondary to low organ perfusion and liver hypoxia. One of these AMI patients (M002) also exhibited increased levels of active chromatin from erythroblast-associated genes, including the hemoglobin locus (see below), and the erythropoietin locus (EPO), most likely from hepatocytes due to liver response to hypoxia. Together this data suggests a systemic response to oxygen shortage. The liver and erythroblasts signals were likely due to transient damage caused by reduced systemic perfusion associated with AMI. Indeed, a follow up cfChIP-seq on thispatient 11 months later appeared normal. In a second AMI patient (M001), we observe a gradual decrease in liver signature within hours following PCI, suggesting a prompt relief of the liver oxygen shortage (FIG. 3G ). To confirm our cfChIP observations we analyzed the cfDNA methylation status for liver-specific genes for which the DNA methylation status is indicative of liver cell death. Indeed, we observe good agreement between liver cfChIP signature levels and liver cfDNA estimates (R2=0.97,FIG. 3H ). - An important potential application of cfChIP is identification of cancer tissue of origin. Advanced stage cancer is often accompanied by higher cfDNA content in the plasma, much of it from tumor cells (ctDNA). We collected plasma samples from patients with gastrointestinal (GI) tract tumors and analyzed their cf-nucleosomes tissue of origin (
FIG. 3I, 3E ). In general, plasma from cancer patients contains signal from tissues that are not observed in healthy subjects. Most evidently, we observe signal originating from gastrointestinal (GI) tissue and from GI smooth muscle, which is in agreement with the primary locations of the tumors. A weaker but significant GI signature was evident even when the primary tumor was removed by surgery and only residual metastatic disease was evident (patients C004 and C005). We also observe low but significant signal from additional tissues, such as the brain signal observed in C001. These signals can be due to treatment (C001 underwent brain radiotherapy) or due to collateral damage to normal (non-malignant) tissue. - We also examined a patient with localized hepatocellular carcinoma (HCC) that underwent partial hepatectomy (PHx). We collected blood samples before, during, and at different time points post operation, and analyzed both circulating cf-nucleosomes using cfChIP along with measurements of the classic marker for liver damage, the enzyme ALT (
FIG. 3J ). Surprisingly, ALT levels were normal prior to operation despite the fact that the patient had active cirrhosis in addition to the HCC. The levels of ALT increased during the first day following PHx and gradually decreased during the following days. cfChIP analysis of liver signature strongly agrees with the ALT test, suggesting again that cfChIP detects dynamic processes at remote tissues. One difference is that the cfChIP liver signature dropped back to normal levels about 2 days earlier than ALT. This difference is likely due to the shorter half-life of cfDNA (<2 hours) compared to ALT (˜47 hours) in the circulation. - Together, these results demonstrate that there are pronounced differences in cfChIP signal of healthy subjects compared to patients with ongoing pathological processes. These differences correspond to the tissue where these processes take place, such as heart, liver, and gastrointestinal tissue.
- A major challenge for cfDNA analysis is the inference of gene expression in the tissue of origin. The major approach proposed to this problem so far relied on under-representation of specific promoter elements in cfDNA as an indication of gene expression; however, this approach requires extremely deep sequencing and is limited to situations where cfDNA from the tissue of interest comprises a majority population in blood. We tested to what extent cfChIP can report on non-constitutive gene expression programs that took place in the cells of origin.
- H3K4me3 is closely associated with transcriptional activity and dynamically changes in response to changes in transcriptional programs, raising the exciting possibility that cfChIP might be able to detect more dynamic transcriptional programs beyond the information on tissue of origin. To test this hypothesis, we compared the H3K4me3 cfChIP signal from patients with AMI or cancer to a collection of hallmark gene expression signatures representing different cellular processes and responses (
FIG. 4A-C ). - This analysis uncovered multiple signatures that have higher than expected signal—that is, the amount of cf-nucleosomes captured for the signature is significantly higher than what we see in healthy subjects. For example, we see a strong signature of Heme Metabolism in M002 and C005, patients who suffered from hypoxia and bacteremia, respectively. The blood count of C005 indeed shows high red cell distribution width (RDW) and low red blood cell count (RBC) and hemoglobin (HGB) indicating high production of red blood cells due to anemia. This signal could be from enhanced cell death of erythrocytes progenitors or closely related cells, or due to nuclear loss in erythroblasts undergoing maturation into erythrocytes. Thus, this signature is indicative of a specific hematopoietic cellular differentiation process.
- Other signatures, such as glycolysis or interferon-alpha response reflect processes that can take place in multiple cell types. We observe higher glycolysis signatures in cancer patients, in agreement with the metabolic reprogramming, known as the Warburg effect, that is considered a hallmark of advanced cancers (
FIG. 4D ). We also see a clear increase in glycolysis signature in M002, who suffers from extensive liver damage. Interestingly, in M002, we also see an increased signal of several liver-specific glycolysis genes such as ALDOB and PFKFB1 while in cancer patients the enhanced glycolysis signature does not include signal from these genes. These results suggest that cfChIP can detect cell-specific transcriptional programs with relevance to an underlying pathophysiological state. As expected, in plasma from cancer patients we also observe increases in several proliferation associated signatures (Kras, Myc targets, E2F targets, G2M checkpoint), and the mTORC1 pathway that coordinates metabolism and cell growth. Interestingly, parts of these signatures are also observed in AMI patients that experience liver damage and may reflect liver recovery from ischemic damage following PCI. - Another example of detectable transcriptional programs is Interferon-alpha response that is normally induced due to the presence of pathogens such as viruses and bacteria. We observe dramatic increase in interferon signature in M004 and C005. In the latter this is likely due to severe bacteremia for which he was hospitalized. M004, whose samples show high interferon and inflammation signatures, seems to have experienced more severe heart damage as compared to other AMI patients in terms of troponin levels and cfChIP heart markers (
FIG. 3A, 3C ). This could be due to induction of IRF3/interferon I response in M004 which was recently shown to promote a fatal response to AMI. - Together, these observations demonstrate that cf-nucleosomes not only report on death of specific cell types but can also reflect detailed changes in gene expression programs in a broad range of cell types.
- A hallmark of cancer cells is genetic alterations that lead to dysregulated gene expression programs. Identification of such cancer-specific transcriptional programs can assist diagnosis and treatment choice. For each sample, we tested for genes whose signal was elevated compared to five “reference” healthy samples. As a control, unrelated healthy samples outside the reference set were in high correlation with healthy reference, with few genes (usually less than 50) showing significantly elevated signal (
FIG. 4E ). In contrast, samples from patients revealed hundreds to thousands of genes with significantly elevated signal (FIG. 4E ). Examining these genes for enrichment in annotated gene lists recapitulated some of the results discussed above. For example, genes in C001 were enriched for gene sets of GI tract and Brain consistent with the pathology of this patient. - We next looked for cancer-specific signatures in the H3K4me3 cfChIP signal. We analyzed expression profiles from The Cancer Genome Atlas and GTEx projects to identify for each tumor type a set of genes that are significantly higher in the tumor compared to normal tissues (see Materials and Methods, Table 2). We then tested for significant overlaps between the set of genes with higher H3K4me3 signal in a sample and the set of genes over-expressed in a tumor type (see Materials and Methods). For example, C002 has significant overlap (q<10-60) with GI-tract adenocarcinoma genes (
FIG. 4E ). The analysis of all samples against all tumor types (FIG. 4F-G ) shows that only samples from cancer patients have significant enrichment of tumor related gene expression, while healthy and MI patients do not. Importantly, the enrichment is for cancers of the GI tract in line with the diagnosed pathology. - Focusing on specific genes that are known to be upregulated in gastric and colorectal cancers we observe a clear increase of the H3K4me3 cfChIP signal in these patients compared to healthy reference (
FIG. 411 ). Among these genes we find the carcinoma markers CEACAM5 and CEACAM6. The protein products of these genes are used in an antibody-based assay for clinical cancer diagnosis. A second colorectal cancer marker, the long non-coding RNA CCAT1 (colorectal cancer associated transcript 1) shows strong signal in one of the cancer patients but not in healthy subjects. Another example is the long non-coding RNA EGFR-AS1 that mediates cancer addiction to EGFR and when highly expressed can render tumors insensitive to EGFR inhibition. While cfChIP signal for EGFR is detected in all cancers, EGFR-AS1 is only detected in C002 but not in other patients. This finding, which would not be detected by cfDNA mutation analysis, raises the exciting possibility that cfChIP can be informative for treatment choice beyond genomic mutations. -
TABLE 2 Cancer signatures Acute Myeloid Leukemia: ITGA4, RP11-1094M14.7, ITPK1-AS1, ABCE1, RP13-786C16.1, RP11-680G24.4, RPL23AP23, RP11-863H1.1, AC005757.6, GCNT6, RP1-308E4.1, SETP5, VPS13C, KRR1, RP11-252K23.2, RP11- 447H19.3, BRIP1, TTC24, RP11-306G20.1, ODCP, RP11-513G19.1, KB-1836B5.1, FTH1P4, RP11-182J1.5, ADAM1A, RP11-415J8.7, RP11-638I2.2, RP11-759F5.1, LDLRAD4-AS1, RPL13AP6, SPIN2A, CDC42-IT1, CCNA2, SKA3, RP11-330L19.2, MPHOSPH9, MGA, XRCC2, KCNA6, GAPDHP39, GUSBP3, GUSBP9, NEMP1, RP11- 932O9.10, BIVM-ERCC5, KB-1507C5.3, RP11-662B19.2, EAF2, RP11-766H1.1, TCTEX1D1, CAP2P1, RP11- 177C12.4, WDR12, RP11-434H14.1, SLC25A36, HNRNPA1P68, RP11-20L24.1, RP11-983P16.2, RP11-475E11.2, KIAA1524, CTC-260E6.6, AC002454.1, C5orf42, RP11-9E13.4, RP11-728K20.2, AC002543.2, PPAT, B3GALNT1P1, VN1R82P, KRT18P63, FAM196B, AC114730.2, RP4-633O19_A.1, EEF1A1P19, RIF1, SREK1, SMARCAD1, ERG, MTRNR2L3, STARD4-AS1, FCER1A, RP11-49O14.2, FANCB, IGLL1, HOXA9, TYMS, AC007919.19, RP3-431A14.4, UBE2D3P2, CTD-2349P21.5, CPXM1, ATP5L2, RPAP2, RP11-207C16.4, AC104634.3, RPL3P1, ZNF519, TAS2R12, AC002400.1, HSPA8P3, FBXL19-AS1, TUBG1P, RP11-392O18.2, RP11- 474L11.5, RPSAP51, GS1-259H13.7, LYST-AS1, RP11-142C4.4, RP11-169K17.2, RP11-491H9.3, BEND3P3, RP11- 165F24.3, NRBF2P5, GLTSCR1-AS1, RN7SKP80, TICRR, KNTC1, MOB1B, LA16c-361A3.3, SUMO2P8, NEIL3, PHBP2, IRGM, RP11-139E19.2, MYCBP2, FAM111B, RP11-118F19.1, C3orf80, SUGT1P3, CDCA7, HNRNPH1P1, PTH2R, RPL29P23, RP4-603I14.3, RP11-215A21.2, PSMA2P1, MAD2L1, RP1-168P16.2, RP1-168P16.1, BIRC6- AS1, RP11-231I16.1, RP11-615I2.1, RP11-731C17.2, SNORD17, RP3-472M2.2, SAMD11P1, RP11-430C7.2, AC010886.2, SACS, RAB40AL, HSPA8P15, RP13-46H24.1, GUCY1A3, RPS16P5, RN7SL505P, RP11-421E14.2, NRIP1, CLSPN, SMG1P3, SMG1P1, ZNF33BP1, ZDHHC21, ZNF141, DNTT, BCOR, PFN1P2, RP11-253I19.3, RP11- 288K12.1, RP11-540B6.2, RP11-666F17.2, FOXN3-AS2, C8orf17, BAHCC1, SNRPCP4, RP11-697N18.2, GMDS- AS1, ZFP14, RP11-443B20.1, SUCNR1, SNORD97, CCDC26, RP5-837J1.4, DISC1-IT1, GLUD1P3, NREP, RPS5P2, CBX5, CASP8AP2, RP11-180P8.3, HTR1F, CPA3, AC006116.22, RN7SL502P, RP11-493E3.1, HIST1H4C, ATAD5, ATAD2, GAPDHP2, RP11-16E23.4, RP11-184I16.3, RP11-1007J8.1, MTND5P25, RP11-497H17.1, BRD7P4, RP11- 319G9.3, RP11-319G9.5, MTND5P28, KCNQ1OT1, ZNF625, ZNF621, RP11-750H9.7, RPL21P65, CTD- 3037G24.3, RAD51AP1P1, RP11-792A8.3, WDR49, ANKRD26, RP11-381E24.1, RPL23AP18, RPL23AP11, LRRC37A4P, AC114776.1, SND1-IT1, TAS2R50, RP11-649E7.5, RP11-332O19.2, RP5-1099D15.1, RP11- 336A10.2, SCML2, RP11-393I2.2, CTB-50L17.5, RPS20P4, RP11-58K22.4, RP11-91A18.4, ZNF788, RP11- 430L17.1, AC002550.5, NAA25, ZBED4, ANKRD63, RN7SL334P, RP11-18F14.4, PSMD10P1, ASB9P1, RP6- 159A1.3, RPL21P123, RP11-252I14.1, DSCC1, CTB-26E19.1, RP11-177A2.5, RP11-318K15.2, RPS20P33, DNA2, SNORD3B-1, RP3-340B19.5, RAD51AP1, PPP1R26P1, RP11-124D2.7, EBLN2, RP11-511H9.3, AMIGO3, ZNF891, CERS6, CTC-265F19.2, ARGFXP2, VANGL1, ZNF708, DENND4C, GUSBP2, RFX8, RFX7, TGIF2P1, SATB1-AS1, SCOCP1, RNVU1-14, ZNF138, PPAN-P2RY11, RP11-720N19.2, RPL18AP2, RN7SL449P, H2AFZP1, AC013474.4, HSP90AB4P, ZNF367, RN7SL328P, RP11-489G11.3, RP11-477E3.2, UHRF1, WDHD1, RXFP2, ZHX1, DYTN, RN7SKP78, MANEA, EPHA1-AS1, RPL36AP15, RPL36AP13, RP11-574F11.3, IQCH-AS1, RN7SL778P, CHRFAM7A, AC108004.3, RP11-386I23.1, SNORD3B-2, B4GALT6, SOGA1, RP11-16P6.1, RP11-182I10.2, FOSB, WARS2-IT1, IL12A, ZNF675, ZNF678, RP3-497J21.1, PRSS57, KIF4A, RP11-453E17.4, EGOT, TPM3P1, RP11-493P1.2, LA16c- 390H2.4, NDC1, TRIM24, RP11-527J8.1, RP11-343B5.1, ZSCAN5D, C5orf66, C8orf44-SGK3, ACTG1P20, RP11- 90L20.2, TARBP1, RP11-685N10.1, RPL9P2, ANKRD36, AC007679.4, AE000661.50, SLC24A1, ANKRD36C, ANKRD36B, LPAR4, HNRNPCP7, CTD-2012K14.6, RP6-91H8.3, NUTM2G, CTB-134F13.1, RPS23P6, RBM12B- AS1, RP11-571F15.2, CTD-2017C7.3, RP5-867C24.4, RP5-867C24.5, HMGB1P10, HMGB1P14, RP11-69L16.4, CTD-2528A14.5, NADK2-AS1, RP11-124N14.3, AC008440.5, RPL23AP64, UHRF1BP1, QSER1, LA16c-385E7.1, CTD-2026D20.2, ZNF736, ZNF737, AC007787.2, SCARNA7, RP11-452H21.4, RP11-452H21.2, SNHG4, AP001432.14, RP11-648O15.1, ZNF69, ZNF66, AC096921.2, ZBED3, ZBED6, KB-1958F4.1, RFC3, RP11-383J24.2, RP3-522J7.5, RP5-997D24.3, RNASEH2B-AS1, CTC-325H20.7, MYCN, PARPBP, CTC-379B2.4, RPS7P11, RP3- 323N1.2, AP000936.1, RPL9P3, IFNK, GLMN, RP13-93L13.2, SPNS3, KRBA2, KIF18B, ERCC6L, LL22NC03-2H8.4, TWISTNB, RP11-709A23.1, MTND6P4, TTF2, PUS7, STAG3L3, STAG3L2, TAS2R41, TAS2R46, RP11-271C24.3, KCNK17, RP11-665C16.1, ALG10, ALG11, RP11-184B22.2, AC073283.7, AC109631.1, CKAP2L, YBX1P3, DUTP1, SNORA12, GCSAML, RP11-484L8.1, DAPK1-IT1, GPR174, SNX25P1, RP1-315G1.1, PNPT1P1, RP11-466F5.3, WRN, AC007365.3, RP11-677N16.1, SPIN4, RP11-173A16.1, RP11-10L12.2, KIF14, KIF11, RP11-295B17.6, TAS2R43, TAS2R42, RP11-69H7.4, VPREB1, MMS22L, KIF4CP, ATP5G2P3, CSNK1G2P1, MDN1, TET1, CD38, ZNF321P, RP11-166B2.5, MTBP, SMC4, RN7SL413P, FAM179B, RN7SL182P, TAS2R62P, DLEU2L, SEC13P1, RN7SL614P, SNORA71B, ATP5HP4, RP11-815J21.4, RP11-815J21.1, ST8SIA6, RPL23AP30, POLE2, RPL7P7, ZNF566, FAM60BP, NDUFA3P2, MRPL37P1, SIAH2-AS1, RP13-580F15.2, RP11-710F7.3, RP11-497G19.1, RP5- 894A10.6, RP11-383G10.3, RP11-45M22.3, CENPK, CENPI, CENPF, CENPE, CENPQ, MYCBP2-AS1, RP11- 552C15.1, RAVER2, CTD-2291D10.4, PDXDC2P, FUT10, RP11-333E13.4, RPL21P4, RPL21P1, RPS20P15, ZNF491, NPM1P29, PPP5D1, FAM35CP, ZNF260, UQCRBP1, HNRNPA1P34, INTS2, RP11-214J9.1, SCAI, RP11-968A15.8, AC005682.5, PTTG3P, LUZP1, ZNF878, SPN, HMGB3P4, AC100830.4, RPS4XP14, RP11-431N15.2, RP1- 180M12.1, CEP128, SETP11, CHD9, MPO, MPL, ACOT11, YPEL5P2, TAS2R10, TAS2R13, SGK494, SPECC1L- ADORA2A, FAM72D, RP11-253I19.4, RP11-616K22.1, GJA1P1, SERPINH1P1, BRI3BP, URB2, TMEM14E, PGAM1P7, BRCA1, BRCA2, FAM208B, FLT3, RP11-697N18.3, CTC-458I2.2, POLQ, RP11-307L3.4, ATP8B4, LY75- CD302, RN7SL128P, ANKRD44-IT1, AC114763.1, CTB-181H17.1, RP11-83A16.1, VDAC1P11, AC002306.1, RP11- 557C18.3, RP11-411G7.2, CCNE2, CYTL1, AC008746.3, AC008746.5, RP11-403A21.3, SRGAP2-AS1, YEATS2- AS1, SGOL1, RP11-345I18.4, GAPDHP70, OGFOD1P1, AC124914.3, PPIAL4G, RPL7P49, ADPGK-AS1, RPL7AP2, AC007036.4, HERC2P10, BANF1P3, TMEM75, ZNF618, SMCR5, RP3-340H11.2, RP11-64K12.9, SSBP3-AS1, RN7SL698P, KANTR, KIF20B, MKI67, ZNF551, PDHA1P1, RP11-53B2.1, RP11-53B2.2, RP11-53B2.4, AC093732.1, AP000487.4, RP11-90O23.1, BNIP3P42, BNIP3P41, GABPB1-AS1, RCC2P6, ARF1P2, RP11- 302M6.5, AC074286.1, RP11-95J11.1, RP11-214K3.20, RP11-553K8.5, RP11-377G16.2, RP11-172F4.2, RP11- 158M2.2, RP11-133K1.7, MARK2P8, ISPD, FXNP2, RGS17P1, YWHAZP4, RP11-193I22.2, RP11-252K23.1, ARHGAP11A, ARHGAP11B, TIFAB, QRSL1P3, ASPM, CTC-425O23.2, RP11-197K3.1, CTC-527H23.1, AC004383.3, POU5F1P4, RPL17P40, RP11-384C4.6, RP11-473O4.4, MORF4L1P4, CTD-2587H24.10, ATP5F1P1, C1orf186, IPPKP1, KMT2A, CLEC11A, CRYM-AS1, ZNF718, ZNF714, ZNF717, PTPN2P1, SIGLEC6, CTA-339C12.1, ZNF43, TAS2R19, TAS2R14, FKTN, RP5-1154L15.1, MRPL53P1, RP11-18B3.3, XPO4, SLC15A2, RP11-16C1.3, ZNF100, RP11-474I11.8, RP11-474I11.7, FAM53B-AS1, AOX2P, RP11-40C6.2, DCUN1D2-AS, DDX11-AS1, TEC, LRRC58, HMGN2P20, FNBP1P1, CTD-2381F24.1, RP11-876N24.2, RP11-77K12.3, ZNF805, AE000661.37, ZNF37A, RP13- 487P22.1, MCM4, MCM2, AC007041.2, RP11-201O14.1, RP11-96D1.9, PTPRJ-AS1, KRR1P1, AC003104.1, ZNF101P2, AC079922.2, RP11-525G13.2, TAS2R60, RP11-440L14.4, RP11-293A21.2, CTC-451P13.1, CATIP-AS2, RP4-635A23.4, OSTCP1, CTD-2542L18.1, ASS1P1, ZNF888, SEPT7P1, SOX12, FTX, RP11-158G18.1, GPR151, CICP13, RP11-306O13.1, MBLAC2, KIAA0101, TMTC4, ZNF660, RP11-550A5.2, RPL31P58, RP11-1379J22.5, HPDL, AP001171.1, KLF4P1, RP11-632K20.8, RP11-480C16.1, RN7SL118P, BNIP3P10, RP11-438N5.2, RP11- 476B13.2, RP1-5O6.4, RPS6P16, OFD1P17, AC010522.1, NDUFAF4P4, ANKRD28, SLC35E1P1, RP11-75C10.7, PTX4, RP11-754B17.1, N4BP2, HEATR1, RN7SL262P, EXO1, FIGNL1, AC017104.2, ZNF415P1, RPS3AP47, RHOQP2, RC3H1-IT1, CCDC88A, RXFP4, EEF1A1P24, HMGB1P23, TRH, RP11-216N14.7, KIAA1586, CDC42P1, GPR52, SRSF10P1, RAI1-AS1, PAICS, PUS7L, RP11-678G14.2, ATP6V0CP1, FDPSP4, POLA1, EIF5AP3, KIAA1919, SCARNA9, RP4-591N18.2, SLC7A5P1, TRGV1, AC011933.2, RP11-35J23.1, DTL, RP11-702F3.1, CYB5RL, AURKAPS1, SOX4, RP6-99M1.3, RPS12P26, ZNF283, RP11-1023L17.1, ZRSR1, AC003989.4, FLJ42393, RP11- 981G7.1, RP11-46A10.8, RP11-276H1.3, RP11-276H1.2, MCM10, GPR75, GJA9, RP11-56B16.1, RP5-916O11.3, ZNF852, ZNF850, ZNF326, RPS2P45, RYKP1, LPPR3, RP11-305O6.3, HELLS, TCEB3C, TGIF2-C20orf24, AKAP9, RP11-426C22.4, CTD-3060P21.1, TAS2R4, TAS2R3, CCDC144B, P2RY4, HOXB-AS2, ANGPT1, ZNF91, ZNF92, SPICE1, TAS2R30, TAS2R31, AC136289.1, RIMKLBP1, ZRANB3, HNRNPA1P37, DNMT3B, RP11-480I12.5, E2F8, RP11-212F11.1, RP11-290D2.3, ZZZ3, ZNF439, COQ10BP2, RPS12P27, RP11-69M1.4, PAPD7, LRBA, RP1- 168P16.3, RPS15AP10, RPS15AP16, RP11-874G11.1, RP11-417O11.5, RP11-312J18.3, LYRM7, RPL21P10, ARL5C, RP11-420A6.2, RP11-797H7.1, RPS17P13, SNRPCP19, CCDC14, CCDC18, RABGAP1L-IT1, HIGD1AP16, ZNF280D, RP11-69M1.3, RBL1, KIT, BNIP3P25, RP1-91J24.3, FAM124B, POU5F1P6, SHMT1P1, RPL7AP3, RP11- 157K17.5, UTP20, IGF2BP2-AS1, RP11-565F19.4, RP6-109B7.4, RP11-384B12.3, RP11-264L1.4, ULK4P1, ZNF221, RP11-17G12.2, RP11-402L5.1, MIB1, AC019097.7, GS1-184P14.2, CTB-102L5.7, RN7SKP180, RN7SKP185, MKRN5P, HTATSF1P2, STXBP4, ERVFRD-1, CCNJP2, CDK1, CDK6, PIN4P1, ZNF573, STIL, NTAN1P2, RPS15AP6, PI4K2B, GSE1, RP13-1056D16.2, TFEC, RN7SKP237, RP11-198M15.1, RP11-544A12.5, RN7SL743P, EEF1B2P1, SSBP2, KNOP1P4, RP11-496H15.2, SH3GL1P2, ZNF772, RP11-347P5.1, CTD-2331H12.7, BMI1, RP1- 56K13.2, RLIMP1, ZNF26, ZNF28, SLC9A7, RP11-43D4.2, CTD-2530N21.5, BDP1, ERMP1, AP001625.6, C1QL3, NPM1P19, ZNF253, RP11-2J18.1, RPL5P11, AC010240.2, RP11-265B8.5, PRR11, BMS1P1, WDPCP, AC066692.3, CHCHD4P5, RP11-576N17.3, CTD-2184D3.7, MIS18BP1, RP1-256G22.2, CLDN20, SNRPGP18, RP11-10N23.2, ORC1, PGAM1P11, TCEB1P33, RP11-477J21.6, RP4-591B8.2, PPIAP16, C20orf197, ATP6V0E1P1, RBM41, AC005546.2, RP11-307P22.1, CHEK1, PLD4, ALG10B, RP11-556E13.1, ZNF460, NUCB2, SRP72P2, TAS2R63P, SEC14L1P1, TDGF1P6, TDGF1P5, RP11-798K23.5, RP11-274B21.1, ZNF124, ZNF121, TRBVB, ACAP2-IT1, RP11- 16C1.2, RP11-539G18.1, PIGFP2, RP11-631M6.3, RP11-1250I15.3, MSH2, PM20D2, DBIL5P2, ZC3HAV1L, BTF3P12, ANKRD36BP1, RP11-540O11.4, RP11-540O11.6, RP11-102M11.1, ZNF681, TUBB8P1, RP4-614O4.11, SMC2, MPRIP-AS1, RP11-75C10.9, IQGAP3, RP11-12A2.1, RP1-209B5.2, MFSD1P1, UHRF2P1, RP11-356M20.1, CHML, MYB, SPC25, RP11-117N2.2, AC006042.8, RP11-384B12.2, SLC25A6P5, RP1-29C18.9, RP11-343C2.9, SHPRH, FASTKD1, VPS26BP1, CTC-435M10.10, GAPDHP62, KAT6B, RP11-5P18.1, RPL7P52, OCLM, AC005264.2, RP11-488L18.3, RP11-584P21.4, CTB-193M12.1, RP11-96D1.5, SLC2A3P2, RP11-571I18.5, KIAA0125, RP1- 224A6.8, RP11-346C16.1, EIF4BP5, ZNF724P, GAS5-AS1, FCF1P5, FCF1P7, ZRANB2-AS1, RP11-15F12.1, AP001469.5, USP41, RP11-118E18.4, RP11-797A18.6, RP11-830F9.7, RN7SL174P, RP1-278E11.3, PROX2, CTC- 244M17.1, ZMYM1, RP11-603K19.1, RP4-534N18.2, TYRO3P, KRT18P57, PDC, THRA1/BTR, TSEN2, ESCO2, DCAF13P3, AC010642.2, TRPC5, PHF14, LATS2-AS1, COL24A1, ALG1L2, AC004893.10, DLEU1, RP11-342M21.2, BCL11A, DPY19L4, RP4-752I6.1, RBMS2P1, RP11-674I16.1, TATDN1P1, ZNF726, AK3P5, ZNF70, E2F3-IT1, RP11- 697N18.4, BTBD8, DIAPH2-AS1. Adrenocortical Cancer: EBP, FAM64A, C1orf53, EPB41L1, TSPAN12, RP5-940J5.9, CTSA, NR5A1, GSTA3, TNFRSF12A, PPIF, ZNF275, RP11-215A21.2, NANOGP1, MINOS1-NBL1, SERPINA5, RP11-649E7.5, FAM222A, PLBD2, SRRM3, LDLR, AGRP, FAM19A4, LONP1, GRTP1-AS1, DBP, HSPE1, RP11-688G15.3, NPTX2, KLHDC8B, TECRP1, SPINK13, C1orf233, MGARP, RP11-320G24.1, BST2, UBE2C, RP11-40C6.2, FAM131C, FDXR, SCARB1, PRDX2, HMGCR, SLC26A2, NETO2, C19orf48, RP1-241P17.4, RP11-159K7.2, QPRT, MAMSTR, TMEM229B, PTPRH, AC009784.3, FAM43B, ESM1, AMHR2, ULBP2, RPS2P55, NHLRC4, CTC-518P12.6, MSMO1, TBX3, DHCR7, NR0B1, CYB5B, POR, DLK1, SRXN1, ANO4, FAM195A. Brain Lower Grade Glioma: RP11-229P13.23, RP5-940J5.9, FERMT1, TAAR3, RP11-215A21.2, RP11-64K12.4, NUDT4P1, RP11-698N11.2, UHRF1, RP4-791C19.1, RP11-565P22.6, AC000089.3, VIPR2, BTBD17, MDFI, NMB, BMP2, RP5-1177M21.1, SHD, DLL3, CX3CR1, MAGED4, NES, BCHE, RPE65, RP11-231C18.1, SOX2OT_exon3, PAX1, RP11-40C6.2, FREM3, GSX1, SOX11, HES6, PCDHB9, NDST4, TREM2, RP1-241P17.4, SOX4, KLRC2, SMOC1, PIGY, PAPD7, RPS2P55, HAPLN1, BCAN, GPR34, MEX3A, MKRN3-AS1. Breast Invasive Carcinoma: FAM64A, NKAIN1, RP11-229P13.23, IL20, CRABP2, NAT1, RP5-940J5.9, HMGB3, RRM2, RIPPLY3, MXRA5, LRRC15, FAM111B, CHRNA6, COL1A1, RP11-649E7.5, AC005255.3, CST2, CST1, UHRF1, EPYC, KRT18, COL11A1, KIF4A, AGR3, HIST1H3G, PAFAH1B3, KIF26B, UBD, TK1, ERCC6L, ANXA9, MMP10, MMP11, MMP13, GALNT6, RP11-400N13.3, CENPF, GRP, UBE2MP1, MKI67, POSTN, CXCL9, UBE2C, RET, YWHAZP4, HLA-DPB2, ERVMER34-1, RP11-40C6.2, TPD52, RP11-10G12.1, FOXA1, KIAA0101, LDHAP7, TREM2, SRMS, INHBA, MAL2, NUSAP1, PIGY, TGIF2-C20orf24, RP11-579D7.2, COL10A1, PAPD7, CXCL11, CDK1, CENPA, PI4K2B, TLR7, CTC-518P12.6, ESR1, PRR11, CCR8, CACNG4, ESRP1, SPC25, RP11-5P18.5, RP11- 379F12.4, ABCC11, CXCL10, SAPCD2, MEX3A. Colon Adenocarcinoma: ITGA2, POU5F1B, GCNT1, RP11-357H14.17, FAM64A, CHMP4C, RP11-229P13.23, SKA3, LAMB3, RP5-940J5.9, LY6G6D, RBBP8NL, HMGB3, SEL1L3, CKMT1A, BCL2L15, CEP55, ATP10B, TRABD2A, TDGF1, FERMT1, CDH17, HTR1D, RRM2, KRT8P45, NEIL3, MAP7, FAM111B, RP11-93K22.13, ILDR1, CDCA7, RP11-215A21.2, MAD2L1, CEMIP, EPCAM, ANKS4B, AC012363.4, PPP1R14D, VWA2, MUC13, KRTCAP3, HKDC1, F2RL1, AP000439.3, AC007099.1, NOX1, AC005255.3, CLRN3, PRSS8, RAD51AP1, CDT1, CST1, ATP2C2, PHLDA2, EREG, RHPN2, C15orf48, CCL20, LLGL2, GAL3ST2, KRT8, HMGN2P17, UHRF1, EPYC, AURKB, MET, KRT18, ABHD17C, LY75, FAM83H, FAM84A, GMDS, LSR, KIF4A, AGR2, PRR15L, AP1S3, FUT3, MMP1, UBD, CASC9, TLCD1, NOS2, KIFC1, PARPBP, TMEM45B, TK1, LRRC31, CCNF, AC000089.3, KIF18B, ERCC6L, RP11- 150O12.3, RP11-150O12.6, MMP10, MMP11, MMP12, GGH, GALNT4, GALNT5, GALNT6, GALNT3, KIF11, ASCL2, HMGA1, C6orf222, C6orf223, PLS1, AREG, GRM8, HOXB8, HOXB9, MARVELD2, KRTAP4-1, RP11- 400N13.3, ENC1, CENPF, MACC1, STAP2, EPHB2, MELK, GPA33, CASC21, ETV4, LGALS4, CTB-191K22.5, C10orf91, GRIN2D, KRT8P48, B3GNT3, SLC44A3, SLC44A4, TPX2, MISP, BRI3BP, LY75-CD302, VIL1, TMEM211, TJP3, MKI67, GUCY2C, CXCL3, CXCL9, UBE2C, UBE2T, CCAT1, YWHAZP4, ARHGAP11A, TNFSF11, CBLC, AP003774.1, RPL36A-HNRNPH2, EPHX4, DSG2, RP11-40C6.2, RNF43, AP1M2, PRR15, MCM4, MCM2, VDR, FAM105A, SGPP2, PPP1R1B, RP11-187E13.1, CCNB1, LAMC2, RP11-10G12.1, NOXO1, RAB19, KIAA0101, DACH1, CRB3, HPDL, TRIM15, LDHAP7, AL163953.2, FXYD3, EXO1, RXFP4, TOR4A, RP11-284F21.7, CDH1, RP11-22L13.1, ECT2, ST14, NFE2L3, SRMS, SOX9, MAL2, NUSAP1, PLEKHG6, MCM10, MYBL2, PIGY, TGIF2- C20orf24, RP11-579D7.2, USH1C, RP11-480I12.5, COL10A1, E2F8, PYCR1, GPR160, CDX1, CDX2, CDC6, CLDN2, CLDN3, CLDN4, CLDN7, BIK, EDAR, AC123023.1, TMEM238, CXCL11, RPS2P55, CDK1, CENPM, CENPA, ANLN, RP11-44F14.2, C17orf77, PRR11, SLC12A2, CCR8, ESRP1, GINS2, CYP2S1, PLEK2, TRIM31-AS1, FOXP4-AS1, AC021218.2, TOX3, IQGAP3, MYB, SPC25, RP5-881L22.5, IHH, TSPAN8, BBOX1-AS1, MYEOV, ACSL5, TRIM31, TFF3, TMPRSS4, CEACAM1, CEACAM5, CEACAM6, ARHGEF35, STX19, CXCL10, SATB2, GPR35, GPX2, FEZF1- AS1, CDCP1, SAPCD2, MYO5B, DDC. Diffuse Large B-Cell Lymphoma: CKS1B, DLGAP5, MZB1, C16orf59, PLA2G7, RMI2, PCNA, DNPH1, CCNA2, SKA1, SKA3, WNT10A, PLA2G2D, ZWINT, AC023590.1, EAF2, PARP1, SNRPD2, SNRPD1, MND1, EEF1A1P19, LSM3, CEP55, TYMS, HSPA8P1, FCRLA, PDCD1LG2, KBTBD8, RRM2, RRM1, NEIL3, HLA-DOA, HLA-DRB6, HLA- DQA1, FAM111B, CDCA7, CDCA5, RP11-215A21.2, MAD2L1, MRPL13, MRPL14, MRPL17, MS4A1, CCL18, RASGRP3, RPS21, POU2AF1, RFC4, RP11-815N9.2, FEN1, RP4-706A16.3, IL21R, CXCR5, RP11-16K12.1, DSCC1, RAD51AP1, CDT1, RGS13, GTSE1, RPL13AP20, RP11-627K11.1, HMGN2P17, CCDC58, UHRF1, WDHD1, AURKB, KIF4A, WDR76, KB-1980E6.3, MRTO4, C12orf45, UBD, NCAPG, BCAS4, CD80, CD83, RFC3, KIFC1, PARPBP, TK1, CCDC167, RPS7P11, CCNF, AC000089.3, TOMM5, KIF18B, ERCC6L, CDC20, MMP12, CLECL1, KIF14, KIF11, METTL1, AC007381.3, DBI, RPL17-C18orf32, PRDX1, TRAF4, RPS3AP26, ELL3, FAM72B, FAM72A, EBI3, POLE2, MIXL1, C12orf77, CENPN, CENPK, CENPI, MELK, RP11-151F5.2, RP11-203B7.2, SPC24, BFSP2, TIMM8A, FAM72D, TPX2, BRI3BP, MCOLN2, SGOL1, SHCBP1, AC020743.2, RP11-624L12.1, MKI67, RPL39, SNRNP25, CXCL9, UBE2C, UBE2T, BIRC5, HAUS1, RFTN1, MTHFD1L, ARHGAP11A, TIFAB, ASPM, EEF1B2, HLA-DPB2, C1orf186, PAX5, RP11-40C6.2, RP11-360L9.7, NME1, NME2, MCM4, MCM2, HSPB11, MREG, AC079922.2, CCNB1, CTC-451P13.1, RP11-10G12.1, KIAA0101, SNRPEP2, AC079767.4, CD19, FUCA1, EXO1, RPS3AP47, RPL35P1, NPM3, BYSL, PAICS, RPL41, C19orf48, NAPSB, RPL22L1, IL4I1, MCM10, GGCT, MYBL2, TGIF2- C20orf24, FPR3, RPL36A, SNRPF, SNRPG, SNRPE, E2F8, RAB42, CDC6, CD40, CD79B, CD79A, IDO1, TMA16, CXCL11, RPS2P55, CDK1, CDK4, RPL17, CENPM, CENPA, SLAMF8, DTYMK, NOP16, CHEK1, GINS2, GINS3, BLNK, SPC25, CD70, CD72, CCL19, GCSAM, HJURP, CTD-2342N23.3, NDC80, RPS7, RPL26, RP1-278E11.3, DEPDC1B, SHFM1, ESCO2, HTRA4, SPIB, CXCL10, TUBBP1, BCL11A, BRIX1, RP11-253E3.1. Esophageal Carcinoma: ITGA2, ITGA6, DLGAP5, RP11-357H14.17, FAM64A, SKA3, LAMB3, RP11-397A16.1, BIVM-ERCC5, SIPA1L3, RNU1-28P, RAB27B, AC002543.2, HIST1H1B, CKMT1A, CEP55, FERMT1, RPSAP52, PI3, RRM2, RIPPLY3, MXRA5, NEIL3, PMAIP1, SAMD9, FAM111B, RP11-93K22.13, FOXM1, ILDR1, MUC4, KLK6, MAD2L1, PRDM1, SNORD17, COL1A1, CEMIP, AC012363.4, RP11-421E14.2, CLSPN, WNT7B, ENTPD7, HEPHL1, RSC1A1, POU6F2, CKAP2, HIST1H4C, CTD-2555C10.3, SERINC2, F2RL1, KLF5, SND1-IT1, NRARP, RNU1-27P, AC002550.5, RP4-594A5.1, RAD51AP1, CDT1, CST2, CST1, RP4-669H2.1, TGFA, PHLDA2, RHPN2, RNVU1-18, KCNK1, CCL20, CCL22, GAL3ST2, RIPK4, HSP90AB4P, TMEM189-UBE2V1, UHRF1, WDHD1, AURKB, AC073046.25, MET, RMRP, KRT17, KRT19, ABHD17C, RP11-108M12.3, LY75, KPNA7, FAM83H, LSR, KIF4A, CTD-2357A8.3, RP11-757F18.3, TMEM30B, LPAR5, HIST1H3G, ABCA13, SLC7A5, RP5-867C24.5, AP1S3, MMP1, KIF26B, JUP, UBD, CASC9, TLCD1, SCARNA7, RAET1E-AS1, PTBP3, SQLE, ZNF296, FOXP3, PARPBP, RP11- 161I6.2, CELSR1, CCNF, KIF18B, ERCC6L, RP11-709A23.1, AC011288.2, MMP10, MMP11, MMP12, MMP13, ALG1L, CKAP2L, RP1-315G1.1, EPPK1, GALNT6, GALNT3, KIF14, KIF11, IGF2BP3, IGF2BP2, NRG1, SPINT1, HMGA2, HMGA1, DSPP, FAM72B, RP11-8L2.1, FAM83H-AS1, MARVELD2, KRTAP4-1, POLE2, TOP2A, MRPL37P1, CENPK, CENPI, CENPF, CENPE, MACC1, RP4-694A7.2, EPHB2, MELK, MDFI, S100A7A, NCEH1, CORO2A, ETV4, ETV7, AC005537.2, MYO10, C10orf91, GRIN2D, B3GNT5, RP11-510J16.5, FAM72D, SLC44A5, TPX2, RP11-429J17.7, RP11-616K22.1, S100A7, BRI3BP, ZNF695, PGAM1P7, IFI6, BRCA2, KIF23, POLQ, LY75- CD302, CTB-181H17.1, RP11-69L16.6, SGOL1, ARNTL2, RPPH1, C2orf48, FSCN1, RP11-1103G16.1, KIF20B, KIF20A, MKI67, RP11-90O23.1, CXCL9, UBE2C, UBE2T, CCAT1, MARK2P8, TNIP3, ARHGAP11A, ASPM, POU5F1P4, CBLC, ERVMER34-1, DDR1, DSG2, RNF43, AP1M2, RP5-884M6.1, MCM4, MCM2, SGPP2, RP11- 201O14.1, BLM, TNFSF15, CCNB1, LAMC2, CTD-2542L18.1, RAB19, KIAA0101, AC010761.10, RP11-550A5.2, HPDL, CTD-3051D23.4, CTHRC1, FXYD3, IL22RA2, EXO1, E2F7, RP11-216N14.7, TTYH3, FAM83B, RP11- 284F21.7, CDH1, CDH3, NETO2, ECT2, ST14, NFE2L3, SRMS, SOX9, SOX4, MAL2, NUSAP1, PLEKHG6, FU42393, MCM10, MYBL2, HELLS, HIST1H2AM, RP11-626H12.2, COL10A1, E2F8, CXADR, PAPD7, OAS3, OAS2, CDC6, CLDN4, RP11-499F3.2, BIK, EDAR, FOSL1, LAD1, RP11-157K17.5, PRRG4, RP11-27M24.1, ULBP2, ULBP3, F11R, TMEM238, KDELR3, CXCL11, GS1-184P14.2, CTB-102L5.7, DEPDC1, CDK1, STIL, CENPA, ANLN, RN7SL4P, RP11- 44F14.2, CTC-480C2.1, AC010240.2, PRR11, PRR7, GJD3, CCR8, CDS1, ESRP1, RBM47, GINS4, CYP2S1, ZNF460, RNU1-1, SAMD12, RP11-16C1.2, PLEK2, FRK, FOXP4-AS1, CSF2, IQGAP3, GYLTL1B, SPC25, C1orf106, OCLN, BBOX1-AS1, DDIAS, MYEOV, TMPRSS4, DEPDC1B, APOBEC3B, TYRO3P, CEACAM5, ARHGEF35, ESCO2, CXCL10, GPX2, FEZF1-AS1, CDCP1, SAPCD2, MYO5B, ZNRF2P1, SLC7A11, GNGT1. Glioblastoma Multiforme: FAM64A, OTP, MSMP, TUBA1C, TNFRSF12A, RRM2, NEIL3, TAAR3, DMRTA2, FAM111B, CHRNA9, CRISPLD1, CHI3L1, RP11-351I24.3, HIST1H4J, SPINK8, HSPE1-MOB4, RP11-134N1.2, ADORA3, IGFBP2, GAL3ST4, TMEM189-UBE2V1, RP11-698N11.2, UHRF1, WARS2-IT1, KIF4A, F2R, TK1, RP11- 565P22.2, TMSB15A, MEST, AC092675.3, RP11-84A19.3, TP53, CENPI, MDFI, IBSP, NMB, SPC24, PDPN, CX3CR1, NES, MKI67, HAS2, UBE2C, RPE65, SPOCD1, CDKN2C, MCM2, GSX1, AC114803.3, KIAA0101, C21orf62, EGFR, TREM2, NUSAP1, HRH1, MCM10, PIGY, TGIF2-C20orf24, RAB42, CCDC109B, CENPA, TLR7, PRR11, CHI3L2, ID3, GPR82, SPC25, CXCL10, GPR34, MEX3A, GBP1. Head & Neck Squamous Cell Carcinoma : ITGA2, ITGA3, ITGA6, CKS1B, ARTN, FAM64A, PCAT1, RP11- 229P13.23, ADAM12, CCNA2, WNT10A, TM4SF19, LAMB3, HN1L, RP11-397A16.1, IL20, DDX60, SERPINB5, RP5-940J5.9, KCTD11, TUBA1C, RP11-334L9.1, BCL2L12, RP13-463N16.6, CEP55, CTSC, AMMECR1, GPR68, OTOP3, TP63, FERMT1, TNFRSF12A, RPSAP52, PTHLH, PI3, EPHA2, RRM2, MXRA5, IGSF3, ITGB4, ITGB6, NEIL3, PMAIP1, CTA-384D8.35, SAMD9, LRRC15, FAM111B, FOXM1, CDCA4, KLK9, RP11-215A21.2, PRDM1, EPGN, COL1A1, C19orf33, KLK10, TMED7-TICAM2, MINOS1-NBL1, IL1A, KRT6B, KRT6C, KRT6A, DSC2, WNT7B, ENTPD7, CGB8, HEPHL1, CKAP4, CTD-2555C10.3, SERINC2, F2RL2, NPBWR1, DEFB4B, DEFB4A, AC007879.7, S100A10, S100A16, CDT1, CST1, VANGL1, IGFL1, PLAU, PHLDA2, SOWAHC, AKR1B15, GNA15, CCL20, CCL22, RND3, LMNA, TMEM102, KRT5, RIPK4, TMEM189-UBE2V1, C1orf74, LGALS3BP, UHRF1, AURKB, CD276, MET, KRT17, KRT16, KRT14, RP4-791C19.1, KPNA7, B4GALT1, FAM83H, LSR, IFIH1, KIF4A, CALML3, CTD-2357A8.3, FZD6, AJUBA, LPAR3, HIST1H3G, RP11-21B23.2, SFN, MMP1, JUP, UBD, CASC9, TNFRSF10A, CLCA2, APOL1, TNPO1P3, COL5A2, PTBP3, KIFC1, RP11-161I6.2, TK1, CELSR1, GSTP1, AC000089.3, RP11-404P21.8, KIF18B, ERCC6L, AC011288.2, ANXA1, ANXA2, ANXA8, PGM2, MMP14, MMP10, MMP11, MMP12, MMP13, ALG1L, CKAP2L, WNT5A, TCF19, GALNT6, GALNT3, KIF14, KIF11, IGF2BP3, IGF2BP2, CD109, LOXL2, SLFN5, CTD- 2008L17.1, HMGA2, AREG, IFI27, FAT2, RHOD, SNAI2, AC007389.3, KRTAP4-1, TPRXL, POLE2, RNASE7, RP11- 400N13.3, SDC1, CENPI, CENPF, RP4-694A7.2, MELK, MDFI, ITPRIPL2, TENM2, RTP4, S100A7A, HNRNPA1P33, MB21D1, ETV4, SPHK1, SPC24, C10orf99, C10orf91, ADH7, B3GNT5, KRT75, PDPN, FST, TPX2, S100A7, S100A2, KIF23, LY75-CD302, RP11-69L16.6, ARNTL2, FRRS1, RP11-104E19.1, TM4SF19-TCTEX1D2, KIF20A, MKI67, HAS3, ARSI, CXCL9, UBE2C, BIRC5, SPRR1B, DTX3L, ARSJ, TGFBI, AC010677.5, TNIP3, GBP1P1, YWHAZP4, EPSTI1, ARHGAP11A, MFAP2, TNFSF10, ASPM, TUBB6, TNC, ERVMER34-1, DSG2, DSG3, RP11-40C6.2, RAP2B, CDKN2B, CCL7, RP5-884M6.1, MCM5, MCM2, MICALL1, FAM129B, RASSF10, AC079922.2, CCNB1, LAMC2, HES2, TM4SF1, IER3, KIAA0101, EXT1, EHD4, CTHRC1, TRIM16, LDHAP7, FXYD3, KREMEN2, IL22RA2, CD1B, EXO1, E2F7, LRRC8E, FAM83A, RPL35P1, TP73, CDH1, CDH3, ECT2, ST14, RP1-241P17.4, SLC1A5, INHBA, CD9, NUSAP1, ABCC1, SERPINH1, MCM10, MYBL2, C16orf74, PIGY, TGIF2-C20orf24, RAET1L, ZDHHC12, COL10A1, E2F8, IL36G, IL12RB2, PAPD7, OAS1, OAS3, OAS2, PTGFRN, CDC6, ISG15, IRF6, FOSL1, LAD1, PRRG4, CD44, GAST, IDO1, KRT16P5, ULBP2, ULBP3, F11R, CCDC109B, CXCL11, RPS2P55, CLCA3P, CDK1, CDK2, CENPA, CTC- 518P12.6, RP11-44F14.2, PRR11, CCR8, PABPC1, ESRP1, RP11-69G7.1, FGFBP1, PLEK2, GPR87, PROM2, CSF2, GYLTL1B, SPC25, ANXA2P2, LAMA3, LY6D, BBOX1-AS1, AMTN, RAB38, BDKRB1, RP11-483L5.1, TMPRSS4, ANO1, COL3A1, APOBEC3B, KLHDC7B, SLC2A1, SPRR2D, SPRR2A, SPRR2B, EDARADD, BNC1, CXCL10, LCE3D, LCE3E, CDCP1, EFNB1, TGIF1, GBP1, GBP6, GNGT1. Kidney Chromophobe: RP11-2N1.2, RP11-215A21.2, KLK15, DNTT, CR1L, AC106869.2, AC005255.3, RP4- 791C19.1, RP11-459C13.1, NCOA7-AS1, PSG9, KRTAP5-3, RP11-40C6.2, RP11-1E11.1. Kidney Clear Cell Carcinoma: AP000439.3, RP11-283G6.5, FABP7, RP11-2L8.1, RP11-40C6.2, AC114803.3, RP11-10G12.1, LDHAP7, PI4K2B. Liver Hepatocellular Carcinoma: GPC3, RP11-334L9.1, RRM2, FAM111B, RP11-215A21.2, FTH1P20, AC005255.3, KIF4A, UBD, ALG1L, SPC24, RP11-40C6.2, RP11-10G12.1, PIGY, CTC-518P12.6, RP11-556E13.1, ACSL4, CXCL10. Lung Adenocarcinoma: IGKV1-12, RP11-229P13.23, RP5-940J5.9, HMGB3, RP11-334L9.1, HS6ST2, RRM2, LRRC15, FAM111B, RP11-93K22.13, EPCAM, RP11-649E7.5, RP11-350J20.12, AC005255.3, CST1, HMGN2P17, UHRF1, EPYC, KPNA7, BPIFA1, KIF4A, AGR2, ARAP1-AS1, MMP1, UBD, CASC9, RP3-407E4.4, MMP11, MMP12, MMP13, ALG1L, KRTAP4-1, ETV4, PAEP, UBE2C, YWHAZP4, DSG2, RP11-40C6.2, SGPP2, RP11-10G12.1, KIAA0101, CTHRC1, LDHAP7, IL22RA2, RP1-241P17.4, MCM10, RP11-579D7.2, SMPDL3B, RP3-340N1.2, COL10A1, E2F8, PYCR1, RPS2P55, CENPA, CTC-518P12.6, PRR11, LGSN, HTR3A, CEACAM5, AFAP1-AS1, FEZF1- AS1, MEX3A. Lung Squamous Cell Carcinoma: ARTN, FAM64A, RP11-229P13.23, ADAM12, CCNA2, RP11-397A16.1, RP5- 940J5.9, HMGB3, BMP7, RP13-463N16.6, CEP55, TP63, FERMT1, PTHLH, RRM2, PVRL1, TICRR, NEIL3, LRRC15, FAM111B, CHRNA5, FOXM1, CDCA7, RP11-215A21.2, MAD2L1, PCDH19, KRT6B, WNT7B, POU6F2, NPBWR1, NKX1-2, RP11-649E7.5, RP4-594A5.1, RAD51AP1, CDT1, CST1, IGFBP2, UHRF1, AURKB, ZP3, KRT17, KRT16, AF127577.8, COL11A1, RP11-108M12.3, IGHV1-69, KIF4A, HIST1H3G, ABCA13, MMP1, UBD, CASC9, CLCA2, KIFC1, PARPBP, RP11-161I6.2, TK1, KIF18B, ERCC6L, AC011288.2, MMP10, MMP11, MMP12, MMP13, ALG1L, CKAP2L, KIF14, KIF11, IGF2BP3, CTD-2008L17.1, HMGA2, RP11-8L2.1, FAM83H-AS1, KRTAP4-1, SDC1, MIXL1, CENPI, CENPF, MELK, S100A7A, ETV4, AC005537.2, SPC24, RP11-742B18.1, ADH7, RP11-510J16.5, TPX2, S100A2, KIF23, RP11-69L16.6, RP11-408B11.2, ARNTL2, ADAMTS20, KIF20A, MKI67, HAS3, UBE2C, UBE2T, BIRC5, CCAT1, KC6, YWHAZP4, ARHGAP11A, ASPM, DSG2, RP11-40C6.2, CDKN2A, MCM4, MCM2, CCNB1, RP11-10G12.1, KIAA0101, CTHRC1, LDHAP7, SNRPEP2, AKR1C2, NXPH4, IL22RA2, EXO1, E2F7, TP73, CDH3, ECT2, RP1-241P17.4, SOX2, NUSAP1, MCM10, MYBL2, C16orf74, PIGY, TGIF2-C20orf24, COL10A1, E2F8, PAPD7, PTGFRN, CDC6, RP11-499F3.2, BIK, GAST, RPS2P55, CDK1, CENPM, CENPA, ANLN, CTC-518P12.6, SLAMF9, CTC-480C2.1, PRR11, RP11-657O9.1, CHEK1, GINS4, CYP2S1, GPR87, GYLTL1B, SPC25, BBOX1-AS1, TMPRSS4, APOBEC3B, SLC2A1, EDARADD, NTS, FEZF1-AS1, SAPCD2, MEX3A, GBP6, GNGT1. Ovarian Serous Cystadenocarcinoma: CKS1B, PHOX2A, KRTAP2-3, SRD5A3, NACC1, FAM64A, CNGB1, RP11- 624M8.1, WNT10A, RP5-1065P14.2, CITED4, CRABP2, TTC30B, KLHL14, LDLRAD1, TUBA1C, RNU1-28P, HMGB3, AC144450.2, LYPD1, HIST1H1B, AC004870.4, CEP55, LGALS17A, PNOC, USP18, TNFRSF12A, SCNN1A, RRM2, RIPPLY3, MXRA5, IGSF9, BASP1P1, NEIL3, MESP2, FAM111B, RP11-93K22.13, FOXM1, CTC-513N18.6, ILDR1, PTH2R, KLK6, KLK7, KLK8, MAD2L1, SNORD17, C19orf33, EPCAM, KLK10, PPP1R14B, ALPPL2, OCIAD2, CHI3L1, MUC16, LRRN2, WNT7A, TMEM139, SCGB1D1, KRTCAP3, HIST1H4C, HOXD1, S100A5, WFDC2, RNU1- 27P, GLOD5, RPS26P47, LAPTM4B, PGAM1P4, SNORD15B, RAD51AP1, CDT1, CST1, CST5, PART1, STON2, OXGR1, PHLDA2, CDR2L, MSLNL, AC012123.1, RHPN2, RNVU1-18, CCL20, GAL3ST2, TMEM102, KRT8, RP11- 806H10.4, TMEM189-UBE2V1, AURKB, RMRP, KRT18, AF127577.8, RN7SL138P, FAM83H, SCGB2A1, FAM84B, LSR, KIF4A, AGRN, RP11-710M11.1, LPAR3, HIST1H3B, HIST1H3G, CMPK2, ATP5J2, VTCN1, MMP7, CASC9, FGF18, TLCD1, SCARNA7, KIFC1, PARPBP, TK1, MSLN, CELSR2, CELSR1, CCDC167, THEM6, RP11-608O21.1, RP11-565P22.6, RP11-565P22.2, KIF18B, ERCC6L, XPR1, HUNK, SBK1, KCNK15, RHPN1-AS1, SNORA12, ARL4C, GALNT6, GALNT3, KIF11, RP11-3K16.2, SPINT1, HMGA2, HMGA1, RP11-283G6.3, RP11-283G6.5, RP11-3J1.1, LYPD6, IFI27, EYA2, HOXB2, HOXB7, HOXB4, NR2F6, FAM72B, AC007389.3, ST8SIA2, PCDH1, NPW, CENPF, RP11-688G15.3, MELK, XAGE2, RTP4, SPON1, ETV4, THSD4, SNORA73B, TMC4, AC005537.2, SPC24, C10orf91, C10orf95, RP11-349N19.2, PARD6B, SNORA47, TPX2, RP11-429J17.7, FOXJ1, RP11-13K12.5, BRI3BP, ZNF695, IFI6, CRIP1, CCNE1, UNC5B-AS1, RP11-408B11.2, RPPH1, PAEP, C1orf233, C2orf48, FAM103A2P, KIF20A, MKI67, RP11-90O23.1, ARSH, CXCL9, RCC2P6, UBE2C, UBE2T, F8A1, HIST1H2BH, VAMP8, RP11-468N14.13, C1orf186, FOLR1, EPHX4, RNU4-2, RNU4-1, UCP2, OVOL2, LRRC55, AP1M2, TLR8-AS1, CDKN2A, CCL7, CCL8, MCM2, SGPP2, ZBTB42, RASSF10, CCNB1, TPD52, LAMC2, SOX17, C2orf15, KIAA0101, C9orf16, CRB2, HPDL, CTD-3051D23.4, CXXC5, CTHRC1, NXPH4, KREMEN2, LRRTM1, CDH6, TREM2, ECT2, ST14, MAL2, CD9, NUSAP1, UPK3B, MCM10, MYBL2, TGIF2-C20orf24, ATP6V1B1, RP11-579D7.2, FPR3, SMPDL3B, HIST1H2AM, ZDHHC12, COL10A1, E2F3, E2F8, RAB42, PAPD7, OAS1, OAS3, OAS2, ISG15, CLDN3, CLDN4, CLDN6, CLDN7, RP11-499F3.2, BIK, CMTM7, RP3-508I15.18, VCAN-AS1, RP11-27M24.1, IDO1, GRB7, TMEM238, KDELR3, CXCL11, GS1-184P14.2, CDK1, CENPA, NHLRC1, PRR11, PRR7, GJD3, FOXI3, OBP2A, RP11-657O9.1, ESRP1, RP11-231N3.1, ABHD11, RNU1-1, SAMD12, SAMD10, RP1-80N2.3, IQGAP3, GYLTL1B, SPC25, HTR3A, BHLHE41, LY6E, BBOX1-AS1, DOK5, FUT8-AS1, KRT87P, TMPRSS3, BCAM, CXCL10, VGLL1, CLDN16, SAPCD2, RP11-323N12.5. Pancreatic Adenocarcinoma: ITGA2, GCNT3, LIF, ADAM12, LAMB3, TIMP1, RP5-940J5.9, RP13-463N16.6, CTSE, STRA6, RRM2, MXRA5, ITGB6, HLA-DRB6, LRRC15, FAM111B, KLK6, SFRP2, COL1A2, COL1A1, CEMIP, C19orf33, CCL17, WNT7A, CGB8, CGB5, NREP, F2RL2, S100A6, RP11-350J20.12, LCN2, AC005255.3, S100A10, CST2, CST1, CST4, PLAU, PLAT, PHLDA2, IGFBP3, VILL, KCNK1, CCL20, CCL22, SULF1, FNDC1, MATN3, EPYC, GABRP, KRT19, COL11A1, LY75, ONECUT3, KPNA7, RP5-907D15.4, KIF4A, TMSB10, AGR2, F2R, CTD-2357A8.3, ARAP1-AS1, MMP7, MMP1, KIF26B, UBD, CASC9, FGF19, APOL1, COL5A2, COL5A1, DUOXA2, MSLN, ZPLD1, RP11-334E6.12, MMP10, MMP11, MMP12, MMP13, GALNT5, IFI27, ADAMTS12, RP11-400N13.3, OLFML2B, PMCH, MDFI, UGT1A10, GRP, TMC5, BMP4, CTB-191K22.5, GRIN2D, B3GNT3, HLA-DRB5, SLC44A4, LUM, IFI6, KERA, LY75-CD302, UBE2MP1, MKI67, POSTN, CXCL5, CXCL9, UBE2C, CCAT1, YWHAZP4, EPSTI1, TNFSF11, TNFSF18, HLA-DPB2, NPSR1, RP11-40C6.2, RP5-884M6.1, SGPP2, P4HA3, LAMC2, ZNF469, RP11-10G12.1, CTHRC1, TNFRSF6B, FXYD3, WISP1, IL22RA2, CD1B, TREM2, INHBA, PIGY, TGIF2-C20orf24, RP11-579D7.2, RP11-626H12.1, COL10A1, OAS1, ISG15, CLDN2, BIK, EDAR, FOSL1, CTC-518P12.6, PRR11, CCR8, PLEK2, RP11- 462L8.1, CSF2, SYT13, ANXA2P2, LAMA3, CCL19, MYEOV, TRIM31, TMPRSS4, ANO1, COL3A1, CEACAM5, CEACAM6, HTRA3, CXCL10, AFAP1-AS1, FEZF1-AS1, DIAPH2-AS1. Pheochromocytoma & Paraganglioma: CHST1, PHOX2A, PHOX2B, SEZ6L, BEX1, SLC18A2, SLC18A1, SYT1, SYT2, SYT4, SYT5, CNGB1, UNC5B, VGF, UNC5A, PCSK2, DYNC1I1, ASTN2, PCDHAC2, PCDHAC1, H2AFY2, FAIM2, RP5- 940J5.9, AC005944.2, DCX, FLRT1, RP11-334L9.1, BMP7, RP11-124O11.1, RP11-294J22.6, HAND2, PENK, TMEM35, DLX1, DLX2, STRA6, HS6ST2, FAM163A, HAND2-AS1, SMIM18, EPHA8, GLB1L3, TH, CRYBA2, RIPPLY2, C18orf42, CHRNA5, CHRNA7, CHRNA3, PRPH, GDAP1, IGF2, PCDHA11, PCDHA13, RP11-64K12.4, ISL1, LRRN2, TMEM130, AC106869.2, DRD2, MAGEE2, SLC29A4, RP4-607I7.1, CDH18, PCSK1, NPBWR2, FAM163B, RCOR2, RTBDN, TPBGL, GCH1, BSCL2, ADGRA1, SV2C, GPR19, RP4-555D20.4, HDAC9, GABRB3, C14orf132, KCNK9, TMEM132D, LRRC4C, NACAD, TFAP2B, VAX2, RP11-148L24.1, IL13RA2, HS3ST2, ROBO2, RP11- 256P1.1, CPE, CHRFAM7A, GNG4, GNG8, RAB39B, GABRQ, MMD, SLC24A2, ARAP1-AS1, HMP19, FBLL1, SCN3A, SCN3B, AMER3, RP11-161D15.3, KIF26A, SEZ6L2, FGF14, SLC8A2, FAM19A3, CACNA2D3, REEP2, FEV, DISP2, KCNH5, KCNH1, KCNH2, MYCN, RP11-334E6.12, NTNG1, GPR176, UCHL1, CDH22, PRCD, RP11- 1018N14.5, GALNT6, NEUROD4, NEFM, NRG1, RGS4, DBH, CYGB, RP11-445N20.3, RP11-8L2.1, ST8SIA3, ST8SIA2, SNAP25-AS1, NPY, AP006547.3, CDK5R2, SLC10A4, INA, MFI2, CHRNB2, CHRNB4, TCEAL7, ATP1A3, BEGAIN, PTCHD4, CPLX2, GALR1, RP11-588K22.2, FAM155A, DBN1, B3GNT4, GNAS, SHF, RP11-272L13.3, DRGX, NEFL, UBE2MP1, GLRA1, ALCAM, TUB, NALCN, CACNG2, RP11-248N22.1, ARSF, RET, RP11-12M5.3, GPR22, KCNMB2, CARTPT, ATP1B1, TLX2, TLX3, TUBB3, CHGA, CHGB, RP11-40C6.2, L1CAM, LMO1, MARCH4, TIAM1, VSTM2L, VSTM2A, OPRD1, IPO8P1, KCNB2, KCNB1, CNTNAP5, GNAS-AS1, GPR158, RP11-10G12.1, RAB15, INSM2, INSM1, GFRA2, NXPH4, CTB-158E9.1, CD200, IGF2-AS, LRRTM4, SCG5, SCG3, SCG2, PROKR1, RP1L1, SLC6A2, SPOCK2, SPOCK3, C2orf91, EGFLAM, MLLT11, MYT1, RP11-662M24.2, CAMK4, HMGCLL1, IGFBPL1, NTRK1, PRLHR, PCLO, RP11-579D7.2, NDUFA4L2, TANC2, ELAVL4, GPR162, CTD-2562J17.4, ACOT7, SLC35D3, IL1RAPL2, PTPRN, PCDHB10, TMIE, KIAA2022, STAC, ESM1, PAK3, KCNG4, RP11-21A7A.3, RP11- 21A7A.2, HCN1, SCRT2, CYB561, SYP, FAM162B, RP11-269G24.4, MYH15, PLCB4, PRRT4, PCBP3, C1QL1, INSRR, NPFF, TAGLN3, GJD2, TMEM179, PTPRN2, TMEM169, GPR83, TMEM114, KIAA1614, CNTN1, MRAP2, RP11-21A7A.4, GATA2, CXXC1P1, RTL1, RASD2, DOK5, RAB3C, PIRT, PPP1R17, ATCAY, RP11-379F12.4, GRID1, DPP6, DGKK, STMN2, RD3, MGAT4C, MCHR1, DDC, BMPR1B, ARC, APLP1, SLC7A14. Prostate Adenocarcinoma: GCNT1, RP11-170M17.1, PLA2G7, PCAT1, RP11-229P13.23, WI2-85898F10.1, AC144450.2, DLX1, DLX2, RP11-215A21.2, AMACR, MINOS1-NBL1, TMEFF2, AP006748.1, NPY4R, PODXL2, HMGN2P17, ACSM1, UBD, TRPM8, AC000089.3, TMSB15A, TRGC1, XXbac-BPG308K3.5, MMP10, RPL7P16, LUZP2, RP11-1084J3.3, RP11-40C6.2, VSTM2L, RP11-10G12.1, ABCC4, PIGY, RP11-579D7.2, COL10A1, CLDN8, CXCL11, RPS2P55, DNASE2B, GDF15, GOLM1, PCA3, AP001610.9, SIM2, RP11-483P21.2. Rectum Adenocarcinoma: POU5F1B, GCNT1, RP11-357H14.17, FAM64A, CHMP4C, RP11-229P13.23, SKA3, LAMB3, GPR143, LY6G6D, RBBP8NL, HMGB3, SEL1L3, CKMT1A, BCL2L15, CEP55, ATP10B, TRABD2A, TDGF1, PRSS33, FERMT1, CDH17, RRM2, RIPPLY3, KRT8P45, NEIL3, MAP7, FAM111B, RP11-93K22.13, ILDR1, CDCA7, RP11-215A21.2, MAD2L1, CEMIP, EPCAM, RNF128, ANKS4B, PPP1R14D, VWA2, MUC13, KRTCAP3, CKAP2, HKDC1, F2RL1, RP11-649E7.5, AP000439.3, AC007099.1, NOX1, AC005255.3, CLRN3, PRSS8, RAD51AP1, CDT1, CST2, CST1, ATP2C2, PHLDA2, EREG, RHPN2, CCL20, GAL3ST2, KRT8, HMGN2P17, UHRF1, AURKA, AURKB, ZP3, MET, KRT18, ABHD17C, LY75, KPNA7, FAM83H, FAM84A, GMDS, LSR, KIF4A, AGR2, MAPK13, PRR15L, FUT3, MMP1, UBD, CASC9, FGF19, TLCD1, NOS2, KIFC1, SQLE, PARPBP, TMEM45B, TK1, LRRC31, CCNF, AC000089.3, KIF18B, ERCC6L, FAM150A, RP11-150O12.3, RP11-150O12.6, MMP10, MMP11, MMP12, KIAA0226L, GGH, GALNT4, GALNT5, GALNT6, GALNT3, KIF11, ASCL2, HMGA1, C6orf222, C6orf223, PLS1, AREG, GRM8, HOXB9, AMELX, MARVELD2, KRTAP4-1, ENC1, CENPF, MACC1, EPHB2, MELK, GPA33, SPAG1, CASC21, RP11-151F5.2, ETV4, LGALS4, CTB-191K22.5, GRIN2D, KRT8P48, B3GNT3, FAM72D, SLC44A3, SLC44A4, TPX2, MISP, LGR5, BRI3BP, LY75-CD302, VIL1, TMEM211, MKI67, GUCY2C, NOTUM, CXCL3, CXCL9, UBE2C, UBE2T, CCAT1, YWHAZP4, ARHGAP11A, TNFSF11, ASPM, EFNA4, CBLC, AP003774.1, RPL36A-HNRNPH2, EPHX4, DSG2, RP11- 40C6.2, RNF43, AP1M2, PRR15, MCM2, VDR, FAM105A, PPP1R1B, BLM, RP11-187E13.1, CCNB1, LAMC2, RP11-10G12.1, NOXO1, KIAA0101, DACH1, CRB3, HPDL, TRIM15, LDHAP7, AL163953.2, FXYD3, EXO1, RXFP4, RP11-284F21.7, CDH1, RP11-22L13.1, ABRACL, ECT2, ST14, NFE2L3, SRMS, SOX9, MAL2, NUSAP1, PLEKHG6, MCM10, MYBL2, HNF4A, TGIF2-C20orf24, USH1C, RP11-480I12.5, COL10A1, E2F8, PYCR1, GPR160, CXADR, CDX1, CDX2, CDC6, CLDN3, CLDN4, CLDN7, BIK, EDAR, AC123023.1, TMEM238, CXCL11, RPS2P55, CDK1, CENPA, ANLN, RP11-44F14.2, PRR11, CCR8, RP1-276N6.2, ESRP1, CYP2S1, PLEK2, TRIM31-AS1, FOXP4-AS1, AC021218.2, TOX3, IQGAP3, MYB, SPC25, RP5-881L22.5, IHH, TSPAN8, BBOX1-AS1, MYEOV, ACSL5, TRIM31, TFF3, TMPRSS4, CEACAM1, CEACAM5, CEACAM6, ARHGEF35, STX19, CXCL10, SATB2, GPR39, GPR35, GPX2, CDCP1, SAPCD2, MEX3A, MYO5B, DDC. Skin Cutaneous Melanoma: PHACTR1, FAM64A, RP11-229P13.23, TRIM51, GPR143, TSPAN10, TYR, RP11- 221N13.3, RP5-940J5.9, AC005592.2, PLOD3, NRP2, HSPA8P1, PIR, RRM2, ABCB5, FAM111B, CHRNA6, RP11- 215A21.2, PLA1A, ST3GAL6-AS1, CTNNAP1, WARS, RP11-1055B8.3, SLC6A15, ATP6V1B1-AS1, PMEL, PLAT, BANCR, H2AFZP3, CCDC140, RP11-1055B8.2, EDNRB, LGALS3BP, KIF4A, AC144835.1, PRAME, SLC7A5, AP1S2, UBD, MAGEA12, RP11-1070A24.2, BAMBI, RHPN1-AS1, SNX10, EN2, SDC3, CENPF, TRIB2, RTP4, ITIH6, ETV1, ETV5, ETV4, ZNF697, IFI6, NES, CD63, CA14, RP11-104E19.1, GAPLINC, RP11-599J14.2, TRPV2, CXCL9, AC004988.1, BIRC7, FMN1, CTD-2380F24.1, YWHAZP4, STEAP1B, PAX3, CTD-2207A17.1, RP11-40C6.2, BLM, SERPINE2, SOX10, MITF, BACE2, RP11-10G12.1, KIAA0101, MAGEA3, MAGEA6, LDHAP7, EXO1, RHOQP2, TREM2, FAM167B, RENBP, PIGY, TGIF2-C20orf24, MAGEA2, RP11-94H18.1, PKNOX2, RLBP1, TMEM229B, IL12RB2, PAPD7, CDC6, IRF4, HMCN1, KDELR3, GDF15, LEF1, SLC45A2, MIA, LZTS1, PRR11, RGS20, ST3GAL4, GYG2, CSAG3, CSAG2, MLANA, TBC1D7, CNIH3, DUSP4, DUSP6, CXCL10, ALX1, SLC24A5, LHFPL3-AS1. Stomach Adenocarcinoma: ITGA2, GCNT3, RP11-357H14.17, FAM64A, CCNA2, SKA3, BIVM-ERCC5, RNU1-28P, HIST1H1E, HIST1H1B, CEP55, ATP10B, FERMT1, RRM2, TICRR, MXRA5, NEIL3, FAM111B, RP11-93K22.13, FOXM1, ILDR1, CDCA7, MAD2L1, SNORD17, COL1A1, AC012363.4, RP11-421E14.2, ALPPL2, MUC13, RSC1A1, CKAP2, HIST1H4C, F2RL2, SND1-IT1, LCN2, RNU1-27P, RPS26P47, S100A10, RP4-594A5.1, RAD51AP1, CDT1, CST2, CST1, CST4, PHLDA2, RNVU1-18, CCL20, CCL22, GAL3ST2, HSP90AB4P, TMEM189-UBE2V1, UHRF1, AURKB, RMRP, ONECUT2, KIF4A, CALML4, AGR3, HIST1H3G, UBD, CASC9, SCARNA7, RAET1E-AS1, FOXP3, PARPBP, TK1, CCNF, KIF18B, ERCC6L, RP11-709A23.1, MMP11, MMP12, KIF14, KIF11, ASCL2, PLS1, ADAMTS12, FAM72B, MRPL37P1, CENPF, CENPE, MACC1, EPHB2, MELK, ETV4, GRIN2D, FAM72D, TPX2, MISP, RP11-616K22.1, ZNF695, IFI6, LY75-CD302, MDK, CTB-181H17.1, SGOL1, RPPH1, KCNE3, RP11-404E16.1, KIF20B, KIF20A, MKI67, RP11-90O23.1, CXCL9, UBE2C, UBE2T, HIST1H2BL, CCAT1, NDUFC2-KCTD14, TNIP3, EPSTI1, ARHGAP11A, TNFSF11, ASPM, POU5F1P4, EFNA2, NPSR1, RNU4-2, CCL7, MCM2, BLM, CCNB1, LAMC2, KIAA0101, AC010761.10, CTHRC1, TRIM15, IL22RA2, EXO1, RP11-284F21.7, TREM2, ECT2, SLC7A5P2, SRMS, INHBA, NUSAP1, FU42393, MCM10, MYBL2, FPR3, HIST1H2AM, RP11-626H12.1, RP11-480I12.5, COL10A1, PAPD7, OAS1, OAS3, CDC6, CLDN3, CLDN4, CLDN7, EDAR, RP3-508I15.18, RP11-157K17.5, RP11-27M24.1, TMEM238, CXCL11, GS1-184P14.2, CDK1, STIL, CENPA, ANLN, RN7SL4P, PRR11, PRR7, GJD3, CCR8, ZNF460, RNU1-1, CHURC1-FNTB, IQGAP3, MYB, SPC25, LY6E, BBOX1-AS1, MYEOV, ZNF724P, TRIM31, TMPRSS4, DEPDC1B, CEACAM5, CEACAM6, ESCO2, CXCL10, FEZF1-AS1, SAPCD2, TNFRSF11A, RP11-323N12.5, ZNRF2P1. Testicular Germ Cell Tumor: CHST2, ALPP, FAM64A, SYT2, GBX2, AC068134.6, PLA2G2D, SLC7A3, RP5-940J5.9, GLDC, ZFP42, TUBA1C, PQLC3, CTC-260E6.6, PRDM14, MAN1C1, CPXM1, TDGF1, RRM2, AICDA, PMAIP1, TEAD4, VRTN, RP11-215A21.2, PRDM1, MRPL15, NANOGP1, ALPPL2, NANOG, CGB8, CTC-260E6.2, NKX1-2, ZSCAN10, SALL4, HIST1H2AJ, CCR5, CXCR3, KLRG2, TUBB, PHLDA3, BTBD6P1, RP11-148B6.2, CCL26, FOXH1, DACT2, TFAP2C, AURKB, TBPL2, SNHG16, CRYGEP, LIN28A, RPRM, FZD5, RP5-1028K7.2, DLG3, UBD, CASC9, MYCN, TK1, RPS7P11, ERCC6L, RP11-334E6.12, POU5F1, MMP12, TXN, GALNT6, NLRP7, NLRP9, LOXL2, ECEL1P1, OTX2, TET1, ZNF486, AC007389.3, VENTX, MIXL1, MDFI, FBN3, ETV4, AC005682.5, BFSP2, MMP24, PDPN, RP11-13K12.5, SHD, MAGED4, DPPA3, DPPA4, DPPA5, MDK, MX1, RP11-408B11.2, KHDC3L, PRODH, RPL39, ARSI, CXCL9, SUSD2, LIM2, POU5F1P4, HLA-DPB2, RP11-40C6.2, TLR8-AS1, PHB2, DSTNP2, RP11- 13K12.1, SOX15, RP11-10G12.1, RAB15, HPDL, FAM46B, BMF, GIP, CDH3, RPL41, NFE2L3, CD9, LDHB, MYBL2, AP000569.9, RP11-855O10.3, ZNF90, FPR3, RPL36A, DNMT3L, ISG15, SLC25A16, CLDN6, FAM124B, BICD1, SNRPFP1, TOR3A, CXCL11, RPS2P55, RP11-87G24.3, FAM162B, CENPA, SLAMF8, ESRG, PRR11, L1TD1, NPFF, FOXI3, CCR8, ESRP1, PPM1N, PPM1H, CYP2S1, GDF3, PIM2, GYLTL1B, CLEC6A, CCND2, RP11-253E3.3, NGEF, CCL19, TSPAN9, RPS4X, MLN, CXCL10, TRPC5, SAPCD2, MEX3A. Thyroid Carcinoma: NPC2, LAMB3, CYSLTR2, CITED1, RP11-221N13.3, RP5-940J5.9, PSG1, SIGLEC15, TNFRSF12A, RPSAP52, LRP4, FAM111B, RP11-215A21.2, ZCCHC12, ZCCHC16, CCL17, AC018816.3, RP11- 649E7.5, RP11-474O21.5, RP11-16K12.1, CST2, GABRB2, TGFA, HMGN2P17, MET, RP4-791C19.1, SLC27A6, LPAR5, DCSTAMP, SLC34A2, HMGA2, AC079630.2, AC079630.4, SDC4, RXRG, ETV4, PTCHD4, FN1, SLC25A47P1, ACTBL2, RP11-40C6.2, PRR15, RP11-542B15.1, AE000661.37, RP11-498C9.3, PDLIM4, RP11- 10G12.1, CD1B, CDH3, TREM2, NELL2, PIGY, RP11-579D7.2, RPS2P55, RP11-230G5.2, DTX4, CTC-518P12.6, PNPLA5, CSF2, SYT12, DPP4, DUSP6, TRPC5, CLDN16. Uterine Carcinosarcoma: FAM64A, C16orf59, WNT10A, CRABP1, CRABP2, CLUL1, RP5-940J5.9, HMGB3, RP11- 334L9.1, PLAC1, HSPA8P1, RRM2, RIPPLY3, RNASEH2A, NEIL3, FAM111B, RP11-215A21.2, MAD2L1, MINOS1- NBL1, NREP, SLC29A4, CBX2, RCOR2, RAD51AP1, CDT1, RPL13AP20, HMGN2P17, VAX2, UHRF1, AURKB, KIF4A, PAFAH1B3, KIF26B, KIFC1, PARPBP, TK1, RP11-608O21.1, CCNF, KIF18B, ERCC6L, TMSB15A, MEST, SBK1, RHPN1-AS1, KIF14, KIF11, IGF2BP1, IGF2BP3, HMGA2, HMGA1, TRAF4, FAM72B, MIXL1, COL2A1, CENPF, EPHB2, MELK, MDFI, FBN2, FBN3, ETV4, RAC3, SPC24, TPX2, ZNF695, MDK, DLX4, CCNE1, C2orf48, FSCN1, KIF20A, MKI67, NOTUM, UBE2C, UBE2T, BIRC5, COL26A1, MFAP2, ASPM, ERVMER34-1, RP11-40C6.2, CDKN2A, RP11-360L9.7, NME1, MCM4, MCM2, BLM, AC079922.2, CCNB1, RP11-10G12.1, KIAA0101, CTHRC1, NXPH4, KREMEN2, EXO1, ECT2, RP1-241P17.4, NUSAP1, MCM10, MYBL2, E2F1, E2F8, PYCR1, CDC6, ESM1, RPS2P55, CDK1, CENPM, CENPA, DRAXIN, CTC-518P12.6, PRR11, RP11-657O9.1, CHEK1, GINS4, SPC25, MARCKSL1, BBOX1-AS1, HJURP, SAPCD2, MEX3A, GNGT1. Uterine Corpus Endometrioid Carcinoma: RP6-65G23.3, FAM64A, C16orf59, RP11-638I2.6, RP5-940J5.9, RBBP8NL, HMGB3, RP11-334L9.1, AC004870.4, RP11-519G16.5, CEP55, DLX5, DLX6, RPSAP52, RRM2, RIPPLY3, KRT8P45, NEIL3, FAM111B, DLX6-AS1, ILDR1, RP11-215A21.2, MAD2L1, EPCAM, ALPPL2, OCIAD2, MUC16, WNT7A, KRTCAP3, HIST1H4J, SERINC2, FEN1, WFDC2, RP11-649E7.5, FTH1P20, PRSS8, CDT1, RHPN2, RPL13AP20, TMEM132A, KRT8, DACT2, HMGN2P17, UHRF1, AURKB, RP11-256P1.1, KRT18, RP11-108M12.3, RP11-465B22.3, SCGB2A1, LSR, KIF4A, ARAP1-AS1, RP11-710M11.1, HIST1H3G, VTCN1, PAFAH1B3, MMP7, UBD, CASC9, FGF18, TLCD1, KIFC1, TK1, CELSR1, RP11-608O21.1, CCNF, KIF18B, ERCC6L, SBK1, MMP11, MMP12, RHPN1-AS1, GALNT6, SPINT2, HMGA2, HMGA1, RP11-3J1.1, IFI27, EYA2, TRAF4, HOXB8, HOXB6, HOXB7, HOXB5, FAM72B, CENPF, MELK, ETV4, RP11-388M20.2, CAPS, SPC24, CTB-191K22.5, TECRP1, TPX2, LGR5, FOXJ1, ZNF695, IFI6, CCNE1, KIF20A, MKI67, ARSH, UBE2C, UBE2T, BIRC5, HIST1H2BH, COL26A1, CA8, C1orf186, ERVMER34-1, RP11-40C6.2, OVOL2, AP1M2, PRR15, CDKN2A, RP11-360L9.7, NME1, MCM2, BLM, CCNB1, LAMC2, RP11-498C9.3, SOX17, RP11-10G12.1, KIAA0101, EXO1, CDH1, ABRACL, ECT2, ST14, PLEKHG4B, RP1-241P17.4, NUSAP1, MTFP1, MCM10, MYBL2, RP11-579D7.2, SMPDL3B, HIST1H2AM, E2F8, CXADR, CDC6, ISG15, CLDN3, CLDN4, CLDN6, IDO1, F11R, CXCL11, RPS2P55, STXBP6, CDK1, HAPLN1, CENPM, CENPA, CTC-518P12.6, CTD-2531D15.5, PRR11, AP001615.9, ABHD11, RNF183, GYLTL1B, SPC25, MARCKSL1, TMPRSS3, TMPRSS4, MAP2K6, CXCL10, SAPCD2, MEX3A, GNGT1. - Our analysis of the active promoter mark H3K4me3 provided rich information regarding transcriptional programs in tissue of origin. Can we gain information from chromatin marks that are associated with enhancer and gene activity? Mono and di-methylation of H3 lysine 4 (H3K4me1 and H3K4me2, respectively) are found in two types of genomic regions: 1) promoter flanking regions at the boundaries of regions marked with H3K4me3, or 2) poised/active enhancers, where the H3K4me3 is barely detected. ChIP-seq for these marks in tissues shows H3K4me2 peaks in the near vicinity of enhancers, while H3K4me1 peaks are wider around enhancers (on the order of ˜10 kb). cfChIP of these marks recapitulates the expected distribution: Around active promoters, the H3K4me2 and H3K4me1 flank the main H3K4me3 peak (
FIG. 5A-B ) and correlate with H3K4me3 and RNA level (FIG. 5D ). Additionally, in gene-poor regions, such as the IFNB1 locus, we clearly see marks at enhancers, which match experimentally verified enhancers for IFNB1 (Banerjee et al. 2014) (FIG. 5C ). - Since the H3K4me2 is more condensed and the signal between healthy subjects is highly similar (
FIG. 5E ) we chose to focus on this mark for enhancer analysis. To identify enhancers, we looked for H3K4me2 peaks that have low or no H3K4me3 signal. We identified 8,000 putative enhancer peaks in healthy subjects. Of these peaks >90% were at sites predicted to be enhancer regions based on five chromatin marks (ChromHMM) in multiple tissues. Moreover, there is a good agreement between these putative enhancer peaks and ChromHMM annotations in the relevant cell types, e.g., monocytes and neutrophils (FIG. 5C ). Applying the same analysis to two samples from a colorectal cancer patient (C002.1 and C002.2) showed strong agreement between these samples with substantial differences from healthy subjects (FIG. 5E ). The differences between healthy subjects and cancer samples illustrate the additional information that can be gained from enhancers over promoter signatures. A few examples for cancer-specific enhancers signals include TCF3, CDX1, and CEACAM5 (FIG. 5F-H ). The transcription factor TCF3 has promoter H3K4me3 signal in every subject tested. However, the enhancer activity marks in the vicinity of the gene are strikingly different, with the cancer samples showing clear H3K4me2 peaks in regions that correspond to putative colon enhancers (FIG. 5F ). These results suggest that the gene is activated by different enhancers in the cells that contributed this signal and are consistent with the patient's clinical condition (FIG. 4A-G ). A subset of the H3K4me2 peaks around TCF3 are not observed in adult colon but only in fetal colon, consistent with de-repression of fetal oncogenes. The intestine-specific transcription factor CDX1 does not have H3K4me3 signal in healthy subjects. In the cancer sample it has a clear signal. This activity is accompanied by H3K4me2 peaks over large GI-specific enhancer regions in the vicinity of the gene (FIG. 5G ). This suggests that CDX1 is activated through these enhancers. Finally, examining the CEACAM5 locus a more complex picture emerges (FIG. 5H ). CEACAM5 has H3K4me3 signal at its promoter in the cancer sample. This is accompanied by a signal in GI-specific enhancers. However, in healthy subjects there is a strong H3K4me2 signal in a neighboring enhancer/promoter region. This suggests that some of these enhancers might be involved in repression of CEACAM5 in monocytes and neutrophils. - Tri-methylation of H3 lysine 36 (H3K36me3) is found at the body of transcribed genes. Unlike H3K4me3, which marks transcription start sites at both poised and active genes, H3K36me3 requires active transcription elongation to be deposited, and is hence more indicative of gene activity. cfChIP of H3K36me3 results in a typical enrichment at gene bodies (
FIG. 5I ) and the signal is in correlation to leukocyte H3K36me3 and RNA-seq (FIG. 5J-K ). Comparing the H3K36me3 signal from a healthy subject to that of a colorectal adenocarcinoma patient we see 1500 genes that have K36me3 that is increased 2-fold or more in the cancer sample (FIG. 5L ). Of these 1500 genes, 60 genes are known to be upregulated in colon adenocarcinoma (60 out of 172 COAD genes, p<10-20;FIG. 5L ). Interestingly, while most genes with higher H3K36me3 gene body signal in cancer also show higher H3K4me3 at promotes we find examples of genes that disagree (FIG. 5M ), attesting to the increased information that can be obtained by combining data from different histone marks. - Altogether, cf-ChIP-seq can probe the state of various genomic functionalities including promoters, enhancers, and gene bodies, and this information is highly informative on transcriptional activity in cells of origin.
- During cf-ChIP most of the material from each blood sample is not captured on beads as it does not carry the target modification/protein. Performing immobilization in successive fashion, where after incubation with one antibody the supernatant is passed to the next one, dramatically increases efficiency in using limited material. Even after multiple cfChIP steps, the remaining material (which still contains most of the original cfDNA) can be used for DNA based assays. To test feasibility, successive cfChIPs for H3K4me1 followed by H3K4me3, and vice versa, were performed (
FIG. 8 ). Good agreement between the results of the two experiments was found. - To test the ability of cf-ChIP to target acetylated histones we performed cf-ChIP with antibodies that target different acetylation sites on H3 (H3K9ac, H3K27ac) and acetylation of the H2A histone variant H2A.z (H2A.z_ac). All of these histone marks are associated with active transcription and indeed the histone acetylation marks showed typical patterns of enrichment around the transcription start site (TSS). Following cf-ChIP we aligned the sequenced DNA fragments to the human genome and performed meta gene analysis centered around the TSS of genes. As can be seen in
FIG. 9 , all acetyl marks showed marked enrichment around the TSS as would be expected. We also used the H3K27ac antibody on a colorectal cancer (CRC) patient and obtained a much higher signal compared to healthy donor suggesting that cf-ChIP of acetylated histones can have diagnostic value as well. - Even more efficient than successive rounds of ChIP, is to perform immobilization for all antibodies in the same tube. Beads are bound by an antibody and a DNA adapter with a matching barcode specific for that antibody. Beads with different antibody/barcode are mixed in the same blood sample and after ChIP a ligation reaction is run on all the beads. Due to the proximity and solid phase immobilization of the cfDNA and adapter/barcode the reaction is specific. The cfDNA on each bead is labeled with the DNA adapter containing a barcode specific to the antibody that pulled it down. Multiplex sequencing is performed on all cfDNA pulled down by any protein together (
FIG. 10A ). We call this method multiplexing by proximity ligation (MPL). This method minimizes loss of material during subsequent transfers and increases the chance of the antibody finding its target in the sample. - To prove the feasibility of MPL, we show that Chromatin can be Immunoprecipitated on surface containing a mixture of immobilized antibodies and barcoded DNA adapters (MPL barcoded surface, next generation sequencing (NGS) DNA libraries of chromatin immobilized on MPL barcoded surface can be generated and there is minimal mixing of chromatin between MPL barcoded surfaces. To test this, an experiment was devised as shown in
FIG. 10B . A unique barcode was combined with a particular pull-down antibody (anti-H3K4me3 or anti-H3K36me3). The MPL barcoded surface were combined to perform ChIP on chromatin derived from two yeast species (S. cerevisiae and K. lactis), that can be differentiated by their genomic sequences. To test whether mixing occurs during IP we mixed K4 and K36 MPL barcoded surface with chromatin from a single source and follow the genomic distribution of the immobilized chromatin. K4 MPL barcoded surface and K36 MPL barcoded surface should ChIP chromatin skewed towards the 5′ or 3′ end of genes respectively. We also combined MPL barcoded surfaces, which were incubated with different yeast strains, after the IP step to test mixing during library preparation and/or later on in the process. Mixing the outputs of the IPs into a single tube prior to library preparation will reveal mixing that occurs at that stage, for example if K. lactis DNA is sequenced withbarcode 1 orbarcode 2 we know that undesired mixing occurred during library preparation). - We performed qPCR following ChIP on MPL barcoded surface with decreasing amounts of DNA adapters (and hence increasing amounts of protein G that is used to recruit antibodies to MPL barcoded surface) and calculated the fraction of the chromatin that was immobilized compared to input (
FIG. 10C ). The fraction of Immunoprecipitated chromatin is comparable to results obtained by standard ChIP and is negatively correlated with the amount of immobilized barcoded DNA adapters (and hence positively correlated with the amount of immobilized antibodies). Note the extremely low levels of background signal (No protein G) proving that ChIP on MPL barcoded surface is dependent on immobilization of the antibody. - Next, we performed ChIP-seq with MPL barcoded surfaces containing antibodies against H3K4me3 or H3K36me3. Following sequencing the sequenced DNA fragments were aligned to the genome and the signal is presented as a meta plot along a typical gene (
FIG. 10D ). MPL barcoded surface with H3K4me3 and H3K36me3 yielded typical results, with H3K4me3 concentrated aroundgene 5′, and H3K36me3 spread around gene bodies towards the 3′ of the gene. - Finally, to test the amount of mixing, we count the number of reads obtained from MPL barcoded surface that is expected to be ligated only to K. lactis chromatin in 5 different samples. It was observed that the majority of the signal is indeed obtained from K. lactis DNA while only a small residual fraction is obtained from S. cerevisiae, suggesting that mixing is minimal even under conditions where the antibodies are not covalently bound to the MPL barcoded surface (binding is mediated by biotinylated protein G that is bound to the beads by strong biotin Streptavidin interaction). Indeed, mixed samples resulted in 80-90% correct alignment to S. cerevisiae.
- Although the invention has been described in conjunction with specific embodiments thereof, it is evident that many alternatives, modifications and variations will be apparent to those skilled in the art. Accordingly, it is intended to embrace all such alternatives, modifications and variations that fall within the spirit and broad scope of the appended claims.
Claims (20)
1. A method of preparing cell free DNA (cfDNA) for analysis, the method comprising:
a. providing a bodily fluid sample, wherein said sample comprises cfDNA;
b. contacting said sample with an antibody immobilized on a physical support, wherein said antibody binds to a DNA-associated protein;
c. isolating said physical support and any thereto bound DNA-associated proteins and cfDNA; and
d. ligating a DNA adapter to said cfDNA bound to said protein and physical support;
thereby preparing cfDNA for analysis.
2. The method of claim 1 , wherein said bodily fluid is selected from whole blood and plasma.
3. The method of claim 1 , wherein said antibody is covalently immobilized to said physical support.
4. The method of claim 1 , wherein said physical support is a bead.
5. The method of claim 1 , wherein said DNA associated protein is selected from a histone, a modified histone and a histone variant.
6. The method of claim 5 , wherein said modified histone is selected from Histone 3 monomethylated lysine 4 (H3K4me1), Histone 3 demethylated lysine 4 (H3K4me2), Histone 3 trimethylated lysine 36 (H3K36me3) and Histone 3 trimethylated lysine 4 (H3K4me3), optionally wherein said reagent is an anti-modified histone antibody or fragment thereof.
7. The method of claim 1 , further comprising eluting cfDNA ligated to a DNA adapter from said beads.
8. The method of claim 7 , further comprising sequencing said eluted cfDNA.
9. The method of claim 8 , further comprising designating a cfDNA molecule comprising a DNA sequence of an informative genomic location as originating from a cell in a cellular state, originating from a tissue, originating from a cell type or a combination thereof, wherein association of said DNA-associated protein with said informative genomic location is indicative of said cellular state, tissue of origin, cell type or combination thereof in the cell that released said cfDNA and wherein said method is a method of determining a cellular state, tissue of origin cell type or combination thereof of a cell that released its DNA.
10. The method of claim 9 , wherein said cell that released its DNA is a dead cell and the method is for detecting death of at least one of:
a. a cell type in a subject,
b. a tissue in a subject, and
c. a cell in a cellular state in a subject.
11. The method of claim 9 , wherein said cellular state is a disease state, optionally wherein said disease state is selected from bacteremia, cancer, pre-cancer, infection, neurodegenerative disease, tissue damage, cardiac disease or damage, brain disease or damage, gastrointestinal disease or damage, liver disease, inflammation, autoimmune disease, arthritis, liver disease, damage or inflammation, bowel inflammation, autoimmune disease, tissue damage from drug side effects, tissue necrosis, and diabetes.
12. The method of claim 9 , wherein at least one of:
a. association of said DNA-associated protein with said genomic location is indicative of active transcription and said genomic location is within a tissue, cell type or cellular state specific gene or enhancer element or is at a disease-specific mutation;
b. wherein said DNA-associated protein is a marker of active transcription and said designating comprises comparing said sequenced cfDNA to a known transcriptional program of a tissue, cell type or cellular state, wherein a cfDNA with a sequence that is from a gene transcribed in said transcriptional program is from said tissue, cell type or cellular state; and
c. wherein association of said DNA-associated protein with said genomic location is indicative of silenced transcription and said genomic location is within a repressor element, or a gene silenced in said tissue, cell-type or cellular state, or is at a disease-specific mutation.
13. The method of claim 8 , wherein said sequencing is Next Generation Sequencing, High Throughput Sequencing or Massively Parallel Sequencing.
14. The method of claim 7 , wherein said adapter is a sequencing adapter.
15. The method of claim 14 , wherein said sequencing adapter is a barcoded sequencing adapter.
16. The method of claim 9 , wherein said designating comprises at least one of:
a. comparing said sequenced cfDNA to at least 10 genomic locations with the greatest unique association of said DNA-associated protein in a tissue, cell type or cellular state, and wherein a cfDNA with a sequence that is the same as a DNA sequence within said at least 10 genomic locations is considered to be from said tissue, cell type or cellular state;
b. comparing the sequenced cfDNA to a DNA-protein association atlas of at least 5 cell types or tissues, wherein said atlas comprises at least 10 genomic location with the greatest unique association of said DNA-associated protein in each of said 5 cell types or tissues, and wherein a cfDNA with a sequence that is the same as a DNA sequence within said at least 10 genomic locations is considered to be from said tissue or cell type; and
c. comparing the sequenced cfDNA to a transcriptional program atlas of at least 5 transcriptional programs, wherein said atlas comprises at least one genomic location with the greatest unique association of said DNA-associated protein in each of said 5 transcriptional programs and wherein a cfDNA with a sequence that is the same as a DNA sequence within said at least one genomic location indicates activation of said transcriptional program.
17. The method of claim 9 , wherein said cellular state is selected from: hypoxia, inflammation, ER stress, mitochondrial stress, interferon response, quiescence, senescence, cycling, malignant, and calcium flux.
18. The method of claim 9 , wherein said informative genomic location is selected from a promoter, an enhancer element, a silencer element, a gene body and a disease-associated mutation.
19. The method of claim 9 , wherein said cellular state is selected from: hypoxia, inflammation, ER stress, mitochondrial stress, interferon response, quiescence, senescence, cycling, malignant, and calcium flux.
20. The method of claim 9 , wherein said sample is from a subject and said method is a method of detecting and treating a disease in said subject and further comprises treating said subject with a suitable treatment based on the cellular state, tissue of origin, cell type or a combination thereof of said cell that died in said subject.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18/217,575 US20230357837A1 (en) | 2018-03-13 | 2023-07-02 | Diagnostic use of cell free dna chromatin immunoprecipitation |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201862642158P | 2018-03-13 | 2018-03-13 | |
US201862667528P | 2018-05-06 | 2018-05-06 | |
PCT/IL2019/050281 WO2019175876A2 (en) | 2018-03-13 | 2019-03-13 | Diagnostic use of cell free dna chromatin immunoprecipitation |
US202016980497A | 2020-09-14 | 2020-09-14 | |
US18/217,575 US20230357837A1 (en) | 2018-03-13 | 2023-07-02 | Diagnostic use of cell free dna chromatin immunoprecipitation |
Related Parent Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IL2019/050281 Continuation WO2019175876A2 (en) | 2018-03-13 | 2019-03-13 | Diagnostic use of cell free dna chromatin immunoprecipitation |
US16/980,497 Continuation US11781183B2 (en) | 2018-03-13 | 2019-03-13 | Diagnostic use of cell free DNA chromatin immunoprecipitation |
Publications (1)
Publication Number | Publication Date |
---|---|
US20230357837A1 true US20230357837A1 (en) | 2023-11-09 |
Family
ID=66182617
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/980,497 Active 2039-04-16 US11781183B2 (en) | 2018-03-13 | 2019-03-13 | Diagnostic use of cell free DNA chromatin immunoprecipitation |
US18/217,575 Pending US20230357837A1 (en) | 2018-03-13 | 2023-07-02 | Diagnostic use of cell free dna chromatin immunoprecipitation |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/980,497 Active 2039-04-16 US11781183B2 (en) | 2018-03-13 | 2019-03-13 | Diagnostic use of cell free DNA chromatin immunoprecipitation |
Country Status (4)
Country | Link |
---|---|
US (2) | US11781183B2 (en) |
EP (1) | EP3765638A2 (en) |
CN (1) | CN112119166A (en) |
WO (1) | WO2019175876A2 (en) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017096112A1 (en) | 2015-12-02 | 2017-06-08 | Bovie Medical Corporation | Mixing cold plasma beam jets with atmosphere |
US11898199B2 (en) | 2019-11-11 | 2024-02-13 | Universal Diagnostics, S.A. | Detection of colorectal cancer and/or advanced adenomas |
US11530453B2 (en) | 2020-06-30 | 2022-12-20 | Universal Diagnostics, S.L. | Systems and methods for detection of multiple cancer types |
JP2023553407A (en) * | 2020-12-02 | 2023-12-21 | エピゲニカ エービー | Multiplexed methods for assessing global or genomic locus-specific levels of chromatin modifications |
TW202242130A (en) * | 2020-12-29 | 2022-11-01 | 比利時商比利時意志有限公司 | Circulating transcription factor analysis |
CN113539355B (en) * | 2021-07-15 | 2022-11-25 | 云康信息科技(上海)有限公司 | Tissue-specific source for predicting cfDNA (deoxyribonucleic acid), related disease probability evaluation system and application |
CN114134223B (en) * | 2021-12-02 | 2022-10-11 | 青岛市中心血站 | Blood detection kit for hepatitis C virus-related liver fibrosis |
WO2023122623A1 (en) | 2021-12-21 | 2023-06-29 | Guardant Health, Inc. | Methods and systems for combinatorial chromatin-ip sequencing |
US20240043935A1 (en) * | 2022-07-29 | 2024-02-08 | Centre For Novostics Limited | Epigenetics analysis of cell-free dna |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4666828A (en) | 1984-08-15 | 1987-05-19 | The General Hospital Corporation | Test for Huntington's disease |
US4683202A (en) | 1985-03-28 | 1987-07-28 | Cetus Corporation | Process for amplifying nucleic acid sequences |
US4801531A (en) | 1985-04-17 | 1989-01-31 | Biotechnology Research Partners, Ltd. | Apo AI/CIII genomic polymorphisms predictive of atherosclerosis |
US5272057A (en) | 1988-10-14 | 1993-12-21 | Georgetown University | Method of detecting a predisposition to cancer by the use of restriction fragment length polymorphism of the gene for human poly (ADP-ribose) polymerase |
US5192659A (en) | 1989-08-25 | 1993-03-09 | Genetype Ag | Intron sequence analysis method for detection of adjacent and remote locus alleles as haplotypes |
AU5241100A (en) * | 1999-09-28 | 2001-04-30 | Universitat Zurich | Factors having prion-binding activity in serum and plasma and agents to detect transmissible spongiform encephalopathitis |
CA2419479A1 (en) * | 2000-08-14 | 2002-02-21 | Transgenetics Incorporated | Transcription factor target gene discovery |
JP2005517431A (en) | 2002-02-20 | 2005-06-16 | ユニバーシティ オブ バージニア パテント ファウンデーション | Noninvasive diagnostic test using histone modified markers |
GB0319376D0 (en) | 2003-08-18 | 2003-09-17 | Chroma Therapeutics Ltd | Histone modification detection |
ES2259270B1 (en) * | 2005-03-09 | 2007-11-01 | Consejo Superior De Investigaciones Cientificas | IN VITRO DIAGNOSTIC METHOD OF ALZHEIMER'S DISEASE BY A MONOCLONAL ANTIBODY. |
WO2010145035A1 (en) * | 2009-06-19 | 2010-12-23 | Siu K W Michael | Renal cell carcinoma biomarkers |
ES2564656T3 (en) * | 2009-10-26 | 2016-03-28 | Lifecodexx Ag | Means and methods for the non-invasive diagnosis of chromosomal aneuploidy |
SG11201402982TA (en) * | 2011-12-07 | 2014-07-30 | Singapore Volition Pte Ltd | Method for detecting nucleosome adducts |
US9920361B2 (en) * | 2012-05-21 | 2018-03-20 | Sequenom, Inc. | Methods and compositions for analyzing nucleic acid |
US11060145B2 (en) * | 2013-03-13 | 2021-07-13 | Sequenom, Inc. | Methods and compositions for identifying presence or absence of hypermethylation or hypomethylation locus |
US10174375B2 (en) | 2013-09-20 | 2019-01-08 | The Chinese University Of Hong Kong | Sequencing analysis of circulating DNA to detect and monitor autoimmune diseases |
JP2017522908A (en) * | 2014-07-25 | 2017-08-17 | ユニヴァーシティ オブ ワシントン | Method for determining tissue and / or cell type producing cell-free DNA, and method for identifying disease or abnormality using the same |
JP6721599B2 (en) * | 2014-10-29 | 2020-07-15 | ベルジアン ボリション エスピーアールエル | Method for concentrating circulating tumor DNA |
GB201518665D0 (en) * | 2015-10-21 | 2015-12-02 | Singapore Volition Pte Ltd | Method for enrichment of cell free nucleosomes |
US11261223B2 (en) * | 2016-05-11 | 2022-03-01 | The University Of Chicago | Methods of treating cancers with CT45 targeted therapies |
WO2018064208A1 (en) * | 2016-09-28 | 2018-04-05 | The Broad Institute, Inc. | Systematic screening and mapping of regulatory elements in non-coding genomic regions, methods, compositions, and applications thereof |
WO2019148141A1 (en) * | 2018-01-26 | 2019-08-01 | The Trustees Of Princeton University | Methods for analyzing genetic data to classify multifactorial traits including complex medical disorders |
-
2019
- 2019-03-13 US US16/980,497 patent/US11781183B2/en active Active
- 2019-03-13 EP EP19717978.1A patent/EP3765638A2/en active Pending
- 2019-03-13 CN CN201980031975.1A patent/CN112119166A/en active Pending
- 2019-03-13 WO PCT/IL2019/050281 patent/WO2019175876A2/en unknown
-
2023
- 2023-07-02 US US18/217,575 patent/US20230357837A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
CN112119166A (en) | 2020-12-22 |
WO2019175876A2 (en) | 2019-09-19 |
US20210024994A1 (en) | 2021-01-28 |
EP3765638A2 (en) | 2021-01-20 |
US11781183B2 (en) | 2023-10-10 |
WO2019175876A3 (en) | 2019-10-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230357837A1 (en) | Diagnostic use of cell free dna chromatin immunoprecipitation | |
US20240102095A1 (en) | Methods for profiling and quantitating cell-free rna | |
US20180089373A1 (en) | Integrated systems and methods for automated processing and analysis of biological samples, clinical information processing and clinical trial matching | |
US20180119137A1 (en) | Integrated systems and methods for automated processing and analysis of biological samples, clinical information processing and clinical trial matching | |
EP3421613B1 (en) | Identification and use of circulating nucleic acid tumor markers | |
US20210071262A1 (en) | Method of detecting cancer through generalized loss of stability of epigenetic domains and compositions thereof | |
AU2021224760A1 (en) | Capturing genetic targets using a hybridization approach | |
WO2017211947A1 (en) | Chemosensitivity predictive biomarkers | |
US20210325387A1 (en) | Cell atlas of the healthy and ulcerative colitis human colon | |
US20210115519A1 (en) | Methods and kits for diagnosis and triage of patients with colorectal liver metastases | |
US20160340740A1 (en) | Methylation haplotyping for non-invasive diagnosis (monod) | |
US20130022974A1 (en) | Dna methylation profiles in cancer | |
WO2016004387A1 (en) | Gene expression signature for cancer prognosis | |
WO2012031008A2 (en) | Cancer-related biological materials in microvesicles | |
WO2019079647A2 (en) | Statistical ai for advanced deep learning and probabilistic programing in the biosciences | |
US20190169691A1 (en) | Heritable epigenetic modifications as markers of chemotherapy exposure | |
US20200370132A1 (en) | Robust genomic predictor of breast and lung cancer metastasis | |
Liu et al. | Identification of liver metastasis-associated genes in human colon carcinoma by mRNA profiling | |
Solé‐Boldo et al. | Differentiation‐related epigenomic changes define clinically distinct keratinocyte cancer subclasses | |
WO2022217096A2 (en) | Cell-free dna sequence data analysis method to examine nucleosome protection and chromatin accessibility | |
US20230212674A1 (en) | Compositions and methods for identifying cell types | |
US20240009172A1 (en) | Compostions and methods for treating prostate cancer | |
US20230416833A1 (en) | Systems and methods for monitoring of cancer using minimal residual disease analysis | |
KR102566777B1 (en) | A method for providing information for predicting genome stability of stem cells | |
EP3995830A1 (en) | Method of prognosis of an individual having multiple myeloma to be sensitive to a treatment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: YISSUM RESEARCH DEVELOPMENT COMPANY OF THE HEBREW UNIVERSITY OF JERUSALEM LTD., ISRAEL Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SADEH, RONEN;FRIEDMAN, NIR;APPLEBOIM, ALON;SIGNING DATES FROM 20200913 TO 20201104;REEL/FRAME:064135/0322 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |