US20160053301A1 - Methods for quantitative genetic analysis of cell free dna - Google Patents
Methods for quantitative genetic analysis of cell free dna Download PDFInfo
- Publication number
- US20160053301A1 US20160053301A1 US14/466,741 US201414466741A US2016053301A1 US 20160053301 A1 US20160053301 A1 US 20160053301A1 US 201414466741 A US201414466741 A US 201414466741A US 2016053301 A1 US2016053301 A1 US 2016053301A1
- Authority
- US
- United States
- Prior art keywords
- cfdna
- genetic
- library
- dna
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 230000002068 genetic Effects 0.000 title claims abstract description 440
- 238000004458 analytical method Methods 0.000 title claims abstract description 232
- 229920003013 deoxyribonucleic acid Polymers 0.000 title claims description 298
- 238000001514 detection method Methods 0.000 claims abstract description 64
- 239000000523 sample Substances 0.000 claims description 402
- 239000002773 nucleotide Substances 0.000 claims description 384
- 125000003729 nucleotide group Chemical group 0.000 claims description 384
- 201000011510 cancer Diseases 0.000 claims description 108
- 230000004927 fusion Effects 0.000 claims description 82
- 201000002406 genetic disease Diseases 0.000 claims description 76
- 238000003753 real-time PCR Methods 0.000 claims description 76
- 230000003902 lesions Effects 0.000 claims description 74
- 230000035772 mutation Effects 0.000 claims description 74
- 102100011141 ALK Human genes 0.000 claims description 72
- 101710033641 ALK Proteins 0.000 claims description 72
- 210000002381 Plasma Anatomy 0.000 claims description 72
- 230000003321 amplification Effects 0.000 claims description 70
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 68
- 239000012530 fluid Substances 0.000 claims description 52
- 102000004190 Enzymes Human genes 0.000 claims description 46
- 108090000790 Enzymes Proteins 0.000 claims description 46
- 229920001405 Coding region Polymers 0.000 claims description 44
- 230000027455 binding Effects 0.000 claims description 40
- 239000012472 biological sample Substances 0.000 claims description 34
- 238000001712 DNA sequencing Methods 0.000 claims description 28
- 230000001605 fetal Effects 0.000 claims description 26
- 238000005259 measurement Methods 0.000 claims description 26
- 210000004369 Blood Anatomy 0.000 claims description 22
- 239000008280 blood Substances 0.000 claims description 22
- 238000003780 insertion Methods 0.000 claims description 20
- 229920002425 Alu element Polymers 0.000 claims description 18
- 102100002671 EML4 Human genes 0.000 claims description 18
- 101700040935 EML4 Proteins 0.000 claims description 18
- 238000003766 bioinformatics method Methods 0.000 claims description 18
- 239000000090 biomarker Substances 0.000 claims description 18
- 210000004381 Amniotic Fluid Anatomy 0.000 claims description 14
- 210000003296 Saliva Anatomy 0.000 claims description 14
- 210000000582 Semen Anatomy 0.000 claims description 14
- 210000002966 Serum Anatomy 0.000 claims description 14
- 210000004243 Sweat Anatomy 0.000 claims description 14
- 210000002700 Urine Anatomy 0.000 claims description 14
- 238000007622 bioinformatic analysis Methods 0.000 claims description 14
- 230000002490 cerebral Effects 0.000 claims description 14
- 230000001926 lymphatic Effects 0.000 claims description 14
- 241000894007 species Species 0.000 claims description 10
- 238000004166 bioassay Methods 0.000 abstract description 126
- 229920001850 Nucleic acid sequence Polymers 0.000 abstract description 52
- 230000014509 gene expression Effects 0.000 abstract description 8
- 230000002759 chromosomal Effects 0.000 abstract description 6
- 239000000203 mixture Substances 0.000 description 106
- 230000035945 sensitivity Effects 0.000 description 80
- 210000004027 cells Anatomy 0.000 description 76
- 102100019730 TP53 Human genes 0.000 description 56
- 108010078814 Tumor Suppressor Protein p53 Proteins 0.000 description 56
- 201000010099 disease Diseases 0.000 description 54
- 241000282414 Homo sapiens Species 0.000 description 52
- 206010028980 Neoplasm Diseases 0.000 description 46
- 238000006243 chemical reaction Methods 0.000 description 46
- 101700041487 ACA2 Proteins 0.000 description 40
- -1 ATR Proteins 0.000 description 40
- 101710026113 CST6 Proteins 0.000 description 40
- 241000700605 Viruses Species 0.000 description 40
- 239000011324 bead Substances 0.000 description 34
- 108020004707 nucleic acids Proteins 0.000 description 28
- 150000007523 nucleic acids Chemical class 0.000 description 28
- 102100018882 MYCN Human genes 0.000 description 24
- 108010010748 N-Myc Proto-Oncogene Protein Proteins 0.000 description 24
- 238000010276 construction Methods 0.000 description 24
- 238000005516 engineering process Methods 0.000 description 24
- 238000009396 hybridization Methods 0.000 description 24
- 230000001717 pathogenic Effects 0.000 description 24
- 239000000243 solution Substances 0.000 description 24
- 210000001519 tissues Anatomy 0.000 description 24
- 238000010367 cloning Methods 0.000 description 22
- 238000000034 method Methods 0.000 description 22
- 210000000056 organs Anatomy 0.000 description 22
- 244000052769 pathogens Species 0.000 description 22
- 206010033128 Ovarian cancer Diseases 0.000 description 20
- 210000004602 germ cell Anatomy 0.000 description 20
- 229920000272 Oligonucleotide Polymers 0.000 description 18
- 108060009497 WRNexo Proteins 0.000 description 18
- 238000007374 clinical diagnostic method Methods 0.000 description 18
- 201000011231 colorectal cancer Diseases 0.000 description 18
- 230000000670 limiting Effects 0.000 description 18
- 239000000463 material Substances 0.000 description 18
- 206010068052 Mosaicism Diseases 0.000 description 16
- 238000003745 diagnosis Methods 0.000 description 16
- 239000003814 drug Substances 0.000 description 16
- 229940079593 drugs Drugs 0.000 description 16
- 230000002255 enzymatic Effects 0.000 description 16
- 238000002474 experimental method Methods 0.000 description 16
- 244000005700 microbiome Species 0.000 description 16
- 239000000047 product Substances 0.000 description 16
- 238000000746 purification Methods 0.000 description 16
- 238000010200 validation analysis Methods 0.000 description 16
- 229920002459 Intron Polymers 0.000 description 14
- 238000001574 biopsy Methods 0.000 description 14
- UIIMBOGNXHQVGW-UHFFFAOYSA-M buffer Substances [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 14
- 230000001364 causal effect Effects 0.000 description 14
- 230000000295 complement Effects 0.000 description 14
- 102100007281 BRCA1 Human genes 0.000 description 12
- 101700076604 BRCA1 Proteins 0.000 description 12
- 238000007792 addition Methods 0.000 description 12
- 238000010790 dilution Methods 0.000 description 12
- 238000001914 filtration Methods 0.000 description 12
- 239000000758 substrate Substances 0.000 description 12
- 102100004328 BRAF Human genes 0.000 description 10
- 101700004551 BRAF Proteins 0.000 description 10
- 101700011961 DPOM Proteins 0.000 description 10
- 102100005049 DPYD Human genes 0.000 description 10
- 101700025368 ERBB2 Proteins 0.000 description 10
- 102100016662 ERBB2 Human genes 0.000 description 10
- 229920002024 GDNA Polymers 0.000 description 10
- 101710029649 MDV043 Proteins 0.000 description 10
- 229920002393 Microsatellite Polymers 0.000 description 10
- 101700061424 POLB Proteins 0.000 description 10
- 101700054624 RF1 Proteins 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- 238000006062 fragmentation reaction Methods 0.000 description 10
- 238000007481 next generation sequencing Methods 0.000 description 10
- 229920000023 polynucleotide Polymers 0.000 description 10
- 239000002157 polynucleotide Substances 0.000 description 10
- 108010000750 BRCA2 Protein Proteins 0.000 description 8
- 102000002280 BRCA2 Protein Human genes 0.000 description 8
- 101700016900 CDH1 Proteins 0.000 description 8
- 101700011568 DIB1 Proteins 0.000 description 8
- 101700031267 DPYD Proteins 0.000 description 8
- 102100018976 FZR1 Human genes 0.000 description 8
- 102100019126 HBB Human genes 0.000 description 8
- 102100009279 KRAS Human genes 0.000 description 8
- 101710033922 KRAS Proteins 0.000 description 8
- RWQNBRDOKXIBIV-UHFFFAOYSA-N Thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 8
- 230000000875 corresponding Effects 0.000 description 8
- 230000001186 cumulative Effects 0.000 description 8
- 101710030587 ligN Proteins 0.000 description 8
- 101700077585 ligd Proteins 0.000 description 8
- 201000005244 lung non-small cell carcinoma Diseases 0.000 description 8
- 238000010369 molecular cloning Methods 0.000 description 8
- 230000001629 suppression Effects 0.000 description 8
- 238000002560 therapeutic procedure Methods 0.000 description 8
- WYWHKKSPHMUBEB-UHFFFAOYSA-N tioguanine Chemical compound N1C(N)=NC(=S)C2=C1N=CN2 WYWHKKSPHMUBEB-UHFFFAOYSA-N 0.000 description 8
- GHASVSINZRGABV-UHFFFAOYSA-N 5-flurouricil Chemical compound FC1=CNC(=O)NC1=O GHASVSINZRGABV-UHFFFAOYSA-N 0.000 description 6
- 102100014001 ABCB1 Human genes 0.000 description 6
- 102100006705 ALDH4A1 Human genes 0.000 description 6
- 101710037561 ALDH4A1 Proteins 0.000 description 6
- 206010069754 Acquired gene mutation Diseases 0.000 description 6
- 210000001124 Body Fluids Anatomy 0.000 description 6
- 206010006187 Breast cancer Diseases 0.000 description 6
- 102000033243 CDKN2A Human genes 0.000 description 6
- 102100017371 CYP2C19 Human genes 0.000 description 6
- 101710007600 CYP2C19 Proteins 0.000 description 6
- 102100005543 CYP2D6 Human genes 0.000 description 6
- 101710007490 CYP2D6 Proteins 0.000 description 6
- 102100004057 CYP3A4 Human genes 0.000 description 6
- 101710007540 CYP3A4 Proteins 0.000 description 6
- 102100004059 CYP3A5 Human genes 0.000 description 6
- 101710007537 CYP3A5 Proteins 0.000 description 6
- 206010009944 Colon cancer Diseases 0.000 description 6
- 229920002676 Complementary DNA Polymers 0.000 description 6
- 108010009392 Cyclin-Dependent Kinase Inhibitor p16 Proteins 0.000 description 6
- 230000004544 DNA amplification Effects 0.000 description 6
- 101710028159 DNTT Proteins 0.000 description 6
- 101710030424 EPHX1 Proteins 0.000 description 6
- 102100017903 EPHX1 Human genes 0.000 description 6
- 229920000665 Exon Polymers 0.000 description 6
- 229960002949 Fluorouracil Drugs 0.000 description 6
- 102000007648 Glutathione S-Transferase pi Human genes 0.000 description 6
- 108010007355 Glutathione S-Transferase pi Proteins 0.000 description 6
- 102100006988 KCNH2 Human genes 0.000 description 6
- 101700085508 KCNH2 Proteins 0.000 description 6
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 6
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 6
- 101710019353 MARS2 Proteins 0.000 description 6
- 102100002074 MTHFR Human genes 0.000 description 6
- 102100015262 MYC Human genes 0.000 description 6
- 101700075357 MYC Proteins 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- 101700067251 NQO1 Proteins 0.000 description 6
- 102100002509 NQO1 Human genes 0.000 description 6
- 102000007530 Neurofibromin 1 Human genes 0.000 description 6
- 108010085793 Neurofibromin 1 Proteins 0.000 description 6
- 206010035226 Plasma cell myeloma Diseases 0.000 description 6
- 102100000895 RB1 Human genes 0.000 description 6
- 101710003919 RB1 Proteins 0.000 description 6
- 101710017811 RB1 Proteins 0.000 description 6
- 102100010056 SLC19A1 Human genes 0.000 description 6
- 102100019647 SULT1A1 Human genes 0.000 description 6
- 101710017681 SULT1A1 Proteins 0.000 description 6
- 102100008790 TNFRSF14 Human genes 0.000 description 6
- 101710038601 TNFRSF14 Proteins 0.000 description 6
- 239000002253 acid Substances 0.000 description 6
- 239000012491 analyte Substances 0.000 description 6
- 230000001093 anti-cancer Effects 0.000 description 6
- 239000002299 complementary DNA Substances 0.000 description 6
- 230000001809 detectable Effects 0.000 description 6
- 238000011161 development Methods 0.000 description 6
- 230000018109 developmental process Effects 0.000 description 6
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical class C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 6
- 238000007901 in situ hybridization Methods 0.000 description 6
- 239000007788 liquid Substances 0.000 description 6
- 201000005202 lung cancer Diseases 0.000 description 6
- 229960000485 methotrexate Drugs 0.000 description 6
- 230000003278 mimic Effects 0.000 description 6
- 230000000869 mutational Effects 0.000 description 6
- 238000004445 quantitative analysis Methods 0.000 description 6
- 150000003839 salts Chemical class 0.000 description 6
- 239000011780 sodium chloride Substances 0.000 description 6
- 239000007787 solid Substances 0.000 description 6
- 230000000392 somatic Effects 0.000 description 6
- 238000004450 types of analysis Methods 0.000 description 6
- 229920000160 (ribonucleotides)n+m Polymers 0.000 description 4
- 102100004948 ABCD1 Human genes 0.000 description 4
- 101700007462 ADH1A Proteins 0.000 description 4
- 102100000356 ADH1A Human genes 0.000 description 4
- 101710003370 ADH1C Proteins 0.000 description 4
- 102100010849 ADH1C Human genes 0.000 description 4
- 102100003861 ADRB1 Human genes 0.000 description 4
- 101700078529 ADRB1 Proteins 0.000 description 4
- 108060003354 ADRB2 Proteins 0.000 description 4
- 102100014183 ADRB2 Human genes 0.000 description 4
- 101700006583 AKT2 Proteins 0.000 description 4
- 101710036635 ALDH1A1 Proteins 0.000 description 4
- 102100017506 ALDH1A1 Human genes 0.000 description 4
- 102100009823 ALOX5 Human genes 0.000 description 4
- 206010000880 Acute myeloid leukaemia Diseases 0.000 description 4
- 229960000643 Adenine Drugs 0.000 description 4
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Natural products NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 4
- 208000009956 Adenocarcinoma Diseases 0.000 description 4
- 102000003984 Aryl hydrocarbon receptors Human genes 0.000 description 4
- 108090000448 Aryl hydrocarbon receptors Proteins 0.000 description 4
- 241000283690 Bos taurus Species 0.000 description 4
- 102100014737 CYP2B6 Human genes 0.000 description 4
- 101710007656 CYP2B6 Proteins 0.000 description 4
- 102100017367 CYP2C9 Human genes 0.000 description 4
- 101710007513 CYP2C9 Proteins 0.000 description 4
- 102100018445 CYP2J2 Human genes 0.000 description 4
- 101710007736 CYP2J2 Proteins 0.000 description 4
- 229960004117 Capecitabine Drugs 0.000 description 4
- GAGWJHPBXLXJQN-UORFTKCHSA-N Capecitabine Chemical compound C1=C(F)C(NC(=O)OCCCCC)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](C)O1 GAGWJHPBXLXJQN-UORFTKCHSA-N 0.000 description 4
- 241000713756 Caprine arthritis encephalitis virus Species 0.000 description 4
- 229960004630 Chlorambucil Drugs 0.000 description 4
- JCKYGMPEJWAADB-UHFFFAOYSA-N Chlorambucil Chemical compound OC(=O)CCCC1=CC=C(N(CCCl)CCCl)C=C1 JCKYGMPEJWAADB-UHFFFAOYSA-N 0.000 description 4
- 206010008958 Chronic lymphocytic leukaemia Diseases 0.000 description 4
- 229960004397 Cyclophosphamide Drugs 0.000 description 4
- CMSMOCZEIVJLDB-UHFFFAOYSA-N Cyclophosphamide Chemical compound ClCCN(CCCl)P1(=O)NCCCO1 CMSMOCZEIVJLDB-UHFFFAOYSA-N 0.000 description 4
- 102000002237 Cytochrome P-450 CYP2A6 Human genes 0.000 description 4
- 108010000080 Cytochrome P-450 CYP2A6 Proteins 0.000 description 4
- 102100014129 DMD Human genes 0.000 description 4
- 101700040453 DRD2 Proteins 0.000 description 4
- 102100014971 DRD2 Human genes 0.000 description 4
- 201000010374 Down syndrome Diseases 0.000 description 4
- 102000006378 EC 2.1.1.6 Human genes 0.000 description 4
- 108020002739 EC 2.1.1.6 Proteins 0.000 description 4
- 201000006360 Edwards syndrome Diseases 0.000 description 4
- 241000713730 Equine infectious anemia virus Species 0.000 description 4
- 102000013165 Exonucleases Human genes 0.000 description 4
- 108060002716 Exonucleases Proteins 0.000 description 4
- 102000027766 FGFR3 Human genes 0.000 description 4
- 241000713800 Feline immunodeficiency virus Species 0.000 description 4
- 241000714165 Feline leukemia virus Species 0.000 description 4
- 241000711950 Filoviridae Species 0.000 description 4
- 108090000652 Flap endonucleases Proteins 0.000 description 4
- 102000004150 Flap endonucleases Human genes 0.000 description 4
- CHPZKNULDCNCBW-UHFFFAOYSA-N Gallium nitrate Chemical compound [Ga+3].[O-][N+]([O-])=O.[O-][N+]([O-])=O.[O-][N+]([O-])=O CHPZKNULDCNCBW-UHFFFAOYSA-N 0.000 description 4
- 206010017758 Gastric cancer Diseases 0.000 description 4
- 206010051066 Gastrointestinal stromal tumour Diseases 0.000 description 4
- 241000713813 Gibbon ape leukemia virus Species 0.000 description 4
- 101700075495 HEXA Proteins 0.000 description 4
- 102100012716 HEXA Human genes 0.000 description 4
- 101710010865 HMGCR Proteins 0.000 description 4
- 102100010498 HMGCR Human genes 0.000 description 4
- 102100016790 HPRT1 Human genes 0.000 description 4
- 101710011940 HPRT1 Proteins 0.000 description 4
- 241000713858 Harvey murine sarcoma virus Species 0.000 description 4
- 108091005902 Hemoglobin subunit beta Proteins 0.000 description 4
- 206010073071 Hepatocellular carcinoma Diseases 0.000 description 4
- FDGQSTZJBFJUBT-UHFFFAOYSA-N Hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 4
- 229960001101 Ifosfamide Drugs 0.000 description 4
- HOMGKSMUEGBAAB-UHFFFAOYSA-N Ifosfamide Chemical compound ClCCNP1(=O)OCCCN1CCCl HOMGKSMUEGBAAB-UHFFFAOYSA-N 0.000 description 4
- 102100019516 JAK2 Human genes 0.000 description 4
- 101700016050 JAK2 Proteins 0.000 description 4
- 206010069755 K-ras gene mutation Diseases 0.000 description 4
- 102100002644 KCNJ11 Human genes 0.000 description 4
- 108060004074 KCNJ11 Proteins 0.000 description 4
- 102100012475 LDLR Human genes 0.000 description 4
- 201000000639 Leber hereditary optic neuropathy Diseases 0.000 description 4
- 208000000429 Leukemia, Lymphocytic, Chronic, B-Cell Diseases 0.000 description 4
- 208000008456 Leukemia, Myelogenous, Chronic, BCR-ABL Positive Diseases 0.000 description 4
- 208000007046 Leukemia, Myeloid, Acute Diseases 0.000 description 4
- 102000003960 Ligases Human genes 0.000 description 4
- 108090000364 Ligases Proteins 0.000 description 4
- 241000712899 Lymphocytic choriomeningitis mammarenavirus Species 0.000 description 4
- 101710033932 MTHFR Proteins 0.000 description 4
- 108010047230 Member 1 Subfamily B ATP Binding Cassette Transporter Proteins 0.000 description 4
- 108010049137 Member 1 Subfamily D ATP Binding Cassette Transporter Proteins 0.000 description 4
- 206010061289 Metastatic neoplasm Diseases 0.000 description 4
- 229960001156 Mitoxantrone Drugs 0.000 description 4
- KKZJGLLVHKMTCM-UHFFFAOYSA-N Mitoxantrone Chemical compound O=C1C2=C(O)C=CC(O)=C2C(=O)C2=C1C(NCCNCCO)=CC=C2NCCNCCO KKZJGLLVHKMTCM-UHFFFAOYSA-N 0.000 description 4
- 241000713869 Moloney murine leukemia virus Species 0.000 description 4
- 241000713862 Moloney murine sarcoma virus Species 0.000 description 4
- 206010028576 Myeloproliferative disease Diseases 0.000 description 4
- NWIBSHFKIJFRCO-WUDYKRTCSA-N Mytomycin Chemical class C1N2C(C(C(C)=C(N)C3=O)=O)=C3[C@@H](COC(N)=O)[C@@]2(OC)[C@@H]2[C@H]1N2 NWIBSHFKIJFRCO-WUDYKRTCSA-N 0.000 description 4
- 206010029592 Non-Hodgkin's lymphomas Diseases 0.000 description 4
- 208000007610 Optic Atrophy, Hereditary, Leber Diseases 0.000 description 4
- 102100019727 P2RY1 Human genes 0.000 description 4
- 102100010469 P2RY12 Human genes 0.000 description 4
- 102100019471 PIK3CA Human genes 0.000 description 4
- 101710027440 PIK3CA Proteins 0.000 description 4
- 102100008799 PTEN Human genes 0.000 description 4
- 108010011536 PTEN Phosphohydrolase Proteins 0.000 description 4
- 102100017318 PTGIS Human genes 0.000 description 4
- 101700002755 PTGIS Proteins 0.000 description 4
- 201000009928 Patau syndrome Diseases 0.000 description 4
- 102000004270 Peptidyl-Dipeptidase A Human genes 0.000 description 4
- 108090000882 Peptidyl-Dipeptidase A Proteins 0.000 description 4
- 208000006664 Precursor Cell Lymphoblastic Leukemia-Lymphoma Diseases 0.000 description 4
- 102000001253 Protein Kinases Human genes 0.000 description 4
- 108010028935 Purinergic P2Y1 Receptors Proteins 0.000 description 4
- GZUITABIAKMVPG-UHFFFAOYSA-N Raloxifene Chemical compound C1=CC(O)=CC=C1C1=C(C(=O)C=2C=CC(OCCN3CCCCC3)=CC=2)C2=CC=C(O)C=C2S1 GZUITABIAKMVPG-UHFFFAOYSA-N 0.000 description 4
- 229960004622 Raloxifene Drugs 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- 241000714474 Rous sarcoma virus Species 0.000 description 4
- 102100017449 SCN5A Human genes 0.000 description 4
- 101700070146 SCN5A Proteins 0.000 description 4
- 108091006693 SLC19A1 Proteins 0.000 description 4
- 102100007110 SLCO1B1 Human genes 0.000 description 4
- 101710042004 SLCO1B1 Proteins 0.000 description 4
- 241000713311 Simian immunodeficiency virus Species 0.000 description 4
- 241000580858 Simian-Human immunodeficiency virus Species 0.000 description 4
- 108010090804 Streptavidin Proteins 0.000 description 4
- 102100009449 TPMT Human genes 0.000 description 4
- 108050009735 TPMT Proteins 0.000 description 4
- 102100008209 TSC1 Human genes 0.000 description 4
- 101700061326 TSC1 Proteins 0.000 description 4
- 101710013732 TYMS Proteins 0.000 description 4
- 102100011050 TYMS Human genes 0.000 description 4
- FOCVUCIESVLUNU-UHFFFAOYSA-N ThioTEPA Chemical compound C1CN1P(N1CC1)(=S)N1CC1 FOCVUCIESVLUNU-UHFFFAOYSA-N 0.000 description 4
- 229960005454 Thioguanine Drugs 0.000 description 4
- 229960001196 Thiotepa Drugs 0.000 description 4
- 229940113082 Thymine Drugs 0.000 description 4
- XFCLJVABOIYOMF-QPLCGJKRSA-N Toremifene Chemical compound C1=CC(OCCN(C)C)=CC=C1C(\C=1C=CC=CC=1)=C(\CCCl)C1=CC=CC=C1 XFCLJVABOIYOMF-QPLCGJKRSA-N 0.000 description 4
- 206010044686 Trisomy 13 Diseases 0.000 description 4
- 206010053884 Trisomy 18 Diseases 0.000 description 4
- 108010081267 Type 3 Fibroblast Growth Factor Receptor Proteins 0.000 description 4
- 102100007198 UBE3A Human genes 0.000 description 4
- 101700027248 UBE3A Proteins 0.000 description 4
- 108020003635 Untranslated Regions Proteins 0.000 description 4
- 229920000146 Untranslated region Polymers 0.000 description 4
- 241000711975 Vesicular stomatitis virus Species 0.000 description 4
- 230000001594 aberrant Effects 0.000 description 4
- 150000007513 acids Chemical class 0.000 description 4
- RJURFGZVJUQBHK-IIXSONLDSA-N actinomycin D Chemical compound C[C@H]1OC(=O)[C@H](C(C)C)N(C)C(=O)CN(C)C(=O)[C@@H]2CCCN2C(=O)[C@@H](C(C)C)NC(=O)[C@H]1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)N[C@@H]4C(=O)N[C@@H](C(N5CCC[C@H]5C(=O)N(C)CC(=O)N(C)[C@@H](C(C)C)C(=O)O[C@@H]4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-IIXSONLDSA-N 0.000 description 4
- 201000005510 acute lymphocytic leukemia Diseases 0.000 description 4
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 4
- 239000011543 agarose gel Substances 0.000 description 4
- 238000000137 annealing Methods 0.000 description 4
- 102000004965 antibodies Human genes 0.000 description 4
- 108090001123 antibodies Proteins 0.000 description 4
- 239000002246 antineoplastic agent Substances 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 201000009030 carcinoma Diseases 0.000 description 4
- 239000002738 chelating agent Substances 0.000 description 4
- 108091006028 chimera Proteins 0.000 description 4
- 201000006934 chronic myeloid leukemia Diseases 0.000 description 4
- 230000002153 concerted Effects 0.000 description 4
- 238000010192 crystallographic characterization Methods 0.000 description 4
- 230000003247 decreasing Effects 0.000 description 4
- KCXVZYZYPLLWCC-UHFFFAOYSA-N edta Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 4
- 238000005755 formation reaction Methods 0.000 description 4
- 201000011243 gastrointestinal stromal tumor Diseases 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 239000002955 immunomodulating agent Substances 0.000 description 4
- 230000002584 immunomodulator Effects 0.000 description 4
- 229940121354 immunomodulators Drugs 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 101700085186 lcb2 Proteins 0.000 description 4
- 230000001404 mediated Effects 0.000 description 4
- 238000002844 melting Methods 0.000 description 4
- GLVAUDGFNGKCSF-UHFFFAOYSA-N mercaptopurine Chemical compound S=C1NC=NC2=C1NC=N2 GLVAUDGFNGKCSF-UHFFFAOYSA-N 0.000 description 4
- 229960001428 mercaptopurine Drugs 0.000 description 4
- 238000007479 molecular analysis Methods 0.000 description 4
- 201000009251 multiple myeloma Diseases 0.000 description 4
- 101700057694 mvaA Proteins 0.000 description 4
- 201000003793 myelodysplastic syndrome Diseases 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 201000008968 osteosarcoma Diseases 0.000 description 4
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 4
- 238000006116 polymerization reaction Methods 0.000 description 4
- 229920001184 polypeptide Polymers 0.000 description 4
- 238000004393 prognosis Methods 0.000 description 4
- 230000002829 reduced Effects 0.000 description 4
- 230000022983 regulation of cell cycle Effects 0.000 description 4
- 238000005070 sampling Methods 0.000 description 4
- 238000000527 sonication Methods 0.000 description 4
- 201000011549 stomach cancer Diseases 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 238000002626 targeted therapy Methods 0.000 description 4
- 201000002510 thyroid cancer Diseases 0.000 description 4
- 229960005026 toremifene Drugs 0.000 description 4
- 241001529453 unidentified herpesvirus Species 0.000 description 4
- 229960002066 vinorelbine Drugs 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- SHGAZHPCJJPHSC-ZVCIMWCZSA-N (2E,4E,6Z,8E)-3,7-dimethyl-9-(2,6,6-trimethylcyclohex-1-en-1-yl)nona-2,4,6,8-tetraenoic acid Chemical compound OC(=O)/C=C(\C)/C=C/C=C(/C)\C=C\C1=C(C)CCCC1(C)C SHGAZHPCJJPHSC-ZVCIMWCZSA-N 0.000 description 2
- WDQLRUYAYXDIFW-RWKIJVEZSA-N (2R,3R,4S,5R,6R)-4-[(2S,3R,4S,5R,6R)-3,5-dihydroxy-4-[(2R,3R,4S,5S,6R)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-6-[[(2R,3R,4S,5S,6R)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxymethyl]oxan-2-yl]oxy-6-(hydroxymethyl)oxane-2,3,5-triol Chemical compound O[C@@H]1[C@@H](CO)O[C@@H](O)[C@H](O)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)[C@H](O)[C@@H](CO[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)O1 WDQLRUYAYXDIFW-RWKIJVEZSA-N 0.000 description 2
- VFKZTMPDYBFSTM-GUCUJZIJSA-N (2R,3S,4R,5S)-1,6-dibromohexane-2,3,4,5-tetrol Chemical compound BrC[C@H](O)[C@@H](O)[C@@H](O)[C@H](O)CBr VFKZTMPDYBFSTM-GUCUJZIJSA-N 0.000 description 2
- FLWWDYNPWOSLEO-HQVZTVAUSA-N (2S)-2-[[4-[1-(2-amino-4-oxo-1H-pteridin-6-yl)ethyl-methylamino]benzoyl]amino]pentanedioic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1C(C)N(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FLWWDYNPWOSLEO-HQVZTVAUSA-N 0.000 description 2
- PMATZTZNYRCHOR-CGLBZJNRSA-N (3S,6S,9S,12R,15S,18S,21S,24S,30S,33S)-30-ethyl-33-[(E,1R,2R)-1-hydroxy-2-methylhex-4-enyl]-1,4,7,10,12,15,19,25,28-nonamethyl-6,9,18,24-tetrakis(2-methylpropyl)-3,21-di(propan-2-yl)-1,4,7,10,13,16,19,22,25,28,31-undecazacyclotritriacontane-2,5,8,11,14,17 Chemical compound CC[C@@H]1NC(=O)[C@H]([C@H](O)[C@H](C)C\C=C\C)N(C)C(=O)[C@H](C(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)N(C)C(=O)CN(C)C1=O PMATZTZNYRCHOR-CGLBZJNRSA-N 0.000 description 2
- DLWOTOMWYCRPLK-UVTDQMKNSA-N (4Z)-5-amino-6-(7-amino-6-methoxy-5,8-dioxoquinolin-2-yl)-4-(4,5-dimethoxy-6-oxocyclohexa-2,4-dien-1-ylidene)-3-methyl-1H-pyridine-2-carboxylic acid Chemical compound C1=CC(OC)=C(OC)C(=O)\C1=C\1C(N)=C(C=2N=C3C(=O)C(N)=C(OC)C(=O)C3=CC=2)NC(C(O)=O)=C/1C DLWOTOMWYCRPLK-UVTDQMKNSA-N 0.000 description 2
- XRBSKUSTLXISAB-XVVDYKMHSA-N (5R,6R,7R,8R)-8-hydroxy-7-(hydroxymethyl)-5-(3,4,5-trimethoxyphenyl)-5,6,7,8-tetrahydrobenzo[f][1,3]benzodioxole-6-carboxylic acid Chemical compound COC1=C(OC)C(OC)=CC([C@@H]2C3=CC=4OCOC=4C=C3[C@H](O)[C@@H](CO)[C@@H]2C(O)=O)=C1 XRBSKUSTLXISAB-XVVDYKMHSA-N 0.000 description 2
- NRUKOCRGYNPUPR-QBPJDGROSA-N (5S,5aR,8aR,9R)-5-[[(2R,4aR,6R,7R,8R,8aS)-7,8-dihydroxy-2-thiophen-2-yl-4,4a,6,7,8,8a-hexahydropyrano[3,2-d][1,3]dioxin-6-yl]oxy]-9-(4-hydroxy-3,5-dimethoxyphenyl)-5a,6,8a,9-tetrahydro-5H-[2]benzofuro[6,5-f][1,3]benzodioxol-8-one Chemical compound COC1=C(O)C(OC)=CC([C@@H]2C3=CC=4OCOC=4C=C3[C@@H](O[C@H]3[C@@H]([C@@H](O)[C@@H]4O[C@@H](OC[C@H]4O3)C=3SC=CC=3)O)[C@@H]3[C@@H]2C(OC3)=O)=C1 NRUKOCRGYNPUPR-QBPJDGROSA-N 0.000 description 2
- JXVAMODRWBNUSF-KZQKBALLSA-N (7S,9R,10R)-7-[(2R,4S,5S,6S)-5-[[(2S,4aS,5aS,7S,9S,9aR,10aR)-2,9-dimethyl-3-oxo-4,4a,5a,6,7,9,9a,10a-octahydrodipyrano[4,2-a:4',3'-e][1,4]dioxin-7-yl]oxy]-4-(dimethylamino)-6-methyloxan-2-yl]oxy-10-[(2S,4S,5S,6S)-4-(dimethylamino)-5-hydroxy-6-methyloxan-2 Chemical compound O([C@@H]1C2=C(O)C=3C(=O)C4=CC=CC(O)=C4C(=O)C=3C(O)=C2[C@@H](O[C@@H]2O[C@@H](C)[C@@H](O[C@@H]3O[C@@H](C)[C@H]4O[C@@H]5O[C@@H](C)C(=O)C[C@@H]5O[C@H]4C3)[C@H](C2)N(C)C)C[C@]1(O)CC)[C@H]1C[C@H](N(C)C)[C@H](O)[C@H](C)O1 JXVAMODRWBNUSF-KZQKBALLSA-N 0.000 description 2
- KMSKQZKKOZQFFG-YXRRJAAWSA-N (7S,9S)-7-[(2R,4S,5S,6S)-4-amino-6-methyl-5-[(2R)-oxan-2-yl]oxyoxan-2-yl]oxy-6,9,11-trihydroxy-9-(2-hydroxyacetyl)-4-methoxy-8,10-dihydro-7H-tetracene-5,12-dione Chemical compound O([C@H]1[C@@H](N)C[C@@H](O[C@H]1C)O[C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@@H]1CCCCO1 KMSKQZKKOZQFFG-YXRRJAAWSA-N 0.000 description 2
- AESVUZLWRXEGEX-GJPCMZTKSA-N (7S,9S)-7-[(2S,4R,5R,6R)-4-amino-5-hydroxy-6-methyloxan-2-yl]oxy-6,9,11-trihydroxy-9-(2-hydroxyacetyl)-4-methoxy-8,10-dihydro-7H-tetracene-5,12-dione;iron(3+) Chemical compound [Fe+3].O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@@H]1C[C@@H](N)[C@@H](O)[C@@H](C)O1 AESVUZLWRXEGEX-GJPCMZTKSA-N 0.000 description 2
- IEXUMDBQLIVNHZ-YOUGDJEHSA-N (8S,11R,13R,14S,17S)-11-[4-(dimethylamino)phenyl]-17-hydroxy-17-(3-hydroxypropyl)-13-methyl-1,2,6,7,8,11,12,14,15,16-decahydrocyclopenta[a]phenanthren-3-one Chemical compound C1=CC(N(C)C)=CC=C1[C@@H]1C2=C3CCC(=O)C=C3CC[C@H]2[C@H](CC[C@]2(O)CCCO)[C@@]2(C)C1 IEXUMDBQLIVNHZ-YOUGDJEHSA-N 0.000 description 2
- AGNGYMCLFWQVGX-AGFFZDDWSA-N (E)-1-[(2S)-2-amino-2-carboxyethoxy]-2-diazonioethenolate Chemical compound OC(=O)[C@@H](N)CO\C([O-])=C\[N+]#N AGNGYMCLFWQVGX-AGFFZDDWSA-N 0.000 description 2
- GDHFOVCRYCPOTK-QBFSEMIESA-N (Z)-2-cyano-3-cyclopropyl-3-hydroxy-N-[3-methyl-4-(trifluoromethyl)phenyl]prop-2-enamide Chemical compound C1=C(C(F)(F)F)C(C)=CC(NC(=O)C(\C#N)=C(/O)C2CC2)=C1 GDHFOVCRYCPOTK-QBFSEMIESA-N 0.000 description 2
- 108091010603 1 family Proteins 0.000 description 2
- COVZYZSDYWQREU-UHFFFAOYSA-N 1,4-Butanediol, dimethanesulfonate Chemical compound CS(=O)(=O)OCCCCOS(C)(=O)=O COVZYZSDYWQREU-UHFFFAOYSA-N 0.000 description 2
- BTOTXLJHDSNXMW-POYBYMJQSA-N 1-[(2R,5S)-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione Chemical compound O1[C@H](CO)CC[C@@H]1N1C(=O)NC(=O)C=C1 BTOTXLJHDSNXMW-POYBYMJQSA-N 0.000 description 2
- MHKBMNACOMRIAW-UHFFFAOYSA-N 2,3-dinitrophenol Chemical compound OC1=CC=CC([N+]([O-])=O)=C1[N+]([O-])=O MHKBMNACOMRIAW-UHFFFAOYSA-N 0.000 description 2
- BOMZMNZEXMAQQW-UHFFFAOYSA-N 2,5,11-trimethyl-6H-pyrido[4,3-b]carbazol-2-ium-9-ol;acetate Chemical compound CC([O-])=O.C[N+]1=CC=C2C(C)=C(NC=3C4=CC(O)=CC=3)C4=C(C)C2=C1 BOMZMNZEXMAQQW-UHFFFAOYSA-N 0.000 description 2
- FOYWNSCCNCUEPU-UHFFFAOYSA-N 2-[[2-[bis(2-hydroxyethyl)amino]-4-piperidin-1-ylpyrimido[5,4-d]pyrimidin-6-yl]-(2-hydroxyethyl)amino]ethanol Chemical compound C12=NC(N(CCO)CCO)=NC=C2N=C(N(CCO)CCO)N=C1N1CCCCC1 FOYWNSCCNCUEPU-UHFFFAOYSA-N 0.000 description 2
- QCXJFISCRQIYID-IAEPZHFASA-N 2-amino-1-N-[(3S,6S,7R,10S,16S)-3-[(2S)-butan-2-yl]-7,11,14-trimethyl-2,5,9,12,15-pentaoxo-10-propan-2-yl-8-oxa-1,4,11,14-tetrazabicyclo[14.3.0]nonadecan-6-yl]-4,6-dimethyl-3-oxo-9-N-[(3S,6S,7R,10S,16S)-7,11,14-trimethyl-2,5,9,12,15-pentaoxo-3,10-di(propa Chemical compound C[C@H]1OC(=O)[C@H](C(C)C)N(C)C(=O)CN(C)C(=O)[C@@H]2CCCN2C(=O)[C@H](C(C)C)NC(=O)[C@H]1NC(=O)C1=C(N=C2C(C(=O)N[C@@H]3C(=O)N[C@H](C(N4CCC[C@H]4C(=O)N(C)CC(=O)N(C)[C@@H](C(C)C)C(=O)O[C@@H]3C)=O)[C@@H](C)CC)=C(N)C(=O)C(C)=C2O2)C2=C(C)C=C1 QCXJFISCRQIYID-IAEPZHFASA-N 0.000 description 2
- FDAYLTPAFBGXAB-UHFFFAOYSA-N 2-chloro-N,N-bis(2-chloroethyl)ethanamine Chemical compound ClCCN(CCCl)CCCl FDAYLTPAFBGXAB-UHFFFAOYSA-N 0.000 description 2
- VNBAOSVONFJBKP-UHFFFAOYSA-N 2-chloro-N,N-bis(2-chloroethyl)propan-1-amine;hydrochloride Chemical compound Cl.CC(Cl)CN(CCCl)CCCl VNBAOSVONFJBKP-UHFFFAOYSA-N 0.000 description 2
- SYNHCENRCUAUNM-UHFFFAOYSA-N 2-chloro-N-(2-chloroethyl)-N-methylethanamine oxide;hydron;chloride Chemical compound Cl.ClCC[N+]([O-])(C)CCCl SYNHCENRCUAUNM-UHFFFAOYSA-N 0.000 description 2
- DBIGHPPNXATHOF-UHFFFAOYSA-N 3-(3-methylsulfonyloxypropylamino)propyl methanesulfonate Chemical compound CS(=O)(=O)OCCCNCCCOS(C)(=O)=O DBIGHPPNXATHOF-UHFFFAOYSA-N 0.000 description 2
- PWMYMKOUNYTVQN-UHFFFAOYSA-N 3-(8,8-diethyl-2-aza-8-germaspiro[4.5]decan-2-yl)-N,N-dimethylpropan-1-amine Chemical compound C1C[Ge](CC)(CC)CCC11CN(CCCN(C)C)CC1 PWMYMKOUNYTVQN-UHFFFAOYSA-N 0.000 description 2
- TVZGACDUOSZQKY-LBPRGKRZSA-N 4-aminofolic acid Chemical compound C1=NC2=NC(N)=NC(N)=C2N=C1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 TVZGACDUOSZQKY-LBPRGKRZSA-N 0.000 description 2
- YFTWHEBLORWGNI-UHFFFAOYSA-N 6-(3-methyl-5-nitroimidazol-4-yl)sulfanyl-7H-purin-2-amine Chemical compound CN1C=NC([N+]([O-])=O)=C1SC1=NC(N)=NC2=C1NC=N2 YFTWHEBLORWGNI-UHFFFAOYSA-N 0.000 description 2
- YCWQAMGASJSUIP-YFKPBYRVSA-N 6-Diazo-5-oxo-L-norleucine Chemical compound OC(=O)[C@@H](N)CCC(=O)C=[N+]=[N-] YCWQAMGASJSUIP-YFKPBYRVSA-N 0.000 description 2
- WYXSYVWAUAUWLD-SHUUEZRQSA-N 6-azauridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=N1 WYXSYVWAUAUWLD-SHUUEZRQSA-N 0.000 description 2
- 101700002015 ABCB1 Proteins 0.000 description 2
- 102100017861 ABCC2 Human genes 0.000 description 2
- 102100006348 ABCC4 Human genes 0.000 description 2
- 101710024119 ABCC4 Proteins 0.000 description 2
- 102100002706 ABCG2 Human genes 0.000 description 2
- 102100019002 ABL1 Human genes 0.000 description 2
- 101700000782 ABL1 Proteins 0.000 description 2
- 102100019003 ABL2 Human genes 0.000 description 2
- 101700048259 ABL2 Proteins 0.000 description 2
- 102100011408 ACACA Human genes 0.000 description 2
- 101700056202 ACACA Proteins 0.000 description 2
- 102100010371 ADGRA2 Human genes 0.000 description 2
- 101710005080 ADGRA2 Proteins 0.000 description 2
- 101700061133 ADH1B Proteins 0.000 description 2
- 102100000357 ADH1B Human genes 0.000 description 2
- AOJJSUZBOXZQNB-TZSSRYMLSA-N ADRIAMYCIN Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 AOJJSUZBOXZQNB-TZSSRYMLSA-N 0.000 description 2
- 108060000207 AK3 Proteins 0.000 description 2
- 102100001248 AKT1 Human genes 0.000 description 2
- 101700006234 AKT1 Proteins 0.000 description 2
- 102100001250 AKT2 Human genes 0.000 description 2
- 102100006725 AKT3 Human genes 0.000 description 2
- 101700004058 AKT3 Proteins 0.000 description 2
- 229940100198 ALKYLATING AGENTS Drugs 0.000 description 2
- 101710036546 ALOX5 Proteins 0.000 description 2
- 229940030486 ANDROGENS Drugs 0.000 description 2
- 229940030495 ANTIANDROGEN SEX HORMONES AND MODULATORS OF THE GENITAL SYSTEM Drugs 0.000 description 2
- 229940100197 ANTIMETABOLITES Drugs 0.000 description 2
- 102100007788 APC Human genes 0.000 description 2
- 101700010938 APC Proteins 0.000 description 2
- 102100011069 ARAF Human genes 0.000 description 2
- 101700086422 ARAF Proteins 0.000 description 2
- 102100013036 ARFRP1 Human genes 0.000 description 2
- 101710036280 ARFRP1 Proteins 0.000 description 2
- 102100011882 ARID1A Human genes 0.000 description 2
- 101710037383 ARID1A Proteins 0.000 description 2
- 102100000648 ATM Human genes 0.000 description 2
- 108060006202 ATM Proteins 0.000 description 2
- 108060000721 ATR Proteins 0.000 description 2
- 102100010553 AURKB Human genes 0.000 description 2
- 101700037792 AURKB Proteins 0.000 description 2
- ZOZKYEHVNDEUCO-XUTVFYLZSA-N Aceglatone Chemical compound O1C(=O)[C@H](OC(C)=O)[C@@H]2OC(=O)[C@@H](OC(=O)C)[C@@H]21 ZOZKYEHVNDEUCO-XUTVFYLZSA-N 0.000 description 2
- 229950002684 Aceglatone Drugs 0.000 description 2
- 208000008919 Achondroplasia Diseases 0.000 description 2
- 208000004064 Acoustic Neuroma Diseases 0.000 description 2
- TXUZVZSFRXZGTL-QPLCGJKRSA-N Afimoxifene Chemical compound C=1C=CC=CC=1C(/CC)=C(C=1C=CC(OCCN(C)C)=CC=1)/C1=CC=C(O)C=C1 TXUZVZSFRXZGTL-QPLCGJKRSA-N 0.000 description 2
- 229940045714 Alkyl sulfonate alkylating agents Drugs 0.000 description 2
- 241000710929 Alphavirus Species 0.000 description 2
- UUVWYPNAQBNQJQ-UHFFFAOYSA-N Altretamine Chemical compound CN(C)C1=NC(N(C)C)=NC(N(C)C)=N1 UUVWYPNAQBNQJQ-UHFFFAOYSA-N 0.000 description 2
- 206010001897 Alzheimer's disease Diseases 0.000 description 2
- 102000007325 Amelogenin Human genes 0.000 description 2
- 108010007570 Amelogenin Proteins 0.000 description 2
- 229960003437 Aminoglutethimide Drugs 0.000 description 2
- ROBVIMPUHSLWNV-UHFFFAOYSA-N Aminoglutethimide Chemical compound C=1C=C(N)C=CC=1C1(CC)CCC(=O)NC1=O ROBVIMPUHSLWNV-UHFFFAOYSA-N 0.000 description 2
- 229960003896 Aminopterin Drugs 0.000 description 2
- 229960001220 Amsacrine Drugs 0.000 description 2
- XCPGHVQEEXUHNC-UHFFFAOYSA-N Amsacrine Chemical compound COC1=CC(NS(C)(=O)=O)=CC=C1NC1=C(C=CC=C2)C2=NC2=CC=CC=C12 XCPGHVQEEXUHNC-UHFFFAOYSA-N 0.000 description 2
- 102000005906 Anaplastic Lymphoma Kinase Human genes 0.000 description 2
- 108010005474 Anaplastic Lymphoma Kinase Proteins 0.000 description 2
- 208000009575 Angelman Syndrome Diseases 0.000 description 2
- 206010002383 Angina pectoris Diseases 0.000 description 2
- 229940046836 Anti-estrogens Drugs 0.000 description 2
- 229940064005 Antibiotic throat preparations Drugs 0.000 description 2
- 229940083879 Antibiotics FOR TREATMENT OF HEMORRHOIDS AND ANAL FISSURES FOR TOPICAL USE Drugs 0.000 description 2
- 229940042052 Antibiotics for systemic use Drugs 0.000 description 2
- 229940042786 Antitubercular Antibiotics Drugs 0.000 description 2
- 206010073360 Appendix cancer Diseases 0.000 description 2
- 108010093579 Arachidonate 5-Lipoxygenase Proteins 0.000 description 2
- 241000712891 Arenavirus Species 0.000 description 2
- 201000009695 Argentine hemorrhagic fever Diseases 0.000 description 2
- 108010078554 Aromatase Proteins 0.000 description 2
- 206010003210 Arteriosclerosis Diseases 0.000 description 2
- 241001480043 Arthrodermataceae Species 0.000 description 2
- 206010003571 Astrocytoma Diseases 0.000 description 2
- 102000004000 Aurora Kinase A Human genes 0.000 description 2
- 108090000461 Aurora Kinase A Proteins 0.000 description 2
- 208000010061 Autosomal Dominant Polycystic Kidney Diseases 0.000 description 2
- 206010064097 Avian influenza Diseases 0.000 description 2
- 229960002756 Azacitidine Drugs 0.000 description 2
- 229950011321 Azaserine Drugs 0.000 description 2
- 102100013894 BCL2 Human genes 0.000 description 2
- 108060000885 BCL2 Proteins 0.000 description 2
- 102100018921 BCL2A1 Human genes 0.000 description 2
- 101710002713 BCL2A1 Proteins 0.000 description 2
- 102100015655 BCL2L1 Human genes 0.000 description 2
- 101710032374 BCL2L1 Proteins 0.000 description 2
- 102100015652 BCL2L2 Human genes 0.000 description 2
- 101710032376 BCL2L2-PABPN1 Proteins 0.000 description 2
- 102100011377 BCL6 Human genes 0.000 description 2
- 101700024247 BCL6 Proteins 0.000 description 2
- 101710019578 BZIP46 Proteins 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 206010004146 Basal cell carcinoma Diseases 0.000 description 2
- GQYIWUVLTXOXAJ-UHFFFAOYSA-N Belustine Chemical compound ClCCN(N=O)C(=O)NC1CCCCC1 GQYIWUVLTXOXAJ-UHFFFAOYSA-N 0.000 description 2
- VGGGPCQERPFHOB-MCIONIFRSA-N Bestatin Chemical compound CC(C)C[C@H](C(O)=O)NC(=O)[C@@H](O)[C@H](N)CC1=CC=CC=C1 VGGGPCQERPFHOB-MCIONIFRSA-N 0.000 description 2
- 206010004593 Bile duct cancer Diseases 0.000 description 2
- 206010005003 Bladder cancer Diseases 0.000 description 2
- 241000335423 Blastomyces Species 0.000 description 2
- OYVAGSVQBOHSSS-WXFSZRTFSA-O Bleomycin Chemical class N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](OC1C(C(O)C(O)C(CO)O1)OC1C(C(OC(N)=O)C(O)C(CO)O1)O)C=1NC=NC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-WXFSZRTFSA-O 0.000 description 2
- 108010006654 Bleomycin Proteins 0.000 description 2
- 201000009694 Bolivian hemorrhagic fever Diseases 0.000 description 2
- 208000003362 Bronchogenic Carcinoma Diseases 0.000 description 2
- 101710009063 CARD11 Proteins 0.000 description 2
- 102100002586 CARD11 Human genes 0.000 description 2
- XREUEWVEMYWFFA-CSKJXFQVSA-N CARUBICIN Chemical compound C1[C@H](N)[C@H](O)[C@H](C)O[C@H]1O[C@@H]1C2=C(O)C(C(=O)C3=C(O)C=CC=C3C3=O)=C3C(O)=C2C[C@@](O)(C(C)=O)C1 XREUEWVEMYWFFA-CSKJXFQVSA-N 0.000 description 2
- 229950001725 CARUBICIN Drugs 0.000 description 2
- 102100008372 CBL Human genes 0.000 description 2
- 108050009659 CBL Proteins 0.000 description 2
- 102100019529 CCND1 Human genes 0.000 description 2
- 101700059002 CCND2 Proteins 0.000 description 2
- 102100019530 CCND2 Human genes 0.000 description 2
- 101700079292 CCND3 Proteins 0.000 description 2
- 102100016486 CCND3 Human genes 0.000 description 2
- 102100016490 CCNE1 Human genes 0.000 description 2
- 101700061678 CCNE1 Proteins 0.000 description 2
- 101710005912 CDH2 Proteins 0.000 description 2
- 102100014481 CDH2 Human genes 0.000 description 2
- 101710035596 CDH5 Proteins 0.000 description 2
- 102100005687 CDH5 Human genes 0.000 description 2
- 101700008359 CDK4 Proteins 0.000 description 2
- 102100019398 CDK4 Human genes 0.000 description 2
- 102100006130 CDK6 Human genes 0.000 description 2
- 102100003970 CDK8 Human genes 0.000 description 2
- 102100019348 CEBPA Human genes 0.000 description 2
- 101700058775 CEBPA Proteins 0.000 description 2
- 101710015564 CHEK1 Proteins 0.000 description 2
- 102100019702 CHEK1 Human genes 0.000 description 2
- 102100019698 CHEK2 Human genes 0.000 description 2
- 108060006647 CHEK2 Proteins 0.000 description 2
- FDKXTQMXEQVLRF-ZHACJKMWSA-N CN(C)\N=N\c1[nH]cnc1C(N)=O Chemical compound CN(C)\N=N\c1[nH]cnc1C(N)=O FDKXTQMXEQVLRF-ZHACJKMWSA-N 0.000 description 2
- 101700072217 CRKL Proteins 0.000 description 2
- 102100011432 CRKL Human genes 0.000 description 2
- 102100005170 CRLF2 Human genes 0.000 description 2
- 101700020064 CRLF2 Proteins 0.000 description 2
- 102100002043 CTNNB1 Human genes 0.000 description 2
- 101710005974 CTNNB1 Proteins 0.000 description 2
- 102100005573 CYP19A1 Human genes 0.000 description 2
- 102100004047 CYP1B1 Human genes 0.000 description 2
- 101710036800 CYP1B1 Proteins 0.000 description 2
- 102100017368 CYP2C8 Human genes 0.000 description 2
- 229950009908 Cactinomycin Drugs 0.000 description 2
- HXCHCVDVKSCDHU-LULTVBGHSA-N Calicheamicin Chemical compound C1[C@H](OC)[C@@H](NCC)CO[C@H]1O[C@H]1[C@H](O[C@@H]2C\3=C(NC(=O)OC)C(=O)C[C@](C/3=C/CSSSC)(O)C#C\C=C/C#C2)O[C@H](C)[C@@H](NO[C@@H]2O[C@H](C)[C@@H](SC(=O)C=3C(=C(OC)C(O[C@H]4[C@@H]([C@H](OC)[C@@H](O)[C@H](C)O4)O)=C(I)C=3C)OC)[C@@H](O)C2)[C@@H]1O HXCHCVDVKSCDHU-LULTVBGHSA-N 0.000 description 2
- IVFYLRMMHVYGJH-PVPPCFLZSA-N Calusterone Chemical compound C1C[C@]2(C)[C@](O)(C)CC[C@H]2[C@@H]2[C@@H](C)CC3=CC(=O)CC[C@]3(C)[C@H]21 IVFYLRMMHVYGJH-PVPPCFLZSA-N 0.000 description 2
- 229950009823 Calusterone Drugs 0.000 description 2
- 241000589876 Campylobacter Species 0.000 description 2
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 2
- 229960004562 Carboplatin Drugs 0.000 description 2
- OLESAACUTLOWQZ-UHFFFAOYSA-L Carboplatin Chemical compound O=C1O[Pt]([N]([H])([H])[H])([N]([H])([H])[H])OC(=O)C11CCC1 OLESAACUTLOWQZ-UHFFFAOYSA-L 0.000 description 2
- SHHKQEUPHAENFK-UHFFFAOYSA-N Carboquone Chemical compound O=C1C(C)=C(N2CC2)C(=O)C(C(COC(N)=O)OC)=C1N1CC1 SHHKQEUPHAENFK-UHFFFAOYSA-N 0.000 description 2
- 208000002458 Carcinoid Tumor Diseases 0.000 description 2
- 241000700199 Cavia porcellus Species 0.000 description 2
- 206010008342 Cervix carcinoma Diseases 0.000 description 2
- 201000011470 Charcot-Marie-Tooth disease Diseases 0.000 description 2
- JWBOIMRXGHLCPP-UHFFFAOYSA-N Chloditan Chemical compound C=1C=CC=C(Cl)C=1C(C(Cl)Cl)C1=CC=C(Cl)C=C1 JWBOIMRXGHLCPP-UHFFFAOYSA-N 0.000 description 2
- 102000011045 Chloride Channels Human genes 0.000 description 2
- 108010062745 Chloride Channels Proteins 0.000 description 2
- HAWPXGHAZFHHAD-UHFFFAOYSA-N Chlormethine Chemical compound ClCCN(C)CCCl HAWPXGHAZFHHAD-UHFFFAOYSA-N 0.000 description 2
- 229950008249 Chlornaphazine Drugs 0.000 description 2
- XCDXSSFOJZZGQC-UHFFFAOYSA-N Chlornaphazine Chemical compound C1=CC=CC2=CC(N(CCCl)CCCl)=CC=C21 XCDXSSFOJZZGQC-UHFFFAOYSA-N 0.000 description 2
- MKQWTWSXVILIKJ-LXGUWJNJSA-N Chlorozotocin Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](C=O)NC(=O)N(N=O)CCCl MKQWTWSXVILIKJ-LXGUWJNJSA-N 0.000 description 2
- 206010008723 Chondrodystrophy Diseases 0.000 description 2
- 208000005243 Chondrosarcoma Diseases 0.000 description 2
- 208000006332 Choriocarcinoma Diseases 0.000 description 2
- 210000000349 Chromosomes Anatomy 0.000 description 2
- 206010065854 Chronic eosinophilic leukaemia Diseases 0.000 description 2
- 241000222290 Cladosporium Species 0.000 description 2
- 241000193403 Clostridium Species 0.000 description 2
- 210000001072 Colon Anatomy 0.000 description 2
- 241000711573 Coronaviridae Species 0.000 description 2
- 241000186216 Corynebacterium Species 0.000 description 2
- 206010070976 Craniocerebral injury Diseases 0.000 description 2
- 208000009798 Craniopharyngioma Diseases 0.000 description 2
- 241000150230 Crimean-Congo hemorrhagic fever orthonairovirus Species 0.000 description 2
- 241001337994 Cryptococcus <scale insect> Species 0.000 description 2
- 108010058546 Cyclin D1 Proteins 0.000 description 2
- 108010025468 Cyclin-Dependent Kinase 6 Proteins 0.000 description 2
- 108010025415 Cyclin-Dependent Kinase 8 Proteins 0.000 description 2
- 108010009356 Cyclin-Dependent Kinase Inhibitor p15 Proteins 0.000 description 2
- 102000009512 Cyclin-Dependent Kinase Inhibitor p15 Human genes 0.000 description 2
- 108010009367 Cyclin-Dependent Kinase Inhibitor p18 Proteins 0.000 description 2
- 102000009503 Cyclin-Dependent Kinase Inhibitor p18 Human genes 0.000 description 2
- 229940119017 Cyclosporine Drugs 0.000 description 2
- 108010036949 Cyclosporine Proteins 0.000 description 2
- 229960000684 Cytarabine Drugs 0.000 description 2
- 108010000561 Cytochrome P-450 CYP2C8 Proteins 0.000 description 2
- 241000701022 Cytomegalovirus Species 0.000 description 2
- UHDGCWIWMRVCDJ-CCXZUQQUSA-N Cytosar Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@@H](O)[C@H](O)[C@@H](CO)O1 UHDGCWIWMRVCDJ-CCXZUQQUSA-N 0.000 description 2
- STQGQHZAVUOBTE-VGBVRHCVSA-N DAUNOMYCIN Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(C)=O)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 STQGQHZAVUOBTE-VGBVRHCVSA-N 0.000 description 2
- 239000003298 DNA probe Substances 0.000 description 2
- 102100006402 DNMT3A Human genes 0.000 description 2
- 101710038368 DNMT3A Proteins 0.000 description 2
- 102100002445 DNTT Human genes 0.000 description 2
- 101700067544 DOT1L Proteins 0.000 description 2
- 102100018143 DOT1L Human genes 0.000 description 2
- 229960000640 Dactinomycin Drugs 0.000 description 2
- 108010092160 Dactinomycin Proteins 0.000 description 2
- 229960000975 Daunorubicin Drugs 0.000 description 2
- NNJPGOLRFBJNIW-HNNXBMFYSA-N Demecolcine Chemical compound C1=C(OC)C(=O)C=C2[C@@H](NC)CCC3=CC(OC)=C(OC)C(OC)=C3C2=C1 NNJPGOLRFBJNIW-HNNXBMFYSA-N 0.000 description 2
- 208000001490 Dengue Diseases 0.000 description 2
- 206010012310 Dengue fever Diseases 0.000 description 2
- 229950003913 Detorubicin Drugs 0.000 description 2
- BMKDZUISNHGIBY-ZETCQYMHSA-N Dexrazoxane Chemical compound C([C@H](C)N1CC(=O)NC(=O)C1)N1CC(=O)NC(=O)C1 BMKDZUISNHGIBY-ZETCQYMHSA-N 0.000 description 2
- 208000000398 DiGeorge Syndrome Diseases 0.000 description 2
- 206010012818 Diffuse large B-cell lymphoma Diseases 0.000 description 2
- 108010066455 Dihydrouracil Dehydrogenase (NADP) Proteins 0.000 description 2
- ZDZOTLJHXYCWBA-VCVYQWHSSA-N Docetaxel Chemical compound O([C@H]1[C@H]2[C@@](C([C@H](O)C3=C(C)[C@@H](OC(=O)[C@H](O)[C@@H](NC(=O)OC(C)(C)C)C=4C=CC=CC=4)C[C@]1(O)C3(C)C)=O)(C)[C@@H](O)C[C@H]1OC[C@]12OC(=O)C)C(=O)C1=CC=CC=C1 ZDZOTLJHXYCWBA-VCVYQWHSSA-N 0.000 description 2
- 229960004679 Doxorubicin Drugs 0.000 description 2
- NOTIQUSPUUHHEH-UXOVVSIBSA-N Drostanolone propionate Chemical compound C([C@@H]1CC2)C(=O)[C@H](C)C[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H](OC(=O)CC)[C@@]2(C)CC1 NOTIQUSPUUHHEH-UXOVVSIBSA-N 0.000 description 2
- 206010013801 Duchenne muscular dystrophy Diseases 0.000 description 2
- 241001520695 Duvenhage lyssavirus Species 0.000 description 2
- 108010069091 Dystrophin Proteins 0.000 description 2
- 102000007698 EC 1.1.1.1 Human genes 0.000 description 2
- 108010021809 EC 1.1.1.1 Proteins 0.000 description 2
- 102000016354 EC 2.4.1.17 Human genes 0.000 description 2
- 108010092364 EC 2.4.1.17 Proteins 0.000 description 2
- 229940121647 EGFR inhibitors Drugs 0.000 description 2
- 229940022766 EGTA Drugs 0.000 description 2
- 102100001810 EPHA3 Human genes 0.000 description 2
- 101700049294 EPHA3 Proteins 0.000 description 2
- 102100001726 EPHA5 Human genes 0.000 description 2
- 101710025984 EPHA5 Proteins 0.000 description 2
- 102100001727 EPHA6 Human genes 0.000 description 2
- 101700082598 EPHA6 Proteins 0.000 description 2
- 102100001728 EPHA7 Human genes 0.000 description 2
- 101700020058 EPHA7 Proteins 0.000 description 2
- 102100009838 EPHB1 Human genes 0.000 description 2
- 101700067933 EPHB1 Proteins 0.000 description 2
- 102100009831 EPHB4 Human genes 0.000 description 2
- 102100009565 EPHB6 Human genes 0.000 description 2
- 101700059782 EPHB6 Proteins 0.000 description 2
- 102100017900 EPHX2 Human genes 0.000 description 2
- 101710030425 EPHX2 Proteins 0.000 description 2
- 229960001904 EPIRUBICIN Drugs 0.000 description 2
- AOJJSUZBOXZQNB-VTZDEGQISA-N EPIRUBICIN Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@@H](O)[C@H](C)O1 AOJJSUZBOXZQNB-VTZDEGQISA-N 0.000 description 2
- 101700041204 ERBB3 Proteins 0.000 description 2
- 102000027776 ERBB3 Human genes 0.000 description 2
- 101700023619 ERBB4 Proteins 0.000 description 2
- 102100009851 ERBB4 Human genes 0.000 description 2
- 102100010876 ERCC2 Human genes 0.000 description 2
- 101700055371 ERG Proteins 0.000 description 2
- ITSGNOIFAJAQHJ-BMFNZSJVSA-N ESORUBICIN Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)C[C@H](C)O1 ITSGNOIFAJAQHJ-BMFNZSJVSA-N 0.000 description 2
- 102100016692 ESR1 Human genes 0.000 description 2
- 102100016237 ESR2 Human genes 0.000 description 2
- 102100002090 ETV1 Human genes 0.000 description 2
- 101700001792 ETV1 Proteins 0.000 description 2
- 102100002087 ETV4 Human genes 0.000 description 2
- 101700082619 ETV4 Proteins 0.000 description 2
- 102100002088 ETV5 Human genes 0.000 description 2
- 101700050069 ETV5 Proteins 0.000 description 2
- 102100002089 ETV6 Human genes 0.000 description 2
- 101700053672 ETV6 Proteins 0.000 description 2
- 102100016041 EZH2 Human genes 0.000 description 2
- 101700041849 EZH2 Proteins 0.000 description 2
- 241000710945 Eastern equine encephalitis virus Species 0.000 description 2
- 206010014071 Ebola disease Diseases 0.000 description 2
- 201000011001 Ebola hemorrhagic fever Diseases 0.000 description 2
- 229950000549 Elliptinium acetate Drugs 0.000 description 2
- 206010014733 Endometrial cancer Diseases 0.000 description 2
- SAMRUMKYXPVKPA-VFKOLLTISA-N Enocitabine Chemical compound O=C1N=C(NC(=O)CCCCCCCCCCCCCCCCCCCCC)C=CN1[C@H]1[C@@H](O)[C@H](O)[C@@H](CO)O1 SAMRUMKYXPVKPA-VFKOLLTISA-N 0.000 description 2
- 229950011487 Enocitabine Drugs 0.000 description 2
- 206010014950 Eosinophilia Diseases 0.000 description 2
- 206010014967 Ependymoma Diseases 0.000 description 2
- 108010055323 EphB4 Receptor Proteins 0.000 description 2
- 241001455610 Ephemerovirus Species 0.000 description 2
- 229950002973 Epitiostanol Drugs 0.000 description 2
- 241000283086 Equidae Species 0.000 description 2
- 241000588722 Escherichia Species 0.000 description 2
- 229950002017 Esorubicin Drugs 0.000 description 2
- 208000002047 Essential Thrombocythemia Diseases 0.000 description 2
- 229960001842 Estramustine Drugs 0.000 description 2
- FRPJXPJMRWBBIH-RBRWEJTLSA-N Estramustine Chemical compound ClCCN(CCCl)C(=O)OC1=CC=C2[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CCC2=C1 FRPJXPJMRWBBIH-RBRWEJTLSA-N 0.000 description 2
- 229960005237 Etoglucid Drugs 0.000 description 2
- UMILHIMHKXVDGH-UHFFFAOYSA-N Etoglucid Chemical compound C1OC1COCCOCCOCCOCC1CO1 UMILHIMHKXVDGH-UHFFFAOYSA-N 0.000 description 2
- 229960005420 Etoposide Drugs 0.000 description 2
- VJJPUSNTGOMMGY-MRVIYFEKSA-N Etoposide Chemical compound COC1=C(O)C(OC)=CC([C@@H]2C3=CC=4OCOC=4C=C3[C@@H](O[C@H]3[C@@H]([C@@H](O)[C@@H]4O[C@H](C)OC[C@H]4O3)O)[C@@H]3[C@@H]2C(OC3)=O)=C1 VJJPUSNTGOMMGY-MRVIYFEKSA-N 0.000 description 2
- 241000579695 European bat 1 lyssavirus Species 0.000 description 2
- 208000006168 Ewing Sarcoma Diseases 0.000 description 2
- 108010062201 F-Box-WD Repeat-Containing Protein 7 Proteins 0.000 description 2
- 102100020361 F5 Human genes 0.000 description 2
- 102100000368 F8 Human genes 0.000 description 2
- 102100020077 FBXW7 Human genes 0.000 description 2
- 102100015541 FCGR3A Human genes 0.000 description 2
- 101710044656 FCGR3A Proteins 0.000 description 2
- 102000027757 FGF receptors Human genes 0.000 description 2
- 108091008101 FGF receptors Proteins 0.000 description 2
- 102100017996 FGFR1 Human genes 0.000 description 2
- 102100018000 FGFR2 Human genes 0.000 description 2
- 101700075612 FGFR4 Proteins 0.000 description 2
- 102100020189 FGFR4 Human genes 0.000 description 2
- 102100006565 FLT1 Human genes 0.000 description 2
- 101710030892 FLT1 Proteins 0.000 description 2
- 102100004573 FLT3 Human genes 0.000 description 2
- 101710009074 FLT3 Proteins 0.000 description 2
- 102100013182 FLT4 Human genes 0.000 description 2
- 102100014905 FMR1 Human genes 0.000 description 2
- 108060002977 FMR1 Proteins 0.000 description 2
- 102100014881 FOXP4 Human genes 0.000 description 2
- 101700084508 FOXP4 Proteins 0.000 description 2
- 108010014172 Factor V Proteins 0.000 description 2
- 108010054218 Factor VIII Proteins 0.000 description 2
- 102000001690 Factor VIII Human genes 0.000 description 2
- 108010087740 Fanconi Anemia Complementation Group A Protein Proteins 0.000 description 2
- 102000009095 Fanconi Anemia Complementation Group A Protein Human genes 0.000 description 2
- 201000000106 Fanconi anemia complementation group A Diseases 0.000 description 2
- 229940043168 Fareston Drugs 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- 241000710781 Flaviviridae Species 0.000 description 2
- ODKNJVUHOIMIIZ-RRKCRQDMSA-N Floxuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(F)=C1 ODKNJVUHOIMIIZ-RRKCRQDMSA-N 0.000 description 2
- 229960000961 Floxuridine Drugs 0.000 description 2
- MKXKFYHWDHIYRV-UHFFFAOYSA-N Flutamide Chemical compound CC(C)C(=O)NC1=CC=C([N+]([O-])=O)C(C(F)(F)F)=C1 MKXKFYHWDHIYRV-UHFFFAOYSA-N 0.000 description 2
- 229960000304 Folic Acid Drugs 0.000 description 2
- 208000001914 Fragile X Syndrome Diseases 0.000 description 2
- 108009000484 Fragile X Syndrome Proteins 0.000 description 2
- 241000714188 Friend murine leukemia virus Species 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- 241000223218 Fusarium Species 0.000 description 2
- 101700048985 GATA1 Proteins 0.000 description 2
- 102100010478 GATA1 Human genes 0.000 description 2
- 102100018836 GATC Human genes 0.000 description 2
- 108060003107 GATC Proteins 0.000 description 2
- 102100008019 GNA11 Human genes 0.000 description 2
- 101700048596 GNA11 Proteins 0.000 description 2
- 102100014305 GNAQ Human genes 0.000 description 2
- 101700035643 GNAQ Proteins 0.000 description 2
- 102000033185 GNAS Human genes 0.000 description 2
- 101700086896 GNAS Proteins 0.000 description 2
- 101710036434 GUCY1A2 Proteins 0.000 description 2
- 102100010488 GUCY1A2 Human genes 0.000 description 2
- 101710041546 Galphas Proteins 0.000 description 2
- 206010018048 Gaucher's disease Diseases 0.000 description 2
- SDUQYLNIPVEERB-QPPQHZFASA-N Gemcitabine Chemical compound O=C1N=C(N)C=CN1[C@H]1C(F)(F)[C@H](O)[C@@H](CO)O1 SDUQYLNIPVEERB-QPPQHZFASA-N 0.000 description 2
- 206010018338 Glioma Diseases 0.000 description 2
- 102000004547 Glucosylceramidase Human genes 0.000 description 2
- 108010017544 Glucosylceramidase Proteins 0.000 description 2
- BLCLNMBMMGCOAS-URPVMXJPSA-N Goserelin Chemical compound C([C@@H](C(=O)N[C@H](COC(C)(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(=O)NNC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 BLCLNMBMMGCOAS-URPVMXJPSA-N 0.000 description 2
- 229960002913 Goserelin Drugs 0.000 description 2
- 108010069236 Goserelin Proteins 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N Guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 229940093922 Gynecological Antibiotics Drugs 0.000 description 2
- 102100020458 HLA-A Human genes 0.000 description 2
- 108010075704 HLA-A Antigens Proteins 0.000 description 2
- 102100020459 HLA-B Human genes 0.000 description 2
- 108010058607 HLA-B Antigens Proteins 0.000 description 2
- 102100020457 HLA-C Human genes 0.000 description 2
- 108010052199 HLA-C Antigens Proteins 0.000 description 2
- 108010010378 HLA-DP Antigens Proteins 0.000 description 2
- 102000015789 HLA-DP Antigens Human genes 0.000 description 2
- 108010062347 HLA-DQ Antigens Proteins 0.000 description 2
- 108010058597 HLA-DR Antigens Proteins 0.000 description 2
- 102000006354 HLA-DR Antigens Human genes 0.000 description 2
- 101710027869 HOXA3 Proteins 0.000 description 2
- 102100014797 HOXA3 Human genes 0.000 description 2
- 102100009283 HRAS Human genes 0.000 description 2
- 101710033925 HRAS Proteins 0.000 description 2
- 102100017052 HSP90AA1 Human genes 0.000 description 2
- 101710033238 HSP90AA1 Proteins 0.000 description 2
- 206010061192 Haemorrhagic fever Diseases 0.000 description 2
- 230000036499 Half live Effects 0.000 description 2
- 208000001258 Hemangiosarcoma Diseases 0.000 description 2
- 241000893570 Hendra henipavirus Species 0.000 description 2
- 241000711549 Hepacivirus C Species 0.000 description 2
- 241000700721 Hepatitis B virus Species 0.000 description 2
- 241000724675 Hepatitis E virus Species 0.000 description 2
- 241000724709 Hepatitis delta virus Species 0.000 description 2
- 241000709721 Hepatovirus A Species 0.000 description 2
- 208000009889 Herpes Simplex Diseases 0.000 description 2
- 208000007514 Herpes Zoster Diseases 0.000 description 2
- 102000016871 Hexosaminidase A Human genes 0.000 description 2
- 108010053317 Hexosaminidase A Proteins 0.000 description 2
- 241000228402 Histoplasma Species 0.000 description 2
- 206010020243 Hodgkin's disease Diseases 0.000 description 2
- 201000006743 Hodgkin's lymphoma Diseases 0.000 description 2
- 229940088597 Hormone Drugs 0.000 description 2
- 241001502974 Human gammaherpesvirus 8 Species 0.000 description 2
- 241000701027 Human herpesvirus 6 Species 0.000 description 2
- 241000725303 Human immunodeficiency virus Species 0.000 description 2
- 241000713340 Human immunodeficiency virus 2 Species 0.000 description 2
- 201000001971 Huntington's disease Diseases 0.000 description 2
- 206010048643 Hypereosinophilic syndrome Diseases 0.000 description 2
- 206010020772 Hypertension Diseases 0.000 description 2
- 101710019545 IARS2 Proteins 0.000 description 2
- 102100004121 IDH1 Human genes 0.000 description 2
- 101700024037 IDH1 Proteins 0.000 description 2
- 101700030371 IDH2 Proteins 0.000 description 2
- 102100002772 IDH2 Human genes 0.000 description 2
- 101700066748 IDH3B Proteins 0.000 description 2
- 102100014263 IGF1R Human genes 0.000 description 2
- 101700025802 IGF1R Proteins 0.000 description 2
- 102100013307 IGF2R Human genes 0.000 description 2
- 101710032496 IGF2R Proteins 0.000 description 2
- 102100008723 IKBKE Human genes 0.000 description 2
- 101710002884 IKBKE Proteins 0.000 description 2
- 102100008719 IKZF1 Human genes 0.000 description 2
- 101700005406 IKZF1 Proteins 0.000 description 2
- 102100008238 INHBA Human genes 0.000 description 2
- 102100002730 IRS2 Human genes 0.000 description 2
- 101700043815 IRS2 Proteins 0.000 description 2
- 102100002913 ITPA Human genes 0.000 description 2
- 101700030116 ITPA Proteins 0.000 description 2
- 229940015872 Ibandronate Drugs 0.000 description 2
- 229960000908 Idarubicin Drugs 0.000 description 2
- XDXDZDZNSLXDNA-TZNDIEGXSA-N Idarubicin hydrochloride Chemical compound C1[C@H](N)[C@H](O)[C@H](C)O[C@H]1O[C@@H]1C2=C(O)C(C(=O)C3=CC=CC=C3C3=O)=C3C(O)=C2C[C@@](O)(C(C)=O)C1 XDXDZDZNSLXDNA-TZNDIEGXSA-N 0.000 description 2
- DOUYETYNHWVLEO-UHFFFAOYSA-N Imiquimod Chemical compound C1=CC=CC2=C3N(CC(C)C)C=NC3=C(N)N=C21 DOUYETYNHWVLEO-UHFFFAOYSA-N 0.000 description 2
- 206010067917 Inflammatory myofibroblastic tumour Diseases 0.000 description 2
- 241000713297 Influenza C virus Species 0.000 description 2
- RCINICONZNJXQF-MZXODVADSA-N Intaxel Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 description 2
- GURKHSYORGJETM-WAQYZQTGSA-N Irinotecan hydrochloride Chemical compound Cl.C1=C2C(CC)=C3CN(C(C4=C([C@@](C(=O)OC4)(O)CC)C=4)=O)C=4C3=NC2=CC=C1OC(=O)N(CC1)CCC1N1CCCCC1 GURKHSYORGJETM-WAQYZQTGSA-N 0.000 description 2
- 102100019517 JAK1 Human genes 0.000 description 2
- 101700034277 JAK1 Proteins 0.000 description 2
- 102100019518 JAK3 Human genes 0.000 description 2
- 101700007593 JAK3 Proteins 0.000 description 2
- 101700011826 KCNH6 Proteins 0.000 description 2
- 102100004455 KMT2A Human genes 0.000 description 2
- 101700000155 KMT2A Proteins 0.000 description 2
- 210000003734 Kidney Anatomy 0.000 description 2
- 201000001430 Klinefelter's syndrome Diseases 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- 108010001831 LDL receptors Proteins 0.000 description 2
- 108060004326 LDLR Proteins 0.000 description 2
- 102100012762 LRP1B Human genes 0.000 description 2
- 101700083303 LRP1B Proteins 0.000 description 2
- 102100012759 LRP2 Human genes 0.000 description 2
- 229950009700 Laflunimus Drugs 0.000 description 2
- 241001520693 Lagos bat lyssavirus Species 0.000 description 2
- GKWPCEFFIHSJOE-UHFFFAOYSA-N Laquinimod Chemical compound OC=1C2=C(Cl)C=CC=C2N(C)C(=O)C=1C(=O)N(CC)C1=CC=CC=C1 GKWPCEFFIHSJOE-UHFFFAOYSA-N 0.000 description 2
- 206010023927 Lassa fever Diseases 0.000 description 2
- 206010024190 Leiomyosarcomas Diseases 0.000 description 2
- 229920001491 Lentinan Polymers 0.000 description 2
- 208000009625 Lesch-Nyhan Syndrome Diseases 0.000 description 2
- 229940008250 Leuprolide Drugs 0.000 description 2
- 108010000817 Leuprolide Proteins 0.000 description 2
- 229960004338 Leuprorelin Drugs 0.000 description 2
- 206010024627 Liposarcoma Diseases 0.000 description 2
- WDRYRZXSPDWGEB-UHFFFAOYSA-N Lonidamine Chemical compound C12=CC=CC=C2C(C(=O)O)=NN1CC1=CC=C(Cl)C=C1Cl WDRYRZXSPDWGEB-UHFFFAOYSA-N 0.000 description 2
- 206010025224 Lymphangiosarcomas Diseases 0.000 description 2
- 241000711828 Lyssavirus Species 0.000 description 2
- 102100002988 MAN1B1 Human genes 0.000 description 2
- 101710009304 MAN1B1 Proteins 0.000 description 2
- GLFNIEUTAYBVOC-UHFFFAOYSA-L MANGANESE CHLORIDE Chemical compound Cl[Mn]Cl GLFNIEUTAYBVOC-UHFFFAOYSA-L 0.000 description 2
- 108010068342 MAP Kinase Kinase 1 Proteins 0.000 description 2
- 108010068353 MAP Kinase Kinase 2 Proteins 0.000 description 2
- 102100006473 MAP2K1 Human genes 0.000 description 2
- 102100015877 MAP2K2 Human genes 0.000 description 2
- 102100015874 MAP2K4 Human genes 0.000 description 2
- 101710027749 MAP2K4 Proteins 0.000 description 2
- 101700074567 MCF2L Proteins 0.000 description 2
- 102100002383 MCL1 Human genes 0.000 description 2
- 101700031439 MCL1 Proteins 0.000 description 2
- 102100019155 MDM2 Human genes 0.000 description 2
- 101700032565 MDM2 Proteins 0.000 description 2
- 108050005300 MDM4 Proteins 0.000 description 2
- 102000017274 MDM4 Human genes 0.000 description 2
- 102100019218 MEN1 Human genes 0.000 description 2
- 101700045140 MEN1 Proteins 0.000 description 2
- 101710026102 MIC-ACT-2 Proteins 0.000 description 2
- 102100014646 MITF Human genes 0.000 description 2
- 102100014416 MLH1 Human genes 0.000 description 2
- 229950010718 MOPIDAMOL Drugs 0.000 description 2
- 102100000250 MPL Human genes 0.000 description 2
- 108060005135 MPL Proteins 0.000 description 2
- 108010000318 MRE11 Homologue Protein Proteins 0.000 description 2
- 102000002251 MRE11 Homologue Protein Human genes 0.000 description 2
- 102100013820 MSH2 Human genes 0.000 description 2
- 101700083509 MSH2 Proteins 0.000 description 2
- 229910015837 MSH2 Inorganic materials 0.000 description 2
- 102100002001 MSH6 Human genes 0.000 description 2
- 101700030163 MSH6 Proteins 0.000 description 2
- 102100013322 MTOR Human genes 0.000 description 2
- 101700036611 MTOR Proteins 0.000 description 2
- 102100003827 MUTYH Human genes 0.000 description 2
- 101700053678 MUTYH Proteins 0.000 description 2
- 101700048958 MYCL Proteins 0.000 description 2
- 102100018883 MYCL Human genes 0.000 description 2
- 241000712898 Machupo mammarenavirus Species 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- MQXVYODZCMMZEM-ZYUZMQFOSA-N Mannomustine Chemical compound ClCCNC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CNCCCl MQXVYODZCMMZEM-ZYUZMQFOSA-N 0.000 description 2
- 206010026798 Mantle cell lymphomas Diseases 0.000 description 2
- 208000000932 Marburg Virus Disease Diseases 0.000 description 2
- 201000011013 Marburg hemorrhagic fever Diseases 0.000 description 2
- 229960004961 Mechlorethamine Drugs 0.000 description 2
- 208000007054 Medullary Carcinoma Diseases 0.000 description 2
- 208000000172 Medulloblastoma Diseases 0.000 description 2
- SGDBTWWWUNNDEQ-LBPRGKRZSA-N Melphalan Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N(CCCl)CCCl)C=C1 SGDBTWWWUNNDEQ-LBPRGKRZSA-N 0.000 description 2
- 108010090306 Member 2 Subfamily G ATP Binding Cassette Transporter Proteins 0.000 description 2
- 206010027191 Meningioma Diseases 0.000 description 2
- IVDYZAAPOLNZKG-KWHRADDSSA-N Mepitiostane Chemical compound O([C@@H]1[C@]2(CC[C@@H]3[C@@]4(C)C[C@H]5S[C@H]5C[C@@H]4CC[C@H]3[C@@H]2CC1)C)C1(OC)CCCC1 IVDYZAAPOLNZKG-KWHRADDSSA-N 0.000 description 2
- 229950009246 Mepitiostane Drugs 0.000 description 2
- 206010027406 Mesothelioma Diseases 0.000 description 2
- 108020004999 Messenger RNA Proteins 0.000 description 2
- 108010030837 Methylenetetrahydrofolate Reductase (NADPH2) Proteins 0.000 description 2
- 108010050345 Microphthalmia-Associated Transcription Factor Proteins 0.000 description 2
- 241001480037 Microsporum Species 0.000 description 2
- VFKZTMPDYBFSTM-KVTDHHQDSA-N Mitobronitol Chemical compound BrC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CBr VFKZTMPDYBFSTM-KVTDHHQDSA-N 0.000 description 2
- 229960003539 Mitoguazone Drugs 0.000 description 2
- 229960004857 Mitomycin Drugs 0.000 description 2
- 229960000350 Mitotane Drugs 0.000 description 2
- 229910021380 MnCl2 Inorganic materials 0.000 description 2
- 241000725171 Mokola lyssavirus Species 0.000 description 2
- 208000005871 Monkeypox Diseases 0.000 description 2
- 241000711513 Mononegavirales Species 0.000 description 2
- 210000000214 Mouth Anatomy 0.000 description 2
- 241000761989 Mucoromycotina Species 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 108010026664 MutL Protein Homolog 1 Proteins 0.000 description 2
- 241000186359 Mycobacterium Species 0.000 description 2
- 229960000951 Mycophenolic Acid Drugs 0.000 description 2
- 206010028537 Myelofibrosis Diseases 0.000 description 2
- 208000010125 Myocardial Infarction Diseases 0.000 description 2
- 206010068871 Myotonic dystrophy Diseases 0.000 description 2
- 208000001611 Myxosarcoma Diseases 0.000 description 2
- YMVWGSQGCWCDGW-UHFFFAOYSA-N N',N'-dimethyl-N-(1-nitroacridin-9-yl)propane-1,3-diamine Chemical compound C1=CC([N+]([O-])=O)=C2C(NCCCN(C)C)=C(C=CC=C3)C3=NC2=C1 YMVWGSQGCWCDGW-UHFFFAOYSA-N 0.000 description 2
- NJSMWLQOCQIOPE-OCHFTUDZSA-N N-[(E)-[10-[(E)-(4,5-dihydro-1H-imidazol-2-ylhydrazinylidene)methyl]anthracen-9-yl]methylideneamino]-4,5-dihydro-1H-imidazol-2-amine Chemical compound N1CCN=C1N\N=C\C(C1=CC=CC=C11)=C(C=CC=C2)C2=C1\C=N\NC1=NCCN1 NJSMWLQOCQIOPE-OCHFTUDZSA-N 0.000 description 2
- 102100008929 NELL2 Human genes 0.000 description 2
- 102100010499 NF2 Human genes 0.000 description 2
- 101700071070 NF2 Proteins 0.000 description 2
- 102100018287 NKX2-1 Human genes 0.000 description 2
- 101710012901 NKX2-1 Proteins 0.000 description 2
- 102100012131 NOTCH1 Human genes 0.000 description 2
- 101710036042 NOTCH1 Proteins 0.000 description 2
- 102100020079 NPM1 Human genes 0.000 description 2
- 101710026364 NPM1 Proteins 0.000 description 2
- 102100001119 NRAS Human genes 0.000 description 2
- 101710033916 NRAS Proteins 0.000 description 2
- 102100016102 NTRK1 Human genes 0.000 description 2
- 101700043017 NTRK1 Proteins 0.000 description 2
- 102100016105 NTRK3 Human genes 0.000 description 2
- 229940086322 Navelbine Drugs 0.000 description 2
- QZGIWPZCWHMVQL-UIYAJPBUSA-N Neocarzinostatin Chemical compound O1[C@H](C)[C@H](O)[C@H](O)[C@@H](NC)[C@H]1O[C@@H]1C/2=C/C#C[C@H]3O[C@@]3([C@@H]3OC(=O)OC3)C#CC\2=C[C@H]1OC(=O)C1=C(O)C=CC2=C(C)C=C(OC)C=C12 QZGIWPZCWHMVQL-UIYAJPBUSA-N 0.000 description 2
- 206010029260 Neuroblastoma Diseases 0.000 description 2
- 208000003019 Neurofibromatosis 1 Diseases 0.000 description 2
- 108090000770 Neuropilin-2 Proteins 0.000 description 2
- XWXYUMMDTVBTOU-UHFFFAOYSA-N Nilutamide Chemical compound O=C1C(C)(C)NC(=O)N1C1=CC=C([N+]([O-])=O)C(C(F)(F)F)=C1 XWXYUMMDTVBTOU-UHFFFAOYSA-N 0.000 description 2
- VFEDRRNHLBGPNN-UHFFFAOYSA-N Nimustine Chemical compound CC1=NC=C(CNC(=O)N(CCCl)N=O)C(N)=N1 VFEDRRNHLBGPNN-UHFFFAOYSA-N 0.000 description 2
- 241000526636 Nipah henipavirus Species 0.000 description 2
- DLGOEMSEDOSKAD-UHFFFAOYSA-N Nitrumon Chemical compound ClCCNC(=O)N(N=O)CCCl DLGOEMSEDOSKAD-UHFFFAOYSA-N 0.000 description 2
- 208000002154 Non-Small-Cell Lung Carcinoma Diseases 0.000 description 2
- 241000714209 Norwalk virus Species 0.000 description 2
- 101710037310 OOP Proteins 0.000 description 2
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 2
- 241000725177 Omsk hemorrhagic fever virus Species 0.000 description 2
- 229950011093 Onapristone Drugs 0.000 description 2
- 241000150452 Orthohantavirus Species 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 241000283898 Ovis Species 0.000 description 2
- 101710037784 P2RY12 Proteins 0.000 description 2
- 102100008310 PAK3 Human genes 0.000 description 2
- 101700057981 PAX5 Proteins 0.000 description 2
- 102100018849 PAX5 Human genes 0.000 description 2
- 101710018349 PDGFRA Proteins 0.000 description 2
- 102100004940 PDGFRA Human genes 0.000 description 2
- 102100004939 PDGFRB Human genes 0.000 description 2
- 229950003180 PEPLOMYCIN Drugs 0.000 description 2
- 102100014818 PIK3R1 Human genes 0.000 description 2
- 101710039899 PIK3R1 Proteins 0.000 description 2
- 102100005877 PKHD1 Human genes 0.000 description 2
- 101700007671 PKHD1 Proteins 0.000 description 2
- 101700074329 PLCG1 Proteins 0.000 description 2
- 102100011802 PLCG1 Human genes 0.000 description 2
- 101710039569 POLM Proteins 0.000 description 2
- 102100016976 PRKD1 Human genes 0.000 description 2
- 102100003225 PRKDC Human genes 0.000 description 2
- 101700061400 PRKDC Proteins 0.000 description 2
- 101710018405 PTPN11 Proteins 0.000 description 2
- 102100017818 PTPN11 Human genes 0.000 description 2
- 102100005502 PTPRD Human genes 0.000 description 2
- 101700079089 PTPRD Proteins 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N PUROMYCIN Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- 229950010131 PUROMYCIN Drugs 0.000 description 2
- 229960001592 Paclitaxel Drugs 0.000 description 2
- 208000008443 Pancreatic Carcinoma Diseases 0.000 description 2
- 208000004019 Papillary Adenocarcinoma Diseases 0.000 description 2
- 241001631646 Papillomaviridae Species 0.000 description 2
- 241000711504 Paramyxoviridae Species 0.000 description 2
- 108010065129 Patched-1 Receptor Proteins 0.000 description 2
- 102000012850 Patched-1 Receptor Human genes 0.000 description 2
- 229960002340 Pentostatin Drugs 0.000 description 2
- FPVKHBSQESCIEP-JQCXWYLXSA-N Pentostatin Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(N=CNC[C@H]2O)=C2N=C1 FPVKHBSQESCIEP-JQCXWYLXSA-N 0.000 description 2
- 108010057150 Peplomycin Proteins 0.000 description 2
- 210000001428 Peripheral Nervous System Anatomy 0.000 description 2
- 206010034699 Peroneal muscular atrophy Diseases 0.000 description 2
- 210000003800 Pharynx Anatomy 0.000 description 2
- 229960005190 Phenylalanine Drugs 0.000 description 2
- 108091000081 Phosphotransferases Proteins 0.000 description 2
- KASDHRXLYQOAKZ-XDSKOBMDSA-N Pimecrolimus Chemical compound C/C([C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@]2(O)O[C@@H]([C@H](C[C@H]2C)OC)[C@@H](OC)C[C@@H](C)C/C(C)=C/[C@H](C(C[C@H](O)[C@H]1C)=O)CC)=C\[C@@H]1CC[C@H](Cl)[C@H](OC)C1 KASDHRXLYQOAKZ-XDSKOBMDSA-N 0.000 description 2
- 208000007641 Pinealoma Diseases 0.000 description 2
- NJBFOOCLYDNZJN-UHFFFAOYSA-N Pipobroman Chemical compound BrCCC(=O)N1CCN(C(=O)CCBr)CC1 NJBFOOCLYDNZJN-UHFFFAOYSA-N 0.000 description 2
- 206010035148 Plague Diseases 0.000 description 2
- 108010051742 Platelet-Derived Growth Factor beta Receptor Proteins 0.000 description 2
- 241000233870 Pneumocystis Species 0.000 description 2
- 208000008696 Polycythemia Vera Diseases 0.000 description 2
- 229920001213 Polysorbate 20 Polymers 0.000 description 2
- 201000010769 Prader-Willi syndrome Diseases 0.000 description 2
- HFVNWDWLWUCIHC-GUPDPFMOSA-N Prednimustine Chemical compound O=C([C@@]1(O)CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)[C@@H](O)C[C@@]21C)COC(=O)CCCC1=CC=C(N(CCCl)CCCl)C=C1 HFVNWDWLWUCIHC-GUPDPFMOSA-N 0.000 description 2
- 208000003476 Primary Myelofibrosis Diseases 0.000 description 2
- CPTBDICYNRMXFX-UHFFFAOYSA-N Procarbazine Chemical compound CNNCC1=CC=C(C(=O)NC(C)C)C=C1 CPTBDICYNRMXFX-UHFFFAOYSA-N 0.000 description 2
- 206010060862 Prostate cancer Diseases 0.000 description 2
- 108060006633 Protein Kinases Proteins 0.000 description 2
- 241000125945 Protoparvovirus Species 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- 108010014270 Purinergic P2Y12 Receptors Proteins 0.000 description 2
- 102100016115 RAF1 Human genes 0.000 description 2
- 101700007719 RAF1 Proteins 0.000 description 2
- 102100013019 RARA Human genes 0.000 description 2
- 101700078798 RARA Proteins 0.000 description 2
- 102100006051 RET Human genes 0.000 description 2
- 101700001630 RET Proteins 0.000 description 2
- 108090000740 RNA-Binding Protein EWS Proteins 0.000 description 2
- 102000004229 RNA-Binding Protein EWS Human genes 0.000 description 2
- 102000020497 RNA-Binding Proteins Human genes 0.000 description 2
- 108091022184 RNA-Binding Proteins Proteins 0.000 description 2
- 101710007825 RNASE3 Proteins 0.000 description 2
- 102100012618 RPTOR Human genes 0.000 description 2
- 102100013038 RUNX1 Human genes 0.000 description 2
- 101700025439 RUNX1 Proteins 0.000 description 2
- 241000711798 Rabies lyssavirus Species 0.000 description 2
- AHHFEZNOXOZZQA-ZEBDFXRSSA-N Ranimustine Chemical compound CO[C@H]1O[C@H](CNC(=O)N(CCCl)N=O)[C@@H](O)[C@H](O)[C@H]1O AHHFEZNOXOZZQA-ZEBDFXRSSA-N 0.000 description 2
- 108010061204 Rapamycin-Insensitive Companion of mTOR Protein Proteins 0.000 description 2
- 102000012007 Rapamycin-Insensitive Companion of mTOR Protein Human genes 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 108010029031 Regulatory-Associated Protein of mTOR Proteins 0.000 description 2
- 208000006265 Renal Cell Carcinoma Diseases 0.000 description 2
- 206010038389 Renal cancer Diseases 0.000 description 2
- 229920000970 Repeated sequence (DNA) Polymers 0.000 description 2
- 108050002653 Retinoblastoma Protein Proteins 0.000 description 2
- 241000713124 Rift Valley fever virus Species 0.000 description 2
- 229950004892 Rodorubicin Drugs 0.000 description 2
- 241000702670 Rotavirus Species 0.000 description 2
- 241000315672 SARS coronavirus Species 0.000 description 2
- 229950001403 SIZOFIRAN Drugs 0.000 description 2
- 101710027453 SLC19A1 Proteins 0.000 description 2
- 102100017015 SLC22A2 Human genes 0.000 description 2
- 108091006650 SLC22A2 Proteins 0.000 description 2
- 108091006645 SLCO1B3 Proteins 0.000 description 2
- 102100007112 SLCO1B3 Human genes 0.000 description 2
- 102100017669 SMAD2 Human genes 0.000 description 2
- 101700012842 SMAD2 Proteins 0.000 description 2
- 102100017668 SMAD3 Human genes 0.000 description 2
- 101700079713 SMAD3 Proteins 0.000 description 2
- 102100017680 SMAD4 Human genes 0.000 description 2
- 101700062085 SMAD4 Proteins 0.000 description 2
- 101710025703 SMARCA4 Proteins 0.000 description 2
- 102100019447 SMARCA4 Human genes 0.000 description 2
- 108010076630 SMARCB1 Protein Proteins 0.000 description 2
- 102000011740 SMARCB1 Protein Human genes 0.000 description 2
- 101700021542 SMO Proteins 0.000 description 2
- 102100015930 SMOX Human genes 0.000 description 2
- 101700076839 SMOX Proteins 0.000 description 2
- 102100004239 SOD2 Human genes 0.000 description 2
- 101700070970 SOX10 Proteins 0.000 description 2
- 102100008830 SOX10 Human genes 0.000 description 2
- 101700006931 SOX2 Proteins 0.000 description 2
- 102100018829 SOX2 Human genes 0.000 description 2
- 102000001332 SRC Human genes 0.000 description 2
- 101710009384 SRC Proteins 0.000 description 2
- 102100019330 STK11 Human genes 0.000 description 2
- 101700065463 STK11 Proteins 0.000 description 2
- 241000736032 Sabia <angiosperm> Species 0.000 description 2
- 206010061934 Salivary gland cancer Diseases 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- 206010039447 Salmonellosis Diseases 0.000 description 2
- 206010039491 Sarcoma Diseases 0.000 description 2
- 210000001732 Sebaceous Glands Anatomy 0.000 description 2
- 241000607768 Shigella Species 0.000 description 2
- 206010040550 Shigella infection Diseases 0.000 description 2
- 208000007056 Sickle Cell Anemia Diseases 0.000 description 2
- 241000700584 Simplexvirus Species 0.000 description 2
- QFJCIRLUMZQUOT-HPLJOQBZSA-N Sirolimus Chemical compound C1C[C@@H](O)[C@H](OC)C[C@@H]1C[C@@H](C)[C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@](O)(O2)[C@H](C)CC[C@H]2C[C@H](OC)/C(C)=C/C=C/C=C/[C@@H](C)C[C@@H](C)C(=O)[C@H](OC)[C@H](O)/C(C)=C/[C@@H](C)C(=O)C1 QFJCIRLUMZQUOT-HPLJOQBZSA-N 0.000 description 2
- 229920000519 Sizofiran Polymers 0.000 description 2
- 208000000587 Small Cell Lung Carcinoma Diseases 0.000 description 2
- 206010041067 Small cell lung cancer Diseases 0.000 description 2
- 206010054184 Small intestine carcinoma Diseases 0.000 description 2
- 208000001203 Smallpox Diseases 0.000 description 2
- 229950006315 Spirogermanium Drugs 0.000 description 2
- 241001149962 Sporothrix Species 0.000 description 2
- 241000713675 Spumavirus Species 0.000 description 2
- 206010041823 Squamous cell carcinoma Diseases 0.000 description 2
- 241000191940 Staphylococcus Species 0.000 description 2
- 241000194017 Streptococcus Species 0.000 description 2
- 229960001052 Streptozocin Drugs 0.000 description 2
- ZSJLQEPLLKMAKR-GKHCUFPYSA-N Streptozotocin Chemical compound O=NN(C)C(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O ZSJLQEPLLKMAKR-GKHCUFPYSA-N 0.000 description 2
- 208000006011 Stroke Diseases 0.000 description 2
- 241000282887 Suidae Species 0.000 description 2
- 206010042863 Synovial sarcoma Diseases 0.000 description 2
- 229940014598 TAC Drugs 0.000 description 2
- RCINICONZNJXQF-XAZOAEDWSA-N TAXOL® Chemical compound O([C@@H]1[C@@]2(CC(C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3(C21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-XAZOAEDWSA-N 0.000 description 2
- 102100017503 TBX22 Human genes 0.000 description 2
- 101700077983 TBX22 Proteins 0.000 description 2
- 101700048164 TET2 Proteins 0.000 description 2
- 102100003998 TET2 Human genes 0.000 description 2
- 102100003994 TGFBR2 Human genes 0.000 description 2
- 102100011827 TMPRSS2 Human genes 0.000 description 2
- 101710044694 TMPRSS2 Proteins 0.000 description 2
- 102100001029 TOP1 Human genes 0.000 description 2
- 101700042008 TOP1 Proteins 0.000 description 2
- 101700001571 TOP1A Proteins 0.000 description 2
- 101700021738 TOP1B Proteins 0.000 description 2
- 229950000212 TRIOXIFENE Drugs 0.000 description 2
- 102100008210 TSC2 Human genes 0.000 description 2
- 101700083014 TSC2 Proteins 0.000 description 2
- 229960001967 Tacrolimus Drugs 0.000 description 2
- QJJXYPPXXYFBGM-LFZNUXCKSA-N Tacrolimus Chemical compound C1C[C@@H](O)[C@H](OC)C[C@@H]1\C=C(/C)[C@@H]1[C@H](C)[C@@H](O)CC(=O)[C@H](CC=C)/C=C(C)/C[C@H](C)C[C@H](OC)[C@H]([C@H](C[C@H]2C)OC)O[C@@]2(O)C(=O)C(=O)N2CCCC[C@H]2C(=O)O1 QJJXYPPXXYFBGM-LFZNUXCKSA-N 0.000 description 2
- 229960001603 Tamoxifen Drugs 0.000 description 2
- NKANXQFJJICGDU-QPLCGJKRSA-N Tamoxifen Chemical compound C=1C=CC=CC=1C(/CC)=C(C=1C=CC(OCCN(C)C)=CC=1)/C1=CC=CC=C1 NKANXQFJJICGDU-QPLCGJKRSA-N 0.000 description 2
- NAVMQTYZDKMPEU-UHFFFAOYSA-N Targretin Chemical compound CC1=CC(C(CCC2(C)C)(C)C)=C2C=C1C(=C)C1=CC=C(C(O)=O)C=C1 NAVMQTYZDKMPEU-UHFFFAOYSA-N 0.000 description 2
- 201000008902 Tay-Sachs disease Diseases 0.000 description 2
- 229960001278 Teniposide Drugs 0.000 description 2
- KLBSRSYHIIAQTG-PZICIZFRSA-N Tenuazonic acid Chemical compound CC[C@H](C)[C@@H]1NC(=O)C(C(C)=O)C1=O KLBSRSYHIIAQTG-PZICIZFRSA-N 0.000 description 2
- 206010057644 Testis cancer Diseases 0.000 description 2
- 229960005353 Testolactone Drugs 0.000 description 2
- 206010043391 Thalassaemia beta Diseases 0.000 description 2
- 102000004377 Thiopurine S-methyltransferases Human genes 0.000 description 2
- 108090000958 Thiopurine S-methyltransferases Proteins 0.000 description 2
- 102000005497 Thymidylate Synthase Human genes 0.000 description 2
- 108010067449 Thymidylate Synthase Proteins 0.000 description 2
- 229950011457 Tiamiprine Drugs 0.000 description 2
- 208000004006 Tick-Borne Encephalitis Diseases 0.000 description 2
- 241000130764 Tinea Species 0.000 description 2
- 208000002474 Tinea Diseases 0.000 description 2
- 229940024982 Topical Antifungal Antibiotics Drugs 0.000 description 2
- 206010057589 Total hypoxanthine-guanine phosphoribosyl transferase deficiency Diseases 0.000 description 2
- 108010082684 Transforming Growth Factor-beta Type II Receptor Proteins 0.000 description 2
- 206010044390 Transient ischaemic attack Diseases 0.000 description 2
- 208000005765 Traumatic Brain Injury Diseases 0.000 description 2
- 241000589886 Treponema Species 0.000 description 2
- 229950007229 Tresperimus Drugs 0.000 description 2
- 229950001353 Tretamine Drugs 0.000 description 2
- 229960004560 Triaziquone Drugs 0.000 description 2
- PXSOHRWMIRDKMP-UHFFFAOYSA-N Triaziquone Chemical compound O=C1C(N2CC2)=C(N2CC2)C(=O)C=C1N1CC1 PXSOHRWMIRDKMP-UHFFFAOYSA-N 0.000 description 2
- 241000223238 Trichophyton Species 0.000 description 2
- NOYPYLRCIDNJJB-UHFFFAOYSA-N Trimetrexate Chemical compound COC1=C(OC)C(OC)=CC(NCC=2C(=C3C(N)=NC(N)=NC3=CC=2)C)=C1 NOYPYLRCIDNJJB-UHFFFAOYSA-N 0.000 description 2
- 229960001099 Trimetrexate Drugs 0.000 description 2
- 208000000625 Triple X syndrome Diseases 0.000 description 2
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Tris Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 208000006284 Trisomy 13 Syndrome Diseases 0.000 description 2
- 206010044688 Trisomy 21 Diseases 0.000 description 2
- 206010053871 Trisomy 8 Diseases 0.000 description 2
- UMKFEPPTGMDVMI-UHFFFAOYSA-N Trofosfamide Chemical compound ClCCN(CCCl)P1(=O)OCCCN1CCCl UMKFEPPTGMDVMI-UHFFFAOYSA-N 0.000 description 2
- HDZZVAMISRMYHH-LITAXDCLSA-N Tubercidin Chemical compound C1=CC=2C(N)=NC=NC=2N1[C@@H]1O[C@@H](CO)[C@H](O)[C@H]1O HDZZVAMISRMYHH-LITAXDCLSA-N 0.000 description 2
- 208000009999 Tuberous Sclerosis Diseases 0.000 description 2
- 108010007089 Tuberous Sclerosis Complex 2 Protein Proteins 0.000 description 2
- 102000007581 Tuberous Sclerosis Complex 2 Protein Human genes 0.000 description 2
- 206010045181 Turner's syndrome Diseases 0.000 description 2
- 108010081291 Type 1 Fibroblast Growth Factor Receptor Proteins 0.000 description 2
- 108010081268 Type 2 Fibroblast Growth Factor Receptor Proteins 0.000 description 2
- 206010045261 Type IIa hyperlipidaemia Diseases 0.000 description 2
- NMUSYJAQQFHJEW-KVTDHHQDSA-N U-18,496 Chemical compound O=C1N=C(N)N=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 NMUSYJAQQFHJEW-KVTDHHQDSA-N 0.000 description 2
- 229950009811 UBENIMEX Drugs 0.000 description 2
- 101700073311 USP9X Proteins 0.000 description 2
- 102100009329 USP9X Human genes 0.000 description 2
- IDPUKCWIGUEADI-UHFFFAOYSA-N Uramustine Chemical compound ClCCN(CCCl)C1=CNC(=O)NC1=O IDPUKCWIGUEADI-UHFFFAOYSA-N 0.000 description 2
- 206010046766 Uterine cancer Diseases 0.000 description 2
- 102100015255 VDR Human genes 0.000 description 2
- 101700055048 VDR Proteins 0.000 description 2
- 241000700647 Variola virus Species 0.000 description 2
- 108010053100 Vascular Endothelial Growth Factor Receptor-3 Proteins 0.000 description 2
- 230000037115 Vdr Effects 0.000 description 2
- 241000710959 Venezuelan equine encephalitis virus Species 0.000 description 2
- 201000009693 Venezuelan hemorrhagic fever Diseases 0.000 description 2
- 241000711970 Vesiculovirus Species 0.000 description 2
- 229960003048 Vinblastine Drugs 0.000 description 2
- HOFQVRTUGATRFI-XQKSVPLYSA-N Vinblastine Chemical compound C([C@@H](C[C@]1(C(=O)OC)C=2C(=CC3=C([C@]45[C@H]([C@@]([C@H](OC(C)=O)[C@]6(CC)C=CCN([C@H]56)CC4)(O)C(=O)OC)N3C)C=2)OC)C[C@@](C2)(O)CC)N2CCC2=C1N=C1[C]2C=CC=C1 HOFQVRTUGATRFI-XQKSVPLYSA-N 0.000 description 2
- JXLYSJRDGCGARV-WWYNWVTFSA-N Vinblastine Natural products O=C(O[C@H]1[C@](O)(C(=O)OC)[C@@H]2N(C)c3c(cc(c(OC)c3)[C@]3(C(=O)OC)c4[nH]c5c(c4CCN4C[C@](O)(CC)C[C@H](C3)C4)cccc5)[C@@]32[C@H]2[C@@]1(CC)C=CCN2CC3)C JXLYSJRDGCGARV-WWYNWVTFSA-N 0.000 description 2
- 229960004528 Vincristine Drugs 0.000 description 2
- 229960004355 Vindesine Drugs 0.000 description 2
- UGGWPQSBPIFKDZ-KOTLKJBCSA-N Vindesine Chemical compound C([C@@H](C[C@]1(C(=O)OC)C=2C(=CC3=C([C@]45[C@H]([C@@]([C@H](O)[C@]6(CC)C=CCN([C@H]56)CC4)(O)C(N)=O)N3C)C=2)OC)C[C@@](C2)(O)CC)N2CCC2=C1N=C1[C]2C=CC=C1 UGGWPQSBPIFKDZ-KOTLKJBCSA-N 0.000 description 2
- 108090000779 Vitamin K Epoxide Reductases Proteins 0.000 description 2
- 102000004210 Vitamin K Epoxide Reductases Human genes 0.000 description 2
- 241000710886 West Nile virus Species 0.000 description 2
- 241000710951 Western equine encephalitis virus Species 0.000 description 2
- 208000008383 Wilms Tumor Diseases 0.000 description 2
- RLQVKDVIBJCQGE-WDUCDQOOSA-N Win-25540 Chemical compound O=C1C(C#N)C[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CC[C@@]32O[C@@H]31 RLQVKDVIBJCQGE-WDUCDQOOSA-N 0.000 description 2
- 208000006254 Wolf-Hirschhorn Syndrome Diseases 0.000 description 2
- 206010056894 XYY syndrome Diseases 0.000 description 2
- 208000002622 XYY syndrome 47 Diseases 0.000 description 2
- 229940053867 Xeloda Drugs 0.000 description 2
- 108010041009 Xeroderma Pigmentosum Group D Protein Proteins 0.000 description 2
- AOCCBINRVIKJHY-UHFFFAOYSA-N Yamafur Chemical compound CCCCCCNC(=O)N1C=C(F)C(=O)NC1=O AOCCBINRVIKJHY-UHFFFAOYSA-N 0.000 description 2
- 241000607479 Yersinia pestis Species 0.000 description 2
- FBTUMDXHSRTGRV-ALTNURHMSA-N Zorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(\C)=N\NC(=O)C=1C=CC=CC=1)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 FBTUMDXHSRTGRV-ALTNURHMSA-N 0.000 description 2
- QIMGFXOHTOXMQP-GFAGFCTOSA-N [(2R,3S,4S,5R,6R)-2-[(2R,3S,4S,5S,6S)-2-[(1R,2S)-2-[[6-amino-2-[(1S)-3-amino-1-[[(2S)-2,3-diamino-3-oxopropyl]amino]-3-oxopropyl]-5-methylpyrimidine-4-carbonyl]amino]-3-[[(2R,3S,4S)-3-hydroxy-5-[[(2S,3R)-3-hydroxy-1-oxo-1-[2-[4-[4-[3-[[(1S)-1-phenylethyl] Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCCN[C@@H](C)C=1C=CC=CC=1)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1NC=NC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C QIMGFXOHTOXMQP-GFAGFCTOSA-N 0.000 description 2
- SPJCRMJCFSJKDE-ZWBUGVOYSA-N [(3S,8S,9S,10R,13R,14S,17R)-10,13-dimethyl-17-[(2R)-6-methylheptan-2-yl]-2,3,4,7,8,9,11,12,14,15,16,17-dodecahydro-1H-cyclopenta[a]phenanthren-3-yl] 2-[4-[bis(2-chloroethyl)amino]phenyl]acetate Chemical compound O([C@@H]1CC2=CC[C@H]3[C@@H]4CC[C@@H]([C@]4(CC[C@@H]3[C@@]2(C)CC1)C)[C@H](C)CCCC(C)C)C(=O)CC1=CC=C(N(CCCl)CCCl)C=C1 SPJCRMJCFSJKDE-ZWBUGVOYSA-N 0.000 description 2
- IFJUINDAXYAPTO-UUBSBJJBSA-N [(8R,9S,13S,14S,17S)-17-[2-[4-[4-[bis(2-chloroethyl)amino]phenyl]butanoyloxy]acetyl]oxy-13-methyl-6,7,8,9,11,12,14,15,16,17-decahydrocyclopenta[a]phenanthren-3-yl] benzoate Chemical compound C([C@@H]1[C@@H](C2=CC=3)CC[C@]4([C@H]1CC[C@@H]4OC(=O)COC(=O)CCCC=1C=CC(=CC=1)N(CCCl)CCCl)C)CC2=CC=3OC(=O)C1=CC=CC=C1 IFJUINDAXYAPTO-UUBSBJJBSA-N 0.000 description 2
- IHGLINDYFMDHJG-UHFFFAOYSA-N [2-(4-methoxyphenyl)-3,4-dihydronaphthalen-1-yl]-[4-(2-pyrrolidin-1-ylethoxy)phenyl]methanone Chemical compound C1=CC(OC)=CC=C1C(CCC1=CC=CC=C11)=C1C(=O)C(C=C1)=CC=C1OCCN1CCCC1 IHGLINDYFMDHJG-UHFFFAOYSA-N 0.000 description 2
- XZSRRNFBEIOBDA-CFNBKWCHSA-N [2-[(2S,4S)-4-[(2R,4S,5S,6S)-4-amino-5-hydroxy-6-methyloxan-2-yl]oxy-2,5,12-trihydroxy-7-methoxy-6,11-dioxo-3,4-dihydro-1H-tetracen-2-yl]-2-oxoethyl] 2,2-diethoxyacetate Chemical compound O([C@H]1C[C@](CC2=C(O)C=3C(=O)C4=CC=CC(OC)=C4C(=O)C=3C(O)=C21)(O)C(=O)COC(=O)C(OCC)OCC)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 XZSRRNFBEIOBDA-CFNBKWCHSA-N 0.000 description 2
- LVBMFPUTQOHXQE-UHFFFAOYSA-N [2-[6-(diaminomethylideneamino)hexylamino]-2-oxoethyl] N-[4-(3-aminopropylamino)butyl]carbamate Chemical compound NCCCNCCCCNC(=O)OCC(=O)NCCCCCCN=C(N)N LVBMFPUTQOHXQE-UHFFFAOYSA-N 0.000 description 2
- NUKCGLDCWQXYOQ-UHFFFAOYSA-N [3-[4-(3-methylsulfonyloxypropanoyl)piperazin-1-yl]-3-oxopropyl] methanesulfonate Chemical compound CS(=O)(=O)OCCC(=O)N1CCN(C(=O)CCOS(C)(=O)=O)CC1 NUKCGLDCWQXYOQ-UHFFFAOYSA-N 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 239000012082 adaptor molecule Substances 0.000 description 2
- 201000005188 adrenal gland cancer Diseases 0.000 description 2
- 201000011452 adrenoleukodystrophy Diseases 0.000 description 2
- PYMYPHUHKUWMLA-WDCZJNDASA-N aldehydo-D-arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 2
- 229960001445 alitretinoin Drugs 0.000 description 2
- 150000008052 alkyl sulfonates Chemical class 0.000 description 2
- 239000002168 alkylating agent Substances 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 229960000473 altretamine Drugs 0.000 description 2
- 229960002749 aminolevulinic acid Drugs 0.000 description 2
- ZGXJTSGNIOSYLO-UHFFFAOYSA-N aminolevulinic acid Chemical compound NCC(=O)CCC(O)=O ZGXJTSGNIOSYLO-UHFFFAOYSA-N 0.000 description 2
- BBDAGFIXKZCXAH-CCXZUQQUSA-N ancitabine Chemical compound N=C1C=CN2[C@@H]3O[C@H](CO)[C@@H](O)[C@@H]3OC2=N1 BBDAGFIXKZCXAH-CCXZUQQUSA-N 0.000 description 2
- 229950000242 ancitabine Drugs 0.000 description 2
- 239000003098 androgen Substances 0.000 description 2
- 201000003076 angiosarcoma Diseases 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 230000002280 anti-androgenic Effects 0.000 description 2
- 230000001833 anti-estrogenic Effects 0.000 description 2
- 230000003388 anti-hormone Effects 0.000 description 2
- 230000000340 anti-metabolite Effects 0.000 description 2
- 230000000692 anti-sense Effects 0.000 description 2
- 239000000051 antiandrogen Substances 0.000 description 2
- 239000002256 antimetabolite Substances 0.000 description 2
- 229940045687 antimetabolites Folic acid analogs Drugs 0.000 description 2
- 230000001640 apoptogenic Effects 0.000 description 2
- 101700017456 asd-1 Proteins 0.000 description 2
- 201000001320 atherosclerosis Diseases 0.000 description 2
- NOWKCMXCCJGMRR-UHFFFAOYSA-N aziridine Chemical class C1CN1 NOWKCMXCCJGMRR-UHFFFAOYSA-N 0.000 description 2
- 150000001541 aziridines Chemical class 0.000 description 2
- 230000001580 bacterial Effects 0.000 description 2
- 229960002938 bexarotene Drugs 0.000 description 2
- 229960000997 bicalutamide Drugs 0.000 description 2
- LKJPYSCBVHEWIU-KRWDZBQOSA-N bicalutamide Chemical compound C([C@@](O)(C)C(=O)NC=1C=C(C(C#N)=CC=1)C(F)(F)F)S(=O)(=O)C1=CC=C(F)C=C1 LKJPYSCBVHEWIU-KRWDZBQOSA-N 0.000 description 2
- 201000007180 bile duct carcinoma Diseases 0.000 description 2
- 201000009036 biliary tract cancer Diseases 0.000 description 2
- 239000012148 binding buffer Substances 0.000 description 2
- 230000003115 biocidal Effects 0.000 description 2
- 230000031018 biological processes and functions Effects 0.000 description 2
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 229950008548 bisantrene Drugs 0.000 description 2
- 201000001531 bladder carcinoma Diseases 0.000 description 2
- 239000010839 body fluid Substances 0.000 description 2
- MPBVHIBUJCELCL-UHFFFAOYSA-N bondronat Chemical compound CCCCCN(C)CCC(O)(P(O)(O)=O)P(O)(O)=O MPBVHIBUJCELCL-UHFFFAOYSA-N 0.000 description 2
- 201000000220 brain stem cancer Diseases 0.000 description 2
- 201000005200 bronchus cancer Diseases 0.000 description 2
- 229960002092 busulfan Drugs 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 229960002115 carboquone Drugs 0.000 description 2
- 231100000504 carcinogenesis Toxicity 0.000 description 2
- 229960003261 carmofur Drugs 0.000 description 2
- 229960005243 carmustine Drugs 0.000 description 2
- 108010047060 carzinophilin Proteins 0.000 description 2
- 150000001768 cations Chemical class 0.000 description 2
- 230000004663 cell proliferation Effects 0.000 description 2
- 201000007455 central nervous system cancer Diseases 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 201000010881 cervical cancer Diseases 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 230000000973 chemotherapeutic Effects 0.000 description 2
- 229960001480 chlorozotocin Drugs 0.000 description 2
- 201000009047 chordoma Diseases 0.000 description 2
- 229960001265 ciclosporin Drugs 0.000 description 2
- 229920002080 circulating cell-free DNA Polymers 0.000 description 2
- 229960004316 cisplatin Drugs 0.000 description 2
- 201000010989 colorectal carcinoma Diseases 0.000 description 2
- 239000000470 constituent Substances 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000002596 correlated Effects 0.000 description 2
- 230000001808 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 201000003883 cystic fibrosis Diseases 0.000 description 2
- 229960003901 dacarbazine Drugs 0.000 description 2
- 238000007405 data analysis Methods 0.000 description 2
- 230000002939 deleterious Effects 0.000 description 2
- 229960005052 demecolcine Drugs 0.000 description 2
- 201000002949 dengue disease Diseases 0.000 description 2
- 229960002923 denileukin diftitox Drugs 0.000 description 2
- 108010017271 denileukin diftitox Proteins 0.000 description 2
- 230000037304 dermatophytes Effects 0.000 description 2
- 230000001627 detrimental Effects 0.000 description 2
- 238000002405 diagnostic procedure Methods 0.000 description 2
- 238000000502 dialysis Methods 0.000 description 2
- 229950002389 diaziquone Drugs 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 2
- 239000003534 dna topoisomerase inhibitor Substances 0.000 description 2
- ZWAOHEXOSAUJHY-ZIYNGMLESA-N doxifluridine Chemical compound O[C@@H]1[C@H](O)[C@@H](C)O[C@H]1N1C(=O)NC(=O)C(F)=C1 ZWAOHEXOSAUJHY-ZIYNGMLESA-N 0.000 description 2
- 229950005454 doxifluridine Drugs 0.000 description 2
- 229940017743 dromostanolone propionate Drugs 0.000 description 2
- 229950004683 drostanolone propionate Drugs 0.000 description 2
- 230000036267 drug metabolism Effects 0.000 description 2
- 230000005014 ectopic expression Effects 0.000 description 2
- 229950007539 elliptinium Drugs 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 201000009051 embryonal carcinoma Diseases 0.000 description 2
- 230000002616 endonucleolytic Effects 0.000 description 2
- 102000017256 epidermal growth factor-activated receptor activity proteins Human genes 0.000 description 2
- 108040009258 epidermal growth factor-activated receptor activity proteins Proteins 0.000 description 2
- OBMLHUPNRURLOK-XGRAFVIBSA-N epitiostanol Chemical compound C1[C@@H]2S[C@@H]2C[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CC[C@H]21 OBMLHUPNRURLOK-XGRAFVIBSA-N 0.000 description 2
- 201000004101 esophageal cancer Diseases 0.000 description 2
- 239000000328 estrogen antagonist Substances 0.000 description 2
- 108091007910 estrogen receptors beta Proteins 0.000 description 2
- QSRLNKCNOLVZIR-KRWDZBQOSA-N ethyl (2S)-2-[[2-[4-[bis(2-chloroethyl)amino]phenyl]acetyl]amino]-4-methylsulfanylbutanoate Chemical compound CCOC(=O)[C@H](CCSC)NC(=O)CC1=CC=C(N(CCCl)CCCl)C=C1 QSRLNKCNOLVZIR-KRWDZBQOSA-N 0.000 description 2
- WVYXNIXAMZOZFK-UHFFFAOYSA-N ethyl N-[2,5-bis(aziridin-1-yl)-4-(ethoxycarbonylamino)-3,6-dioxocyclohexa-1,4-dien-1-yl]carbamate Chemical compound O=C1C(NC(=O)OCC)=C(N2CC2)C(=O)C(NC(=O)OCC)=C1N1CC1 WVYXNIXAMZOZFK-UHFFFAOYSA-N 0.000 description 2
- JOYRKODLDBILNP-UHFFFAOYSA-N ethyl urethane Chemical compound CCOC(N)=O JOYRKODLDBILNP-UHFFFAOYSA-N 0.000 description 2
- DEFVIWRASFVYLL-UHFFFAOYSA-N ethylene glycol bis(2-aminoethyl)tetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)CCOCCOCCN(CC(O)=O)CC(O)=O DEFVIWRASFVYLL-UHFFFAOYSA-N 0.000 description 2
- 230000001036 exonucleolytic Effects 0.000 description 2
- 201000003542 factor VIII deficiency Diseases 0.000 description 2
- 201000008808 fibrosarcoma Diseases 0.000 description 2
- 229960000390 fludarabine Drugs 0.000 description 2
- GIUYCYHIANZCFB-FJFJXFQQSA-N fludarabine phosphate Chemical compound C1=NC=2C(N)=NC(F)=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O GIUYCYHIANZCFB-FJFJXFQQSA-N 0.000 description 2
- 229960002074 flutamide Drugs 0.000 description 2
- 235000019152 folic acid Nutrition 0.000 description 2
- 239000011724 folic acid Substances 0.000 description 2
- 201000003444 follicular lymphoma Diseases 0.000 description 2
- 229960004783 fotemustine Drugs 0.000 description 2
- YAKWPXVTIGTRJH-UHFFFAOYSA-N fotemustine Chemical compound CCOP(=O)(OCC)C(C)NC(=O)N(CCCl)N=O YAKWPXVTIGTRJH-UHFFFAOYSA-N 0.000 description 2
- 238000007710 freezing Methods 0.000 description 2
- 230000002538 fungal Effects 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037240 fusion proteins Human genes 0.000 description 2
- 229940044658 gallium nitrate Drugs 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 229960005277 gemcitabine Drugs 0.000 description 2
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 2
- 201000010536 head and neck cancer Diseases 0.000 description 2
- 201000010238 heart disease Diseases 0.000 description 2
- 201000002222 hemangioblastoma Diseases 0.000 description 2
- 230000002489 hematologic Effects 0.000 description 2
- 201000000907 hemorrhagic fever with renal syndrome Diseases 0.000 description 2
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 229960001330 hydroxycarbamide Drugs 0.000 description 2
- VSNHCAURESNICA-UHFFFAOYSA-N hydroxyurea Chemical compound NC(=O)NO VSNHCAURESNICA-UHFFFAOYSA-N 0.000 description 2
- 229940027318 hydroxyurea Drugs 0.000 description 2
- 229960002751 imiquimod Drugs 0.000 description 2
- 238000003364 immunohistochemistry Methods 0.000 description 2
- 229950008097 improsulfan Drugs 0.000 description 2
- 230000000415 inactivating Effects 0.000 description 2
- 201000003803 inflammatory myofibroblastic tumor Diseases 0.000 description 2
- 200000000003 influenza A Diseases 0.000 description 2
- 200000000004 influenza B Diseases 0.000 description 2
- 108010019691 inhibin beta A subunit Proteins 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 229940079866 intestinal antibiotics Drugs 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 201000010982 kidney cancer Diseases 0.000 description 2
- 229960004577 laquinimod Drugs 0.000 description 2
- 101700064822 lcb1 Proteins 0.000 description 2
- 229940115286 lentinan Drugs 0.000 description 2
- GFIJNRVAKGFPGQ-LIJARHBVSA-N leuprolide Chemical compound CCNC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H]1NC(=O)CC1)CC1=CC=C(O)C=C1 GFIJNRVAKGFPGQ-LIJARHBVSA-N 0.000 description 2
- 201000007270 liver cancer Diseases 0.000 description 2
- 229960002247 lomustine Drugs 0.000 description 2
- 229960003538 lonidamine Drugs 0.000 description 2
- 239000011565 manganese chloride Substances 0.000 description 2
- 229950008612 mannomustine Drugs 0.000 description 2
- 201000001441 melanoma Diseases 0.000 description 2
- 229960001924 melphalan Drugs 0.000 description 2
- 230000003340 mental Effects 0.000 description 2
- 229920002106 messenger RNA Polymers 0.000 description 2
- VJRAUFKOOPNFIQ-TVEKBUMESA-N methyl (1R,2R,4S)-4-[(2R,4S,5S,6S)-5-[(2S,4S,5S,6S)-5-[(2S,4S,5S,6S)-4,5-dihydroxy-6-methyloxan-2-yl]oxy-4-hydroxy-6-methyloxan-2-yl]oxy-4-(dimethylamino)-6-methyloxan-2-yl]oxy-2-ethyl-2,5,7,10-tetrahydroxy-6,11-dioxo-3,4-dihydro-1H-tetracene-1-carboxylat Chemical compound O([C@H]1[C@@H](O)C[C@@H](O[C@H]1C)O[C@H]1[C@H](C[C@@H](O[C@H]1C)O[C@H]1C[C@]([C@@H](C2=CC=3C(=O)C4=C(O)C=CC(O)=C4C(=O)C=3C(O)=C21)C(=O)OC)(O)CC)N(C)C)[C@H]1C[C@H](O)[C@H](O)[C@H](C)O1 VJRAUFKOOPNFIQ-TVEKBUMESA-N 0.000 description 2
- 229960005485 mitobronitol Drugs 0.000 description 2
- MXWHMTNPTTVWDM-NXOFHUPFSA-N mitoguazone Chemical compound NC(N)=N\N=C(/C)\C=N\N=C(N)N MXWHMTNPTTVWDM-NXOFHUPFSA-N 0.000 description 2
- 229950010913 mitolactol Drugs 0.000 description 2
- ZAHQPTJLOCWVPG-UHFFFAOYSA-N mitoxantrone dihydrochloride Chemical compound Cl.Cl.O=C1C2=C(O)C=CC(O)=C2C(=O)C2=C1C(NCCNCCO)=CC=C2NCCNCCO ZAHQPTJLOCWVPG-UHFFFAOYSA-N 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000006011 modification reaction Methods 0.000 description 2
- 108010066419 multidrug resistance-associated protein 2 Proteins 0.000 description 2
- HPNSFSBZBAHARI-RUDMXATFSA-N mycophenolic acid Chemical compound OC1=C(C\C=C(/C)CCC(O)=O)C(OC)=C(C)C2=C1C(=O)OC2 HPNSFSBZBAHARI-RUDMXATFSA-N 0.000 description 2
- 201000000050 myeloid neoplasm Diseases 0.000 description 2
- 230000001338 necrotic Effects 0.000 description 2
- 230000001613 neoplastic Effects 0.000 description 2
- 201000008026 nephroblastoma Diseases 0.000 description 2
- 201000011682 nervous system cancer Diseases 0.000 description 2
- 230000000955 neuroendocrine Effects 0.000 description 2
- 229960002653 nilutamide Drugs 0.000 description 2
- 229960001420 nimustine Drugs 0.000 description 2
- 229950008607 nitracrine Drugs 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 229950009266 nogalamycin Drugs 0.000 description 2
- MAZYQGHSTXUZJF-ZBRHGPMOSA-N nogalamycin Chemical compound CO[C@@H]1[C@@](O)(C)[C@@H](OC)[C@H](C)O[C@H]1O[C@@H]1C2=C(O)C(C(=O)C3=C(O)C=C4[C@]5(C)C[C@H](C[C@@H](O5)CC4=C3C3=O)N(C)C)=C3C=C2[C@@H](C(=O)OC)[C@@](C)(O)C1 MAZYQGHSTXUZJF-ZBRHGPMOSA-N 0.000 description 2
- 238000009376 nuclear reprocessing Methods 0.000 description 2
- 238000001821 nucleic acid purification Methods 0.000 description 2
- 201000010133 oligodendroglioma Diseases 0.000 description 2
- 231100000590 oncogenic Toxicity 0.000 description 2
- 230000002246 oncogenic Effects 0.000 description 2
- 229940005935 ophthalmologic Antibiotics Drugs 0.000 description 2
- 201000003738 orofaciodigital syndrome VIII Diseases 0.000 description 2
- 230000002611 ovarian Effects 0.000 description 2
- 230000003647 oxidation Effects 0.000 description 2
- 238000007254 oxidation reaction Methods 0.000 description 2
- 201000002528 pancreatic cancer Diseases 0.000 description 2
- 201000010198 papillary carcinoma Diseases 0.000 description 2
- 230000036961 partial Effects 0.000 description 2
- 201000011252 phenylketonuria Diseases 0.000 description 2
- 229960005330 pimecrolimus Drugs 0.000 description 2
- 201000004123 pineal gland cancer Diseases 0.000 description 2
- 229960000952 pipobroman Drugs 0.000 description 2
- 229950001100 piposulfan Drugs 0.000 description 2
- 229960001221 pirarubicin Drugs 0.000 description 2
- 150000003057 platinum Chemical class 0.000 description 2
- 229910052697 platinum Inorganic materials 0.000 description 2
- 108010059054 polycystic kidney disease 1 protein Proteins 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 2
- 229960004694 prednimustine Drugs 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 229960000624 procarbazine Drugs 0.000 description 2
- 230000035755 proliferation Effects 0.000 description 2
- 230000004853 protein function Effects 0.000 description 2
- 102000004169 proteins and genes Human genes 0.000 description 2
- 108090000623 proteins and genes Proteins 0.000 description 2
- 230000002685 pulmonary Effects 0.000 description 2
- 150000003212 purines Chemical class 0.000 description 2
- 150000003230 pyrimidines Chemical class 0.000 description 2
- 238000003908 quality control method Methods 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 239000002096 quantum dot Substances 0.000 description 2
- 229960002185 ranimustine Drugs 0.000 description 2
- 229960000460 razoxane Drugs 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 230000003252 repetitive Effects 0.000 description 2
- 108091007521 restriction endonucleases Proteins 0.000 description 2
- 201000000582 retinoblastoma Diseases 0.000 description 2
- 150000004508 retinoic acid derivatives Chemical class 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 201000009410 rhabdomyosarcoma Diseases 0.000 description 2
- 102220227244 rs368435578 Human genes 0.000 description 2
- 201000010208 seminoma Diseases 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 201000003176 severe acute respiratory syndrome Diseases 0.000 description 2
- 229960002930 sirolimus Drugs 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 210000000130 stem cell Anatomy 0.000 description 2
- 108010045815 superoxide dismutase 2 Proteins 0.000 description 2
- 238000001356 surgical procedure Methods 0.000 description 2
- 201000010965 sweat gland carcinoma Diseases 0.000 description 2
- 201000008736 systemic mastocytosis Diseases 0.000 description 2
- 229930003347 taxol Natural products 0.000 description 2
- 201000003120 testicular cancer Diseases 0.000 description 2
- BPEWUONYVDABNZ-DZBHQSCQSA-N testolactone Chemical compound O=C1C=C[C@]2(C)[C@H]3CC[C@](C)(OC(=O)CC4)[C@@H]4[C@@H]3CCC2=C1 BPEWUONYVDABNZ-DZBHQSCQSA-N 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 201000010875 transient cerebral ischemia Diseases 0.000 description 2
- LXZZYRPGZAFOLE-UHFFFAOYSA-L transplatin Chemical compound [H][N]([H])([H])[Pt](Cl)(Cl)[N]([H])([H])[H] LXZZYRPGZAFOLE-UHFFFAOYSA-L 0.000 description 2
- IUCJMVBFZDHPDX-UHFFFAOYSA-N tretamine Chemical compound C1CN1C1=NC(N2CC2)=NC(N2CC2)=N1 IUCJMVBFZDHPDX-UHFFFAOYSA-N 0.000 description 2
- 229960001670 trilostane Drugs 0.000 description 2
- 108010064892 trkC Receptor Proteins 0.000 description 2
- 229960000875 trofosfamide Drugs 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 229960001055 uracil mustard Drugs 0.000 description 2
- 201000005112 urinary bladder cancer Diseases 0.000 description 2
- 201000006266 variola major Diseases 0.000 description 2
- 201000000627 variola minor Diseases 0.000 description 2
- OGWKCGZFUXNPDA-XQKSVPLYSA-N vincristine Chemical compound C([N@]1C[C@@H](C[C@]2(C(=O)OC)C=3C(=CC4=C([C@]56[C@H]([C@@]([C@H](OC(C)=O)[C@]7(CC)C=CCN([C@H]67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)C[C@@](C1)(O)CC)CC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-XQKSVPLYSA-N 0.000 description 2
- GBABOYUKABKIAF-IELIFDKJSA-N vinorelbine Chemical compound C1N(CC=2C3=CC=CC=C3NC=22)CC(CC)=C[C@H]1C[C@]2(C(=O)OC)C1=CC([C@]23[C@H]([C@@]([C@H](OC(C)=O)[C@]4(CC)C=CCN([C@H]34)CC2)(O)C(=O)OC)N2C)=C2C=C1OC GBABOYUKABKIAF-IELIFDKJSA-N 0.000 description 2
- CILBMBUYJCWATM-IJDPFCGHSA-N vinorelbine L-tartrate Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O.OC(=O)[C@H](O)[C@@H](O)C(O)=O.C1N(CC=2C3=CC=CC=C3NC=22)CC(CC)=C[C@H]1C[C@]2(C(=O)OC)C1=CC([C@]23[C@H]([C@]([C@H](OC(C)=O)[C@]4(CC)C=CCN([C@H]34)CC2)(O)C(=O)OC)N2C)=C2C=C1OC CILBMBUYJCWATM-IJDPFCGHSA-N 0.000 description 2
- 108050000156 vitamin D receptors Proteins 0.000 description 2
- 102000009310 vitamin D receptors Human genes 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- 229950009268 zinostatin Drugs 0.000 description 2
- 229960000641 zorubicin Drugs 0.000 description 2
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/10—Sequence alignment; Homology search
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- G06F19/20—
-
- G06F19/22—
-
- G06F19/28—
-
- G06F19/345—
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/10—Ploidy or copy number detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/20—Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/10—Gene or protein expression profiling; Expression-ratio estimation or normalisation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
Abstract
The invention provides a method for genetic analysis in individuals that reveals both the genetic sequences and chromosomal copy number of targeted and specific genomic loci in a single assay. The present invention further provides methods for the sensitive and specific detection of target gene sequences and gene expression profiles.
Description
- The Sequence Listing associated with this application is provided in text format in lieu of a paper copy, and is hereby incorporated by reference into the specification. The name of the text file containing the Sequence Listing is CLFK—002—00US_ST25.txt. The text file is 117 KB, was created on Aug. 22, 2014, and is being submitted electronically via EFS-Web.
- 1. Technical Field
- The invention relates generally to compositions and methods for the quantitative genetic analysis of cell free DNA (cfDNA). In particular, the present invention relates to improved targeted sequence capture compositions and methods for the genetic characterization and analysis of cfDNA.
- 2. Description of the Related Art
- It is becoming increasing clear that most, if not all, of the most common human cancers are diseases of the human genome. The emerging picture is that somatic mutations accumulate during an individual's lifetime, some of which increase the probability that the cell in which they are harbored can develop into a tumor (Vogelstein et al., Science 339(6127): 1546-1558 (2013)). With just the wrong combination of accumulated mutational events, a precancerous growth loses constraints that keep uncontrolled proliferation in check and the resulting cell mass becomes a cancer. The constellations of mutations that are necessary and sufficient to cause cancer are often collectively referred to as “driver mutations.” One of the themes that have emerged from recent and intensive molecular analysis is that cancer, once thought of as a single, tissue-specific disease, is in fact a group of related diseases, each with a unique molecular pathology. The human genome project laid the groundwork for genome-wide analysis of cancers.
- For example, the introduction of next-generation sequencing technologies (2004-present) has accelerated the discovery pace of causal genomic factors that underlie the diagnosis of NSCLC, making it clear that NSCLC is really a family of related diseases, each of which may be responsive to a different targeted therapy.
- The art lacks reliable and robust molecular analysis methods for the analysis of genetic diseases. Traditionally, molecular diagnostics have consisted of antibody-based tests (immunohistochemistry), in-situ hybridization with DNA probes (fluorescence in situ hybridization), and hybridization or PCR-based tests that query specific nucleotide sequences. Until recently, DNA sequencing as a molecular diagnostic tool has been generally limited to the coding exons of one or two genes. While DNA sequencing has been used in the diagnosis and treatment of solid cancers, one of the most significant drawbacks of these methods is that they require direct access to tumor tissues. Such material is often difficult to obtain from the initial biopsy used to diagnose the disease and virtually impossible to obtain in multiple repetitions over time. Similarly, biopsies are not possible in patients with inaccessible tumors and not practical in individuals suffering from metastatic disease.
- Thus, the vast potential of molecular diagnostics for genetic diseases; fetal testing; paternity testing; predicting response to drug treatment; diagnosing or monitoring a medical condition; mendelian disorders; genetic mosaicism; pathogen screening; microbiome profiling; and organ transplant monitoring; has yet to be realized. To date, existing molecular diagnostics approaches lack efficient solutions to clone and amplify individual DNA molecules, as well as solutions to efficiently target sequencing to specific genomic loci, with sensitivity sufficient to discriminate true positive test results from false positive signals that arise during sample processing.
- The invention relates generally to compositions and methods for improved compositions and methods for the genetic analysis of cfDNA.
- In various embodiments, a method for genetic analysis of cell-free DNA (cfDNA) is provided, comprising: treating cfDNA with one or more end-repair enzymes to generate end-repaired cfDNA; ligating one or more adaptors to each end of the end-repaired cfDNA to generate a cfDNA library; amplifying the cfDNA library to generate cfDNA library clones; determining the number of genome equivalents in the cfDNA clone library; and performing a quantitative genetic analysis of one or more target genetic loci in the cfDNA library clones.
- In a particular embodiment, the method further comprises isolating cfDNA from a biological sample of a subject.
- In an additional embodiment, the cfDNA is isolated from a biological sample selected from the group consisting of: amniotic fluid, blood, plasma, serum, semen, lymphatic fluid, cerebral spinal fluid, ocular fluid, urine, saliva, stool, mucous, and sweat.
- In a certain embodiment, the one or more adaptors comprise a plurality of adaptor species.
- In a particular embodiment, the one or more adaptors each comprise a primer binding site for amplification of the cfDNA library.
- In a further embodiment, the one or more adaptors each comprise one or more unique read codes.
- In an additional embodiment, the one or more adaptors each comprise one or more sample codes for sample multiplexing.
- In another embodiment, the one or more adaptors each comprise one or more sequences for DNA sequencing.
- In a particular embodiment, qPCR is performed on the cfDNA clone library and a qPCR measurement is compared to standards of known genome equivalents to determine the genome equivalents of the cfDNA clone library.
- In another particular embodiment, qPCR is performed with a primer that binds to an Alu sequence and a primer that binds to a sequence in an adaptor.
- In a certain embodiment, the quantitative genetic analysis is performed on a plurality of genetic loci in the cfDNA library clones.
- In a further embodiment, the quantitative genetic analysis is performed on a plurality of genetic loci in a plurality of cfDNA clone libraries.
- In an additional embodiment, the quantitative genetic analysis comprises hybridizing one or more capture probes to a target genetic locus to form capture probe-cfDNA clone complexes.
- In a particular embodiment, the quantitative genetic analysis comprises isolating the capture probe-cfDNA clone complexes.
- In a certain embodiment, the quantitative genetic analysis comprises amplification of the cfDNA clone sequence in the isolated hybridized capture probe-cfDNA clone complexes.
- In a further embodiment, the quantitative genetic analysis comprises DNA sequencing to generate a plurality of sequencing reads.
- In another embodiment, the quantitative genetic analysis comprises bioinformatic analysis of the plurality of sequencing reads.
- In a particular embodiment, bioinformatics analysis is used: to quantify the number of genome equivalents analyzed in the cfDNA clone library; to detect genetic variants in a target genetic locus; to detect mutations within a target genetic locus; to detect genetic fusions within a target genetic locus; and to measure copy number fluctuations within a target genetic locus.
- In an additional embodiment, the subject does not have a genetic disease.
- In a certain embodiment, the subject has not been diagnosed with a genetic disease.
- In another certain embodiment, the subject has been diagnosed with a genetic disease.
- In another embodiment, the quantitative genetic analysis is used to identify or detect one or more genetic lesions that cause or associated with the genetic disease.
- In a certain embodiment, the genetic lesion comprises a nucleotide transition or transversion, a nucleotide insertion or deletion, a genomic rearrangement, a change in copy number, or a gene fusion.
- In a particular embodiment, the genetic lesion comprises a genomic rearrangement that fuses the 3′ coding region of the ALK gene to another gene.
- In a particular embodiment, the 3′ coding region of the ALK gene is fused to the EML4 gene.
- In another embodiment, the genetic disease is cancer.
- In a further embodiment, the subject is pregnant.
- In an additional embodiment, the quantitative genetic analysis is used to identify or detect one or more genetic variants or genetic lesions of one or more target genetic loci in fetal cfDNA.
- In a particular embodiment, the subject is a transplant recipient.
- In an additional embodiment, the quantitative genetic analysis is used to identify or detect donor cfDNA in the subject.
- In various embodiments, a method of predicting, diagnosing, or monitoring a genetic disease in a subject is provided, comprising: isolating or obtaining cfDNA from a biological sample of a subject; treating the cfDNA with one or more end-repair enzymes to generate end-repaired cfDNA; ligating one or more adaptors to each end of the end-repaired cfDNA to generate a cfDNA library; amplifying the cfDNA library to generate a cfDNA clone library; determining the number of genome equivalents in the cfDNA clone library; and performing a quantitative genetic analysis of one or more target genetic loci associated with the genetic disease in the cfDNA clone library, wherein the identification or detection of one or more genetic lesions in the one or more target genetic loci is prognostic for, diagnostic of, or monitors the progression of the genetic disease.
- In an additional embodiment, the cfDNA is isolated from a biological sample selected from the group consisting of: amniotic fluid, blood, plasma, serum, semen, lymphatic fluid, cerebral spinal fluid, ocular fluid, urine, saliva, stool, mucous, and sweat.
- In a certain embodiment, the genetic lesion comprises a nucleotide transition or transversion, a nucleotide insertion or deletion, a genomic rearrangement, a change in copy number, or a gene fusion.
- In a particular embodiment, the genetic lesion comprises a genomic rearrangement that fuses the 3′ coding region of the ALK gene to another gene.
- In a further embodiment, the 3′ coding region of the ALK gene is fused to the EML4 gene.
- In a particular embodiment, the genetic disease is cancer.
- In various embodiments, a companion diagnostic for a genetic disease is provided comprising: isolating or obtaining cfDNA from a biological sample of a subject; treating the cfDNA with one or more end-repair enzymes to generate end-repaired cfDNA; ligating one or more adaptors to each end of the end-repaired cfDNA to generate a cfDNA library; amplifying the cfDNA library to generate a cfDNA clone library; determining the number of genome equivalents in the cfDNA clone library; and performing a quantitative genetic analysis of one or more biomarkers associated with the genetic disease in the cfDNA clone library, wherein detection of, or failure to detect, at least one of the one or more biomarkers indicates whether the subject should be treated for the genetic disease.
- In a particular embodiment, the cfDNA is isolated from a biological sample selected from the group consisting of: amniotic fluid, blood, plasma, serum, semen, lymphatic fluid, cerebral spinal fluid, ocular fluid, urine, saliva, stool, mucous, and sweat.
- In an additional embodiment, the biomarker is a genetic lesion.
- In a particular embodiment, the genetic lesion comprises a nucleotide transition or transversion, a nucleotide insertion or deletion, a genomic rearrangement, a change in copy number, or a gene fusion.
- In an additional embodiment, the genetic lesion comprises a genomic rearrangement that fuses the 3′ coding region of the ALK gene to another gene.
- In a further embodiment, the 3′ coding region of the ALK gene is fused to the EML4 gene.
- In a certain embodiment, the genetic disease is cancer.
-
FIG. 1 shows the expected versus observed variant frequencies as a function of admix dilution in the absence of unique read filtering. In the absence of unique read filtering, random base changes at these four selected positions occurred with measurable, non-zero frequencies; thus, demonstrating a lack of sensitivity to detect the particular single nucleotide variants (SNV). -
FIG. 2 shows that unique read filtering performed on the data generated inFIG. 1 . The left hand panel shows the data fromFIG. 1 on the BRAF I326T SNV without unique read filtering. The right hand panel shows that using unique read filtering of the same data increased the assay sensitivity and allowed the discrimination of true signal from error-prone noise. -
FIG. 3 shows capture probe performance as a function of length and wash temperature. The y-axis shows the total number of reads associated with each capture probe. The bars in the bar chart are broken into two categories, where open bars correspond to on-target reads that align to the intended capture probe targets and solid bars show off-target reads that are associated with a capture probe but that map to unintended regions of the genome. Overall, the 40-mer and 60-mer capture probes perform substantially the same with 44° C. and 47° C. washes. With the 50° C. wash, the 40-mer capture probes perform erratically. These data validate the use of 40-mer capture probes at wash temperatures in the range of about 44° C. to about 47° C. -
FIG. 4 shows a schematic for the targeted and oriented sequencing of intron 19 of the ALK gene. A) In the “wild-type” reference sequence, antisense-oriented ALK capture probes identify sequences from intron 19. B) In the case of pathogenic fusion genes, some ALK capture probes will identify junction sequences associated with the gene fusion event. -
FIG. 5 shows a schematic for high density capture probe placement for complete sequencing of target regions. Each capture probe captures a collection of sequences that provide cumulative coverage at each base position. Here, coverage is represented by a line, and the amplitude of the line denotes depth of coverage derived from a particular capture probe. Overlapping coverage from adjacent capture probes provides complete sequencing of target regions in both possible directions. In addition, the head-to-head placement of opposite strand capture probes ensures that all capture probe binding sites are sequenced. -
FIG. 6 shows a representative example of the size distribution of fragmented DNA used in library construction. -
FIG. 7 shows the performance of high-density 40-mer capture probes in a representative experiment. The y-axis shows the total number of reads, which are broken out as on-target reads, off-target reads, and unmappable reads. The x-axis enumerates each of the 105 capture probes used in this experiment for sequence capture. -
FIG. 8 shows a representative example of the cumulative coverage of a target region using high density 40-mer capture probes. Shown here is the cumulative coverage of TP53 coding exons. -
FIG. 9A shows a representative example of the size distribution of cell-free DNA (cfDNA) libraries. The dominant band is consistent with a collection of 170±10 bp fragments ligated to 90 bp of adaptors.FIG. 9B shows a published gel image of cfDNA and a representative cfDNA library generated using the methods disclosed and/or contemplated herein. The qualitative “ladder” appearance is conserved in the library, but the library is shifted to higher mass by the addition of 90 bp of adaptor sequences.FIG. 9C shows a representative example of genomic, plasma-derived cfDNA libraries from Ovarian cancer patients (OvC) and “healthy donors” (HD). -
FIG. 10 shows the unique read counts across eight cfDNA libraries derived from four plasma samples. Fragmentation (frag) prior to library construction with this sample 23407 increased the library yield by more than two-fold. -
FIG. 11 shows the representative read coverage of cfDNA across a region of the TP53 gene. Twenty four 131 bp reads captured by the “TP53_NM—000546_chr17:7579351:region—3:280nt:41:80:r” capture probe (SEQ ID NO:201) were chosen at random and aligned using the BLAT algorithm within the UCSC genome browser. Twenty one reads map to the target region, and they do so in a pattern of overlapping coverage. The probe used to capture these reads is marked with an arrow. -
FIG. 12 shows an overview of targeted DNA sequencing of the coding regions of the TP53 gene from a cfDNA genomic library. The coverage (horizontal axis) extends across all 10 coding regions and includes intronic regions involved in mRNA splicing. The sequencing depth (vertical axis) reaches a maximum of 4851 and is uniform across all coding exons. -
FIG. 13 shows a plot of unique read counts versus qPCR estimated genome equivalents in an ACA2-based assay. qPCR measurements are shown on the X-axis versus read counts on the Y-axis. Perfect agreement between these measurements is shown as the diagonal. There is very poor, if any, correlation between measurements, especially at lower genomic inputs. These data show that the ACA2-based qPCR assay chronically underestimates library complexity and is inadequate for measuring genome equivalents. -
FIG. 14 shows a schematic of the core elements of a qPCR genome equivalent measurement assay that couples an genomic repeat specific primer (e.g., Alu) and a long adaptor-specific primer. (A) Standard library amplification using a single, 25 nt primer named ACA2 (primer 1). (B) Longer, 58 nt versions of the ACA2 primer (primer 2) do not amplify genomic libraries because of stem-loop suppression. (C) Forward and reverse primers directed to a consensus human Alu repeat element (primers 3 and 4) recognize 1000′s of loci and readily amplify genomic DNA. (D) A single Alu primer alone, either forward or reverse (primer 3 or primer 4), coupled with the long ACA2 primer (primer 2) do not amplify genomic DNA. (E) The same primer pair as in (D) readily amplifies genomic cfDNA library clones that contain Alu sequences. -
FIG. 15 shows proof-of-concept data for an Alu plus adaptor-based qPCR assay of genome equivalents. (A) Amplification of 10 pg of a standard genomic library with various PCR primers. The x-axis specifies PCR primers used for amplification and the Y-axis (log scale) indicates the PCR signal measured in units of fg/μL. The standard ACA2 primer produced a strong signal, as expected. The ACA2 long primer failed to produce signal owing to PCR suppression. The two Alu primer pairs both produced signal at 1% the amount of ACA2, suggesting that 1% of clones possess an amplifiable Alu sequence. The combination of any Alu primer with the long ACA2 primer also produced signal in ˜1% of clones. (B) Validation against 10 pg of genomic DNA (left four samples) or 10 pg of library DNA (right four clones). Alu primer pairs amplify comparable signal from genomic DNA or a genomic library. In contrast, primer pairs consisting of an Alu primer and a long ACA2 primer amplify genomic DNA poorly (L+A1F) or not at all (L+A1R). These same pairs exhibit amplification of library that slightly exceeds the signal from Alu primer pairs. -
FIG. 16 shows a direct comparison of ACA2 primer qPCR assay with the Alu-ACA2 long-primer qPCR assay. The Alu ACA2 long-primer qPCR assay shows an 8-fold increase in detectable genome equivalents, which is more consistent with unique read counts derived from sequencing data. -
FIG. 17 shows a representative example of adaptor structure and function for high sensitivity, quantitative genetic assays that provide accurate determinations of genome equivalents analyzed. (A) Fine structure of the adaptor ligation strand. Details relating to each numbered element are provided in Example 4. (B) The duplex formed between 45 nt ligation strands and 12 nt partner oligo strand creates a blunt-end ligation substrate compatible with end-repaired cfDNA fragments (solid bars). (C) Following ligation, the complement to the ligation strand is created by a DNA polymerase-mediated fill-in reaction. -
FIG. 18 shows a representative example of the size distribution of two DNA samples (NA06994 & NCI-H2228) processed to mimic cfDNA. -
FIG. 19 shows a representative example of the sensitivity of detection of the TP53 point mutation Q331* in tumor sample DNA (H2228) admixed with normal genomic DNA (N). The most sensitive detection corresponds to ˜1 mutant copy of TP53 among 1000 normal copies of the gene. -
FIG. 20 shows the precise determination of the junction sequence for the EML4-ALK fusion gene harbored in cell line NCI-H2228 using the compositions and methods contemplated herein. -
FIG. 21 shows the detection of the EML4-ALK fusion gene tumor sample DNA (H2228) admixed with normal genomic DNA (N). Because the fusion is present as a heterozygote in the NCI-H2228 cell line, the most sensitive detection corresponds to one gene fusion among ˜100 normal chromosomal copies of the ALK gene (50 genome equivalents). -
FIG. 22 shows the detection of the MYCN gene amplification in admixtures of cell line NCI-H69 (H69) diluted into normal human DNA (N). The threshold value of two normal diploid copies is shown as a dashed red line. -
FIG. 23 shows the DNA mutations detected in the TP53 gene of three different cancer patients. The canonical gene model is shown at the top of the figure. The peaks represent DNA sequence coverage (X-axis) and depth (Y-axis). Sequencing depth was >4000 genome equivalents for all sample analyzed. An expanded view of exon 7 below the gene model shows where all detected mutations were localized. The frequency of mutant detection in cfDNA (plasma), tumor tissue, and normal adjacent tissue is shown, where available (NA—not available). OVA1 and OVA2 are ovarian cancer patients; CRC406 and CRC407 are colorectal cancer patients. No mutations in TP53 were found in any of the OVA1 samples. -
FIG. 24 shows the DNA sequencing of a larger, thirteen gene panel (boxed). The sequencing identified a KRAS mutation in cfDNA and tumor from ovarian cancer patient OVA1. -
FIG. 25 shows the DNA sequencing of a larger, twelve gene panel. The sequencing identified an ERBB2 gene amplification in the plasma of colorectal cancer patient CRC407. - The present invention contemplates, in part, compositions and methods for the quantitative genetic analysis of the genetic state of an individual using cell-free DNA (cfDNA). As used herein, the term “genetic state” refers to the sequence of one or more target genome sequences in the genome in relation to a non-causal normal sequence or in relation to a sequence that is causal for a genetic condition or disease. In one embodiment, analyzing the genetic state refers to identifying, quantifying, or monitoring a genetic variant in a target genetic locus, wherein the variant varies with respect to a reference sequence (e.g., a normal or mutated sequence). The present inventors have provided solutions to the molecular diagnostic problems of genetic conditions or diseases associated with lack of sensitivity to discriminate true positives from false positives, inefficient cloning and amplification of individual DNA molecules, and inefficient targeted sequencing to specific genomic loci. The solutions contemplated herein comprise compositions and methods for reliable and robust quantitative genetic analysis with sensitivity sufficient to discriminate true positive test results from false positive signals that arise during sample processing.
- Next-generation sequencing technology has afforded the opportunity to add broad genomic surveys to molecular diagnosis in a variety of scenarios including cancers, fetal diagnostics, paternity testing, pathogen screening and organ transplant monitoring. In the context of genetic diseases, next-generation sequencing information is being used in a clinical setting to identify mutations within genes that are likely to alter gene function, to identify the gain or loss of genetic material within cells, and to identify genomic rearrangements that are not found in normal, healthy cells. The results of these broad diagnostic surveys are often used to guide patient treatment.
- However, the potential benefits of DNA sequencing in diagnosis and treatment of the genetic state of an individual or genetic conditions or diseases is outweighed by the need to directly access affected tissues to obtain samples. Such material is often difficult to obtain from the initial biopsy used to diagnose the disease and virtually impossible to obtain in multiple repetitions over time. Similarly, in cancer patients, biopsies are not possible in patients with inaccessible tumors and not practical in individuals suffering from metastatic disease. In contrast, the present inventors' approach derives from the fact that all tissues require access to the vasculature to survive and as a consequence these masses deposit DNA into bodily fluids. One major depot of bodily fluid in which the DNA of diseased cells is found is the plasma of human blood.
- In contrast to testing methods that rely on shallow, genome-wide sequence coverage, molecular diagnostics contemplated herein for the genetic state of an individual; genetic diseases; mendelian disorders; genetic mosaicism; fetal testing; paternity testing; predicting response to drug treatment; diagnosing or monitoring a medical condition; pathogen screening; microbiome profiling; and organ transplant monitoring leverage the availability of cfDNA to provide deep sequence coverage of select target genes. In addition, the cfDNA-based cancer diagnostics contemplated herein possess the ability to detect a variety of genetic changes including somatic sequence variations that alter protein function, large-scale chromosomal rearrangements that create chimeric gene fusions, and copy number variation that includes loss or gain of gene copies. Using the contemplated compositions and methods, these changes are detectable and quantifiable in the face of significant dilution, or admixture, of normal sequences within cfDNA that are contributed by the normal turnover processes that happen within healthy tissues. The compositions and methods contemplated herein also successfully address the major challenges associated with detecting rare genetic changes causal of disease; namely, that cfDNA is highly fragmented, that cfDNA levels vary substantially between different individuals, and that the degree of admixture of diseased versus normal sequences is highly variable among patients, even within individuals suffering from the same molecular disease and stage.
- In various embodiments, compositions and methods for genetic analysis of comprise interrogating the DNA fraction within biological fluid samples and stool samples. The methods contemplated herein provide a novel comprehensive framework address molecular genetic analysis using cfDNA available from a variety of biological sources. Cloning of purified cfDNA introduces tagged cfDNA sequences that inform downstream analysis and enable amplification of the resulting clone libraries. Hybrid capture with target specific oligonucleotides is used to retrieve specific sequences for subsequent analysis. Independent measurements of the number of genomes present in the library are applied to each sample, and these assays provide a means to estimate the assay's sensitivity. The assays contemplated herein provide reliable, reproducible, and robust methods for the analysis, detection, diagnosis, or monitoring of genetic states, conditions, or disease.
- The practice of particular embodiments of the invention will employ, unless indicated specifically to the contrary, conventional methods of chemistry, biochemistry, organic chemistry, molecular biology, microbiology, recombinant DNA techniques, genetics, immunology, and cell biology that are within the skill of the art, many of which are described below for the purpose of illustration. Such techniques are explained fully in the literature. See, e.g., Sambrook, et al., Molecular Cloning: A Laboratory Manual (3rd Edition, 2001); Sambrook, et al., Molecular Cloning: A Laboratory Manual (2nd Edition, 1989); Maniatis et al., Molecular Cloning: A Laboratory Manual (1982); Ausubel et al., Current Protocols in Molecular Biology (John Wiley and Sons, updated July 2008); Short Protocols in Molecular Biology: A Compendium of Methods from Current Protocols in Molecular Biology, Greene Pub. Associates and Wiley-Interscience; Glover, DNA Cloning: A Practical Approach, vol. I & II (IRL Press, Oxford, 1985); Anand, Techniques for the Analysis of Complex Genomes, (Academic Press, New York, 1992); Transcription and Translation (B. Hames & S. Higgins, Eds., 1984); Perbal, A Practical Guide to Molecular Cloning (1984); and Harlow and Lane, Antibodies, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1998).
- Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by those of ordinary skill in the art to which the invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, preferred embodiments of compositions, methods and materials are described herein. For the purposes of the present invention, the following terms are defined below.
- The articles “a,” “an,” and “the” are used herein to refer to one or to more than one (i.e. to at least one) of the grammatical object of the article. By way of example, “an element” means one element or more than one element.
- The use of the alternative (e.g., “or”) should be understood to mean either one, both, or any combination thereof of the alternatives.
- The term “and/or” should be understood to mean either one, or both of the alternatives.
- As used herein, the term “about” or “approximately” refers to a quantity, level, value, number, frequency, percentage, dimension, size, amount, weight or length that varies by as much as 15%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2% or 1% to a reference quantity, level, value, number, frequency, percentage, dimension, size, amount, weight or length. In one embodiment, the term “about” or “approximately” refers a range of quantity, level, value, number, frequency, percentage, dimension, size, amount, weight or length±15%, ±10%, ±9%, ±8%, ±7%, ±6%, ±5%, ±4%, ±3%, ±2%, or ±1% about a reference quantity, level, value, number, frequency, percentage, dimension, size, amount, weight or length.
- Throughout this specification, unless the context requires otherwise, the words “comprise”, “comprises” and “comprising” will be understood to imply the inclusion of a stated step or element or group of steps or elements but not the exclusion of any other step or element or group of steps or elements. In particular embodiments, the terms “include,” “has,” “contains,” and “comprise” are used synonymously.
- By “consisting of” is meant including, and limited to, whatever follows the phrase “consisting of.” Thus, the phrase “consisting of” indicates that the listed elements are required or mandatory, and that no other elements may be present.
- By “consisting essentially of” is meant including any elements listed after the phrase, and limited to other elements that do not interfere with or contribute to the activity or action specified in the disclosure for the listed elements. Thus, the phrase “consisting essentially of” indicates that the listed elements are required or mandatory, but that no other elements are optional and may or may not be present depending upon whether or not they affect the activity or action of the listed elements.
- Reference throughout this specification to “one embodiment,” “an embodiment,” “a particular embodiment,” “a related embodiment,” “a certain embodiment,” “an additional embodiment,” or “a further embodiment” or combinations thereof means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearances of the foregoing phrases in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.
- As used herein, the term “isolated” means material that is substantially or essentially free from components that normally accompany it in its native state. In particular embodiments, the term “obtained” or “derived” is used synonymously with isolated.
- As used herein, the term “DNA” refers to deoxyribonucleic acid. In various embodiments, the term DNA refers to genomic DNA, recombinant DNA, synthetic DNA, or cDNA. In one embodiment, DNA refers to genomic DNA or cDNA. In particular embodiments, the DNA comprises a “target region.” DNA libraries contemplated herein include genomic DNA libraries and cDNA libraries constructed from RNA, e.g., an RNA expression library. In various embodiments, the DNA libraries comprise one or more additional DNA sequences and/or tags.
- A “target genetic locus” or “DNA target region” refers to a region of interest within a DNA sequence. In various embodiments, targeted genetic analyses are performed on the target genetic locus. In particular embodiments, the DNA target region is a region of a gene that is associated with a particular genetic state, genetic condition, genetic diseases; fetal testing; genetic mosaicism, paternity testing; predicting response to drug treatment; diagnosing or monitoring a medical condition; microbiome profiling; pathogen screening; or organ transplant monitoring.
- As used herein, the terms “circulating DNA,” “circulating cell-free DNA” and “cell-free DNA” are often used interchangeably and refer to DNA that is extracellular DNA, DNA that has been extruded from cells, or DNA that has been released from necrotic or apoptotic cells.
- A “subject,” “individual,” or “patient” as used herein, includes any animal that exhibits a symptom of a condition that can be detected or identified with compositions contemplated herein. Suitable subjects include laboratory animals (such as mouse, rat, rabbit, or guinea pig), farm animals (such as horses, cows, sheep, pigs), and domestic animals or pets (such as a cat or dog). In particular embodiments, the subject is a mammal. In certain embodiments, the subject is a non-human primate and, in preferred embodiments, the subject is a human.
- In various embodiments, a method for genetic analysis of cfDNA is provided.
- In particular embodiments, a method for genetic analysis of cfDNA comprises: generating and amplifying a cfDNA library, determining the number of genome equivalents in the cfDNA library; and performing a quantitative genetic analysis of one or more genomic target loci.
- A method for genetic analysis of cfDNA comprises treating cfDNA with one or more end-repair enzymes to generate end-repaired cfDNA and ligating one or more adaptors to each end of the end-repaired cfDNA to generate a cfDNA library; amplifying the cfDNA library to generate cfDNA library clones; determining the number of genome equivalents of cfDNA library clones; and performing a quantitative genetic analysis of one or more target genetic loci in the cfDNA library clones.
- 1. Generating a cfDNA Library
- In particular embodiments, methods of genetic analysis contemplated herein comprise generating a cfDNA library comprising treating cfDNA with one or more end-repair enzymes to generate end-repaired cfDNA and ligating one or more adaptors to each end of the end-repaired cfDNA to generate the cfDNA library.
- (a) Cell-Free DNA (cfDNA)
- The methods and compositions contemplated herein are designed to efficiently analyze, detect, diagnose, and/or monitor genetic states, genetic conditions, genetic diseases, genetic mosaicism, fetal diagnostics, paternity testing, microbiome profiling, pathogen screening, and organ transplant monitoring using cell-free DNA (cfDNA) as an analyte. The size distribution of cfDNA ranges from about 150 bp to about 180 bp fragments. Fragmentation may be the result of endonucleolytic and/or exonucleolytic activity and presents a formidable challenge to the accurate, reliable, and robust analysis of cfDNA. Another challenge for analyzing cfDNA is its short half-life in the blood stream, on the order of about 15 minutes. Without wishing to be bound to any particular theory, the present invention contemplates, in part, that analysis of cfDNA is like a “liquid biopsy” and is a real-time snapshot of current biological processes.
- Moreover, because cfDNA is not found within cells and may be obtained from a number of suitable sources including, but not limited to, biological fluids and stool samples, it is not subject to the existing limitations that plague next generation sequencing analysis, such as direct access to the tissues being analyzed.
- Illustrative examples of biological fluids that are suitable sources from which to isolate cfDNA in particular embodiments include, but are not limited to amniotic fluid, blood, plasma, serum, semen, lymphatic fluid, cerebral spinal fluid, ocular fluid, urine, saliva, mucous, and sweat.
- In particular embodiments, the biological fluid is blood or blood plasma.
- In certain embodiments, commercially available kits and other methods known to the skilled artisan can used to isolate cfDNA directly from the biological fluids of a patient or from a previously obtained and optionally stabilized biological sample, e.g., by freezing and/or addition of enzyme chelating agents including, but not limited to EDTA, EGTA, or other chelating agents specific for divalent cations.
- (b) Generating End-Repaired cfDNA
- In particular embodiments, generating a cfDNA library comprises the end-repair of isolated cfDNA. The fragmented cfDNA is processed by end-repair enzymes to generate end-repaired cfDNA with blunt ends, 5′-overhangs, or 3′-overhangs. In some embodiments, the end-repair enzymes can yield for example. In some embodiments, the end-repaired cfDNA contains blunt ends. In some embodiments, the end-repaired cfDNA is processed to contain blunt ends. In some embodiments, the blunt ends of the end-repaired cfDNA are further modified to contain a single base pair overhang. In some embodiments, end-repaired cfDNA containing blunt ends can be further processed to contain adenine (A)/thymine (T) overhang. In some embodiments, end-repaired cfDNA containing blunt ends can be further processed to contain adenine (A)/thymine (T) overhang as the single base pair overhang. In some embodiments, the end-repaired cfDNA has non-templated 3′ overhangs. In some embodiments, the end-repaired cfDNA is processed to contain 3′ overhangs. In some embodiments, the end-repaired cfDNA is processed with terminal transferase (TdT) to contain 3′ overhangs. In some embodiments, a G-tail can be added by TdT. In some embodiments, the end-repaired cfDNA is processed to contain overhang ends using partial digestion with any known restriction enzymes (e.g., with the enzyme Sau3A, and the like.
- (c) Ligating Adaptor Molecules to End-Repaired cfDNA
- In particular embodiments, generating a cfDNA library comprises ligating one or more adaptors to each end of the end-repaired cfDNA. The present invention contemplates, in part, an adaptor module designed to accommodate large numbers of genome equivalents in cfDNA libraries. Adaptor modules are configured to measure the number of genome equivalents present in cfDNA libraries, and, by extension, the sensitivity of sequencing assays used to identify sequence mutations.
- As used herein, the term “adaptor module” refers to a polynucleotide comprising at least five elements: (i) a first element comprising a PCR primer binding site for the single-primer library amplification; (ii) a second element comprising a 5 nucleotide read code that acts to uniquely identified each sequencing read; (iii) a third element comprising a 3 nucleotide sample code to identify different samples and enable sample multiplexing within a sequencing run; (iv) a fourth element comprising a 12 nucleotide anchor sequence that enables calibration of proper base calls in sequencing reads and acts as an anchor for hybridization to a partner oligonucleotide; and (v) a fifth element comprising the two 3′ terminal nucleotides of Element 4 (
FIG. 17 and Tables 12-16). The adaptor module is hybridized to a partner oligonucleotide that is complementary to Element 4 to form an adaptor suitable for ligating to the ends of cfDNA, optionally end-repaired blunt-ended cfDNA. - In particular embodiments, an adaptor module comprises one or more PCR primer sequences, one or more read codes, one or more sample codes, one or more anchor sequences, and two or more 3′ nucleotides that are efficient ligation substrates. In additional embodiments, the adaptor module further comprises one or more sequencing primer binding sites.
- In particular embodiments, an adaptor module comprises a first element that comprises one or more PCR primer binding sequences for single-primer amplification of a cfDNA library. In one embodiment, the PCR primer binding sequence is about 12 to about 40 nucleotides, about 18 to about 40 nucleotides, about 20 to about 35 nucleotides, or about 20 to about 30 nucleotides. In another embodiment, the PCR primer binding sequence is about 12 nucleotides, about 13 nucleotides, about 14 nucleotides, about 15 nucleotides, about 16 nucleotides, about 17 nucleotides, about 18 nucleotides, about 19 nucleotides, about 20 nucleotides, about 21 nucleotides, about 22 nucleotides, about 23 nucleotides, about 24 nucleotides, about 25 nucleotides, about 26 nucleotides, about 27 nucleotides, about 28 nucleotides, about 29 nucleotides, about 30 nucleotides, about 31 nucleotides, about 32 nucleotides, about 33 nucleotides, about 34 nucleotides, about 35 nucleotides, about 36 nucleotides, about 37 nucleotides, about 38 nucleotides, about 39 nucleotides, or about 40 nucleotides or more.
- In one embodiment, the PCR primer binding sequence is about 25 nucleotides.
- In particular embodiments, an adaptor module comprises a second element that comprises one or more read code sequences. As used herein, the term “read code” refers to a polynucleotide that is used to identify unique sequencing reads. In one embodiment, the read code is a random sequence of nucleotides. In one embodiment, the read code is about 1 nucleotide, about 2 nucleotides, about 3 nucleotides, about 4 nucleotides, about 5 nucleotides, about 6 nucleotides, about 7 nucleotides, about 8 nucleotides, about 9 nucleotides, about 10 nucleotides, or more.
- By way of a non-limiting example, a 5 nucleotide codes consists of 256 possible unique sequences where each code chosen is 2 nucleotides different from every other code in the set. This feature enables unique and distinct reads to be differentiated from reads that appear to be unique owing to a sequencing error in the code region. In particular embodiments, codes that have been empirically determined to interfere with adaptor function, owing to particular sequence combinations, may be excluded from use, e.g., seven codes of the 256 had an overrepresentation of G nucleotides and were excluded.
- In other embodiments, each read code of 5, 6, 7, 8, 9, 10 or more nucleotides may differ by 2, 3, 4, or 5 nucleotides from every other read code.
- In one embodiment, the read code is about 5 nucleotides and differs from every other read code by 2 nucleotides.
- In particular embodiments, an adaptor module comprises a third element that comprises one or more sample code sequences. As used herein, the term “sample code” refers to a polynucleotide that is used to identify the sample. The sample code is also useful in establishing multiplex sequencing reactions because each sample code is unique to the sample and thus, can be used to identify a read from a particular sample within a multiplexed sequencing reaction.
- In one embodiment, the sample code comprises sequence that is about 1, about 2 nucleotides, about 3 nucleotides, about 4 nucleotides, or about 5 nucleotides, or more. In another embodiment, each sample code of 2, 3, 4, 5 or more nucleotides may differ from every other sample code by 2, 3, 4, or 5 nucleotides.
- In one embodiment, the sample code is about three nucleotides and differs from every other sample code used in other samples by two nucleotides.
- In particular embodiments, an adaptor module comprises a fourth element that comprises one or more anchor sequences. As used herein, an “anchor sequence” refers to a nucleotide sequence of at least 8 nucleotides, at least 10 nucleotides, at least 12 nucleotides, at least 14 nucleotides, or at least 16 nucleotides that hybridizes to a partner oligonucleotide and that comprises the following three properties: (1) each anchor sequence is part of a family of four anchor sequences that collectively represent each of the four possible DNA bases at each site within extension; this feature, balanced base representation, is useful to calibrate proper base calling in sequencing reads in particular embodiments; (2) each anchor sequence is composed of only two of four possible bases, and these are specifically chosen to be either and equal number of A+C or an equal number of G+T; an anchor sequence formed from only two bases reduces the possibility that the anchor sequence will participate in secondary structure formation that would preclude proper adaptor function; and (3) because each anchor sequence is composed of equal numbers of A+C or G+T, each anchor sequence shares roughly the same melting temperature and duplex stability as every other anchor sequence in a set of four.
- In particular embodiments, an adaptor module comprises a fifth element that is comprised of the two 3′ terminal nucleotides of Element 4. These two bases at the 3′ end of each anchor are chosen based on an empirical determination that shows that these two nucleotides are efficient substrates for ligation to the cfDNA. In particular embodiments, Element 5 comprises the sequences selected from the group consisting of: AA, CC, TT and GG. In particular embodiments, Element 5 does not comprise the dinucleotide combination CG or TG as the inventors have determined that these combinations are not efficient ligation substrates.
- In particular embodiments, a ligation step comprises ligating an adaptor module to the end-repaired cfDNA to generate a “tagged” cfDNA library. In some embodiments, a single adaptor module is employed. In some embodiments, two, three, four or five adaptor modules are employed. In some embodiments, an adaptor module of identical sequence is ligated to each end of the fragmented end-repaired DNA.
- In one embodiment, a plurality of adaptor species is ligated to an end-repaired cfDNA library. Each of the plurality of adaptors may comprise one or more primer binding site for amplification of the cfDNA library, one or more read code sequences, one or more sequences for sample multiplexing, and one or more sequences for DNA sequencing.
- Ligation of one or more adaptors contemplated herein may be carried out by methods known to those of ordinary skill in the art. In particular embodiments, one or more adaptors contemplated herein are ligated to end-repaired cfDNA that comprises blunt ends. In certain embodiments, one or more adaptors contemplated herein are ligated to end-repaired cfDNA that comprises complementary ends appropriate for the ligation method employed. In certain embodiments, one or more adaptors contemplated herein are ligated to end-repaired cfDNA that comprises a 3′ overhang.
- 2. cfDNA Library Amplification
- In particular embodiments, methods of genetic analysis contemplated herein comprise amplification of a cfDNA library to generate a cfDNA clone library or a library of cfDNA clones. Each molecule of the cfDNA library comprises an adaptor ligated to each end of an end-repaired cfDNA, and each adaptor comprises one or more PCR primer binding sites. In one embodiment, different adaptors are ligated to different ends of the end-repaired cfDNA.
- In a preferred embodiment, the same adaptor is ligated to both ends of the cfDNA. Ligation of the same adaptor to both ends of end-repaired cfDNA allows for PCR amplification with a single primer sequence. In particular embodiments, a portion of the adaptor ligated-cfDNA library will be amplified using standard PCR techniques with a single primer sequence driving amplification. In one embodiment, the single primer sequence is about 25 nucleotides, optionally with a projected Tm of ≧55° C. under standard ionic strength conditions.
- In particular embodiments, picograms of the initial cfDNA library are amplified into micrograms of cfDNA clones, implying a 10,000-fold amplification. The amount of amplified product can be measured using methods known in the art, e.g., quantification on a Qubit 2.0 or Nanodrop instrument.
- 3. Determining the Number of Genome Equivalents
- In various embodiments, a method for genetic analysis of cfDNA comprises determining the number of genome equivalents in the cfDNA clone library. As used herein, the term “genome equivalent” refers to the number of genome copies in each library. An important challenge met by the compositions and methods contemplated herein is achieving sufficient assay sensitivity to detect and analysis rare genetic mutations or differences in genetic sequence. To determine assay sensitivity value on a sample-by-sample basis, the numbers of different and distinct sequences that are present in each sample are measured, by measuring the number of genome equivalents that are present in a sequencing library. To establish sensitivity, the number of genome equivalents must be measured for each sample library.
- The number of genome equivalents can be determined by qPCR assay or by using bioinformatics-based counting after sequencing is performed. In the process flow of clinical samples, qPCR measurement of genome equivalents is used as a QC step for cfDNA libraries. It establishes an expectation for assay sensitivity prior to sequence analysis and allows a sample to be excluded from analysis if its corresponding cfDNA clone library lacks the required depth of genome equivalents. Ultimately, the bioinformatics-based counting of genome equivalents is also used to identify the genome equivalents—and hence the assay sensitivity and false negative estimates—for each given cfDNA clone library.
- The empirical qPCR assay and statistical counting assays should be well correlated. In cases where sequencing fails to reveal the sequence depth in a cfDNA clone library, reprocessing of the cfDNA clone library and/or additional sequencing may be required.
- In one embodiment, the genome equivalents in a cfDNA clone library are determined using a quantitative PCR (qPCR) assay. In a particular embodiment, a standard library of known concentration is used to construct a standard curve and the measurements from the qPCR assay are fit to the resulting standard curve and a value for genome equivalents is derived from the fit. Surprisingly, the present inventors have discovered that a qPCR “repeat-based” assay comprising one primer that specifically hybridizes to a common sequence in the genome, e.g., a repeat sequence, and another primer that binds to the primer binding site in the adaptor, measured an 8-fold increase in genome equivalents compared to methods using just the adaptor specific primer (present on both ends of the cfDNA clone). The number of genome equivalents measured by the repeat-based assays provides a more consistent library-to-library performance and a better alignment between qPCR estimates of genome equivalents and bioinformatically counted tag equivalents in sequencing runs.
- Illustrative examples of repeats suitable for use in the repeat-based genome equivalent assays contemplated herein include, but not limited to: short interspersed nuclear elements (SINEs), e.g., Alu repeats; long interspersed nuclear elements (LINEs), e.g., LINE1, LINE2, LINE3; microsatellite repeat elements, e.g., short tandem repeats (STRs), simple sequence repeats (SSRs); and mammalian-wide interspersed repeats (MIRs).
- In one embodiment, the repeat is an Alu repeat.
- 4. Quantitative Genetic Analysis
- In various embodiments, a method for genetic analysis of cfDNA comprises quantitative genetic analysis of one or more target genetic loci of the cfDNA library clones. Quantitative genetic analysis comprises one or more of, or all of, the following steps: capturing cfDNA clones comprising a target genetic locus; amplification of the captured targeted genetic locus; sequencing of the amplified captured targeted genetic locus; and bioinformatic analysis of the resulting sequence reads.
- (a) Capture of Target Genetic Locus
- The present invention contemplates, in part, a capture probe module designed to retain the efficiency and reliability of larger probes but that minimizes uninformative sequence generation in a cfDNA clone library. A “capture probe module” refers to a polynucleotide that comprises a capture probe sequence and a tail sequence. In particular embodiments, the capture probe module sequence or a portion thereof serves as a primer binding site for one or more sequencing primers.
- In particular embodiments, a capture probe module comprises a capture probe. As used herein a “capture probe” refers to a region capable of hybridizing to a specific DNA target region. Because the average size of cfDNA is about 150 to about 170 bp and is highly fragmented the compositions and methods contemplated herein comprise the use of high density and relatively short capture probes to interrogate DNA target regions of interest.
- One particular concern with using high density capture probes is that generally capture probes are designed using specific “sequence rules.” For example, regions of redundant sequence or that exhibit extreme base composition biases are generally excluded in designing capture probes. However, the present inventors have discovered that the lack of flexibility in capture probe design rules does not substantially impact probe performance. In contrast, capture probes chosen strictly by positional constraint provided on-target sequence information; exhibit very little off-target and unmappable read capture; and yield uniform, useful, on-target reads with only few exceptions. Moreover, the high redundancy at close probe spacing more than compensates for occasional poor-performing capture probes.
- In particular embodiments, a target region is targeted by a plurality of capture probes, wherein any two or more capture probes are designed to bind to the target region within 10 nucleotides of each other, within 15 nucleotides of each other, within 20 nucleotides of each other, within 25 nucleotides of each other, within 30 nucleotides of each other, within 35 nucleotides of each other, within 40 nucleotides of each other, within 45 nucleotides of each other, or within 50 nucleotides or more of each other, as well as all intervening nucleotide lengths.
- In one embodiment, the capture probe is about 25 nucleotides, about 26 nucleotides, about 27 nucleotides, about 28 nucleotides, about 29 nucleotides, about 30 nucleotides, about 31 nucleotides, about 32 nucleotides, about 33 nucleotides, about 34 nucleotides, about 35 nucleotides, about 36 nucleotides, about 37 nucleotides, about 38 nucleotides, about 39 nucleotides, about 40 nucleotides, about 41 nucleotides, about 42 nucleotides, about 43 nucleotides, about 44 nucleotides, or about 45 nucleotides.
- In one embodiment, the capture probe is about 100 nucleotides, about 200 nucleotides, about 300 nucleotides, about 400 nucleotides, or about 100 nucleotides. In another embodiment, the capture probe is from about 100 nucleotides to about 500 nucleotides, about 200 nucleotides to about 500 nucleotides, about 300 nucleotides to about 500 nucleotides, or about 400 nucleotides to about 500 nucleotides, or any intervening range thereof.
- In a particular embodiment, the capture probe is not 60 nucleotides.
- In another embodiment, the capture probe is substantially smaller than 60 nucleotides but hybridizes comparably, as well as, or better than a 60 nucleotide capture probe targeting the same DNA target region.
- In a certain embodiment, the capture probe is 40 nucleotides.
- In certain embodiments, a capture probe module comprises a tail sequence. As used herein, the term “tail sequence” refers to a polynucleotide at the 5′ end of the capture probe module, which in particular embodiments can serve as a primer binding site. In particular embodiments, a sequencing primer binds to the primer binding site in the tail region.
- In particular embodiments, the tail sequence is about 5 to about 100 nucleotides, about 10 to about 100 nucleotides, about 5 to about 75 nucleotides, about 5 to about 50 nucleotides, about 5 to about 25 nucleotides, or about 5 to about 20 nucleotides. In certain embodiments, the third region is from about 10 to about 50 nucleotides, about 15 to about 40 nucleotides, about 20 to about 30 nucleotides or about 20 nucleotides, or any intervening number of nucleotides.
- In particular embodiments, the tail sequence is about 30 nucleotides, about 31 nucleotides, about 32 nucleotides, about 33 nucleotides, about 34 nucleotides, about 35 nucleotides, about 36 nucleotides, about 37 nucleotides, about 38 nucleotides, about 39 nucleotides, or about 40 nucleotides.
- In various embodiments, the capture probe module comprises a specific member of a binding pair to enable isolation and/or purification of one or more captured fragments of a tagged and or amplified cfDNA library that hybridizes to the capture probe. In particular embodiments, the capture probe module is conjugate to biotin or another suitable hapten, e.g., dinitrophenol, digoxigenin.
- In various embodiments, the capture probe module is hybridized to a tagged and optionally amplified cfDNA library to form a complex. In some embodiments, the multifunctional capture probe module substantially hybridizes to a specific genomic target region in the cfDNA library.
- Hybridization or hybridizing conditions can include any reaction conditions where two nucleotide sequences form a stable complex; for example, the tagged cfDNA library and capture probe module forming a stable tagged cfDNA library-capture probe module complex. Such reaction conditions are well known in the art and those of skill in the art will appreciated that such conditions can be modified as appropriate, e.g., decreased annealing temperatures with shorter length capture probes, and within the scope of the present invention. Substantial hybridization can occur when the second region of the capture probe complex exhibits 100%, 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92% 91%, 90%, 89%, 88%, 85%, 80%, 75%, or 70% sequence identity, homology or complementarity to a region of the tagged cfDNA library.
- In particular embodiments, the capture probe is about 40 nucleotides and has an optimal annealing temperature of about 44° C. o about 47° C.
- In certain embodiments, the methods contemplated herein comprise isolating a tagged cfDNA library-capture probe module complex. In particular embodiments, methods for isolating DNA complexes are well known to those skilled in the art and any methods deemed appropriate by one of skill in the art can be employed with the methods of the present invention (Ausubel et al., Current Protocols in Molecular Biology, 2007-2012). In particular embodiments, the complexes are isolated using biotin-streptavidin isolation techniques.
- In particular embodiments, removal of the single stranded 3′-ends from the isolated tagged cfDNA library-capture probe module complex is contemplated. In certain embodiments, the methods comprise 3′-5′ exonuclease enzymatic processing of the isolated tagged DNA library-multifunctional capture probe module complex to remove the single stranded 3′ ends.
- In certain other embodiments, the methods comprise performing 5′-3′ DNA polymerase extension of multifunctional capture probe utilizing the isolated tagged DNA library fragments as template.
- In certain other embodiments, the methods comprise creating a hybrid capture probe-isolated tagged cfDNA target molecule through the concerted action of a 5′ FLAP endonuclease, DNA polymerization and nick closure by a DNA ligase.
- A variety of enzymes can be employed for the 3′-5′ exonuclease enzymatic processing of the isolated tagged cfDNA library-multifunctional capture probe module complex. Illustrative examples of suitable enzymes, which exhibit 3′-5′ exonuclease enzymatic activity, that can be employed in particular embodiments include, but are not limited to: T4 or Exonucleases I, III, V (see also, Shevelev I V, Hübscher U., “The 3′ 5′ exonucleases,” Nat Rev Mol Cell Biol. 3(5):364-76 (2002)). In particular embodiments, the enzyme comprising 3′-5′ exonuclease activity is T4 polymerase. In particular embodiments, an enzyme which exhibits 3′-5′ exonuclease enzymatic activity and is capable of primer template extension can be employed, including for example T4 or Exonucleases I, III, V. Id.
- In some embodiments, the methods contemplated herein comprise performing sequencing and/or PCR on the 3′-5′ exonuclease enzymatically processed complex discussed supra and elsewhere herein. In particular embodiments, a tail portion of a capture probe molecule is copied in order to generate a hybrid nucleic acid molecule. In one embodiment, the hybrid nucleic acid molecule generated comprises the target region capable of hybridizing to the capture probe module and the complement of the capture probe module tail sequence.
- In a particular embodiment, genetic analysis comprises a) hybridizing one or more capture probe modules to one or more target genetic loci in a plurality of cfDNA library clones to form one or more capture probe module-cfDNA library clone complexes; b) isolating the one or more capture probe module-cfDNA library clone complexes from a); c) enzymatically processing the one or more isolated capture probe module-cfDNA library clone complexes from step b); d) performing PCR on the enzymatically processed complex from c) wherein the tail portion of the capture probe molecule is copied in order to generate amplified hybrid nucleic acid molecules, wherein the amplified hybrid nucleic acid molecules comprise a target sequence in the target genomic locus capable of hybridizing to the capture probe and the complement of the capture probe module tail sequence; and e) performing quantitative genetic analysis on the amplified hybrid nucleic acid molecules from d).
- In a particular embodiment, methods for determining copy number of a specific target genetic locus are contemplated comprising: a) hybridizing one or more capture probe modules to one or more target genetic loci in a plurality of cfDNA library clones to form one or more capture probe module-cfDNA library clone complexes; b) isolating the one or more capture probe module-cfDNA library clone complexes from a); c) enzymatically processing the one or more isolated capture probe module-cfDNA library clone complexes from step b); d) performing PCR on the enzymatically processed complex from c) wherein the tail portion of the capture probe molecule is copied in order to generate amplified hybrid nucleic acid molecules, wherein the amplified hybrid nucleic acid molecules comprise a target sequence in the target genetic locus capable of hybridizing to the capture probe and the complement of the capture probe module tail sequence; e) performing PCR amplification of the amplified hybrid nucleic acid molecules in d); and f) quantitating the PCR reaction in e), wherein the quantitation allows for a determination of copy number of the specific target region.
- In one embodiment, the enzymatic processing of step c) comprises performing 3′-5′ exonuclease enzymatic processing on the one or more capture probe module-cfDNA library clone complexes from b) using an enzyme with 3′-5′ exonuclease activity to remove the single stranded 3′ ends; creating one or more hybrid capture probe module-cfDNA library clone molecules through the concerted action of a 5′ FLAP endonuclease, DNA polymerization and nick closure by a DNA ligase; or performing 5′-3′ DNA polymerase extension of the capture probe using the isolated cfDNA clone in the complex as a template.
- In one embodiment, the enzymatic processing of step c) comprises performing 5′-3′ DNA polymerase extension of the capture probe using the isolated cfDNA clone in the complex as a template.
- In particular embodiments, PCR can be performed using any standard PCR reaction conditions well known to those of skill in the art. In certain embodiments, the PCR reaction in e) employs two PCR primers. In one embodiment, the PCR reaction in e) employs a first PCR primer that hybridizes to a repeat within the target genetic locus. In a particular embodiment, the PCR reaction in e) employs a second PCR primer that hybridizes to the hybrid nucleic acid molecules at the target genetic locus/tail junction. In certain embodiments, the PCR reaction in e) employs a first PCR primer that hybridizes to the target genetic locus and a second PCR primer hybridizes to the amplified hybrid nucleic acid molecules at the target genetic locus/tail junction. In particular embodiments, the second primer hybridizes to the target genetic locus/tail junction such that at least one or more nucleotides of the primer hybridize to the target genetic locus and at least one or more nucleotides of the primer hybridize to the tail sequence.
- In certain embodiments, the amplified hybrid nucleic acid molecules obtained from step e) are sequenced and the sequences aligned horizontally, i.e., aligned to one another but not aligned to a reference sequence. In particular embodiments, steps a) through e) are repeated one or more times with one or more capture probe modules. The capture probe modules can be the same or different and designed to target either cfDNA strand of a target genetic locus. In some embodiments, when the capture probes are different, they hybridize at overlapping or adjacent target sequences within a target genetic locus in the tagged cfDNA clone library. In one embodiment, a high density capture probe strategy is used wherein a plurality of capture probes hybridize to a target genetic locus, and wherein each of the plurality of capture probes hybridizes to the target genetic locus within about 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 100, 200, bp of any other capture probe that hybridizes to the target genetic locus in a tagged cfDNA clone library, including all intervening distances.
- In some embodiments, the method can be performed using two capture probe modules per target genetic locus, wherein one hybridizes to the “Watson” strand (non-coding or template strand) upstream of the target region and one hybridizes to the “Crick” strand (coding or non-template strand) downstream of the target region.
- In particular embodiments, the methods contemplated herein can further be performed multiple times with any number of capture probe modules, for example 2, 3, 4, 5, 6, 7, 8, 9, or 10 or more capture probe modules per target genetic locus any number of which hybridize to the Watson or Crick strand in any combination. In some embodiments, the sequences obtained can be aligned to one another in order to identify any of a number of differences.
- In certain embodiments, a plurality of target genetic loci are interrogated, e.g., 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500, 5000, 10000, 50000, 100000, 500000 or more in a single reaction, using one or more capture probe modules.
- (b) Sequencing
- In particular embodiments, the quantitative genetic analysis comprises sequencing a plurality of hybrid nucleic acid molecules, as discussed elsewhere herein, supra, to generate sufficient sequencing depths to obtain a plurality of unique sequencing reads. A unique read is defined as the single consensus read from a “family” of reads that all share the same read code and sequence start point within cfDNA. Each capture probe yields a set of unique reads that are computationally distilled from total reads by grouping into families. The unique reads for a given sample are then computed as the average of all the unique reads observed on a probe-by-probe basis. Cases where there is an obvious copy number change are excluded from the data set used to compute the average. Unique reads are important because each unique read must be derived from a unique cfDNA clone. Each unique read represents the input and analysis of a haploid equivalent of genomic DNA. The sum of unique reads is the sum of haploid genomes analyzed. The number of genomes analyzed, in turn, defines the sensitivity of the sequencing assay. By way of a non-limiting example, if the average unique read count is 100 genome equivalents, then that particular assay has a sensitivity of being able to detect one mutant read in 100, or 1%. Any observation less than this is not defensible.
- In particular embodiments, the quantitative genetic analysis comprises multiplex sequencing of hybrid nucleic acid molecules derived from a plurality of samples.
- In various embodiments, the quantitative genetic analysis comprises obtaining one or more or a plurality of tagged DNA library clones, each clone comprising a first DNA sequence and a second DNA sequence, wherein the first DNA sequence comprises a sequence in a targeted genetic locus and the second DNA sequence comprises a capture probe sequence; performing a paired end sequencing reaction on the one or more clones and obtaining one or more sequencing reads or performing a sequencing reaction on the one or more clones in which a single long sequencing read of greater than about 100, 200, 300, 400, 500 or more nucleotides is obtained, wherein the read is sufficient to identify both the first DNA sequence and the second DNA sequence; and ordering or clustering the sequencing reads of the one or more clones according to the probe sequences of the sequencing reads.
- (c) Bioinformatics Analysis
- In various embodiments, the quantitative genetic analysis further comprises bioinformatic analysis of the sequencing reads. Bioinformatic analysis excludes any purely mental analysis performed in the absence of a composition or method for sequencing. In certain embodiments, bioinformatics analysis includes, but is not limited to: sequence alignments; genome equivalents analysis; single nucleotide variant (SNV) analysis; gene copy number variation (CNV) analysis; and detection of genetic lesions. In particular embodiments, bioinformatics analysis is useful to quantify the number of genome equivalents analyzed in the cfDNA clone library; to detect the genetic state of a target genetic locus; to detect genetic lesions in a target genetic locus; and to measure copy number fluctuations within a target genetic locus.
- Sequence alignments may be performed between the sequence reads and one or more human reference DNA sequences. In particular embodiments, sequencing alignments can be used to detect genetic lesions in a target genetic locus including, but not limited to detection of a nucleotide transition or transversion, a nucleotide insertion or deletion, a genomic rearrangement, a change in copy number, or a gene fusion. Detection of genetic lesions that are causal or prognostic indicators may be useful in the diagnosis, prognosis, treatment, and/or monitoring of a particular genetic condition or disease.
- Also contemplated herein, are methods for sequence alignment analysis that can be performed without the need for alignment to a reference sequence, referred to herein as horizontal sequence analysis. Such analysis can be performed on any sequences generated by the methods contemplated herein or any other methods. In particular embodiments, the sequence analysis comprises performing sequence alignments on the reads obtained by the methods contemplated herein.
- In one embodiment, the genome equivalents in a cfDNA clone library are determined using bioinformatics-based counting after sequencing is performed. Each sequencing read is associated with a particular capture probe, and the collection of reads assigned to each capture probe is parsed into groups. Within a group, sets of individual reads share the same read code and the same DNA sequence start position within genomic sequence. These individual reads are grouped into a “family” and a single consensus representative of this family is carried forward as a “unique read.” All of the individual reads that constituted a family are derived from a single ligation event and thus, they are amplification-derived “siblings” of one another. Each unique read is considered a unique ligation event and the sum of unique reads is considered equivalent to the number of genome equivalents analyzed.
- As the number of unique clones approaches the total number of possible sequence combinations, probability dictates that the same code and start site combinations will be created by independent events and that these independent events will be inappropriately grouped within single families. The net result will be an underestimate of genome equivalents analyzed, and rare mutant reads may be discarded as sequencing errors because they overlap with wild-type reads bearing the same identifiers.
- In particular embodiments, to provide an accurate analysis for cfDNA clone libraries, the number of genome equivalents analyzed is about 1/10, about 1/12, about 1/14, about 1/16, about 1/18, about 1/20, about 1/25 or less the number of possible unique clones. It should be understood that the procedure outlined above is merely illustrative and not limiting.
- In some embodiments, the number of genome equivalents to be analyzed may need to be increased. To expand the depth of genome equivalents, at least two solutions are contemplated. The first solution is to use more than one adaptor set per sample. By combining adaptors, it is possible to multiplicatively expand the total number of possible clones and therefore, expand the comfortable limits of genomic input. The second solution is to expand the read code by 1, 2, 3, 4, or 5 or more bases. The number of possible read codes that differ by at least 2 bases from every other read code scales as 4(n−1) where n is the number of bases within a read code. Thus, in a non-limiting example, if a read code is 5 nucleotides and 4(5−1)=256; therefore, the inclusion of additional bases expands the available repertoire by a factor of four for each additional base.
- In one embodiment, quantitative genetic analysis comprises bioinformatic analysis of sequencing reads to identify rare single nucleotide variants (SNV).
- Next-generation sequencing has an inherent error rate of roughly 0.02-0.02%, meaning that anywhere from 1/200 to 1/500 base calls are incorrect. To detect variants and other mutations that occur at frequencies lower than this, for example at frequencies of 1 per 1000 sequences, it is necessary to invoke molecular annotation strategies. By way of a non-limiting example, analysis of 5000 unique molecules using targeted sequence capture technology would generate—at sufficient sequencing depths of >50,000 reads—a collection of 5000 unique reads, with each unique read belonging to a “family” of reads that all possess the same read code. A SNV that occurs within a family is a candidate for being a rare variant. When this same variant is observed in more than one family, it becomes a very strong candidate for being a rare variant that exists within the starting sample. In contrast, variants that occur sporadically within families are likely to be sequencing errors and variants that occur within one and only one family are either rare or the result of a base alteration that occurred ex vivo (e.g., oxidation of a DNA base or PCR-introduced errors).
- In one embodiment, the methods of detecting SNVs comprise introducing 10-fold more genomic input (genomes or genome equivalents) as the desired target sensitivity of the assay. In one non-limiting example, if the desired sensitivity is 2% (2 in 100), then the experimental target is an input of 2000 genomes.
- In particular embodiments, bioinformatics analysis of sequencing data is used to detect or identify SNV associated with a genetic state, condition or disease, genetic mosaicism, fetal testing, paternity testing, predicting response to drug treatment, diagnosing or monitoring a medical condition, microbiome profiling, pathogen screening, and monitoring organ transplants.
- In various embodiments, a method for copy number determination analysis is provided comprising obtaining one or more or a plurality of clones, each clone comprising a first DNA sequence and a second DNA sequence, wherein the first DNA sequence comprises a sequence in a targeted genetic locus and the second DNA sequence comprises a capture probe sequence. In related embodiments, a paired end sequencing reaction on the one or more clones is performed and one or more sequencing reads are obtained. In another embodiment, a sequencing reaction on the one or more clones is performed in which a single long sequencing read of greater than about 100 nucleotides is obtained, wherein the read is sufficient to identify both the first DNA sequence and the second DNA sequence. The sequencing reads of the one or more clones can be ordered or clustered according to the probe sequence of the sequencing reads.
- Copy number analyses include, but are not limited to analyses, that examine the number of copies of a particular gene or mutation that occurs in a given genomic DNA sample and can further include quantitative determination of the number of copies of a given gene or sequence differences in a given sample. In particular embodiments, copy number analysis is used to detect or identify gene amplification associated with genetic states, conditions, or diseases, fetal testing, genetic mosaicism, paternity testing, predicting response to drug treatment, diagnosing or monitoring a medical condition, microbiome profiling, pathogen screening, and monitoring organ transplants.
- In particular embodiments, bioinformatics analysis of sequencing data is used to detect or identify one or more sequences or genetic lesions in a target locus including, but not limited to detection of a nucleotide transition or transversion, a nucleotide insertion or deletion, a genomic rearrangement, a change in copy number, or a gene fusion. Detection of genetic lesions that are causal or prognostic indicators may be useful in the diagnosis, prognosis, treatment, and/or monitoring of a particular genetic condition or disease. In one embodiment, genetic lesions are associated with genetic states, conditions, or diseases, fetal testing, genetic mosaicism, paternity testing, predicting response to drug treatment, diagnosing or monitoring a medical condition, microbiome profiling, pathogen screening, and monitoring organ transplants.
- In various embodiments, the present invention contemplates a method of detecting, identifying, predicting, diagnosing, or monitoring a condition or disease in a subject.
- In particular embodiments, a method of detecting, identifying, predicting, diagnosing, or monitoring a genetic state, condition or disease in a subject comprises performing a quantitative genetic analysis of one or more target genetic loci in a cfDNA clone library to detect or identify a change in the sequence at the one or more target genetic loci.
- In one embodiment, a method of detecting, identifying, predicting, diagnosing, or monitoring a genetic state, condition or disease comprises isolating or obtaining cfDNA from a biological sample of a subject; treating the cfDNA with one or more end-repair enzymes to generate end-repaired cfDNA; ligating one or more adaptors to each end of the end-repaired cfDNA to generate a cfDNA library; amplifying the cfDNA library to generate a cfDNA clone library; determining the number of genome equivalents in the cfDNA clone library; and performing a quantitative genetic analysis of one or more target genetic loci in a cfDNA clone library to detect or identify a change in the sequence at the one or more target genetic loci.
- In particular embodiments, a method of detecting, identifying, predicting, diagnosing, or monitoring a genetic state, or genetic condition or disease selected from the group consisting of: genetic diseases; genetic mosaicism; fetal testing; paternity testing; paternity testing; predicting response to drug treatment; diagnosing or monitoring a medical condition; microbiome profiling; pathogen screening; and organ transplant monitoring comprising isolating or obtaining cfDNA from a biological sample of a subject; treating the cfDNA with one or more end-repair enzymes to generate end-repaired cfDNA; ligating one or more adaptors to each end of the end-repaired cfDNA to generate a cfDNA library; amplifying the cfDNA library to generate a cfDNA clone library; determining the number of genome equivalents in the cfDNA clone library; and performing a quantitative genetic analysis of one or more target genetic loci in a cfDNA clone library to detect or identify a nucleotide transition or transversion, a nucleotide insertion or deletion, a genomic rearrangement, a change in copy number, or a gene fusion in the sequence at the one or more target genetic loci.
- Illustrative examples of genetic diseases that can be detected, identified, predicted, diagnosed, or monitored with the compositions and methods contemplated herein include, but are not limited to cancer, Alzheimer's disease (APOE1), Charcot-Marie-Tooth disease, Leber hereditary optic neuropathy (LHON), Angelman syndrome (UBE3A, ubiquitin-protein ligase E3A), Prader-Willi syndrome (region in chromosome 15), β-Thalassaemia (HBB, β-Globin), Gaucher disease (type I) (GBA, Glucocerebrosidase), Cystic fibrosis (CFTR Epithelial chloride channel), Sickle cell disease (HBB, β-Globin), Tay-Sachs disease (HEXA, Hexosaminidase A), Phenylketonuria (PAH, Phenylalanine hydrolyase), Familial hypercholesterolaemia (LDLR, Low density lipoprotein receptor), Adult polycystic kidney disease (PKD1, Polycystin), Huntington disease (HDD, Huntingtin), Neurofibromatosis type I (NF1, NF1 tumour suppressor gene), Myotonic dystrophy (DM, Myotonin), Tuberous sclerosis (TSC1, Tuberin), Achondroplasia (FGFR3, Fibroblast growth factor receptor), Fragile X syndrome (FMR1, RNA-binding protein), Duchenne muscular dystrophy (DMD, Dystrophin), Haemophilia A (F8C, Blood coagulation factor VIII), Lesch-Nyhan syndrome (HPRT1, Hypoxanthine guanine ribosyltransferase 1), and Adrenoleukodystrophy (ABCD1).
- Illustrative examples of cancers that can be detected, identified, predicted, diagnosed, or monitored with the compositions and methods contemplated herein include, but are not limited to: B cell cancer, e.g., multiple myeloma, melanomas, breast cancer, lung cancer (such as non-small cell lung carcinoma or NSCLC), bronchus cancer, colorectal cancer, prostate cancer, pancreatic cancer, stomach cancer, ovarian cancer, urinary bladder cancer, brain or central nervous system cancer, peripheral nervous system cancer, esophageal cancer, cervical cancer, uterine or endometrial cancer, cancer of the oral cavity or pharynx, liver cancer, kidney cancer, testicular cancer, biliary tract cancer, small bowel or appendix cancer, salivary gland cancer, thyroid gland cancer, adrenal gland cancer, osteosarcoma, chondrosarcoma, cancer of hematological tissues, adenocarcinomas, inflammatory myofibroblastic tumors, gastrointestinal stromal tumor (GIST), colon cancer, multiple myeloma (MM), myelodysplastic syndrome (MDS), myeloproliferative disorder (MPD), acute lymphocytic leukemia (ALL), acute myelocytic leukemia (AML), chronic myelocytic leukemia (CML), chronic lymphocytic leukemia (CLL), polycythemia Vera, Hodgkin lymphoma, non-Hodgkin lymphoma (NHL), soft-tissue sarcoma, fibrosarcoma, myxosarcoma, liposarcoma, osteogenic sarcoma, chordoma, angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, squamous cell carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland carcinoma, papillary carcinoma, papillary adenocarcinomas, medullary carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct carcinoma, choriocarcinoma, seminoma, embryonal carcinoma, Wilms' tumor, bladder carcinoma, epithelial carcinoma, glioma, astrocytoma, medulloblastoma, craniopharyngioma, ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, oligodendroglioma, meningioma, neuroblastoma, retinoblastoma, follicular lymphoma, diffuse large B-cell lymphoma, mantle cell lymphoma, hepatocellular carcinoma, thyroid cancer, gastric cancer, head and neck cancer, small cell cancers, essential thrombocythemia, agnogenic myeloid metaplasia, hypereosinophilic syndrome, systemic mastocytosis, familiar hypereosinophilia, chronic eosinophilic leukemia, neuroendocrine cancers, carcinoid tumors, and the like.
- In one embodiment, the genetic lesion is a lesion annotated in the Cosmic database (the lesions and sequence data can be downloaded from cancer.sanger.ac.uk/cosmic/census) or a lesion annotated in the Cancer Genome Atlas (the lesions and sequence data can be downloaded from tcga-data.nci.nih.gov/tcga/tcgaDownload.jsp).
- Illustrative examples of genes that harbor one or more genetic lesions associated with cancer that can be detected, identified, predicted, diagnosed, or monitored with the compositions and methods contemplated herein include, but are not limited to ABCB1, ABCC2, ABCC4, ABCG2, ABL1, ABL2, AKT1, AKT2, AKT3, ALDH4A1, ALK, APC, AR, ARAF, ARFRP1, ARID1A, ATM, ATR, AURKA, AURKB, BCL2, BCL2A1, BCL2L1, BCL2L2, BCL6, BRAF, BRCA1, BRCA2, Clorf144, CARD11, CBL, CCND1, CCND2, CCND3, CCNE1, CDH1, CDH2, CDH2O, CDH5, CDK4, CDK6, CDK8, CDKN2A, CDKN2B, CDKN2C, CEBPA, CHEK1, CHEK2, CRKL, CRLF2, CTNNB1, CYP1B1, CYP2C19, CYP2C8, CYP2D6, CYP3A4, CYP3A5, DNMT3A, DOT1L, DPYD, EGFR, EPHA3, EPHA5, EPHA6, EPHA7, EPHB1, EPHB4, EPHB6, EPHX1, ERBB2, ERBB3, ERBB4, ERCC2, ERG, ESR1, ESR2, ETV1, ETV4, ETV5, ETV6, EWSR1, EZH2, FANCA, FBXW7, FCGR3A, FGFR1, FGFR2, FGFR3, FGFR4, FLT1, FLT3, FLT4, FOXP4, GATA1, GNA11, GNAQ, GNAS, GPR124, GSTP1, GUCY1A2, HOXA3, HRAS, HSP90AA1, IDH1, IDH2, IGF1R, IGF2R, IKBKE, IKZF1, INHBA, IRS2, ITPA, JAK1, JAK2, JAK3, JUN, KDR, KIT, KRAS, LRP1B, LRP2, LTK, MAN1B1, MAP2K1, MAP2K2, MAP2K4, MCL1, MDM2, MDM4, MEN1, MET, MITF, MLH1, MLL, MPL, MRE11A, MSH2, MSH6, MTHFR, MTOR, MUTYH, MYC, MYCL1, MYCN, NF1, NF2, NKX2-1, NOTCH1, NPM1, NQO1, NRAS, NRP2, NTRK1, NTRK3, PAK3, PAX5, PDGFRA, PDGFRB, PIK3CA, PIK3R1, PKHD1, PLCG1, PRKDC, PTCH1, PTEN, PTPN11, PTPRD, RAF1, RARA, RB1, RET, RICTOR, RPTOR, RUNX1, SLC19A1, SLC22A2, SLCO1B3, SMAD2, SMAD3, SMAD4, SMARCA4, SMARCB1, SMO, SOD2, SOX10, SOX2, SRC, STK11, SULT1A1, TBX22, TET2, TGFBR2, TMPRSS2, TNFRSF14, TOP1, TP53, TPMT, TSC1, TSC2, TYMS, UGT1A1, UMPS, USP9X, VHL, and WT1.
- In particular embodiments, the genetic lesion comprises a nucleotide transition or transversion, a nucleotide insertion or deletion, a genomic rearrangement, a change in copy number, or a gene fusion.
- In one embodiment, the genetic lesion is a gene fusion that fuses the 3′ coding region of the ALK gene to another gene.
- In one embodiment, the genetic lesion is a gene fusion that fuses the 3′ coding region of the ALK gene to the EML4 gene.
- Illustrative examples of conditions suitable for fetal testing that can be detected, identified, predicted, diagnosed, or monitored with the compositions and methods contemplated herein include but are not limited to: Down Syndrome (Trisomy 21), Edwards Syndrome (Trisomy 18), Patau Syndrome (Trisomy 13), Klinefelter's Syndrome (XXY), Triple X syndrome, XYY syndrome, Trisomy 8, Trisomy 16, Turner Syndrome (XO), Robertsonian translocation, DiGeorge Syndrome and Wolf-Hirschhorn Syndrome.
- Illustrative examples of alleles suitable for paternity testing that can be detected, identified, predicted, diagnosed, or monitored with the compositions and methods contemplated herein include but are not limited to 16 or more of: D20S1082, D6S474, D12ATA63, D22S1045, D10S1248, D1S1677, D11S4463, D4S2364, D9S1122, D2S1776, D10S1425, D3S3053, D5S2500, D1S1627, D3S4529, D2S441, D175974, D6S1017, D4S2408, D9S2157, Amelogenin, D17S1301, D1GATA113, D18S853, D20S482, and D14S1434.
- Illustrative examples of genes suitable for predicting the response to drug treatment that can be detected, identified, predicted, diagnosed, or monitored with the compositions and methods contemplated herein include, but are not limited to, one or more of the following genes: ABCB1 (ATP-binding cassette, sub-family B (MDR/TAP), member 1), ACE (angiotensin I converting enzyme), ADH1A (alcohol dehydrogenase 1A (class I), alpha polypeptide), ADH1B (alcohol dehydrogenase IB (class I), beta polypeptide), ADH1C (alcohol dehydrogenase 1C (class I), gamma polypeptide), ADRB1 (adrenergic, beta-1-, receptor), ADRB2 (adrenergic, beta-2-, receptor, surface), AHR (aryl hydrocarbon receptor), ALDH1A1 (aldehyde dehydrogenase 1 family, member A1), ALOX5 (arachidonate 5-lipoxygenase), BRCA1 (breast cancer 1, early onset), COMT (catechol-O-methyltransferase), CYP2A6 (cytochrome P450, family 2, subfamily A, polypeptide 6), CYP2B6 (cytochrome P450, family 2, subfamily B, polypeptide 6), CYP2C9 (cytochrome P450, family 2, subfamily C, polypeptide 9), CYP2C19 (cytochrome P450, family 2, subfamily C, polypeptide 19), CYP2D6 (cytochrome P450, family 2, subfamily D, polypeptide 6), CYP2J2 (cytochrome P450, family 2, subfamily J, polypeptide 2), CYP3A4 (cytochrome P450, family 3, subfamily A, polypeptide 4), CYP3A5 (cytochrome P450, family 3, subfamily A, polypeptide 5), DPYD (dihydropyrimidine dehydrogenase), DRD2 (dopamine receptor D2), F5 (coagulation factor V), GSTP1 (glutathione S-transferase pi), HMGCR (3-hydroxy-3-methylglutaryl-Coenzyme A reductase), KCNH2 (potassium voltage-gated channel, subfamily H (eag-related), member 2), KCNJ11 (potassium inwardly-rectifying channel, subfamily J, member 11), MTHFR (5,10-methylenetetrahydrofolate reductase (NADPH)), NQO1 (NAD(P)H dehydrogenase, quinone 1), P2RY1 (purinergic receptor P2Y, G-protein coupled, 1), P2RY12 (purinergic receptor P2Y, G-protein coupled, 12), PTGIS (prostaglandin I2 (prostacyclin) synthase), SCN5A (sodium channel, voltage-gated, type V, alpha (long QT syndrome 3)), SLC19A1 (solute carrier family 19 (folate transporter), member 1), SLCO1B1 (solute carrier organic anion transporter family, member 1B1), SULT1A1 (sulfotransferase family, cytosolic, 1A, phenol-preferring, member 1), TPMT (thiopurine S-methyltransferase), TYMS (thymidylate synthetase), UGT1A1 (UDP glucuronosyltransferase 1 family, polypeptide A1), VDR (vitamin D (1,25-dihydroxyvitamin D3) receptor), VKORC1 (vitamin K epoxide reductase complex, subunit 1).
- Illustrative examples of medical conditions that can be detected, identified, predicted, diagnosed, or monitored with the compositions and methods contemplated herein include, but are not limited to: stroke, transient ischemic attack, traumatic brain injury, heart disease, heart attack, angina, atherosclerosis, and high blood pressure.
- Illustrative examples of pathogens that can be screened for with the compositions and methods contemplated herein include, but are not limited to: bacteria fungi, and viruses.
- Illustrative examples of bacterial species that can be screened for with the compositions and methods contemplated herein include, but are not limited to: a Mycobacterium spp., a Pneumococcus spp., an Escherichia spp., a Campylobacter spp., a Corynebacterium spp., a Clostridium spp., a Streptococcus spp., a Staphylococcus spp., a Pseudomonas spp., a Shigella spp., a Treponema spp., or a Salmonella spp.
- Illustrative examples of fungal species that can be screened for with the compositions and methods contemplated herein include, but are not limited to: an Aspergillis spp., a Blastomyces spp., a Candida spp., a Coccicioides spp., a Cryptococcus spp., dermatophytes, a Tinea spp., a Trichophyton spp., a Microsporum spp., a Fusarium spp., a Histoplasma spp., a Mucoromycotina spp., a Pneumocystis spp., a Sporothrix spp., an Exserophilum spp., or a Cladosporium spp.
- Illustrative examples of viruses that can be screened for with the compositions and methods contemplated herein include, but are not limited to: Influenza A such as H1N1, H1N2, H3N2 and H5N1 (bird flu), Influenza B, Influenza C virus, Hepatitis A virus, Hepatitis B virus, Hepatitis C virus, Hepatitis D virus, Hepatitis E virus, Rotavirus, any virus of the Norwalk virus group, enteric adenoviruses, parvovirus, Dengue fever virus, Monkey pox, Mononegavirales, Lyssavirus such as rabies virus, Lagos bat virus, Mokola virus, Duvenhage virus, European bat virus 1 & 2 and Australian bat virus, Ephemerovirus, Vesiculovirus, Vesicular Stomatitis Virus (VSV), Herpesviruses such as Herpes simplex virus types 1 and 2, varicella zoster, cytomegalovirus, Epstein-Bar virus (EBV), human herpesviruses (HHV), human herpesvirus type 6 and 8, Moloney murine leukemia virus (M-MuLV), Moloney murine sarcoma virus (MoMSV), Harvey murine sarcoma virus (HaMuSV), murine mammary tumor virus (MuMTV), gibbon ape leukemia virus (GaLV), feline leukemia virus (FLV), Spumavirus, Friend murine leukemia virus, Murine Stem Cell Virus (MSCV) and Rous Sarcoma Virus (RSV), HIV (human immunodeficiency virus; including HIV type 1, and HIV type 2), visna-maedi virus (VMV) virus, the caprine arthritis-encephalitis virus (CAEV), equine infectious anemia virus (EIAV), feline immunodeficiency virus (FIV), bovine immune deficiency virus (BIV), and simian immunodeficiency virus (SIV), papilloma virus, murine gammaherpesvirus, Arenaviruses such as Argentine hemorrhagic fever virus, Bolivian hemorrhagic fever virus, Sabia-associated hemorrhagic fever virus, Venezuelan hemorrhagic fever virus, Lassa fever virus, Machupo virus, Lymphocytic choriomeningitis virus (LCMV), Bunyaviridiae such as Crimean-Congo hemorrhagic fever virus, Hantavirus, hemorrhagic fever with renal syndrome causing virus, Rift Valley fever virus, Filoviridae (filovirus) including Ebola hemorrhagic fever and Marburg hemorrhagic fever, Flaviviridae including Kaysanur Forest disease virus, Omsk hemorrhagic fever virus, Tick-borne encephalitis causing virus and Paramyxoviridae such as Hendra virus and Nipah virus, variola major and variola minor (smallpox), alphaviruses such as Venezuelan equine encephalitis virus, eastern equine encephalitis virus, western equine encephalitis virus, SARS-associated coronavirus (SARS-CoV), West Nile virus, and any encephaliltis causing virus.
- Illustrative examples of genes suitable for monitoring an organ transplant in a transplant recipient that can be detected, identified, predicted, diagnosed, or monitored with the compositions and methods contemplated herein include, but are not limited to, one or more of the following genes: HLA-A, HLA-B, HLA-C, HLA-DR, HLA-DP, and HLA-DQ.
- In particular embodiments, a bioinformatic analysis is used to quantify the number of genome equivalents analyzed in the cfDNA clone library; detect genetic variants in a target genetic locus; detect mutations within a target genetic locus; detect genetic fusions within a target genetic locus; or measure copy number fluctuations within a target genetic locus.
- In various embodiments, a companion diagnostic for a genetic disease is provided, comprising: isolating or obtaining cfDNA from a biological sample of a subject; treating the cfDNA with one or more end-repair enzymes to generate end-repaired cfDNA; ligating one or more adaptors to each end of the end-repaired cfDNA to generate a cfDNA library; amplifying the cfDNA library to generate a cfDNA clone library; determining the number of genome equivalents in the cfDNA clone library; and performing a quantitative genetic analysis of one or more biomarkers associated with the genetic disease in the cfDNA clone library, wherein detection of, or failure to detect, at least one of the one or more biomarkers indicates whether the subject should be treated for the genetic disease.
- As used herein, the term “companion diagnostic” refers to a diagnostic test that is linked to a particular anti-cancer therapy. In a particular embodiment, the diagnostic methods comprise detection of genetic lesion in a biomarker associated with in a biological sample, thereby allowing for prompt identification of patients should or should not be treated with the anti-cancer therapy.
- Anti-cancer therapy includes, but is not limited to surgery, radiation, chemotherapeutics, anti-cancer drugs, and immunomodulators.
- Illustrative examples of anti-cancer drugs include, but are not limited to: alkylating agents such as thiotepa and cyclophosphamide (CYTOXAN™); alkyl sulfonates such as busulfan, improsulfan and piposulfan; aziridines such as benzodopa, carboquone, meturedopa, and uredopa; ethylenimines and methylamelamines including altretamine, triethylenemelamine, trietylenephosphoramide, triethylenethiophosphaoramide and trimethylolomelamine resume; nitrogen mustards such as chlorambucil, chlornaphazine, cholophosphamide, estramustine, ifosfamide, mechlorethamine, mechlorethamine oxide hydrochloride, melphalan, novembichin, phenesterine, prednimustine, trofosfamide, uracil mustard; nitrosureas such as carmustine, chlorozotocin, fotemustine, lomustine, nimustine, ranimustine; antibiotics such as aclacinomysins, actinomycin, authramycin, azaserine, bleomycins, cactinomycin, calicheamicin, carabicin, carminomycin, carzinophilin, chromomycins, dactinomycin, daunorubicin, detorubicin, 6-diazo-5-oxo-L-norleucine, doxorubicin and its pegylated formulations, epirubicin, esorubicin, idarubicin, marcellomycin, mitomycins, mycophenolic acid, nogalamycin, olivomycins, peplomycin, potfiromycin, puromycin, quelamycin, rodorubicin, streptonigrin, streptozocin, tubercidin, ubenimex, zinostatin, zorubicin; anti-metabolites such as methotrexate and 5-fluorouracil (5-FU); folic acid analogues such as denopterin, methotrexate, pteropterin, trimetrexate; purine analogs such as fludarabine, 6-mercaptopurine, thiamiprine, thioguanine; pyrimidine analogs such as ancitabine, azacitidine, 6-azauridine, carmofur, cytarabine, dideoxyuridine, doxifluridine, enocitabine, floxuridine, 5-FU; androgens such as calusterone, dromostanolone propionate, epitiostanol, mepitiostane, testolactone; anti-adrenals such as aminoglutethimide, mitotane, trilostane; folic acid replenisher such as frolinic acid; aceglatone; aldophosphamide glycoside; aminolevulinic acid; amsacrine; bestrabucil; bisantrene; edatraxate; defofamine; demecolcine; diaziquone; elformithine; elliptinium acetate; etoglucid; gallium nitrate; hydroxyurea; lentinan; lonidamine; mitoguazone; mitoxantrone; mopidamol; nitracrine; pentostatin; phenamet; pirarubicin; podophyllinic acid; 2-ethylhydrazide; procarbazine; PSK®; razoxane; sizofiran; spirogermanium; tenuazonic acid; triaziquone; 2,2′,2″-trichlorotriethylamine; urethan; vindesine; dacarbazine; mannomustine; mitobronitol; mitolactol; pipobroman; gacytosine; arabinoside (“Ara-C”); cyclophosphamide; thiotepa; taxoids, e.g., paclitaxel (TAXOL®, Bristol-Myers Squibb Oncology, Princeton, N.J.) and doxetaxel (TAXOTERE®., Rhne-Poulenc Rorer, Antony, France); chlorambucil; gemcitabine; 6-thioguanine; mercaptopurine; methotrexate; platinum analogs such as cisplatin and carboplatin; vinblastine; platinum; etoposide (VP-16); ifosfamide; mitomycin C; mitoxantrone; vincristine; vinorelbine; navelbine; novantrone; teniposide; aminopterin; xeloda; ibandronate; CPT-11; topoisomerase inhibitor RFS 2000; difluoromethylomithine (DMFO); retinoic acid derivatives such as Targretin™ (bexarotene), Panretin™ (alitretinoin); ONTAK™ (denileukin diftitox); esperamicins; capecitabine; and pharmaceutically acceptable salts, acids or derivatives of any of the above. Also included in this definition are anti-hormonal agents that act to regulate or inhibit hormone action on cancers such as anti-estrogens including for example tamoxifen, raloxifene, aromatase inhibiting 4(5)-imidazoles, 4-hydroxytamoxifen, trioxifene, keoxifene, LY117018, onapristone, and toremifene (Fareston); and anti-androgens such as flutamide, nilutamide, bicalutamide, leuprolide, and goserelin; and pharmaceutically acceptable salts, acids or derivatives of any of the above.
- Illustrative examples of immunomodulators include, but are not limited to: cyclosporine, tacrolimus, tresperimus, pimecrolimus, sirolimus, verolimus, laflunimus, laquinimod and imiquimod, as well as analogs, derivatives, salts, ions and complexes thereof.
- All publications, patent applications, and issued patents cited in this specification are herein incorporated by reference as if each individual publication, patent application, or issued patent were specifically and individually indicated to be incorporated by reference.
- Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be readily apparent to one of ordinary skill in the art in light of the teachings of this invention that certain changes and modifications may be made thereto without departing from the spirit or scope of the appended claims. The following examples are provided by way of illustration only and not by way of limitation. Those of skill in the art will readily recognize a variety of noncritical parameters that could be changed or modified to yield essentially similar results.
- The purpose of this experiment was to provide a direct proof-of-principle demonstration of rare variant detection using targeted sequence capture technology.
- Target sequence capture technology provides quantitative, sequence-based genetic analysis of nucleic acids and can be exploited to perform a combined mutational and copy number analysis of drug metabolism genes. The present inventors used targeted sequence capture technology and subsequence genetic analysis to detect rare sequence variants.
- Genomic DNA inputs play a central role in rare variant detection, but quantitative analysis and control of genomic inputs places bounds on the estimated sensitivity of rare variant analysis. A genomic qPCR assay was used by the present inventors to estimate genomic inputs.
- One experimental goal for rare variant analysis is to introduce 10-fold more genomic input as the target sensitivity of the assay. In other words, to measure variants with a sensitivity of 1% (1 in 100), then the experimental target is to input 1000 genomes. Downstream of sequencing, bioinformatics analysis reveals the number of unique reads, and this has the desirable quality of being both an orthogonal and a more direct measure of genomic inputs.
- A cell line (ZR75-30) with known SNVs was admixed with a germ line DNA sample (NA12878) in a dilution series ranging from 1-to-1 through 1-to-1000. Target regions corresponding to known sequence differences were retrieved using targeted sequence capture technology and sequenced. Sequence variants that occur at a frequency of less than 1 per 1000 sequences were detected.
- Capture Probes
- The following table shows a collection of 62 capture probes that were used in this experiment.
-
TABLE 1 60 base probe sequences used in the admix proof- of-concept study SEQ ID Target NO: Probe Sequence BRAF 1 TAAACATTGGAAAGGTTTCTAATTAACCAGG AGATCCAAAAGAAAGCGGTTCAAGTAGCA 2 GATCTCAGTTTTTTTGGTTAACTATGTATTT TGGTATATGAAGCTTCTGGGTTTTGCACA MYCN 3 GACAGATAAGCATACATATTAACATGGATAT ATATGTGAATTTCATTCAAATGGTTCTCA 4 AGCTCTTAGCCTTTGGGGGGATGACACTCTT GAGCGGACGTGGGGACGCCTCGCTCTTTA BRAF 5 AAGCCCCCACCGCCGCCTCTTTCCAAAATAA ACACCAGCCAGCCGCCGAGCCCGGAGTCG 6 GCCTCCCTTCCCCCTCCCCGCCCGACAGCGG CCGCTCGGGCCCCGGCTCTCGGTTATAAG CDH1 7 GGTGTGGCAGCCAGGGGGGCGCACTCTGCTC TGGCTGGGCCCCTTCTCCCATGTTTTCTT 8 TTACACAACCTTTGGGCTTGGACAACACTTT GGGGTCCAAAGAACCTAAGAGTCTTTCTG EPHX2 9 TGATGAAACTTGGGCTGGATGGGGCACAGGT AGGGTGCTTGTTGCTTTCAGTCAGATGAA 10 AATGAAAGAAAAGGAGGCCAGATTGCTACTC CTGGTCCCTGCCACACACTAGGTACCCTA BRCA1 11 ATTGACAATACCTACATAAAACTCTTTCCAG AATGTTGTTAAGTCTTAGTCATTAGGGAG 12 GGATTTCCACCAACACTGTATTCATGTACCC ATTTTTCTCTTAACCTAACTTTATTGGTC BRCA2 13 CAAAGGGGGAAAACCATCAGGACATTATTTA ACAACGGAAATATCTAACTGAAAGGCAAA 14 CAGGCAGACCAACCAAAGTCTTTGTTCCACC TTTTAAAACTAAATCACATTTTCACAGAG MYCN 15 CCCCAGCCAGCGGTCCGCAACCCTTGCCGCA TCCACGAAACTTTGCCCATAGCAGCGGGC MYCr1_F1 16 CGACTCATCTCAGCATTAAAGTGATAAAAAA ATAAATTAAAAGGCAAGTGGACTTCGGTG MYCr1_R1 17 CTGTGGCGCGCACTGCGCGCTGCGCCAGGTT TCCGCACCAAGACCCCTTTAACTCAAGAC MYC_r2_F1 18 TTCTACTGCGACGAGGAGGAGAACTTCTACC AGCAGCAGCAGCAGAGCGAGCTGCAGCCC MYC_r2_F2 19 ACCGAGCTGCTGGGAGGAGACATGGTGAACC AGAGTTTCATCTGCGACCCGGACGACGAG MYC_r2_F3 20 GCCGCCGCCTCAGAGTGCATCGACCCCTCGG TGGTCTTCCCCTACCCTCTCAACGACAGC MYC_r2_F4 21 GGCGGCTAGGGGACAGGGGCGGGGTGGGCAG CAGCTCGAATTTCTTCCAGATATCCTCGC MYC_r2_R1 22 AGACGAGCTTGGCGGCGGCCGAGAAGCCGCT CCACATACAGTCCTGGATGATGATGTTTT MYC_r2_R2 23 AGGAGAGCAGAGAATCCGAGGACGGAGAGAA GGCGCTGGAGTCTTGCGAGGCGCAGGACT MYC_r2_R3 24 TAAGAGTGGCCCGTTAAATAAGCTGCCAATG AAAATGGGAAAGGTATCCAGCCGCCCACT MYC_r2_R4 25 TTGTATTTGTACAGCATTAATCTGGTAATTG ATTATTTTAATGTAACCTTGCTAAAGGAG MYC_r3_F1 26 GAGGCCACAGCAAACCTCCTCACAGCCCACT GGTCCTCAAGAGGTGCCACGTCTCCACAC MYC_r3_F2 27 AGAGGAGGAACGAGCTAAAACGGAGCTTTTT TGCCCTGCGTGACCAGATCCCGGAGTTGG MYC_r3_F3 28 TCCAACTTGACCCTCTTGGCAGCAGGATAGT CCTTCCGAGTGGAGGGAGGCGCTGCGTAG MYC_r3_R1 29 GCTTGGACGGACAGGATGTATGCTGTGGCTT TTTTAAGGATAACTACCTTGGGGGCCTTT MYC_r3_R2 30 GCATTTGATCATGCATTTGAAACAAGTTCAT AGGTGATTGCTCAGGACATTTCTGTTAGA MYC_r3_R3 31 CGCCCCGCGCCCTCCCAGCCGGGTCCAGCCG GAGCCATGGGGCCGGAGCCGCAGTGAGCA ERBB2r1r 32 CTCTGGCCCCGCCGGCCGCGGGACCTCGGCG GGGCATCCACAGGGCAGGGTCCCGCCGCT ERBB2r2f 33 GGCATGACTTGGAGTGAGTTTGGATGGGGTG GCCAGGTCTGAGAAGGTCCCCCGCCAGTG ERBB2r2r 34 GCAGGGCACCTTCTTCTGCCACCCACCTGTA AACAGAGGGCTCAGCCCAGCTGGAGGCAG ERBB2r3f 35 CCCAAGATCTCCAAGTACTGGGGAACCCCAG GGAGGCCCTGGGGGGTGGCAGTGTTCCTA ERBB2r3r 36 CTAATGCACACAAAGCCTCCCCCTGGTTAGC AGTGGCCCTGGTCAGCTCTGAATAACCAA ERBB2r4f 37 CTGCTCCTCTTTTAGAAGGCAGGAGGGCCCC AAGGGAAGCAGAAGGTGACAGAAGGGGAA ERBB2r4r 38 TGGGGCAGTGGCGGGCAGGCACTGGGTTGTA AGTTGGGAGTTTGCGGCTGGGGTCAGGCT ERBB2r5f 39 TCTGCTGCTGTTTGTGCCTCTCTCTGTTACT AACCCGTCCTCTCGCTGTTAGACATCTCT ERBB2r5r 40 CCCACCCCTCCCATGTCACCTGTATGACACC TGCATTCCACCCGGCCCCAGCCCTCCCCT ERBB2r6f 41 TGGGCCAGGTAGTCTCCCTAGAAGGTGATGC TGATGAGGGTCTGGTGCCCAGGGCGCCAC ERBB2r6r 42 GGTGCCCACCCCTTGCATCCTGGGGGGTAGA GCACATTGGGCACAAAGCAGAGGCACATA ERBB2r7f 43 CACCCTGCCTGGTACTGCCCTATTGCCCCTG GCACACCAGGGCAAAACAGCACAGTGAAA ERBB2r7r 44 CCATTTACAGAAACAAACCTCCCCACCAAAA TGAGAAAACTGTGTTTCTCCCTGGCACTC ERBB2r8f 45 TTATTCTTCTTGTGCCTGGGCACGGTAATGC TGCTCATGGTGGTGCACGAAGGGCCAGGG ERBB2r8r 46 GAAGGATAGGACAGGGTGGGCTGGGCCAGGC TGCATGCGCAGAGGGACAGGAACTGCAGC ERBB2r9f 47 GGGCCCGGACCCTGATGCTCATGTGGCTGTT GACCTGTCCCGGTATGAAGGCTGAGACGG ERBB2r9r 48 TCTGTCTCCTGCCATCCCCAAGAGATGCTGC CACATCTGGATCCTCAGGACTCTGTCTGC TYMSr2f 49 TCACGTCCCAGGGCAGTTTTCTTCCCTGAAG AAAGTTGGATGGCATGATCTGTCTTCCCA TYMSr2r 50 GTGTTGAGAACAGACTACTGACTTCTAATAG CAGCGACTTCTTTACCTTGATAAACCACA TYMSr3f 51 AAAAAAAGGATGGGTTCCATATGGGTGGTGT CAAGTGCCCACCTCCTAGCAAGTCAGCAG TYMSr3r 52 CCCTCACAAGGTCAAAGCTATACATCAGCTC