US10767220B2 - Methods of amplifying nucleic acids and compositions for practicing the same - Google Patents
Methods of amplifying nucleic acids and compositions for practicing the same Download PDFInfo
- Publication number
- US10767220B2 US10767220B2 US15/563,939 US201615563939A US10767220B2 US 10767220 B2 US10767220 B2 US 10767220B2 US 201615563939 A US201615563939 A US 201615563939A US 10767220 B2 US10767220 B2 US 10767220B2
- Authority
- US
- United States
- Prior art keywords
- copy number
- interest
- nucleic acid
- nucleic acids
- sequencing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 233
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 229
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 229
- 238000000034 method Methods 0.000 title claims abstract description 108
- 239000000203 mixture Substances 0.000 title abstract description 45
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 179
- 230000003321 amplification Effects 0.000 claims abstract description 106
- 238000003199 nucleic acid amplification method Methods 0.000 claims abstract description 106
- 239000011541 reaction mixture Substances 0.000 claims abstract description 21
- 238000012163 sequencing technique Methods 0.000 claims description 168
- 206010028980 Neoplasm Diseases 0.000 claims description 26
- 230000001413 cellular effect Effects 0.000 claims description 19
- 101001013799 Homo sapiens Metallothionein-1X Proteins 0.000 claims description 14
- 101001007909 Homo sapiens Nuclear pore complex protein Nup93 Proteins 0.000 claims description 14
- 102100031781 Metallothionein-1X Human genes 0.000 claims description 14
- 102100027585 Nuclear pore complex protein Nup93 Human genes 0.000 claims description 14
- 108091006480 SLC25A44 Proteins 0.000 claims description 14
- 102100032414 Solute carrier family 25 member 44 Human genes 0.000 claims description 14
- 102100029167 Calcipressin-3 Human genes 0.000 claims description 13
- 102100030208 Elongin-A Human genes 0.000 claims description 13
- 102100039417 Gap junction beta-5 protein Human genes 0.000 claims description 13
- 101001062199 Homo sapiens Calcipressin-3 Proteins 0.000 claims description 13
- 101001011859 Homo sapiens Elongin-A Proteins 0.000 claims description 13
- 101000889145 Homo sapiens Gap junction beta-5 protein Proteins 0.000 claims description 13
- 101001047853 Homo sapiens Heterogeneous nuclear ribonucleoprotein R Proteins 0.000 claims description 13
- 101001044883 Homo sapiens Interleukin-22 receptor subunit alpha-1 Proteins 0.000 claims description 13
- 101000621037 Homo sapiens Rab-like protein 2B Proteins 0.000 claims description 13
- 102100022723 Interleukin-22 receptor subunit alpha-1 Human genes 0.000 claims description 13
- 102100022836 Rab-like protein 2B Human genes 0.000 claims description 13
- 102100023999 Heterogeneous nuclear ribonucleoprotein R Human genes 0.000 claims description 9
- 101100355763 Mus musculus Rabl2 gene Proteins 0.000 claims 1
- 239000000523 sample Substances 0.000 description 118
- 108091093088 Amplicon Proteins 0.000 description 35
- 239000002773 nucleotide Substances 0.000 description 35
- 125000003729 nucleotide group Chemical group 0.000 description 35
- 238000007481 next generation sequencing Methods 0.000 description 28
- 210000004027 cell Anatomy 0.000 description 20
- 108020004414 DNA Proteins 0.000 description 19
- 102000053602 DNA Human genes 0.000 description 19
- 230000000295 complement effect Effects 0.000 description 18
- 238000006243 chemical reaction Methods 0.000 description 16
- 150000002500 ions Chemical class 0.000 description 16
- 210000001519 tissue Anatomy 0.000 description 16
- 229920002477 rna polymer Polymers 0.000 description 10
- 201000011510 cancer Diseases 0.000 description 9
- 238000009396 hybridization Methods 0.000 description 9
- 238000001514 detection method Methods 0.000 description 8
- 238000010008 shearing Methods 0.000 description 8
- 238000013459 approach Methods 0.000 description 7
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 7
- 238000013467 fragmentation Methods 0.000 description 6
- 238000006062 fragmentation reaction Methods 0.000 description 6
- 238000003556 assay Methods 0.000 description 5
- 208000035475 disorder Diseases 0.000 description 5
- 230000001404 mediated effect Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 239000007790 solid phase Substances 0.000 description 5
- 241000124008 Mammalia Species 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 239000003112 inhibitor Substances 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 238000007480 sanger sequencing Methods 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- 102000016911 Deoxyribonucleases Human genes 0.000 description 3
- 108010053770 Deoxyribonucleases Proteins 0.000 description 3
- 201000010374 Down Syndrome Diseases 0.000 description 3
- 108020004459 Small interfering RNA Proteins 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 239000000654 additive Substances 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 208000036878 aneuploidy Diseases 0.000 description 3
- 231100001075 aneuploidy Toxicity 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- -1 hydrogen ions Chemical class 0.000 description 3
- 208000014018 liver neoplasm Diseases 0.000 description 3
- 238000002844 melting Methods 0.000 description 3
- 230000008018 melting Effects 0.000 description 3
- 229910052751 metal Inorganic materials 0.000 description 3
- 239000002184 metal Substances 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 239000003002 pH adjusting agent Substances 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 239000003161 ribonuclease inhibitor Substances 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 239000004055 small Interfering RNA Substances 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 2
- 201000009030 Carcinoma Diseases 0.000 description 2
- 208000005443 Circulating Neoplastic Cells Diseases 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- 108700011259 MicroRNAs Proteins 0.000 description 2
- 229940122426 Nuclease inhibitor Drugs 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 241000283984 Rodentia Species 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 206010044688 Trisomy 21 Diseases 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 230000000996 additive effect Effects 0.000 description 2
- 239000013060 biological fluid Substances 0.000 description 2
- 239000012472 biological sample Substances 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 230000004663 cell proliferation Effects 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 201000007270 liver cancer Diseases 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 239000002679 microRNA Substances 0.000 description 2
- 208000030454 monosomy Diseases 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 238000004806 packaging method and process Methods 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 206010041823 squamous cell carcinoma Diseases 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 239000001226 triphosphate Substances 0.000 description 2
- 235000011178 triphosphate Nutrition 0.000 description 2
- 206010044689 trisomy 22 Diseases 0.000 description 2
- JWBWJOKTZVXSRT-DWQAGKKUSA-N 5-[(3as,4s,6ar)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]-2-aminopentanoic acid Chemical compound N1C(=O)N[C@@H]2[C@H](CCCC(N)C(O)=O)SC[C@@H]21 JWBWJOKTZVXSRT-DWQAGKKUSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 241000269350 Anura Species 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 206010005003 Bladder cancer Diseases 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 208000014392 Cat-eye syndrome Diseases 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 206010014733 Endometrial cancer Diseases 0.000 description 1
- 206010014759 Endometrial neoplasm Diseases 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 206010017993 Gastrointestinal neoplasms Diseases 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 208000008839 Kidney Neoplasms Diseases 0.000 description 1
- 206010025323 Lymphomas Diseases 0.000 description 1
- 241000282560 Macaca mulatta Species 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 108020005196 Mitochondrial DNA Proteins 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 108700019961 Neoplasm Genes Proteins 0.000 description 1
- 102000048850 Neoplasm Genes Human genes 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 206010033128 Ovarian cancer Diseases 0.000 description 1
- 206010061535 Ovarian neoplasm Diseases 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241000282579 Pan Species 0.000 description 1
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 206010038389 Renal cancer Diseases 0.000 description 1
- 206010061934 Salivary gland cancer Diseases 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 206010041067 Small cell lung cancer Diseases 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 206010047741 Vulval cancer Diseases 0.000 description 1
- 206010049644 Williams syndrome Diseases 0.000 description 1
- 241000269370 Xenopus <genus> Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 210000004381 amniotic fluid Anatomy 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 210000001367 artery Anatomy 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 201000000053 blastoma Diseases 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 230000008711 chromosomal rearrangement Effects 0.000 description 1
- 208000029742 colonic neoplasm Diseases 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 230000002559 cytogenic effect Effects 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 201000008184 embryoma Diseases 0.000 description 1
- 201000003914 endometrial carcinoma Diseases 0.000 description 1
- 230000002357 endometrial effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 238000003205 genotyping method Methods 0.000 description 1
- 208000005017 glioblastoma Diseases 0.000 description 1
- 201000010536 head and neck cancer Diseases 0.000 description 1
- 208000014829 head and neck neoplasm Diseases 0.000 description 1
- 230000002440 hepatic effect Effects 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 1
- GPRLSGONYQIRFK-UHFFFAOYSA-N hydron Chemical compound [H+] GPRLSGONYQIRFK-UHFFFAOYSA-N 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 201000010982 kidney cancer Diseases 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 201000005249 lung adenocarcinoma Diseases 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 210000003097 mucus Anatomy 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 230000009826 neoplastic cell growth Effects 0.000 description 1
- 210000000944 nerve tissue Anatomy 0.000 description 1
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 238000003203 nucleic acid sequencing method Methods 0.000 description 1
- 230000005257 nucleotidylation Effects 0.000 description 1
- 201000002528 pancreatic cancer Diseases 0.000 description 1
- 208000008443 pancreatic carcinoma Diseases 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 201000002628 peritoneum cancer Diseases 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 210000002381 plasma Anatomy 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 201000003804 salivary gland carcinoma Diseases 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 208000000587 small cell lung carcinoma Diseases 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 208000017572 squamous cell neoplasm Diseases 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 210000001138 tear Anatomy 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 125000003396 thiol group Chemical class [H]S* 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 201000005112 urinary bladder cancer Diseases 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 206010046766 uterine cancer Diseases 0.000 description 1
- 208000012991 uterine carcinoma Diseases 0.000 description 1
- 210000000605 viral structure Anatomy 0.000 description 1
- 201000005102 vulva cancer Diseases 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/686—Polymerase chain reaction [PCR]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2535/00—Reactions characterised by the assay type for determining the identity of a nucleotide base or a sequence of oligonucleotides
- C12Q2535/122—Massive parallel sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2537/00—Reactions characterised by the reaction format or use of a specific feature
- C12Q2537/10—Reactions characterised by the reaction format or use of a specific feature the purpose or use of
- C12Q2537/143—Multiplexing, i.e. use of multiple primers or probes in a single reaction, usually for simultaneously analyse of multiple analysis
Definitions
- Nucleic acid sequencing methods include the Sanger “dideoxy” method that relies upon the use of dideoxyribonucleoside triphosphates as chain terminators. The Sanger method has been adapted for use in automated sequencing with the use of chain terminators incorporating fluorescent labels. Other methods include “next-generation” sequencing methods, including those based on successive cycles of incorporation of fluorescently labeled nucleic acid analogues. In such “sequencing by synthesis” or “cycle sequencing” methods the identity of the added base is determined after each nucleotide addition by detecting the fluorescent label. Other next-generation sequencing methods include those based on the detection of hydrogen ions that are released during the polymerization of DNA.
- a microwell containing a template DNA strand to be sequenced is flooded with a single species of deoxyribonucleotide triphosphate (dNTP). If the introduced dNTP is complementary to the leading template nucleotide, it is incorporated into the growing complementary strand. This causes the release of a hydrogen ion that triggers an ISFET ion sensor, which indicates that a reaction has occurred. If homopolymer repeats are present in the template sequence, multiple dNTP molecules will be incorporated in a single cycle. This leads to a corresponding number of released hydrogens and a proportionally higher electronic signal.
- dNTP deoxyribonucleotide triphosphate
- Copy number variations are alterations observed in the genome that result in genes having an abnormal number of copies—either more or less than the expected number of two. CNVs may be indicative of large-scale chromosomal rearrangements, such as large insertions or deletions, which can be commonly found in cancer tissue. In some cases, entire chromosomes can be lost or duplicated, which is a common cause of genetic disorders, such as Down syndrome (trisomy 21), cat eye syndrome (trisomy 22), Williams syndrome (monosomy 7), and various others. Identifying copy number variations can help understand and diagnose cancer and aneuploidy-related disorders.
- CNVs play a large role in cancer and detection of altered numbers of copies of certain genes can provide physicians with information to guide therapy.
- CNVs have been detected with cytogenetic techniques such as array-based comparative genome hybridization and molecular techniques such as SNP arrays.
- cytogenetic techniques such as array-based comparative genome hybridization and molecular techniques such as SNP arrays.
- NGS next-generation sequencing
- the methods include combining a nucleic acid sample and one or more amplification primers adapted to amplify a region of one or more copy number stable genes in a reaction mixture under conditions sufficient to amplify the one or more copy number stable genes.
- compositions and kits that find use in practicing embodiments of the methods.
- FIG. 1 provides next generation sequencing (NGS) data obtained using a custom NGS sequencing panel according to one embodiment of the present disclosure.
- NGS next generation sequencing
- a sequencing library was prepared using a panel of primers adapted to amplify ten distinct copy number stable genes, as well as genes of interest present in the sample.
- FIG. 2 graphically illustrates copy number stable regions of the human genome identified using an approach according to one embodiment of the present disclosure.
- FIG. 3 shows non-limiting examples of genes which may be amplified, e.g., as part of preparing an NGS sequencing library, according to certain aspects of the present disclosure.
- the methods include combining a nucleic acid sample and one or more amplification primers adapted to amplify a region of one or more copy number stable genes in a reaction mixture under conditions sufficient to amplify the one or more copy number stable genes.
- compositions and kits that find use in practicing embodiments of the methods.
- aspects of the present disclosure include methods of amplifying nucleic acids.
- the methods include combining a nucleic acid sample and one or more amplification primers adapted to amplify a region of one or more copy number stable genes in a reaction mixture under conditions sufficient to amplify the one or more copy number stable genes.
- aspects of the invention include producing a reaction mixture from a sample and one or more amplification primers adapted to amplify a region of one or more copy number stable genes.
- copy number stable gene is meant a gene or genomic region that is refractory to duplication or loss (e.g., is refractory to copy number variation (CNV)).
- CNV copy number variation
- a copy number stable gene may be a gene or genomic region having a copy number that does not vary (or substantially vary) between a genome of interest and a reference genome.
- a copy number stable gene may be a gene or genomic region that is present in both a genome of interest (e.g., a human tumor genome) and a reference genome (e.g., a human non-tumor genome) and, based on available information relating to the copy number of the gene, is known to not vary (or substantially vary) in copy number between the genome of interest and the reference genome.
- a genome of interest e.g., a human tumor genome
- a reference genome e.g., a human non-tumor genome
- a copy number stable gene has the same copy number in 80% or more, 85% or more, 90% or more, 95% or more, 96% or more, 97% or more, 98% or more, 99% or more, or 100% of the genomes in a population of interest, e.g., humans, or human subpopulations of interest (e.g., racial/ethnic human subpopulations, or the like).
- amplicons produced from copy number stable genes find use, e.g., as internal copy number controls in amplification- and/or sequencing-based assays for determining the copy number (and optionally, the presence of CNVs) in one or more genes of interest in a nucleic acid sample of interest.
- genes of interest are clinically relevant, e.g., genes for which copy number variation is associated with a disease or disorder, such as cancer, aneuploidy-related disorders (e.g., trisomy 21, trisomy 22, monosomy 7, etc.), and the like.
- amplification of the one or more copy number stable genes finds use, e.g., in diagnosing a condition known to be associated with one or more CNVs.
- the diagnosis includes sequencing (e.g., by NGS sequencing) the amplicons of the one or more copy number stable genes and one or more genes of interest to determine the copy number (and optionally, detect CNV, if any) in the one or more genes of interest.
- the one or more copy number stable genes are located in ohnolog-rich regions (ORRs) of the genome.
- An “ohnolog” is a duplicated gene derived from whole genome duplication (WGD).
- an ORR is a genomic region (e.g., a human genomic region) having a proportion of ohnologs that is 50% or greater, 55% or greater, 60% or greater, 65% or greater, 70% or greater, or 75% or greater in a 2 Mb window.
- the one or more copy number stable genes include one or more ohnologs. A detailed description of human ohnolog-rich regions and human ohnologs is found in Makino et al. (2013) Nature Communications 4:2283, and in certain aspects, one or any combination of ohnologs or copy number stable genes within the ohnolog-rich regions described therein may be amplified according to the methods of the present disclosure.
- the combining comprises combining a known amount of nucleic acids corresponding to one or more copy number stable genes with the nucleic acid sample and the one or more amplification primers adapted to amplify a region of the nucleic acids corresponding to one or more copy number stable genes.
- the one or more amplification primers adapted to amplify a region of one or more copy number stable genes may be adapted to amplify any useful copy number stable genes, including any of the copy number stable genes described herein, e.g., ohnologs, genes or genomic regions that are refractory to duplication or loss within ohnolog-rich regions, and the like.
- Any amplification primer, or combination of two or more amplification primers, adapted to amplify the one or more copy number stable genes (and optionally, one or more nucleic acids of interest) may be employed.
- the one or more amplification primers are adapted to amplify one, each of, or any combination of the copy number stable genes HNRNPR (Entrez Gene ID 10236), TCEB3 (Entrez Gene ID 6924), IL22RA1 (Entrez Gene ID 58985), RCAN3 (Entrez Gene ID 11123), GJB5 (Entrez Gene ID 2707), SLC25A44 (Entrez Gene ID 9673), MT3 (Entrez Gene ID 4504), MT1X (Entrez Gene ID 4501), NUP93 (Entrez Gene ID 9688) and/or RABL2B (Entrez Gene ID 11158).
- HNRNPR Entrez Gene ID 10236
- TCEB3 Entrez Gene ID 6924
- IL22RA1 Entrez Gene ID 58985
- RCAN3 Entrez Gene ID 11123
- GJB5 Entrez Gene ID 2707
- SLC25A44 Entrez Gene ID 9673
- MT3 Entrez Gene ID 4504
- the one or more amplification primers are non-random primers specifically designed/selected to amplify one or more predetermined copy number stable genes, and optionally, one or more predetermined nucleic acids of interest in the sample.
- the one or more amplification primers may be designed/selected by a practitioner of the subject methods based both on the type of nucleic acid sample that will be present in the reaction mixture.
- the nucleic acid sample is a human genomic DNA sample
- the one or more amplification primers may be designed/selected by the practitioner to ensure that the one or more amplification primers are adapted to amplify one or more copy number stable genes present in the human genome.
- CNVs are known to occur in organisms other than humans, such as rodents (e.g., mice and rats), dogs, cattle, rhesus monkeys, chimpanzees, and birds.
- the one or more amplification primers may be designed/selected to amplify copy number stable genes present in the genome of the relevant non-human organism.
- a “panel” (or “pool”) of two or more amplification primers is employed. Such pools find use, e.g., when multiplexed amplification of two or more copy number stable genes is desirable.
- a panel of primers is employed and adapted to amplify 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 15 or more, 20 or more, 25 or more, 30 or more, 40 or more, 50 or more, or 100 or more copy number stable genes.
- Such pools may additionally include primers adapted to amplify one or more genes of interest, e.g., genes for which it is desirable to determine copy number information (e.g., for detecting CNV, if any), including but not limited to, genes for which copy number variation is associated with a disease or disorder (e.g., cancer or a particular type thereof, any aneuploidy-related disorder, etc.).
- genes for which it is desirable to determine copy number information e.g., for detecting CNV, if any
- a disease or disorder e.g., cancer or a particular type thereof, any aneuploidy-related disorder, etc.
- a panel of primers may be employed that is adapted to amplify any desirable number of copy number stable genes, and additionally, adapted to amplify any desirable number of genes of interest, including but not limited to, 1 or more, 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 15 or more, 20 or more, 25 or more, 30 or more, 40 or more, 50 or more, 100 or more, 150 or more, 200 or more, 250 or more, 500 or more, or 1000 or more genes of interest.
- genes of interest finds use, e.g., for SNP genotyping/variant detection by sequencing, genomic profiling, expression profiling, and/or the like.
- genes of interest which may be amplified according to the methods of the present disclosure include one, each, or any combination of the genes provided in FIG. 3 .
- a panel of amplification primers is employed, and the panel is adapted, in addition to amplifying one or more copy number stable genes, to amplify two or more regions of interest present in genomic DNA, including but not limited to, “hot spot” regions that are frequently mutated in human cancer genes.
- a panel may be specifically designed by one practicing the subject methods, or the practitioner may order one of the various commercially available panels, such as an Ion AmpliSeqTM Cancer Hotspot Panel available from Life Technologies, Inc. (Carlsbad, Calif.).
- the one or more amplification primers may be designed to be sufficiently complementary to their corresponding target nucleic acids in the nucleic acid sample, such that the primer specifically hybridizes to its target under hybridization conditions.
- complementary refers to a nucleotide sequence that base-pairs by non-covalent bonds to a region of the copy number stable gene or nucleic acid of interest.
- adenine (A) forms a base pair with thymine (T), as does guanine (G) with cytosine (C) in DNA.
- thymine is replaced by uracil (U).
- U uracil
- A is complementary to T and G is complementary to C.
- RNA is complementary to U and vice versa.
- “complementary” refers to a nucleotide sequence that is at least partially complementary.
- a nucleotide sequence may be partially complementary to a target, in which not all nucleotides are complementary to every nucleotide in the target nucleic acid in all the corresponding positions.
- the amplification primer may be perfectly (i.e., 100%) complementary to the copy number stable gene, or the primer and the copy number stable gene may share some degree of complementarity which is less than perfect (e.g., 70%, 75%, 85%, 90%, 95%, 99%).
- a non-limiting example of such a mathematical algorithm is described in Karlin et al., Proc. Natl. Acad. Sci. USA 90:5873-5877 (1993).
- NBLAST nucleic Acids Res. 25:389-3402
- hybridization conditions means conditions in which a primer specifically hybridizes to a region of a copy number stable gene or nucleic acid of interest. Whether a primer specifically hybridizes to a target nucleic acid is determined by such factors as the degree of complementarity between the polymer and the target nucleic acid and the temperature at which the hybridization occurs, which may be informed by the melting temperature (T M ) of the primer.
- T M melting temperature
- the melting temperature refers to the temperature at which half of the primer-target nucleic acid duplexes remain hybridized and half of the duplexes dissociate into single strands.
- nucleic acid sequences present in the genomes, transcriptomes, etc. of nucleic acid sources of interest are readily available from resources such as the nucleic acid sequence databases of the National Center for Biotechnology Information (NCBI), the European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), and the like. Based on such sequence information, one can design/select one or more amplification primers to amplify the one or more copy number stable genes, and optionally, one or more nucleic acids of interest.
- NCBI National Center for Biotechnology Information
- EBL-EBI European Molecular Biology Laboratory-European Bioinformatics Institute
- the one or more amplification primers include a sequencing adapter (e.g., 5′ relative to a 3′ hybridization region of the primer(s)).
- sequencing adapter is meant one or more nucleic acid domains that include at least a portion of a nucleic acid sequence (or complement thereof) utilized by a sequencing platform of interest, such as a sequencing platform provided by Illumina® (e.g., the HiSegTM, MiSegTM and/or Genome AnalyzerTM sequencing systems); Ion TorrentTM (e.g., the Ion PGMTM and/or Ion ProtonTM sequencing systems); Pacific Biosciences (e.g., the PACBIO RS II sequencing system); Life TechnologiesTM (e.g., a SOLiD sequencing system); Roche (e.g., the 454 GS FLX+ and/or GS Junior sequencing systems); or any other sequencing platform of interest.
- Illumina® e.g., the HiSegTM, MiSegTM and/or Genome Ana
- the one or more amplification primers include a sequencing adapter that includes a nucleic acid domain selected from: a domain (e.g., a “capture site” or “capture sequence”) that specifically binds to a surface-attached sequencing platform oligonucleotide (e.g., the P5 or P7 oligonucleotides attached to the surface of a flow cell in an Illumina® sequencing system); a sequencing primer binding domain (e.g., a domain to which the Read 1 or Read 2 primers of the Illumina® platform may bind); a barcode domain (e.g., a domain that uniquely identifies the sample source of the nucleic acid being sequenced to enable sample multiplexing by marking every molecule from a given sample with a specific barcode or “tag”); a barcode sequencing primer binding domain (a domain to which a primer used for sequencing a barcode binds); a molecular identification domain (e.g., a molecular index tag, such
- the one or more amplification primers may include a sequencing adapter of any length and sequence suitable for the sequencing platform of interest.
- the nucleic acid domains are from 4 to 100 nucleotides in length, such as from 6 to 75, from 8 to 50, or from 10 to 40 nucleotides in length.
- the one or more amplification primers may include one or more nucleotides (or analogs thereof) that are modified or otherwise non-naturally occurring.
- the amplification primers may include one or more nucleotide analogs (e.g., LNA, FANA, 2′-O-Me RNA, 2′-fluoro RNA, or the like), linkage modifications (e.g., phosphorothioates, 3′-3′ and 5′-5′ reversed linkages), 5′ and/or 3′ end modifications (e.g., 5′ and/or 3′ amino, biotin, DIG, phosphate, thiol, dyes, quenchers, etc.), one or more fluorescently labeled nucleotides, or any other feature that provides a desired functionality to the primers and/or resulting amplicons.
- nucleotide analogs e.g., LNA, FANA, 2′-O-Me RNA, 2′-fluoro RNA, or the like
- the nucleic acid sample may be any nucleic acid sample that includes, or is suspected of including, one or more copy number stable genes, and optionally, one or more nucleic acids of interest, e.g., one or more nucleic acids in addition to the one or more copy number stable genes for which amplification is desirable.
- Amplification of one or more copy number stable genes and one or more nucleic acids of interest may be desirable for a variety of reasons, including but not limited to, sequencing the amplification products (or “amplicons”) of the one or more copy number stable genes and one or more nucleic acids of interest.
- Sequencing the amplicons enables one to determine the nucleotide sequence(s) of—and number of sequencing runs corresponding to—the one or more copy number stable genes and one or more nucleic acids of interest, which may in turn be used to determine the copy number of one or more of the nucleic acids of interest as described in detail below.
- determining the copy number of one or more of the nucleic acids of interest enables one to determine whether copy number variation (CNV) exists for one or more of the one or more nucleic acids of interest in the sample.
- CNV copy number variation
- the nucleic acid sample may be one or more cells of interest, or a nucleic acid sample isolated from one or more cells of a cellular sample of interest.
- the nucleic acid sample may be a nucleic acid sample isolated from a single cell, a plurality of cells (e.g., cultured cells), a tissue, an organ, or an organism (e.g., bacteria, yeast, or the like).
- the nucleic acid sample is isolated from a cell(s), tissue, organ, and/or the like of a mammal (e.g., a human, a rodent (e.g., a mouse), or any other mammal of interest).
- the nucleic acid sample is isolated from a source other than a mammal, such as bacteria, yeast, insects (e.g., drosophila), amphibians (e.g., frogs (e.g., Xenopus)), viruses, plants, or any other non-mammalian nucleic acid sample source.
- a source other than a mammal such as bacteria, yeast, insects (e.g., drosophila), amphibians (e.g., frogs (e.g., Xenopus)), viruses, plants, or any other non-mammalian nucleic acid sample source.
- the nucleic acid sample is isolated from a biological sample, such as a biological fluid or a biological tissue.
- biological fluids include urine, blood, plasma, serum, saliva, semen, stool, sputum, cerebral spinal fluid, tears, mucus, sperm, amniotic fluid or the like.
- Biological tissues are aggregates of cells, usually of a particular kind together with their intercellular substance that form one of the structural materials of a human, animal, plant, bacterial, fungal or viral structure, including connective, epithelium, muscle and nerve tissues. Examples of biological tissues also include organs, tumors, lymph nodes, arteries and individual cells.
- the nucleic acid sample is a tumor nucleic acid sample (that is, a nucleic acid sample isolated from a tumor).
- Tumor refers to all neoplastic cell growth and proliferation, whether malignant or benign, and all pre-cancerous and cancerous cells and tissues.
- cancer and “cancerous” refer to or describe the physiological condition in mammals that is typically characterized by unregulated cell growth/proliferation. Examples of cancer include but are not limited to, carcinoma, lymphoma, blastoma, sarcoma, and leukemia.
- the nucleic acid sample includes nucleic acids from one or more circulating tumor cells (CTCs).
- CTCs circulating tumor cells
- the nucleic acid sample is a deoxyribonucleic acid (DNA) sample.
- DNA samples of interest include, but are not limited to, genomic DNA samples, mitochondrial DNA samples, complementary DNA (cDNA, synthesized from any RNA or DNA of interest) samples, recombinant DNA samples (e.g., plasmid DNA samples), and any other DNA samples of interest.
- the nucleic acid sample is a ribonucleic acid (RNA) sample.
- RNA samples of interest include, but are not limited to, messenger RNA (mRNA) samples, small/short interfering RNA (siRNA) samples, microRNA (miRNA) samples, any other DNA samples of interest.
- mRNA messenger RNA
- siRNA small/short interfering RNA
- miRNA microRNA
- kits for isolating DNA from a source of interest include the DNeasy®, RNeasy®, QIAamp®, QIAprep® and QIAquick® nucleic acid isolation/purification kits by Qiagen, Inc. (Germantown, Mdd); the DNAzol®, ChargeSwitch®, Purelink®, and GeneCatcher® nucleic acid isolation/purification kits by Life Technologies, Inc. (Carlsbad, Calif.); the NucleoMag®, NucleoSpin®, and NucleoBond® nucleic acid isolation/purification kits by Clontech Laboratories, Inc.
- the nucleic acid is isolated from a fixed biological sample, e.g., formalin-fixed, paraffin-embedded (FFPE) tissue.
- FFPE formalin-fixed, paraffin-embedded
- Genomic DNA and RNA from FFPE tissue may be isolated using commercially available kits—such as the AllPrep® DNA/RNA FFPE kit by Qiagen, Inc. (Germantown, Md.), the RecoverAll® Total Nucleic Acid Isolation kit for FFPE by Life Technologies, Inc. (Carlsbad, Calif.), and the NucleoSpin® FFPE kits by Clontech Laboratories, Inc. (Mountain View, Calif.).
- the sample may be subjected to shearing/fragmentation, e.g., to generate nucleic acids that are shorter in length as compared to precursor non-sheared nucleic acids (e.g., genomic DNA) in the original sample.
- shearing/fragmentation strategies include, but are not limited to, passing the sample one or more times through a micropipette tip or fine-gauge needle, nebulizing the sample, sonicating the sample (e.g., using a focused-ultrasonicator by Covaris, Inc.
- bead-mediated shearing e.g., using one or more DNA-shearing e.g., restriction, enzymes
- enzymatic shearing e.g., using one or more DNA-shearing e.g., restriction, enzymes
- chemical based fragmentation e.g., using divalent cations
- fragmentation buffer which may be used in combination with heat
- the nucleic acids generated by shearing/fragmentation of a starting nucleic acid sample has a length of from 50 to 10,000 nucleotides, from 100 to 5000 nucleotides, from 150 to 2500 nucleotides, from 200 to 1000 nucleotides, e.g., from 250 to 500 nucleotides in length.
- the nucleic acids generated by shearing/fragmentation of a starting nucleic acid sample has a length of from 10 to 20 nucleotides, from 20 to 30 nucleotides, from 30 to 40 nucleotides, from 40 to 50 nucleotides, from 50 to 60 nucleotides, from 60 to 70 nucleotides, from 70 to 80 nucleotides, from 80 to 90 nucleotides, from 90 to 100 nucleotides, from 100 to 150 nucleotides, from 150 to 200, from 200 to 250 nucleotides in length, or from 200 to 1000 nucleotides or even from 1000 to 10,000 nucleotides, for example, as appropriate for a sequencing platform in which one desires to sequence amplicons produced upon amplification of the one or more copy number stable genes and any other amplicons that may be present, e.g., amplicons produced from nucleic acids of interest present in the nucleic acid sample.
- the nucleic acid sample and the one or more amplification primers adapted to amplify a region of one or more copy number stable genes are combined in a reaction mixture under conditions sufficient to amplify the one or more copy number stable genes.
- condition sufficient to amplify the one or more copy number stable genes is meant reaction conditions that permit polymerase-mediated extension of a 3′ end of the one or more amplification primers.
- Achieving suitable reaction conditions may include selecting reaction mixture components, concentrations thereof, and a reaction temperature to create an environment in which a polymerase is active and the relevant nucleic acids in the reaction interact (e.g., hybridize) with one another in the desired manner. Suitable hybridization conditions are described in detail above.
- the reaction mixture may include buffer components that establish an appropriate pH, salt concentration (e.g., KCl concentration), metal cofactor concentration (e.g., Mg 2+ or Mn 2+ concentration), and the like, for the extension reaction to occur.
- salt concentration e.g., KCl concentration
- metal cofactor concentration e.g., Mg 2+ or Mn 2+ concentration
- nuclease inhibitors e.g., a DNase inhibitor and/or an RNase inhibitor
- additives for facilitating amplification/replication of GC rich sequences e.g., one or more additives for facilitating amplification/replication of GC rich sequences
- enzyme-stabilizing components e.g., DTT present at a final concentration ranging from 1 to 10 mM (e.g., 5 mM)
- any other reaction mixture components useful for facilitating polymerase-mediated extension reactions.
- the reaction mixture can have a pH suitable for the primer extension reaction.
- the pH of the reaction mixture ranges from 5 to 9, such as from 7 to 9.
- the reaction mixture includes a pH adjusting agent.
- pH adjusting agents of interest include, but are not limited to, sodium hydroxide, hydrochloric acid, phosphoric acid buffer solution, citric acid buffer solution, and the like.
- the pH of the reaction mixture can be adjusted to the desired range by adding an appropriate amount of the pH adjusting agent.
- the temperature range suitable for amplification may vary according to factors such as the particular polymerase employed, the melting temperatures of the one or more amplification primers employed, etc.
- the reaction mixture conditions include bringing the reaction mixture to a temperature ranging from 4° C. to 80° C., such as from 16° C. to 75° C., e.g., from 37° C. to 72° C.
- the methods of the present disclosure may include one or more steps in addition to the combining step described above.
- the methods may further include utilizing the amplified one or more copy number stable genes (and any other amplicons that may be present) in a downstream application/assay of interest.
- the amplified nucleic acids may be utilized directly (optionally after a purification step), or may be modified prior to being utilized in a downstream application/assay of interest.
- the methods further include adding a sequencing adapter to the amplified one or more copy number stable genes and any other amplicons that may be present. Such a step may be performed whether or not the amplicons already include one or more sequencing adapters (e.g., by virtue of the one or more amplification primers including one or more sequencing adapters as described above).
- Sequencing adapters that may be added include, e.g., one or more capture domains, one or more sequencing primer binding domains, one or more barcode domains, one or more barcode sequencing primer binding domains, one or more molecular identification domains, a complement of any such domains, or any combination thereof. Further details regarding sequencing adapters are described hereinabove.
- the methods include sequencing the amplified one or more copy number stable genes, and any other amplicons produced during the combining step, e.g., amplicons produced from one or more nucleic acids of interest as described above.
- amplification products may be sequenced directly (optionally after a purification step), or may be modified prior to being sequenced. Modifications prior to sequencing include, but are not limited to, the addition of one or more sequencing adapters as described above, and/or any other useful modifications for sequencing the amplicons on a sequencing platform of interest.
- the sequencing may be carried out on any suitable sequencing platform, including a Sanger sequencing platform, a next generation sequencing (NGS) platform (e.g., using a next generation sequencing protocol), or the like.
- NGS sequencing platforms of interest include, but are not limited to, a sequencing platform provided by Illumina® (e.g., the HiSegTM, MiSegTM and/or Genome AnalyzerTM sequencing systems); Ion TorrentTM (e.g., the Ion PGMTM and/or Ion ProtonTM sequencing systems); Pacific Biosciences (e.g., the PACBIO RS II sequencing system); Life TechnologiesTM (e.g., a SOLiD sequencing system); Roche (e.g., the 454 GS FLX+ and/or GS Junior sequencing systems); or any other sequencing platform of interest.
- Illumina® e.g., the HiSegTM, MiSegTM and/or Genome AnalyzerTM sequencing systems
- Ion TorrentTM e.g., the Ion P
- the methods further include determining the copy number of the one or more nucleic acids of interest present in the nucleic acid sample (e.g., the copy number of a nucleic acid of interest present in a genome from which the nucleic acid sample is derived. Such a determination may be based on, e.g., the number of sequencing reads corresponding to the one or more nucleic acids of interest present in the nucleic acid sample, and the number of sequencing reads corresponding to the one or more copy number stable genes.
- determining the copy number of the one or more nucleic acids of interest present in the nucleic acid sample includes determining a ratio of the number of sequencing reads corresponding to the one or more nucleic acids of interest present in the nucleic acid sample to the number of sequencing reads corresponding to the one or more copy number stable genes. Determining the copy number of the one or more nucleic acids of interest present in the nucleic acid sample may be based on a ratio.
- c the copy number
- r the number of sequencing reads
- NA the nucleic acid of interest
- CNSG the copy number stable gene
- the methods of the present disclosure find use in a variety of applications, including but not limited to, applications in which it is desirable to determine the copy number of one or more genomic regions (e.g., genes, intergenic regions, etc.) present in a source of nucleic acids (e.g., cells or tissues of interest, such as tumor cells or tissues of interest).
- Applications of interest include, e.g., research applications, clinical applications (e.g., clinical diagnostic applications), etc., and the methods may be employed in such applications to assess whether, e.g., a cell or tissue exhibits copy number variation (CNV) in one or more genomic regions of interest.
- the methods also find use in determining the nucleotide sequences of nucleic acids amplified from the nucleic acid sample and/or quantifying the amount of the one or more nucleic acids of interest present in the sample.
- the methods of the present disclosure which involve the amplification of one or more copy number stable genes—provide advantages over existing approaches in a number of respects.
- the methods of the present disclosure are advantageous in the context of nucleic acid sequencing for reasons including, but not limited to, the presence of internal control standards in sequencing libraries generated using the methods of the present disclosure, which internal control standards enable sample normalization and accurate copy number determination.
- the internal control standards are the amplicons produced from the one or more copy number stable genes using the method of the present disclosure.
- the amplicons produced from the one or more copy number stable genes may be used to normalize sequencing reads so that copy number information may be derived from sequencing data, obviating the need for matched samples (e.g., matched tumor/normal samples), pooled normal controls, universal control standards, and/or the like.
- matched samples e.g., matched tumor/normal samples
- pooled normal controls e.g., pooled normal controls
- universal control standards e.g., the number of patient samples that can be analyzed in a given sequencing run.
- the provision of internal copy number controls as provided by embodiments of the present disclosure therefore, increases sample throughput relative to current approaches.
- matched patient normal sample availability may be limited, rendering copy number determination impracticable if a pooled normal sample is not routinely used and/or available.
- use of pooled normal samples to normalize NGS reads may lead to erroneous copy number results if an abnormal copy number is present within the pool. Such issues are precluded according to embodiments of the subject methods.
- the methods provide sequencing read normalization required for NGS sequencing Copy Number Variation (CNV) determination that is built into each sample's sequencing library preparation reaction, obviating the requirement for additional control samples, each control sample requiring a separate library preparation reaction, sequencing run, etc.
- CNV NGS sequencing Copy Number Variation
- compositions of the present disclosure further include compositions.
- the compositions of the present disclosure find a variety of uses, including in some aspects, practicing the methods of the present disclosure.
- composition that includes a nucleic acid sample and one or more amplification primers adapted to amplify a region of one or more copy number stable genes.
- the composition may include any nucleic acid sample of interest and any suitable amplification primer(s), including any of the nucleic acid samples and amplification primers described above in the section relating to the methods of the present disclosure.
- the composition includes one or more amplification primers adapted to amplify one or more nucleic acids of interest present in the nucleic acid sample, in addition to the one or more amplification primers adapted to amplify a region of one or more copy number stable genes.
- the composition may include a pool (or “panel”) of primers for amplification of one or more nucleic acids of interest and one or more copy number stable genes.
- Panels of interest include, but are not limited to, sequencing panels adapted to amplify one or more genomic regions (e.g., genes) of interest in a tissue of interest (e.g., tumor tissue) and one or more copy number stable genes. Downstream sequencing of the copy number stable genes enables, e.g., determination of copy number and any variation thereof in the one or more genomic regions of interest.
- compositions of the present disclosure include, but are not limited to, a polymerase, dNTPs, a buffer component that establishes an appropriate pH, a salt (e.g., e.g., NaCl, KCl, or the like), a metal cofactor (e.g., Mg 2+ , Mn 2+ , or the like), a nuclease inhibitor (e.g., a DNase inhibitor and/or an RNase inhibitor), an additive for facilitating amplification/replication of GC rich sequences, an enzyme-stabilizing component (e.g., DTT), any other reaction mixture components (e.g., useful for facilitating polymerase-mediated extension reactions), a known amount of internal standard nucleic acids corresponding to one or more copy number stable genes, and any combination thereof.
- a salt e.g., e.g., NaCl, KCl, or the like
- a metal cofactor e.g., Mg 2+ , Mn 2+
- composition of the present disclosure includes the amplicons produced by the methods of the present disclosure.
- such compositions include the amplicons in purified form (e.g., substantially or completely separated from the amplification reaction mixture components).
- the amplicons may include a sequencing adapter provided during or after the amplification reaction as described above.
- compositions of the present disclosure may be present in a container.
- suitable containers include, but are not limited to, tubes, vials, plates (e.g., a 96- or other-well plate).
- compositions of the present disclosure may be present in a device.
- Devices of interest include, but are not limited to, an incubator, a thermocycler, a sequencing system (e.g., a Sanger sequencing system or a next generation sequencing system), a microfluidic device, or the like.
- nucleic acid sequencing systems find use in sequencing amplicons generated using the methods of the present disclosure.
- a sequencing system of the present disclosure includes a collection of nucleic acids.
- the collection of nucleic acids includes amplicons corresponding to nucleic acids of interest present in a nucleic acid sample, and amplicons corresponding to one or more copy number stable genes present in the nucleic acid sample.
- the sequencing system includes amplicons generated from any of the one or more amplification primers adapted to amplify a region of one or more copy number stable genes and any of the one or more amplification primers adapted to amplify one or more nucleic acids of interest as described above in the section relating to the methods of the present disclosure.
- the amplicons may include a sequencing adapter provided during the amplification reaction that produced the amplicons (e.g., provided according to embodiments of the subject methods) and/or after the amplification reaction (e.g., provided according to embodiments of the subject methods).
- a sequencing adapter provided during the amplification reaction that produced the amplicons (e.g., provided according to embodiments of the subject methods) and/or after the amplification reaction (e.g., provided according to embodiments of the subject methods).
- the sequencing system may be any sequencing system of interest, including a Sanger sequencing system, a next generation sequencing (NGS) system, or the like.
- the sequencing system is an NGS system.
- NGS systems of interest include, but are not limited to, a sequencing system provided by Illumina® (e.g., the HiSegTM, MiSegTM and/or Genome AnalyzerTM sequencing systems); Ion TorrentTM (e.g., the Ion PGMTM and/or Ion ProtonTM sequencing systems); Pacific Biosciences (e.g., the PACBIO RS II sequencing system); Life TechnologiesTM (e.g., a SOLiD sequencing system); Roche (e.g., the 454 GS FLX+ and/or GS Junior sequencing systems), or any other suitable NGS systems.
- Illumina® e.g., the HiSegTM, MiSegTM and/or Genome AnalyzerTM sequencing systems
- Ion TorrentTM e.g., the Ion PGMTM and/
- the collection of nucleic acids may be present in a component of the sequencing system.
- the collection of nucleic acids may be present in a sample preparation component of the sequencing system, e.g., a component of the sequencing system where nucleic acids of the collection are fragmented and/or sequencing adapters are added to the nucleic acids of the collection.
- the collection of nucleic acids may be present in a solid-phase amplification component of the sequencing system, where solid-phase amplification of the nucleic acids of the collection may occur.
- An example of such a solid-phase amplification component of a sequencing system is the flow cell of Illumina-based sequencing systems, where cluster generation occurs.
- a solid-phase amplification component of a sequencing system is the Ion OneTouchTM 2 component for producing templates suitable for sequencing on an Ion PGMTM system, Ion ProtonTM system, or other NGS system provided by Ion TorrentTM.
- the collection of nucleic acids may be present in any component of a sequencing system useful for utilizing the collection of nucleic acids to obtain the nucleic acid sequences thereof.
- the sequencing system is adapted to determine the copy number of the nucleic acids of interest present in the nucleic acid sample. In certain aspects, the determination is based on the number of sequencing reads corresponding to nucleic acids of interest in the nucleic acid sample, and the number of sequencing reads corresponding to the one or more copy number stable genes. In certain aspects, such a sequencing system is adapted to determine a ratio of the number of sequencing reads corresponding to nucleic acids of interest in the nucleic acid sample to the number of sequencing reads corresponding to the one or more copy number stable genes.
- the system may be further adapted to determine the copy number of the nucleic acids of interest present in the nucleic acid sample based on the ratio of the number of sequencing reads corresponding to nucleic acids of interest in the nucleic acid sample to the number of sequencing reads corresponding to the one or more copy number stable genes.
- the computer-readable medium includes instructions executable by the processor to, e.g., determine the copy number of the nucleic acids of interest present in the nucleic acid sample as described above, determine a ratio of the number of sequencing reads corresponding to nucleic acids of interest in the nucleic acid sample to the number of sequencing reads corresponding to the one or more copy number stable genes, and/or the like.
- Example formulas/algorithms which may be implemented by the sequencing systems of the present disclosure are described above in the section relating to the methods of the present disclosure.
- kits include one or more amplification primers adapted to amplify a region of one or more copy number stable genes present in a nucleic acid sample of interest, and a container (e.g., a tube).
- a container e.g., a tube
- the one or more amplification primers are present in the container.
- the subject kits may include any of the amplification primers adapted to amplify any of the copy number stable genes described above in relation to the methods of the present disclosure.
- kits include one or more amplification primers adapted to amplify one or more nucleic acids of interest present in the nucleic acid sample, in addition to the one or more amplification primers adapted to amplify a region of one or more copy number stable genes.
- a kit of the present disclosure may include a pool (or “panel”) of primers for amplification of one or more nucleic acids of interest and one or more copy number stable genes.
- Panels of interest include, but are not limited to, sequencing panels adapted to amplify one or more genomic regions (e.g., genes) of interest in a tissue of interest (e.g., tumor tissue) and one or more copy number stable genes. Downstream sequencing of the copy number stable genes enables, e.g., determination of copy number and any variation thereof in the one or more genomic regions of interest.
- kits include one or more of a polymerase, dNTPs, a buffer component that establishes an appropriate pH, a salt (e.g., e.g., NaCl, KCl, or the like), a metal cofactor (e.g., Mg 2+ , Mn 2+ , or the like), a nuclease inhibitor (e.g., a DNase inhibitor and/or an RNase inhibitor), an additive for facilitating amplification/replication of GC rich sequences, an enzyme-stabilizing component (e.g., DTT), internal standard nucleic acids corresponding to one or more copy number stable genes, and/or any other reaction mixture components, e.g., useful for facilitating polymerase-mediated extension reactions.
- a salt e.g., e.g., NaCl, KCl, or the like
- a metal cofactor e.g., Mg 2+ , Mn 2+ , or the like
- a nuclease inhibitor
- Components of the subject kits may be present in separate containers, or multiple components may be present in a single container.
- each of the two or more amplification primers may be present in separate containers, subsets of the two or more amplification primers may be present in separate containers, each of the two or more amplification primers may be present in a single container, etc.
- the one or more amplification primers may be provided in any suitable container.
- the amplification primers may be provided in a single tube (e.g., vial), in one or more wells of a plate (e.g., a 96-well plate, a 384-well plate, etc.), or the like.
- kits of the present disclosure may further include instructions for using the components of the kit, e.g., to practice the methods of the present disclosure.
- the kit may include instructions for using the one or more amplification primers adapted to amplify a region of one or more copy number stable genes to determine the copy number of one or more genes of interest present in the nucleic acid sample of interest.
- the instructions may be recorded on a suitable recording medium.
- the instructions may be printed on a substrate, such as paper or plastic, etc.
- the instructions may be present in the kits as a package insert, in the labeling of the container of the kit or components thereof (i.e., associated with the packaging or sub-packaging) etc.
- the instructions are present as an electronic storage data file present on a suitable computer readable storage medium, e.g., portable flash drive, DVD, CD-ROM, diskette, etc.
- the actual instructions are not present in the kit, but means for obtaining the instructions from a remote source, e.g. via the internet, are provided.
- An example of this embodiment is a kit that includes a web address where the instructions can be viewed and/or from which the instructions can be downloaded.
- the means for obtaining the instructions is recorded on a suitable substrate.
- amplification primers adapted to amplify 10 genes identified based upon their proximity to ohnologs were included in a sequencing panel for sequencing library preparation.
- Ohnologs and genes in close proximity to ohnologs have fewer copy number alterations than genes further away from ohnolog-rich regions. See, e.g., Makino et al. (2013) Nature Communications 4:2283.
- These ten additional genes were included in a custom TumorSeq DNA Panel and served as internal controls for CNV detection.
- the ten genes included in the panel for the purposes of internal control standards were: HNRNPR (Entrez Gene ID 10236), TCEB3 (Entrez Gene ID 6924), IL22RA1 (Entrez Gene ID 58985), RCAN3 (Entrez Gene ID 11123), GJB5 (Entrez Gene ID 2707), SLC25A44 (Entrez Gene ID 9673), MT3 (Entrez Gene ID 4504), MT1X (Entrez Gene ID 4501), NUP93 (Entrez Gene ID 9688) and RABL2B (Entrez Gene ID 11158). Sequencing data was collected using a custom NGS panel that included these ten genes. Data for these ten genes is shown in FIG. 1 .
- the ten genes identified to serve as internal controls were identified using both the reference mentioned and bioinformatics tools, using the following method: (1) regions on chromosomes lacking ClinVar and DGV entries for CNV were located using bioinformatics tools; (2) coordinates from supplementary Table 1 of Makino et al. (supra) were updated to reflect the current build of the human genome (hg19), using bioinformatics tools; (3) CNV-free regions identified in step (1) were cross-referenced with the table from step (2); (4) genes closest to ohnologs were identified; and (5) CNV status was confirmed using DGV (Database of Genomic Variation) entry data from Gene Cards. Copy number stable genes in the human genome identified using this approach are graphically illustrated in FIG. 2 .
Abstract
Description
c_NA/c_CNSG=r_NA/r_CNSG (Formula I)
- 1. A method of amplifying nucleic acids, comprising:
- combining:
- a nucleic acid sample; and
- one or more amplification primers adapted to amplify a region of one or more copy number stable genes, in a reaction mixture under conditions sufficient to amplify the one or more copy number stable genes.
- 2. The method according to
Clause 1, wherein the one or more amplification primers are adapted to amplify one or more copy number stable genes located in ohnolog-rich regions of a genome. - 3. The method according to
Clause 2, wherein the ohnolog-rich regions of the genome are regions comprising a proportion of ohnologs greater than 50% in a 2 Mb window. - 4. The method according to
Clause 2 orClause 3, wherein the one or more amplification primers are adapted to amplify one or more ohnologs. - 5. The method according to
Clause 2, wherein the one or more amplification primers are adapted to amplify a region of one or more copy number stable genes selected from the group consisting of: HNRNPR, TCEB3, IL22RA1, RCAN3, GJB5, SLC25A44, MT3, MT1X, NUP93, RABL2B, and combinations thereof. - 6. The method according to Clause 5, wherein the one or more amplification primers are adapted to amplify a region of each of the copy number stable genes HNRNPR, TCEB3, IL22RA1, RCAN3, GJB5, SLC25A44, MT3, MT1X, NUP93, and RABL2B.
- 7. The method according to any one of
Clauses 1 to 6, wherein the combining comprises combining a known amount of nucleic acids corresponding to one or more copy number stable genes with the nucleic acid sample and the one or more amplification primers adapted to amplify a region of one or more copy number stable genes. - 8. The method according to any one of
Clauses 1 to 7, wherein the nucleic acid sample comprises nucleic acids isolated from one or more cells of a cellular sample of interest. - 9. The method according to Clause 8, wherein the cellular sample of interest is a single cell.
- 10. The method according to Clause 8 or Clause 9 wherein the cellular sample of interest is a tumor sample.
- 11. The method according to any one of
Clauses 1 to 10, wherein the one or more amplification primers comprise a sequencing adapter. - 12. The method according to any one of
Clauses 1 to 11, wherein the one or more amplification primers are non-random primers. - 13. The method according to any one of
Clauses 1 to 12, further comprising adding a sequencing adapter to the amplified one or more copy number stable genes. - 14. The method according to any one of
Clauses 1 to 13, further comprising amplifying one or more nucleic acids of interest present in the nucleic acid sample. - 15. The method according to Clause 14, further comprising sequencing the amplified one or more copy number stable genes and the amplified one or more nucleic acids of interest.
- 16. The method according to Clause 15, wherein the sequencing is by a next generation sequencing protocol.
- 17. The method according to Clause 15 or Clause 16, further comprising determining the copy number of the one or more nucleic acids of interest present in the nucleic acid sample based on:
- the number of sequencing reads corresponding to the one or more nucleic acids of interest present in the nucleic acid sample; and
- the number of sequencing reads corresponding to the one or more copy number stable genes.
- 18. The method according to Clause 17, wherein determining the copy number of the one or more nucleic acids of interest present in the nucleic acid sample is based on a ratio of the number of sequencing reads corresponding to the one or more nucleic acids of interest present in the nucleic acid sample to the number of sequencing reads corresponding to the one or more copy number stable genes.
- 19. A composition, comprising:
- a nucleic acid sample; and
- one or more amplification primers adapted to amplify a region of one or more copy number stable genes.
- 20. The composition according to Clause 19, wherein the one or more amplification primers are adapted to amplify one or more copy number stable genes located in ohnolog-rich regions of a genome.
- 21. The composition according to Clause 20, wherein the ohnolog-rich regions of the genome are regions comprising a proportion of ohnologs greater than 50% in a 2 Mb window.
- 22. The composition according to any one of Clauses 19 to 21, wherein the one or more amplification primers are adapted to amplify one or more ohnologs.
- 23. The composition according to Clause 20, wherein the one or more amplification primers are adapted to amplify a region of one or more copy number stable genes selected from the group consisting of: HNRNPR, TCEB3, IL22RA1, RCAN3, GJB5, SLC25A44, MT3, MT1X, NUP93, RABL2B, and combinations thereof.
- 24. The composition according to Clause 23, wherein the one or more amplification primers are adapted to amplify a region of each of the copy number stable genes HNRNPR, TCEB3, IL22RA1, RCAN3, GJB5, SLC25A44, MT3, MT1X, NUP93, and RABL2B.
- 25. The composition according to any one of Clauses 19 to 24, further comprising a known amount of internal standard nucleic acids corresponding to one or more copy number stable genes.
- 26. The composition according to any one of Clauses 19 to 24, wherein the nucleic acid sample comprises nucleic acids isolated from one or more cells of a cellular sample of interest.
- 27. The composition according to Clause 26, wherein the cellular sample of interest is a single cell.
- 28. The composition according to Clause 26 or Clause 27, wherein the cellular sample of interest is a tumor sample.
- 29. The composition according to any one of Clauses 19 to 28, wherein the one or more amplification primers comprise a sequencing adapter.
- 30. The composition according to any one of Clauses 19 to 29, wherein the one or more amplification primers are non-random primers.
- 31. A nucleic acid sequencing system, comprising:
- a collection of nucleic acids comprising: amplicons corresponding to nucleic acids of interest present in a nucleic acid sample; and amplicons corresponding to one or more copy number stable genes.
- 32. The sequencing system according to Clause 31, wherein the one or more copy number stable genes are located in ohnolog-rich regions of a genome.
- 33. The sequencing system according to Clause 32, wherein the ohnolog-rich regions of the genome are regions comprising a proportion of ohnologs greater than 50% in a 2 Mb window.
- 34. The sequencing system according to Clause 31, wherein the one or more copy number stable genes are one or more ohnologs.
- 35. The sequencing system according to Clause 31, wherein the one or more copy number stable genes are selected from the group consisting of: HNRNPR, TCEB3, IL22RA1, RCAN3, GJB5, SLC25A44, MT3, MT1X, NUP93, RABL2B, and combinations thereof.
- 36. The sequencing system according to Clause 35, wherein the one or more copy number stable genes are each of HNRNPR, TCEB3, IL22RA1, RCAN3, GJB5, SLC25A44, MT3, MT1X, NUP93, and RABL2B.
- 37. The sequencing system according to any one of Clauses 31 to 36, wherein the collection of nucleic acids further comprises amplicons corresponding to a known amount of internal standard nucleic acids corresponding to one or more copy number stable genes.
- 38. The sequencing system according to any one of Clauses 31 to 36, wherein the nucleic acid sample comprises nucleic acids isolated from one or more cells of a cellular sample of interest.
- 39. The sequencing system according to Clause 38, wherein the cellular sample of interest is a single cell.
- 40. The sequencing system according to Clause 38 or Clause 39, wherein the cellular sample of interest is a tumor sample.
- 41. The sequencing system according to any one of Clauses 31 to 40, wherein the amplicons were amplified using non-random primers.
- 42. The sequencing system according to any one of Clauses 31 to 41, wherein the sequencing system is adapted to determine the copy number of the nucleic acids of interest present in the nucleic acid sample based on: the number of sequencing reads corresponding to nucleic acids of interest in the nucleic acid sample; and the number of sequencing reads corresponding to the one or more copy number stable genes.
- 43. The sequencing system according to Clause 42, wherein the sequencing system is adapted to determine the copy number of the nucleic acids of interest present in the nucleic acid sample based on a ratio of the number of sequencing reads corresponding to nucleic acids of interest in the nucleic acid sample to the number of sequencing reads corresponding to the one or more copy number stable genes.
- 44. The sequencing system according to any one of Clauses 31 to 43, wherein the sequencing system is a next generation sequencing system.
- 45. A kit comprising:
- one or more amplification primers adapted to amplify a region of one or more copy number stable genes present in a nucleic acid sample of interest; and
- a container.
- 46. The kit according to Clause 45, wherein the one or more amplification primers are adapted to amplify one or more copy number stable genes located in ohnolog-rich regions of a genome.
- 47. The kit according to Clause 46, wherein the ohnolog-rich regions of the genome are regions comprising a proportion of ohnologs greater than 50% in a 2 Mb window.
- 48. The kit according to any one of Clauses 45 to 47, wherein the one or more amplification primers are adapted to amplify one or more ohnologs.
- 49. The kit according to Clause 46, wherein the one or more amplification primers are adapted to amplify a region of one or more copy number stable genes selected from the group consisting of: HNRNPR, TCEB3, IL22RA1, RCAN3, GJB5, SLC25A44, MT3, MT1X, NUP93, RABL2B, and combinations thereof.
- 50. The kit according to Clause 49, wherein the one or more amplification primers are adapted to amplify a region of each of the copy number stable genes HNRNPR, TCEB3, IL22RA1, RCAN3, GJB5, SLC25A44, MT3, MT1X, NUP93, and RABL2B.
- 51. The kit according to any one of Clauses 45 to 50, further comprising internal standard nucleic acids corresponding to one or more copy number stable genes.
- 52. The kit according to any one of Clauses 45 to 50, wherein the nucleic acid sample comprises nucleic acids isolated from one or more cells of a cellular sample of interest.
- 53. The kit according to Clause 52, wherein the cellular sample of interest is a single cell.
- 54. The kit according to Clause 52 or Clause 53, wherein the cellular sample of interest is a tumor sample.
- 55. The kit according to any one of Clauses 45 to 54, wherein the one or more amplification primers comprise a sequencing adapter.
- 56. The kit according to any one of Clauses 45 to 55, wherein the one or more amplification primers are non-random primers.
- 57. The kit according to any one of Clauses 45 to 56, further comprising instructions for using the one or more amplification primers adapted to amplify a region of one or more copy number stable genes to determine the copy number of one or more genes of interest present in the nucleic acid sample of interest.
Claims (20)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/563,939 US10767220B2 (en) | 2015-05-21 | 2016-05-17 | Methods of amplifying nucleic acids and compositions for practicing the same |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201562164871P | 2015-05-21 | 2015-05-21 | |
US15/563,939 US10767220B2 (en) | 2015-05-21 | 2016-05-17 | Methods of amplifying nucleic acids and compositions for practicing the same |
PCT/US2016/032919 WO2016187224A1 (en) | 2015-05-21 | 2016-05-17 | Methods of amplifying nucleic acids and compositions for practicing the same |
Publications (2)
Publication Number | Publication Date |
---|---|
US20180127817A1 US20180127817A1 (en) | 2018-05-10 |
US10767220B2 true US10767220B2 (en) | 2020-09-08 |
Family
ID=57320505
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/563,939 Active 2036-11-06 US10767220B2 (en) | 2015-05-21 | 2016-05-17 | Methods of amplifying nucleic acids and compositions for practicing the same |
Country Status (2)
Country | Link |
---|---|
US (1) | US10767220B2 (en) |
WO (1) | WO2016187224A1 (en) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150011396A1 (en) | 2012-07-09 | 2015-01-08 | Benjamin G. Schroeder | Methods for creating directional bisulfite-converted nucleic acid libraries for next generation sequencing |
CA2916660C (en) | 2013-06-25 | 2022-05-17 | Prognosys Biosciences, Inc. | Spatially encoded biological assays using a microfluidic device |
US10774374B2 (en) | 2015-04-10 | 2020-09-15 | Spatial Transcriptomics AB and Illumina, Inc. | Spatially distinguished, multiplex nucleic acid analysis of biological specimens |
EP3894591A2 (en) | 2018-12-10 | 2021-10-20 | 10X Genomics, Inc. | Imaging system hardware |
US20230242976A1 (en) * | 2018-12-10 | 2023-08-03 | 10X Genomics, Inc. | Imaging system hardware |
US11926867B2 (en) | 2019-01-06 | 2024-03-12 | 10X Genomics, Inc. | Generating capture probes for spatial analysis |
US11649485B2 (en) | 2019-01-06 | 2023-05-16 | 10X Genomics, Inc. | Generating capture probes for spatial analysis |
US11732299B2 (en) | 2020-01-21 | 2023-08-22 | 10X Genomics, Inc. | Spatial assays with perturbed cells |
US11898205B2 (en) | 2020-02-03 | 2024-02-13 | 10X Genomics, Inc. | Increasing capture efficiency of spatial assays |
US11891654B2 (en) | 2020-02-24 | 2024-02-06 | 10X Genomics, Inc. | Methods of making gene expression libraries |
CN115916999A (en) | 2020-04-22 | 2023-04-04 | 10X基因组学有限公司 | Methods for spatial analysis using targeted RNA depletion |
EP4153775A1 (en) | 2020-05-22 | 2023-03-29 | 10X Genomics, Inc. | Simultaneous spatio-temporal measurement of gene expression and cellular activity |
US11761038B1 (en) | 2020-07-06 | 2023-09-19 | 10X Genomics, Inc. | Methods for identifying a location of an RNA in a biological sample |
US11926822B1 (en) | 2020-09-23 | 2024-03-12 | 10X Genomics, Inc. | Three-dimensional spatial analysis |
US11827935B1 (en) | 2020-11-19 | 2023-11-28 | 10X Genomics, Inc. | Methods for spatial analysis using rolling circle amplification and detection probes |
EP4121555A1 (en) | 2020-12-21 | 2023-01-25 | 10X Genomics, Inc. | Methods, compositions, and systems for capturing probes and/or barcodes |
EP4196605A1 (en) | 2021-09-01 | 2023-06-21 | 10X Genomics, Inc. | Methods, compositions, and kits for blocking a capture probe on a spatial array |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120039887A1 (en) * | 2009-02-19 | 2012-02-16 | Cornell University | Compositions and methods for diagnosing prostate cancer based on detection of slc45a3-elk4 fusion transcript |
WO2012058689A2 (en) * | 2010-10-29 | 2012-05-03 | Pbd Biodiagnostics, Llc | Methods, kits and arrays for screening for, predicting and identifying donors for hematopoietic cell transplantation, and predicting risk of hematopoietic cell transplant (hct) to induce graft vs. host disease (gvhd) |
US20130338933A1 (en) | 2011-10-06 | 2013-12-19 | Sequenom, Inc. | Methods and processes for non-invasive assessment of genetic variations |
US20140206543A1 (en) * | 2012-01-27 | 2014-07-24 | Peter Keith Rogan | Stable gene targets in breast cancer and use thereof for optimizing therapy |
US20140256571A1 (en) | 2013-03-06 | 2014-09-11 | Life Technologies Corporation | Systems and Methods for Determining Copy Number Variation |
WO2015157571A1 (en) * | 2014-04-09 | 2015-10-15 | Lineagen, Inc. | Genetic markers associated with chromosomal deletion and duplication syndromes |
-
2016
- 2016-05-17 US US15/563,939 patent/US10767220B2/en active Active
- 2016-05-17 WO PCT/US2016/032919 patent/WO2016187224A1/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120039887A1 (en) * | 2009-02-19 | 2012-02-16 | Cornell University | Compositions and methods for diagnosing prostate cancer based on detection of slc45a3-elk4 fusion transcript |
WO2012058689A2 (en) * | 2010-10-29 | 2012-05-03 | Pbd Biodiagnostics, Llc | Methods, kits and arrays for screening for, predicting and identifying donors for hematopoietic cell transplantation, and predicting risk of hematopoietic cell transplant (hct) to induce graft vs. host disease (gvhd) |
US20130338933A1 (en) | 2011-10-06 | 2013-12-19 | Sequenom, Inc. | Methods and processes for non-invasive assessment of genetic variations |
US20140206543A1 (en) * | 2012-01-27 | 2014-07-24 | Peter Keith Rogan | Stable gene targets in breast cancer and use thereof for optimizing therapy |
US20140256571A1 (en) | 2013-03-06 | 2014-09-11 | Life Technologies Corporation | Systems and Methods for Determining Copy Number Variation |
WO2015157571A1 (en) * | 2014-04-09 | 2015-10-15 | Lineagen, Inc. | Genetic markers associated with chromosomal deletion and duplication syndromes |
Non-Patent Citations (5)
Title |
---|
Makino, et al., "Genome-wide deserts for copy number variation in vertebrates", Nature Communications, 4:2283, pp. 1-10, 2013. |
McLysaght et al. (PNAS, 2014, vol. 111, No. 1, p. 361-366, IDS reference) (Year: 2014). * |
McLysaght, et al., "Ohnologs are overrepresented in pathogenic copy number mutations", PNAS Jan. 7, 2014. 111 (1) 361-366. |
Molyneux et al. (Nature Genetics, 2014, 46(9):964-972) (Year: 2014). * |
Molyneux et al. (Nature Genetics, 2014, 46(9):964-973, IDS reference) (Year: 2014). * |
Also Published As
Publication number | Publication date |
---|---|
WO2016187224A1 (en) | 2016-11-24 |
US20180127817A1 (en) | 2018-05-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10767220B2 (en) | Methods of amplifying nucleic acids and compositions for practicing the same | |
US10711269B2 (en) | Method for making an asymmetrically-tagged sequencing library | |
EP3177740B1 (en) | Digital measurements from targeted sequencing | |
US20200048697A1 (en) | Compositions and methods for detection of genomic variance and DNA methylation status | |
WO2020056381A9 (en) | PROGRAMMABLE RNA-TEMPLATED SEQUENCING BY LIGATION (rSBL) | |
US11725230B2 (en) | Selective degradation of wild-type DNA and enrichment of mutant alleles using nuclease | |
US20120214160A1 (en) | Methods, compositions, and kits for detecting rare cells | |
US20180044731A1 (en) | Methods of Preparing Dual-Indexed DNA Libraries for Bisulfite Conversion Sequencing | |
US11319576B2 (en) | Methods of producing nucleic acid libraries and compositions and kits for practicing same | |
CA2931140C (en) | Error-free sequencing of dna | |
US20230242907A1 (en) | Methods of Producing Nucleic Acid Libraries and Compositions and Kits for Practicing Same | |
US20200277651A1 (en) | Nucleic Acid Preparation and Analysis | |
CN107406882B (en) | Multiplexing method for identification and quantification of minor alleles and polymorphisms | |
US20180051330A1 (en) | Methods of amplifying nucleic acids and compositions and kits for practicing the same | |
CN111349691B (en) | Composition, kit and detection method for EGFR gene deletion mutation detection | |
CN114787385A (en) | Methods and systems for detecting nucleic acid modifications | |
Morlighem et al. | DNA amplification techniques in pharmacogenomics | |
Walsh et al. | Molecular pathology in lung cancer: a guide to the techniques used in clinical practice | |
Jenner et al. | Development of a gLCR-based KRAS mutation detection approach and its comparison with other screening methods | |
JP2023103372A (en) | Improved nucleic acid target enrichment and related methods | |
CA3107052A1 (en) | Methods and composition for targeted genomic analysis | |
EP3837383A1 (en) | Reagents, mixtures, kits and methods for amplification of nucleic acids | |
Rooney | A genomic approach to the study of chemoresistance |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
AS | Assignment |
Owner name: BECTON, DICKINSON AND COMPANY, NEW JERSEY Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BORCHERT, KRISTEN MARY;TONG, FRANCES POYEN;BROWN, CHARLOTTE ANN;AND OTHERS;SIGNING DATES FROM 20160527 TO 20160706;REEL/FRAME:045746/0781 |
|
AS | Assignment |
Owner name: BECTON, DICKINSON AND COMPANY, NEW JERSEY Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BORCHERT, KRISTEN MARY;TONG, FRANCES POYEN;BROWN, CHARLOTTE ANN;AND OTHERS;SIGNING DATES FROM 20160527 TO 20160706;REEL/FRAME:047343/0615 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: AWAITING TC RESP, ISSUE FEE PAYMENT VERIFIED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |