US20210189382A1 - Barcoding Nucleic Acids - Google Patents
Barcoding Nucleic Acids Download PDFInfo
- Publication number
- US20210189382A1 US20210189382A1 US17/184,048 US202117184048A US2021189382A1 US 20210189382 A1 US20210189382 A1 US 20210189382A1 US 202117184048 A US202117184048 A US 202117184048A US 2021189382 A1 US2021189382 A1 US 2021189382A1
- Authority
- US
- United States
- Prior art keywords
- loop
- stem
- barcode
- nucleic acid
- adaptors
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 150000007523 nucleic acids Chemical class 0.000 title abstract description 113
- 108020004707 nucleic acids Proteins 0.000 title abstract description 112
- 102000039446 nucleic acids Human genes 0.000 title abstract description 112
- 108091034117 Oligonucleotide Proteins 0.000 claims description 53
- 239000002773 nucleotide Substances 0.000 claims description 13
- 125000003729 nucleotide group Chemical group 0.000 claims description 13
- 230000010076 replication Effects 0.000 claims description 9
- 229910019142 PO4 Inorganic materials 0.000 claims description 8
- 239000010452 phosphate Substances 0.000 claims description 8
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 claims description 7
- 208000035657 Abasia Diseases 0.000 claims description 5
- MXHRCPNRJAMMIM-SHYZEUOFSA-N 2'-deoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-SHYZEUOFSA-N 0.000 claims description 4
- MXHRCPNRJAMMIM-UHFFFAOYSA-N desoxyuridine Natural products C1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-UHFFFAOYSA-N 0.000 claims description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 claims description 3
- 230000000295 complement effect Effects 0.000 claims description 3
- IIRDTKBZINWQAW-UHFFFAOYSA-N hexaethylene glycol Chemical compound OCCOCCOCCOCCOCCOCCO IIRDTKBZINWQAW-UHFFFAOYSA-N 0.000 claims description 3
- 238000000034 method Methods 0.000 abstract description 46
- 239000012634 fragment Substances 0.000 abstract description 21
- -1 genomic DNA Chemical class 0.000 abstract description 4
- 108020004414 DNA Proteins 0.000 description 33
- 230000003321 amplification Effects 0.000 description 24
- 238000003199 nucleic acid amplification method Methods 0.000 description 24
- 150000002500 ions Chemical class 0.000 description 23
- 238000012163 sequencing technique Methods 0.000 description 19
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 12
- 230000009977 dual effect Effects 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 10
- 239000000203 mixture Substances 0.000 description 10
- 230000008878 coupling Effects 0.000 description 9
- 238000010168 coupling process Methods 0.000 description 9
- 238000005859 coupling reaction Methods 0.000 description 9
- 238000002360 preparation method Methods 0.000 description 9
- 238000005516 engineering process Methods 0.000 description 8
- 238000006116 polymerization reaction Methods 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 6
- UHZZMRAGKVHANO-UHFFFAOYSA-M chlormequat chloride Chemical compound [Cl-].C[N+](C)(C)CCCl UHZZMRAGKVHANO-UHFFFAOYSA-M 0.000 description 6
- 238000006073 displacement reaction Methods 0.000 description 6
- 229910001629 magnesium chloride Inorganic materials 0.000 description 6
- 230000001629 suppression Effects 0.000 description 6
- 238000003786 synthesis reaction Methods 0.000 description 6
- AHCYMLUZIRLXAA-SHYZEUOFSA-N Deoxyuridine 5'-triphosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 AHCYMLUZIRLXAA-SHYZEUOFSA-N 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 239000000539 dimer Substances 0.000 description 5
- 238000007481 next generation sequencing Methods 0.000 description 5
- 230000008439 repair process Effects 0.000 description 5
- 239000007787 solid Substances 0.000 description 5
- 238000011282 treatment Methods 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 4
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- 238000000137 annealing Methods 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- 238000006911 enzymatic reaction Methods 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 102000012410 DNA Ligases Human genes 0.000 description 3
- 108010061982 DNA Ligases Proteins 0.000 description 3
- 239000012082 adaptor molecule Substances 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 238000010348 incorporation Methods 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 235000011178 triphosphate Nutrition 0.000 description 3
- 239000001226 triphosphate Substances 0.000 description 3
- GUAHPAJOXVYFON-ZETCQYMHSA-N (8S)-8-amino-7-oxononanoic acid zwitterion Chemical compound C[C@H](N)C(=O)CCCCCC(O)=O GUAHPAJOXVYFON-ZETCQYMHSA-N 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 102000004594 DNA Polymerase I Human genes 0.000 description 2
- 108010017826 DNA Polymerase I Proteins 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 238000003559 RNA-seq method Methods 0.000 description 2
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 2
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- JSRLJPSBLDHEIO-SHYZEUOFSA-N dUMP Chemical compound O1[C@H](COP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 JSRLJPSBLDHEIO-SHYZEUOFSA-N 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 238000001668 nucleic acid synthesis Methods 0.000 description 2
- 239000002777 nucleoside Substances 0.000 description 2
- 150000003833 nucleoside derivatives Chemical class 0.000 description 2
- 238000005498 polishing Methods 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000003753 real-time PCR Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 1
- 238000001353 Chip-sequencing Methods 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 206010013142 Disinhibition Diseases 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 102100037111 Uracil-DNA glycosylase Human genes 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000012864 cross contamination Methods 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000017858 demethylation Effects 0.000 description 1
- 238000010520 demethylation reaction Methods 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 239000005549 deoxyribonucleoside Substances 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000001915 proofreading effect Effects 0.000 description 1
- XKMLYUALXHKNFT-UHFFFAOYSA-N rGTP Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O XKMLYUALXHKNFT-UHFFFAOYSA-N 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1065—Preparation or screening of tagged libraries, e.g. tagged microorganisms by STM-mutagenesis, tagged polynucleotides, gene tags
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6853—Nucleic acid amplification reactions using modified primers or templates
- C12Q1/6855—Ligating adaptors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2525/00—Reactions involving modified oligonucleotides, nucleic acids, or nucleotides
- C12Q2525/10—Modifications characterised by
- C12Q2525/155—Modifications characterised by incorporating/generating a new priming site
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2525/00—Reactions involving modified oligonucleotides, nucleic acids, or nucleotides
- C12Q2525/10—Modifications characterised by
- C12Q2525/191—Modifications characterised by incorporating an adaptor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2525/00—Reactions involving modified oligonucleotides, nucleic acids, or nucleotides
- C12Q2525/30—Oligonucleotides characterised by their secondary structure
- C12Q2525/301—Hairpin oligonucleotides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2563/00—Nucleic acid detection characterized by the use of physical, structural and functional properties
- C12Q2563/179—Nucleic acid detection characterized by the use of physical, structural and functional properties the label being a nucleic acid
Definitions
- the present invention relates generally to the fields of molecular biology and nucleic acid sequencing. More particularly, it concerns methods of barcoding nucleic acids.
- Barcodes can be used to identify nucleic acid molecules, for example, where sequencing can reveal a certain barcode coupled to a nucleic acid molecule of interest.
- a sequence-specific event can be used to identify a nucleic acid molecule, where at least a portion of the barcode is recognized in the sequence-specific event, e.g., at least a portion of the barcode can participate in a ligation or extension reaction.
- the barcode can therefore allow identification, selection or amplification of gDNA molecules that are coupled thereto.
- One method to couple a barcode to nucleic acid molecules of interest includes preparation of an Ion gDNA fragment library, as described by Life Technologies for the Ion Torrent System.
- fragments of gDNA are ligated to adaptors, where at least one end of each fragment of genomic DNA is ligated to an adaptor including a barcode.
- the ligated adaptors and gDNA fragments may be nick repaired, size selected, and amplified using PCR with primers directed to the adaptors to produce an amplified library.
- ligation adaptors including 16 different barcodes can be used to prepare 16 different gDNA samples, each with a unique barcode, such that either each sample can be amplified separately by PCR using the same PCR primers and then pooled (mixed together) or each sample can be pooled first and then simultaneously amplified using the same PCR primers.
- each gDNA sample can be identified by its attached unique barcode.
- the number of different ligation adaptors needed is equal to the number of barcodes. For example, the production of 256 sample libraries capable of being sequenced as a mixture would require 256 different ligation adapters.
- Embodiments of the present invention provide methods of making dual barcoded nucleic acid molecule for sequencing. Having a first and a second barcode on the same end of a nucleic acid molecule may permit a sequencing read to begin with the second barcode, continue through the first barcode and then into the nucleic acid molecule. Identification of the second barcode, first barcode, and sequence of the nucleic acid molecule may therefore be obtained in a single read as opposed to having to provide a sequencing read from each end of the nucleic acid molecule in order to read the sequence of a single barcode back from a distal end of the nucleic acid molecule, as is the case in traditional methods of using dual sequencing barcodes.
- one embodiment of the present invention relates to a method of making a dual barcoded nucleic acid molecule comprising: coupling one strand of a stem-loop oligonucleotide to a nucleic acid molecule to form a first barcode coupled nucleic acid molecule, the stem-loop oligonucleotide including an intramolecular inverted repeat and a loop, the inverted repeat including a first barcode; displacing one strand of the stem-loop oligonucleotide from the first barcode coupled nucleic acid molecule by strand displacement or by nick translation polymerization to form a first barcoded nucleic acid molecule; annealing a primer to the first barcoded nucleic acid molecule, the primer including a first portion complementary to the first barcoded nucleic acid molecule and a second portion including a second barcode; and extending the annealed primer to form a dual barcoded nucleic acid molecule, the dual barcoded
- the first portion of the primer anneals to the first barcode or to a portion of the first barcode. In other aspects, the first portion of the primer does not anneal within the first barcode.
- the extending may be performed by using a polymerase, e.g., via polymerase chain reaction or PCR.
- the nucleic acid molecule may be genomic DNA, cDNA, amplified DNA, a nucleic acid library, or a fragment thereof.
- a method of preparing a nucleic acid molecule comprising: providing a double stranded nucleic acid molecule; and attaching one strand of a stem-loop oligonucleotide comprising an inverted repeat and a loop to the double stranded nucleic acid molecule to produce an oligonucleotide-attached nucleic acid molecule.
- the double stranded nucleic acid molecule may be a double stranded DNA molecule, in some embodiments.
- the attaching is further defined as attaching the oligonucleotide to the double stranded nucleic acid molecule under conditions to produce a non-covalent junction, such as a nick, a gap, or a 5′ flap structure, in the oligonucleotide-attached nucleic acid molecule.
- the attaching is further defined as ligating. Ligating may be defined as ligating the 3′ end of the stem-loop oligonucleotide adaptor to the 5′ end of the target nucleic acid molecule.
- the method may further comprise displacing one strand of the oligonucleotide from the oligonucleotide-attached nucleic acid molecule by strand displacement or by nick translation polymerization.
- at least part of the oligonucleotide-attached nucleic acid molecule is amplified, such as by polymerase chain reaction, RNA transcription, or strand displacement, for example.
- Methods of the invention may further comprise amplifying an oligonucleotide-attached nucleic acid molecule, wherein at least part of the stem-loop adaptor's intramolecular inverted repeat is excluded from the amplified oligonucleotide-attached nucleic acid molecule.
- Ligating embodiments may be further defined as comprising: generating ligatable ends on the double stranded nucleic acid molecule; generating a ligatable end on the stem-loop oligonucleotide; and ligating one strand of the ligatable end of the stem-loop oligonucleotide to one strand of an end of the nucleic acid molecule, thereby generating a non-covalent junction, such as a nick, a gap, or a 5′ flap structure, in the oligonucleotide-attached nucleic acid molecule.
- the methods comprise generating blunt ends on the nucleic acid molecule; generating a blunt end on the stem-loop oligonucleotide; and ligating one strand of the blunt end of the stem-loop oligonucleotide to one strand of a blunt end of the nucleic acid molecule, thereby generating a nick in the oligonucleotide-ligated nucleic acid molecule.
- the method may comprise coupling one strand of a stem-loop oligonucleotide adaptor to each end of the target nucleic acid molecule.
- the inverted repeat of the stem-loop adaptors coupled to each end of a target nucleic acid molecule may comprise an identical sequence.
- coupling of the stem-loop adaptors to each end of the target nucleic acid molecule will produce a nucleic acid molecule comprising terminal inverted repeats thereby allowing the molecule to form a stem loop.
- the inverted repeat of the stem-loop adaptors coupled to each end of a target nucleic acid molecule may not comprise an identical sequence. In this aspect, coupling of the stem-loop adaptors to each end of the target nucleic acid molecule will produce a nucleic acid molecule lacking terminal inverted repeats and therefore the molecule will not be able to form a stem loop.
- the oligonucleotide-attached nucleic acid molecule comprises a nick having a 3′ hydroxy group, wherein there is polymerization from the 3′ hydroxy group of at least part of the oligonucleotide-attached nucleic acid molecule.
- Strand displacement or nick translation polymerization may be further defined as polymerization that ceases at a non-replicable base or region in the loop or in a region of the stem adjacent to the loop.
- the method further comprises the step of digesting the double stranded DNA molecule with an endonuclease to generate DNA fragments, wherein the oligonucleotide becomes ligated to one strand of the DNA fragment and wherein polymerization of an oligonucleotide-ligated DNA fragment excludes at least part of the stem-loop adaptor's intramolecular inverted repeat by subjecting the oligonucleotide-ligated DNA fragment to strand displacement or nick translation polymerization that halts at a base or sequence in the loop or in a region of the stem adjacent to the loop.
- the stem-loop oligonucleotide is further defined as comprising a cleavable base.
- the cleavable base is present in the loop of the oligonucleotide or in a sequence of the stem adjacent to the loop.
- the cleavable base or sequence may comprise an abasic site or sequence, hexaethylene glycol, and/or a bulky chemical moiety attached to the sugar-phosphate backbone or the base.
- the abasic site or sequence is introduced by one or more enzymes in the single solution.
- the loop of the stem-loop oligonucleotide comprises at least one deoxy-uridine.
- a 5′ end of the stem-loop oligonucleotide lacks a phosphate
- Barcodes can be generated based on selecting a particular nucleic acid sequence.
- the IlluminaTM sequencing can utilize 6 bases to effectively generate 48 different barcodes.
- the Ion Torrent sequencer e.g., the Ion ProtonTM Sequencer or the Ion PGMTM sequencer
- the Ion Torrent sequencer can utilize 6 bases to generate 16 barcodes.
- rules may be applied to the generation of bar codes that allow for separate barcodes to be correctly identified even if two errors occur during sequencing. Barcoding is described, e.g., in U.S. Pat. No. 7,902,122 and U.S. Pat. Publn. 2009/0098555.
- Barcode incorporation by primer extension may be performed using methods described in U.S. Pat. No. 5,935,793 or US 2010/0227329.
- a barcode may be incorporated into a nucleic acid via using ligation, which can then be followed by amplification; for example, methods described in U.S. Pat. Nos. 5,858,656, 6,261,782, U.S. Pat. Publn. 2011/0319290, or U.S. Pat. Publn. 2012/0028814 may be used with the present invention.
- one or more bar code may be used, e.g., as described in U.S. Pat. Publn. 2007/0020640, U.S. Pat. Publn. 2009/0068645, U.S. Pat. Publn. 2010/0273219, U.S. Pat. Publn. 2011/0015096, or U.S. Pat. Publn. 2011/0257031.
- a second bar code may be incorporated into a fragment of a nucleic acid library, wherein the nucleic acid library was generated with an approach compatible with Illumina sequencing such as a NexteraTM DNA sample prep kit, and additional approaches for generating Illumina next-generation sequencing library preparation are described, e.g., in Oyola et al. (2012).
- a nucleic acid library is generated with a method compatible with a SOLiDTM or Ion Torrent sequencing method (e.g., a SOLiD® Fragment Library Construction Kit, a SOLiD® Mate-Paired Library Construction Kit, SOLiD® ChIP-Seq Kit, a SOLiD® Total RNA-Seq Kit, a SOLiD® SAGETM Kit, a Ambion® RNA-Seq Library Construction Kit, etc.). Additional methods for next-generation sequencing methods, including various methods for library construction that may be used with embodiments of the present invention are described, e.g., in Pareek (2011) and Thudi (2012).
- kit housed in a suitable container that comprises one or more compositions of the invention and/or comprises one or more compositions suitable for at least one method of the invention.
- Additional embodiments of the invention include a library of DNA molecules prepared by the methods of the invention.
- FIG. 1 Schematic comparing use of a single set of barcodes versus the use of first and second sets of barcodes.
- FIG. 2 Illustration of the use of stem-loop adaptors to add the first barcode, further followed by PCR amplification to add the second barcode.
- FIG. 3 Illustration of adaptors containing tandem dual barcodes but do not generate terminal inverted repeats.
- FIG. 4 Results of real-time PCR Library Amplification using tandem dual barcodes without inverted repeats.
- FIG. 5 Illustration of adaptors containing tandem dual barcodes that generate terminal inverted repeats.
- FIG. 6 Results of real-time PCR Library Amplification using tandem dual barcodes with inverted repeats.
- the present technology relates to barcoding of nucleic acid molecules.
- Barcodes also described as tags, indexing sequences, or identifier codes, include specific sequences that are incorporated into a nucleic acid molecule for identification purposes.
- synthetic nucleic acid molecules can be joined with genomic DNA (gDNA) by ligation and/or primer extension.
- gDNA genomic DNA
- the present technology is directed towards nucleic acid molecules having multiple barcodes, in particular, sequential or tandem barcodes.
- An example of a tandem barcode includes a first barcode coupled to at least one end of a gDNA molecule by a ligation event (e.g., ligation to a synthetic stem-loop adaptor) followed by a second barcode that is coupled to the gDNA by primer extension (e.g., PCR), where the first barcode is proximal to the gDNA molecule (closer to the insert) and the second barcode is distal to the gDNA (further from the insert).
- primer extension e.g., PCR
- Barcodes can be used to identify nucleic acid molecules, for example, where sequencing can reveal a certain barcode coupled to a nucleic acid molecule of interest.
- a sequence-specific event can be used to identify a nucleic acid molecule, where at least a portion of the barcode is recognized in the sequence-specific event, e.g., at least a portion of the barcode can participate in a ligation or extension reaction.
- the barcode can therefore allow identification, selection or amplification of gDNA molecules that are coupled thereto.
- One method to couple a barcode to nucleic acid molecules of interest includes preparation of an Ion gDNA fragment library, as described by Life Technologies for the Ion Torrent System.
- fragments of gDNA are ligated to adaptors, where at least one end of each fragment of genomic DNA is ligated to an adaptor including a barcode.
- the ligated adaptors and gDNA fragments can be nick repaired, size selected, and amplified using PCR with primers directed to the adaptors to produce an amplified library.
- ligation adaptors including 16 different barcodes can be used to prepare 16 different gDNA samples, each with a unique barcode, such that either each sample can be amplified separately by PCR using the same PCR primers and then pooled (mixed together), or each sample can be pooled first and then simultaneously amplified using the same PCR primers.
- each gDNA sample can be identified by its attached unique barcode.
- the number of different ligation adaptors needed is equal to the number of barcodes. For example, the production of 256 sample libraries capable of being sequenced as a mixture would require 256 different ligation adapters.
- fragments of genomic DNA can be ligated to adaptors having a first set of barcodes, for example, using the stem-loop adaptors and methods as described in U.S. Pat. No. 7,803,550.
- Adaptors having 16 different barcodes can be generated and may be used, e.g., with an Ion Torrent sequencing system (e.g., the Ion ProtonTM Sequencer or the Ion PGMTM sequencer).
- the ligated adaptors and gDNA fragments having the first set of barcodes can then be subjected to a primer extension reaction or PCR using a primer having a second set of barcodes.
- the resulting nucleic acid molecules each have one barcode from the first set of barcodes adjacent to one barcode from the second set of barcodes on at least one end of the nucleic acid molecule.
- the exact number of barcodes may be determined based on the particular application; for example, in some embodiments, the second barcode may utilize six bases to generate, e.g., 16 additional barcodes. Nonetheless, depending on the application and/or sequencing method 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 16 or more bases may be utilized to generate the second barcode. In some embodiments, at least 2, at least 3, or 3-16 bases can be used to generate a second barcode.
- the second set of barcodes can include 16 different primers.
- the ligation adaptors having sixteen different barcodes can be amplified with 16 different primers (directed to the second set of barcodes) to produce 16 ⁇ 16 unique combinations of barcodes, which allows 256 samples to be pooled and multiplexed 256-fold.
- Use of only 16+16 oligonucleotides to achieve this level of multiplexing is a significant savings of cost and time in producing 256 useful libraries.
- the first barcode and the second barcode may be on the same side of the gDNA (i.e., as part of the same adaptor) and can be sequenced in series with each other and with the gDNA in order to economize on sequencing time and cost.
- the first barcode may be directly attached to the gDNA, whereas the second barcode may be attached during amplification by PCR.
- the first set of barcodes can be used to either: a) tag all members of a gDNA sample with the same barcode or b) tag different members of the gDNA sample with different barcodes. For example, if ligation is performed using a single barcode, then all members of the gDNA will generally carry the same barcode. However, if the ligation adaptor is synthesized with a random or partially-random barcode, different molecules in the gDNA sample will have different barcodes.
- the barcode region contains 16 random bases
- 65,536 barcodes may be represented within the ligated nucleic acid library.
- the barcode may be used to distinguish among different members of the input gDNA library; for example, these methods may be used for independent counting of molecular duplicates in the gDNA library (most of which will have different barcodes) and duplicates created during PCR amplification (most of which will carry identical first barcodes).
- Partially-random first barcodes can be used to give information about individual samples and about individual molecules.
- Amplification refers to any in vitro process for increasing the number of copies of a nucleotide sequence or sequences. Nucleic acid amplification results in the incorporation of nucleotides into DNA or RNA. As used herein, one amplification reaction may consist of many rounds of DNA replication. For example, one PCR reaction may consist of 30-100 “cycles” of denaturation and replication.
- Nucleotide is a term of art that refers to a base-sugar-phosphate combination. Nucleotides are the monomeric units of nucleic acid polymers, i.e., of DNA and RNA. The term includes ribonucleotide triphosphates, such as rATP, rCTP, rGTP, or rUTP, and deoxyribonucleotide triphosphates, such as dATP, dCTP, dUTP, dGTP, or dTTP.
- ribonucleotide triphosphates such as rATP, rCTP, rGTP, or rUTP
- deoxyribonucleotide triphosphates such as dATP, dCTP, dUTP, dGTP, or dTTP.
- nucleoside is a base-sugar combination, i.e., a nucleotide lacking a phosphate. It is recognized in the art that there is a certain inter-changeability in usage of the terms nucleoside and nucleotide.
- the nucleotide deoxyuridine triphosphate, dUTP is a deoxyribonucleoside triphosphate. After incorporation into DNA, it serves as a DNA monomer, formally being deoxyuridylate, i.e., dUMP or deoxyuridine monophosphate.
- dUMP deoxyuridylate
- deoxyuridine monophosphate One may say that one incorporates dUTP into DNA even though there is no dUTP moiety in the resultant DNA. Similarly, one may say that one incorporates deoxyuridine into DNA even though that is only a part of the substrate molecule.
- Oligonucleotide refers collectively and interchangeably to two terms of art, “oligonucleotide” and “polynucleotide.” Note that although oligonucleotide and polynucleotide are distinct terms of art, there is no exact dividing line between them and they are used interchangeably herein.
- the term “adaptor” may also be used interchangeably with the terms “oligonucleotide” and “polynucleotide.”
- Primer refers to a single-stranded oligonucleotide or a single-stranded polynucleotide that is extended by covalent addition of nucleotide monomers during amplification. Often, nucleic acid amplification is based on nucleic acid synthesis by a nucleic acid polymerase. Many such polymerases require the presence of a primer that can be extended to initiate nucleic acid synthesis.
- hairpin and “stem-loop oligonucleotide” as used herein refer to a structure formed by an oligonucleotide comprised of 5′ and 3′ terminal regions, which are intramolecular inverted repeats that form a double-stranded stem, and a non-self-complementary central region, which forms a single-stranded loop.
- Embodiments of the present invention may provide one or more benefits or advantages as follows. Having a first and a second barcode on the same end of a nucleic acid molecule may permit a sequencing read to begin with the second barcode, continue through the first barcode and then into the nucleic acid molecule. Identification of the second barcode, first barcode, and sequence of the nucleic acid molecule may therefore be obtained in a single read as opposed to having to provide a sequencing read from each end of the nucleic acid molecule in order to read the sequence of a single barcode back from a distal end of the nucleic acid molecule, as is the case in traditional methods of using dual sequencing barcodes.
- first and second barcodes may be preferably added in two separate steps. This order of addition may significantly increase the possible combinations of encoded sample identity information.
- amplification of individual samples with individual PCR primers may reduce the chance of cross-contamination between barcodes due to priming and amplification artifacts as opposed to amplification of pooled samples with a universal PCR primer as currently practiced in some next generation sequencing platforms ( FIG. 1 ).
- ligation of the first barcode to at least one end of the nucleic acid molecule may be performed using a short adaptor, e.g., a short stem-loop adaptor.
- short adaptors have a stem comprising about 14 to about 23 nucleotides whereas long adaptors have a stem of about 24 to about 40 nucleotides.
- Suppression refers to the selective exclusion of molecules less than a certain size flanked by terminal inverted repeats, due to their inefficient amplification when the primer(s) used for amplification correspond(s) to the entire repeat or a fraction of the repeat (Chenchik et al., 1996; Lukyanov et al., 1999; Siebert et al., 1995; Shagin et al., 1999).
- the reason for this lies in the equilibrium between productive PCR primer annealing and nonproductive self-annealing of the fragment's complementary ends.
- the shorter the insert the stronger the suppression effect and vice versa.
- the target nucleic acid is incubated with an exemplary mixture comprising a stem-loop oligonucleotide with 3′ recessed, 3′ protruding, or blunt end; a 3′ proofreading DNA polymerase (Klenow fragment of the DNA polymerase I, T4 DNA polymerase, etc.); T4 DNA ligase; ATP; and dNTPs.
- a stem-loop oligonucleotide with 3′ recessed, 3′ protruding, or blunt end
- a 3′ proofreading DNA polymerase Klenow fragment of the DNA polymerase I, T4 DNA polymerase, etc.
- T4 DNA ligase T4 DNA ligase
- ATP dNTPs
- exemplary enzymatic reactions are taking place simultaneously: “polishing” of the DNA ends and the oligonucleotide double-stranded stem-region; ligation of the oligonucleotide 3′ end to the 5′ phosphate of the DNA leaving a nick between the 3′ end of DNA and the 5′ end of the oligonucleotide double-stranded stem-region; polymerase extension of the 3′ DNA end that propagates toward the end of the stem-loop oligonucleotide; and a strand-displacement reaction within the oligonucleotide stem region.
- This process results in a library of DNA fragments with inverted repeat adaptors at their ends that include the first barcode sequence.
- a nucleic acid molecule of interest can be a single nucleic acid molecule or a plurality of nucleic acid molecules. Also, a nucleic acid molecule of interest can be of biological or synthetic origin. Examples of nucleic acid molecules include genomic DNA, cDNA, RNA, amplified DNA, a pre-existing nucleic acid library, etc.
- a nucleic acid molecule of interest may be subjected to various treatments, such as repair treatments and fragmenting treatments. Fragmenting treatments include mechanical, sonic, chemical, enzymatic, degradation over time, etc. Repair treatments include nick repair via extension and/or ligation, polishing to create blunt ends, removal of damaged bases such as deaminated, derivatized, abasic, or crosslinked nucleotides, etc.
- a nucleic acid molecule of interest may also be subjected to chemical modification (e.g., bisulfite conversion, methylation/demethylation), extension, amplification (e.g., PCR, isothermal, etc.), etc.
- a first barcode or a first set of barcodes may be coupled to at least one end of the nucleic acid molecule of interest.
- the first barcode may be provided within a stem-loop adaptor, or a first set of barcodes may be provided as a population of stem-loop adaptors.
- a stem-loop adaptor may comprise a stem-loop adaptor as described by U.S. Pat. No. 7,803,550.
- a stem-loop adaptor may include a barcode within the stem portion of the stem-loop adaptor.
- the loop portion of a stem-loop adaptor may include a cleavable replication stop.
- a stem-loop adaptor including the barcode may be coupled to one end of a target nucleic acid molecule or to both ends of a target nucleic acid molecule.
- the intramolecular inverted repeat of the stem-loop adaptors coupled to each end of a target nucleic acid molecule may comprise an identical sequence.
- coupling of the stem-loop adaptors to each end of the target nucleic acid molecule will produce a nucleic acid molecule comprising terminal inverted repeats thereby allowing the molecule to form a stem loop.
- the intramolecular inverted repeat of the stem-loop adaptors coupled to each end of a target nucleic acid molecule may not comprise an identical sequence. In this aspect, coupling of the stem-loop adaptors to each end of the target nucleic acid molecule will produce a nucleic acid molecule lacking terminal inverted repeats and therefore the molecule will not be able to form a stem loop.
- a stem-loop adaptor including the barcode may be coupled to the nucleic acid molecule via ligation to the 5′ end of the nucleic acid molecule, for example, by blunt-end ligation. Ligating the stem-loop adaptor to one or both ends of a target nucleic acid molecule may result in nick formation. Said one or more nicks may be removed from the ligated stem-loop adaptor and the nucleic acid molecule.
- an extension reaction may extend the 3′ end of the nucleic acid molecule through the stem-loop adaptor where the loop portion is cleaved at the cleavable replication stop.
- a second barcode or a second set of barcodes may be coupled to the first barcode or the first set of barcodes that is/are coupled to the nucleic acid molecule(s).
- the first barcode may be an intermediate of the nucleic acid molecule and the second barcode.
- the second barcode may be provided within a primer, or a second set of barcodes may be provided as a population of primers.
- primer extension or PCR may be used to incorporate the second barcode.
- the primer may include a 3′ portion and a 5′ portion, where the 3′ portion may anneal to a portion of the first barcode and the 5′ portion comprises the second barcode.
- FIG. 1 provides a schematic comparing the use of a single set of barcodes (e.g., Ion Torrent System) versus the use of first and second sets of barcodes in embodiments of the present technology.
- the primers are shown to bind to sequences outside the first barcode; however, in some embodiments the primer may bind to the first barcode or even to gDNA sequences.
- the ligation may be to a unique adaptor molecule (e.g., a stem loop) that is added to both ends of the gDNA, or two (or more) distinct adaptor molecules that have different sequences.
- a unique adaptor molecule e.g., a stem loop
- FIG. 2 illustrates a specific embodiment of the invention that utilizes stem-loop adaptors and methods as described in U.S. Pat. No. 7,803,550 to add adaptors with the first bar code. The method is further followed by PCR amplification to add the second barcode.
- the present methods may be adapted for use with other next generation sequencing platforms and is not limited to use with the Ion Torrent platform.
- a pre-mix of 2 ⁇ L/sample Template Preparation Buffer ((6.5 ⁇ ATP-free ligase buffer comprising: 325 mM Tris-HCI pH 7.6 @ 25° C., 65 mM MgCl 2 , 3.25 mM DTT) supplemented with dNTP mix (2.5 mM each dNTP)) and 1 ⁇ L/sample Template Preparation Enzyme (End Repair Mix, Enzymatics Cat # Y914-LC-L) was prepared in a separate tube and mixed by pipette.
- Template Preparation Buffer ((6.5 ⁇ ATP-free ligase buffer comprising: 325 mM Tris-HCI pH 7.6 @ 25° C., 65 mM MgCl 2 , 3.25 mM DTT) supplemented with dNTP mix (2.5 mM each dNTP)) and 1 ⁇ L/sample Template Preparation Enzyme (End Repair Mix, Enzymatics Cat
- Library Amplification pre-mix of 4.25 ⁇ L/sample nuclease-free water, 3.75 ⁇ L/sample EvaGreen:FC (9:1), 50.5 ⁇ L/sample Library Amplification Buffer (comprising: 150 mM Tris-SO 4 , pH 8.5 @ 25° C., 120 mM TMAC, 0.75 mM MgCl 2 , 0.06% w/v Gelatin, supplemented with 0.375 ⁇ M of each PCR oligo), and 1.5 ⁇ L/sample Library Amplification Enzyme (KAPA HiFi DNA Polymerase (KK2102) at 1 U/ul) was prepared in a separate tube immediately prior to use.
- KAPA HiFi DNA Polymerase KK2102
- the final concentration of the reaction components was as follows: 100 mM Tris-SO4, pH 8.5 @ 25° C., 80 mM TMAC, 2.5 mM MgCl 2 , 0.04% w/v Gelatin, 1 ⁇ EvaGreen, 1 ⁇ FCD, 1.5 U KAPA HiFi DNA Polymerase, 0.25 ⁇ M each PCR oligo.
- the plates were centrifuged and then incubated in a real-time thermal cycler as follows: 1 cycle at 72° C. for 3 min; 1 cycle at 85° C. for 2 min; 1 cycle at 98° C. for 2 min; 4 cycles of 98° C. for 20 sec, 67° C. for 20 sec, 72° C. for 40 sec; and 4-21 cycles of 98° C. for 20 sec and 72° C. for 50 sec.
- the T M of the Ion Universal Adaptor P1/A was computed using the Oligo Analyzer (IDT) at 0.25 ⁇ M oligo, 100 mM Nat, 2.5 mM Mg ++ , and 0.3 mM dNTPs to be 61° C.
- IDT Oligo Analyzer
- the experimental conditions were as described in Example 1, except that the oligonucleotide sequences indicated in Table 2 were used and a single universal stem-loop adaptor (2 ⁇ M in the library synthesis reaction) was used to attach terminal inverted repeats to both ends of the DNA fragments.
- the stem-loop adaptor containing sequences for generating terminal inverted repeats showed significant improvement of signal-to-noise ratio ( FIG. 6 ) over the design containing no such sequences for generating inverted repeats described in Example 1 ( FIG. 4 ).
Abstract
Description
- The present application is a continuation of U.S. application Ser. No. 16/183,107 filed Nov. 7, 2018; which application is a continuation of U.S. application Ser. No. 14/438,280 filed Apr. 24, 2015 and now issued as U.S. Pat. No. 10,155,942; which application is a National Stage Entry of International Application No. PCT/US2013/068468 filed Nov. 5, 2013; which application claims the priority benefit of U.S. Provisional Application No. 61/722,357 filed Nov. 5, 2012; the entire contents of which are incorporated herein by reference.
- The sequence listing that is contained in the file named “CLON-163_Seqlist_14438280”, which is 2 KB (as measured in Microsoft Windows®) and was created on Nov. 5, 2013, is filed herewith by electronic submission and is incorporated by reference herein.
- The present invention relates generally to the fields of molecular biology and nucleic acid sequencing. More particularly, it concerns methods of barcoding nucleic acids.
- Barcodes can be used to identify nucleic acid molecules, for example, where sequencing can reveal a certain barcode coupled to a nucleic acid molecule of interest. In some instances, a sequence-specific event can be used to identify a nucleic acid molecule, where at least a portion of the barcode is recognized in the sequence-specific event, e.g., at least a portion of the barcode can participate in a ligation or extension reaction. The barcode can therefore allow identification, selection or amplification of gDNA molecules that are coupled thereto.
- One method to couple a barcode to nucleic acid molecules of interest includes preparation of an Ion gDNA fragment library, as described by Life Technologies for the Ion Torrent System. In this method, fragments of gDNA are ligated to adaptors, where at least one end of each fragment of genomic DNA is ligated to an adaptor including a barcode. The ligated adaptors and gDNA fragments may be nick repaired, size selected, and amplified using PCR with primers directed to the adaptors to produce an amplified library. For example, ligation adaptors including 16 different barcodes can be used to prepare 16 different gDNA samples, each with a unique barcode, such that either each sample can be amplified separately by PCR using the same PCR primers and then pooled (mixed together) or each sample can be pooled first and then simultaneously amplified using the same PCR primers. As a result each gDNA sample can be identified by its attached unique barcode. However, the number of different ligation adaptors needed is equal to the number of barcodes. For example, the production of 256 sample libraries capable of being sequenced as a mixture would require 256 different ligation adapters.
- Embodiments of the present invention provide methods of making dual barcoded nucleic acid molecule for sequencing. Having a first and a second barcode on the same end of a nucleic acid molecule may permit a sequencing read to begin with the second barcode, continue through the first barcode and then into the nucleic acid molecule. Identification of the second barcode, first barcode, and sequence of the nucleic acid molecule may therefore be obtained in a single read as opposed to having to provide a sequencing read from each end of the nucleic acid molecule in order to read the sequence of a single barcode back from a distal end of the nucleic acid molecule, as is the case in traditional methods of using dual sequencing barcodes.
- As such, one embodiment of the present invention relates to a method of making a dual barcoded nucleic acid molecule comprising: coupling one strand of a stem-loop oligonucleotide to a nucleic acid molecule to form a first barcode coupled nucleic acid molecule, the stem-loop oligonucleotide including an intramolecular inverted repeat and a loop, the inverted repeat including a first barcode; displacing one strand of the stem-loop oligonucleotide from the first barcode coupled nucleic acid molecule by strand displacement or by nick translation polymerization to form a first barcoded nucleic acid molecule; annealing a primer to the first barcoded nucleic acid molecule, the primer including a first portion complementary to the first barcoded nucleic acid molecule and a second portion including a second barcode; and extending the annealed primer to form a dual barcoded nucleic acid molecule, the dual barcoded nucleic acid molecule including the second barcode, the first barcode, and at least a portion of the nucleic acid molecule. In some aspects, the first portion of the primer anneals to the first barcode or to a portion of the first barcode. In other aspects, the first portion of the primer does not anneal within the first barcode. The extending may be performed by using a polymerase, e.g., via polymerase chain reaction or PCR. The nucleic acid molecule may be genomic DNA, cDNA, amplified DNA, a nucleic acid library, or a fragment thereof.
- In one embodiment of the invention, there is a method of preparing a nucleic acid molecule, comprising: providing a double stranded nucleic acid molecule; and attaching one strand of a stem-loop oligonucleotide comprising an inverted repeat and a loop to the double stranded nucleic acid molecule to produce an oligonucleotide-attached nucleic acid molecule. The double stranded nucleic acid molecule may be a double stranded DNA molecule, in some embodiments. In specific embodiments, the attaching is further defined as attaching the oligonucleotide to the double stranded nucleic acid molecule under conditions to produce a non-covalent junction, such as a nick, a gap, or a 5′ flap structure, in the oligonucleotide-attached nucleic acid molecule. In particular aspects of the invention, the attaching is further defined as ligating. Ligating may be defined as ligating the 3′ end of the stem-loop oligonucleotide adaptor to the 5′ end of the target nucleic acid molecule. The method may further comprise displacing one strand of the oligonucleotide from the oligonucleotide-attached nucleic acid molecule by strand displacement or by nick translation polymerization. In a specific embodiment, at least part of the oligonucleotide-attached nucleic acid molecule is amplified, such as by polymerase chain reaction, RNA transcription, or strand displacement, for example. Methods of the invention may further comprise amplifying an oligonucleotide-attached nucleic acid molecule, wherein at least part of the stem-loop adaptor's intramolecular inverted repeat is excluded from the amplified oligonucleotide-attached nucleic acid molecule.
- Ligating embodiments may be further defined as comprising: generating ligatable ends on the double stranded nucleic acid molecule; generating a ligatable end on the stem-loop oligonucleotide; and ligating one strand of the ligatable end of the stem-loop oligonucleotide to one strand of an end of the nucleic acid molecule, thereby generating a non-covalent junction, such as a nick, a gap, or a 5′ flap structure, in the oligonucleotide-attached nucleic acid molecule. In further aspects, the methods comprise generating blunt ends on the nucleic acid molecule; generating a blunt end on the stem-loop oligonucleotide; and ligating one strand of the blunt end of the stem-loop oligonucleotide to one strand of a blunt end of the nucleic acid molecule, thereby generating a nick in the oligonucleotide-ligated nucleic acid molecule.
- In some aspects, the method may comprise coupling one strand of a stem-loop oligonucleotide adaptor to each end of the target nucleic acid molecule. In some aspects, the inverted repeat of the stem-loop adaptors coupled to each end of a target nucleic acid molecule may comprise an identical sequence. In this aspect, coupling of the stem-loop adaptors to each end of the target nucleic acid molecule will produce a nucleic acid molecule comprising terminal inverted repeats thereby allowing the molecule to form a stem loop. In other aspects, the inverted repeat of the stem-loop adaptors coupled to each end of a target nucleic acid molecule may not comprise an identical sequence. In this aspect, coupling of the stem-loop adaptors to each end of the target nucleic acid molecule will produce a nucleic acid molecule lacking terminal inverted repeats and therefore the molecule will not be able to form a stem loop.
- In additional embodiments, the oligonucleotide-attached nucleic acid molecule comprises a nick having a 3′ hydroxy group, wherein there is polymerization from the 3′ hydroxy group of at least part of the oligonucleotide-attached nucleic acid molecule.
- Strand displacement or nick translation polymerization may be further defined as polymerization that ceases at a non-replicable base or region in the loop or in a region of the stem adjacent to the loop.
- In a specific aspect of the invention, the method further comprises the step of digesting the double stranded DNA molecule with an endonuclease to generate DNA fragments, wherein the oligonucleotide becomes ligated to one strand of the DNA fragment and wherein polymerization of an oligonucleotide-ligated DNA fragment excludes at least part of the stem-loop adaptor's intramolecular inverted repeat by subjecting the oligonucleotide-ligated DNA fragment to strand displacement or nick translation polymerization that halts at a base or sequence in the loop or in a region of the stem adjacent to the loop.
- In some embodiments, the stem-loop oligonucleotide is further defined as comprising a cleavable base. In particular, in some cases the cleavable base is present in the loop of the oligonucleotide or in a sequence of the stem adjacent to the loop. The cleavable base or sequence may comprise an abasic site or sequence, hexaethylene glycol, and/or a bulky chemical moiety attached to the sugar-phosphate backbone or the base. In specific embodiments, the abasic site or sequence is introduced by one or more enzymes in the single solution. In a further specific embodiment, the loop of the stem-loop oligonucleotide comprises at least one deoxy-uridine.
- In specific aspects, a 5′ end of the stem-loop oligonucleotide lacks a phosphate.
- Barcodes, also referred to as “barcodes,” can be generated based on selecting a particular nucleic acid sequence. For example, the Illumina™ sequencing can utilize 6 bases to effectively generate 48 different barcodes. The Ion Torrent sequencer (e.g., the Ion Proton™ Sequencer or the Ion PGM™ sequencer) can utilize 6 bases to generate 16 barcodes. In some embodiments, rules may be applied to the generation of bar codes that allow for separate barcodes to be correctly identified even if two errors occur during sequencing. Barcoding is described, e.g., in U.S. Pat. No. 7,902,122 and U.S. Pat. Publn. 2009/0098555. Barcode incorporation by primer extension, for example via PCR, may be performed using methods described in U.S. Pat. No. 5,935,793 or US 2010/0227329. In some embodiments, a barcode may be incorporated into a nucleic acid via using ligation, which can then be followed by amplification; for example, methods described in U.S. Pat. Nos. 5,858,656, 6,261,782, U.S. Pat. Publn. 2011/0319290, or U.S. Pat. Publn. 2012/0028814 may be used with the present invention. In some embodiments, one or more bar code may be used, e.g., as described in U.S. Pat. Publn. 2007/0020640, U.S. Pat. Publn. 2009/0068645, U.S. Pat. Publn. 2010/0273219, U.S. Pat. Publn. 2011/0015096, or U.S. Pat. Publn. 2011/0257031.
- Although some embodiments incorporate a second bar code into a genomic library generated, e.g., via the methods described in U.S. Pat. No. 7,803,550, methods of the present invention may be used in combination with a wide variety of techniques for generating a nucleic acid library. For example, a second barcode may be incorporated into a fragment of a nucleic acid library, wherein the nucleic acid library was generated with an approach compatible with Illumina sequencing such as a Nextera™ DNA sample prep kit, and additional approaches for generating Illumina next-generation sequencing library preparation are described, e.g., in Oyola et al. (2012). In other embodiments, a nucleic acid library is generated with a method compatible with a SOLiD™ or Ion Torrent sequencing method (e.g., a SOLiD® Fragment Library Construction Kit, a SOLiD® Mate-Paired Library Construction Kit, SOLiD® ChIP-Seq Kit, a SOLiD® Total RNA-Seq Kit, a SOLiD® SAGE™ Kit, a Ambion® RNA-Seq Library Construction Kit, etc.). Additional methods for next-generation sequencing methods, including various methods for library construction that may be used with embodiments of the present invention are described, e.g., in Pareek (2011) and Thudi (2012).
- In an additional embodiment, there is a kit housed in a suitable container that comprises one or more compositions of the invention and/or comprises one or more compositions suitable for at least one method of the invention.
- Additional embodiments of the invention include a library of DNA molecules prepared by the methods of the invention.
- As used herein the specification, “a” or “an” may mean one or more. As used herein in the claim(s), when used in conjunction with the word “comprising,” the words “a” or “an” may mean one or more than one.
- The use of the term “or” in the claims is used to mean “and/or” unless explicitly indicated to refer to alternatives only or the alternatives are mutually exclusive, although the disclosure supports a definition that refers to only alternatives and “and/or.” As used herein “another” may mean at least a second or more.
- Throughout this application, the term “about” is used to indicate that a value includes the inherent variation of error for the device, the method being employed to determine the value, or the variation that exists among the study subjects.
- Other objects, features and advantages of the present invention will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples, while indicating preferred embodiments of the invention, are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention will become apparent to those skilled in the art from this detailed description.
- The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present invention. The invention may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.
-
FIG. 1 . Schematic comparing use of a single set of barcodes versus the use of first and second sets of barcodes. -
FIG. 2 . Illustration of the use of stem-loop adaptors to add the first barcode, further followed by PCR amplification to add the second barcode. -
FIG. 3 . Illustration of adaptors containing tandem dual barcodes but do not generate terminal inverted repeats. -
FIG. 4 . Results of real-time PCR Library Amplification using tandem dual barcodes without inverted repeats. -
FIG. 5 . Illustration of adaptors containing tandem dual barcodes that generate terminal inverted repeats. -
FIG. 6 . Results of real-time PCR Library Amplification using tandem dual barcodes with inverted repeats. - The present technology relates to barcoding of nucleic acid molecules. Barcodes, also described as tags, indexing sequences, or identifier codes, include specific sequences that are incorporated into a nucleic acid molecule for identification purposes. For example, synthetic nucleic acid molecules can be joined with genomic DNA (gDNA) by ligation and/or primer extension. The present technology is directed towards nucleic acid molecules having multiple barcodes, in particular, sequential or tandem barcodes. An example of a tandem barcode includes a first barcode coupled to at least one end of a gDNA molecule by a ligation event (e.g., ligation to a synthetic stem-loop adaptor) followed by a second barcode that is coupled to the gDNA by primer extension (e.g., PCR), where the first barcode is proximal to the gDNA molecule (closer to the insert) and the second barcode is distal to the gDNA (further from the insert). Methods of using stem loop adaptor ligation and primer extension or PCR to add additional sequences are described, e.g., in U.S. Pat. No. 7,803,550, which is incorporated by reference herein in its entirety. These methods may be used in embodiments of the present invention to add a first and/or second barcode to a nucleic acid molecule.
- Barcodes can be used to identify nucleic acid molecules, for example, where sequencing can reveal a certain barcode coupled to a nucleic acid molecule of interest. In some instances, a sequence-specific event can be used to identify a nucleic acid molecule, where at least a portion of the barcode is recognized in the sequence-specific event, e.g., at least a portion of the barcode can participate in a ligation or extension reaction. The barcode can therefore allow identification, selection or amplification of gDNA molecules that are coupled thereto.
- One method to couple a barcode to nucleic acid molecules of interest includes preparation of an Ion gDNA fragment library, as described by Life Technologies for the Ion Torrent System. In this method, fragments of gDNA are ligated to adaptors, where at least one end of each fragment of genomic DNA is ligated to an adaptor including a barcode. The ligated adaptors and gDNA fragments can be nick repaired, size selected, and amplified using PCR with primers directed to the adaptors to produce an amplified library. For example, ligation adaptors including 16 different barcodes can be used to prepare 16 different gDNA samples, each with a unique barcode, such that either each sample can be amplified separately by PCR using the same PCR primers and then pooled (mixed together), or each sample can be pooled first and then simultaneously amplified using the same PCR primers. As a result each gDNA sample can be identified by its attached unique barcode. However, one problem with this approach is that the number of different ligation adaptors needed is equal to the number of barcodes. For example, the production of 256 sample libraries capable of being sequenced as a mixture would require 256 different ligation adapters.
- To address this problem, fragments of genomic DNA can be ligated to adaptors having a first set of barcodes, for example, using the stem-loop adaptors and methods as described in U.S. Pat. No. 7,803,550. Adaptors having 16 different barcodes can be generated and may be used, e.g., with an Ion Torrent sequencing system (e.g., the Ion Proton™ Sequencer or the Ion PGM™ sequencer). The ligated adaptors and gDNA fragments having the first set of barcodes can then be subjected to a primer extension reaction or PCR using a primer having a second set of barcodes. The resulting nucleic acid molecules each have one barcode from the first set of barcodes adjacent to one barcode from the second set of barcodes on at least one end of the nucleic acid molecule. The exact number of barcodes may be determined based on the particular application; for example, in some embodiments, the second barcode may utilize six bases to generate, e.g., 16 additional barcodes. Nonetheless, depending on the application and/or
sequencing method - In some embodiments, the second set of barcodes can include 16 different primers. In this manner, the ligation adaptors having sixteen different barcodes (the first set of barcodes) can be amplified with 16 different primers (directed to the second set of barcodes) to produce 16×16 unique combinations of barcodes, which allows 256 samples to be pooled and multiplexed 256-fold. Use of only 16+16 oligonucleotides to achieve this level of multiplexing is a significant savings of cost and time in producing 256 useful libraries. Preferably, the first barcode and the second barcode may be on the same side of the gDNA (i.e., as part of the same adaptor) and can be sequenced in series with each other and with the gDNA in order to economize on sequencing time and cost.
- Additionally, the first barcode may be directly attached to the gDNA, whereas the second barcode may be attached during amplification by PCR. Thus, the first set of barcodes can be used to either: a) tag all members of a gDNA sample with the same barcode or b) tag different members of the gDNA sample with different barcodes. For example, if ligation is performed using a single barcode, then all members of the gDNA will generally carry the same barcode. However, if the ligation adaptor is synthesized with a random or partially-random barcode, different molecules in the gDNA sample will have different barcodes. In the extreme, if the barcode region contains 16 random bases, then 65,536 barcodes may be represented within the ligated nucleic acid library. The barcode may be used to distinguish among different members of the input gDNA library; for example, these methods may be used for independent counting of molecular duplicates in the gDNA library (most of which will have different barcodes) and duplicates created during PCR amplification (most of which will carry identical first barcodes). Partially-random first barcodes can be used to give information about individual samples and about individual molecules.
- “Amplification,” as used herein, refers to any in vitro process for increasing the number of copies of a nucleotide sequence or sequences. Nucleic acid amplification results in the incorporation of nucleotides into DNA or RNA. As used herein, one amplification reaction may consist of many rounds of DNA replication. For example, one PCR reaction may consist of 30-100 “cycles” of denaturation and replication.
- “Nucleotide,” as used herein, is a term of art that refers to a base-sugar-phosphate combination. Nucleotides are the monomeric units of nucleic acid polymers, i.e., of DNA and RNA. The term includes ribonucleotide triphosphates, such as rATP, rCTP, rGTP, or rUTP, and deoxyribonucleotide triphosphates, such as dATP, dCTP, dUTP, dGTP, or dTTP.
- A “nucleoside” is a base-sugar combination, i.e., a nucleotide lacking a phosphate. It is recognized in the art that there is a certain inter-changeability in usage of the terms nucleoside and nucleotide. For example, the nucleotide deoxyuridine triphosphate, dUTP, is a deoxyribonucleoside triphosphate. After incorporation into DNA, it serves as a DNA monomer, formally being deoxyuridylate, i.e., dUMP or deoxyuridine monophosphate. One may say that one incorporates dUTP into DNA even though there is no dUTP moiety in the resultant DNA. Similarly, one may say that one incorporates deoxyuridine into DNA even though that is only a part of the substrate molecule.
- “Incorporating,” as used herein, means becoming part of a nucleic acid polymer.
- “Oligonucleotide,” as used herein, refers collectively and interchangeably to two terms of art, “oligonucleotide” and “polynucleotide.” Note that although oligonucleotide and polynucleotide are distinct terms of art, there is no exact dividing line between them and they are used interchangeably herein. The term “adaptor” may also be used interchangeably with the terms “oligonucleotide” and “polynucleotide.”
- “Primer” as used herein refers to a single-stranded oligonucleotide or a single-stranded polynucleotide that is extended by covalent addition of nucleotide monomers during amplification. Often, nucleic acid amplification is based on nucleic acid synthesis by a nucleic acid polymerase. Many such polymerases require the presence of a primer that can be extended to initiate nucleic acid synthesis.
- The terms “hairpin” and “stem-loop oligonucleotide” as used herein refer to a structure formed by an oligonucleotide comprised of 5′ and 3′ terminal regions, which are intramolecular inverted repeats that form a double-stranded stem, and a non-self-complementary central region, which forms a single-stranded loop.
- Embodiments of the present invention may provide one or more benefits or advantages as follows. Having a first and a second barcode on the same end of a nucleic acid molecule may permit a sequencing read to begin with the second barcode, continue through the first barcode and then into the nucleic acid molecule. Identification of the second barcode, first barcode, and sequence of the nucleic acid molecule may therefore be obtained in a single read as opposed to having to provide a sequencing read from each end of the nucleic acid molecule in order to read the sequence of a single barcode back from a distal end of the nucleic acid molecule, as is the case in traditional methods of using dual sequencing barcodes.
- Another advantage in certain embodiments is that the first and second barcodes may be preferably added in two separate steps. This order of addition may significantly increase the possible combinations of encoded sample identity information. In addition, amplification of individual samples with individual PCR primers may reduce the chance of cross-contamination between barcodes due to priming and amplification artifacts as opposed to amplification of pooled samples with a universal PCR primer as currently practiced in some next generation sequencing platforms (
FIG. 1 ). In particular, ligation of the first barcode to at least one end of the nucleic acid molecule may be performed using a short adaptor, e.g., a short stem-loop adaptor. For example, short adaptors have a stem comprising about 14 to about 23 nucleotides whereas long adaptors have a stem of about 24 to about 40 nucleotides. - Qualitative observations and quantitative experiments show that ligation of a single adaptor or two different adaptors designed to contain a common sequence proximal to the ligation site may have a beneficial effect on the ability to preferentially amplify molecules comprising target inserts of controlled size and discriminate against adaptor dimers carrying no insert or molecules comprising short inserts that have little or no information value. This phenomenon is referred to as suppression or suppression PCR. Suppression refers to the selective exclusion of molecules less than a certain size flanked by terminal inverted repeats, due to their inefficient amplification when the primer(s) used for amplification correspond(s) to the entire repeat or a fraction of the repeat (Chenchik et al., 1996; Lukyanov et al., 1999; Siebert et al., 1995; Shagin et al., 1999). The reason for this lies in the equilibrium between productive PCR primer annealing and nonproductive self-annealing of the fragment's complementary ends. At a fixed size of a flanking terminal inverted repeat, the shorter the insert, the stronger the suppression effect and vice versa. Likewise, at a fixed insert size, the longer the terminal inverted repeat, the stronger the suppression effect (Chenchik et al., 1996; Lukyanov et al., 1999; Siebert et al., 1995; Shagin et al., 1999).
- By virtue of attaching a terminal inverted repeat to both end of a nucleic acid molecule by ligation and/or primer extension one may achieve precise control over the efficiency of primer annealing and extension of target inserts of desired minimal size versus undesirable adaptor dimers or short insert byproducts as described by U.S. Pat. No. 7,803,550. Efficiency in attaching the first barcode via adaptor ligation may be utilized for preventing bias and preserving the representation of nucleic acid molecules in a sample. Conversely, coupling the second barcode via primer extension may efficiently employ a long primer including the second barcode.
- In this embodiment of the present invention, the target nucleic acid is incubated with an exemplary mixture comprising a stem-loop oligonucleotide with 3′ recessed, 3′ protruding, or blunt end; a 3′ proofreading DNA polymerase (Klenow fragment of the DNA polymerase I, T4 DNA polymerase, etc.); T4 DNA ligase; ATP; and dNTPs. Four exemplary enzymatic reactions are taking place simultaneously: “polishing” of the DNA ends and the oligonucleotide double-stranded stem-region; ligation of the oligonucleotide 3′ end to the 5′ phosphate of the DNA leaving a nick between the 3′ end of DNA and the 5′ end of the oligonucleotide double-stranded stem-region; polymerase extension of the 3′ DNA end that propagates toward the end of the stem-loop oligonucleotide; and a strand-displacement reaction within the oligonucleotide stem region. This process results in a library of DNA fragments with inverted repeat adaptors at their ends that include the first barcode sequence.
- The following outline provides further details relating to the methods and compositions of the present technology. All particular examples provided are understood to be non-limiting examples.
- A. Preparation of Nucleic Acid Molecules of Interest
- A nucleic acid molecule of interest can be a single nucleic acid molecule or a plurality of nucleic acid molecules. Also, a nucleic acid molecule of interest can be of biological or synthetic origin. Examples of nucleic acid molecules include genomic DNA, cDNA, RNA, amplified DNA, a pre-existing nucleic acid library, etc.
- A nucleic acid molecule of interest may be subjected to various treatments, such as repair treatments and fragmenting treatments. Fragmenting treatments include mechanical, sonic, chemical, enzymatic, degradation over time, etc. Repair treatments include nick repair via extension and/or ligation, polishing to create blunt ends, removal of damaged bases such as deaminated, derivatized, abasic, or crosslinked nucleotides, etc. A nucleic acid molecule of interest may also be subjected to chemical modification (e.g., bisulfite conversion, methylation/demethylation), extension, amplification (e.g., PCR, isothermal, etc.), etc.
- B. First Barcode Coupling
- A first barcode or a first set of barcodes may be coupled to at least one end of the nucleic acid molecule of interest. In some aspects, the first barcode may be provided within a stem-loop adaptor, or a first set of barcodes may be provided as a population of stem-loop adaptors. A stem-loop adaptor may comprise a stem-loop adaptor as described by U.S. Pat. No. 7,803,550. In some aspects, a stem-loop adaptor may include a barcode within the stem portion of the stem-loop adaptor. In some aspects, the loop portion of a stem-loop adaptor may include a cleavable replication stop.
- In some aspects, a stem-loop adaptor including the barcode may be coupled to one end of a target nucleic acid molecule or to both ends of a target nucleic acid molecule. In some aspects, the intramolecular inverted repeat of the stem-loop adaptors coupled to each end of a target nucleic acid molecule may comprise an identical sequence. In this aspect, coupling of the stem-loop adaptors to each end of the target nucleic acid molecule will produce a nucleic acid molecule comprising terminal inverted repeats thereby allowing the molecule to form a stem loop. In other aspects, the intramolecular inverted repeat of the stem-loop adaptors coupled to each end of a target nucleic acid molecule may not comprise an identical sequence. In this aspect, coupling of the stem-loop adaptors to each end of the target nucleic acid molecule will produce a nucleic acid molecule lacking terminal inverted repeats and therefore the molecule will not be able to form a stem loop.
- In some aspects, a stem-loop adaptor including the barcode may be coupled to the nucleic acid molecule via ligation to the 5′ end of the nucleic acid molecule, for example, by blunt-end ligation. Ligating the stem-loop adaptor to one or both ends of a target nucleic acid molecule may result in nick formation. Said one or more nicks may be removed from the ligated stem-loop adaptor and the nucleic acid molecule.
- In some aspects, an extension reaction may extend the 3′ end of the nucleic acid molecule through the stem-loop adaptor where the loop portion is cleaved at the cleavable replication stop.
- C. Second Barcode Coupling
- A second barcode or a second set of barcodes may be coupled to the first barcode or the first set of barcodes that is/are coupled to the nucleic acid molecule(s). In this manner, the first barcode may be an intermediate of the nucleic acid molecule and the second barcode. In some aspects, the second barcode may be provided within a primer, or a second set of barcodes may be provided as a population of primers. In some aspects, primer extension or PCR may be used to incorporate the second barcode. In some aspects, the primer may include a 3′ portion and a 5′ portion, where the 3′ portion may anneal to a portion of the first barcode and the 5′ portion comprises the second barcode.
- Additional information regarding the methods may be found in Ausubel et al. (2003) or Sambrook et al. (1989). As would be recognized by one of skill in the art, various parameters may be manipulated to optimize preparation of a nucleic acid of interest, primer extension, or PCR to incorporate a second barcode.
- The following examples are included to demonstrate preferred embodiments of the invention. It should be appreciated by those of skill in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventor to function well in the practice of the invention, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit and scope of the invention.
-
FIG. 1 provides a schematic comparing the use of a single set of barcodes (e.g., Ion Torrent System) versus the use of first and second sets of barcodes in embodiments of the present technology. The primers are shown to bind to sequences outside the first barcode; however, in some embodiments the primer may bind to the first barcode or even to gDNA sequences. In the aspects of present invention, the ligation may be to a unique adaptor molecule (e.g., a stem loop) that is added to both ends of the gDNA, or two (or more) distinct adaptor molecules that have different sequences. For purposes of reducing background from adaptor dimers, if different adaptor molecules are used they may preferably have a common sequence that will suppress PCR amplification of very short molecules, including adaptor dimers. -
FIG. 2 illustrates a specific embodiment of the invention that utilizes stem-loop adaptors and methods as described in U.S. Pat. No. 7,803,550 to add adaptors with the first bar code. The method is further followed by PCR amplification to add the second barcode. - The inventors sought to test Ion Torrent adaptors that contain tandem dual barcodes but that lack sequences to generate terminal inverted repeats. Stem-loop adaptors and PCR primers were designed as shown in
FIG. 3 and Table 1. Of note, the present methods may be adapted for use with other next generation sequencing platforms and is not limited to use with the Ion Torrent platform. -
TABLE 1 Oligonucleotide Sequences SEQ Oligo Name ID NO: Sequence Ion Adaptor 1 ATCACUGACUGCCCATAUUUUUUTATGGGC P1 AGTCGGTGAT Ion PCR 2 CCACTACGCCTCCGCTTTCCTCTCTATGGG Primer P1 CAGTCGGTGAT Ion Adaptor 3 ATCCUGGAAUCCTCTTATCUUUUUUGATAA A GAGGATTCCCGGAT Ion PCR 4 CCATCTCATCCCTGCGTGTCTCCGACTCAG Primer A CTAAGGTAAC GATAAGAGGATTCCCGGAT Underlining = first barcode; Underlining and bold = second barcode. - Template Preparation. Ten microliters of each DNA sample (0.1 ng/μL Coavris-shearted human gDNA) was added to a PCR tube or well. For non-template controls (NTC), 10 μL of nuclease-free water was substituted for the DNA sample. A pre-mix of 2 μL/sample Template Preparation Buffer ((6.5×ATP-free ligase buffer comprising: 325 mM Tris-HCI pH 7.6 @ 25° C., 65 mM MgCl2, 3.25 mM DTT) supplemented with dNTP mix (2.5 mM each dNTP)) and 1 μL/sample Template Preparation Enzyme (End Repair Mix, Enzymatics Cat # Y914-LC-L) was prepared in a separate tube and mixed by pipette. Then, 3 μL of the pre-mix was added to the 10 μL DNA sample in the PCR tube or well and mixed 4-5 times was a pipette set to 8 μL. The final concentration of the reaction components was as follows: 50 mM Tris-HCI pH 7.6 @ 25° C., 10 mM MgCl2, 0.5 mM DTT, 385 μM dNTPs, 1× End Repair Enzymes. The PCR plate was centrifuged and incubated in a thermal cycler using the following conditions: 1 cycle at 22° C. for 25 min; 1 cycle at 55° C. for 20 min; hold at 22° C.
- Library Synthesis. Fresh Library Synthesis pre-mix of 1 μL/sample Library Synthesis Buffer (2×ATP-free ligase buffer comprising: 100 mM Tris-HCI pH 7.6 @ 25° C., 20 mM MgCl2, 1.0 mM DTT supplemented with 15 mM ATP and 15 μM each stem-loop adaptor oligo) and 1 μL/sample Library Synthesis Enzyme Mix (comprising: 1.2 U Uracil DNA Glycosylase (UDG, Enzymatics # G5010L) and 8 U T4 DNA Ligase (Enzymatics # L603-HC-L) per μL) was prepared in a separate tube and mixed by pipette. Then, 2 μL of the Library Synthesis pre-mix were added to each sample and mixed 4-5 times with a pipette set to 10 μL. The final concentration of the reaction components was as follows: 50 mM Tris-HCI pH 7.6 @ 25° C., 10 mM MgCl2, 0.5 mM DTT, 334 μM dNTPs, 1 mM ATP, 1.2 U Uracil DNA Glycosylase, 8 U T4 DNA Ligase, 1 μM each adaptor oligo. The plate was centrifuged and incubated in a thermal cycler using the following conditions: 1 cycle at 22° C. for 40 min; hold at 4° C.
- ThruPLEX-FD Library Amplification. Library Amplification pre-mix of 4.25 μL/sample nuclease-free water, 3.75 μL/sample EvaGreen:FC (9:1), 50.5 μL/sample Library Amplification Buffer (comprising: 150 mM Tris-SO4, pH 8.5 @ 25° C., 120 mM TMAC, 0.75 mM MgCl2, 0.06% w/v Gelatin, supplemented with 0.375 μM of each PCR oligo), and 1.5 μL/sample Library Amplification Enzyme (KAPA HiFi DNA Polymerase (KK2102) at 1 U/ul) was prepared in a separate tube immediately prior to use. Then, 60 μL of the Library Amplification pre-mix was added to each library and mixed 3-4 times with a pipette set to 60 μL. The final concentration of the reaction components was as follows: 100 mM Tris-SO4, pH 8.5 @ 25° C., 80 mM TMAC, 2.5 mM MgCl2, 0.04% w/v Gelatin, 1× EvaGreen, 1×FCD, 1.5 U KAPA HiFi DNA Polymerase, 0.25 μM each PCR oligo. The plates were centrifuged and then incubated in a real-time thermal cycler as follows: 1 cycle at 72° C. for 3 min; 1 cycle at 85° C. for 2 min; 1 cycle at 98° C. for 2 min; 4 cycles of 98° C. for 20 sec, 67° C. for 20 sec, 72° C. for 40 sec; and 4-21 cycles of 98° C. for 20 sec and 72° C. for 50 sec.
- Conclusion. Significant amplification of adaptor dimers occurred (
FIG. 4 ). This may be due to the lack of suppression in the construct; therefore, the inventors tested a version containing sequences to generate terminal inverted repeats and comprising the proximal inline barcode. - The inventors sought to test Ion adaptors that contain tandem dual barcodes with terminal inverted repeats represented by the second barcode (proximal to the site of ligation). Stem-loop adaptors and PCR primers were designed as shown in
FIG. 5 and Table 2. -
TABLE 2 Oligonucleotide Sequence Generating Terminal Inverted Repeats SEQ Oligo Name ID NO: Sequence Ion PCR 5 CCACTACGCCTCCGCTTTCCTCTCTATGGGC Oligo P1 AGTCGGTGATAAGAGGATTCCCGGATTG Ion 6 CAATCCUGGAAUCCTCTTATCUUUUUUGATA Universal AGAGGATTCCCGGATTG Adaptor P1/A Ion PCR 7 CCATCTCATCCCTGCGTGTCTCCGACTCAG C Primer A TAAGGTAAC GATAAGAGGATTCCCGGATTG Underlining = first barcode; Underlining and bold = second barcode. - The TM of the Ion Universal Adaptor P1/A was computed using the Oligo Analyzer (IDT) at 0.25 μM oligo, 100 mM Nat, 2.5 mM Mg++, and 0.3 mM dNTPs to be 61° C.
- The experimental conditions were as described in Example 1, except that the oligonucleotide sequences indicated in Table 2 were used and a single universal stem-loop adaptor (2 μM in the library synthesis reaction) was used to attach terminal inverted repeats to both ends of the DNA fragments. The stem-loop adaptor containing sequences for generating terminal inverted repeats showed significant improvement of signal-to-noise ratio (
FIG. 6 ) over the design containing no such sequences for generating inverted repeats described in Example 1 (FIG. 4 ). - All of the methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods of this invention have been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the methods and in the steps or in the sequence of steps of the method described herein without departing from the concept, spirit and scope of the invention. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims.
- The following references, to the extent that they provide exemplary procedural or other details supplementary to those set forth herein, are specifically incorporated herein by reference.
- U.S. Pat. No. 5,858,656
- U.S. Pat. No. 5,935,793
- U.S. Pat. No. 6,261,782
- U.S. Pat. No. 7,803,550
- U.S. Pat. No. 7,902,122
- U.S. Pat. Publn. No. 2007/0020640
- U.S. Pat. Publn. No. 2009/0068645
- U.S. Pat. Publn. No. 2010/0227329
- U.S. Pat. Publn. No. 2010/0273219
- U.S. Pat. Publn. No. 2011/0015096
- U.S. Pat. Publn. No. 2011/0257031
- U.S. Pat. Publn. No. 2011/0319290
- U.S. Pat. Publn. No. 2012/0028814
- Ausubel et al., In: Current Protocols in Molecular Biology, John Wiley & Sons, N Y, 2003.
- Chenchik et al., Full-length cDNA cloning and determination of mRNA 5′ and 3′ ends by amplification of adaptor-ligated cDNA, Biotechniques, 21:526-534, 1996.
- Lukyanov et al., Selective suppression of polymerase chain reaction, Bioorganicheskaya Khimiya, 25:163-170, 1999.
- Oyola et al., Optimizing Illumina next-generation sequencing library preparation for extremely AT-biased genomes, BMC Genomics, 13:1, 2012
- Pareek et al., Sequencing technologies and genome sequencing, J. Appl. Genet., 52(4):413-435, 2011.
- Sambrook et al., In: Molecular cloning: a laboratory manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989.
- Shagin et al., Regulation of average length of complex PCR product, Nucleic Acids Research, 27, e23, 1999.
- Siebert et al., An Improved PCR Method for Walking in Uncloned Genomic DNA, Nucleic Acids Research, 23:1087-1088, 1995.
- Thudi et al., Current state-of-art of sequencing technologies for plant genomics research, Brief Funct. Genomics., 11(1):3-11, 2012.
Claims (21)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/184,048 US20210189382A1 (en) | 2012-11-05 | 2021-02-24 | Barcoding Nucleic Acids |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201261722357P | 2012-11-05 | 2012-11-05 | |
PCT/US2013/068468 WO2014071361A1 (en) | 2012-11-05 | 2013-11-05 | Barcoding nucleic acids |
US201514438280A | 2015-04-24 | 2015-04-24 | |
US16/183,107 US10961529B2 (en) | 2012-11-05 | 2018-11-07 | Barcoding nucleic acids |
US17/184,048 US20210189382A1 (en) | 2012-11-05 | 2021-02-24 | Barcoding Nucleic Acids |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/183,107 Continuation US10961529B2 (en) | 2012-11-05 | 2018-11-07 | Barcoding nucleic acids |
Publications (1)
Publication Number | Publication Date |
---|---|
US20210189382A1 true US20210189382A1 (en) | 2021-06-24 |
Family
ID=49724653
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/438,280 Active 2034-08-24 US10155942B2 (en) | 2012-11-05 | 2013-11-05 | Barcoding nucleic acids |
US16/183,107 Active US10961529B2 (en) | 2012-11-05 | 2018-11-07 | Barcoding nucleic acids |
US17/184,048 Abandoned US20210189382A1 (en) | 2012-11-05 | 2021-02-24 | Barcoding Nucleic Acids |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/438,280 Active 2034-08-24 US10155942B2 (en) | 2012-11-05 | 2013-11-05 | Barcoding nucleic acids |
US16/183,107 Active US10961529B2 (en) | 2012-11-05 | 2018-11-07 | Barcoding nucleic acids |
Country Status (8)
Country | Link |
---|---|
US (3) | US10155942B2 (en) |
EP (1) | EP2914745B1 (en) |
JP (1) | JP6454281B2 (en) |
CN (2) | CN104903466B (en) |
AU (1) | AU2013337280B2 (en) |
CA (1) | CA2889862C (en) |
HK (1) | HK1209162A1 (en) |
WO (1) | WO2014071361A1 (en) |
Families Citing this family (124)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8835358B2 (en) | 2009-12-15 | 2014-09-16 | Cellular Research, Inc. | Digital counting of individual molecules by stochastic attachment of diverse labels |
US10941396B2 (en) | 2012-02-27 | 2021-03-09 | Becton, Dickinson And Company | Compositions and kits for molecular counting |
CA2872017A1 (en) * | 2012-05-09 | 2013-11-14 | Apdn (B.V.I.) Inc. | Verification of physical encryption taggants using digital representatives and authentications thereof |
CN113528634A (en) | 2012-08-14 | 2021-10-22 | 10X基因组学有限公司 | Microcapsule compositions and methods |
US10400280B2 (en) | 2012-08-14 | 2019-09-03 | 10X Genomics, Inc. | Methods and systems for processing polynucleotides |
US10752949B2 (en) | 2012-08-14 | 2020-08-25 | 10X Genomics, Inc. | Methods and systems for processing polynucleotides |
US10323279B2 (en) | 2012-08-14 | 2019-06-18 | 10X Genomics, Inc. | Methods and systems for processing polynucleotides |
US9951386B2 (en) | 2014-06-26 | 2018-04-24 | 10X Genomics, Inc. | Methods and systems for processing polynucleotides |
US10273541B2 (en) | 2012-08-14 | 2019-04-30 | 10X Genomics, Inc. | Methods and systems for processing polynucleotides |
US11591637B2 (en) | 2012-08-14 | 2023-02-28 | 10X Genomics, Inc. | Compositions and methods for sample processing |
US10221442B2 (en) | 2012-08-14 | 2019-03-05 | 10X Genomics, Inc. | Compositions and methods for sample processing |
US9701998B2 (en) | 2012-12-14 | 2017-07-11 | 10X Genomics, Inc. | Methods and systems for processing polynucleotides |
EP3567116A1 (en) | 2012-12-14 | 2019-11-13 | 10X Genomics, Inc. | Methods and systems for processing polynucleotides |
US10533221B2 (en) | 2012-12-14 | 2020-01-14 | 10X Genomics, Inc. | Methods and systems for processing polynucleotides |
CA2900543C (en) | 2013-02-08 | 2023-01-31 | 10X Genomics, Inc. | Partitioning and processing of analytes and other species |
EP3988671A1 (en) * | 2013-02-20 | 2022-04-27 | Emory University | Compositions for sequencing nucleic acids in mixtures |
SG11201508193TA (en) | 2013-04-17 | 2015-11-27 | Agency Science Tech & Res | Method for generating extended sequence reads |
DK3013984T3 (en) | 2013-06-25 | 2023-06-06 | Prognosys Biosciences Inc | METHOD FOR DETERMINING SPATIAL PATTERNS IN BIOLOGICAL TARGETS IN A SAMPLE |
DK3039158T3 (en) | 2013-08-28 | 2019-03-04 | Becton Dickinson Co | MASSIVE PARALLEL SINGLE CELL CELL ANALYSIS |
US9582877B2 (en) | 2013-10-07 | 2017-02-28 | Cellular Research, Inc. | Methods and systems for digitally counting features on arrays |
KR102313982B1 (en) | 2014-03-11 | 2021-10-18 | 프레지던트 앤드 펠로우즈 오브 하바드 칼리지 | High-throughput and highly multiplexed imaging with programmable nucleic acid probes |
CN106413896B (en) | 2014-04-10 | 2019-07-05 | 10X基因组学有限公司 | For encapsulating and dividing fluid means, system and method and its application of reagent |
EP3161160B1 (en) | 2014-06-26 | 2021-10-13 | 10X Genomics, Inc. | Methods of analyzing nucleic acids from individual cells or cell populations |
EP3169799B1 (en) * | 2014-07-15 | 2019-09-04 | Qiagen Sciences, LLC | Semi-random barcodes for nucleic acid analysis |
US10590483B2 (en) * | 2014-09-15 | 2020-03-17 | Abvitro Llc | High-throughput nucleotide library sequencing |
CN114807307A (en) | 2014-10-29 | 2022-07-29 | 10X 基因组学有限公司 | Methods and compositions for sequencing target nucleic acids |
US9975122B2 (en) | 2014-11-05 | 2018-05-22 | 10X Genomics, Inc. | Instrument systems for integrated sample processing |
SG11201705615UA (en) | 2015-01-12 | 2017-08-30 | 10X Genomics Inc | Processes and systems for preparing nucleic acid sequencing libraries and libraries prepared using same |
US11639522B2 (en) | 2015-01-30 | 2023-05-02 | President And Fellows Of Harvard College | Microscope-free imaging |
EP3766988B1 (en) | 2015-02-19 | 2024-02-14 | Becton, Dickinson and Company | High-throughput single-cell analysis combining proteomic and genomic information |
EP3262407B1 (en) | 2015-02-24 | 2023-08-30 | 10X Genomics, Inc. | Partition processing methods and systems |
WO2016138080A1 (en) | 2015-02-24 | 2016-09-01 | Trustees Of Boston University | Protection of barcodes during dna amplification using molecular hairpins |
JP2018505688A (en) | 2015-02-24 | 2018-03-01 | 10エックス ゲノミクス,インコーポレイテッド | Method for targeted nucleic acid sequence coverage (COVERAGE) |
US9727810B2 (en) | 2015-02-27 | 2017-08-08 | Cellular Research, Inc. | Spatially addressable molecular barcoding |
WO2016149418A1 (en) * | 2015-03-18 | 2016-09-22 | Cellular Research, Inc. | Methods and compositions for labeling targets and haplotype phasing |
EP3277843A2 (en) | 2015-03-30 | 2018-02-07 | Cellular Research, Inc. | Methods and compositions for combinatorial barcoding |
WO2016168351A1 (en) * | 2015-04-15 | 2016-10-20 | The Board Of Trustees Of The Leland Stanford Junior University | Robust quantification of single molecules in next-generation sequencing using non-random combinatorial oligonucleotide barcodes |
EP3286326A1 (en) * | 2015-04-23 | 2018-02-28 | Cellular Research, Inc. | Methods and compositions for whole transcriptome amplification |
WO2016195963A1 (en) * | 2015-05-29 | 2016-12-08 | Tsavachidou Dimitra | Methods for constructing consecutively connected copies of nucleic acid molecules |
WO2016196229A1 (en) | 2015-06-01 | 2016-12-08 | Cellular Research, Inc. | Methods for rna quantification |
WO2016195382A1 (en) * | 2015-06-01 | 2016-12-08 | 연세대학교 산학협력단 | Next-generation nucleotide sequencing using adaptor comprising bar code sequence |
GB2539675B (en) * | 2015-06-23 | 2017-11-22 | Cs Genetics Ltd | Libraries of multimeric barcoding reagents and kits thereof for labelling nucleic acids for sequencing |
ES2841077T3 (en) * | 2015-08-06 | 2021-07-07 | Hoffmann La Roche | Target Enrichment by Single Probe Primer Extension |
CA3000816A1 (en) * | 2015-09-11 | 2017-03-16 | The General Hospital Corporation | Full interrogation of nuclease dsbs and sequencing (find-seq) |
CN108026524A (en) | 2015-09-11 | 2018-05-11 | 赛卢拉研究公司 | Method and composition for nucleic acid library standardization |
EP3371309B1 (en) * | 2015-11-04 | 2023-07-05 | Atreca, Inc. | Combinatorial sets of nucleic acid barcodes for analysis of nucleic acids associated with single cells |
US10774370B2 (en) | 2015-12-04 | 2020-09-15 | 10X Genomics, Inc. | Methods and compositions for nucleic acid analysis |
WO2017120531A1 (en) | 2016-01-08 | 2017-07-13 | Bio-Rad Laboratories, Inc. | Multiple beads per droplet resolution |
ITUA20162640A1 (en) * | 2016-04-15 | 2017-10-15 | Menarini Silicon Biosystems Spa | METHOD AND KIT FOR THE GENERATION OF DNA LIBRARIES FOR PARALLEL MAXIMUM SEQUENCING |
US10822643B2 (en) | 2016-05-02 | 2020-11-03 | Cellular Research, Inc. | Accurate molecular barcoding |
WO2017197338A1 (en) | 2016-05-13 | 2017-11-16 | 10X Genomics, Inc. | Microfluidic systems and methods of use |
US20190292575A1 (en) * | 2016-05-24 | 2019-09-26 | The Translational Genomics Research Institute | Molecular tagging methods and sequencing libraries |
KR20170133270A (en) * | 2016-05-25 | 2017-12-05 | 주식회사 셀레믹스 | Method for preparing libraries for massively parallel sequencing using molecular barcoding and the use thereof |
US10301677B2 (en) | 2016-05-25 | 2019-05-28 | Cellular Research, Inc. | Normalization of nucleic acid libraries |
US11397882B2 (en) | 2016-05-26 | 2022-07-26 | Becton, Dickinson And Company | Molecular label counting adjustment methods |
US10640763B2 (en) | 2016-05-31 | 2020-05-05 | Cellular Research, Inc. | Molecular indexing of internal sequences |
US10202641B2 (en) | 2016-05-31 | 2019-02-12 | Cellular Research, Inc. | Error correction in amplification of samples |
US11708574B2 (en) | 2016-06-10 | 2023-07-25 | Myriad Women's Health, Inc. | Nucleic acid sequencing adapters and uses thereof |
WO2018045109A1 (en) * | 2016-08-30 | 2018-03-08 | Metabiotech Corporation | Methods and compositions for phased sequencing |
CN109923213B (en) | 2016-09-20 | 2023-02-28 | 哈佛学院院长及董事 | Molecular verification system |
ES2961743T3 (en) | 2016-09-26 | 2024-03-13 | Becton Dickinson Co | Measurement of protein expression using reagents with barcoded oligonucleotide sequences |
EP3518974A4 (en) | 2016-09-29 | 2020-05-27 | Myriad Women's Health, Inc. | Noninvasive prenatal screening using dynamic iterative depth optimization |
WO2018069484A2 (en) * | 2016-10-13 | 2018-04-19 | F. Hoffmann-La Roche Ag | Molecular detection and counting using nanopores |
DK3529357T3 (en) * | 2016-10-19 | 2022-04-25 | 10X Genomics Inc | Methods for bar coding nucleic acid molecules from individual cells |
WO2018081113A1 (en) | 2016-10-24 | 2018-05-03 | Sawaya Sterling | Concealing information present within nucleic acids |
EP3532635B1 (en) * | 2016-10-31 | 2021-06-09 | F. Hoffmann-La Roche AG | Barcoded circular library construction for identification of chimeric products |
JP7232180B2 (en) | 2016-11-08 | 2023-03-02 | ベクトン・ディキンソン・アンド・カンパニー | Methods of expression profile classification |
CN109906274B (en) | 2016-11-08 | 2023-08-25 | 贝克顿迪金森公司 | Methods for cell marker classification |
US10815525B2 (en) | 2016-12-22 | 2020-10-27 | 10X Genomics, Inc. | Methods and systems for processing polynucleotides |
US10550429B2 (en) | 2016-12-22 | 2020-02-04 | 10X Genomics, Inc. | Methods and systems for processing polynucleotides |
US10011872B1 (en) | 2016-12-22 | 2018-07-03 | 10X Genomics, Inc. | Methods and systems for processing polynucleotides |
GB201622222D0 (en) | 2016-12-23 | 2017-02-08 | Cs Genetics Ltd | Reagents and methods for molecular barcoding of nucleic acids of single cells |
US10722880B2 (en) | 2017-01-13 | 2020-07-28 | Cellular Research, Inc. | Hydrophilic coating of fluidic channels |
EP4310183A3 (en) | 2017-01-30 | 2024-02-21 | 10X Genomics, Inc. | Methods and systems for droplet-based single cell barcoding |
WO2018144216A1 (en) * | 2017-01-31 | 2018-08-09 | Counsyl, Inc. | Methods and compositions for enrichment of target polynucleotides |
WO2018144217A1 (en) | 2017-01-31 | 2018-08-09 | Counsyl, Inc. | Methods and compositions for enrichment of target polynucleotides |
US11319583B2 (en) | 2017-02-01 | 2022-05-03 | Becton, Dickinson And Company | Selective amplification using blocking oligonucleotides |
WO2018148289A2 (en) * | 2017-02-08 | 2018-08-16 | Integrated Dna Technologies, Inc. | Duplex adapters and duplex sequencing |
EP3360962A1 (en) * | 2017-02-14 | 2018-08-15 | Technische Universität Dortmund | Synthesis of dna-encoded libraries by micellar catalysis |
WO2018175907A1 (en) | 2017-03-24 | 2018-09-27 | Counsyl, Inc. | Copy number variant caller |
US10844372B2 (en) | 2017-05-26 | 2020-11-24 | 10X Genomics, Inc. | Single cell analysis of transposase accessible chromatin |
SG11201901822QA (en) | 2017-05-26 | 2019-03-28 | 10X Genomics Inc | Single cell analysis of transposase accessible chromatin |
US10676779B2 (en) | 2017-06-05 | 2020-06-09 | Becton, Dickinson And Company | Sample indexing for single cells |
US10633651B2 (en) | 2017-07-10 | 2020-04-28 | Agilent Technologies, Inc. | Assay methods and compositions for detecting contamination of nucleic acid identifiers |
US11505826B2 (en) | 2017-07-12 | 2022-11-22 | Agilent Technologies, Inc. | Sequencing method for genomic rearrangement detection |
EP3431611A1 (en) | 2017-07-21 | 2019-01-23 | Menarini Silicon Biosystems S.p.A. | Improved method and kit for the generation of dna libraries for massively parallel sequencing |
JP2020532967A (en) | 2017-08-23 | 2020-11-19 | ザ ジェネラル ホスピタル コーポレイション | Genetically engineered CRISPR-Cas9 nuclease with altered PAM specificity |
EP3694993A4 (en) | 2017-10-11 | 2021-10-13 | The General Hospital Corporation | Methods for detecting site-specific and spurious genomic deamination induced by base editing technologies |
SG11201913654QA (en) | 2017-11-15 | 2020-01-30 | 10X Genomics Inc | Functionalized gel beads |
US10829815B2 (en) | 2017-11-17 | 2020-11-10 | 10X Genomics, Inc. | Methods and systems for associating physical and genetic properties of biological particles |
EP3728636A1 (en) | 2017-12-19 | 2020-10-28 | Becton, Dickinson and Company | Particles associated with oligonucleotides |
CN112189055A (en) * | 2018-03-22 | 2021-01-05 | 哈佛学院院长及董事 | Methods and compositions for molecular authentication |
GB201804641D0 (en) * | 2018-03-22 | 2018-05-09 | Inivata Ltd | Methods of sequencing nucleic acids and error correction of sequence reads |
CN112262218A (en) | 2018-04-06 | 2021-01-22 | 10X基因组学有限公司 | System and method for quality control in single cell processing |
EP3781585A4 (en) | 2018-04-17 | 2022-01-26 | The General Hospital Corporation | Sensitive in vitro assays for substrate preferences and sites of nucleic acid binding, modifying, and cleaving agents |
JP7358388B2 (en) * | 2018-05-03 | 2023-10-10 | ベクトン・ディキンソン・アンド・カンパニー | Molecular barcoding at opposite transcript ends |
JP7407128B2 (en) | 2018-05-03 | 2023-12-28 | ベクトン・ディキンソン・アンド・カンパニー | High-throughput multi-omics sample analysis |
US20210163927A1 (en) * | 2018-06-15 | 2021-06-03 | Roche Sequencing Solutions, Inc. | Generation of double-stranded dna templates for single molecule sequencing |
DK3604525T3 (en) | 2018-08-02 | 2021-05-31 | Univ Dresden Tech | A method of providing a DNA-encoded library, a DNA-encoded library, and a method of decoding a DNA-encoded library |
EP3861134A1 (en) | 2018-10-01 | 2021-08-11 | Becton, Dickinson and Company | Determining 5' transcript sequences |
WO2020097315A1 (en) | 2018-11-08 | 2020-05-14 | Cellular Research, Inc. | Whole transcriptome analysis of single cells using random priming |
SG11202105824RA (en) | 2018-12-10 | 2021-06-29 | 10X Genomics Inc | Imaging system hardware |
WO2020123384A1 (en) | 2018-12-13 | 2020-06-18 | Cellular Research, Inc. | Selective extension in single cell whole transcriptome analysis |
EP3666904A1 (en) | 2018-12-14 | 2020-06-17 | Lexogen GmbH | Nucleic acid amplification and identification method |
US11926867B2 (en) | 2019-01-06 | 2024-03-12 | 10X Genomics, Inc. | Generating capture probes for spatial analysis |
WO2020150356A1 (en) | 2019-01-16 | 2020-07-23 | Becton, Dickinson And Company | Polymerase chain reaction normalization through primer titration |
WO2020154307A1 (en) * | 2019-01-22 | 2020-07-30 | Singular Genomics Systems, Inc. | Polynucleotide barcodes for multiplexed proteomics |
EP3914728B1 (en) | 2019-01-23 | 2023-04-05 | Becton, Dickinson and Company | Oligonucleotides associated with antibodies |
EP4004231A1 (en) | 2019-07-22 | 2022-06-01 | Becton, Dickinson and Company | Single cell chromatin immunoprecipitation sequencing assay |
WO2021092386A1 (en) | 2019-11-08 | 2021-05-14 | Becton Dickinson And Company | Using random priming to obtain full-length v(d)j information for immune repertoire sequencing |
CN115244184A (en) | 2020-01-13 | 2022-10-25 | 贝克顿迪金森公司 | Methods and compositions for quantifying protein and RNA |
US11898205B2 (en) | 2020-02-03 | 2024-02-13 | 10X Genomics, Inc. | Increasing capture efficiency of spatial assays |
US11891654B2 (en) | 2020-02-24 | 2024-02-06 | 10X Genomics, Inc. | Methods of making gene expression libraries |
EP4150118A1 (en) | 2020-05-14 | 2023-03-22 | Becton Dickinson and Company | Primers for immune repertoire profiling |
EP4153775A1 (en) | 2020-05-22 | 2023-03-29 | 10X Genomics, Inc. | Simultaneous spatio-temporal measurement of gene expression and cellular activity |
US11761038B1 (en) | 2020-07-06 | 2023-09-19 | 10X Genomics, Inc. | Methods for identifying a location of an RNA in a biological sample |
US11932901B2 (en) | 2020-07-13 | 2024-03-19 | Becton, Dickinson And Company | Target enrichment using nucleic acid probes for scRNAseq |
US11926822B1 (en) | 2020-09-23 | 2024-03-12 | 10X Genomics, Inc. | Three-dimensional spatial analysis |
US11827935B1 (en) | 2020-11-19 | 2023-11-28 | 10X Genomics, Inc. | Methods for spatial analysis using rolling circle amplification and detection probes |
CN116635533A (en) | 2020-11-20 | 2023-08-22 | 贝克顿迪金森公司 | Profiling of high and low expressed proteins |
EP4121555A1 (en) | 2020-12-21 | 2023-01-25 | 10X Genomics, Inc. | Methods, compositions, and systems for capturing probes and/or barcodes |
WO2022181858A1 (en) * | 2021-02-26 | 2022-09-01 | 지니너스 주식회사 | Composition for improving molecular barcoding efficiency and use thereof |
KR20220122095A (en) | 2021-02-26 | 2022-09-02 | 지니너스 주식회사 | Composition for improving molecular barcoding efficiency and use thereof |
WO2023034489A1 (en) | 2021-09-01 | 2023-03-09 | 10X Genomics, Inc. | Methods, compositions, and kits for blocking a capture probe on a spatial array |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070031857A1 (en) * | 2005-08-02 | 2007-02-08 | Rubicon Genomics, Inc. | Compositions and methods for processing and amplification of DNA, including using multiple enzymes in a single reaction |
US20120015821A1 (en) * | 2009-09-09 | 2012-01-19 | Life Technologies Corporation | Methods of Generating Gene Specific Libraries |
US20120244525A1 (en) * | 2010-07-19 | 2012-09-27 | New England Biolabs, Inc. | Oligonucleotide Adapters: Compositions and Methods of Use |
US20160208322A1 (en) * | 2011-05-20 | 2016-07-21 | Fluidigm Corporation | Nucleic acid encoding reactions |
Family Cites Families (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2036946C (en) | 1990-04-06 | 2001-10-16 | Kenneth V. Deugau | Indexing linkers |
US5935793A (en) | 1996-09-27 | 1999-08-10 | The Chinese University Of Hong Kong | Parallel polynucleotide sequencing method using tagged primers |
AU778438B2 (en) | 1999-04-06 | 2004-12-02 | Yale University | Fixed address analysis of sequence tags |
US20040209299A1 (en) * | 2003-03-07 | 2004-10-21 | Rubicon Genomics, Inc. | In vitro DNA immortalization and whole genome amplification using libraries generated from randomly fragmented DNA |
GB0400584D0 (en) * | 2004-01-12 | 2004-02-11 | Solexa Ltd | Nucleic acid chacterisation |
GB0422551D0 (en) | 2004-10-11 | 2004-11-10 | Univ Liverpool | Labelling and sequencing of nucleic acids |
US7393665B2 (en) | 2005-02-10 | 2008-07-01 | Population Genetics Technologies Ltd | Methods and compositions for tagging and identifying polynucleotides |
US20070020640A1 (en) | 2005-07-21 | 2007-01-25 | Mccloskey Megan L | Molecular encoding of nucleic acid templates for PCR and other forms of sequence analysis |
WO2007018602A1 (en) | 2005-08-02 | 2007-02-15 | Rubicon Genomics, Inc. | Isolation of cpg islands by thermal segregation and enzymatic selection-amplification method |
US7537897B2 (en) | 2006-01-23 | 2009-05-26 | Population Genetics Technologies, Ltd. | Molecular counting |
KR100823684B1 (en) | 2006-12-06 | 2008-04-21 | 한국전자통신연구원 | Method for detecting a biological target material using barcode dna |
CA2697640C (en) | 2007-09-21 | 2016-06-21 | Katholieke Universiteit Leuven | Tools and methods for genetic tests using next generation sequencing |
US8268564B2 (en) | 2007-09-26 | 2012-09-18 | President And Fellows Of Harvard College | Methods and applications for stitched DNA barcodes |
US8586310B2 (en) | 2008-09-05 | 2013-11-19 | Washington University | Method for multiplexed nucleic acid patch polymerase chain reaction |
CN102203273A (en) | 2008-09-09 | 2011-09-28 | 生命技术公司 | Methods of generating gene specific libraries |
US8383345B2 (en) | 2008-09-12 | 2013-02-26 | University Of Washington | Sequence tag directed subassembly of short sequencing reads into long sequencing reads |
US9080211B2 (en) * | 2008-10-24 | 2015-07-14 | Epicentre Technologies Corporation | Transposon end compositions and methods for modifying nucleic acids |
US20110301042A1 (en) | 2008-11-11 | 2011-12-08 | Helicos Biosciences Corporation | Methods of sample encoding for multiplex analysis of samples by single molecule sequencing |
KR101829182B1 (en) | 2009-04-02 | 2018-03-29 | 플루이다임 코포레이션 | Multi-primer amplification method for barcoding of target nucleic acids |
US8481699B2 (en) | 2009-07-14 | 2013-07-09 | Academia Sinica | Multiplex barcoded Paired-End ditag (mbPED) library construction for ultra high throughput sequencing |
US9315857B2 (en) | 2009-12-15 | 2016-04-19 | Cellular Research, Inc. | Digital counting of individual molecules by stochastic attachment of diverse label-tags |
US8835358B2 (en) | 2009-12-15 | 2014-09-16 | Cellular Research, Inc. | Digital counting of individual molecules by stochastic attachment of diverse labels |
US20110257031A1 (en) | 2010-02-12 | 2011-10-20 | Life Technologies Corporation | Nucleic acid, biomolecule and polymer identifier codes |
WO2011140510A2 (en) | 2010-05-06 | 2011-11-10 | Bioo Scientific Corporation | Oligonucleotide ligation, barcoding and methods and compositions for improving data quality and throughput using massively parallel sequencing |
EP2580378A4 (en) | 2010-06-08 | 2014-01-01 | Nugen Technologies Inc | Methods and composition for multiplex sequencing |
EP2619327B1 (en) | 2010-09-21 | 2014-10-22 | Population Genetics Technologies LTD. | Increasing confidence of allele calls with molecular counting |
US8865404B2 (en) * | 2010-11-05 | 2014-10-21 | President And Fellows Of Harvard College | Methods for sequencing nucleic acid molecules |
WO2012103154A1 (en) * | 2011-01-24 | 2012-08-02 | Nugen Technologies, Inc. | Stem-loop composite rna-dna adaptor-primers: compositions and methods for library generation, amplification and other downstream manipulations |
EP3736281A1 (en) | 2011-02-18 | 2020-11-11 | Bio-Rad Laboratories, Inc. | Compositions and methods for molecular labeling |
US9260753B2 (en) | 2011-03-24 | 2016-02-16 | President And Fellows Of Harvard College | Single cell nucleic acid detection and analysis |
US9476095B2 (en) | 2011-04-15 | 2016-10-25 | The Johns Hopkins University | Safe sequencing system |
GB201113214D0 (en) | 2011-07-29 | 2011-09-14 | Univ East Anglia | Analysing sequencing bias |
GB2496016B (en) | 2011-09-09 | 2016-03-16 | Univ Leland Stanford Junior | Methods for obtaining a sequence |
CN108611398A (en) * | 2012-01-13 | 2018-10-02 | Data生物有限公司 | Genotyping is carried out by new-generation sequencing |
PL2828218T3 (en) | 2012-03-20 | 2021-01-11 | University Of Washington Through Its Center For Commercialization | Methods of lowering the error rate of massively parallel dna sequencing using duplex consensus sequencing |
CA2872141C (en) | 2012-05-31 | 2016-01-19 | Board Of Regents, The University Of Texas System | Method for accurate sequencing of dna |
US20150011396A1 (en) * | 2012-07-09 | 2015-01-08 | Benjamin G. Schroeder | Methods for creating directional bisulfite-converted nucleic acid libraries for next generation sequencing |
-
2013
- 2013-11-05 CN CN201380069090.3A patent/CN104903466B/en active Active
- 2013-11-05 CA CA2889862A patent/CA2889862C/en active Active
- 2013-11-05 EP EP13801915.3A patent/EP2914745B1/en active Active
- 2013-11-05 US US14/438,280 patent/US10155942B2/en active Active
- 2013-11-05 CN CN201610959042.7A patent/CN107090491A/en active Pending
- 2013-11-05 AU AU2013337280A patent/AU2013337280B2/en active Active
- 2013-11-05 WO PCT/US2013/068468 patent/WO2014071361A1/en active Application Filing
- 2013-11-05 JP JP2015540858A patent/JP6454281B2/en active Active
-
2015
- 2015-10-08 HK HK15109828.2A patent/HK1209162A1/en unknown
-
2018
- 2018-11-07 US US16/183,107 patent/US10961529B2/en active Active
-
2021
- 2021-02-24 US US17/184,048 patent/US20210189382A1/en not_active Abandoned
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070031857A1 (en) * | 2005-08-02 | 2007-02-08 | Rubicon Genomics, Inc. | Compositions and methods for processing and amplification of DNA, including using multiple enzymes in a single reaction |
US20120015821A1 (en) * | 2009-09-09 | 2012-01-19 | Life Technologies Corporation | Methods of Generating Gene Specific Libraries |
US20120244525A1 (en) * | 2010-07-19 | 2012-09-27 | New England Biolabs, Inc. | Oligonucleotide Adapters: Compositions and Methods of Use |
US20160208322A1 (en) * | 2011-05-20 | 2016-07-21 | Fluidigm Corporation | Nucleic acid encoding reactions |
Also Published As
Publication number | Publication date |
---|---|
JP6454281B2 (en) | 2019-01-16 |
US20190153434A1 (en) | 2019-05-23 |
CN104903466B (en) | 2016-11-23 |
JP2015533296A (en) | 2015-11-24 |
CA2889862C (en) | 2021-02-16 |
WO2014071361A1 (en) | 2014-05-08 |
CA2889862A1 (en) | 2014-05-08 |
AU2013337280A1 (en) | 2015-05-14 |
US10961529B2 (en) | 2021-03-30 |
EP2914745A1 (en) | 2015-09-09 |
US20150284712A1 (en) | 2015-10-08 |
CN104903466A (en) | 2015-09-09 |
CN107090491A (en) | 2017-08-25 |
HK1209162A1 (en) | 2016-03-24 |
US10155942B2 (en) | 2018-12-18 |
AU2013337280B2 (en) | 2018-11-08 |
EP2914745B1 (en) | 2017-09-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210189382A1 (en) | Barcoding Nucleic Acids | |
US10301660B2 (en) | Methods and compositions for repair of DNA ends by multiple enzymatic activities | |
CN110191961B (en) | Method for preparing asymmetrically tagged sequencing library | |
US9255291B2 (en) | Oligonucleotide ligation methods for improving data quality and throughput using massively parallel sequencing | |
CN111849965B (en) | Polynucleotide adapter design for reduced bias | |
CN109844137B (en) | Barcoded circular library construction for identification of chimeric products | |
US20230159984A1 (en) | Gene target region enrichment method and kit | |
EP4004232A1 (en) | Methods and compositions for high throughput sample preparation using double unique dual indexing | |
EP3330386A1 (en) | Preparation of adapter-ligated amplicons | |
JP2021514651A (en) | Preparation of single-stranded circular DNA template for single molecule sequencing | |
WO2016135300A1 (en) | Efficiency improving methods for gene library generation | |
CN111164219A (en) | Circularization method for single molecule sequencing sample preparation | |
US11015192B2 (en) | Method for generating a stranded RNA library | |
JP2020512845A (en) | Methods, compositions, and kits for preparing nucleic acid libraries | |
JP2023513606A (en) | Methods and Materials for Assessing Nucleic Acids | |
JP2019520839A (en) | Method for generating a single stranded circular DNA library for single molecule sequencing | |
WO2018009677A1 (en) | Fast target enrichment by multiplexed relay pcr with modified bubble primers | |
WO2019023243A1 (en) | Methods and compositions for selecting and amplifying dna targets in a single reaction mixture | |
KR20230028450A (en) | Inclusive enrichment of amplicons | |
AU2021354916A1 (en) | Method and means for generating transcribed nucleic acids | |
CN113564235A (en) | DNA sequencing method and kit |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: RUBICON GENOMICS, INC., MICHIGAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KURIHARA, TAKAO;KAMBEROV, EMMANUEL;TESMER, TIM;AND OTHERS;SIGNING DATES FROM 20150716 TO 20150720;REEL/FRAME:055409/0739 Owner name: TAKARA BIO USA, INC., CALIFORNIA Free format text: MERGER;ASSIGNOR:RUBICON GENOMICS, INC.;REEL/FRAME:055409/0755 Effective date: 20170331 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |