WO2015089339A2 - Compositions, methods and kits for dna fragmentation and tagmentation - Google Patents

Compositions, methods and kits for dna fragmentation and tagmentation Download PDF

Info

Publication number
WO2015089339A2
WO2015089339A2 PCT/US2014/069855 US2014069855W WO2015089339A2 WO 2015089339 A2 WO2015089339 A2 WO 2015089339A2 US 2014069855 W US2014069855 W US 2014069855W WO 2015089339 A2 WO2015089339 A2 WO 2015089339A2
Authority
WO
WIPO (PCT)
Prior art keywords
double
recognition end
end sequence
oligonucleotide
stranded portion
Prior art date
Application number
PCT/US2014/069855
Other languages
French (fr)
Other versions
WO2015089339A3 (en
Inventor
Changping Shi
Original Assignee
Changping Shi
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Changping Shi filed Critical Changping Shi
Publication of WO2015089339A2 publication Critical patent/WO2015089339A2/en
Publication of WO2015089339A3 publication Critical patent/WO2015089339A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/66General methods for inserting a gene into a vector to form a recombinant vector using cleavage and ligation; Use of non-functional linkers or adaptors, e.g. linkers containing the sequence for a restriction endonuclease
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1093General methods of preparing gene libraries, not provided for in other subgroups
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2525/00Reactions involving modified oligonucleotides, nucleic acids, or nucleotides
    • C12Q2525/10Modifications characterised by
    • C12Q2525/15Modifications characterised by incorporating a consensus or conserved sequence
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2535/00Reactions characterised by the assay type for determining the identity of a nucleotide base or a sequence of oligonucleotides
    • C12Q2535/122Massive parallel sequencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2563/00Nucleic acid detection characterized by the use of physical, structural and functional properties
    • C12Q2563/185Nucleic acid dedicated to use as a hidden marker/bar code, e.g. inclusion of nucleic acids to mark art objects or animals

Definitions

  • the disclosed embodiments relate to compositions, methods and kits for generating DNA libraries from target DNA using transpososome complexes. At least some of the DNA fragments of such libraries are tagged at both ends, wherein the tag sequence at the 5 ' end of a fragment has a different nucleotide sequence than the tag present in the 3 ' end of that fragment.
  • the disclosed DNA libraries are suitable for next generation sequencing application or other methods that rely on the use of tagged DNA fragments derived from the target genetic material.
  • DNA fragment libraries comprising a collection or population of DNA fragments generated from target DNA in a sample, wherein the combination of all of the DNA fragments in the library exhibits sequences that are qualitatively and/or quantitatively representative of the sequence of the target DNA from which the DNA fragments were generated.
  • next generation sequencing methods require fragmentation of genomic DNA (gDNA) or double-stranded cDNA (prepared from RNA) into smaller single- stranded DNA (ssDNA) fragments and addition of tags to at least one strand or preferably both fragment strands.
  • gDNA genomic DNA
  • ssDNA single- stranded DNA
  • Fragmentation of gDNA is one of the main steps in preparing samples for high- throughput sequencing.
  • current methods for generating DNA fragment libraries for next generation sequencing comprise fragmenting the target DNA that one desires to sequence (e.g. target DNA comprising genomic DNA or double-stranded cDNA after reverse transcription of RNA) using a sonicator, nebulizer, or a nuclease (for example, DNase I), and joining (e.g., by ligation) oligonucleotides consisting of adapters or tags to the 5' and 3' ends of the fragments.
  • a sonicator e.g. target DNA comprising genomic DNA or double-stranded cDNA after reverse transcription of RNA
  • a nuclease for example, DNase I
  • joining e.g., by ligation
  • transposase complexes comprise a dimer of modified transposase Tn5 and a pair of Tn-5 binding double- stranded DNA (dsDNA) oligonucleotides containing a 19 base pair (bp) transposase-binding sequence (mosaic end, ME), or inverted repeat sequence (IR).
  • dsDNA double- stranded DNA
  • bp 19 base pair transposase-binding sequence
  • IR inverted repeat sequence
  • transpososome creates only one DNA cleavage per complex. Therefore, unlike with DNase I, the degree of DNA fragmentation is more easily controlled during transposase fragmentation.
  • compositions, methods and kits for making creating DNA libraries of tagged fragments typically obtained from gDNA or double stranded cDNA.
  • compositions for generating fragmented, tagged DNA libraries comprise at least one first oligonucleotide comprising at least one double- stranded portion, wherein the double-stranded portion comprises at least one first recognition end sequence; at least one second oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one second recognition end sequence; and a transposase.
  • compositions of the current teachings further comprise at least one third oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one third recognition end sequence; at least one fourth oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one fourth recognition end sequence;
  • methods for generating a library of tagged DNA fragments comprise combining (a) a first oligonucleotide comprising at least one double- stranded portion, wherein the double-stranded portion comprises at least one first recognition end sequence; b) a second oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one second recognition end sequence; (c) at least one transposase; and (d) at least one target DNA sequence; and incubating the combination under conditions suitable for generating a DNA library wherein each DNA fragment comprises at least two different recognition end sequences, at least two sequences complimentary to two different recognition end sequences, or at least one recognition end sequence and at least one sequence that is complimentary to a different recognition end sequence.
  • the target DNA is gDNA or double-stranded cDNA.
  • Some method embodiments further comprise combining (a) a third
  • the first oligonucleotide, the second oligonucleotide and the at least one transposase are incubated under conditions suitable a first transposome to form prior to adding the at least one target DNA sequence.
  • the third oligonucleotide, the fourth oligonucleotide, and the at least one transposase are incubated under conditions suitable for a second transposome to form prior to adding the target DNA.
  • the at least one first recognition end sequence comprises SEQ ID NO: l, SEQ ID NO:3, or both
  • the at least one second recognition end sequence comprises SEQ ID NO:2, SEQ ID NO:4, or both
  • the at least one first recognition end sequence comprises SEQ ID NO: l, SEQ ID NO:3, or both
  • the at least one second recognition end sequence comprises SEQ ID NO:2, SEQ ID NO:4, or both.
  • the at least one first recognition end sequence comprises SEQ ID NO: l; the at least one second recognition end sequence comprises SEQ ID NO:2; the at least one third recognition end sequence comprises SEQ ID NO:3; and the at least one fourth recognition end sequence comprises SEQ ID NO:4.
  • Certain method embodiments further comprise amplifying at least one tagged DNA fragment using primers and the polymerase chain reaction (PCR).
  • the PCR amplification comprises using at least one first primer and at least one second primer, wherein the 3' end portion of the first primer is complementary to the 3' end of the at least one first recognition end sequence of at least one DNA fragment and wherein the 3' end portion of the second primer comprises the sequence of at least a portion of the 5' end of the at least one first recognition end sequence of the at least one DNA fragment.
  • both the first primer and the second primer comprise 3' sequence that are the same as or are complimentary to at least a portion of the first recognition end sequence, the second recognition end sequence or both recognition end sequences.
  • the first primer comprises SEQ ID NO:7 and the second primer comprises SEQ ID NO:8.
  • the first primer, the second primer or both further comprise sequencing tags comprising at least one of: Roche 454 sequencing tags, IlluminaTM SOLEXATM sequencing tags, Applied Biosystems' SOLIDTM or Ion Torrent sequencing tags, the Pacific Biosciences' SMRTTM sequencing tags, Pollonator Polony sequencing tags, or Complete
  • Kits for generating tagged libraries of fragmented DNA are also disclosed.
  • kits comprise: at least one first oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one first recognition end sequence; and at least one second oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one second recognition end sequence.
  • Certain kit embodiments further comprise at least one third oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one third recognition end sequence; and at least one fourth
  • the first recognition end sequence comprises SEQ ID NO: l, SEQ ID NO:3, or both
  • the second recognition end sequences comprises SEQ ID NO:2, SEQ ID NO:4, or both.
  • the first recognition end sequence comprises SEQ ID NO: l; the second recognition end sequences comprises SEQ ID NO:2; the third recognition end sequence comprises SED ID NO:3; and the fourth recognition end sequence comprises SEQ ID NO:4.
  • kits of the current teachings further comprise at least one adaptor and at least one oligonucleotide primer.
  • the at least one adaptor comprises SEQ ID NO:5, SEQ ID NO:6, or both
  • the at least one oligonucleotide primer comprises SEQ ID NO;7, SEQ ID NO:8, or both
  • the at least one adaptor comprises SEQ ID NO:5, SEQ ID NO:6, or both
  • the at least one oligonucleotide primer comprises SEQ ID NO;7, SEQ ID NO:8, or both.
  • FIG. 1 graphically depicts a disclosed method for generating DNA fragments form target DNA using a transposase (Tn5) and two different recognition end sequences (El and E2), sometimes referred to as transposon end sequences.
  • Tn5 transposase
  • El and E2 two different recognition end sequences
  • transposome complex comprising the transposase and the two different recognition end sequences is formed.
  • the transposome is combined with the target DNA and an in vitro transposition reaction occurs, resulting in DNA fragments with different recognition end sequences, El and E2, on each end (sometimes referred to as di-tagged dsDNA).
  • FIG. 2 graphically depicts a method for generating a DNA library using the exemplary tagged DNA fragments form FIG 1.
  • First and second amplification primers each comprising a different bar code sequence that the other primer, are annealed to the adaptor linked di-tagged DNA fragments.
  • the fragments are then PCR amplified using DNA polymerase to generate a DNA library comprising bar coded (tagged) fragments.
  • FIG. 3 A shows an agarose gel of electrophoresed lambda DNA that was fragmented and tagged using an engineered transposase and two different recognition end sequences corresponding to that transposase according to a disclosed method.
  • the fragments were amplified using one primer pair (Lane #1).
  • Lane 2 shows the results obtained with a NEXTERATM kit (Illumina) using a single ME and 4 amplification primers. .
  • FIG. 3B is a fragment size distribution printout of the tagged, fragmented DNA library shown in FIG. 3 A lane 1 obtained using an Agilent Bioanalyzer DNA 1000 kit.
  • FIG. 4 is an agarose gel of tagged DNA libraries generated according to Example 1. The first and last lanes shown molecular weight markers. Lane two (9948) is from a DNA library prepared from human gDNA. Lane 3 is a DNA library prepared from bacteriophage Lambda gDNA. Lane 4 is a DNA library prepared from Helicobacter pylori gDNA, and Lane 5 is a DNA library prepared from E. coli gDNA.
  • compositions, methods and kits for generating compositions related to tissue engineered constructs including but not limited to, constructs designed to add volume at and/or below an incision or insertion site when certain constructs of the present teachings is implanted.
  • constructs designed according to certain disclosed tissue engineering methods that mimic the appearance or a nipple or a NAC when implanted.
  • methods for fabricating multilayered constructs, for fabricating constructs in a hydrogel scaffolds and methods comprising bioprinting one or more cell type in or on a construct layer, or a hydrogel or other scaffold.
  • an adaptor refers to a non-target nucleic acid sequence, generally DNA, that provides a means of addressing a nucleic acid fragment to which it is joined.
  • an adaptor comprises a nucleotide sequence that permits identification, recognition, and/or molecular or biochemical manipulation of the DNA to which the adaptor is attached, for example, by providing a site for annealing an oligonucleotide, such as a primer for extension by a DNA polymerase, or an oligonucleotide for binding to a bead or other support, or for a ligation reaction.
  • one or more adaptors are incorporated in components of the fragmented DNA libraries of the current teachings.
  • adaptors are used for polymerase-mediated amplification.
  • oligo or “oligonucleotide” is a short piece of DNA or RNA that is typically single-stranded, but for purposes of the current teachings, an oligo may be double stranded or may be partially single stranded and partially double stranded.
  • recognition end sequence refers to the nucleotide sequence within the oligonucleotides of the current teachings (e.g., the first oligonucleotide, the second
  • oligonucleotide and so forth to which a transposase specifically binds to during transposome formation.
  • exemplary recognition end sequences of the current teachings comprise SEQ ID NOs: l-4.
  • transposase is an enzyme that is involved in the transposition of DNA segments in the genome, for example, the movement of transposable elements during DNA mutagenesis.
  • the term transposase as used herein may, in certain embodiments, further comprise related enzymes known as integrases.
  • Repetitive transposable elements have been identified in the mammalian genome, among other places, and some repetitive elements are able to move within the genome (transposons and retrotransposons). Typically, DNA transposons move from one genomic location to another by a mechanism that is sometimes referred to as a "cut-and-paste" mechanism.
  • transposases include the DDE transposases (for example, the transposases of the maize Ac transposon, the Drosophila P element, bacteriophage Mu, Tn5 and TnlO, Mariner, IS 10, and IS50), and the synthetic Sleeping Beauty (SB) transposase.
  • DDE transposases for example, the transposases of the maize Ac transposon, the Drosophila P element, bacteriophage Mu, Tn5 and TnlO, Mariner, IS 10, and IS50
  • SB Sleeping Beauty
  • the current teachings provide new methods for preparing DNA libraries from gDNA or double-stranded cDNA targets.
  • the DNA fragments in such libraries comprise two different recognition end sequences per DNA fragment created.
  • Certain embodiments further comprise using two independently tagged primers for amplification of the DNA fragments.
  • random DNA fragments are generated each comprising two different recognition end sequences without regard to the sequences of the target DNA.
  • an aqueous composition comprising a transposase (shown as Tn5 in this example) and two different recognition end sequences (shown as El and E2) are incubated under conditions appropriate to form a transpososome, sometimes referred to as a transposome complex.
  • the transpososome is combined with target DNA and under conditions suitable for in vitro transposition, the transpososomes bind the target DNA, transposition occurs (FIG. 1, middle panel), and DNA fragments comprising different recognition sequences on their ends are generated (FIG. 1, lower panel).
  • the oligonucleotides comprising the recognition end sequences may further comprise an adaptor portion.
  • the transposition results in fragmented target DNA, wherein at least some of the fragments comprise a different adaptor and recognition end sequence on each end, for example, as depicted in the top panel of FIG. 2.
  • tagged DNA libraries suitable for high-throughput sequencing are created by combining di-tagged DNA fragments with amplification primers and DNA polymerase under conditions suitable for primer hybridization and PCR amplification to occur, for example, as depicted in the middle and bottom panels of FIG. 2.
  • compositions, methods, and kits may comprise any naturally available transposase or genetically engineered transposase capable of binding with the disclosed oligonucleotides to form a transpososome capable of in vitro transposition.
  • compositions, methods, and kits comprise a first oligonucleotide comprising a first recognition end sequence and a second oligonucleotide comprising a second recognition end sequence, where the first recognition end sequence and the second recognition end sequence are different.
  • the first oligo comprises a first recognition end sequence and a first adaptor sequence and the second oligo comprises a second recognition end sequence and a second adaptor sequence, wherein the first recognition end sequence and the first adaptor are different from the second recognition end sequence and the second adaptor.
  • composition, method and kit embodiments further comprise a third oligonucleotide comprising a third recognition end sequence and a fourth oligonucleotide comprising a fourth recognition end sequence, where the third and fourth recognition end sequences.
  • a third oligonucleotide, a fourth oligonucleotide or both further comprise an adaptor portion that is upstream of the recognition end sequence.
  • the third oligonucleotide comprises a third recognition end sequence and a third adaptor sequence and the fourth oligonucleotide comprises a fourth recognition end sequence and a fourth adaptor sequence, wherein the third recognition end sequence and the third adaptor are different from the fourth recognition end sequence and the fourth adaptor.
  • the first and third adaptor sequences are the same or different and the second and fourth adaptor sequences are the same or different.
  • the fragments in the DNA library comprise (a) a first subpopulation of fragments comprising a first recognition end sequence on one end and a second recognition end sequence on the other end and (b) a second subpopulation of fragments comprising a third recognition end sequence on one end and a fourth second recognition end sequence on the other end.
  • the number of di- tagged fragments in the first subpopulation and the second subpopulation are approximately equal.
  • the recognition end sequence can be a naturally occurring sequence for the transposase of choice, or it can be an engineered sequence designed for a particular transposase and providing additional or alternative functions for the oligonucleotide or DNA fragment where it is incorporated or transposed.
  • the recognition end sequence comprise: 5'- AGATCTGATCAAGAGACAG-3 * (SEQ ID NO: l); 5 * - CTGTCTCTTGATCAGATCT-3 * (SEQ ID NO:2); 5 * - ACTTGTGTATAAGAGTCAG-3 * (SEQ ID NO: 3); 5'-
  • the first recognition end sequence and the second recognition end sequence are double stranded DNA sequence comprising a sequence for one strand of SEQ ID NO: l, SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4.
  • Certain embodiments comprise: (1) a first composition comprising a first
  • oligonucleotide comprising a first recognition end sequence comprising SEQ ID NO: l, a second oligonucleotide comprising a second recognition end sequence comprising SED ID NO:2 and a transposase; and (2) a second composition comprising a third oligonucleotide comprising a third recognition end sequence comprising SEQ ID NO:3, a fourth oligonucleotide comprising a fourth recognition end sequence comprising SED ID NO:4 and a transposase.
  • first and second compositions are incubated under conditions suitable for transposome formation to occur.
  • the first and/or the second compositions comprising transpososome complexes are combined with target DNA.
  • the first composition, the second composition, and the target DNA are combined simultaneously or the components are combined in any order.
  • the combination is incubated under conditions suitable for transposition reactions to occur, resulting in fragmentation of the target with each fragment comprising a different end sequence than the corresponding end of the fragment.
  • the oligonucleotides comprising the recognition end sequences further comprise adaptor sequences
  • transposition results in DNA fragments with ends comprising different end sequences and adaptors (for example, as depicted in FIG. 2).
  • the first oligonucleotide, the second oligonucleotide, the transposase, and the target DNA are combined at the same time.
  • the first oligonucleotide, the second oligonucleotide, and the transposase are combined to make a complex prior to combining the complex with the target DNA.
  • Any number of additional substances can be included in the disclosed compositions and methods. Those in the art will appreciate that the identity, number, and amount of the various additional components will typically be dictated by the application for the composition or the specific optimal activity requirements for a particular transposase employed.
  • DNA fragments comprising adaptors are combined with pairs of primers (for example but not limited to SEQ ID NO: 7 and SEQ ID NO: 8) under conditions suitable for primer hybridization.
  • the primers are suitable for use as amplification or sequencing primers.
  • the primers used for PCR amplification include 3' regions comprising sequences that are part of the recognition end sequence(s).
  • primers include 5' sequences that do not include sequences that are part of the recognition end sequence(s).
  • one or both primers include sequences (sometimes referred to as tags) that allow amplicons generated using such primers to hybridize to a solid support for sequencing.
  • the tags provide priming sites for sequencing using a DNA polymerase. In some embodiments, the tags also provide sites for capturing the fragments onto a surface, such as a bead (e.g., prior to emulsion PCR amplification).
  • the DNA fragment libraries used as templates for next generation sequencing comprise 5'- and 3'- tagged DNA fragments or "di-tagged DNA fragments," i.e., the fragments comprise a different recognition end sequence in the 5 ' portion of the fragment than the recognition end sequence that is found in the 3 ' portion of that DNA fragment.
  • transpososomes formation of transpososomes (transposase-transposon complexes).
  • a first composition comprising lmicroliter ( ⁇ ) lx TE buffer (lOmM Tris pH8.0, ImM EDTA), 5 ⁇ lOOuM first oligonucleotide comprising a first recognition end sequence (SEQ ID NO: l) and 5 ⁇ lOOuM second oligonucleotide comprising a second recognition end sequence (SEQ ID NO:2) was incubated at 90°C for 5 minutes. The temperature was decreased to 25° C, then maintained at 25°C for 2 hours.
  • a second composition comprising ⁇ ⁇ lx TE buffer (lOmM Tris pH8.0, ImM EDTA), 5 ⁇ , lOOuM third oligonucleotide comprising a third recognition end sequence (SEQ ID NO:3) and 5 ⁇ lOOuM fourth oligonucleotide comprising a fourth recognition end sequence (SEQ ID NO:4) was incubated at 90°C for 5 minutes. The temperature was decreased to 25° C in a step-wise manner, then maintained at 25°C for 2 hours.
  • composition comprising the first and second oligonucleotides, and ⁇ , recombinant transposase Tn5 (2ug ⁇ L; obtained from a Tn5 gene mutated to prevent expression of the inhibitor protein and possess enhance the transposition activity) were combined and incubated at room temperature for 1 hour.
  • Transpososome 2 preparation Nine ⁇ , lx TE buffer, ⁇ of the second composition comprising the third and fourth oligonucleotides, and ⁇ ⁇ recombinant transposase Tn5 (2ug ⁇ L) were combined and incubated at room temperature for 1 hour. The two transposome preparations were mixed together, then stored at -20°C.
  • genomic DNA 9948 human genomic DNA 9948 (MCLAB), bacteriophage lambda DNA (Takara), Helicobacter pylori (H. pylori) DNA (Coriell) and E. coli DNA (MCLAB) were adjusted to final volumes of 10 ⁇ using nuclease-free ⁇ 20 in four PCR tubes, one for each DNA target. To each tube was added 10 ⁇ of 5x reaction buffer (45mM Tris Acetate pH7.5, 20mM Magnesium Acetate, 40mM Potassium Acetate, 20mM Potassium
  • the resulting tagmented DNA was mixed with 180 ⁇ ⁇ Binding Buffer (Zymo DNA Clean & Concentration Kit), transferred to a column (Zymo), then centrifuged at 10,000 rpm in an 5415C Centrifuge (Eppendorf) for 1 minute, then washed twice with 300 ⁇ , Wash Buffer (Zymo). Fourteen ⁇ of DNA Resuspension Buffer (Zymo) was added directly to the column matrix, the column was incubated 2 minutes at room temperature, then the elute was collected by centrifuging the column at 10,000 rpm for 1 minute.
  • Binding Buffer Zymo DNA Clean & Concentration Kit
  • the purified tagmented DNA was amplified via 9-cycle PCR program with addition of extra sequences: index 1 (i7) and index 2 (i5) (Nextera Index Kit, Illumina) for sequencing, as well as common adapters (P5 and P7; SEQ ID NO:7 and SEQ ID N0:8, respectively) for cluster generation and sequencing.
  • the PCR mixture included 25 ⁇ , Amplification MasterMix (MCLAB 1-5TM 2X High-Fidelity Master Mix) 5 ⁇ PCR PRimerMix (lOuM P5 and P7), 4 ⁇ each index 1 and index 2, and the tagmented DNA.
  • the PCR program was: 1) 72°C 3 minutes, 2) 98°C 3 minutes, 3) 9 cycles of: 98°C 10 seconds, 65°C 30 seconds, 72°C 3 minutes, then 4) hold at 10°C.

Landscapes

  • Genetics & Genomics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Wood Science & Technology (AREA)
  • Organic Chemistry (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • Zoology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Compositions, methods, and kits for preparing DNA libraries are provided. In some embodiments, at least some of the fragments in such libraries comprise a different recognition end sequence in the 5 portion of the fragment than the recognition end sequence in the 3 portion of that DNA fragment.

Description

COMPOSITIONS, METHODS AND KITS FOR DNA
FRAGMENTATION AND TAGMENTATION
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This patent application claims priority under 35 U.S. C. § 119(e) to U.S. Provisional Patent Application No. 61/963,665, entitled "Ultra-Low Input System with Two Different End Recognition Sequences for DNA Fragmentation and Tagmentation" filed December 11, 2013, which is incorporated herein by reference in its entirety.
FIELD
[0002] The disclosed embodiments relate to compositions, methods and kits for generating DNA libraries from target DNA using transpososome complexes. At least some of the DNA fragments of such libraries are tagged at both ends, wherein the tag sequence at the 5 ' end of a fragment has a different nucleotide sequence than the tag present in the 3 ' end of that fragment. The disclosed DNA libraries are suitable for next generation sequencing application or other methods that rely on the use of tagged DNA fragments derived from the target genetic material.
BACKGROUND
[0003] Traditionally, Sanger dideoxy sequencing has been the preferred DNA sequencing method, although it typically requires extensive cloning or amplification of the target DNA. Preparing cloned or amplified DNA template is usually laborious, time-consuming, and expensive. Recently, next generation sequencing platforms have become available
commercially, for example, the SOLEXA™ (Illumina), 454 FLX™ or 454 TITANIUM™ (Roche) and the SOLID™ or Ion Torrent DNA Sequencer (Life Technologies/ Applied
Biosystems) systems.
[0004] Although the chemistry by which sequence information is generated may vary among different next generation sequencing platforms, all of the systems generate sequence data by simultaneously sequencing very large numbers of sequencing templates. So, rather than requiring genomic libraries of DNA clones in E. coli for Sanger sequencing, for example, it is now desirable to have in vitro systems for generating DNA fragment libraries comprising a collection or population of DNA fragments generated from target DNA in a sample, wherein the combination of all of the DNA fragments in the library exhibits sequences that are qualitatively and/or quantitatively representative of the sequence of the target DNA from which the DNA fragments were generated. In some cases, it is necessary to generate DNA fragment libraries comprising multiple genomic DNA fragment libraries, each of which is labeled with a different address tag, index or bar code to permit identification of the source of each fragment.
[0005] In general, these next generation sequencing methods require fragmentation of genomic DNA (gDNA) or double-stranded cDNA (prepared from RNA) into smaller single- stranded DNA (ssDNA) fragments and addition of tags to at least one strand or preferably both fragment strands.
[0006] Fragmentation of gDNA is one of the main steps in preparing samples for high- throughput sequencing. In general, current methods for generating DNA fragment libraries for next generation sequencing comprise fragmenting the target DNA that one desires to sequence (e.g. target DNA comprising genomic DNA or double-stranded cDNA after reverse transcription of RNA) using a sonicator, nebulizer, or a nuclease (for example, DNase I), and joining (e.g., by ligation) oligonucleotides consisting of adapters or tags to the 5' and 3' ends of the fragments. These traditional methods are laborious, time-consuming, and often result in DNA fragmentation that is either insufficient or too extensive. In either case, the yield of DNA fragments of useful size is low.
[0007] This difficulty has been overcome by controlled DNA fragmentation using oligonucleotide-transposase complexes, sometimes referred to as "transpososomes". Such complexes comprise a dimer of modified transposase Tn5 and a pair of Tn-5 binding double- stranded DNA (dsDNA) oligonucleotides containing a 19 base pair (bp) transposase-binding sequence (mosaic end, ME), or inverted repeat sequence (IR). Unlike DNase, a single molecule of which can generate numerous breaks in a target DNA, the transposase complex or
"transpososome" creates only one DNA cleavage per complex. Therefore, unlike with DNase I, the degree of DNA fragmentation is more easily controlled during transposase fragmentation.
SUMMARY
[0008] The present teachings are directed to compositions, methods and kits for making creating DNA libraries of tagged fragments, typically obtained from gDNA or double stranded cDNA.
[0009] According to certain embodiments, compositions for generating fragmented, tagged DNA libraries comprise at least one first oligonucleotide comprising at least one double- stranded portion, wherein the double-stranded portion comprises at least one first recognition end sequence; at least one second oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one second recognition end sequence; and a transposase. Certain compositions of the current teachings further comprise at least one third oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one third recognition end sequence; at least one fourth oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one fourth recognition end sequence;
[0010] According to certain embodiments, methods for generating a library of tagged DNA fragments comprise combining (a) a first oligonucleotide comprising at least one double- stranded portion, wherein the double-stranded portion comprises at least one first recognition end sequence; b) a second oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one second recognition end sequence; (c) at least one transposase; and (d) at least one target DNA sequence; and incubating the combination under conditions suitable for generating a DNA library wherein each DNA fragment comprises at least two different recognition end sequences, at least two sequences complimentary to two different recognition end sequences, or at least one recognition end sequence and at least one sequence that is complimentary to a different recognition end sequence. In certain embodiments, the target DNA is gDNA or double-stranded cDNA.
[0011] Some method embodiments further comprise combining (a) a third
oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one third recognition end sequence; b) a fourth oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one fourth recognition end sequence; (c) at least one transposase. According to certain embodiments, the first oligonucleotide, the second oligonucleotide and the at least one transposase are incubated under conditions suitable a first transposome to form prior to adding the at least one target DNA sequence. In certain embodiments, the third oligonucleotide, the fourth oligonucleotide, and the at least one transposase are incubated under conditions suitable for a second transposome to form prior to adding the target DNA. [0012] In certain embodiments, (a) the at least one first recognition end sequence comprises SEQ ID NO: l, SEQ ID NO:3, or both; (b) the at least one second recognition end sequence comprises SEQ ID NO:2, SEQ ID NO:4, or both; or (c) (i) the at least one first recognition end sequence comprises SEQ ID NO: l, SEQ ID NO:3, or both, and (ii) the at least one second recognition end sequence comprises SEQ ID NO:2, SEQ ID NO:4, or both. In certain embodiments, the at least one first recognition end sequence comprises SEQ ID NO: l; the at least one second recognition end sequence comprises SEQ ID NO:2; the at least one third recognition end sequence comprises SEQ ID NO:3; and the at least one fourth recognition end sequence comprises SEQ ID NO:4.
[0013] Certain method embodiments further comprise amplifying at least one tagged DNA fragment using primers and the polymerase chain reaction (PCR). In certain embodiments, the PCR amplification comprises using at least one first primer and at least one second primer, wherein the 3' end portion of the first primer is complementary to the 3' end of the at least one first recognition end sequence of at least one DNA fragment and wherein the 3' end portion of the second primer comprises the sequence of at least a portion of the 5' end of the at least one first recognition end sequence of the at least one DNA fragment.
[0014] In certain embodiments, both the first primer and the second primer comprise 3' sequence that are the same as or are complimentary to at least a portion of the first recognition end sequence, the second recognition end sequence or both recognition end sequences. In certain embodiments, the first primer comprises SEQ ID NO:7 and the second primer comprises SEQ ID NO:8. In certain embodiments, the first primer, the second primer or both, further comprise sequencing tags comprising at least one of: Roche 454 sequencing tags, Illumina™ SOLEXA™ sequencing tags, Applied Biosystems' SOLID™ or Ion Torrent sequencing tags, the Pacific Biosciences' SMRT™ sequencing tags, Pollonator Polony sequencing tags, or Complete
Genomics sequencing tags.
[0015] Kits for generating tagged libraries of fragmented DNA are also disclosed.
According to certain embodiments, kits comprise: at least one first oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one first recognition end sequence; and at least one second oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one second recognition end sequence. Certain kit embodiments further comprise at least one third oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one third recognition end sequence; and at least one fourth
oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one fourth recognition end sequence. In some kit embodiments, (a) the first recognition end sequence comprises SEQ ID NO: l, SEQ ID NO:3, or both, and (b) the second recognition end sequences comprises SEQ ID NO:2, SEQ ID NO:4, or both. In some kit embodiments, (a) the first recognition end sequence comprises SEQ ID NO: l; the second recognition end sequences comprises SEQ ID NO:2; the third recognition end sequence comprises SED ID NO:3; and the fourth recognition end sequence comprises SEQ ID NO:4.
[0016] Certain kits of the current teachings further comprise at least one adaptor and at least one oligonucleotide primer. In some embodiments, (a) the at least one adaptor comprises SEQ ID NO:5, SEQ ID NO:6, or both, (b) the at least one oligonucleotide primer comprises SEQ ID NO;7, SEQ ID NO:8, or both, or (c) (i) the at least one adaptor comprises SEQ ID NO:5, SEQ ID NO:6, or both, (ii) the at least one oligonucleotide primer comprises SEQ ID NO;7, SEQ ID NO:8, or both.
[0017] These and other features of the present teachings are set forth herein.
BRIEF DESCRIPTION OF THE DRAWINGS
[0018] The skilled artisan will understand that the drawings, described below, are for illustration purposes only. These figures are not intended to limit the scope of the present teachings in any way.
[0019] FIG. 1 graphically depicts a disclosed method for generating DNA fragments form target DNA using a transposase (Tn5) and two different recognition end sequences (El and E2), sometimes referred to as transposon end sequences. First, a transposome complex comprising the transposase and the two different recognition end sequences is formed. Next, the transposome is combined with the target DNA and an in vitro transposition reaction occurs, resulting in DNA fragments with different recognition end sequences, El and E2, on each end (sometimes referred to as di-tagged dsDNA).
[0020] FIG. 2 graphically depicts a method for generating a DNA library using the exemplary tagged DNA fragments form FIG 1. First and second amplification primers, each comprising a different bar code sequence that the other primer, are annealed to the adaptor linked di-tagged DNA fragments. The fragments are then PCR amplified using DNA polymerase to generate a DNA library comprising bar coded (tagged) fragments.
[0021] FIG. 3 A shows an agarose gel of electrophoresed lambda DNA that was fragmented and tagged using an engineered transposase and two different recognition end sequences corresponding to that transposase according to a disclosed method. The fragments were amplified using one primer pair (Lane #1). Lane 2 shows the results obtained with a NEXTERA™ kit (Illumina) using a single ME and 4 amplification primers. .
[0022] FIG. 3B is a fragment size distribution printout of the tagged, fragmented DNA library shown in FIG. 3 A lane 1 obtained using an Agilent Bioanalyzer DNA 1000 kit.
[0023] FIG. 4 is an agarose gel of tagged DNA libraries generated according to Example 1. The first and last lanes shown molecular weight markers. Lane two (9948) is from a DNA library prepared from human gDNA. Lane 3 is a DNA library prepared from bacteriophage Lambda gDNA. Lane 4 is a DNA library prepared from Helicobacter pylori gDNA, and Lane 5 is a DNA library prepared from E. coli gDNA.
DETAILED DESCRIPTION
[0024] It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not intended to limit the scope of the current teachings.
[0025] The instant application provides compositions, methods and kits for generating compositions related to tissue engineered constructs, including but not limited to, constructs designed to add volume at and/or below an incision or insertion site when certain constructs of the present teachings is implanted. For example, constructs designed according to certain disclosed tissue engineering methods that mimic the appearance or a nipple or a NAC when implanted. Also disclosed are methods for fabricating multilayered constructs, for fabricating constructs in a hydrogel scaffolds, and methods comprising bioprinting one or more cell type in or on a construct layer, or a hydrogel or other scaffold.
[0026] The term "adaptor" refers to a non-target nucleic acid sequence, generally DNA, that provides a means of addressing a nucleic acid fragment to which it is joined. In certain embodiments, an adaptor comprises a nucleotide sequence that permits identification, recognition, and/or molecular or biochemical manipulation of the DNA to which the adaptor is attached, for example, by providing a site for annealing an oligonucleotide, such as a primer for extension by a DNA polymerase, or an oligonucleotide for binding to a bead or other support, or for a ligation reaction. According to certain disclosed embodiments, one or more adaptors are incorporated in components of the fragmented DNA libraries of the current teachings. In certain embodiments, adaptors are used for polymerase-mediated amplification.
[0027] An "oligo" or "oligonucleotide" is a short piece of DNA or RNA that is typically single-stranded, but for purposes of the current teachings, an oligo may be double stranded or may be partially single stranded and partially double stranded.
[0028] The term "recognition end sequence" refers to the nucleotide sequence within the oligonucleotides of the current teachings (e.g., the first oligonucleotide, the second
oligonucleotide, and so forth) to which a transposase specifically binds to during transposome formation. Exemplary recognition end sequences of the current teachings comprise SEQ ID NOs: l-4.
[0029] A "transposase" is an enzyme that is involved in the transposition of DNA segments in the genome, for example, the movement of transposable elements during DNA mutagenesis. The term transposase as used herein may, in certain embodiments, further comprise related enzymes known as integrases. Repetitive transposable elements have been identified in the mammalian genome, among other places, and some repetitive elements are able to move within the genome (transposons and retrotransposons). Typically, DNA transposons move from one genomic location to another by a mechanism that is sometimes referred to as a "cut-and-paste" mechanism. Exemplary transposases include the DDE transposases (for example, the transposases of the maize Ac transposon, the Drosophila P element, bacteriophage Mu, Tn5 and TnlO, Mariner, IS 10, and IS50), and the synthetic Sleeping Beauty (SB) transposase.
[0030] In this application, the use of the singular includes the plural unless specifically stated otherwise. The use of "comprise", "contain", and "include", or modifications of those root words, for example but not limited to, "comprises", "contained", and "including", are not intended to be limiting. The term "and/or" means that the terms before and after can be taken together or separately. For illustration purposes, but not as a limitation, "X, Y and/or Z" can mean "X" or "Y" or "Z" or "X and Y" or "X and Z" or "Y and Z", of "X and Y and Z.". [0031] The section headings used herein are for organizational purposes only and are not to be construed as limiting the described subject matter in any way. All literature and similar materials cited in this application, including patents, patent applications, articles, books, and treatises are expressly incorporated by reference in their entirety for any purpose. In the event that one or more of the incorporated literature and similar materials defines or uses a term in such a way that it contradicts that term's definition in this application, this application controls. While the present teachings are described in conjunction with various embodiments, it is not intended that the present teachings be limited to such embodiments. On the contrary, the present teachings encompass various alternatives, modifications, and equivalents, as will be appreciated by those of skill in the art.
Exemplary Embodiments
[0032] The following examples provide compositions, methods and kits for generating DNA libraries comprising fragments comprising different recognition end sequences on their respective ends, and in some embodiments, adaptor tags on one or both ends.
[0033] The current teachings provide new methods for preparing DNA libraries from gDNA or double-stranded cDNA targets. The DNA fragments in such libraries comprise two different recognition end sequences per DNA fragment created. Certain embodiments further comprise using two independently tagged primers for amplification of the DNA fragments. According to certain disclosed methods, random DNA fragments are generated each comprising two different recognition end sequences without regard to the sequences of the target DNA.
[0034] In one exemplary embodiment, depicted in FIG. 1, an aqueous composition comprising a transposase (shown as Tn5 in this example) and two different recognition end sequences (shown as El and E2) are incubated under conditions appropriate to form a transpososome, sometimes referred to as a transposome complex. The transpososome is combined with target DNA and under conditions suitable for in vitro transposition, the transpososomes bind the target DNA, transposition occurs (FIG. 1, middle panel), and DNA fragments comprising different recognition sequences on their ends are generated (FIG. 1, lower panel).
[0035] In certain embodiments, the oligonucleotides comprising the recognition end sequences may further comprise an adaptor portion. In some embodiments, the transposition results in fragmented target DNA, wherein at least some of the fragments comprise a different adaptor and recognition end sequence on each end, for example, as depicted in the top panel of FIG. 2. In some embodiments, tagged DNA libraries suitable for high-throughput sequencing are created by combining di-tagged DNA fragments with amplification primers and DNA polymerase under conditions suitable for primer hybridization and PCR amplification to occur, for example, as depicted in the middle and bottom panels of FIG. 2.
[0036] The disclosed compositions, methods, and kits may comprise any naturally available transposase or genetically engineered transposase capable of binding with the disclosed oligonucleotides to form a transpososome capable of in vitro transposition.
[0037] Certain disclosed compositions, methods, and kits comprise a first oligonucleotide comprising a first recognition end sequence and a second oligonucleotide comprising a second recognition end sequence, where the first recognition end sequence and the second recognition end sequence are different. In some embodiments, a first oligonucleotide, a second
oligonucleotide or both, further comprise an adaptor portion that is upstream of the recognition end sequence. In some embodiments, the first oligo comprises a first recognition end sequence and a first adaptor sequence and the second oligo comprises a second recognition end sequence and a second adaptor sequence, wherein the first recognition end sequence and the first adaptor are different from the second recognition end sequence and the second adaptor.
[0038] Certain composition, method and kit embodiments further comprise a third oligonucleotide comprising a third recognition end sequence and a fourth oligonucleotide comprising a fourth recognition end sequence, where the third and fourth recognition end sequences. In some embodiments, a third oligonucleotide, a fourth oligonucleotide or both, further comprise an adaptor portion that is upstream of the recognition end sequence. In some embodiments, the third oligonucleotide comprises a third recognition end sequence and a third adaptor sequence and the fourth oligonucleotide comprises a fourth recognition end sequence and a fourth adaptor sequence, wherein the third recognition end sequence and the third adaptor are different from the fourth recognition end sequence and the fourth adaptor. In some embodiments the first and third adaptor sequences are the same or different and the second and fourth adaptor sequences are the same or different. In certain embodiments, the fragments in the DNA library comprise (a) a first subpopulation of fragments comprising a first recognition end sequence on one end and a second recognition end sequence on the other end and (b) a second subpopulation of fragments comprising a third recognition end sequence on one end and a fourth second recognition end sequence on the other end. In some embodiments, the number of di- tagged fragments in the first subpopulation and the second subpopulation are approximately equal.
[0039] The recognition end sequence can be a naturally occurring sequence for the transposase of choice, or it can be an engineered sequence designed for a particular transposase and providing additional or alternative functions for the oligonucleotide or DNA fragment where it is incorporated or transposed.
[0040] In certain exemplary embodiments, the recognition end sequence comprise: 5'- AGATCTGATCAAGAGACAG-3* (SEQ ID NO: l); 5*- CTGTCTCTTGATCAGATCT-3* (SEQ ID NO:2); 5*- ACTTGTGTATAAGAGTCAG-3* (SEQ ID NO: 3); 5'-
CTGACTCTTATAC AC AAGT-3 ' (SEQ ID NO:4). In some embodiments, the first recognition end sequence and the second recognition end sequence are double stranded DNA sequence comprising a sequence for one strand of SEQ ID NO: l, SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4. Certain embodiments comprise: (1) a first composition comprising a first
oligonucleotide comprising a first recognition end sequence comprising SEQ ID NO: l, a second oligonucleotide comprising a second recognition end sequence comprising SED ID NO:2 and a transposase; and (2) a second composition comprising a third oligonucleotide comprising a third recognition end sequence comprising SEQ ID NO:3, a fourth oligonucleotide comprising a fourth recognition end sequence comprising SED ID NO:4 and a transposase. In some embodiments, such first and second compositions are incubated under conditions suitable for transposome formation to occur. In some embodiments, the first and/or the second compositions comprising transpososome complexes are combined with target DNA. In some embodiments, the first composition, the second composition, and the target DNA are combined simultaneously or the components are combined in any order. The combination is incubated under conditions suitable for transposition reactions to occur, resulting in fragmentation of the target with each fragment comprising a different end sequence than the corresponding end of the fragment. In some embodiments, where the oligonucleotides comprising the recognition end sequences further comprise adaptor sequences, transposition results in DNA fragments with ends comprising different end sequences and adaptors (for example, as depicted in FIG. 2).
[0041] In some embodiments, the first oligonucleotide, the second oligonucleotide, the transposase, and the target DNA are combined at the same time. In other embodiments, the first oligonucleotide, the second oligonucleotide, and the transposase are combined to make a complex prior to combining the complex with the target DNA. Any number of additional substances can be included in the disclosed compositions and methods. Those in the art will appreciate that the identity, number, and amount of the various additional components will typically be dictated by the application for the composition or the specific optimal activity requirements for a particular transposase employed.
[0042] In certain embodiments, DNA fragments comprising adaptors are combined with pairs of primers (for example but not limited to SEQ ID NO: 7 and SEQ ID NO: 8) under conditions suitable for primer hybridization. According to various embodiments, the primers are suitable for use as amplification or sequencing primers. In certain embodiments, the primers used for PCR amplification include 3' regions comprising sequences that are part of the recognition end sequence(s). In some embodiments, primers include 5' sequences that do not include sequences that are part of the recognition end sequence(s). In some embodiments, one or both primers include sequences (sometimes referred to as tags) that allow amplicons generated using such primers to hybridize to a solid support for sequencing.
[0043] In some embodiments, the tags provide priming sites for sequencing using a DNA polymerase. In some embodiments, the tags also provide sites for capturing the fragments onto a surface, such as a bead (e.g., prior to emulsion PCR amplification). In some embodiments, the DNA fragment libraries used as templates for next generation sequencing comprise 5'- and 3'- tagged DNA fragments or "di-tagged DNA fragments," i.e., the fragments comprise a different recognition end sequence in the 5 ' portion of the fragment than the recognition end sequence that is found in the 3 ' portion of that DNA fragment.
Example 1
Formation of transpososomes (transposase-transposon complexes).
[0044] A first composition comprising lmicroliter (μΕ) lx TE buffer (lOmM Tris pH8.0, ImM EDTA), 5μί lOOuM first oligonucleotide comprising a first recognition end sequence (SEQ ID NO: l) and 5μί lOOuM second oligonucleotide comprising a second recognition end sequence (SEQ ID NO:2) was incubated at 90°C for 5 minutes. The temperature was decreased to 25° C, then maintained at 25°C for 2 hours.
[0045] A second composition comprising Ι μΕ lx TE buffer (lOmM Tris pH8.0, ImM EDTA), 5μΙ, lOOuM third oligonucleotide comprising a third recognition end sequence (SEQ ID NO:3) and 5μί lOOuM fourth oligonucleotide comprising a fourth recognition end sequence (SEQ ID NO:4) was incubated at 90°C for 5 minutes. The temperature was decreased to 25° C in a step-wise manner, then maintained at 25°C for 2 hours.
[0046] Transpososome 1 preparation. Nine lx TE buffer, ΙμΕ of the first
composition comprising the first and second oligonucleotides, and ΙΟμΙ, recombinant transposase Tn5 (2ug^L; obtained from a Tn5 gene mutated to prevent expression of the inhibitor protein and possess enhance the transposition activity) were combined and incubated at room temperature for 1 hour.
[0047] Transpososome 2 preparation. Nine μΐ, lx TE buffer, ΙμΕ of the second composition comprising the third and fourth oligonucleotides, and ΙΟμΙ^ recombinant transposase Tn5 (2ug^L) were combined and incubated at room temperature for 1 hour. The two transposome preparations were mixed together, then stored at -20°C.
Example 2
Generation of tagged DNA fragment libraries from gDNA.
[0048] Fifty nanogram of four genomic DNAs (human genomic DNA 9948 (MCLAB), bacteriophage lambda DNA (Takara), Helicobacter pylori (H. pylori) DNA (Coriell) and E. coli DNA (MCLAB) were adjusted to final volumes of 10 μί using nuclease-free Η20 in four PCR tubes, one for each DNA target. To each tube was added 10 μί of 5x reaction buffer (45mM Tris Acetate pH7.5, 20mM Magnesium Acetate, 40mM Potassium Acetate, 20mM Potassium
Glutamate and 5% N,N-Dimethylformaide), 5 μΐ^ of the mixture of transposome 1 and 2 (see Example 1), and 25 μΐ, nuclease-free H20, and the tubes were mixed gently. The tubes were then placed into a thermocycler (MJ Research PTC-200 Thermal Cycler) programmed to run: 55°C for 5 minutes, then hold at 10°C.
[0049] The resulting tagmented DNA was mixed with 180 μΐ^ Binding Buffer (Zymo DNA Clean & Concentration Kit), transferred to a column (Zymo), then centrifuged at 10,000 rpm in an 5415C Centrifuge (Eppendorf) for 1 minute, then washed twice with 300 μΐ, Wash Buffer (Zymo). Fourteen μΕ of DNA Resuspension Buffer (Zymo) was added directly to the column matrix, the column was incubated 2 minutes at room temperature, then the elute was collected by centrifuging the column at 10,000 rpm for 1 minute.
[0050] The purified tagmented DNA was amplified via 9-cycle PCR program with addition of extra sequences: index 1 (i7) and index 2 (i5) (Nextera Index Kit, Illumina) for sequencing, as well as common adapters (P5 and P7; SEQ ID NO:7 and SEQ ID N0:8, respectively) for cluster generation and sequencing.
[0051] The PCR mixture included 25μΙ, Amplification MasterMix (MCLAB 1-5™ 2X High-Fidelity Master Mix) 5μΕ PCR PRimerMix (lOuM P5 and P7), 4μΕ each index 1 and index 2, and the tagmented DNA. The PCR program was: 1) 72°C 3 minutes, 2) 98°C 3 minutes, 3) 9 cycles of: 98°C 10 seconds, 65°C 30 seconds, 72°C 3 minutes, then 4) hold at 10°C.
[0052] Fifteen μΕ of each of the four PCR products were analyzed on a 2% agarose gel As seen in FIG. 4, the resulting amplification products were fragments of about 500 bp.
[0053] Although the disclosed teachings have been described with reference to various applications, methods, and compositions, it will be appreciated that various changes and modifications may be made without departing from the teachings herein. The foregoing examples are provided to better illustrate the present teachings and are not intended to limit the scope of the teachings herein. Furthermore, various presently unforeseen or unanticipated alternatives, modifications, variations or improvements therein may be subsequently made by those skilled in the art which are also intended to be encompassed by the following claims. Certain aspects of the present teachings may be further understood in light of the following claims.

Claims

What is claimed is:
1. A composition for generating a DNA library comprising: a first oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one first recognition end sequence; a second oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one second recognition end sequence; and a transposase.
2. The composition of claim 1 , wherein the first recognition end sequence, the second recognition end sequence, or both, are modified.
3. The composition of claim 1, wherein the transposase is a modified transposase that has been genetically engineered to alter at least one wild-type transposase activity.
4. The composition of claim 1, further comprising: a third oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one third recognition end sequence; and a fourth oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one fourth recognition end sequence.
5. A method for generating a DNA library comprising: combining (a) a first oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one first recognition end sequence; (b) a second oligonucleotide comprising at least one double-stranded portion, wherein the double- stranded portion comprises at least one second recognition end sequence; (c) at least one transposase; and (d) at least one target DNA; and incubating the combination under conditions suitable for generating a DNA library comprising DNA fragments, wherein each DNA fragment comprises at least two different recognition end sequences, at least two sequences complimentary to two different recognition end sequences, or at least one recognition end sequence and at least one sequence that is complimentary to a second recognition end sequence.
6. The method of claim 5, wherein the (a) a first oligonucleotide comprising at least one double- stranded portion, wherein the double-stranded portion comprises at least one first recognition end sequence; (b) a second oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one second recognition end sequence; and the (c) at least one transposase are combined, prior to adding the target DNA.
7. The method of claim 5, wherein the at least one target DNA sequence is genomic DNA (gDNA).
8. The method of claim 5, wherein (a) the first recognition end sequence comprises SEQ ID NO: l, SEQ ID NO:3, or both; (b) the second recognition end sequence comprises SEQ ID NO:2, SEQ ID NO:4, or both; or (c) (i) the first recognition end sequence comprises SEQ ID NO: l, SEQ ID NO:3, or both, and (ii) the second recognition end sequence comprises SEQ ID NO:2, SEQ ID NO:4, or both.
9. The method of claim 5, further comprising amplifying the DNA fragments, wherein the amplifying comprises PCR amplification.
10. The method of claim 9, wherein the PCR amplification comprises using at least one first oligonucleotide primer and at least one second oligonucleotide primer, wherein the 3' end portion of the first oligonucleotide primer is complementary to the 3' end of the at least one first recognition end sequence of at least one DNA fragment and wherein the 3' end portion of the second oligonucleotide comprises the sequence of at least a portion of the 5' end of the at least one first recognition end sequence of the at least one DNA fragment.
11. A method for generating a DNA library comprising:
(a) combining a first oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one first recognition end sequence; a second oligonucleotide comprising at least one double-stranded portion, wherein the double- stranded portion comprises at least one second recognition end sequence; and at least one transposase; and incubating under conditions suitable to form a first transposase complex;
(b) combining a third oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one third recognition end sequence; a fourth oligonucleotide comprising at least one double-stranded portion, wherein the double- stranded portion comprises at least one fourth recognition end sequence; and at least one transposase; and incubating under conditions suitable to form a second transposase complex;
(c) combining the first transposase complex, the second transposome complex and the target DNA, wherein the combining occurs in any order; and
(d) incubating the combination under conditions suitable for generating a DNA library comprising DNA fragments, wherein each DNA fragment comprises at least two different recognition end sequences, at least two sequences complimentary to two different recognition end sequences, or at least one recognition end sequence and at least one sequence that is complimentary to a second recognition end sequence.
12. The method of claim 11, further comprising amplifying the DNA fragments, wherein the amplifying comprises PCR amplification.
13. The method of claim 12, comprising a first primer and a second primer, wherein the first primer, the second primer, or both, comprise a sequence at or near the 3 ' end of the primer that is the same as or complimentary to at least a portion of the first recognition end sequence, the second recognition end sequence, the third recognition end sequence, the fourth recognition end sequences, or combinations thereof.
14. The method of 13, wherein the first oligonucleotide primer comprises SEQ ID NO:7 and the second oligonucleotide primer comprises SEQ ID NO:8.
15. The method of claim 13, wherein the first oligonucleotide primer, the second
oligonucleotide primer or both, further comprise sequencing tags comprising at least one of: Roche 454 sequencing tags, Illumina™ SOLEXA™ sequencing tags, Applied Biosystems' SOLID™ or Ion Torrent sequencing tags, the Pacific Biosciences' SMRT™ sequencing tags, Pollonator Polony sequencing tags, or Complete Genomics sequencing tags.
16. A kit comprising: a first oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one first recognition end sequence; a second oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one second recognition end sequence; a third oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one third recognition end sequence; a fourth oligonucleotide comprising at least one double-stranded portion, wherein the double-stranded portion comprises at least one fourth recognition end sequence; and at least one transposase.
17. The kit of claim 16, wherein (a) the first recognition end sequence comprises SEQ ID NO: l; (b) the second recognition end sequences comprises SEQ ID NO:2; (c) the third recognition end sequence comprises SEQ ID NO:3; and (d) the fourth recognition end sequence comprises SEQ ID NO:4.
18. The kit of claim 16, further comprising at least one adaptor and at least one primer.
19. The kit of claim 18, wherein (a) the at least one adaptor comprises SEQ ID NO:5, SEQ ID NO:6, or both, (b) the at least one primer comprises SEQ ID NO:7, SEQ ID NO:8, or both, or (c) (i) the at least one adaptor comprises SEQ ID NO:5, SEQ ID NO:6, or both, (ii) the at least one oligonucleotide primer comprises SEQ ID NO:7, SEQ ID NO:8, or both.
PCT/US2014/069855 2013-12-11 2014-12-11 Compositions, methods and kits for dna fragmentation and tagmentation WO2015089339A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201361963665P 2013-12-11 2013-12-11
US61/963,665 2013-12-11

Publications (2)

Publication Number Publication Date
WO2015089339A2 true WO2015089339A2 (en) 2015-06-18
WO2015089339A3 WO2015089339A3 (en) 2016-01-21

Family

ID=53371969

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2014/069855 WO2015089339A2 (en) 2013-12-11 2014-12-11 Compositions, methods and kits for dna fragmentation and tagmentation

Country Status (1)

Country Link
WO (1) WO2015089339A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180016571A1 (en) * 2014-11-05 2018-01-18 Illumina, Inc. Transposase compositions for reduction of insertion bias
WO2024003332A1 (en) 2022-06-30 2024-01-04 F. Hoffmann-La Roche Ag Controlling for tagmentation sequencing library insert size using archaeal histone-like proteins

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180016571A1 (en) * 2014-11-05 2018-01-18 Illumina, Inc. Transposase compositions for reduction of insertion bias
US10815478B2 (en) * 2014-11-05 2020-10-27 Illumina, Inc. Method of sequential tagmentation with transposase compositions for reduction of insertion bias
AU2015343034B2 (en) * 2014-11-05 2021-12-16 Illumina, Inc. Transposase compositions for reduction of insertion bias
WO2024003332A1 (en) 2022-06-30 2024-01-04 F. Hoffmann-La Roche Ag Controlling for tagmentation sequencing library insert size using archaeal histone-like proteins

Also Published As

Publication number Publication date
WO2015089339A3 (en) 2016-01-21

Similar Documents

Publication Publication Date Title
US20210071171A1 (en) Compositions and methods for targeted nucleic acid sequence enrichment and high efficiency library generation
CN106062209B (en) Synthetic long read DNA sequencing
US9255291B2 (en) Oligonucleotide ligation methods for improving data quality and throughput using massively parallel sequencing
US9469874B2 (en) Long-range barcode labeling-sequencing
US7601499B2 (en) Paired end sequencing
US20060127920A1 (en) Polynucleotide synthesis
US8921076B2 (en) Method for genome complexity reduction and polymorphism detection
WO2006127423A2 (en) Methods of producing polynucleotide libraries using scarless ligation
JP2013518598A (en) Isothermal amplification of nucleic acids using primers containing randomized sequences and specific primers and uses thereof
CA3163353A1 (en) Controlled strand-displacement for paired-end sequencing
JP2012223203A (en) Paired end sequencing
US20140336058A1 (en) Method and kit for characterizing rna in a composition
US20170267998A1 (en) Methods of synthesizing polynucleotides
WO2015089339A2 (en) Compositions, methods and kits for dna fragmentation and tagmentation
EP2152901B1 (en) Method for introducing common and/or individual sequence elements in a target nucleic acid molecule
US20230183792A1 (en) Methods for the multiplexed isothermal amplification of nucleic acid sequences
CN110573627A (en) Methods and compositions for producing target nucleic acid molecules
US20240076803A1 (en) Method for Library Preparation in Next Generation Sequencing by Enzymatic DNA Fragmentation
RU2798952C2 (en) Obtaining a nucleic acid library using electrophoresis
JP4793842B2 (en) DNA amplification method using random RNA primer
Khezri et al. An Efficient Approach for Two Distal Point Site-Directed Mutagenesis from Randomly Ligated PCR Products
WO2024112758A1 (en) High-throughput amplification of targeted nucleic acid sequences
AU2023215324A1 (en) Methods selectively depleting nucleic acid using rnase h
JPWO2021081423A5 (en)
JP2005218301A (en) Method for base sequencing of nucleic acid

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14869766

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase in:

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14869766

Country of ref document: EP

Kind code of ref document: A2