WO2016170147A1 - Efficiency improving ligation methods - Google Patents

Efficiency improving ligation methods Download PDF

Info

Publication number
WO2016170147A1
WO2016170147A1 PCT/EP2016/059068 EP2016059068W WO2016170147A1 WO 2016170147 A1 WO2016170147 A1 WO 2016170147A1 EP 2016059068 W EP2016059068 W EP 2016059068W WO 2016170147 A1 WO2016170147 A1 WO 2016170147A1
Authority
WO
WIPO (PCT)
Prior art keywords
dna
chloride
dsdna
tetrahydropyrimidine
thp
Prior art date
Application number
PCT/EP2016/059068
Other languages
French (fr)
Inventor
Nan Fang
Katja HEITZ
Original Assignee
Qiagen Gmbh
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qiagen Gmbh filed Critical Qiagen Gmbh
Publication of WO2016170147A1 publication Critical patent/WO2016170147A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay

Definitions

  • the present invention provides new methods and kits to improve the efficiency of ligation reactions, in particular in molecular biology applications, such as the next generation sequencing (NGS) library construction methods and gene cloning.
  • NGS next generation sequencing
  • the ligation step is critical in adding sequencing platform-specific adapters to the DNA fragments that are to be sequenced. Said improvement is achieved by the addition of agents, which modulate the melting temperature of dsDNA.
  • Double-stranded nucleic acids containing blunt ends or cohesive (sticky) ends with an overhang of one or more nucleotides can be joined by means of intermolecular or intramolecular ligation reactions.
  • Examples for the methods for ligating at a specific site are DNA ligation reactions of cohesive ends of DNA fragments, which have been cleaved by a restriction enzyme, or of blunt ends of DNA fragments. Such ligation reactions are commonly used in molecular biology applications, such as next-generation sequencing and gene cloning.
  • NGS Next-generation sequencing
  • technologies include e.g. de novo genome sequencing, transcriptome sequencing and epigenomics, as well as genetic screening for the identification of rare genetic variants and for efficient detection of either inherited or somatic mutations in cancer genes.
  • the preparation step involves random fragmentation of the genomic DNA and addition of adapter sequences to the fragment ends.
  • the commonly used method to generate platform-specific NGS libraries uses multi-step enzymatic reaction protocols to ligate adapters to the DNA fragments to be analyzed.
  • DNA fragments are generated with mechanical, chemical, or enzymatic
  • the end-repair step requires at least two enzymes: (a) a polynucleotide kinase, normally the T4 Polynucleotide Kinase (PNK) that phosphorylates the 5'-terminus of the double stranded DNA fragments; and (b) an enzyme or enzymes with polymerase and exonuclease activities that make the ends of the DNA fragments blunt by either fill-in or trimming reactions, such as e.g. T4 DNA Polymerase.
  • PNK Polynucleotide Kinase
  • sequencing adapters that have an overhang formed by thymidine nucleotides, i.e. a T- overhang.
  • an A-overhang is added to the 3'-terminus of the end-repaired PCR product, e.g. by Klenow Fragment exo-, the large fragment of the DNA polymerase I having 5'- 3' polymerase activity, but lacking both 3'- 5' exonuclease activity and 5'- 3' exonuclease activity.
  • the A-addition step can also be facilitated with enzymes having terminal nucleotide transferase activity, such as the Taq polymerase.
  • the sequencing adapter can be ligated to the DNA by a ligase, such as the T4 DNA Ligase.
  • a ligase such as the T4 DNA Ligase.
  • the A-addition step is not required and blunt-ended adapters are ligated by a T4 DNA ligase directly to the end-repaired DNA fragments.
  • the low efficiency of library generation methods can be a draw-back if a sequencing library needs to be constructed from a small amount of input DNA, such as an input amount lower than 1 ng. Most current library construction methods cannot deliver high adaptor-ligation efficiency with such low input amount; therefore, only a small percentage of the input DNA is converted into sequencing library. This could lead to either lack of sufficient library for sequencing, or the library complexity is low and cannot faithfully represent the sequence in the original sample.
  • the present invention relates to agents, which modulate the melting temperature (Tm) of dsDNA in the dsDNA ligation reactions to improve the efficiency and specificity of ligation reactions in molecular biology applications, such as gene library generation and gene cloning.
  • Tm melting temperature
  • Such ligation reactions may be intermolecular or intramolecular.
  • One aspect of the present invention refers to methods of generating dsDNA, wherein the method comprises ligating a first and a second dsDNA, both optionally having one or two single-stranded end(s), in the presence of a DNA ligase and an agent, which modulates the melting temperature of dsDNA.
  • the present invention refers to a method for ligating a first and a second ds DNA, wherein both the first and the second dsDNAs comprise two ssDNA regions, whereby each of the ssDNA region ends of the first dsDNA ligates with each of the complementary ss region ends of the second dsDNA to provide ligated circular dsDNA in the presence of a an agent, which modifies the melting temperature of dsDNA.
  • the first or the second DNA is capable of conferring the ability to auto-replicate within competent cells.
  • the invention relates to a method for generation of a sequencing library, wherein the method comprises the steps of:
  • step (iv) ligating the DNA fragments, optionally having the terminal adenine, with sequencing adapters wherein preferably the adapters have a terminal thymidine if the fragments have a terminal adenine.
  • said method further comprises step (v), wherein the ligated fragments of step (iv) are purified and size-selected for sequencing.
  • said method further comprises step (vi), wherein the adapter- ligated fragments are amplified and the amplification product is optionally purified prior to sequencing.
  • step (v) or (vi) are subjected to sequencing.
  • a further aspect of the invention refers to a kit comprising:
  • the agent which modulates the melting temperature of dsDNA is selected from any one of tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and Triton®X-100, and mixtures thereof.
  • TMAC tetramethylammonium chloride
  • TEAC tetraethylammonium chloride
  • TMANO trimethylamine N-oxide
  • the agent which modulates the melting temperature of dsDNA is selected from any one of tetramethylammonium chloride (TMAC),
  • piperazinium chloride tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), and mixtures thereof.
  • TEAC tetramethylpiperazinium chloride
  • TMANO trimethylamine N-oxide
  • 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B) 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), and mixtures thereof.
  • the agent which modulates the melting temperature of dsDNA is in a ligation buffer.
  • the ligase and the agent which modulates the melting temperature of dsDNA are in separate containers.
  • the above kit further comprises:
  • the polynucleotide kinase enzyme is the T4 Polynucleotide Kinase (PNK) and the enzyme with polymerase and exonuclease activity is the T4 DNA Polymerase; and/or wherein the deoxynucleotidyl transferase enzyme is a Taq polymerase or a Klenow Fragment exo-.
  • PNK Polynucleotide Kinase
  • the enzyme with polymerase and exonuclease activity is the T4 DNA Polymerase
  • the deoxynucleotidyl transferase enzyme is a Taq polymerase or a Klenow Fragment exo-.
  • the agent which modulates the melting temperature of dsDNA, is an organic compound or a biochemical substance.
  • the organic compound is selected from tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), and Trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy- 3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and
  • Triton®X-100 Triton®X-100, and mixtures thereof.
  • the organic compound is selected from tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), and
  • TMANO Trimethylamine N-oxide
  • A 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine
  • THP(B) 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine
  • B 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine
  • the agent which modulates the melting temperature of dsDNA, increases the melting temperature of dsDNA.
  • the compound increasing the melting temperature of dsDNA is tetramethylammonium chloride (TMAC) or tetramethylpiperazinium chloride.
  • TMAC tetramethylammonium chloride
  • the DNA ligase in any of the above ligations is a T3 DNA ligase or a T4 DNA ligase. In other embodiments, the ligase is a T7 DNA ligase or an
  • each of the first and the second dsDNA have one or two single- stranded DNA (ssDNA) region end(s), wherein this/these ssDNA region end(s) is/are less than 20 nucleotides (nt) in length.
  • ssDNA single- stranded DNA
  • FIG. 1 TMAC improves ligation efficiency.
  • the Agilent Bioanalyzer graphs below show the size distribution and quantity of the sequencing libraries generated with either standard ligation condition (Fig. 1A, "Control”) or additional TMAC in the Ligation reaction (Fig. 1 B, "TMAC in Ligation”).
  • the addition of TMAC in the ligation reaction can significantly improve the yield of the library, as determined by the peak heights at about 500 bp, which is the expected size of adapter-ligated sequencing libraries.
  • Figure 2 TMAC improves ligation efficiency. The diagram below shows the
  • high stringency refers to conditions, under which ability of nucleic acids with certain mismatched bases to hybridize is reduced or completely eliminated. Higher stringency conditions result in a higher ratio of the amount of hybridization of sequences with no mismatches when compared to the amount of hybridization of sequences with one or more mismatches. Preferably, under conditions of "high stringency" only exact matches of bases of nucleotides will anneal and stay together.
  • agent which modulates the melting temperature of dsDNA
  • Tm-modulating agent refers to any organic molecule or biochemical substance, such as an amino acid, oligopeptide (2 to 10 amino acids), polypeptide (at least 10 amino acids to about 100 amino acids), protein, lipid, carbohydrate, or a mixture thereof, which increases or which decreases the melting temperature of dsDNA.
  • a compound increases the melting temperature of dsDNA.
  • melting temperature of double-stranded nucleic acids is the temperature, at which half of the DNA strands are in the random coil or single-stranded (ssDNA) state, and half of the DNA strands are in a double-stranded state. Tm depends on the length of the DNA molecule and its specific nucleotide sequence, in particular, the guanine (G) and cytosine (C) content.
  • the double-stranded nucleic acids refer to dsDNA, dsRNA or RNA:DNA hybrids.
  • the melting temperature also depends on the ionic strength of the solution. One may calculate the melting temperature Tm of any given DNA hybrid as shown:
  • the equation melting temperature used above refers to the melting temperature that was measured under standard conditions (about 0.8M NaCI, neutral pH (about pH 7.0)).
  • module and “modify” are used herein as synonyms and are used in the present invention to refer to an increase of the melting temperature of an adenine- thymidine-bond (A:T-bond) or to a decrease of the melting temperature of a guanine- cytosine bond (G:C-bond), which results in an overall increase of the melting
  • a melting temperature can be measured assessing dissociation- characteristics of double-stranded DNA during heating.
  • next generation sequencing and “high-throughput sequencing” are used as synonyms.
  • library refers to a large number of nucleic acid fragments, here the collection of DNA fragments for sequencing analysis.
  • the libraries referred to herein are generated by fragmentation of a sample to be analyzed, end-repairing, optionally addition of a terminal adenine, and ligation of fragments into adapters.
  • the purified DNA fragments are amplified or enriched before they are sequenced.
  • high ligation specificity means that only the desired DNA, such as insert and vector and not the insert or the vector by themselves, or end-repaired DNA fragments and adapters, not DNA fragments or adapters by themselves, are ligated together. The specificity itself can be measured by methods known to the skilled person, such as PCR.
  • the term "about" when used together with a numerical value is intended to encompass a deviation of 20%, preferably 10%, more preferably 5%, even more preferably of 2%, and most preferably of 1 % from that value.
  • a numerical value e.g., a pH value or a percentage value
  • next generation sequencing and “high-throughput sequencing” are used as synonyms.
  • restriction endonucleases are well-known compounds as is the method of their preparation; see for example Roberts, Critical Reviews in Biochemistry, November 1976, pages 123-164.
  • Representative restriction endonucleases which may be employed in the method of the invention include, but are not restricted to: Alu I, Ava I, Ava II, Bal I, Bam HI, Bel I, Bgl I, Bst E II, Eco R I, Hae II, Hae III, Hinc II, Hind II, Hind III, Hinf I, Hha I, Hpa I, Hpa II, Hph I, Hin 389I, Kpn II, Pst I, Rru I, Sau 3A, Sal I, Sma I, Sst I, Sst II, Tac I, Taq I, Xba I, Xho I and the like, many of which are commercially available (e.g. NEB, Promega, Life Technologies, and Thermo Scientific).
  • fragment size means that half of the fragments have a longer length and half of the fragments have a shorter length.
  • comprising is to be construed as encompassing both
  • nt is an abbreviation of “nucleotides”.
  • T4 Polynucleotide Kinase refers to an enzyme that catalyzes the transfer and exchange of P, from the ⁇ position of ATP to the 5 ' -hydroxyl terminus of polynucleotides (double-and single-stranded DNA and RNA) and nucleoside 3 ' -monophosphates.
  • T4 DNA Polymerase refers to an enzyme that catalyzes the synthesis of DNA in the 5 ' ⁇ 3 ' direction and requires the presence of template and primer. This enzyme has a 3 ' ⁇ 5 ' exonuclease activity which is much more active than that found in DNA
  • T4 DNA Polymerase does not exhibit 5 ' ⁇ 3 ' exonuclease activity.
  • Klenow fragment exo- or “Klenow fragment (3 ' ⁇ 5 ' exo-)" refers to an N-terminal truncation of DNA Polymerase I which retains polymerase activity, but has lost the 5 ' ⁇ 3 ' exonuclease activity and the 3 ' ⁇ 5 ' exonuclease activity.
  • Taq polymerase refers to a highly thermostable DNA polymerase from the thermophilic bacterium Thermus aquaticus. The enzyme catalyzes 5' ⁇ 3' synthesis of DNA, has no detectable 3' ⁇ 5' exonuclease (proofreading) activity and possesses low 5' ⁇ 3' exonuclease activity. In addition, Taq DNA Polymerase exhibits deoxynucleotidyl transferase activity, which is often applied in the addition of additional adenines at the 3'- end of PCR products to generate 3 ' adenine overhangs.
  • T3 DNA ligase refers to an ATP-dependent dsDNA ligase from bacteriophage T3. It catalyzes the formation of a phosphodiester bond between adjacent 5 ' phosphate and 3 ' hydroxyl groups of duplex DNA. The enzyme joins both cohesive (sticky) and blunt ends.
  • T4 DNA Ligase refers to an enzyme that catalyzes the formation of a phosphodiester bond between juxtaposed 5' phosphate and 3' hydroxyl termini in double-stranded DNA or RNA. This enzyme joins both blunt end and cohesive (sticky) ends.
  • T7 DNA Ligase is an ATP-dependent ligase from bacteriophage T7. This enzyme joins cohesive (sticky) ends and it is suitable for nick sealing. Blunt-end ligation does not occur in the presence of T7 ligase.
  • Ampligase® refers to a DNA Ligase that catalyzes NAD-dependent ligation of adjacent 3 ' -hydroxylated and 5 ' -phosphorylated termini in duplex DNA structures that are stable at high temperatures.
  • the half-life of Ampligase® is 48 hours at 65°C and more than 1 hour at 95°C. In most cases, the upper limit on reaction temperatures with Ampligase® is determined by the Tm of the DNA substrate. Under conditions of maximal hybridization stringency, nonspecific ligation is nearly eliminated.
  • ligation buffer refers to a conventional buffer for DNA ligation known to the skilled person.
  • the ligation buffer can comprise, for example, 50 mM Tris-HCI, 10 mM MgCI 2 , 1 mM ATP, 10 mM DTT, and a pH of 7.5 at 25 °C.
  • PCR refers to polymerase chain reaction, which is a standard method in molecular biology for DNA amplification.
  • qPCR refers to quantitative real-time PCR, a method used to amplify and simultaneously detect the amount of amplified target DNA molecule fragments. The process involves PCR to amplify one or more specific sequences in a DNA sample. At the same time, a detectable probe, typically a fluorescent probe, is included in the reaction mixture to provide real-time quantification.
  • Two commonly used fluorescent probes for quantification of real-time PCR products are: (1 ) non-sequence-specific fluorescent dyes (e.g., SYBR® Green) that intercalate into double-stranded DNA molecules in a sequence non-specific manner, and (2) sequence-specific DNA probes (e.g., oligonucleotides labeled with fluorescent reporters) that permit detection only after hybridization with the DNA targets or after incorporation into PCR products.
  • non-sequence-specific fluorescent dyes e.g., SYBR® Green
  • sequence-specific DNA probes e.g., oligonucleotides labeled with fluorescent reporters
  • DNA in the present invention relates to any one of viral DNA, prokaryotic DNA, archaeal DNA, and eukaryotic DNA.
  • the DNA may also be obtained from any one of viral RNA, and mRNA from prokaryotes, archaea, and eukaryotes by generating complementary DNA (cDNA) by using a reverse transcriptase.
  • the melting temperature can be measured experimentally by assessing dissociation- characteristics of double-stranded DNA during heating.
  • the energy required to break the base-base hydrogen bonding between two strands of DNA is dependent on their length, GC content and their complementarity. With increased heating of a reaction-mixture that contains double-stranded DNA sequences, the amount of double-stranded DNA decreases. The difference between the denaturing conditions of a double-stranded DNA with a completely complementary sequence and the conditions of a double-stranded DNA with an almost complementary but not completely
  • complementary sequence can be detected as the difference in absorbance change when such double-stranded DNA is denatured into a single-stranded DNA (melting curve) (for example, see I. V. Razlutuskii, L. S. Shlyakhtenko and Yu. L. lyubchenko: Nucleic Acids Research, Vol. 15, No. 16, pp. 6665-6676 (1987)).
  • Agents that modify the melting temperature can be identified by measuring the melting curve of a DNA sample under (i) standard conditions and under (ii) identical conditions as in (i) including a Tm-modulating agent, and by comparing the measured melting curve data as described above for double-stranded DNA with a completely complementary sequence and the conditions of a double-stranded DNA with an almost complementary but not completely complementary sequence.
  • the dissociation can be visualized by UV spectroscopy, or by fluorescence
  • a fluorescent dye such as SYBR® Green I, YO-PRO-I®, or ethidium bromide.
  • SYBR green fluoresces with 1000-fold higher intensity, when it intercalates into the minor groove of dsDNA.
  • dual hybridization probes may be used, whereby said probes attach to single-stranded DNA.
  • Two probes are designed to hybridize to adjacent sequences of the target DNA.
  • the probes are labeled with a pair of dyes that allow for fluorescence resonance energy transfer (FRET).
  • FRET fluorescence resonance energy transfer
  • the donor dye is e.g. attached to the 3' end of the first probe, while the acceptor dye is attached to the 5' end of the second probe.
  • the agent, which modulates the melting temperature in dsDNA is an organic compound or a biochemical substance.
  • Compounds that have been identified to modulate the melting temperature of dsDNA and thereby improving efficacy and specificity of DNA ligation include, but are not restricted to compounds such as tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), and trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and Triton®X-100, and mixtures thereof.
  • TMAC tetramethylammonium chloride
  • TEAC tetramethylpiperazinium chloride
  • TMANO trimethyl
  • the organic compound increases the melting temperature of dsDNA.
  • Such compounds include, but are not restricted to quaternary ammonium salts having the structure NR 4 + .
  • R may be an alkyl, cycloalkyl or an aryl group.
  • such quaternary ammonium salts are tetramethylammonium chloride (TMAC) or
  • the present invention refers to ligation methods, in particular to gene cloning methods and methods of generating sequencing libraries.
  • the method referred herein is characterized in that the ligation step efficiency and specificity is increased by applying an agent to a ligation reaction, which modulates the melting temperature of dsDNA.
  • a ligation step is a critical step in both gene cloning and in generating next generation sequencing libraries.
  • the ligation is carried out in the presence of an agent or more than one agent, which increase(s) the melting temperature of dsDNA.
  • the invention relates to a method of generating a sequencing library, wherein the method comprises the steps of:
  • the ligation is carried out under high stringency conditions.
  • said method further comprises step (v), wherein the ligated fragments of step (iv) are purified and size-selected for sequencing.
  • said method further comprises step (vi), wherein the adapter- ligated fragments are amplified and the amplification product is optionally purified prior to sequencing.
  • the library fragments of step (v) or step (vi) are subsequently sequenced by using sequencing platforms known to the person skilled in the art, such as lllumina® (Solexa) and Ion Torrent: Proton / PGM by Life Technologies/Thermo Fisher Scientific or other suitable high-throughput sequencing platforms.
  • the size of the DNA fragment length is a key factor for gene library construction and for sequencing.
  • Typical median lengths of DNA fragments for NGS libraries are between about 150 bps and about 1000 bps, preferably between about 150 bps and about 600 bps, more preferably between about 200 bps and about 500 bps. Most preferably, the median length is about 200 bps, about 300 bps, or about 500 bps.
  • the preferred amount of DNA starting material for generating a NGS sequencing library and for subsequent sequence analysis ranges from about 1 pg to about 1 ⁇ g, preferably from about 10 pg to about 1 ⁇ g, and more preferably about 10 pg to about 1 ng.
  • the amount of starting material is preferably about 1 pg to about 1 ⁇ g, preferably from about 10 pg to about 1 ⁇ g, and more preferably about 10 pg to about 1 ng.
  • the fragmentation step is mechanical.
  • the mechanical fragmentation is among others achieved by ultrasonic acoustic shearing, nebulization forces, sonication, hydrodynamic shearing (e.g. in French pressure cells or by needle shearing).
  • specific median fragment length sizes of DNA can be prepared e.g. by ultrasonic acoustic shearing, such as Adaptive Focused Acoustics (AFA)TM by using a Covaris® instrument, according to the manufacturer's instructions.
  • the fragmentation of DNA step is chemical. Chemical shear may also be employed for the breakup of long RNA fragments.
  • the fragmentation step is enzymatic.
  • said enzymatic fragmentation is achieved by digestion of DNA by an endonuclease.
  • an endonuclease Such endonucleases are described in more detail in the Definitions section.
  • the fragmentation may also be carried out by employing a transposase known to the person skilled in the art.
  • said fragmentation step may be inactivated by heat.
  • Step (ii) the end-repair step, is carried out by an enzyme or two enzymes with (a) polynucleotide kinase activity (PNK) and (b) an enzyme with polymerase and exonuclease activities, whereby the exonuclease activity makes the ends of the DNA blunt by fill-in or trimming reactions.
  • the enzymes of step (ii) comprise a T4 Polynucleotide Kinase (PNK) and a T4 DNA Polymerase.
  • Step (iii), the A-addition step is carried out by an enzyme, which generates an adenine docking site for adapters that have a thymidine overhang (T-overhang).
  • the enzyme of step (iii) is a Taq polymerase or Klenow Fragment exo-, the large fragment of the DNA polymerase I having 5'- 3' polymerase activity but lacking both 3'- 5' exonuclease activity and 5'- 3' exonuclease activity.
  • the enzyme of step (iii) is a thermostable polymerase, preferably a Taq polymerase.
  • Step (iv) the ligation step for gene library generation, joins either blunt or cohesive (sticky) ends of DNA fragments with either blunt or cohesive (sticky) ends of adapter molecules. Successful ligation of cohesive (sticky) ends requires complementary sequences.
  • the length of the nucleotide sequences of the ssDNA region ends of dsDNA for the ligation methods in gene library generation referred to above is less than 20 nt or less than 12 nt, preferably the sequence length is less than 10 nt or less than 8 nt, more preferably 1-6 nt or 1-5 nt.
  • the ssDNA length is 1 nt.
  • ssDNA region of one DNA comprises a terminal (3 ' ) adenine (A) and a the complementary ssDNA of the other DNA comprises a terminal (3 ' ) thymidine (T).
  • the terminal ssDNA regions are (3 ' ) cytosine (C) and the complementary terminal (3 ' ) guanine (G).
  • a fragment comprising terminal, i.e. 3' adenine overhangs serves as a docking site for the sequencing adapters, which comprise a complementary terminal, i.e. 3' thymidine overhang.
  • the sequencing adapters which comprise a complementary terminal, i.e. 3' thymidine overhang.
  • Tm modulator modulates the melting temperature of dsDNA
  • Tm modulators can include any organic molecule or biochemical substance that induces a change to the melting temperature of the DNA.
  • the Tm of the A-T bond is increased, so that the ligation through the A/T overhang is used e.g. by the library construction protocols for lllumina sequencing instruments.
  • the organic compound is selected from any one of the following: quaternary ammonium salts, such as tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and Triton®X-100, and mixtures thereof.
  • TMAC tetramethylammonium chloride
  • TEAC tetraethylammonium chloride
  • TMANO trimethylamine N-oxide
  • the organic compound is selected from any one of the following: quaternary ammonium salts, such as tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4- carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3, 4,5,6- tetrahydropyrimidine THP(B), and mixtures thereof.
  • TMAC tetramethylammonium chloride
  • TEAC tetraethylammonium chloride
  • TMANO trimethylamine N-oxide
  • 2-methyl-4-carboxy-3, 4,5,6- tetrahydropyrimidine THP(B) and mixtures thereof
  • Such compounds include, but are not restricted to quaternary ammonium salts having the structure NR 4 + .
  • R may be an alkyl, cycloalkyl or an aryl group.
  • quaternary ammonium salts are tetramethylammonium chloride (TMAC) or tetramethylpiperazinium chloride.
  • the concentration of the above compounds in the above methods is about:
  • 50 mM-5 M preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M TMAC;
  • TEAC tetraethylammonium chloride
  • 50 mM-5 M preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M piperazinium chloride;
  • 50 mM-5 M preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M tetramethylpiperazinium chloride;
  • TMANO trimethylamine N-oxide
  • non-ionic detergents 0.1-1 % non-ionic detergents.
  • the concentration thereof is 50 mM-5 M, preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, and even more preferably 1 M-3.3 M.
  • Test PCR reactions (under identical conditions, but with or without the Tm-modulating agent) may be performed to determine the optimum concentration of each of the above mentioned additives, whereby optimal concentration is the concentration, at which the dsDNA has the highest Tm.
  • the ligation enzyme referred to above in particular the enzyme of step (iv) is a T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, an Ampligase®, or an E. coli DNA- ligase, whereby the T7 DNA ligase, the Ampligase® and the E. coli DNA-ligase only ligate cohesive (sticky) DNA.
  • cohesive (sticky) end ligation such as AT-ligation is envisioned, it is preferable to use T7 DNA ligase or an Ampligase®.
  • step (iv) comprises T4 DNA ligase when blunt ends are to be ligated.
  • the ligation step is carried out at 4-50°C, depending on the optimal temperature for the ligase ' s activity.
  • the preferred ligase temperature is 4-25°C.
  • the ligation temperature is adapted according to the Tm of the DNA substrate to be ligated.
  • said fragments are purified and size-selected on e.g. silicon containing surface of a binding matrix in the presence of a salt, preferably a chaotropic salt.
  • a salt preferably a chaotropic salt.
  • the size of DNA molecules that bind to the binding matrix can be controlled e.g. by the salt concentration or the pH value of the binding mixture.
  • Such purification is e.g. described in WO 2014/122288 A1. Suitable columns for such a size selection method include the GeneReadTM Size Selection Kit.
  • a further DNA size selection method includes agarose gel electrophoresis. The purified fragments may be used directly for subsequent sequencing.
  • the purified fragments may be amplified for library enrichment by PCR- based methods known to the person skilled in the art, or by capture-by-hybridization, i.e. on-array or in-solution hybrid capture; or by capture-by-circularization, i.e. molecular inversion probe-based methods.
  • library enrichment is carried out by PCR amplification.
  • the ligation reaction in gene library generation is characterized in that the first dsDNA used in such ligation reactions comprises ssDNA regions at both of its termini, which may or may not be identical. Preferably, such terminal ssDNA regions are identical.
  • each of the terminal ssDNA regions comprises a terminal adenine.
  • Each of the termini hybridizes under high stringency conditions with a complementary ssDNA region of a second dsDNA, respectively.
  • a second dsDNA is a sequence adaptor. More preferably, such a sequence adaptor comprises a terminal thymidine.
  • a ligation method for gene cloning of two dsDNA is disclosed, each comprising two ssDNA regions, whereby the two ssDNA regions in one dsDNA may be identical or non-identical.
  • the terminal ssDNA regions of the first and the second dsDNA hybridize under high stringency conditions in the presence of Tm-modulating agent.
  • the ssDNA ends of the dsDNA to be ligated are complementary.
  • the ligation is carried out in the presence of Tm-modulating agent, which increases the melting temperature of dsDNA.
  • each of the ssDNA region ends of the first dsDNA ligates with each of the ss region ends of the second dsDNA to provide ligated circular dsDNA in the presence of an agent, which modulates the melting temperature of dsDNA, preferably, such an agent increases the melting temperature of dsDNA.
  • the first DNA or the second DNA is capable of conferring the ability to auto-replicate within competent cells.
  • a ligation yield increase may also be assessed by methods known to the skilled person, such as agarose gel electrophoresis.
  • the nucleotide sequence length of the DNA for ligation reactions, in particular gene cloning, more particular in vitro gene cloning is not restricted, as long as it agrees with the objective of this invention and accomplishes the functional effects of the invention.
  • the appropriate scope of the aforementioned length can be understood by a person skilled in the art in the field of molecular biology.
  • the ratio of DNAs to be ligated is not restricted, and may be any, as long as they are within a range that does not adversely affect the correct ligation of each end.
  • Other ratios of a vector and a gene to be inserted are 1 :2, 1 :5, 1 : 10, and 1 :20. More preferably, such a ratio is 1 :5.
  • the vector DNA is preferably a DNA that can be introduced into a suitable competent cell, wherein it can auto-replicate.
  • Such vectors are selected according to the competent cells into which the ligate is introduced.
  • the commercially available vectors or plasmids can be used.
  • Such vectors include, but are not restricted to pBR322, pQE series (N-terminus vectors: pQE-9, pQE-30, pQE31 , pQE-32, and pQE-40; C-terminus vectors: pQE16, pQE60, pQE-70 (Qiagen), and pUC series (for example, pUC18, pSP64, pGEM-3, pBluescript).
  • yeast such vectors include, but are not restricted to Yep24, Ylp5.
  • vectors When using Bacillus, such vectors include, but are not restricted to pHY300 and PLK. Insect cell expression vectors include, but are not restricted to Easy Xpress plX3.0 and pIX 4.0 (Qiagen). Vectors for E.coli, insect cell, and mammalian cell expression include, but are not restricted to pQE Trisystem vectors (Qiagen).
  • Tm modulator modulates the melting temperature of dsDNA
  • Tm modulators can include any organic molecule or biochemical substance that induces a change to the melting temperature of the DNA.
  • the Tm of the A-T bond is increased, so that the ligation through the A/T overhang is used e.g. by the library construction protocols for lllumina sequencing instruments.
  • the organic compound is selected from any one of the following: quaternary ammonium salts, such as tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and Triton®X-100, and mixtures thereof.
  • TMAC tetramethylammonium chloride
  • TEAC tetraethylammonium chloride
  • TMANO trimethylamine N-oxide
  • the organic compound is selected from any one of the following: quaternary ammonium salts, such as tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride,
  • TMAC tetramethylammonium chloride
  • piperazinium chloride tetramethylpiperazinium chloride
  • TEAC tetraethylammonium chloride
  • TMANO trimethylamine N-oxide
  • 2-methyl-4-carboxy-3, 4,5,6- tetrahydropyrimidine THP(B) 2-methyl-4-carboxy-3, 4,5,6- tetrahydropyrimidine THP(B), and mixtures thereof.
  • the organic compound increases the melting temperature of dsDNA.
  • Such compounds include, but are not restricted to quaternary ammonium salts having the structure NR 4 + .
  • R may be an alkyl, cycloalkyl or an aryl group.
  • such quaternary ammonium salts are tetramethylammonium chloride (TMAC) or
  • 50 mM-5 M preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M TMAC;
  • TEAC tetraethylammonium chloride
  • 50 mM-5 M preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M piperazinium chloride; 50 mM-5 M, preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M tetramethylpiperazinium chloride;
  • TMANO trimethylamine N-oxide
  • non-ionic detergents 0.1-1 % non-ionic detergents.
  • the tetramethylammonium or tetramethylpiperazinium chloride concentration is 50 mM-5 M, preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, and even more preferably 1 M-3.3 M.
  • Test PCR reactions (under identical conditions, but with or without the Tm-modulating agent) may be performed to determine the optimum concentration of each of the above mentioned additives, whereby optimal concentration is the concentration, at which the dsDNA has the highest Tm.
  • the DNA ligation enzyme referred to above is a T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, an Ampligase®, or an E. coli DNA-ligase, whereby the T7 DNA ligase, the Ampligase® and the E. coli DNA-ligase only ligate cohesive (sticky) DNA.
  • cohesive (sticky) end ligation such as AT-ligation is envisioned
  • Ampligase® is preferred, as its exceptional thermostability permits very high hybridization stringency and ligation specificity.
  • T4 DNA ligase is preferred when blunt ends are to be ligated.
  • the ligation step is carried out at 4-50°C, depending on the optimal temperature for the ligase ' s activity.
  • the preferred ligase temperature is 4-25°C.
  • the ligation temperature is adapted according to the Tm of the DNA substrate to be ligated. Kits
  • kits comprising:
  • the Tm-modulating agent is an organic compound or a biochemical substance.
  • the organic compound increases the melting temperature of dsDNA.
  • Such compounds include, but are not restricted to quaternary ammonium salts having the structure NR 4 + .
  • R may be hydrogen, an alkyl, cycloalkyl or an aryl group.
  • such quaternary ammonium salts are tetramethylammonium chloride (TMAC) or tetramethylpiperazinium chloride.
  • the agent which modulates the melting temperature of dsDNA is selected from any one of tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and Triton®X-100, and mixtures thereof.
  • TMAC tetramethylammonium chloride
  • TEAC tetraethylammonium chloride
  • TMANO trimethylamine N-oxide
  • the agent, which modulates the melting temperature of dsDNA is selected from any one of tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), and mixtures thereof.
  • TMAC tetramethylammonium chloride
  • TEAC tetraethylammonium chloride
  • TMANO trimethylamine N-oxide
  • 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B) 2-methyl-4-carbox
  • the agent which modulates the melting temperature of dsDNA is in a ligation buffer.
  • the ligase and the agent which modulates the melting temperature of dsDNA are in separate containers.
  • the polynucleotide kinase enzyme is the T4 Polynucleotide Kinase (PNK) and the enzyme with polymerase and exonuclease activity is the T4 DNA Polymerase and/or the deoxynucleotidyl transferase enzyme is a Taq polymerase or a Klenow Fragment exo-.
  • PNK Polynucleotide Kinase
  • the enzyme with polymerase and exonuclease activity is the T4 DNA Polymerase and/or the deoxynucleotidyl transferase enzyme is a Taq polymerase or a Klenow Fragment exo-.
  • the DNA ligase is a T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, an Ampligase®, or an E. coli DNA-ligase, whereby the T7 DNA ligase, Ampligase® and the E. coli DNA- ligase only ligate cohesive (sticky) DNA. Therefore, more preferably, step (iv) comprises T4 DNA ligase when blunt ends are to be ligated. For cohesive (sticky) end ligation in step (iv), T7 DNA ligase or Ampligase® is preferred. For cohesive (sticky) end ligation under high stringency conditions Ampligase® is preferred as its exceptional thermostability permits high hybridization stringency and ligation specificity.
  • the concentration of the above compounds in the above kits is about:
  • 50 mM-5 M preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M TMAC;
  • TEAC tetraethylammonium chloride
  • 50 mM-5 M preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M tetramethylpiperazinium chloride;
  • trimethlamine N-oxide TMANO
  • the tetramethylammonium or tetramethylpiperazinium chloride concentration is 50 mM-5 M, preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, and even more preferably 1 M-3.3 M.
  • kits are used for gene cloning.
  • kits are used for generating a sequencing library.
  • EXAMPLES gDNA from E.coli DH10B is sheared to an average fragment size of 300 bp (Covaris S220 Focused-ultrasonicator, Covaris), and 10 pg of sheared DNA is used for each library construction test.
  • GeneReadTM DNA Library Prep I Core Kit, GeneReadTM DNA I Amp Kit, GeneReadTM Adapter I Set 12-Plex (72), and GeneReadTM Size Selection Kit are used according to manufacturer's instructions with the following modifications: 0.5 U of the Taq polymerase (QIAGEN) and 0.5 mM of dATP (QIAGEN) are added to the end-repair reaction; the temperature profile for end-repair reaction is 30 minutes at 25°C, and 30 minutes at 72°C, where the 72°C step was used to both inactivate end-repair enzymes and utilize the terminal transferase activity of the Taq enzyme to add an adenine to the 3' of the DNA fragments. The separate A-addition step using Klenow fragment (3'- 5' exo-) is therefore removed from the protocol.
  • the above amplified product of the test and control samples was qualitatively analyzed by using Agilent Bioanalyzer and High Sensitivity DNA Analysis Kit (Agilent).
  • the above amplified product of the test and control samples was quantitatively analyzed by using qPCR method (QuantiFast Sybr Green Kit, QIAGEN).
  • Table 1 Zero Coverage Regions in sequencing for the library generated with or without TMAC in ligation reaction.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Physics & Mathematics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Immunology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

The present invention provides new methods and kits to improve the efficiency of ligation reactions, in particular in molecular biology applications, such as the next generation sequencing (NGS) library construction methods and gene cloning. In next-generation sequencing methods, the ligation step is critical in adding sequencing platform-specific adapters to the DNA fragments that are to be sequenced. Said improvement is achieved by the addition of agents, which modulate the melting temperature of dsDNA.

Description

EFFICIENCY IMPROVING LIGATION METHODS
FIELD OF THE INVENTION
The present invention provides new methods and kits to improve the efficiency of ligation reactions, in particular in molecular biology applications, such as the next generation sequencing (NGS) library construction methods and gene cloning. In next-generation sequencing methods, the ligation step is critical in adding sequencing platform-specific adapters to the DNA fragments that are to be sequenced. Said improvement is achieved by the addition of agents, which modulate the melting temperature of dsDNA.
BACKGROUND OF THE INVENTION
Double-stranded nucleic acids containing blunt ends or cohesive (sticky) ends with an overhang of one or more nucleotides can be joined by means of intermolecular or intramolecular ligation reactions. Examples for the methods for ligating at a specific site are DNA ligation reactions of cohesive ends of DNA fragments, which have been cleaved by a restriction enzyme, or of blunt ends of DNA fragments. Such ligation reactions are commonly used in molecular biology applications, such as next-generation sequencing and gene cloning.
Next-generation sequencing (NGS), also known as high-throughput sequencing allows to acquire genome-wide data using highly parallel sequencing approaches for molecular biology applications, in vitro clinical diagnostics, or for forensics. Such applications include e.g. de novo genome sequencing, transcriptome sequencing and epigenomics, as well as genetic screening for the identification of rare genetic variants and for efficient detection of either inherited or somatic mutations in cancer genes.
Hence, several sequencing platforms have been developed, which allow for low-cost, high-throughput sequencing. Such platforms include lllumina® (Solexa), and Ion torrent: Proton / PGM by Life Technologies/ Thermo Fisher Scientific. NGS technologies, NGS platforms and common applications/fields for NGS technologies are e.g. reviewed in Voelkerding et al. (Clinical Chemistry 55:4 641-658, 2009) and Metzker (Nature
Reviews/Genetics Volume 1 1 , January 2010, pages 31-46).
Three main steps exist in NGS on most current platforms: preparation of the sample for high-throughput sequencing, immobilization on a suitable surface, and the actual sequencing. The preparation step involves random fragmentation of the genomic DNA and addition of adapter sequences to the fragment ends. The commonly used method to generate platform-specific NGS libraries uses multi-step enzymatic reaction protocols to ligate adapters to the DNA fragments to be analyzed.
First, DNA fragments are generated with mechanical, chemical, or enzymatic
fragmentation or by target-specific PCR. Subsequently, the DNA fragments are end- repaired. The end-repair step requires at least two enzymes: (a) a polynucleotide kinase, normally the T4 Polynucleotide Kinase (PNK) that phosphorylates the 5'-terminus of the double stranded DNA fragments; and (b) an enzyme or enzymes with polymerase and exonuclease activities that make the ends of the DNA fragments blunt by either fill-in or trimming reactions, such as e.g. T4 DNA Polymerase. After the end-repair step, for sequencing on platforms, such as those provided by lllumina®, a so-called A-addition step is required, which generates a terminal adenine as a docking site for the
sequencing adapters that have an overhang formed by thymidine nucleotides, i.e. a T- overhang. In this step, an A-overhang is added to the 3'-terminus of the end-repaired PCR product, e.g. by Klenow Fragment exo-, the large fragment of the DNA polymerase I having 5'- 3' polymerase activity, but lacking both 3'- 5' exonuclease activity and 5'- 3' exonuclease activity. Alternatively, the A-addition step can also be facilitated with enzymes having terminal nucleotide transferase activity, such as the Taq polymerase. Following the A-addition step, the sequencing adapter can be ligated to the DNA by a ligase, such as the T4 DNA Ligase. For other sequencing platforms, such as Ion Torrent PGM/ Proton by Life Technologies®, the A-addition step is not required and blunt-ended adapters are ligated by a T4 DNA ligase directly to the end-repaired DNA fragments. The low efficiency of library generation methods can be a draw-back if a sequencing library needs to be constructed from a small amount of input DNA, such as an input amount lower than 1 ng. Most current library construction methods cannot deliver high adaptor-ligation efficiency with such low input amount; therefore, only a small percentage of the input DNA is converted into sequencing library. This could lead to either lack of sufficient library for sequencing, or the library complexity is low and cannot faithfully represent the sequence in the original sample.
Thus, there is a need in the art for sample preparation methods for a NGS library protocol generation and ligation preparation for gene cloning, especially when small amounts of DNA are to be analyzed.
SUMMARY OF THE INVENTION
The present invention relates to agents, which modulate the melting temperature (Tm) of dsDNA in the dsDNA ligation reactions to improve the efficiency and specificity of ligation reactions in molecular biology applications, such as gene library generation and gene cloning. Such ligation reactions may be intermolecular or intramolecular.
In particular, such an improvement is disclosed herein for gene library generation, where the presence of an agent that modulates the melting temperature of dsDNA leads to an enhanced library and specificity. In the context of sequencing library construction, high ligation specificity means that end-repaired DNA fragments and adapters, not DNA fragments or adapters by themselves, are ligated together. This is necessary to prevent sequencing artefacts that can arise from DNA or adapter dimers and concatemers. Surprisingly, the addition of Tm-modulating compounds significantly enhance both the overall yield and the specificity of the ligation product, and hence the overall yield and specificity of a generated NGS library. Similarly, in the presence of said agents in gene cloning, the insert gene DNA and the vector DNA are ligated together, not the insert gene DNA and the vector DNA by themselves. The overall yield increase is at least 70% in the presence of a Tm-modulating agent.
One aspect of the present invention refers to methods of generating dsDNA, wherein the method comprises ligating a first and a second dsDNA, both optionally having one or two single-stranded end(s), in the presence of a DNA ligase and an agent, which modulates the melting temperature of dsDNA.
In some embodiments, the present invention refers to a method for ligating a first and a second ds DNA, wherein both the first and the second dsDNAs comprise two ssDNA regions, whereby each of the ssDNA region ends of the first dsDNA ligates with each of the complementary ss region ends of the second dsDNA to provide ligated circular dsDNA in the presence of a an agent, which modifies the melting temperature of dsDNA. Herein it is preferred that the first or the second DNA is capable of conferring the ability to auto-replicate within competent cells. In other embodiments, the invention relates to a method for generation of a sequencing library, wherein the method comprises the steps of:
(i) providing DNA fragments;
(ii) end-repairing the DNA fragments by a polynucleotide kinase enzyme and an enzyme with polymerase and exonuclease activities to obtain blunt-ended, 5' phosphorylated DNA fragments;
(iii) optionally adding a terminal adenine to the end of the end-repaired DNA fragments by a deoxynucleotidyl transferase enzyme; and
(iv) ligating the DNA fragments, optionally having the terminal adenine, with sequencing adapters wherein preferably the adapters have a terminal thymidine if the fragments have a terminal adenine. In some embodiments, said method further comprises step (v), wherein the ligated fragments of step (iv) are purified and size-selected for sequencing.
In some embodiments, said method further comprises step (vi), wherein the adapter- ligated fragments are amplified and the amplification product is optionally purified prior to sequencing.
In another embodiment, the fragments of step (v) or (vi) are subjected to sequencing. A further aspect of the invention refers to a kit comprising:
(i) a DNA ligase; and
(ii) an agent, which modulates the melting temperature of dsDNA.
In another embodiment, the agent which modulates the melting temperature of dsDNA is selected from any one of tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and Triton®X-100, and mixtures thereof.
In a preferred embodiment, the agent which modulates the melting temperature of dsDNA is selected from any one of tetramethylammonium chloride (TMAC),
piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), and mixtures thereof.
In some embodiments, the agent which modulates the melting temperature of dsDNA is in a ligation buffer.
In some embodiments, the ligase and the agent which modulates the melting temperature of dsDNA are in separate containers. In some embodiments, the above kit further comprises:
(i) a polynucleotide kinase and a DNA polymerase; and
(ii) optionally a deoxynucleotidyl transferase.
In the methods or kits referenced above, the polynucleotide kinase enzyme is the T4 Polynucleotide Kinase (PNK) and the enzyme with polymerase and exonuclease activity is the T4 DNA Polymerase; and/or wherein the deoxynucleotidyl transferase enzyme is a Taq polymerase or a Klenow Fragment exo-.
In the methods or kits referenced above, the agent, which modulates the melting temperature of dsDNA, is an organic compound or a biochemical substance.
In the methods or kits referenced above, the organic compound is selected from tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), and Trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy- 3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and
Triton®X-100, and mixtures thereof.
In another embodiment, in the methods or kits referenced above, the organic compound is selected from tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), and
Trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), and mixtures thereof.
Preferably, the agent, which modulates the melting temperature of dsDNA, increases the melting temperature of dsDNA.
More preferably, the compound increasing the melting temperature of dsDNA is tetramethylammonium chloride (TMAC) or tetramethylpiperazinium chloride. In some embodiments, the DNA ligase in any of the above ligations is a T3 DNA ligase or a T4 DNA ligase. In other embodiments, the ligase is a T7 DNA ligase or an
Ampligase®.
In some embodiments of any one of the above methods, each of the first and the second dsDNA have one or two single- stranded DNA (ssDNA) region end(s), wherein this/these ssDNA region end(s) is/are less than 20 nucleotides (nt) in length.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 : TMAC improves ligation efficiency. The Agilent Bioanalyzer graphs below show the size distribution and quantity of the sequencing libraries generated with either standard ligation condition (Fig. 1A, "Control") or additional TMAC in the Ligation reaction (Fig. 1 B, "TMAC in Ligation"). The addition of TMAC in the ligation reaction can significantly improve the yield of the library, as determined by the peak heights at about 500 bp, which is the expected size of adapter-ligated sequencing libraries. Figure 2: TMAC improves ligation efficiency. The diagram below shows the
concentrations of the sequencing library, as determined by qPCR. "Control" refers to a standard ligation condition. "TMAC in Ligation", additional TMAC in the ligation reaction.
DETAILED DESCRIPTION OF THE INVENTION Definitions
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art (e.g., in cell culture, molecular genetics, nucleic acid chemistry, hybridization techniques and biochemistry). In practicing the present invention, many conventional techniques in molecular biology, microbiology, and recombinant DNA may be used. These techniques are well known and are explained in, for example, Current Protocols in Molecular Biology, Volumes 1, 11, and III, 1997 (F. M. Ausubel ed.); Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.; DNA Cloning: A Practical Approach, Volumes I and II, 1985 (D. N. Glover ed.); Oligonucleotide Synthesis, 1984 (M. L. Gait ed.); Nucleic Acid Hybridization, 1985, (Hames and Higgins); Transcription and Translation, 1984 (Hames and Higgins eds.); Animal Cell Culture, 1986 (R. I. Freshney ed.); Immobilized Cells and Enzymes, 1986 (IRL Press); Perbal, 1984, A Practical Guide to Molecular Cloning; the series, Methods In Enzymology (Academic Press, Inc.); Gene Transfer Vectors for Mammalian Cells, 1987 (J. H. Miller and M. P. Calos eds., Cold Spring Harbor Laboratory); and Methods in Enzymology Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, eds., respectively). The term "high stringency" refers to conditions, under which ability of nucleic acids with certain mismatched bases to hybridize is reduced or completely eliminated. Higher stringency conditions result in a higher ratio of the amount of hybridization of sequences with no mismatches when compared to the amount of hybridization of sequences with one or more mismatches. Preferably, under conditions of "high stringency" only exact matches of bases of nucleotides will anneal and stay together.
The term "agent, which modulates the melting temperature of dsDNA" or "Tm-modulating agent" refers to any organic molecule or biochemical substance, such as an amino acid, oligopeptide (2 to 10 amino acids), polypeptide (at least 10 amino acids to about 100 amino acids), protein, lipid, carbohydrate, or a mixture thereof, which increases or which decreases the melting temperature of dsDNA. Preferably, such a compound increases the melting temperature of dsDNA.
The term "melting temperature of double-stranded nucleic acids" is the temperature, at which half of the DNA strands are in the random coil or single-stranded (ssDNA) state, and half of the DNA strands are in a double-stranded state. Tm depends on the length of the DNA molecule and its specific nucleotide sequence, in particular, the guanine (G) and cytosine (C) content. In this context, the double-stranded nucleic acids refer to dsDNA, dsRNA or RNA:DNA hybrids. The melting temperature also depends on the ionic strength of the solution. One may calculate the melting temperature Tm of any given DNA hybrid as shown:
Tm = 81.5°C + 0.41 (%G + %C) - 550/n n = probe length (number of nucleotides). The equation melting temperature used above refers to the melting temperature that was measured under standard conditions (about 0.8M NaCI, neutral pH (about pH 7.0)).
The term "modulate", and "modify" are used herein as synonyms and are used in the present invention to refer to an increase of the melting temperature of an adenine- thymidine-bond (A:T-bond) or to a decrease of the melting temperature of a guanine- cytosine bond (G:C-bond), which results in an overall increase of the melting
temperature of dsDNA or an overall decrease of the melting temperature of dsDNA, respectively. Experimentally, a melting temperature can be measured assessing dissociation- characteristics of double-stranded DNA during heating.
The terms "next generation sequencing" and "high-throughput sequencing" are used as synonyms.
The term "library" refers to a large number of nucleic acid fragments, here the collection of DNA fragments for sequencing analysis. The libraries referred to herein are generated by fragmentation of a sample to be analyzed, end-repairing, optionally addition of a terminal adenine, and ligation of fragments into adapters. Optionally, the purified DNA fragments are amplified or enriched before they are sequenced. The term "high ligation specificity" means that only the desired DNA, such as insert and vector and not the insert or the vector by themselves, or end-repaired DNA fragments and adapters, not DNA fragments or adapters by themselves, are ligated together. The specificity itself can be measured by methods known to the skilled person, such as PCR.
As used herein, the term "about" when used together with a numerical value (e.g., a pH value or a percentage value) is intended to encompass a deviation of 20%, preferably 10%, more preferably 5%, even more preferably of 2%, and most preferably of 1 % from that value. When used together with a numerical value it is at the same time to be understood as individually disclosing that exact numerical value as a preferred embodiment in accordance with the present invention.
The terms "next generation sequencing" and "high-throughput sequencing" are used as synonyms.
The term "restriction endonuclease" is used herein in its commonly accepted sense as a site specific endodeoxyribonuclease and isoschizomers thereof. Restriction
endonucleases are well-known compounds as is the method of their preparation; see for example Roberts, Critical Reviews in Biochemistry, November 1976, pages 123-164. Representative restriction endonucleases which may be employed in the method of the invention include, but are not restricted to: Alu I, Ava I, Ava II, Bal I, Bam HI, Bel I, Bgl I, Bst E II, Eco R I, Hae II, Hae III, Hinc II, Hind II, Hind III, Hinf I, Hha I, Hpa I, Hpa II, Hph I, Hin 389I, Kpn II, Pst I, Rru I, Sau 3A, Sal I, Sma I, Sst I, Sst II, Tac I, Taq I, Xba I, Xho I and the like, many of which are commercially available (e.g. NEB, Promega, Life Technologies, and Thermo Scientific). Other restriction endonucleases which may be employed and their preparation are listed in e.g. Roberts, pages 127-130.
The term "median fragment size" means that half of the fragments have a longer length and half of the fragments have a shorter length. As used herein, the term "comprising" is to be construed as encompassing both
"including" and "consisting of, both meanings being specifically intended, and hence individually disclosed embodiments in accordance with the present invention. "nt" is an abbreviation of "nucleotides".
"bp" is an abbreviation of "base pair".
"T4 Polynucleotide Kinase" refers to an enzyme that catalyzes the transfer and exchange of P, from the γ position of ATP to the 5'-hydroxyl terminus of polynucleotides (double-and single-stranded DNA and RNA) and nucleoside 3'-monophosphates.
"T4 DNA Polymerase" refers to an enzyme that catalyzes the synthesis of DNA in the 5'→3' direction and requires the presence of template and primer. This enzyme has a 3'→5' exonuclease activity which is much more active than that found in DNA
Polymerase I (E. coli). T4 DNA Polymerase does not exhibit 5'→3' exonuclease activity.
"Klenow fragment exo-" or "Klenow fragment (3'→5' exo-)" refers to an N-terminal truncation of DNA Polymerase I which retains polymerase activity, but has lost the 5'→3' exonuclease activity and the 3'→5' exonuclease activity.
"Taq polymerase" refers to a highly thermostable DNA polymerase from the thermophilic bacterium Thermus aquaticus. The enzyme catalyzes 5'→3' synthesis of DNA, has no detectable 3'→5' exonuclease (proofreading) activity and possesses low 5'→3' exonuclease activity. In addition, Taq DNA Polymerase exhibits deoxynucleotidyl transferase activity, which is often applied in the addition of additional adenines at the 3'- end of PCR products to generate 3' adenine overhangs. The terms "deoxynucleotidyl transfer", "terminal nucleotide addition" and "terminal nucleotide transfer" are used herein as synonyms. "T3 DNA ligase" refers to an ATP-dependent dsDNA ligase from bacteriophage T3. It catalyzes the formation of a phosphodiester bond between adjacent 5' phosphate and 3' hydroxyl groups of duplex DNA. The enzyme joins both cohesive (sticky) and blunt ends. "T4 DNA Ligase" refers to an enzyme that catalyzes the formation of a phosphodiester bond between juxtaposed 5' phosphate and 3' hydroxyl termini in double-stranded DNA or RNA. This enzyme joins both blunt end and cohesive (sticky) ends.
"T7 DNA Ligase" is an ATP-dependent ligase from bacteriophage T7. This enzyme joins cohesive (sticky) ends and it is suitable for nick sealing. Blunt-end ligation does not occur in the presence of T7 ligase.
"Ampligase®" refers to a DNA Ligase that catalyzes NAD-dependent ligation of adjacent 3'-hydroxylated and 5'-phosphorylated termini in duplex DNA structures that are stable at high temperatures. The enzyme that catalyzes the formation of a phosphodiester bond between juxtaposed 5' phosphate and 3' hydroxyl termini in double-stranded DNA or RNA. This enzyme joins cohesive (sticky) ends. The half-life of Ampligase® is 48 hours at 65°C and more than 1 hour at 95°C. In most cases, the upper limit on reaction temperatures with Ampligase® is determined by the Tm of the DNA substrate. Under conditions of maximal hybridization stringency, nonspecific ligation is nearly eliminated.
The term "ligation buffer" refers to a conventional buffer for DNA ligation known to the skilled person. The ligation buffer can comprise, for example, 50 mM Tris-HCI, 10 mM MgCI2, 1 mM ATP, 10 mM DTT, and a pH of 7.5 at 25 °C.
The term "PCR" refers to polymerase chain reaction, which is a standard method in molecular biology for DNA amplification.
The term "qPCR" refers to quantitative real-time PCR, a method used to amplify and simultaneously detect the amount of amplified target DNA molecule fragments. The process involves PCR to amplify one or more specific sequences in a DNA sample. At the same time, a detectable probe, typically a fluorescent probe, is included in the reaction mixture to provide real-time quantification. Two commonly used fluorescent probes for quantification of real-time PCR products are: (1 ) non-sequence-specific fluorescent dyes (e.g., SYBR® Green) that intercalate into double-stranded DNA molecules in a sequence non-specific manner, and (2) sequence-specific DNA probes (e.g., oligonucleotides labeled with fluorescent reporters) that permit detection only after hybridization with the DNA targets or after incorporation into PCR products.
The term "DNA" in the present invention relates to any one of viral DNA, prokaryotic DNA, archaeal DNA, and eukaryotic DNA. The DNA may also be obtained from any one of viral RNA, and mRNA from prokaryotes, archaea, and eukaryotes by generating complementary DNA (cDNA) by using a reverse transcriptase.
Agents that modify the melting temperature of dsDNA The melting temperature can be measured experimentally by assessing dissociation- characteristics of double-stranded DNA during heating.
The energy required to break the base-base hydrogen bonding between two strands of DNA is dependent on their length, GC content and their complementarity. With increased heating of a reaction-mixture that contains double-stranded DNA sequences, the amount of double-stranded DNA decreases. The difference between the denaturing conditions of a double-stranded DNA with a completely complementary sequence and the conditions of a double-stranded DNA with an almost complementary but not completely
complementary sequence can be detected as the difference in absorbance change when such double-stranded DNA is denatured into a single-stranded DNA (melting curve) (for example, see I. V. Razlutuskii, L. S. Shlyakhtenko and Yu. L. lyubchenko: Nucleic Acids Research, Vol. 15, No. 16, pp. 6665-6676 (1987)).
Agents that modify the melting temperature can be identified by measuring the melting curve of a DNA sample under (i) standard conditions and under (ii) identical conditions as in (i) including a Tm-modulating agent, and by comparing the measured melting curve data as described above for double-stranded DNA with a completely complementary sequence and the conditions of a double-stranded DNA with an almost complementary but not completely complementary sequence.
The dissociation can be visualized by UV spectroscopy, or by fluorescence
measurements, where a fluorescent dye is used for readout, such as SYBR® Green I, YO-PRO-I®, or ethidium bromide.
For example, SYBR green fluoresces with 1000-fold higher intensity, when it intercalates into the minor groove of dsDNA.
Alternatively, dual hybridization probes may be used, whereby said probes attach to single-stranded DNA. Two probes are designed to hybridize to adjacent sequences of the target DNA. The probes are labeled with a pair of dyes that allow for fluorescence resonance energy transfer (FRET). The donor dye is e.g. attached to the 3' end of the first probe, while the acceptor dye is attached to the 5' end of the second probe.
The agent, which modulates the melting temperature in dsDNA is an organic compound or a biochemical substance. Compounds that have been identified to modulate the melting temperature of dsDNA and thereby improving efficacy and specificity of DNA ligation include, but are not restricted to compounds such as tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), and trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and Triton®X-100, and mixtures thereof.
Preferably, the organic compound increases the melting temperature of dsDNA. Such compounds include, but are not restricted to quaternary ammonium salts having the structure NR4 +. R may be an alkyl, cycloalkyl or an aryl group. Preferably, such quaternary ammonium salts are tetramethylammonium chloride (TMAC) or
tetramethylpiperazinium chloride. Methods
The present invention refers to ligation methods, in particular to gene cloning methods and methods of generating sequencing libraries.
In particular, the method referred herein is characterized in that the ligation step efficiency and specificity is increased by applying an agent to a ligation reaction, which modulates the melting temperature of dsDNA. Such a ligation step is a critical step in both gene cloning and in generating next generation sequencing libraries. In preferred embodiments, the ligation is carried out in the presence of an agent or more than one agent, which increase(s) the melting temperature of dsDNA.
Next generation sequencing
In some embodiments, the invention relates to a method of generating a sequencing library, wherein the method comprises the steps of:
(i) providing DNA fragments;
(ii) end-repairing the DNA fragments by a polynucleotide kinase enzyme and an enzyme with polymerase and exonuclease activities, preferably DNA polymerase, to obtain blunt- ended, 5' phosphorylated DNA fragments;
(iii) optionally adding a terminal adenine to the end of the end-repaired DNA fragments by a deoxynucleotidyl transferase enzyme; and
(iv) ligating the DNA fragments, optionally having the terminal adenine base, with sequencing adapters wherein preferably the adapters have a terminal thymidine if the fragments have a terminal adenine.
In preferred embodiments, the ligation is carried out under high stringency conditions. In some embodiments, said method further comprises step (v), wherein the ligated fragments of step (iv) are purified and size-selected for sequencing. In some embodiments, said method further comprises step (vi), wherein the adapter- ligated fragments are amplified and the amplification product is optionally purified prior to sequencing. The library fragments of step (v) or step (vi) are subsequently sequenced by using sequencing platforms known to the person skilled in the art, such as lllumina® (Solexa) and Ion Torrent: Proton / PGM by Life Technologies/Thermo Fisher Scientific or other suitable high-throughput sequencing platforms.
The size of the DNA fragment length is a key factor for gene library construction and for sequencing. Typical median lengths of DNA fragments for NGS libraries are between about 150 bps and about 1000 bps, preferably between about 150 bps and about 600 bps, more preferably between about 200 bps and about 500 bps. Most preferably, the median length is about 200 bps, about 300 bps, or about 500 bps.
The preferred amount of DNA starting material for generating a NGS sequencing library and for subsequent sequence analysis ranges from about 1 pg to about 1 μg, preferably from about 10 pg to about 1 μg, and more preferably about 10 pg to about 1 ng. For genomic DNA analysis, the amount of starting material is preferably about 1 pg to about 1 μg, preferably from about 10 pg to about 1 μg, and more preferably about 10 pg to about 1 ng.
In some embodiments, the fragmentation step is mechanical. Preferably, the mechanical fragmentation is among others achieved by ultrasonic acoustic shearing, nebulization forces, sonication, hydrodynamic shearing (e.g. in French pressure cells or by needle shearing). More preferably, specific median fragment length sizes of DNA can be prepared e.g. by ultrasonic acoustic shearing, such as Adaptive Focused Acoustics (AFA)™ by using a Covaris® instrument, according to the manufacturer's instructions. In some embodiments, the fragmentation of DNA step is chemical. Chemical shear may also be employed for the breakup of long RNA fragments. This is typically performed through heat digestion of RNA with a divalent metal cation (magnesium or zinc). The length of the RNA (1 15 bp-350 bp) can be adjusted by increasing or decreasing the time of incubation. In some embodiments, the fragmentation step is enzymatic. Preferably, said enzymatic fragmentation is achieved by digestion of DNA by an endonuclease. Such endonucleases are described in more detail in the Definitions section. Preferably, the fragmentation may also be carried out by employing a transposase known to the person skilled in the art. When applying enzymes for the fragmentation reaction, said fragmentation step may be inactivated by heat.
Step (ii), the end-repair step, is carried out by an enzyme or two enzymes with (a) polynucleotide kinase activity (PNK) and (b) an enzyme with polymerase and exonuclease activities, whereby the exonuclease activity makes the ends of the DNA blunt by fill-in or trimming reactions. Preferably, the enzymes of step (ii) comprise a T4 Polynucleotide Kinase (PNK) and a T4 DNA Polymerase.
Step (iii), the A-addition step, is carried out by an enzyme, which generates an adenine docking site for adapters that have a thymidine overhang (T-overhang). Preferably, the enzyme of step (iii) is a Taq polymerase or Klenow Fragment exo-, the large fragment of the DNA polymerase I having 5'- 3' polymerase activity but lacking both 3'- 5' exonuclease activity and 5'- 3' exonuclease activity. In some preferred embodiments, the enzyme of step (iii) is a thermostable polymerase, preferably a Taq polymerase.
Step (iv), the ligation step for gene library generation, joins either blunt or cohesive (sticky) ends of DNA fragments with either blunt or cohesive (sticky) ends of adapter molecules. Successful ligation of cohesive (sticky) ends requires complementary sequences.
In some embodiments, the length of the nucleotide sequences of the ssDNA region ends of dsDNA for the ligation methods in gene library generation referred to above is less than 20 nt or less than 12 nt, preferably the sequence length is less than 10 nt or less than 8 nt, more preferably 1-6 nt or 1-5 nt. In preferred embodiments, the ssDNA length is 1 nt. In embodiments, where the ssDNA region is 1 nt, ssDNA region of one DNA comprises a terminal (3') adenine (A) and a the complementary ssDNA of the other DNA comprises a terminal (3') thymidine (T). Alternatively the terminal ssDNA regions are (3') cytosine (C) and the complementary terminal (3') guanine (G).
In preferred embodiments, a fragment comprising terminal, i.e. 3' adenine overhangs serves as a docking site for the sequencing adapters, which comprise a complementary terminal, i.e. 3' thymidine overhang. By using such TA cloning it is not necessary to design a specific pair of primers for each DNA fragment to be analyzed. The same primers can be used for amplification of different templates provided that each template is modified by addition of the same universal primer-binding sequences to its 5' and 3' ends. The adapter sequence can therefore be any DNA fragment of interest, as long as it has a 3' thymidine overhang. The efficiency of the ligation step regarding specificity and library yield is improved by the addition of a molecule, which modulates the melting temperature of dsDNA (Tm modulator). Such Tm-modulators can include any organic molecule or biochemical substance that induces a change to the melting temperature of the DNA. In preferred embodiments, the Tm of the A-T bond is increased, so that the ligation through the A/T overhang is used e.g. by the library construction protocols for lllumina sequencing instruments.
In the methods referenced above, the organic compound is selected from any one of the following: quaternary ammonium salts, such as tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and Triton®X-100, and mixtures thereof. Preferably, in the methods referenced above, the organic compound is selected from any one of the following: quaternary ammonium salts, such as tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4- carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3, 4,5,6- tetrahydropyrimidine THP(B), and mixtures thereof. Preferably, the organic compound increases the melting temperature of dsDNA. Such compounds include, but are not restricted to quaternary ammonium salts having the structure NR4 +. R may be an alkyl, cycloalkyl or an aryl group. Preferably, such quaternary ammonium salts are tetramethylammonium chloride (TMAC) or tetramethylpiperazinium chloride.
The concentration of the above compounds in the above methods is about:
50 mM-5 M, preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M TMAC;
50 mM-5 M, preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M tetraethylammonium chloride (TEAC);
50 mM-5 M, preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M piperazinium chloride;
50 mM-5 M, preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M tetramethylpiperazinium chloride;
>50mM trimethylamine N-oxide (TMANO);
0.5-3.5 M 2-methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine (THP(A));
0.5-4 M 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine (THP(B));
0.1-1 % non-ionic detergents.
In embodiments, where TMAC or tetramethylpiperazinium chloride is the modulating agent, the concentration thereof is 50 mM-5 M, preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, and even more preferably 1 M-3.3 M. Test PCR reactions (under identical conditions, but with or without the Tm-modulating agent) may be performed to determine the optimum concentration of each of the above mentioned additives, whereby optimal concentration is the concentration, at which the dsDNA has the highest Tm.
Preferably, the ligation enzyme referred to above, in particular the enzyme of step (iv) is a T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, an Ampligase®, or an E. coli DNA- ligase, whereby the T7 DNA ligase, the Ampligase® and the E. coli DNA-ligase only ligate cohesive (sticky) DNA. In embodiments, where cohesive (sticky) end ligation, such as AT-ligation is envisioned, it is preferable to use T7 DNA ligase or an Ampligase®. For cohesive (sticky) end ligation under high stringency conditions Ampligase® is preferred, as its exceptional thermostability reduces the hybridization of mismatched base pairs. Preferably, step (iv) comprises T4 DNA ligase when blunt ends are to be ligated.
The ligation step is carried out at 4-50°C, depending on the optimal temperature for the ligase's activity. For T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, and E.coli DNA ligase, the preferred ligase temperature is 4-25°C. In embodiments, where Ampligase® is used, the ligation temperature is adapted according to the Tm of the DNA substrate to be ligated.
In gene library generation methods, after generating ligated fragments, said fragments are purified and size-selected on e.g. silicon containing surface of a binding matrix in the presence of a salt, preferably a chaotropic salt. The size of DNA molecules that bind to the binding matrix can be controlled e.g. by the salt concentration or the pH value of the binding mixture. Such purification is e.g. described in WO 2014/122288 A1. Suitable columns for such a size selection method include the GeneRead™ Size Selection Kit. A further DNA size selection method includes agarose gel electrophoresis. The purified fragments may be used directly for subsequent sequencing. Alternatively, prior to the sequencing step, the purified fragments may be amplified for library enrichment by PCR- based methods known to the person skilled in the art, or by capture-by-hybridization, i.e. on-array or in-solution hybrid capture; or by capture-by-circularization, i.e. molecular inversion probe-based methods. Preferably, library enrichment is carried out by PCR amplification. In some embodiments, the ligation reaction in gene library generation is characterized in that the first dsDNA used in such ligation reactions comprises ssDNA regions at both of its termini, which may or may not be identical. Preferably, such terminal ssDNA regions are identical. More preferably, each of the terminal ssDNA regions comprises a terminal adenine. Each of the termini hybridizes under high stringency conditions with a complementary ssDNA region of a second dsDNA, respectively. Preferably, such a second dsDNA is a sequence adaptor. More preferably, such a sequence adaptor comprises a terminal thymidine. Gene Cloning
In some embodiments, a ligation method for gene cloning of two dsDNA (intermolecular) is disclosed, each comprising two ssDNA regions, whereby the two ssDNA regions in one dsDNA may be identical or non-identical. The terminal ssDNA regions of the first and the second dsDNA hybridize under high stringency conditions in the presence of Tm-modulating agent. Preferably, the ssDNA ends of the dsDNA to be ligated are complementary. In more preferred embodiments, the ligation is carried out in the presence of Tm-modulating agent, which increases the melting temperature of dsDNA. In gene cloning methods, each of the ssDNA region ends of the first dsDNA ligates with each of the ss region ends of the second dsDNA to provide ligated circular dsDNA in the presence of an agent, which modulates the melting temperature of dsDNA, preferably, such an agent increases the melting temperature of dsDNA. In preferred embodiments, the first DNA or the second DNA is capable of conferring the ability to auto-replicate within competent cells. The use of ligating nucleic acids in the presence or an agent, which modulates the melting temperature of DNA results in an increased number of transformed host cells after transformation with the ligated molecules with chemically transformed host cells or with host cells transformed by electroporation. A ligation yield increase may also be assessed by methods known to the skilled person, such as agarose gel electrophoresis. The nucleotide sequence length of the DNA for ligation reactions, in particular gene cloning, more particular in vitro gene cloning is not restricted, as long as it agrees with the objective of this invention and accomplishes the functional effects of the invention. The appropriate scope of the aforementioned length can be understood by a person skilled in the art in the field of molecular biology.
The ratio of DNAs to be ligated is not restricted, and may be any, as long as they are within a range that does not adversely affect the correct ligation of each end. In embodiments for cloning of a specific gene, it is preferable to use the DNA to be ligated in a concentration that is equimolar to the DNA comprising the whole or partial gene. Other ratios of a vector and a gene to be inserted are 1 :2, 1 :5, 1 : 10, and 1 :20. More preferably, such a ratio is 1 :5.
When ligating two dsDNA, such as vector DNA and insert gene DNA, the vector DNA is preferably a DNA that can be introduced into a suitable competent cell, wherein it can auto-replicate.
Such vectors are selected according to the competent cells into which the ligate is introduced. For example, for E.coli, the commercially available vectors or plasmids can be used. Such vectors include, but are not restricted to pBR322, pQE series (N-terminus vectors: pQE-9, pQE-30, pQE31 , pQE-32, and pQE-40; C-terminus vectors: pQE16, pQE60, pQE-70 (Qiagen), and pUC series (for example, pUC18, pSP64, pGEM-3, pBluescript). When using yeast as said cells, such vectors include, but are not restricted to Yep24, Ylp5. When using Bacillus, such vectors include, but are not restricted to pHY300 and PLK. Insect cell expression vectors include, but are not restricted to Easy Xpress plX3.0 and pIX 4.0 (Qiagen). Vectors for E.coli, insect cell, and mammalian cell expression include, but are not restricted to pQE Trisystem vectors (Qiagen).
The efficiency of the ligation step regarding specificity and library yield is improved by the addition of a molecule, which modulates the melting temperature of dsDNA (Tm modulator). Such Tm-modulators can include any organic molecule or biochemical substance that induces a change to the melting temperature of the DNA. In preferred embodiments, the Tm of the A-T bond is increased, so that the ligation through the A/T overhang is used e.g. by the library construction protocols for lllumina sequencing instruments. The organic compound is selected from any one of the following: quaternary ammonium salts, such as tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and Triton®X-100, and mixtures thereof.
Preferably, in the methods referenced above, the organic compound is selected from any one of the following: quaternary ammonium salts, such as tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride,
tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4- carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3, 4,5,6- tetrahydropyrimidine THP(B), and mixtures thereof.
Preferably, the organic compound increases the melting temperature of dsDNA. Such compounds include, but are not restricted to quaternary ammonium salts having the structure NR4 +. R may be an alkyl, cycloalkyl or an aryl group. Preferably, such quaternary ammonium salts are tetramethylammonium chloride (TMAC) or
tetramethylpiperazinium chloride. The concentration of the above compounds in the above methods is about:
50 mM-5 M, preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M TMAC;
50 mM-5 M, preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M tetraethylammonium chloride (TEAC);
50 mM-5 M, preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M piperazinium chloride; 50 mM-5 M, preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M tetramethylpiperazinium chloride;
>50mM trimethylamine N-oxide (TMANO);
0.5-3.5 M 2-methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine (THP(A));
0.5-4 M 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine (THP(B));
0.1-1 % non-ionic detergents.
In embodiments, where TMAC or tetramethylpiperazinium chloride is the modulating agent, the tetramethylammonium or tetramethylpiperazinium chloride concentration is 50 mM-5 M, preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, and even more preferably 1 M-3.3 M.
Test PCR reactions (under identical conditions, but with or without the Tm-modulating agent) may be performed to determine the optimum concentration of each of the above mentioned additives, whereby optimal concentration is the concentration, at which the dsDNA has the highest Tm.
Preferably, the DNA ligation enzyme referred to above is a T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, an Ampligase®, or an E. coli DNA-ligase, whereby the T7 DNA ligase, the Ampligase® and the E. coli DNA-ligase only ligate cohesive (sticky) DNA. In embodiments, where cohesive (sticky) end ligation, such as AT-ligation is envisioned, it is preferable to use T7 DNA ligase or Ampligase®. For cohesive (sticky) end ligation under high stringency conditions Ampligase® is preferred, as its exceptional thermostability permits very high hybridization stringency and ligation specificity. T4 DNA ligase is preferred when blunt ends are to be ligated.
The ligation step is carried out at 4-50°C, depending on the optimal temperature for the ligase's activity. For T3 DNA ligase, T4 DNA ligase, T7 DNA ligase and E.coli DNA ligase, the preferred ligase temperature is 4-25°C. In embodiments, where Ampligase® is used, the ligation temperature is adapted according to the Tm of the DNA substrate to be ligated. Kits
Another aspect of the invention relates to kits comprising:
(i) a DNA ligase; and
(ii) an agent, which modulates the melting temperature of dsDNA.
In the kits referenced above, the Tm-modulating agent is an organic compound or a biochemical substance. Preferably, the organic compound increases the melting temperature of dsDNA. Such compounds include, but are not restricted to quaternary ammonium salts having the structure NR4 +. R may be hydrogen, an alkyl, cycloalkyl or an aryl group. Preferably, such quaternary ammonium salts are tetramethylammonium chloride (TMAC) or tetramethylpiperazinium chloride.
In some embodiments, the agent which modulates the melting temperature of dsDNA is selected from any one of tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and Triton®X-100, and mixtures thereof.
In a preferred embodiment, the agent, which modulates the melting temperature of dsDNA, is selected from any one of tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), and mixtures thereof. In some embodiments, said invention relates to kits additionally comprising:
(i) a polynucleotide kinase and an enzyme, with polymerase and exonuclease activities, preferably DNA polymerase; and (ii) optionally a deoxynucleotidyl transferase;
In some embodiments, the agent which modulates the melting temperature of dsDNA is in a ligation buffer.
In a preferred embodiment, in the disclosed kits, the ligase and the agent which modulates the melting temperature of dsDNA are in separate containers.
In preferred embodiments of the kits, the polynucleotide kinase enzyme is the T4 Polynucleotide Kinase (PNK) and the enzyme with polymerase and exonuclease activity is the T4 DNA Polymerase and/or the deoxynucleotidyl transferase enzyme is a Taq polymerase or a Klenow Fragment exo-.
The DNA ligase is a T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, an Ampligase®, or an E. coli DNA-ligase, whereby the T7 DNA ligase, Ampligase® and the E. coli DNA- ligase only ligate cohesive (sticky) DNA. Therefore, more preferably, step (iv) comprises T4 DNA ligase when blunt ends are to be ligated. For cohesive (sticky) end ligation in step (iv), T7 DNA ligase or Ampligase® is preferred. For cohesive (sticky) end ligation under high stringency conditions Ampligase® is preferred as its exceptional thermostability permits high hybridization stringency and ligation specificity.
The concentration of the above compounds in the above kits is about:
50 mM-5 M, preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M TMAC;
50 mM-5 M, preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M tetraethylammonium chloride (TEAC);
50 mM-5 M, preferably 100 mM-3.5 M. more preferably 500 mM-3.3 M, even more preferably 1 M-3.3 M tetramethylpiperazinium chloride;
>50mM trimethlamine N-oxide (TMANO);
0.5-3.5 M 2-methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine (THP(A));
0.5-4 M 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine (THP(B));
0.1-1 % non-ionic detergents. In embodiments, where TMAC is the modulating agent, the tetramethylammonium or tetramethylpiperazinium chloride concentration is 50 mM-5 M, preferably 100 mM-3.5 M, more preferably 500 mM-3.3 M, and even more preferably 1 M-3.3 M.
In preferred embodiments, such kits are used for gene cloning.
In other embodiments, such kits are used for generating a sequencing library.
EXAMPLES gDNA from E.coli DH10B is sheared to an average fragment size of 300 bp (Covaris S220 Focused-ultrasonicator, Covaris), and 10 pg of sheared DNA is used for each library construction test. GeneRead™ DNA Library Prep I Core Kit, GeneRead™ DNA I Amp Kit, GeneRead™ Adapter I Set 12-Plex (72), and GeneRead™ Size Selection Kit (all from QIAGEN) are used according to manufacturer's instructions with the following modifications: 0.5 U of the Taq polymerase (QIAGEN) and 0.5 mM of dATP (QIAGEN) are added to the end-repair reaction; the temperature profile for end-repair reaction is 30 minutes at 25°C, and 30 minutes at 72°C, where the 72°C step was used to both inactivate end-repair enzymes and utilize the terminal transferase activity of the Taq enzyme to add an adenine to the 3' of the DNA fragments. The separate A-addition step using Klenow fragment (3'- 5' exo-) is therefore removed from the protocol.
0.05 μΜ of sequencing adapter was used in the ligation steps.
Following the ligation step, the library was first purified with the GeneRead Size
Selection Kit (QIAGEN), then amplified for 22 cycles with adapter-specific primers in PCR (GeneRead DNA I Amp Kit, QIAGEN), and purified again with GeneRead Size Selection Kit (QIAGEN). The final sequencing libraries were qualified with Agilent Bioanalyzer High Sensitivity DNA Analysis Kit (Agilent) and quantified with the qPCR method (QuantiFast Sybr Green Kit, QIAGEN). The sequencing libraries were also sequenced on a miSeq instrument (lllumina) using a dual-layer 300 nt Flow Cell and lllumina® miSeq Reagent Kit V2 (300). Sequencing data were analyzed with CLC Genomics Workbench software (QIAGEN).
As a proof of principle, either standard ligation condition was used or TMAC was added at the final concentration of 0.1 mM in the ligation step. As shown in Figure 1 and Figure 2, both Agilent Bioanalyzer and qPCR results demonstrated that the addition of the TMAC into the ligation reaction ("TMAC in Ligation") could significantly increase the library yield compared to the ligation reaction without additional TMAC ("Control"). Furthermore, sequencing data showed that the addition of the TMAC in the sequencing significantly reduces the number of regions with zero coverage. This demonstrated that the addition of TMAC in the ligation reaction enables a higher percentage of the genomic DNA regions to be ligated to the sequencing adaptors and sequenced.
Example 1 :
The above amplified product of the test and control samples was qualitatively analyzed by using Agilent Bioanalyzer and High Sensitivity DNA Analysis Kit (Agilent).
Example 2:
The above amplified product of the test and control samples was quantitatively analyzed by using qPCR method (QuantiFast Sybr Green Kit, QIAGEN).
Example 3:
The sequencing libraries constructed with or without TMAC were sequenced. Sequence data were mapped to the E.coli DH10B. The number of Zero Coverage Regions, which reflected the drop-out of genomic regions during library construction and in turn sequencing, was significantly reduced by the addition of TMAC in the ligation (from 127 Zero Coverage Regions No TMAC TMAC
Count 127 28
Minimum length 1 1
Maximum length 115 82
Mean length 23,05 24,11
Standard deviation 20,9 21,19
Total length 2.927 675
Table 1 : Zero Coverage Regions in sequencing for the library generated with or without TMAC in ligation reaction.

Claims

A method of generating double-stranded DNA (dsDNA), wherein the method comprises ligating a first and a second dsDNA, both optionally having one or two single-stranded end(s), in the presence of a DNA ligase and an agent, which modulates the melting temperature of dsDNA, wherein the agent is selected from any one of tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and Triton®X-100, and mixtures thereof.
The method of claim 1 , wherein the agent, which modulates the melting temperature of dsDNA is selected from any one of tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2- methyl-4-carboxy-5-hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4- carboxy-3,4,5,6-tetrahydropyrimidine THP(B), and mixtures thereof.
The method of claims 1 or 2, wherein the method comprises the steps of:
(i) providing DNA fragments;
(ii) end-repairing the DNA fragments using a polynucleotide kinase and an enzyme with polymerase and exonuclease activities to obtain end-repaired DNA fragments;
(iii) optionally adding a terminal adenine to the end of the end-repaired DNA fragments using a deoxynucleotidyl transferase enzyme; and
(iv) ligating the DNA fragments, optionally having the terminal adenine, with sequencing adapters, wherein preferably the adapters have a terminal thymidine if the fragments have a terminal adenine. The method of claim 3, further comprising step (v), wherein the ligated fragments of step (iv) are purified and size-selected.
The method of claim 4, further comprising step (vi), wherein the adapter-ligated fragments are amplified and the amplification product is optionally purified.
A kit comprising:
(i) a DNA ligase; and
(ii) an agent, which modulates the melting temperature of dsDNA, wherein the agent is selected from any one of tetramethylammonium chloride (TMAC), piperazinium chloride, tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5- hydroxy-3,4,5,6-tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3, 4,5,6- tetrahydropyrimidine THP(B), non-ionic detergents, such as NP-40, and
Triton®X-100, and mixtures thereof.
The kit of claim 6, wherein the agent is selected from any one of
tetramethylammonium chloride (TMAC), piperazinium chloride,
tetramethylpiperazinium chloride, tetraethylammonium chloride (TEAC), trimethylamine N-oxide (TMANO), 2-methyl-4-carboxy-5-hydroxy-3, 4,5,6- tetrahydropyrimidine THP(A), 2-methyl-4-carboxy-3,4,5,6-tetrahydropyrimidine THP(B), and mixtures thereof.
The kit of claims 6 or 7, wherein the ligase and the agent are in separate containers.
The kit of claims 6 to 8, wherein the kit further comprises:
(i) a polynucleotide kinase and an enzyme with polymerase and exonuclease activities; and
(ii) optionally a deoxynucleotidyl transferase.
The method of any one of claims 3-5, or the kit of claim 9, wherein (i) the polynucleotide kinase is the T4 Polynucleotide Kinase (PNK);
(ii) the enzyme with polymerase and exonuclease activity is the T4 DNA
Polymerase; and/or
(iii) the deoxynucleotidyl transferase enzyme is a Taq polymerase or a Klenow Fragment exo-.
1 1 . The method of claims 1 or 2, wherein each of dsDNAs comprises two single- stranded DNA ends, whereby each of the ss region ends of the first dsDNA ligates with each of the complementary ssDNA region ends of the second dsDNA, respectively, to provide ligated circular dsDNA, wherein preferably the first or the second DNA is capable of conferring the ability to auto-replicate within competent cells.
12. The method of any one of claims 1-5 and 10-1 1 , or the kit of any one of claims 6-9, wherein the agent is tetramethyl ammonium chloride (TMAC) or tetramethylpiperazinium chloride.
13. The method of any one of claims 1-5 and 10-12, or the kit of any one of claims 6-9, wherein the DNA ligase is a T3 DNA ligase or a T4 DNA ligase.
14. The method of any one of claims 1-5 and 10-12, or the kit of any one of claims 6-9, wherein the DNA ligase is a T7 DNA ligase or an Ampligase®.
15. The method of any one of claims 1-5 and 10-14, wherein each of the one or two single-stranded DNA (ssDNA) region end(s), is/are less than 20 nucleotides (nt) in length.
PCT/EP2016/059068 2015-04-22 2016-04-22 Efficiency improving ligation methods WO2016170147A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP15164711.2 2015-04-22
EP15164711 2015-04-22

Publications (1)

Publication Number Publication Date
WO2016170147A1 true WO2016170147A1 (en) 2016-10-27

Family

ID=52997954

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2016/059068 WO2016170147A1 (en) 2015-04-22 2016-04-22 Efficiency improving ligation methods

Country Status (1)

Country Link
WO (1) WO2016170147A1 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018183918A1 (en) * 2017-03-30 2018-10-04 Grail, Inc. Enhanced ligation in sequencing library preparation
WO2020023493A1 (en) * 2018-07-24 2020-01-30 Ripple Biosolutions Llc Methods and composition for targeted genomic analysis
WO2020150656A1 (en) 2017-08-07 2020-07-23 The Johns Hopkins University Methods for assessing and treating cancer
US10982351B2 (en) 2016-12-23 2021-04-20 Grail, Inc. Methods for high efficiency library preparation using double-stranded adapters
US11118222B2 (en) 2017-03-31 2021-09-14 Grail, Inc. Higher target capture efficiency using probe extension
US11180803B2 (en) 2011-04-15 2021-11-23 The Johns Hopkins University Safe sequencing system
EP3839046A4 (en) * 2018-08-07 2021-11-24 GeneMind Biosciences Company Limited Method for ligating nucleic acid fragments, method for constructing sequencing library, and use
US11286531B2 (en) 2015-08-11 2022-03-29 The Johns Hopkins University Assaying ovarian cyst fluid
US11525163B2 (en) 2012-10-29 2022-12-13 The Johns Hopkins University Papanicolaou test for ovarian and endometrial cancers
US11680293B1 (en) 2022-04-21 2023-06-20 Paragon Genomics, Inc. Methods and compositions for amplifying DNA and generating DNA sequencing results from target-enriched DNA molecules
US12091715B2 (en) 2022-04-21 2024-09-17 Paragon Genomics, Inc. Methods and compositions for reducing base errors of massive parallel sequencing using triseq sequencing

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999004041A9 (en) * 1997-07-17 1999-04-15 Tm Technologies Inc Methods and compositions for modulating melting temperatures of nucleic acids
US20040209299A1 (en) * 2003-03-07 2004-10-21 Rubicon Genomics, Inc. In vitro DNA immortalization and whole genome amplification using libraries generated from randomly fragmented DNA
WO2011156529A2 (en) * 2010-06-08 2011-12-15 Nugen Technologies, Inc. Methods and composition for multiplex sequencing
WO2013022504A1 (en) * 2011-05-06 2013-02-14 New England Biolabs, Inc. Ligation enhancement
WO2014122288A1 (en) 2013-02-08 2014-08-14 Qiagen Gmbh Method for separating dna by size

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999004041A9 (en) * 1997-07-17 1999-04-15 Tm Technologies Inc Methods and compositions for modulating melting temperatures of nucleic acids
US20040209299A1 (en) * 2003-03-07 2004-10-21 Rubicon Genomics, Inc. In vitro DNA immortalization and whole genome amplification using libraries generated from randomly fragmented DNA
WO2011156529A2 (en) * 2010-06-08 2011-12-15 Nugen Technologies, Inc. Methods and composition for multiplex sequencing
WO2013022504A1 (en) * 2011-05-06 2013-02-14 New England Biolabs, Inc. Ligation enhancement
WO2014122288A1 (en) 2013-02-08 2014-08-14 Qiagen Gmbh Method for separating dna by size

Non-Patent Citations (20)

* Cited by examiner, † Cited by third party
Title
"Animal Cell Culture", 1986
"Current Protocols in Molecular Biology", vol. I, II, I, 1997
"Gene Transfer Vectors for Mammalian Cells", 1987, COLD SPRING HARBOR LABORATORY
"Immobilized Cells and Enzymes", 1986, IRL PRESS
"Methods in Enzymology", vol. 154, 155
"Oligonucleotide Synthesis", vol. I, II, 1984, article "DNA Cloning: A Practical Approach"
"Transcription and Translation", 1984
ANONYMOUS: "Fast-Link(TM) DNA Ligation and Screening Kits", 1 June 2012 (2012-06-01), XP055216393, Retrieved from the Internet <URL:http://www.epibio.com/docs/default-source/protocols/fast-link-dna-ligation-and-screening-kits.pdf?sfvrsn=6> [retrieved on 20150928] *
ANONYMOUS: "Ligation (molecular biology)", 14 April 2015 (2015-04-14), XP055284400, Retrieved from the Internet <URL:https://en.wikipedia.org/w/index.php?title=Special:Book&bookcmd=download&collection_id=68a742704d90a5e213c373da761300ff1d6f4902&writer=rdf2latex&return_to=Ligation+(molecular+biology)> [retrieved on 20160629] *
ANONYMOUS: "Ligation (molecular biology)", 14 April 2015 (2015-04-14), XP055284401, Retrieved from the Internet <URL:https://en.wikipedia.org/w/index.php?title=Ligation_(molecular_biology)&oldid=656484983> [retrieved on 20160629] *
HAMES; HIGGINS, NUCLEIC ACID HYBRIDIZATION, 1985
HONG-KHANH NGUYEN ET AL: "Smoothing of the thermal stability of DNA dupexes by using modified nucleosides and chaotropic agents", NUCLEIC ACIDS RESEARCH, INFORMATION RETRIEVAL LTD, vol. 27, no. 6, 1 January 1999 (1999-01-01), pages 1492 - 1498, XP002245640, ISSN: 0305-1048, DOI: 10.1093/NAR/27.6.1492 *
I. V. RAZLUTUSKII; L. S. SHLYAKHTENKO; YU. L. LYUBCHENKO, NUCLEIC ACIDS RESEARCH, vol. 15, no. 16, 1987, pages 6665 - 6676
METZKER, NATURE REVIEWS/GENETICS, vol. 11, January 2010 (2010-01-01), pages 31 - 46
PERBAL: "A Practical Guide to Molecular Cloning; the series, Methods In Enzymology", 1984, ACADEMIC PRESS, INC.
ROBERTS, CRITICAL REVIEWS IN BIOCHEMISTRY, November 1976 (1976-11-01), pages 123 - 164
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS, COLD SPRING HARBOR
SAMUEL O OYOLA ET AL: "Optimizing illumina next-generation sequencing library preparation for extremely at-biased genomes", BMC GENOMICS, BIOMED CENTRAL LTD, LONDON, UK, vol. 13, no. 1, 3 January 2012 (2012-01-03), pages 1, XP021117820, ISSN: 1471-2164, DOI: 10.1186/1471-2164-13-1 *
VOELKERDING ET AL., CLINICAL CHEMISTRY, vol. 55, no. 4, 2009, pages 641 - 658
WOOD W I ET AL: "BASE COMPOSITION-INDEPENDENT HYBRIDIZATION IN TETRAMETHYLAMMONIUM CHLORIDE: A METHOD FOR OLIGONUCLEOTIDE SCREENING OF HIGHLY COMPLEX GENE LIBRARIES", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, NATIONAL ACADEMY OF SCIENCES, US, vol. 82, no. 3, 1 March 1985 (1985-03-01), pages 1585 - 1588, XP002027508, ISSN: 0027-8424, DOI: 10.1073/PNAS.82.6.1585 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12006544B2 (en) 2011-04-15 2024-06-11 The Johns Hopkins University Safe sequencing system
US11773440B2 (en) 2011-04-15 2023-10-03 The Johns Hopkins University Safe sequencing system
US11459611B2 (en) 2011-04-15 2022-10-04 The Johns Hopkins University Safe sequencing system
US11453913B2 (en) 2011-04-15 2022-09-27 The Johns Hopkins University Safe sequencing system
US11180803B2 (en) 2011-04-15 2021-11-23 The Johns Hopkins University Safe sequencing system
US11525163B2 (en) 2012-10-29 2022-12-13 The Johns Hopkins University Papanicolaou test for ovarian and endometrial cancers
US11286531B2 (en) 2015-08-11 2022-03-29 The Johns Hopkins University Assaying ovarian cyst fluid
US10982351B2 (en) 2016-12-23 2021-04-20 Grail, Inc. Methods for high efficiency library preparation using double-stranded adapters
US11274344B2 (en) 2017-03-30 2022-03-15 Grail, Inc. Enhanced ligation in sequencing library preparation
WO2018183918A1 (en) * 2017-03-30 2018-10-04 Grail, Inc. Enhanced ligation in sequencing library preparation
US20180291445A1 (en) * 2017-03-30 2018-10-11 Grail, Inc. Enhanced ligation in sequencing library preparation
US11118222B2 (en) 2017-03-31 2021-09-14 Grail, Inc. Higher target capture efficiency using probe extension
WO2020150656A1 (en) 2017-08-07 2020-07-23 The Johns Hopkins University Methods for assessing and treating cancer
WO2020023493A1 (en) * 2018-07-24 2020-01-30 Ripple Biosolutions Llc Methods and composition for targeted genomic analysis
EP3839046A4 (en) * 2018-08-07 2021-11-24 GeneMind Biosciences Company Limited Method for ligating nucleic acid fragments, method for constructing sequencing library, and use
US11680293B1 (en) 2022-04-21 2023-06-20 Paragon Genomics, Inc. Methods and compositions for amplifying DNA and generating DNA sequencing results from target-enriched DNA molecules
US12091715B2 (en) 2022-04-21 2024-09-17 Paragon Genomics, Inc. Methods and compositions for reducing base errors of massive parallel sequencing using triseq sequencing

Similar Documents

Publication Publication Date Title
WO2016170147A1 (en) Efficiency improving ligation methods
US20210047638A1 (en) Methods for Preparing a Next Generation Sequencing (NGS) Library from a Ribonucleic Acid (RNA) Sample and Compositions for Practicing the Same
EP3821011B1 (en) Transposome enabled dna/rna-sequencing (ted rna-seq)
EP3102691B1 (en) Method for controlled dna fragmentation
CN107109401B (en) Polynucleotide enrichment Using CRISPR-CAS System
EP3036359B1 (en) Next-generation sequencing libraries
EP2304054B1 (en) Isothermal nucleic acid amplification
EP3475449B1 (en) Uses of a cell-free nucleic acid standards
WO2018013558A1 (en) Compositions and methods for detecting nucleic acid regions
WO2016135300A1 (en) Efficiency improving methods for gene library generation
US20110195457A1 (en) Isothermal amplification of nucleic acid using primers comprising a randomized sequence and specific primers and uses thereof
US10648031B2 (en) Preparation of adapter-ligated amplicons
WO2012003374A2 (en) Targeted sequencing library preparation by genomic dna circularization
WO2007123742A2 (en) Methods and compositions for increasing the fidelity of multiplex nucleic acid assembly
CA3024546A1 (en) Method of amplifying circular dna
WO2022261281A1 (en) Target initiation and amplification of long dna with nuclease and replisome enzymes
US20240182881A1 (en) Altered cytidine deaminases and methods of use
WO2024069581A1 (en) Helicase-cytidine deaminase complexes and methods of use
EP2126133B1 (en) Multiply-primed amplification of circular nucleic acid sequences
Takahashi et al. Cell-free cloning using multiply-primed rolling circle amplification with modified RNA primers
Ali et al. A rapid method for creating gene mutations and fusions using PCR-ligation-PCR mutagenesis
WO2018073323A1 (en) Method for complete, uniform and specific amplification of ultra-low amounts of input dna
US20240076803A1 (en) Method for Library Preparation in Next Generation Sequencing by Enzymatic DNA Fragmentation
US20240124921A1 (en) Detection of analytes using targeted epigenetic assays, proximity-induced tagmentation, strand invasion, restriction, or ligation
US20230340449A1 (en) Thermostable ligase with reduced sequence bias

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16721730

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16721730

Country of ref document: EP

Kind code of ref document: A1