EP1224304A1 - Methods of manipulating and sequencing nucleic acid molecules using transposition and recombination - Google Patents

Methods of manipulating and sequencing nucleic acid molecules using transposition and recombination

Info

Publication number
EP1224304A1
EP1224304A1 EP00976633A EP00976633A EP1224304A1 EP 1224304 A1 EP1224304 A1 EP 1224304A1 EP 00976633 A EP00976633 A EP 00976633A EP 00976633 A EP00976633 A EP 00976633A EP 1224304 A1 EP1224304 A1 EP 1224304A1
Authority
EP
European Patent Office
Prior art keywords
nucleic acid
recombination
sequence
acid molecule
integration
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP00976633A
Other languages
German (de)
French (fr)
Other versions
EP1224304A4 (en
Inventor
Michael A. Brasch
James L. Hartley
Gary F. Temple
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Life Technologies Corp
Original Assignee
Invitrogen Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Invitrogen Corp filed Critical Invitrogen Corp
Publication of EP1224304A1 publication Critical patent/EP1224304A1/en
Publication of EP1224304A4 publication Critical patent/EP1224304A4/en
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/66General methods for inserting a gene into a vector to form a recombinant vector using cleavage and ligation; Use of non-functional linkers or adaptors, e.g. linkers containing the sequence for a restriction endonuclease

Definitions

  • the present invention relates generally to recombinant DNA technology. More specifically, the present invention relates generally to compositions, kits and methods for use in the construction and manipulation of nucleic acid molecules.
  • the methods of the present invention involve the use of in vitro or in vivo integration and recombination events to construct and/or select desired nucleic acid molecules which may further be manipulated by any number of molecular biology techniques, including sequencing, amplification and mutagenesis.
  • Site-specific recombinases are proteins that are present in many organisms (e.g. viruses and bacteria) and have been characterized as having both endonuclease and ligase properties. These recombinases (along with associated proteins in some cases) recognize specific sequences of bases in DNA and exchange the DNA segments flanking those segments. The recombinases and associated proteins are collectively referred to as "recombination proteins" (see, e.g., Landy, A., Current Opinion in Biotechnology 3:699-707 (1993)).
  • Transposons are mobile genetic elements. Transposons are structurally vanable, being descnbed as simple or compound, but typically encode a transposition catalyzing enzyme, termed a transposase, flanked by DNA sequences organized in inverted onentations. For a more thorough discussion of the charactenstics of transposons, one may consult Mobile Genetic Elements, D.
  • Transposons have been used to insert DNA into target DNA sequences. As a general rule, the insertion of transposons into target DNA is a random event.
  • Transposon Tn7 can integrate itself into a specific site in the E. coli genome as one part of its life cycle (Stellwagen, A.E., and Craig, N L. Trends in Biochemical Sciences 23, 486- 490, 1998 specifically incorporated herein by reference). This site specific insertion has been used in vivo to manipulate the baculovirus genome (Lucklow et al. (J. Virol 67:4566-4579 (1993) specifically incorporated herein by reference). The site specificity of Tn7 is atypical of transposable elements whose hallmark is movement to random positions in acceptor DNA molecules.
  • transposition will be used to refer to random or quasi-random movement, unless otherwise specified, whereas recombination will be used to refer to site specific recombination events.
  • site specific insertion of Tn7 into the ⁇ ttTn 7 site would be referred to as a recombination event while the random insertion of Tn7 would be referred to as a transposition event.
  • the intramolecular transposition reaction was favored and was successfully used to generate a set of nested deletions in the target DNA.
  • Devine, etal disclose the construction of artificial transposons for the insertion of DNA segments into recipient DNA molecules in vitro.
  • the system makes use of the insertion-catalyzing enzyme of yeast TY1 virus-like particles as a source of transposase activity.
  • the DNA segment of interest is cloned, using standard methods, between the ends of the transposon- like element TY1.
  • the resulting element integrates randomly into a second target DNA molecule.
  • a key feature of the recombination reactions mediated by the above-noted recombination proteins are recognition sequences, often termed "recombination sites," on the DNA molecules participating in the recombination reactions. These recombination sites are discrete sections or segments of DNA on the participating nucleic acid molecules that are recognized and bound by the recombination proteins during recombination.
  • the recombination site for Cre recombinase is loxP which is a 34 base pair sequence comprised of two 13 base pair inverted repeats (serving as the recombinase binding sites) flanking an 8 base pair core sequence. See Figure 1 of Sauer, B., Curr. Opin. Biotech. 5:521-527 (1994).
  • recognition sequences include the attB, attP, attL, and ⁇ ttR sequences which are recognized by the recombination protein 1 Int.
  • ⁇ ttB is an approximately 25 base pair sequence containing two 9 base pair core-type Int binding sites and a 7 base pair overlap region
  • ⁇ ttP is an approximately 240 base pair sequence containing core-type Int binding sites and arm-type Int binding sites as well as sites for auxiliary proteins integration host factor (IHF), FIS and excisionase (Xis). See Landy, Curr. Opin. Biotech. 3:699-707 (1993).
  • This method uses the DNA-synthesizing activity of DNA polymerases which, when combined with mixtures of reaction-terminating dideoxynucleoside triphosphates (Sanger, F., et al., Proc. Natl. Acad. Sci. USA 74:5463-5467, 1977) and a short primer (either of which may be detectably labeled), gives rise to a series of newly synthesized DNA fragments specifically terminated at one of the four dideoxy bases. These fragments are then resolved by gel electrophoresis and the sequence determined as described for Maxam and Gilbert sequencing above. By carrying out four separate reactions (one with each ddNTP), the sequences of even fairly complex DNA molecules may rapidly be determined (Sanger, F., et al, Nature 265:678-695, 1977; Barnes, W., Meth.
  • Amplification-based methods of nucleic acid sequencing utilize the thermostable polymerases and temperature cycling used in PCR applications in combination with a single primer and ddNTPs resulting in the synthesis of multiple dideoxy-terminated oligonucleotides from each template in contrast to the single oligonucleotide produced in standard Sanger sequencing.
  • the resultant linear DNA molecule is then incubated with an exonuclease that removes bases from the end of the insert.
  • an exonuclease that removes bases from the end of the insert.
  • the number of bases removed from the insert can be vaned, resulting in a senes of DNAs containing progressively less of the insert.
  • a collection of clones can be isolated having new sequence adjacent to the pnming site in the vector thus permitting the entire insert to be sequenced using a pnmer that hybndizes to the vector sequence adjacent to the site of digestion.
  • transposons have been used to insert small DNA molecules of known sequence into larger DNA molecules of unknown sequence.
  • the known sequence can be used as the a pnmer recognition site and the DNA sequence of the larger DNA molecule adjacent to the inserted transposon can be determined using standard sequencing methods.
  • Strathmann, et al (Proc. Natl. Acad. Sci USA, 88: 1247-1250, 1990) descnbe one such system utilizing an in vivo insertion of gd transposon into target DNA
  • the DNA of interest is cloned into a "mimplasmid" to bias the insertion of the transposon into the target DNA rather than the vector DNA
  • DNA sequence of the target is determined using pnmers that hybridize to sequence in the PART
  • the present invention generally concerns nucleic acid molecules (DNA or RNA) compnsing at least one integration sequence and at least one recombination site, wherein the recombination site(s) may be located within and/or outside (e.g. adjacent to) the integration sequences.
  • integration sequences may include any nucleic acid molecules which, through recombination or integration, becomes a part of the nucleic acid molecule of interest. Examples of integration sequences include, but are not limited to, transposons, insertion sequences, integrating viruses, homing introns, or other integrating elements, or various combinations thereof.
  • the integrating sequences of the present invention may be insertion sequences or transposons or derivatives thereof.
  • At least two recombination sites are contained in the nucleic acid molecule outside the integration sequence and preferably flanking both sides of the integration sequence. In another aspect, at least two recombination sites (which may be the same or different) are contained within the integration sequence.
  • the present invention specifically provides for nucleic acid molecules (preferably a vector) comprising a target nucleic acid sequence flanked by recombination sites and at least one integration sequence inserted into the target sequence.
  • the recombination site(s), in accordance with the invention, may be used to exchange sequences with the molecule of interest, delete sequences from the molecule of interest, incorporate sequences into the molecule of interest, or otherwise identify, manipulate, analyze and/or select the molecule of interest.
  • various strategies utilizing homologous recombination can provide an alternative to transposons for integrating DNA segments of interest into a target sequence. These can be accomplished in vivo or in vitro. Yu et al (Proc Natl Acad Sci U S A 2000 May 23;97(l l):5978-83) have shown that
  • DNA segments containing homology to a target seqeunce can be efficiently integrated into a predetermined DNA sequence.
  • Such approaches can be used to integrate recombination sites, selectable markers, functional elements into a defined locus of a target sequence.
  • in vitro heteroduplex formation and repair reactions have been used for inserting genes and other DNA segments into target sequences (Volkov AA et al., Nucl. Acids Res. 1999 Sep 15;27(18):el8).
  • Oligonucleotides defining complete or partial homology flanking a recombination site can thus be used to generate populations of target sequences containing directed, partially directed or random insertions of recombination sites.
  • Recombination sites for use in the invention may be any recognition sequence on a nucleic acid molecule which participates in a recombination reaction by recombination proteins. In those embodiments of the present invention utilizing more than one recombination site, such recombination sites may be the same or different and may recombine with each other or may not recombine or not substantially recombine with each other. Recombination sites contemplated by the invention also include mutants, derivatives or variants of wild-type or naturally occurring recombination sites.
  • Preferred recombination site modifications include those that enhance recombination, such enhancement selected from the group consisting of substantially (i) favoring integrative recombination; (ii) favoring excisive recombination; (iii) relieving the requirement for host factors; (iv) increasing the efficiency of co-integrate or product formation; and (v) increasing the specificity of co-integrate and/or product formation.
  • Preferred modifications include those that enhance recombination specificity, those that permit the recombination site or portion thereof (or a nucleic acid molecule comprising the recombination site or portion thereof) to act as a primer site for amplification (e.g.
  • Preferred recombination sites used in accordance with the invention include att sites, FRT sites, and lox sites, or mutants, derivatives, fragments, portions and variants thereof (or combinations thereof)- Recombination sites contemplated by the invention also include portions of such recombination sites.
  • the integration sequences of the invention may comprise one or a number of elements and/or functional sequences and/or sites (or combinations thereof) including one or more sequences which are complementary to one or more sequencing or amplification pnmers of interest (e.g., sequencing primer sites or amplification primer sites), one or more selectable markers (e.g , toxic genes, antibiotic resistance genes, etc.), one or more transcnption or translation sites or signals, one or more transcription or translation termination sites, one or more ongins of replication, one or more recombination sites (or portions thereof), etc.
  • the integration sequence may compnse one or more recombination sites (or portions thereof) and one or more selectable markers.
  • integration sequences may be used to incorporate one or more recombination sites (or portions thereof) or other sites or sequences of interest into any nucleic acid molecule.
  • Integration sequences may be introduced in accordance with the invention by in vivo or in vitro installation.
  • the methods of the invention may utilize one or more integration sequences which may be the same or different. The use of different integration sequences with different functional sites or signals is thus contemplated by the invention.
  • the present invention also provides a method of inserting an integration sequence into a target nucleic acid sequence compnsing incubating a target sequence of interest flanked by recombination sites with at least one integration sequence under conditions sufficient to cause at least one of said integration sequences to integrate or insert in said target sequence and optionally selecting for said target sequence containing said at least one integration sequences.
  • target sequences are preferably contained by a vector and preferred integration sequences are one or more transposons. Selection of target sequences containing at least one integration sequence may preferably be accomplished by the use of the recombination sites which flank the target sequence of interest
  • recombinational cloning is used to transfer and select target sequences containing integration sequences.
  • such a method preferably compnses:
  • the first and/or second nucleic acid molecules are vectors.
  • the selection of said second nucleic acid molecule can be accomplished by using one or more selectable markers contained by the integration sequence and/or the target sequence.
  • One or more selectable markers contained by the second nucleic acid molecule may also be utilized in the selection scheme according to the invention.
  • negative selection may also be used to select against second nucleic acid molecules not containing the target sequence of interest.
  • recombinational cloning is used to transfer target sequences containing at least one integration sequence into a vector.
  • selectable markers contained by the vector and by the integration sequence are used in combination to select the desired product vector containing the target sequence/integration sequence.
  • undesired products for example, vectors containing the target sequence without an inserted integration sequence are selected against.
  • the selected target sequences containing integration sequences are used for further manipulation of the target sequence.
  • the invention allows random insertions of desired sequences by random integration of integration sequences which may be used to manipulate or analyze the target sequence.
  • random insertion in the target sequence of sequencing primer sites contained by the integration sequence allows sequencing of various portions or all of the target sequence.
  • portions of sequence information from the target can be used to determine the entire nucleic acid sequence of the target by analyzing and comparing the sequence overlap of such partial sequences.
  • random insertion in the target sequence of amplification primer sites contained by the integration sequence allows amplification of portions or all of the target sequence
  • rando insertion of transcriptional or regulatory sequences contained by the integration sequence allows expression of proteins or polypeptides from vanous portions or all of the target sequence
  • random insertion of genes or portions of genes allows the creation of a population of gene fusions for the target sequence of interest.
  • random insertion of recombination sites (or portions thereof) contained by the integration sequence allows creation of a population of deletion mutants of the target sequence of interest.
  • the deleted portion of the target sequence may be cloned
  • the present invention relates to a method of manipulating or analyzing (e.g., sequencing, amplification, deletion, mutation, expression analysis etc ) all or a portion of the target nucleic acid molecule compnsing:
  • sequencing methods of the invention may compnse: (a) hybndizing a pnmer to a first nucleic acid molecule; (b) contacting said molecule with one or more nucleotides and one or more terminating agents;
  • step (c) incubating the mixture of step (b) under conditions sufficient to synthesize a population of nucleic acid molecules complementary to all or a portion of said first nucleic acid molecule, wherein said synthesized molecules are shorter in length than said first molecule and said synthesized molecules comprise a terminating agent at their 3' termini;
  • the present invention also provides for a method of making deletions in a nucleic acid molecule of interest comprising contacting the nucleic acid molecule which comprises at least a first recombination site with an integration sequence which comprises at least a second recombination site under conditions such that at least one of said integration sequences is inserted into said nucleic acid molecule, and causing at least said first and said second recombination sites to recombine, thereby resulting in a deletion of at least a portion of said nucleic acid molecule.
  • the deleted portion of the target nucleic acid molecule may be cloned.
  • a new recombination site will be created at the point of deletion.
  • the nucleic acid molecule of interest may be a vector which comprises a target sequence.
  • the target sequence and/or vector sequence may comprise said first recombination site and the integration sequence, in some embodiments a transposon, comprises the second recombination site.
  • the target sequence may first be inserted into a vector containing at least a first recombination site.
  • the first and second recombination sites may be incorporated in the target sequence and/or vector by one or more integration sequences.
  • a population of deletion mutants may be made by allowing recombination to occur between recombination sites.
  • Other deletions of different sizes and at different positions may be accomplished by including additional recombination sites at different positions within the target sequence and/or vector of interest.
  • a third, fourth, and/or fifth recombination site may be inserted at different positions within the target or vector sequence (for example by additional integration sequences containing such different recombination sites). Causing recombination between such sites allows generation of further deletions of the target or vector sequence.
  • deletions may be done in a target or vector sequence sequentially by first causing recombination between the first and second recombination sites to create a first deletion and a new recombination site (e.g., a third recombination site) at the point of deletion, inserting a fourth recombination site in the target or vector sequence (preferably by insertion of an integration sequence containing one or more recombination sites), and causing recombination between said third and fourth recombination sites to create a second deletion and creating a new recombination site (e.g., a fifth recombination site) at the point of deletion.
  • This process may be repeated any number of times to generate any number of deletions in the target and/or vector sequence of interest.
  • the present invention provides a method for replacing or exchanging sequences in a nucleic acid molecule of interest.
  • the method comprises contacting the nucleic acid molecule which comprises at least a first recombination site with an integration sequence which comprises at least a second recombination site under conditions such that at least one of said integration sequences is inserted into said nucleic acid molecule, and causing replacement of one or more sequences in said molecule which are flanked by said first and said second recombination sites with at least a second nucleic acid molecule flanked by recombination sites.
  • the target sequence and the second nucleic acid molecule encode peptides, polypeptides or proteins and the recombination event places the encoded peptides, polypeptides or proteins in the same reading frame.
  • Such second molecule may contain one or more genes or portions of genes.
  • the nucleic acid molecule of interest for making such replacement is a vector which compnses a target sequence.
  • the target sequence and/or vector sequence compnses said first recombination site and the integration sequence (preferably a transposon) compnses the second recombination site.
  • the target sequence may first be inserted into a vector containing at least a first recombination site.
  • the first and second recombination sites may be incorporated in the target sequence and/or vector by one or more integration sequences. After insertion of the integration sequence into one or more positions within the target sequence, a population of fusions may be made by allowing a molecule flanked by said first and second recombination sites to be replaced with a population of second nucleic acid molecules flanked by recombination sites.
  • one or more recombination sites may be added to nucleic acid molecules of interest by a method which compnses.
  • the one or more nucleic acid molecules are contacted with the one or more integration sequences in vitro.
  • the recombination sites may be used to transfer nucleic acid molecules which are flanked by such recombination sites
  • random insertion of integration sequences containing recombination sites or portions thereof allows inco oration of a number of recombination sites (or portions thereof) into the molecule of interest.
  • Use of such recombination sites, through recombinational cloning, provides a method for transferring portions of the molecule which are flanked by recombination sites into one or more vectors. For example, one or a number of molecules of interest flanked by a first and second recombination site
  • a population of molecules may be produced by transferring a number of molecules of interest into one or more vectors.
  • the invention provides for the construction of a library which may be representative of all or a portion of the starting genetic material.
  • a library may be prepared from cDNA, genomic or chromosomal genetic material using the invention.
  • the recombination sites which are incorporated in the nucleic acid molecules of interest may be recombined directly without the need to transfer to a separate nucleic acid molecule or vector.
  • the molecule flanked by recombination sites can circularize upon recombination of the recombination sites.
  • the circular molecule contains a new recombination site at the point of recircularization.
  • the circularized molecule contains at least one origin of replication so that the molecule may replicate autonomously in a host cell or function as a vector is a host cell.
  • the circularized molecule may also contain one or more selectable markers.
  • one or more origins of replication and/or selectable markers are provided by one or more integration sequences.
  • the molecule upon recombination, the molecule preferably will comprise at least one recombination site, at least one selectable marker, a nucleic acid molecule of interest and an origin of replication.
  • the invention provides a method by which recombination sites may be used to create one or a population of vectors comprising portions of the original nucleic acid molecule of interest.
  • the invention allows for efficient preparation of libraries of starting genetic material such as cDNA, genomic or chromosomal DNA.
  • the invention provides a method by which a linear nucleic acid molecule may be circularized by recombining at least a first and second recombination site within the molecule to be circularized.
  • the first and second recombination sites are located at or near the termini of the linear molecule.
  • the recombination sites are incorporated at or near the termini of the linear molecule by ligation of adapters (which comprise at least one recombination site or portion thereof) to one or both termini of the molecule and/or by amplifying the linear molecule with primers which comprise a recombination site or a portion thereof.
  • DNA segments comprising a covalently linked topoisomerase can be used to join linkers (for example, which comprise at least one recombination site or a portion thereof) or other DNA segments to the ends of other linear DNA segments (Shuman, S., J. Biol. Chem. 269:32678 (1994)).
  • a combination of addition of an adapter and amplification with a primer may be used to incorporate recombination sites into the termini of the molecule.
  • a linear molecule can be created which contains a first recombination site at or near the first terminus of the linear molecule and a second recombination site at or near the second terminus of the linear molecule.
  • recombination of these recombination sites provides a circular molecule.
  • the circular molecule contains a new recombination site at the point of recircularization.
  • the circular molecule comprises an ongin of replication and/or at least one selectable marker
  • one or more integration sequences which contain one or more functional sites such as ongins of replication, selectable markers, transcnptional signals, etc. may be integrated into the linear or circulanzed molecule to provide functional sequences to such molecule.
  • the integration sequences (which are preferably transposons) incorporate an on gin of replication and optionally at least one selectable marker into such linear or circular molecules.
  • kits for carrying out the methods of the invention may compnse one or more nucleic acid molecules of the invention such as integration sequences and/or vectors of the invention
  • Such kits may optionally compnse one or more additional components selected from the group consisting of one or more nucleotides, one or more polymerases and or reverse transcnptases, one or more suitable buffers, one or more pnmers and one or more terminating agents (such as one or more di deoxynucleoti des)
  • compositions, methods and kits of the invention are preferably prepared and earned out using a phage-lambda site-specific recombination system and most preferably with the GATEWAYTM recombinational cloning technology available from Invitrogen Corporation, Life Technologies Division (Rockville, MD).
  • Figure 1 is a schematic representation of a recombination reaction of the present invention
  • Figure 2 is a schematic representation of the insertion of a transposon into a target nucleic acid molecule and/or a vector nucleic acid molecule.
  • Figure 3 is a schematic representation of how the present invention can be used to select for target nucleic acid molecules comprising an insertion sequence by performing a recombinational cloning step after performing a transposition reaction.
  • Figure 4A is a schematic representation of the cloning of genomic DNA using transposons containing recombination site(s).
  • Figure 4B is a schematic representation of the cloning of genomic DNA using transposons containing recombination sites that are oriented so as to allow productive and non-productive recombination reactions.
  • Figure 5 is a schematic representation of a transposon designed to transfer a selectable marker by recombination.
  • Figure 6 is a schematic representation of the cloning of genomic DNA using a transposon comprising a toxic gene.
  • Figure 7 is a schematic representation of the cloning of genomic DNA using a transposon comprising an origin of replication and a transposon containing a selectable marker.
  • Figure 8A is a schematic representation of the construction of subclones using the compositions and methods of the present invention.
  • Figure 8B is a schematic representation of the replacement of a portion of a target sequence using the compositions and methods of the present invention.
  • Figure 9 is a schematic representation of the construction of subclones using an insertion sequence containing an origin of replication according to the methods of the present invention.
  • Figure 10 is a schematic representation of the construction of gene targeting vectors from PCR products using the compositions and methods of the present invention.
  • Figure 11 is a schematic representation of the construction of deletions in a target DNA molecule using the compositions and methods of the present invention.
  • Figure 12 is a schematic representation of the cloning of a deleted portion of a target molecule using the compositions and methods of the present invention.
  • Figure 13 is a schematic representation of the generation of populations of nucleic acid molecules attached to a solid substrate using the compositions and methods of the present invention.
  • recombination sites are indicated by RS and the recombination sites are distinguished by numerical subscripts, selectable markers are indicated by SM and a numerical subscript.
  • the reaction product of two compatible recombination sites is designated RS and a subscript indicating the two sites which were recombined.
  • amplification is any in vitro method for increasing a number of copies of a nucleotide sequence with the use of one or more polypeptides having polymerase activity (e.g., one or more nucleic acid polymerases or one or more reverse transcriptases).
  • Nucleic acid amplification results in the incorporation of nucleotides into a DNA and/or RNA molecule or primer thereby forming a new nucleic acid molecule complementary to a template.
  • the formed nucleic acid molecule and its template can be used as templates to synthesize additional nucleic acid molecules.
  • one amplification reaction may consist of many rounds of nucleic acid replication.
  • DNA amplification reactions include, for example, polymerase chain reaction (PCR).
  • Gene As used herein, a gene is a nucleic acid sequence that contains information necessary for expression of a polypeptide or protein. It includes the promoter and the structural gene as well as other sequences involved in expression of the protein.
  • a host is any prokaryotic or eukaryotic organism that is a recipient of a replicable expression vector, cloning vector or any nucleic acid molecule.
  • the nucleic acid molecule may contain, but is not limited to, a structural gene, a transcriptional regulatory sequence (such as a promoter, enhancer, repressor, and the like) and/or an origin of replication (ori).
  • a transcriptional regulatory sequence such as a promoter, enhancer, repressor, and the like
  • ori origin of replication
  • the terms "host,” “host cell,” “recombinant host” and “recombinant host cell” may be used interchangeably. For examples of such hosts, see Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1982).
  • hybridization and hybridizing refer to base pairing of two complementary single-stranded nucleic acid molecules (RNA and/or DNA) to give a double stranded molecule.
  • RNA and/or DNA complementary single-stranded nucleic acid molecules
  • hybridization and hybridizing refer to base pairing of two complementary single-stranded nucleic acid molecules (RNA and/or DNA) to give a double stranded molecule.
  • two nucleic acid molecules may be hybridized, although the base pairing is not completely complementary. Accordingly, mismatched bases do not prevent hybridization of two nucleic acid molecules provided that appropriate conditions, well known in the art, are used.
  • hybridization is said to be under "stringent conditions.”
  • stringent conditions as used herein is meant overnight incubation at 42 °C in a solution comprising: 50% formamide, 5x SSC ( 150 mM NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 g/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 0.1 x SSC at about 65 °C. .
  • Incorporating As used herein, incorporating means becoming a part of a nucleic acid (e.g., DNA) molecule or primer.
  • an insert is a desired nucleic acid segment that is a part of a larger nucleic acid molecule.
  • An insert may be a target nucleic acid molecule in accordance with the invention.
  • an insert donor is one of the two parental nucleic acid molecules (e.g. RNA or DNA) of the present invention which carries the Insert.
  • the Insert Donor molecule comprises the Insert flanked on both sides with recombination sites.
  • the Insert Donor can be linear or circular.
  • the Insert Donor is a circular DNA molecule and further comprises a cloning vector sequence outside of the recombination signals (see Figure 1). When a population of Inserts or population of nucleic acid segments are used to make the Insert Donor, a population of Insert Donors result and may be used in accordance with the invention.
  • an integration sequence is any nucleotide sequence that is capable of inserting randomly into a target nucleic acid molecule. Integration sequences are also known in the art as mobile genetic elements. Any integration sequence known to those of ordinary skill in the art may be used to practice the present invention, including but not limited to transposons (transposable elements), integrating viruses (e.g., retroviruses), IS elements, retrotransposons, conjugative transposons, P elements of Drosophila, bacterial virulence factors, or mobile genetic elements for eukaryotic organisms such as mariner, Tc 1 and Sleeping Beauty. Other mobile genetic elements known to those skilled in the art may also be used in accordance with the present invention.
  • a library is a collection of nucleic acid molecules (circular or linear).
  • a library may comprise a plurality (i.e., two or more) of nucleic acid molecules, which may or may not be from a common source organism, organ, tissue, or cell.
  • a library is representative of all or a portion or a significant portion of the nucleic acid content of an organism (a "genomic" library), or a set of nucleic acid molecules representative of all or a portion or a significant portion of the expressed nucleic acid molecules (a cDNA library) in a cell, tissue, organ or organism.
  • a library may include a target DNA molecule containing insertions at various places within the target.
  • a library may also comprise random sequences made by de novo synthesis, mutagenesis of one or more sequences and the like. Such libraries may or may not be contained in one or more vectors.
  • nucleotide As used herein, a nucleotide is a base-sugar-phosphate combination. Nucleotides are monomeric units of a nucleic acid molecule (DNA and RNA). The term nucleotide includes ribonucleoside triphosphates ATP,
  • UTP, CTG, GTP and deoxyribonucleoside triphosphates such as dATP, dCTP, dlTP, dUTP, dGTP, dTTP, or derivatives thereof.
  • Such derivatives include, for example, [ ⁇ SjdATP, 7-deaza-dGTP and 7-deaza-dATP.
  • nucleotide as used herein also refers to dideoxyribonucleoside triphosphates (ddNTPs) and their derivatives. Illustrated examples of dideoxyribonucleoside triphosphates include, but are not limited to, ddATP, ddCTP, ddGTP, ddlTP, and ddTTP.
  • a "nucleotide” may be unlabeled or detectably labeled by well known techniques.
  • Detectable labels include, for example, radioactive isotopes, fluorescent labels, chemiluminescent labels, bioluminescent labels and enzyme labels.
  • Oligonucleotide As used herein, an oligonucleotide is a synthetic or natural molecule comprising a covalently linked sequence of nucleotides which are joined by a phosphodiester bond between the 3' position of the pentose of one nucleotide and the 5' position of the pentose of the adjacent nucleotide.
  • Primer As used herein, a primer is a single stranded or double stranded oligonucleotide that is extended by covalent bonding of nucleotide monomers during amplification or polymerization of a nucleic acid molecule (e.g. a DNA molecule).
  • the primer may be a sequencing primer (for example, a universal sequencing primer).
  • the primer may comprise a recombination site or portion thereof.
  • Product As used herein, a product is one the desired daughter molecules comprising the A and D sequences which is produced after the second recombination event during the recombinational cloning process (see Figure 1). The Product contains the nucleic acid which was to be cloned or subcloned. In accordance with the invention, when a population of Insert Donors are used, the resulting population of Product molecules will contain all or a portion of the population of Inserts of the Insert Donors and preferably will contain a representative population of the original molecules of the Insert Donors.
  • a promoter is an example of a transcriptional regulatory sequence, and is specifically a DNA sequence generally described as the 5'-region of a gene located proximal to the start codon. The transcription of an adjacent DNA segment is initiated at the promoter region. A repressible promoter's rate of transcription decreases in response to a repressing agent. An inducible promoter's rate of transcription increases in response to an inducing agent. A constitutive promoter's rate of transcription is not specifically regulated, though it can vary under the influence of general metabolic conditions.
  • a recognition sequence is a particular sequence to which a protein, chemical compound, DNA, or RNA molecule (e.g., restriction endonuclease, a modification methylase, or a recombinase) recognizes and binds.
  • a recognition sequence will usually refer to a recombination site.
  • the recognition sequence for Cre recombinase is loxP which is a 34 base pair sequence comprised of two 13 base pair inverted repeats (serving as the recombinase binding sites) flanking an 8 base pair core sequence. See Figure 1 of Sauer, B., Current Opinion in Biotechnology 5:521-527 (1994).
  • recognition sequences are the ⁇ ttB, ⁇ ttP, ttL, and ⁇ ttR sequences which are recognized by the recombinase enzyme 1 Integrase.
  • ⁇ ttB is an approximately 25 base pair sequence containing two 9 base pair core-type Int binding sites and a 7 base pair overlap region.
  • ⁇ ttP is an approximately 240 base pair sequence containing core- type Int binding sites and arm-type Int binding sites as well as sites for auxiliary proteins integration host factor (EHF), FIS and excisionase (Xis). See Landy, Current Opinion in Biotechnology 3:699-707 (1993). Such sites may also be engineered according to the present invention to enhance production of products in the methods of the invention.
  • such engineered sites lack the PI or HI domains to make the recombination reactions irreversible (e.g., ⁇ ttR or ⁇ ttP)
  • such sites may be designated ⁇ ttR' or ⁇ ttP' to show that the domains of these sites have been modified in some way.
  • Recombination proteins include excisive or integrative proteins, enzymes, co-factors or associated proteins that are involved in recombination reactions involving one or more recombination sites, which may be wild-type proteins (See Landy, Current Opinion in Biotechnology 3:699-707 (1993)), or mutants, derivatives, fragments, and variants thereof.
  • Recombination site is a recognition sequence on a nucleic acid molecule participating in an integration/recombination reaction by recombination proteins. Recombination sites are discrete sections or segments of nucleic acid on the participating nucleic acid molecules that are recognized and bound by a site-specific recombination protein during the initial stages of integration or recombination.
  • the recombination site for Cre recombinase is loxP which is a 34 base pair sequence comprised of two 13 base pair inverted repeats (serving as the recombinase binding sites) flanking an 8 base pair core sequence. See Figure 1 of Sauer, B., Curr. Opin. Biotech.
  • recognition sequences include the ⁇ ttB, ⁇ ttP, ⁇ ttL, and ⁇ ttR sequences described herein, and mutants, fragments, variants and derivatives thereof, which are recognized by the recombination protein 1 Int and by the auxiliary proteins integration host factor (IHF), FIS and excisionase (Xis). See Landy, Curr. Opin. Biotech. 3:699-707 (1993).
  • Recombinational Cloning is a method, such as that described in U.S. Patent No. 5,888,732 (the contents of which are fully incorporated herein by reference), whereby segments of nucleic acid molecules or populations of such molecules are exchanged, inserted, replaced, substituted or modified, in vitro or in vivo.
  • cloning method is an in vitro method.
  • Repression cassette is a nucleic acid segment that contains a repressor or a Selectable marker present in the subcloning vector.
  • selectable marker is a nucleic acid segment that allows one to select for or against a molecule (e.g., a replicon) or a cell that contains it, often under particular conditions. These markers can encode an activity, such as, but not limited to, production of RNA. peptide, or protein, or can provide a binding site for RNA, peptides, proteins, inorganic and organic compounds or compositions and the like. Examples of selectable markers include but are not limited to: (1) DNA segments that encode products which provide resistance against otherwise toxic compounds (e.g., antibiotics); (2) DNA segments that encode products which are otherwise lacking in the recipient cell
  • DNA segments that encode products which suppress the activity of a gene product e.g., tRNA genes, auxotrophic markers
  • DNA segments that encode products which can be readily identified e.g., phenotypic markers such as b- galactosidase, green fluorescent protein (GFP), and cell surface proteins
  • DNA segments that bind products which are otherwise detrimental to cell survival and/or function e.g., antisense oligonucleotides
  • DNA segments that bind products that modify a substrate e.g. restriction endonucleases
  • DNA segments that can be used to isolate or identify a desired molecule e.g.
  • selection scheme is any method which allows selection, enrichment, or identification of a desired product(s) or molecule(s) from a mixture. In some preferred embodiments, the selection scheme results in selection of or enrichment for only one or more desired products or molecules. As defined herein, selecting for a DNA molecule includes
  • the selection schemes may take one of three forms, which will be discussed in terms of
  • Figure 1 The first, exemplified herein with a selectable marker and a repressor therefor, selects for molecules having segment D and lacking segment C.
  • the second selects against molecules having segment C and for molecules having segment D.
  • Possible embodiments of the second form would have a DNA segment carrying a gene toxic to cells into which the in vitro reaction products are to be introduced.
  • a toxic gene can be a DNA that is expressed as a toxic gene product (a toxic protein or RNA), or can be toxic in and of itself.
  • toxic gene is understood to carry its classical definition of "heritable trait”.
  • toxic gene products include, but are not limited to, restriction endonucleases (e.g., Dpn ⁇ ), thymidine kinase (TK) genes, apoptosis-related genes (e.g.
  • ASK1 or members of the bcl-2/ced-9 family retroviral genes including those of the human immunodeficiency virus (HIV), defensins such as NP-1, inverted repeats or paired palindromic DNA sequences, bacteriophage lytic genes such as those from fX174 or bacteriophage T4; antibiotic sensitivity genes such as rpsL, antimicrobial sensitivity genes such as pheS, plasmid killer genes, eukaryotic transcriptional vector genes that produce a gene product toxic to host cells, such as GATA-1, and genes that kill hosts in the absence of a suppressing function, e.g., kicB, ccdB, fX174 E (Liu, Q. et al, Curr. Biol. 8: 1300-1309 (1998)), and other genes that negatively affect replicon stability and/or replication.
  • a toxic gene can alternatively be selectable in vitro, e.g., a restriction site.
  • segment D carries a selectable marker.
  • the toxic gene would eliminate transformants harboring the Vector Donor, Cointegrate, and Byproduct molecules, while the selectable marker can be used to select for cells containing the Product and against cells harboring only the Insert Donor.
  • the third form selects for cells that have both segments A and D in cis on the same molecule, but not for cells that have both segments in trans on different molecules.
  • the recombinational event can link a promoter with a structural nucleic acid molecule (e.g., a gene), can link two fragments of a structural nucleic acid molecule, or can link nucleic acid molecules that encode a heterodimeric gene product needed for survival, or can link portions of a replicon.
  • a structural nucleic acid molecule e.g., a gene
  • Site-specific recombinase is a type of recombinase which typically has at least the following four activities (or combinations thereof): (1) recognition of one or two specific nucleic acid sequences; (2) cleavage of said sequence or sequences; (3) topoisomerase activity involved in strand exchange; and (4) ligase activity to reseal the cleaved strands of nucleic acid.
  • Conservative site-specific recombination is distinguished from homologous recombination and transposition by a high degree of specificity for both partners.
  • the strand exchange mechanism involves the cleavage and rejoining of specific DNA sequences in the absence of DNA synthesis (Landy, A. (1989) Ann. Rev. Biochem. 58:913-949).
  • Structural gene As used herein, a structural gene refers to a nucleic acid sequence that is transcribed into messenger RNA that is then translated into a sequence of amino acids characteristic of a specific polypeptide.
  • a subcloning vector is a cloning vector comprising a circular or linear nucleic acid molecule which includes preferably an appropriate replicon.
  • the subcloning vector (segment D in Figure 1) can also contain functional and/or regulatory elements that are desired to be incorporated into the final product to act upon or with the cloned DNA Insert (segment A in Figure 1).
  • the subcloning vector can also contain a selectable marker.
  • Target nucleic acid molecule is a nucleic acid segment of interest (preferably DNA) which is to be acted upon using the present invention.
  • a template is a double stranded or single stranded nucleic acid molecule which is to be amplified, synthesized or sequenced.
  • denaturation of its strands to form a first and a second strand is preferably performed before these molecules may be amplified, synthesized or sequenced, or the double stranded molecule may be used directly as a template.
  • a primer complementary to at least a portion of the template is hybridized under appropriate conditions and one or more polypeptides having polymerase activity (e.g.
  • DNA polymerases and/or reverse transcriptases may then synthesize a molecule complementary to all or a portion of the template.
  • one or more transcnptional regulatory sequences e.g , one or more promoters
  • the newly synthesized molecule may be of equal or shorter length compared to the onginal template. Mismatch incorporation or strand slippage dunng the synthesis or extension of the newly synthesized molecule may result in one or a number of mismatched base pairs. Thus, the synthesized molecule need not be exactly complementary to the template.
  • a population of nucleic acid templates may be used dunng synthesis or amplification to produce a population of nucleic acid molecules typically representative of the onginal template population.
  • transcnptional regulatory sequence is a functional stretch of nucleotides contained on a nucleic acid molecule, in any configuration or geometry, that acts to regulate the transcnption of one or more structural genes into messenger RNA.
  • transcnptional regulatory sequences include, but are not limited to, promoters, enhancers, repressors, and the like. "Transcnption regulatory sequence"
  • transcnption sites and “transcnption signals” may be used interchangeably.
  • a vector is a nucleic acid molecule (preferably DNA) that provides a useful biological or biochemical property to an Insert.
  • a vector can have one or more restnction endonuclease recognition sites at which the sequences can be cut in a determinable fashion without loss of an essential biological function of the vector, and into which a nucleic acid fragment can be spliced in order to bnng about its replication and cloning.
  • Vectors can further pro ⁇ ide pnmer sites, e.g , for PCR, transcnptional and/or translational initiation and/or regulation sites, recombinational signals, rep cons, selectable markers, etc
  • methods of inserting a desired nucleic acid fragment which do not require the use of homologous recombination, transpositions or restriction enzymes (such as, but not limited to, UDG cloning of PCR fragments (U.S. Patent No. 5,334,575, entirely incorporated herein by reference), T: A cloning, and the like) can also be applied to clone a fragment into a cloning vector to be used according to the present invention.
  • the cloning vector can further contain one or more selectable markers suitable for use in the identification of cells transformed with the cloning vector.
  • a Vector Donor is one of the two parental nucleic acid molecules (e.g., RNA or DNA) of the present invention which carries the segments comprising the vector which is to become part of the desired Product.
  • the Vector Donor comprises a subcloning vector D (or it can be called the cloning vector if the Insert Donor does not already contain a cloning vector) and a segment C flanked by recombination sites (see Figure 1). Segments C and/or D can contain elements that contribute to selection for the desired Product daughter molecule, as described above for selection schemes.
  • the recombination signals can be the same or different, and can be acted upon by the same or different recombinases.
  • the Vector Donor can be linear or circular.
  • the present invention relates to the construction of nucleic acid molecules (RNA or DNA) by inserting at least one integration sequence (e.g., a transposon) into a target nucleic acid molecule and subsequently transferring the modified target nucleic acid molecule to a vector using recombinational cloning.
  • integration sequence e.g., a transposon
  • recombinational cloning allows efficient selection and identification of molecules (particularly vectors) containing the target sequence comprising all or a portion of the integration sequence.
  • sites or sequences of interest can be inserted within the target sequence which allows for further manipulation of the target nucleic acid molecule.
  • Integration sequences of the invention to be introduced into the target nucleic acid molecules may comprise any number or combinations of functional sequences such as primer sites (e.g., sequences for which a primer such as a sequencing primer or amplification primer may hybridize to initiate nucleic acid synthesis, amplification or sequencing), transcription or translation signals or regulatory sequences such as promoters, ribosomal binding sites, translation effecting sequences such as Kozak and Shine-Delgarno sequences, start codons, origins of replication, termination signals such as stop codons, recombination sites (or portions thereof), selectable markers, and genes or portions of genes to create protein fusion (e.g., N-terminal or carboxy terminal) such as GST, GUS, GFP, and combinations thereof.
  • the molecules may be manipulated in a variety of ways including sequencing or amplification of all or a portion of the target sequence
  • the present invention also relates to cloning nucleic acid molecules (e.g. , genomic DNA or cDNA) by inserting recombination site -containing integration sequences into the molecule(s) and performing recombinational cloning or causing recombination of the inserted recombination sites.
  • nucleic acid molecules e.g. , genomic DNA or cDNA
  • one or more integration sequences comprising at least one recombination site may be inserted within the molecule of interest to allow recombinational cloning or cloning of such molecules or portions thereof.
  • the integration sequences may also comprise other functional sequences of interest (such as primer sites, transcription and translation signals, termination signals, selectable markers, origins of replication, etc. noted above) to allow further manipulation of the molecule obtained by this method of the invention.
  • Recombination sites for use in the invention may be any recognition sequence which participates in a recombination reaction. Such recombination sites may be the same or different and may be wild-type or naturally occurring recombination sites or modified or mutant recombination sites.
  • recombination sites for use in the invention include, but are not limited to, phage- lambda recombination sites (such as ⁇ ttP, ⁇ ttB, ⁇ ttL, and ⁇ ttR and mutants or derivatives thereof) and recombination sites from other bacteriophage such as PI, phi80, P22, P2, 186, P4 and PI (including lox sites such as loxP and / ⁇ *P511). Corresponding recombination proteins for these systems may be used in accordance with the invention with the indicated recombination sites.
  • recombination sites and recombination proteins for use in the invention include the FLP/FRT system from Saccharomyces cerevisiae, the resolvase family (e.g., gd, Tn3 resolvase, Hin, Gin and Cin), and IS231 and other Bacillus thuringiensis transposable elements.
  • Preferred recombination proteins and mutant or modified recombination sites for use in the invention include those described in U.S. Patent No. 5,888,732, co-pending U.S. Application No. 09/438,358 (filed November 12, 1991) and co-pending U.S. Application No. 09/517,466 (filed March 2, 2000), as well as those associated with the GATEWAYTM Cloning Technology available from Invitrogen Corporation, Life Technologies Division (Rockville, MD).
  • the integration sequence may be a transposon (transposable element). Any transposon sequence known to those skilled in the art may be suitable for use in the present invention.
  • the transposons suitable for use in the present invention include, but are not limited to, TnJ family transposons, TnJ, TnA, gd, TnlOOO, Tn5, Tni 721, Tn7, Tn9, Tn70 and derivatives and mutants thereof.
  • the integration sequence may be an integrating virus.
  • the integrating virus may be a lambdoid phage.
  • Lambdoid phages are seen to include, but are not limited to, coliphages such as 1, 21, 434, f80 and HK022 as well as Salmonella phages such as P22.
  • the integrating virus may be a phage not related to 1, such as Mu-1, P2 and P4.
  • Other integrating viruses known to those skilled in the art may be used in the practice of the present invention.
  • the integration sequence may be an IS element such as ISI, IS2, IS4, IS5, and derivatives and mutants thereof.
  • the integration sequence may be a retrovirus, retrotransposons, conjugative transposons, P elements of Drosophila, bacterial virulence factors, or mobile genetic elements for eukaryotic organisms such as mariner, Tel and Sleeping Beauty. Other mobile genetic elements known to those skilled in the art may also be used in accordance with the present invention.
  • An origin of replication is a nucleotide sequence in a nucleic acid molecule at which replication of the nucleic acid molecule is initiated.
  • the phrase origin of replication is seen to include the definable origin of replication as well as one or more adjoining controlling elements necessary for the replication of the nucleic acid molecule.
  • This combination of definable starting point of DNA synthesis during replication and the adjacent controlling element or elements may also be termed a replicon.
  • Replicons suitable for use in the present invention include, but are not limited to, the pMBl replicon, the pl5A replicon, the pSClOl replicon, the ColEl replicon, the R6K replicon, the
  • the replicons suitable for the practice of the present invention are not limited to those replicons functional in E. coli.
  • Replicons functional in other organisms include, but are not limited to, the PS 10 replicon, the pCTTI replicon, the pWV02 replicon, the pF3A replicon and the plP404 replicon.
  • Replicons suitable for use in eukaryotic cells including but not limited to insect cells, yeast cells, mammalian cells, amphibian cells or any of the host cells described below may be used in conjunction with the present invention.
  • the invention also relates to host cells comprising one or more of the nucleic acid molecules or vectors of the invention, particularly those nucleic acid molecules and vectors described in detail herein.
  • Representative host cells that may be used according to this aspect of the invention include, but are not limited to, bacterial cells, yeast cells, insect cells, plant cells and animal cells.
  • Preferred bacterial host cells include Escherichia spp. cells (particularly E. coli cells and most particularly E. coli strains DH10B, Stbl2, DH5a, DB3, DB3.1 (preferably E. coli LIBRARY EFFICIENCY® DB3.1TM Competent Cells; Invitrogen Corporation, Life Technologies Division, Rockville, MD), DB4 and DB5 (see
  • E. coli W strains such as those described in United States provisional patent application 60/139,889 filed June 22, 1999, Bacillus spp. cells (particularly B. subtilis and B. megaterium cells), Streptomyces spp. cells, Erwinia spp. cells, Klebsiella spp. cells, Serratia spp. cells (particularly S. marcessans cells), Pseudomonas spp. cells (particularly P. aeruginosa cells), and Salmonella spp. cells (particularly S. typhimurium and S. typhi cells).
  • Preferred animal host cells include insect cells (most particularly Drosophila melanogaster cells, Spodoptera frugiperda Sf9 and Sf21 cells and Trichoplusa High-Five cells), nematode cells (particularly C. elegans cells), avian cells, amphibian cells (particularly Xenopus laevis cells), reptilian cells, and mammalian cells (most particularly CHO, COS, VERO, BHK and human cells).
  • Preferred yeast host cells include Saccharomyces cerevisiae cells and Pichia pastoris cells. These and other suitable host cells are available commercially, for example from Invitrogen Corporation, Life Technologies Division (Rockville, Maryland), Amencan Type Culture Collection (Manassas, Virginia), and Agncultural Research Culture Collection (NRRL, Peona, Illinois)
  • nucleic acid molecules and/or vectors of the invention may be introduced into host cells using well known techniques of infection, transduction, transfection, and transformation.
  • the nucleic acid molecules and/or vectors of the invention may be introduced alone or in conjunction with other the nucleic acid molecules and/or vectors.
  • the nucleic acid molecules and/or vectors of the invention may be introduced into host cells as a precipitate, such as a calcium phosphate precipitate, or in a complex with a lipid Electroporation also may be used to introduce the nucleic acid molecules and/or vectors of the invention into a host.
  • such molecules may be introduced into chemically competent cells.
  • the chemically competent cells are E coli cells, particularly E coli W cells If the vector is a virus, it may be packaged in vitro or introduced into a packaging cell and the packaged virus may be transduced into cells.
  • Polymerases for use in the invention include but are not limited to polymerases (DNA and RNA polymerases), and reverse transcriptases.
  • DNA polymerases include, but are not limited to, Thermus thermophilus (Tth) DNA polymerase, Thermus aquaticus (Taq) DNA polymerase, Thermotoga neopolitana (Tne) DNA polymerase, Thermotoga maritima (Tma) DNA polymerase, Thermococcus litoralis (Tli or VENTTM) DNA polymerase, Pyrococcus furiosus (Pfu) DNA polymerase, DEEPVENTTM DNA polymerase, Pyrococcus woosii (Pwo) DNA polymerase, Pyrococcus sp KOD2 (KOD) DNA polymerase, Bacillus sterothermophilus (Bst) DNA polymerase, Bacillus caldophilus (Bca) DNA polymerase, Sulfolobus (Sac) DNA
  • RNA polymerases such as T3, T5 and SP6 and mutants, variants and derivatives thereof may also be used in accordance with the invention.
  • the nucleic acid polymerases used in the present invention may be mesophilic or thermophilic, and are preferably thermophilic.
  • Preferred mesophilic DNA polymerases include Pol I family of DNA polymerases (and their respective Klenow fragments) any of which may be isolated from organism such as E. coli, H. influenzae, D. radiodurans, H. pylori, C. aurantiacus, R. prowazekii, T.pallidum, Synechocystis sp., B. subtilis, L. lactis, S. pneumoniae, M. tuberculosis, M. leprae, M.
  • smegmatis Bacteriophage L5, phi-C31 , T7, T3, T5, SP01, SP02, mitochondrial from 5. cerevisiae MIP-1, and eukaryotic C. elegans, and D. melanogaster (Astatke, M. et al., 1998, J. Mol Biol. 278, 147-165), pol III type DNA polymerase isolated for any sources, and mutants, denvatives or vanants thereof, and the like.
  • thermostable DNA polymerases that may be used in the methods and compositions of the invention include Taq, Tne, Tma, Pfu, KOD, Tfl, Tth, Stoffel fragment, VENTTM and DEEPVENTTM DNA polymerases, and mutants, vanants and denvatives thereof (U.S. Patent No. 5,436,149; U.S. Patent 4,889,818; U.S. Patent 4,965,188; U.S.
  • Reverse transcnptases for use in this invention include any enzyme having reverse transcnptase activity.
  • Such enzymes include, but are not limited to, retro viral reverse transcnptase, retrotransposon reverse transcnptase, hepatitis B reverse transcnptase, cauliflower mosaic virus reverse transcnptase, bactenal reverse transcnptase, Tth DNA polymerase, Taq DNA polymerase (Saiki, R.K., et al., Science 239:487-491 (1988); U.S. Patent Nos.
  • Tne DNA polymerase WO 96/10640 and WO 97/09451
  • Tma DNA polymerase U. S. Patent No. 5,374,553
  • mutants, vanants or denvatives thereof see, e.g., WO 97/09451 and WO 98/47912.
  • Preferred enzymes for use in the invention include those that have reduced, substantially reduced or eliminated
  • RNase H activity By an enzyme “substantially reduced in RNase H activity” is meant that the enzyme has less than about 20%, more preferably less than about 15%, 10% or 5%, and most preferably less than about 2%, of the RNase H activity of the corresponding wildtype or RNase H + enzyme such as wildtype Moloney Munne Leukemia Virus (M-MLV), Avian Myeloblastosis Virus (AMV) or Rous Sarcoma Virus (RSV) reverse transcnptases.
  • M-MLV Moloney Munne Leukemia Virus
  • AMV Avian Myeloblastosis Virus
  • RSV Rous Sarcoma Virus reverse transcnptases.
  • the RNase H activity of any enzyme may be determined by a vanety of assays, such as those described, for example, in U.S. Patent No.
  • polypeptides for use in the invention include, but are not limited to, M-MLV H " reverse transcriptase, RS V H “ reverse transcriptase, AMV H “ reverse transcriptase, RAV (rous-associated virus) H “ reverse transcriptase, MAV (myeloblastosis-associated virus) H “ reverse transcriptase and HIV H “ reverse transcriptase. (See U.S. Patent No.
  • Enzymes having reverse transcriptase activity for use in the invention may be obtained commercially, for example from Invitrogen Corporation, Life Technologies Division (Rockville, Maryland), Pharmacia (Piscataway, New Jersey), Sigma (Saint Louis, Missouri) or Boehringer Mannheim Biochemicals (Indianapolis, Indiana).
  • polymerases or reverse transcriptases having polymerase acti vity may be isolated from their natural viral or bacterial sources according to standard procedures for isolating and purifying natural proteins that are well-known to one of ordinary skill in the art (see, e.g., Houts, G.E., et al., J. Virol. 29:517 (1979)).
  • polymerases/reverse transcriptases may be prepared by recombinant DNA techniques that are familiar to one of ordinary skill in the art (see, e.g., Kotewicz, M.L., et al., Nucl. Acids Res. 16:265 (1988); U.S. Patent No. 5,244,797; WO
  • the present invention may be used in combination with any method involving the synthesis of nucleic acid molecules, such as DNA (including cDNA) and RNA molecules.
  • nucleic acid molecules such as DNA (including cDNA) and RNA molecules.
  • methods include, but are not limited to, nucleic acid synthesis methods, nucleic acid amplification methods and nucleic acid sequencing methods.
  • Nucleic acid synthesis methods may comprise one or more steps.
  • the invention provides a method for synthesizing a nucleic acid molecule comprising (a) mixing a nucleic acid template (e.g., a target molecule comprising an integration sequence) with one or more primers and one or more enzymes having polymerase or reverse transcriptase activity to form a mixture; and (b) incubating the mixture under conditions sufficient to make a first nucleic acid molecule complementary to all or a portion of the template.
  • the nucleic acid template may be a DNA molecule such as a cDNA molecule or library, or an RNA molecule such as a mRNA molecule. Conditions sufficient to allow synthesis such as pH, temperature, ionic strength, and incubation times may be optimized by those skilled in the art.
  • the target or template nucleic acid molecules or libraries may be prepared from nucleic acid molecules obtained from natural sources, such as a variety of cells, tissues, organs or organisms.
  • Cells that may be used as sources of nucleic acid molecules may be prokaryotic (bacterial cells, including those of species of the genera Escherichia, Bacillus, Serratia, Salmonella, Staphylococcus, Streptococcus, Clostridium, Chlamydia, Neisseria, Treponema, Mycoplasma, Borrelia, Legionella, Pseudomonas,
  • fungi especially yeast's
  • plants protozoans and other parasites
  • animals including insects (particularly Drosophila spp. cells), nematodes (particularly Caenorhabditis elegans cells), and mammals (particularly human cells)).
  • insects particularly Drosophila spp. cells
  • nematodes particularly Caenorhabditis elegans cells
  • mammals particularly human cells
  • Nucleic acid amplification methods may include the use of one or more polypeptides having reverse transcriptase activity, in methods generally known in the art as one-step (e.g., one-step RT-PCR) or two-step (e.g., two-step RT-PCR) reverse transcr ⁇ ptase-amplification reactions.
  • one-step e.g., one-step RT-PCR
  • two-step e.g., two-step RT-PCR
  • Amplification methods according to the invention may comprise one or more steps.
  • the invention provides a method for amplifying a nucleic acid molecule comprising (a) mixing one or more enzymes with polymerase activity with one or more nucleic acid templates (e.g., a target molecule comprising an integration sequence); and (b) incubating the mixture under conditions sufficient to allow the enzyme with polymerase activity to amplify one or more nucleic acid molecules complementary" to all or a portion of the templates.
  • the invention also provides nucleic acid molecules amplified by such methods.
  • amplification methods which may be used in accordance with the present invention include PCR (U.S. Patent Nos. 4,683.195 and 4,683,202), Strand Displacement Amplification (SDA; U.S. Patent No. 5,455,166; EP 0 684 315), and Nucleic Acid Sequence-Based Amplification (NASB A; U.S. Patent No. 5,409,818; EP 0 329 822).
  • these amplification methods comprise: (a) mixing one or more enzymes with polymerase activity with the nucleic acid sample in the presence of one or more primer sequences, and (b) amplifying the nucleic acid sample to generate a collection of amplified nucleic acid fragments, preferably by PCR or equivalent automated amplification technique.
  • the amplified or synthesized nucleic acid fragments may be isolated for further use or characterization. This step is usually accomplished by separation of the amplified or synthesized nucleic acid fragments by size or by any physical or biochemical means including gel electrophoresis, capillary electrophoresis, chromatography (including sizing, affinity and immunochromatography), density gradient centrifugation and immunoadsorption. Separation of nucleic acid fragments by gel electrophoresis is particularly preferred, as it provides a rapid and highly reproducible means of sensitive separation of a multitude of nucleic acid fragments, and permits direct, simultaneous comparison of the fragments in several samples of nucleic acids.
  • the invention is also directed to isolated nucleic acid molecules produced by the amplification or synthesis methods of the invention.
  • one or more of the amplified or synthesized nucleic acid fragments are removed from the gel which was used for identification (see above), according to standard techniques such as electroelution or physical excision.
  • the isolated unique nucleic acid fragments may then be inserted into standard vectors, including expression vectors, suitable for transfection or transformation of a variety of prokaryotic (bacterial) or eukaryotic (yeast, plant or animal including human and other mammalian) cells.
  • nucleic acid molecules produced by the methods of the invention may be further charactenzed, for example by sequencing (1 e , determining the nucleotide sequence of the nucleic acid fragments), by methods descnbed below and others that are standard in the art (see, e g., U.S Patent Nos 4,962,022 and 5,498,523, which are directed to methods of DNA sequencing) Nucleic acid sequencing methods according to the invention may compnse one or more steps.
  • the invention may be combined with a method for sequencing a nucleic acid molecule compnsing (a) mixing an enzyme with polymerase activity with a nucleic acid molecule to be sequenced, one or more pnmers, one or more nucleotides, and one or more terminating agents (such as a dideoxynucleotides) to form a mixture, (b) incubating the mixture under conditions sufficient to synthesize a population of molecules complementary to all or a portion of the molecule to be sequenced; and (c) separating the population to determine the nucleotide sequence of all or a portion of the molecule to be sequenced
  • Nucleic acid sequencing techniques which may be employed include dideoxy sequencing methods such as those disclosed in U S. Patent Nos 4.962,022 and 5,498,523.
  • kits which may be used in conjunction with the invention Kits according to this aspect of the invention may compnse one or more containers, which may contain one or more components selected from the group consisting of one or more nucleic acid molecules or vectors of the invention, one or more polymerases, one or more reverse transcriptases, one or more insertion-catalyzing enzymes, one or more recombination proteins (or other enzymes for carrying out the methods of the invention), one or more buffers, one or more detergents, one or more restnction endonucleases, one or more nucleotides, one or more terminating agents (e.g , ddNTPs), one or more transfection reagents, pyrophosphatase, and the like.
  • the kits of the invention may also comprise instructions for carrying out methods of the invention.
  • a target molecule is cloned into a first vector suitable for recombinational cloning as described according to the methods and procedures of the GATEWAYTM Cloning System (see U.S. Patent No. 5,888,732, U.S. Patent Appl. Nos. 09/438,358 and 09/517,466, and the instruction manual entitled GATEWAYTM Cloning Technology (Versions 1 and 2), all of which are incorporated by reference herein in their entireties).
  • the target DNA molecule is inserted into an appropriate vector such that the target molecule is flanked by recombination sites. In some embodiments, the recombination sites are not capable of recombining with each other.
  • the target-containing first vector is contacted with a solution containing an integration sequence such as a transposon, the appropriate cofactors such as buffer salts, ions and the like and an enzyme that catalyzes the insertion of the integration sequence into the target DNA molecule.
  • an integration sequence such as a transposon
  • the appropriate cofactors such as buffer salts, ions and the like
  • an enzyme that catalyzes the insertion of the integration sequence into the target DNA molecule.
  • the transposon could be inserted into the target DNA in an in vivo reaction such as the conjugal transfer of a plasmid to insert a gd based transposon described by Strathmann, et al. (Proceedings of the National Academy of Sciences, USA, 88: 1247-1250, 1991, specifically incorporated herein by reference).
  • the DNA sequence of the transposon will include terminal sequences that serve as substrates for the insertion-catalyzing enzyme and the enzyme will catalyze the insertion of the transposon into the target DNA molecule.
  • the insertion-catalyzing enzyme will also catalyze the insertion of the transposon into the vector as well.
  • the result of the transposition reaction will be a population of molecules having transposons inserted in various places in the vector and the target DNA as shown in Figure 2.
  • the target DNA sequence is flanked by two recombination sites (RS, and RS 2 ).
  • the integration sequence is shown as comprising a selectable marker (SM2) and a primer binding sequence at each end.
  • SM2 selectable marker
  • SM2 primer binding sequence
  • Transposons suitable for use in the present invention may comprise one or more selectable markers.
  • the transposons of the present invention may comprise a toxic gene.
  • the toxic gene may be a suicide gene, i. e. be lethal to susceptible organisms whenever the gene is expressed or the toxic gene may be conditionally lethal, i. e., be lethal to a susceptible organism only when the gene is expressed and some additional factor is present.
  • transposons suitable for use in the sequencing methods of the present invention may comprise one or more sequences suitable for binding a primer.
  • a primer may be used to determine the sequence of the target DNA molecule adjacent to the transposon or may be used for other purposes such as PCR.
  • Suitable sequences may be of any length as long as the primer: DNA duplex formed upon incubation of the primer with the DNA to be sequenced or amplified is sufficiently stable to permit the subsequent reaction, . e. sequencing or PCR, to be conducted.
  • the actual nucleotide sequence of the primer binding site is not critical as long as it is known. The selection of suitable primer binding sequences and the determination of the appropriate reaction conditions for subsequent reactions are routine tasks for those of ordinary skill in the art.
  • Transposons suitable for insertion into DNA target molecules in order to clone portions of the target may comprise one or more recombination sites or portions thereof.
  • transposons of the present invention will contain two recombination sites which may be the same or different. The two sites may be in opposite orientation to each other.
  • Transposons suitable for cloning applications may comprise an origin of replication.
  • the origin of replication may be selected to be compatible with the origin of replication in one or more of the vectors used in the practice of the present invention. This will permit the nucleic acid molecules comprising the origin of replication derived from the transposon to be stably maintained in cells that also contain the vector.
  • the origin of replication may be selected so as to be incompatible with the origin of replication in the vector. This will facilitate segregation of the vector and the transposon containing nucleic acid molecule.
  • the sequences and characteristics of origins of replication are well known to those skilled in the art. Examples of suitable origins of replication may be found in Current Protocols in Molecular Biology, Ausubel, et al.
  • origins of replication may direct the replication of nucleic acid molecules containing them in a variety of organisms.
  • the origin of replication may function in prokaryotic host cells such as those previously discussed.
  • the origin of replication may function in eukaryotic host cells.
  • Transposons suitable for use in the present invention may contain a DNA sequence that includes one or more sites that serve as a substrate for one or more restriction enzymes.
  • the transposons used in the present invention may comprise a site that serves as a substrate for a restriction enzyme that cuts infrequently, a so called "rare cutter.”
  • the Vector Donor may also provide one or more sites for a rare cutter.
  • the Vector Donor may be provided with two rare cutter sites which may be the same or different and which are adjacent to the recombination sites.
  • a transposon of the present invention may comprise more than one of the features discussed above.
  • a transposon may comprise an origin of replication in addition to recombination sites and may further comprise one or more primer binding sequences, selectable markers and/or suicide genes. Other useful combinations of features will be readily apparent to those skilled in the art and are within the scope of the present invention.
  • the molar ratio of transposon to target- containing first vector in the transposition reaction will range from about 25: 1 to about 1:25. In preferred embodiments, the molar ratio will range from about 10:1 to about 1:10. The molar ratio may be varied in order to ensure that one transposon is inserted into the DNA target.
  • the size of the first vector is large compared to the target, it may be desirable to have a higher ratio of transposon: vector to bias the reaction in favor of multiple insertions into each target-containing first vector in order to obtain an insertion into the target DNA.
  • the size of the target DNA is large compared to the vector, it may be desirable to reduce the transposomvector ratio.
  • a typical in vitro transposition reaction may contain transposon, target- containing first vector, ions, buffering agents and the like. Suitable reaction conditions may be about 100-500 ng of transposon and about 1 mg of target- containing first vector.
  • the reaction may contain a divalent metal ion in a concentration from about 0.5 mM to about 250 mM.
  • MgCl 2 may be source of the divalent metal ion and may be present in a concentration from about 1 mM to about 50 mM, more preferably from about 5 mM to about 20 mM.
  • the reaction solution may also contain a buffering agent in a concentration from about 1 mM to about 100 mM, more preferably from about 5 mM to about 50 mM and most preferably from about 10 mM to about 25 mM.
  • a suitable buffering agent is Tris.
  • the reaction solution may also contain a reducing agent such as b-mercaptoethanol (b-ME), dithiothreitol (DTT) or dithioerythritol (DTE) at a concentration from about 0.1 mM to about 5 mM, preferably at about 1 mM.
  • the pH of the reaction solution may be from about 6.5 to about 8.5, preferably about 7.5.
  • the reaction solution may contain monovalent cations in a concentration from about 1 mM to about 100 mM, preferably from about 5 mM to about 25 mM, most preferably at about 10 mM.
  • Suitable sources of monovalent cations include KC1 and NaCl.
  • a suitable set of reaction conditions is 15 mM MgCl 2 , 10 mM Tris-HCl, pH 7.5, lO mM KCl, ImM DTT and sufficient insertion-catalyzing enzyme activity to catalyze the insertion reaction.
  • Suitable reaction conditions will vary depending upon the source of the integration sequence/insertion-catalyzing enzyme pair. Those skilled in the art will appreciate that the various insertion-catalyzing enzymes known have optimal activity under conditions specific to each enzyme.
  • the determination and optimization of the reaction conditions for a given enzyme may be accomplished by routine experimentation by those skilled in the art.
  • the reaction conditions may be varied based upon the size of the transposon and vector, and the activity of the insertion-catalyzing enzyme preparation.
  • the transposition reaction may be carried out in the presence of reagents that increase the effective concentration of the nucleic acid species present in the reaction.
  • a suitable reagent of this kind is polyethylene glycol (PEG).
  • a suitable PEG is PEG 8000.
  • the reaction mixture may be incubated at an appropriate temperature, for example, from about 20 °C to about 37 °C, for a suitable period of time, for example, from about 15 minutes to about 16 hours.
  • the optimum temperature and incubation period for a given transposon, target and insertion-catalyzing enzyme preparation can be determined by routine experimentation by one of ordinary skill in the art.
  • the DNA After incubation of the transposition reaction, the DNA may be used as is or may be purified by means known to those skilled in the art. When used without purification, the insertion-catalyzing enzyme may be inactivated, for example, by heating at 65 °C for 20 minutes. Suitable methods for purification of the DNA from the transposition reaction include phenol/chloroform extraction and ethanol precipitation, extraction using silica, for example the CONCERTTM system available from Invitrogen Corporation, Life Technologies Division, Rockville, MD, or any other purification scheme used by those skilled in the art.
  • the transposition reaction When the transposition reaction is sufficiently efficient, enough molecules of the first vector comprising the transposon-containing target DNA molecule will be made to serve as a substrate for the subsequent recombination reaction. In other instances, it may be necessary to transform competent host organisms with the molecules made in the transposition reaction and grow the transformed organisms to amplify the reaction products.
  • the transformed organisms may be grown in the presence of a suitable selection agent, such as antibiotic, to ensure the presence of the selectable marker present on the transposon in the growing organisms. Amplification steps are routine in the art and the skilled artisan can select suitable organisms and transformation conditions and isolate the amplified reaction products without the use of undue experimentation.
  • a transposon-containing target DNA molecule in a first vector can be transferred to a second vector using recombinational cloning.
  • the products of the insertion reaction discussed in the previous example can be mixed with a second vector termed a Vector Donor.
  • the Vector Donor comprises recombination sites indicated as RS 3 and RS in Figure 3 which recombination sites are compatible with the recombination sites present in the first vector.
  • the Target DNA molecule is transfened to the second vector.
  • the Vector Donor comprises a toxic gene between recombination sites in addition to a selectable marker outside the recombination sites (SM 3 ).
  • SM 3 selectable marker outside the recombination sites
  • the first and the second vector are incubated in a suitable buffer.
  • the reaction conditions may be optimized for the particular vectors and recombination proteins used.
  • the reaction solution may contain a buffering agent at a concentration capable of maintaining the desired pH.
  • the concentration of the buffering agent may be from about lmM to about 100 mM. Preferably from about 10 mM to about 50 mM.
  • a suitable buffering agent is Tris.
  • the pH of the reaction solution may be varied depending upon the pH optimum of the recombination enzymes used.
  • the pH of the reaction solution will be from about 6. 5 to about 8.5, more preferably from about 7.0 to 8.0 and most preferably 7.5.
  • the reaction solution may contain monovalent cations in a concentration from about 1 mM to about 100 mM, preferably from about 5 mM to about 50 mM and most preferably from about 20 mM to about 35 mM.
  • a suitable source of monovalent cation is NaCl.
  • the reaction solution may also contain spermidine in a concentration from about 0.1 mM to about 10 mM, preferably from about 1 mM to about 5 mM.
  • the reaction solution may also contain bovine serum albumin (BSA) at a concentration from about 50 mg/mL to about 5 mg/mL, preferably from ablut 100 mg/mL to about 1 mg/mL most preferably at about 500 mg/mL.
  • BSA bovine serum albumin
  • the reaction solution may also contain a chelating agent at a concentration of from about 0.1 mM to about 10 mM, preferably at about 1 mM to about 5 mM.
  • One suitable set of reaction conditions is 50 mM Tris'HCL, pH 7.5, 33 mM NaCl, 5 mM spermidine'HCl and 500 mg/mL bovine serum albumin.
  • the reaction conditions may include 25 mMTris ⁇ Cl, pH7.5, 22 mM NaCl, 5 mM EDTA, 5 mM spermidine»HCl and 1 mg/mL BSA.
  • the reaction mixture is incubated at about 25°C for about 60 minutes and then incubated with a protease, for example proteinase K, for ten minutes to inactivate the recombination proteins.
  • a protease for example proteinase K
  • An increase in the efficiency of the recombination reaction is realized by linearizing the vectors prior to the recombination reaction. This may be accomplished by digestion with a suitable restriction enzyme.
  • topoisomerase I may be added to the recombination reaction.
  • the reaction mixture may be used to transform a competent host organism.
  • the transformed host may be grown in the presence of suitable selection agents to ensure the presence of the desired reaction product.
  • the growth medium for the transformed host may comprise two antibiotics in those embodiments where the transposon codes for resistance to one of the antibiotics and the second vector codes for resistance to the other antibiotic.
  • the transposon carries a selectable marker SM 2 while the Vector Donor carries SM 3 .
  • the first vector may code for resistance to yet a third antibiotic, i. e. SM,.
  • the growth conditions will also select for the absence of the toxic gene.
  • any organism capable of growing under these conditions will contain both the selectable marker from the transposon and the selectable marker from the second vector and will not contain the toxic gene. These molecules will be the result of recombination between the first vector and the second vector and resolution of the cointegrate intermediate.
  • the product molecule will contain the target DNA containing an insertion and flanked by recombination sites that are the product of the recombination of the sites in the vector donor with the original flanking sites depicted as RS 1+3 and RS 2+4 .
  • the product molecule would contain the target nucleic acid flanked on one end by either ⁇ ttBl or ⁇ ttPl and flanked on the other end by either ⁇ ttB2 or ⁇ ttP2 depending upon the orientation of the sites with respect to the target sequence.
  • the product molecule may contain the target nucleic acid flanked by distinct mutant ⁇ ttB sites.
  • the mixture may be used i7i vitro to further manipulate the target:
  • Oligonucleotides to the vector into which the transposon-containing DNA segment has been transferred can be used in conjunction with oligonucleotides complementary to the transposon to generate a population of amplicons extending from the vector to the site of transposon insertion. These segments can be cloned (for example, if the oligonucleotides contain recombination sites, or if the vector is charged with topoisomerase) and further manipulated or sequenced.
  • the target DNA may have a sequence that results in the expression of one or more biological activities of interest when introduced into an appropriate host cell.
  • introduction of the target DNA sequence may result in the expression of a particular enzymatic activity.
  • the sequence of transposon-containing target DNA molecules may be determined by contacting the target DNA molecule with a primer that binds to a portion of the transposon sequence and then performing any suitable sequencing protocol known to those skilled in the art. It is important to note that, for sequencing applications, the present invention overcomes the obstacle presented by insertion of a transposon into the vector sequence instead of, or in addition to, insertion into the target DNA.
  • Figure 2 depicts only a single insertion into a target-containing vector molecule; however, those skilled in the art will appreciate that multiple insertions are also possible.
  • the recombination step that moves the target DNA into a second vector after completion of the transposition reaction effectively eliminates the concern over sequencing the vector since the first vector sequence is not recovered from the recombination reaction.
  • This is in contrast to the prior art where insertions into the vector would make it necessary to repeatedly sequence the vector or perform tedious screening procedures to eliminate clones in which the transposon inserted into the vector.
  • the resulting molecules could not be used in the prior art methods since the presence of two primer binding sites in the same molecule to be sequenced would generate an unintelligible mixture of products. Since the present methods remove the transposon containing vector portion of the starting DNA molecule, more molecules that can be sequenced can be recovered from a given transposition reaction.
  • the methods of the present invention can be used to clone segments of large DNA molecules such as genomic DNA as shown in Figure 4A.
  • the methods of the present invention permit cloning of segments of any larger DNA molecule.
  • genomic DNA any large DNA molecule can be cloned using these methods.
  • the large DNA molecule might be a YAC, BAC or any isolated chromosome or portions thereof
  • Genomic DNA is isolated from the organism of interest and is contacted with a transposon comprising one or more recombination sites and an insertion- catalyzing enzyme under conditions causing the integration of the transposon into the genomic DNA.
  • the genomic DNA is then contacted with a Vector Donor having recombination sites compatible with the recombination sites in the transposons ( Figure 4A).
  • the recombination sites in the transposon and the Vector Donor may be oriented so that the transposon alone cannot productively react with the Vector Donor ( Figure 4B).
  • the reaction mixture can be used to transform competent host cells.
  • the transformed host cells are grown under condition that select for the presence of the selectable marker on the Vector Donor and against the presence of the toxic gene.
  • the transposon can be modified so that the recombination event transfers the selectable marker with the genomic DNA to the Vector Donor. This configuration of the transposon is shown in Figure 5.
  • Transposons suitable for embodiments involving the cloning of genomic DNA may comprise two recombination sites.
  • the recombination sites will have the same sequence and will be in opposite orientation, i. e., inverted repeats.
  • a schematic representation of cloning of genomic DNA using this embodiment is shown in Figure 6.
  • the transposon may comprise a DNA sequence coding for a toxic gene.
  • Transposons of this type will be useful in preventing recombinational cloning of the transposon or of genomic fragments that have an additional transposon located between the transposons that provided the recombination sites used for cloning.
  • the recombination sites may have different sequences and be in opposite orientation.
  • the genomic DNA is contacted with a Vector Donor molecule having recombination sites compatible with those in the transposon and the appropriate recombination proteins under conditions that result in a recombination between the recombination sites on the transposon and the recombination sites on the Vector Donor. Transformation and screening may be carried out as described above.
  • the transposons used in genomic cloning may comprise an origin of replication.
  • a transposon comprising one or more recombination site and further comprising an origin of replication is inserted into the genomic DNA.
  • a recombination site present on a transposon may recombine with a recombination site present on an adjacent transposon resulting in the excision of the fragment between the two recombination sites. Since the excised molecule is a circular molecule having an origin of replication, the excised molecule is capable of being stably maintained in a host cell.
  • the transposons of the present invention may optionally comprise one or more selectable markers.
  • one population may comprise a recombination site and an origin of replication while the other transposon may comprise a selectable marker and a recombination site.
  • the recombination between the recombination sites present on two adjacent transposons produces a DNA molecule that contains an origin of replication and a selectable marker in addition to the DNA of interest.
  • Such a molecule may be transformed into an appropriate host cell line a selected for using one or more of the selectable markers. This is shown schematically in Figure 7.
  • the ratio of the concentration of the genomic DNA and the concentration of the transposon present in the integration reaction may be varied so as to control the size of the genomic DNA fragments transferred into the Vector Donor. By increasing the concentration of transposons, the average size of the genomic DNA fragment may be decreased.
  • Target DNA containing a transposon may be used to construct clones containing less than the entire sequence of the target DNA. Such smaller clones are generally termed subclones.
  • a transposon may be inserted into a target DNA that contains or is flanked by recombination sites, to produce the molecule shown at the top of Figure 8 A.
  • the transposon may contain one or more recombination sites that are different from the recombination sites in the target molecule, and may in addition contain one or more selectable markers. This molecule is then contacted with one or two Vector Donors that contain recombination sites that will recombine with sites on the transposon and the target.
  • the vector containing the target DNA may be provided with additional recombination sites, while the Vector Donor(s) contain recombination sites that recombine with these additional sites.
  • a recombination is conducted and then the nucleic acid produced in the recombination reaction is inserted into host cells. By plating portions of the transformation reaction on various selective media, the desired subclones can be isolated as shown in Figure 8A.
  • segments of the target DNA may be replaced.
  • the segment of the target DNA flanked by RS, and RS 2 can be exchanged with a replacement sequence.
  • the replacement sequence may be of a different size than segment replaced.
  • exchange of a large segment of the target DNA with a small replacement sequences results in a deletion of a part of the target sequence.
  • the replacement sequence introduce into the target DNA any desired characteristic including, but not limited to, the expression of a desired biological activity.
  • a transposon comprising a recombination site, origin of replication and a selectable marker is integrated into a target molecule.
  • the recombination site present on the transposon is selected so as to be compatible with a recombination site present on the vector comprising the target DNA molecule.
  • a recombination is conducted in the absence of a vector donor. The result is the excision of the DNA between the recombination site present in the transposon and the recombination site present in the vector. Since the excised portion of the target DNA comprises an origin of replication and a selectable marker, the excised portion can be inserted into a host cell and will be stably maintained. The result is to subclone the excised portion of the target DNA. This is schematically shown in Figure 9.
  • the methods of the present invention can be used to clone PCR fragments.
  • Primers containing recombination sequences (or portions thereof) are used to amplify a target DNA sequence (see United States provisional patent application number 60/065,930 filed October 24, 1997 and United States patent application serial number 09/177,387).
  • the PCR primers may have a sequence that permits the generation of ligatable ends, for example, by including recognition sequence for a restriction enzyme.
  • the resultant linear fragment flanked by recombination sites (or ligatable ends) is reacted with a transposon containing a selectable marker and an origin of replication. After integration of the transposon, a recombination reaction (or ligation reaction) is conducted.
  • the result is a circular molecule having an origin of replication and a selectable marker.
  • the molecule may be circularized first, followed by integration of the transposon.
  • the circular molecule may be transformed into a competent host cell and maintained. This method will be particularly useful for the construction of gene targeting vectors.
  • the transposon may comprise a selectable marker that confers resistance to neomycin and cells comprising the selectable marker may be selected with G-418.
  • FIG. 10 A schematic representation of this method is shown in Figure 10
  • a target DNA molecule is amplified using pnmers containing recombination sites indicated by RS , and RS 2
  • An integration sequence is inserted into the amplification product which is then circulanzed by a recombination event
  • the amplification product containing the integration sequence may be reacted with another nucleic acid molecule having recombination sites compatible with those in the amplification product
  • Example 6 Construction of Deletions in a Target DNA Molecule
  • a vector comprising a target DNA molecule flanked by two different, non-interacting recombination sites is contacted with a transposon and an insertion-catalyzing enzyme under conditions causing the insertion of the transposon into the target DNA molecule or into the vector or into both
  • the transposon is constructed to contain a recombination site compatible with one of the recombination sites flanking the target DNA molecule as well as a sequencing pnmer binding site
  • the transposon may contain a sequence coding for a selectable marker and a sequence coding for a toxic gene distnubbed as shown in Figure 11
  • a recombination reaction may be earned out between the recombination site present on the transposon and the compatible recombination site present on the vector With reference to Figure 11, this would be a recombination between RS 3 and RS 2
  • the recombination reaction mixture is used to transform competent host cells that are susceptible to the toxic gene and the transformed host cells are spread on plates containing suitable reagents for selection using the selectable marker present on the transposon and the selectable marker present on the vector Insertion of the transposon into the vector sequence oi insertion of the transposon into the target DNA so that the recombination site in the transposon is in an inverse onentation with regard to the cognate recombination site in the vector results in a molecule that retains the toxic gene and, thus, will not produce colonies upon transformation.
  • the transposon When the transposon is inserted into the target DNA so that the recombination site in the transposon has the same orientation as the recombination site on the vector, a portion of the target DNA is deleted as well as the portion of the transposon containing the toxic gene. The resulting deleted plasmid will produce colonies upon transformation.
  • Plasmids may be recovered from positive colonies and the size of the recovered plasmids may be determined by gel electrophoresis in order to assay how much of the target DNA was deleted.
  • the plasmids may be analyzed by restriction mapping using conventional techniques.
  • the sequence that is deleted may be recovered, as shown in
  • FIG. 12 An insertion element containing one or more recombination sites is inserted into the target region of a molecule that contains a recombination site.
  • a Vector Donor When contacted with a Vector Donor, the region between the recombination site on the insertion element and the recombination site on the target molecule is transferred to the Vector Donor, resulting in the cloning of the deleted portion of the original target.
  • the methods of the present invention can further be used to generate populations of molecules attached to solid substrates.
  • This approach can be utilized to segregate members of the population, to provide nucleic acid molecules that may serve as templates for amplification or that may be used as substrates for further addition and manipulation of DNA segments, or in systems such as in vitro transcription/translation and as templates for probe generation.
  • a target DNA is reacted with a transposon that contains at least one recombination site.
  • the target DNA and the transposon are linear, although other configurations and structures (e.g. , circular, supercoiled, hairpin, etc.) of these molecules may also be used.
  • Random (or directed) mtegration of the transposon containing the recombination site generates a population of molecules each containing a recombination site This population can be further reacted with a recombination site that is immobilized on a solid substrate such that the recombination reaction generates covalent linkage of the target DNA with the immobilized recombination site
  • Each feature of the immobilization substrate thereby contains a member of the population.
  • individual feature can further be used as substrates for amplification using oligonucleotides complementary to the transposon and the end of the target DNA
  • sequencing several members from the population using the transposon as a mobile pnmer site the entirety of a large DNA segment can be determined
  • amphcons generated from the members on the feature can be used for the generation of probes, expression of segments of proteins, localization of domains (DNA or protein), etc.
  • members of each population can be cloned using a vector containing a recombination site and an end compatible with the end of the target DNA, or following amplification

Abstract

The present invention relates generally to methods, kits and compositions for use in manipulating nucleic acid molecules, particularly cloning, sequencing, amplifying and mutating such molecules. In particular, the invention relates to use of recombination sites and recombinational cloning to manipulate, select and analyze nucleic acid molecules of interest.

Description

Methods of Manipulating and Sequencing Nucleic Acid Molecules Using Transposition and Recombination
BACKGROUND OF THE INVENTION
Field of the Invention
The present invention relates generally to recombinant DNA technology. More specifically, the present invention relates generally to compositions, kits and methods for use in the construction and manipulation of nucleic acid molecules. The methods of the present invention involve the use of in vitro or in vivo integration and recombination events to construct and/or select desired nucleic acid molecules which may further be manipulated by any number of molecular biology techniques, including sequencing, amplification and mutagenesis.
Related Art
Site-specific Recombinases
Site-specific recombinases are proteins that are present in many organisms (e.g. viruses and bacteria) and have been characterized as having both endonuclease and ligase properties. These recombinases (along with associated proteins in some cases) recognize specific sequences of bases in DNA and exchange the DNA segments flanking those segments. The recombinases and associated proteins are collectively referred to as "recombination proteins" (see, e.g., Landy, A., Current Opinion in Biotechnology 3:699-707 (1993)).
Numerous recombination systems from various organisms have been described. See, e.g., Hoess, et al., Nucleic Acids Research 14(6):2287 (1986); Abremski, et ah, J. Biol. Chem. 261(1):391 (1986); Campbell, J. Bacteriol. 174(23):7495 (1992); Qian, et al, J. Biol. Chem. 267(11):7794 (1992); Araki, et al, J. Mol. Biol. 225(1 ):25 (1992); Maeser and Kahnmann, Mol. Gen. Genet.
230: 170-176) (1991); Esposito, et al, Nucl. Acids Res. 25(18):3605 (1997). Many of these belong to the integrase family of recombinases (Argos, et al, EMBO J. 5:433-440 (1986); Voziyanov, et al, Nucl. Acids Res. 27:930 (1999)). Perhaps the best studied of these are the Integrase/αtt system from bactenophage λ (Landy, A. Current Opinions in Genetics and Dev el. 3:699-707 (1993)), the Cre/loxP system from bactenophage PI (Hoess and Abremski (1990) In Nucleic Acids and Molecular Biology, vol.4. Eds.: Eckstein andLilley, Ber n-
Heidelberg: Spnnger- Verlag; pp. 90-109), and the FLP/FRT system from the Saccharomyces cerevisiae 2 μ circle plasmid (Broach, et al., Cell 29:227-234 (1982)).
Transposons
Transposons are mobile genetic elements. Transposons are structurally vanable, being descnbed as simple or compound, but typically encode a transposition catalyzing enzyme, termed a transposase, flanked by DNA sequences organized in inverted onentations. For a more thorough discussion of the charactenstics of transposons, one may consult Mobile Genetic Elements, D.
J. Sherratt, Ed., Oxford University Press (1995) and Mobile DNA, D. E. Berg and M. M. Howe, Eds., Amencan Society for Microbiology (1989), Washington, DC both of which are specifically incorporated herein by reference.
Transposons have been used to insert DNA into target DNA sequences. As a general rule, the insertion of transposons into target DNA is a random event.
One exception to this rule is the insertion of transposon Tn7. Transposon Tn7 can integrate itself into a specific site in the E. coli genome as one part of its life cycle (Stellwagen, A.E., and Craig, N L. Trends in Biochemical Sciences 23, 486- 490, 1998 specifically incorporated herein by reference). This site specific insertion has been used in vivo to manipulate the baculovirus genome (Lucklow et al. (J. Virol 67:4566-4579 (1993) specifically incorporated herein by reference). The site specificity of Tn7 is atypical of transposable elements whose hallmark is movement to random positions in acceptor DNA molecules. For the purposes of this application, transposition will be used to refer to random or quasi-random movement, unless otherwise specified, whereas recombination will be used to refer to site specific recombination events. Thus, the site specific insertion of Tn7 into the αttTn 7 site would be referred to as a recombination event while the random insertion of Tn7 would be referred to as a transposition event. York, et al. (Nucleic Acids Research, 26(8): 1927-1933, (1998)) disclose an in vitro method for the generation of nested deletions based upon an intramolecular transposition within a plasmid event using Tn5 A vector containing a kanamycin resistance gene flanked by two 19 base pair Tn5 transposase recognition sequences and a target DNA sequence was incubated in vitro in the presence of punfied transposase protein. Under the conditions of low
DNA concentration employed, the intramolecular transposition reaction was favored and was successfully used to generate a set of nested deletions in the target DNA. The authors suggested that this system might be used to generate C- terminal truncations in a protein encoded by the target DNA by the inclusion of stop signals in all three reading frames adjacent to the recognition sequences. In addition, the authors suggested that the inclusion of a His tag and kinase region might be used to generate N-terminal deletion proteins for further analysis.
Devine, etal, (Nucleic Acids Research, 22.3765-3772 (1994) and United States Patents Nos. 5,677,170 and 5,843,772, all of which are specifically incorporated herein by reference) disclose the construction of artificial transposons for the insertion of DNA segments into recipient DNA molecules in vitro. The system makes use of the insertion-catalyzing enzyme of yeast TY1 virus-like particles as a source of transposase activity. The DNA segment of interest is cloned, using standard methods, between the ends of the transposon- like element TY1. In the presence of the TY1 insertion-catalyzing enzyme, the resulting element integrates randomly into a second target DNA molecule.
Recombination Sites
A key feature of the recombination reactions mediated by the above-noted recombination proteins are recognition sequences, often termed "recombination sites," on the DNA molecules participating in the recombination reactions. These recombination sites are discrete sections or segments of DNA on the participating nucleic acid molecules that are recognized and bound by the recombination proteins during recombination. For example, the recombination site for Cre recombinase is loxP which is a 34 base pair sequence comprised of two 13 base pair inverted repeats (serving as the recombinase binding sites) flanking an 8 base pair core sequence. See Figure 1 of Sauer, B., Curr. Opin. Biotech. 5:521-527 (1994). Other examples of recognition sequences include the attB, attP, attL, and αttR sequences which are recognized by the recombination protein 1 Int. αttB is an approximately 25 base pair sequence containing two 9 base pair core-type Int binding sites and a 7 base pair overlap region, while αttP is an approximately 240 base pair sequence containing core-type Int binding sites and arm-type Int binding sites as well as sites for auxiliary proteins integration host factor (IHF), FIS and excisionase (Xis). See Landy, Curr. Opin. Biotech. 3:699-707 (1993).
Nucleic Acid Sequencing
Historically, two primary techniques have been used to sequence nucleic acids. In the first method, termed "Maxam and Gilbert sequencing" after its co- developers (Maxam, A.M. and Gilbert, W., Proc. Natl. Acad. Sci. USA 74:560- 564, 1977), DNA is radiolabeled, divided into four samples and treated with chemicals that selectively damage specific nucleotide bases in the DNA and cleave the molecule at the sites of damage. By separating the resultant fragments into discrete bands by gel electrophoresis and exposing the gel to X-ray film, the sequence of the original DNA molecule can be read from the film. This technique has been used to determine the sequences of certain complex DNA molecules, including the primate virus S V40 (Fiers, W., et al, Nature 273: 113- 120, 1978; Reddy, V.B., et al, Science 200:494-502, 1978) and the bacterial plasmid pBR322 (Sutcliffe, G., Cold Spring Harbor Symp. Quant. Biol.43:77-90, 1979). An alternative technique for sequencing, named "Sanger sequencing" after its developer (Sanger, F., and Coulson, A.R., J. Mol. Biol.94:444-448, 1975), has also been traditionally used. This method uses the DNA-synthesizing activity of DNA polymerases which, when combined with mixtures of reaction-terminating dideoxynucleoside triphosphates (Sanger, F., et al., Proc. Natl. Acad. Sci. USA 74:5463-5467, 1977) and a short primer (either of which may be detectably labeled), gives rise to a series of newly synthesized DNA fragments specifically terminated at one of the four dideoxy bases. These fragments are then resolved by gel electrophoresis and the sequence determined as described for Maxam and Gilbert sequencing above. By carrying out four separate reactions (one with each ddNTP), the sequences of even fairly complex DNA molecules may rapidly be determined (Sanger, F., et al, Nature 265:678-695, 1977; Barnes, W., Meth.
Enzymol. 152:538-556, 1987).
Despite their use for a number of years, however, both Maxam/Gilbert and Sanger sequencing are often time-consuming, expensive, and prone to errors in sequence determination. More recently, the determination of the nucleotide sequences of nucleic acid molecules has been performed using amplification- based methods. Probably the most commonly used of such methods rely on the use of the Polymerase Chain Reaction (PCR) described by Mullis and colleagues (see U.S. Patent Nos. 4,683,195 and 4,683,202), particularly using thermostable enzymes such as DNA polymerases that retain activity at the relatively high temperatures used in automated PCR methodologies (see Saiki, R.K., et al,
Science 239:487-491 (1988); U.S. Patent Nos. 4,889,818 and 4,965,188). Amplification-based methods of nucleic acid sequencing, particularly automated methods of dideoxy sequencing such as "cycle sequencing," utilize the thermostable polymerases and temperature cycling used in PCR applications in combination with a single primer and ddNTPs resulting in the synthesis of multiple dideoxy-terminated oligonucleotides from each template in contrast to the single oligonucleotide produced in standard Sanger sequencing. In addition to the increase in sensitivity provided by the synthesis of multiple oligonucleotides per template, use of higher denaturation temperatures in automated sequencing also improves sequencing efficiency (i.e., fewer isincorporations occur) and allows the sequencing of templates that are GC-nch or contain significant secondary structure
The key requirement of both the standard Sanger method of sequencing and amplification-based techniques is knowledge of the DNA sequence at the site to which the sequencing pnmer hybndizes. While it is possible to sequence small fragments in known vectors using pnmer sites in the vector adjacent to the fragment of interest, the sequencing of larger fragments is somewhat more problematic.
One possible method to circumvent this problem is to synthesize new pnmers having sequences complementary to the sequence determined in the initial sequencing reactions This technique is frequently referred to as "walking" the gene of interest.
An alternative to walking the gene is to create a set of nested deletions in the DNA molecule of interest (see Hemkoff, Gene 28(3)-351-9, 1984). The vector containing the insert is cleaved at one junction of the insert and the vector.
The resultant linear DNA molecule is then incubated with an exonuclease that removes bases from the end of the insert. By varying the incubation time, the number of bases removed from the insert can be vaned, resulting in a senes of DNAs containing progressively less of the insert. After ligation and transformation of the nuclease treated DNAs, a collection of clones can be isolated having new sequence adjacent to the pnming site in the vector thus permitting the entire insert to be sequenced using a pnmer that hybndizes to the vector sequence adjacent to the site of digestion.
In a recently developed technique, transposons have been used to insert small DNA molecules of known sequence into larger DNA molecules of unknown sequence. The known sequence can be used as the a pnmer recognition site and the DNA sequence of the larger DNA molecule adjacent to the inserted transposon can be determined using standard sequencing methods. Strathmann, et al, (Proc. Natl. Acad. Sci USA, 88: 1247-1250, 1990) descnbe one such system utilizing an in vivo insertion of gd transposon into target DNA The DNA of interest is cloned into a "mimplasmid" to bias the insertion of the transposon into the target DNA rather than the vector DNA
An in vitro transposon insertion system for sequencing applications was descnbed by Devine, et al. in United States patent no 5,728,551 which is specifically incorporated herein by reference. Artificial transposons refened to as "pnmer island" artificial transposons (PARTs) are reacted with a vector containing a target DNA in the presence of a transposase. The resultant population is screened to identify molecules containing a PART in the target DNA and the location of the PART in the target is mapped. A population of vectors with PARTs spaced appropnately in the target DNA is selected and the
DNA sequence of the target is determined using pnmers that hybridize to sequence in the PART
While it is possible to insert a transposon into a target DNA molecule, sequencing methods based on this technique suffer from a significant limitation. The random nature of the insertion of the transposon into the target DNA- containing vector result m frequent insertions of the transposon into the vector as well As a result, current methods require a tedious sorting procedure (for example by restnction mapping) to identify clones containing the appropnate insertions into the target DNA, or accept repeated sequencing of the vector. Both methods add considerably to the effort and expense of sequencing projects
Accordingly, there exists a need in the art for an alternative sequencing system that overcomes the limitations of the methods of the pnor art and provides for more rapid, efficient, and economical determinations of the nucleotide sequences of nucleic acid molecules This need and others is met by the present invention.
BRIEF SUMMARY OF THE INVENTION
The present invention generally concerns nucleic acid molecules (DNA or RNA) compnsing at least one integration sequence and at least one recombination site, wherein the recombination site(s) may be located within and/or outside (e.g. adjacent to) the integration sequences. In accordance with the invention, integration sequences may include any nucleic acid molecules which, through recombination or integration, becomes a part of the nucleic acid molecule of interest. Examples of integration sequences include, but are not limited to, transposons, insertion sequences, integrating viruses, homing introns, or other integrating elements, or various combinations thereof. In some preferred embodiments, the integrating sequences of the present invention may be insertion sequences or transposons or derivatives thereof. In one aspect, at least two recombination sites (which may be the same or different) are contained in the nucleic acid molecule outside the integration sequence and preferably flanking both sides of the integration sequence. In another aspect, at least two recombination sites (which may be the same or different) are contained within the integration sequence. The present invention specifically provides for nucleic acid molecules (preferably a vector) comprising a target nucleic acid sequence flanked by recombination sites and at least one integration sequence inserted into the target sequence. The recombination site(s), in accordance with the invention, may be used to exchange sequences with the molecule of interest, delete sequences from the molecule of interest, incorporate sequences into the molecule of interest, or otherwise identify, manipulate, analyze and/or select the molecule of interest.
In another aspect, various strategies utilizing homologous recombination can provide an alternative to transposons for integrating DNA segments of interest into a target sequence. These can be accomplished in vivo or in vitro. Yu et al (Proc Natl Acad Sci U S A 2000 May 23;97(l l):5978-83) have shown that
DNA segments containing homology to a target seqeunce can be efficiently integrated into a predetermined DNA sequence. Such approaches can be used to integrate recombination sites, selectable markers, functional elements into a defined locus of a target sequence. Similarly several reports of using in vitro heteroduplex formation and repair reactions have been used for inserting genes and other DNA segments into target sequences (Volkov AA et al., Nucl. Acids Res. 1999 Sep 15;27(18):el8). Oligonucleotides defining complete or partial homology flanking a recombination site can thus be used to generate populations of target sequences containing directed, partially directed or random insertions of recombination sites.
Recombination sites for use in the invention may be any recognition sequence on a nucleic acid molecule which participates in a recombination reaction by recombination proteins. In those embodiments of the present invention utilizing more than one recombination site, such recombination sites may be the same or different and may recombine with each other or may not recombine or not substantially recombine with each other. Recombination sites contemplated by the invention also include mutants, derivatives or variants of wild-type or naturally occurring recombination sites. Preferred recombination site modifications include those that enhance recombination, such enhancement selected from the group consisting of substantially (i) favoring integrative recombination; (ii) favoring excisive recombination; (iii) relieving the requirement for host factors; (iv) increasing the efficiency of co-integrate or product formation; and (v) increasing the specificity of co-integrate and/or product formation. Preferred modifications include those that enhance recombination specificity, those that permit the recombination site or portion thereof (or a nucleic acid molecule comprising the recombination site or portion thereof) to act as a primer site for amplification (e.g. , via PCR), those that remove one or more stop codons, and/or those that avoid hairpin formation. Preferred recombination sites used in accordance with the invention include att sites, FRT sites, and lox sites, or mutants, derivatives, fragments, portions and variants thereof (or combinations thereof)- Recombination sites contemplated by the invention also include portions of such recombination sites.
The integration sequences of the invention may comprise one or a number of elements and/or functional sequences and/or sites (or combinations thereof) including one or more sequences which are complementary to one or more sequencing or amplification pnmers of interest (e.g., sequencing primer sites or amplification primer sites), one or more selectable markers (e.g , toxic genes, antibiotic resistance genes, etc.), one or more transcnption or translation sites or signals, one or more transcription or translation termination sites, one or more ongins of replication, one or more recombination sites (or portions thereof), etc.
In one embodiment, the integration sequence may compnse one or more recombination sites (or portions thereof) and one or more selectable markers. Thus, according to the invention, integration sequences may be used to incorporate one or more recombination sites (or portions thereof) or other sites or sequences of interest into any nucleic acid molecule. Integration sequences may be introduced in accordance with the invention by in vivo or in vitro installation. The methods of the invention may utilize one or more integration sequences which may be the same or different. The use of different integration sequences with different functional sites or signals is thus contemplated by the invention.
The present invention also provides a method of inserting an integration sequence into a target nucleic acid sequence compnsing incubating a target sequence of interest flanked by recombination sites with at least one integration sequence under conditions sufficient to cause at least one of said integration sequences to integrate or insert in said target sequence and optionally selecting for said target sequence containing said at least one integration sequences. According to the invention, such target sequences are preferably contained by a vector and preferred integration sequences are one or more transposons. Selection of target sequences containing at least one integration sequence may preferably be accomplished by the use of the recombination sites which flank the target sequence of interest In a preferred aspect, recombinational cloning is used to transfer and select target sequences containing integration sequences. In accordance with the invention, such a method preferably compnses:
(a) transfernng target sequences flanked by recombination sites or portions thereof and containing at least one integration sequence or a portion thereof from a first nucleic acid molecule to a second nucleic acid molecule; and
(b) selecting said second nucleic acid molecule containing said target sequence flanked by recombination sites or portions thereof. In a preferred aspect, the first and/or second nucleic acid molecules are vectors. For example, the selection of said second nucleic acid molecule can be accomplished by using one or more selectable markers contained by the integration sequence and/or the target sequence. One or more selectable markers contained by the second nucleic acid molecule may also be utilized in the selection scheme according to the invention. Alternatively, or in addition, negative selection may also be used to select against second nucleic acid molecules not containing the target sequence of interest. In a preferred aspect, recombinational cloning is used to transfer target sequences containing at least one integration sequence into a vector. Preferably, selectable markers contained by the vector and by the integration sequence are used in combination to select the desired product vector containing the target sequence/integration sequence. In this way, undesired products, for example, vectors containing the target sequence without an inserted integration sequence are selected against.
In a further aspect of the invention, the selected target sequences containing integration sequences are used for further manipulation of the target sequence. In such aspect, the invention allows random insertions of desired sequences by random integration of integration sequences which may be used to manipulate or analyze the target sequence. For example, random insertion in the target sequence of sequencing primer sites contained by the integration sequence allows sequencing of various portions or all of the target sequence. In one aspect, portions of sequence information from the target can be used to determine the entire nucleic acid sequence of the target by analyzing and comparing the sequence overlap of such partial sequences. Alternatively, random insertion in the target sequence of amplification primer sites contained by the integration sequence allows amplification of portions or all of the target sequence, while rando insertion of transcriptional or regulatory sequences contained by the integration sequence allows expression of proteins or polypeptides from vanous portions or all of the target sequence Likewise, random insertion of genes or portions of genes (such as GUS, GST, GFP etc.) allows the creation of a population of gene fusions for the target sequence of interest. Additionally, random insertion of recombination sites (or portions thereof) contained by the integration sequence allows creation of a population of deletion mutants of the target sequence of interest. Optionally, the deleted portion of the target sequence may be cloned Thus, the present invention relates to a method of manipulating or analyzing (e.g., sequencing, amplification, deletion, mutation, expression analysis etc ) all or a portion of the target nucleic acid molecule compnsing:
(a) selecting for target sequences which are flanked by recombination sites or portions thereof and which contain at least one integration sequence or a portion thereof, and (b) manipulating or analyzing (e.g., sequencing, amplifying, mutating, expression analysis, etc ) at least a portion of said target sequence containing said integration sequence. In a preferred aspect, such manipulation or analysis is initiated at or accomplished by one or more sites contained within the integration sequence Sequencing steps, according to the invention, may compnse
(a) mixing a nucleic acid molecule to be sequenced with one or more pnmers, one or more nucleotides and one or more termination agents to form a mixture,
(b) incubating said mixture under conditions sufficient to synthesize a population of molecules complementary to all or a portion of said molecule to be sequenced, and
(c) separating said population to determine the nucleotide sequence of all or a portion of said molecule to be sequenced
More specifically, sequencing methods of the invention may compnse: (a) hybndizing a pnmer to a first nucleic acid molecule; (b) contacting said molecule with one or more nucleotides and one or more terminating agents;
(c) incubating the mixture of step (b) under conditions sufficient to synthesize a population of nucleic acid molecules complementary to all or a portion of said first nucleic acid molecule, wherein said synthesized molecules are shorter in length than said first molecule and said synthesized molecules comprise a terminating agent at their 3' termini; and
(d) separating said synthesized molecules by size so that at least a part of the nucleotide sequence of said first molecule can be determined.
The present invention also provides for a method of making deletions in a nucleic acid molecule of interest comprising contacting the nucleic acid molecule which comprises at least a first recombination site with an integration sequence which comprises at least a second recombination site under conditions such that at least one of said integration sequences is inserted into said nucleic acid molecule, and causing at least said first and said second recombination sites to recombine, thereby resulting in a deletion of at least a portion of said nucleic acid molecule. In some embodiments, the deleted portion of the target nucleic acid molecule may be cloned. In a preferred aspect, a new recombination site will be created at the point of deletion. For example, recombination between an αttP and αttB may create either an αttL or αttR site at the point of deletion. Such new recombination sites may then be used for further manipulation of the target or vector sequence containing such new recombination site(s). In a preferred aspect, the nucleic acid molecule of interest may be a vector which comprises a target sequence. In this aspect, the target sequence and/or vector sequence may comprise said first recombination site and the integration sequence, in some embodiments a transposon, comprises the second recombination site. In this aspect, the target sequence may first be inserted into a vector containing at least a first recombination site. In another aspect, the first and second recombination sites may be incorporated in the target sequence and/or vector by one or more integration sequences. After insertion of the integration sequence(s) into one or more positions within the target sequence, a population of deletion mutants may be made by allowing recombination to occur between recombination sites. Other deletions of different sizes and at different positions may be accomplished by including additional recombination sites at different positions within the target sequence and/or vector of interest. Thus, a third, fourth, and/or fifth recombination site may be inserted at different positions within the target or vector sequence (for example by additional integration sequences containing such different recombination sites). Causing recombination between such sites allows generation of further deletions of the target or vector sequence. For example, deletions may be done in a target or vector sequence sequentially by first causing recombination between the first and second recombination sites to create a first deletion and a new recombination site (e.g., a third recombination site) at the point of deletion, inserting a fourth recombination site in the target or vector sequence (preferably by insertion of an integration sequence containing one or more recombination sites), and causing recombination between said third and fourth recombination sites to create a second deletion and creating a new recombination site (e.g., a fifth recombination site) at the point of deletion. This process may be repeated any number of times to generate any number of deletions in the target and/or vector sequence of interest.
The present invention provides a method for replacing or exchanging sequences in a nucleic acid molecule of interest. The method comprises contacting the nucleic acid molecule which comprises at least a first recombination site with an integration sequence which comprises at least a second recombination site under conditions such that at least one of said integration sequences is inserted into said nucleic acid molecule, and causing replacement of one or more sequences in said molecule which are flanked by said first and said second recombination sites with at least a second nucleic acid molecule flanked by recombination sites. In some embodiments, the target sequence and the second nucleic acid molecule encode peptides, polypeptides or proteins and the recombination event places the encoded peptides, polypeptides or proteins in the same reading frame. Such second molecule may contain one or more genes or portions of genes. In a preferred aspect, the nucleic acid molecule of interest for making such replacement is a vector which compnses a target sequence. In this aspect, the target sequence and/or vector sequence compnses said first recombination site and the integration sequence (preferably a transposon) compnses the second recombination site. In this aspect, the target sequence may first be inserted into a vector containing at least a first recombination site. In another aspect, the first and second recombination sites may be incorporated in the target sequence and/or vector by one or more integration sequences. After insertion of the integration sequence into one or more positions within the target sequence, a population of fusions may be made by allowing a molecule flanked by said first and second recombination sites to be replaced with a population of second nucleic acid molecules flanked by recombination sites.
In another embodiment of the invention, one or more recombination sites may be added to nucleic acid molecules of interest by a method which compnses.
(a) contacting one or more nucleic acid molecules with one or more integration sequences which compnse one or more recombination sites or portions thereof, and
(b) incubating said mixture under conditions sufficient to incorporate said recombination site containing integration sequences into said nucleic acid molecules.
In some preferred embodiments, the one or more nucleic acid molecules are contacted with the one or more integration sequences in vitro.
Once such one or more recombination sites (and/or portions thereof) are incorporated in the nucleic acid molecules of interest, the recombination sites may be used to transfer nucleic acid molecules which are flanked by such recombination sites Thus, according to the invention, random insertion of integration sequences containing recombination sites or portions thereof allows inco oration of a number of recombination sites (or portions thereof) into the molecule of interest. Use of such recombination sites, through recombinational cloning, provides a method for transferring portions of the molecule which are flanked by recombination sites into one or more vectors. For example, one or a number of molecules of interest flanked by a first and second recombination site
(which preferably do not recombine with each other) is mixed with a vector comprising a third and fourth recombination site (which preferably do not recombine with each other) under conditions sufficient to allow the first recombination site to recombine with the third recombination site, and the second recombination site to recombine with the fourth recombination site. The desired product, comprising the vector and the nucleic acid molecule flanked by recombination sites may then be selected in accordance with the invention. In a preferred aspect, a population of molecules may be produced by transferring a number of molecules of interest into one or more vectors. Thus, the invention provides for the construction of a library which may be representative of all or a portion of the starting genetic material. In a preferred aspect, such a library may be prepared from cDNA, genomic or chromosomal genetic material using the invention.
In another aspect, the recombination sites which are incorporated in the nucleic acid molecules of interest may be recombined directly without the need to transfer to a separate nucleic acid molecule or vector. Thus, the molecule flanked by recombination sites can circularize upon recombination of the recombination sites. Preferably, the circular molecule contains a new recombination site at the point of recircularization. Thus, by recombining a first recombination site and a second recombination site located within the nucleic acid molecule of interest, a new circularized molecule can be created which comprises the nucleic acid molecule which was originally flanked by recombination sites. In a preferred aspect, the circularized molecule contains at least one origin of replication so that the molecule may replicate autonomously in a host cell or function as a vector is a host cell. The circularized molecule may also contain one or more selectable markers. In one aspect, one or more origins of replication and/or selectable markers are provided by one or more integration sequences. Thus, upon recombination, the molecule preferably will comprise at least one recombination site, at least one selectable marker, a nucleic acid molecule of interest and an origin of replication. Thus, the invention provides a method by which recombination sites may be used to create one or a population of vectors comprising portions of the original nucleic acid molecule of interest. In this way, the invention allows for efficient preparation of libraries of starting genetic material such as cDNA, genomic or chromosomal DNA. In a related aspect, the invention provides a method by which a linear nucleic acid molecule may be circularized by recombining at least a first and second recombination site within the molecule to be circularized. Preferably, the first and second recombination sites are located at or near the termini of the linear molecule. In a preferred aspect, the recombination sites are incorporated at or near the termini of the linear molecule by ligation of adapters (which comprise at least one recombination site or portion thereof) to one or both termini of the molecule and/or by amplifying the linear molecule with primers which comprise a recombination site or a portion thereof. Alternatively, DNA segments comprising a covalently linked topoisomerase can be used to join linkers (for example, which comprise at least one recombination site or a portion thereof) or other DNA segments to the ends of other linear DNA segments (Shuman, S., J. Biol. Chem. 269:32678 (1994)). In another aspect, a combination of addition of an adapter and amplification with a primer may be used to incorporate recombination sites into the termini of the molecule. In this way, a linear molecule can be created which contains a first recombination site at or near the first terminus of the linear molecule and a second recombination site at or near the second terminus of the linear molecule. In accordance with the invention, recombination of these recombination sites provides a circular molecule. Preferably, the circular molecule contains a new recombination site at the point of recircularization. In a preferred aspect, the circular molecule comprises an ongin of replication and/or at least one selectable marker In one aspect, one or more integration sequences which contain one or more functional sites such as ongins of replication, selectable markers, transcnptional signals, etc. may be integrated into the linear or circulanzed molecule to provide functional sequences to such molecule. In a another aspect, the integration sequences (which are preferably transposons) incorporate an on gin of replication and optionally at least one selectable marker into such linear or circular molecules.
The present invention also relates to kits for carrying out the methods of the invention, and particularly for use in amplifying and sequencing nucleic acid, creating deletions, creating mutations, and inserting recombination sites into a nucleic acid molecule of interest These kits may compnse one or more nucleic acid molecules of the invention such as integration sequences and/or vectors of the invention Such kits may optionally compnse one or more additional components selected from the group consisting of one or more nucleotides, one or more polymerases and or reverse transcnptases, one or more suitable buffers, one or more pnmers and one or more terminating agents (such as one or more di deoxynucleoti des)
The compositions, methods and kits of the invention are preferably prepared and earned out using a phage-lambda site-specific recombination system and most preferably with the GATEWAY™ recombinational cloning technology available from Invitrogen Corporation, Life Technologies Division (Rockville, MD).
Other preferred embodiments of the present invention will be apparent to one of ordinary skill in light of what is known in the art, in light of the following drawings and descnption of the invention, and in light of the claims
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 is a schematic representation of a recombination reaction of the present invention Figure 2 is a schematic representation of the insertion of a transposon into a target nucleic acid molecule and/or a vector nucleic acid molecule.
Figure 3 is a schematic representation of how the present invention can be used to select for target nucleic acid molecules comprising an insertion sequence by performing a recombinational cloning step after performing a transposition reaction.
Figure 4A is a schematic representation of the cloning of genomic DNA using transposons containing recombination site(s).
Figure 4B is a schematic representation of the cloning of genomic DNA using transposons containing recombination sites that are oriented so as to allow productive and non-productive recombination reactions.
Figure 5 is a schematic representation of a transposon designed to transfer a selectable marker by recombination.
Figure 6 is a schematic representation of the cloning of genomic DNA using a transposon comprising a toxic gene.
Figure 7 is a schematic representation of the cloning of genomic DNA using a transposon comprising an origin of replication and a transposon containing a selectable marker.
Figure 8A is a schematic representation of the construction of subclones using the compositions and methods of the present invention.
Figure 8B is a schematic representation of the replacement of a portion of a target sequence using the compositions and methods of the present invention.
Figure 9 is a schematic representation of the construction of subclones using an insertion sequence containing an origin of replication according to the methods of the present invention.
Figure 10 is a schematic representation of the construction of gene targeting vectors from PCR products using the compositions and methods of the present invention. Figure 11 is a schematic representation of the construction of deletions in a target DNA molecule using the compositions and methods of the present invention.
Figure 12 is a schematic representation of the cloning of a deleted portion of a target molecule using the compositions and methods of the present invention.
Figure 13 is a schematic representation of the generation of populations of nucleic acid molecules attached to a solid substrate using the compositions and methods of the present invention.
In the figures, recombination sites are indicated by RS and the recombination sites are distinguished by numerical subscripts, selectable markers are indicated by SM and a numerical subscript. The reaction product of two compatible recombination sites is designated RS and a subscript indicating the two sites which were recombined.
DETAILED DESCRIPTION OF THE INVENTION
Definitions
In the description that follows, a number of terms used in molecular biology are utilized extensively. In order to provide a clear and consistent understanding of the specification and claims, including the scope to be given such terms, the following definitions are provided.
Amplification: As used herein, amplification is any in vitro method for increasing a number of copies of a nucleotide sequence with the use of one or more polypeptides having polymerase activity (e.g., one or more nucleic acid polymerases or one or more reverse transcriptases). Nucleic acid amplification results in the incorporation of nucleotides into a DNA and/or RNA molecule or primer thereby forming a new nucleic acid molecule complementary to a template. The formed nucleic acid molecule and its template can be used as templates to synthesize additional nucleic acid molecules. As used herein, one amplification reaction may consist of many rounds of nucleic acid replication. DNA amplification reactions include, for example, polymerase chain reaction (PCR). One PCR reaction may consist of 5 to 100 cycles of denaturation and synthesis of a DNA molecule. Gene: As used herein, a gene is a nucleic acid sequence that contains information necessary for expression of a polypeptide or protein. It includes the promoter and the structural gene as well as other sequences involved in expression of the protein.
Host: As used herein, a host is any prokaryotic or eukaryotic organism that is a recipient of a replicable expression vector, cloning vector or any nucleic acid molecule. The nucleic acid molecule may contain, but is not limited to, a structural gene, a transcriptional regulatory sequence (such as a promoter, enhancer, repressor, and the like) and/or an origin of replication (ori). As used herein, the terms "host," "host cell," "recombinant host" and "recombinant host cell" may be used interchangeably. For examples of such hosts, see Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1982).
Hybridization: As used herein, the terms hybridization and hybridizing refer to base pairing of two complementary single-stranded nucleic acid molecules (RNA and/or DNA) to give a double stranded molecule. As used herein, two nucleic acid molecules may be hybridized, although the base pairing is not completely complementary. Accordingly, mismatched bases do not prevent hybridization of two nucleic acid molecules provided that appropriate conditions, well known in the art, are used. In some aspects, hybridization is said to be under "stringent conditions." By "stringent conditions" as used herein is meant overnight incubation at 42 °C in a solution comprising: 50% formamide, 5x SSC ( 150 mM NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 g/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 0.1 x SSC at about 65 °C. . Incorporating: As used herein, incorporating means becoming a part of a nucleic acid (e.g., DNA) molecule or primer.
Insert: As used herein, an insert is a desired nucleic acid segment that is a part of a larger nucleic acid molecule. An insert may be a target nucleic acid molecule in accordance with the invention.
Insert Donor: As used herein, an insert donor is one of the two parental nucleic acid molecules (e.g. RNA or DNA) of the present invention which carries the Insert. The Insert Donor molecule comprises the Insert flanked on both sides with recombination sites. The Insert Donor can be linear or circular. In one embodiment of the invention, the Insert Donor is a circular DNA molecule and further comprises a cloning vector sequence outside of the recombination signals (see Figure 1). When a population of Inserts or population of nucleic acid segments are used to make the Insert Donor, a population of Insert Donors result and may be used in accordance with the invention. Integration sequence: As used herein, an integration sequence is any nucleotide sequence that is capable of inserting randomly into a target nucleic acid molecule. Integration sequences are also known in the art as mobile genetic elements. Any integration sequence known to those of ordinary skill in the art may be used to practice the present invention, including but not limited to transposons (transposable elements), integrating viruses (e.g., retroviruses), IS elements, retrotransposons, conjugative transposons, P elements of Drosophila, bacterial virulence factors, or mobile genetic elements for eukaryotic organisms such as mariner, Tc 1 and Sleeping Beauty. Other mobile genetic elements known to those skilled in the art may also be used in accordance with the present invention.
Library: As used herein, a library is a collection of nucleic acid molecules (circular or linear). In one embodiment, a library may comprise a plurality (i.e., two or more) of nucleic acid molecules, which may or may not be from a common source organism, organ, tissue, or cell. In another embodiment, a library is representative of all or a portion or a significant portion of the nucleic acid content of an organism (a "genomic" library), or a set of nucleic acid molecules representative of all or a portion or a significant portion of the expressed nucleic acid molecules (a cDNA library) in a cell, tissue, organ or organism. In other embodiments, a library may include a target DNA molecule containing insertions at various places within the target. A library may also comprise random sequences made by de novo synthesis, mutagenesis of one or more sequences and the like. Such libraries may or may not be contained in one or more vectors.
Nucleotide: As used herein, a nucleotide is a base-sugar-phosphate combination. Nucleotides are monomeric units of a nucleic acid molecule (DNA and RNA). The term nucleotide includes ribonucleoside triphosphates ATP,
UTP, CTG, GTP and deoxyribonucleoside triphosphates such as dATP, dCTP, dlTP, dUTP, dGTP, dTTP, or derivatives thereof. Such derivatives include, for example, [αSjdATP, 7-deaza-dGTP and 7-deaza-dATP. The term nucleotide as used herein also refers to dideoxyribonucleoside triphosphates (ddNTPs) and their derivatives. Illustrated examples of dideoxyribonucleoside triphosphates include, but are not limited to, ddATP, ddCTP, ddGTP, ddlTP, and ddTTP. According to the present invention, a "nucleotide" may be unlabeled or detectably labeled by well known techniques. Detectable labels include, for example, radioactive isotopes, fluorescent labels, chemiluminescent labels, bioluminescent labels and enzyme labels.
Oligonucleotide: As used herein, an oligonucleotide is a synthetic or natural molecule comprising a covalently linked sequence of nucleotides which are joined by a phosphodiester bond between the 3' position of the pentose of one nucleotide and the 5' position of the pentose of the adjacent nucleotide. Primer: As used herein, a primer is a single stranded or double stranded oligonucleotide that is extended by covalent bonding of nucleotide monomers during amplification or polymerization of a nucleic acid molecule (e.g. a DNA molecule). In one aspect, the primer may be a sequencing primer (for example, a universal sequencing primer). In another aspect, the primer may comprise a recombination site or portion thereof. Product: As used herein, a product is one the desired daughter molecules comprising the A and D sequences which is produced after the second recombination event during the recombinational cloning process (see Figure 1). The Product contains the nucleic acid which was to be cloned or subcloned. In accordance with the invention, when a population of Insert Donors are used, the resulting population of Product molecules will contain all or a portion of the population of Inserts of the Insert Donors and preferably will contain a representative population of the original molecules of the Insert Donors.
Promoter: As used herein, a promoter is an example of a transcriptional regulatory sequence, and is specifically a DNA sequence generally described as the 5'-region of a gene located proximal to the start codon. The transcription of an adjacent DNA segment is initiated at the promoter region. A repressible promoter's rate of transcription decreases in response to a repressing agent. An inducible promoter's rate of transcription increases in response to an inducing agent. A constitutive promoter's rate of transcription is not specifically regulated, though it can vary under the influence of general metabolic conditions.
Recognition sequence: As used herein, a recognition sequence is a particular sequence to which a protein, chemical compound, DNA, or RNA molecule (e.g., restriction endonuclease, a modification methylase, or a recombinase) recognizes and binds. In the present invention, a recognition sequence will usually refer to a recombination site. For example, the recognition sequence for Cre recombinase is loxP which is a 34 base pair sequence comprised of two 13 base pair inverted repeats (serving as the recombinase binding sites) flanking an 8 base pair core sequence. See Figure 1 of Sauer, B., Current Opinion in Biotechnology 5:521-527 (1994). Other examples of recognition sequences are the αttB, αttP, ttL, and αttR sequences which are recognized by the recombinase enzyme 1 Integrase. αttB is an approximately 25 base pair sequence containing two 9 base pair core-type Int binding sites and a 7 base pair overlap region. αttP is an approximately 240 base pair sequence containing core- type Int binding sites and arm-type Int binding sites as well as sites for auxiliary proteins integration host factor (EHF), FIS and excisionase (Xis). See Landy, Current Opinion in Biotechnology 3:699-707 (1993). Such sites may also be engineered according to the present invention to enhance production of products in the methods of the invention. When such engineered sites lack the PI or HI domains to make the recombination reactions irreversible (e.g., αttR or αttP), such sites may be designated αttR' or αttP' to show that the domains of these sites have been modified in some way.
Recombination proteins: As used herein, recombination proteins include excisive or integrative proteins, enzymes, co-factors or associated proteins that are involved in recombination reactions involving one or more recombination sites, which may be wild-type proteins (See Landy, Current Opinion in Biotechnology 3:699-707 (1993)), or mutants, derivatives, fragments, and variants thereof.
Recombination site: A used herein, a recombination site is a recognition sequence on a nucleic acid molecule participating in an integration/recombination reaction by recombination proteins. Recombination sites are discrete sections or segments of nucleic acid on the participating nucleic acid molecules that are recognized and bound by a site-specific recombination protein during the initial stages of integration or recombination. For example, the recombination site for Cre recombinase is loxP which is a 34 base pair sequence comprised of two 13 base pair inverted repeats (serving as the recombinase binding sites) flanking an 8 base pair core sequence. See Figure 1 of Sauer, B., Curr. Opin. Biotech. 5:521- 527 (1994). Other examples of recognition sequences include the αttB, αttP, αttL, and αttR sequences described herein, and mutants, fragments, variants and derivatives thereof, which are recognized by the recombination protein 1 Int and by the auxiliary proteins integration host factor (IHF), FIS and excisionase (Xis). See Landy, Curr. Opin. Biotech. 3:699-707 (1993).
Recombinational Cloning: As used herein, recombinational cloning is a method, such as that described in U.S. Patent No. 5,888,732 (the contents of which are fully incorporated herein by reference), whereby segments of nucleic acid molecules or populations of such molecules are exchanged, inserted, replaced, substituted or modified, in vitro or in vivo. Preferably, such cloning method is an in vitro method.
Repression cassette: As used herein, repression cassette is a nucleic acid segment that contains a repressor or a Selectable marker present in the subcloning vector.
Selectable marker: As used herein, selectable marker is a nucleic acid segment that allows one to select for or against a molecule (e.g., a replicon) or a cell that contains it, often under particular conditions. These markers can encode an activity, such as, but not limited to, production of RNA. peptide, or protein, or can provide a binding site for RNA, peptides, proteins, inorganic and organic compounds or compositions and the like. Examples of selectable markers include but are not limited to: (1) DNA segments that encode products which provide resistance against otherwise toxic compounds (e.g., antibiotics); (2) DNA segments that encode products which are otherwise lacking in the recipient cell
(e.g., tRNA genes, auxotrophic markers); (3) DNA segments that encode products which suppress the activity of a gene product; (4) DNA segments that encode products which can be readily identified (e.g., phenotypic markers such as b- galactosidase, green fluorescent protein (GFP), and cell surface proteins); (5) DNA segments that bind products which are otherwise detrimental to cell survival and/or function; (6) DNA segments that otherwise inhibit the activity of any of the DNA segments described in Nos. 1-5 above (e.g., antisense oligonucleotides); (7) DNA segments that bind products that modify a substrate (e.g. restriction endonucleases); (8) DNA segments that can be used to isolate or identify a desired molecule (e.g. specific protein binding sites); (9) DNA segments that encode a specific nucleotide sequence which can be otherwise nonfunctional (e.g., for PCR amplification of subpopulations of molecules); (10) DNA segments, which when absent, directly or indirectly confer resistance or sensitivity to particular compounds; and/or (11) DNA segments that encode products which are toxic in recipient cells. Selection scheme: As used herein, selection scheme is any method which allows selection, enrichment, or identification of a desired product(s) or molecule(s) from a mixture. In some preferred embodiments, the selection scheme results in selection of or enrichment for only one or more desired products or molecules. As defined herein, selecting for a DNA molecule includes
(a) selecting or enriching for the presence of the desired DNA molecule, and (b) selecting or enriching against the presence of DNA molecules that are not the desired DNA molecule.
In one embodiment, the selection schemes (which can be carried out in reverse) may take one of three forms, which will be discussed in terms of
Figure 1. The first, exemplified herein with a selectable marker and a repressor therefor, selects for molecules having segment D and lacking segment C. The second selects against molecules having segment C and for molecules having segment D. Possible embodiments of the second form would have a DNA segment carrying a gene toxic to cells into which the in vitro reaction products are to be introduced. A toxic gene can be a DNA that is expressed as a toxic gene product (a toxic protein or RNA), or can be toxic in and of itself. (In the latter case, the toxic gene is understood to carry its classical definition of "heritable trait".) Examples of such toxic gene products are well known in the art, and include, but are not limited to, restriction endonucleases (e.g., Dpnϊ), thymidine kinase (TK) genes, apoptosis-related genes (e.g. ASK1 or members of the bcl-2/ced-9 family), retroviral genes including those of the human immunodeficiency virus (HIV), defensins such as NP-1, inverted repeats or paired palindromic DNA sequences, bacteriophage lytic genes such as those from fX174 or bacteriophage T4; antibiotic sensitivity genes such as rpsL, antimicrobial sensitivity genes such as pheS, plasmid killer genes, eukaryotic transcriptional vector genes that produce a gene product toxic to host cells, such as GATA-1, and genes that kill hosts in the absence of a suppressing function, e.g., kicB, ccdB, fX174 E (Liu, Q. et al, Curr. Biol. 8: 1300-1309 (1998)), and other genes that negatively affect replicon stability and/or replication. A toxic gene can alternatively be selectable in vitro, e.g., a restriction site.
Many genes coding for restriction endonucleases operably linked to inducible promoters are known, and may be used in the present invention. See, e.g. U.S. Patent Nos. 4,960,707 (Dpnl and Dpnll); 5,000,333, 5,082,784 and
5,192,675 (Kpnl); 5,147,800 (NgoAIH andNgoAI); 5,179,015 (FspIandHaelH): 5,200,333 (Haeϋ andTaql); 5,248,605 (HpaH); 5,312,746 (Clal); 5,231,021 and 5,304,480 (Xhol and XhoH); 5,334,526 (Alul); 5,470,740 (Nsil); 5,534,428 (Sstl/Sacl); 5,202,248 (Ncol); 5,139,942 (Ndel); and 5,098,839 (Pad). See also Wilson, G.G., Nucl. Acids Res. 19:2539-2566 (1991); and Lunnen, K.D., et al,
Gene 74:25-32 (1988).
In the second form, segment D carries a selectable marker. The toxic gene would eliminate transformants harboring the Vector Donor, Cointegrate, and Byproduct molecules, while the selectable marker can be used to select for cells containing the Product and against cells harboring only the Insert Donor.
The third form selects for cells that have both segments A and D in cis on the same molecule, but not for cells that have both segments in trans on different molecules. This could be embodied by a selectable marker that is split into two inactive fragments, one each on segments A and D. The fragments are so arranged relative to the recombination sites that when the segments are brought together by the recombination event, they reconstitute a functional selectable marker. For example, the recombinational event can link a promoter with a structural nucleic acid molecule (e.g., a gene), can link two fragments of a structural nucleic acid molecule, or can link nucleic acid molecules that encode a heterodimeric gene product needed for survival, or can link portions of a replicon.
Site-specific recombinase: As used herein, a site specific recombinase is a type of recombinase which typically has at least the following four activities (or combinations thereof): (1) recognition of one or two specific nucleic acid sequences; (2) cleavage of said sequence or sequences; (3) topoisomerase activity involved in strand exchange; and (4) ligase activity to reseal the cleaved strands of nucleic acid. See Sauer, B., Current Opinions in Biotechnology 5:521-527 (1994). Conservative site-specific recombination is distinguished from homologous recombination and transposition by a high degree of specificity for both partners. The strand exchange mechanism involves the cleavage and rejoining of specific DNA sequences in the absence of DNA synthesis (Landy, A. (1989) Ann. Rev. Biochem. 58:913-949).
Structural gene: As used herein, a structural gene refers to a nucleic acid sequence that is transcribed into messenger RNA that is then translated into a sequence of amino acids characteristic of a specific polypeptide.
Subcloning vector: As used herein, a subcloning vector is a cloning vector comprising a circular or linear nucleic acid molecule which includes preferably an appropriate replicon. In the present invention, the subcloning vector (segment D in Figure 1) can also contain functional and/or regulatory elements that are desired to be incorporated into the final product to act upon or with the cloned DNA Insert (segment A in Figure 1). The subcloning vector can also contain a selectable marker.
Target nucleic acid molecule: As used herein, target nucleic acid molecule is a nucleic acid segment of interest (preferably DNA) which is to be acted upon using the present invention.
Template: As used herein, a template is a double stranded or single stranded nucleic acid molecule which is to be amplified, synthesized or sequenced. In the case of a double-stranded DNA molecule, denaturation of its strands to form a first and a second strand is preferably performed before these molecules may be amplified, synthesized or sequenced, or the double stranded molecule may be used directly as a template. For single stranded templates, a primer complementary to at least a portion of the template is hybridized under appropriate conditions and one or more polypeptides having polymerase activity (e.g. DNA polymerases and/or reverse transcriptases) may then synthesize a molecule complementary to all or a portion of the template. Alternatively, for double stranded templates, one or more transcnptional regulatory sequences (e.g , one or more promoters) may be used in combination with one or more polymerases to make nucleic acid molecules complementary to all or a portion of the template. The newly synthesized molecule, according to the invention, may be of equal or shorter length compared to the onginal template. Mismatch incorporation or strand slippage dunng the synthesis or extension of the newly synthesized molecule may result in one or a number of mismatched base pairs. Thus, the synthesized molecule need not be exactly complementary to the template. Additionally, a population of nucleic acid templates may be used dunng synthesis or amplification to produce a population of nucleic acid molecules typically representative of the onginal template population.
Transcnptional regulatory sequence As used herein, transcnptional regulatory sequence is a functional stretch of nucleotides contained on a nucleic acid molecule, in any configuration or geometry, that acts to regulate the transcnption of one or more structural genes into messenger RNA. Examples of transcnptional regulatory sequences include, but are not limited to, promoters, enhancers, repressors, and the like. "Transcnption regulatory sequence",
"transcnption sites" and "transcnption signals" may be used interchangeably.
Vector: As used herein, a vector is a nucleic acid molecule (preferably DNA) that provides a useful biological or biochemical property to an Insert.
Examples include plasmids, phages, autonomously replicating sequences (ARS), centromeres, and other sequences which are able to replicate or be replicated in vitro or in a host cell, or to convey a desired nucleic acid segment to a desired location within a host cell. A vector can have one or more restnction endonuclease recognition sites at which the sequences can be cut in a determinable fashion without loss of an essential biological function of the vector, and into which a nucleic acid fragment can be spliced in order to bnng about its replication and cloning. Vectors can further pro\ ide pnmer sites, e.g , for PCR, transcnptional and/or translational initiation and/or regulation sites, recombinational signals, rep cons, selectable markers, etc Clearly, methods of inserting a desired nucleic acid fragment which do not require the use of homologous recombination, transpositions or restriction enzymes (such as, but not limited to, UDG cloning of PCR fragments (U.S. Patent No. 5,334,575, entirely incorporated herein by reference), T: A cloning, and the like) can also be applied to clone a fragment into a cloning vector to be used according to the present invention. The cloning vector can further contain one or more selectable markers suitable for use in the identification of cells transformed with the cloning vector.
Vector Donor: As used herein, a Vector Donor is one of the two parental nucleic acid molecules (e.g., RNA or DNA) of the present invention which carries the segments comprising the vector which is to become part of the desired Product. The Vector Donor comprises a subcloning vector D (or it can be called the cloning vector if the Insert Donor does not already contain a cloning vector) and a segment C flanked by recombination sites (see Figure 1). Segments C and/or D can contain elements that contribute to selection for the desired Product daughter molecule, as described above for selection schemes. The recombination signals can be the same or different, and can be acted upon by the same or different recombinases. In addition, the Vector Donor can be linear or circular.
Other terms used in the fields of recombinant DNA technology and molecular and cell biology as used herein will be generally understood by one of ordinary skill in the applicable arts.
Overview
The present invention relates to the construction of nucleic acid molecules (RNA or DNA) by inserting at least one integration sequence (e.g., a transposon) into a target nucleic acid molecule and subsequently transferring the modified target nucleic acid molecule to a vector using recombinational cloning. In accordance with the invention, recombinational cloning allows efficient selection and identification of molecules (particularly vectors) containing the target sequence comprising all or a portion of the integration sequence. Thus, sites or sequences of interest (contained by the integration sequence) can be inserted within the target sequence which allows for further manipulation of the target nucleic acid molecule. Integration sequences of the invention to be introduced into the target nucleic acid molecules may comprise any number or combinations of functional sequences such as primer sites (e.g., sequences for which a primer such as a sequencing primer or amplification primer may hybridize to initiate nucleic acid synthesis, amplification or sequencing), transcription or translation signals or regulatory sequences such as promoters, ribosomal binding sites, translation effecting sequences such as Kozak and Shine-Delgarno sequences, start codons, origins of replication, termination signals such as stop codons, recombination sites (or portions thereof), selectable markers, and genes or portions of genes to create protein fusion (e.g., N-terminal or carboxy terminal) such as GST, GUS, GFP, and combinations thereof. After insertion of such sequences of interest, the molecules may be manipulated in a variety of ways including sequencing or amplification of all or a portion of the target sequence
(i.e., by using at least one or the primer sites introduced by the integration sequence), mutation of the target sequence (i.e., by insertion, deletion or substitution of target sequences), and protein expression from the target sequence or portions thereof (i.e., by insertion of translation and/or transcription signals). The present invention also relates to cloning nucleic acid molecules (e.g. , genomic DNA or cDNA) by inserting recombination site -containing integration sequences into the molecule(s) and performing recombinational cloning or causing recombination of the inserted recombination sites. Thus, one or more integration sequences comprising at least one recombination site may be inserted within the molecule of interest to allow recombinational cloning or cloning of such molecules or portions thereof. In this aspect, the integration sequences may also comprise other functional sequences of interest (such as primer sites, transcription and translation signals, termination signals, selectable markers, origins of replication, etc. noted above) to allow further manipulation of the molecule obtained by this method of the invention. Recombination sites for use in the invention may be any recognition sequence which participates in a recombination reaction. Such recombination sites may be the same or different and may be wild-type or naturally occurring recombination sites or modified or mutant recombination sites. Examples of recombination sites for use in the invention include, but are not limited to, phage- lambda recombination sites (such as αttP, αttB, αttL, and αttR and mutants or derivatives thereof) and recombination sites from other bacteriophage such as PI, phi80, P22, P2, 186, P4 and PI (including lox sites such as loxP and /α*P511). Corresponding recombination proteins for these systems may be used in accordance with the invention with the indicated recombination sites. Other systems providing recombination sites and recombination proteins for use in the invention include the FLP/FRT system from Saccharomyces cerevisiae, the resolvase family (e.g., gd, Tn3 resolvase, Hin, Gin and Cin), and IS231 and other Bacillus thuringiensis transposable elements. Preferred recombination proteins and mutant or modified recombination sites for use in the invention include those described in U.S. Patent No. 5,888,732, co-pending U.S. Application No. 09/438,358 (filed November 12, 1991) and co-pending U.S. Application No. 09/517,466 (filed March 2, 2000), as well as those associated with the GATEWAY™ Cloning Technology available from Invitrogen Corporation, Life Technologies Division (Rockville, MD).
Integration Sequences
Any integration sequence known to those skilled in the art may be used to practice the present invention. Integration sequences are also known in the art as mobile genetic elements. In some preferred embodiments, the integration sequence may be a transposon (transposable element). Any transposon sequence known to those skilled in the art may be suitable for use in the present invention. In some preferred embodiments, the transposons suitable for use in the present invention include, but are not limited to, TnJ family transposons, TnJ, TnA, gd, TnlOOO, Tn5, Tni 721, Tn7, Tn9, Tn70 and derivatives and mutants thereof. In other preferred embodiments, the integration sequence may be an integrating virus. In some preferred embodiments, the integrating virus may be a lambdoid phage. Lambdoid phages are seen to include, but are not limited to, coliphages such as 1, 21, 434, f80 and HK022 as well as Salmonella phages such as P22. In other preferred embodiments, the integrating virus may be a phage not related to 1, such as Mu-1, P2 and P4. Other integrating viruses known to those skilled in the art may be used in the practice of the present invention.
In additional preferred embodiments, the integration sequence may be an IS element such as ISI, IS2, IS4, IS5, and derivatives and mutants thereof. In other embodiments the integration sequence may be a retrovirus, retrotransposons, conjugative transposons, P elements of Drosophila, bacterial virulence factors, or mobile genetic elements for eukaryotic organisms such as mariner, Tel and Sleeping Beauty. Other mobile genetic elements known to those skilled in the art may also be used in accordance with the present invention.
Origins of Replication
An origin of replication (ori) is a nucleotide sequence in a nucleic acid molecule at which replication of the nucleic acid molecule is initiated. As used herein, the phrase origin of replication is seen to include the definable origin of replication as well as one or more adjoining controlling elements necessary for the replication of the nucleic acid molecule. This combination of definable starting point of DNA synthesis during replication and the adjacent controlling element or elements may also be termed a replicon. Replicons suitable for use in the present invention include, but are not limited to, the pMBl replicon, the pl5A replicon, the pSClOl replicon, the ColEl replicon, the R6K replicon, the
F replicon, the PI replicon, the Rtsl replicon, the pColV-K30 replicon, the ldv replicon, the pIP522 replicon, theRl 162/RSF1010 replicon, the RK2 replicon, the pSa replicon and the RA1 replicon. The replicons suitable for the practice of the present invention are not limited to those replicons functional in E. coli. Replicons functional in other organisms include, but are not limited to, the PS 10 replicon, the pCTTI replicon, the pWV02 replicon, the pF3A replicon and the plP404 replicon. Replicons suitable for use in eukaryotic cells, including but not limited to insect cells, yeast cells, mammalian cells, amphibian cells or any of the host cells described below may be used in conjunction with the present invention.
Host Cells
The invention also relates to host cells comprising one or more of the nucleic acid molecules or vectors of the invention, particularly those nucleic acid molecules and vectors described in detail herein. Representative host cells that may be used according to this aspect of the invention include, but are not limited to, bacterial cells, yeast cells, insect cells, plant cells and animal cells. Preferred bacterial host cells include Escherichia spp. cells (particularly E. coli cells and most particularly E. coli strains DH10B, Stbl2, DH5a, DB3, DB3.1 (preferably E. coli LIBRARY EFFICIENCY® DB3.1™ Competent Cells; Invitrogen Corporation, Life Technologies Division, Rockville, MD), DB4 and DB5 (see
U.S. Application No. 518,188, filed on March 2, 2000, the disclosure of which is incorporated by reference herein in its entirety), E. coli W strains such as those described in United States provisional patent application 60/139,889 filed June 22, 1999, Bacillus spp. cells (particularly B. subtilis and B. megaterium cells), Streptomyces spp. cells, Erwinia spp. cells, Klebsiella spp. cells, Serratia spp. cells (particularly S. marcessans cells), Pseudomonas spp. cells (particularly P. aeruginosa cells), and Salmonella spp. cells (particularly S. typhimurium and S. typhi cells). Preferred animal host cells include insect cells (most particularly Drosophila melanogaster cells, Spodoptera frugiperda Sf9 and Sf21 cells and Trichoplusa High-Five cells), nematode cells (particularly C. elegans cells), avian cells, amphibian cells (particularly Xenopus laevis cells), reptilian cells, and mammalian cells (most particularly CHO, COS, VERO, BHK and human cells). Preferred yeast host cells include Saccharomyces cerevisiae cells and Pichia pastoris cells. These and other suitable host cells are available commercially, for example from Invitrogen Corporation, Life Technologies Division (Rockville, Maryland), Amencan Type Culture Collection (Manassas, Virginia), and Agncultural Research Culture Collection (NRRL, Peona, Illinois)
Methods for introducing the nucleic acid molecules and/or vectors of the invention into the host cells descnbed herein, to produce host cells compnsing one or more of the nucleic acid molecules and/or vectors of the invention, will be familiar to those of ordinary skill m the art. For instance, the nucleic acid molecules and/or vectors of the invention may be introduced into host cells using well known techniques of infection, transduction, transfection, and transformation. The nucleic acid molecules and/or vectors of the invention may be introduced alone or in conjunction with other the nucleic acid molecules and/or vectors. Alternatively, the nucleic acid molecules and/or vectors of the invention may be introduced into host cells as a precipitate, such as a calcium phosphate precipitate, or in a complex with a lipid Electroporation also may be used to introduce the nucleic acid molecules and/or vectors of the invention into a host. Likewise, such molecules may be introduced into chemically competent cells. In some preferred embodiments, the chemically competent cells are E coli cells, particularly E coli W cells If the vector is a virus, it may be packaged in vitro or introduced into a packaging cell and the packaged virus may be transduced into cells. Hence, a wide vanety of techniques suitable for introducing the nucleic acid molecules and/or vectors of the invention into cells in accordance with this aspect of the invention are well known and routine to those of skill in the art. Such techniques are reviewed at length, for example, in Sambrook, J., et al, Molecular Cloning, a Laboratory Manual, 2nd Ed., Cold Spnng Harbor, NY: Cold Spnng Harbor Laboratory Press, pp. 16.30-16.55 (1989), Watson, J.D., et al, Recombinant DNA, 2nd Ed , New York- W.H.
Freeman and Co., pp. 213-234 (1992), and Winnacker, E -L , From Genes to Clones, New York: VCH Publishers (1987), which are illustrative of the many laboratory manuals that detail these techniques and which are incorporated by reference herein in their entireties for their relevant disclosures Polymerases
Polymerases for use in the invention include but are not limited to polymerases (DNA and RNA polymerases), and reverse transcriptases. DNA polymerases include, but are not limited to, Thermus thermophilus (Tth) DNA polymerase, Thermus aquaticus (Taq) DNA polymerase, Thermotoga neopolitana (Tne) DNA polymerase, Thermotoga maritima (Tma) DNA polymerase, Thermococcus litoralis (Tli or VENT™) DNA polymerase, Pyrococcus furiosus (Pfu) DNA polymerase, DEEPVENT™ DNA polymerase, Pyrococcus woosii (Pwo) DNA polymerase, Pyrococcus sp KOD2 (KOD) DNA polymerase, Bacillus sterothermophilus (Bst) DNA polymerase, Bacillus caldophilus (Bca) DNA polymerase, Sulfolobus (Sac) DNA polymerase, Thermoplasma acidophilum (7αc)DNA polymerase, Thermus flavus (Tfl/Tub) DNA polymerase, Thermus ruber (Tru) DNA polymerase, Thermus brockianus (DYNAZYME™) DNA polymerase, Methanobacterium thermoautotrophicum (Mth) DNA polymerase, mycobacterium DNA polymerase
(Mtb, Mlep), E. coli pol I DNA polymerase, T5 DNA polymerase, T7 DNA polymerase, and generally pol I type DNA polymerases and mutants, variants and derivatives thereof. RNA polymerases such as T3, T5 and SP6 and mutants, variants and derivatives thereof may also be used in accordance with the invention.
The nucleic acid polymerases used in the present invention may be mesophilic or thermophilic, and are preferably thermophilic. Preferred mesophilic DNA polymerases include Pol I family of DNA polymerases (and their respective Klenow fragments) any of which may be isolated from organism such as E. coli, H. influenzae, D. radiodurans, H. pylori, C. aurantiacus, R. prowazekii, T.pallidum, Synechocystis sp., B. subtilis, L. lactis, S. pneumoniae, M. tuberculosis, M. leprae, M. smegmatis, Bacteriophage L5, phi-C31 , T7, T3, T5, SP01, SP02, mitochondrial from 5. cerevisiae MIP-1, and eukaryotic C. elegans, and D. melanogaster (Astatke, M. et al., 1998, J. Mol Biol. 278, 147-165), pol III type DNA polymerase isolated for any sources, and mutants, denvatives or vanants thereof, and the like. Preferred thermostable DNA polymerases that may be used in the methods and compositions of the invention include Taq, Tne, Tma, Pfu, KOD, Tfl, Tth, Stoffel fragment, VENT™ and DEEPVENT™ DNA polymerases, and mutants, vanants and denvatives thereof (U.S. Patent No. 5,436,149; U.S. Patent 4,889,818; U.S. Patent 4,965,188; U.S.
Patent 5,079,352; U.S. Patent 5,614,365; U.S. Patent 5,374,553; U.S. Patent 5,270,179; U.S. Patent 5,047,342; U.S. Patent No.5,512,462; WO 92/06188; WO 92/06200; WO 96/10640; WO 97/09451 ; Barnes, W.M., Gene 112:29-35 (1992); Lawyer, F.C., et al., PCR Meth. Appl. 2:275-287 (1993); Flaman, J.-M, et al., Nucl. Acids Res. 22(15):3259-3260 (1994))
Reverse transcnptases for use in this invention include any enzyme having reverse transcnptase activity. Such enzymes include, but are not limited to, retro viral reverse transcnptase, retrotransposon reverse transcnptase, hepatitis B reverse transcnptase, cauliflower mosaic virus reverse transcnptase, bactenal reverse transcnptase, Tth DNA polymerase, Taq DNA polymerase (Saiki, R.K., et al., Science 239:487-491 (1988); U.S. Patent Nos. 4,889,818 and 4,965,188), Tne DNA polymerase (WO 96/10640 and WO 97/09451), Tma DNA polymerase (U. S. Patent No. 5,374,553) and mutants, vanants or denvatives thereof (see, e.g., WO 97/09451 and WO 98/47912). Preferred enzymes for use in the invention include those that have reduced, substantially reduced or eliminated
RNase H activity. By an enzyme "substantially reduced in RNase H activity" is meant that the enzyme has less than about 20%, more preferably less than about 15%, 10% or 5%, and most preferably less than about 2%, of the RNase H activity of the corresponding wildtype or RNase H+ enzyme such as wildtype Moloney Munne Leukemia Virus (M-MLV), Avian Myeloblastosis Virus (AMV) or Rous Sarcoma Virus (RSV) reverse transcnptases. The RNase H activity of any enzyme may be determined by a vanety of assays, such as those described, for example, in U.S. Patent No. 5,244,797, in Kotewicz, M L., et al., Nucl. Acids Res. 16:265 (1988) and in Gerard, G.F., et al., FOCUS 14(5):91 (1992), the disclosures of all of which are fully incorporated herein by reference. Particularly preferred polypeptides for use in the invention include, but are not limited to, M-MLV H" reverse transcriptase, RS V H" reverse transcriptase, AMV H" reverse transcriptase, RAV (rous-associated virus) H" reverse transcriptase, MAV (myeloblastosis-associated virus) H" reverse transcriptase and HIV H" reverse transcriptase. (See U.S. Patent No. 5,244,797 and WO 98/47912). It will be understood by one of ordinary skill, however, that any enzyme capable of producing a DNA molecule from a ribonucleic acid molecule (i.e., having reverse transcriptase activity) may be equivalently used in the compositions, methods and kits of the invention. The enzymes having polymerase activity for use in the invention may be obtained commercially, for example from Invitrogen Corporation, Life Technologies Division (Rockville, Maryland), Perkin-Elmer (Branchburg, New Jersey), New England BioLabs (Beverly, Massachusetts) or Boehringer Mannheim Biochemicals (Indianapolis, Indiana). Enzymes having reverse transcriptase activity for use in the invention may be obtained commercially, for example from Invitrogen Corporation, Life Technologies Division (Rockville, Maryland), Pharmacia (Piscataway, New Jersey), Sigma (Saint Louis, Missouri) or Boehringer Mannheim Biochemicals (Indianapolis, Indiana). Alternatively, polymerases or reverse transcriptases having polymerase acti vity may be isolated from their natural viral or bacterial sources according to standard procedures for isolating and purifying natural proteins that are well-known to one of ordinary skill in the art (see, e.g., Houts, G.E., et al., J. Virol. 29:517 (1979)). In addition, such polymerases/reverse transcriptases may be prepared by recombinant DNA techniques that are familiar to one of ordinary skill in the art (see, e.g., Kotewicz, M.L., et al., Nucl. Acids Res. 16:265 (1988); U.S. Patent No. 5,244,797; WO
98/47912; Soltis, D.A., and Skalka, A.M., Proc. Natl. Acad. Sci. USA 85:3372-3376 (1988)). Examples of enzymes having polymerase activity and reverse transcriptase activity may include any of those described in the present application. Methods of Nucleic Acid Synthesis, Amplification and Sequencing
The present invention may be used in combination with any method involving the synthesis of nucleic acid molecules, such as DNA (including cDNA) and RNA molecules. Such methods include, but are not limited to, nucleic acid synthesis methods, nucleic acid amplification methods and nucleic acid sequencing methods.
Nucleic acid synthesis methods according to this aspect of the invention may comprise one or more steps. For example, the invention provides a method for synthesizing a nucleic acid molecule comprising (a) mixing a nucleic acid template (e.g., a target molecule comprising an integration sequence) with one or more primers and one or more enzymes having polymerase or reverse transcriptase activity to form a mixture; and (b) incubating the mixture under conditions sufficient to make a first nucleic acid molecule complementary to all or a portion of the template. According to this aspect of the invention, the nucleic acid template may be a DNA molecule such as a cDNA molecule or library, or an RNA molecule such as a mRNA molecule. Conditions sufficient to allow synthesis such as pH, temperature, ionic strength, and incubation times may be optimized by those skilled in the art.
In accordance with the invention, the target or template nucleic acid molecules or libraries may be prepared from nucleic acid molecules obtained from natural sources, such as a variety of cells, tissues, organs or organisms. Cells that may be used as sources of nucleic acid molecules may be prokaryotic (bacterial cells, including those of species of the genera Escherichia, Bacillus, Serratia, Salmonella, Staphylococcus, Streptococcus, Clostridium, Chlamydia, Neisseria, Treponema, Mycoplasma, Borrelia, Legionella, Pseudomonas,
Mycobacterium, Helicobacter, Erwinia, Agrobacterium, Rhizobium, and Streptomyces) or eukaryotic (including fungi (especially yeast's), plants, protozoans and other parasites, and animals including insects (particularly Drosophila spp. cells), nematodes (particularly Caenorhabditis elegans cells), and mammals (particularly human cells)). Of course, other techniques of nucleic acid synthesis which may be advantageously used will be readily apparent to one of ordinary skill in the art. In other aspects of the invention, the invention may be used in combination with methods for amplifying or sequencing nucleic acid molecules. Nucleic acid amplification methods according to this aspect of the invention may include the use of one or more polypeptides having reverse transcriptase activity, in methods generally known in the art as one-step (e.g., one-step RT-PCR) or two-step (e.g., two-step RT-PCR) reverse transcrϊptase-amplification reactions. For amplification of long nucleic acid molecules (i.e., greater than about 3-5 Kb in length), a combination of DNA polymerases may be used, as described in WO
98/06736 and WO 95/16028.
Amplification methods according to the invention may comprise one or more steps. For example, the invention provides a method for amplifying a nucleic acid molecule comprising (a) mixing one or more enzymes with polymerase activity with one or more nucleic acid templates (e.g., a target molecule comprising an integration sequence); and (b) incubating the mixture under conditions sufficient to allow the enzyme with polymerase activity to amplify one or more nucleic acid molecules complementary" to all or a portion of the templates. The invention also provides nucleic acid molecules amplified by such methods.
General methods for amplification and analysis of nucleic acid molecules or fragments are well-known to one of ordinary skill in the art (see, e.g., U.S. Pat. Nos. 4,683,195; 4,683,202; and 4,800,159; Innis, M.A.. et al., eds., PCR Protocols: A Guide to Methods and Applications, San Diego, California: Academic Press, Inc. (1990); Griffin, H.G., and Griffin, A.M., eds., PCR
Technology: Current Innovations, Boca Raton, Florida: CRC Press (1994)). For example, amplification methods which may be used in accordance with the present invention include PCR (U.S. Patent Nos. 4,683.195 and 4,683,202), Strand Displacement Amplification (SDA; U.S. Patent No. 5,455,166; EP 0 684 315), and Nucleic Acid Sequence-Based Amplification (NASB A; U.S. Patent No. 5,409,818; EP 0 329 822).
Typically, these amplification methods comprise: (a) mixing one or more enzymes with polymerase activity with the nucleic acid sample in the presence of one or more primer sequences, and (b) amplifying the nucleic acid sample to generate a collection of amplified nucleic acid fragments, preferably by PCR or equivalent automated amplification technique.
Following amplification or synthesis by the methods of the present invention, the amplified or synthesized nucleic acid fragments may be isolated for further use or characterization. This step is usually accomplished by separation of the amplified or synthesized nucleic acid fragments by size or by any physical or biochemical means including gel electrophoresis, capillary electrophoresis, chromatography (including sizing, affinity and immunochromatography), density gradient centrifugation and immunoadsorption. Separation of nucleic acid fragments by gel electrophoresis is particularly preferred, as it provides a rapid and highly reproducible means of sensitive separation of a multitude of nucleic acid fragments, and permits direct, simultaneous comparison of the fragments in several samples of nucleic acids. One can extend this approach, in another preferred embodiment, to isolate and characterize these fragments or any nucleic acid fragment amplified or synthesized by the methods of the invention. Thus, the invention is also directed to isolated nucleic acid molecules produced by the amplification or synthesis methods of the invention.
In this embodiment, one or more of the amplified or synthesized nucleic acid fragments are removed from the gel which was used for identification (see above), according to standard techniques such as electroelution or physical excision. The isolated unique nucleic acid fragments may then be inserted into standard vectors, including expression vectors, suitable for transfection or transformation of a variety of prokaryotic (bacterial) or eukaryotic (yeast, plant or animal including human and other mammalian) cells. Alternatively, nucleic acid molecules produced by the methods of the invention may be further charactenzed, for example by sequencing (1 e , determining the nucleotide sequence of the nucleic acid fragments), by methods descnbed below and others that are standard in the art (see, e g., U.S Patent Nos 4,962,022 and 5,498,523, which are directed to methods of DNA sequencing) Nucleic acid sequencing methods according to the invention may compnse one or more steps. For example, the invention may be combined with a method for sequencing a nucleic acid molecule compnsing (a) mixing an enzyme with polymerase activity with a nucleic acid molecule to be sequenced, one or more pnmers, one or more nucleotides, and one or more terminating agents (such as a dideoxynucleotides) to form a mixture, (b) incubating the mixture under conditions sufficient to synthesize a population of molecules complementary to all or a portion of the molecule to be sequenced; and (c) separating the population to determine the nucleotide sequence of all or a portion of the molecule to be sequenced Nucleic acid sequencing techniques which may be employed include dideoxy sequencing methods such as those disclosed in U S. Patent Nos 4.962,022 and 5,498,523.
Kits In another aspect, the invention provides kits which may be used in conjunction with the invention Kits according to this aspect of the invention may compnse one or more containers, which may contain one or more components selected from the group consisting of one or more nucleic acid molecules or vectors of the invention, one or more polymerases, one or more reverse transcriptases, one or more insertion-catalyzing enzymes, one or more recombination proteins (or other enzymes for carrying out the methods of the invention), one or more buffers, one or more detergents, one or more restnction endonucleases, one or more nucleotides, one or more terminating agents (e.g , ddNTPs), one or more transfection reagents, pyrophosphatase, and the like. The kits of the invention may also comprise instructions for carrying out methods of the invention.
It will be understood by one of ordinary skill in the relevant arts that other suitable modifications and adaptations to the methods and applications described herein are readily apparent from the description of the invention contained herein in view of information known to the ordinarily skilled artisan, and may be made without departing from the scope of the invention or any embodiment thereof. Having now described the present invention in detail, the same will be more clearly understood by reference to the following examples, which are included herewith for purposes of illustration only and are not intended to be limiting of the invention.
Examples
Example 1: Construction of a Transposon-containing Target DNA Molecule
A target molecule is cloned into a first vector suitable for recombinational cloning as described according to the methods and procedures of the GATEWAY™ Cloning System (see U.S. Patent No. 5,888,732, U.S. Patent Appl. Nos. 09/438,358 and 09/517,466, and the instruction manual entitled GATEWAY™ Cloning Technology (Versions 1 and 2), all of which are incorporated by reference herein in their entireties). Briefly, the target DNA molecule is inserted into an appropriate vector such that the target molecule is flanked by recombination sites. In some embodiments, the recombination sites are not capable of recombining with each other. The target-containing first vector is contacted with a solution containing an integration sequence such as a transposon, the appropriate cofactors such as buffer salts, ions and the like and an enzyme that catalyzes the insertion of the integration sequence into the target DNA molecule. Alternatively, the transposon could be inserted into the target DNA in an in vivo reaction such as the conjugal transfer of a plasmid to insert a gd based transposon described by Strathmann, et al. (Proceedings of the National Academy of Sciences, USA, 88: 1247-1250, 1991, specifically incorporated herein by reference). Although the present examples will be directed to in vitro insertion of a transposon into the target DNA, those skilled in the art will appreciate that a corresponding reaction could be carried out in vitro using methods known to those skilled in the art. Such corresponding methods are deemed to be within the scope of the present invention. The DNA sequence of the transposon will include terminal sequences that serve as substrates for the insertion-catalyzing enzyme and the enzyme will catalyze the insertion of the transposon into the target DNA molecule. As discussed above, the insertion-catalyzing enzyme will also catalyze the insertion of the transposon into the vector as well. The result of the transposition reaction will be a population of molecules having transposons inserted in various places in the vector and the target DNA as shown in Figure 2. The target DNA sequence is flanked by two recombination sites (RS, and RS2). The integration sequence is shown as comprising a selectable marker (SM2) and a primer binding sequence at each end. Those skilled in the art will appreciate that modifications of these features and inclusion of additional features are within the scope of the present invention. As the insertion reaction is random, the integration sequence can insert into both the target and the vector as shown.
Transposons suitable for use in the present invention may comprise one or more selectable markers. In some embodiments, the transposons of the present invention may comprise a toxic gene. The toxic gene may be a suicide gene, i. e. be lethal to susceptible organisms whenever the gene is expressed or the toxic gene may be conditionally lethal, i. e., be lethal to a susceptible organism only when the gene is expressed and some additional factor is present. In addition, transposons suitable for use in the sequencing methods of the present invention may comprise one or more sequences suitable for binding a primer. A primer may be used to determine the sequence of the target DNA molecule adjacent to the transposon or may be used for other purposes such as PCR. Suitable sequences may be of any length as long as the primer: DNA duplex formed upon incubation of the primer with the DNA to be sequenced or amplified is sufficiently stable to permit the subsequent reaction, . e. sequencing or PCR, to be conducted. The actual nucleotide sequence of the primer binding site is not critical as long as it is known. The selection of suitable primer binding sequences and the determination of the appropriate reaction conditions for subsequent reactions are routine tasks for those of ordinary skill in the art.
Transposons suitable for insertion into DNA target molecules in order to clone portions of the target may comprise one or more recombination sites or portions thereof. In some preferred embodiments, transposons of the present invention will contain two recombination sites which may be the same or different. The two sites may be in opposite orientation to each other.
Transposons suitable for cloning applications may comprise an origin of replication. In some embodiments, the origin of replication may be selected to be compatible with the origin of replication in one or more of the vectors used in the practice of the present invention. This will permit the nucleic acid molecules comprising the origin of replication derived from the transposon to be stably maintained in cells that also contain the vector. In other embodiments, the origin of replication may be selected so as to be incompatible with the origin of replication in the vector. This will facilitate segregation of the vector and the transposon containing nucleic acid molecule. The sequences and characteristics of origins of replication are well known to those skilled in the art. Examples of suitable origins of replication may be found in Current Protocols in Molecular Biology, Ausubel, et al. Eds., John Wiley and Sons, 1994, which is specifically incorporated herein by reference. Other suitable origins of replication are known to those skilled in the art and are within the scope of the present invention. The origins of replication used in the present invention may direct the replication of nucleic acid molecules containing them in a variety of organisms. In some embodiments, the origin of replication may function in prokaryotic host cells such as those previously discussed. In other embodiments, the origin of replication may function in eukaryotic host cells. Transposons suitable for use in the present invention may contain a DNA sequence that includes one or more sites that serve as a substrate for one or more restriction enzymes. In some preferred embodiments, the transposons used in the present invention may comprise a site that serves as a substrate for a restriction enzyme that cuts infrequently, a so called "rare cutter." In some embodiments of the present invention, the Vector Donor may also provide one or more sites for a rare cutter. In some embodiments, the Vector Donor may be provided with two rare cutter sites which may be the same or different and which are adjacent to the recombination sites. A transposon of the present invention may comprise more than one of the features discussed above. For example, a transposon may comprise an origin of replication in addition to recombination sites and may further comprise one or more primer binding sequences, selectable markers and/or suicide genes. Other useful combinations of features will be readily apparent to those skilled in the art and are within the scope of the present invention.
In some preferred embodiments, the molar ratio of transposon to target- containing first vector in the transposition reaction will range from about 25: 1 to about 1:25. In preferred embodiments, the molar ratio will range from about 10:1 to about 1:10. The molar ratio may be varied in order to ensure that one transposon is inserted into the DNA target. When the size of the first vector is large compared to the target, it may be desirable to have a higher ratio of transposon: vector to bias the reaction in favor of multiple insertions into each target-containing first vector in order to obtain an insertion into the target DNA. Conversely, when the size of the target DNA is large compared to the vector, it may be desirable to reduce the transposomvector ratio.
A typical in vitro transposition reaction may contain transposon, target- containing first vector, ions, buffering agents and the like. Suitable reaction conditions may be about 100-500 ng of transposon and about 1 mg of target- containing first vector. The reaction may contain a divalent metal ion in a concentration from about 0.5 mM to about 250 mM. In preferred embodiments, MgCl2 may be source of the divalent metal ion and may be present in a concentration from about 1 mM to about 50 mM, more preferably from about 5 mM to about 20 mM. The reaction solution may also contain a buffering agent in a concentration from about 1 mM to about 100 mM, more preferably from about 5 mM to about 50 mM and most preferably from about 10 mM to about 25 mM. A suitable buffering agent is Tris. The reaction solution may also contain a reducing agent such as b-mercaptoethanol (b-ME), dithiothreitol (DTT) or dithioerythritol (DTE) at a concentration from about 0.1 mM to about 5 mM, preferably at about 1 mM. The pH of the reaction solution may be from about 6.5 to about 8.5, preferably about 7.5. The reaction solution may contain monovalent cations in a concentration from about 1 mM to about 100 mM, preferably from about 5 mM to about 25 mM, most preferably at about 10 mM. Suitable sources of monovalent cations include KC1 and NaCl. A suitable set of reaction conditions is 15 mM MgCl2, 10 mM Tris-HCl, pH 7.5, lO mM KCl, ImM DTT and sufficient insertion-catalyzing enzyme activity to catalyze the insertion reaction. Suitable reaction conditions will vary depending upon the source of the integration sequence/insertion-catalyzing enzyme pair. Those skilled in the art will appreciate that the various insertion-catalyzing enzymes known have optimal activity under conditions specific to each enzyme. The determination and optimization of the reaction conditions for a given enzyme may be accomplished by routine experimentation by those skilled in the art. The reaction conditions may be varied based upon the size of the transposon and vector, and the activity of the insertion-catalyzing enzyme preparation. In some embodiments, the transposition reaction may be carried out in the presence of reagents that increase the effective concentration of the nucleic acid species present in the reaction. A suitable reagent of this kind is polyethylene glycol (PEG). A suitable PEG is PEG 8000. The reaction mixture may be incubated at an appropriate temperature, for example, from about 20 °C to about 37 °C, for a suitable period of time, for example, from about 15 minutes to about 16 hours. The optimum temperature and incubation period for a given transposon, target and insertion-catalyzing enzyme preparation can be determined by routine experimentation by one of ordinary skill in the art.
After incubation of the transposition reaction, the DNA may be used as is or may be purified by means known to those skilled in the art. When used without purification, the insertion-catalyzing enzyme may be inactivated, for example, by heating at 65 °C for 20 minutes. Suitable methods for purification of the DNA from the transposition reaction include phenol/chloroform extraction and ethanol precipitation, extraction using silica, for example the CONCERT™ system available from Invitrogen Corporation, Life Technologies Division, Rockville, MD, or any other purification scheme used by those skilled in the art.
When the transposition reaction is sufficiently efficient, enough molecules of the first vector comprising the transposon-containing target DNA molecule will be made to serve as a substrate for the subsequent recombination reaction. In other instances, it may be necessary to transform competent host organisms with the molecules made in the transposition reaction and grow the transformed organisms to amplify the reaction products. The transformed organisms may be grown in the presence of a suitable selection agent, such as antibiotic, to ensure the presence of the selectable marker present on the transposon in the growing organisms. Amplification steps are routine in the art and the skilled artisan can select suitable organisms and transformation conditions and isolate the amplified reaction products without the use of undue experimentation.
Example 2: Recombination of a Transposon-containing Target Molecule with a Vector Donor
A transposon-containing target DNA molecule in a first vector can be transferred to a second vector using recombinational cloning. As shown in Figure 3, the products of the insertion reaction discussed in the previous example can be mixed with a second vector termed a Vector Donor. The Vector Donor comprises recombination sites indicated as RS3 and RS in Figure 3 which recombination sites are compatible with the recombination sites present in the first vector. When the mixture is contacted with suitable recombination proteins, the target DNA molecule is transfened to the second vector. In the embodiment shown in Figure 3, the Vector Donor comprises a toxic gene between recombination sites in addition to a selectable marker outside the recombination sites (SM3). The preparation of suitable Vector Donor molecules is described in
U.S. Patent No. 5,888,732 issued to Hartley, et al., and according to the instruction manual entitled GATEWAY™ Cloning Technology (Versions 1 and 2) available from Invitrogen Corporation, Life Technologies Division (Rockville, MD). The first and the second vector are incubated in a suitable buffer. The reaction conditions may be optimized for the particular vectors and recombination proteins used. The reaction solution may contain a buffering agent at a concentration capable of maintaining the desired pH. The concentration of the buffering agent may be from about lmM to about 100 mM. Preferably from about 10 mM to about 50 mM. A suitable buffering agent is Tris. The pH of the reaction solution may be varied depending upon the pH optimum of the recombination enzymes used. In preferred embodiments, the pH of the reaction solution will be from about 6. 5 to about 8.5, more preferably from about 7.0 to 8.0 and most preferably 7.5. The reaction solution may contain monovalent cations in a concentration from about 1 mM to about 100 mM, preferably from about 5 mM to about 50 mM and most preferably from about 20 mM to about 35 mM. A suitable source of monovalent cation is NaCl. The reaction solution may also contain spermidine in a concentration from about 0.1 mM to about 10 mM, preferably from about 1 mM to about 5 mM. The reaction solution may also contain bovine serum albumin (BSA) at a concentration from about 50 mg/mL to about 5 mg/mL, preferably from ablut 100 mg/mL to about 1 mg/mL most preferably at about 500 mg/mL. The reaction solution may also contain a chelating agent at a concentration of from about 0.1 mM to about 10 mM, preferably at about 1 mM to about 5 mM. One suitable set of reaction conditions is 50 mM Tris'HCL, pH 7.5, 33 mM NaCl, 5 mM spermidine'HCl and 500 mg/mL bovine serum albumin. When the recombination sites are αttL and αttR derivatives, the reaction conditions may include 25 mMTrisΗCl, pH7.5, 22 mM NaCl, 5 mM EDTA, 5 mM spermidine»HCl and 1 mg/mL BSA. The reaction mixture is incubated at about 25°C for about 60 minutes and then incubated with a protease, for example proteinase K, for ten minutes to inactivate the recombination proteins. An increase in the efficiency of the recombination reaction is realized by linearizing the vectors prior to the recombination reaction. This may be accomplished by digestion with a suitable restriction enzyme. Alternatively, topoisomerase I may be added to the recombination reaction. After the recombination reaction, the reaction mixture may be used to transform a competent host organism. The transformed host may be grown in the presence of suitable selection agents to ensure the presence of the desired reaction product. For example, the growth medium for the transformed host may comprise two antibiotics in those embodiments where the transposon codes for resistance to one of the antibiotics and the second vector codes for resistance to the other antibiotic. In the embodiment shown in Figure 3, the transposon carries a selectable marker SM2 while the Vector Donor carries SM3. In this scenario, the first vector may code for resistance to yet a third antibiotic, i. e. SM,. The growth conditions will also select for the absence of the toxic gene. Any organism capable of growing under these conditions will contain both the selectable marker from the transposon and the selectable marker from the second vector and will not contain the toxic gene. These molecules will be the result of recombination between the first vector and the second vector and resolution of the cointegrate intermediate. As depicted in Figure 3, the product molecule will contain the target DNA containing an insertion and flanked by recombination sites that are the product of the recombination of the sites in the vector donor with the original flanking sites depicted as RS1+3 and RS2+4. For example, if the original flanking sites were αttLl and αttL2 and the sites in the Vector donor were αttR 1 and αttR2, the product molecule would contain the target nucleic acid flanked on one end by either αttBl or αttPl and flanked on the other end by either αttB2 or αttP2 depending upon the orientation of the sites with respect to the target sequence. In certain preferred embodiments, the product molecule may contain the target nucleic acid flanked by distinct mutant αttB sites.
Alternatively, after the recombination reaction, the mixture may be used i7i vitro to further manipulate the target: Oligonucleotides to the vector into which the transposon-containing DNA segment has been transferred can be used in conjunction with oligonucleotides complementary to the transposon to generate a population of amplicons extending from the vector to the site of transposon insertion. These segments can be cloned (for example, if the oligonucleotides contain recombination sites, or if the vector is charged with topoisomerase) and further manipulated or sequenced. In another such aspect, prior to amplification, individual members of the population can also be separated, amplified, and the amplification product(s) sequenced directly, thereby eliminating the need to clone and propagate the DNA segments. In some embodiments, the target DNA may have a sequence that results in the expression of one or more biological activities of interest when introduced into an appropriate host cell. For example, introduction of the target DNA sequence may result in the expression of a particular enzymatic activity. In these embodiments, it may be desirable to screen the host cells transformed with the recombination reaction mixture for the absence of the biological activity of interest thus identifying clones in which the transposon has inserted into the sequence necessary for expression of the biological activity. This provides information about the location of the sequence encoding the activity within the larger target DNA sequence. This will be particularly useful when the target sequence is large, for example, in the case of the target sequence being a cosmid,
BAC, YAC or genomic fragment.
The sequence of transposon-containing target DNA molecules may be determined by contacting the target DNA molecule with a primer that binds to a portion of the transposon sequence and then performing any suitable sequencing protocol known to those skilled in the art. It is important to note that, for sequencing applications, the present invention overcomes the obstacle presented by insertion of a transposon into the vector sequence instead of, or in addition to, insertion into the target DNA. For simplicity, Figure 2 depicts only a single insertion into a target-containing vector molecule; however, those skilled in the art will appreciate that multiple insertions are also possible. The recombination step that moves the target DNA into a second vector after completion of the transposition reaction, effectively eliminates the concern over sequencing the vector since the first vector sequence is not recovered from the recombination reaction. This is in contrast to the prior art where insertions into the vector would make it necessary to repeatedly sequence the vector or perform tedious screening procedures to eliminate clones in which the transposon inserted into the vector. In those cases where a transposon inserts into the vector and the target sequence, the resulting molecules could not be used in the prior art methods since the presence of two primer binding sites in the same molecule to be sequenced would generate an unintelligible mixture of products. Since the present methods remove the transposon containing vector portion of the starting DNA molecule, more molecules that can be sequenced can be recovered from a given transposition reaction.
Example 3: Manipulation of Large Nucleic Acid Molecules Using Insertion and Recombination
The methods of the present invention can be used to clone segments of large DNA molecules such as genomic DNA as shown in Figure 4A. In addition to genomic DNA, the methods of the present invention permit cloning of segments of any larger DNA molecule. Thus, while this embodiment of the present invention is exemplified with genomic DNA, those skilled in the art will appreciate that segments from any large DNA molecule can be cloned using these methods. For example, the large DNA molecule might be a YAC, BAC or any isolated chromosome or portions thereof Genomic DNA is isolated from the organism of interest and is contacted with a transposon comprising one or more recombination sites and an insertion- catalyzing enzyme under conditions causing the integration of the transposon into the genomic DNA. The genomic DNA is then contacted with a Vector Donor having recombination sites compatible with the recombination sites in the transposons (Figure 4A). Alternatively, the recombination sites in the transposon and the Vector Donor may be oriented so that the transposon alone cannot productively react with the Vector Donor (Figure 4B). After incubation in the presence of suitable recombination proteins, the reaction mixture can be used to transform competent host cells. The transformed host cells are grown under condition that select for the presence of the selectable marker on the Vector Donor and against the presence of the toxic gene. In some embodiments, the transposon can be modified so that the recombination event transfers the selectable marker with the genomic DNA to the Vector Donor. This configuration of the transposon is shown in Figure 5.
Transposons suitable for embodiments involving the cloning of genomic DNA may comprise two recombination sites. In some preferred embodiments, the recombination sites will have the same sequence and will be in opposite orientation, i. e., inverted repeats. A schematic representation of cloning of genomic DNA using this embodiment is shown in Figure 6. In some embodiments, the transposon may comprise a DNA sequence coding for a toxic gene. Transposons of this type will be useful in preventing recombinational cloning of the transposon or of genomic fragments that have an additional transposon located between the transposons that provided the recombination sites used for cloning. In other preferred embodiments, the recombination sites may have different sequences and be in opposite orientation. After insertion of a transposon, the genomic DNA is contacted with a Vector Donor molecule having recombination sites compatible with those in the transposon and the appropriate recombination proteins under conditions that result in a recombination between the recombination sites on the transposon and the recombination sites on the Vector Donor. Transformation and screening may be carried out as described above. In some embodiments, it may be desirable to include on the Vector Donor one or more additional recombination sites that have a different specificity from those used to recombine the transposon with the Vector Donor (Figure 6). These additional sites may be used for further manipulations of the cloned DNA. For example, it may be desirable to move the cloned DNA into a different vector which may be accomplished using the additional recombination sites.
In some preferred embodiments, the transposons used in genomic cloning may comprise an origin of replication. A transposon comprising one or more recombination site and further comprising an origin of replication is inserted into the genomic DNA. A recombination site present on a transposon may recombine with a recombination site present on an adjacent transposon resulting in the excision of the fragment between the two recombination sites. Since the excised molecule is a circular molecule having an origin of replication, the excised molecule is capable of being stably maintained in a host cell. In order to facilitate the selection of excised molecules, the transposons of the present invention may optionally comprise one or more selectable markers. In some embodiments of this type, it may be desirable to integrate two distinct populations of transposons into the genomic DNA. In a preferred embodiment, one population may comprise a recombination site and an origin of replication while the other transposon may comprise a selectable marker and a recombination site. The recombination between the recombination sites present on two adjacent transposons produces a DNA molecule that contains an origin of replication and a selectable marker in addition to the DNA of interest. Such a molecule may be transformed into an appropriate host cell line a selected for using one or more of the selectable markers. This is shown schematically in Figure 7.
The ratio of the concentration of the genomic DNA and the concentration of the transposon present in the integration reaction may be varied so as to control the size of the genomic DNA fragments transferred into the Vector Donor. By increasing the concentration of transposons, the average size of the genomic DNA fragment may be decreased.
Example 4: Construction of Subclones Using Transposition and Recombination
Target DNA containing a transposon may be used to construct clones containing less than the entire sequence of the target DNA. Such smaller clones are generally termed subclones. A transposon may be inserted into a target DNA that contains or is flanked by recombination sites, to produce the molecule shown at the top of Figure 8 A. The transposon may contain one or more recombination sites that are different from the recombination sites in the target molecule, and may in addition contain one or more selectable markers. This molecule is then contacted with one or two Vector Donors that contain recombination sites that will recombine with sites on the transposon and the target. In some embodiments, the vector containing the target DNA may be provided with additional recombination sites, while the Vector Donor(s) contain recombination sites that recombine with these additional sites. A recombination is conducted and then the nucleic acid produced in the recombination reaction is inserted into host cells. By plating portions of the transformation reaction on various selective media, the desired subclones can be isolated as shown in Figure 8A.
In some embodiments, such as those shown in Figure 8B, segments of the target DNA may be replaced. For example, the segment of the target DNA flanked by RS, and RS2 can be exchanged with a replacement sequence. The replacement sequence may be of a different size than segment replaced. Thus, exchange of a large segment of the target DNA with a small replacement sequences results in a deletion of a part of the target sequence. The replacement sequence introduce into the target DNA any desired characteristic including, but not limited to, the expression of a desired biological activity. In some embodiments of the invention, a transposon comprising a recombination site, origin of replication and a selectable marker is integrated into a target molecule. The recombination site present on the transposon is selected so as to be compatible with a recombination site present on the vector comprising the target DNA molecule. After insertion of the transposon, a recombination is conducted in the absence of a vector donor. The result is the excision of the DNA between the recombination site present in the transposon and the recombination site present in the vector. Since the excised portion of the target DNA comprises an origin of replication and a selectable marker, the excised portion can be inserted into a host cell and will be stably maintained. The result is to subclone the excised portion of the target DNA. This is schematically shown in Figure 9.
Example 5: Cloning of PCR Fragments Using Transposition and Recombination
The methods of the present invention can be used to clone PCR fragments. Primers containing recombination sequences (or portions thereof) are used to amplify a target DNA sequence (see United States provisional patent application number 60/065,930 filed October 24, 1997 and United States patent application serial number 09/177,387). Alternatively, the PCR primers may have a sequence that permits the generation of ligatable ends, for example, by including recognition sequence for a restriction enzyme. The resultant linear fragment flanked by recombination sites (or ligatable ends) is reacted with a transposon containing a selectable marker and an origin of replication. After integration of the transposon, a recombination reaction (or ligation reaction) is conducted. The result is a circular molecule having an origin of replication and a selectable marker. Alternatively, the molecule may be circularized first, followed by integration of the transposon. The circular molecule may be transformed into a competent host cell and maintained. This method will be particularly useful for the construction of gene targeting vectors. In some embodiments of this type, the transposon may comprise a selectable marker that confers resistance to neomycin and cells comprising the selectable marker may be selected with G-418. A schematic representation of this method is shown in Figure 10 In the embodiment shown in Figure 10, a target DNA molecule is amplified using pnmers containing recombination sites indicated by RS , and RS2 An integration sequence is inserted into the amplification product which is then circulanzed by a recombination event In other embodiments, the amplification product containing the integration sequence may be reacted with another nucleic acid molecule having recombination sites compatible with those in the amplification product
Example 6: Construction of Deletions in a Target DNA Molecule A vector comprising a target DNA molecule flanked by two different, non-interacting recombination sites is contacted with a transposon and an insertion-catalyzing enzyme under conditions causing the insertion of the transposon into the target DNA molecule or into the vector or into both The transposon is constructed to contain a recombination site compatible with one of the recombination sites flanking the target DNA molecule as well as a sequencing pnmer binding site In addition, the transposon may contain a sequence coding for a selectable marker and a sequence coding for a toxic gene distnbuted as shown in Figure 11
After insertion of the transposon into the vector compnsing the target DNA molecule, a recombination reaction may be earned out between the recombination site present on the transposon and the compatible recombination site present on the vector With reference to Figure 11, this would be a recombination between RS3 and RS2 The recombination reaction mixture is used to transform competent host cells that are susceptible to the toxic gene and the transformed host cells are spread on plates containing suitable reagents for selection using the selectable marker present on the transposon and the selectable marker present on the vector Insertion of the transposon into the vector sequence oi insertion of the transposon into the target DNA so that the recombination site in the transposon is in an inverse onentation with regard to the cognate recombination site in the vector results in a molecule that retains the toxic gene and, thus, will not produce colonies upon transformation. When the transposon is inserted into the target DNA so that the recombination site in the transposon has the same orientation as the recombination site on the vector, a portion of the target DNA is deleted as well as the portion of the transposon containing the toxic gene. The resulting deleted plasmid will produce colonies upon transformation.
Plasmids may be recovered from positive colonies and the size of the recovered plasmids may be determined by gel electrophoresis in order to assay how much of the target DNA was deleted. Optionally, the plasmids may be analyzed by restriction mapping using conventional techniques. Alternatively, the sequence that is deleted may be recovered, as shown in
Figure 12. An insertion element containing one or more recombination sites is inserted into the target region of a molecule that contains a recombination site. When contacted with a Vector Donor, the region between the recombination site on the insertion element and the recombination site on the target molecule is transferred to the Vector Donor, resulting in the cloning of the deleted portion of the original target.
Example 7: Generation of Populations of Nucleic Acid Molecules on Solid Supports
The methods of the present invention can further be used to generate populations of molecules attached to solid substrates. This approach can be utilized to segregate members of the population, to provide nucleic acid molecules that may serve as templates for amplification or that may be used as substrates for further addition and manipulation of DNA segments, or in systems such as in vitro transcription/translation and as templates for probe generation. In one such aspect, depicted schematically in Figure 13, a target DNA is reacted with a transposon that contains at least one recombination site. In one preferred embodiment of this aspect of the invention, the target DNA and the transposon are linear, although other configurations and structures (e.g. , circular, supercoiled, hairpin, etc.) of these molecules may also be used. Random (or directed) mtegration of the transposon containing the recombination site generates a population of molecules each containing a recombination site This population can be further reacted with a recombination site that is immobilized on a solid substrate such that the recombination reaction generates covalent linkage of the target DNA with the immobilized recombination site Each feature of the immobilization substrate thereby contains a member of the population.
There are numerous applications for such immobilized populations: for example, individual feature can further be used as substrates for amplification using oligonucleotides complementary to the transposon and the end of the target DNA By sequencing several members from the population using the transposon as a mobile pnmer site, the entirety of a large DNA segment can be determined Similarly, amphcons generated from the members on the feature can be used for the generation of probes, expression of segments of proteins, localization of domains (DNA or protein), etc. It should be noted that if desired, members of each population can be cloned using a vector containing a recombination site and an end compatible with the end of the target DNA, or following amplification
Having descnbed the present invention in some detail by way of illustration and example for purposes of clanty of understanding, it will be obvious to one of ordinary skill in the art that the same can be performed by modifying or changing the invention within a wide and equivalent range of conditions, formulations and other parameters without affecting the scope of the invention or any specific embodiment thereof, and that such modifications or changes are intended to be encompassed within the scope of the appended claims. All publications, patents and patent applications mentioned in this specification are indicative of the level of skill of those skilled in the art to which this invention pertains, and are herein incorporated by reference to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated by reference

Claims

WHAT IS CLAIMED IS:
1. A integration sequence comprising at least one recombination site or portion thereof.
2. The integration sequence of claim 1, wherein said integration sequence further comprises at least one element selected from the group consisting of one or more primer sites, one or more transcription or translation signals or regulatory sequences, one or more termination signals, one or more origins of replication, one or more selectable markers and one or more genes or portions of genes.
3. A target nucleic acid sequence which is flanked by at least a first and at least a second recombination site, wherein said target nucleic acid sequence comprises at least one integration sequence.
4. The target nucleic acid sequence of claim 3, wherein said integration sequence further comprises at least one element selected from the group consisting of one or more primer sites, one or more transcription or translation signals or regulatory sequences, one or more termination signals, one or more recombination sites or portions thereof, one or more origins of replication, one or more selectable markers, and one or more genes or portions of genes.
5. A method for selecting a target nucleic acid molecule comprising at least one integration sequence comprising: incubating a target sequence of interest flanked by recombination sites with at least one integration sequence under conditions sufficient to cause at least one of said integration sequences to integrate in said target sequence; and selecting for said target sequence.
6. The method of claim 5, wherein said selection comprises transferring said target sequence into a vector.
7. A method for selecting a target nucleic acid molecule, comprising: transferring a target sequence flanked by recombination sites and comprising at least one integration sequence from a first nucleic acid molecule to a second nucleic acid molecule; and selecting for said second nucleic acid molecule comprising said target sequence flanked by recombination sites.
8. A method of determining the sequence of a nucleic acid molecule comprising: transferring a target sequence flanked by recombination sites and containing at least one integration sequence from a first nucleic acid molecule to a second nucleic acid molecule; and determining the sequence of at least a portion of said target sequence.
9. The method according to claim 8, wherein said integration sequence contains at least one primer site.
10. The method according to claim 8, wherein said transfer is accomplished by recombinational cloning.
11. The method according to claim 8, wherein said transfer is preformed in vitro or in vivo.
12. A method of making one or more deletions in a nucleic acid molecule comprising: contacting a nucleic acid molecule which comprises at least a first recombination site with an integration sequence, the integration sequence comprising at least a second recombination site under conditions such that at least one of said integration sequences is inserted into said nucleic acid molecule; and causing at least said first and said second recombination sites to recombine, thereby resulting in a deletion of at least a portion of said nucleic acid molecule.
13. A method for making one or more deletions in a nucleic acid molecule comprising: obtaining said nucleic acid molecule which comprises at least a first and second recombination site; and causing said first and said second recombination sites to recombine, thereby resulting in a deletion of at least a portion of said nucleic acid molecule.
14. A method of cloning a nucleic acid molecule or a population of nucleic acid molecules comprising: inserting one or more integration sequences comprising at least one recombination site into at least one nucleic acid molecule; and transferring one or more nucleic acid molecules flanked by recombination sites by recombinational cloning into one or more vectors.
15. The method of claim 14, wherein said nucleic acid molecule is genomic, chromosomal or cDNA.
16. A method for cloning a nucleic acid molecule or a population of nucleic acid molecules comprising: inserting one or more integration sequences comprising at least one recombination site into at least one nucleic acid molecule thereby resulting in said nucleic acid molecule comprising at least a first and a second recombination site; and causing said at least first and second recombination sites to recombine.
17. The method of claim 16, wherein said recombination of said first and second recombination sites results in a circular molecule.
18. The method of claim 16, wherein said first and second recombination sites are separated by at least a portion of said nucleic acid molecule.
19. The method of claim 16, wherein said integration sequence comprises at least one element selected from the group consisting of one or more primer sites, one or more transcription or translation signals or regulatory sequences, one or more termination signals, one or more origins of replication, one or more selectable markers, and one or more genes or portions of genes.
20. The method of claim 16, wherein said integration sequence comprises one or more origins of replication and/or one or more selectable markers.
21. A method of circularizing a linear nucleic acid molecule comprising: obtaining a linear nucleic acid molecule comprising at least a first and second recombination site; and causing said first and second recombination site to recombine.
22. The method of claim 21, wherein said recombination sites are located at or near each terminus of said linear nucleic acid molecule.
23. The method of claim 21, wherein said first and/or second recombination sites are added to said linear nucleic acid molecule by amplification with one or more pnmers compnsing at least one recombination site or portion thereof.
24. The method of claim 21, wherein said first and/or second recombination sites are added to said linear nucleic acid molecule by adding one or more adapters compnsing at least one recombination site or portion thereof.
25. The method of claim 21, further compnsing incubating said linear nucleic acid molecule with at least one integration sequence under conditions sufficient to cause at least one of said integration sequences to insert in said linear nucleic acid molecule.
26. The method of claim 21, further compnsing incubating said circulanzed nucleic acid molecule with at least one integration sequence under conditions sufficient to insert at least one of said integration sequences in said circulanzed molecule.
27. The method of claim 16, wherein said nucleic acid molecule is genomic, chromosomal or cDNA
28. The method of claim 21, wherein said nucleic acid molecule is genomic, chromosomal or cDNA.
29. A method according to claim 13, wherein said first and said second recombination sites recombine in vitro.
EP00976633A 1999-10-25 2000-10-25 Methods of manipulating and sequencing nucleic acid molecules using transposition and recombination Withdrawn EP1224304A4 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16140399P 1999-10-25 1999-10-25
US161403P 1999-10-25
PCT/US2000/029355 WO2001031039A1 (en) 1999-10-25 2000-10-25 Methods of manipulating and sequencing nucleic acid molecules using transposition and recombination

Publications (2)

Publication Number Publication Date
EP1224304A1 true EP1224304A1 (en) 2002-07-24
EP1224304A4 EP1224304A4 (en) 2005-04-06

Family

ID=22581047

Family Applications (1)

Application Number Title Priority Date Filing Date
EP00976633A Withdrawn EP1224304A4 (en) 1999-10-25 2000-10-25 Methods of manipulating and sequencing nucleic acid molecules using transposition and recombination

Country Status (8)

Country Link
US (1) US20070128725A1 (en)
EP (1) EP1224304A4 (en)
JP (1) JP4731078B2 (en)
CN (2) CN1399684A (en)
AU (1) AU784522B2 (en)
CA (1) CA2388907A1 (en)
NZ (2) NZ529476A (en)
WO (1) WO2001031039A1 (en)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7670823B1 (en) 1999-03-02 2010-03-02 Life Technologies Corp. Compositions for use in recombinational cloning of nucleic acids
CN101125873A (en) * 1997-10-24 2008-02-20 茵维特罗根公司 Recombinational cloning using nucleic acids having recombination sites
CA2327387C (en) 1998-05-07 2012-07-03 Gx Biosystems A/S Cytotoxin-based biological containment
US6551828B1 (en) * 2000-06-28 2003-04-22 Protemation, Inc. Compositions and methods for generating expression vectors through site-specific recombination
US7198924B2 (en) 2000-12-11 2007-04-03 Invitrogen Corporation Methods and compositions for synthesis of nucleic acid molecules using multiple recognition sites
WO2002046372A1 (en) * 2000-12-08 2002-06-13 Invitrogen Corporation Methods and compositions for synthesis of nucleic acid molecules using multiple recognition sites
WO2002053576A1 (en) 2001-01-05 2002-07-11 The General Hospital Corporation Viral delivery system for infectious transfer of large genomic dna inserts
EP1362110B1 (en) 2001-02-23 2012-09-12 Université Libre de Bruxelles Method for the selection of recombinant clones comprising a sequence encoding an antidote protein to a toxic molecule
FR2824844B1 (en) * 2001-05-18 2003-09-19 Genoway DUAL SELECTION CLONING PROCESS AND VECTORS THEREFOR
WO2002095055A2 (en) * 2001-05-21 2002-11-28 Invitrogen Corporation Compositions and methods for use in isolation of nucleic acid molecules
US9309518B2 (en) 2002-09-03 2016-04-12 Universite Libre De Bruxelles Reversible, parallel and multitask cloning method and kit
JP4546828B2 (en) 2002-09-03 2010-09-22 ユニヴェルシテ リブル ドゥ ブリュッセル Cloning methods and kits that are reversible, parallel, and capable of performing multiple tasks
CN1263860C (en) * 2002-09-30 2006-07-12 华南农业大学 Construction method of multigene carrier and its application
WO2005028615A2 (en) * 2003-06-26 2005-03-31 Invitrogen Corporation Methods and compositions for detecting promoter activity and expressing fusion proteins
EP1697534B1 (en) 2003-12-01 2010-06-02 Life Technologies Corporation Nucleic acid molecules containing recombination sites and methods of using the same
JP2006141320A (en) * 2004-11-22 2006-06-08 Invitrogen Japan Kk Method for cloning plural nucleic acid fragments
BR122021015417B1 (en) 2013-08-19 2022-11-08 Syngulon Sa A GENETICALLY ENGINEERED MICROBIAL CELL COMPRISING A NUCLEIC ACID ENCODING A BACTERIOCIN
CN103938277B (en) * 2014-04-18 2016-05-25 中国科学院北京基因组研究所 Taking trace amount DNA as basis two generation sequencing library construction method
CN108728477B (en) * 2017-04-24 2022-02-22 华东理工大学 Efficient transposition mutation system and construction method
US11932672B2 (en) 2017-12-19 2024-03-19 Syngulon S.A. Fermentation process
CN114891787B (en) * 2022-05-09 2023-06-27 珠海圣美生物诊断技术有限公司 Random probe, preparation method and application

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2670502A1 (en) * 1990-12-13 1992-06-19 Eurolysine "MULTISITE" INTEGRATION CASSETTE IN THE GENOME OF A YEAST, PROCESSED YEAST AND PROCESS FOR THE PREPARATION OF A PRODUCT OF INTEREST BY A YEAST THUS PROCESSED.
WO1994020604A2 (en) * 1993-03-05 1994-09-15 Crop Genetics International Corporation Agricultural-chemical-producing endosymbiotic microorganisms and method of preparing and using same
US5451513A (en) * 1990-05-01 1995-09-19 The State University of New Jersey Rutgers Method for stably transforming plastids of multicellular plants
WO2000052027A1 (en) * 1999-03-02 2000-09-08 Invitrogen Corporation Compositions and methods for use in recombinational cloning of nucleic acids

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5286632A (en) * 1991-01-09 1994-02-15 Jones Douglas H Method for in vivo recombination and mutagenesis
EP1229113A3 (en) * 1995-06-07 2002-11-27 Invitrogen Corporation Recombinational cloning using engineered recombination sites
NZ504214A (en) * 1997-10-24 2003-06-30 Invitrogen Corp Recombination cloning using nucleic acids having recombination sites
US7007180B2 (en) * 2000-01-13 2006-02-28 Access Co., Ltd. System and method for switching a computer system to a first or second power saving mode based on whether or not there exists a timer-expiration-waiting event in an event queue

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5451513A (en) * 1990-05-01 1995-09-19 The State University of New Jersey Rutgers Method for stably transforming plastids of multicellular plants
FR2670502A1 (en) * 1990-12-13 1992-06-19 Eurolysine "MULTISITE" INTEGRATION CASSETTE IN THE GENOME OF A YEAST, PROCESSED YEAST AND PROCESS FOR THE PREPARATION OF A PRODUCT OF INTEREST BY A YEAST THUS PROCESSED.
WO1994020604A2 (en) * 1993-03-05 1994-09-15 Crop Genetics International Corporation Agricultural-chemical-producing endosymbiotic microorganisms and method of preparing and using same
WO2000052027A1 (en) * 1999-03-02 2000-09-08 Invitrogen Corporation Compositions and methods for use in recombinational cloning of nucleic acids

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
See also references of WO0131039A1 *
SNAITH M R ET AL: "Multiple cloning sites carrying loxP and FRT recognition sites for the Cre and Flp site-specific recombinases" GENE, ELSEVIER BIOMEDICAL PRESS. AMSTERDAM, NL, vol. 166, no. 1, 1995, pages 173-174, XP004043130 ISSN: 0378-1119 *

Also Published As

Publication number Publication date
US20070128725A1 (en) 2007-06-07
CN1818070A (en) 2006-08-16
CA2388907A1 (en) 2001-05-03
AU1437801A (en) 2001-05-08
AU784522B2 (en) 2006-04-27
NZ518503A (en) 2004-02-27
JP2003512075A (en) 2003-04-02
NZ529476A (en) 2005-03-24
CN1399684A (en) 2003-02-26
WO2001031039A1 (en) 2001-05-03
JP4731078B2 (en) 2011-07-20
EP1224304A4 (en) 2005-04-06

Similar Documents

Publication Publication Date Title
US20070128725A1 (en) Methods of manipulating and sequencing nucleic acid molecules using transposition and recombination
US7351578B2 (en) Use of multiple recombination sites with unique specificity in recombinational cloning
EP0937098B1 (en) Recombinational cloning in vitro using engineered recombination sites
EP1250453B1 (en) Use of multiple recombination sites with unique specificity in recombinational cloning
US6720140B1 (en) Recombinational cloning using engineered recombination sites
US20130323794A1 (en) Recombinational cloning using engineered recombination sites
US20070196838A1 (en) Methods and compositions for synthesis of nucleic acid molecules using multiple recognition sites
EP2105496B1 (en) Methods and compositions for synthesis of nucleic acid molecules using multiple recognition sites
US20030186233A1 (en) Methods and compositions for synthesis of nucleic acid molecules using multiple recognition sites
US7198924B2 (en) Methods and compositions for synthesis of nucleic acid molecules using multiple recognition sites
WO1996040724A9 (en) Recombinational cloning using engineered recombination sites
EP1697534B1 (en) Nucleic acid molecules containing recombination sites and methods of using the same
US20040040053A1 (en) Method for the preparation of nucleic acid
WO2004005482A2 (en) Methods and compositions for the production, identification and purification of fusion proteins
NZ516384A (en) Composition comprising a nucleic acid molecule
AU1006201A (en) Recombinational cloning using engineered recombination sites

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20020502

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL PAYMENT 20020502;LT PAYMENT 20020502;LV PAYMENT 20020502;MK PAYMENT 20020502;RO PAYMENT 20020502;SI PAYMENT 20020502

A4 Supplementary search report drawn up and despatched

Effective date: 20050221

17Q First examination report despatched

Effective date: 20071214

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20080625