EP1100885A1 - Schnelle subklonierung mittels ortsspezifischer rekombination - Google Patents

Schnelle subklonierung mittels ortsspezifischer rekombination

Info

Publication number
EP1100885A1
EP1100885A1 EP98937141A EP98937141A EP1100885A1 EP 1100885 A1 EP1100885 A1 EP 1100885A1 EP 98937141 A EP98937141 A EP 98937141A EP 98937141 A EP98937141 A EP 98937141A EP 1100885 A1 EP1100885 A1 EP 1100885A1
Authority
EP
European Patent Office
Prior art keywords
nucleic acid
site
sequence
specific recombinase
gene
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP98937141A
Other languages
English (en)
French (fr)
Other versions
EP1100885A4 (de
Inventor
Stephen J. Elledge
Qinghua Liu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baylor College of Medicine
Original Assignee
Baylor College of Medicine
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/122,384 external-priority patent/US6828093B1/en
Application filed by Baylor College of Medicine filed Critical Baylor College of Medicine
Publication of EP1100885A1 publication Critical patent/EP1100885A1/de
Publication of EP1100885A4 publication Critical patent/EP1100885A4/de
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • C12N15/902Stable introduction of foreign DNA into chromosome using homologous recombination
    • C12N15/905Stable introduction of foreign DNA into chromosome using homologous recombination in yeast
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/66General methods for inserting a gene into a vector to form a recombinant vector using cleavage and ligation; Use of non-functional linkers or adaptors, e.g. linkers containing the sequence for a restriction endonuclease

Definitions

  • the invention relates to recombinant DNA technology.
  • the invention relates to compositions, including vectors, and methods for the rapid subcloning of nucleic acid sequences in vivo and in vitro.
  • This process typically involves isolation or cloning of the gene encoding the protein of interest followed by transfer of the coding region into an expression vector that contains elements (e.g., promoters) which direct the expression of the desired protein in the heterologous host cell.
  • the most commonly used means of transferring or subcloning a coding region into an expression vector involves the in vitro use of restriction endonucleases and DNA ligases.
  • Restriction endonucleases are enzymes which generally recognize and cleave a specific DNA sequence in a double-stranded DNA molecule. Restriction enzymes are used to excise the coding region from the cloning vector and the excised DNA fragment is then joined using DNA ligase to a suitably cleaved expression vector in such a manner that a functional protein may be expressed.
  • the ability to transfer the desired coding region to an expression vector is often limited by the availability or suitability of restriction enzyme recognition sites. Often multiple restriction enzymes must be employed for the removal of the desired coding region and the reaction conditions used for each enzyme may differ such that it is necessary to perform the excision reactions in separate steps. In addition, it may be necessary to remove a particular enzyme used in an initial restriction enzyme reaction prior to completing all restriction enzyme digestions; this requires a time-consuming purification of the subcloning intermediate. Ideal methods for the subcloning of DNA molecules would permit the rapid transfer of the target DNA molecule from one vector to another in vitro or in vivo without the need to rely upon restriction enzyme digestions.
  • the present invention provides reagents and methods which comprise a system for the rapid subcloning ' of nucleic acid sequences in vivo and in vitro without the need to use restriction enzymes.
  • the present invention provides a method for the recombination of nucleic acid constructs, comprising: providing a first nucleic acid construct comprising, in operable order, an origin of replication, a first sequence-specific recombinase target site, and a nucleic acid of interest, a second nucleic acid construct comprising, in operable order, an origin of replication, a regulatory element and a second sequence-specific recombinase target site adjacent to and downstream from the regulatory element, and a site-specific recombinase; contacting the first and the second nucleic acid constructs with the site-specific recombinase under conditions such that the first and second nucleic acid constructs are recombined to form a third nucleic acid construct, wherein the nucleic acid of interest is operably linked to the regulatory element.
  • the present invention contemplates the use of any type of regulatory element.
  • the regulatory element comprises a promoter element, a fusion peptide (e.g., an affinity domain), or an epitope tag.
  • the nucleic acid of interest comprises a gene.
  • the first nucleic acid construct further comprises a selectable marker.
  • the second nucleic acid construct further comprises a selectable marker.
  • the present invention also contemplates that the first and second nucleic acid constructs both comprise selectable markers. In preferred embodiments the selectable markers of the first and second nucleic acid constructs are different from one another.
  • the first nucleic acid construct further comprises a prokaryotic termination sequence.
  • Prokaryotic termination sequences include, but are not limited to the T7 termination sequence.
  • the first nucleic acid construct further comprises a eukaryotic polyadenylation sequence.
  • Polyadenylation sequences include, but are not limited to, the bovine growth hormone polyadenylation sequence, the simian virus 40 polyadenylation sequence, and the Herpes Simplex virus thymidine kinase polyadenylation sequence.
  • the first nucleic acid construct further comprises a conditional origin of replication.
  • the first and second sequence- specific recombinase target sites are selected from the group consisting of loxP, loxP2, / xP3, / xP23, /oxP511 , lox , loxC2, loxL, loxR, loxAS6, loxAl ll, frt, dif, loxH and atl.
  • the present invention contemplates that the first and second sequence-specific recombinase target sites may comprise the same sequence or may comprise different sequences.
  • the first nucleic acid construct further comprises a polylinker.
  • the present invention contemplates that the recombination methods can be used in vitro and in vivo.
  • the site-specific recombinase is provided by a host cell expressing the site-specific recombinase.
  • the contacting of the first and the second nucleic acid constructs with the site-specific recombinase comprises introducing the first and said second nucleic acid constructs into a host cell under conditions such that the third nucleic acid construct is capable of replicating in the host cell.
  • the present invention further provides methods for precise transfer of nucleic acid molecules by recombination.
  • the first nucleic acid construct further comprises a third sequence-specific recombinase target site and said second nucleic acid constructs further comprises a fourth sequence-specific recombinase target site.
  • the first sequence-specific recombinase and the third sequence-specific recombinase in the first nucleic acid construct are located on opposite sides of the nucleic acid of interest. It is contemplated that the first and third sequence-specific recombinase target sites are contiguous with, adjacent to, or distant from the nucleic acid of interest.
  • the third and fourth sequence-specific recombinase target sites are selected from the group consisting of RS sites and Res sites, although other target sites are contemplated by the present invention.
  • the first nucleic acid construct further comprises a third sequence- specific recombinase target site and the second nucleic acid constructs further comprises a fourth sequence-specific recombinase target site, wherein the method further comprises providing a second site-specific recombinase and the step of contacting the third nucleic acid construct with the second site-specific recombinase under conditions such that the third nucleic acid construct is recombined to form a fourth and a fifth nucleic acid construct.
  • the present invention also provides a recombined nucleic acid construct prepared according to any of the above methods.
  • the present invention further provides a method for the recombination of nucleic acid constructs, comprising: providing a vector, a linear nucleic acid molecule comprising a sequence complementary to at least a portion of said vector, and an E. coli host cell, wherein said host cell comprises an endogenous recombination system, a loss of function rec mutation, a suppressor, and a loss of function endogenous restriction modification system mutation; and introducing the vector and the linear nucleic acid molecule into the host cell under conditions such that the linear nucleic acid molecule and the vector are recombined to form a recombinant nucleic acid construct.
  • the loss of function rec mutation is selected from the group consisting of recBC and recD.
  • the suppressor comprises sbc.
  • the loss of function endogenous restriction modification system mutation comprises hsdR.
  • the present invention further provides a method for generating a nucleic acid fusion on the 3 * end of the nucleic acid of interest in the first nucleic acid construct from above, comprising: providing a tagged linear nucleic acid sample comprising a tag to be added to the 3' end of the nucleic acid of interest, and a sequence complementary to a region of the first nucleic acid construct that is 3' of the nucleic acid of interest; and a host cell capable of endogenous homologous recombination of complementary nucleic acid molecules; and introducing the tagged linear nucleic acid sample and the first nucleic acid construct into the host cell under conditions such that the tagged linear nucleic acid sample and the first nucleic acid construct are recombined to form a tagged nucleic acid construct.
  • the present invention further provides a method for the cloning of nucleic acid libraries, comprising: providing a plurality of first nucleic acid constructs comprising, in operable order, an origin of replication, a first sequence-specific recombinase target site, and a nucleic acid member from a nucleic acid library, a plurality of second nucleic acid construct comprising, in operable order, an origin of replication, a regulatory element and a second sequence-specific recombinase target site adjacent to and downstream from the regulatory element, and a site-specific recombinase; contacting the plurality of first and second nucleic acid constructs with the site-specific recombinase under conditions such that the plurality of first and second nucleic acid constructs are recombined to form a plurality of third nucleic acid constructs, wherein the nucleic acid members from the nucfeic ⁇ 1 ⁇ c ⁇ 'd library are operably linked to the regulatory elements.
  • the present invention further
  • the present invention also provides a method for the directional cloning of a nucleic acid molecule, comprising: providing first and second portions of a regulatory element, a first nucleic acid molecule comp ⁇ sing the first portion of the regulatory element; and a second nucleic acid molecule comprising the second portion of the regulatory element; and combining the first and the second nucleic acid molecules to produce a third nucleic acid molecule under conditions whereby an intact regulatory element is produced from the combination of the first and the second portions of the regulatory element, wherein the presence of the intact regulatory element in the third nucleic acid molecule indicates a direction of cloning of the first nucleic acid molecule with respect to the second nucleic acid molecule.
  • the present invention also provides a method for the directional cloning of a nucleic acid molecule, comprising providing: the nucleic acid molecule to be cloned, a first primer comprising sequence complementary to the nucleic acid molecule, a second primer comprising sequence complementary to the nucleic acid molecule and sequence corresponding to a first portion of a lacO site, amplification means, and a target nucleic acid molecule comprising a second portion of the lacO site; amplifying the nucleic acid molecule with the first and second primers to produce a modified nucleic acid molecule comprising the first portion of a lacO site; and ligating the modified nucleic acid molecule into the target nucleic acid such that, when cloned in the desired direction, an intact lacO site is produced.
  • the method further comprises the step of detecting the intact lacO site.
  • the target nucleic acid molecule comprises pUNI-30.
  • the present invention further provides a method for regulated recombination in host cells that constitutively express a recombinase, comprising: providing a host cell expressing a recombinase, a first nucleic acid construct comprising an origin of replication, a first site- specific recombinase site, a second site-specific recombinase site that differs in sequence from the first site-specific recombinase site such that the recombinase will not initiate recombination between the first and second site-specific recombinase sites, and a selectable marker gene between the first and second site-specific recombinase sites, and a second nucleic acid construct comprising an origin of replication, a third site-specific recombinase target site, and a fourth site-specific
  • the method further comprises the step of selecting for a desired recombinant nucleic acid molecule using the selectable marker.
  • the first nucleic acid construct is a Univector.
  • the second nucleic acid construct is a Univector.
  • the present invention also provides, a nucleic acid construct comprising, in operable order: a conditional origin of replication; a sequence-specific recombinase target site having a 5' and a 3 ' end; and a unique restriction enzyme site, said restriction enzyme site located adjacent to the 3' end of the sequence-specific recombinase target site.
  • the construct further comprises a prokaryotic termination sequence.
  • the construct further comprises a eukaryotic polyadenylation sequence.
  • the present invention contemplates the use of any prokaryotic termination sequence and any eukaryotic polyadenylation sequence.
  • the construct further comprises one or more selectable marker genes.
  • Selectable marker genes include, but are not limited to the kanamycin resistance gene, the ampicillin resistance gene, the tetracycline resistance gene, the chloramphenicol resistance gene, the streptomycin resistance gene, the sir A gene, and the sacB gene.
  • the sequence- specific recombinase target site is selected from the group consisting of /oxP, / xP2, /oxP3, /oxP23, /oxP51 1 , loxB, loxC2, loxL, loxR, /ox ⁇ 86, lox ⁇ ll, frt, dif, loxU and att.
  • the construct further comprises a gene of interest inserted into the unique restriction enzyme site.
  • the construct has the nucleotide sequence set forth in SEQ ID NOT.
  • the construct further comprises a second sequence-specific recombinase target site.
  • the second sequence-specific recombinase target site is selected from the group consisting of RS site and a Res site.
  • the construct further comprises a polylinker.
  • the present invention further provides a nucleic acid construct comprising in 5' to 3' operable order: an origin of replication; a promoter element having a 5 " and a 3 end; and a sequence-specific recombinase target site having a 5' and a 3' end.
  • the construct further comprises a selectable marker gene.
  • the present invention also provides a nucleic acid construct comprising in operable order: a promoter element having a 5' and a 3' end; a first sequence-specific recombinase target site having a 5' and a 3' end, wherein the 3' end of the promoter element is located upstream of the 5' end of the sequence-specific recombinase target site; ' a gene of interest joined to the 3' end of the sequence-specific recombinase target site such that a functional translational reading frame is created; a conditional origin of replication; a first selectable marker gene; a second sequence-specific recombinase target site; and an origin of replication.
  • the construct further comprises a second selectable marker gene.
  • the present invention also provides a method for the recombination of nucleic acid constructs, comprising: providing a first nucleic acid construct comprising a loxH site, a second nucleic acid construct comprising a loxH site; and a site-specific recombinase; and contacting the first and the second nucleic acid constructs with the site-specific recombinase under conditions such that the first and second nucleic acid constructs are recombined.
  • the present invention also provides a recombined nucleic acid construct prepared according to the above method.
  • Figure 1 provides a schematic illustrating certain elements of the pUNI vectors and the Univector Fusion System.
  • Figure 2A provides a schematic map of the pUNI-10 vector; the locations of selected restriction enzyme sites are indicated and unique sites are indicated by the use of bold type.
  • Figure 2B shows the DNA sequence of the lox? site and the polylinkers contained within pUNI-10 (Le., nucleotides 401-530 of SEQ ID NOT).
  • Figure 3A shows the oligonucleotides (SEQ ID NOS:4 and 5) which were annealed to insert a loxP site into the polylinker of pGEX-2TKcs to create pGst-/ox.
  • Figure 3B provides a schematic map of pGEX-2TKcs which includes an enlargement of the multiple cloning site (MCS).
  • Figure 4A shows the oligonucleotides (SEQ ID NOS:6 and 7) which were annealed to insert a lox? site into the polylinker of pVL1392 to create pVL1392-/t>x.
  • Figure 4B provides a schematic map of pVL1392 which includes an enlargement of the multiple cloning site (MCS); the ampicillin resistance gene (Ap R ) and the tac promoter (P tac ) are indicated.
  • Figure 5 A shows the oligonucleotides (SEQ ID NOS: 8 and 9) which were annealed to insert a lox? site into the polylinker of pGAP24 to create pGAP24-/ox.
  • Figure 5B provides a schematic map of pGAP24 which includes an enlargement of the multiple cloning site (MCS); the ampicillin resistance gene (Ap R ), the GAP promoter (P GAP ), the origin from the 2 ⁇ m circle (2 ⁇ ) and the TRPl gene, encoding N-(5'-phosphoribosyl)- anthranilate synthetase, (TRPl) are indicated.
  • MCS multiple cloning site
  • Amicillin resistance gene Ap R
  • P GAP GAP promoter
  • TRPl N-(5'-phosphoribosyl)- anthranilate synthetase
  • Figure 6A shows the oligonucleotides (SEQ ID NOS: 8 and 9) which were annealed to insert a lox? site into the polylinker of pGAL14 to create pGAL14-/ x.
  • Figure 6B provides a schematic map of pGAL 14 which includes an enlargement of the multiple cloning site (MCS); the ampicillin resistance gene (Ap R ), the GAL promoter (P GAL ), the yeast centromeric sequences (CEN), yeast autonomous replication sequences (ARS) and the TRPl gene (TRPl) are indicated.
  • MCS multiple cloning site
  • Ap R ampicillin resistance gene
  • P GAL GAL promoter
  • CEN yeast centromeric sequences
  • ARS yeast autonomous replication sequences
  • TRPl TRPl
  • Figure 7 shows a Coomassie blue-stained SDS-PAGE gel showing the purification of Gst-Cre from E. coli cells containing pQL123.
  • Figure 8 provides a schematic showing the strategy employed for the in vitro recombination of a pUNI vector ("pA,” pUNI-5) with a pHOST vector ("pB.” pQL103) to create a fused construct ("pAB"). The relevant markers on each construct are indicated, as are selected restriction enzyme sites.
  • Figure 9A provides a schematic showing the starting constructs (pU I-Skpl and pGst- lox) and the predicted fusion construct (pGst-Skpl) generated by an in vitro fusion reaction.
  • Figure 9B provides an ethidium bromide-stained gel showing the separation of restriction fragments generated by the digestion of pUNI-Skpl, pGst-/ x and pGst-Skpl .
  • Figure 10A shows a Coomassie blue-stained SDS-PAGE gel showing the expression of the Gst-Skpl protein from E. coli cells containing pGst-Skpl .
  • Figure 10B shows a Western blot of an SDS-PAGE gel containing extracts prepared from E. coli cells containing pGst-Skpl which was probed using an anti-Skpl antibody.
  • Figure 1 1 shows a Western blot of an SDS-PAGE gel containing extracts prepared from E. coli cells (QLB4) containing either a conventionally constructed Gst-Skpl plasmid or pGst-Skpl (produced by an in vitro fusion reaction).
  • Figure 12 provides a schematic illustrating the in vivo gene trap method for the recombination of /ox-containing vectors in a host cell constitutively expressing the Cre protein.
  • Figure 13 provides the nucleotide sequence of the wild-type lox? site (SEQ ID
  • Figure 14 shows a schematic for one embodiment of Cre-mediated plasmid fusion.
  • Figure 15 shows data demonstrating the efficiency of Gst-Cre recombinase activity as measured by UPS.
  • Figure 16 shows the protein expression of UPS generated fusion proteins containing lox? following separation by SDS-PAGE and (A) staining with Coomassie blue, and (B) immunoblotting with anti-Skpl antibodies.
  • Figure 17 shows a comparison of expression levels between loxP and / ⁇ xH containing constructs.
  • Figure 18 shows the expression of UPS-derived baculovirus expression constructs in insect cells.
  • Figure 19 shows immunblotting with anti-HA antibodies of Hela cells expressing Myc- tagged F-box protein under the control of the CMV promoter.
  • Figure 20 shows a schematic representation of the POT reaction.
  • Figure 21 shows restriction digestion assays of sample that underwent POT with SKP1 replacing the E gene in pAS2-E.
  • Figure 22 shows a schematic of a method for directional subcloning of nucleic acid samples into a Univector.
  • Figure 23 provides a schematic map of the pUNI-10, pUNI-20, and pUNI-30 vectors.
  • Figure 24 shows a schematic of a method for producing a tagged recombinant protein.
  • Figure 25 shows a schematic of a gap repair scheme for modification of the 3 * end of coding regions using homologous recombination.
  • conditional origin of replication refers to an origin of replication that requires the presence of a functional trans-acting factor (e.g., a replication factor) in a prokaryotic host cell.
  • Conditional origins of replication include, but are not limited to, temperature-sensitive replicons such as rep pSC101 ,s .
  • the term "origin of replication” refers to that is functional in a broad range of prokaryotic host cells (i.e., a normal or non-conditional origin of replication such as the ColEl origin and its derivatives).
  • sequence-specific recombinase and “site-specific recombinase” refer to enzymes that recognize and bind to a short nucleic acid site or sequence and catalyze the recombination of nucleic acid in relation to these sites.
  • sequence-specific recombinase target site and "site-specific recombinase target site” refer to a short nucleic acid site or sequence which is recognized by a sequence- or site-specific recombinase and which become the crossover regions during the site-specific recombination event.
  • sequence-specific recombinase target sites include, but are not limited to, lox sites, frt sites, att sites and dif sites.
  • /ox site refers to a nucleotide sequence at which the product of the ere gene of bacteriophage PI, Cre recombinase, can catalyze a site-specific recombination.
  • lox sites are known to the art including the naturally occurring loxP (the sequence found in the PI genome), loxB, loxL and loxR (these are found in the E. coli chromosome) as well as a number of mutant or variant lox sites such as ZoxP51 1 , /ox ⁇ 86, /ox ⁇ 1 17, loxC2, loxP2, loxP3, loxP23, loxS, and loxH.
  • 'yrt site refers to a nucleotide sequence at which the product of the FLP gene of the yeast 2 ⁇ m plasmid, FLP recombinase, can catalyze a site-specific recombination.
  • the term "unique restriction enzyme site” indicates that the recognition sequence for a given restriction enzyme appears once within a nucleic acid molecule.
  • the EcoRI site is a unique restriction enzyme site within the plasmid pUNI-10 (S ⁇ Q ID NOT).
  • a restriction enzyme site is said to be located "adjacent to the 3 ' end of a sequence- specific recombinase target site” if the restriction enzyme recognition site is located downstream of the 3' end of the sequence-specific recombinase target site.
  • the adjacent restriction enzyme site may, but need not, be contiguous with the last or 3 ' nucleotide comprising the sequence-specific recombinase target site.
  • the EcoRI site of pUNI-10 is located adjacent (within 3 nucleotides) to the 3' end of the loxP site (see Figure 2B); the Xhol. Ndel, and Ncol sites are also adjacent (i.e., within about 10-150 nucleotides) to the loxP site but these sites are not contiguous with the 3' end of the loxP site in pU ⁇ I-10.
  • polylinker or “multiple cloning site” refe ⁇ a cluster of restriction enzyme sites on a nucleic acid construct which are utilized for the insertion and/or excision of nucleic acid sequences such as the coding region of a gene, lox sites, etc.
  • prokaryotic termination sequence refers to a nucleic acid sequence which is recognized by the RNA polymerase of a prokaryotic host cell and results in the termination of transcription.
  • Prokaryotic termination sequences commonly comprise a GC-rich region that has a twofold symmetry followed by an AT-rich sequence (Stryer, supra).
  • a commonly used prokaryotic termination sequence is the T7 termination sequence.
  • termination sequences are known to the art and may be employed in the nucleic acid constructs of the present invention including, but not limited to, the iNT , T u , T L2 , T u , T R , T R2 , T b termination signals derived from the bacteriophage lambda [Lambda II, Hendrix et al. Eds., supra] and termination signals derived from bacterial genes such as the tip gene of E. coli [Stryer, supra].
  • eukaryotic polyadenylation sequence (also referred to as a "poly A site” or “poly A sequence”) as used herein denotes a DNA sequence which directs both the termination and polyadenylation of the nascent RNA transcript. Efficient polyadenylation of the recombinant transcript is desirable as transcripts lacking a poly A tail are unstable and are rapidly degraded.
  • the poly A signal utilized in an expression vector may be "heterologous” or "endogenous.” An endogenous poly A signal is one that is found naturally at the 3' end of the coding region of a given gene in the genome. A heterologous poly A signal is one which is isolated from one gene and placed 3' of another gene.
  • a commonly used heterologous poly A signal is the SV40 poly A signal.
  • the SV40 poly A signal is contained on a 237 bp BamWIIBcll restriction fragment and directs both termination and polyadenylation [J. Sambrook, supra, at 16.6-16.7]; numerous vectors contain the SV40 poly A signal [e.g., pCEP4, pREP4, pEBVHis (Invitrogen)].
  • Another commonly used heterologous poly A signal is derived from the bovine growth hormone (BGH) gene; the BGH poly A signal is available on a number of commercially available vectors [e.g., pcDNA3.1, pZeoSV2. pSecTag (Invitrogen)].
  • the poly A signal from the Herpes simplex virus thymidine kinase (HSV tk) gene is also often used as a poly A signal on expression vectors.
  • Vectors containing the HSV tk poly A signal include the pBK-CMV, pBK-RSV, and pOP13CAT vectors from Stratagene.
  • selectable marker refers to the use of a gene which encodes an enzymatic activity that confers the ability to grow in medium lacking what would otherwise be an essential nutrient (e.g., the TRPl gene in yeast cells).
  • a selectable marker may confer resistance to an antibiotic or drug upon the cell in which the selectable marker is expressed.
  • a selectable marker may be used to confer a particular phenotype upon a host cell. When a host cell must express a selectable marker to grow in selective medium, the marker is said to be a positive selectable marker (e.g., antibiotic resistance genes which confer the ability to grow in the presence of the appropriate antibiotic).
  • Selectable markers can also be used to select against host cells containing a particular gene (e.g., the sacB gene which, if expressed, kills the bacterial host cells grown in medium containing 5% sucrose and the ⁇ X174 E gene). Selectable markers used in this manner are referred to as negative selectable markers or counter-selectable markers.
  • the term "vector" is used in reference to nucleic acid molecules that transfer DNA segment(s) from one cell to another.
  • nucleic acid construct includes circular nucleic acid constructs such as plasmid constructs, phagemid constructs, cosmid vectors, etc. as well as linear nucleic acid constructs (e.g.. ⁇ phage constructs and PCR products).
  • the nucleic acid construct may comprise expression signals such as a promoter and/or an enhancer (in such a case it is referred to as an expression vector).
  • expression vector refers to a recombinant DNA molecule containing a desired coding sequence and appropriate nucleic acid sequences necessary for the expression of the operably linked coding sequence in a particular host organism.
  • Nucleic acid sequences necessary for expression in prokaryotes usually include a promoter, an operator (optional), and a ribosome binding site, often along with other sequences.
  • Eukaryotic cells are known to utilize promoters, enhancers, and termination and polyadenylation signals.
  • operable combination refers to the linkage of nucleic acid sequences in such a manner that a nucleic acid molecule capable of directing the transcription of a given gene and/or the synthesis of a desired protein molecule is produced.
  • the term also refers to the linkage of amino acid sequences in such a manner so that a functional protein is produced.
  • transformation and “transfection” as used herein refer to the introduction of foreign DNA into prokaryotic or eukaryotic cells. Transformation of prokaryotic cells may be accomplished by a variety of means known to the art including the treatment of host cells with CaCl 2 to make competent cells, electroporation, etc. Transfection of eukaryotic cells may be accomplished by a variety of means known to the art including calcium phosphate- DNA co-precipitation, DEAE-dextran-mediated transfection, 3 ⁇ T bfehe-mediated transfection, electroporation, micro injection, liposome fusion, lipofection, protoplast fusion, retroviral infection, and biolistics, among other means.
  • restriction endonucleases and “restriction enzymes” refer to bacterial enzymes, each of which cut double-stranded DNA at or near a specific nucleotide sequence.
  • recombinant DNA molecule refers to a DNA molecule that comprises segments of DNA joined together by means of molecular biological techniques.
  • recombinant protein or “recombinant polypeptide” as used herein refers to a protein molecule that is expressed from a recombinant DNA molecule.
  • DNA molecules are said to have "5' ends” and "3' ends” because mononucleotides are reacted to make oligonucleotides in a manner such that the 5' phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of its neighbor in one direction via a phosphodiester linkage. Therefore, an end of an oligonucleotides is referred to as the "5' end” if its 5' phosphate is not linked to the 3' oxygen of a mononucleotide pentose ring and as the "3 end” if its 3' oxygen is not linked to a 5' phosphate of a subsequent mononucleotide pentose ring.
  • nucleic acid sequence even if internal to a larger oligonucleotide, also may be said to have 5' and 3' ends.
  • discrete elements are referred to as being "upstream" or 5 " of the
  • downstream or 3' elements This terminology reflects the fact that transcription proceeds in a 5 ' to 3 " fashion along the DNA strand.
  • the promoter and enhancer elements that direct transcription of a linked gene are generally located 5' or upstream of the coding region. However, enhancer elements can exert their effect even when located 3" of the promoter element and the coding region. Transcription termination and polyadenylation signals are located 3 " or downstream of the coding region.
  • the 3' end of a promoter element is said to be located upstream of the 5' end of a sequence-specific recombinase target site when (moving in a 5' to 3" direction along the nucleic acid molecule) the 3' terminus of a promoter element (the transcription start site is taken as the 3' end of a promoter element) precedes the 5' end of the sequence-specific recombinase target site.
  • the 3' end of the promoter element may be located adjacent (generally within about 0 to 500 bp) to the 5' end of the sequence-specific recombinase target site. Such an arrangement is used when the pHOST vector is not intended to permit the expression of a translational fusion with the gene of ref ⁇ donated fay a pUNI vector.
  • the 3' end of the promoter element is located upstream of both the sequences encoding the amino-terminus of a fusion protein and the 5' end of the sequence-specific recombinase target site.
  • the 5' end of the sequence-specific recombinase target site is located within the coding region of the fusion protein (e.g., located downstream of both the promoter element and the sequences encoding the affinity domain, such as Gst).
  • an oligonucleotide having a nucleotide sequence encoding a gene refers to a nucleic acid sequence comprising the coding region of a gene or, in other words, the nucleic acid sequence that encodes a gene product.
  • the coding region may be present in either a cDNA, genomic DNA, or RNA form.
  • the oligonucleotide may be single-stranded (i.e., the sense strand) or double-stranded.
  • Suitable control elements such as enhancers/promoters, splice junctions, polyadenylation signals, etc. may be placed in close proximity to the coding region of the gene if needed to permit proper initiation of transcription and/or correct processing of the primary RNA transcript.
  • the coding region utilized in the vectors of the present invention may contain endogenous enhancers/promoters, splice junctions, intervening sequences, polyadenylation signals, etc. or a combination of both endogenous and exogenous control elements.
  • regulatory element refers to a genetic element that controls some aspect of the expression of nucleic acid sequences.
  • a promoter is a regulatory element that facilitates the initiation of transcription of an operably linked coding region.
  • Other regulatory elements are splicing signals, polyadenylation signals, termination signals, etc. (defined infra).
  • Promoters and enhancers consist of short arrays of DNA sequences that interact specifically with cellular proteins involved in transcription [Maniatis, T. et al.. Science 236:1237 (1987)]. Promoter and enhancer elements have been isolated from a variety of eukaryotic sources including genes in yeast, insect, and mammalian cells and viruses (analogous control elements, i.e., promoters, are also found in prokaryotes). The selection of a particular promoter and enhancer depends on what cell type is to be used to express the protein of interest.
  • eukaryotic promoters and enhancers have a broad host range while others are functional in a limited subset of cell types [for review, see Voss, S.D. et al.. Trends Biochem. Sci.. 1 1 :287 (1986) and Maniatis, T. et al., supra (1987)].
  • the SV40 early gene enhancer is very active in a wide variety of cell types from many mammalian species and has been widely used for the expression of proteins in mammalian cells [Dijkema, R. et al, EMBO J. 4:761 (1985)].
  • promoter/enhancer elements active in a broad range of mammalian cell types are those from the human elongation factor l ⁇ gene [Uetsuki, T. et al, J. Biol. Chem., 264:5791 (1989), Kim, D.W. et al, Gene 91 :217 (1990) and Mizushima, S. and Nagata, S., Nuc. Acids. Res., 18:5322 (1990)] and the long terminal repeats of the Rous sarcoma virus [Gorman, CM. et al, Proc. Natl. Acad. Sci.
  • promoter/enhancer denotes a segment of DNA that contains sequences capable of providing both promoter and enhancer functions (i.e., the functions provided by a promoter element and an enhancer element, see above for a discussion of these functions).
  • promoter/promoter may be "endogenous” or “exogenous” or “heterologous.”
  • An "endogenous” enhancer/promoter is one which is naturally linked with a given gene in the genome.
  • an “exogenous” or “heterologous” enhancer/promoter is one which is placed in juxtaposition to a gene by means of genetic manipulation (i.e., molecular biological techniques) such that transcription of that gene is directed by the linked enhancer/promoter.
  • the presence of "splicing signals" on an expression vector often results in higher levels of expression of the recombinant transcript. Splicing signals mediate the removal of introns from the primary RNA transcript and consist of a splice donor and acceptor site [Sambrook, J. et al, Molecular Cloning: A Laboratory Manual, 2nd ed.. Cold Spring Harbor Laboratory Press, New York (1989) pp. 16.7-16.8].
  • a commonly used splice donor and acceptor site is the splice junction from the 16S RNA of SV40.
  • Eukaryotic expression vectors may also contain "viral replicons" or "viral origins of replication.”
  • Viral replicons are viral DNA sequences that allow for the extrachromosomal replication of a vector in a host cell expressing the appropriate replication factors.
  • Vectors that contain either the SV40 or polyoma virus origin of replication replicate to high copy number (up to 10 4 copies/cell) in cells that express the appropriate viral T antigen.
  • Vectors that contain the replicons from bovine papillomavirus or Epstein-Barr virus replicate extrachromosomally at low copy number (-100 copies/cell).
  • nucleic acid molecule encod ⁇ Bg ⁇ ' ⁇ A sequence encoding and “DNA encoding” refer to the order or sequence of deoxyribonucleotides along a strand of deoxyribonucleic acid. The order of these deoxyribonucleotides determines the order of amino acids along the polypeptide (protein) chain. The DNA sequence thus codes for the amino acid sequence.
  • the term “gene” means the deoxyribonucleotide sequences comprising the coding region of a structural gene and the including sequences located adjacent to the coding region on both the 5' and 3' ends such that the gene corresponds to the length of the full-length mRNA.
  • the sequences that are located 5' of the coding region and which are present on the mRNA are referred to as 5' non-translated sequences.
  • the sequences that are located 3 ' or downstream of the coding region and which are present on the mRNA are referred to as 3' non-translated sequences.
  • the term “gene” encompasses both cDNA and genomic forms of a gene.
  • a genomic form or clone of a gene contains the coding region interrupted with non-coding sequences termed "introns" or "intervening regions” or “intervening sequences.” Introns are segments of a gene that are transcribed into nuclear
  • RNA introns may contain regulatory elements such as enhancers. Introns are removed or “spliced out” from the nuclear or primary transcript. Introns therefore are absent in the messenger RNA (mRNA) transcript.
  • mRNA messenger RNA
  • the mRNA functions during translation to specify the sequence or order of amino acids in a nascent polypeptide.
  • genomic forms of a gene may also include sequences located on both the 5' and 3' end of the sequences that are present on the RNA transcript.
  • flanking sequences or regions are located 5' or 3' to the non-translated sequences present on the mRNA transcript.
  • the 5' flanking region may contain regulatory sequences such as promoters and enhancers that control or influence the transcription of the gene.
  • the 3' flanking region may contain sequences that direct the termination of transcription, post-transcriptional cleavage, and polyadenylation.
  • the term "purified” or “to purify” refers to the removal of contaminants from a sample.
  • recombinant Cre polypeptides are expressed in bacterial host cells (e.g., as a Gst-Cre fusion protein) and the Cre polypeptides are purified by the removal of at least a portion of the host cell proteins; the percent of recombinant Cre polypeptides is thereby increased in the sample.
  • native protein is used herein to indicate that a protein does not contain amino acid residues encoded by vector sequences; that is the native protein contains only those amino acids found in the protein as it occurs in nature.
  • a native protein may be produced by recombinant means or may be isolated from a naturally occurring source.
  • portion when in reference to a protein (as in “a portion of a given protein”) refers to fragments of that protein.
  • the fragments may range in size from four amino acid residues to the entire amino acid sequence minus one amino acid.
  • fusion protein refers to a chimeric protein containing the protein of interest (e.g., the Cre protein) joined to an exogenous protein fragment (e.g., the fusion partner which consists of non-Cre protein sequences).
  • the fusion partner may enhance solubility of the protein of interest as expressed in a host cell, may provide an affinity tag to allow purification of the recombinant fusion protein from the host cell or culture supernatant, or both, among other desired characteristics.
  • the fusion protein may be removed from the protein of interest by a variety of enzymatic or chemical means known to the art.
  • the present invention provides compositions and methods that comprise a system for the rapid subcloning of nucleic acid sequences in vivo and in vitro without the need to use restriction enzymes.
  • This system is referred to as the Univector Fusion System or Univector Plasmid-fusion System (UPS).
  • the UPS employs site-specific recombination to catalyze plasmid fusion between a Univector (i.e., a plasmid containing a gene of interest) and host vectors containing regulatory information.
  • plasmid fusion events are genetically selected and result in placement of the gene of interest under the control of novel regulatory elements.
  • UPS UPS-related method of the present invention allows for the precise transfer of coding sequences alone from a Univector into a host vector.
  • UPS further provides means for the subcloning of entire nucleic acid libraries and the directional cloning of linear nucleic acid molecules (e.g., PCR products).
  • the UPS offers many advantages over previously available technologies for the manipulation of genes. For example, for a routine analysis of a new gene, it may be desirable to express it in bacteria as a glutathione-S-transf erase (Gst) or polyhistidine fusion for purification and antibody production, to fuse it to the DNA-binding domain of GAL4 or lexA for two hybrid analysis, to express it from the T7 promoter to allow generation of a riboprobe or mRNA for in vitro transcription and translation, and express it in baculovirus, all in the course of a single study.
  • Gst glutathione-S-transf erase
  • polyhistidine fusion for purification and antibody production
  • T7 promoter to allow generation of a riboprobe or mRNA for in vitro transcription and translation
  • baculovirus all in the course of a single study.
  • the availability of whole genome sequences now provides the opportunity to analyze large sets of genes for both genetic and biochemical properties.
  • the need to perform parallel processing of large gene sets exponentially amplifies the current defects associated with conventional cloning methods.
  • the methods and compositions of the present invention provide a series of recombination-based approaches that significantly reduce the time and effort involved in generating multiple transcriptional and translational fusions for gene analysis and cDNA library construction.
  • the present invention provides a system whereby a gene can be placed under the control of any of a variety of promoters or fused in frame to other proteins or peptides without the use of restriction enzymes. As discussed above, the
  • UPS uses site-specific recombination to fuse two plasmids at a unique sequence adjacent to both a regulatory region and the 5' end of the gene or interest, thereby placing the gene under new regulation.
  • This system together with the other methods and compositions of the present invention discussed herein, provide a multifaceted approach for the rapid and efficient generation and manipulation of recombinant DNA, thus making possible parallel processing of whole genome sets of coding sequences.
  • the basis of the UPS is a vector termed the "Univector” or the "pUNI vector” into which sequences encoding a gene of interest (cDNA or genomic) are inserted.
  • the pUNI vector has a sequence-specific recombinase target site, such as a loxP site, preceding the insertion site for the gene of interest, a selectable marker gene (this feature is optional) and a conditional origin of replication that is active only in host cells expressing the requisite transacting replication factor (this feature is optional).
  • the pUNI vectors are designed to contain a gene of interest but lack a promoter for the expression of the gene of interest.
  • the gene of interest may be cloned directly into the pUNI vector (i.e., the pUNI vector may be used as a cloning vector, particularly for the cloning of cDNA libraries) or a previously cloned gene of interest may be inserted (i.e., subcloned) into the pUNI vector.
  • a sequence-specific recombinase e.g., Cre recombinase
  • a precise fusion of the pUNI vector into a second vector containing another sequence-specific recombinase target site is catalyzed.
  • the second vector referred to generically as a "pHOST" vector, is a vector (e.g., expression vector) that contains the sequence- specific recombinase target site downstream of regulatory element (e.g., a promoter) contained within the pHOST vector.
  • each vector e.g., the pUNI vector and the pHOST vector
  • the two vectors are stably fused in a manner that places the gene of interest under the control of the regulatory element contained within the pHOST vector.
  • this fusion event also occurs in a manner that retains the proper translational reading frame of the gene of interest.
  • the fusion or recombination event can be selected for by selecting for the ability of host cells, which do not express a trans-acting replication factor required for replication of a conditional origin contained on the pUNI vector, to acquire a selectable phenotype conferred by the selectable marker gene (if present) on the pUNI vector.
  • the pUNI vector cannot replicate in cells that do not express the trans-acting replication factor and therefore, unless the pUNI vector has integrated into the second vector that contains a non-conditional origin of replication, pUNI will be lost from the host cell.
  • the Univector Fusion System allows any number of expression or fusion constructs containing the gene of interest present on the pUNI vector to be made rapidly (e.g.. within a single day). Using conventional cloning or subcloning techniques which employ restriction enzyme digestion(s), the production of a single expression vector containing a gene of interest can take several days (i.e., for the design and construction of each expression vector). In contrast, with the methods and compositions of the present invention, once a battery of expression vectors modified to contain the appropriate sequence-specific recombinase target site is made, a gene of interest can be transferred to any number of expression vectors in an afternoon using the Univector Fusion System. For example, Figure 1 provides a schematic illustrating the straightforward recombination methods of the pUNI vectors and the Univector Fusion System.
  • the present invention further provides methods and compositions for directional subcloning of PCR fragments and other nucleic acid molecules into Univectors or other vectors and methods and compositions for generation of epitope tags and other fusions at the 3 * end of open reading frames using homologous recombination.
  • UPS can be used to fuse any coding region of interest either with a specific promoter to gain novel transcriptional regulation, with another coding sequence to produce a fusion protein with novel properties (e.g., an epitope tag for immunological detection or a DNA binding domain or transcriptional activation domain for two hybrid analysis), or with any other desired regulatory element.
  • the UPS eliminates the need for restriction enzymes, DNA ligases, and many in vitro manipulations required for subcloning.
  • the pUNI vector comprises a conditional origin of replication.
  • Conditional origins of replication are origins that require the presence or expression of a trans-acting factor in the host cell for replication.
  • a variety of conditional origins of replication functional in prokaryotic host ' s ⁇ g ⁇ E:- ' coli)-ar ⁇ k ii fl f ⁇ , the art.
  • the present invention is illustrated with, but not limited by, the use of the R6K ⁇ origin, oriR, from the plasmid R6K.
  • the R6K ⁇ origin requires a trans-acting factor, the ⁇ protein supplied by the pir gene [Metcalf et ⁇ l (1996) Plasmid 35: 1].
  • coli strains containing the pir gene will support replication of R6K ⁇ origins to medium copy number.
  • a strain containing a mutant allele of pir, pir- ⁇ ⁇ 6, will allow an even higher copy number of constructs containing the R6K ⁇ origin (i.e., 15 copies per cell for the wild type versus 250 copies per cell for the mutant).
  • This property may be useful when potentially toxic genes are manipulated, although the chances of expression of a toxic gene are low because, in preferred embodiments of the present invention, the Univector either contains no promoter or contains a promoter driving the neo gene which is transcribed in the opposite direction from the gene of interest.
  • E. coli strains that express the pir or pir- ⁇ ⁇ 6 gene product include BW18815 (ATCC 47079; this strain contains the /? r- 1 16 gene), BW19094 (ATCC 47080; this strain contains the pri gene), BW20978 (this strain contains the pir-l 16 gene), BW20979 (this strain contains the pir gene), BW21037 (this strain contains the pir- ⁇ 16 gene) and BW21038 (this strain contains the pir gene) (Metcalf et ⁇ l., supra).
  • conditional origins of replication suitable for use on the pUNI vectors of the present invention include, but are not limited to: 1 ) the RK2 oriV from the plasmid RK2 (ATCC 37125).
  • the RK2 oriV requires a trans-acting protein encoded by the trfA gene [Ayres et al. ( 1993) J. Mol. Biol. 230: 174]; 2) the bacteriophage PI ori which requires the repA protein for replication [Pal et al. (1986) J. Mol. Biol.
  • replication-thermosensitive plasmids such pSU739 or pSU300 which contain a thermosensitive replicon derived from plasmid pSClOl, rep pSC101 ts which comprises oriV [Mendiola and de la Cruz (1989) Mol. Microbiol. 3:979 and Francia and Lobo (1996) J. Bad. 178:894].
  • pSU739 and pSU300 are stably maintained in E. coli strain DH5 (Gibco BRL) at a growth temperature of 30°C (42°C is non-permissive for replication of this replicon).
  • Other conditional origins of replication including other temperature sensitive replicons, are known to the art and may be employed in the vectors and methods of the present invention.
  • Site-specific recombinases are enzymes that recognize a specific DNA site or sequence (referred to herein generically as a "sequence-specific recombinase target site") and catalyze the recombination of DNA in relation to these sites.
  • Site-specific recombinases are employed for the recombination of DNA in both prokaryotes and eukaryotes. Examples of site-specific recombination include, but are not limited to:
  • chromosomal rearrangements that occur in Salmonella typhimurium during phase variation, inversion of the FLP sequence during the replication of the yeast 2 ⁇ m circle, and in the rearrangement of immunoglobulin and T cell receptor genes in vertebrates, 2) integration of bacteriophages into the chromosome of prokaryotic host cells to form a lysogen, and 3) transposition of mobile genetic elements (e.g., transposons) in both prokaryotes and eukaryotes.
  • site-specific recombinase refers to enzymes that recognize short DNA sequences that become the crossover regions during the recombination event and includes recombinases. transposases, and integrases.
  • the present invention is illustrated with, but not limited by, the use of vectors containing lox sites (e.g., loxP sites) and the recombination of these vectors using the Cre recombinase of bacteriophage PL
  • the Cre protein catalyzes recombination of DNA between two loxP sites and is involved in the resolution of PI dimers generated by replication of circular lysogens [Sternberg et al. (1981) Cold Spring Harbor Symp. Quant. Biol. 45:297].
  • Cre can function in vitro and in vivo in many organisms including, but not limited to, bacteria, fungi, and mammals [Abremski et al. (1983) Cell 32:1301 ; Sauer (1987) Mol. Cell. Biol. 7:2087; and Orban et al. (1992) Proc. Natl. Acad. Sci. 89:6861].
  • a schematic for one embodiment of Cre-mediated plasmid fusion is shown in Figure 14.
  • the Univector pUNI
  • pHOST represents the recipient vector that contains the appropriate transcriptional and/or translational regulatory sequences that will eventually control the expression of the gene of interest.
  • a recombinant expression construct is made through Cre-/oxP-mediated site-specific recombination that fuses these two plasmids. This in vitro reaction generates a dimeric recombinant plasmid in which the gene of interest from pUNI is placed downstream of the promoter present on the host vector.
  • the recombinant plasmid in Figure 14 can be selected in a pir bacterial strain by selecting Kn r .
  • the loxP sites may be present on the same DNA molecule or they may be present on different DNA molecules; the DNA molecules may be linear or circular or a combination of both.
  • the loxP site consists of a double-stranded 34 bp sequence (SEQ ID NO: 12) which comprises two 13 bp inverted repeat sequences separated by an 8 bp spacer region [Hoess et al. (1982) Proc. Natl. Acad. Sci. USA 79:3398 and U.S. Patent No. 4,959,317. the disclosure of which is herein incorporated by reference].
  • the internal spacer sequence of the loxP site is asymmetrical and thus, two loxP sites can exhibit directionality relative to one another [Hoess et al.
  • Cre excises the DNA between these two sites leaving a single loxP site on the DNA molecule [Abremski et al. ( 1983) Cell 32: 1301 ]. If two loxP sites are in opposite orientation on a single DNA molecule, Cre inverts the DNA sequence between these two sites rather than removing the sequence. Two circular DNA molecules each containing a single loxP site will recombine with one another to form a mixture of monomer, dimer, trimer, etc. circles. The concentration of the DNA circles in the reaction can be used to favor the formation of monomer (lower concentration) or multimeric circles (higher concentration).
  • Circular DNA molecules having a single loxP site will recombine with a linear molecule having a single lox? site to produce a larger linear molecule.
  • Cre interacts with a linear molecule containing two directly repeating loxP sites to produce a circle containing the sequences between the loxP sites and a single loxP site and a linear molecule containing a single loxP site at the site of the deletion.
  • the Cre protein has been purified to homogeneity [Abremski et al. ( 1984) J. Mol.
  • Cre gene has been cloned and expressed in a variety of host cells [Abremski et al. (1983), supra].
  • Purified Cre protein is available from a number of suppliers (e.g., Novagen and New England Nuclear/DuPont). The Cre protein also recognizes a number of variant or mutant lox sites (variant relative to the loxP sequence), including the loxB, loxL and loxR sites which are found in the E. coli chromosome [Hoess et al. (1982), supra].
  • lox sites include /oxP51 1 [5"- ATAACTTCGTATAGTATACATTATACGAAGTTAT-3' (SEQ ID NO: 16); spacer region underlined; Hoess et al. (1986), supra], and /oxC2 [5'-ACAACTTCGTATA ATGTATGCTATACGAAGTTAT-3 ' (SEQ ID NO:17); spacer region underlined; U.S. Patent
  • Cre catalyzes the cleavage of the lox site within the spacer region and creates a six base-pair staggered cut [Hoess and Abremski (1985) J. Mol. Biol. 181 :351].
  • the two 13 bp inverted repeat domains of the lox site represent binding sites for the Cre protein. If two lox sites differ in their spacer regions in such a manner that the overhanging ends of the cleaved DNA cannot reanneal with one another, Cre cannot efficiently catalyze a recombination event using the two different lox sites.
  • Cre cannot recombine (at least not efficiently) a loxP site and a /oxP51 1 site; these two lox sites differ in the spacer region.
  • Two lox sites which differ due to variations in the binding sites i.e.. the 13 bp inverted repeats
  • Cre may be recombined by Cre provided that Cre can bind to each of the variant binding sites.
  • the efficiency of the reaction between two different lox sites (varying in the binding sites) may be less efficient that between two lox sites having the same sequence (the efficiency will depend on the degree and the location of the variations in the binding sites).
  • the /oxC2 site can be efficiently recombined with the lox?
  • site as these two lox sites differ by a single nucleotide in the left binding site.
  • site-specific recombinases may be employed in the methods of the present invention in place of the Cre recombinase.
  • Alternative site-specific recombinases include, but are not limited to:
  • the FLP recombinase of the 2 ⁇ plasmid of Saccharomyces cerevisiae [Cox (1983) Proc. Natl. Acad. Sci. USA 80:4223] which recognizes the frt site.
  • the frt site comprises two 13 bp inverted repeats separated by an 8 bp spacer [5'-GAAGTTCCTATTCTCTAGAAAGTATAGGAACTTC-3 ' (SEQ ID NO: 18); spacer underlined].
  • the FLP gene has been cloned and expressed in E. coli (Cox, supra) and in mammalian cells (PCT International Patent Application PCT/US92/01899, Publication No.: WO 92/15694.
  • pHOST vectors are generally expression vectors (e.g., plasmids) which have been modified by the insertion of a sequence- specific recombinase target site (e.g., a lox site).
  • the pHOST can comprise any regulatory sequence desired for manipulation of nucleic acids.
  • the pHOST vector may encode a protein domain such as an affinity domain including, but not limited to, glutathione-S-transferase (Gst), maltose binding protein (MBP). a portion of staphylococcal protein A (SPA), a polyhistidine tract, etc.
  • Gst glutathione-S-transferase
  • MBP maltose binding protein
  • SPA staphylococcal protein A
  • the affinity domain may be located at either the amino- or carboxy-terminus of the fusion protein.
  • a sequence-specific recombinase target site is placed after (i.e., downstream of) the start of transcription in the host vector.
  • sequence-specific recombinase target site in the correct reading frame such that: 1) an open reading frame is maintained through the sequence-specific recombinase target site on pHOST, and 2) the open reading frame in the sequence-specific recombinase target site on pHOST is in frame with the open reading frame found on the sequence-specific recombinase target site contained within the pUNI vector.
  • the oligonucleotide comprising the sequence-specific recombinase target site on pHOST is designed to avoid the introduction of in-frame stop codons.
  • the gene of interest contained within the pUNI vector is cloned in a particular reading frame so as to facilitate the creation of the desired fusion protein.
  • pHOST vectors The modification of several expression vectors is provided in the examples below to illustrate the creation of suitable pHOST vectors.
  • approximately 40 pHOST vectors have been generated, including GST expression vectors, yeast GAL1 expression vectors, mammalian CMV expression vectors, and baculovirus expression vectors. In each case, expression was at or near the levels achieved by conventional cloning.
  • a general strategy for generating any pHOST of interest involves the generation of a linker containing the desired sequence- specific recombinase target site (e.g., a lox site such as loxP or loxH) by annealing two complementary oligonucleotides.
  • the annealed oligonucleotides form a linker having sticky ends that are compatible with ends generated by restriction enzymes whose sites are conveniently located in the parental expression vector (e.g., within a polylinker of the parental expression vector).
  • any vector can be easily adapted for use with the UPS method.
  • the fusion of a pUNI vector and a pHOST vector may be accomplished in vitro using a purified preparation of a site-specific recombinase (e.g., Cre recombinase).
  • a site-specific recombinase e.g., Cre recombinase
  • the pUNI vector and the pHOST vector are placed in reaction vessel (e.g. , a microcentrifuge tube) in a buffer compatible with the site-specific recombinase to be used.
  • reaction buffer may comprise 50 mM Tris-HCl (pH 7.5), 10 mM MgCl 2 , 30 mM NaCl and 1 mg/ml BSA.
  • the reaction buffer may comprise 50 mM Tris-HCl (pH 7.4), 10 mM MgCL, 100 ⁇ g/ml BSA [Gronostajski and Sadowski, supra].
  • the concentration of the pUNI vector and the pHOST vector may vary between 100 ng to 1.0 ⁇ g of each vector per 20 ⁇ l reaction volume with about 0.1 ⁇ g of each nucleic acid construct (0.2 ⁇ g total) per 20 ⁇ il reaction being preferred.
  • the concentration of the site-specific recombinase may be titered under a standard set of reaction conditions to find the optimal concentration of enzyme to be used as described in Example 4.
  • the host cell employed will not express the transacting factor required for replication of the conditional origin of replication contained within the pUNI vector (or alternatively the host cell will be grown at a temperature which is non- permissive for replication of a temperature sensitive replicon contained within the pUNI vector).
  • the host cells will be grown under conditions that select for the presence of the selectable marker contained within the pUNI vector (e.g., growth in the presence of kanamycin when the pUNI vector contains a kanamycin resistance gene).
  • Plasmid or non- chromosomal DNA is isolated from host cells which display the desired phenotype and subjected to restriction enzyme digestion to confirm that the desired fusion event has occurred. e) Recombination in Prokaryotic Host Cells
  • the fusion of a pUNI vector and a pHOST vector may be accomplished in vivo using a host cell that expresses the appropriate site-specific recombinase (e.g., Cre recombinase).
  • a host cell that expresses the appropriate site-specific recombinase (e.g., Cre recombinase).
  • the host cell may express the recombinase as part of its genomre or may be supplied with means for expressing the recombinase (e.g., a recombinase expression vector).
  • a recombinase expression vector e.g., a recombinase expression vector.
  • the host cell employed lack the ability to express the trans-acting factor required for replication of the conditional origin of replication (or alternatively the host cell will be grown at a temperature which is non-permissive for replication of a temperature sensitive replicon contained within the pUNI vector).
  • the pUNI vector and the pHOST vector are cotransformed into the host cell using a variety of methods known to the art (e.g., transformation of cells made competent by treatment with CaCL, electroporation, etc.).
  • the cotransformed host cells are grown under conditions that select for the presence of the selectable marker contained within the pUNI vector (e.g., growth in the presence of kanamycin when the pUNI vector contains the kanamycin resistance gene).
  • Plasmid or non-chromosomal DNA is isolated from host cells which display the desired phenotype and subjected to restriction enzyme digestion to confirm that the desired fusion event has occurred.
  • UPS results in the fusion of two plasmids and is suitable for the vast majority of expression needs.
  • the size of the recombinant molecule is limiting (e.g., in the generation of retrovirus or adeno-associated viral [AAV] expression constructs)
  • a second recombination event is utilized. In some embodiments of the present invention, this second recombination is catalyzed by the R recombinase [Araki et al. (1992) J. Mol. Biol.
  • a standard UPS method is utilized to generate a dimer containing the entire pUNI and pHOST vectors, followed by a reaction with the second recombinase that excises the unwanted portions of the Univector.
  • host cells or reaction conditions can be applied that allow both recombination reactions to occur in a single step (See Example 9).
  • Cells containing the desired recombinant product can be selected for by using selectable markers, and/or conditional origins of replication.
  • UPS greatly facilitates the generation of fusion proteins at the N-terminus of the protein of interest
  • it is often necessary to modify proteins on the C-terminus e.g.. to add an epitope tag.
  • the present invention takes advantage of E. coifs endogenous homologous recombination system. It has been shown [Winans et al.
  • the present invention provides means to overcome these problems and to provide for effective cloning and recombination (e.g., with the UPS).
  • the present invention provides BUNIO, a recBCsbcBhsdR strain expressing pir- 1 16.
  • the hsdR mutation prevents restriction of nucleic acid (e.g.. PCR amplified DNA) by the endogenous restriction modification system of E. coli.
  • this system was tested using a 3xMYC epitope tag and the SKP gene in pUNI-10 as the recipient.
  • pML74 which is pUNI-Amp containing a triple (3x) MYC epitope tag followed by a stop codon, was used as template DNA for PCR amplification with two primers, A and B.
  • Primer A (SEQ ID NO:30) is 71 nt long, the first 50 nt of which correspond to the last 50 nt of the SKPl coding region and the last 21 nt. the 3 * end of the primer, correspond to the first 21 nt of the DNA encoding the 3xMYC tag.
  • the reading frames of SKPl and the 3xMYC tag are in register.
  • Primer B (SEQ ID NO:31 ) is 22 nt long and recognizes a site on pML74 common to pUNI vectors that begins 367 bp from the polylinker region. Amplification using primers A and B and pML74 as a template generated a fragment of DNA with 50 bp homology to the Univector. This amplification product was co-transformed with BamHl-Sacl -cleaved pUNI-SKPl into BUN10 cells and Kn r transformants were selected and analyzed by restriction mapping. Homologous recombination events are selected because they allow the recircularization of the linearized vector. A schematic representation of this method is provided in Figure 25.
  • any nucleic acid sample with sufficient sequence complementarity can be used.
  • the sample to be inserted could be artificially synthesized or prepared by any other means.
  • the recombination event can be designed to occur at any desired location on any desired recipient vector (i.e.. is not limited to the production of 3 " gene fusions).
  • the present invention provides a method for directional subcloning into vectors (e.g. , pUNI derivatives) that relies upon the generation of a reconstituted regulatory element from two partial sites located on the fragment to be cloned and the recipient vector, respectively.
  • a linear nucleic acid molecule to be inserted into a vector can be designed with a portion of a promoter at its 3 " or 5 * ends.
  • the recipient vector is then designed with the remainder of the promoter, arranged such that, when the cloned fragment is inserted in the desired direction, an intact promoter is reconstituted and provides a means of detecting the successful directional cloning event.
  • reconstituted regulatory elements can be employed to achieve detectable directional cloning.
  • reconstituted regulatory elements that find use with the present invention include, but are not limited to, promoters, repressors. operators, enhancers, enzyme recognitions sites, selectable markers, and conditional origins of replication, among others.
  • the reconstituted regulatory element may comprise a negative selection capability, such that fragments cloned in an undesired
  • Figure 22A shows a schematic representation of normal conditions in the absence of inducer (left diagram) where lacR is bound to the lac operator sites in front of lacZ and represses transcription. In the presence of high copy number plasmid containing the lacO sequence (right diagram), LacR repressors are titrated out by binding to plasmid borne lacO sites and the endogenous lacZ gene is expressed.
  • FIG. 22B shows this strategy, whereby primer A (5') and B (3') are used to amplify the gene of interest.
  • the 5' end of primer B contains a half lacO site which subsequently becomes the 3 -end of the PCR fragment indicated in the Figure.
  • the PCR fragment was Hgated into Eco47III-cleaved pUNI-30 and transformed into BUN10, a Lac" E. coli strain, and Kn r colonies were selected on plates containing X-gal. Plasmids containing SKPl in the proper orientation were identified by their dark blue color (shown by arrows in Figure 22C). Reclosure of the vector without insert as well as the presence of the PCR fragment in the incorrect orientation result in the production of white or pale blue colonies. Ten out of 10 dark blue colonies contained SKPl in the correct orientation. In particularly preferred embodiments, phosphorylated PCR primers are used. In most embodiments in which Taq polymerase is used, the material is preferably treated briefly with T4 polymerase and dNTPs to remove the 3' overhangs generated.
  • the Univector Fusion System permits the rapid exchange of an entire cDNA library to a variety of expression vectors. This capability to essentially transform one library into many libraries is one of the most significant advances made possible by the UPS methods provided by the present invention.
  • the high efficiency of the in vitro UPS reaction i.e., a minimum of 16.8% coupled with the extremely high efficiency of modern transformation methods makes possible the conversion of whole cDNA libraries constructed in the Univector into expression libraries without loss of representation.
  • single cDNA libraries will be converted into any of a number of different expression libraries such as those used in the two hybrid systems [Durfee et al. (1993) Gene. & Dev. 7:55; and Aronheim et al. (1997) Mol. Cell. Biol.
  • the present invention provides methods such that libraries made for one purpose will no longer need to be remade from scratch when needed in a different context; clones isolated from these libraries are easily converted back into simple Univector plasmids compatible with other pHOST vectors for future analysis.
  • the cDNA library is generated using a pUNI vector as the cloning vector (a pUNI library).
  • the entire library may then be transferred (using either an in vitro or an in vivo recombination reaction) into any expression vector modified to contain a sequence-specific recombinase target site (e.g., a lox site) (i.e., into a pHOST vector).
  • a sequence-specific recombinase target site e.g., a lox site
  • Example 10 provides an illustration of such capabilities using methods of the present invention.
  • sequences contained within a pUNI library can be used to recombine with linear ⁇ constructs (which can then be used to isolate specific genes by complementation of appropriate host cell such as E. coli or S. cerevisiae mutant cells).
  • UPS is compatible with the ⁇ YES series of lambda cloning vectors that use cre-lox recombination to convert phage clones into plasmids.
  • These vectors are capable of making extremely large cDNA libraries (i.e., greater than 10 8 recombinants per 100 ng of cDNA) and. unlike plasmid libraries, can be propagated with minimal loss of representation.
  • the in vivo gene trap method a variation of the Univector Fusion System, can be used to transfer linear DNA fragments that lack a selectable marker, such as a PCR product, into a variety of expression vectors.
  • °C degrees Centigrade
  • g gravitational field
  • vol volume
  • DNA deoxyribonucleic acid
  • RNA ribonucleic acid
  • kdal or kD kilodaltons
  • OD optical density
  • EDTA ethylene diamine tetra-acetic acid
  • coli Esscherichia coli: SDS (sodium dodecyl sulfate): PAGE (polyacrylamide gel electrophoresis); ts (temperature sensitive); p (plasmid); LB (Luria- Bertani medium: per liter: 10 g Bacto-tryptone, 5 g yeast extract, 10 g NaCl, pH to 7.5 with NaOH); ml (milliliter); ⁇ l (microliter); M (Molar); mM (millimolar); ⁇ M (microMolar); g (gram); ⁇ g (microgram); ng (nanogram); U (units), mU (milliunits); min. (minutes); sec.
  • Univector constructs are provided.
  • the map for several Univectors is shown in Figure 23, showing pUNI-10, pUNI-20, and pUNI-30.
  • nucleotide positions (in parentheses) of unique restriction enzyme cleavage sites are shown.
  • Functional sequences are shown as filled boxes and are labeled inside of the circle. Boxes with arrows are genes transcribed in the direction of the arrow.
  • Below each map is the sequence of the polylinker region displayed as coding triplets in frame with the open reading frame of loxP. Unique restriction enzyme cleavage sites are in bold.
  • Univectors include a loxP site placed adjacent to the 5' end of a polylinker for insertion of cDNAs.
  • loxP has a single open reading frame that is in frame with the ATG of the Ndel and Ncol sites of the polylinker. This facilitates the subsequent generation of protein fusions as noted below.
  • Following the polylinker are bacterial and eukaryotic transcriptional terminators to facilitate 3' end formation of transcripts.
  • the Univectors also comprise a conditional origin or replication derived from R6K ⁇ that allows their propagation only in bacterial hosts expressing the pir gene originally from R6K ⁇ [Metcalf et al ( 1994) Gene 138: 1].
  • the Univectors also have the neo gene from Tn5 for selection in bacteria (e.g., selection of recombinant products of UPS is achieved by selecting for kanamycin resistance after transformation into a pir " strain because the neo gene on the pUNI can only be propagated when covalently linked to an origin or replication that is functional in a pir background).
  • pUNI-20 contains additional site specific recombination sites, such as RS. that facilitate precise ORF transfer (POT), as described below.
  • One Univector construct the pUNI-10 vector, contains a loxP site, a kanamycin resistance gene (Kn R ) and the R6K ⁇ conditional origin of replication (OriR R6Ky ).
  • Kn R kanamycin resistance gene
  • OriR R6Ky conditional origin of replication
  • OriR R6K ⁇ is functional only in E. coli strains expressing the FI replication protein (i.e.. the product of the pir gene).
  • a gene of interest is placed within pUNI-10 (either as a result of constructing a library in pUNI-10 or by subcloning a previously cloned gene of interest). Once the gene of interest is contained within pUNI-10. Any number of plasmid expression constructs containing this gene of interest can be constructed rapidly (e.g. within a single day).
  • the expression constructs will contain an antibiotic resistance gene other than kanamycin (e.g., ampicillin).
  • coli cells that do not express the IT protein to grow in the presence of kanamycin cannot replicate in E. coli cells that do not express the FI protein unless pUNI has fused or integrated into another plasmid that contains a normal (i.e., not a conditional) origin of replication (e.g., the Col ⁇ l origin). In this case, pUNI will be replicated (as part of the fusion plasmid) and kanamycin resistance will be conferred on the host cell.
  • a normal origin of replication e.g., the Col ⁇ l origin
  • Figure 2A provides a schematic map of the pUNI-10 vector; the locations of selected restriction enzyme sites are indicated (with the exception of Notl, all sites shown are unique).
  • Figure 2B shows the D ⁇ A sequence of the loxP site and the polylinkers contained within pU ⁇ I-10 (i.e., nucleotides 401-530 of S ⁇ Q ID ⁇ O: l).
  • Nucleotides 1-400 of pUNI-10 contain the conditional origin of replication from R6K ⁇ (OriR R6K ⁇ ); the OriR R6Kv was derived from the plasmid R6K (ATCC 37120) [Metcalf et al.
  • nucleotides 401-414 comprise a Notl-Kpnl polylinker that facilitates the exchange of lox sites
  • pUNI-10 contains a wild-type loxP site (as discussed above, pUNI vectors containing modified lox sites may be employed).
  • Nucleotides 415-448 comprise the wild-type loxP site;
  • nucleotides 449-527 comprise a polylinker used for the insertion of the gene of interest (genomic or cDNA sequences).
  • Nucleotides 528-750 contain the polyA addition sequence from bovine growth hormone (BGH) (the BGH polyA sequence is available on a number of commercially available vectors including pcDNA3.1 (Invitrogen)); the BGH polyA sequence provides a 3' end for transcripts expressed in mammalian and other eukaryotic cells.
  • BGH bovine growth hormone
  • the art is aware of other eukaryotic polyA sequences that may be used in place of the BGH polyA sequence (e.g., the SV40 poly A sequence, the TK polyA sequence, etc.).
  • Nucleotides 751 -890 contain the T7 terminator sequence which is used to terminate transcription in prokaryotic hosts (numerous prokaryotic termination signals are known to the art and may be employed in place of the T7 terminator sequence).
  • Nucleotides 890-895 comprise an EcoRV restriction enzyme recognition site and nucleotides 896-2220 comprise the kanamycin resistance gene (Kan or Kn R ) from Tn5 which provides a positive selectable marker.
  • Kn R gene found on pUNI-10 was modified using site-directed mutagenesis to remove the naturally occurring Ncol site such that pU ⁇ I-10 contains a unique Ncol site in the polylinker region located at nucleotides 449-527.
  • pU ⁇ I vectors need not contain a Kn R gene (modified or wild-type); other selectable genes may be used in place of the Kn R gene (e.g., ampicillin resistance gene, tetracycline resistance gene, zeocinTM resistance gene. etc.).
  • the pU ⁇ I vector need not contain a selectable marker, although the use of a selectable marker is preferred. When a selectable marker is present on the pU ⁇ I vector, this marker is preferably a different selectable marker than that present on the pHOST vector.
  • the nucleotide sequence of pU ⁇ I-10 is provided in S ⁇ Q ID ⁇ O:l .
  • pHOST plasmids Host plasmids used in the Univector plasmid fusion system are referred to as pHOST plasmids.
  • pHOST plasmids or vectors are generally expression vectors that have been modified by the insertion of a site-specific recombination site, such as a lox site. The presence of the lox site on the pHOST plasmid permits the rapid subcloning or insertion of the gene interest contained within a pUNI vector to generate an expression vector capable of expressing the gene of interest.
  • the pHOST vector may encode a protein domain such as an affinity domain including, but not limited to, glutathione-S-transferase (Gst).
  • MBP maltose binding protein
  • SPA staphylococcal protein A
  • polyhistidine tract etc.
  • affinity domains A variety of commercially available expression vectors encoding such affinity domains are known to the art.
  • the pHOST plasmid contains a vector-encoded affinity domain
  • a fusion protein comprising the vector-encoded affinity domain and the protein of interest is generated when the pUNI and pHOST vectors are recombined.
  • the host vector features include the Col El origin of replication and the bla gene for propagation and selection in bacteria, a loxP site for plasmid fusions and a specific promoter residing upstream of, and adjacent to, the lox? site.
  • Host vectors may also comprise sequences responsible for propagation, selection, and maintenance in organisms other than E. coli.
  • a lox site is placed after (i.e., downstream of) the start of transcription in the host vector. This is easily accomplished using synthetic oligonucleotides comprising the desired lox site. In designing the oligonucleotide comprising the lox site, care is taken to avoid introducing an ATG or start codon that might initiate translation inappropriately.
  • the modification of several expression vectors is provided below to illustrate the creation of suitable pHOST vectors.
  • the general strategy involved the generation of a linker containing a lox site by annealing two complementary oligonucleotides.
  • the annealed oligonucleotides form a linker having sticky ends that are compatible with ends generated by restriction enzymes whose sites are conveniently located in the parental expression vector (e.g., within the polylinker of the parental expression vector).
  • pGEX-2TKcs is an expression vector active in E. coli cells which is designed for inducible, intracellular expression of genes or gene fragments as fusions with Gst.
  • 2TKcs contains the IPTG-inducible tac promoter (P lac ) and was derived from pGEX-2TK (Pharmacia Biotech) as follows.
  • a linker containing a loxP site was generated by annealing the following oligonucleotides: 5'-CATGGCTATAACTTCGTATAGCATACATTATACGAA GTTATG-3' (SEQ ID NO:4) and 5'-GATCCATAACTTCGTATAATGTATGCTATACGAAGTTAT AGC-3 * (SEQ ID NO:5).
  • these two oligonucleotides form a double-stranded linker having a 5' end compatible with an Ncol sticky end and a 3' end compatible with a BamHl sticky end (Figure 3A).
  • pGEX-2TKcs was digested with Ncol and BamHl ( Figure 3B) and the annealed loxP linker was inserted to form pGst-/ox.
  • pVL1392 Baculovirus Expression Vector pVL1392 is an expression vector that contains the polyhedrin promoter which is active in insect cells (Pharmingen).
  • a linker containing a loxP site was generated by annealing the following oligonucleotides: 5'-GGCCGGACGTCATAACTTCGTATAGCATACATTATAC GAAGTTATG-3' (SEQ ID ⁇ O:6) and 5'-GATCCATAACTTCGTATAATGTATGCTATAC GAAGTTATGACGTCC-3 " (SEQ ID NO:7).
  • pGAP24 is an expression vector that is based on the yeast 2 ⁇ m circle and contains the constitutive GAP (glyceraldehyde 3-phosphate dehydrogenase) promoter (P (I ⁇
  • GAP glycose promoter
  • a linker containing a loxP site was generated by annealing the following oligonucleotides: 5'-TCGAGACGTCATAACTTCGTATAGCATACATTAT ACGAAGTTATGC-3' (SEQ ID ⁇ O:8) and 5'-GGCCGCATAACTTCGTATAATGT ATGCTATACGAAGTTATGACGTC-3' (SEQ ID NO:9). When annealed, these two oligonucleotides form a double-stranded linker having a 5' end compatible with a Xhol sticky end and a 3' end compatible with a Notl sticky end (Figure 5A).
  • pGAP24 was digested with Xhol and Notl ( Figure 5B) and the annealed loxP linker was inserted to form pGAP24-/ox.
  • pGAL14 is a yeast centromeric expression vector that contains the GAL promoter
  • P GAL which is induced by the presence of galactose in the medium, and the TRPl gene.
  • a linker containing a loxP site was generated by annealing together the oligonucleotides listed in SEQ ID ⁇ OS:8 and 9. When annealed, these two oligonucleotides form a double-stranded linker having a 5' end compatible with a Xhol sticky end and a 3' end compatible with a Notl sticky end ( Figure 6A).
  • pGAL14 was digested with Xhol and Notl ( Figure 6B) and the annealed loxP linker was inserted to form pGAL14-/ox.
  • the cre gene was inserted into a Gst expression vector such that a fusion protein comprising Gst at the amino-terminal end and Cre recombinase at the carboxy-terminal end was produced.
  • the Gst-Cre fusion protein was purified by chromatography using Glutathione Sepharose 4B (Pharmacia). Purified Gst-Cre can be stored at -80"C. -20°C, or 4°C for several months without significant loss of activity.
  • a plasmid expressing a GST-cre fusion protein was constructed, pQL123.
  • the cre gene was isolated by polymerase chain reaction (PCR) amplification using the plasmid pBS39 (U.S. Patent 4,959,317).
  • U.S. Patent ⁇ os. 4,683,195, 4,683,202 and 4,965,188 describe PCR methodology and are incorporated herein by reference.
  • the primers used in the PCR were designed to introduce an Ncol site at the first ATG in the cre open reading frame.
  • PCR product was cloned into a TA cloning vector (pCRII.l ; Invitrogen) and then was subcloned as an NcoI-EcoRI fragment into pG ⁇ X-2TKcs (Example
  • the ligation products were used to transform DH5 ⁇ cells and the desired recombinant was isolated and used to transform BL21(DE3) cells (Invitrogen).
  • the nucleotide sequence of the Gst-Cre coding region within pQL 123 is listed in SEQ ID ⁇ O: 10.
  • the amino acid sequence of the fusion protein expressed by pQL123 is listed in SEQ ID NOT L
  • BL21(DE3) cells containing the pQL123 plasmid were grown at 37°C in LB containing 100 ⁇ g/ml ampicillin until the OD 6 ⁇ reached 0.6. Expression of the fusion protein was then induced by the addition of IPTG to a final concentration of 0.4 mM and the cells were allowed to grow overnight at 25°C. Following induction, the bacterial cells were pelleted by centrifugation at 5.000 x g at 4°C and the supernatant was discarded. A cell lysate was prepared as follows.
  • Cells harvested from 0.5 liter of culture were suspended in 35 ml of a solution containing 20 mM Tris-HCl, pH 8.0, 0.1 M NaCl, 1 mM EDTA, 0.5% Nonidet P-40, 5 ⁇ g/ml of each of leupeptin. antipain. aprotinin and 1 mM PMSF at 4°C.
  • the cells were incubated ice and disrupted by sonication (3 x 15 sec bursts) using a sonicator (Ultrasonic Heat Systems Model 200R) at full power.
  • the lysate was then clarified by centrifugation at 12.000 rpm using a SS34 rotor (Sorvall).
  • the Gst-Cre fusion protein was affinity purified from the cell lysate by chromatography on Glutathione Sepharose 4B (Pharmacia) according to the manufacturer's instructions. The protein concentration of Gst-Cre was determined by Bradford analysis (BioRad).
  • Gst-Cre retained high recombinase activity as measured by UPS. The efficiency of this reaction reached up to 16.8% as shown in Figure 15, similar to that for native Cre (Abremski et al, supra).
  • the indicated amounts of Gst-Cre were incubated with pUNI-10 and pQL103 plasmid DNA as described below. Percentage of recombinants were calculated by measuring the ratio of total kanamycin resistant transformants (fusion events between pUNI-10 and pQL103) relative to total ampicillin resistant transformants (pQL103 alone and pUNI-10-pQL103 fusions).
  • FIG. 8 provides a schematic showing the strategy employed for in vitro recombination.
  • pA represents a generic pUNI vector that contains a loxP site, a kanamycin resistance gene and the conditional R6K origin that is only functional in E. coli strains expressing the IT protein (e.g., E. coli strains BW18815, BW19094, BW20978. BW20979, BW21037. BW21038).
  • pB represents a generic pHOST vector that contains a loxP site, an ampicillin resistance gene and a Col El origin of replication.
  • pAB represents the fused plasmid which results from the Cre-mediated fusion of pA and pB.
  • pUNI-5 (a pUNI vector which differs from pUNI-10 only in that pUNI-5 retains the Ncol site in the Kn R gene and contains a different polylinker) was employed as pA and pQL103, an ampicillin-resistant plasmid containing a loxP site and the CoIEl origin, was employed as pB.
  • pA and pQL103 an ampicillin-resistant plasmid containing a loxP site and the CoIEl origin
  • the amount of purified Gst-Cre was varied from 0 to 1.0 ⁇ g.
  • the reactions were incubated at 37°C for 20 minutes and then the reactions were placed at 70°C for 5 min. to inactivate the Gst-Cre protein.
  • Five microliters of each reaction mixture were used directly to transform competent DH5 ⁇ cells (CaCl 2 treated).
  • the transformed cells were plated onto LB/Amp ( 100 ⁇ g/ml amp) and LB/Kan (40 ⁇ g/ml kan) plates and the number of ampicillin resistant (Ap R ) and kanamycin-resistant (Kn R ) colonies were counted.
  • the results are summarized in Table 1.
  • Example 4 it was demonstrated that the Univector Plasmid Fusion System can be used to rapidly fuse two plasmid constructs together in vitro.
  • a series of expression plasmids were made by UPS and tested for expression in several contexts.
  • Example 2 using the in vitro reaction described in Example 4; 0.2 ⁇ g of Gst-Cre was used per 20 ⁇ l reaction.
  • the resulting plasmid fusions were termed pGAP24-Skpl and pGAL14- Skpl .
  • pGAP24-Skpl and pGAL14-Skpl were then transformed into the temperature sensitive (ts) skpl-11 mutant yeast strain Y555 (Bai et al, supra) and the transformed yeast cells were plated onto SC-tryptophan plates (to select for the expression of the selectable marker TRPl) and incubated at either a permissive (25°C) or non-permissive temperature (37°C).
  • the plates which received yeast cells transformed with pGAL14-Skpl contained galactose.
  • the ability of the transformed cells to grow at the non-permissive temperature is dependent upon the expression of the wild-type skpl gene encoded by a properly fused pUNI-Skpl /expression vector construct.
  • the yeast SKPl genomic clone contained in a URA3 CEN vector was used to transform the ts skpl -11 mutant yeast strain Y555 and the transformed cells were also plated at 25°C and 37°C.
  • an expression vector e.g., pRS414 or pRS415; Bai et al, supra
  • the same selectable marker i.e., TRPl
  • TRPl selectable marker
  • FIG. 9A provides a schematic showing the starting constructs and the predicted fusion construct. Five microliters of the fusion reaction mixture was used transform DH5 ⁇ cells as described in Example 4. The transformed cells were plated onto LB/Amp/Kan plates and plasmid D ⁇ A was isolated from individual Ap R Kn R colonies.
  • plasmid D ⁇ As were digested with Pstl followed by electrophoresis on agarose gels to examine the structure of the fused plasmids.
  • a representative ethidium bromide- stained gel is shown in Figure 9B.
  • lane "M” contains DNA size markers
  • lanes pUNI-Skpl and pGst-/ox contain the starting plasmids digested with .vtl
  • lanes 1-12 contain plasmid DNA from individual Ap R Kn R colonies digested with Pstl.
  • Lanes marked with an "*" indicate that these colonies contained a trimeric fusion plasmid that resulted from the fusion of two Gst-/ox plasmids and one pUNI-Skpl plasmid.
  • the sizes of the two Pstl fragments which result from the fusion of pUNI-Skpl and pGst-/ox in kb are indicated (5.8 and 2.0 kb).
  • the results shown in Figure 9B demonstrate that the in vitro fusion reaction resulted in the production of the desired fused construct with high efficiency (about 83% of the plasmids in the Ap R Kn R colonies comprised the fusion of one pUNI-Skpl vector with one pGst-/ox vector).
  • lane "M” contains protein molecular weight markers (size in kd is indicated).
  • Lanes marked “C” contain extracts prepared from E. coli containing a GST-SKP1 construct made by conventional cloning (i.e., the SKPl cDNA was excised using restriction enzymes and inserted into pGEX-2TKcs (Example 2)).
  • Lanes 1-3 contain extracts from Ap R Kn R cells transformed with in vitro fusion reaction mixtures. Extracts prepared from uninduc ⁇ d cells and IPTG induced cells are indicated by "-” and "+", respectively.
  • the arrowheads indicate the location of the Gst-Skpl fusion proteins.
  • the Gst-Skpl fusion product generated from the pGST-S J*/ fusion construct contains 15 additional amino acids which are located between the Gst domain and the Skpl protein sequences relative to the Gst- Skpl fusion protein expressed from the conventionally constructed GST-SKPI plasmid (the additional 15 amino acids are encoded by the linker comprising the lox? site; see Figure 3).
  • the lane designations are the same as described for Figure 10A. This Western blot confirms that the bands indicated by the arrowheads in Figure 10A represent Gst-Skpl fusion proteins.
  • Univector Fusion System can be used to recombine two plasmids, one containing a gene of interest but no promoter (this vector may optionally contain expression signals such as termination signals and/or polyadenylation signals) and the other containing a promoter and optionally other expression signals (e.g.. splicing signals, translation initiation codons) (and optionally sequences encoding an affinity domain) but lacking a gene of interest, in vitro in such a manner that the proper translational reading frame is maintained permitting the expression of a functional protein from the fused plasmids in the host cell.
  • this vector may optionally contain expression signals such as termination signals and/or polyadenylation signals
  • other expression signals e.g. splicing signals, translation initiation codons
  • the S. cerevisiae SKPl ORF (Bai et al., supra) in pUNI-10 was fused to the pGST- lox host vector pHB2-GST by UPS to create a bacterial Gst-lox-Skpl fusion protein expressed under the control of the E. coli tac promoter.
  • the SKPl ORF was placed under the control of the S. cerevisiae GALl promoter both by conventional means and by UPS.
  • UPS the relative expression level of the UPS-derived plasmid was slightly lower.
  • This reduction in expression might be explained by the ability of loxP RNA to form a 13 bp stem-loop, as secondary structures formed within the 5' UTR of an mRNA can interfere with the initiation of translation [Kozak (1989) Mol Cell. Biol. 9:5134], although an understanding of the mechanism is not required to practice the present invention, and the present invention is not limited to any particular mechanistic explanation.
  • a series of lox sites were made containing mutations designed to reduce the stability of the stem-loop, as described in Example 8.
  • Protein extracts were prepared from Y80 cells grown in SC-ura plus galactose containing the following plasmids: vector alone (lane 1), pMH176 (GAL-MYC3-RNR4) made by conventional cloning lacking a lox sequence (lane 2), UPS-derived GAL-lox-MYC3-RNR4 constructs with either loxP (lane 3) or loxH (lane 4) present between the GALl promoter and the MYC3-RNR4 gene, vector alone (lane 5), and UPS-derived GALl-MYC3-lox-RAD53 construct (lane 6).
  • the recipient vector for RAD53 was pHY314-MYC3.
  • FIG. 18 shows the expression of the UPS-derived baculovirus expression constructs in insect cells. UPS reactions were performed between pUNI-10-RAD53 clones and baculovirus expression vectors in pVL1392 backbones engineered to contain lox sites and epitope tags.
  • Host insect expression vectors used were pHHOO-GST, pHI100-MYC3, and pHI100-HA3 and the resulting fusion plasmids were crossed onto Baculogold (Pharmingen) by standard methods.
  • GST affinity purified protein from lysates from 1 million cells infected with baculovirus expressing either GST-
  • RAD53 made by conventional cloning (lane 1) or UPS (lane 2) were fractionated on a SDS- PAGE and Coomassie stained.
  • Western blots of protein prepared from cells infected with the baculoviruses containing vector alone (lane 3), UPS-derived MYC3-lox-RAD53 (lane 4), vector alone (lane 5), or UPS-derived HA3-lox-RAD53 (lane 6) were probed with anti-Myc (lanes 3-4) or anti-HA (lane 5-6) monoclonal antibodies.
  • the present invention demonstrated expression of a Myc-tagged F-box protein under the control of the CMV promoter when transfected into Hela cells as shown in Figure 19.
  • This figure shows immunoblotting of whole cell lysates with anti-HA antibodies.
  • the cells used were Hela cells transfected by the calcium phosphate method with the CMV expression vectors pHM200-HA3 or pHM200-HA3-F3, expressing an HA-tagged F-box protein.
  • over 200 UPS derived constructs have been made and tested, showing expression success rates indistinguishable from those of conventional cloning methods.
  • An E. coli strain containing a cre gene under the control of an inducible promoter termed the QLB4 strain, was constructed as follows.
  • the cre gene was placed under the transcriptional control of the inducible lac promoter by inserting the cre ORF into a derivative of pNN402 [Elledge et al. (1991) Proc. Natl. Acad. Sci USA 88: 1731 ]; pNN402 was modified to contain a lac promoter.
  • This construct was then crossed onto lambda phage (e g-, ⁇ gtl l) using conventional techniques.
  • the recombinant lambda phage carrying the lac- cre gene was integrated into the chromosome of E. coli strain JM107 to generate the QLB4 strain.
  • Cre recombinase was induced by growing QLB4 cells at 37°C until an OD ⁇ 0 of 0.6 was reached. The culture was then split into 2 parts and IPTG was added to one part to a final concentration of 0.4 mM.
  • the BNN132 strain ATCC 47059;
  • Figure 1 1 shows a Western blot containing extracts prepared from (shown left to right) BNN123 cells grown in the absence of IPTG (“C”) and QLB4 cells grown in the absence (“QLB4 -”) and presence of IPTG (“QLB4 +”), respectively.
  • C IPTG
  • QLB4 + IPTG
  • Cre protein was expressed constitutively at very low levels in both BNN132 cells and QLB4 cells in the absence of IPTG.
  • a pUNI vector (Kn R ) and a pHOST vector (Ap R ) were cotransformed into QLB4 cells and the transformed cells were grown on plates containing kanamycin to select for the presence of the pUNI-pHOST fusion plasmid.
  • Plasmid DNA was isolated from individual kanamycin-resistant colonies and subjected to restriction enzyme digestion to examine the structure of the plasmid DNA. This analysis revealed that multiple isoforms of the plasmid fusion product were present in the plasmid DNA isolated from any single kanamycin-resistant colony.
  • Cre recombinase leads to multiple fusion events between the pUNI and pHOST vectors resulting in the production of multimeric forms (i.e., trimer, tetramer, etc.) of the fused plasmid (the desired fused plasmid is a dimer formed by fusion of pUNI and pHOST).
  • the multimeric plasmid fusion products would be expected to be unstable due to the fact that the Cre protein is constitutively expressed in QLB4 cells.
  • cre can be more tightly controlled as described below.
  • the pUNI and pHOST vectors can be modified as described in Example 7 and these modified vectors can be fused using a host cell that constitutively expresses the Cre protein.
  • Cre recombinase can be more tightly controlled by a variety of means.
  • the expression of the cre gene can be made conditional when expressing cre under the control of the lac promoter by growing the host cells in medium containing glucose. The presence of 0.2% glucose in the growth medium virtually shuts down transcription from the lac promoter.
  • the lac promoter can be modified to insert additional operator (o) sites which bind the lac repressor.
  • Other tightly controlled promoters are known to the art (e.g., the T7 promoter which requires the expression of T7 RNA polymerase; these promoters are available on the pET vectors (Novagen)) and may be employed to control the expression of the cre gene.
  • Cre expression can be tightly controlled by placing the cre gene on a plasmid containing a temperature-sensitive (ts) replicon (e.g., rep pSC101 ts ).
  • ts temperature-sensitive replicon
  • Cre will be expressed during the transformation of the host cell (because the host cell containing the ts plasmid containing the cre gene was maintained at the permissive temperature) but will be absent following recombination of the pUNI and pHOST vectors when the host cell is grown at a temperature non-permissive for replication of the ts replicon.
  • Cre-/oxP-mediated plasmid fusion can occur in vivo, although the reverse reaction, resolution of heterodimers, might decrease its utility.
  • the pUNI construct is modified such that two different lox sites flank the kanamycin resistance gene (the modified pUNI construct is termed pUNI-D).
  • the two lox sites differ in their spacer regions by one or two nucleotides and for the sake of discussion the two different lox sites are referred to as " oxA” and "loxB” (e.g., loxP and / ⁇ xP511 ; oxB" is used in this discussion to distinguish it from the first lox site termed "lox A” and does not indicate the use of the loxB sequence found in the E. coli chromosome).
  • the pHOST construct is modified such that one loxA site and one loxB site flank the selectable marker gene (the modified pHOST construct is termed pHOST-D).
  • pHOST contains the sacB gene as the selectable marker (a negative selectable marker).
  • the presence of the sacB gene on pHOST-D provides a means of counter-selection as cells expressing the sacB gene are killed when the cell is grown in medium containing 5% sucrose [Gay et al. (1985) J. Bacteriol. 164:918 and (1983) J. Bacteriol. 153:1424].
  • Figure 12 provides a schematic showing the strategy for in vivo recombination in a Cre-expressing host cell (e.g., QLB4 cells) using the pUNI-D and pHOST-D constructs. Arrows are used to indicate the direction of transcription of various genes or gene segments in Figure 12.
  • Ap R ampicillin resistance gene
  • Kn R kanamycin resistance gene
  • Ori non-conditional plasmid origin of replication
  • Figure 12 illustrates that the second lox site (loxB) in pUNI-D (relative to the design of the pUNI-10 vector) is inserted between the kanamycin resistance gene and the R6K ⁇ conditional origin of replication.
  • a commercially available expression vector containing the desired promoter (and optionally enhancer) is modified as described in Example 2 to insert the loxA site downstream of the promoter.
  • a commercially available expression vector be employed as the art is well aware of methods for the generation of expression vectors. Sequences encoding the sacB gene [Gay et al. (1983) J. Bacteriol. 153:1424; GenBank Accession Nos. X02730 and KOI 987] and the second lox site (loxB) are inserted downstream of the first lox site (lox A).
  • the pUNI-D and pHOST-D constructs are cotransformed into QLB4 cells (Example 6) and the transformed cells are plated onto LB/Ap/Kn plates containing 5% sucrose to select for the desired recombinant.
  • Figure 12 illustrates the recombination events that will occur in the presence of Cre in the QLB4 cells.
  • First pUNI-D and pHOST-D will fuse to form two dimers in which two possible double cross-over events can occur. These two double cross- over events are diagrammed in Fig 12.
  • the double cross-over events will result in the exchange of the DNA segments that are flanked by lox A and loxB to produce the plasmids labelled "A" and "B.” All plasmids that contain the sacB gene (the pHOST-D, the fused plasmids and plasmid B) will be selected against by the presence of sucrose in the growth medium.
  • the pUNI-D construct will not be able to replicate in QLB4 cells as these cells do not express the ⁇ protein required for replication of the R6K ⁇ origin.
  • the only construct that will be maintained in QLB4 cells selected on LB/Kn containing sucrose is the desired plasmid A in which the gene of interest from pUNI-D has been placed under the transcriptional control of the promoter located on pHOST-D.
  • pUNI-10 was modified to place a second lox site, comprising the /oxP51 1 sequence (SEQ ID NO: 16) between the kanamycin resistance gene and the R6K ⁇ conditional origin of replication to create pUNI-10-D.
  • a second lox site, comprising the /oxP51 1 site was inserted onto a /oxP-containing expression plasmid (i.e.. a pHOST vector) to create a pHOST-D vector.
  • a /oxP-containing expression plasmid i.e.. a pHOST vector
  • the in vivo gene trap method can be used to recombine a gene of interest carried on a pUNI-D vector into an expression vector using host cells that constitutively express the Cre protein.
  • the in vivo gene trap method provides a means to transfer a gene of interest contained on a linear DNA molecule (e.g., a PCR product) that lacks a selectable marker into an expression vector(s).
  • the desired PCR product is amplified using two primers, each of which encode a different lox site (a "loxA" and "loxB" site such as a loxP and /oxP51 1 site).
  • a pUNI vector is constructed that contains (5' to 3') a lox A site, a counter-selectable marker such as the s ⁇ cB gene and a loxB site (i.e., the two different lox sites flank the counter- selectable marker).
  • This pUNI vector also contains a conditional origin of replication and an antibiotic resistance gene as described above and in Example 1.
  • the PCR product (loxA- amplified sequence-/oxB) is recombined with the modified pUNI vector (which comprises /oxA-counter-selectable marker-/oxB) to create a pUNI vector containing the PCR product which now lacks the counter-selectable marker.
  • This recombination event is selected for by growing the host cells in medium that kills the host if the counter-selectable gene is expressed.
  • the PCR product in the pUNI vector (containing 2 lox sites) can then be placed under the control of the desired promoter element by recombining the pUNI/PCR product construct with the appropriate pHOST-D vector.
  • the pUNI and pHOST constructs employed in the Univector Plasmid Fusion System were designed such that plasmid fusion resulted in the introduction of a lox site between the promoter and the gene of interest.
  • Z,oxP sites consist of two 13 bp inverted repeats separated by an 8 bp spacer region [Hoess et ⁇ l. (1982) Proc. N ⁇ tl Ac ⁇ d. Sci. USA 79:3398 and U.S.
  • Transcripts of the gene of interest produced from a pUNI-pHOST fusion construct comprising a loxP site may have two 13 nucleotide perfect inverted repeats within the 5' untranslated region (UTR) that have the potential to form a stem-loop structure (this will occur in those cases where pHOST does not encode an affinity domain at the amino-terminus of the fusion protein).
  • UTR 5' untranslated region
  • the ribosome scanning mechanism is the most commonly used mechanism for initiation of translation in eukaryotes (e.g., yeast and mammalian cells).
  • the ribosome binds to the 5" cap structure of the mRNA transcript and scans downstream along the 5 " UTR searching for the first ATG or translation start codon.
  • a stem-loop structure formed by the presence of a loxP sequence on the 5' UTR of the mRNA encoding the protein of interest would block or reduce the efficiency of ribosome scanning and thus the translation initiation step could be impaired.
  • stem-loop structures in the 5' UTR of particular mRNAs reduce the efficiency of translation in eukaryotes [see, e.g., Donahue et al. (1988) Mol. Cell. Biol.
  • Example 5 shows that a GST-SKP1 fusion construct produced using the Univector Fusion System (i.e., the construct contains a loxP site between the sequences encoding the Gst and Skpl domains) produced the same level of fusion protein as did a conventional construct encoding a Gst-Skpl fusion protein which lacks the lox? sequence. Therefore, concerns over the presence of a stem-loop structure caused by the presence of a lox sequence in a transcript encoded by a pUNI-pHOST fusion construct are limited to those constructs that do not generate fusion proteins.
  • pUNI-pHOST fusion constructs comprising lox sequences that comprise perfect 13 bp inverted repeats (e.g., lox?)
  • pUNI and pHOST constructs containing mutated lox? sequences are employed.
  • the mutated lox? sequences comprise point mutations that create mismatches between the two 13 bp inverted repeat sequences within the loxP site that disrupt the formation of or reduce the stability of a stem loop structure.
  • two modified loxP sites were designed that have mismatches at different positions in the inverted repeats located within a lox? site.
  • the 13 bp inverted repeats are binding sites for the Cre protein; thus, each lox?
  • the wild-type lox? site has two binding sites for Cre. For the purpose of discussion, these two binding sites are referred to as L and R (left and right).
  • the wild-type lox? site is designed L(0)-R(0) wherein "0" indicates the absence of a mutation (i.e., the wild-type sequence).
  • Two derivatives of the wild-type lox? sequence were designed and termed /oxP2 and /oxP3.
  • the sequence of /oxP2 (SEQ ID NO: 13), /oxP3 (SEQ ID NO:14), as well as the wild-type lox? sequence (SEQ ID NO:12) are shown in Figure 13.
  • Z,oxP2 is placed on the pUNI-10 construct (in place of the wild-type lox?
  • Lox?2 has repeats designated L(3,6)-R(0) which indicates that the third and sixth nucleotides of the left repeat are mutated; thus, a mismatch is introduced at the third and sixth positions between the L and R repeats of the /oxP2 site.
  • Lox?3 has repeats designated L(0)- R(9) which indicates that the ninth nucleotide on the right repeat sequence is mutated to introduce a mismatch at the ninth position between the L and R repeats of the /oxP3 site.
  • Fusion between the /oxP2 site on the pUNI construct and the /oxP3 site on the pHOST construct will generate a hybrid /oxP23 site [L(3,6)-R(9)] located between the promoter and the gene of interest and a wild-type loxP site [L(0)-R(0)] at the distal junction.
  • the /oxP23 site (SEQ ID NO: 15) in the 5' UTR will have three mismatches distributed at positions 3, 6 and 9 between the 13 nucleotide inverted repeats which are expected to strongly destabilize the formation of the stem-loop structure.
  • sequences suitable for disruption of the stem-loop structure will be apparent to those skilled in the art; therefore, the present invention is not limited to the use of the /oxP2 and /oxP3 sequences for the purpose of disrupting stem-loop formation on the 5' UTR of transcripts produced from pUNI- pHOST fusion constructs.
  • Univector Fusion system may be tested by placing one member of the pair on a pUNI vector and the other member on a pHOST construct.
  • the two modified vectors are then recombined in vitro as described in Example 4 and the fusion reaction mixture is used to transform E. coli cells and the transformed cells are plated on selective medium (e.g.. on LB/Amp and LB/Kan plates) in order to determine the efficiency of recombination between the two mutated lox sites (Example 4).
  • the efficiency of recombination between the two mutated lox sites is compared to the efficiency of recombination between two wild-type lox? sites.
  • Any pair of two different mutant lox sites that recombines at a rate that is about 5% or greater than that observed using two lox? sites is a useful pair of mutated lox sites for use in avoiding the formation of a stem-loop structure on the 5' UTR of the mRNA transcribed from the pUNI/pHOST fusion construct.
  • a strategy as described above was employed to determine if the reduced expression observed with the SKPl ORF under control of the GALl promoter as described in Example 5 could be improved with mutated lox sites.
  • a series of lox sites designed to reduce the stability of the stem-loop were employed. These, together with a control scrambled site, loxS, were placed between the GALl promoter and the lacZ reporter gene and ⁇ -galactosidase expression was measured. Mutations that decreased stem-loop stability tended to express better and one mutant, /oxP Lj69 , did not display any inhibitory effects. This mutant also retained 25% of the wild-type recombination efficiency and has been designated lbxH ⁇ i. * ⁇ for host).
  • oligonucleotides used to generate the loxH site are based on the loxH sequence 5'-ATTACCTCATATAGCATACATTATACGAAGTTAT-3' (SEQ ID NO:32). LoxH was further tested by using it to place MYC-RNR4 under GALl control and showed no translational interference, as shown in Figure 17 (compare lanes 2, 3, and 4). Z,oxH " s 25% recombinational efficiency is well within the range useful for UPS-mediated plasmid constructions. Thus, it is recommended that loxH be used in pHOST recipient vectors intended for transcriptional fusions to maximize expression, while lox? should be used for all other applications because of its higher recombination efficiency.
  • frt site like lox sites, contains two 13 bp inverted repeats separated by an 8 bp spacer region.
  • the present invention provides a second recombination event that allows a resolution of the UPS generated heterodimer.
  • a schematic representation of the POT reaction is shown in Figure
  • a R-recombination site. RS. was placed after the cloning site in pUNI (i.e., pUNI-20) such that any gene inserted into pUNI-20 would be flanked on the 5 ' side by lox? and on the 3' side by RS, although the present invention contemplates the use of any other second recombination system (e.g., the Res system).
  • Host recipient vectors must also contain lox and RS elements in the correct order.
  • the initial fusion event is catalyzed by Cre by UPS.
  • the second reaction can be catalyzed in vitro by incubation with purified R-recombinase (Araki et al, supra) or in vivo by transformation into a strain (e.g., BUN 15) expressing the R-recombinase under tac control on a Ts replication plasmid (e.g., pML66) that is lost when cells are plated at 42°C.
  • POT works efficiently as a two step reaction in vivo or in vitro. Efficient resolution in vivo without a selection for the second recombination event requires incubation in LB plus IPTG after transformation prior to plating on selective media.
  • the efficiency of recovering plasmids that have undergone POT can be greatly enhanced through the use of a recipient vector in which a counter-selectable marker is placed between the lox? and RS sites.
  • the present invention utilized the ⁇ X174 E gene which is toxic when expressed in E. coli unless the host cell lacks the slyD gene [Maratea et al. (1985) Gene 40:39].
  • pAS2-E a two hybrid bait vector derived from pAS2 [Durfee et al. (1994) Gene. & Dev. 7:555] which contains in a 5' to 3 ' order lox?, E under control of the tac promoter, and an RS site, was fused with pUNI-20.
  • BUN 15 is slyLY, pAS2-E alone cannot survive in it because of toxicity due to E expression. However, when pAS2-E is fused to pUNI-20 derivatives, it can transform that strain because subsequent R-dependent site-specific recombination between RS sites will eliminate both the pUNI backbone and E. This results in the replacement of E with the corresponding region from pUNI.
  • BUN 15 cells were grown overnight in LB containing spectinomycin (50 ⁇ g/ml) at 30°C.
  • BUN15 cells were diluted 1 to 100 in fresh media LB/Spec media containing 0.3 mM IPTG and grown to OD of 0.5.
  • Electrocompetent cells were prepared as recommended (Biorad). Forty ⁇ l of competent cells were used in each transformation. After the electrotransformation, cells were incubated in LB plus IPTG for 1 -8 hr for recovery before being plated on LB/Amp/IPTG ImM and incubated at 42°C.
  • LB plates or liquid media were supplemented with either kanamycin (40 ⁇ g/ml) or ampicillin (100 ⁇ g/ml).
  • IPTG isopropyl ⁇ -D- thiogalactoside
  • X-Gal Sigma
  • Yeast growth media and plates were made according to Rose el al. [Rose et al. (1990) Laboratory course manual for methods in yeast genetics, Cold Spring Harbor. New York, Cold Spring Harbor Laboratory Press].
  • Restriction endonucleases large (klenow) fragment of E. coli DNA polymerase I, T4 polynucleotide kinase, T4 DNA polymerase.
  • T4 DNA ligase were purchased from New England Biolabs. Drugs were purchased from Sigma if not otherwise specified.
  • E.coli BW23474 [ ⁇ / ⁇ c-169, robAl, creC510, hsdR5 lA, uidA(AMluI): ⁇ ir-l 16. endA, recAJ] and BW23473 [ ⁇ / ⁇ c-169, robAl, creC510, hs !R5l4, uidA(AMlul): ⁇ ir . endA. recAJ]
  • BUN10 [hisG4 thr-1 leuB ⁇ 1 lacYl kdgK5l A(gpl-proA)62 rpsL31 I.sx33 s ⁇ tpE44 recB21 recC22 sbcA23 hsdR: :cat-pir-l 1 ⁇ 5(Cm R )] was used for homologous recombination experiments.
  • BUN 13 which has cre under the control of the lac promoter is JM107 lysogenized with ⁇ LC (aadA lac-cre).
  • BUN15 is XL1 blue containing pML66(/c/c-R,
  • E. coli JM107 or DH5 were the transformation recipients for all other plasmid construction, including those made by UPS.
  • E. coli BL21 was used as the host for bacterial expression studies.
  • CXI (ara leu pvrE gal trp his argG rpsL thi-1 supE lad- slyDl) was used for propagation of E expression clones.
  • the PCR product was cloned into pCR 1M II (Invitrogen) and subcloned as a NcoI-EcoRI fragment into NcoI-EcoRI digested pGEX-2Tkcs to create pQL123.
  • the pHOST plasmid pQL103 was made by deleting one lox? site from pSE1086, which contains a /zo/-/oxP-Not/-/oxP-SalI cassette, by digestion with Notl and Sail, filling in the ends with klenow and religation.
  • the 590 bp NcoI-BamHI fragment containing the S. cerevisiae SKPl ORF was subcloned from pCB149 into NcoI-BamHI-cut pU ⁇ I-10 to create pQL130(pU ⁇ I-SKPl).
  • a second subclone of SKPl is pML73 which contains the same 5 " end of SKPl but an additional 800 bp of genomic DNA to the next BamHl site at the 3' end cloned into pUNI- 20.
  • pML73 was used for the POT experiments.
  • An oligo linker containing lox? and flanked by Ncol and BamHl overhangs was made by annealing two oligos 5"-CATGGCTATAAC TTCGTATAGCATACATTATACGAAGTTATG-3' (SEQ ID NO:22) and 5'-
  • pQL269 lac-cre aadA on a Ts pSCl Ol ori
  • EcoRI-PvuII fragment from pQLl 14 containing aadA and the lac-cre gene fusion was ligated to a Bgll (made blunt by T4 polymerase)-Ecoi?7 fragment from pINT-ts [Hasan et al ( 1994) Gene 150:51] containing the Ts replication origin and transformants were screened for Sp R and Ts growth at 42°C.
  • a plasmid with those properties was designated pQL269.
  • pML66 was constructed by ligating the EcoRI-Sall (blunt) fragment containing the tac promoter driving the R recombinase from pNN115 (Araki et al, supra) into EcoRI-Pstl (blunt) cleaved pQL269.
  • This spectinomycin resistant plasmid expresses R protein in the presence of IPTG and is lost from cells grown at 42°C because of a temperature sensitive replication mutation.
  • pUNI-Amp was made by placing the bla gene from pUC19 in place of the neo gene on pUNI-20 by generating a PCR product of bla and ligating that into Mlul-Nhel (blunt) cleaved pUNI-20.
  • the subcloning of the triple MYC tag into pUNI-Amp was accomplished by PCR amplification of the 3xMYC tag present of pJBN48 by the primers MZL154, 5'- AAATTTCTCGAGGCTCTGAGCAAAAGCTCAT-3' (S ⁇ Q ID NO:28) and MZL155,
  • pAS2- ⁇ was constructed by first placing a synthetic lox? site between the Ncol-Sall sites of pAS2 to make pAS2-/ox, and then generating a E-containing fragment with the following features: 5' Xhol site, tac promoter driving E, Spel site 3 * and ligated the Xhol-
  • Hindlll (made blunt by T4 DNA polymerase) fragment containing the spectinomycin resistance gene aadA from pDPT270 [Taylor and Cohen (1979) J. Bacteriol 137:92] was subcloned into BamHI-Sphl (made blunt by T4 DNA polymerase digested pSE356) to create pQL102, replacing neo with aadA.
  • BamHI-Sphl made blunt by T4 DNA polymerase digested pSE356
  • the present invention provides methods for the subcloning of nucleic acid molecules that permit the rapid transfer of a target nucleic acid sequence (e.g., a gene of interest) from nucleic acid molecule to another in vitro or in vivo without the need to rely upon restriction enzyme digestions.
  • a target nucleic acid sequence e.g., a gene of interest

Landscapes

  • Genetics & Genomics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • Organic Chemistry (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Zoology (AREA)
  • Physics & Mathematics (AREA)
  • Microbiology (AREA)
  • Plant Pathology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Mycology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
EP98937141A 1998-07-24 1998-07-27 Schnelle subklonierung mittels ortsspezifischer rekombination Ceased EP1100885A4 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US122384 1980-02-19
US09/122,384 US6828093B1 (en) 1997-02-28 1998-07-24 Rapid subcloning using site-specific recombination
PCT/US1998/015481 WO2000005355A1 (en) 1998-07-24 1998-07-27 Rapid subcloning using site-specific recombination

Publications (2)

Publication Number Publication Date
EP1100885A1 true EP1100885A1 (de) 2001-05-23
EP1100885A4 EP1100885A4 (de) 2001-12-12

Family

ID=22402395

Family Applications (1)

Application Number Title Priority Date Filing Date
EP98937141A Ceased EP1100885A4 (de) 1998-07-24 1998-07-27 Schnelle subklonierung mittels ortsspezifischer rekombination

Country Status (5)

Country Link
EP (1) EP1100885A4 (de)
JP (1) JP2002522013A (de)
AU (1) AU8592298A (de)
CA (1) CA2336520A1 (de)
WO (1) WO2000005355A1 (de)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1194540A1 (de) 1999-07-14 2002-04-10 Clontech Laboratories Inc. Rekombinase-basierende methoden zur herstellung von expressionsvektoren und zusammensetzungen für ihre verwendung
US7517688B2 (en) 2000-02-25 2009-04-14 Stratagene California Method for ligating nucleic acids and molecular cloning
CA2401632A1 (en) 2000-02-25 2001-08-30 Stratagene Method for ligating nucleic acids and molecular cloning
ES2350248T3 (es) * 2001-10-01 2011-01-20 Deutsches Krebsforschungszentrum Stiftung des öffentlichen Rechts Procedimiento de producción de bibliotecas de proteínas y de selección de proteínas a partir de las mismas.
FR2831185A1 (fr) * 2001-10-19 2003-04-25 Genoplante Valor Procede et kit de conversion de vecteurs hybrides de clonage en phase solide et procede de clonage de sequences nucleotidiques d'interet en faisant application
US7160702B2 (en) * 2002-07-26 2007-01-09 Shuwei Yang Methods and nucleic acid vectors for rapid expression and screening of CDNA clones
ATE443128T1 (de) 2004-10-22 2009-10-15 Novozymes As Stabile genomische integration mehrerer polynukleotidkopien
AU2022334711A1 (en) * 2021-08-23 2024-04-04 Bioverativ Therapeutics Inc. Baculovirus expression system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0220009A2 (de) * 1985-10-07 1987-04-29 E.I. Du Pont De Nemours And Company Stellenspezifische Rekombination von DNS in Hefe
WO1991009957A1 (en) * 1989-12-22 1991-07-11 E.I. Du Pont De Nemours And Company Site-specific recombination of dna in plant cells
WO1993015191A1 (en) * 1992-01-24 1993-08-05 Life Technologies, Inc. Modulation of enzyme activities in the in vivo cloning of dna
WO1996004393A2 (en) * 1994-08-01 1996-02-15 Delta And Pine Land Company Control of plant gene expression
US5851808A (en) * 1997-02-28 1998-12-22 Baylor College Of Medicine Rapid subcloning using site-specific recombination
WO1999010488A1 (en) * 1997-08-28 1999-03-04 The Salk Institute For Biological Studies Site-specific recombination in eukaryotes and constructs useful therefor
WO1999021977A1 (en) * 1997-10-24 1999-05-06 Life Technologies, Inc. Recombinational cloning using nucleic acids having recombination sites

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5658772A (en) * 1989-12-22 1997-08-19 E. I. Du Pont De Nemours And Company Site-specific recombination of DNA in plant cells
US5801030A (en) * 1995-09-01 1998-09-01 Genvec, Inc. Methods and vectors for site-specific recombination

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0220009A2 (de) * 1985-10-07 1987-04-29 E.I. Du Pont De Nemours And Company Stellenspezifische Rekombination von DNS in Hefe
WO1991009957A1 (en) * 1989-12-22 1991-07-11 E.I. Du Pont De Nemours And Company Site-specific recombination of dna in plant cells
WO1993015191A1 (en) * 1992-01-24 1993-08-05 Life Technologies, Inc. Modulation of enzyme activities in the in vivo cloning of dna
WO1996004393A2 (en) * 1994-08-01 1996-02-15 Delta And Pine Land Company Control of plant gene expression
US5851808A (en) * 1997-02-28 1998-12-22 Baylor College Of Medicine Rapid subcloning using site-specific recombination
WO1999010488A1 (en) * 1997-08-28 1999-03-04 The Salk Institute For Biological Studies Site-specific recombination in eukaryotes and constructs useful therefor
WO1999021977A1 (en) * 1997-10-24 1999-05-06 Life Technologies, Inc. Recombinational cloning using nucleic acids having recombination sites

Non-Patent Citations (14)

* Cited by examiner, † Cited by third party
Title
AKAGI K ET AL: "CRE-MEDIATED SOMATIC SITE-SPECIFIC RECOMBINATION IN MICE" NUCLEIC ACIDS RESEARCH, OXFORD UNIVERSITY PRESS, SURREY, GB, vol. 25, no. 9, 1997, pages 1766-1773, XP001004602 ISSN: 0305-1048 *
ALADJEM MIRIT I ET AL: "Positive of FLP-mediated unequal sister chromatid exchange products in mammalian cells." MOLECULAR AND CELLULAR BIOLOGY, vol. 17, no. 2, 1997, pages 857-861, XP002180200 ISSN: 0270-7306 *
ANGRAND PIERRE-OLIVIER ET AL: "Inducible expression based on regulated recombination: A single vector strategy for stable expression in cultured cells." NUCLEIC ACIDS RESEARCH, vol. 26, no. 13, 1 July 1998 (1998-07-01), pages 3263-3269, XP001021788 ISSN: 0305-1048 *
AUBRECHT ET AL GENE vol. 172, 1996, pages 227 - 231, XP000404274 *
BOUHASSIRA ERIC E ET AL: "Transcriptional behavior of LCR enhancer elements integrated at the same chromosomal locus by recombinase-mediated cassette exchange." BLOOD, vol. 90, no. 9, 1997, pages 3332-3344, XP002143981 ISSN: 0006-4971 *
ELLEDGE S J ET AL: "LAMBDAYES A MULTIFUNCTIONAL COMPLEMENTARY DNA EXPRESSION VECTOR FOR THE ISOLATION OF GENES BY COMPLEMENTATION OF YEAST AND ESCHERICHIA-COLI MUTATIONS" PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES, vol. 88, no. 5, 1991, pages 1731-1735, XP002916290 1991 ISSN: 0027-8424 *
HALL RUTH M ET AL: "Mobile gene cassettes and integrons: Capture and spread of genes by site-specific recombination." MOLECULAR MICROBIOLOGY, vol. 15, no. 4, 1995, pages 593-600, XP001028961 ISSN: 0950-382X *
KILBY N J ET AL: "SITE-SPECIFIC RECOMBINASES: TOOLS FOR GENOME ENGINEERING" TRENDS IN GENETICS, ELSEVIER SCIENCE PUBLISHERS B.V. AMSTERDAM, NL, vol. 9, no. 12, 1993, pages 413-421, XP002916291 ISSN: 0168-9525 *
LIU QINGHUA ET AL: "The univector plasmid-fusion system, a method for rapid construction of recombinant DNA without restriction enzymes." CURRENT BIOLOGY, vol. 8, no. 24, 3 December 1998 (1998-12-03), pages 1300-1309, XP002155212 ISSN: 0960-9822 *
ODELL J T ET AL: "SEED-SPECIFIC GENE ACTIVATION MEDIATED BY THE CRE/LOX SITE-SPECIFIC RECOMBINATION SYSTEM" PLANT PHYSIOLOGY, AMERICAN SOCIETY OF PLANT PHYSIOLOGISTS, ROCKVILLE, MD, US, vol. 106, no. 2, 1 October 1994 (1994-10-01), pages 447-458, XP002912196 ISSN: 0032-0889 *
QIN M ET AL: "CRE RECOMBINASE-MEDIATED SITE-SPECIFIC RECOMBINATION BETWEEN PLANT CHROMOSOMES" PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF USA, NATIONAL ACADEMY OF SCIENCE. WASHINGTON, US, vol. 91, no. 5, 1 March 1994 (1994-03-01), pages 1706-1710, XP000615549 ISSN: 0027-8424 *
SAUER B ET AL: "CONSTRUCTION OF ISOGENIC CELL LINES EXPRESSING HUMAN AND RAT ANGIOTENSIN II AT1 RECEPTORS BY CRE-MEDIATED SITE-SPECIFIC RECOMBINATION" METHODS: A COMPANION TO METHODS IN ENZYMOLOGY, ACADEMIC PRESS INC., NEW YORK, NY, US, vol. 4, no. 2, 1992, pages 143-149, XP000613888 ISSN: 1046-2023 *
SCHLAKE THOMAS ET AL: "Use of Mutated FLP Recognition Target (FRT) Sites for the Exchange of Expression Cassettes at Defined Chromosomal Loci." BIOCHEMISTRY, vol. 33, no. 43, 1994, pages 12746-12751, XP000616165 ISSN: 0006-2960 *
See also references of WO0005355A1 *

Also Published As

Publication number Publication date
JP2002522013A (ja) 2002-07-23
EP1100885A4 (de) 2001-12-12
AU8592298A (en) 2000-02-14
WO2000005355A1 (en) 2000-02-03
CA2336520A1 (en) 2000-02-03

Similar Documents

Publication Publication Date Title
US6828093B1 (en) Rapid subcloning using site-specific recombination
US7736851B2 (en) DNA cloning method
US8129598B2 (en) DNA recombination in eukaryotic cells by the bacteriophage PHIC31 recombination system
EP1234022B1 (de) Dna-rekombination in eukaryotischen zellen mittels des rekombinationszentrums vom bakteriophag phic31
US5958728A (en) Methods for producing polypeptides in mutants of bacillus cells
US6140129A (en) Chromosomal targeting in bacteria using FLP recombinase
Liu et al. [32] Rapid construction of recombinant DNA by the univector plasmid-fusion system
CN110997909A (zh) 改进的转座酶多肽及其用途
US6936470B2 (en) Rapid and enzymeless cloning of nucleic acid fragments
EP1100885A1 (de) Schnelle subklonierung mittels ortsspezifischer rekombination
Brecht et al. Genome engineering of Toxoplasma gondii using the site-specific recombinase Cre
US8647841B2 (en) Artificial chromosome vector
US20010044150A1 (en) Method for transformation of animal cells
Gottfried et al. Site-specific recombination in Arabidopsis plants promoted by the Integrase protein of coliphage HK022
JP2004518419A (ja) 基質連結型指向性進化(SLiDE)
JP7472167B2 (ja) 安定的な標的組み込み
AU2004201968A1 (en) Rapid subcloning using site-specific recombination
WO2006041096A1 (ja) 黄色ブドウ球菌バクテリオファージφmr11由来インテグラーゼ、及びその認識配列を利用した部位特異的遺伝子組換え方法
Thumasnn et al. CELLS BY THE BACTERIOPHAGE PHIC31 RECOMBINATION SYSTEM
JP4581838B2 (ja) Dna断片連結方法及びキット
Brennan et al. FOREIGN PATENT DOCUMENTS

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20010214

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

RIN1 Information on inventor provided before grant (corrected)

Inventor name: LIU, QINGHUA

Inventor name: ELLEDGE, STEPHEN, J.,DR.

A4 Supplementary search report drawn up and despatched

Effective date: 20011031

AK Designated contracting states

Kind code of ref document: A4

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

17Q First examination report despatched

Effective date: 20040730

APBN Date of receipt of notice of appeal recorded

Free format text: ORIGINAL CODE: EPIDOSNNOA2E

APAF Appeal reference modified

Free format text: ORIGINAL CODE: EPIDOSCREFNE

APBT Appeal procedure closed

Free format text: ORIGINAL CODE: EPIDOSNNOA9E

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20070413