WO2002016642A1 - Methodes et compositions pour l'evolution moleculaire dirigee utilisant une modification des extremites de l'adn - Google Patents

Methodes et compositions pour l'evolution moleculaire dirigee utilisant une modification des extremites de l'adn Download PDF

Info

Publication number
WO2002016642A1
WO2002016642A1 PCT/US2001/025788 US0125788W WO0216642A1 WO 2002016642 A1 WO2002016642 A1 WO 2002016642A1 US 0125788 W US0125788 W US 0125788W WO 0216642 A1 WO0216642 A1 WO 0216642A1
Authority
WO
WIPO (PCT)
Prior art keywords
polynucleotides
deletions
library
polynucleotide
dna
Prior art date
Application number
PCT/US2001/025788
Other languages
English (en)
Other versions
WO2002016642A9 (fr
Inventor
Vaughn Smider
Original Assignee
Integrigen, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Integrigen, Inc. filed Critical Integrigen, Inc.
Priority to JP2002522312A priority Critical patent/JP2004507247A/ja
Priority to EP01965982A priority patent/EP1311709A4/fr
Priority to AU2001286528A priority patent/AU2001286528A1/en
Priority to CA002419961A priority patent/CA2419961A1/fr
Publication of WO2002016642A1 publication Critical patent/WO2002016642A1/fr
Publication of WO2002016642A9 publication Critical patent/WO2002016642A9/fr

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/102Mutagenizing nucleic acids
    • C12N15/1027Mutagenizing nucleic acids by DNA shuffling, e.g. RSR, STEP, RPR
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/102Mutagenizing nucleic acids

Definitions

  • the invention relates to directed evolution, which encompasses methods that can be applied to genetic engineering and protein engineering. Directed evolution is used to evolve gene sequences with the goal of improving or altering gene or protein function. Directed evolution can be applied to many areas including, but not limited to, pharmaceutical development, bioremediation, bioleaching, and the chemical industry.
  • a major barrier to current molecular evolution is the inability to efficiently search sequence space for a given protein.
  • of critical importance is the ability to generate and identify sequences that differ at more than one residue, wherein those differences may have an additive effect upon protein function.
  • This additive effect may be described at amino acid interdependence.
  • a protein with a single mutation at residue i may not have any detectable increase in function unless a concomitant mutation at is also present.
  • all possible two-mutant variants of the target sequence should be sampled and tested for improved function.
  • N the number of R-mutant variants of a protein of length N is given by:
  • R is the number of mutant variants, and 20 denotes the number of amino acids possible at each position.
  • R is the number of mutant variants, and 20 denotes the number of amino acids possible at each position.
  • a critical value is the length of the protein (i.e. the number of R-mutant variants depends on the length of the protein).
  • the length of any given protein may not be as critical for its function as the arrangement of amino acid residues in three dimensional space.
  • a hypothetical concept of "catalytic task space” has been proposed to account for this principle (Kauffman, 1993).
  • Alterations in amino acid residues without altering protein length, N may not affect the three dimensional structure of the protein in the same ways that increasing or decreasing N would. Alternatively, changing N may not change the biological function of the protein at all.
  • N is the initial length of the protein and D is the number of positions where a deletion occurs.
  • D is the number of positions where a deletion occurs.
  • SEX S SCD SCA where amino acids are mutated to other residues (S R ), are deleted (S CD ), or added (S C A) hi various combinations and permutations.
  • S R residues
  • S CD deleted
  • S C A added
  • coli and Salmonella are highly related bacterial species, however the differences in their genetic content are due almost entirely to genetic swapping events as opposed to single residue changes. Additionally, swapping events where exchanges of large chunks of DNA create genes are thought to have occurred several times in pathways such as the clotting cascade, as well as to create novel transcription cassettes through transposition [Bell, (1997); Patthy, (1999)]. [11 ] A well known example of molecular evolution occurring naturally is that which underlies the production of antibodies in the immune system. In mammalian pre- lymphocytes natural molecular evolution occurs successfully on a daily basis. Antibodies are capable of binding a bewildering array of different antigens, yet have similar amino acid sequences and secondary structures.
  • Antibody genes are arranged in the germline as gene segments ( Figure 2). During lymphocyte maturation these segments (named variable or "V”, diversity or “D”, and junctional or “J") are juxtaposed to one another in the process termed V(D)J recombination to create a functional antibody or T-cell receptor gene. Multiple V, D, and J segments allow a substantial amount of diversity, and hence different antigen binding specificities, to be created in the final repertoire of lymphocytes. The diversity created by this mechanism is referred to as combinatorial diversity. Another type of diversity is also created during V(D) J recombination, which is as important as combinatorial diversity [Davis & Bjorkman, Nature 334: 395-402 (1988)].
  • junctional diversity is created when nucleotides are lost or gained at the joints of the gene segments. Importantly, these joints encode the regions of the antibody molecule which contact antigen, so this type of diversity is critical to creating a diverse, but functional immune system.
  • the two types of diversity utilized by the immune system might be characterized in the following way with regards to the practice of molecular evolution.
  • Generation of combinatorial diversity in immunoglobulin genes allow a sampling of the total sequence space by providing multiple functional V, D, and J gene segments, each member of which is slightly different in sequence but still homologous to other members of the family of segments.
  • the combinatorial rearrangement of V, D, and J gene segments functions as a "domain swapping" event in order to generate novel antibody genes.
  • Generation of junctional diversity allows a greater local sampling of sequence space at the critical residues for contacting antigen through the mechanism of adding or deleting random nucleotides at the ends of the DNA that are to be ligated.
  • Error-prone PCR uses low-fidelity polymerization conditions to introduce a low level of point mutations randomly over a long sequence. Error prone PCR can also be used to mutagenize a mixture of fragments of unknown sequence. Error-prone PCR can randomly mutate genes by altering the concentrations of respective dNTP's in the presence of dITP [Caldwell & Joyce, (1992); Leung & Miyamoto, Nucleic Acids Res 17: 1177-95 (1989); Spee et al, Nucleic Acids Res 21: 777-8 (1993)].
  • oligonucleotide-directed mutagenesis a short sequence is replaced with a synthetically mutagenized oligonucleotide. This approach does not generate combinations of distant mutations and is thus not significantly combinatorial.
  • the limited library size relative to the vast sequence length means that many rounds of selection are unavoidable for protein optimization.
  • Mutagenesis with synthetic oligonucleotides requires sequencing of individual clones after each selection round followed by grouping into families, arbitrarily choosing a single family, and reducing it to a consensus motif, which is resynthesized and reinserted into a single gene followed by additional selection. This process constitutes a statistical bottleneck, it is labor intensive and not practical for many rounds of mutagenesis.
  • Cassette mutagenesis is another method for creating libraries of mutant proteins [Bock et al., U.S. Patent # 5,830,720 (1995); Christou & McCabe, U.S. Patent # 5,830,728 (1998); Hill et al., Methods Enzymol 155: 558-68 (1987); Miller et al., U.S. Patent # 5,830,740 (1998); Shiraishi & Snimura, Gene 64: 313-9 (1988); Stemmer & Crameri, U.S. Patent # 5,830,721 (1998)].
  • Cassette mutagenesis typically replaces a sequence block length of a template with a partially randomized sequence. The maximum information content that can be obtained is thus limited statistically to the number of random sequences in the randomized portion of the cassette.
  • Zaccolo and Gherardi (1999) describe a method of random mutagenesis utilizing pyrimidine and purine nucleoside analogs [Zaccolo & Gherardi, JMol Biol 285: 775-83 (1999)]. This method was successful in achieving substitution mutations which rendered -lactamase with an increased catalytic rate against the cephalosporin cefotaxime. Crea describes a "walk through” method, wherein a predetermined amino acid is introduced into a targeted sequence at pre-selected positions [Crea, U.S. Patent # 5,798,208 (1998)].
  • Lietz describes a method whereby oligonucleotides with random sequences may be combined with PCR to induce insertions and deletions. Enhancement of function by this technique has not been shown, and the capacity to overmutagenize (i.e. make too many insertions or deletions per polynucleotide) is substantial in this method [Lietz, U.S. Patent # 6,251,604 (2001)]. [22] The technique most often used to evolve proteins in vitro is known as
  • DNA Shuffling In this method, a library of gene modifications is created by fragmenting homologous sequences of a gene, allowing the fragments to randomly anneal to one another, and filling in the overhangs with polymerase. A full length gene library is then reconstructed with polymerase chain reaction (PCR).
  • PCR polymerase chain reaction
  • the utility of this method occurs at the step of annealing, whereby homologous sequences may anneal to one another, producing sequences with attributes of both starting sequences. In effect, the method affects recombination between two or more genes that are homologous, but that contain significant differences at several positions.
  • StEP staggered extension ' process
  • full-length genes are assembled directly in the presence of the template(s).
  • the StEP consists of repeated cycles of denaturation followed by extremely abbreviated annealing/extension steps. In each cycle the extended fragments can anneal to different templates based on complementarity and extend a little further to create “recombinant cassettes.” Due to this template switching, most of the polynucleotides contain sequences from different parental genes (i.e. are novel recombinants). This process is repeated until full-length genes form.
  • fragmentation of the initial DNA can be accomplished by premature termination of the polymerase in an extension reaction by inducing adduct formation in the target gene [Short, U.S. Patent # 5,965,408 (1999)].
  • a library is created by inducing incremental truncations in each of two homologs to produce a library of fusion genes, each of which contains domains donated from each homolog [Ostermeier et al., Nat. Biotechnol 17: 1205-9 (1999)].
  • the advantage of this approach is that significant homology amongst the starting sequences is not required since the annealing step of previous methods is omitted. It is unclear, however, whether this modified technique actually will lead to generation of improved gene function after selection techniques are applied to the library.
  • the present invention involves the directed molecular evolution of nucleic acid sequences by:
  • step (b) optionally subjecting the pool of polynucleotides in step (a) to a selection procedure capable of identifying polynucleotides encoding for a desired function or feature. Steps (a) and (b) can optionally be repeated. Libraries produced by the methods of the invention are also described and contemplated. [27] Uniquely, the present invention allows a sampling of sequence space which will include sequences that significantly affect secondary protein structure, thus increasing the probability of identifying altered or improved function in an evolved gene. Further, the present invention allows a sampling of sequence space which cannot be sampled by other current technologies. Moreover, libraries of polynucleotides created with the present invention cannot be obtained utilizing other current technologies.
  • One method of the invention generates a library of polynucleotide sequences having nucleotide deletions at differing positions in a sequence of a genetic element comprising the steps of:
  • step (b) subjecting said polynucleotides from step (a) to a process which removes at least one nucleotide from one of said DNA ends of said polynucleotides producing a library of deletion polynucleotide sequences, said library comprising multiple deletion polynucleotide sequences with deletions at different random positions.
  • polynucleotides from step (b) may be subjected to a process that co valently joins the 3' and 5' ends to one another and the library of polynucleotides may be further subjected to a process that selects for a function of interest.
  • the library of deletion polynucleotides may comprise more than two or more, for example, deletions of at least 10, 20 or 30 or more or even 50 to 100 individual polynucleotides each having a random deletion at a different position from the others may be obtained. The number of deletions made will depend upon the starting material and the goal of the technician. In some embodiments, the library of deletion polynucleotides comprises very short deletions of at least 1, 2, 3, 4, or 5 individual nucleotides or more. In different embodiments, the library may comprise larger deletions of 50-100 or more nucleotides. In another emmbodiment, the composition of multiple copies of circular polynucleotides is free of naturally-occurring homologs to the genetic element. Further, steps (a) and (b) may optionally be repeated. Another optional method includes a process for inserting nucleotides at the position of deletion in step (b).
  • compositions comprising a library of multiple (preferably more than two, more preferably more than 5, most preferably more than 10) linear polynucleotides each having a different 3' and a 5' end, but each linear polynucleotide being identical to the others if circularized are described and contemplated.
  • compositions comprising a library of at least 2 (preferably more than 5, more preferably more than 10) deletion polynucleotides each differing from the other only by having a different random deletion are also described and contemplated.
  • deletion polynucleotides further comprise at least one nucleotide inserted at the position of deletion.
  • Another method of the invention generates a library of polynucleotide sequences having nucleotide additions at random positions in a genetic element comprising the steps of:
  • step (a) subjecting a composition of multiple copies of circular polynucleotides with the genetic element to random cleavage to obtain multiple linear polynucleotides each polynucleotide having at least one 3' and 5' end; and (b) subjecting said polynucleotides from step (a) to a process which adds at least one nucleotide to one of said ends of said polynucleotides producing a library of addition polynucleotide sequences, said library comprising multiple addition sequences with additions at different random positions.
  • the addition polynucleotides from step (b) may be subjected to a process that co valently joins said 3' and 5' DNA ends to one another.
  • the library of polynucleotides maybe subjected to a process that selects for a function of interest.
  • cleavage preferably occurs with the use of an endonuclease, preferably SI.
  • This method permits the library of addition polynucleotides to comprise any number of different polynucleotides, for example, at least 5, 10, 20 or 30 individual polynucleotides each having a random addition of nucleotides at a different position from the others.
  • the composition of multiple copies of circular polynucleotides is free of naturally-occurring homologs to the genetic element.
  • steps (a) and (b) of the method may be repeated.
  • Another option includes a process for deleting nucleotides at the point of addition in step (b).
  • nucleotides may be added in step (b) depending upon the starting molecule and the goal of the technician, for example, 1-3, 3-50, or 50-100 or more nucleotides may be added in step (b).
  • Substantially pure compositions comprising a library of at least 2 (preferably at least 5, most preferably at least 10) addition polynucleotides each differing from the other only by having a different random addition are contemplated.
  • the present invention surprisingly provides a method to make short deletions at the end of a polynucleotide, producing a population of polynucleotides with short deletions (from 1 to 100), preferably from 1 to 35, most preferably 1 to 10 at the end.
  • a DNA end having such deletions can then be co valently joined with other DNA ends, producing a library of polynucleotides containing deletions at a specific internal position.
  • the two ends to be ligated will be present on the same DNA molecule, such that the resulting ligation product comprises circular polynucleotides.
  • Such methods and compositions are important in the areas of protein engineering and directed evolution.
  • FIG. 1 is a diagram of the process of DNA shuffling, an earlier method of choice for molecular evolution. Homologs of a gene of interest are fragmented, subjected to denaturation and reannealing such that the single-strand fragments from the homologs can prime one another in an extension reaction. Amplification of the full length gene then produces a library of hybrid genes. A genetic screen is then applied to select an altered or improved gene.
  • FIG. 2 is a diagram of the immunoglobulin heavy chain locus illustrating the process of V(D)J recombination which produces combinatorial diversity, and DNA end-joining which produces junctional diversity.
  • FIG 3. is a diagram illustrating an example of a method which produces nucleotide deletions and insertions at random positions in a polynucleotide.
  • a target gene is cleaved to produce a pool of genes each of which are fragmented at random positions in the gene. Residues can be deleted (left), or inserted (right) at the DNA ends to produce libraries containing deletions, insertions, or deletions and insertions at random positions.
  • FIG 4 is a diagram illustrating the random cleavage of a polynucleotide.
  • the DNA plasmid pLacZi (Clontech, Palo Alto, CA) was either uncleaved (lane 2), cleaved with the single cutting restriction enzyme Cla I (lane 3), or increasing concentrations of SI nuclease (lanes 4-7). Lanes 1 and 8 are lambda Hind III DNA markers.
  • the pLacZi plasmid is uncut (lane 2), cleaved with Cla I (lane 3), or SI nuclease (lane 4).
  • FIG. 5 is a diagram illustrating an example of a method which produces short nucleotide deletions at a DNA end. Exonuclease III deletes nucleotides from the ends of a fluorescently labeled 232 bp DNA fragment in a salt dependent reaction. As salt is increased the number of deletions decreases.
  • FIG 6 is a diagram illustrating the deletion of nucleotides in the LacZ gene.
  • the plasmid pLacZi was cleaved with Cla I, treated with exonuclease III as described in FIG. 5, re-ligated, electroporated into E. coli, and plated on plates containing the colorimetric lactose analog X-Gal. Clones with either a blue or white color were picked, grown in LB, and DNA prepared. Plasmid was subjected to PCR with primers flanking the Cla I site, where one primer was fluorescently labeled.
  • the PCR product was run on a 6% denaturing acrylamide gel in an ABI 373 DNA sequencer and analyzed with Genescan software (Perkin Elmer, Foster City, CA).
  • the top panel shows PCR with the wild-type LacZ gene, producing a 312 bp fragment.
  • Clones 1-6 had variable short deletions present.
  • Clones 1 and 6 had blue phenotypes and 2-5 had white phenotypes.
  • FIG. 7 is a 1.5% agarose gel showing 3 clones containing an insertion in pLacZi.
  • CHO cell cDNA was fragmented with DNase I, ligated into the Cla I site of pLacZi, electroporated into E. coli, and plated on X-Gal plates.
  • Three clones were analyzed by PCR of plasmid DNA using primers flanking the Cla I site. Lanes labeled 1-3 are clones containing different sized insertions, and lane 4 is pLacZi.
  • the DNA in the first and last lanes are ⁇ X174/Hae III DNA markers with their sizes in basepairs indicated at the right.
  • Swapping events constitute a major driver in the evolution of macromolecules. Swapping events may include nucleotide insertions, deletions, or replacements. A swapping event may occur by means of homologous recombination, but may also occur by non-homologous means as they do in V(D)J recombination and the DNA- end joining mechanism used by antibody gene segments [Smider & Chu, Se . Immun. 9: 189-97 (1997)]. Current technologies for molecular evolution do not provide a generally applicable non-homologous means for gene swapping. [42] Applications of the current invention include producing novel genetic elements with improved or altered function. These genetic elements can have significant commercial value. For instance, the genetic element may enhance production of a protein pharmaceutical.
  • the genetic element may encode a protein pharmaceutical such as a monoclonal antibody, or an enzyme used to treat a disease. Further, the genetic element may encode an enzyme important in industrial processes such as chemical manufacturing, or may be used in a product such as laundry detergent (i.e. proteases, lipases, or esterases). Further, the genetic element may have important uses in agriculture, such as to provide a means for pathogen resistance, or to allow production of novel nutrients by a plant species. Additionally, the genetic element may be used in microorganisms to produce novel products for human use, such as novel antibiotics, pigments or other small molecules. As can be seen, the modification of genetic elements in order to improve or alter their function has a myriad of applications in several diverse industries.
  • base refers to a component of nucleic acid consisting of either adenine, guanine, thymine, cytosine, or uracil. Additionally, “purine” refers to either adenine or guanine, and “pyrimidine” refers to either thymine, cytosine, or uracil. [45] The term “nucleoside” refers to a molecule comprising the covalent linkage of a pyrimidine or purine to a pentose ring (such as ribose or deoxyribose).
  • nucleotide refers to the phosphate ester of a nucleoside.
  • polynucleotide(s) refers to a molecule containing at least one 5' hydroxyl of one nucleotide covalently linked to one 3' hydroxyl of at least one other nucleotide through a bond such as a phosphodiester bond.
  • a polynucleotide is necessarily composed of "positions" containing "residues” as defined below.
  • position refers to the location of a given residue in the polynucleotide or polypeptide chain.
  • position in a polynucleotide sequence is defined as the location of a nucleotide in the polynucleotide chain in reference to at least one other nucleotide. For instance in the simple polynucleotide TG, the T is in position 1 (in reference to itself) and G is in position 2 (in reference to the T in position 1). Often it is convention to label the furthest 5' nucleotide as a reference and label it as position 1.
  • positions located 5' from the ATG would be given a negative position (such as -11, -35, etc.) and positions located 3' to the ATG would be given positive positions.
  • positions located 5' from the ATG would be given a negative position (such as -11, -35, etc.) and positions located 3' to the ATG would be given positive positions.
  • sequence ATG means that the base adenine occupies a position which immediately precedes thymine, and thymine occupies a position which immediately precedes guanine.
  • a "specific position” refers to a position in a polynucleotide between at least two nucleotides whose sequence and composition is known.
  • a “genetic element” means a sequence of polynucleotide encoding a function.
  • a “genetic element” may encode a polypeptide sequence, may encode a promoter function, an enhancer function, a transcription start or stop site, or RNA splice sites and the like.
  • Genetic elements may be operatively linked to other genetic elements, for example a promoter may be operatively linked to a genetic element encoding a protein to allow expression of a protein in a given cell type.
  • the term "gene” and "gene of interest” refer to a polynucleotide capable of encoding a polypeptide.
  • sequence swapping in reference to a polynucleotide means either: 1) the occurrence of a deletion of at least two residues occupying consecutive positions in a polynucleotide, or 2) the occurrence of an addition of at least two residues occupying consecutive positions into a polynucleotide, or 3) the replacement of at least two residues occupying consecutive positions in a polynucleotide with other residues.
  • nucleotide deletions as applied to a polynucleotide means that a polynucleotide has had one or more specific residues removed from one or more positions in the polynucleotide chain when the resulting polynucleotide is compared to the parental, wild-type, or other reference sequence.
  • nucleotide insertions or “nucleotide additions” means that a polynucleotide has had specific residues added to the polynucleotide chain, such that at least one of the original residues now occupies a new position in the polynucleotide when compared to the parental, wild-type, or other reference sequence.
  • library of polynucleotide sequences refers to a mixture of polynucleotides, wherein at least one of the sequences differs from at least one other sequence in the mixture by sequence composition or length, for example, where at least one position is occupied by a different nucleotide when the two sequences are compared or at least one nucleotide position is absent in one sequence when compared with the other sequence.
  • DNA refers to deoxyribonucleic acid. It will be understood by those of skill in the art that where manipulations are described herein that relate to DNA they will also apply to RNA.
  • DNA ends or ends refers to the position in a DNA strand wherein a phosphodiester bond is broken. In a single-stranded DNA end a nucleotide is only covalently linked with one other nucleotide.
  • a "double-stranded DNA or RNA end” refers to the position in a double-stranded DNA or RNA molecule wherein the molecule is no longer double-stranded. Generally DNA ends are recognizable to those skilled in the art. Double- stranded DNA ends are characterized as blunt, having a 5' overhang, a 3' overhang, or a hairpin structure. A DNA end may or may not contain a 5' phosphate group.
  • cleavage refers to the breakage of a bond between two nucleotides, such as a phosphodiester bond.
  • circular polynucleotide refers to a polynucleotide wherein no double-stranded DNA ends are present.
  • a circular polynucleotide may be single-stranded or double-stranded.
  • a circular polynucleotide may, however, contain single-stranded DNA ends.
  • a circular polynucleotide will be present if single-stranded DNA ends exist but hydrogen bonding keeps the two strands of the double-stranded molecule hybridized to one another such that a double-stranded DNA end is not created by the presence of two single- stranded ends in proximity to one another.
  • Such a circular double-stranded polynucleotide is often referred to as "nicked”.
  • linear polynucleotide is a polynucleotide which contains at least one, but most often two DNA ends.
  • a linear polynucleotide may be either single- stranded or double-stranded.
  • random or “random position” as applied to a polynucleotide refers to a process by which any of the specific residue positions may be selected. Random as used here does not mean that all points or point of cleavage or nucleotides or positions are selected or chosen with equal frequency. Rather random focuses on the unpredictable nature of the process, i.e. the worker cannot predict a priori where an event will occur or what position any base will have.
  • a polynucleotide with a length of N may have any or all of its positions (i.e. 1, 2,...N) affected by a manipulation.
  • a polynucleotide necessarily must have covalent bonds (such as phosphodiester bonds) cleaved, thereafter which residues are deleted or added (i.e. the total number of positions is decreased or increased, respectively).
  • insertions at random positions in a polynucleotide of length N, it is meant that any or all of the N (in a circular polynucleotide) or N-l (in a linear polynucleotide) covalent linkages between nucleotides (i.e. phosphodiester bonds) are broken, and at least one new nucleotide (i.e. a new position) is added at the end prior to re-ligation.
  • the final length of the polynucleotide (N, or the number of positions) necessarily increases.
  • substantially pure means an object species is the predominant species present (i.e., on a molar basis it is more abundant than any other individual macromolecular species in the composition), and preferably a substantially purified fraction is a composition wherein the object species comprises at least about 50 percent (on a molar basis) of all macromolecular species present. Generally, a substantially pure composition will comprise more than about 80 to 90 percent of all macromolecular species present in the composition. Most preferably, the object species is purified to essential homogeneity (contaminant species cannot be detected in the composition by conventional detection methods) wherein the composition consists essentially of a single macromolecular species. Solvent species, small molecules ( ⁇ 500 Daltons), and elemental ion species are not considered macromolecular species.
  • homologous or “homeologous” means that one single- stranded nucleic acid sequence may hybridize to a complementary single-stranded nucleic acid sequence.
  • the degree of hybridization may depend on a number of factors including the amount of identity between the sequences and the hybridization conditions such as temperature and salt concentration as discussed later.
  • the region of identity is greater than about 5 bp, more preferably the region of identity is greater than 10 bp.
  • “homologs” are nucleic acid molecules that are not identical but are capable of hybridizing to one another under physiological conditions. Double-stranded homologs are capable of hybridizing to one another following denaturation.
  • heterologous means that one single-stranded nucleic acid sequence is unable to hybridize to another single-stranded nucleic acid sequence or its complement.
  • areas of heterology means that nucleic acid fragments or polynucleotides have areas or regions in the sequence which are unable to hybridize to another nucleic acid or polynucleotide. Such regions or areas are, for example, areas of mutations.
  • nucleic acid sequences have the same sequence or a complementary sequence.
  • areas of identity means that regions or areas of a nucleic acid fragment or polynucleotide are identical or complementary to another polynucleotide or nucleic acid fragment.
  • amplification means that the number of copies of a nucleic acid fragment is increased.
  • wild-type means that the nucleic acid fragment does not comprise any mutations.
  • a wild-type protein means that the protein will be active at a comparable level of activity found in nature and typically will comprise the amino acid sequence found in nature.
  • wild type or parental sequence can indicate a starting or reference sequence prior to a manipulation of the sequence.
  • related polynucleotides means that regions or areas of the polynucleotides are identical and regions or areas of the polynucleotides are heterologous.
  • chimeric polynucleotide means that the polynucleotide comprises nucleotide regions which are wild-type and regions which are mutated. It may also mean that the polynucleotide comprises wild-type regions from one polynucleotide and wild- type regions from another related polynucleotide.
  • the term "population” as used herein means a collection of components such as polynucleotides, nucleic acid fragments or proteins.
  • a “mixed population” means a collection of components which belong to the same family of nucleic acids or proteins (i.e. are related) but which differ in their sequence (i.e. are not identical) and hence in their biological activity.
  • a “library” necessarily implies a population wherein at least two of the components is different in some aspect (chemical composition, length, etc.)
  • the term "specific nucleic acid fragment” means a nucleic acid fragment having certain end points and having a certain nucleic acid sequence.
  • nucleic acid fragments wherein one nucleic acid fragment has the identical sequence as a portion of the second nucleic acid fragment but different ends comprise two different specific nucleic acid fragments.
  • Two nucleic acid fragments with identical sequences but different 5' or 3' ends comprise two different specific nucleic acid fragments.
  • the term "mutations" means changes in the sequence of a wild-type nucleic acid sequence or changes in the sequence of a peptide. Such mutations may be point mutations such as transitions or transversions. The mutations may be deletions, insertions or duplications.
  • the left-hand direction is the amino terminal direction and the right-hand direction is the carboxy-terminal direction, in accordance with standard usage and convention.
  • the left-hand end of single-stranded polynucleotide sequences is the 5' end; the left-hand direction of double-stranded polynucleotide sequences is referred to as the 5' direction.
  • RNA transcripts The direction of 5' to 3' addition of nascent RNA transcripts is referred to as the transcription direction; sequence regions on the DNA strand having the same sequence as the RNA and which are 5' to the 5' end of the RNA transcript are referred to as "upstream sequences"; sequence regions on the DNA strand having the same sequence as the RNA and which are 3' to the 3' end of the coding RNA transcript are referred to as "downstream sequences".
  • naturally-occurring refers to the fact that an object can be found in nature.
  • a polypeptide or polynucleotide sequence that is present in an organism (including viruses) that can be isolated from a source in nature and which has not been intentionally modified by man in the laboratory is naturally-occurring.
  • naturally-occurring refers to an object as present in a non-pathological (undiseased) individual, such as would be typical for the species.
  • physiological conditions refers to temperature, pH, ionic strength, viscosity, and like biochemical parameters which are compatible with a viable organism, and/or which typically exist intracellularly in a viable cultured yeast cell or mammalian cell.
  • the intracellular conditions in a yeast cell grown under typical laboratory culture conditions are physiological conditions.
  • Suitable in vitro reaction conditions for in vitro transcription cocktails are generally physiological conditions.
  • in vitro physiological conditions comprise 50-200 mM NaCl or KC1, pH 6.5-8.5, 20-45°C.
  • aqueous conditions may be selected by the practitioner according to conventional methods.
  • buffered aqueous conditions may be applicable: 10-250 mM NaCl, 5-50 mM Tris HC1, pH 5-8, with optional addition of divalent cation(s) and/or metal chelators and/or nonionic detergents and/or membrane fractions and/or antifoam agents and/or scintillants.
  • linker refers to a molecule or group of molecules that connects two molecules, such as a DNA binding protein and a random peptide, and serves to place the two molecules in a preferred configuration, e.g., so that the random peptide can bind to a receptor with minimal steric hindrance from the DNA binding protein.
  • operably linked refers to a linkage of polynucleotide elements in a functional relationship.
  • a nucleic acid is “operably linked” when it is placed into a functional relationship with another nucleic acid sequence.
  • a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the coding sequence.
  • Operably linked means that the DNA sequences being linked are typically contiguous and, where necessary to join two protein coding regions, contiguous and in reading frame.
  • the present invention provides a method to create libraries of polynucleotides containing either nucleotide deletions, insertions or combinations of deletions and insertions at random positions.
  • this invention provides a means to "swap" genetic elements without the need for homology or amplification techniques.
  • the swapping of genetic elements is known to be a driving force in evolution of macromolecules, cells, and organisms [Ostermeier & Benkovic, Adv Protein Chem 55: 29-77 (2000)].
  • Current techniques, such as PCR based gene shuffling, do not allow significant swapping of genetic elements independent of homology.
  • the invention provides a method to create a population of polynucleotides, with members of the population differing from one another by the presence of deletions at a single random position.
  • One method of the invention for example, comprises the steps of:
  • step (b) subjecting said polynucleotides from step (a) to a process which removes at least one nucleotide from one end of the ends of said polynucleotides;
  • step (c) optionally subjecting said polynucleotides from step (b) to a process which covalently joins said ends to one another, producing a library of polynucleotides which contains at least one polynucleotide that differs from the others by a deletion at one position.
  • the invention provides a population of polynucleotides, with members of the population differing from one another by the presence of deletions at a single random position. It is contemplated that deletions will allow removal of detrimental or unwanted functions of a genetic element. These functions might include protease sites, ion binding domains, DNA binding sequences for inhibitory transcription factors, immunogenic domains of proteins and the like.
  • the invention provides a method, for example, to generate polynucleotides wherein the polynucleotides contain deletions at more than one position.
  • One method comprises the steps of: (a) cleavage of a composition of multiple copies of polynucleotides at random positions to create two ends; (b) subjecting said polynucleotides from step (a) to a process which removes at least one nucleotide from one end of said ends of said polynucleotides; and
  • step (c) optionally, subjecting said polynucleotides from step (b) to a process which covalently joins said ends to one another, producing a library of polynucleotides which contains at least one polynucleotide that differs from the others by a deletion at one position.
  • a function of interest may then be selected for if desired (step(d)). Further, if desired, steps (a) to (c) or steps (a) to (d) may be repeated from 1 to 50 times or more.
  • the invention provides a population of polynucleotides wherein the polynucleotides contain deletions at more than one position. It is contemplated that deletions at multiple positions will allow removal of multiple detrimental or unwanted functions of a genetic element. These functions might include any combination of protease sites, ion binding domains, DNA binding sequences for inhibitory transcription factors, immunogenic domains of proteins or other functions of interest as will be well appreciated by those of skill in the art.
  • the invention provides a method to create a population of polynucleotides, with members of the population differing from one another by the presence of insertions at a single random position.
  • One method comprises the steps of:
  • the invention provides a population of polynucleotides, with members of the population differing from one another by the presence of insertions at a single random position. This embodiment of the invention will allow novel fusion of genetic elements to occur.
  • a toxin could be fused to a targeting molecule (like an antibody), enzyme modules in important metabolic pathways (such as polyketide synthetases) could be fused in new ways, or new functions like binding domains (i.e. nucleic acid binding domains, small molecule or ion binding domains, protease sites, or other post-translational modification modules) could be incorporated into existing genetic elements.
  • a targeting molecule like an antibody
  • enzyme modules in important metabolic pathways such as polyketide synthetases
  • binding domains i.e. nucleic acid binding domains, small molecule or ion binding domains, protease sites, or other post-translational modification modules
  • the invention provides a method to generate polynucleotides wherein the polynucleotides contain insertions at more than one position.
  • One method comprises the steps of:
  • step (b) subjecting said polynucleotides from step (a) to a process which inserts at least one nucleotide to at least one end of said DNA ends of said polynucleotides;
  • step (c) optionally, subjecting said polynucleotides from step (b) to a process which covalently joins said DNA ends to one another, producing a library of polynucleotides which contains at least one polynucleotide that differs from the others by an insertion at one position;
  • Steps (a)-(b), (a)-(c) or (a)-(d) may be repeated from 1 to 50 times or more.
  • the invention provides a population of polynucleotides wherein the polynucleotides contain insertions at more than one position. It is contemplated that this embodiment of the invention will allow multiple novel fusions of genetic elements to occur. For example, the following could be fused to a gene of interest in a combinatorial fashion: a toxin could be fused to a targeting molecule (like an antibody), enzyme modules in important metabolic pathways (such as polyketide synthetases) could be fused in new ways, or new functions like multiple binding domains (i.e. nucleic acid binding domains, ion binding domains, protease sites, or other post-translational modification modules) could be incorporated into existing genetic elements.
  • a targeting molecule like an antibody
  • enzyme modules in important metabolic pathways such as polyketide synthetases
  • the invention provides a method to create a population of polynucleotides, with members of the population differing from one another by the presence of deletions and insertions at a single random position. This method comprises the steps of:
  • step (a) cleavage of a composition of multiple copies of polynucleotides at random positions to create two ends; (b) subjecting said polynucleotides from step (a) to a process which removes at least one nucleotide from one end of said ends of said polynucleotides;
  • step (c) subjecting said polynucleotides from step (b) to a process which inserts at least one nucleotide to at least one end of said DNA ends of said polynucleotides from step (b);
  • step (d) optionally subjecting said polynucleotides from step (c) to a process which covalently joins said DNA ends to one another, producing a library of polynucleotides which contains at least one polynucleotide that differs from the others by a deletion and insertion at one position.
  • the invention provides a population of polynucleotides, with members differing from one another by a combination of deletions and insertions at a single random position. It is contemplated that this embodiment will allow for new heterologous domains to replace domains in the gene of interest. In this regard, new functions, such as ligand binding or enzymatic catalysis could be conferred upon a genetic element. Also, native function could be enhanced utilizing this embodiment.
  • the invention provides a method to generate polynucleotides wherein the polynucleotides contain insertions and deletions at more than one position.
  • deletions may occur at different positions than insertions, or deletions and insertions can occur at the same position. Further, deletions and/or insertions can occur at multiple positions. This method comprises the steps of:
  • step (b) subjecting said polynucleotides from step (a) to a process which removes at least one nucleotide from one end of said ends of said polynucleotides;
  • step (c) optionally subjecting said polynucleotides from step (b) to a process which inserts at least one nucleotide to at least one end of said ends of said polynucleotides; (d) optionally subjecting said polynucleotides from step (c) to a process which covalently joins said ends to one another, producing a library of polynucleotides which contains at least one polynucleotide that differs from the others by a deletion and insertion at one position; (e) optionally selecting for a function of interest; and optionally repeating any of steps (a) to (d) from 1 to 50 times or more.
  • the invention provides a population of polynucleotides wherein the polynucleotides contain insertions and deletions at more than one position. It is contemplated that this embodiment of the invention will allow for classical directed evolution, wherein multiple rounds of insertions at random positions, deletions at random positions, and combinations of insertions and deletions, are produced with the genetic element being optionally subjected to selection between each round. This embodiment allows for the improvement or alteration of the function of a genetic element.
  • the present invention can be applied to any polynucleotide of interest to the researcher.
  • the polynucleotide can be nucleic acid, i.e. RNA or DNA. Often the polynucleotide will be DNA consisting of genetic elements or one or more genes of interest.
  • the starting material may be obtained through natural sources, or may be polynucleotides which have been synthesized in a laboratory (e.g. gene synthesis), or may be polynucleotides derived from natural sources which have been manipulated in a laboratory.
  • the nucleic acid may be obtained from any source, for example, from plasmids such a pBR322, from cloned DNA or RNA or from natural DNA or RNA from any source including bacteria, yeast, viruses and higher organisms such as plants or animals. DNA or RNA may be extracted from blood or tissue material.
  • the template polynucleotide may be obtained by amplification using the polynucleotide chain reaction (PCR) [Mullis, U.S. Patent # 4,683,202 (1987); Mullis et al., U.S. Patent # 4,683,195 (1987)].
  • PCR polynucleotide chain reaction
  • the polynucleotide may be present in a vector present in a cell and sufficient nucleic acid may be obtained by culturing the cell and extracting the nucleic acid from the cell by methods known in the art.
  • the choice of vector depends on the size of the polynucleotide sequence and the host cell to be employed in the methods of this invention.
  • the templates may be plasmids, phages, cosmids, phagemids, viruses (e.g., retroviruses, parainfluenzavirus, herpesviruses, reoviruses, paramyxoviruses, and the like), or selected portions thereof (e.g., coat protein, spike glycoprotein, capsid protein).
  • viruses e.g., retroviruses, parainfluenzavirus, herpesviruses, reoviruses, paramyxoviruses, and the like
  • selected portions thereof e.g., coat protein, spike glycoprotein, capsid protein.
  • cosmids, phagemids, YACs, and BACs are preferred where the specific nucleic acid sequence to be mutated is larger because these vectors are able to stably propagate large nucleic acid fragment
  • the specific nucleic acid sequence is cloned into a vector it can be clonally amplified by inserting each vector into a host cell and allowing the host cell to amplify the vector. This is referred to as clonal amplification because while the absolute number of nucleic acid sequences increases, the number of mutants does not increase.
  • the polynucleotide may be double-stranded or single-stranded, but more preferably is double- stranded. Further, the polynucleotide may be linear or circular, but in a preferred embodiment the polynucleotide is circular. Polynucleotides in circular form may be prepared by preparation of plasmid DNA from organisms such as bacteria, yeast, plants, or mammalian cells by techniques well known to those skilled in the art [Maniatis et al., (1989)]. The number of different specific nucleic acid fragments in the reaction vessel will be at least about 100, preferably at least about 500, and more preferably at least about 1000. [95] The starting material (i.e.
  • the polynucleotide while in substantially pure form, can also be present without homologs or related sequences.
  • the polynucleotides in the initial vessel may all be identical, although they may also be related, unrelated or heterologous.
  • performance of the present invention will be unaffected by the sequence of the starting material.
  • the sequence of the starting material may be known or unknown.
  • all that is required is a method to detect the function of the polynucleotide (such as a screening assay).
  • a nucleic acid fragment may be cleaved by a number of different methods.
  • the nucleic acid fragment may be digested with a nuclease such as DNAse I, SI nuclease, PI or mung bean nuclease, or RNAse, which are readily available.
  • Other enzymes such as RAGl and RAG2, topoisomerases, and integrases are capable of cleaving polynucleotides.
  • the nucleic acid may be randomly sheared by the method of sonication or by passage through a tube having a small orifice.
  • the use of radiation such as gamma radiation or ultraviolet radiation is also capable of cleaving polynucleotides.
  • Chemical agents such as bleomycin or methyl methanesulfonate (MMS) can also cleave polynucleotides.
  • the present invention provides a means to cleave a polynucleotide such that cleavage occurs only at one position per polynucleotide in the reaction vessel.
  • the present invention provides a means for a near random cleavage of a polynucleotide (i.e. cleavage at several different positions in different molecules). Cleavage can be double-stranded or single-stranded (i.e. produce single-stranded ends or double-stranded ends).
  • Examples of enzymes which can cleave polynucleotides include DNase I, SI nuclease, PI nuclease, as well as topoisomerases, transposons, and integrases. Cleavage can occur transiently with enzymes such as topoisomerases, transposons, and integrases. These enzymes may cleave the polynucleotide once, or more than once.
  • SI nuclease can be used to cleave double or single-stranded polynucleotides in a generally random fashion. In a preferred embodiment, with circular double-stranded DNA, SI nuclease will cleave the polynucleotide only once, producing two DNA ends (FIG 4).
  • the nucleic acid may also be partially digested with one or more restriction enzymes which cleave DNA at a high frequency (i.e. at several positions within a polynucleotide), such that certain polynucleotides are cleaved only once, and that the resulting population contains polynucleotides cleaved one time, but with different polynucleotides cleaved at different positions.
  • the cleavage with a restriction enzyme may not be entirely random, but if the genetic element of interest has enough specific restriction sites at different positions, the cleavage pattern may be useful enough to generate substantial diversity.
  • single cleavage of a polynucleotide can be accomplished through other alternative mechanisms which normally cleave polynucleotides several times.
  • a polynucleotide can be randomly sheared by the method of sonication or by passage through a tube having a small orifice.
  • radiation such as gamma radiation or ultraviolet radiation is also capable of cleaving polynucleotides. If any of these modalities is carefully titrated and a means of purification is utilized, the singly cleaved molecules can be obtained in substantially pure form (i.e. singly cleaved molecules can be purified away from uncleaved or multiply cleaved molecules).
  • cleavage and rejoining steps may be coupled.
  • the DNA ends are linked, or are in physical proximity to one another, following cleavage. This is in order to prevent the re- ligation of the wrong ends to one another following deletional or insertional events.
  • One mechanism to keep the ends linked is through the use of a circular polynucleotide as a starting material.
  • the ends are linked by the intervening polynucleotide chain.
  • the re-ligation will be an intramolecular event as opposed to intermolecular, and will proceed with greater efficiency.
  • Other mechanisms to keep the ends in proximity is through a protein bridge, such as through chromatin (i.e. histones, or other DNA binding proteins), or through enzymes which couple cleavage with, rejoining, such as transposons, integrases, or topoisomerases.
  • ends could conceivably be left in proximity to one another through the linkage of opposite ends (the non-cleaved ends) to solid supports.
  • Cleavage of a circular polynucleotide consisting of supercoiled plasmid DNA can be accomplished by incubating from 0.1 to 100 ⁇ g, preferably from 1 to 10 ⁇ g with a nuclease such as SI nuclease.
  • the nuclease can be present in amounts from 0.1 to 1000 units, but preferably from 1 to 100 units in a reaction of 10 ⁇ l.
  • the temperature of the reaction can occur from between 0 and 100°C, but preferably between 4 and 50 °C.
  • the reaction time can vary from 30 seconds to 1 hour, but preferably is between about 1 and 30 minutes.
  • the degree of linearization can be measured by analyzing the plasmid DNA on an agarose gel as in FIG. 4.
  • the linear DNA should preferably be purified from the uncut DNA by any of a number of methods well known to those skilled in the art. Such methods include utilization of agarose gel purification kits (Qiagen, Valencia, CA), HPLC, column chromatography and the like.
  • Nucleotide deletions can be generated at a DNA end by a variety of means.
  • an exonuclease such as exonuclease III
  • exonuclease III can be used to remove nucleotides in a 3' to 5' direction from a DNA end.
  • the resulting DNA end then contains a 5' overhang which can be removed by digestion of the DNA with a single-stranded endonuclease such as PI nuclease, SI nuclease, or mung bean nuclease.
  • Bal 31 nuclease is an enzyme which possesses 5' to 3' as well as 3' to 5' nucleolytic activity and can be used to delete nucleotides from a DNA end.
  • polymerases like DNA polymerase I from E. coli, Klenow fragment, and Taq polymerase contain exonuclease activity and could conceivably be used to make deletions from a DNA end.
  • Cell extracts from all organisms contain DNA repair enzymes which can act to delete nucleotides, thus unpure cell extract could conceivably be used as a source for exonuclease activity.
  • Other nucleases which may not have exonuclease activity under certain conditions may be capable of producing deletions at a DNA end under other conditions.
  • SI nuclease can produce short deletions when used at high enzyme concentrations.
  • the conditions of the deletion reaction are set such that the number of individual deletions occurring at each DNA end may be well controlled.
  • altering the salt concentration, altering the pH, altering the temperature, or altering any of the other biochemical parameters of the reaction can change the activity of the nuclease enzyme such that more or less deletions will occur depending on the intent of the investigator (for instance decreasing temperature or increasing salt may lower the processivity of the exonuclease and cause fewer deletions).
  • Figure 5 shows altering conditions allowing differing numbers of deletions to occur on a DNA end. In some cases large deletions might be warranted (i.e. to completely remove a large domain in a genetic element), in other cases small deletions might be preferable (i.e.
  • deletions could be obtained numbering from 1 to 1000, more preferably they would be from 1 to 100. In certain instances, as described, the deletions may number from 1 to 10.
  • the polynucleotide is necessarily cleaved at a random position, as described above.
  • nucleotides Prior to insertion, nucleotides may be deleted from the DNA ends produced during the cleavage event. Alternatively, the DNA ends formed by the cleavage reaction can be used as substrates to which new nucleotides or polynucleotides are added.
  • nucleotides can be added by chemical coupling.
  • a polymerase such as terminal deoxynucleotidyl transferase can be utilized to add nucleotides in a semirandom fashion to a DNA end [Gauss & Lieber, Mol Cell Biol 1996 16: 258-69 (1996)].
  • the cleavage step may be coupled to the insertion event, as can be the case when employing transposons or integrases to the insertion event.
  • a ligase such as E. coli ligase or phage T4 ligase can be utilized to covalently couple a new polynucleotide to the parent polynucleotide.
  • the polynucleotide is a genetic element or a fragment of a genetic element.
  • a genetic element predisposes the resulting polynucleotide to have function since genetic elements are functional in some way by definition.
  • the genetic element may be a gene, the regulatory element of a gene, or a genetic element encoding a useful domain.
  • the genetic element may be a library of genetic elements such as a cDNA library or genomic DNA library.
  • Fragments of a genetic element can be produced by digesting the polynucleotide with a nuclease, such as DNAse I, SI nuclease, PI or mung bean nuclease, or RNAse.
  • a nuclease such as DNAse I, SI nuclease, PI or mung bean nuclease, or RNAse.
  • Other enzymes such as restriction enzymes and topoisomerases, can also cleave polynucleotides into fragments.
  • the polynucleotides may be randomly sheared by the method of sonication or by passage through a tube having a small orifice.
  • radiation such as gamma radiation or ultraviolet radiation is also capable of cleaving polynucleotides into fragments.
  • Chemical agents, such as bleomycin or MMS can also cleave polynucleotides into fragments.
  • a parent polynucleotide cleaved at a random position with a population of genetic elements or fragments of genetic elements, and a ligase such .as T4 DNA ligase, under the appropriate salt, buffer, and temperature conditions, will allow covalent coupling of the genetic elements with the parent polynucleotide at the position of the original cleavage event.
  • a mixture of polynucleotides is produced comprising an insertion at a random position within the parent polynucleotide.
  • the content (i.e. the sequence) of each insertion may be identical if the genetic elements or fragments of genetic elements are identical, or different if the fragments of genetic elements were non- identical.
  • DNA ends may be rejoined covalently by incubating the DNA ends with an enzyme like a DNA ligase which will form phosphodiester bonds between nucleotides at the DNA end.
  • ligases include E. coli DNA ligase, phage T4 DNA ligase, or human DNA ligases. These enzymes can be used under conditions well known to those skilled in the art to ligate DNA. Other enzymes are also capable of creating covalent linkages (like phosphodiester bonds) between nucleotides at DNA ends. Such enzymes are topoisomerases, transposons, integrases, and other recombination enzymes.
  • DNA ends can be joined such as the utilization of an oligonucleotide whose sequence can hybridize to sequences on either end (i.e. both the 5' and 3' ends) to "bridge" the ends with hydrogen bonds.
  • the intervening sequence on the opposite strand could be filled in with a polymerase, such as E. coli polymerase, Klenow fragment, phage T4 polymerase, or Taq polymerase.
  • Nicks could then be repaired by a DNA ligase as described above.
  • Cellular extracts also contain ligase activities and cell or nuclear extracts could be used to rejoin DNA ends.
  • DNA molecules could be introduced into intact cells and the cell's machinery could rejoin DNA ends by homologous or non-homologous means.
  • the invention provides a population of polynucleotides, with members of the population differing from one another by the presence of deletions at a single random position.
  • Such single deletion libraries can contain at least 2 molecules, but preferably 100 molecules, and most preferably at least about 1000 molecules.
  • Deletion libraries should contain at least one molecule that differs from at least one other molecule by the deletion of at least one nucleotide at one random position. The number of deletions at each position could be from 1 to 1000, but should be at least one. It is contemplated that deletions will allow removal of detrimental or unwanted functions of a genetic element. These functions might include protease sites, ion binding domains, DNA binding sequences for inhibitory transcription factors, immunogenic domains of proteins and the like.
  • the invention provides a population of polynucleotides wherein the polynucleotides contain deletions at more than one position.
  • a library should contain at least 2 molecules, but preferably 100 molecules, and most preferably at least about 1000 molecules.
  • These multiple deletion libraries should contain at least one molecule that differs from at least one other molecule by the deletion of at least one nucleotide at more than one random position. It is contemplated that deletions at multiple positions will allow removal of multiple detrimental or unwanted functions of a genetic element. These functions might include any combination of multiple protease sites, ion binding domains, DNA binding sequences for inhibitory transcription factors, immunogenic domains of proteins and the like. Insertions
  • Insertion libraries can contain at least 2 molecules, but preferably 100 molecules, and most preferably at least about 1000 molecules. Insertion libraries should contain at least one molecule that differs from at least one other molecule by the insertion of at least one nucleotide at one random position. The number of insertions at each position could be from 1 to 10,000, but preferably will be at least one.
  • a toxin could be fused to a targeting molecule (like an antibody), enzyme modules in important metabolic pathways (such as polyketide synthetases) could be fused in new ways, or a new function like binding domains (i.e. nucleic acid binding domains, ion binding domains, protease sites, or other post-translational modification modules) could be incorporated into existing genetic elements.
  • a targeting molecule like an antibody
  • enzyme modules in important metabolic pathways such as polyketide synthetases
  • binding domains i.e. nucleic acid binding domains, ion binding domains, protease sites, or other post-translational modification modules
  • the invention provides a population of polynucleotides wherein the polynucleotides contain insertions at more than one position.
  • a library should contain at least 2 molecules, but preferably 100 molecules, and most preferably at least about 1000 molecules.
  • These multiple insertion libraries should contain at least one molecule that differs from at least one other molecule by the insertion of at least one nucleotide at more than one random position. It is contemplated that this embodiment of the invention will allow novel fusion of genetic elements to occur. It is contemplated that this embodiment of the invention will allow multiple novel fusions of genetic elements to occur.
  • a toxin could be fused to a targeting molecule (like an antibody)
  • enzyme modules in important metabolic pathways such as polyketide synthetases
  • binding domains i.e. nucleic acid binding domains, ion binding domains, protease sites, or other post-translational modification modules
  • the invention provides a population of polynucleotides, with members differing from one another by a combination of deletions and insertions at a single random position.
  • a library should contain at least 2 molecules, but preferably 100 molecules, and most preferably at least about 1000 molecules.
  • These combination libraries should contain at least one molecule that differs from at least one other molecule by the insertion of one nucleotide and the deletion of at least one nucleotide at one random position. It is contemplated that this embodiment will allow for heterologous domains to replace domains in the gene of interest.
  • new functions such as ligand binding or enzymatic catalysis could be conferred upon a genetic element.
  • native function could be enhanced utilizing this embodiment.
  • the invention provides a population of polynucleotides wherein the polynucleotides contain insertions and deletions at more than one position.
  • a library should contain at least 2 molecules, but preferably 100 molecules, and most preferably at least about 1000 molecules.
  • combination libraries should contain at least one molecule that differs from at least one other molecule by the insertion of at least one nucleotide at one random position and the deletion of at least one nucleotide at one random position.
  • This embodiment of the invention will allow for classical directed evolution, wherein multiple rounds of insertions at random positions, deletions at random positions, and combinations of insertions and deletions, are produced with the gene of interest being optionally subjected to selection between each round. This embodiment allows for the improvement or alteration of function of a genetic element.
  • composition of such libraries can be determined by mechanisms well known to those in the art.
  • the library can be analyzed by agarose or acrylamide gel electrophoresis and size can be compared to the parental sequence.
  • Other methods like HPLC, mass spectrometry, column chromatography can be used to identify size differences between polynucleotides.
  • the most definitive method to determining the composition of a library is to subject representative polynucleotides within the composition to sequencing, a method well known to those skilled in the art. Comparison of sequences of representative clones would allow one to determine if deletions or insertions occurred at random positions in different molecules in the library.
  • the resulting library could be ligated into an expression vector for use as a vehicle to express the resulting variants contained within the library.
  • the nature of the expression vector is described below in the "screening" section.
  • the library In testing a library of polynucleotides for a function of interest, the library should be inserted in an appropriate expression vector.
  • the library can be constructed in an expression vector (i.e. the library comprises an expression vector).
  • the vector used for cloning is not critical provided that it will accept a DNA fragment of the desired size. If expression of the DNA fragment is desired, the cloning vehicle should further comprise transcription and translation signals next to the site of insertion of the DNA fragment to allow expression of the DNA fragment in the host cell.
  • preferred vectors include the pUC series and the pBR series of plasmids.
  • the resulting bacterial population will include a number of recombinant DNA fragments having random mutations. This mixed population may be tested to identify the desired recombinant nucleic acid fragment. The method of selection will depend on the DNA fragment desired.
  • the choice of vector depends on the size of the polynucleotide sequence and the host cell to be employed in the methods of this invention.
  • the templates may be plasmids, phages, cosmids, phagemids, viruses (e.g., retroviruses, parainfluenzavirus, herpesviruses, reoviruses, paramyxoviruses, and the like), or selected portions thereof (e.g., coat protein, spike glycoprotein, capsid protein).
  • viruses e.g., retroviruses, parainfluenzavirus, herpesviruses, reoviruses, paramyxoviruses, and the like
  • selected portions thereof e.g., coat protein, spike glycoprotein, capsid protein.
  • cosmids, phagemids, YACs, and BACs are preferred where the specific nucleic acid sequence is larger because these vectors are able to stably propagate large nucleic acid fragments.
  • the proteins expressed by each of the DNA fragments in the population or library may be tested for their ability to bind to the ligand by methods known in the art (i.e. panning, affinity chromatography). If a DNA fragment which encodes for a protein with increased drug resistance is desired, the proteins expressed by each of the DNA fragments in the population or library may be tested for their ability to confer drug resistance to the host organism.
  • methods known in the art i.e. panning, affinity chromatography
  • the proteins expressed by each of the DNA fragments in the population or library may be tested for their ability to confer drug resistance to the host organism.
  • One skilled in the art given knowledge of the desired protein, could readily test the population to identify DNA fragments which confer the desired properties onto the protein.
  • positive polypeptide variants means resulting polypeptide variants possessing functional properties which has been improved in comparison to the polypeptides producible from the corresponding input DNA sequences. Examples, of such improved properties can be as different as, for example, enhanced or lowered biological activity, increased wash performance, thermostability, • oxidation stability, substrate specificity, antibiotic resistance or others that may be of interest. [124] Consequently, the screening method to be used for identifying positive variants depend on which property of the polypeptide in question it is desired to change, and in what direction the change is desired.
  • the recombinant DNA molecules are cloned into the phage DNA at a site which results in the transcription of a fusion protein, a portion of which is encoded by the recombinant DNA molecule.
  • the phage containing the recombinant nucleic acid molecule undergoes replication and transcription in the cell.
  • the leader sequence of the fusion protein directs the transport of the fusion protein to the tip of the phage particle.
  • the fusion protein which is partially encoded by the recombinant DNA molecule is displayed on the phage particle for detection and selection by the methods described above.
  • the deletion generating mechanism can be applied to any polynucleotide of interest to the researcher.
  • the polynucleotide can be nucleic acid, i.e. RNA or DNA. Often the polynucleotide will be DNA consisting of genetic elements or one or more genes of interest.
  • the starting material may be obtained through natural sources, or may be polynucleotides which have been synthesized in a laboratory (e.g. gene synthesis), or may be polynucleotides derived from natural sources which have been manipulated in a laboratory.
  • Genbank http://www.ncbi.nlm.nih.gov: 80/Genbank/index.html
  • Genbank available commercially
  • Incyte Palo Alto
  • Clontech Palo Alto
  • CA Invitrogen, Carlsbad, CA
  • the nucleic acid may be obtained from any source, for example, from plasmids such a pBR322, from cloned DNA or RNA or from natural DNA or RNA from any source including bacteria, yeast, viruses and higher organisms such as plants or animals.
  • DNA or RNA may be extracted from blood or tissue material.
  • the template polynucleotide may be obtained by amplification using the polynucleotide chain reaction (PCR) [Mullis, U.S. Patent # 4,683,202 (1987); Mullis et al., U.S. Patent # 4,683,195 (1987)].
  • PCR polynucleotide chain reaction
  • the polynucleotide may be present in a vector present in a cell and sufficient nucleic acid may be obtained by culturing the cell and extracting the nucleic acid from the cell by methods known in the art.
  • Nucleotide deletions can be generated at a DNA end by a variety of means.
  • an exonuclease such as exonuclease III
  • exonuclease III can be used to remove nucleotides in a 3' to 5' direction from a DNA end.
  • the resulting DNA end contains a 5' overhang which can be removed by digestion of the DNA with a single-stranded endonuclease such as PI nuclease, SI nuclease, or mung bean nuclease.
  • Other exonucleases could also be used in the present invention.
  • Bal 31 nuclease is an enzyme which possesses 5' to 3' as well as 3' to 5' nucleolytic activity and can be used to delete nucleotides from a DNA end.
  • Exonuclease T can remove nucleotides in a 3' to 5' direction.
  • Exonuclease 7 can remove nucleotides in a 5' to 3' direction, and can act at single-stranded ends such as nicks or gaps.
  • Exonuclease I catalyzes the removal of nucleotides from single-stranded DNA in the 3' to 5 ' direction.
  • Lambda exonuclease is a highly processive enzyme that acts in the 5 ' to 3 ' direction, catalyzing the removal of 5 ' mononucleotides from duplex DNA.
  • RecJ is a single- stranded DNA specific exonuclease that catalyzes the removal of deoxynucleotide monophosphates from DNA in the 5 ' to 3 ' direction.
  • polymerases like DNA polymerase I from e. coli, Klenow fragment, and Taq polymerase contain exonuclease activity and could conceivably be used to make deletions from a DNA end.
  • Cell extracts from all organisms contain DNA repair enzymes which can act to delete nucleotides, thus unpure cell extract could conceivably be used as a source for exonuclease activity.
  • Other nucleases which may not have exonuclease activity under certain conditions may be capable of producing deletions at a DNA end under other conditions.
  • SI nuclease can produce short deletions when used at high enzyme concentrations.
  • mild denaturation of a DNA molecule such that the DNA ends become "frayed" will allow deletions to occur upon application of a single-stranded endonuclease, such as SI, PI, or mung-bean nuclease.
  • the conditions of the deletion reaction are set such that the number of individual deletions occurring at each DNA end may be well controlled.
  • altering the salt concentration and the temperature, altering the pH, or altering any of the other biochemical parameters of the reaction can change the activity of the nuclease enzyme such that more or less deletions will occur depending on the intent of the investigator.
  • Salts used in the reaction may be any salt. Examples of salts include sodium chloride, sodium acetate, potassium chloride, or potassium acetate.
  • the salt is either sodium chloride or potassium chloride.
  • Salt concentrations can range from 10 mM to 1.0 M, but preferably is between 50 mM and 500 mM. Temperature of the reaction can also vary in the present invention. The temperature can range from 0°C to 30°C, but preferably is between 0°C and 24°C.
  • Figure 5 shows altering conditions allowing differing numbers of deletions to occur on a DNA end. In some cases large deletions might be warranted (i.e. to completely remove a large domain in a genetic element), in other cases small deletions might be preferable (i.e. to remove a single amino acid, or a few amino acids such as those that comprise a protease site).
  • the resulting population of polynucleotides contain variable amounts of deletions at the ends of the starting sequence. Generally deletions could be obtained numbering from 1 to 1000, more preferably they would be from 1 to 100. In a preferred embodiment, the deletions may number from 1 to 30 or even 1 to 10.
  • DNA ends may be rejoined by incubating the DNA ends with an enzyme like a DNA ligase which will form phosphodiester bonds between nucleotides at the DNA end.
  • ligases include E. coli DNA ligase, phage T4 DNA ligase, or human DNA ligases. These enzymes can be used under conditions well known to those skilled in the art to ligate DNA.
  • enzymes are also capable of creating covalent linkages (like phosphodiester bonds) between nucleotides at DNA ends.
  • Such enzymes are topoisomerases, transposons, integrases, and other recombination enzymes.
  • Other mechanisms can be used to join DNA ends such as the utilization of an oligonucleotide whose sequence can hybridize to sequences on either end (i.e. both the 5' and 3' ends) to "bridge" the ends with hydrogen bonds.
  • the intervening sequence on the opposite strand could be filled in with a polymerase, such as e.coli polymerase, Klenow fragment, phage T4 polymerase, or Taq polymerase.
  • Nicks could then be repaired by a DNA ligase as described above.
  • Cellular extracts also contain ligase activities and cell or nuclear extracts could be used to rejoin DNA ends.
  • DNA molecules could be introduced into intact cells and the cell's machinery could rejoin DNA ends by homologous or non- homologous means.
  • the current invention provides for a composition of polynucleotides, wherein members of the population differ from one another by the presence of deletions at one or both ends of the polynucleotide.
  • the number of deletions may range from 1 to 100 at each end, but more preferable is from 1 to 30.
  • the current invention provides for a composition of polynucleotides differing from one another by short deletions at a specific internal position (i.e. not at an end).
  • This composition is obtained by joining the composition of polynucleotides with deletions at the ends to other DNA ends, such that the deletion now occurs internally. Often the two ends to be ligated will be present on the same DNA molecule, such that the resulting ligation product is a circular polynucleotide.
  • the number of deletions may range from 1 to 100 at each end, but more preferable is from 1 to 30.
  • the plasmid pLacZi (Clontech, Palo Alto, CA) was used to illustrate the mechanism by which a polynucleotide can be cleaved at random positions.
  • the plasmid was propagated in DH10B E. coli cells (Invitrogen, Carlsbad, CA) and plasmid was prepared by Qiagen maxiprep columns (Qiagen, Valencia, CA).
  • Plasmid DNA at 200 ng/ ⁇ l was incubated with 0.4, 2.0, 10, or 50 units of SI nuclease (Promega, Madison, WT) in IX SI buffer (50 mM sodium acetate pH 4.5, 280 mM NaCl, 4.5 mM ZnSO 4 ) for 10 minutes at room temperature. The reaction was stopped by the addition of EDTA to 0.025 M and heated to 70°C for 10 minutes. Protein was removed by twice extracting with an equal volume of phenol:chloroform:isoamyl alcohol (25:24:1), once with an equal volume of ether, precipitated with sodium acetate and resuspended in water.
  • the SI /Cla I cleaved plasmid is seen as a smear, showing that SI is cleaving in several different positions in the plasmid. If SI cleaved at only one position, then the Sl/Cla I cleaved plasmid would migrate as two bands; if SI cleaved at two positions, then the Sl/Cla I plasmid would migrate as three bands, and so on.
  • the importance of this example is that a polynucleotide is able to be cleaved once (i.e. linearization of a circle), and only once, at different positions.
  • Example 2 Deletions at a site in LacZ
  • Nucleotide deletions have been made for structural analysis of genes, and for nucleotide sequence analysis. Generally these deletions are large, in the range of well over 100 nucleotides. Under normal conditions, for example, exonuclease III removes over 100 bases per minute [Sambrook et al., (1989)]. The ability to create small deletions, however, would be useful to alter small domains in proteins or remove deleterious functions. In order to make small deletions at the end of a polynucleotide, exonuclease III was utilized under various conditions of salt ( Figure 5) and temperature.
  • a fluorescently labeled 232 base pair PCR product from pLacZi was exposed to 100 mM, 150 mM, and 200 mM NaCl in the presence of 10 U exonuclease III (New England Biolabs, Beverly, MA) in 10 ⁇ l of 66 mM Tris-Cl (pH 7.4), 0.66 mM MgCl 2 at 15°C in a 5 minute reaction.
  • the reaction was stopped by the addition of EDTA to 0.025 M, and extracted once with an equal volume of phenol:chloroform:isoamyl alcohol (25:24:1), once with an equal volume of ether, and precipitated with sodium acetate.
  • DNA was resuspended in 20 ⁇ l deionized formamide, and 0.5 ⁇ l was run on a 6% polyacrylamide denaturing gel in ABI 373 sequencer (Perkin-Elmer, Foster City, CA) set to the genescan setting according to the manufacturers recommendation.
  • the Cla I site in pLacZi exists in the coding region of the LacZ gene. This site was utilized to make short deletions within the gene itself, which could then be analyzed further by PCR to determine the extent to which deletions were made. Additionally, plasmids containing deletions were selected on LB agar plates containing 40 ⁇ g/ml X-Gal to determine the functionality of the LacZ gene. The pLacZi plasmid (10 ⁇ g) was linearized with Cla I in 200 ⁇ l, then incubated with 20 U of SI nuclease in 400 ⁇ l to remove the 2 bp 5' overhangs.
  • the linearized plasmid was concentrated and filtered through an ultrafree MC membrane (30 kD cutoff, Millipore, Bedford, MA), then brought to a volume of 400 ⁇ l in IX calf intestinal phosphatase buffer containing 100 U of calf intestinal phosphatase (New England Biolabs, Beverly, MA) and incubated for 45 minutes at room temperature. Plasmid was extracted with an equal volume of phenol:chloroform:isoamyl alcohol (25:24:1), once with an equal volume of ether, precipitated with sodium acetate, and resuspended in water.
  • the plasmid was then incubated with exonuclease III as described in example 1, in the presence of either 100 mM, 150 mM or 200 mM NaCl for 5 minutes at 15°C in a 10 ⁇ l reaction.
  • plasmid was not incubated with exonuclease III, to test for the frequency of religation of the dephosphorylated plasmid in the absence of deletions.
  • a mix containing SI nuclease 50 U in IX SI buffer was added. This mix was further incubated at room temperature for 15 minutes. The reaction was stopped by the addition of EDTA to 0.025 M and heated to 70°C for 10 minutes.
  • the DNA was then extracted once with an equal volume of phenol:chloroform:isoamyl alcohol (25:24:1), once with an equal volume of ether, precipitated with sodium acetate and resuspended in 10 ⁇ l of IX ligase buffer containing 1.0 U of T4 DNA ligase (Invitrogen, Carlsbad, CA). Ligation reactions were incubated at 15°C for 12 hours. Electroporation of E. coli strain DH10B (Invitrogen, Carlsbad, CA) was accomplished with 1.0 ⁇ l of ligation mix. Cells were plated on LB agar plates containing 40 ⁇ g/ml X-Gal and 100 ⁇ g/ml ampicillin and incubated overnight at 30°C. Table 1 illustrates the results of the plating experiment.
  • Clone 5 contains a 7 basepair out of frame deletion (PCR product of 305 bases) and has a white phenotype.
  • Clone 6 has a 3 basepair deletion (PCR product of 309 bases) and has a blue phenotype. Although it may be thought that shorter deletions would lead to less severe phenotype, this experiment illustrates that this is not necessarily the case.
  • Clone 1 contains a deletion encompassing 7 amino acids but retains function whereas clones 3 and 4 contain in frame shorter deletions but do not retain function. Furthermore, this example illustrates the ability of deletional technology to search functional sequence space.
  • Insertions of random DNA in the LacZ gene was accomplished by employing DNase I to fragment cDNA derived from CHO cells, followed by ligation of these fragments into linearized pLacZi. Since cDNA is by definition functional, it is contemplated that the use of cDNA will optimize the likelihood of obtaining functional proteins.
  • CHO cell cDNA (5 ⁇ g ) was fragmented with 0.001 units of DNase I in a buffer containing 40 mM Tris-Cl pH 7.4 and 10.0 mM MgCl for 5 minutes at room temperature. The reaction was stopped by the addition of EDTA to 0.025 M and heated to 70°C in the presence of 10 ⁇ g of protease K.
  • DNA was extracted with an equal volume of phenol:chloroform:isoamyl alcohol (25:24:1), once with an equal volume of ether, and precipitated with sodium acetate.
  • Plasmid linearized with Cla I or SI nuclease were dephosphorylated as described above, then again extracted with an equal volume of phenol:chloroform:isoamyl alcohol (25:24:1), once with an equal volume of ether, and precipitated with sodium acetate.
  • linearized, dephosphorylated plasmid was incubated with 1 ng of cDNA fragments in the presence of T4 DNA ligase (1.0 U) in a reaction volume of 10 ml at 15°C for 12 hours.
  • T4 DNA ligase 1.0 U
  • linearized plasmid was incubated with ligase in the absence of cDNA fragments, and cDNA fragments were incubated with ligase in the absence of linearized vector.
  • DH10B E. coli were then electroporated with 1.0 ⁇ l of each ligation mix.
  • the lac operon is a model system by which genetic elements are easily studied.
  • the enzyme ⁇ -galactosidase is encoded by the LacZ gene, but is normally only produced when lactose is present in the environment. Control of enzyme levels is accomplished at the level of transcription.
  • the lac repressor protein binds to the operator sequence upstream from the ATG start site of LacZ, and inhibits transcription by RNA polymerase. In the presence of lactose, however, the repressor is removed from the operator and transcription can proceed.
  • the mechanism of promotor activation is through the binding of lactose, the inducer, to the lac repressor and causing an allosteric change that causes its affinity for the operator to decrease dramatically.
  • LacZ transcription can be assessed by plating E. coli on the colorimetric substrate X-Gal, which causes colonies to turn blue when hydrolyzed by ⁇ -galactosidase.
  • the operator can be de- repressed by utilizing the lactose analog IPTG, which is non-hydrolizable, and strongly induces LacZ transcription by binding the lac repressor.
  • the pBluescript II KS+ plasmid was linearized with SI nuclease, gel purified, dephosphorylated, and subjected to exonuclease III digestion as described in examples 1 and 2.
  • Linearized plasmid at 20 ng/ ⁇ l was incubated with 10 U exonuclease III in 66 mM Tris-Cl pH 7.4, 0.66 mM MgCl 2 buffer at 15°C for 5 minutes, followed by addition of IX SI solution containing 50 mM sodium acetate pH 4.5, 280 mM NaCl, 4.5 mM ZnSO 4 and 10 U SI nuclease, and incubation for 15 minutes at room temperature. The reaction was stopped by adding EDTA to 0.025 M, and extraction with an equal volume of phenol:chloroform:isoamyl alcohol (25:24:1), once with an equal volume of ether, and precipitated with sodium acetate.

Abstract

L'invention concerne des méthodes illustrées par la figure destinées à l'évolution dirigée, dans lesquelles des éléments génétiques sont clivés de façon aléatoire pour permettre la délétion ou l'addition de polynucléotides ou l'un et l'autre afin de créer une banque d'éléments génétiques libérés avec des additions ou des délétions. L'invention concerne également des populations de banques correspondantes. Ces processus permettent un échantillonnage significatif de l'espace de séquence nécessaire à une évolution dirigée de gènes. En outre, l'invention concerne des méthodes permettant d'effectuer de très petites délétions de nucléotides dans des éléments génétiques étudiés.
PCT/US2001/025788 2000-08-18 2001-08-17 Methodes et compositions pour l'evolution moleculaire dirigee utilisant une modification des extremites de l'adn WO2002016642A1 (fr)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2002522312A JP2004507247A (ja) 2000-08-18 2001-08-17 Dna末端改変を使用する指向された分子進化のための方法および組成物
EP01965982A EP1311709A4 (fr) 2000-08-18 2001-08-17 Methodes et compositions pour l'evolution moleculaire dirigee utilisant une modification des extremites de l'adn
AU2001286528A AU2001286528A1 (en) 2000-08-18 2001-08-17 Methods and compositions for directed molecular evolution using DNA-end modification
CA002419961A CA2419961A1 (fr) 2000-08-18 2001-08-17 Methodes et compositions pour l'evolution moleculaire dirigee utilisant une modification des extremites de l'adn

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US22647700P 2000-08-18 2000-08-18
US60/226,477 2000-08-18

Publications (2)

Publication Number Publication Date
WO2002016642A1 true WO2002016642A1 (fr) 2002-02-28
WO2002016642A9 WO2002016642A9 (fr) 2003-03-27

Family

ID=22849054

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/025788 WO2002016642A1 (fr) 2000-08-18 2001-08-17 Methodes et compositions pour l'evolution moleculaire dirigee utilisant une modification des extremites de l'adn

Country Status (5)

Country Link
EP (1) EP1311709A4 (fr)
JP (1) JP2004507247A (fr)
AU (1) AU2001286528A1 (fr)
CA (1) CA2419961A1 (fr)
WO (1) WO2002016642A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013540439A (ja) * 2010-09-24 2013-11-07 コバリス,インコーポレイテッド 核酸を断片化する方法及び装置

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2516343C2 (ru) * 2008-04-23 2014-05-20 ДАНИСКО ЮЭс ИНК. Варианты изопренсинтазы, применяемые для улучшения продуцирования изопрена микроорганизмами

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
GREEN C. ET AL.: "Targeted deletions of sequences from closed circular DNA", PROC. NATL. ACAD. SCI. USA, vol. 77, no. 5, May 1980 (1980-05-01), pages 2455 - 2459, XP002905142 *
See also references of EP1311709A4 *
WALDECK W. ET AL.: "Random cleavage of superhelical SV40 DNA by S1 nuclease", BIOCHIMICA ET BIOPHYSICA ACTA, vol. 425, March 1976 (1976-03-01), pages 157 - 167, XP002905143 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013540439A (ja) * 2010-09-24 2013-11-07 コバリス,インコーポレイテッド 核酸を断片化する方法及び装置

Also Published As

Publication number Publication date
EP1311709A4 (fr) 2006-08-02
AU2001286528A1 (en) 2002-03-04
EP1311709A1 (fr) 2003-05-21
JP2004507247A (ja) 2004-03-11
WO2002016642A9 (fr) 2003-03-27
CA2419961A1 (fr) 2002-02-28

Similar Documents

Publication Publication Date Title
US7833759B2 (en) Method of increasing complementarity in a heteroduplex
Ostermeier et al. A combinatorial approach to hybrid enzymes independent of DNA homology
AU724698B2 (en) Recombination of polynucleotide sequences using random or defined primers
Jessberger et al. Repair of deletions and double-strand gaps by homologous recombination in a mammalian in vitro system
EP2009102A2 (fr) Mutagenèse aléatoire et amplification d'acide nucléique
JP6552969B2 (ja) 定方向進化のためのライブラリーの作製方法
AU2002314712A1 (en) A method of increasing complementarity in a heteroduplex polynucleotide
US7820413B2 (en) Incrementally truncated nucleic acids and methods of making same
AU2003267008B2 (en) Method for the selective combinatorial randomization of polynucleotides
EP1311709A1 (fr) Methodes et compositions pour l'evolution moleculaire dirigee utilisant une modification des extremites de l'adn
US20030036641A1 (en) Methods for homology-driven reassembly of nucleic acid sequences
US20030129709A1 (en) Method for site-directed mutagenesis of nucleic acid molecules using a single primer
WO2002081643A2 (fr) Elaboration d'echantillotheques de polynucleotides et identification d'elements de l'echantillotheque presentant les caracteristiques attendues
US20030092023A1 (en) Method of shuffling polynucleotides using templates
Pera et al. Hybrid enzymes
KR20000015984A (ko) 무작위 프라이머 또는 일정 프라이머를 이용한폴리뉴클레오티드 서열의 재조합 방법
Berta Analysis of specific cis-acting DNA sequences of the Himar1 mariner transposon
ZA200306203B (en) A method of increasing complementarity in a heteroduplex polynucleotide.
KR20000022394A (ko) 부위특이적변이도입방법
AU2002307170A1 (en) Methods for the preparation of polynucleotide librairies and identification of library members having desired characteristics
AU7257300A (en) Recombination of polynucleotide sequences using random or defined primers

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2419961

Country of ref document: CA

Ref document number: 2002522312

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2001286528

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2001965982

Country of ref document: EP

COP Corrected version of pamphlet

Free format text: PAGES 1/7-7/7, DRAWINGS, REPLACED BY NEW PAGES 1/7-7/7; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE

WWP Wipo information: published in national office

Ref document number: 2001965982

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWW Wipo information: withdrawn in national office

Ref document number: 2001965982

Country of ref document: EP