EP1109889A1 - Transformation, selektion und screening von sequenzdurchmischten polynukleotiden zur entwicklung und optimierung von pflanzenphänotypen. - Google Patents

Transformation, selektion und screening von sequenzdurchmischten polynukleotiden zur entwicklung und optimierung von pflanzenphänotypen.

Info

Publication number
EP1109889A1
EP1109889A1 EP99943983A EP99943983A EP1109889A1 EP 1109889 A1 EP1109889 A1 EP 1109889A1 EP 99943983 A EP99943983 A EP 99943983A EP 99943983 A EP99943983 A EP 99943983A EP 1109889 A1 EP1109889 A1 EP 1109889A1
Authority
EP
European Patent Office
Prior art keywords
sequence
plant
library
protoplasts
shuffled
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP99943983A
Other languages
English (en)
French (fr)
Inventor
Willem P. C. Stemmer
Venkiteswaran Subramanian
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Maxygen Inc
Original Assignee
Maxygen Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Maxygen Inc filed Critical Maxygen Inc
Publication of EP1109889A1 publication Critical patent/EP1109889A1/de
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1058Directional evolution of libraries, e.g. evolution of libraries is achieved by mutagenesis and screening or selection of mixed population of organisms
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1079Screening libraries by altering the phenotype or phenotypic trait of the host
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • C12N15/8271Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
    • C12N15/8274Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for herbicide resistance
    • C12N15/8275Glyphosate
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1085Transferases (2.) transferring alkyl or aryl groups other than methyl groups (2.5)
    • C12N9/10923-Phosphoshikimate 1-carboxyvinyltransferase (2.5.1.19), i.e. 5-enolpyruvylshikimate-3-phosphate synthase

Definitions

  • the invention relates to methods and compositions for generating, modifying, adapting, and optimizing polynucleotide sequences that confer detectable phenotypic properties on plant species, agronomically-important microorganisms, genetic constructs/vectors, and related aspects.
  • GENETIC ENGINEERING OF AGRICULTURAL ORGANISMS Genetic engineering of agricultural organisms dates back thousands of years to the dawn of agriculture. Agricultural organisms having phenotypic traits that were deemed desirable have been selected, including taste, high yield, caloric value, ease of propagation, resistance to pests and disease, and appearance. Classical breeding methods to select for germplasm encoding desirable agricultural traits had been a standard practice of the world's farmers long before Gregor Mendel and others identified the basic rules of segregation and selection.
  • sequence diversity available is limited by the natural genetic variability within the existing specimen gene pool, although crude mutagenic approaches have been used to add to the natural variability in the gene pool.
  • the present invention provides a composition comprising a population of protoplast library members, wherein said protoplast library members each comprise a plant cell protoplast harboring intracellularly one or a subset of a library of hetero logous polynucleotide sequences, each of which is operably linked to an expression sequence, or, if the heterologous polynucleotide sequence is a transcriptional regulatory sequence, operably linked to a reporter gene sequence.
  • the library of heterologous polynucleotide sequences comprise at least 10, usually at least 100, and typically at least 1,000 species of distinct heterologous polynucleotide sequences which, in certain embodiments, may share 70 to 99 percent sequence identity or more, and/or may differ by only one or several nucleotide differences, and/or may share less than 70 sequence identity, or a combination thereof.
  • the heterologous polynucleotide sequence is xenogenic; however in some embodiments the heterologous polynucleotides may be derived from genetic sequences from the genome of the same plant species from which the plant cell protoplast was produced, but said heterologous polynucleotides are not naturally- occurring sequences in said genome and comprise at least one mutation or recombination not present in the genome of the plant cell protoplast.
  • the heterologous sequence is substantially identical to a naturally-occurring gene sequence in the genome of a species of plant, algae, dinoflagellate, bacterium, archaebacterium, cyanobacterium, plant pathogen (insect, nematode, virus, fungus), which is substantially or completely absent in the genome of the plant species from which the plant cell protoplast was produced.
  • the protoplast library members comprise an expression library of cloned heterologous polynucleotides, such as an expression cDNA library, transformed by suitable means into said plant cell protoplasts.
  • the protoplast library members contain heterologous polynucleotides which are sequence-shuffled variants of at least two parental polynucleotide species, which typically share at least 70 percent sequence identity or which contain site-specific recombination sequences, or compatible restriction sites which can be used for cassette shuffling, or a combination thereof.
  • the invention provides a plant cell protoplast library comprising a plurality of library members, wherein each library member comprises a plant protoplast containing an intracellular polynucleotide comprising a distinct species of heterologous polynucleotide sequence operably linked to an expression sequence
  • a transcriptional regulatory sequence functional in the protoplast cell or progeny thereof e.g., a transcriptional regulatory sequence functional in the protoplast cell or progeny thereof
  • a replication sequence e.g., a plant origin of replication, a bacterial origin of replication (e.g., for use as a shuttle vector for transferring materials from bacteria to plants), an Agrobacterium Ti plasmid origin of replication, a viral replicon (e.g., for a plant virus), or the like).
  • the invention provides a library of transformed plant protoplasts, or progeny thereof, wherein each transformed protoplast harbors at least one distinct species of heterologous polynucleotide sequence operably linked to an Agrobacterium Ti plasmid in expressible form such that substantially each species of heterologous polynucleotide sequence is transcribed and translated in the host plant cell protoplast or progeny thereof.
  • the heterologous sequences cloned into the Ti plasmid are cDNA sequences obtained from an organism distinct from the phylogenetic species of the plant cell protoplast.
  • the heterologous sequences are mutated variants of one or more heterologous parental sequences and/or of one or more sequences present in the genome of the phylogenetic species of the plant cell protoplast; such mutation(s) can be introduced by any suitable method, including but not limited to error-prone PCR, site-directed mutagenesis, oligonucleotide-spiking, or other methods known in the art.
  • the invention also provides a method for obtaining a desired polynucleotide sequence, comprising selecting, from a population of protoplast library members or their clonal progeny, wherein said protoplast library members each comprise a plant cell protoplast harboring intracellularly one or a subset of a library of heterologous polynucleotide sequences, a subpopulation of said library members which express a predetermined phenotype.
  • the step of selecting comprises assaying a detectable biochemical phenotype in library members and segregating into a subpopulation those library members which exhibit said detectable biochemical phenotype; typically, the heterologous polynucleotide sequences are recovered from the selected subpopulation.
  • heterologous sequences can be used directly for a variety of uses, can be subjected to one or more subsequent rounds of transformation and selection, and/or can be mutagenized and/or sequence- shuffled and subjected to a subsequent round of transformation and selection, or combinations thereof.
  • the present invention provides a method for rapid evolution of polynucleotide sequences conferring a desired or predetermined phenotype to at least one plant species, algal species, or cyanobacterium.
  • the method comprises:(l) transferring a first population of sequence-shuffled polynucleotides comprising a genetic sequence (e.g., a coding sequence, transcriptional or translational regulatory sequence, RNA stability-regulating sequence, etc.) into a plurality of plant cells to produce a first population of transformed plant cells wherein the sequence-shuffled polynucleotides are expressible (either as a coding sequence or as a functional non-coding sequence), either constitutively or conditionally, to confer a phenotype to the transformed plant cell, and optionally to its clonal progeny, (2) selecting, from the first population of transformed plant cells, and/or optionally from clonal progeny thereof, a plurality of genotypes present in said first population of transformed plant
  • a genetic sequence
  • Cycles of sequence shuffling, transfer into host cells, and selection typically are repeated iteratively until at least one genetic sequence possesses a satisfactory capacity to produce the desired phenotype; usually from 2 to 1000 cycles of iterative shuffling, transfer into host cells, and selection, with a common range of from 5 to 50 cycles.
  • sequences are recombined recursively prior to selection (either in vitro, or in vivo, e.g., following protoplast fusion), thereby increasing the diversity available for selection.
  • At least one cycle of the method comprises transfer into plant cells, such as protoplasts, of shuffled polynucleotides having the genetic sequence(s) to be evolved to confer a desired phenotype, and often at least one cycle comprises selection in plant cell culture, such as by a metabolic assay of cultured plant cells generated from a protoplast transformation, or other selection methodology applicable to plant cell cultures.
  • the evolved polynucleotide specie(s) often are transferred into a host organism by any suitable method for transferring the evolved gene into germplasm of a plant species, such as, for example and not limitation, a plant cell protoplast competent for regeneration of an adult organism, which generally may be capable of sexual reproduction and/or asexual propagation by any art-known propagation method.
  • a variation of the method comprises transfer of the evolved genetic sequence into adult plants or plant parts (e.g., a leaf or root) by abrasive transfer (applying the transgene to an abraded surface, with or without an excipient such as LipofectinTM) or biolistics.
  • a variation of the method includes the further step of genetically crossing (e.g., by conventional breeding, protoplast fusion, or recombinant molecular methods) an adult plant harboring an evolved polynucleotide of the invention with a second (or multiple) individual plants, typically of the same species.
  • An aspect of the invention provides a method for obtaining polynucleotide sequences conferring a desired phenotypic trait to a plant cell, although the method is general and can be used in conjunction with an algal cell or cyanobacterium for certain desired applications.
  • An embodiment of the method comprises transferring into a population of plant cell protoplasts a plurality of library members, wherein library members each comprise a sequence-shuffled polynucleotide obtained by shuffling a plurality of species of a genetic sequence, and selecting from the resultant population of transformed plant cell protoplasts at least one plant cell, or clonal progeny thereof, exhibiting the desired phenotype.
  • the plurality of species of the genetic sequence that is shuffled are obtained by mutagenesis of one or more starting ("parental") genetic sequence(s), and/or may be obtained from a plurality of parental genetic sequences from nonisogenic individuals of the same or different species (e.g., allogenic - as distinct alleles of a gene locus, or xenogenic - obtained from a plurality of different organismal species and sharing sufficient nucleotide sequence homology for shuffling, or a combination thereof), or alternative sources as is described in commonly-assigned PCT patent applications published as WO98/13487 and WO98/13485 or other related informational publications cited herein.
  • the invention provides a method for identifying polynucleotide sequences encoding a predetermined phenotype for a plant cell, the method comprises: (1) transforming a plurality of species of sequence-shuffled polynucleotides into protoplasts of plant cells which are clonal progeny of a predetermined non-regenerating plant cell line, and (2) selecting transformed non- regenerable protoplasts or their clonal progeny by segregating individual transformants or pools thereof which express a predetermined phenotype and recovering at least one polynucleotide sequence of a sequence-shuffled polynucleotide.
  • the method comprises the further step of culturing the transformed protoplasts on a semisolid medium in growth conditions to form a population of microcalli, wherein substantially each microcallus comprises the clonal progeny of a transformed protoplast; the microcalli or portions thereof are then subjected to selection for the desired phenotype(s).
  • the sequence- shuffled polynucleotides comprise a selectable marker gene and the semisolid medium and/or growth conditions first select for transformants expressing the selectable marker gene which are capable of growth into microcalli whereas untransformed protoplasts and their progeny are relatively less capable of growth into microcalli.
  • the semisolid protoplast growth medium is M2 and contains an agent which selects for cells expressing a marker gene encoding antibiotic resistance or herbicide resistance.
  • the transformed protoplasts are propagated as suspensions of callus cells wherein the clonal progeny of individual transformants are propagated in discrete culture vessels; in a specific embodiment the culture vessels are individual wells of a multiwell culture plate.
  • the invention provides a method for isolating novel genetic sequences which confer a predetermined phenotype to a plant cell or plant when expressed therein, the method comprising screening a population of microcalli generated by transforming a population of plant protoplasts with a plurality of library members, wherein library members comprise a sequence-shuffled genetic sequence in expressible form.
  • the screening comprises performing a biochemical assay on the microcalli or portion thereof.
  • the screening comprises performing a biochemical assay for detecting an enzyme activity; in one variation, the enzyme activity screened for can also be detected in at least one naturally occurring species of the Kingdom Plantae and is encoded by a naturally- occurring plant genome.
  • the screening comprises obtaining a cellular sample of each microcallus (or pool) and performing an assay on the cellular sample which utilizes destructive testing of the cellular sample for obtaining readout of the assay.
  • the clonal progeny of the transformed protoplasts are propagated as suspension cultures in liquid protoplast growth medium in discrete culture vessels.
  • the protoplasts used for transformation are obtained from a plant cell line that is predetermined to be non- regenerating, such that adult plants can not be formed under conventional protoplast regeneration conditions.
  • sequence-shuffled polynucleotides can be transformed into protoplasts as naked DNA, as part or all of a genome of a plant virus (encapsidated or as naked nucleic acid), as a lipid-polynucleotide complex, as polynucleo tide-coated microprojectiles, or alternative delivery forms known in the art.
  • the invention also provides a recombinogenic protoplast plant cell suitable for hosting in vivo sequence shuffling, said recombinogenic protoplast plant cell comprising a plant cell which is either stably or transiently transformed with a polynucleotide capable of expressing a recombinase activity which does not naturally occur in the plant species from which the plant cell was derived.
  • a recombinogenic plant cell can comprise a cell of a monocot or dicot plant which also has a polynucleotide encoding a bacterial recA recombinase (or a FLP recombinase or ere recombinase for site-specific recombination) in expressible form (e.g., under the transcriptional control of a plant promoter or plant virus promoter functional in said cell, and other variations.)
  • the invention provides a method for performing in vivo sequence shuffling of multiple species of a genetic sequence, the method comprising transforming a population of recombinogenic plant cell protoplasts with a plurality of library members, wherein library members each comprise a polynucleotide species of a genetic sequence, under conditions whereby greater than about 2 percent, preferably more than about 5 percent to about 10 percent or more, of the transformed recombinogenic plant cell protoplasts are co-transformed with multiple species of library members and expressed re
  • the encoded recombinase is inducible, such as by being operably linked to an inducible promoter which can be induced in a plant cell by application of induction conditions.
  • the method comprises the further step of selecting from the resultant population of co- transformed plant cell protoplasts at least one plant cell, or clonal progeny thereof, exhibiting the desired phenotype.
  • the in vivo shuffled library members can be recovered for subsequent transformation into plant cell protoplasts (recombinogenic or non-recombinogenic), either prior to or subsequent to a phenotype selection step.
  • the invention provides a plant cell protoplast and clonal progeny thereof containing a sequence-shuffled polynucleotide which is not encoded by the naturally occurring genome of the plant cell protoplast.
  • the invention also provides a collection of plant cell protoplasts transformed with a library of sequence-shuffled polynucleotides in expressible form.
  • the invention further provides a plant cell protoplast co-transformed with at least two species of library members wherein library members comprise sequence-shuffled polynucleotides encoding a genetic sequence.
  • the invention also provides a regenerated plant containing at least one species of replicable or integrated polynucleotide comprising a sequence-shuffled portion, typically in expressible form.
  • the invention provides a method variation wherein at least one round of phenotype selection is performed on regenerated plants derived from protoplasts transformed with sequence-shuffled library members.
  • the present invention provides a method for generating polynucleotide sequences encoding at least one novel or modified phenotype which can be selected on the basis of expression of a genetic sequence in a plant protoplast, plant cell culture, or organism regenerated from a plant protoplast.
  • phenotypes include: a biosynthetic enzyme, a multi-enzyme biosynthetic pathway, enzymatic activity, resistance to insect infestation, resistance to a plant pathogen, morphological characteristic, foodstuff content, flavor component, altered fruit ripening, vegetative growth, senescence, carbon- fixation rate, nitrogen fixation, interaction with Rhizobium and/or other microbes, photosynthetic efficiency, herbicide resistance, pesticide resistance, flowering, photoperiodism, shelf-life, growth rate, growth habit, starch content, protein content, frost resistance, pigment content, nutrient content, genes encoding functions that effect transformation efficiency and efficient somatic regeneration, and the like.
  • the phenotype modification can result from introduction of an optimized gene, gene fragment, or regulatory sequence derived from a genome of a plant (e.g., from a genome of an organism in the Kingdom Plantae), a plant virus genome, a microorganism genome
  • the optimized gene, gene fragment, or regulatory sequence is obtained by recursive sequence shuffling which is described further herein and in documents incorporated herein by reference.
  • the recursive sequence shuffling is typically employed to obtain and/or optimize function of the gene, gene product, gene segment, and/or regulatory sequence in a plant host, in a prokaryotic host that is suitable for agricultural use (e.g., Agrobacterium tumefaciens, and ice H leaf commensal bacteria, etc.), or in a plant virus.
  • the method employs at least one step wherein sequence-shuffled polynucleotides are introduced into plant cell protoplasts to produce a library of transformed protoplasts which can be selected for the presence of a desired predetermined phenotype, either directly or by performing selection on clonal progeny of the transformed protoplast.
  • the invention provides a kit for obtaining a polynucleotide encoding a predetermined phenotype, the kit comprising a plant cell line suitable for forming transformable protoplasts and a collection sequence-shuffled polynucleotides formed by in vitro sequence shuffling.
  • the kit often further comprises a transformation enhancing agent (e.g., lipofection agent, PEG, etc.) and/or a transformation device
  • kits e.g., a biolistics gene gun
  • a plant viral vector which can infect plant cells or protoplasts thereof.
  • the kit also optionally comprises buffers, containers, packaging materials, instructions for practicing the methods herein, or the like.
  • the disclosed method for altering a agricultural organism phenotype by iterative gene shuffling and phenotype selection is a pioneering method which enables a broad range of novel and advantageous agricultural compositions, methods, kits, uses, plant cultivars, and apparatus which will be apparent to those skilled in the art in view of the present disclosure.
  • Figure 1 shows a schematic portrayal of a generic plasmid for transduction/transformation of cloned heterologous polynucleotide sequences into cells.
  • the present invention provides methods, reagents, genetically modified plants, plant cells and protoplasts thereof, microbes (e.g., Agrobacterium), polynucleotides, shuffled nucleic acids, other protoplasts (such as fungal protoplasts), plant cell and plant libraries, fungal cells and fungal organism libraries and compositions relating to the forced evolution of genetic sequences that confer selectable phenotypes to agricultural organisms, or portions thereof, having a desired phenotypic alteration generated by polynucleotide sequence shuffling of a plurality of polynucleotide sequences, typically having regions of substantial sequence identity to facilitate shuffling recombinations.
  • microbes e.g., Agrobacterium
  • polynucleotides e.g., shuffled nucleic acids
  • other protoplasts such as fungal protoplasts
  • plant cell and plant libraries e.g., fungal cells and fungal organism libraries and compositions relating to the forced evolution of genetic
  • the invention provides methods and related compositions for introducing libraries of shuffled nucleic acids into plant protoplasts, and selecting the protoplasts (or corresponding regenerated plant cells or plants) for a desired trait or property.
  • Nucleic acids from the plants or protoplasts can be isolated to produce secondary libraries which can be transduced into cells or protoplasts, which are again selected for a desired trait or property. This process can be repeated one, two, three, four or more times until a desired trait or property is obtained.
  • plants, cells or protoplasts which are selected can be transduced with one or more additional library of nucleic acids, which recombine in the plants, cells or protoplasts, and which are selected for a desired trait or property.
  • This process can also be repeated one or several times and multiple cycles of recombination can be performed prior to selection (or between rounds of selection) to increase the diversity available during screening stages.
  • Libraries of materials can be shuffled nucleic acids produced by any available shuffling methodology, or can be focused or random libraries of nucleic acids.
  • nucleic acids of the libraries can remain unrecombined in cells or protoplasts into which the nucleic acids are transduced, or the nucleic acids can recombine with nucleic acids previously present in the cells, plants or protoplasts (e.g., genomic or episomal DNAs).
  • plant cells, plants or protoplasts can be transduced with genes which encode recombinogenic proteins (such as recA), or libraries of materials can be coated with the recombinogenic proteins themselves (e.g., the recA protein).
  • transduction of recombinogenic factors nucleic acids, proteins or other materials
  • Nucleic acids can be present in transducing vectors such as Agrobacterium vectors which facilitate recombination of sequences of interest into host DNAs (e.g., genomic DNAs).
  • destructive testing is defined herein as a procedure to determine a biochemical, biophysical, genetic, or other property or parameter of a plant cell or protoplast, which procedure results in the assayed cells thereby becoming non-replicable and/or non-viable.
  • destructive testing can include assays which use cell lysis (irreparable damage to the cell membrane and/or cell wall), exposure to genotoxic or toxic chemicals, ionizing or ultraviolet irradiation at flux levels sufficient to lethally damage the irradiated cells, and the like.
  • derivative refers to a component (e.g., a library of molecules) made using a specified parental (e.g., an original library of molecules) component.
  • DNA shuffling is used herein to indicate recombination between substantially homologous but non-identical polynucleotide sequences.
  • DNA shuffling may involve crossover via nonhomologous recombination, such as via cre/lox and/or flp/frt systems or via oligonucleotide or in silico shuffling, or the like, such that recombination need not require substantially similar polynucleotide sequences.
  • Homologous and non- homologous recombination formats can be used, and, in some embodiments, can generate molecular chimeras and/or molecular hybrids of substantially dissimilar sequences.
  • Viral recombination systems such as template-switching and the like can also be used to generate molecular chimeras and recombined genes, or portions thereof.
  • a general description of shuffling is provided in commonly-assigned WO98/13487 and WO98/13485, both of which are incorporated herein in their entirety by reference; in case of any conflicting description of definition between any of the incorporated documents and the text of this specification, the present specification provides the principal basis for guidance and disclosure of the present invention.
  • related polynucleotides means that regions or areas of the polynucleotides at issue are identical and regions or areas of the polynucleotides are heterologous.
  • chimeric polynucleotide means that the polynucleotide comprises regions which are w ild-type and regions which are mutated, or that the polynucleotide has nucleic acid subsequences derived from more than one source, depending on the context herein. It can also mean that the polynucleotide comprises wild-type regions from one polynucleotide and wild-type regions from another related polynucleotide.
  • the term “cleaving” means digesting the polynucleotide with enzymes or breaking the polynucleotide (e.g., by chemical or physical means), or generating partial length copies of a parent sequence(s) via partial PCR extension, PCR stuttering, differential fragment amplification, or other means of producing partial length copies of one or more parental sequences.
  • the term “population” as used herein means a collection of components such as polynucleotides, nucleic acid fragments or proteins.
  • a “mixed population” means a collection of components which belong to the same family of nucleic acids or proteins (i.e. are related) but which differ in their sequence (i.e. are not identical) and hence in their biological activity.
  • mutants means changes in the sequence of a parent nucleic acid sequence (e.g., a gene or a microbial genome, transferable element, or episome) or changes in the sequence of a parent polypeptide. Such mutations may be point mutations such as transitions or transversions. The mutations may be deletions, insertions or duplications.
  • recursive sequence recombination refers to a method whereby a population of polynucleotide sequences are recombined with each other by any suitable recombination means (e.g., sexual PCR, homologous recombination, site-specific recombination, etc.) to generate a library of sequence- recombined species which is then screened or subjected to selection to obtain those sequence-recombined species having a desired property; the selected species are then subjected to at least one additional cycle of recombination with themselves and/or with other polynucleotide species and at subsequent selection or screening for the desired property.
  • suitable recombination means e.g., sexual PCR, homologous recombination, site-specific recombination, etc.
  • amplification means that the number of copies of a nucleic acid fragment is increased.
  • naturally-occurring refers to the fact that an object can be found in nature.
  • a polypeptide or polynucleotide sequence that is present in an organism that can be isolated from a source in nature and which has not been intentionally modified by man in the laboratory is naturally-occurring.
  • laboratory strains and established cultivars of plants which may have been selectively bred according to classical genetics are considered naturally-occurring.
  • naturally-occurring polynucleotide and polypeptide sequences are those sequences, including natural variants thereof, which can be found in a source in nature, or which are sufficiently similar to known natural sequences that a skilled artisan would recognize that the sequence could have arisen by natural mutation and recombination processes.
  • predetermined means that the cell type, non-human animal, or virus may be selected at the discretion of the practitioner on the basis of a known phenotype.
  • linked means in polynucleotide linkage (i.e., phosphodiester linkage).
  • Unlinked means not linked to another polynucleotide sequence; hence, two sequences are unlinked if each sequence has a free 5' terminus and a free 3' terminus.
  • operably linked refers to a linkage of polynucleotide elements in a functional relationship.
  • a nucleic acid is “operably linked” when it is placed into a functional relationship with another nucleic acid sequence.
  • a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the coding sequence.
  • Operably linked means that the DNA sequences being linked are typically contiguous and, where appropriate to join two protein coding regions, contiguous and in reading frame.
  • enhancers generally function when separated from the promoter by several kilobases and intronic sequences may be of variable lengths, some polynucleotide elements may be operably linked but not contiguous.
  • a structural gene (e.g., a ESPSP gene) which is operably linked to a polynucleotide sequence corresponding to a transcriptional regulatory sequence of an endogenous gene is generally expressed in substantially the same temporal and cell type-specific pattern as is the naturally-occurring gene.
  • an expression cassette refers to a polynucleotide comprising a promoter sequence and, optionally, an enhancer and/or silencer element(s), operably linked to a structural sequence, such as a cDNA sequence or genomic DNA sequence.
  • an expression cassette may also include polyadenylation site sequences to ensure polyadenylation of transcripts.
  • an expression cassette comprises: (1) a promoter, such as a CaMV 35S promoter, a NOS promoter or a rbcS promoter, or other suitable promoter known in the art, (2) a cloned polynucleotide sequence, such as a cDNA or genomic fragment ligated to the promoter in sense orientation so that transcription from the promoter will produce a RNA that encodes a functional protein, and (3) a polyadenylation sequence.
  • an expression cassette of the invention may comprise the cDNA expression cloning vectors, pCD and ⁇ NMT (Okayama H and Berg P (1983) Mol. Cell. Biol. 3: 280; Okayama H and Berg P (1985) Mol. Cell. Biol. 5: 1136, incorporated herein by reference).
  • transcriptional modulation is used herein to refer to the capacity to either enhance transcription or inhibit transcription of a structural sequence linked in cis; such enhancement or inhibition may be contingent on the occurrence of a specific event, such as stimulation with an inducer and/or may only be manifest in certain cell types.
  • the altered ability to modulate transcriptional enhancement or inhibition may affect the inducible transcription of a gene or may effect the basal level transcription of a gene, or both.
  • transcription regulatory elements such as specific enhancers and silencers, are known to those of skill in the art and may be selected for use in the methods and polynucleotide constructs of the invention on the basis of the practitioner's desired application.
  • a transcription regulatory element may be constructed by synthesis (and ligation, if necessary) of oligonucleo tides made on the basis of available sequence information (e.g., GenBank sequences).
  • transcriptional unit or “transcriptional complex” refers to a polynucleotide sequence that comprises a structural gene (exons), a cis-acting linked promoter and other cis-acting sequences necessary for efficient transcription of the structural sequences, distal regulatory elements necessary for appropriate tissue-specific and developmental transcription of the structural sequences, and additional cis sequences important for efficient transcription and translation (e.g., polyadenylation site, mRNA stability controlling sequences).
  • transcription regulatory region refers to a DNA sequence comprising a functional promoter and any associated transcription elements (e.g., enhancer, CCAAT box, TATA box, LRE, ethanol-inducible element, etc.) that are essential for transcription of a polynucleotide sequence that is operably linked to the transcription regulatory region.
  • transcription elements e.g., enhancer, CCAAT box, TATA box, LRE, ethanol-inducible element, etc.
  • xenogeneic is defined in relation to a recipient genome, host cell, or organism and means that an amino acid sequence or polynucleotide sequence is not encoded by or present in, respectively, the naturally- occurring genome of the recipient genome, host cell, or organism.
  • Xeno genie DNA sequences are foreign DNA sequences.
  • a nucleic acid sequence that has been substantially mutated is xenogeneic with respect to the genome from which the sequence was originally derived, if the mutated sequence does not naturally occur in the genome.
  • minilocus refers to a heterologous gene construct wherein one or more nonessential segments of a gene are deleted with respect to the naturally-occurring gene.
  • deleted segments are intronic sequences of at least about 100 basepairs to several kilobases, and may span up to several tens of kilobases or more. Isolation and manipulation of large (i.e., greater than about 50 kilobases) targeting constructs is frequently difficult and may reduce the efficiency of transferring the targeting construct into a host cell. Thus, it is frequently desirable to reduce the size of a targeting construct by deleting one or more nonessential portions of the gene. Typically, intronic sequences that do not encompass essential regulatory elements may be deleted.
  • a deletion of the intronic sequence may be produced by: (1) digesting the cloned DNA with the appropriate restriction enzymes, (2) separating the restriction fragments (e.g., by electrophoresis), (3) isolating the restriction fragments encompassing the essential exons and regulatory elements, and (4) ligating the isolated restriction fragments to form a minigene wherein the exons are in the same linear order as is present in the germline copy of the naturally-occurring gene.
  • ligation of partial genomic clones which encompass essential exons but which lack portions of intronic sequence will be apparent to those of skill in the art (e.g., ligation of partial genomic clones which encompass essential exons but which lack portions of intronic sequence).
  • the gene segments comprising a minigene will be arranged in the same linear order as is present in the germline gene, however, this will not always be the case.
  • Some desired regulatory elements e.g., enhancers, silencers
  • an enhancer may be located at a different distance from a promoter, in a different orientation, and/or in a different linear order.
  • an enhancer that is located 3' to a promoter in germline configuration might be located 5' to the promoter in a minigene.
  • some genes may have exons which are alternatively spliced at the RNA level, and thus a minigene may have fewer exons and/or exons in a different linear order than the corresponding germline gene and still encode a functional gene product.
  • a cDNA encoding a gene product may also be used to construct a minigene. However, since it is generally desirable that the heterologous minigene be expressed similarly to the cognate naturally-occurring nonhuman gene, transcription of a cDNA minigene typically is driven by a linked gene promoter and enhancer from the naturally-occurring gene.
  • nucleotide sequence “5'-TATAC” corresponds to a reference sequence "5'-TATAC” and is complementary to a reference sequence "5'-GTATA”.
  • Physiological conditions refers to temperature, pH, ionic strength, viscosity, and like biochemical parameters that are compatible with a viable plant organism or agricultural microorganism (e.g., Rhizobium,
  • in vitro physiological conditions can comprise 50-200 mM NaCl or KC1, pH 6.5-8.5, 20- 45°C and 0.001-10 mM divalent cation (e.g., Mg " ⁇ , Ca* " ); preferably about 150 mM NaCl or KC1, pH 7.2-7.6, 5 mM divalent cation, and often include 0.01-1.0 percent nonspecific protein (e.g., BSA).
  • BSA nonspecific protein
  • a non-ionic detergent (Tween, NP-40, Triton X-100) can often be present, usually at about 0.001 to 2%, typically 0.05-0.2% (v/v).
  • Particular aqueous conditions may be selected by the practitioner according to conventional methods. For general guidance, the following buffered aqueous conditions may be applicable: 10-250 mM NaCl, 5-50 mM Tris HCl, pH 5-8, with optional addition of divalent cation(s), metal chelators, nonionic detergents, membrane fractions, antifoam agents, and/or scintillants.
  • label refers to incorporation of a detectable marker, e.g., a radiolabeled amino acid or a recoverable label (e.g. biotinyl moieties that can be recovered by avidin or streptavidin).
  • Recoverable labels can include covalently linked polynucleobase sequences that can be recovered by hybridization to a complementary sequence polynucleotide.
  • a detectable marker e.g., a radiolabeled amino acid or a recoverable label (e.g. biotinyl moieties that can be recovered by avidin or streptavidin).
  • recoverable labels can include covalently linked polynucleobase sequences that can be recovered by hybridization to a complementary sequence polynucleotide.
  • Various methods of labeling polypeptides, PNAs, and polynucleotides are known in the art and may be used.
  • labels include, but are not limited to, the following: radioisotopes (e.g., 3 H, 14 C, 35 S, 125 1, 131 I), fluorescent or phosphorescent labels (e.g., FITC, rhodamine, lanthanide phosphors), enzymatic labels (e.g., horseradish peroxidase, ⁇ - galactosidase, luciferase, alkaline phosphatase), biotinyl groups, predetermined polypeptide epitopes recognized by a secondary reporter (e.g., leucine zipper pair sequences, binding sites for antibodies, transcriptional activator polypeptide, metal binding domains, epitope tags).
  • labels are attached by spacer arms of various lengths, e.g., to reduce potential steric hindrance.
  • statically significant means a result (i.e., an assay readout) that generally is at least two standard deviations above or below the mean of at least three separate determinations of a control assay readout and/or that is statistically significant as determined by Student's t-test or other art-accepted measure of statistical significance.
  • transcriptional modulation is used herein to refer to the capacity to either enhance transcription or inhibit transcription of a structural sequence linked in cis; such enhancement or inhibition may be contingent on the occurrence of a specific event, such as stimulation with an inducer and/or may only be manifest in certain cell types.
  • agent is used herein to denote a chemical compound, a mixture of chemical compounds, a biological macromolecule, or an extract made from biological materials such as bacteria, plants, fungi, or animal cells or tissues. Agents are evaluated for potential activity as antiviral agents by inclusion in screening assays described hereinbelow.
  • substantially pure means an object species is the predominant species present (i.e., on a molar basis it is more abundant than any other individual macromolecular species in the composition), and preferably a substantially purified fraction is a composition wherein the object species comprises at least about 50 percent (on a molar basis) of all macromolecular species present. Generally, a substantially pure composition will comprise more than about 80 to 90 percent of all macromolecular species present in the composition. Most preferably, the object species is purified to essential homogeneity (contaminant species cannot be detected in the composition by conventional detection methods) wherein the composition consists essentially of a single macromolecular species. Solvent species, small molecules ( ⁇ 500 Daltons), and elemental ion species are not considered macromolecular species.
  • phenotype means an observable or otherwise detectable manifestation of a heritable trait or encoded function.
  • a phenotype can comprise an enzyme activity, a metabolic pathway that produces a detectable product or depletes a detectable substrate.
  • a phenotype can comprise a detectable change in the rate of uptake or production of a metabolite, insect resistance, herbicide resistance, and other detectable manifestations of gene expression.
  • the procedures herein involve, e.g., making libraries of nucleic acids and transducing protoplasts with the libraries. More generally, the nomenclature used hereafter and the laboratory procedures in agriculture, cell culture (especially plant cell culture), molecular genetics, virology (e.g., of plant viruses and virus-based vectors), and nucleic acid chemistry and hybridization described below are those well known and commonly employed in the art. Standard techniques are used for recombinant nucleic acid methods, polynucleotide synthesis, and microbial culture and transformation (e.g., biolistics, Agrobacterium (Ti plasmid), electroporation, lipofection).
  • Standard techniques are used for recombinant nucleic acid methods, polynucleotide synthesis, and microbial culture and transformation (e.g., biolistics, Agrobacterium (Ti plasmid), electroporation, lipofection).
  • Oligonucleotides can be synthesized on an Applied Bio Systems oligonucleotide synthesizer according to specifications provided by the manufacturer, or by a variety of other known techniques, or can be ordered from any of a variety of sources, including, e.g., Operon Technologies (Alameda, CA). See also, Beaucage and Caruthers (1981), Tetrahedron Letts.. 22(20): 1859-1862 and NeedhamNanDevanter et al. (1984) Nucleic Acids Res.. 12:6159-6168.
  • Leaf PCR is suitable for genotype analysis of transgenote plants. All sequences referred to herein by GenBank database file designation or a commonly used reference name which is indexed in GenBank or otherwise published are incorporated herein by reference and are publicly available.
  • the methods of the invention entail performing recombination ("shuffling") and screening or selection to "evolve" individual genes, whole plasmids or viruses, multigene clusters, or even whole genomes (Stemmer (1995) Bio/Technology 13:549-553).
  • This recombination can occur before or after introduction of nucleic acids into, e.g., plant protoplasts.
  • Reiterative cycles of recombination and screening/selection can be performed to further evolve the nucleic acids of interest.
  • Such techniques do not require the extensive analysis and computation required by conventional methods for polypeptide and genetic engineering.
  • Shuffling allows the recombination of large numbers of mutations in a minimum number of selection cycles, in contrast to natural pairwise recombination events (e.g., as occur during sexual replication).
  • sequence recombination techniques described herein provide particular advantages in that they provide recombination between mutations in any or all of these, thereby providing a very fast way of exploring the manner m which different combinations of mutations can affect a desired result.
  • structural and/or functional information is available which, although not required for sequence recombination, provides opportunities for modification of the technique.
  • DNA Shuffling can also be found in WO95/22625, WO97/ 20078, WO96/33207, WO97/33957, WO98/27230, WO97/35966, WO98/ 31837, WO98/13487, WO98/13485 and WO98/42832.
  • the invention involves creating recombinant libraries of polynucleotides that are then screened to identify those library members that exhibit a desired property.
  • the recombinant libraries can be created using any of the various methods herein, as well as many others which would be apparent to one of skill.
  • Methods for obtaining recombinant polynucleotides and/or for obtaining diversity in nucleic acids, e.g., as in molecular libraries of such polynucleotides, e.g., used as the substrates for DNA shuffling as described herein include, for example, homologous recombination (e.g., PCT/US98/05223; Publ. No.
  • oligonucleotide-directed mutagenesis for review see, Smith, Ann. Rev. Genet. 19: 423-462 (1985); Botstein and Shortle, Science 229: 1193-1201 (1985); Carter, Biochem. J. 237: 1-7 (1986); Kunkel, "The efficiency of oligonucleotide directed mutagenesis" in Nucleic acids & Molecular Biology, Eckstein and Lilley, eds., Springer Verlag, Berlin (1987)).
  • oligonucleotide-directed mutagenesis Zoller and Smith, Nucl. Acids Res. 10: 6487-6500 (1982), Methods in Enzymol. 100: 468-500 (1983), and Methods in Enzymol. 154: 329-350 (1987)) phosphothioate-modified DNA mutagenesis (Taylor et al., Nucl. Acids Res. 13: 8749-8764 (1985); Taylor et al., Nucl. Acids Res. 13: 8765-8787 (1985); Nakamaye and Eckstein, Nucl. Acids Res. 14: 9679-9698 (1986); Sayers et al., Nucl. Acids Res.
  • Kits for mutagenesis are commercially available (e.g., Bio-Rad, Amersham International, Yalen Biotechnology).
  • the recombinant libraries are prepared using DNA shuffling.
  • the shuffling and screening or selection can be used to "evolve" individual genes, whole plasmids or viruses, multigene clusters, or even whole genomes (Stemmer (1995) Bio/Technology 13:549-553).
  • Reiterative cycles of recombination and screening/selection can be performed to further evolve the nucleic acids of interest. These cycles can occur before or after transduction of libraries into protoplasts, and multiple cycles of recombination can be performed prior to cycles of selection (conversely, especially where a population is highly diverse and selective forces are weak, multiple cycles of selection can be performed between cycles of recombination).
  • sequence recombination techniques described herein provide particular advantages in that they provide recombination between mutations in any or all of these, thereby providing a very fast way of exploring the manner in which different combinations of mutations can affect a desired result. In some instances, however, structural and/or functional information is available which, although not required for sequence recombination, provides opportunities for modification of the technique.
  • the invention relates in part to a generally applicable method for generating novel or improved agricultural organisms (e.g., plants or fungi) or genetic sequences relating thereto comprising genotypes and phenotypes which do not naturally occur or would be anticipated to occur at a substantial frequency in nature.
  • a broad aspect of the method employs recursive nucleotide sequence recombination, termed "sequence shuffling", which enables the rapid generation of a collection of broadly diverse phenotypes that can be selectively bred for a broader range of novel phenotypes or more extreme phenotypes than would otherwise occur by natural evolution in the same time period.
  • a basic variation of the method is a recursive process comprising: (1) sequence shuffling of a plurality of species of a genetic sequence, which species may differ by as little as a single nucleotide difference or may be substantially different, yet retain sufficient regions of sequence similarity or site-specific recombination junction sites to support shuffling recombination (this step is optionally reiterated before performing step 2, or can be repeatedly performed on material selected in step 2); (2) selection of the resultant shuffled genetic sequence to isolate or enrich a plurality of shuffled genetic sequences having a desired phenotype(s) (this step is also optionally reiterated); and (3) repeating steps (1) and (2) on the plurality of shuffled genetic sequences having the desired phenotype(s) until one or more variant genetic sequences encoding a sufficiently optimized desired phenotype is obtained.
  • alternative formats oligonucleotide mediated shuffling, or "in silico" formats are used to generate shuffled libraries.
  • the methods herein facilitate the "forced evolution" of a novel or improved genetic sequence to encode a desired phenotype which natural selection and evolution has heretofore not generated in the reference agricultural organism.
  • Shuffling and selection steps can be performed prior to introduction of materials into protoplasts, or subsequent to introduction of materials into protoplasts, or both.
  • a plurality of genetic sequences of the same gene locus from the same taxonomic classification of organism are shuffled and selected by the present method.
  • a common use of the method is to shuffle mutant variants of a genetic sequence of a plant or fungal genome or a genetic sequence of a microorganism which may function in a plant or fungus, to obtain a variant of the genetic sequence that possesses a novel desired phenotype or an improved desired phenotype.
  • the method can be used with a plurality of alleles, homologs, or cognate genes of a gentic locus, or even with a plurality of genetic sequences from related organisms, and in some instances with unrelated genetic sequences or portions thereof which have recombinogenic portions (either naturally or generated via genetic engineering or via in silico or oligonucleotide-mediated recombination methods).
  • the method can be used to evolve a heterologous sequence (e.g., a non-naturally occurring mutant gene) to optimize its phenotypic expression (e.g., function) in a particular genomic background, and/or in a particular host cell or expression system (e.g., an expression cassette or expression replicon).
  • a heterologous sequence e.g., a non-naturally occurring mutant gene
  • phenotypic expression e.g., function
  • host cell or expression system e.g., an expression cassette or expression replicon
  • sequence shuffling in broad application, consists of a method for generating a selected polynucleotide sequence or population of selected polynucleotide sequences, typically in the form of amplified and/or cloned polynucleotides, whereby the selected polynucleotide sequence(s) possess or encode a desired phenotypic characteristic (e.g., encode a polypeptide, promote transcription of linked polynucleotides, modify transformation efficiency, bind a protein, and the like) which can be selected for.
  • a desired phenotypic characteristic e.g., encode a polypeptide, promote transcription of linked polynucleotides, modify transformation efficiency, bind a protein, and the like
  • One method of identifying polypeptides that possess a desired structure or functional property involves the screening of a large library of polynucleotides for individual library members which possess or encode the desired structure or functional property conferred by the polynucleotide sequence.
  • the invention provides a method, termed "sequence shuffling", for generating libraries of recombinant polynucleotides having a desired characteristic which can be selected or screened for.
  • Libraries of recombinant polynucleotides are generated from a population of related-sequence polynucleotides which comprise sequence regions which have substantial sequence identity and can be homologously recombined in vitro or in vivo.
  • at least two species of the related-sequence polynucleotides are combined in a recombination system suitable for generating sequence-recombined polynucleotides, wherein said sequence- recombined polynucleotides comprise a portion of at least one first species of a related-sequence polynucleotide with at least one adjacent portion of at least one second species of a related-sequence polynucleotide.
  • Recombination systems suitable for generating sequence-recombined polynucleotides can be either: (1) in vitro systems for homologous recombination or sequence shuffling via amplification or other formats described herein, or (2) in vivo systems for homologous recombination or site-specific recombination as described herein.
  • the population of sequence- recombined polynucleotides comprises a subpopulation of polynucleotides which possess desired or advantageous characteristics and which can be selected by a suitable selection or screening method.
  • the selected sequence-recombined polynucleotides which are typically related-sequence polynucleotides, can then be subjected to at least one recursive cycle wherein at least one selected sequence- recombined polynucleotide is combined with at least one distinct species of related- sequence polynucleotide (which may itself be a selected sequence-recombined polynucleotide) in a recombination system suitable for generating sequence- recombined polynucleotides, such that additional generations of sequence-recombined polynucleotide sequences are generated from the selected sequence-recombined polynucleotides obtained by the selection or screening method employed.
  • related- sequence polynucleotide which may itself be a selected sequence-recombined polynucleotide
  • recursive sequence recombination generates library members which are sequence-recombined polynucleotides possessing desired characteristics.
  • characteristics can be any property or attribute capable of being selected for or detected in a screening system, and may include properties of: an encoded protein, a transcriptional element, a sequence controlling transcription, RNA processing, RNA stability, chromatin conformation, translation, or other expression property of a gene or transgene, a replicative element, a protein-binding element, or the like, such as any feature which confers a selectable or detectable property.
  • Nucleic acid sequence shuffling is a method for recursive in vitro or in vivo homologous or nonhomologous recombination of pools of nucleic acid fragments or polynucleotides (e.g., genes from agricultural organisms or portions thereof).
  • Mixtures of related nucleic acid sequences or polynucleotides are randomly or pseudorandomly fragmented, and reassembled to yield a library or mixed population of recombinant nucleic acid molecules or polynucleotides.
  • the present invention is directed to a method for generating a selected polynucleotide sequence (e.g., a plant gene or microbe gene, or combinations thereof) or population of selected polynucleotide sequences, typically in the form of amplified and/or cloned polynucleotides, whereby the selected polynucleotide sequence(s) possess a desired phenotypic characteristic (e.g., encode a polypeptide, promote transcription of linked polynucleotides, bind a protein, metabolize a compound, confer toxicity to insects or pathogenic viruses, and the like) which can be selected for, and whereby the selected polynucleotide sequences are genetic sequences having a desired functionality and/or conferring a desired phenotypic property to an agricultural organism in which the polynucleotide has been transferred into.
  • a desired phenotypic characteristic e.g., encode a polypeptide, promote transcription of linked polynucleotides
  • One method of identifying novel genetic sequences that possess a desired structure or functional property in a plant or soil microbe, such as having an altered metabolism involves the screening of a large library of recombinant sequences (which can be a component of a genome - e.g., part of a gene, non-coding transcriptional regulatory sequence, origin of replication, - or a complete genome of an organelle or microbe) for individual library members which possess the desired structure or functional property conferred by the novel genetic sequence.
  • a large library of recombinant sequences which can be a component of a genome - e.g., part of a gene, non-coding transcriptional regulatory sequence, origin of replication, - or a complete genome of an organelle or microbe
  • the invention provides a method, termed "sequence shuffling" for use in plants and other agricultural organisms of interest such as fungi and even animals, for generating libraries of recombinant polynucleotides having a desired characteristic which can be selected or screened for in the relevant system, e.g., in plant cell protoplasts or progeny thereof (plant cells, plants, etc.).
  • Libraries of recombinant polynucleotides are generated from a population of related-sequence polynucleotides which comprise sequence regions which have substantial sequence identity and can be homologously recombined in vitro or in vivo.
  • At least two species of the related-sequence polynucleotides are combined in a recombination system suitable for generating sequence-recombined polynucleotides, wherein said sequence-recombined polynucleotides comprise a portion of at least one first species of a related-sequence polynucleotide with at least one adjacent portion of at least one second species of a related-sequence polynucleotide.
  • Recombination systems suitable for generating sequence-recombined polynucleotides can be either: (1) in vitro systems for homologous recombination or sequence shuffling via amplification or other formats described herein, or (2) in vivo systems for homologous recombination or site-specific recombination as described herein, or template-switching of a retroviral genome replication event.
  • the population of sequence-recombined polynucleotides comprises a subpopulation of polynucleotides which possess desired or advantageous characteristics and which can be selected by a suitable selection or screening method.
  • the selected sequence-recombined polynucleotides which are typically related- sequence polynucleotides, can then be subjected to at least one recursive cycle wherein at least one selected sequence-recombined polynucleotide is combined with at least one distinct species of related-sequence polynucleotide (which may itself be a selected sequence-recombined polynucleotide) in a recombination system suitable for generating sequence-recombined polynucleotides, such that additional generations of sequence-recombined polynucleotide sequences are generated from the selected sequence-recombined polynucleotides obtained by the selection or screening method employed.
  • recursive sequence recombination generates library members which are sequence-recombined polynucleotides possessing desired characteristics.
  • characteristics can be any property or attribute capable of being selected for or detected in a screening system, and may include properties of: an encoded protein, a transcriptional element, a sequence controlling transcription, RNA processing, RNA stability, chromatin conformation, translation, or other expression property of a gene or transgene, a replicative element, a protein-binding element, or the like, such as any feature which confers a selectable or detectable property.
  • Screening/selection produces a subpopulation of genetic sequences (or protoplasts, plants fungi or cells) expressing recombinant forms of gene(s) that have evolved toward acquisition of a desired property. These recombinant forms can then be subjected to further rounds of recombination and screening/selection in any order. For example, a second round of screening/selection can be performed analogous to the first resulting in greater enrichment for genes having evolved toward acquisition of the desired property.
  • the stringency of selection can be increased between rounds (e.g., if selecting for drug resistance, the concentration of drug in the media can be increased).
  • the method of shuffling can generate libraries of polynucleotides (microbial enzymes adapted to perform a desired catalytic process in a plant cell, transgene polynucleotides) encoding selectable properties, which can compose all or a part of a genetic sequence or host cell transgene, wherein the library is suitable for function optimization of a gene or regulatory sequence or phenotypic screening.
  • polynucleotides microbial enzymes adapted to perform a desired catalytic process in a plant cell, transgene polynucleotides
  • the method can include (1) obtaining a first plurality of library members comprising an agricultural organism genome, gene, regulatory or replication sequence, or host cell transgene (or encoding sequence or expression cassette thereof), and obtaining from said library a polynucleotide, or copy thereof, complete or partial, of at least one selected library member having a detectable desired phenotype, optionally introducing mutations into said polynucleotide or copy(ies), and (2) shuffling these nucleic acids by any available method, e.g., by pooling and fragmenting, by nuclease digestion, partial extension PCR amplification, PCR stuttering, or other suitable fragmenting means, typically producing random fragments or fragment equivalents, said selected polynucleotide(s) or copies to form fragments thereof under conditions suitable for PCR amplification, performing PCR amplification and optionally mutagenesis, and thereby homologously recombining said fragments to form a shuffled pool of recombined polynucleo
  • the method comprises the additional step of screening the library members of the shuffled pool to identify individual shuffled library members having the desired functional ability or phenotype.
  • novel shuffled genes, genome sequences, and transgene sequences that are identified from such libraries can be used and/or can be subjected to one or more additional cycles of shuffling and/or functional optimization or phenotype selection for further optimization.
  • the method can be modified such that the step of selecting is for a phenotypic characteristic other than a metabolic trait, gene function, transcriptional regulatory sequence function, or the like. Oligonucleotide and in silico shuffling approaches can also be used.
  • the first plurality of selected library members is fragmented and homologously recombined by PCR in vitro.
  • Fragment generation is by nuclease digestion, partial extension PCR amplification, PCR stuttering, or other suitable fragmenting means, such as described herein and in WO95/22625 published 24 August 1995, and in commonly owned U.S.S.N. 08/621,859 filed 25 March 1996, PCT/US96/05480 filed 18 April 1996, which are incorporated herein by reference).
  • Stuttering is fragmentation by incomplete polymerase extension of templates.
  • a recombination format based on very short PCR extension times can be employed to create partial PCR products, which continue to extend off a different template in the next (and subsequent) cycle(s), and effect de facto fragmentation.
  • Template-switching and other formats which accomplish sequence shuffling between a plurality of sequence-related polynucleotides can be used. Such alternative formats will be apparent to those skilled in the art.
  • the first plurality of selected library members is fragmented in vitro, the resultant fragments transferred into a host cell or organism and homologously recombined to form shuffled library members in vivo.
  • the first plurality of selected library members is cloned or amplified on episomally replicable vectors, a multiplicity of said vectors is transferred into a cell and homologously recombined to form shuffled library members in vivo.
  • the first plurality of selected library members is not fragmented, but is cloned or amplified on an episomally replicable vector as a direct repeat or indirect (or inverted) repeat, which each repeat comprising a distinct species of selected library member sequence, said vector is transferred into a cell and homologously recombined by intra- vector or inter-vector recombination to form shuffled library members in vivo.
  • first plurality of selected library members is replicated under conditions wherein retroviral template switching between at least two xenogeneic genomes cloned into retrovirus vectors occurs, typically involving non- retroviral genes cloned into a retroviral replication system.
  • viral (and viral vector) systems such as gemini viruses, positive stranded RNA viruses and DNA viruses can be used.
  • combinations of in vitro and in vivo shuffling are provided to enhance combinatorial diversity.
  • the recombination cycles in vitro or in vivo
  • the recombination cycles can be performed in any order desired by the practitioner.
  • the present invention provides a method for generating libraries of shuffled polynucleotides suitable for functional screening (i.e., which is measured without respect to a phenotype conferred on a plant or related agricultural organism) or phenotypic screening (i.e., which is detected as a phenotype of a plant or other agricultural organism).
  • the method generally comprises (1) obtaimng a first plurality of selected library member polynucleotides comprising a polynucleotide conferring a selectable phenotype, and wherein said selected library member polynucleotides comprise a region of substantially identical sequence, optionally introducing mutations into said library member polynucleotides or copies, and (2) pooling and fragmenting, by chemical fragmentation, nuclease digestion, partial extension PCR amplification, PCR stuttering, site-specific recombination, or other suitable fragmenting means, typically producing random fragments or fragment equivalents, to form fragments thereof under conditions suitable for PCR amplification, performing PCR amplification and optionally mutagenesis, and thereby homologously recombining said fragments to form a shuffled pool of recombined polynucleotides, whereby a substantial fraction (e.g., greater than 10 percent) of the recombined polynucleotides of said shu
  • the method can be modified such that the step of selecting is for a phenotypic characteristic not naturally found in the host organism (e.g., for a herbicide catalytic activity, viral resistance, drug resistance, or other non-native detectable phenotype conferred on a host cell or organism).
  • the method can be modified such that the step of selecting is for a modified phenotype which is enhanced or diminished, or otherwise changed in character, as compared to the phenotype which naturally occurs in the host cell or host organism.
  • the first plurality of selected library members is fragmented and homologously recombined by PCR in vitro.
  • Fragment generation is by nuclease digestion, partial extension PCR amplification, PCR stuttering, or other suitable fragmenting means, such as described herein and in the documents incorporated herein by reference.
  • Stuttering is fragmentation by incomplete polymerase extension of templates.
  • the first plurality of selected library members is fragmented in vitro, the resultant fragments transferred into a host cell or organism and homologously recombined to form shuffled library members in vivo.
  • the host cell is a plant cell which has been engineered to contain enhanced recombination systems, such as an enhanced system for general homologous recombination (e.g., a plant expressing a recA protein or a plant recombinase from a transgene or plant virus) or a site-specific recombination system (e.g., a cre/LOX or frt/FLP system encoded on a transgene or plant virus).
  • enhanced recombination systems such as an enhanced system for general homologous recombination (e.g., a plant expressing a recA protein or a plant recombinase from a transgene or plant virus) or a site-specific recombination system (e.g., a cre/
  • the first plurality of selected library members is cloned or amplified on episomally replicable vectors, a multiplicity of said vectors is transferred into a cell and homologously recombined to form shuffled library members in vivo in a plant cell, fungal cell, algae cell, or bacterial cell.
  • Other cell types may be used, if desired.
  • the first plurality of selected library members is not fragmented, but is cloned or amplified on an episomally replicable vector as a direct repeat or indirect (or inverted) repeat, which each repeat comprising a distinct species of selected library member sequence, said vector is transferred into a cell and homologously recombined by intra-vector or inter- vector recombination to form shuffled library members in vivo in a plant cell, algae cell, or microorganism.
  • combinations of in vitro and in vivo shuffling are provided to enhance combinatorial diversity.
  • shuffling provides methods, compositions, and uses related to creating novel or improved plants, plant cells, algal cells, soil microbes, plant pathogens, pharmaceuticals, commensal microbes, or other plant-related organisms having art-recognized importance to the agricultural, horticultural, and argonomic areas (collectively, "agricultural organisms").
  • the invention provides a method for creating or altering a phenotype of an agricultural organism by introducing a shuffled polynucleotide into said agricultural organism to generate a modified agricultural organism having a phenotype conferred by the introduced shuffled polynucleotide.
  • the invention also provides the modified agricultural organisms made by this method, and uses thereof.
  • the method comprises the further step of performing a selection or screening step on the modified agricultural organism to identify or quantitate a detectable phenotypic property.
  • phenotypes can be, for example and not limitation, a herbicide-resistance trait, organ morphology, life-cycle modification (e.g., conversion of a short-day plant into a long-day plant, rapid fruit formation, delayed ripening, suppressed seed formation), metabolic biosynthesis (e.g., carbon-fixation efficiency, lipid content, bulk protein composition, starch content, etc.), or any phenotype that the artisan skilled in agriculture, botany, plant sciences, plant pathology, biochemistry, nutrition, food processing, or horticulture would recognize as a detectable phenotype.
  • the invention provides a method for obtaining polynucleotide sequences conferring a desired phenotype on an agricultural organism, the method comprising the steps of: (1) contacting or transforming a population of plant cells, algae cells, bacterial cells, fungal cells plant viruses, plants or explanted organs therefrom, with a first plurality of polynucleotide species having at least one region of substantial sequence identity to support shuffling to generate a first transformed population, (2) selecting, from the first transformed population, a subpopulation having at least one desired phenotype, and recovering from the subpopulation a plurality of selected polynucleotide species, (3) recombining, by shuffling, said plurality of said selected polynucleotide species, thereby generating a collection of shuffled polynucleotide species, and (4) contacting or transforming a population of plant cells, algae cells, bacterial cells, plant viruses, plants or explanted organs therefrom, with said collection of shuffled polynu
  • At least one, preferably a plurality of, selected, shuffled polynucleotide(s) are recovered from the at least one cell or organism selected from the second population and having at least one desired phenotype; the selected, shuffled polynucleotide(s) are subjected to at least one subsequent round of shuffling (with each other, with related unshuffled sequences, with spike sequences, with mutagenic methods, or the like), transformation or contacting, and selection; this additional step can be repeated iteratively (with or without modification or variance in one or more cycles) from 1 to about 1000 cycles or as deemed suitable by the practitioner.
  • the recombination in step (3) is performed in vitro or by an in vivo recombination method which substantially does not occur naturally in a plant cell at a recombination frequency of more than 10% of the frequency of the recombination methods described herein for polynucleotide sequence shuffling.
  • naturally occurring in vivo recombination mechanisms of plants, agricultural microorganisms, or vector-host cells for intermediate replication can be used in conjunction with a collection of shuffled polynucleotide sequence variants having a desired phenotypic property to be optimized further; in this way, a natural recombination mechanism can be combined with intelligent selection of variants in an iterative manner to produce optimized variants by "forced evolution", wherein the forced evolved variants are not expected to, nor are observed to, occur in nature, nor are predicted to occur at an appreciable frequency.
  • the practitioner may further elect to supplement and/or the mutational drift by introducing intentionally mutated polynucleotide species suitable for shuffling, or portions thereof, into the pool of initial polynucleotide species and/or into the plurality of selected, shuffled polynucleotide species which are to be recombined.
  • Mutational drift may also be supplemented by the use of mutagens (e.g., chemical mutagens or mutagenic irradiation), or by employing replication conditions which enhance the mutation rate.
  • the invention provides a method of performing recursive shuffling on a transgene portion or complete transgene, comprising: (1) introducing into a population of site-specific recombination plant cells a site-specific recombination transgene having loxP or FLP sites, or equivalents, and obtaining site-specific integration or recombination of the transgene into a site-specific target site in the plant genome, (2) selecting from the population of plant cells a subpopulation having or encoding a desired phenotype, which may be an enzymatic function, a morphological trait observable following regeneration from the plant cell, or the like, (3) recovering a plurality of transgene sequences from the subpopulation, (4) shuffling the recovered transgene sequences to create a shufflant library of transgenes having suitable site- specific recombination site(s), and (5) repeating steps 1 through 5 with the shufflant library of transgenes for at least one cycle of recursion, preferably for sufficient
  • the selection step involves a biochemical assay or herbicide resistance assay that can be performed in plant cell culture without substantial development of an adult plant organism, and preferably is done in a high-throughput format, as by cell colony screening (e.g., using a reporter system in the cells) or by multiwell plate format, otr the like.
  • the invention provides regenerable plant cells and non-regenerable plant cell lines having homologous recombination systems with a detectable recombination frequency of at least 50 percent greater than the naturally-occurring plant cells of the same species and cell type.
  • An embodiment comprises a plant cell expressing a transgene-encoded heterologous recombinase (e.g., recA or the like), which may be of plant origin, animal origin (e.g., a general recombinase, the V-D-J recombinase, and the like), fungal origin, or bacterial origin (e.g., recA).
  • a transgene-encoded heterologous recombinase e.g., recA or the like
  • animal origin e.g., a general recombinase, the V-D-J recombinase, and the like
  • fungal origin e.g., bacterial origin
  • a method of the invention employs such plant cells expressing a recombinase and homologous transgene constructs, to facilitate homologous gene targeting and homologous transgene integration into plant genomes so as to either inactivate or replace an endogenous plant gene, and/or to homologously integrate a heterologous gene into a plant genome.
  • the invention also provides for the shuffled polynucleotide sequence(s) conferring the desired phenotype(s) on an agricultural organism, and the modified agricultural organisms themselves, produced by the method of polynucleotide sequence shuffling; the exact structures of said produced polynucleotide sequences and modified agricultural organisms are definable a priori only by reference to the method by which they are generated.
  • the invention includes a shuffled polynucleotide sequence conferrring the desired phenotype, or a plurality thereof, produced by the methods described herein.
  • the shuffled polynucleotides(s) produced thereby are easily distinguishable from naturally occurring genome sequences by virtue of their atypical modified or novel phenotype(s) which is/are normally not present in the population of naturally occurring agricultural organism.
  • the shuffled polynucleotide sequence can be further distinguished from naturally-occurring plant, animal, or microbe genome sequences by reference to sequence databases and published sequence data, wherein the shuffled polynucleotide will generally comprise a constellation of mutations as compared to the reference dataset which would be recognized by the skilled artisan as a polynucleotide sequence which is substantially improbable of having evolved by natural evolution or classical breeding.
  • one or more encoding sequences or transcriptional regulatory sequences derived from a plant genome are jointly or separately optimized (or improved for function) in a predetermined plant cell and/or host plant species as distinct genetic elements isolated from the remainder of the plant genome.
  • the optimized or improved portions of the encoding sequence and/or transcriptional regulatory sequence is then introduced into the plant genome(s).
  • the optimized or improved portions can be used in conjunction with one or more heterologous polynucleotide sequence(s), such as genes or transcriptional regulatory sequences from other plant species or from non-plant genomes to confer a desired functional or structural property, such as transcriptional regulation or franslational regulation, to the improved portions.
  • Optimized or improved portions of a plant gene often can be marketed as a commercial product, either alone or in combination with one or more heterologous sequences.
  • the invention also encompasses compositions of such shuffled plant polynucleotides encoding at least one modified phenotype of an agricultural organism.
  • the compositions can include a plurality of species of shuffled polynucleotides, or can represent a single purified polynucleotide species.
  • Certain shuffled polynucleotides encode variants which possess detectable phenotypes that are not naturally occurring and which can be selected for; selected phenotypes often are characterized by desirable properties.
  • At least two additional related formats are useful in the practice of the present invention, i.e., for producing libraries of shuffled materials to be screened in protoplasts. These additional methods can be used individually or in combination with each other and with the formats noted herein, e.g., those above.
  • the first, referred to as "in silico" shuffling utilizes computer algorithms to perform "virtual" shuffling using genetic operators in a computer.
  • gene sequence strings are recombined in a computer system and desirable products (such as libraries for transduction into protoplasts) are made, e.g., by reassembly PCR of synthetic oligonucleotides.
  • desirable products such as libraries for transduction into protoplasts
  • genetic operators are used to model recombinational or mutational events which can occur in one or more nucleic acid, e.g., by aligning nucleic acid sequence strings (using standard alignment software, or by manual inspection and alignment) and predicting recombinational outcomes.
  • the predicted recombinational outcomes are used to produce corresponding molecules, e.g., by oligonucleotide synthesis and reassembly PCR.
  • the second useful format is referred to as "oligonucleotide mediated shuffling" in which oligonucleotides corresponding to a family of related homologous nucleic acids (e.g., as applied to the present invention, interspecific or allelic variants of a nucleic acid) which are recombined to produce selectable nucleic acids.
  • This format is described in detail in Crameri et al. "OLIGONUCLEOTIDE MEDIATED NUCLEIC ACID RECOMBINATION" filed February 5, 1999, USSN 60/118,813 and Crameri et al. "OLIGONUCLEOTIDE MEDIATED NUCLEIC ACID RECOMBINATION” filed June 24, 1999, USSN 60/141,049.
  • the technique can be used to recombine homologous or even non-homologous nucleic acid sequences.
  • oligonucleotide-mediated recombination is the ability to recombine homologous nucleic acids with low sequence similarity, or even non-homologous nucleic acids.
  • these low-homology oligonucleotide shuffling methods one or more set of fragmented nucleic acids are recombined, e.g., with a with a set of crossover family diversity oligonucleotides.
  • crossover oligonucleotides have a plurality of sequence diversity domains corresponding to a plurality of sequence diversity domains from homologous or non-homologous nucleic acids with low sequence similarity.
  • the fragmented oligonucleotides which are derived by comparison to one or more homologous or non-homologous nucleic acids, can hybridize to one or more region of the crossover oligos, facilitating recombination.
  • sets of overlapping family gene shuffling oligonucleotides (which are derived by comparison of homologous nucleic acids and synthesis of oligonucleotide fragments) are hybridized and elongated (e.g., by reassembly PCR), providing a population of recombined nucleic acids, which can be selected for a desired trait or property.
  • the set of overlapping family shuffling gene oligonucleotides include a plurality of oligonucleotide member types which have consensus region subsequences derived from a plurality of homologous target nucleic acids.
  • family gene shuffling oligonucleotide are provided by aligning homologous nucleic acid sequences to select conserved regions of sequence identity and regions of sequence diversity.
  • a plurality of family gene shuffling oligonucleotides are synthesized (serially or in parallel) which correspond to at least one region of sequence diversity.
  • Sets of fragments, or subsets of fragments used in oligonucletoide shuffling approaches can be provided by cleaving one or more homologous nucleic acids (e.g., with a DNase), or, more commonly, by synthesizing a set of oligonucleotides corresponding to a plurality of regions of at least one nucleic acid (typically oligonucleotides corresponding to a full-length nucleic acid are provided as members of a set of nucleic acid fragments).
  • homologous nucleic acids e.g., with a DNase
  • synthesizing a set of oligonucleotides corresponding to a plurality of regions of at least one nucleic acid typically oligonucleotides corresponding to a full-length nucleic acid are provided as members of a set of nucleic acid fragments.
  • these cleavage fragments can be used in conjunction with family gene shuffling oligonucleotides, e.g., in one or more recombination reaction to produce recombinant nucleic acids.
  • a first nucleic acid sequence encoding a first polypeptide sequence is selected.
  • a plurality of codon altered nucleic acid sequences, each of which encode the first polypeptide, or a modified or related polypeptide is then selected (e.g., a library of codon altered nucleic acids can be selected in a biological assay which recognizes library components or activities), and the plurality of codon-altered nucleic acid sequences is recombined to produce a target codon altered nucleic acid encoding a second protein.
  • the target codon altered nucleic acid is then screened for a detectable functional or structural property, optionally including comparison to the properties of the first polypeptide and/or related polypeptides.
  • a nucleic acid encoding such a polypeptide can be used in essentially any procedure desired, including introducing the target codon altered nucleic acid into a cell, vector, virus, attenuated virus (e.g., as a component of a vaccine or immunogenic composition), transgenic organism, or the like.
  • the present method can be used to create variant plant genes which exhibit altered function, stability, or expression by employing the rapid forced evolution of shuffling to generate variant genetic sequences that are adapted to the desired phenotype which is expressible in plant cell culture or in a regenerated plant or plant organ.
  • the method is general and can be employed to modify a genetically conferred phenotype of substantially any agricultural organism suitable for recursive sequence shuffling.
  • the present method can also be employed to force evolution of plant host cells and polygenic transgenes to support enhanced transformation by
  • Multiple genetic sequences may be allowed to co-evolve, or the individual genetic sequences can be optimized individually and later recombined.
  • the present method can be used with substantially any type of agricultural organism having a genome or gene portion suitable for in vitro or in vivo sequence shuffling with expression in plant cells and phenotype selection thereon.
  • the recovered sequences can be shuffled with other genetic sequences and/or with one or more spiked polynucleotide specie(s) (e.g., mutation-bearing gene sequences or mutation-bearing sequences), which may include optimized components of a genotype that have been separately optimized by shuffling.
  • Optimized components typically can include expression cassettes encoding plant or microbe metabolic genes, plant viral sequences, origins or replication, non-coding sequences important for replication, transcriptional control sequences, xenogeneic proteins, and the like. It is also possible to combine one or more cycle(s) of individual component/segment evolution with one or more cycle(s) of collective component/segment evolution, in any order.
  • a plurality of genetic sequences are shuffled and the resultant shuffled genetic sequences are selected for the capacity to confer a desired phenotype to a host cell or organism harboring the shuffled sequence(s).
  • the present invention provides a method for generating libraries of genomes or genetic sequences suitable for phenotype screening, such as to generate enhanced function in a cell type and/or agricultural organism species, modify metabolism, resistance phenotype, or other desired property.
  • the method comprises (1) obtaining a first plurality of library members comprising a genome polynucleotide or portion thereof, (2) pooling and fragmenting said polynucleotides or copies to form fragments thereof under conditions suitable for PCR amplification and thereby homologously recombining said fragments to form a shuffled pool of recombined polynucleotides comprising novel combinations of sequences, whereby a substantial fraction (e.g., greater than 10 percent) of the recombined polynucleotides of said shuffled pool comprise genome sequence combinations which are not present in the first plurality of library members, said shuffled pool composing a library of viral genome sequences comprising sequence combinations suitable for phenotype screening.
  • the plurality of selected shuffled library members can be shuffled and screened iteratively, from 1 to about 1000 cycles or as desired until library members having a desired binding affinity are obtained. Often, from 2 to 25 cycles of recursion are performed before a sufficiently optimized shufflant (i.e., selected shuffled library member) is obtained.
  • a sufficiently optimized shufflant i.e., selected shuffled library member
  • the degree of optimization for any particular application will vary based on the specific intended use and other considerations (e.g., time, minimization of mutational drift, etc.) that are selected by the practitioner.
  • the format of the assay used to select library members will depend on the trait to be selected. For example, where the desired trait is herbicide resistance, survival of cells or protoplasts on media containing herbicides can be used to select desirable herbicide resistance traits. Similarly, where production of a metabolite (e.g., an oil, vitamin, phytohormone or phytochemical) is desired, the presence of the metabolite can be monitored, e.g., in a high-throughput fashion.
  • a metabolite e.g., an oil, vitamin, phytohormone or phytochemical
  • one high throughput method for detecting analyte molecules from a complex biological mixture is by electrospray tandem mass spectrometry as taught in "HIGH THROUGHPUT MASS SPECTROMETRY" by Sun Ai Raillard, USSN 60/119,766, filed 02/11/1999.
  • methods which utilize off-line parallel sample purification and fast flow-injection analysis typically reducing the time of analysis to 30 to 40 seconds per sample. All steps starting from cell/protoplast picking, growth, sample preparation and analysis are automated and can be carried out overnight by various robotic workstations.
  • the ability to detect a subtle increase in the performance of a shuffled library member over that of a parent strain relies on the sensitivity of the assay.
  • the chance of finding the organisms having an improvement is increased by the number of individual mutants that can be screened by the assay.
  • a prescreen that increases the number of mutants processed by 10-fold can be used.
  • the goal of the primary screen is to quickly identify mutants having roughly equal or better product titers than the parent strain(s) and to move only these mutants forward to liquid cell culture for subsequent analysis.
  • FORCED EVOLUTIONOF GENES The invention provides a means to evolve gene variants and/or host cells, as well as providing a model system for evaluating a library of agents to identify candidate agents that could find use as agricultural reagents (e.g., herbicide) for commercial applications.
  • agricultural reagents e.g., herbicide
  • the methods of the invention can be used to force the evolution of a gene which has a beneficial property in one organism into a shuffled variant that can confer that same phenotype to a second organism in which the gene was substantially non-functional or inadequate.
  • Suitable transcriptional regulatory sequences include: cauliflower mosaic virus 19S and 35S promoters, NOS promoter, OCS promoter, rbcS promoter, Brassica heat shock promoter, synthetic promoters, non-plant promoters modified, if advantageous, for function in plant cells, substantially any promoter that naturally occurs in a plant genome, promoters of plant viruses or Ti plasmids, tissue- preferential promoters or cis-acting elements, light-responsive promote :s or cis-acting elements (e.g., rbcS LRE), hormone-responsive cis-acting elements, developmental stage-specific promoters and cis-acting elements, viral promoters (e.g., from Tobacco),
  • a transcriptional regulatory sequence from a first plant species is optimized for functionality in a second plant species by application of recursive sequence shuffling.
  • the "granularity" of a shuffling event refers to the relative average density of recombination joints per unit length (e.g., per kilobase) or per recombined polynucleotide molecule (e.g., per functional viral genome).
  • a coarse granularity could be an average of one or less recombination joint per polynucleotide resulting from a shuffling (i.e., sequence recombination event); a coarse granularity of shuffling generates a "low crossover library.” It is often desirable to alter the granularity of shuffling in different recursion cycles, although this is not necessary in many cases.
  • the granularity desired can frequently be selected by the practitioner and is typically accomplished by controlling the degree of recombination in the recombination format selected (e.g., for a fragmentation reassembly format, a high degree of fragmentation will generate a small average fragment size and hence a finer granularity; increasing the number of polynucleotide species shuffled can also be used to obtain finer granularity, among other ways apparent to those skilled in the art upon review of the many references incorporated herein related to shuffling).
  • the average size of segment from the parental sequence(s) represented in the library of sequence- recombined polynucleotides is denoted as the "average segment length", and may be expressed by unit length (e.g., per kilobase) or as a fraction of the parental sequence
  • the present method permits the construction of a library of shuffled genes (or gene portions) wherein the library contains a population of shuffled genes of any granularity desired by the practitioner.
  • Libraries prepared from a plurality of parental genes can be made to have substantially any granularity; for example a gene library having, on average, at least two recombination joints (e.g., three distinct segments) per sequence-recombined genome can be generated, as can viral genomes having three, four, five, six, seven, eight, nine, ten, or more recombination joints (e.g., a genomic polynucleotide composed of 4, 5, 6, 7, 8, 9, 10, or 11 or more distinct sequence segments).
  • the basic sequence shuffling methodology can be used to shuffle a collection of related sequences, wherein most or all of the related sequences substantially span a certain physical portion of a gene or genome (e.g., a structural gene, a transcriptional regulatory sequence, a replication origin, or an entire viral genome).
  • a gene or genome e.g., a structural gene, a transcriptional regulatory sequence, a replication origin, or an entire viral genome.
  • the collection of related polynucleotides could represent, e.g., alleles of a gene locus, variant genes).
  • One methodological modification to focus sequence diversity on a particular segment of a genome is to "spike" a recombination reaction with additional polynucleotides which represent only a subset of the locus being shuffled.
  • These "spiking polynucleotides” can enhance the potential sequence diversity at the locus subset (e.g., randomly or pseudorandomly increase mutation density at the locus subset), or can overrepresent (or underrepresent) certain predetermined sequences in order to steer the sequence diversity in a predetermined direction (e.g., to overrepresent mutations which tend to produce a beneficial result based on prior results).
  • Superfluous mutations can be removed by backcrossing, which is shuffling the selected shuffled gene(s) with one or more parental gene and/or naturally-occurring gene(s) (or portions thereof) and selecting the resultant collection of shufflants for those species that retain the desired phenotype.
  • a pea Rubisco subunit gene (small subunit) can be shuffled and selected for the capacity to substantially function in any Angiosperm plant cells; the resultant selected shufflants can be backcrossed with one or more Rubisco genes of a particular plant species and selected for the capacity to retain the capacity to confer the phenotype. After several cycles of such backcrossing, the backcrossing will yield gene(s) which contain the mutations necessary for the desired phenotype, and will otherwise have a genomic sequence substantially identical to the genome(s) of the host genome.
  • Isolated components e.g., genes, regulatory sequences, packaging sequences, replication origins, and the like
  • parental sequences e.g., genes, regulatory sequences, packaging sequences, replication origins, and the like
  • Transgenes and expression vectors can be constructed by any suitable method known in the art; by either PCR or RT-PCR amplification from a suitable cell type or by ligating or amplifying a set of overlapping synthetic oligonucleotides; publicly available sequence databases and the literature can be used to select the polynucleotide sequence(s) to encode the specific protein desired, including any mutations, consensus sequence, or mutation kernal desired by the practitioner.
  • the coding sequence(s) are operably linked to a transcriptional regulatory sequence and, if desired, an origin of replication.
  • Antisense or sense-suppression transgenes and genetic sequences can be optimized or adapted for particular host cells and organisms by the described methods.
  • transgene(s) and or expression vectors are transferred into host cells, protoplasts, pluripotent embryonic plant cells, microbes, or fungi by a suitable method, such as for example lipofection, electroporation, microinjection, biolistics,
  • Stable transfectant host cells can be prepared by art-known methods, as can transgenic cell lines. Phenotypic Traits
  • traits also include traits (or "phenotypic traits" or
  • phenotypes are selectable with appropriate procedures and sufficient numbers of fransgenotes.
  • Such traits include, but are not limited to, visible traits, environmental or stress related traits, disease related traits and ripening traits, such traits also include flower or plant color, flower shape and size, leaf shape and size, flower number per plant, leaf number per plant, pest resistance, plant height, plant bushiness, time to flowering, cold hardiness, drought tolerance, tolerance to high temperatures, chemical resistance, flavor, and aroma. These traits are dependent upon the synthesis of structural proteins and enzymes which catalyze biosynthetic or degradative reactions of plant metabolism.
  • plant refers to either a whole plant, a plant part, a plant cell, or a group of plant cells.
  • the class of plants which can be used in the method of the invention is generally as broad as the class of higher plants amenable to protoplast transformation techniques, including both monocotyledonous and dicotyledonous plants. It includes plants of a variety of ploi y levels, including polyploid, diploid and haploid, and may employ non-regenerable cells for certain aspects which do not require development of an adult plant for selection or in vivo shuffling.
  • transformation means alteration of the genotype of a host plant by the introduction of a nucleic acid sequence.
  • the nucleic acid sequence need not necessarily originate from a different source, but it will, at some point, have been external to the cell into which it is to be introduced.
  • the foreign nucleic acid is mechanically transferred by microinjection directly into plant cells by use of micropipettes.
  • the foreign nucleic acid may be transferred into the plant cell by using polyethylene glycol.
  • This forms a precipitation complex with the genetic material that is taken up by the cell e.g., by incubation of protoplasts with "naked DNA” in the presence of polyethylenelycol)(Paszkowski et al., (1984) EMBO J. 3:2717-22; Baker et al (1985) Plant Genetics, 201-211; Li et al. (1990) Plant Molecular Biology Report 8(4)276-291].
  • the introduced gene may be introduced into the plant or other cells by electroporation (Fromm et al., (1985) "Expression of Genes Transferred into Monocot and Dicot Plant Cells by
  • Electroporation Proc. Natl Acad. Sci. USA 82:5824, which is incorporated herein by reference).
  • plant protoplasts are electroporated in the presence of plasmids or nucleic acids containing the relevant genetic construct. Electrical impulses of high field strength reversibly permeabilize biomembranes allowing the introduction of the plasmids. Electroporated plant protoplasts reform the cell wall, divide, and form a plant callus. Selection of the transformed plant cells with the transformed gene can be accomplished using phenotypic markers.
  • Cauliflower mosaic virus may also be used as a vector for introducing the foreign nucleic acid into plant and other cells (Hohn et al., (1982) "Molecular Biology of Plant Tumors," Academic Press, New York, pp.549-560;
  • CaMV viral DNA genome is inserted into a parent bacterial plasmid creating a recombinant DNA molecule which can be propagated in bacteria.
  • the recombinant plasmid again may be cloned and further modified by introduction of the desired DNA sequence into the unique restriction site of the linker.
  • the modified viral portion of the recombinant plasmid is then excised from the parent bacterial plasmid, and used to inoculate the plant cells or plants.
  • tobacco mosaic virus, potato virus or other viral systems can be used.
  • nucleic acid segments Another method of introduction of nucleic acid segments is high velocity ballistic penetration by small particles with the nucleic acid either within the matrix of small beads or particles, or on the surface (Klein et al., (1987) Nature 327:70-73). Although typically only a single introduction of a new nucleic acid segment is required, this method particularly provides for multiple introductions.
  • a method of introducing the nucleic acid segments into plant cells is to infect a plant cell, an explant, a meristem or a seed with Agrobacterium tumefaciens transformed with the segment. Under appropriate conditions known in the art, the transformed plant cells are grown to form shoots, roots, and develop further into plants.
  • the nucleic acid segments can be introduced into appropriate plant cells, for example, by means of the Ti plasmid of Agrobacterium tumefaciens.
  • the Ti plasmid is transmitted to plant cells upon infection by Agrobacterium tumefaciens, and is stably integrated into the plant genome (Horsch et al., (1984) "Inheritance of Functional Foreign Genes in Plants," Science. 233:496-498; Fraley et al., (1983) Proc. Natl. Acad. Sci. USA 80:4803).
  • Ti plasmids contain two regions essential for the production of transformed cells. One of these, named transfer DNA (T DNA), induces tumor formation. The other, termed virulent region, is essential for the introduction of the T DNA into plants.
  • T DNA transfer DNA
  • the transfer DNA region which transfers to the plant genome, can be increased in size by the insertion of the foreign nucleic acid sequence without its transferring ability being affected.
  • the modified Ti plasmid can then be used as a vector for the transfer of the gene constructs of the invention into an appropriate plant cell, such being a "disabled Ti vector.”
  • an appropriate plant cell such being a "disabled Ti vector.”
  • All plant cells which can be transformed by Agrobacterium and whole plants regenerated from the transformed cells can also be transformed according to the invention so as to produce transformed whole plants which contain the transferred foreign nucleic acid sequence.
  • Method (1) uses, e.g., an established culture system that allows culturing protoplasts and plant regeneration from cultured protoplasts.
  • Method (2) uses, e.g., (a) plant cells or tissues that can be transformed by Agrobacterium and (b) induced to regenerate into whole plants.
  • Method (3) uses, e.g., micropropagation.
  • two plasmids are used: a T-DNA containing plasmid and a vir plasmid. Any one of a number of T-DNA containing plasmids can be used; the main caveat is that it may be desirable to be able to select independently for each of the two plasmids.
  • those plant cells or plants transformed by the Ti plasmid so that the desired DNA segment is integrated can be selected by an appropriate phenotypic marker.
  • These phenotypic markers include, but are not limited to, antibiotic resistance, herbicide resistance or visual observation. Other phenotypic markers are known in the art and may be used in this invention.
  • Protoplasts can be prepared for both bacterial and eukaryotic cells, including mammalian cells, fungal cells and plant cells, by several means, including chemical treatment to strip cell walls.
  • cell walls can be stripped by digestion with a cell wall degrading enzyme such as lysozyme in a 10-20% sucrose, 50 mM EDTA buffer. Conversion of cells to spherical protoplasts can be monitored by phase-contrast microscopy.
  • Protoplasts can also be prepared by propagation of cells in media supplemented with an inhibitor of cell wall synthesis, or use of mutant strains lacking capacity for cell wall formation.
  • Eukaryotic cells are optionally synchronized in Gl phase by arrest with inhibitors such as ⁇ -factor, K. lactis killer toxin, leflonamide and adenylate cyclase inhibitors.
  • some protoplasts to be fused can be killed and or have their DNA fragmented by treatment with ultraviolet irradiation, hydroxylamine or cupferon (Reeves et al., FEMS Microbiol. Lett. 99, 193-198 (1992)).
  • killed protoplasts are referred to as donors, and viable protoplasts as acceptors.
  • dead donors cells e.g., comprising a previously introduced shuffled library
  • breaking up DNA in donor cells is advantageous for stimulating recombination with acceptor DNA.
  • acceptor and/or fused cells can also be briefly, but nonlethally, exposed to UV irradiation further to stimulate recombination in the protoplast or in protoplast fusions.
  • protoplasts can be stabilized in a variety of osmolytes and compounds such as sodium chloride, potassium chloride, sodium phosphate, potassium phosphate, sucrose, sorbitol, etc., e.g., in the presence of DTT.
  • the combination of buffer, pH, reducing agent, and osmotic stabilizer can be optimized for different cell types.
  • Protoplasts can be induced to fuse by treatment with a chemical such as PEG, calcium chloride or calcium propionate or electro fusion (Tsoneva, Acta Microbiologica Bulgaria 24, 53-59 (1989)). A method of cell fusion employing electric fields has also been described. See Chang US, 4,970,154. Conditions can be optimized for different strains.
  • Fused cells are heterokaryons containing genomes from two or more component protoplasts.
  • Fused cells can be enriched from unfused parental cells by sucrose gradient sedimentation or cell sorting.
  • the two nuclei in the heterokaryons can fuse (karyogamy) and homologous recombination can occur between the genomes.
  • the chromosomes can also segregate asymmetrically resulting in regenerated protoplasts that have lost or gained whole chromosomes.
  • the frequency of recombination can be increased by treatment with ultraviolet irradiation or by use of strains overexpressing recA or other recombination genes, or the yeast rad genes, and cognate variants thereof in other species, or by the inhibition of gene products of wtS, wtL, or MutD.
  • Overexpression can be either the result of introduction of exogenous recombination genes or the result of selecting strains, which as a result of natural variation or induced mutation, overexpress endogenous recombination genes.
  • the fused protoplasts are propagated under conditions allowing regeneration of cell walls, recombination and segregation of recombinant genomes into progeny cells from the heterokaryon and expression of recombinant genes. This process can be reiteratively repeated to increase the diversity of any set of protoplasts or cells. After, or occasionally before or during, recovery of fused cells, the cells are screened or selected for evolution toward a desired property.
  • Subsequent rounds of recombination can be performed by preparing protoplasts from cells (or whole organisms, or protoplasts, depending on the format) surviving selection/screening in a previous round.
  • the protoplasts are optionally fused, with recombination occurring in fused protoplasts.
  • Cells, tissues or whole organisms are optionally regenerated from the fused protoplasts. This process can again be reiteratively repeated to increase the diversity of the starting population.
  • Protoplasts, or regenerated or regenerating cells are subject to further selection or screening.
  • Suitable plants for protoplasting include, for example, species from the genera Fragaria, Lotus, Medicago, Onobrychis, Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum, Hyoscyamus, Lycopersicon, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Ciohorium, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, Hererocallis, Nemesia, Pelargonium, Panicum, Pennisetum,
  • Monocots may also be transformed by techniques or with vectors other than Agrobacterium. For example, monocots have been transformed by electroporation (Fromm et al. [1986] Nature 319:791-793: Rhodes et al. Science [1988] 240: 204- 207), direct gene transfer (Baker et al. [1985] Plant Genetics 201-211), by using pollen-mediated vectors (EP 0270 356), and by injection of DNA into floral tillers (de la Pena et al. [1987], Nature 325:274-276).
  • Additional plant genera that may be transformed by Agrobacterium include Chrysanthemum. Dianthus, Gerbera, Euphorbia, Pelaronium, Ipomoea, Passiflora, Cyclamen, Malus, Prunus, Rosa, Rubus, Populus, Santalum, Allium, Lilium, Narcissus, Ananas, Arachis, Phaseolus and
  • Monocots include plants in the grass family (Gramineae), such as plants in the sub families Fetucoideae and Poacoideae, which together include several hundred genera including plants in the genera Agrostis,
  • Additional preferred targets include other commercially important crops, e.g., from the families Compositae (the largest family of vascular plants, including at least 1 ,000 genera, including important commercial crops such as sunflower), and Leguminosae or "pea family,” which includes several hundred genera, including many commercially valuable crops such as pea, beans, lentil, peanut, yam bean, cowpeas, velvet beans, soybean, clover, alfalfa, lupine, vetch, lotus, sweet clover, wisteria, and sweetpea.
  • Common crops applicable to the methods of the invention include Zea mays (corn), rice, soybean, sorghum, wheat, oats, barley, millet, sunflower, and canola.
  • Shuffling Fungi the largest family of vascular plants, including at least 1 ,000 genera, including important commercial crops such as sunflower
  • pea family which includes several hundred genera, including many commercially valuable crops such as pea, beans, lentil, peanut, yam bean, cowpeas,
  • fungal cells can also be protoplasted and shuffled in the mariner described herein for plants. Spores from a frozen stock, a lyophilized stock, or fresh from an agar plate are used to inoculate suitable liquid medium. Spores are germinated resulting in hyphal growth. Mycelia are harvested, and washed by filtration and/or centrifugation. Optionally the sample is pretreated with DTT to enhance protoplast formation. Protoplasting is performed in an osmotically stabling medium (e.g., 1 m NaCl/20mM MgSO4, pH 5.8) by the addition of cell wall- degrading enzyme (e.g., Novozyme 234).
  • an osmotically stabling medium e.g., 1 m NaCl/20mM MgSO4, pH 5.8
  • cell wall- degrading enzyme e.g., Novozyme 234
  • Protoplasts can be separated from mycelia, debris and spores by filtration through miracloth, and density centrifugation. Protoplasts are harvested by centrifugation and resuspended to the appropriate concentration. This step may lead to some protoplast fusion. Fusion can be stimulated by addition of PEG (e.g., PEG 3350), and/or repeated centrifugation and resuspension with or without PEG. Electrofusion can also be performed. Fused protoplasts can optionally be enriched from unfused protoplasts by sucrose gradient sedimentation (or other methods of screening described above).
  • PEG e.g., PEG 3350
  • Electrofusion can also be performed. Fused protoplasts can optionally be enriched from unfused protoplasts by sucrose gradient sedimentation (or other methods of screening described above).
  • Fused protoplasts can optionally be treated with ultraviolet irradiation to stimulate recombination.
  • Protoplasts are cultured on osmotically stabilized agar plates to regenerate cell walls and form mycelia. The mycelia are used to generate spores, which are used as the starting material in the next round of shuffling.
  • Selection for a desired property can be performed either on regenerated mycelia or spores derived therefrom.
  • protoplasts are formed by inhibition of one or more enzymes required for cell wall synthesis.
  • the inhibitor should be fungistatic rather than fungicidal under the conditions of use.
  • inhibitors include antifungal compounds described by (e.g., Georgopapadakou & Walsh, Antimicrob. Ag. Chemother. 40, 279-291 (1996); Lyman & Walsh, Drugs 44, 9-35 (1992)).
  • Other examples include chitin synthase inhibitors (polyoxin or nikkomycin compounds) and/or glucan synthase inhibitors (e.g. echinocandins, papulocandins, pneumocandins).
  • Inhibitors should be applied in osmotically stabilized medium.
  • Fungi which can be shuffled include filamentous fungi, which are particularly suited to performing the shuffling methods described above. Filamentous fungi are divided into four main classifications based on their structures for sexual reproduction: Phycomycetes, Ascomycetes, Basidiomycetes and the Fungi Imperfecti. Phycomycetes (e.g., Rhizopus, Mucor) form sexual spores in sporangium. The spores can be uni or multinucleate and often lack septated hyphae (coenocytic). Ascomycetes.
  • Phycomycetes e.g., Rhizopus, Mucor
  • Basidiomycetes include mushrooms, and smuts and form sexual spores on the surface of a basidium. In holobasidiomycetes, such as mushrooms, the basidium is undivided. In hemibasidiomycetes, such as ruts (Uredinales) and smut fungi (Ustilaginales), the basidium is divided. Fungi imperfecti, which include most human pathogens, have no known sexual stage.
  • Transgenote refers to the immediate product of the transformation process and to resultant whole transgenic plants.
  • regeneration means growing a whole plant from a plant cell, a group of plant cells, a plant part or a plant piece (e.g. from a protoplast, callus, or tissue part).
  • embryo formation can then be induced from the protoplast suspension, to the stage of ripening and germination as natural embryos.
  • the culture media will generally contain various amino acids and hormones, such as auxin and cytokinins. It is sometimes advantageous to add glutamic acid and proline to the medium, especially for such species as corn and alfalfa. Shoots and roots normally develop simultaneously. Efficient regeneration will depend on the medium, on the genotype, and on the history of the culture. If these three variables are controlled, then regeneration is fully reproducible and repeatable.
  • Regeneration also occurs from plant callus, explants, organs or parts. Transformation can be performed in the context of organ or plant part regeneration. See, Methods in Enzvmology, supra; also Methods in Enzvmology, Vol. 118; and Klee et al., (1987) Annual Review of Plant Physiology. 38:467-486.
  • vegetatively propagated crops the mature transgenic plants are propagated by the taking of cuttings or by tissue culture techniques to produce multiple identical plants for trialling, such as testing for production characteristics. Selection of desirable fransgenotes is made and new varieties are obtained thereby, and propagated vegetatively for commercial sale.
  • seed propagated crops the mature transgenic plants are self crossed to produce a homozygous inbred plant. The inbred plant produces seed containing the gene for the newly introduced foreign gene activity level. These seeds can be grown to produce plants that would produce the selected phenotype.
  • the inbreds according to this invention can be used to develop new hybrids.
  • a selected inbred line is crossed with another inbred line to produce the hybrid.
  • the offspring resulting from the first experimental crossing of two parents is known in the art as the FI hybrid, or first filial generation.
  • the two parents crossed to produce FI progeny according to the present invention one or both parents can be transgenic plants.
  • Parts obtained from the regenerated plant such as flowers, seeds, leaves, branches, fruit, and the like are covered by the invention, provided that these parts comprise cells which have been so transformed. Progeny and variants, and mutants of the regenerated plants are also included within the scope of this invention, provided that these parts comprise the introduced DNA sequences. Progeny and variants, and mutants of the regenerated plants are also included within the scope of this invention.
  • Microspores are haploid (In) male spores that develop into pollen grains. Anthers contain a large numbers of microspores in early-uninucleate to first-mitosis stages. Microspores have been successfully induced to develop into plants for most species, such as, e.g., rice (Chen, CC 1977 In Vitro. 13: 484-489), tobacco (Atanassov, I. et al. 1998 Plant Mol Biol. 38:1169-1178), Tradescantia
  • microspores The plants derived from microspores are predominantly haploid or diploid (infrequently polyploid and aneuploid).
  • the diploid plants are homozygous and fertile and can be generated in a relatively short time.
  • Microspores obtained from FI hybrid plants represent great diversity, thus being an excellent model for targeting and studying recombination.
  • microspores can be transformed with
  • T-DNA introduced by agrobacterium or other available means and then regenerated into individual plants.
  • protoplasts can be made from microspores and they can be fused similar to what occur in fungi and bacteria.
  • Microspores generated from microspores are pooled and fused.
  • Microspores obtained from plants generated by protoplast fusion are optionally pooled and fused again, increasing the genetic diversity of the resulting microspores .
  • Microspores can also be subjected to mutagenesis in various ways, such as by chemical mutagenesis, radiation-induced mutagenesis and, e.g., t-DNA transformation, prior to fusion or regeneration. New mutations which are generated can be recombined through the recursive processes described above and herein.
  • Vectors
  • the vector need be no more than the mimmal nucleic acid sequences necessary to confer the desired traits, without the need for additional other sequences.
  • the possible vectors include the Ti plasmid vectors, shuttle vectors designed merely to maximally yield high numbers of copies, episomal vectors containing minimal sequences necessary for ultimate replication once transformation has occurred, transposon vectors, homologous recombination vectors, mini-chromosome vectors, and viral vectors, including the possibility of RNA forms of the gene sequences.
  • the selection of vectors and methods to construct them are commonly known to persons of ordinary skill in the art and are described in general technical references (Methods in Enzymology, supra).
  • any additional attached vector sequences which will confer resistance to degradation of the nucleic acid fragment to be introduced, which assists in the process of genomic integration or provides a means to easily select for those cells or plants which are actually, in fact, transformed are advantageous and greatly decrease the difficulty of selecting useable transgenotes.
  • shuffled genetic sequences can be recovered for further shuffling or for direct use by any applicable method, including but not limited to: recovery of DNA, RNA, or cDNA from cells (or PCR-amplified copies thereof) from cells or medium, recovery of sequences from host chromosomal DNA or PCR- amplified copies thereof, recovery of episome (e.g., expression vector) such as a plasmid, cosmid, viral vector, artificial chromosome, and the like, or other suitable recovery method known in the art.
  • Libraries of nucleic acids are also thus obtained from populations of organisms, e.g., cells or protoplasts comprising shuffled nucleic acids. These secondary libraries can be used to transform additional protoplasts, plants, or the like.
  • Any suitable art-known method including RT-PCR or PCR, can be used to obtain the selected shufflant sequence(s) for subsequent manipulation and shuffling.
  • RT-PCR RT-PCR
  • PCR PCR
  • EPSP synthase (EPSPS) genes are isolated from commercially available cDNA libraries of Arabidopsis, tomato, tobacco, maize and other plants. The gene is alternatively isolated from cDNA prepared from poly (A+) mRNA from floral organs of different parts (Gasser et al. J. Biol Chem. 263: 4280-4289, 1988, incorporated herein by reference). Primers for isolation of cDNA specific for EPSPS are designed based on consensus sequences derived from public information (J. Biol. Chem, above and Padgette et al.
  • EPSPS genes isolated from cDNAs of different plants contain the transit sequences for targeting of the genes to the chloroplasts.
  • the EPSPS genes from various plants which have nucleotide homology in the range 75-93%, are shuffled according to published procedures for polynucleotide shuffling. Briefly, this procedure involves random fragmentation of the genes with DNAse I and selecting nucleotide fragments of 100-300 bp. The fragments are reassembled based on sequence similarity by primerless PCR. Recombination as well as variable levels of mutations that are introduced by the PCR reaction generate the diversity. The assembled gene is cloned into a plasmid such as the Ti-based vector pBinl 9 used in Agrobacterium tumefaciens-mediated transformation.
  • a plasmid such as the Ti-based vector pBinl 9 used in Agrobacterium tumefaciens-mediated transformation.
  • FIG. 1 The schematic representation of the plasmid is shown in Figure 1 (see, Dyer WE in Herbicide Resistant Crops Duke S (ed.) pp 37-51).
  • Shuffled EPSPS genes are cloned into multiple cloning sites shown in the plasmid and directly electroporated into tobacco protoplasts. Preparation of protoplasts from tobacco leaves and subsequence transformation and culturing conditions are described in the literature.
  • Transformed tobacco protoplasts, carrying EPSPS resistant to glyphosate are selected directly on a growth medium containing glyphosate.
  • the level of glyphosate used is determined by plating untransformed tobacco protoplasts in a range of herbicide concentrations. At least lOx the lethal concentration (between
  • EPSPS genes are isolated from this callus (or calli if multiple individuals are selected) and used for a subsequent rounds of sequence-shuffling and phenotype selection for glyphosate resistance. Eventually, the optimized gene is assayed for magnitude of resistance and quantification of other properties.
  • the resultant genetic sequence encoding glyphosate resistance is cloned into a plant cell protoplast capable of regeneration as a transgene or other stable, replication sequence that segregates with germplasm, an adult plant is regenerated, and the resultant regenerated plant species is bred to establish a germplasm which can be used to produce glyphosate-resistance plants which can be sold commercially as seed or as vegetative plants.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Cell Biology (AREA)
  • Ecology (AREA)
  • Botany (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
EP99943983A 1998-08-31 1999-08-30 Transformation, selektion und screening von sequenzdurchmischten polynukleotiden zur entwicklung und optimierung von pflanzenphänotypen. Withdrawn EP1109889A1 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US9852898P 1998-08-31 1998-08-31
US98528P 1998-08-31
PCT/US1999/019732 WO2000012680A1 (en) 1998-08-31 1999-08-30 Transformation, selection, and screening of sequence-shuffled polynucleotides for development and optimization of plant phenotypes

Publications (1)

Publication Number Publication Date
EP1109889A1 true EP1109889A1 (de) 2001-06-27

Family

ID=22269690

Family Applications (1)

Application Number Title Priority Date Filing Date
EP99943983A Withdrawn EP1109889A1 (de) 1998-08-31 1999-08-30 Transformation, selektion und screening von sequenzdurchmischten polynukleotiden zur entwicklung und optimierung von pflanzenphänotypen.

Country Status (3)

Country Link
EP (1) EP1109889A1 (de)
AU (1) AU5696899A (de)
WO (1) WO2000012680A1 (de)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999021979A1 (en) 1997-10-28 1999-05-06 Maxygen, Inc. Human papillomavirus vectors
IL140125A0 (en) 1998-06-17 2002-02-10 Maxygen Inc Method for producing polynucleotides with desired properties
IL154596A0 (en) 2000-08-25 2003-09-17 Basf Plant Science Gmbh Plant polynucleotides encoding novel prenyl proteases
US8008459B2 (en) 2001-01-25 2011-08-30 Evolva Sa Concatemers of differentially expressed multiple genes
DK1356037T3 (da) 2001-01-25 2011-06-27 Evolva Ltd Bibliotek med en samling af celler
US7336174B1 (en) 2001-08-09 2008-02-26 Key Control Holding, Inc. Object tracking system with automated system control and user identification
AU2003235582A1 (en) * 2002-01-16 2003-07-30 Cropdesign N.V. Herbicide mode of action determination
DE10212158A1 (de) * 2002-03-19 2003-10-02 Metanomics Gmbh & Co Kgaa Population transgener Pflanzen, davon abgeleitetes biologisches Material, entsprechende Plasmidkollektion und Population transformierter Wirtsorganismen, sowie deren Verwendung und Verfahren zu deren Erzeugung
US7572618B2 (en) 2006-06-30 2009-08-11 Bristol-Myers Squibb Company Polynucleotides encoding novel PCSK9 variants
CN116411021B (zh) * 2023-06-07 2023-08-15 隆平生物技术(海南)有限公司 一种番茄草甘膦筛选体系的转化方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO0012680A1 *

Also Published As

Publication number Publication date
AU5696899A (en) 2000-03-21
WO2000012680A1 (en) 2000-03-09

Similar Documents

Publication Publication Date Title
US6483011B1 (en) Modified ADP-glucose pyrophosphorylase for improvement and optimization of plant phenotypes
CN108795972B (zh) 不使用转基因标记序列分离细胞的方法
US6703240B1 (en) Modified starch metabolism enzymes and encoding genes for improvement and optimization of plant phenotypes
US6531316B1 (en) Encryption of traits using split gene sequences and engineered genetic elements
US20060117409A1 (en) Modified ribulose 1,5-bisphosphate carboxylase/oxygenase for improvement and optimization of plant phenotypes
AU3391900A (en) Encryption of traits using split gene sequences
US11519000B2 (en) Methodologies and compositions for creating targeted recombination and breaking linkage between traits
EP3296403A1 (de) Transgenes sojabohnenereignis mon87751 und verfahren zur detektion und verwendung davon
US20080287314A1 (en) Methods for modulating cellular and organismal phenotypes
WO2000061740A1 (en) Modified lipid production
JP2002522089A (ja) 除草剤選択性作物を生成するためのdnaシャッフリング
US20060272044A1 (en) Methods for Improving a Photosynthetic Carbon Fixation Enzyme
CN111819285A (zh) 防碎基因和突变
WO2000012680A1 (en) Transformation, selection, and screening of sequence-shuffled polynucleotides for development and optimization of plant phenotypes
CN117051035A (zh) 不使用转基因标记序列分离细胞的方法
WO2023111961A1 (en) Spatio-temporal promoters for polynucleotide expression in plants
EP1129185A1 (de) Modifizierte phosphoenolpyruvat carboxylase zur verbesserung und optimierung des phenotyps von pflanzen
AU6082399A (en) Uracil permease from arabidopsis as herbicidal target gene
WO2000061731A2 (en) Modified starch metabolism enzymes and encoding genes for improvement and optimization of plant phenotypes
WO2024127362A1 (en) Spatio-temporal promoters for polynucleotide expression in plants
CN116445521A (zh) 一种抗除草剂的基因及其应用

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20010402

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

18W Application withdrawn

Effective date: 20031210