EP1045899A2 - Method of determining the function of nucleotide sequences and the proteins they encode by transfecting the same into a host - Google Patents

Method of determining the function of nucleotide sequences and the proteins they encode by transfecting the same into a host

Info

Publication number
EP1045899A2
EP1045899A2 EP99903208A EP99903208A EP1045899A2 EP 1045899 A2 EP1045899 A2 EP 1045899A2 EP 99903208 A EP99903208 A EP 99903208A EP 99903208 A EP99903208 A EP 99903208A EP 1045899 A2 EP1045899 A2 EP 1045899A2
Authority
EP
European Patent Office
Prior art keywords
nucleic acid
plant
viral
ofthe
sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP99903208A
Other languages
German (de)
French (fr)
Inventor
Guy Della-Cioppa
Robert L. Erwin
Wayne P. Fitzmaurice
Kathleen M. Hanley
Monto H. Kumagai
John A. Lindbo
David R. Mcgee
Hal S. Padgett
Gregory P. Pogue
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Biosource Technologies Inc
Original Assignee
Biosource Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Biosource Technologies Inc filed Critical Biosource Technologies Inc
Publication of EP1045899A2 publication Critical patent/EP1045899A2/en
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1079Screening libraries by altering the phenotype or phenotypic trait of the host
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids

Definitions

  • the present invention relates generally to the field of molecular biology and plant genetics. Specifically, the present invention relates to a method for determining the function of nucleotide sequences and genes by transfecting the same into a host.
  • Arabidopsis thaliana may be an attractive target for gene function discovery because a very large set of ESTs have already been produced in this organism, and these sequences tag more than 50% of the expected Arabidopsis genes.
  • sequence information so generated is enormous if gene function can be determined. It may become possible to engineer commercial seeds for agricultural use to convey any number of desirable traits to food and fiber crops and thereby increase agricultural production and the world food supply. Research and development of commercial seeds has so far focused primarily on traditional plant breeding, however there has been increased interest in biotechnology as it relates to plant characteristics. Knowledge ofthe genomes involved and the function of genes contained therein for both monocotyledonous and dicotyledonous plants is essential to realizing positive effects from such technology.
  • gene profiling in cotton can lead to an understanding ofthe types of genes being expressed primarily in fiber cells.
  • the genes or promoters derived from these genes may be important in genetic engineering of cotton fiber for increased strength or for "built-in" fiber color.
  • gene profiling coupled to physiological trait analysis can lead to the identification of predictive markers that will be increasingly important in marker assisted breeding programs. Mining the DNA sequence of a particular crop for genes important for yield, quality, health, appearance, color, taste, etc., are applications of obvious importance for crop improvement.
  • the recombinant plant viral nucleic acids and recombinant viruses such as those demonstrated by Donson et al. which have been demonstrated to infect plant cells and express the foreign genetic material systemically are generally characterized as comprising a native plant viral subgenomic promoter, at least one non-native plant viral subgenomic promoter, a plant viral coat protein coding sequence, and at least one non-native nucleic acid sequence.
  • the value of using such plant viral nucleic acids to effect systemic expression of non-native nucleic acids in a plant host is significant. This tool, if coupled with a rational design for elucidating the function ofthe non-native nucleic acids, would make significant strides in understanding the large amount of sequence information produced by sequencing efforts.
  • the present invention is directed to a method of determining the function of nucleic acid sequences including genes and the proteins they encode in host organisms such as bacteria, yeast, plants, or animals, by transfecting the nucleic acid sequences into the organisms in a manner so as to effect localized or systemic expression ofthe nucleic acid sequences.
  • the present inventors have determined methods for determining the function of nucleic acid sequences and the proteins they encode by tr.ansfecting organisms with nucleic acids of interest thereby providing a more rapid means for elucidating the function of these nucleic acids including genes and subsequently utilizing the rapidly expanding information in the field of genomics.
  • a nucleic acid is introduced into a plant host wherein the plant host may be a monocotyledonous or dicotyledonous plant, plant tissue or plant cell.
  • the nucleic acid may be introduced by way of a plant viral nucleic acid.
  • plant viral nucleic acids are stable for the maintenance and transcription or expression of non-native nucleic acid sequences and are capable of locally or systemically transcribing or expressing such sequences in the plant host.
  • Especially preferred recombinant plant viral nucleic acids useful in the methods ofthe present invention comprise a native plant viral subgenomic promoter, a plant viral coat protein coding sequence, and at least one non-native nucleic acid sequence.
  • Some viral vectors used in accordance with the present invention may be encapsidated by the coat proteins encoded by the recombinant plant virus.
  • the recombinant plant viral nucleic acid or recombinant plant virus is used to infect appropriate hosts such as plants.
  • the recombinant plant viral nucleic acid is capable of replication in the host, localized or systemic spread in the host, and transcription or expression ofthe non-native nucleic acid in the host to produce the desired product.
  • Such products may be for example, useful polypeptides or proteins including enzymes, complex biomolecules, ribozymes, or polypeptides or protein products resulting from positive-sense or anti-sense RNA expression.
  • the nucleic acid of interest may be expressed with the genomic DNA or RNA ofthe viral vectors and hence be under the control of a genomic promoter.
  • Some other viral vectors used in accordance with the present invention comprise recombinant animal viruses or portions thereof.
  • animal viral vectors are useful to infect appropriate hosts such as animals.
  • the recombinant animal viral nucleic acid is capable if replication in the host, systemic or localized spread in the host, and transcription or expression ofthe non-native nucleic acid in the host to produce the desired product.
  • the present method uses a viral expression vector encoding for at least one protein non-native to the vector that is released from at least one polyprotein expressed by said vector by proteolytic processing.
  • recombinant plant viruses are used which encode for the expression of a fusion between a plant viral coat protein and the amino acid product ofthe nucleic acid of interest.
  • a nucleic acid sequence of interest including a gene may be placed within any suitable vector construct such as a virus for infecting the host organism. That is, the present method may be practiced without concern for the position ofthe nucleic acid sequence of interest within the vector used to infect the host organism.
  • the invention is not intended to be limited to any particular viral constructs but specifically contemplates using all operable constructs. Those skilled in the art will understand that these embodiments are representative only of many constructs which may be useful to produce localized or systemic expression of nucleic acids in host organisms such as plants. All such constructs .are contemplated and intended to be within the scope ofthe present invention.
  • the function of a nucleic acid may be determined by complementation analysis. That is, the function ofthe nucleic acid of interest may be determined by observing the endogenous gene or genes whose function is replaced or augmented by introducing the nucleic acid of interest. A discussion of such phenomenon is provided by Napoli et al, The Plant Cell 2:279-289 (1990).
  • the function of a nucleic acid may be determined by analyzing the biochemical alterations in the accumulation of substrates or products from enzymatic reactions according to any one ofthe means known by those skilled in the art.
  • the function of a nucleic acid may be determined by observing phenotypic changes in the host by methods including morphological, macroscopic or microscopic analysis.
  • the function of a nucleic acid may be determined by observing any changes in biochemical pathways which may be modified in the host organism as a result of expression ofthe nucleic acid.
  • the function of a nucleic acid may be determined utilizing techniques known by those skilled in the art to observe inhibition of endogenous gene expression in the cytoplasm of cells as a result of expression ofthe nucleic acid.
  • the function of a nucleic acid may be determined utilizing techniques known by those skilled in the art to observe changes in the RNA or protein profile as a result of expression ofthe nucleic acid.
  • the function of a nucleic acid may be determined by selection of organisms such as plants or human cells and tissues capable of growing or maintaining viability in the presence of noxious or toxic substances, such as, for example herbicides and pharmaceutical ingredients.
  • a second aspect ofthe present invention is a method of silencing endogenous genes in a host by introducing nucleic acids into the host by way of a viral nucleic acid such as a plant or animal viral nucleic acid suitable to produce expression of a nucleic acid in a transfected host.
  • the host is a plant, but those skilled in the art will understand that other hosts such as bacteria, yeast and animals including humans may also be utilized.
  • This method utilizes the principle of post-transcription gene silencing ofthe endogenous host gene homo log. Since the replication mechanism ofthe transfected non-native nucleic acid produces both sense and antisense RNA sequences, the orientation of the non-native nucleic acid insert is not crucial to providing gene silencing. Particularly, this aspect ofthe invention is especially useful for silencing a multigene family as is frequently found in plants. The prior art has not demonstrated an effective means for silencing a multigene family in plants.
  • a third aspect ofthe present invention is a method for selecting desired functions of RNAs .and proteins by the use of virus vectors to express libraries of nucleic acid sequence variants.
  • Libraries of sequence variants may be generated by means of in vitro mutagenenisis and/or recombination. Rapid in vitro evolution can be used to improve virus-specific or protein-specific functions.
  • plant RNA virus expression vectors may be used as tools to bear libraries containing variants of nucleic acid, genes from virus, plant or other sources, and to be applied to plants or plant cells such that the desired altered effects in the RNA or protein products can be determined, selected and improved.
  • nucleic acid shuffling techniques may be employed to construct shuffled gene libraries.
  • Random, semi-random or known sequences of virus origin may also be inserted in virus expression vectors between native virus sequences and foreign gene sequences, to increase the genetic stability of foreign genes in expression vectors as well as the translation ofthe foreign gene and the stability ofthe mRNA encoding the foreign gene in vivo.
  • the desired function of RNA and protein may include the promoter activities, replication properties, translational efficiencies, movement properties (local and systemic), signaling pathway, or virus host range, among others.
  • the desired function alteration can be identified by assaying infected plants and the nature of mutation can be determined by analysis of sequence variants in the virus vector.
  • Methods to increase the representation of gene sequences in virus expression libraries may also be achieved by bypassing the genetic bottleneck of propagation in E. coli.
  • cell-free methods may be used to clone sequence libraries or individual arrayed sequences into virus expression vectors and reconstruct an infectious virus, such that the final ligation product can be transcribed and the resulting RNA can be used for plant or plant cell inoculation/infection with the output being gene function discovery or protein production.
  • RNA viruses or RNA virus vectors as populations or individuals in parallel to identify individuals with novel and augmented virus-encoded functions in replication and virus movement, foreign gene sequence retention in vectors and proper folding, activity and expression of protein products, novel gene expression, effects on host metabolism, and resistance or susceptibility of plants to exogenous agents.
  • Variation in the sequence of a native virus gene(s) or heterologous nucleotide sequence(s) may be introduced into an RNA virus or an RNA virus expression vector by many methods as a means to screen a population of variants in batch or individuals in parallel for novel properties exhibited by the virus itself or conferred on the host plant or cell by the virus vector.
  • Variant populations can be transfected as populations or individual clones into "host”: 1) protoplasts; 2) whole plants; or 3) inoculated leaves of whole plants and screened for various traits including protein expression (increase or decrease), RNA expression (increase or decrease), secondary metabolites or other host property gained or loss as a result ofthe virus infection.
  • a virus vector which simultaneously expressed the green fluorescent protein (GFP) or other selectable marker gene and the variant sequence, is used to screen quantitatively for levels of resistance or sensitivity to the agent in question conferred upon the host by the variant sequence expressed from the viral vector.
  • GFP green fluorescent protein
  • transcription factors regulating the expression of defense proteins such as systemin peptides, or protease inhibitors could be identified by transfecting hosts with virus libraries and the expression of systemin or protease inhibitors or their RNAs be directly assayed.
  • the promoters responsible for expressing these genes could be genetically fused to the green fluorescent protein .and introduced into hosts as transient expression constructs or into stable transformed host cells/tissues. The resulting cells would be transfected with viral vector libraries.
  • Hosts now could be screened rapidly by following relative GFP expression following vector fransfection.
  • coupling the transfecting of hosts with virus libraries with the treatment of plants with methyl jasmonate could identify sequences that reverse or enhance the gene induction events induced by this metabolite.
  • This approach could be applied to other factors involved in promotion of higher biomass in plants such as Leafy or DET2.
  • the expression of these factors could be directly assayed or via promoters genetically fused to GFP. This technique will enable screening of populations or individuals followed by rescue ofthe sequence from virus vectors conferring desired trait by RT-PCR and re-screening of particular variant sequences in secondary screens.
  • a fourth aspect ofthe present invention is a method for inhibiting an endogenous protease of a plant host comprising the step of treating the plant host with a compound which induces the production of an endogenous inhibitor of said protease.
  • jasmonic acid may be used to treat the plant host to induce the production of an endogenous inhibitor of an endogenous protease.
  • the treatment ofthe plant host with a compound results an increased representation of an exogenous nucleic acid or the protein product thereof.
  • transgenic hosts expressing protease inhibitors may be used to decrease the degradation of proteins expressed by virus expression vectors.
  • jasmonic acid may be used to treat plants infected with virus expression vectors to decrease degradation of proteins expressed by virus expression vectors.
  • a fifth aspect ofthe present invention are genes and fragments thereof, nucleotide sequences, and gene products obtained by way ofthe method ofthe present invention.
  • the present invention features expressing selected nucleotide sequences in a host organism.
  • gene products of such nucleotide sequences may be isolated using techniques known to those skilled in the art. Such gene products may exhibit biological activity as pharmaceuticals, herbicides, and other similar functions.
  • FIG. 1 depicts the vector TTOl/PSY +.
  • FIG. 2 represents the vector TTO1A/PDS+.
  • FIG. 3 represents the vector TT01A/Ca CCS+.
  • FIG. 4 represents the vector TTU51 CTP CrtB.
  • FIG. 5 represents the vector TTOSA1CTP CrtI 491.
  • FIG. 6 represents the Erwinia herbicola phytoene desaturase gene (plasmid pAU211).
  • FIG. 7 represents the plasmid KS+/Crt/* 491.
  • FIG. 8 represents the plasmid pBS736.
  • FIG. 9 represents the plasmid pBS 712.
  • FIG. 10 represents the 72 kDa gene product ofthe genomic clone encoding alcohol oxidase ZZA1.
  • FIG. 11 represents the plasmid TTOS 1 APE ZZA1.
  • FIG. 12 represents the plasmid TTO1 A 103L.
  • FIG. 13 represents the plasmid TTU51A QSEO #3.
  • FIG. 14 represents the plasmid KS+ TVCVK #23.
  • FIG. 15 represents the plasmid pBS735.
  • FIG. 16 represents the plasmid pBS740.
  • FIG. 17 represents the plasmid pBS723.
  • FIG. 18 represents the plasmid pBS731.
  • FIG. 19 represents the plasmid pBS740 AT #120.
  • FIG. 20 represents the nucleotide sequence alignment of 740 AT #120 to human ADP-ribosylation factor (ARF3) M33384.
  • FIG. 21 represents the plasmid pBS740 AT #88.
  • FIG. 22 represents the nucleotide sequence alignment of 740 AT #88 to L33574 mRNA for rhodopsin.
  • FIG. 23 represents the nucleotide sequence alignment of 740 AT #88 to X07797 Octopus mRNA for rhodopsin.
  • FIG. 24 represents the protein sequence alignment of 740 AT #88 to an Arabidopsis est ORF ATTS2938.
  • FIG. 25 represents the protein sequence alignment of 740 AT #88 to Octopus rhodopsin P31356.
  • FIG. 26 represents amino acid sequence comparison of 740 AT #2441 to tobacco RAN-B1 GTP binding protein.
  • FIG. 27 represents nucleotide sequence comparison of 740 AT #2441 to human RAN GTP-binding protein.
  • FIG. 28 represents a schematic diagram of cell free cloning.
  • the present invention is directed to a method of determining the function of a nucleic acid sequence including a gene and a protein encoded thereby in an organism such as bacteria, fungi, yeast, animals and plants by transfecting the nucleic acid sequence into the organism.
  • the present inventors have determined methods for determining the function of nucleic acid sequences by transfecting organisms with the nucleic acids thereby providing a more rapid means for determining gene function and utilizing the rapidly expanding sequence information in the field of genomics.
  • a nucleic acid is introduced into a plant host.
  • the nucleic acid may be introduced by way of a viral nucleic acid.
  • Such recombinant viral nucleic acids are stable for the maintenance and transcription or expression of non-native nucleic acid sequences and are capable of systemically transcribing or expressing such non-native sequences in the plant host.
  • Especially preferred recombinant plant viral nucleic acids useful in the present invention comprise a native plant viral subgenomic promoter, a plant viral coat protein coding sequence, and at least one non-native nucleic acid sequence.
  • plant viral nucleic acid sequences used in the method ofthe present invention are characterized by the deletion ofthe native coat protein coding sequence and comprise a non-native plant viral coat protein coding sequence and a non-native promoter, preferably the subgenomic promoter ofthe non-native coat protein coding sequence, capable of expression in the plant host, packaging ofthe recombinant plant viral nucleic acid, and ensuring a systemic infection ofthe host by the recombinant plant viral nucleic acid.
  • the recombinant plant viral nucleic acid may contain one or more additional native or non-native subgenomic promoters.
  • Each non-native subgenomic promoter is capable of transcribing or expressing adjacent genes or nucleic acid sequences in the plant host and incapable of recombination with each other and with native subgenomic promoters.
  • One or more non-native nucleic acids may be inserted adjacent to the native plant viral subgenomic promoter or the native and non- native plant viral subgenomic promoters if more than one nucleic acid sequence is included.
  • two or more heterologous non-native subgenomic promoters may be used.
  • the non-native nucleic acid sequences may be transcribed or expressed in the host plant under the control ofthe subgenomic promoter to produce the products ofthe nucleic acids of interest.
  • plant viral nucleic acids are used in the present invention wherein the native coat protein coding sequence is placed adjacent one ofthe non-native coat protein subgenomic promoters instead of a non-native coat protein coding sequence.
  • plant viral nucleic acids are used in the present invention wherein the native coat protein gene is adjacent its subgenomic promoter and one or more non-native subgenomic promoters have been inserted into the viral nucleic acid.
  • the inserted non-native subgenomic promoters are capable of transcribing or expressing adjacent genes in a plant host and are incapable of recombination with each other and with native subgenomic promoters.
  • Non-native nucleic acid sequences may be inserted adjacent the non- native subgenomic plant viral promoters such that the sequences are transcribed or expressed in the host plant under control ofthe subgenomic promoters to produce the product ofthe non-native nucleic acid.
  • the native coat protein coding sequence may be replaced by a non-native coat protein coding sequence.
  • the viral vectors used in accordance with the present invention may be encapsidated by the coat proteins encoded by the recombinant plant virus.
  • the recombinant plant viral nucleic acid or recombinant plant virus is used to infect appropriate hosts such as plants.
  • the recombinant plant viral nucleic acid is capable of replication in the host, localized or systemic spread in the host, and transcription or expression ofthe non-native nucleic acid in the host to produce the desired product.
  • Such products may be for example, therapeutics and other useful polypeptides or proteins including enzymes, complex biomolecules, ribozymes, or polypeptides or protein products resulting from positive-sense or anti-sense RNA expression.
  • the nucleic acid of interest may be under the control of a genomic promoter and therefore be expressed with the genome of the virus.
  • the present method uses a viral expression vector encoding at least one protein non-native to the vector that is released from at least one polyprotein expressed by said vector by proteolytic processing catalyzed by at least one protease in said polyprotein wherein said vector comprises at least one promoter, DNA having a sequence which codes for at least one polyprotein from a polyprotein-producing virus, at least one restriction site flanking a 3' terminus of said DNA and a cloning vehicle.
  • Additional embodiments use a viral expression vector encoding for at least one protein non-native to the vector that is released from at least one polyprotein expressed by the vector by proteolytic processing catalyzed by at least one protease in the polyprotein wherein the vector comprises at least one promoter, DNA having a sequence which codes for at least one polyprotein from a polyprotein-producing virus, may contain at least one restriction site flanking a 3' terminus of said cDNA and a cloning vehicle.
  • Preferred embodiments include using a potyvirus as the polyprotein-producing virus, and especially preferred embodiments may use TEV (tobacco etch virus).
  • TEV tobacco etch virus
  • recombinant plant viruses which encode for the expression of a fusion between a plant viral coat protein and the amino acid product ofthe nucleic acid of interest.
  • a recombinant plant virus provides for high level expression of a nucleic acid of interest.
  • the location or locations where the viral coat protein is joined to the amino acid product ofthe nucleic acid of interest may be referred to as the fusion joint.
  • a given product of such a construct may have one or more fusion joints.
  • the fusion joint may be located at the carboxyl terminus ofthe viral coat protein or the fusion joint may be located at the amino terminus ofthe coat protein portion ofthe construct.
  • the nucleic acid of interest may be located 5', 3', upstream, downstream or within the coat protein.
  • a "leaky” start or stop codon may occur at a fusion joint which sometimes does not result in translational termination.
  • a more detailed description of some recombinant plant viruses according to this embodiment of the invention may be found in co-pending U.S. Patent Application Serial No. 08/324,003 the disclosure of which is incorporated herein by reference.
  • a nucleic acid sequence of interest or a gene may be placed within any suitable vector construct such as a virus for infecting the host organism.
  • the present method may be practiced without concern for the position ofthe nucleic acid sequence of interest within the vector used to infect the host organism.
  • the invention is not intended to be limited to any particular viral constructs but specifically contemplates using all operable constructs. Specifically, those skilled in the art may choose to transfer DNA or RNA of any size up to and including an entire genome into a host organism in order to determine the function thereof.
  • Adjacent A position in a nucleotide sequence proximate to and 5' or 3' to a defined sequence. Generally, adjacent means within 2 or 3 nucleotides ofthe site of reference.
  • Animal cell A single functional cell found within an animal organism.
  • Animal tissue refers to one or more cells grouped or organized to perform one or more functions.
  • Animal organ refers to one or more tissues morphologically arranged to perform one or more functions within .an organism.
  • Anti-Sense Inhibition A type of gene regulation based on cytoplasmic, nuclear or organelle inhibition of gene expression due to the presence in a cell of an RNA molecule complementary to at least a portion ofthe mRNA being translated. It is specifically contemplated that DNA molecules may be from either an RNA virus or mRNA from the host cells genome or from a DNA virus.
  • Cell Culture A proliferating group of cells which may be in either an undifferentiated or differentiated state, growing contiguously or non- contiguously.
  • Chimeric Sequence or Gene A nucleotide sequence derived from at least two heterologous parts.
  • the sequence may comprise DNA or RNA.
  • Coding Sequence A deoxyribonucleotide or ribonucleotide sequence which, when either transcribed and translated or simply translated, results in the formation of a cellular polypeptide or a ribonucleotide sequence which, when translated, results in the formation of a cellular polypeptide.
  • Compatible The capability of operating with other components of a system.
  • a vector or plant or animal viral nucleic acid which is compatible with a host is one which is capable of replicating in that host.
  • a coat protein which is compatible with a viral nucleotide sequence is one capable of encapsidating that viral sequence.
  • Complementation Analysis As used herein, this term refers to observing the changes produced in an organism when a nucleic acid sequence is introduced into that organism after a selected gene has been deleted or mutated so that it no longer functions fully in its normal role.
  • a complementary gene to the deleted or mutated gene can restore the genetic phenotype ofthe selected gene.
  • Constitutive expression Gene expression which features substantially constant or regularly cyclical gene transcription. Generally, genes which are constitutively expressed are substantially free of induction from an external stimulus.
  • Differentiated cell A cell which has substantially matured to perform one or more biochemical or physiological functions.
  • Dual Heterologous Subgenomic Promoter Expression System a plus stranded RNA vector having a dual heterologous subgenomic promoter expression system to increase, decrease, or change the expression of proteins, peptides or RNAs, preferably those described in U.S. Patent Nos. 5,316,931, 5,811,653, 5,589,367, and 5,866,785, the disclosure of which is incorporated herein by reference.
  • ESTs Relatively short single-pass DNA sequences obtained from one or more ends of cDNA clones and RNA derived therefrom. They may be present in either the 5' or the 3' orientation. ESTs have been shown useful for identifying particular genes.
  • Gene A discrete nucleic acid sequence responsible for producing one or more cellular products and/or performing one or more intercellular or intracellular functions.
  • Gene silencing A reduction in gene expression.
  • a viral vector expressing gene sequences from a host may induce gene silencing of homologous gene sequences.
  • Growth cycle As used herein, the term is meant to include the replication of a nucleus, an organelle, a cell, or an organism.
  • Host A cell, tissue or organism capable of replicating a nucleic acid such as a vector or plant viral nucleic acid and which is capable of being infected by a virus containing the viral vector or viral nucleic acid.
  • This term is intended to include prokaryotic and eukaryotic cells, organs, tissues or organisms, where appropriate. Bacteria, fungi, yeast, animal (cell, tissues, or organisms), and plant (cell, tissues, or organisms) are examples of a host.
  • induction refer generally to a gene and a promoter operably linked thereto which is in some manner dependent upon an external stimulus, such as a molecule, in order to actively transcribe and/or translate the gene.
  • Infection The ability of a virus to transfer its nucleic acid to a host or introduce a viral nucleic acid into a host, wherein the viral nucleic acid is replicated, viral proteins are synthesized, and new viral particles assembled.
  • transmissible and infective are used interchangeably herein.
  • the term is also meant to include the ability of a selected nucleic acid sequence to integrate into a genome, chromosome or gene of a target organism.
  • Multigene family A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes.
  • multigene families include those which encode the histones, hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins.
  • Non-Native Any RNA or DNA sequence that does not normally occur in the cell or organism in which it is placed. Examples include recombinant plant viral nucleic acids and genes or ESTs contained therein. That is, an RNA or DNA sequence may be non-native with respect to a viral nucleic acid. Such an RNA or DNA sequence would not naturally occur in the viral nucleic acid. Also, an RNA or DNA sequence may be non-native with respect to a host organism. That is, such a RNA or DNA sequence would not naturally occur in the host organism. Conversely, the term non-native does not imply that an RNA or DNA sequence must be non-native with respect to both a viral nucleic acid and a host organism concurrently. The present invention specifically contemplates placing an RNA or DNA sequence which is native to a host organism into a viral nucleic acid in which it is non-native. Nucleic acid: As used herein the term is meant to include any DNA or
  • RNA sequence from the size of one or more nucleotides up to and including a complete gene sequence.
  • the term is intended to encompass all nucleic acids whether naturally occurring in a particular cell or organism or non-naturally occurring in a particular cell or organism.
  • Nucleic acid of interest The term is used interchangeably with the term
  • nucleic acid and is intended to refer to the nucleic acid sequence whose function is to be determined. The sequence will normally be non-native to the viral vector but may be native or non-native to the host organism.
  • Organism The term organism and "host organism” as used herein is specifically intended to include animals including humans, plants, viruses, fungi, .and bacteria.
  • Phenotypic Trait An observable, measurable or detectable property resulting from the expression or suppression of a gene or genes.
  • Plant Cell The structural and physiological unit of plants, consisting of a protoplast and the cell wall.
  • Plant Organ A distinct and visibly differentiated part of a plant, such as root, stem, leaf or embryo.
  • Plant Tissue Any tissue of a plant in planta or in culture. This term is intended to include a whole plant, plant cell, plant organ, protoplast, cell culture, or any group of plant cells organized into a structural and functional unit.
  • Positive-sense inhibition A type of gene regulation based on cytoplasmic inhibition of gene expression due to the presence in a cell of an RNA molecule substantially homologous to at least a portion ofthe mRNA being translated.
  • Promoter The 5 '-flanking, non-coding sequence substantially adjacent a coding sequence which is involved in the initiation of transcription ofthe coding sequence.
  • Protoplast An isolated plant or bacterial cell without some or all of its cell wall.
  • Plant viral nucleic acid which has been modified to contain non-native nucleic acid sequences. These non-native nucleic acid sequences may be from any organism or purely synthetic, however, they may also include nucleic acid sequences naturally occurring in the organism into which the recombinant plant viral nucleic acid is to be introduced.
  • Recombinant Plant Virus A plant virus containing the recombinant plant viral nucleic acid.
  • Subgenomic Promoter A promoter of a subgenomic mRNA of a viral nucleic acid.
  • Substantial Sequence Homology Denotes nucleotide sequences that are substantially functionally equivalent to one another. Nucleotide differences between such sequences having substantial sequence homology will be de minimis in affecting function ofthe gene products or an RNA coded for by such sequence.
  • Transposon A nucleotide sequence such as a DNA or RNA sequence which is capable of transferring location or moving within a gene, a chromosome or a genome.
  • Transgenic plant A plant which contains a foreign nucleotide sequence inserted into either its nuclear genome or organellar genome.
  • RNA polymerase Production of an RNA molecule by RNA polymerase as a complementary copy of a DNA sequence or subgenomic mRNA.
  • Vector A self-replicating RNA or DNA molecule which transfers an
  • RNA or DNA segment between cells such as bacteria, yeast, plant, or animal cells.
  • Virus An infectious agent composed of a nucleic acid which may or may not be encapsidated in a protein.
  • a virus may be a mono-, di-, tri-, or multi- partite virus, as described above.
  • the present invention provides for the infection of a plant host by a recombinant plant virus containing a recombinant plant viral nucleic acid or by the recombinant plant viral nucleic acid which contains one or more non-native nucleic acid sequences which are subsequently transcribed or expressed in the infected tissues ofthe plant host.
  • the product ofthe coding sequences may be recovered from the plant, produce a phenotypic trait in the plant, effect biochemical pathways within the plant or effect endogenous gene expression within the plant.
  • the present invention has a number of advantages.
  • the instant invention allows practitioners to determine the function of a nucleic acid sequence which has been heretofore unknown.
  • chimeric genes and vectors and recombinant plant viral nucleic acids used in this invention are constructed using techniques well known in the art. Suitable techniques have been described in Sambrook et al. (2nd ed.), Cold Spring Harbor Laboratory, Cold Spring Harbor (1982, 1989); Methods in
  • An important feature ofthe present invention is the use of recombinant plant viral nucleic acids which are capable of replication, local and/or systemic spread in a compatible plant host, and which contain one or more non-native subgenomic promoters which are capable of transcribing or expressing adjacent nucleic acid sequences in the plant host.
  • the recombinant plant viral nucleic acids may be further modified to delete all or part ofthe native coat protein coding sequence and to contain a non-native coat protein coding sequence under control ofthe native or one ofthe non-native subgenomic promoters, or put the native coat protein coding sequence under the control of a non-native plant viral subgenomic promoter.
  • the recombinant plant viral nucleic acids have substantial sequence homology to plant viral nucleotide sequences.
  • a partial listing of suitable viruses is described, infra.
  • the nucleotide sequence may be or may be derived from an RNA, DNA, cDNA or a chemically synthesized RNA or DNA.
  • the first step in producing recombinant plant viral nucleic acids according to this particular embodiment for use in the present invention is to modify the nucleotide sequences ofthe plant viral nucleotide sequence by known conventional techniques such that one or more non-native subgenomic promoters are inserted into the plant viral nucleic acid without destroying the biological function ofthe plant viral nucleic acid.
  • the subgenomic promoters are capable of transcribing or expressing adjacent nucleic acid sequences in a plant host infected by the recombination plant viral nucleic acid or recombinant plant virus.
  • the native coat protein coding sequence may be deleted in some embodiments, placed under the control of a non-native subgenomic promoter in other embodiments, or retained in a further embodiment. If it is deleted or otherwise inactivated, a non- native coat protein gene is inserted under control of one ofthe non-native subgenomic promoters, or optionally under control ofthe native coat protein gene subgenomic promoter.
  • the non-native coat protein is capable of encapsidating the recombinant plant viral nucleic acid to produce a recombinant plant virus.
  • the recombinant plant viral nucleic acid contains a coat protein coding sequence, which may be native or a normative coat protein coding sequence, under control of one ofthe native or non-native subgenomic promoters.
  • the coat protein is involved in the systemic infection ofthe plant host.
  • viruses which meet this requirement, and therefore have been shown to be suitable for use according to the methods ofthe present invention, include viruses from the tobamovirus group such as Tobacco Mosaic virus (TMV), Ribgrass Mosaic Virus (RGM), Cowpea Mosaic virus (CMV), Alfalfa Mosaic virus (AMV), Cucumber Green Mottle Mosaic virus watermelon strain (CGMMV-W) and Oat Mosaic virus (OMV) and viruses from the brome mosaic virus group such as Brome Mosaic virus (BMV), broad bean mottle virus and cowpea chlorotic mottle virus.
  • TMV Tobacco Mosaic virus
  • RGM Ribgrass Mosaic Virus
  • CMV Alfalfa Mosaic virus
  • AMV Alfalfa Mosaic virus
  • CGMMV-W Cucumber Green Mottle Mosaic virus watermelon strain
  • Oat Mosaic virus Oat Mosaic virus
  • viruses from the brome mosaic virus group such as Brome Mosaic virus (BMV
  • Suitable viruses include Rice Necrosis virus (RNV), and geminiviruses such as Tomato Golden Mosaic virus (TGMV), Cassava Latent virus (CLV) and Maize Streak virus (MS V).
  • RMV Rice Necrosis virus
  • TGMV Tomato Golden Mosaic virus
  • CLV Cassava Latent virus
  • MS V Maize Streak virus
  • TOBAMOVIRUS GROUP Tobacco Mosaic virus is a member ofthe Tobamo viruses.
  • the TMV virion is a tubular filament, and comprises coat protein sub-units arranged in a single right-handed helix with the single-stranded RNA intercalated between the turns ofthe helix.
  • TMV infects tobacco as well as other plants. TMV is transmitted mechanically and may remain infective for a year or more in soil or dried leaf tissue.
  • the TMV virions may be inactivated by subjection to an environment with a pH of less than 3 or greater than 8, or by formaldehyde or iodine. Preparations of TMV may be obtained from plant tissues by (NH 4 ) 2 SO 4 precipitation, followed by differential centrifugation.
  • the TMV single-stranded RNA genome is about 6400 nucleotides long, and is capped at the 5 '-end but not polyadenylated.
  • the genomic RNA can serve as mRNA for protein of a molecular weight of about 130,000 (130K) and another produced by read-through of molecular weight about 180,000 (180K). However, it cannot function as a messenger for the synthesis of coat protein.
  • Other genes are expressed during infection by the formation of monocistronic, 3'-coterminal subgenomic mRNAs, including one (LMC) encoding the 17.5K coat protein and another (I 2 ) encoding a 3 OK protein.
  • the 3 OK protein has been detected in infected protoplasts as described in Miller, J., Virology 132:71 (1984), and it is involved in the cell-to-cell transport ofthe virus in an infected plant as described by Deom et al, Science 237:389 (1987).
  • the functions ofthe two large proteins are unknown, however, they are thought to function in RNA replication and transcription.
  • RNA molecules including double-str.anded RNAs corresponding to the genomic, I 2 and LMC RNAs, have been detected in plant tissues infected with TMV. These RNA molecules are presumably intermediates in genome replication and/or mRNA synthesis processes which appear to occur by different mechanisms.
  • TMV assembly apparently occurs in plant cell cytoplasm, although it has been suggested that some TMV assembly may occur in chloroplasts since transcripts of ctDNA have been detected in purified TMV virions. Initiation of TMV assembly occurs by interaction between ring-shaped aggregates ("discs") of coat protein (each disc consisting of two layers of 17 subunits) and a unique internal nucleation site in the RNA; a hairpin region about 900 nucleotides from the 3 '-end in the common strain of TMV. Any RNA, including subgenomic RNAs containing this site, may be packaged into virions. The discs apparently assume a helical form on interaction with the RNA, and assembly (elongation) then proceeds in both directions (but much more rapidly in the 3'- to 5'- direction from the nucleation site).
  • CGMMV-W Cucumber Green Mottle Mosaic virus watermelon strain
  • Subgroup I which includes the vulgare, OM, and tomato strain, has an origin of assembly about 800-1000 nucleotides from the 3 '-end ofthe RNA genome, and outside the coat protein cistron. Lebeurier et al, Proc. Natl. Acad. Sci. USA 74:149 (1977); and Fukuda et al, Virology 101:493 (1980).
  • Subgroup II which includes CGMMV-W and cornpea strain (Cc) has an origin of assembly about 300-500 nucleotides from the 3 '-end ofthe RNA genome and within the coat-protein cistron.
  • the coat protein cistron of CGMMV-W is located at nucleotides 176-661 from the 3 '-end.
  • the 3' noncoding region is 175 nucleotides long.
  • the origin of assembly is positioned within the coat protein cistron. Meshi et al, Virology 127:54 (1983).
  • BMV BROME MOSAIC VIRUS GROUP Brome Mosaic virus
  • BMV BROME MOSAIC VIRUS GROUP Brome Mosaic virus
  • Each member of the bromo viruses infects a narrow range of plants. Mechanical transmission of bromoviruses occurs readily, and some members are transmitted by beetles.
  • other bromoviruses include broad bean mottle virus and cowpea chlorotic mottle virus.
  • a bromovirus virion is icosahedral, with a diameter of about 26 ⁇ m, containing a single species of coat protein.
  • the bromovirus genome has three molecules of linear, positive-sense, single-stranded RNA, and the coat protein mRNA is also encapsidated.
  • the RNAs each have a capped 5 '-end, and a tRNA-like structure (which accepts tyrosine) at the 3 '-end.
  • Virus assembly occurs in the cytoplasm.
  • the complete nucleotide sequence of BMV has been identified and characterized as described by Ahlquist et al., J. Mol. Biol. 153:23 (1981).
  • RICE NECROSIS VIRUS Rice Necrosis virus is a member ofthe Potato Virus Y Group or Potyviruses.
  • the Rice Necrosis virion is a flexuous filament comprising one type of coat protein (molecular weight about 32,000 to about 36,000) and one molecule of linear positive-sense single-stranded RNA.
  • the Rice Necrosis virus is transmitted by Polymyxa oraminis (a eukaryotic intracellular parasite found in plants, algae and fungi).
  • Geminiviruses are a group of small, single-stranded DNA-containing plant viruses with virions of unique morphology. Each virion consists of a pair of isometric particles (incomplete icosahedral), composed of a single type of protein (with a molecular weight of about 2.7-3.4X10 4 ). Each geminivirus virion contains one molecule of circular, positive-sense, single-stranded DNA. In some geminiviruses (i.e., Cassava latent virus and bean golden mosaic virus) the genome appears to be bipartite, containing two single-str.anded DNA molecules.
  • TEV tobacco etch virus
  • TEV tobacco etch virus
  • TEV is a well characterized potyvirus and contains a positive-strand RNA genome of 9.5 kilobases encoding for a single, large polyprotein that is processed by three virus- specific proteinases.
  • the nuclear inclusion protein "a" proteinase is involved in the maturation of several replication-associated proteins and capsid protein.
  • the helper component-proteinase (HC-Pro) and 35-kDa proteinase both catalyze cleavage only at their respective C-termini.
  • the proteolytic domain in each of these proteins is located near the C-terminus.
  • the 35-kDa proteinase and HC-Pro derive from the N-terminal region ofthe TEV polyprotein.
  • viruses which are associated with animal hosts. Some of these viruses are discussed, infra.
  • the alphaviruses are a genus ofthe viruses ofthe family Togaviridae. Almost all ofthe members of this genus are transmitted by mosquitoes, and may cause diseases in man or animals. Some of the alphaviruses are grouped into three serologicallly defined complexes. The complex-specific antigen is associated with the El protein ofthe virus, and the species-specific antigen is associated with the E2 protein ofthe virus.
  • the Semliki Forest virus complex includes Bebaru virus, Chikungunya Fever virus, Getah virus, Mayaro Fever virus, O'nyongnyong Fever virus, Ross River virus, Sagiyama virus, Semliki Forest virus and Una virus.
  • the Venezuelan Equine Encephalomyelitis virus complex includes Cabassou virus, Everglades virus, Mucambo virus, Pixuna virus and Venezuelan Equine Encephalomyelitis virus.
  • the Western Equine Encephalomyelitis virus complex includes Aura virus, Fort Morgan virus, Highlands J virus, Kyzylagach virus, Sindbis virus, Western Equine Encephalomyelitis virus and Whataroa virus.
  • the alphaviruses contain an icosahedral nucleocapsid consisting of 180 copies of a single species of capsid protein complexed with a plus-stranded mRNA.
  • the alphaviruses mature when preassembled nucleocapsid is surrounded by a lipid envelope containing two virus-encoded integral membrane glycoproteins, called El and E2.
  • the envelope is acquired when the capsid, assembled in the cytoplasm, buds through the plasma membrane.
  • the envelope consists of a lipid bilayer derived from the host cell.
  • the mRNA encodes a glycoprotein which is cotranslationally cleaved into nonstructural proteins and structural proteins.
  • the 3' one-third ofthe RNA genome consists of a 26S mRNA which encodes for the capsid protein and the E3, E2, K6 and El glycoproteins.
  • the capsid is cotranslationally cleaved from the E3 protein. It is hypothesized that the amino acid triad of His, Asp and Ser at the COOH terminus ofthe capsid protein comprises a serine protease responsible for cleavage. Hahn et al, Proc. Natl. Acad. Sci. USA 82:4648 (1985). Cotranslational cleavage also occurs between E2 and K proteins.
  • two proteins PE2 which consists of E3 and E2 prior to cleavage and an El protein comprising K6 and El are formed. These proteins are cotranslationally inserted into the endoplasmic reticulum ofthe host cell, glycosylated and transported via the Golgi apparatus to the plasma membrane where they can be used for budding. At the point of virion maturation the E3 and E2 proteins are separated. The El and E2 proteins are incorporated into the lipid envelope. It has been suggested that the basic amino-terminal half ofthe capsid protein stabilizes the interaction of capsid with genomic RNA or interacts with genomic RNA to initiate a encapsidation, Strauss et al, in the Togaviridae and Flaviviridaei.
  • Temperature sensitivity results from an A-U change at nucleotide 9502.
  • the t s lesion present cleavage of PE2 to E2 and E3 and the final maturation of progeny virions at the nonpermissive temperature.
  • Hahn et al, supra reported three temperature sensitive mutations in the capsid protein which prevents cleavage ofthe precursor polyprotein at the nonpermissive temperature. The failure of cleavage resulted in no capsid formation and very little envelope protein.
  • Dl particles Defective interfering RNAs (Dl particles) of Sindbis virus are helper- dependent deletion mutants which interfere specifically with the replication ofthe homologous standard virus. Perrault, J., Microbiol. Immunol. 93:151 (1981). Dl particles have been found to be functional vectors for introducing at least one foreign gene into cells. Levis, R., Proc. Natl. Acad. Sci. USA 84:4811 (1987). It has been found that it is possible to replace at least 1689 internal nucleotides of a Dl genome with a foreign sequence and obtain RNA that will replicate and be encapsidated. Deletions ofthe Dl genome do not destroy biological activity.
  • a 42S mRNA region which is packaged into nature virions and functions as the message for the nonstructural proteins
  • 26S mRNA which encodes the structural polypeptides.
  • the 26S mRNA is homologous to the 3' third ofthe 42S mRNA. It is translated into a 13 OK polyprotein that is cotranslationally cleaved and processed into the capsid protein and two glycosylated membrane proteins, El and E2.
  • EEEE Eastern Equine Encephalomyelitis
  • the 26S mRNA region encodes the capsid proteins, E3, E2, 6K and El.
  • the .amino terminal end ofthe capsid protein is thought to either stabilize the interaction of capsid with mRNA or to interact with genomic RNA to initiate encapsidation.
  • PE2 Uncleaved E3 and E2 proteins called PE2 is inserted into the host endoplasmic reticulum during protein synthesis.
  • the PE2 is thought to have a region common to at least five alphaviruses which interacts with the viral nucleocapsid during morphogenesis.
  • the 6K protein is thought to function as a signal sequence involved in translocation ofthe El protein through the membrane.
  • the El protein is thought to mediate virus fusion and anchoring ofthe El protein to the virus envelope.
  • the rhino viruses are a genus of viruses ofthe family Picornaviridae.
  • the rhinoviruses are acid-labile, and are therefore rapidly inactivated at pH values of less than about 6.
  • the rhinoviruses commonly infect the upper respiratory tract of mammals.
  • Human rhinoviruses are the major causal agents ofthe common cold, and many serotypes are known.
  • Rhinoviruses may be propagated in various human cell cultures, and have an optimum growth temperature of about 33°C. Most strains of rhinoviruses are stable at or below room temperature and can withstand freezing.
  • Rhinoviruses can be inactivated by citric acid, tincture of iodine or phenol/alcohol mixtures.
  • HRV2 human rhinovirus 2
  • the genome consists of 7102 nucleotides with a long open reading frame of 6450 nucleotides which is initiated 611 nucleotides from the 5'- end and stops 42 nucleotides from the poly(A) tract. Three capsid proteins and their cleavage cites have been identified.
  • Rhinovirus RNA is single-stranded and positive-sense.
  • the RNA is not capped, but is joined at the 5 '-end to a small virus-encoded protein, virion-protein genome-linked (VPg). Translation is presumed to result in a single polyprotein which is broken by proteolytic cleavage to yield individual virus proteins.
  • An icosahedral viral capsid contains 60 copies each of 4 virus proteins VP1, VP2, VP3 and VP4 and surrounds the RNA genome. Medappa, K., Virology 44:259 (1971). Analysis of the 610 nucleotides preceding the long open reading frame shows several short open reading frames.
  • the capsid proteins of HRV2: VP4, VP2, VP3 and VP1 begin at nucleotide 611, 818, 1601 .and 2311, respectively.
  • the cleavage point between VP1 and P2A is thought to be .around nucleotide 3255. Skern et al, Nucleic Acids Research 13:2111 (1985).
  • HRV89 Human rhinovirus type 89
  • Polioviruses are the causal agents of poliomyelitis in man, and are one of three groups of enteroviruses. Entero viruses are a genus ofthe family Picornaviridae (also the family of rhinoviruses). Most enteroviruses replicate primarily in the mammalian gastrointestinal tract, although other tissues may subsequently become infected. Many enteroviruses can be propagated in primarily cultures of human or monkey kidney cells and in some cell lines (e.g.
  • enteroviruses may be accomplished with heat (about 50°C), formaldehyde (3%), hydrochloric acid (0.1N) or chlorine (ca. 0.3-0.5 ppm free residual Cl 2 ).
  • Poliovirus translation produces a single polyprotein which is cleaved by proteolytic processing. Kitamura et al, Nature
  • the structures ofthe serotypes of poliovirus have a high degree of sequence homology. Their coding sequences code for the same proteins in the same order. Therefore, genes for structural proteins are similarly located.
  • PV1, PV2 and PV3 the polyprotein begins translation near the 750 nucleotide.
  • the four structural proteins VP4, VP2, VP3 and VPI begin at about 745, 960, 1790 and 2495, respectively, with VPI ending at about 3410. They are separated in vivo by proteolytic cleavage, rather than by stop/start codons.
  • Simian virus 40 is a virus ofthe genus Polyomavirus, and was originally isolated from the kidney cells of the rhesus monkey. The virus is commonly found, in its latent form, in such cells. Simian virus 40 is usually non- pathogenic in its natural host.
  • Simian virus 40 virions are made by the assembly of three structural proteins, VPI, VP2 and VP3. Girard et al, Biochem. Biophys. Res. Commun. 40:97 (1970); Prives et al, Proc. Natl. Acad. Sci. USA 21 :302 (1974); and Jacobson et al, Proc. Natl. Acad. Sci. USA 73:2747 (1976).
  • the three conesponding viral genes are organized in a partially overlapping manner. They constitute the late genes portion ofthe genome. Tooze, J., Molecular Biology of Tumor Viruses Appendix A The SV40 Nucleotide Sequence, 2nd Ed. Part 2, pp.
  • Capsid proteins VP2 and VP3 are encoded by nucleotides 545 to 1601 and 899 to 1601, respectively, and both are read in the same frame. VP3 is therefore a subset of VP2.
  • Capsid protein VPI is encoded by nucleotides 1488-2574. The end of the VP2-VP3 open reading frame therefore overlaps the VPI by 113 nucleotides but is read in an alternative frame. Tooze, J., supra. Wychowski et al, J. Virology 61:3862 (1987).
  • ADENOVIRUSES Adenovirus type 2 is a member ofthe adenovirus family or adeno virus.
  • This family of viruses are non-enveloped, icosahedral, linear, double-stranded DNA-containing viruses which infect mammals or birds.
  • the adenovirus virion consists of an icosahedral capsid enclosing a core in which the DNA genome is closely associated with a basic (argmine-rich) viral polypeptide VII.
  • the capsid is composed of 252 capsomeres: 240 hexons
  • Each penton consists of a penton base (composed of viral polypeptide III) associated with one (in mammalian adenoviruses) or two (in most avian adenoviruses) glycoprotein fibres (viral polypeptide IV).
  • the fibres can act as haemagglutinins and are the sites of attachment ofthe virion to a host cell-surface receptor.
  • the hexons each consist of three molecules of viral polypeptide II; they make up the bulk ofthe icosahedron. Various other minor viral polypeptides occur in the virion.
  • the adenovirus dsDNA genome is covalently linked at the 5 '-end of each strand to a hydrophobic 'terminal protein', TP (molecular weight about 55,000 Da); the DNA has an inverted terminal repeat of different length in different adenoviruses. In most adenoviruses examined, the 5 '-terminal residue is dCMP.
  • the virion attaches via its fibres to a specific cell-surface receptor, and enters the cell by endocytosis or by direct penetration of the plasma membrane. Most ofthe capsid proteins are removed in the cytoplasm. The virion core enters the nucleus, where the uncoating is completed to release viral DNA almost free of virion polypeptides.
  • Virus gene expression then begins.
  • the viral dsDNA contains genetic information on both strands.
  • Early genes (regions Ela, Elb, E2a, E3, E4) are expressed before the onset of viral DNA replication.
  • Late genes (regions LI, L2, L3, L4 .and L5) are expressed only after the initiation of DNA synthesis.
  • Intermediate genes (regions E2b and Iva 2 ) are expressed in the presence or absence of DNA synthesis.
  • Region Ela encodes proteins involved in the regulation of expression of other early genes, and is also involved in transformation.
  • the RNA transcripts are capped (with m 7 G 5 ppp 5 N) .and polyadenylated in the nucleus before being transferred to the cytoplasm for translation.
  • Viral DNA replication requires the terminal protein, TP, as well as virus- encoded DNA polymerase and other viral and host proteins.
  • TP is synthesized as an 80K precursor, pTP, which binds covalently to nascent replicating DNA strands.
  • pTP is cleaved to the mature 55K TP late in virion assembly; possibly at this stage, pTP reacts with a dCTP molecule and becomes covalently bound to a dCMP residue, the 3' OH of which is believed to act as a primer for the initiation of DNA synthesis.
  • Late gene expression resulting in the synthesis of viral structural proteins, is accompanied by the cessation of cellular protein synthesis, and virus assembly may result in the production of up to 10 5 virions per cell.
  • viral expression system in bacteria and yeast cells may also be employed. See
  • the nucleic acid of any suitable plant virus can be utilized to prepare a recombinant plant viral nucleic acid for use in the present invention, and the foregoing are only exemplary of such suitable plant viruses.
  • the nucleotide sequence ofthe plant virus is modified, using conventional techniques, by the insertion of one or more subgenomic promoters into the plant viral nucleic acid.
  • the subgenomic promoters are capable of functioning in the specific host plant. For example, if the host is tobacco, TMV, TEV, or other viruses containing subgenomic promoter may be utilized.
  • the inserted subgenomic promoters should be compatible with the TMV nucleic acid and capable of directing transcription or expression of adjacent nucleic acid sequences in tobacco.
  • the native coat protein gene could also be retained and a non-native nucleic acid sequence inserted within it to create a fusion protein.
  • the native or non-native coat protein gene is utilized in the recombinant plant viral nucleic acid. Whichever non-native nucleic acid is utilized may be positioned adjacent its natural subgenomic promoter or adjacent one ofthe other available subgenomic promoters.
  • the non-native coat protein as is the case for the native coat protein, is capable of encapsidating the recombinant plant viral nucleic acid and providing for systemic spread ofthe recombinant plant viral nucleic acid in the host plant.
  • the coat protein is selected to provide a systemic infection in the plant host of interest.
  • the TMV-O coat protein provides systemic infection in N benthamiana
  • TMV-U1 coat protein provides systemic infection in N tabacum.
  • the recombinant plant viral nucleic acid is prepared by cloning a viral nucleic acid. If the viral nucleic acid is D ⁇ A, it can be cloned directly into a suitable vector using conventional techniques. One technique is to attach an origin of replication to the viral D ⁇ A which is compatible with the cell to be transfected. If the viral nucleic acid is R ⁇ A, a full-length D ⁇ A copy ofthe viral genome is first prepared by well-known procedures.
  • the viral R ⁇ A is transcribed into D ⁇ A using reverse transcriptase to produce subgenomic D ⁇ A pieces, and a double-stranded D ⁇ A made using D ⁇ A polymerases.
  • the cD ⁇ A is then cloned into appropriate vectors and cloned into a cell to be transfected.
  • the cD ⁇ A's ligated into the vector may be directly transcribed into infectious R ⁇ A in vitro and inoculated onto the plant host.
  • the cD ⁇ A pieces are mapped and combined in proper sequence to produce a full-length D ⁇ A copy of the viral R ⁇ A genome, if necessary.
  • D ⁇ A sequences for the subgenomic promoters, with or without a coat protein gene are then inserted into the nucleic acid at non-essential sites, according to the particular embodiment ofthe invention utilized.
  • Non-essential sites are those that do not affect the biological properties ofthe plant viral nucleic acid.
  • the RNA genome is the infective agent
  • the cDNA is positioned adjacent a suitable promoter so that the RNA is produced in the production cell.
  • the RNA is capped using conventional techniques, if the capped RNA is the infective agent.
  • the capped RNA can be packaged in vitro with added coat protein from TMV to make assembled virions. These assembled virions can then be used to inoculate plants or plant tissues.
  • an uncapped RNA may also be employed in the embodiments ofthe present mvention. Contrary to the practiced art in scientific literature .and in issued patent (Ahlquist et al, U.S. Patent No. 5,466,788), uncapped transcripts for virus expression vectors are infective on both plants and in plant cells. Capping is not a prerequisite for establishing an infection of a virus expression vector in plants, although capping increases the efficiency of infection.
  • nucleotides may be added between the transcription start site ofthe promoter and the start ofthe cDNA of a viral nucleic acid to construct an infectious viral vector. One or more nucleotides may be added.
  • the inserted nucleotide sequence contains a G at the 5 '-end.
  • the inserted nucleotide sequence is GNN, GTN, or their multiples, (GNN) X or (GTN) X .
  • nucleic acid sequences capable of being transcribed in the plant host.
  • These nucleic acid sequences may be native nucleic acid sequences which occur in the host organism or they may be non-native nucleic acid sequences which do not normally occur in the host organism.
  • the nucleic acid sequence is placed adjacent one ofthe non- native viral subgenomic promoters and/or the native coat protein gene promoter depending on the particular embodiment used.
  • the nucleic acid is inserted by conventional techniques, or the nucleic acid sequence can be inserted into or adjacent the native coat protein coding sequence such that a fusion protein is produced.
  • the nucleic acid sequence which is transcribed may be transcribed as an RNA which is capable of regulating the expression of a phenotypic trait by an anti-sense or a positive-sense mechanism.
  • the nucleic acid sequence in the recombinant plant viral nucleic acid may be transcribed and translated in the plant host to produce a phenotypic trait.
  • the nucleic acid sequence(s) may also code for the expression of more than one phenotypic trait.
  • the recombinant plant viral nucleic acid containing the nucleic acid sequence is constructed using conventional techniques such that the nucleic acid sequence(s) are in proper orientation to whichever viral subgenomic promoter is utilized.
  • a double-stranded DNA ofthe recombinant plant viral nucleic acid or a complementary copy ofthe recombinant plant viral nucleic acid is cloned into the cell to be transfected.
  • the viral nucleic acid is a RNA molecule
  • the nucleic acid (cDNA) is first attached to a promoter which is compatible with the production cell.
  • the recombinant plant viral nucleic acid can then be cloned into any suitable vector which is compatible with the production cell. In this manner, only RNA copies ofthe chimeric nucleotide sequence are produced in the production cell.
  • the CaMV promoter can be used when plant cells are to be transfected.
  • the recombinant plant viral nucleic acid is inserted in a vector adjacent a promoter which is compatible with the production cell. If the viral nucleic acid is a DNA molecule, it can be cloned directly into a production cell by attaching it to an origin of replication which is compatible with the cell to be transfected. In this manner, DNA copies ofthe chimeric nucleotide sequence are produced in the transfected cell.
  • a further alternative when creating the recombinant plant viral nucleic acid is to prepare more than one nucleic acid (i.e., to prepare the nucleic acids necessary for a multipartite viral vector construct). In this case, each nucleic acid would require its own origin of assembly. Each nucleic acid could be prepared to contain a subgenomic promoter and a non-native nucleic acid.
  • the insertion of a non-native nucleic acid into the nucleic acid of a monopartite virus may result in the creation of two nucleic acids (i.e., the nucleic acid necessary for the creation of a bipartite viral vector).
  • Each nucleic acid would have to have its own origin of assembly.
  • the host can be infected with the recombinant plant virus by conventional techniques. Suitable techniques include, but are not limited to, leaf abrasion, abrasion in solution, high velocity water spray and other injury of a host as well as imbibing host seeds with water containing the recombinant plant virus. More specifically, suitable techniques include:
  • Hand inoculations of the encapsidated vector are performed using a neutral pH, low molarity phosphate buffer, with the addition of celite or carborundum (usually about 1%).
  • celite or carborundum usually about 1%.
  • One to four drops ofthe preparation is put onto the upper surface of a leaf and gently rubbed.
  • Plant bed inoculations are performed by spraying (gas-propelled) the vector solution into a tractor- driven mower while cutting the leaves. Alternatively, the plant bed is mowed and the vector solution sprayed immediately onto the cut leaves.
  • High Pressure Spray of Single Leaves Single plant inoculations can also be performed by spraying the leaves with a narrow, directed spray (50 psi, 6-12 inches from the leaf) containing approximately 1% carborundum in the buffered vector solution.
  • Vacuum Infiltration may be accomplished by subjecting the host organism to a substantially vacuum pressure environment in order to facilitate infection.
  • An alternative method for introducing a recombinant plant viral nucleic acid into a plant host is a technique known as agroinfection or Agrobacterium- mediated transformation (sometimes called Agro-infection) as described by Grimsley et al, Nature 325:177 (1987).
  • This technique makes use of a common feature of Agrobacterium which colonizes plants by transferring a portion of their DNA (the T-DNA) into a host cell, where it becomes integrated into nuclear DNA.
  • the T-DNA is defined by border sequences which are 25 base pairs long, and any DNA between these border sequences is transfened to the plant cells as well.
  • the insertion of a recombinant plant viral nucleic acid between the T-DNA border sequences results in transfer ofthe recombinant plant viral nucleic acid to the plant cells, where the recombinant plant viral nucleic acid is replicated, and then spreads systemically through the plant.
  • Agro-infection has been accomplished with potato spindle tuber viroid (PSTV) (Gardner et al, Plant Mol. Biol. 6:221 (1986); CaV (Grimsley et al, Proc. Natl. Acad. Sci. USA 83:3282 (1986)); MSV (Grimsley et al, Nature 325:177 (1987)), and Lazarowitz, S., Nucl. Acids Res.
  • PSTV potato spindle tuber viroid
  • agro-infection of a susceptible plant could be accomplished with a virion containing a recombinant plant viral nucleic acid based on the nucleotide sequence of any ofthe above viruses. Particle bombardment or electrosporation or any other methods known in the art may also be used.
  • Infection may also be attained by placing a selected nucleic acid sequence into an organism such as E. coli, or yeast, either integrated into the genome of such organism or not and then applying the organism to the surface ofthe host organism. Such a mechanism may thereby produce secondary transfer ofthe selected nucleic acid sequence into the host organism. This is a particularly practical embodiment when the host organism is a plant.
  • infection may be attained by first packaging a selected nucleic acid sequence in a pseudovirus. Such a method is described in WO 94/10329, the teachings of which are inco ⁇ orated herein by reference. Though the teachings of this reference may be specific for bacteria, those of skill in the art will readily appreciate that the same procedures could easily be adapted to other organisms.
  • the function of a nucleic acid may be determined by complementation analysis. That is, the function ofthe nucleic acid of interest may be determined by observing the endogenous gene or genes whose function is replaced or augmented by introducing the nucleic acid of interest.
  • a discussion of this principle is provided by Napoli et al, The Plant Cell 2:279-289 (1990) which is inco ⁇ orated herein by reference. Further teachings in these regards are provided by WO 97/42210, the disclosure of which is also inco ⁇ orated herein by reference.
  • the function of a nucleic acid may be determined by analyzing the biochemical alterations in the accumulation of substrates or products from enzymatic reactions according to any one ofthe means known by those skilled in the art.
  • the function of a nucleic acid may be determined by observing phenotypic changes in the host by methods including mo ⁇ hological, macroscopic or microscopic analysis.
  • the function of a nucleic acid may be determined by observing the change in biochemical pathways which may be modified in the host as a result ofthe local and/or systemic expression ofthe non- native nucleic acids.
  • the function of a nucleic acid may be determined utilizing techniques known by those skilled in the art to observe inhibition of gene expression in the cytoplasm of cells as a result of expression of the non-native nucleic acid.
  • a particularly useful way to determine gene function is by observing the phenotype in a whole plant when a particular gene function has been silenced.
  • Useful phenotypic traits in plant cells which may be observed microscopically, macroscopically or by other methods include, but are not limited to, improved tolerance to herbicides, improved tolerance to extremes of heat or cold, drought, salinity or osmotic stress; improved resistance to pests (insects, nematodes or arachnids) or diseases (fungal, bacterial or viral) production of enzymes or secondary metabolites; male or female sterility; dwarfhess; early maturity; improved yield, vigor, heterosis, nutritional qualities, flavor or processing properties, and the like.
  • phenotypic traits include the production of important proteins or other products for commercial use, such as lipase, melanin, pigments, alkaloids, antibodies, hormones, pharmaceuticals, antibiotics and the like.
  • Another useful phenotypic trait is the production of degradative or inhibitory enzymes, such as are utilized to prevent or inhibit root development in malting barley or that determine response or non-response to a systemically administered drug in a human.
  • the phenotypic trait may also be a secondary metabolite whose production is desired in a bioreactor.
  • Another particularly useful means to determine function of nucleic acids transfected into a host is to observe the effects of gene silencing.
  • EST/DNA library from an organism for example Arabidopsis thaliana
  • the DNA sequences in the transcription plasmid library may then be introduced into plant cells as part of a functional RNA virus which post-transcriptionally silences the homologous target gene.
  • the EST/DNA sequences may be introduced into a plant viral vector in either the plus or minus sense orientation, and the orientation can be either directed or random based on the cloning strategy.
  • a high-throughput, automated cloning scheme based on robotics may be used to assemble and characterize the library.
  • double stranded RNA may also be an effective stimulator of gene silencing/co- suppression in transgenic plant. Gene silencing/co-suppression of plant genes may be induced by delivering an RNA capable of base pairing with itself to form double stranded regions. This approach could be used with any plant or non- plant gene to assist in the identification ofthe function of a particular gene sequence.
  • a particularly troublesome problem with gene silencing in plant hosts is that many plant genes exist in a multigene family. Therefore, effective silencing of a gene function may be especially problematic.
  • nucleic acids may be inserted into the genome to effectively silence a particular gene function or to silence the function of a multigene family. It is presently believed that about 20% of plant genes exist in multigene families.
  • a single nucleotide sequence of about 20 to 100 or more bases having about 70% or more homology to a gene may silence an entire plant gene family having two or more homologous genes.
  • RNA can reduce the expression of a target gene through inhibitory RNA interactions with target mRNA that occur in the cytoplasm and/or the nucleus of a cell.
  • Full-length cDNAs may be accessed from public and private repositories or extracted from field samples for insertion of unknown open reading frames into viral vectors for expression of nucleic acids in the host organism and thereby utilized as an alternative to antisense gene knockout.
  • This technology may be implemented by PCR amplification and cloning of all cDNAs that do not share homology with gene sequences in public and or private databases.
  • the cDNAs may be expressed in plants transfected with one or more plant viral vectors for subsequent analysis of novel phenotype ofthe whole plant (biochemical and mo ⁇ hological).
  • Selected cDNA sequences from maize, rice, soybean canola and other crop species may be used to assemble the cDNA libraries. This method may thus be used to search for useful dominant gene phenotypes from novel cDNA libraries through the gene expression.
  • An EST/cDNA library from an organism such as Arabidopsis thaliana may be assembled into a plant viral transcription plasmid background.
  • the cDNA sequences in the transcription plasmid library can then be introduced into plant cells as cytoplasmic RNA in order to post-transcriptionally silence the endogenous genes.
  • the EST/cDNA sequences may be introduced into the plant viral transcription plasmid in either the plus or anti-sense orientation (or both), and the orientation can be either directed or random based on the cloning strategy.
  • a high-throughput, automated cloning strategy using robotics can be used to assemble the library.
  • the EST clones can be inserted behind a duplicated subgenomic promoter such that they are represented as subgenomic transcripts during viral replication in plant cells.
  • the EST/cDNA sequences can be inserted into the genomic RNA of a plant viral vector such that they are represented as genomic RNA during the viral replication in plant cells.
  • the library of EST clones is then transcribed into infectious RNA and inoculated onto individual platelets of Arabidopsis thaliana (or other plant species).
  • the viral RNA containing the EST/cDNA sequences contributed from the original library are now present in a sufficiently high concentration in the cytoplasm such that they cause post-transcriptional gene silencing ofthe endogenous plant-gene homologs.
  • the orientation ofthe EST/cDNA insert is normally irrelevant in terms of producing the desired gene-silenced phenotype in the tissue.
  • Partial cDNA sequences cloned into a plant viral vector in the sense orientation have previously been shown to also confer a gene silencing phenotype (Kumagai et al, Proc. Natl. Acad. Sci. USA 92:1679 (1995)), the teachings of which are inco ⁇ orated herein by reference.
  • the actual mechanism of gene silencing has not been fully determined. This phenomenon may be similar to the gene silencing via cosuppression observed in transgenic plants.
  • the plant tissue may then be taken for sophisticated biochemical analysis in order to determine which metabolic pathway has been affected by the EST/DNA gene silencing, and in particular, which steps in a given metabolic pathway have been affected by the EST/DNA gene silencing.
  • Biochemical analysis may be done, for example, in a high-throughput, fully automated fashion using robotics. Suitable biochemical analysis may include MALDI-TOF, LC/MS, GC/MS, two-dimensional IEF/SDS-PAGE, ELISA or other methods of analyses.
  • the clones in the EST/plant viral vector library may then be functionally classified based on metabolic pathway affected or visual/selectable phenotype produced in the plant. This process enables the rapid determination of gene function for unknown EST/DNA sequences of plant origin. Furthermore, this process can be used to rapidly confirm function of full-length DNA's of unknown gene function. Functional identification of unknown ESTDNA sequences in a plant library may then rapidly lead to identification of similar unknown sequences in expression libraries for other crop species based on
  • Links may be made between sequences from various species predicted to carry out similar biochemical or regulatory functions. Links may also be generated between predicted enzymatic activities and visually displayed biochemical and regulatory pathways. Likewise, links may be generated between predicted enzymatic or regulatory activity and known small molecule inhibitors, activators, substrates or substrate analogs. Phenotypic data from expression libraries expressed in transfected hosts maybe automatically linked within such a relational database. Genes with similar predicted roles of interest in other crop plants or crop plant pests may thereby be more rapidly discovered. A complete classification scheme of gene functionality for a fully sequenced eukaryotic organism has been established for yeast. This classification scheme may be modified for plants and divided into the appropriate categories. Such organizational structure may be utilized to rapidly identify herbicide target loci which may confer dominant lethal phenotypes, and thereby is useful in helping to design rational herbicide programs.
  • a second aspect ofthe present invention is a method of silencing endogenous genes in a host by introducing nucleic acids into the host by way of a viral nucleic acid suitable to produce the local and systemic expression ofthe nucleic acid of interest.
  • the host is a plant, but those skilled in the art will imderstand that other hosts may also be utilized.
  • This method utilizes the principle of post-transcription gene silencing ofthe endogenous host gene homolog as described above. Since the replication mechanism produces both sense and anti-sense RNA sequences as disclosed above, the orientation of the non-native nucleic acid insert is not crucial to providing gene silencing. More information describing some aspects ofthe "gene silencing" effect is provided in co-pending U.S. Patent Application Serial No.
  • RNA can reduce the expression of a target gene through inhibitory RNA interactions with target mRNA that occur in the cytoplasm and/or the nucleus of a cell.
  • Silencing of endogenous genes can be achieved with homologous (but not identical) sequences from distant plant species.
  • the Nicotiana benthamiana gene for phytoene desaturase may be silenced by fransfection with a partial tomato cDNA for PDS (cloned in either the positive or antisense orientation).
  • the tomato PDS cDNA is 92% homologous at the nucleotide level yet is still able to confer efficient gene silencing in an unrelated plant species (Kumagai et al, Proc. Natl. Acad. Sci. USA 92:1679 (1995)).
  • Identification of EST/cDNA gene function in Arabidopsis thaliana could then be extrapolated to similar EST/cDNA sequences of unknown function that exist in other libraries (e.g., soybean, maize, rice, oilseed rape, etc.).
  • a third aspect ofthe present invention is a method for selecting desired functions of RNAs and proteins by the use of virus vectors to express libraries of nucleic acid sequence variants.
  • Libraries of sequence variants may be generated by means of in vitro mutagenenisis and/or recombination. Rapid in vitro evolution can be used to improve virus-specific or protein-specific functions.
  • plant RNA virus expression vectors may be used as tools to bear libraries containing variants of nucleic acid, genes from virus, plant or other sources, and to be applied to plants or plant cells such that the desired altered effects in the RNA or protein products can be determined, selected and improved.
  • nucleic acid shuffling techniques may be employed to construct shuffled gene libraries.
  • Random, semi-random or known sequences of virus origin may also be inserted in virus expression vectors between native virus sequences and foreign gene sequences, to increase the genetic stability of foreign genes in expression vectors as well as the translation ofthe foreign gene and the stability ofthe mRNA encoding the foreign gene in vivo.
  • the desired function of RNA and protein may include the promoter activities, replication properties, translational efficiencies, movement properties (local and systemic), signaling pathway, or virus host range, among others.
  • the desired function alteration can be identified by assaying infected plants and the nature of mutation can be determined by analysis of sequence variants in the virus vector.
  • Methods to increase the representation of gene sequences in virus expression libraries may also be achieved by bypassing the genetic bottleneck of propagation in E. coli.
  • cell-free methods may be used to clone sequence libraries or individual arrayed sequences into virus expression vectors and reconstruct an infectious virus, such that the final ligation product can be transcribed and the resulting RNA can be used for plant or plant cell inoculation/infection with the output being gene function discovery or protein production.
  • RNA viruses or RNA virus vectors as populations or individuals in parallel to identify individuals with novel and augmented virus-encoded functions in replication and virus movement, foreign gene sequence retention in vectors and proper folding, activity and expression of protein products, novel gene expression, effects on host metabolism, and resistance or susceptibility of plants to exogenous agents.
  • Variation in the sequence of a native virus gene(s) or heterologous nucleotide sequence(s) may be introduced into an RNA virus or an RNA virus expression vector by many methods as a means to screen a population of variants in batch or individuals in parallel for novel properties exhibited by the virus itself or conferred on the host plant or cell by the virus vector.
  • Variant populations can be transfected as populations or individual clones into "host”: 1) protoplasts; 2) whole plants; or 3) inoculated leaves of whole plants and screened for various traits including protein expression (increase or decrease), RNA expression (increase or decrease), secondary metabolites or other host property gained or loss as a result ofthe virus infection.
  • a virus vector which simultaneously expressed the green fluorescent protein (GFP) or other selectable marker gene and the variant sequence, is used to screen quantitatively for levels of resistance or sensitivity to the agent in question conferred upon the host by the variant sequence expressed from the viral vector.
  • GFP green fluorescent protein
  • a new mode of enhanced or reduced expression could be identified by finding the factors that respond to cellular signals that in turn alter its particular expression.
  • transcription factors regulating the expression of defense proteins such as systemin peptides, or protease inhibitors could be identified by transfecting hosts with virus libraries and the expression of systemin or protease inhibitors or their RNAs be directly assayed.
  • the promoters responsible for expressing these genes could be genetically fused to the green fluorescent protein and introduced into hosts as transient expression constructs or into stable transformed host cells/tissues. The resulting cells would be transfected with viral vector libraries.
  • Hosts now could be screened rapidly by following relative GFP expression following vector fransfection.
  • coupling the transfecting of hosts with virus libraries with the treatment of plants with methyl jasmonate could identify sequences that reverse or enhance the gene induction events induced by this metabolite.
  • This approach could be applied to other factors involved in promotion of higher biomass in plants such as Leafy or DET2.
  • the expression of these factors could be directly assayed or via promoters genetically fused to GFP. This technique will enable screening of populations or individuals followed by rescue ofthe sequence from virus vectors conferring desired trait by RT-PCR and re-screening of particular variant sequences in secondary screens.
  • a fourth aspect ofthe present invention is a method for inhibiting an endogenous protease of a plant host comprising the step of treating the plant host with a compound which induces the production of an endogenous inhibitor of said protease.
  • jasmonic acid may be used to treat the plant host to induce the production of an endogenous inhibitor of an endogenous protease.
  • the treatment ofthe plant host with a compound results an increased representation of an exogenous nucleic acid or the protein product thereof.
  • transgenic hosts expressing protease inhibitors may be used to decrease the degradation of proteins expressed by virus expression vectors.
  • jasmonic acid may be used to treat plants infected with virus expression vectors to decrease the degradation of proteins expressed by virus expression vectors.
  • a fifth aspect ofthe present invention are genes and fragments thereof, nucleotide sequences, and gene products obtained by way ofthe method ofthe present invention.
  • the present invention features expressing selected nucleotide sequences in a host organism such as, for example, a plant.
  • a host organism such as, for example, a plant.
  • Such gene products may exhibit biological activity as pharmaceuticals, herbicides, and other similar functions.
  • a hybrid virus consisting of TMV-U1 and ToMV-F was constructed by swapping an 874- bp BamRl-Kpnl ToMV fragment into pBGC152, creating plasmid TTO1. The inserted fragment was verified by dideoxynucleotide sequencing.
  • a unique Avrll site was inserted downstream ofthe Xhol site in TTO1 by PCR mutagenesis, creating plasmid TTO1 A, using the following oligonucleotides: 5'-
  • Partial cDNAs were isolated from ripening tomato fruit RNA by polymerase chain reaction (PCR) using the following oligonucleotides: PSY, 5'-TATGTATGGTGCAGAAGAACAGAT-3' (upstream) (SEQ ID NO: 5), 5 * -AGTCGACTCTTCCTCTTCTGGCAT C-3' (downstream) (SEQ ID NO: 6); PDS, 5'- TGCTCGAGTGTGTTCTTCAGTTTTCTGTCA-3' (SEQ ID NO: 7)
  • TTO1/PSY + (FIGURE 1) contains the phytoene synthase cDNA in the positive orientation under the control ofthe TMV-U1 coat protein subgenomic promoter; while, the vector TTOl/PSY - contains the phytoene synthase cDNA in the antisense orientation.
  • TTO1A/PDS + (FIGURE 2) contains the phytoene desaturase cDNA in the positive orientation under the control ofthe TMV-U1 coat protein subgenomic promoter; while the vector TTO1A/PDS - contains the phytoene desaturase cDNA in the antisense orientation.
  • TTO1/PSY+ Transfection and analysis of N. benthamiana ITTO1/PSY+.
  • Infectious RNAs from TTO1/PSY+ (FIGURE 1), TTO 1 /PS Y-TTO 1 /PDS +, TTO1/PDS+ were prepared by in vitro transcription using SP6 DNA-dependent RNA polymerase as described previously (Dawson et al, Proc. Natl. Acad. Sci. USA 83:1832 (1986)) and were used to mechanically inoculate N benthamiana.
  • the hybrid viruses spread throughout all the non- inoculated upper leaves as verified by transmission electron microscopy, local lesion infectivity assay, and polymerase chain reaction (PCR) amplification.
  • the viral symptoms resulting from the infection consisted of distortion of systemic leaves and plant stunting with mild chlorosis.
  • the leaves from plants transfected with TTO1/PSY+ turned orange and accumulated high levels of phytoene while those transfected with TTO1 ⁇ /PDS+ and TTO1 ⁇ /PDS- turned white.
  • Agarose gel eletrophoresis of PCR cD ⁇ A isolated from virion R ⁇ A and Northern blot analysis of virion RNA indicate that the vectors are maintained in an extrachromosomal state and have not undergone any detectable intramolecular rearrangements.
  • the carotenoids were isolated from systemically infected tissue and analyzed by HPLC chromatography. Carotenoids were extracted in ethanol and identified by their peak retention time and abso ⁇ tion spectra on a 25-cm Spherisorb® ODS-1 5- m column using acetonitrile/methanol/2-propanol (85:10:5) as a developing solvent at a flow rate of 1 ml/min. They had identical retention time to a synthetic phytoene standard and ⁇ -carotene standards from carrot and tomato. The phytoene peak from N benthamiana transfected with TTO1/PSY + had an optical absorbance maxima at 276, 285, and 298 nm.
  • Plants transfected with viral encoded phytoene synthase showed a ten- fold increase in phytoene compared to the levels in noninfected plants.
  • the expression of sense and antisense R ⁇ A to a partial phytoene desaturase in transfected plants inhibited the synthesis of colored carotenoids and caused the systemically infected leaves to turn white. HPLC analysis of these plants revealed that they also accumulated phytoene. The white leaf phenotype was also observed in plants treated with the herbicide norflurazon which specifically inhibits phytoene desaturase.
  • This change in the levels of phytoene represents one ofthe largest increases of any carotenoid (secondary metabolite) in any genetically engineered plant.
  • Plants transfected with viral-encoded phytoene synthase showed a ten-fold increase in phytoene compared to the levels in noninfected plants.
  • the accumulation of phytoene in plants transfected with positive-sense or antisense phytoene desaturase suggests that viral vectors can be used as a potent tool to manipulate pathways in the production of secondary metabolites through cytoplasmic .antisense inhibition.
  • EXAMPLE 2 Expression of bell pepper cD ⁇ A in transfected plant confirms that it encodes capsanthin-capsorubin svnthase.
  • capsanthin In contrast to the situation prevailing in chromoplasts, capsanthin was not esterified and its increased level was balanced by a concomitant decrease ofthe major leaf xanthophylls, suggesting an autoregulatory control of chloroplast carotenoid composition. Capsanthin was exclusively recruited into the trimeric and monomeric light-harvesting complexes of Photosystem II. This demonstration that higher plant antenna complexes can accommodate non-native carotenoids provides compelling evidence for functional remodeling of photosynthetic membranes by rational design of carotenoids.
  • Ccs bell pepper capsanthin-capsorubin synthase
  • the 1.6-kb Xhol, Avrll cDNA fragment was placed under the control ofthe TMV-U1 coat protein subgenomic promoter by subcloning into TTO1A, creating plasmid TTO1A CCS+ (FIGURE 3) in the sense orientation as represented by FIGURE 3.
  • EXAMPLE 3 Expression of bacterial Crt-9 gene in transfected plants confirms that it encodes phytoene svnthase.
  • TTU51 tobacco mosaic virus strain Ul (TMV-U1) (Goelet et al, Proc. Natl. Acad. Sci. USA 79:5818- 5822 (1982)), and tobacco mild green mosaic virus (TMGMV; U5 strain) (Solis et al, "The complete nucleotide sequence ofthe genomic RNA ofthe tobamovirus tobacco mild green mosaic virus” (1990)).
  • TMV tobacco mosaic virus
  • This construct also contained the gene encoding the chloroplast targeting peptide (CTP) for the small subunit of ribulose- 1,5-bisphosphate carboxylase (RUBISCO) (O'Neal et al, Nucl. Acids Res. L5: 8661-8677 (1987)) and was called TTU51 CTP Crt-5 as represented by FIGURE 4.
  • CTP chloroplast targeting peptide
  • RUBISCO ribulose- 1,5-bisphosphate carboxylase
  • Phytoene synthase activity from plants transfected with TTU51 CTP Crt-5 was assayed using isolated chloroplasts and labeled [ 14 C] geranylgeranyl PP. There was a large increase in phytoene and an unidentified C 40 alcohol in the CrtB plants. Phytoene synthetase assay.
  • the chloroplasts were prepared as described previously (Camara, Methods Enzymol. 214:352-365 (1993)).
  • the phytoene synthase assays were carried out in an incubation mixture (0.5 ml final volume) buffered with Tris-HCL, pH 7.6, containing [ 14 C] geranylgeranyl PP (100,000 cpm) (prepared using pepper GGPP synthase expressed in E. coli), 1 mM ATP, 5 mM MnCl , 1 mM MgCl 2 , Triton X-100 (20 mg per mg of chloroplast protein) and chloroplast suspension equivalent to 2 mg protein.
  • reaction products were extracted with chloroform methanol (Camara, supra) and subjected to TLC onto silicagel plate developed with benzene/ethyl acetate (90/10) followed by autoradiography.
  • Chlorophyll synthetase assay Chlorophyll synthetase assay.
  • the isolated chloroplasts were lysed by osmotic shock before incubation.
  • the reaction mixture (0.2 ml, final volume) consisting of 50 mM Tris-HCL (pH 7.6) containing [ 14 C] geranylgeranyl PP (100,000 cpm), 5 MgCl 2 , 1 mM ATP, and ruptured plasmid suspension equivalent to 1 mg protein was incubated for 1 hr at 30°C.
  • the reaction products were analyzed as described previously.
  • TTU51 Tobamoviral vector, TTU51, was constructed.
  • PDS Erwinia phytoene desaturase
  • the chloroplast targeting, bacterial phytoene desaturase expression vector, TTOSAl CTP CrtI 491 #7 was constructed as follows. First, a unique Sphl site was inserted in the start codon for the Erwinia herbicola phytoene desaturase gene (plasmid pAU211, (FIGURE 6) by polymerase chain reaction (PCR) mutagenesis using the oligonucleotides CrtI HSM1 5'-GA CAG AAG CTT TGC AGC ATG CAA AAA ACC GTT-3' (upstream) (SEQ ID NO: 16) and IQ419A 5'-CGC GGT CAT TGC AGA TCC TCA ATC ATC AGG C-3' (downstream) (SEQ ID NO: 17).
  • the 1504 bp CrtI PCR fragment was subcloned into pBluescript® (Stratagene) by inserting it between the EcoRV and Hindlll sites, creating plasmid KS+/Crt/* 491 (FIGURE 7).
  • a 1481 bp Sphl, Avrll CrtI fragment from plasmid KS+ICrtI* 491 was then subcloned into the tobamo viral vector TTOSAl , creating TTOSAl CTP CrtI 491 #7.
  • Leaf tissue from a TTOSAl CTP CrtI 491 #7 infected plant was examined for ⁇ -carotene levels. Treating an uninoculated control plant with norflurazon resulted in severely depressed ⁇ -carotene levels (7.8% ofthe wild- type level). However, when a plant which had been previously inoculated with the viral construct TTOSAl CTP CrtI 491 #7 was treated with norflurazon, the ⁇ - carotene level were partially restored (28.3% ofthe wild-type level).
  • EXAMPLE 5 Expression of 5-enolpyruvylshikimate-3-phosphate svnthase -.EPSPS ("EPSPS”) genes in plants confers resistance to Roundup® herbicide.
  • EPSPS 5- enolpyruvylshikimate-3-phosphate synthase
  • the purpose of this experiment is to provide a method to systemically express EPSPS genes via a recombinant viral vector in fully-grown pl-ants.
  • Transfected plants that ove ⁇ roduce the enzyme EPSPS in vegetative tissue (root, stem, and leaf) are resistant to Roundup® herbicide.
  • the present invention provides a method for the production of plasmid-targeted EPSPS in plants via an RNA viral vector.
  • a dual subgenomic promoter vector encoding the full-length EPSPS gene from Nicotiana tabacum (Class I EPSPS) is shown in plasmid pBS736.
  • Systemic expression ofthe Nicotiana tabacum Class I EPSPS confers resistance to Roundup® herbicide in whole plants and tissue culture.
  • FIGURE 8 shows plasmid pBS736.
  • EXAMPLE 6 Cvtoplasmic inhibition of 5-enolpyruwlshikimate-3-phosphate svnthase (EPSPS) genes in plants blocks aromatic amino acid biosynthesis.
  • EPSPS 5-enolpyruwlshikimate-3-phosphate svnthase
  • EPSPS 5-enolpyruvylshikimate-3-phosphate synthase
  • a transgenic plant or naturally occurring plant mutant may have a nonfunctional gene such as the one which produces EPSP synthase.
  • a plant deficient or lacking in the EPSP synthase gene could grow only in the presence of added aromatic amino acids.
  • Transfection of plants with a viral vector containing a functional EPSP synthase gene or cDNA sequence encoding the same would cause the plant to produce a functional EPSP synthase gene product.
  • a plant so transfected would then be able to grow normally without added aromatic amino acids to its environment.
  • the EPSP synthase mutation in the plant would be complemented in trans by the viral nucleic acid sequence containing the native or foreign EPSP synthase cDNA sequence.
  • EXAMPLE 8 Expression of methylotrophic veast ZZA1 gene in transfected plants confirms that it encodes alcohol oxidase.
  • Infectious RNA from TTO1 APE ZZA1 (FIGURE 11) was prepared by in vitro transcription using SP6 DNA-dependent RNA polymerase and used to mechanically inoculate N benthamiana.
  • the 72-kDa protein accumulated in systemically infected tissue and was analyzed by immunoblotting, using Pichia pastoris alcohol oxidase as a standard. No detectable cross-reacting protein was observed in the noninfected N benthamiana control plant extracts.
  • Thermus aquaticus DNA polymerase (2.5U; Perkin-Elmer Cetus) consisted of an initial 2 minute incubation at 97°C followed by two cycles at 97°C (Imin.), 45°C (Imin.), 60°C (1 min.), thirty-five cycles at 94°C (1 min.), 45°C (1 min.), 60°C (1 min.), and a final DNA polymerase extension at 60°C for 7 min.
  • the 3273 base pair fragment containing ZZA1 gene was phenol/chloroform treated and precipitated with ammonium acetate/ethanol.
  • the alcohol oxidase genomic clone KS-AO7'8' was characterized by restriction mapping and dideoxynucleotide sequencing.
  • Rapid, high-level expression of rice OS 103 cDNA in transfected plants confirms that it encodes glycosylated rice ⁇ -amylase.
  • the hybrid virus spread throughout the noninoculated upper leaves as verified by transmission electron microscopy, local lesion infectivity assay, and PCR amplification.
  • the viral symptoms consisted of plant stunting with mild chlorosis and distortion of systemic leaves.
  • the 46-kDa ⁇ -amylase accumulated to levels of at least 5% total soluble protein, and was analyzed by immunoblotting, using yeast expressed ⁇ -amylase as a standard. No detectable cross-reacting protein was observed in the noninfected N benthamiana control plant extracts.
  • the expression level ofthe recombinant enzyme produced in transfected plants was at least ten times higher than the amount of thermostable bacterial ⁇ -amylase produced in transgenic tobacco.
  • the ⁇ -amylase was purified using ion exchange chromatography and its structural and biological properties were analyzed.
  • the secreted protein had an approximate relative molecular mass of 46 kDa, cross-reacted with anti- ⁇ -amylase antibody, and hydro lyzed starch and oligomaltose in an in vitro assay.
  • the recombinant enzyme from transfected N benthamiana was glycosylated at an asparagine residue via an N-glycosidic linkage.
  • the heterologously expressed ⁇ -amylases from transfected N benthamiana .and from transformed strains of S. cerevisiae and P. pastoris were treated with endo-H and were compared by Western blot SDS-PAGE analysis. There was an equivalent mobility shift for the enzymes expressed in S. cerevisiae and P. pastoris. The extent ofthe change in mobility suggests that the yeast expressed enzymes are hyperglycosylated while the recombinant protein from transfected plants is similar to that ofthe native rice ⁇ -amylase.
  • the 1.4-kb Xhol, Avrll OS 103 PCR fragment was subcloned into pTTOlA, creating plasmid TTOl A 103L.
  • the ⁇ -amylase was eluted with a linear gradient of 0.0- 1 M ⁇ aCl in 50 mM Tris-bis propane pH 7.0.
  • the ⁇ -amylase eluted in fraction 16, 17 and its enzyme activity was analyzed (Sigma Kit #576-3).
  • Fractions containing cross-reacting material to ⁇ -amylase antibody were concentrated with a Centriprep-30® (Amicon) and the buffer was exchanged by diafiltration (50 mM Tris-bis propane, pH 7.0).
  • the sample was then loaded on a POROS HQ/M column (Perceptive Biosystems), eluted with a linear gradient of 0.0-1 M ⁇ aCl in 50 mM Tris-bis propane pH 7.0, and assayed for ⁇ -amylase activity. Fractions containing cross-reacting material to ⁇ -amylase antibody were concentrated with a Centriprep-30 and the buffer was exchanged by diafiltration (20 mM Sodium Acetate/HEPES/MES, pH 6.0). The sample was finally loaded on a POROS HQ/M column (Perceptive Biosystems), eluted with a linear gradient of 0.0-1 M ⁇ aCl in 50 mM Tris-bis propane pH 7.0, and assayed for ⁇ -amylase activity. Fractions containing cross-reacting material to ⁇ -amylase antibody were concentrated with a Centriprep-30 and the buffer was exchanged by diafiltration (20 mM Sodium Acetate/HEPES/MES,
  • HS/TVI column Perceptive Biosystems
  • Total soluble plant protein concentrations were determined using bovine serum albumin as a standard.
  • the proteins were analyzed on a 0.1 % SDS/ 10% polyacrylamide gel and fransfened by elecfrob lotting for 1 hour to a nitrocellulose membrane. The blotted membrane was incubated for 1 hr with a 2000- fold dilution of anti- ⁇ -amylase antiserum.
  • the antisera was raised in rabbits against S cerevisiae expressed rice ⁇ -amylase.
  • the enhanced chemiluminescence horseradish peroxidase-linked, goat anti-rabbit IgG assay (Cappel Laboratories) was performed according to the manufacturer's (Amersham) specifications.
  • the blotted membrane was subjected to film exposure times of up to 10 sec.
  • the quantity of total recombinant ⁇ -amylase in an extracted leaf sample was determined (using a 1-sec exposure ofthe blotted membrane) by comparing the crude extract chemiluminescent signal to the signal obtained from known quantities of ⁇ -amylase. Shorter and longer chemiluminescent exposure times ofthe blotted membrane gave the same quantitative results.
  • recombinant protein Approximately 5 ⁇ g of recombinant protein was dissolved in 1 M acetic acid and subjected to matrix-assisted laser deso ⁇ tion/ionization time of flight (MALDI-TOF) analysis (Karas et al, Anal Chem. 60:2299-2301 (1988)).
  • MALDI-TOF matrix-assisted laser deso ⁇ tion/ionization time of flight
  • endo H endo--5-N-acetylglucosaminidase H
  • 2 ⁇ g ofthe recombinant ⁇ -amylases were denatured in 0.5% SDS/ 1% ⁇ -mercaptoethanol at 100°C for 10 minutes.
  • EXAMPLE 10 Expression of Chinese cucumber cDNA clone pO21D in transfected plants confirms that it encodes ⁇ -trichosanthin.
  • the ⁇ - trichosanthin open reading frame was verified by dideoxy sequencing, and placed under the control ofthe TMV-U1 coat protein subgenomic promoter by subcloning into TTU51 A, creating plasmid TTU51 A QSEO #3.
  • N benthaminana plants were inoculated with in vitro transcripts of Kpn I- digested TTU51A QSEO #3 as previously described (Dawson et al, supra). Virions were isolated from N benthamiana leaves infected with TTU51A QSEO #3 transcripts.
  • the hemoglobin expression vector, RED1 was constructed in several subcloning steps. A unique Sphl site was inserted in the start codon for the human ⁇ -globin and a Xbal site was placed downstream ofthe stop codon by
  • This construct consists of a 1020 bp fragment from the tobacco mild green mosaic virus (TMGMV; U5 strain) containing the viral subgenomic promoter, coat protein gene, and the 3'-end that was isolated by PCR using TMGMV primers 5' ⁇ GGC TGT GAA ACT CGA AAA GGT TCC GG-3* (upstream) (SEQ ID NO: 28) and 5'-CGG GGT ACC TGG GCC GCT ACC GGC GGT TAG GGG AGG-3' (downstream) (SEQ ID NO: 29).
  • TMGMV tobacco mild green mosaic virus
  • a hybrid sequence encoding rice alpha-amylase signal peptide and ⁇ - chain of human hemoglobin was placed under the control ofthe tobacco mosaic virus (TMV-U1) coat protein subgenomic promoter. Infectious RNA was made in vitro and directly applied to N benthamiana. One to two weeks post- inoculation transfected plants had accumulated recombinant hemoglobin. The 16-KDa ⁇ -globin accumulated in systemically infected leaves and was analyzed by immunoblotting, using hum.an hemoglobin as a standard. The recombinant hemoglobin was detected in transfected plants using a rabbit anti-human hemoglobin antibody. No detectable cross-reacting protein was observed in the noninfected N benthamiana control plants.
  • TMV-U1 tobacco mosaic virus
  • EXAMPLE 12 Construction of a tobamoviral vector for expression of heterologous genes in A. thaliana. Virions that were prepared as a crude aqueous extract of tissue from turnip infected with RMV were used to inoculate N benthamiana, N tabacum, A. thaliana, and oilseed rape (canola). Two to three weeks after fransfection, systemically infected plants were analyzed by immunoblotting, using purified RMV as a standard.
  • Total soluble plant protein concentrations were determined using bovine serum albumin as a standard. The proteins were analyzed on a 0.1% SDS/12.5% polyacrylamide gel and transfened by electroblotting for 1 hr to a nitrocellulose membrane. The blotted membrane was incubated for 1 hr with a 2000-fold dilution of anti-ribgrass mosaic virus coat antiserum. Using standard protocols, the antisera was raised in rabbits against purified RMV coat protein. The enhanced chemiluminescence horseradish peroxidase- linked, goat anti-rabbit IgG assay (Cappel Laboratories) was performed according to the manufacturer's (Amersham) specifications. The blotted membrane was subjected to film exposure times of up to 10 sec.
  • Ribgrass mosaic virus is a member ofthe tobamovirus group that infects crucifers.
  • a partial RMV cD ⁇ A containing the 30K subgenomic promoter, 30K ORF, coat subgenomic promoter, coat ORF, and 3 '-end was isolated by RT-PCR using by using oligonucleotides TVCV183X 5'-TAC TCG AGG TTC ATA AGA CCG CGG TAG GCG G-3' (upstream) (SEQ ID NO: 30) and TVCV Kpnl 5*-CGG GGT ACC TGG GCC CCT ACC CGG GGT TTA GGG AGG-3' (downstream) (SEQ ID NO: 31), and subcloned into the EcoRV site of KS+, creating plasmid KS+ TVCV #23 (FIGURE 14).
  • the RMV cDNA was characterized by restriction mapping and dideoxy nucleotide sequencing.
  • the partial nucleotide sequence is as follows: 5'- ctcgaggttcataagaccgcggtaggcggagcgtttgtttactgtagtataattaaatatttgtcagataaaggttgttt aaagatttgttttttgtttgactgagtcgataATGTCTTACGAGCCTAAAGTTAGTGACTTC CTTGCTCTTACGAAAAAGGAGGAAATTTTACCCAAGGCTTTGACGAGA TTAAAGACTGTCTCTATTAGTACTAAGGATGTTATATCTGTTAAGGAG TCTGAGTCCCTGTGATATTGATTTGTTAGTGAATGCCATTAGATA AGTATAGGTATGGGTGTTTTGGGTGTTGTTTTC ACCGGTGAATGGC TGGTACCGGATTTCGTTAAAGGTGGGGTAACAGTGAGCGTGATT
  • the 1543 base pair from the partial RMV cDNA was compared (PCGENE) to oilseed rape mosaic virus (ORMV).
  • ORMV oilseed rape mosaic virus
  • the nucleotide sequence identity was 97.8%.
  • the RMV 30K and coat ORF were compared to ORMV and the amino acid identity was 98.11% (30K) and 98.73% (coat), respectively.
  • RMV cDNA containing the 5 '-end and part ofthe replicase was isolated by RT-PCR from RMV RNA using by using oligonucleotides RGMV1 5'"GAT GGC GCC TTA ATA CGA CTC ACT ATA GTT TTA TTT TTG TTG CAA CAA CAA C-3' (upstream) (SEQ ID NO: 33) and RGR 132 5'-CTT GTG CCC TTC ATG ACG AGC TAT ATC ACG-3' (downstream) (SEQ ID NO: 34).
  • the RMV cDNA was characterized by dideoxy nucleotide sequencing.
  • the partial nucleotide sequence containing the T7 RNA polymerase promoter and part ofthe RMV cDNA is as follows: 5'- ccttaatacgactcactataGTTTTATTTTTGTTGCAACAACAACAACAAATTACAA TAACAACAAAACAAATACAAACAACAACAACAACATGGCACAATTTCAAC AAACAGTAAACATGCAAACATTGCAGGCTGCCGCAGGGCGCAACAGC CTGGTGAATGATTTAGCCTCACGACGTGTTTATGACAATGCTGTCGAG GAGCTAAATGCACGCTCGAGACGCCCTAAGGTTCATTACTCCAAATCA GTGTCTACGGAACAGACGCTGTTAGCTTCAAACGCTTATCCGGAGTTT GAGATTTCCTTTACTCATACCCAACATGCCGTACACTCCCTTGCGGGT GGCCTAAGGACTCTTGAGTTAGTATCTCATGATGCAAGTTCCGTTC GGTTCTCTGACGTACGACATCGGTGGTAACTTTGCAGCGCACCTTTTC AAAGGA
  • the uppercase letters are nucleotide sequences from RMV cDNA.
  • the lower case letters are nucleotide sequences from T7 RNA polymerase promoter.
  • the nucleotide sequences from the 5' and 3' oligonucleotides are underlined.
  • Full length infectious RMV cDNA clones were obtained by RT-PCR from
  • RMV RNA using by using oligonucleotides RGMV1 5'"GAT GGC GCC TTA ATA CGA CTC ACT ATA GTT TTA TTT TTG TTG CAA CAA CAA C- 3' (upstream) (SEQ ID NO: 36) and RG1 APE 5'-ATC GTT TAA ACT GGG CCC CTA CCC GGG GTT AGG GAG G-3' (downstream) (SEQ ID NO: 37).
  • the RMV cDNA was characterized by dideoxy nucleotide sequencing.
  • the partial nucleotide sequence containing the T7 RNA polymerase promoter and part of the RMV cDNA is as follows: 5'-
  • the uppercase letters are nucleotide sequences from RMV cDNA.
  • the nucleotide sequences from the 5' .and 3' oligonucleotides are underlined.
  • Full length infectious RMV cDNA clones were obtained by RT-PCR from RMV RNA using oligonucleotides
  • ABRC Biological Resource Center
  • the cDNA inserts in the CD4-13 were recovered by digestion with NotI. Digestion with NotI in most cases liberates the entire Arabidopsis thaliana cD ⁇ A insert because the original library was assembled with NotI adapters. NotI is an 8-base cutter that infrequently cleaves plant D ⁇ A.
  • the pBS735 transcription plasmid (FIGURE 15) was digested with PacVXhol and ligated to an adapter D ⁇ A sequence created from the oligonucleotides 5'-TCGAGCGGCCGCAT-3' (SEQ ID NO: 41) and 5'-
  • the resulting plasmid pBS740 (FIGURE 16) contains a unique NotI restriction site for bidirectional insertion of NotI fragments from the CD4-13 library. Recovered colonies were prepared from these for plasmid minipreps with a Qiagen BioRobot 9600®. The plasmid D ⁇ A preps performed on the BioRobot 9600® are done in 96-well format and yield transcription quality D ⁇ A.
  • An Arabidopsis cD ⁇ A library was transformed into the plasmid and analyzed by agarose gel electrophoresis to identify clones with inserts. Clones with inserts may be transcribed in vitro .and inoculated onto N benthamiana .and/or Arabidopsis thaliana. Selected leaf disks from transfected plants may be then taken for biochemical analysis.
  • EXAMPLE 14 Expression and targeting to the chloroplasts of a green fluorescent protein in Arabidopsis thaliana via a recombinant viral nucleic acid vector.
  • the gene encoding green fluorescent protein (GFP) was fused at the ⁇ - terminus to the chloroplast transit peptide (CTP) sequence of RuBPCase to create plasmid pBS723 (FIGURE 17).
  • Plasmid pBS723 was modified by PCR mutagenesis to create a unique Pad site upstream ofthe ATG start codon ofthe CTP-GFP gene fusion.
  • the PCR .amplification product obtained from plasmid pBS723 was digested PacllSall and cloned into plasmid GFP-30B/clone 60 (also digested with PacllSall) to create plasmid pBS731 (FIGURE 18).
  • Plasmid pBS731 was linearized at a unique Kpnl restriction site and transcribed into infectious R ⁇ A with T7 R ⁇ A polymerase according to standard procedures. Infectious RNA transcripts that were inoculated onto Nicotiana benthamiana plants showed systemic expression in the upper leaves of CTP-GFP within six days.
  • RNA transcripts from plasmid pBS731 were harvested by grinding the leaves with a mortar and pestle to obtain recombinant virions derived from pBS731 infectious RNA transcripts.
  • Virions from pBS731 were inoculated onto Arabidopsis thaliana leaves.
  • the inoculated leaves of Arabidopsis thaliana plants showed strong green fluorescence under UV light, thus indicating successful expression ofthe CTP-GFP reporter gene.
  • Inoculation of subject organisms such as plants may be effected by using means of high throughput robotics.
  • Arabidopsis thaliana were grown in microtiter plates such as the standard 96-well and 384- well micro titer plates.
  • a robotic handling arm then moved the plates containing the organism to a colony picker or other robot that may deliver inoculations to each plant in the well.
  • inoculation was performed in a very high speed and high throughput manner. It is preferable in the case of plants that the organism be a germinating seed at least in the development cycle to enable access to the cells to be transfected.
  • Equipment used for automated robotic production line could include, but not be limited to, robots of these types: electronic multichannel pipetmen, Qiagen BioRobot 9600®, Robbins Hydra liquid handler, Flexys Colony Picker, New Brunswick automated plate pourer, GeneMachines HiGro shaker incubator, New Brunswick floor shaker, three Qiagen BioRobots, MJ Research PCR machines (PTC-200, Tetrad), ABI 377 sequencer and Tecan Genesis RSP200 liquid handler.
  • robots of these types electronic multichannel pipetmen, Qiagen BioRobot 9600®, Robbins Hydra liquid handler, Flexys Colony Picker, New Brunswick automated plate pourer, GeneMachines HiGro shaker incubator, New Brunswick floor shaker, three Qiagen BioRobots, MJ Research PCR machines (PTC-200, Tetrad), ABI 377 sequencer and Tecan Genesis RSP200 liquid handler.
  • Genomic DNA library construction in a recombinant viral nucleic acid vector Genomic DNA represented in BAC (bacterial artificial chromosome) or
  • YAC (yeast artificial chromosome) libraries may be obtained from the Arabidopsis Biological Resource Center (ABRC).
  • the BAC/YAC DNA can be mechanically size- fractionated, ligated to adapters with cohesive ends, and shotgun-cloned into recombinant viral nucleic acid vectors.
  • mechanically size-fractionated genomic DNA can be blunt-end ligated into a recombinant viral nucleic acid vector.
  • Recovered colonies can be prepared for plasmid minipreps with a Qiagen BioRobot 9600®.
  • the plasmid DNA preps done on the BioRobot 9600® may be assembled in 96-well format and yield transcription quality DNA.
  • the recombinant viral nucleic acid; 'Arabidopsis genomic DNA library may be analyzed by agarose gel electrophoresis (template quality control step) to identify clones with inserts. Clones with inserts can then be transcribed in vitro and inoculated onto N benthamiana and/or Arabidopsis thaliana. Selected leaf disks from transfected plants can then be taken for biochemical analysis.
  • Genomic D ⁇ A from Arabidopsis typically contains a gene every 2.5 kb (kilobases) on average. Genomic D ⁇ A fragments of 0.5 to 2.5 kb obtained by random shearing of D ⁇ A were shotgun assembled in a recombinant viral nucleic acid expression/knockout vector library. Given a genome size of Arabidopsis of approximately 120,000 kb, a random recombinant viral nucleic acid genomic D ⁇ A library would need to contain minimally 48,000 independent inserts of 2.5 kb in size to achieve IX coverage of ' the Arabidopsis genome.
  • a random recombinant viral nucleic acid genomic D ⁇ A library would need to contain minimally 240,000 independent inserts of 0.5 kb in size to achieve IX coverage ofthe Arabidopsis genome.
  • Assembling recombinant viral nucleic acid expression/knockout vector libraries from genomic D ⁇ A rather than cD ⁇ A has the potential to overcome known difficulties encountered when attempting to clone rare, low-abundance mR ⁇ A's in a cD ⁇ A library.
  • a recombinant viral nucleic acid expression/knockout vector library made with genomic D ⁇ A would be especially useful as a gene silencing knockout library.
  • DHSPES expression/knockout vector library made with genomic D ⁇ A would be especially useful for expression of genes lacking introns.
  • other plant species with moderate to small genomes e.g. rose, approximately 80,000 kb
  • a recombinant viral nucleic acid expression/knockout vector library could be made from existing BAC/YAC genomic D ⁇ A or from newly-prepared genomic D ⁇ A for any plant species.
  • a recombinant viral nucleic acid expression/knockout vector library could be made with genomic DNA obtained from yeast, bacteria, or animals including humans.
  • Genomic DNA or cDNA library construction in a DHSPES vector and fransfection of individual clones from said vector library onto T-DNA tagged or transposon tagged or mutated plants.
  • Genomic DNA or cDNA library construction in a recombinant viral nucleic acid vector, and fransfection of individual clones from the vector library onto T-DNA tagged or transposon tagged or mutated plants may be performed according the procedure set forth in Example 16.
  • Such a protocol may be easily designed to complement mutations introduced by random insertional mutagenesis of T-DNA sequences or transposon sequences.
  • Clone B is one ocytotoxic T lymphocyte (CTL) cell clone shown to recognize an epitope present in both the P. yoelii and P. berghei CS proteins. Clone B recognizes the following amino acid sequence; SYVPSAEQILEFVKQISSQ (SEQ ID NO: 43) and when adoptively fransfened to mice protects against infection from both species of malaria sporozoites. Construction of a genetically modified tobamovirus designed to carry this malarial CTL epitope fused to the surface of virus particles is set forth herein.
  • PBGC215 was formed by cloning the 0.15 kb Accl-Nsil fragment of pBGC214 into pBGC235. The 0.9 kb Ncol-Kp ⁇ l fragment from pBGC215 was cloned in pBGC152 to form pBGC216.
  • a 0.07 kb synthetic fragment was formed by annealing PYCS.2p with PYCS.2m and the resulting double stranded fragment, encoding the P. yoelii CTL malarial epitope, was cloned into the -4vrII site of pBGC215 made blunt ended by treatment with mung bean nuclease and creating a unique -4 ⁇ tII site, to form pBGC262.
  • a 0.03 kb synthetic -4 ⁇ tII fragment was formed by annealing TLS.1EXP with TLS.1EXM and the resulting double stranded fragment, encoding the leaky-stop sequence and a stuffer sequence used to facilitate cloning, was cloned into Aaill digested ⁇ BGC262 to form pBGC263.
  • PBGC262 was digested with-4 ⁇ tll and ligated to itself removing the 0.02 kb stuffer fragment to form pBGC264.
  • the 1.0 kb Ncol-Kpnl fragment of pBGC264 was cloned into pSNC004 to form pBGC289.
  • the virus TMV289 produced by transcription of plasmid pBGC289 in vitro contains a leaky stop signal resulting in the removal of four amino acids from the C terminus ofthe wild type TMV coat protein gene and is therefore predicted to synthesize a truncated coat protein and coat protein with a CTL epitope fused at the C terminus at a ratio of 20: 1.
  • the recombinant TMVCP/CTL epitope fusion present in TMV289 is with the stop codon decoded as the amino acid Y (amino acid residue 156).
  • the amino acid sequence ofthe coat protein of virus TMV216 produced by transcription ofthe plasmid pBGC216 in vitro is truncated by four amino acids.
  • the epitope SYVPSAEQILEFVKQISSQ (SEQ ID NO: 43) is calculated to be present at approximately 0.5% ofthe weight ofthe virion using the same assumptions confirmed by quantitative ELISA analysis.
  • Sample ID No. TMV289.1 lBla Fifteen tobacco plants were grown for 33 days post inoculation accumulating 595 g fresh weight of harvested leaf biomass not including the two lower inoculated leaves. Purified Sample ID No. TMV289.11B2 was recovered (383 mg) at a yield of 0.6 mg virion per gram of fresh weight. Therefore, 3 g of 19-mer peptide was obtained per gram of fresh weight extracted. Tobacco plants infected with TMV289 accumulated greater than 1.4 micromoles of peptide per kilogram of leaf tissue.
  • TMV289 Partial confirmation ofthe sequence ofthe epitope coding region of TMV289 was obtained by restriction digestion analysis of PCR amplified cDNA using viral RNA isolated from Sample ID No. TMV289.11B2. The presence of proteins in TMV289 with the predicted mobility ofthe cp fusion at 20 kD and the truncated cp at 17.1 kD was confirmed by denaturing polyacrylamide gel electrophoresis.
  • EXAMPLE 19 Identification of nucleotide sequences involved in the regulation of plant growth by cvtoplasmic inhibition of gene expression using viral derived RNA.
  • Antisense RNA has been used to down regulate gene expression in transgenic and transfected plants. The effectiveness of antisense on the inhibition of eukaryotic gene expression was first demonstrated by Izant et al. (Cell 36(4): 1007-1015 (1984)). Since then, the down-regulation of numerous genes from transgenic plants has been reported. In addition, there is evidence that "co- suppression" of genes occurs in transgenic plants containing sense RNA by readthrough transcription from distal promoters located on the opposite strand of the DNA (Van der Krol et al, Plant Cell 2(4):291-299 (1990) and Napoli et al, Plant Cell 2:279-289 (1990)).
  • RNA viral vectors which replicate solely in the cytoplasm, can be used to identify genes involved in the regulation of plant growth by inhibiting the expression of specific endogenous genes. This example will enable one to characterize specific genes and biochemical pathways in transfected plants using an RNA viral vector.
  • Tobamoviral vectors have been developed for the heterologous expression of uncharacterized nucleotide sequences in transfected plants.
  • a partial Arabidopsis thaliana cDNA library was placed under the transcriptional control of a tobamovirus subgenomic promoter in a RNA viral vector. Colonies from transformed E. coli were automatically picked using a Flexys robot and transfened to a 96 well flat bottom block containing terrific broth (TB) Amp 50 ug/ml. Approximately 2000 plasmid DNAs were isolated from overnight cultures using a BioRobot and infectious RNAs from 430 independent clones were directly applied to plants.
  • cDNA encoding Nicotiana benthamiana ADP-ribosylation factor Partial cDNAs from Nicotiana benthamiana leaf RNA may be isolated by polymerase chain reaction (PCR) using the following oligonucleotides: ATARFM1X, 5'-GCC TCG AGT GCA GCA TGG GGT TGT CAT TCG GAA AGT TGT TC-3' (upstream) (SEQ ID NO: 45) and ATARFA181 A, 5'-TAC CTA GGC CTT GCT TGC GAT GTT GTT GGA GAG-3' (downstream) (SEQ ID NO: 46).
  • PCR polymerase chain reaction
  • a full-length cDNA encoding ARF may be isolated by screening a cDNA library by colony hybridization using a 32 P labeled Arabidopsis thaliana ARF PCR product. Hybridization can be carried out at 42°C for 48h in 50% formamide, 5X SSC, 0.02M phosphate buffer, 5X Denhart's solution, and 0.1 mg/ml sheared calf thymus DNA. Filters may be washed at 65°C in 0.1X SSC and 0.1% SDS, prior to autoradiography. PCR products and the ARF cDNA clones may be verified by dideoxynucleotide sequencing.
  • EXAMPLE 20 Identification of nucleotide sequences involved in the regulation of plant development by cvtoplasmic inhibition of gene expression using viral derived RNA.
  • a partial Arabidopsis thaliana cD ⁇ A library was placed under the transcriptional control of a tobamovirus subgenomic promoter in a R ⁇ A viral vector. Colonies from transformed E. coli were automatically picked using a Flexys robot and transfened to a 96 well flat bottom block containing terrific broth (TB) Amp 50 ug/ml. Approximately 2000 plasmid D ⁇ As were isolated from overnight cultures using a BioRobot and infectious R ⁇ As from 430 independent clones were directly applied to plants. One to two weeks after inoculation, transfected Nicotiana benthamiana plants were visually monitored for changes in growth rates, mo ⁇ hology, and color.
  • An Arabidopsis thaliana CD4-13 cDNA library was digested with NotI. D ⁇ A fragments between 500 and 1000 bp were isolated by trough elution and subcloned into the NotI site of pBS740. E. coli C600 competent cells were transformed with the pBS740 AT library and colomes containing Arabidopsis cD ⁇ A sequences were selected on LB Amp 50 ug/ml. Recombinant C600 cells were automatically picked using a Flexys robot and then transfened to a 96 well flat bottom block containing terrific broth (TB) Amp 50 ug/ml. Approximately 2000 plasmid D ⁇ As were isolated from overnight cultures using a BioRobot (Qiagen) and infectious R ⁇ As from 430 independent clones were directly applied to plants.
  • D ⁇ A sequencing and computer analysis A 750 bp NotI fragment of 740 AT #88 containing the G-protein coupled receptor cD ⁇ A was characterized. D ⁇ A sequence of NotI fragment of 740 AT #88 (750 bp) is as follows: 5'-
  • nucleotide sequence analysis and amino acid sequence comparisons were performed using DNA Strider, PCGENE and NCBI Blast programs.
  • the nucleotide sequence from 740 AT #88 was compared to Brassica rapa cDNA L33574 (FIGURE 22), the octopus rhodopsin mRNA X07797 (FIGURE 23).
  • the amino acid sequence derived from 740 AT #88 was compared to an Arabidopsis EST ORF ATTS2938 (FIGURE 24) and octopus rhodopsin P31356 (FIGURE 25).
  • EXAMPLE 21 Identification of nucleotide sequences involved in the regulation of plant growth bv cvtoplasmic inhibition of gene expression using viral derived RNA.
  • Antisense RNA has been used to down regulate gene expression in transgenic and transfected plants.
  • the pu ⁇ ose of this example is again to demonstrate that novel positive strand viral vectors, which replicate solely in the cytoplasm, can be used to identify genes involved in the regulation of plant growth by inhibiting the expression of specific endogenous genes. This example will enable one to characterize specific genes and biochemical pathways in transfected plants using an RNA viral vector.
  • Tobamoviral vectors have been developed for the heterologous expression of uncharacterized nucleotide sequences in transfected plants.
  • a partial Arabidopsis thaliana cDNA library was placed under the transcriptional control of a tobamovirus subgenomic promoter in a RNA viral vector.
  • Colonies from transformed E. coli were automatically picked using a Flexys robot and transfered to a 96 well flat bottom block containing terrific broth (TB) Amp 50 ug/ml.
  • TB terrific broth
  • Approximately 2000 plasmid DNAs were isolated from overnight cultures using a BioRobot and infectious RNAs from 430 independent clones were directly applied to plants.
  • An Arabidopsis thaliana CD4-13 cDNA library was digested with NotI. D ⁇ A fragments between 500 and 1000 bp were isolated by trough elution and subcloned into the NotI site of pBS740. E. coli C600 competent cells were transformed with the pBS740 AT library and colonies containing Arabidopsis cD ⁇ A sequences were selected on LB Amp 50 ug/ml. Recombinant C600 cells were automatically picked using a Flexys robot and then transfered to a 96 well flat bottom block containing terrific broth (TB) Amp 50 ug/ml. Approximately 2000 plasmid D ⁇ As were isolated from overnight cultures using a BioRobot (Qiagen) and infectious R ⁇ As from 430 independent clones were directly applied to plants.
  • DNA sequence of NotI fragment of 740 AT #2441 (350 bp) is as follows: 5'- CTTCACTTTCGCCGATGGCTCTACCTAACCAGCAAACCGTGGATTACC CTAGCTTCAAGCTCGTTATCGTTGGCGATGGAGGCACAGGGAAGACC ACATTTGTAAAGAGACATCTTACTGGAGAGTTTGAGAAGAAGTATGA ACCCACTATTGGTGTTGAGGTTCATCCTCTTGATTTCTTCACTAACTGT GGCAAGATCCGTTTCTACTGTTGGGATACTGCTGGCCAAGAGAAATTT GGTGGTCTTAGGGATGGTTACTACATCCATGGACAATGTGCTATCATC ATGTTTGATGTCACAAGCACGACTGACATACAAGAATGTTCC AACATG GCACCGTGATCTTTG-3' (SEQ ID NO. 48)
  • the nucleotide sequencing of 740 AT #2441 was carried out by dideoxy termination using double stranded templates (Sanger et al, Proc. Natl. Acad. Sci. USA 24(12):5463-5467 (1977)). Nucleotide sequence analysis and amino acid sequence comparisons were performed using DNA Strider, PCGENE and NCBI Blast programs.
  • the nucleotide sequence from 740 AT #2441 was compared to tobacco RAN-Bl GTP binding protein (FIGURE 26).
  • the nucleotide sequence from 740 AT #2441 was compared to human RAN GTP-binding protein (FIGURE 27).
  • EXAMPLE 22 Gene silencing/co-supression of genes induced by delivering an RNA capable of base pairing with itself to form double stranded regions.
  • RNA silencing has been used to down regulate gene expression in transgenic plants. Recent experimental evidence suggests that double stranded RNA may be an effective stimulator of gene silencing/co-suppression phenomenon in transgenic plant. For example, Waterhouse et al. (Proc. Natl. Acad. Sci. USA 95:13959-13964 (1998), inco ⁇ orated herein by reference) described that virus resistance and gene silencing in plants could be induced by simultaneous expression of sense and antisense RNA. Gene silencing/co- suppression of plant genes may be induced by delivering an RNA capable of base pairing with itself to form double stranded regions. This example shows: (1) a novel method for generating an RNA virus vector capable of producing an RNA capable of forming double stranded regions, and (2) a process to silence plant genes by using such a viral vector.
  • Step 1 Construction of a DNA sequence which after it is transcribed would generate an RNA molecule capable of base pairing with itself.
  • Two identical, or nearly identical, ds DNA sequences can be ligated together in an inverted orientation to each other (i.e., in either a head to tail or tail to head orientation) with or without a linking nucleotide sequence between the homologous sequences.
  • the resulting DNA sequence can then be cloned into a cDNA copy of a plant viral vector genome.
  • Step 2 Cloning, screening, transcription of clones of interest using known methods in the art.
  • Step 3 Infect plant cells with transcripts from clones.
  • RNA from foreign gene will base pair upon itself, forming double-stranded RNA regions.
  • This approach could be used with any plant or non-plant gene and used to silence plant gene homologous to assist in identification ofthe function of a particular gene sequence.
  • a full- length cDNA copy ofthe Eastern Equine Encephalomyelitis Virus (EEEV) genome is prepared and inserted into the Pstl site of pUC18 as described by Chang et al, J. Gen. Virol. 68:2129 (1987).
  • the sequence for the viral coat protein and its adjacent El and E2 glycoprotein fransmissibility factors are located on the region conesponding to the 26S RNA region.
  • the vector containing the cDNA copy ofthe EEEV genome is digested with the appropriate restriction enzymes and exonucleases to delete the coding sequence ofthe coat protein and the El and E2 proteins (structural protein coding sequence).
  • the structural protein coding sequence is removed by partial digestion with Mbol, followed by religation to remove a vital portion ofthe structural gene.
  • the vector is cut at the 3 '-end ofthe viral structural gene.
  • the viral DNA is sequentially removed by digestion with Bal31 or Micrococcal SI nuclease up through the start codon ofthe structural protein sequence.
  • the DNA sequence containing the sequence ofthe viral 3 '-tail is then ligated to the remaining 5 '-end.
  • the deletion ofthe coding sequence for the structural proteins is confirmed by isolating EEEV RNA and using it to infect an equine cell culture.
  • the isolated EEEV RNA is found to be non-infective under natural conditions.
  • the coat protein coding sequence is removed by partial digestion with Mbol followed by religation to reattach the 3 '-tail ofthe virus. This will remove a vital portion ofthe coat protein gene.
  • a second alternative method for removing only the coat protein sequence is to cut the vector at the 3 '-end ofthe viral coat protein gene.
  • the viral DNA is removed by digestion with Bal31 or Micrococcal S 1 nuclease up through the start codon ofthe coat protein sequence.
  • the synthetic DNA sequence containing the sequence ofthe 3 '-tail is then ligated to the remaining 5 '-end.
  • the deletion ofthe coding sequence for the coat protein is confirmed by isolating EEEV RNA and using it to infect an equine cell culture.
  • the isolated EEEV RNA is found to be non-infective under natural conditions.
  • Sindbis viruses Methods for genetic manipulation of Sindbis viruses are described in Garoff et al, Curr. Opin. Biotechnol. 9(5):464-9 (1998); Agapov et al, Proc. Natl. Acad. Sci. USA 95(22): 12989-94 (1998); Frolov et al, J. Virol. Apr;71(4):2819-29 (1997), all of which are inco ⁇ orated herein by reference.
  • a full-length cDNA copy ofthe Sindbis virus genome is prepared and inserted into the Sm ⁇ l site of a plasmid derived from pBR322 as described by Lindquist et al, Virology 151:10 (1986).
  • the sequence for the viral coat protein and the adjacent El and E2 glycoprotein fransmissibility factors are located on the region conesponding to the 26S RNA region.
  • the vector containing the cDNA copy of the Sindbis virus genome is digested with the appropriate restriction enzymes and exonucleases to delete the coding sequence for the structural proteins.
  • the structural protein coding sequence is removed by partial digestion with Bi ⁇ l, followed by religation to remove a vital portion ofthe structural gene.
  • the vector is cut at the 3'-end ofthe viral nucleic acid.
  • the viral DNA is removed by digestion with Ba 1 or Micrococcal S 1 nuclease up through the start codon ofthe structural protein sequence.
  • the synthetic DNA sequence containing the sequence ofthe viral 3 '-tail is then ligated to the remaining 5 '-end.
  • the deletion ofthe coding sequence for the structural proteins is confirmed by isolating Sindbis RNA and using it to infect an avian cell culture. The isolated Sindbis RNA is found to be non-infective under natural conditions.
  • the coding sequence for the coat protein is deleted and the sequence for the El and E2 glycoproteins remain in the vector containing the cDNA copy ofthe Sindbis genome.
  • the coat protein coding sequence is removed by partial digestion with Aflll followed by religation to reattach the 3 '-tail ofthe virus.
  • a second alternative method for removing only the coat protein sequence is to cut the vector at the 3 '-end ofthe viral nucleic acid.
  • the viral DNA is removed by digestion with BaB 1 or Micrococcal S 1 nuclease up through the start codon ofthe coat protein sequence (the same start codon as for the sequence for all the structural proteins).
  • the synthetic DNA sequence containing the sequence ofthe 3 '-tail is then ligated to the remaining 5 '-end.
  • the deletion ofthe coding sequence for the coat protein is confirmed by isolating Sindbis RNA and using it to infect an avian cell culture.
  • the isolated Sindbis RNA is found to be non-infective under natural conditions.
  • EXAMPLE 25 Preparation of a Non-Transmissible Western Equine Encephalomyelitis Virus Nucleotide Sequence.
  • the vector containing the cDNA copy ofthe WEEV genome is digested with the appropriate restriction enzymes and exonucleases to delete the coding sequence ofthe coat protein and the El and E2 proteins (structural protein coding sequence).
  • structural protein coding sequence is removed by partial digestion with N ⁇ cI, followed by religation to remove a vital portion ofthe structural protein sequence.
  • the vector is cut at the 3 '-end ofthe structural protein D ⁇ A sequence.
  • the viral D ⁇ A is removed by digestion with -5 ⁇ /31 or Micrococcal SI nuclease up through the start codon ofthe structural protein sequence.
  • the D ⁇ A sequence ofthe viral 3 '-tail is then ligated to the remaining 5 '-end.
  • the deletion ofthe coding sequence for the structural proteins is confirmed by isolating WEEV R ⁇ A and using it to infect a Vero cell culture.
  • the isolated WEEV R ⁇ A is found to be non-infective under natural conditions.
  • only the coding sequence for the coat protein is deleted and the sequence for the El and E2 glycoproteins remain in the vector containing the cD ⁇ A copy ofthe WEEV genome.
  • the coat protein coding sequence is removed by partial digestion with HgiAI followed by religation to reattach the 3 '-tail of the vims.
  • a second alternative method for removing only the coat protein sequence is to cut the vector at the 3'-end ofthe viral coat protein sequence.
  • D ⁇ A is removed by digestion with -5 ⁇ /31 or Micrococcal SI nuclease up through the a vital portion ofthe coat protein sequence.
  • the D ⁇ A sequence containing the sequence ofthe 3'-tail is then ligated to the remaining 5'-end.
  • the deletion ofthe coding sequence for the coat protein is confirmed by isolating WEEV RNA and using it to infect a Vero cell culture.
  • the isolated WEEV RNA is found to be non-infective, i.e., biologically contained, under natural conditions.
  • the VPI coat protein coding sequence is removed by partial digestion with -5 ⁇ mHI nuclease, and then treated with EcoRI, filled in with Klenow enzyme and recircularized.
  • the deletion ofthe coding sequence for the coat protein VPI is confirmed by isolating SV40 RNA and using it to infect simian cell cultures.
  • the isolated SV40 RNA is found to be non-infective, i.e., biologically contained, under natural conditions.
  • Novel requirements for production of infectious viral vector in vitro derived RNA transcripts Novel requirements for production of infectious viral vector in vitro derived RNA transcripts.
  • This example demonstrates the production of highly infectious viral vector transcripts containing 5' nucleotides with reference to the vims vector. Construction of a library of subgenomic cDNA clones of TMV and BMV has been described in Dawson et al, Proc. Natl. Acad. Sci. USA 83:1832-1836
  • Nucleotides were added between the transcriptional start site ofthe promoter for in vitro transcription, in this case T7, .and the start ofthe cDNA of TMV in order to maximize transcription product yield and possibly obviate the need to cap vims transcripts to insure infectivity.
  • the relevant sequence is the T7 promoter ...TATAG ⁇ TATTTT.... where the ⁇ indicates the base preceding is the start site for transcription and the bold letter is the first base ofthe TMV cDNA.
  • Three approaches were taken: 1) addition of G, GG or GGG between the start site of transcription and the TMV cDNA ( ... TATAGGTATTT...
  • the GTN would allow the mimicking of two potential sites for initiation, the added and the native sequence, and facilitate more ready mis-initiation of transcription in vivo to restore the native TMV cDNA sequence.
  • Approaches included cloning GFP expressing TMV vector sequences into vectors containing extra G, GG or GGG bases using standard molecular biology techniques.
  • full length PCR of TMV expression clone 1056 was done to add N2, N3 and GTN bases between the T7 promoter and the TMV cDNA. Subsequently, these PCR products were cloned into pUC based vectors.
  • Capped and uncapped transcripts were made in vitro .and inoculated to tobacco protoplasts or Nicotiana benthamiana plants, wild type and 30k expressing transgenics.
  • the results are that an extra G, ... TATAGGTATTTT..., or a GTC, ... TATAGTCGTATTTT..., were found to be well tolerated as additional 5' nucleotides on the 5' of TMV vector RNA transcripts and were quite infectious on both plant types and protoplasts as capped or non-capped franscripts. Other sequences may be screened to find other options.
  • infectious transcripts may be derived with extra 5' nucleotides.
  • the replicated progeny may exhibit the results of reversion events that would yield the wild type vims 5' vims sequence, but may include portions or entire sets of introduced additional base sequences.
  • This strategy can be applied to a range of RNA vimses or RNA viral vectors of various genetic anangements derived from wild type vims genome. This would require the use of sequences particular to that ofthe vims used as a vector.
  • TMV-based vims expression vectors were initially used in these studies pBTI 1056 which contains the T7 promoter followed directly by the vims cDNA sequence f... TAT AGT ATT).
  • pBTI SBS60-29 which contains the T7 promoter (underlined) followed by an extra guanine residue then the vims cDNA sequence (...TATAGGTATT).
  • Both expression vectors express the cycle 3 shuffled green fluorescent protein (GFPc3) in localized infection sites and systemically infected tissue of infected plants. Transcriptions of each plasmid were carried out in the absence of cap analogue (uncapped) or in the presence of 8-fold greater concentration of RNA cap analogue than rGTP (capped).
  • Nicotiana tabacum protoplasts were infected with either capped or uncapped transcriptions (as described above) of pBTI SBS60 which contains the T7 promoter followed directly by the vims cDNA sequence ( " TATAGTATT). This expression vector also expresses the GFPc3 gene in infected cells and tissues. Nicotiana tabacum protoplasts were transfected with 1 mcl of each transcriptions. Approximately 36 hours post infection fransfected protoplasts were viewed under UV illumination and cells showing GFPc3 expression. Approximately 80% cells transfected with the capped PBTI SBS60 transcripts showed GFP expression while 5% of cells transfected with uncapped franscripts showed GFP expression.
  • the expression ofthe 30K movement protein of TMV in transgenic plants also has the unexpected effect of equalizing the relative specific infectivity of uncapped verses capped franscripts.
  • the mechanism behind this effect is not fully understood, but could arise from the RNA binding activity ofthe movement protein stabilizing the uncapped transcript in infected cells from prereplication cytosolic degradation.
  • TMV-based vims expression vector pBTI 1056 GTN#28 which contains the T7 promoter (underlined) followed GTC bases (bold) then the vims cDNA sequence (...TATAGTCGTATT).
  • This expression vector expresses the cycle 3 shuffled green fluorescent protein (GFPc3) in localized infection sites and systemically infected tissue of infected plants.
  • GFPc3 cycle 3 shuffled green fluorescent protein
  • This vector was transcribed in vitro in the presence (capped) and absence (uncapped) of cap analogue. Transcriptions were mixed with abrasive and inoculated on expanded older leaves of a wild type Nicotiana benthamiana (Nb) plant and a Nb plant expressing a TMV Ul 30k movement protein transgene (Nb 30K).
  • Elicitor recognition and the response cascades occurring in plants form an essential link between the environmental stress and plant survival responses.
  • Many products are induced following induction by environmental stimuli or pathogen infection, which include, but are not limited to, proteases, protease inhibitors, alkaloids and other metabolites. Glazebrook, et al, Annu. Rev. Gen. 31:547-569 (1997); Grahm, et al, J. Biol. Chem. 260:6555-6560 (1985); and Ryan, et al, Ann. Rev. Cell Dev. Biol. 14:1-17 (1998), all inco ⁇ orated herein by reference. The components ofthe recognition and response pathways are poorly understood, yet have tremendous practical value for input traits in genetically improved crops.
  • Vims expression vectors capable of either overexpressing gene products or suppressing the expression of particular endogenous host genes provide a unique tool to discover the nature of the genes whose products contribute to the response pathways.
  • This example describes methods for inhibiting endogenous plant proteases which interfere with the expression and purification of recombinant proteins in plants.
  • this example shows methods for inhibiting proteolytic activity in planta which is responsible for the degradation of a viral vector- expressed recombinant protein.
  • These methods are also applicable to the protection of recombinant proteins expressed via a stable transformation system or endogenous plant proteins.
  • Viral vectors have been constructed to include an N-terminal signal peptide sequence. This sequence directs the recombinant protein through the secretory pathway to the cell surface and ultimately accumulating in the plant intercellular fluid (IF) (Kermode, Critical Reviews in Plant Sciences 5(4):285-423 (1996), inco ⁇ orated herein by reference).
  • IF plant intercellular fluid
  • the target protein was cleaved abenantly in vivo.
  • Three examples include a mammalian growth hormone and single chain antibody and an avian interferon.
  • In vivo residence time in the IF led to the accumulation ofthe cleavage product(s) as detected by immunoblotting. Cleavage was either complete in vivo or continued in vitro following IF extraction (Co-pending U.S. Patent Application Serial No.
  • transgenic plants and virally- expressed proteins Another possible approach is to combine transgenic plants and virally- expressed proteins.
  • JA jasmonic acid
  • JA is produced as part of a general plant defense mechanism and is known to induce specific proteinase inhibitors (Lightner et al, JMol Gen Genet. 241:595-601 (1993), inco ⁇ orated herein by reference).
  • Exogenous application of JA as been used to induce a plant defense response in Nicotiana attenuata to against herbivore attack (Baldwin, PNAS, 95(14):8113-8118 (1998), inco ⁇ orated herein by reference).
  • JA jasmonic acid
  • JA is produced as part of a general plant defense mechanism and is known to induce specific proteinase inhibitors (Lightner et al, JMol Gen Genet. 241:595-601 (1993), inco ⁇ orated herein by reference).
  • Exogenous application of JA as been used to induce a plant defense response in Nicotiana attenuata to against herbivore attack (Baldwin, PNAS,
  • the desired phenotype in host plants used for gene discovery program using vims expression vectors is reduced proteolytic activities in the cytosol, secretory pathway or apoplast so to increase the half-life of virally produced proteins. This will allow virally expressed proteins to exert their influence on plant biochemistry, development and growth optimally. Rapid or premature degradation may reduce the amount ofthe expressed protein below the necessary threshold to exert a measurable effect.
  • Transgenic expression of protease inhibitors such as those induced by the systemin pathway (Ryan, et al, Ann. Rev. Cell Dev. Biol. 14:1-17 (1998)), will provide a continuous source of inhibitor to slow particular degradation processes.
  • EXAMPLE 30 Selection of optimized RNA and protein activities by use of vims vectors to express libraries of sequence variants generated by means of in vitro mutagenenisis and/or recombination.
  • DNA shuffling is a process for recursive mutation and in vitro recombination, performed by random fractionation .and re-assembly of a gene of interest to generate a pool of related, yet not identical, gene sequences.
  • Fractionation occurs through the treatment of DNA sequences with limiting amounts of nuclease and re-assembly typically requires two steps, first primerless PCR to re-align fragments based on local homology and then primer driven PCR to recover full length assembled fragments.
  • DNA shuffling has been successfully applied to prokaryotic or cell-based systems to select sequences of desired protein activities.
  • the ability to introduce shuffled sequences throughout an organism in a rapid and high throughput manner necessary to harness the full potential of this technology has not been demonstrated.
  • the properties confened by the selected shuffled sequences were demonstrated to be inherited by progeny vimses.
  • Two aspects that must be continually improved in vims expression vectors are: 1) their ability to move in a facile manner both locally and systemically in plants, and 2) the need for greater levels of foreign gene expression.
  • Both of these functions can potentially be affected by modifications to the 30 kDa ORF.
  • Functions within the 30 kDa coding region include the movement protein (MP), the vims origin of virion assembly and the subgenomic promoter used for coat protein synthesis. This is the promoter used for expression of foreign gene sequences in most tobamovirus vectors. It has been demonstrated that natural variation in viral populations can be the substrates for selection of improved characters in viral vectors can lead to dramatic improvements in their performance. This work further showed that single or multiple amino acid substitutions in the 30 kDa ORF can significantly effect the movement properties of vims vectors.
  • Vimses function genomically, as an integrated whole of RNA and protein sequences, suggesting that either individual elements, such as the 30 kDa ORF, or the entire plant vims genomes could be subjected to shuffling so to improve plant vims vector performance.
  • Obvious following the application of shuffling in this context is the use of plant vims vectors to house shuffled foreign gene populations which, following inoculation onto plants, gene products with optimized activities can be selected.
  • Plant vims vectors are the ultimate tool for shuttling genes into plants for selection of optimized activities. No other tool, transient or stable expression methods, can match the ability of plant vims vectors to develop optimized genes for plant activities.
  • p30B GFP vector is the TMV Ul infectious cDNA (bases 1-5756) containing the 5' NTR, replicase genes (126 and 183 kDa proteins), movement protein gene with associated subgenomic promoter and an RNA leader derived from the Ul coat protein gene. Following the RNA leader is a unique Pad site and the green fluorescent protein (GFP) gene. Following a unique Xhol site, the clone continues with a portion ofthe TMV Ul 3' NTR followed by a subgenomic promoter, coat protein gene and 3' NTR from TMV U5 strain.
  • GFP green fluorescent protein
  • the first stage ofthe project required the construction of a vector into which shuffled DNA fragments could be reintroduced.
  • the polymerase chain reaction (PCR) was used to amplify a DNA fragment from the TMV vector p30B comprising the T7 promoter, 5' non-translated region (NTR), and the reading frames for the 126 and 183 kDa replicase proteins.
  • the 5' primer covered the T7 promoter and initial bases ofthe TMV genome while the second primer modified the context surrounding the start codon for the 30 kDa MP of TMV. This allowed DNA fragments to be ligated into the modified vector, designated 3 OB GFP d30K, as vrll, Pa restriction endonuclease digested fragments.
  • This modification allowed the ready insertion of modified 30 kDa gene fragments into a vims vector and have them expressed in plant cells, tissues or systemically.
  • the wild type GFP ORF is the reporter gene since the visual level of fluorescence as observed under long wave UV light conelates directly with levels of GFP protein present in plant tissues. This has been demonstrated by looking at different vims vectors expressing GFP, each having different strength subgenomic promoters, that were infected in plants and GFP levels determined by UV fluorescence and Western blotting using anti-GFP antibodies.
  • the procedure for shuffling ofthe 30 kDa gene is similar to that described by Crameri et al, Nature Biotech. 15:436 (1997), and contained the following steps.
  • the 30 kDa gene fragment also containing the coat protein RNA leader was amplified from tobamoviras expression vectors using primers: TMVUl 3 OK 5'A (5'-GGCCCTAGGATGGCTCTAGTTGTTAAAGG-3') (SEQ ID NO: 49) and 3-5' Pac primer (5'-GTTCTTCTCCTTTGCTAGCCATTTAATTAATGAC- 3') (SEQ ID NO: 50).
  • the PCR DNA product was gel isolated and then incompletely digested with DNasel.
  • DNA fragments of 500 bp or smaller were isolated by using DEAE blotting paper technique and then eluted. Purified DNA fragments were mixed together with taq DNA polymerase and allowed to "reassemble” for 40 cycles. "Reassembly” reaction was assayed by gel electrophoresis for DNA bands of approximately 800-850 bp. Approximately 1 mcl ofthe "reassembly” reaction was then subjected to PCR using primers TMV Ul 30K 5'A and 3-5' Pac that hybridize to terminal DNA ends of reassembled fragments.
  • the reassembled fragments will be gel isolated and digested with restriction enzymes Avrll and Pad (sites present in the terminal primers) to allow for facile cloning back into the p30B d30k ANP digested with -4vrII and Pad.
  • Ligations of shuffled genes into p30B d30k ANP resulted in pooled libraries of sequences containing 100 to 50,000 members in five separate experiments. Pooled vims vectors with libraries of variant 30 kDa coding regions were transcribed with T7 RNA polymerase and then inoculated by standard PEG fransfection into 0.5 x 10 6 Nicotiana tabacum protoplasts per sample.
  • the protoplast extracts were then inoculated on leaves of wild type and transgenic Nicotiana tabacum c.v. MD609 expressing the TMV Ul 30 kDa movement protein.
  • Three to five days post inoculation localized infection sites were observed expressing GFP.
  • a variety of intensities of GFP fluorescence were observed varying from that observed with the wild type GFP gene to much duller to very bright, as observed from the viral expression ofthe shuffled GFP gene of Crameri et al, Nature Biotech. (1996) (GFPc3).
  • the occurrence of vimses expressing enhanced GFP fluorescence varied between libraries tested from 1/200 to 1/50 infection foci depending on libraries tested.
  • the protoplast extracts isolated from transfections with vims libraries were inoculated on one half of wild type Nicotiana tabacum c.v. MD609 and Nicotiana benthamiana leaves. To the other leaf half, vims derived from p30B GFP was inoculated. Some infection sites resulting from infection of vimses containing shuffled 30 kDa ORFs grew more rapidly than those ofthe average from p30B GFP. These events occurred at a frequency of 1/100 to 1/500 infection foci depending on the vims library analyzed. These more rapidly growing infection foci were excised and inoculated on young Nicotiana tabacum c.v. MD609 plants.
  • p30B GFP was inoculated on similar sized and aged plants.
  • the p30B GFP vector does not move systemically on tobacco plants.
  • some shuffled 30 kDa ORF variant vectors that were identified as rapidly growing local infection sites, were able to move systemically on tobacco plants. The movement was primarily on phloem source tissue and were localized to veins and circular spots in green lamina. This movement ability was reproducible in multiple inoculations of these individual vims variants.
  • Sequence analysis ofthe vimses containing shuffled 30 kDa ORFs capable of systemic movement on Nicotiana tabacum plants demonstrated that localized amino acid substitutions were present and responsible for altered movement phenotype.
  • the phenotypes that can be assayed are protein activity in planta, as with the movement activities ofthe 30 kDa protein, enzyme activities in planta or in plant extracts or other surrogate features such as substrate or product accumulation.
  • These data demonstrate the power of vims expression vectors to be effective tools for shuttling sequence variants into plants and allow the selection of genes encoding the desired altered property. This tool allows one to mine the hidden activities, enhance the isolated activities of enzymes or eliminate allosteric inhibition of enzyme activities. This could be applied to any plant gene or genes from other sources to optimize the activities desired for agronomic, pharmaceutical or developmental effects caused by altered genes.
  • EXAMPLE 31 Composite clomng to facilitate cloning of libraries in vims vectors and/or their introduction into host cells for expression of sequences.
  • Vims vector clones could be integrated into lambda phage or cosmid clones to facilitate library constmction, clone representation, elimination of cell based amplification by direct transcription and archiving of individual clones.
  • cis-acting elements allowing for expression in plant cells or integration into plant DNA could be included into such plasmids to facilitate inoculation of DNA for direct expression, obviating the need for transcription of vector cDNA, or construction of dedicated plant transformation vectors.
  • Vims vectors are tools housing libraries of sequences that can be screened for novel gene discovery. However libraries are often first constructed in plasmid or phage shuttle vectors before excising and introduction into vims vectors. Likewise, sequences can be screened in hosts using vims vectors, but must be subcloned into appropriate eukaryotic expression vectors before the trait identified in the vector transfected host will become a stable trait in the host by gene integration. Additional hurdles to overcome are: (1) constmction of libraries to most efficiently represent the clones in a cDNA library, (2) obtaining maximal fransfection efficiency into bacterial hosts (if used), and (3) archiving DNA samples without the need for fransfection into bacteria and transcription of ligated DNA.
  • a virus vector into a cosmid clone, or lambda phage itself, (both termed phagmids here) could allow a multi-pu ⁇ ose vector to be generated to be both the repository of primary generated library sequences, source for ligation transcriptions, high efficiency bacterial fransfection and direct expression in higher eukaryotic hosts.
  • a non symmetrical restriction such as BstXI: CCANNNNNNTGG
  • the 3' part ofthe vector will be inserted into another arm with a non-symmetrical restriction (such as BstXI: CCANNNNNNTGG) containing a second unique sticky sequence (the N's).
  • the vector would be split at the determined restriction site (e.g. BstXI) within the site for foreign sequence expression in the vims vector.
  • the 5'-end ofthe vims cDNA would be appropriately fused to a promoter for in vitro transcription (e.g. T7) or for in vivo expression (e.g. an appropriate higher eukaryotic RNA polymerase promoter).
  • the 3'-end ofthe vims cDNA would terminate with a ribozyme for in vitro cleavage and/or a 3' terminator from a gene from host organism to lead to in vivo termination of transcription.
  • Left and right T-DNA borders that promote the integration of sequences in between into plant genomic DNA, could flank the promoter and terminator sequences.
  • At the terminus of each arm would be cos sequences to allow complete regeneration ofthe phagmid upon ligation in the presence of foreign library DNA containing the two unique sticky sequences at each respective termini.
  • These library DNA fragments could be generated by PCR amplification using determined restriction sites (e.g., BstXI) to generate unique sticky ends complementary to those in the phagmid-vector arms integrated in the PCR primers.
  • the 5' and 3' primers would each have unique recognition sequences in the BstXI restriction site (the N's) that would match the sticky sites on the respective sides ofthe vims vector.
  • the sites could be switched on a second set of PCR primers to allow the amplification of DNA to be ligated into the phagmid-viral vector arms in the "sense" and "anti-sense” orientation.
  • These constructions would allow for efficient in vitro ligation and use of cmde ligation mix as template for E. coli transformation, plant transformation, in vitro lambda packaging to 10 9 pfu/mcg or in vitro transcription. In this manner, the vector and flexibility for its screening could be maximized.
  • the goal of this example is to improve the host plant by introducing foreign genetic material via interspecific hybridization.
  • Host plant species vary in their ability to support expression of a sequence inserted into a plant viral vector. Some species support expression to a high specific activity, such as
  • Nicotiana benthamiana but have relatively low biomass.
  • Other species such as N tabacum, have high biomass and/or other desirable properties for growth in the field, but have a relatively low specific activity ofthe expressed sequence.
  • the desirable properties of two or more species are combined by making an interspecific hybrid by standard methods. After chromosome doubling to restore fertility, the primary hybrid may have suitable properties, or it may be desirable to backcross toward either parent selecting or screening at each generation for the desired property(ies) ofthe non-recunent parent, for example, infrogress the superior biomass of N tabacum into N benthamiana, or infrogress the superior viral vector performance of N benthamiana into N tabacum, among others.
  • a viral vector expressing the green fluorescent protein (GFP) is one example of a useful tool for screening the level of systemic expression in candidate hybrid plants.
  • EXAMPLE 33 Libraries of heterologous nucleic acid sequences in DHSPES constructs generated in a restriction-endonuclease-free and cell-free manner.
  • the goal of this example is to generate libraries of DHSPES constmcts containing heterologous sequences while avoiding the potential problems associated with the use of restriction enzymes for preparation ofthe inserted nucleic acids and with passage ofthe resultant constructions through E. coli.
  • DNA fragments are generated by restriction endonuclease treatment and ligated into a DHSPES vector with compatible termini.
  • libraries of DHSPES constructs harboring cDNA molecules in a restriction endonuclease-free and E. co/t-free manner are constructed. Such a system will permit the inclusion into DHSPES constmcts of molecules that harbor inconvenient internal restriction sites. This method of "cell-free cloning" will also allow us to obtain DHSPES- derived vimses containing genes that are not well tolerated by E. coli in traditional cloning approaches.
  • cell-free cloning will entail the in vitro assembly of partial viral sequences with a DNA fragment into a configuration that that will yield infectious viral RNA molecules upon in vitro transcription.
  • the viral sequences are divided into two "arms"; the left arm and the right arm.
  • the left arm encodes a T7 RNA polymerase promoter followed by viral sequences encoding replicase followed by the gene encoding movement protein and the subgenomic promoter that controls expression ofthe desired gene.
  • the right arm will contain sequences ofthe viral genome that encode the viral coat protein and the sequences that control its expression, the viral 3' untranslated region, and a ribozyme sequence for generating the desired 3' terminus on the transcribed molecules.
  • FIGURE 28 A schematic diagram for cell free cloning is shown in FIGURE 28.
  • the left arm and right arm will each have separate asymmetric (non- palindromic, thus self-incompatible) overhangs that will permit the two arms to be brought together by an intervening insert that is derived either from PCR product, cDNA reaction, or elsewhere.
  • the insert will have termini that are compatible with both the left and right arms. The termini of these molecules are such that ligation of left and right arms to insert will ensure assembly into the proper configuration to yield infectious viral transcripts.
  • the sequence contained in the insert will then be in the conect orientation and genomic position to permit its expression from the vims in plant cells.
  • the right arm will be synthesized by PCR and will have a biotin group inco ⁇ orated into the reverse (3') primer.
  • the resulting biotinylated PCR product representing the right arm will then be immobilized upon streptavidin paramagnetic beads.
  • Treatment ofthe DNA with T4 DNA polymerase and a single dNTP (in the present case, dGTP) will give a 5' overhang as a result ofthe exonuclease activity ofthe polymerase.
  • the insert DNA being PCR product, restriction fragment, or cDNA will be treated with T4 DNA polymerase with a single dNTP to generate 5' overhangs on its termini; the 3' of which is compatible with the 5' ofthe right arm.
  • the 5' terminus ofthe insert DNA will be compatible with the left .arm 3' terminus that had been generated similarly.
  • ligation reactions in the assembly ofthe vims on the paramagnetic beads will be carried out sequentially, with the insert being ligated to the immobilized right .arm first, followed by washing ofthe bead complex and then ligation ofthe left arm. Following the subsequent wash, in vitro transcription will be carried out to generate infectious RNA transcripts.
  • a DNA fragment comprising the left arm ofthe vims was then ligated to the resulting DNA-bead complex to generate a full-length vims clone that was subsequently used as template for in vitro transcription.
  • DNA-bead complexes were washed by sedimenting them in a magnetic field and resuspending them in the appropriate buffer.
  • aliquots were taken for analysis to confirm that the desired reaction had occurred.
  • the infectious RNA products ofthe franscription reaction were introduced into protoplasts of tobacco cell suspension cultures.
  • EXAMPLE 34 Use of undefined sequences to increase the genetic stability of foreign genes in vims expression vectors.
  • Insertion of foreign gene sequences into vims expression vectors can result in anangements of sequences that interfere with normal vims function and thereby, establish a selection landscape that favors the genetic deletion ofthe foreign sequence. Such events are adverse to the use of such expression vectors to stably express gene sequences systemically in plants.
  • a method that would allow sequences to be identified that may "insulate" functional virus sequences from the potential adverse effects of insertion of foreign gene sequences would greatly augment the expression potential of vims expression vectors.
  • identification of such "insulating" sequences that simultaneously enhanced the translation ofthe foreign gene product or the stability ofthe mRNA encoding the foreign gene would be quite helpful.
  • the example below demonstrates how libraries of random sequences can be introduced into vims vectors flanking foreign gene sequences.
  • the genetic stability ofthe human growth hormone gene (hGH) or an Ubiquitin fusion to hGH (Ubiq hGH) in the tob-amo virus expression vector p30B is rather poor, such that no stable vims preparations could not be made to serially passage infection onto plants and detect the expression of hGH recombinant protein.
  • the site of gene insertion is following a Pad site (underlined) in the vims vector. This sequence is known as a leader sequence and has been derived from the native leader and coding region from the native TMV Ul coat protein gene. In this leader, the normal coat protein ATG has been mutated to a Aga sequence (underlined in
  • a particular subset of this leader sequence (TCTTACAGTATCACTACTCCATCTCAGTTCGTGTTCTTGTCA) has been known to increase genetic stability and gene expression when compared with vims construct lacking the leader sequence.
  • the start site of subgenomic RNA synthesis is found at the GTTTT...
  • An oligonucleotide RL-1 (GTTTTAAATAGATCTTAC N(20)TTAATTAAGGCC ) was used with a primer homologous to the NcollApal region ofthe TMV genome to amplify a portion ofthe TMV movement protein.
  • the population of sequences were cloned into the Apal and Pad sites ofthe p30B hGH vector.
  • Vectors containing the undefined sequences leading the hGH genes were transcribed and inoculated onto Nicotiana benthamiana plants. 14 days post inoculation, systemic leaves were ground .and the plant extracts were inoculated onto a second set of plants.
  • a selectable marker could be used to discover which ofthe undefined sequences yield the desired function.
  • the amount ofthe GFP protein conelates with the level of fluorescence seen under long wave UV light and the amount of herbicide resistance gene product conelates with survival of plant cells or plants upon treatment with the herbicide. Therefore introduction of undefined sequences sunounding the GFP or herbicide resistance genes and then screening for individual vimses that either express the greatest level of fluorescence or cells that survive the highest amount of herbicide.
  • EXAMPLE 35 Method for using reporter genes fused to regulated or constitutive promoters as a surrogate marker for identifying genes impacting gene regulation.
  • transgenic hosts expressing a reporter gene under the control of various promoter types; 2) means to use such hosts to identify genes from libraries expressed in vims expression vectors that alter gene regulation.
  • the initial constmction ofthe reporter gene expression cassette will require identification ofthe appropriate reporter gene, which could include GFP (fluorescent in live plants under long wave UV light), GUS (fluorescent and color-based assay in detected tissue), herbicide resistance genes (live or death phenotype upon treatment with herbicide) or other scoreable gene products known to the art.
  • Promoter sequences can express RNA in constitutive or induced conditions.
  • An example of a regulated promoter would be that of tomato or potato protease inhibitor type I gene (Graham, et al, J. Biol. Chem. 260:6555- 6560 (1985)). These promoters are up regulated in the presence of jasmonic acid or herbivore damage to plant tissues.
  • Constitutive promoters are readily identifiable from anyone skilled in the art inspecting the relevant literature. Such combinations of inducible or constitutive promoters using appropriate reporter genes would be integrated into binary plant transformation vectors, fransformed into Agrobacterium and transformed into Nicotiana benthamiana leaf disks. Upon identification ofthe appropriate gene construct in regenerated tissues, the primary transformants would be selfed to obtain the first stable line of plants for assay. Libraries of cDNAs, full-length for gene overexpression or gene fragments for sense or anti-sense based gene suppression, would be ligated into vims expression vectors by normal molecular biology techniques. These libraries would be prepared for inoculation by the methods described in this patent application.
  • hosts with inducible promoters fused to reporter genes Once inoculated, hosts with inducible promoters fused to reporter genes, maintained in uninduced state, would be monitored for abenant expression ofthe reporter gene in tissue that contains replicating vims. If hosts containing constitutive promoter fusions to reporter genes .are used, monitoring for hyper- or hypo-expression conditions ofthe reporter gene would be the focus. In this manner, genes that augment pathways that induce or upregulate the activity of certain promoters could be identified by following the surrogate marker of reporter gene expression. Conversely, gene that down-regulate or halt reporter gene expression could be identified as products that negatively effect the activities ofthe promoter or signaling pathway to which it is responsive. Vims vectors containing sequences that effected reporter gene expression by overexpression or suppression positive or negative regulatory factors can be isolated, and foreign gene contained may be sequenced and analyzed by bioinformatic methods.
  • EXAMPLE 36 Method to induce the expression of alternative splicing variants to discover biological effects in host organisms and to use said host organism as a source for novel cDNA libraries enriched for alternatively spliced variants of genes.
  • Transcription of nuclear genes in higher eukaryotic organisms results in a primary RNA transcript that contains both coding (exon) and non-coding (intron) information.
  • a cmcial step in RNA maturation before exporting to the cytosol for translation is the splicing of introns from the primary transcript and the rendering of contiguous exons for coding ofthe desired product.
  • Gene families can include the SF2/ASF-like group of splicing factors (Lopato et al, PNAS 92:7672- 7676 (1995)), the RS-rich family of splicing factors (Lapato et al, The Plant Cell :2255-2264 (1996)) and other splicing families that have been identified in the literature in lower or upper eukaryotic systems.
  • the gene libraries will be subcloned into vims expression vectors and vims libraries will be inoculated as individuals or pools onto plants or plant cells.
  • splicing factors Once individual or groups of splicing factors are overexpressed or have their expression suppressed in plant cells, novel forms of splicing will occur due to the role of these proteins in alternative splicing of many transcription factors, splicing factors or other gene products.
  • the high level of expression achieved by vims expression vectors and their ability to infect most cell types in plants should raise the overall level of abenantly expressed mRNAs in the plant.
  • the transfected plants will be used as the starting point for the isolation of poly A(+) RNA for the constmction of cDNAs enriched for alternatively spliced genes.
  • the alterations in the alternative splicing could be the splicing of a greater or lesser number of introns from the primary mRNA than normally occurs in non-transfected plants.
  • These enriched cDNA libraries can now be cloned into vims expression vectors and the functions of these novel spliced forms of genes can be assayed on plants transfected with these vector libraries.
  • Similar methods could be to derive novel cDNA libraries by using vims vectors to express factors responsible for transcriptional regulation of genes in plants.
  • targeted cloning of transcription factor families would be ligated into vims expression vectors. Families could include homeodomain, Zn finger, leucine zipper and other transcription factor families appearing in pro or eukaryotic genomes. Schwechheimer, et al., Ann. Rev. Plant Phys. and Plant Mol. Biol. 49 (1998), in press.
  • the gene libraries will be sub-cloned into vims expression vectors and vims libraries will be inoculated as individuals or pools onto plants or plant cells.
  • cDNAs would be used to construct expression or gene suppression libraries that will be enriched for these rare or abenantly expressed cDNAs.
  • These enriched cDNA libraries can now be cloned into vims expression vectors and the functions of these novel spliced forms of genes can be assayed on plants transfected with these vector libraries.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biomedical Technology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Physics & Mathematics (AREA)
  • Microbiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Plant Pathology (AREA)
  • Immunology (AREA)
  • Analytical Chemistry (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)

Abstract

The present invention provides methods for rapidly determining the function of nucleic acid sequences by transfecting the same into a host organism to effect expression. Phenotypic and biochemical changes produced thereby are then analyzed to ascertain the function of the nucleic acids which have been transfected into the host organism. The invention also provides methods for silencing endogenous genes by transfecting hosts with nucleic acid sequences to effect expression of the same. The present invention also provides methods for selecting desired functions of RNAs and proteins by the use of virus vectors to express libraries of nucleic acid sequence variants. Moreover, the present invention provides methods for inhibiting an endogenous protease of a plant host.

Description

METHOD OF DETERMINING THE FUNCTION OF NUCLEOTIDR SEQUENCES AND THE PROTEINS THEY ENCODE BY TRANSFECTING
THE SAME INTO A HOST
This application is a Continuation-In-Part application of U.S. Patent Application Serial No. 09/008,186, filed on January 16, 1998, which is incorporated by reference.
FIELD OF THE INVENTION The present invention relates generally to the field of molecular biology and plant genetics. Specifically, the present invention relates to a method for determining the function of nucleotide sequences and genes by transfecting the same into a host.
BACKGROUND OF THE INVENTION Great interest exists in launching genome projects in plants comparable to the human genome project. Valuable and basic agricultural plants, including by way of example but without limitation, corn, soybeans and rice are targets for such projects because the information obtained thereby may prove very beneficial for increasing world food production and improving the quality and value of agricultural products. The United States Congress is considering launching a corn genome project. By helping to unravel the genetics hidden in the corn genome, the project could aid in understanding and combating common diseases of grain crops. It could also provide a big boost for efforts to engineer plants to improve grain yields and resist drought, pests, salt, and other extreme environmental conditions. Such advances are critical for a world population expected to double by 2050. Currently, there are four species which provide 60% of all human food: wheat, rice, corn, and potatoes, and the strategies for increasing the productivity of these plants is dependent on rapid discovery ofthe function of unknown gene sequences determined as a result of genomics research. Moreover, such information could identify genes and products encoded by genes useful for human and animal healthcare such as pharmaceuticals. 0 One strategy that has been proposed to assist in such efforts is to create a database of expressed sequence tags (ESTs) that can be used to identify expressed genes. Accumulation and analysis of expressed sequence tags (ESTs) have become an important component of genome research. EST data may be used to identify gene products and thereby accelerate gene cloning. Various sequence databases have been established in an effort to store and relate the tremendous amount of sequence information being generated by the ongoing sequencing efforts. Some have suggested sequencing 500,000 ESTs for corn and 100,000 ESTs each for rice, wheat, oats, barley, and sorghum. Efforts at sequencing the genomes of plant species will undoubtedly rely upon these computer databases to share the sequence data as it is generated. Arabidopsis thaliana may be an attractive target for gene function discovery because a very large set of ESTs have already been produced in this organism, and these sequences tag more than 50% of the expected Arabidopsis genes.
Estimates of several ofthe important grain genome sizes (in reference to microbes and humans) have been suggested. These include Oryza sativa (rice) at about 430 million bases or about 20,000 genes, Sorghum bicolor (sorghum) at about 760 million bases or about 30,000 genes, Zea mays (corn) at about 2 billion bases or about 30,000 genes, and Triticum aestivum (wheat) at about 16 billion bases or about 30,000 genes.
Potential use ofthe sequence information so generated is enormous if gene function can be determined. It may become possible to engineer commercial seeds for agricultural use to convey any number of desirable traits to food and fiber crops and thereby increase agricultural production and the world food supply. Research and development of commercial seeds has so far focused primarily on traditional plant breeding, however there has been increased interest in biotechnology as it relates to plant characteristics. Knowledge ofthe genomes involved and the function of genes contained therein for both monocotyledonous and dicotyledonous plants is essential to realizing positive effects from such technology.
The impact of genomic research in seeds is potentially far reaching. For example, gene profiling in cotton can lead to an understanding ofthe types of genes being expressed primarily in fiber cells. The genes or promoters derived from these genes may be important in genetic engineering of cotton fiber for increased strength or for "built-in" fiber color. In plant breeding, gene profiling coupled to physiological trait analysis can lead to the identification of predictive markers that will be increasingly important in marker assisted breeding programs. Mining the DNA sequence of a particular crop for genes important for yield, quality, health, appearance, color, taste, etc., are applications of obvious importance for crop improvement.
Work has been conducted in the area of developing suitable vectors for expressing foreign DNA and RNA in plant hosts. Ahlquist, U.S. Patent Nos. 4,885,248 and 5,173,410 describes preliminary work done in devising transfer vectors which might be useful in transferring foreign genetic material into a plant host for the purpose of expression therein. Additional aspects of hybrid RNA viruses and RNA transformation vectors are described by Ahlquist et al. in U.S. Patent Nos. 5,466,788, 5,602,242, 5,627,060 and 5,500,360, all of which are incorporated herein by reference. Donson et al, U.S. Patent Nos. 5,316,931, 5,589,367 and 5,866,785, incorporated herein by reference, demonstrate for the first time plant viral vectors suitable for the systemic expression of foreign genetic material in plants. Donson et al. describe plant viral vectors having heterologous subgenomic promoters for the systemic expression of foreign genes. Carrington et al, U.S. Patent 5,491,076, describe particular potyvirus vectors also useful for expressing foreign genes in plants. The expression vectors described by Carrington et al. are characterized by utilizing the unique ability of viral polyprotein proteases to cleave heterologous proteins from viral polyproteins. These include Potyviruses such as Tobacco Etch Virus. Additional suitable vectors are described in U.S. Patent No. 5,811,653 and U.S. Patent Application Serial No. 08/324,003, both of which are incoφorated herein by reference.
Construction of plant RNA viruses for the introduction and expression of non- viral foreign genes in plants has also been demonstrated by Brisson et al, Methods in Enzymology 118:659 (1986), Guzman et al, Communications in
Molecular Biology: Viral Vectors, Cold Spring Harbor Laboratory, pp. 172-189 (1988), Dawson et al, Virology 172:285-292 (1989), Takamatsu et al, EMBO J. 6:307-311 (1987), French et al, Science 231:1294-1297 (1986), and Takamatsu et al, FEBS Letters 269:73-76 (1990). However, these viral vectors have not been shown capable of systemic spread in the plant and expression ofthe non- viral foreign genes in the majority of plant cells in the whole plant. Moreover, many of these viral vectors have not proven stable for the maintenance of non- viral foreign genes. However, the viral vectors described by Donson et al, in U.S. Patent Nos. 5,316,931, 5,589,367, and 5,866,785, Turpen in U.S. Patent No. 5,811,653, Carrington et al in U.S. Patent No. 5,491,076, and in co-pending U.S. Patent Application Serial No. 08/324,003, have proven capable of infecting plant cells with foreign genetic material and systemically spreading in the plant and expressing the non- viral foreign genes contained therein in plant cells locally or systemically. Likely, additional vehicles having greater infectivity and enhanced local or systemic expression of foreign genetic material will be developed either independently or as improvements ofthe vectors described in the patents and pending applications noted above. All patents, patent applications, and references cited in the instant application are hereby incorporated by reference.
The recombinant plant viral nucleic acids and recombinant viruses such as those demonstrated by Donson et al. which have been demonstrated to infect plant cells and express the foreign genetic material systemically are generally characterized as comprising a native plant viral subgenomic promoter, at least one non-native plant viral subgenomic promoter, a plant viral coat protein coding sequence, and at least one non-native nucleic acid sequence. The value of using such plant viral nucleic acids to effect systemic expression of non-native nucleic acids in a plant host is significant. This tool, if coupled with a rational design for elucidating the function ofthe non-native nucleic acids, would make significant strides in understanding the large amount of sequence information produced by sequencing efforts.
SUMMARY OF THE INVENTION In one aspect, the present invention is directed to a method of determining the function of nucleic acid sequences including genes and the proteins they encode in host organisms such as bacteria, yeast, plants, or animals, by transfecting the nucleic acid sequences into the organisms in a manner so as to effect localized or systemic expression ofthe nucleic acid sequences. The present inventors have determined methods for determining the function of nucleic acid sequences and the proteins they encode by tr.ansfecting organisms with nucleic acids of interest thereby providing a more rapid means for elucidating the function of these nucleic acids including genes and subsequently utilizing the rapidly expanding information in the field of genomics. In one embodiment, a nucleic acid is introduced into a plant host wherein the plant host may be a monocotyledonous or dicotyledonous plant, plant tissue or plant cell. Preferably, the nucleic acid may be introduced by way of a plant viral nucleic acid. Such plant viral nucleic acids are stable for the maintenance and transcription or expression of non-native nucleic acid sequences and are capable of locally or systemically transcribing or expressing such sequences in the plant host. Especially preferred recombinant plant viral nucleic acids useful in the methods ofthe present invention comprise a native plant viral subgenomic promoter, a plant viral coat protein coding sequence, and at least one non-native nucleic acid sequence.
Some viral vectors used in accordance with the present invention may be encapsidated by the coat proteins encoded by the recombinant plant virus. The recombinant plant viral nucleic acid or recombinant plant virus is used to infect appropriate hosts such as plants. The recombinant plant viral nucleic acid is capable of replication in the host, localized or systemic spread in the host, and transcription or expression ofthe non-native nucleic acid in the host to produce the desired product. Such products may be for example, useful polypeptides or proteins including enzymes, complex biomolecules, ribozymes, or polypeptides or protein products resulting from positive-sense or anti-sense RNA expression. Moreover, in alternate embodiments, the nucleic acid of interest may be expressed with the genomic DNA or RNA ofthe viral vectors and hence be under the control of a genomic promoter.
Some other viral vectors used in accordance with the present invention comprise recombinant animal viruses or portions thereof. Likewise, such animal viral vectors are useful to infect appropriate hosts such as animals. The recombinant animal viral nucleic acid is capable if replication in the host, systemic or localized spread in the host, and transcription or expression ofthe non-native nucleic acid in the host to produce the desired product.
In another embodiment, the present method uses a viral expression vector encoding for at least one protein non-native to the vector that is released from at least one polyprotein expressed by said vector by proteolytic processing.
In yet other preferred embodiments according to the present method, recombinant plant viruses are used which encode for the expression of a fusion between a plant viral coat protein and the amino acid product ofthe nucleic acid of interest.
In yet other preferred embodiments according to the present method, a nucleic acid sequence of interest including a gene may be placed within any suitable vector construct such as a virus for infecting the host organism. That is, the present method may be practiced without concern for the position ofthe nucleic acid sequence of interest within the vector used to infect the host organism. The invention is not intended to be limited to any particular viral constructs but specifically contemplates using all operable constructs. Those skilled in the art will understand that these embodiments are representative only of many constructs which may be useful to produce localized or systemic expression of nucleic acids in host organisms such as plants. All such constructs .are contemplated and intended to be within the scope ofthe present invention. Those of skill in the art will readily understand that there are many methods to determine the function ofthe nucleic acid once localized or systemic expression in a host, such as a plant, plant cell, transgenic plant, animal or animal cell is attained. In one embodiment the function of a nucleic acid may be determined by complementation analysis. That is, the function ofthe nucleic acid of interest may be determined by observing the endogenous gene or genes whose function is replaced or augmented by introducing the nucleic acid of interest. A discussion of such phenomenon is provided by Napoli et al, The Plant Cell 2:279-289 (1990). In a second embodiment, the function of a nucleic acid may be determined by analyzing the biochemical alterations in the accumulation of substrates or products from enzymatic reactions according to any one ofthe means known by those skilled in the art. In a third embodiment, the function of a nucleic acid may be determined by observing phenotypic changes in the host by methods including morphological, macroscopic or microscopic analysis. In a fourth embodiment, the function of a nucleic acid may be determined by observing any changes in biochemical pathways which may be modified in the host organism as a result of expression ofthe nucleic acid. In a fifth embodiment, the function of a nucleic acid may be determined utilizing techniques known by those skilled in the art to observe inhibition of endogenous gene expression in the cytoplasm of cells as a result of expression ofthe nucleic acid. In a sixth embodiment, the function of a nucleic acid may be determined utilizing techniques known by those skilled in the art to observe changes in the RNA or protein profile as a result of expression ofthe nucleic acid. In a seventh embodiment, the function of a nucleic acid may be determined by selection of organisms such as plants or human cells and tissues capable of growing or maintaining viability in the presence of noxious or toxic substances, such as, for example herbicides and pharmaceutical ingredients.
A second aspect ofthe present invention is a method of silencing endogenous genes in a host by introducing nucleic acids into the host by way of a viral nucleic acid such as a plant or animal viral nucleic acid suitable to produce expression of a nucleic acid in a transfected host. In one embodiment, the host is a plant, but those skilled in the art will understand that other hosts such as bacteria, yeast and animals including humans may also be utilized. This method utilizes the principle of post-transcription gene silencing ofthe endogenous host gene homo log. Since the replication mechanism ofthe transfected non-native nucleic acid produces both sense and antisense RNA sequences, the orientation of the non-native nucleic acid insert is not crucial to providing gene silencing. Particularly, this aspect ofthe invention is especially useful for silencing a multigene family as is frequently found in plants. The prior art has not demonstrated an effective means for silencing a multigene family in plants.
A third aspect ofthe present invention is a method for selecting desired functions of RNAs .and proteins by the use of virus vectors to express libraries of nucleic acid sequence variants. Libraries of sequence variants may be generated by means of in vitro mutagenenisis and/or recombination. Rapid in vitro evolution can be used to improve virus-specific or protein-specific functions. In particular, plant RNA virus expression vectors may be used as tools to bear libraries containing variants of nucleic acid, genes from virus, plant or other sources, and to be applied to plants or plant cells such that the desired altered effects in the RNA or protein products can be determined, selected and improved. In a preferred embodiment, nucleic acid shuffling techniques may be employed to construct shuffled gene libraries. Random, semi-random or known sequences of virus origin may also be inserted in virus expression vectors between native virus sequences and foreign gene sequences, to increase the genetic stability of foreign genes in expression vectors as well as the translation ofthe foreign gene and the stability ofthe mRNA encoding the foreign gene in vivo. The desired function of RNA and protein may include the promoter activities, replication properties, translational efficiencies, movement properties (local and systemic), signaling pathway, or virus host range, among others. The desired function alteration can be identified by assaying infected plants and the nature of mutation can be determined by analysis of sequence variants in the virus vector.
Methods to increase the representation of gene sequences in virus expression libraries may also be achieved by bypassing the genetic bottleneck of propagation in E. coli. For example, in one ofthe preferred embodiments ofthe instant invention, cell-free methods may be used to clone sequence libraries or individual arrayed sequences into virus expression vectors and reconstruct an infectious virus, such that the final ligation product can be transcribed and the resulting RNA can be used for plant or plant cell inoculation/infection with the output being gene function discovery or protein production.
Techniques to screen sequence libraries can be introduced into RNA viruses or RNA virus vectors as populations or individuals in parallel to identify individuals with novel and augmented virus-encoded functions in replication and virus movement, foreign gene sequence retention in vectors and proper folding, activity and expression of protein products, novel gene expression, effects on host metabolism, and resistance or susceptibility of plants to exogenous agents. Variation in the sequence of a native virus gene(s) or heterologous nucleotide sequence(s) may be introduced into an RNA virus or an RNA virus expression vector by many methods as a means to screen a population of variants in batch or individuals in parallel for novel properties exhibited by the virus itself or conferred on the host plant or cell by the virus vector. Variant populations can be transfected as populations or individual clones into "host": 1) protoplasts; 2) whole plants; or 3) inoculated leaves of whole plants and screened for various traits including protein expression (increase or decrease), RNA expression (increase or decrease), secondary metabolites or other host property gained or loss as a result ofthe virus infection.
For treatment of hosts with agents that result in cell death or down regulation in general metabolic function, a virus vector, which simultaneously expressed the green fluorescent protein (GFP) or other selectable marker gene and the variant sequence, is used to screen quantitatively for levels of resistance or sensitivity to the agent in question conferred upon the host by the variant sequence expressed from the viral vector. By quantitatively screening pools or individual infection events, those viruses containing unique variant sequences allowing sustained metabolic life of host are identified by fluorescence under long wave UV light. Those that do not confer this phenotype will fail to or poorly fluoresce. In this manner, high throughput screening in multi-well dishes in plate readers is possible where the average fluorescence ofthe well would be expressed as a ratio ofthe adsorption (measuring the cell mass) thereby giving a comparable quantitative value. This technique enables screening of populations or individuals followed by rescue ofthe sequence from virus vectors conferring desired trait by RT-PCR and re-screening of particular variant sequences in secondary screens. The functions of transcription factors or factors contributing to the signal transduction pathway of host cells are monitored by using specific proteomic, mRNA or metanomic traits to be assayed following fransfection with a virus expression library. The contribution of a particular protein or product to a valuable trait may be known from the literature, but a new mode of enhanced or reduced expression could be identified by finding the factors that respond to cellular signals that in turn alter its particular expression. For example, transcription factors regulating the expression of defense proteins such as systemin peptides, or protease inhibitors could be identified by transfecting hosts with virus libraries and the expression of systemin or protease inhibitors or their RNAs be directly assayed. Conversely, the promoters responsible for expressing these genes could be genetically fused to the green fluorescent protein .and introduced into hosts as transient expression constructs or into stable transformed host cells/tissues. The resulting cells would be transfected with viral vector libraries. Hosts now could be screened rapidly by following relative GFP expression following vector fransfection. Likewise, coupling the transfecting of hosts with virus libraries with the treatment of plants with methyl jasmonate could identify sequences that reverse or enhance the gene induction events induced by this metabolite. This approach could be applied to other factors involved in promotion of higher biomass in plants such as Leafy or DET2. The expression of these factors could be directly assayed or via promoters genetically fused to GFP. This technique will enable screening of populations or individuals followed by rescue ofthe sequence from virus vectors conferring desired trait by RT-PCR and re-screening of particular variant sequences in secondary screens. A fourth aspect ofthe present invention is a method for inhibiting an endogenous protease of a plant host comprising the step of treating the plant host with a compound which induces the production of an endogenous inhibitor of said protease. In a preferred embodiment, jasmonic acid may be used to treat the plant host to induce the production of an endogenous inhibitor of an endogenous protease. In another preferred embodiment, the treatment ofthe plant host with a compound results an increased representation of an exogenous nucleic acid or the protein product thereof. In particular, transgenic hosts expressing protease inhibitors may be used to decrease the degradation of proteins expressed by virus expression vectors. In a preferred embodiment, jasmonic acid may be used to treat plants infected with virus expression vectors to decrease degradation of proteins expressed by virus expression vectors.
A fifth aspect ofthe present invention are genes and fragments thereof, nucleotide sequences, and gene products obtained by way ofthe method ofthe present invention. The present invention features expressing selected nucleotide sequences in a host organism. Those of skill in the art will readily appreciate that the gene products of such nucleotide sequences may be isolated using techniques known to those skilled in the art. Such gene products may exhibit biological activity as pharmaceuticals, herbicides, and other similar functions.
BRIEF DESCRIPTION OF THE FIGURES FIG. 1 depicts the vector TTOl/PSY +. FIG. 2 represents the vector TTO1A/PDS+. FIG. 3 represents the vector TT01A/Ca CCS+. FIG. 4 represents the vector TTU51 CTP CrtB.
FIG. 5 represents the vector TTOSA1CTP CrtI 491.
FIG. 6 represents the Erwinia herbicola phytoene desaturase gene (plasmid pAU211). FIG. 7 represents the plasmid KS+/Crt/* 491.
FIG. 8 represents the plasmid pBS736.
FIG. 9 represents the plasmid pBS 712.
FIG. 10 represents the 72 kDa gene product ofthe genomic clone encoding alcohol oxidase ZZA1. FIG. 11 represents the plasmid TTOS 1 APE ZZA1.
FIG. 12 represents the plasmid TTO1 A 103L.
FIG. 13 represents the plasmid TTU51A QSEO #3.
FIG. 14 represents the plasmid KS+ TVCVK #23.
FIG. 15 represents the plasmid pBS735. FIG. 16 represents the plasmid pBS740.
FIG. 17 represents the plasmid pBS723.
FIG. 18 represents the plasmid pBS731.
FIG. 19 represents the plasmid pBS740 AT #120.
FIG. 20 represents the nucleotide sequence alignment of 740 AT #120 to human ADP-ribosylation factor (ARF3) M33384.
FIG. 21 represents the plasmid pBS740 AT #88.
FIG. 22 represents the nucleotide sequence alignment of 740 AT #88 to L33574 mRNA for rhodopsin.
FIG. 23 represents the nucleotide sequence alignment of 740 AT #88 to X07797 Octopus mRNA for rhodopsin.
FIG. 24 represents the protein sequence alignment of 740 AT #88 to an Arabidopsis est ORF ATTS2938.
FIG. 25 represents the protein sequence alignment of 740 AT #88 to Octopus rhodopsin P31356. FIG. 26 represents amino acid sequence comparison of 740 AT #2441 to tobacco RAN-B1 GTP binding protein.
FIG. 27 represents nucleotide sequence comparison of 740 AT #2441 to human RAN GTP-binding protein. FIG. 28 represents a schematic diagram of cell free cloning.
DETAILED DESCRIPTION OF THE INVENTION In one aspect, the present invention is directed to a method of determining the function of a nucleic acid sequence including a gene and a protein encoded thereby in an organism such as bacteria, fungi, yeast, animals and plants by transfecting the nucleic acid sequence into the organism. The present inventors have determined methods for determining the function of nucleic acid sequences by transfecting organisms with the nucleic acids thereby providing a more rapid means for determining gene function and utilizing the rapidly expanding sequence information in the field of genomics.
In one embodiment, a nucleic acid is introduced into a plant host. Preferably, the nucleic acid may be introduced by way of a viral nucleic acid. Such recombinant viral nucleic acids are stable for the maintenance and transcription or expression of non-native nucleic acid sequences and are capable of systemically transcribing or expressing such non-native sequences in the plant host. Especially preferred recombinant plant viral nucleic acids useful in the present invention comprise a native plant viral subgenomic promoter, a plant viral coat protein coding sequence, and at least one non-native nucleic acid sequence. In a second embodiment, plant viral nucleic acid sequences used in the method ofthe present invention are characterized by the deletion ofthe native coat protein coding sequence and comprise a non-native plant viral coat protein coding sequence and a non-native promoter, preferably the subgenomic promoter ofthe non-native coat protein coding sequence, capable of expression in the plant host, packaging ofthe recombinant plant viral nucleic acid, and ensuring a systemic infection ofthe host by the recombinant plant viral nucleic acid. The recombinant plant viral nucleic acid may contain one or more additional native or non-native subgenomic promoters. Each non-native subgenomic promoter is capable of transcribing or expressing adjacent genes or nucleic acid sequences in the plant host and incapable of recombination with each other and with native subgenomic promoters. One or more non-native nucleic acids may be inserted adjacent to the native plant viral subgenomic promoter or the native and non- native plant viral subgenomic promoters if more than one nucleic acid sequence is included. Moreover, it is specifically contemplated that two or more heterologous non-native subgenomic promoters may be used. The non-native nucleic acid sequences may be transcribed or expressed in the host plant under the control ofthe subgenomic promoter to produce the products ofthe nucleic acids of interest.
In a third embodiment, plant viral nucleic acids are used in the present invention wherein the native coat protein coding sequence is placed adjacent one ofthe non-native coat protein subgenomic promoters instead of a non-native coat protein coding sequence. In a fourth embodiment, plant viral nucleic acids are used in the present invention wherein the native coat protein gene is adjacent its subgenomic promoter and one or more non-native subgenomic promoters have been inserted into the viral nucleic acid. The inserted non-native subgenomic promoters are capable of transcribing or expressing adjacent genes in a plant host and are incapable of recombination with each other and with native subgenomic promoters. Non-native nucleic acid sequences may be inserted adjacent the non- native subgenomic plant viral promoters such that the sequences are transcribed or expressed in the host plant under control ofthe subgenomic promoters to produce the product ofthe non-native nucleic acid. Alternatively, the native coat protein coding sequence may be replaced by a non-native coat protein coding sequence.
The viral vectors used in accordance with the present invention may be encapsidated by the coat proteins encoded by the recombinant plant virus. The recombinant plant viral nucleic acid or recombinant plant virus is used to infect appropriate hosts such as plants. The recombinant plant viral nucleic acid is capable of replication in the host, localized or systemic spread in the host, and transcription or expression ofthe non-native nucleic acid in the host to produce the desired product. Such products may be for example, therapeutics and other useful polypeptides or proteins including enzymes, complex biomolecules, ribozymes, or polypeptides or protein products resulting from positive-sense or anti-sense RNA expression. Moreover, the nucleic acid of interest may be under the control of a genomic promoter and therefore be expressed with the genome of the virus. In another embodiment, the present method uses a viral expression vector encoding at least one protein non-native to the vector that is released from at least one polyprotein expressed by said vector by proteolytic processing catalyzed by at least one protease in said polyprotein wherein said vector comprises at least one promoter, DNA having a sequence which codes for at least one polyprotein from a polyprotein-producing virus, at least one restriction site flanking a 3' terminus of said DNA and a cloning vehicle. Additional embodiments use a viral expression vector encoding for at least one protein non-native to the vector that is released from at least one polyprotein expressed by the vector by proteolytic processing catalyzed by at least one protease in the polyprotein wherein the vector comprises at least one promoter, DNA having a sequence which codes for at least one polyprotein from a polyprotein-producing virus, may contain at least one restriction site flanking a 3' terminus of said cDNA and a cloning vehicle. Preferred embodiments include using a potyvirus as the polyprotein-producing virus, and especially preferred embodiments may use TEV (tobacco etch virus). A more detailed description of such vectors useful according to the method ofthe present invention may be found in U.S. Patent No. 5,491,076 which is incorporated herein by reference.
In yet other preferred embodiments according to the present method, recombinant plant viruses are used which encode for the expression of a fusion between a plant viral coat protein and the amino acid product ofthe nucleic acid of interest. Such a recombinant plant virus provides for high level expression of a nucleic acid of interest. The location or locations where the viral coat protein is joined to the amino acid product ofthe nucleic acid of interest may be referred to as the fusion joint. A given product of such a construct may have one or more fusion joints. The fusion joint may be located at the carboxyl terminus ofthe viral coat protein or the fusion joint may be located at the amino terminus ofthe coat protein portion ofthe construct. In instances where the nucleic acid of interest is located internal with respect to the 5' and 3' residues ofthe nucleic acid sequence encoding for the viral coat protein, there are two fusion joints.
That is, the nucleic acid of interest may be located 5', 3', upstream, downstream or within the coat protein. In some embodiments of such recombinant plant viruses, a "leaky" start or stop codon may occur at a fusion joint which sometimes does not result in translational termination. A more detailed description of some recombinant plant viruses according to this embodiment of the invention may be found in co-pending U.S. Patent Application Serial No. 08/324,003 the disclosure of which is incorporated herein by reference. In yet other embodiments according to the present method, a nucleic acid sequence of interest or a gene may be placed within any suitable vector construct such as a virus for infecting the host organism. That is, the present method may be practiced without concern for the position ofthe nucleic acid sequence of interest within the vector used to infect the host organism. The invention is not intended to be limited to any particular viral constructs but specifically contemplates using all operable constructs. Specifically, those skilled in the art may choose to transfer DNA or RNA of any size up to and including an entire genome into a host organism in order to determine the function thereof.
Those skilled in the art will understand that these embodiments are representative only of many constructs which may be useful to produce localized or systemic expression of nucleic acids in host organisms such as plants. All such constructs are contemplated and intended to be within the scope ofthe present invention.
In order to provide an even clearer and more consistent understanding of the specification and the claims, including the scope given herein to such terms, the following definitions are provided:
Adjacent: A position in a nucleotide sequence proximate to and 5' or 3' to a defined sequence. Generally, adjacent means within 2 or 3 nucleotides ofthe site of reference. Animal cell: A single functional cell found within an animal organism.
Animal tissue refers to one or more cells grouped or organized to perform one or more functions. Animal organ refers to one or more tissues morphologically arranged to perform one or more functions within .an organism.
Anti-Sense Inhibition: A type of gene regulation based on cytoplasmic, nuclear or organelle inhibition of gene expression due to the presence in a cell of an RNA molecule complementary to at least a portion ofthe mRNA being translated. It is specifically contemplated that DNA molecules may be from either an RNA virus or mRNA from the host cells genome or from a DNA virus. Cell Culture: A proliferating group of cells which may be in either an undifferentiated or differentiated state, growing contiguously or non- contiguously.
Chimeric Sequence or Gene: A nucleotide sequence derived from at least two heterologous parts. The sequence may comprise DNA or RNA.
Coding Sequence: A deoxyribonucleotide or ribonucleotide sequence which, when either transcribed and translated or simply translated, results in the formation of a cellular polypeptide or a ribonucleotide sequence which, when translated, results in the formation of a cellular polypeptide. Compatible: The capability of operating with other components of a system. A vector or plant or animal viral nucleic acid which is compatible with a host is one which is capable of replicating in that host. A coat protein which is compatible with a viral nucleotide sequence is one capable of encapsidating that viral sequence. Complementation Analysis: As used herein, this term refers to observing the changes produced in an organism when a nucleic acid sequence is introduced into that organism after a selected gene has been deleted or mutated so that it no longer functions fully in its normal role. A complementary gene to the deleted or mutated gene can restore the genetic phenotype ofthe selected gene. Constitutive expression: Gene expression which features substantially constant or regularly cyclical gene transcription. Generally, genes which are constitutively expressed are substantially free of induction from an external stimulus.
Differentiated cell: A cell which has substantially matured to perform one or more biochemical or physiological functions.
Dual Heterologous Subgenomic Promoter Expression System (DHSPES): a plus stranded RNA vector having a dual heterologous subgenomic promoter expression system to increase, decrease, or change the expression of proteins, peptides or RNAs, preferably those described in U.S. Patent Nos. 5,316,931, 5,811,653, 5,589,367, and 5,866,785, the disclosure of which is incorporated herein by reference.
Expressed sequence tags (ESTs): Relatively short single-pass DNA sequences obtained from one or more ends of cDNA clones and RNA derived therefrom. They may be present in either the 5' or the 3' orientation. ESTs have been shown useful for identifying particular genes.
Expression: The term as used herein is meant to incorporate one or more of transcription, reverse transcription and translation. Gene: A discrete nucleic acid sequence responsible for producing one or more cellular products and/or performing one or more intercellular or intracellular functions.
Gene silencing: A reduction in gene expression. A viral vector expressing gene sequences from a host may induce gene silencing of homologous gene sequences.
Growth cycle: As used herein, the term is meant to include the replication of a nucleus, an organelle, a cell, or an organism.
Host: A cell, tissue or organism capable of replicating a nucleic acid such as a vector or plant viral nucleic acid and which is capable of being infected by a virus containing the viral vector or viral nucleic acid. This term is intended to include prokaryotic and eukaryotic cells, organs, tissues or organisms, where appropriate. Bacteria, fungi, yeast, animal (cell, tissues, or organisms), and plant (cell, tissues, or organisms) are examples of a host.
Induction: The terms "induce", "induction" and "inducible" refer generally to a gene and a promoter operably linked thereto which is in some manner dependent upon an external stimulus, such as a molecule, in order to actively transcribe and/or translate the gene.
Infection: The ability of a virus to transfer its nucleic acid to a host or introduce a viral nucleic acid into a host, wherein the viral nucleic acid is replicated, viral proteins are synthesized, and new viral particles assembled. In this context, the terms "transmissible" and "infective" are used interchangeably herein. The term is also meant to include the ability of a selected nucleic acid sequence to integrate into a genome, chromosome or gene of a target organism. Multigene family: A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those which encode the histones, hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins.
Non-Native: Any RNA or DNA sequence that does not normally occur in the cell or organism in which it is placed. Examples include recombinant plant viral nucleic acids and genes or ESTs contained therein. That is, an RNA or DNA sequence may be non-native with respect to a viral nucleic acid. Such an RNA or DNA sequence would not naturally occur in the viral nucleic acid. Also, an RNA or DNA sequence may be non-native with respect to a host organism. That is, such a RNA or DNA sequence would not naturally occur in the host organism. Conversely, the term non-native does not imply that an RNA or DNA sequence must be non-native with respect to both a viral nucleic acid and a host organism concurrently. The present invention specifically contemplates placing an RNA or DNA sequence which is native to a host organism into a viral nucleic acid in which it is non-native. Nucleic acid: As used herein the term is meant to include any DNA or
RNA sequence from the size of one or more nucleotides up to and including a complete gene sequence. The term is intended to encompass all nucleic acids whether naturally occurring in a particular cell or organism or non-naturally occurring in a particular cell or organism. Nucleic acid of interest: The term is used interchangeably with the term
"nucleic acid" and is intended to refer to the nucleic acid sequence whose function is to be determined. The sequence will normally be non-native to the viral vector but may be native or non-native to the host organism.
Organism: The term organism and "host organism" as used herein is specifically intended to include animals including humans, plants, viruses, fungi, .and bacteria.
Phenotypic Trait: An observable, measurable or detectable property resulting from the expression or suppression of a gene or genes.
Plant Cell: The structural and physiological unit of plants, consisting of a protoplast and the cell wall.
Plant Organ: A distinct and visibly differentiated part of a plant, such as root, stem, leaf or embryo. Plant Tissue: Any tissue of a plant in planta or in culture. This term is intended to include a whole plant, plant cell, plant organ, protoplast, cell culture, or any group of plant cells organized into a structural and functional unit.
Positive-sense inhibition: A type of gene regulation based on cytoplasmic inhibition of gene expression due to the presence in a cell of an RNA molecule substantially homologous to at least a portion ofthe mRNA being translated.
Promoter: The 5 '-flanking, non-coding sequence substantially adjacent a coding sequence which is involved in the initiation of transcription ofthe coding sequence. Protoplast: An isolated plant or bacterial cell without some or all of its cell wall.
Recombinant Plant Viral Nucleic Acid: Plant viral nucleic acid which has been modified to contain non-native nucleic acid sequences. These non-native nucleic acid sequences may be from any organism or purely synthetic, however, they may also include nucleic acid sequences naturally occurring in the organism into which the recombinant plant viral nucleic acid is to be introduced.
Recombinant Plant Virus: A plant virus containing the recombinant plant viral nucleic acid.
Subgenomic Promoter: A promoter of a subgenomic mRNA of a viral nucleic acid.
Substantial Sequence Homology: Denotes nucleotide sequences that are substantially functionally equivalent to one another. Nucleotide differences between such sequences having substantial sequence homology will be de minimis in affecting function ofthe gene products or an RNA coded for by such sequence.
Systemic Infection: Denotes infection throughout a substantial part of an organism including mechanisms of spread other than mere direct cell inoculation but rather including transport from one infected cell to additional cells either nearby or distant. Transposon: A nucleotide sequence such as a DNA or RNA sequence which is capable of transferring location or moving within a gene, a chromosome or a genome. Transgenic plant: A plant which contains a foreign nucleotide sequence inserted into either its nuclear genome or organellar genome.
Transcription: Production of an RNA molecule by RNA polymerase as a complementary copy of a DNA sequence or subgenomic mRNA. Vector: A self-replicating RNA or DNA molecule which transfers an
RNA or DNA segment between cells, such as bacteria, yeast, plant, or animal cells.
Virus: An infectious agent composed of a nucleic acid which may or may not be encapsidated in a protein. A virus may be a mono-, di-, tri-, or multi- partite virus, as described above.
In preferred embodiments, the present invention provides for the infection of a plant host by a recombinant plant virus containing a recombinant plant viral nucleic acid or by the recombinant plant viral nucleic acid which contains one or more non-native nucleic acid sequences which are subsequently transcribed or expressed in the infected tissues ofthe plant host. The product ofthe coding sequences may be recovered from the plant, produce a phenotypic trait in the plant, effect biochemical pathways within the plant or effect endogenous gene expression within the plant.
The present invention has a number of advantages. The instant invention allows practitioners to determine the function of a nucleic acid sequence which has been heretofore unknown.
The chimeric genes and vectors and recombinant plant viral nucleic acids used in this invention are constructed using techniques well known in the art. Suitable techniques have been described in Sambrook et al. (2nd ed.), Cold Spring Harbor Laboratory, Cold Spring Harbor (1982, 1989); Methods in
Enzymol. (Vols. 68, 100, 101, 118, and 152-155) (1979, 1983, 1986 and 1987); and DNA Cloning, D.M. Clover, Ed., IRL Press, Oxford (1985). Medium compositions have been described by Miller, J., Experiments in Molecular Genetics, Cold Spring Harbor Laboratory, New York (1972), as well as the references previously identified, all of which are incorporated herein by reference. DNA manipulations and enzyme treatments are carried out in accordance with manufacturers' recommended procedures in making such constructs. An important feature ofthe present invention is the use of recombinant plant viral nucleic acids which are capable of replication, local and/or systemic spread in a compatible plant host, and which contain one or more non-native subgenomic promoters which are capable of transcribing or expressing adjacent nucleic acid sequences in the plant host. The recombinant plant viral nucleic acids may be further modified to delete all or part ofthe native coat protein coding sequence and to contain a non-native coat protein coding sequence under control ofthe native or one ofthe non-native subgenomic promoters, or put the native coat protein coding sequence under the control of a non-native plant viral subgenomic promoter. The recombinant plant viral nucleic acids have substantial sequence homology to plant viral nucleotide sequences. A partial listing of suitable viruses is described, infra. The nucleotide sequence may be or may be derived from an RNA, DNA, cDNA or a chemically synthesized RNA or DNA. The first step in producing recombinant plant viral nucleic acids according to this particular embodiment for use in the present invention is to modify the nucleotide sequences ofthe plant viral nucleotide sequence by known conventional techniques such that one or more non-native subgenomic promoters are inserted into the plant viral nucleic acid without destroying the biological function ofthe plant viral nucleic acid. The subgenomic promoters are capable of transcribing or expressing adjacent nucleic acid sequences in a plant host infected by the recombination plant viral nucleic acid or recombinant plant virus. The native coat protein coding sequence may be deleted in some embodiments, placed under the control of a non-native subgenomic promoter in other embodiments, or retained in a further embodiment. If it is deleted or otherwise inactivated, a non- native coat protein gene is inserted under control of one ofthe non-native subgenomic promoters, or optionally under control ofthe native coat protein gene subgenomic promoter. The non-native coat protein is capable of encapsidating the recombinant plant viral nucleic acid to produce a recombinant plant virus. Thus, the recombinant plant viral nucleic acid contains a coat protein coding sequence, which may be native or a normative coat protein coding sequence, under control of one ofthe native or non-native subgenomic promoters. The coat protein is involved in the systemic infection ofthe plant host. Some ofthe viruses which meet this requirement, and therefore have been shown to be suitable for use according to the methods ofthe present invention, include viruses from the tobamovirus group such as Tobacco Mosaic virus (TMV), Ribgrass Mosaic Virus (RGM), Cowpea Mosaic virus (CMV), Alfalfa Mosaic virus (AMV), Cucumber Green Mottle Mosaic virus watermelon strain (CGMMV-W) and Oat Mosaic virus (OMV) and viruses from the brome mosaic virus group such as Brome Mosaic virus (BMV), broad bean mottle virus and cowpea chlorotic mottle virus. Additional suitable viruses include Rice Necrosis virus (RNV), and geminiviruses such as Tomato Golden Mosaic virus (TGMV), Cassava Latent virus (CLV) and Maize Streak virus (MS V). Each of these groups of suitable viruses is characterized below. However, the invention should not be construed as limited to using these particular viruses, but rather the method ofthe present invention is contemplated to include all plant viruses at a minimum.
TOBAMOVIRUS GROUP Tobacco Mosaic virus (TMV) is a member ofthe Tobamo viruses. The TMV virion is a tubular filament, and comprises coat protein sub-units arranged in a single right-handed helix with the single-stranded RNA intercalated between the turns ofthe helix. TMV infects tobacco as well as other plants. TMV is transmitted mechanically and may remain infective for a year or more in soil or dried leaf tissue.
The TMV virions may be inactivated by subjection to an environment with a pH of less than 3 or greater than 8, or by formaldehyde or iodine. Preparations of TMV may be obtained from plant tissues by (NH4)2SO4 precipitation, followed by differential centrifugation.
The TMV single-stranded RNA genome is about 6400 nucleotides long, and is capped at the 5 '-end but not polyadenylated. The genomic RNA can serve as mRNA for protein of a molecular weight of about 130,000 (130K) and another produced by read-through of molecular weight about 180,000 (180K). However, it cannot function as a messenger for the synthesis of coat protein. Other genes are expressed during infection by the formation of monocistronic, 3'-coterminal subgenomic mRNAs, including one (LMC) encoding the 17.5K coat protein and another (I2) encoding a 3 OK protein. The 3 OK protein has been detected in infected protoplasts as described in Miller, J., Virology 132:71 (1984), and it is involved in the cell-to-cell transport ofthe virus in an infected plant as described by Deom et al, Science 237:389 (1987). The functions ofthe two large proteins are unknown, however, they are thought to function in RNA replication and transcription.
Several double-stranded RNA molecules, including double-str.anded RNAs corresponding to the genomic, I2 and LMC RNAs, have been detected in plant tissues infected with TMV. These RNA molecules are presumably intermediates in genome replication and/or mRNA synthesis processes which appear to occur by different mechanisms.
TMV assembly apparently occurs in plant cell cytoplasm, although it has been suggested that some TMV assembly may occur in chloroplasts since transcripts of ctDNA have been detected in purified TMV virions. Initiation of TMV assembly occurs by interaction between ring-shaped aggregates ("discs") of coat protein (each disc consisting of two layers of 17 subunits) and a unique internal nucleation site in the RNA; a hairpin region about 900 nucleotides from the 3 '-end in the common strain of TMV. Any RNA, including subgenomic RNAs containing this site, may be packaged into virions. The discs apparently assume a helical form on interaction with the RNA, and assembly (elongation) then proceeds in both directions (but much more rapidly in the 3'- to 5'- direction from the nucleation site).
Another member ofthe Tobamoviruses, the Cucumber Green Mottle Mosaic virus watermelon strain (CGMMV-W) is related to the cucumber virus. Nozu et al, Virology -45:577 (1971). The coat protein of CGMMV-W interacts with RNA of both TMV and CGMMV to assemble viral particles in vitro. Kurisu et al, Virology 70:214 (1976).
Several strains ofthe tobamovirus group are divided into two subgroups, on the basis ofthe location ofthe assembly of origin. Subgroup I, which includes the vulgare, OM, and tomato strain, has an origin of assembly about 800-1000 nucleotides from the 3 '-end ofthe RNA genome, and outside the coat protein cistron. Lebeurier et al, Proc. Natl. Acad. Sci. USA 74:149 (1977); and Fukuda et al, Virology 101:493 (1980). Subgroup II, which includes CGMMV-W and cornpea strain (Cc) has an origin of assembly about 300-500 nucleotides from the 3 '-end ofthe RNA genome and within the coat-protein cistron. The coat protein cistron of CGMMV-W is located at nucleotides 176-661 from the 3 '-end. The 3' noncoding region is 175 nucleotides long. The origin of assembly is positioned within the coat protein cistron. Meshi et al, Virology 127:54 (1983).
BROME MOSAIC VIRUS GROUP Brome Mosaic virus (BMV) is a member of a group of tripartite, single- stranded, RNA-containing plant viruses commonly referred to as the bromoviruses. Each member of the bromo viruses infects a narrow range of plants. Mechanical transmission of bromoviruses occurs readily, and some members are transmitted by beetles. In addition to BV, other bromoviruses include broad bean mottle virus and cowpea chlorotic mottle virus.
Typically, a bromovirus virion is icosahedral, with a diameter of about 26 μm, containing a single species of coat protein. The bromovirus genome has three molecules of linear, positive-sense, single-stranded RNA, and the coat protein mRNA is also encapsidated. The RNAs each have a capped 5 '-end, and a tRNA-like structure (which accepts tyrosine) at the 3 '-end. Virus assembly occurs in the cytoplasm. The complete nucleotide sequence of BMV has been identified and characterized as described by Ahlquist et al., J. Mol. Biol. 153:23 (1981).
RICE NECROSIS VIRUS Rice Necrosis virus is a member ofthe Potato Virus Y Group or Potyviruses. The Rice Necrosis virion is a flexuous filament comprising one type of coat protein (molecular weight about 32,000 to about 36,000) and one molecule of linear positive-sense single-stranded RNA. The Rice Necrosis virus is transmitted by Polymyxa oraminis (a eukaryotic intracellular parasite found in plants, algae and fungi).
GEMINIVIRUSES Geminiviruses are a group of small, single-stranded DNA-containing plant viruses with virions of unique morphology. Each virion consists of a pair of isometric particles (incomplete icosahedral), composed of a single type of protein (with a molecular weight of about 2.7-3.4X104). Each geminivirus virion contains one molecule of circular, positive-sense, single-stranded DNA. In some geminiviruses (i.e., Cassava latent virus and bean golden mosaic virus) the genome appears to be bipartite, containing two single-str.anded DNA molecules.
POTYVIRUSES Potyviruses are a group of plant viruses which produce polyprotein. A particularly preferred potyvirus is tobacco etch virus (TEV). TEV is a well characterized potyvirus and contains a positive-strand RNA genome of 9.5 kilobases encoding for a single, large polyprotein that is processed by three virus- specific proteinases. The nuclear inclusion protein "a" proteinase is involved in the maturation of several replication-associated proteins and capsid protein. The helper component-proteinase (HC-Pro) and 35-kDa proteinase both catalyze cleavage only at their respective C-termini. The proteolytic domain in each of these proteins is located near the C-terminus. The 35-kDa proteinase and HC-Pro derive from the N-terminal region ofthe TEV polyprotein.
Other particularly useful viruses according to some embodiments ofthe present invention feature viruses which are associated with animal hosts. Some of these viruses are discussed, infra.
ALPHAVIRUSES The alphaviruses are a genus ofthe viruses ofthe family Togaviridae. Almost all ofthe members of this genus are transmitted by mosquitoes, and may cause diseases in man or animals. Some of the alphaviruses are grouped into three serologicallly defined complexes. The complex-specific antigen is associated with the El protein ofthe virus, and the species-specific antigen is associated with the E2 protein ofthe virus.
The Semliki Forest virus complex includes Bebaru virus, Chikungunya Fever virus, Getah virus, Mayaro Fever virus, O'nyongnyong Fever virus, Ross River virus, Sagiyama virus, Semliki Forest virus and Una virus. The Venezuelan Equine Encephalomyelitis virus complex includes Cabassou virus, Everglades virus, Mucambo virus, Pixuna virus and Venezuelan Equine Encephalomyelitis virus. The Western Equine Encephalomyelitis virus complex includes Aura virus, Fort Morgan virus, Highlands J virus, Kyzylagach virus, Sindbis virus, Western Equine Encephalomyelitis virus and Whataroa virus.
The alphaviruses contain an icosahedral nucleocapsid consisting of 180 copies of a single species of capsid protein complexed with a plus-stranded mRNA. The alphaviruses mature when preassembled nucleocapsid is surrounded by a lipid envelope containing two virus-encoded integral membrane glycoproteins, called El and E2. The envelope is acquired when the capsid, assembled in the cytoplasm, buds through the plasma membrane. The envelope consists of a lipid bilayer derived from the host cell.
The mRNA encodes a glycoprotein which is cotranslationally cleaved into nonstructural proteins and structural proteins. The 3' one-third ofthe RNA genome consists of a 26S mRNA which encodes for the capsid protein and the E3, E2, K6 and El glycoproteins. The capsid is cotranslationally cleaved from the E3 protein. It is hypothesized that the amino acid triad of His, Asp and Ser at the COOH terminus ofthe capsid protein comprises a serine protease responsible for cleavage. Hahn et al, Proc. Natl. Acad. Sci. USA 82:4648 (1985). Cotranslational cleavage also occurs between E2 and K proteins. Thus, two proteins PE2 which consists of E3 and E2 prior to cleavage and an El protein comprising K6 and El are formed. These proteins are cotranslationally inserted into the endoplasmic reticulum ofthe host cell, glycosylated and transported via the Golgi apparatus to the plasma membrane where they can be used for budding. At the point of virion maturation the E3 and E2 proteins are separated. The El and E2 proteins are incorporated into the lipid envelope. It has been suggested that the basic amino-terminal half ofthe capsid protein stabilizes the interaction of capsid with genomic RNA or interacts with genomic RNA to initiate a encapsidation, Strauss et al, in the Togaviridae and Flaviviridaei. Ed. S. Schlesinger & M. Schlesinger, Plenum Press, New York, pp. 35-90 (1980). These suggestions imply that the origin of assembly is located either on the unencapsidated genomic RNA or at the amino-terminus ofthe capsid protein. It has been suggested that E3 and K6 function as signal sequences for the insertion of PE2 and El, respectively, into the endoplasmic reticulum. Work with temperature sensitive mutants of alphaviruses has shown that failure of cleavage ofthe structural proteins results in failure to form mature virions. Lindquist et al, Virology 151:10 (1986) characterized a temperature sensitive mutant of Sindbis virus, ts 20. Temperature sensitivity results from an A-U change at nucleotide 9502. The ts lesion present cleavage of PE2 to E2 and E3 and the final maturation of progeny virions at the nonpermissive temperature. Hahn et al, supra, reported three temperature sensitive mutations in the capsid protein which prevents cleavage ofthe precursor polyprotein at the nonpermissive temperature. The failure of cleavage resulted in no capsid formation and very little envelope protein.
Defective interfering RNAs (Dl particles) of Sindbis virus are helper- dependent deletion mutants which interfere specifically with the replication ofthe homologous standard virus. Perrault, J., Microbiol. Immunol. 93:151 (1981). Dl particles have been found to be functional vectors for introducing at least one foreign gene into cells. Levis, R., Proc. Natl. Acad. Sci. USA 84:4811 (1987). It has been found that it is possible to replace at least 1689 internal nucleotides of a Dl genome with a foreign sequence and obtain RNA that will replicate and be encapsidated. Deletions ofthe Dl genome do not destroy biological activity. The disadvantages ofthe system are that Dl particles undergo apparently random rearrangements ofthe internal RNA sequence and size alterations. Monroe et al, J. Virology 49:865 (1984). Expression of a gene inserted into the internal sequence is not as high as expected. Levis et al, supra, found that replication ofthe inserted gene was excellent but translation was low. This could be the result of competition with whole virus particles for translation sites and/or also from disruption ofthe gene due to rearrangement through several passages.
Two species of mRNA are present in alphavirus-infected cells: A 42S mRNA region, which is packaged into nature virions and functions as the message for the nonstructural proteins, and a 26S mRNA, which encodes the structural polypeptides. the 26S mRNA is homologous to the 3' third ofthe 42S mRNA. It is translated into a 13 OK polyprotein that is cotranslationally cleaved and processed into the capsid protein and two glycosylated membrane proteins, El and E2. The 26S mRNA of Eastern Equine Encephalomyelitis (EEE) strain 82V- 2137 was cloned and analyzed by Chang et al, J. Gen. Virol. 68:2129 (1987). The 26S mRNA region encodes the capsid proteins, E3, E2, 6K and El. The .amino terminal end ofthe capsid protein is thought to either stabilize the interaction of capsid with mRNA or to interact with genomic RNA to initiate encapsidation.
Uncleaved E3 and E2 proteins called PE2 is inserted into the host endoplasmic reticulum during protein synthesis. The PE2 is thought to have a region common to at least five alphaviruses which interacts with the viral nucleocapsid during morphogenesis.
The 6K protein is thought to function as a signal sequence involved in translocation ofthe El protein through the membrane. The El protein is thought to mediate virus fusion and anchoring ofthe El protein to the virus envelope.
RHINOVIRUSES
The rhino viruses are a genus of viruses ofthe family Picornaviridae. The rhinoviruses are acid-labile, and are therefore rapidly inactivated at pH values of less than about 6. The rhinoviruses commonly infect the upper respiratory tract of mammals. Human rhinoviruses are the major causal agents ofthe common cold, and many serotypes are known. Rhinoviruses may be propagated in various human cell cultures, and have an optimum growth temperature of about 33°C. Most strains of rhinoviruses are stable at or below room temperature and can withstand freezing. Rhinoviruses can be inactivated by citric acid, tincture of iodine or phenol/alcohol mixtures.
The complete nucleotide sequence of human rhinovirus 2 (HRV2) has been sequenced. The genome consists of 7102 nucleotides with a long open reading frame of 6450 nucleotides which is initiated 611 nucleotides from the 5'- end and stops 42 nucleotides from the poly(A) tract. Three capsid proteins and their cleavage cites have been identified.
Rhinovirus RNA is single-stranded and positive-sense. The RNA is not capped, but is joined at the 5 '-end to a small virus-encoded protein, virion-protein genome-linked (VPg). Translation is presumed to result in a single polyprotein which is broken by proteolytic cleavage to yield individual virus proteins. An icosahedral viral capsid contains 60 copies each of 4 virus proteins VP1, VP2, VP3 and VP4 and surrounds the RNA genome. Medappa, K., Virology 44:259 (1971). Analysis of the 610 nucleotides preceding the long open reading frame shows several short open reading frames. However, no function can be assigned to the translated proteins since only two sequences show homology throughout HRV2, HRV14 and the 3 sterotypes of poliovirus. These two sequences may be critical in the life cycle ofthe virus. They are a stretch of 16 bases beginning at 436 in HRV2 and a stretch of 23 bases beginning at 531 in HRV2. Cutting or removing these sequences from the remainder ofthe sequence for non-structural proteins could have an unpredictable effect upon efforts to assemble a mature virion.
The capsid proteins of HRV2: VP4, VP2, VP3 and VP1 begin at nucleotide 611, 818, 1601 .and 2311, respectively. The cleavage point between VP1 and P2A is thought to be .around nucleotide 3255. Skern et al, Nucleic Acids Research 13:2111 (1985).
Human rhinovirus type 89 (HRV89) is very similar to HRV2. It contains a genome of 7152 nucleotides with a single large open reading frame of 2164 condons. Translation begins at nucleotide 619 and ends 42 nucleotides before the poly(A) tract. The capsid structural proteins, VP4, VP2, VP3 and VP1 are the first to be translated. Translation of VP4 begins at 619. Cleavage cites occur at:
VP4/VP2 825 determined
VP2/VP3 1627 determined
VP3/VP1 2340 determined
VP1/P2-A 3235 presumptive
Duechler et al, Proc. Natl. Acad. Sci. USA 84:2605 (1987).
POLIOVIRUSES
Polioviruses are the causal agents of poliomyelitis in man, and are one of three groups of enteroviruses. Entero viruses are a genus ofthe family Picornaviridae (also the family of rhinoviruses). Most enteroviruses replicate primarily in the mammalian gastrointestinal tract, although other tissues may subsequently become infected. Many enteroviruses can be propagated in primarily cultures of human or monkey kidney cells and in some cell lines (e.g.
HeLa, Vero, WT-e8). Inactivation ofthe enteroviruses may be accomplished with heat (about 50°C), formaldehyde (3%), hydrochloric acid (0.1N) or chlorine (ca. 0.3-0.5 ppm free residual Cl2).
The complete nucleotide sequence of poliovirus PV2 (Sab) and PV3 (Sab) have been determined. They are 7439 and 7434 nucleotide in length, respectively. There is a single long open reading frame which begins more than
700 nucleotides from the 5 '-end. Poliovirus translation produces a single polyprotein which is cleaved by proteolytic processing. Kitamura et al, Nature
291:547 (1981).
It is speculated that these homologous sequences in the untranslated regions play an essential role in viral replication such as:
1. viral-specific RNA synthesis; 2. viral-specific protein synthesis; and
3. packaging
Toyoda, H. et al, J. Mol. Biol. 174:561 (1984).
The structures ofthe serotypes of poliovirus have a high degree of sequence homology. Their coding sequences code for the same proteins in the same order. Therefore, genes for structural proteins are similarly located. In PV1, PV2 and PV3, the polyprotein begins translation near the 750 nucleotide. The four structural proteins VP4, VP2, VP3 and VPI begin at about 745, 960, 1790 and 2495, respectively, with VPI ending at about 3410. They are separated in vivo by proteolytic cleavage, rather than by stop/start codons.
SIMIAN VIRUS 40 Simian virus 40 (SV40) is a virus ofthe genus Polyomavirus, and was originally isolated from the kidney cells of the rhesus monkey. The virus is commonly found, in its latent form, in such cells. Simian virus 40 is usually non- pathogenic in its natural host.
Simian virus 40 virions are made by the assembly of three structural proteins, VPI, VP2 and VP3. Girard et al, Biochem. Biophys. Res. Commun. 40:97 (1970); Prives et al, Proc. Natl. Acad. Sci. USA 21 :302 (1974); and Jacobson et al, Proc. Natl. Acad. Sci. USA 73:2747 (1976). The three conesponding viral genes are organized in a partially overlapping manner. They constitute the late genes portion ofthe genome. Tooze, J., Molecular Biology of Tumor Viruses Appendix A The SV40 Nucleotide Sequence, 2nd Ed. Part 2, pp. 799-831 (1980), Cold Spring Harbor Laboratory, Cold Spring Harbor, New York. Capsid proteins VP2 and VP3 .are encoded by nucleotides 545 to 1601 and 899 to 1601, respectively, and both are read in the same frame. VP3 is therefore a subset of VP2. Capsid protein VPI is encoded by nucleotides 1488-2574. The end of the VP2-VP3 open reading frame therefore overlaps the VPI by 113 nucleotides but is read in an alternative frame. Tooze, J., supra. Wychowski et al, J. Virology 61:3862 (1987).
ADENOVIRUSES Adenovirus type 2 is a member ofthe adenovirus family or adeno virus.
This family of viruses are non-enveloped, icosahedral, linear, double-stranded DNA-containing viruses which infect mammals or birds.
The adenovirus virion consists of an icosahedral capsid enclosing a core in which the DNA genome is closely associated with a basic (argmine-rich) viral polypeptide VII. The capsid is composed of 252 capsomeres: 240 hexons
(capsomers each surrounded by 6 other capsomers) and 12 pentons (one at each vertex, each surrounded by 5 'peripentonal' hexons). Each penton consists of a penton base (composed of viral polypeptide III) associated with one (in mammalian adenoviruses) or two (in most avian adenoviruses) glycoprotein fibres (viral polypeptide IV). The fibres can act as haemagglutinins and are the sites of attachment ofthe virion to a host cell-surface receptor. The hexons each consist of three molecules of viral polypeptide II; they make up the bulk ofthe icosahedron. Various other minor viral polypeptides occur in the virion.
The adenovirus dsDNA genome is covalently linked at the 5 '-end of each strand to a hydrophobic 'terminal protein', TP (molecular weight about 55,000 Da); the DNA has an inverted terminal repeat of different length in different adenoviruses. In most adenoviruses examined, the 5 '-terminal residue is dCMP. During its replication cycle, the virion attaches via its fibres to a specific cell-surface receptor, and enters the cell by endocytosis or by direct penetration of the plasma membrane. Most ofthe capsid proteins are removed in the cytoplasm. The virion core enters the nucleus, where the uncoating is completed to release viral DNA almost free of virion polypeptides. Virus gene expression then begins. The viral dsDNA contains genetic information on both strands. Early genes (regions Ela, Elb, E2a, E3, E4) are expressed before the onset of viral DNA replication. Late genes (regions LI, L2, L3, L4 .and L5) are expressed only after the initiation of DNA synthesis. Intermediate genes (regions E2b and Iva2) are expressed in the presence or absence of DNA synthesis. Region Ela encodes proteins involved in the regulation of expression of other early genes, and is also involved in transformation. The RNA transcripts are capped (with m7G5ppp5N) .and polyadenylated in the nucleus before being transferred to the cytoplasm for translation. Viral DNA replication requires the terminal protein, TP, as well as virus- encoded DNA polymerase and other viral and host proteins. TP is synthesized as an 80K precursor, pTP, which binds covalently to nascent replicating DNA strands. pTP is cleaved to the mature 55K TP late in virion assembly; possibly at this stage, pTP reacts with a dCTP molecule and becomes covalently bound to a dCMP residue, the 3' OH of which is believed to act as a primer for the initiation of DNA synthesis. Late gene expression, resulting in the synthesis of viral structural proteins, is accompanied by the cessation of cellular protein synthesis, and virus assembly may result in the production of up to 105 virions per cell. In addition to the plant and animal viruses described above, viral expression system in bacteria and yeast cells may also be employed. See
Munishkin et al, Nature 333(6172):473-5 (1988) and Priano et al, J. Mol. Biol. 221(3):299-310 (1997) for viral expression system in bacteria and Janda et al, Cell 72(6):961-70 (1993) and Ishikawa et al, J. Virol. 7_l(10):7781-90 (1997) for viral expression in yeast. The teachings of these references are incoφorated herein by reference.
The nucleic acid of any suitable plant virus can be utilized to prepare a recombinant plant viral nucleic acid for use in the present invention, and the foregoing are only exemplary of such suitable plant viruses. The nucleotide sequence ofthe plant virus is modified, using conventional techniques, by the insertion of one or more subgenomic promoters into the plant viral nucleic acid. The subgenomic promoters are capable of functioning in the specific host plant. For example, if the host is tobacco, TMV, TEV, or other viruses containing subgenomic promoter may be utilized. The inserted subgenomic promoters should be compatible with the TMV nucleic acid and capable of directing transcription or expression of adjacent nucleic acid sequences in tobacco. The native coat protein gene could also be retained and a non-native nucleic acid sequence inserted within it to create a fusion protein. The native or non-native coat protein gene is utilized in the recombinant plant viral nucleic acid. Whichever non-native nucleic acid is utilized may be positioned adjacent its natural subgenomic promoter or adjacent one ofthe other available subgenomic promoters. The non-native coat protein, as is the case for the native coat protein, is capable of encapsidating the recombinant plant viral nucleic acid and providing for systemic spread ofthe recombinant plant viral nucleic acid in the host plant. The coat protein is selected to provide a systemic infection in the plant host of interest. For example, the TMV-O coat protein provides systemic infection in N benthamiana, whereas TMV-U1 coat protein provides systemic infection in N tabacum. The recombinant plant viral nucleic acid is prepared by cloning a viral nucleic acid. If the viral nucleic acid is DΝA, it can be cloned directly into a suitable vector using conventional techniques. One technique is to attach an origin of replication to the viral DΝA which is compatible with the cell to be transfected. If the viral nucleic acid is RΝA, a full-length DΝA copy ofthe viral genome is first prepared by well-known procedures. For example, the viral RΝA is transcribed into DΝA using reverse transcriptase to produce subgenomic DΝA pieces, and a double-stranded DΝA made using DΝA polymerases. The cDΝA is then cloned into appropriate vectors and cloned into a cell to be transfected. Alternatively, the cDΝA's ligated into the vector may be directly transcribed into infectious RΝA in vitro and inoculated onto the plant host. The cDΝA pieces are mapped and combined in proper sequence to produce a full-length DΝA copy of the viral RΝA genome, if necessary. DΝA sequences for the subgenomic promoters, with or without a coat protein gene, are then inserted into the nucleic acid at non-essential sites, according to the particular embodiment ofthe invention utilized. Non-essential sites are those that do not affect the biological properties ofthe plant viral nucleic acid. Since the RNA genome is the infective agent, the cDNA is positioned adjacent a suitable promoter so that the RNA is produced in the production cell. The RNA is capped using conventional techniques, if the capped RNA is the infective agent. In addition, the capped RNA can be packaged in vitro with added coat protein from TMV to make assembled virions. These assembled virions can then be used to inoculate plants or plant tissues. Alternatively, an uncapped RNA may also be employed in the embodiments ofthe present mvention. Contrary to the practiced art in scientific literature .and in issued patent (Ahlquist et al, U.S. Patent No. 5,466,788), uncapped transcripts for virus expression vectors are infective on both plants and in plant cells. Capping is not a prerequisite for establishing an infection of a virus expression vector in plants, although capping increases the efficiency of infection. In addition, nucleotides may be added between the transcription start site ofthe promoter and the start ofthe cDNA of a viral nucleic acid to construct an infectious viral vector. One or more nucleotides may be added. In a preferred embodiment ofthe present invention, the inserted nucleotide sequence contains a G at the 5 '-end. In a particularly preferred embodiment, the inserted nucleotide sequence is GNN, GTN, or their multiples, (GNN)X or (GTN)X.
Another feature of these recombinant plant viral nucleic acids useful in the present invention is that they further comprise one or more nucleic acid sequences capable of being transcribed in the plant host. These nucleic acid sequences may be native nucleic acid sequences which occur in the host organism or they may be non-native nucleic acid sequences which do not normally occur in the host organism. The nucleic acid sequence is placed adjacent one ofthe non- native viral subgenomic promoters and/or the native coat protein gene promoter depending on the particular embodiment used. The nucleic acid is inserted by conventional techniques, or the nucleic acid sequence can be inserted into or adjacent the native coat protein coding sequence such that a fusion protein is produced. The nucleic acid sequence which is transcribed may be transcribed as an RNA which is capable of regulating the expression of a phenotypic trait by an anti-sense or a positive-sense mechanism. Alternatively, the nucleic acid sequence in the recombinant plant viral nucleic acid may be transcribed and translated in the plant host to produce a phenotypic trait. The nucleic acid sequence(s) may also code for the expression of more than one phenotypic trait. The recombinant plant viral nucleic acid containing the nucleic acid sequence is constructed using conventional techniques such that the nucleic acid sequence(s) are in proper orientation to whichever viral subgenomic promoter is utilized.
A double-stranded DNA ofthe recombinant plant viral nucleic acid or a complementary copy ofthe recombinant plant viral nucleic acid is cloned into the cell to be transfected. If the viral nucleic acid is a RNA molecule, the nucleic acid (cDNA) is first attached to a promoter which is compatible with the production cell. The recombinant plant viral nucleic acid can then be cloned into any suitable vector which is compatible with the production cell. In this manner, only RNA copies ofthe chimeric nucleotide sequence are produced in the production cell. For example, the CaMV promoter can be used when plant cells are to be transfected. Alternatively, the recombinant plant viral nucleic acid is inserted in a vector adjacent a promoter which is compatible with the production cell. If the viral nucleic acid is a DNA molecule, it can be cloned directly into a production cell by attaching it to an origin of replication which is compatible with the cell to be transfected. In this manner, DNA copies ofthe chimeric nucleotide sequence are produced in the transfected cell.
A further alternative when creating the recombinant plant viral nucleic acid is to prepare more than one nucleic acid (i.e., to prepare the nucleic acids necessary for a multipartite viral vector construct). In this case, each nucleic acid would require its own origin of assembly. Each nucleic acid could be prepared to contain a subgenomic promoter and a non-native nucleic acid.
Alternatively, the insertion of a non-native nucleic acid into the nucleic acid of a monopartite virus may result in the creation of two nucleic acids (i.e., the nucleic acid necessary for the creation of a bipartite viral vector). This would be advantageous when it is desirable to keep the replication and transcription or expression ofthe nucleic acid of interest separate from the replication and translation of some ofthe coding sequences ofthe native nucleic acid. Each nucleic acid would have to have its own origin of assembly. The host can be infected with the recombinant plant virus by conventional techniques. Suitable techniques include, but are not limited to, leaf abrasion, abrasion in solution, high velocity water spray and other injury of a host as well as imbibing host seeds with water containing the recombinant plant virus. More specifically, suitable techniques include:
(a) Hand Inoculations. Hand inoculations of the encapsidated vector are performed using a neutral pH, low molarity phosphate buffer, with the addition of celite or carborundum (usually about 1%). One to four drops ofthe preparation is put onto the upper surface of a leaf and gently rubbed.
(b) Mechanized Inoculations of Plant Beds. Plant bed inoculations are performed by spraying (gas-propelled) the vector solution into a tractor- driven mower while cutting the leaves. Alternatively, the plant bed is mowed and the vector solution sprayed immediately onto the cut leaves. (c) High Pressure Spray of Single Leaves. Single plant inoculations can also be performed by spraying the leaves with a narrow, directed spray (50 psi, 6-12 inches from the leaf) containing approximately 1% carborundum in the buffered vector solution.
(d) Vacuum Infiltration. Inoculations may be accomplished by subjecting the host organism to a substantially vacuum pressure environment in order to facilitate infection.
(e) High Speed Robotics Inoculation. Especially applicable when the organism is a plant, individual organisms may be grown in mass array such as in microtiter plates. Machinery such as robotics may then be used to transfer the nucleic acid of interest.
An alternative method for introducing a recombinant plant viral nucleic acid into a plant host is a technique known as agroinfection or Agrobacterium- mediated transformation (sometimes called Agro-infection) as described by Grimsley et al, Nature 325:177 (1987). This technique makes use of a common feature of Agrobacterium which colonizes plants by transferring a portion of their DNA (the T-DNA) into a host cell, where it becomes integrated into nuclear DNA. The T-DNA is defined by border sequences which are 25 base pairs long, and any DNA between these border sequences is transfened to the plant cells as well. The insertion of a recombinant plant viral nucleic acid between the T-DNA border sequences results in transfer ofthe recombinant plant viral nucleic acid to the plant cells, where the recombinant plant viral nucleic acid is replicated, and then spreads systemically through the plant. Agro-infection has been accomplished with potato spindle tuber viroid (PSTV) (Gardner et al, Plant Mol. Biol. 6:221 (1986); CaV (Grimsley et al, Proc. Natl. Acad. Sci. USA 83:3282 (1986)); MSV (Grimsley et al, Nature 325:177 (1987)), and Lazarowitz, S., Nucl. Acids Res. 16:229 (1988)) digitaria streak virus (Donson et al, Virology 162:248 (1988)), wheat dwarf virus (Hayes et al, J. Gen. Virol 69:891 (1988)) and tomato golden mosaic virus (TGMV) (Elmer et al, Plant Mol. Biol. 10:225 (1988) and Gardiner et al, EMBO J. 7:899 (1988)). Therefore, agro-infection of a susceptible plant could be accomplished with a virion containing a recombinant plant viral nucleic acid based on the nucleotide sequence of any ofthe above viruses. Particle bombardment or electrosporation or any other methods known in the art may also be used.
Infection may also be attained by placing a selected nucleic acid sequence into an organism such as E. coli, or yeast, either integrated into the genome of such organism or not and then applying the organism to the surface ofthe host organism. Such a mechanism may thereby produce secondary transfer ofthe selected nucleic acid sequence into the host organism. This is a particularly practical embodiment when the host organism is a plant. Likewise, infection may be attained by first packaging a selected nucleic acid sequence in a pseudovirus. Such a method is described in WO 94/10329, the teachings of which are incoφorated herein by reference. Though the teachings of this reference may be specific for bacteria, those of skill in the art will readily appreciate that the same procedures could easily be adapted to other organisms.
Those of skill in the art will readily understand that there are many methods to determine the function of a nucleic acid once expression in a host, such as a plant is attained. In one embodiment the function of a nucleic acid may be determined by complementation analysis. That is, the function ofthe nucleic acid of interest may be determined by observing the endogenous gene or genes whose function is replaced or augmented by introducing the nucleic acid of interest. A discussion of this principle is provided by Napoli et al, The Plant Cell 2:279-289 (1990) which is incoφorated herein by reference. Further teachings in these regards are provided by WO 97/42210, the disclosure of which is also incoφorated herein by reference. In a second embodiment, the function of a nucleic acid may be determined by analyzing the biochemical alterations in the accumulation of substrates or products from enzymatic reactions according to any one ofthe means known by those skilled in the art. In a third embodiment, the function of a nucleic acid may be determined by observing phenotypic changes in the host by methods including moφhological, macroscopic or microscopic analysis. In a fourth embodiment, the function of a nucleic acid may be determined by observing the change in biochemical pathways which may be modified in the host as a result ofthe local and/or systemic expression ofthe non- native nucleic acids. In a fifth embodiment, the function of a nucleic acid may be determined utilizing techniques known by those skilled in the art to observe inhibition of gene expression in the cytoplasm of cells as a result of expression of the non-native nucleic acid.
A particularly useful way to determine gene function is by observing the phenotype in a whole plant when a particular gene function has been silenced. Useful phenotypic traits in plant cells which may be observed microscopically, macroscopically or by other methods include, but are not limited to, improved tolerance to herbicides, improved tolerance to extremes of heat or cold, drought, salinity or osmotic stress; improved resistance to pests (insects, nematodes or arachnids) or diseases (fungal, bacterial or viral) production of enzymes or secondary metabolites; male or female sterility; dwarfhess; early maturity; improved yield, vigor, heterosis, nutritional qualities, flavor or processing properties, and the like. Other examples include the production of important proteins or other products for commercial use, such as lipase, melanin, pigments, alkaloids, antibodies, hormones, pharmaceuticals, antibiotics and the like. Another useful phenotypic trait is the production of degradative or inhibitory enzymes, such as are utilized to prevent or inhibit root development in malting barley or that determine response or non-response to a systemically administered drug in a human. The phenotypic trait may also be a secondary metabolite whose production is desired in a bioreactor. Another particularly useful means to determine function of nucleic acids transfected into a host is to observe the effects of gene silencing. Traditionally, functional gene knockout has been achieved following inactivation due to insertion of transposable elements or random integration of T-DNA into the chromosome, followed by characterization of conditional, homozygous-recessive mutants obtained upon backcrossing. Some teachings in these regards are provided by WO 97/42210 which is herein incoφorated by reference. As an alternative to traditional knockout analysis, an EST/DNA library from an organism, for example Arabidopsis thaliana, may be assembled into a plant viral transcription plasmid. The DNA sequences in the transcription plasmid library may then be introduced into plant cells as part of a functional RNA virus which post-transcriptionally silences the homologous target gene. The EST/DNA sequences may be introduced into a plant viral vector in either the plus or minus sense orientation, and the orientation can be either directed or random based on the cloning strategy. A high-throughput, automated cloning scheme based on robotics may be used to assemble and characterize the library. In addition, double stranded RNA may also be an effective stimulator of gene silencing/co- suppression in transgenic plant. Gene silencing/co-suppression of plant genes may be induced by delivering an RNA capable of base pairing with itself to form double stranded regions. This approach could be used with any plant or non- plant gene to assist in the identification ofthe function of a particular gene sequence.
A particularly troublesome problem with gene silencing in plant hosts is that many plant genes exist in a multigene family. Therefore, effective silencing of a gene function may be especially problematic. According to the present invention, however, nucleic acids may be inserted into the genome to effectively silence a particular gene function or to silence the function of a multigene family. It is presently believed that about 20% of plant genes exist in multigene families. A single nucleotide sequence of about 20 to 100 or more bases having about 70% or more homology to a gene may silence an entire plant gene family having two or more homologous genes.
A detailed discussion of some aspects ofthe "gene silencing" effect is provided in co-pending U.S. Patent Application Serial No. 08/260,546 (WO95/34668 published 12/21/95) the disclosure of which is incoφorated herein by reference. RNA can reduce the expression of a target gene through inhibitory RNA interactions with target mRNA that occur in the cytoplasm and/or the nucleus of a cell. Full-length cDNAs may be accessed from public and private repositories or extracted from field samples for insertion of unknown open reading frames into viral vectors for expression of nucleic acids in the host organism and thereby utilized as an alternative to antisense gene knockout. This technology may be implemented by PCR amplification and cloning of all cDNAs that do not share homology with gene sequences in public and or private databases. The cDNAs may be expressed in plants transfected with one or more plant viral vectors for subsequent analysis of novel phenotype ofthe whole plant (biochemical and moφhological). Selected cDNA sequences from maize, rice, soybean canola and other crop species may be used to assemble the cDNA libraries. This method may thus be used to search for useful dominant gene phenotypes from novel cDNA libraries through the gene expression.
An EST/cDNA library from an organism such as Arabidopsis thaliana may be assembled into a plant viral transcription plasmid background. The cDNA sequences in the transcription plasmid library can then be introduced into plant cells as cytoplasmic RNA in order to post-transcriptionally silence the endogenous genes. The EST/cDNA sequences may be introduced into the plant viral transcription plasmid in either the plus or anti-sense orientation (or both), and the orientation can be either directed or random based on the cloning strategy. A high-throughput, automated cloning strategy using robotics can be used to assemble the library. The EST clones can be inserted behind a duplicated subgenomic promoter such that they are represented as subgenomic transcripts during viral replication in plant cells. Alternatively, the EST/cDNA sequences can be inserted into the genomic RNA of a plant viral vector such that they are represented as genomic RNA during the viral replication in plant cells. The library of EST clones is then transcribed into infectious RNA and inoculated onto individual platelets of Arabidopsis thaliana (or other plant species). The viral RNA containing the EST/cDNA sequences contributed from the original library are now present in a sufficiently high concentration in the cytoplasm such that they cause post-transcriptional gene silencing ofthe endogenous plant-gene homologs. Since the replication mechanism ofthe virus produces both sense and antisense RNA sequences, the orientation ofthe EST/cDNA insert is normally irrelevant in terms of producing the desired gene-silenced phenotype in the tissue. Partial cDNA sequences cloned into a plant viral vector in the sense orientation have previously been shown to also confer a gene silencing phenotype (Kumagai et al, Proc. Natl. Acad. Sci. USA 92:1679 (1995)), the teachings of which are incoφorated herein by reference. The actual mechanism of gene silencing has not been fully determined. This phenomenon may be similar to the gene silencing via cosuppression observed in transgenic plants.
The plant tissue may then be taken for sophisticated biochemical analysis in order to determine which metabolic pathway has been affected by the EST/DNA gene silencing, and in particular, which steps in a given metabolic pathway have been affected by the EST/DNA gene silencing. Biochemical analysis may be done, for example, in a high-throughput, fully automated fashion using robotics. Suitable biochemical analysis may include MALDI-TOF, LC/MS, GC/MS, two-dimensional IEF/SDS-PAGE, ELISA or other methods of analyses. The clones in the EST/plant viral vector library may then be functionally classified based on metabolic pathway affected or visual/selectable phenotype produced in the plant. This process enables the rapid determination of gene function for unknown EST/DNA sequences of plant origin. Furthermore, this process can be used to rapidly confirm function of full-length DNA's of unknown gene function. Functional identification of unknown ESTDNA sequences in a plant library may then rapidly lead to identification of similar unknown sequences in expression libraries for other crop species based on sequence homology.
Large amounts of DNA sequence information is being generated in the public domain and may be entered into a relational database. Links may be made between sequences from various species predicted to carry out similar biochemical or regulatory functions. Links may also be generated between predicted enzymatic activities and visually displayed biochemical and regulatory pathways. Likewise, links may be generated between predicted enzymatic or regulatory activity and known small molecule inhibitors, activators, substrates or substrate analogs. Phenotypic data from expression libraries expressed in transfected hosts maybe automatically linked within such a relational database. Genes with similar predicted roles of interest in other crop plants or crop plant pests may thereby be more rapidly discovered. A complete classification scheme of gene functionality for a fully sequenced eukaryotic organism has been established for yeast. This classification scheme may be modified for plants and divided into the appropriate categories. Such organizational structure may be utilized to rapidly identify herbicide target loci which may confer dominant lethal phenotypes, and thereby is useful in helping to design rational herbicide programs.
A second aspect ofthe present invention is a method of silencing endogenous genes in a host by introducing nucleic acids into the host by way of a viral nucleic acid suitable to produce the local and systemic expression ofthe nucleic acid of interest. In one embodiment, the host is a plant, but those skilled in the art will imderstand that other hosts may also be utilized. This method utilizes the principle of post-transcription gene silencing ofthe endogenous host gene homolog as described above. Since the replication mechanism produces both sense and anti-sense RNA sequences as disclosed above, the orientation of the non-native nucleic acid insert is not crucial to providing gene silencing. More information describing some aspects ofthe "gene silencing" effect is provided in co-pending U.S. Patent Application Serial No. 08/260,546 (WO 95/34668 published 12/21/95) the disclosure of which is incoφorated herein by reference. RNA can reduce the expression of a target gene through inhibitory RNA interactions with target mRNA that occur in the cytoplasm and/or the nucleus of a cell.
Silencing of endogenous genes can be achieved with homologous (but not identical) sequences from distant plant species. For example, the Nicotiana benthamiana gene for phytoene desaturase (PDS) may be silenced by fransfection with a partial tomato cDNA for PDS (cloned in either the positive or antisense orientation). The tomato PDS cDNA is 92% homologous at the nucleotide level yet is still able to confer efficient gene silencing in an unrelated plant species (Kumagai et al, Proc. Natl. Acad. Sci. USA 92:1679 (1995)). Identification of EST/cDNA gene function in Arabidopsis thaliana could then be extrapolated to similar EST/cDNA sequences of unknown function that exist in other libraries (e.g., soybean, maize, rice, oilseed rape, etc.).
A third aspect ofthe present invention is a method for selecting desired functions of RNAs and proteins by the use of virus vectors to express libraries of nucleic acid sequence variants. Libraries of sequence variants may be generated by means of in vitro mutagenenisis and/or recombination. Rapid in vitro evolution can be used to improve virus-specific or protein-specific functions. In particular, plant RNA virus expression vectors may be used as tools to bear libraries containing variants of nucleic acid, genes from virus, plant or other sources, and to be applied to plants or plant cells such that the desired altered effects in the RNA or protein products can be determined, selected and improved. In a preferred embodiment, nucleic acid shuffling techniques may be employed to construct shuffled gene libraries. Random, semi-random or known sequences of virus origin may also be inserted in virus expression vectors between native virus sequences and foreign gene sequences, to increase the genetic stability of foreign genes in expression vectors as well as the translation ofthe foreign gene and the stability ofthe mRNA encoding the foreign gene in vivo. The desired function of RNA and protein may include the promoter activities, replication properties, translational efficiencies, movement properties (local and systemic), signaling pathway, or virus host range, among others. The desired function alteration can be identified by assaying infected plants and the nature of mutation can be determined by analysis of sequence variants in the virus vector.
Methods to increase the representation of gene sequences in virus expression libraries may also be achieved by bypassing the genetic bottleneck of propagation in E. coli. For example, in one ofthe preferred embodiments ofthe instant invention, cell-free methods may be used to clone sequence libraries or individual arrayed sequences into virus expression vectors and reconstruct an infectious virus, such that the final ligation product can be transcribed and the resulting RNA can be used for plant or plant cell inoculation/infection with the output being gene function discovery or protein production.
Techniques to screen sequence libraries can be introduced into RNA viruses or RNA virus vectors as populations or individuals in parallel to identify individuals with novel and augmented virus-encoded functions in replication and virus movement, foreign gene sequence retention in vectors and proper folding, activity and expression of protein products, novel gene expression, effects on host metabolism, and resistance or susceptibility of plants to exogenous agents. Variation in the sequence of a native virus gene(s) or heterologous nucleotide sequence(s) may be introduced into an RNA virus or an RNA virus expression vector by many methods as a means to screen a population of variants in batch or individuals in parallel for novel properties exhibited by the virus itself or conferred on the host plant or cell by the virus vector. Variant populations can be transfected as populations or individual clones into "host": 1) protoplasts; 2) whole plants; or 3) inoculated leaves of whole plants and screened for various traits including protein expression (increase or decrease), RNA expression (increase or decrease), secondary metabolites or other host property gained or loss as a result ofthe virus infection.
For treatment of hosts with agents that result in cell death or down regulation in general metabolic function, a virus vector, which simultaneously expressed the green fluorescent protein (GFP) or other selectable marker gene and the variant sequence, is used to screen quantitatively for levels of resistance or sensitivity to the agent in question conferred upon the host by the variant sequence expressed from the viral vector. By quantitatively screening pools or individual infection events, those viruses containing unique variant sequences allowing sustained metabolic life of host are identified by fluorescence under long wave UV light. Those that do not confer this phenotype will fail to or poorly fluoresce. In this manner, high throughput screening in multi-well dishes in plate readers is possible where the average fluorescence ofthe well would be expressed as a ratio ofthe adsoφtion (measuring the cell mass) thereby giving a comparable quantitative value. This technique enables screening of populations or individuals followed by rescue ofthe sequence from virus vectors conferring desired trait by RT-PCR and re-screening of particular variant sequences in secondary screens. The functions of transcription factors or factors contributing to the signal transduction pathway of host cells are monitored by using specific proteomic, mRNA or metanomic traits to be assayed following fransfection with a virus expression library. The contribution of a particular protein or product to a valuable trait may be known from the literature, but a new mode of enhanced or reduced expression could be identified by finding the factors that respond to cellular signals that in turn alter its particular expression. For example, transcription factors regulating the expression of defense proteins such as systemin peptides, or protease inhibitors could be identified by transfecting hosts with virus libraries and the expression of systemin or protease inhibitors or their RNAs be directly assayed. Conversely, the promoters responsible for expressing these genes could be genetically fused to the green fluorescent protein and introduced into hosts as transient expression constructs or into stable transformed host cells/tissues. The resulting cells would be transfected with viral vector libraries. Hosts now could be screened rapidly by following relative GFP expression following vector fransfection. Likewise, coupling the transfecting of hosts with virus libraries with the treatment of plants with methyl jasmonate could identify sequences that reverse or enhance the gene induction events induced by this metabolite. This approach could be applied to other factors involved in promotion of higher biomass in plants such as Leafy or DET2. The expression of these factors could be directly assayed or via promoters genetically fused to GFP. This technique will enable screening of populations or individuals followed by rescue ofthe sequence from virus vectors conferring desired trait by RT-PCR and re-screening of particular variant sequences in secondary screens. A fourth aspect ofthe present invention is a method for inhibiting an endogenous protease of a plant host comprising the step of treating the plant host with a compound which induces the production of an endogenous inhibitor of said protease. In a preferred embodiment, jasmonic acid may be used to treat the plant host to induce the production of an endogenous inhibitor of an endogenous protease. In another preferred embodiment, the treatment ofthe plant host with a compound results an increased representation of an exogenous nucleic acid or the protein product thereof. In particular, transgenic hosts expressing protease inhibitors may be used to decrease the degradation of proteins expressed by virus expression vectors. In a preferred embodiment, jasmonic acid may be used to treat plants infected with virus expression vectors to decrease the degradation of proteins expressed by virus expression vectors. A fifth aspect ofthe present invention are genes and fragments thereof, nucleotide sequences, and gene products obtained by way ofthe method ofthe present invention. The present invention features expressing selected nucleotide sequences in a host organism such as, for example, a plant. Those of skill in the art will readily appreciate that the gene products of such nucleotide sequences may be isolated using techniques known to those skilled in the art. Such gene products may exhibit biological activity as pharmaceuticals, herbicides, and other similar functions.
EXAMPLES OF THE PREFERRED EMBODIMENTS
The following examples further illustrate the present invention. These examples are intended merely to be illustrative ofthe present invention and are not to be construed as being limiting.
EXAMPLE 1
Cytoplasmic inhibition of phytoene desaturase in transfected plant confirms that the partial tomato PDS sequence encodes phytoene desaturase.
Isolation of tomato mosaic virus cDNA. An 861 base pair fragment (5524-6384) from the tomato mosaic virus (fruit necrosis strain F; tom-F) containing the putative coat protein subgenomic promoter, coat protein gene, and the 3'-end was isolated by PCR using primers 5'-CTCGCAAAGTTTCGAACCAAATCCTC-3' (upstream) (SEQ ID NO: 1) and 5'-
CGGGGTACCTGGGCCCCAACCGGGGGTTCCGGGGG-3 ' (downstream) (SEQ ID NO: 2) and subcloned into the H cII site of pBluescript KS-. A hybrid virus consisting of TMV-U1 and ToMV-F was constructed by swapping an 874- bp BamRl-Kpnl ToMV fragment into pBGC152, creating plasmid TTO1. The inserted fragment was verified by dideoxynucleotide sequencing. A unique Avrll site was inserted downstream ofthe Xhol site in TTO1 by PCR mutagenesis, creating plasmid TTO1 A, using the following oligonucleotides: 5'-
TCCTCGAGCCTAGGCTCGCAAAGTTTCGAACCAAATCCTCA-3' (upstream) (SEQ ID NO: 3), 5'- CGGGGTACCTGGGCCCCAACCGGGGGTTCCGGGGG-3' (downstream) (SEQ ID NO: 4).
Isolation of a cDNA encoding tomato phytoene svnthase and a partial cDNA encoding tomato phytoene desaturase. Partial cDNAs were isolated from ripening tomato fruit RNA by polymerase chain reaction (PCR) using the following oligonucleotides: PSY, 5'-TATGTATGGTGCAGAAGAACAGAT-3' (upstream) (SEQ ID NO: 5), 5*-AGTCGACTCTTCCTCTTCTGGCAT C-3' (downstream) (SEQ ID NO: 6); PDS, 5'- TGCTCGAGTGTGTTCTTCAGTTTTCTGTCA-3' (SEQ ID NO: 7)
(upstream), 5'-AACTCGAGCGCTTTGATTTCTCCGAAGCTT-3' (downstream) (SEQ ID NO: 8). Approximately 3 X 104 colonies from a Lycopersicon esculentum cDNA library were screened by colony hybridization using a 32P labeled tomato phytoene synthase PCR product. Hybridization was carried out at 42°C for 48 hours in 50% formamide, 5X SSC, 0.02 M phosphate buffer, 5X Denhart's solution, and 0.1 mg/ml sheared calf thymus DNA. Filters were washed at 65°C in 0.1X SSC, 0.1% SDS prior to autoradiography. PCR products and the phytoene synthase cDNA clones were verified by dideoxynucleotide sequencing.
DNA sequencing and computer analysis. A Pstl, BamHl fragment containing the phytoene synthase cDNA and the partial phytoene desaturase cDNA was subcloned into pBluescript® KS+ (Stratagene, La Jolla, California). The nucleotide sequencing of KS+/PDS #38 and KS+/ 5'3'PSY was carried out by dideoxy termination using single-stranded templates (Maniatis, Molecular Cloning, 1st Ed.) Nucleotide sequence analysis and amino acid sequence comparisons were performed using PCGENE® and DNA Inspector® HE programs.
Construction ofthe tomato phytoene svnthase expression vector. A Xhol fragment containing the tomato phytoene synthase cDNA was subcloned into TTO1. The vector TTO1/PSY + (FIGURE 1) contains the phytoene synthase cDNA in the positive orientation under the control ofthe TMV-U1 coat protein subgenomic promoter; while, the vector TTOl/PSY - contains the phytoene synthase cDNA in the antisense orientation.
Construction of a viral vector containing a partial tomato phytoene desaturase cDNA. A Xhol fragment containing the partial tomato phytoene desaturase cDNA was subcloned into TTO1. The vector TTO1A/PDS + (FIGURE 2) contains the phytoene desaturase cDNA in the positive orientation under the control ofthe TMV-U1 coat protein subgenomic promoter; while the vector TTO1A/PDS - contains the phytoene desaturase cDNA in the antisense orientation.
Transfection and analysis of N. benthamiana ITTO1/PSY+. TTO1/PSY-. TTO1Δ/PDS +. TTO1/PDS -1. Infectious RNAs from TTO1/PSY+ (FIGURE 1), TTO 1 /PS Y-TTO 1 /PDS +, TTO1/PDS+ were prepared by in vitro transcription using SP6 DNA-dependent RNA polymerase as described previously (Dawson et al, Proc. Natl. Acad. Sci. USA 83:1832 (1986)) and were used to mechanically inoculate N benthamiana. The hybrid viruses spread throughout all the non- inoculated upper leaves as verified by transmission electron microscopy, local lesion infectivity assay, and polymerase chain reaction (PCR) amplification. The viral symptoms resulting from the infection consisted of distortion of systemic leaves and plant stunting with mild chlorosis. The leaves from plants transfected with TTO1/PSY+ turned orange and accumulated high levels of phytoene while those transfected with TTO1Δ/PDS+ and TTO1Δ/PDS- turned white. Agarose gel eletrophoresis of PCR cDΝA isolated from virion RΝA and Northern blot analysis of virion RNA indicate that the vectors are maintained in an extrachromosomal state and have not undergone any detectable intramolecular rearrangements.
Purification and analysis of carotenoids from transfected plants. The carotenoids were isolated from systemically infected tissue and analyzed by HPLC chromatography. Carotenoids were extracted in ethanol and identified by their peak retention time and absoφtion spectra on a 25-cm Spherisorb® ODS-1 5- m column using acetonitrile/methanol/2-propanol (85:10:5) as a developing solvent at a flow rate of 1 ml/min. They had identical retention time to a synthetic phytoene standard and β-carotene standards from carrot and tomato. The phytoene peak from N benthamiana transfected with TTO1/PSY + had an optical absorbance maxima at 276, 285, and 298 nm. Plants transfected with viral encoded phytoene synthase showed a ten- fold increase in phytoene compared to the levels in noninfected plants. The expression of sense and antisense RΝA to a partial phytoene desaturase in transfected plants inhibited the synthesis of colored carotenoids and caused the systemically infected leaves to turn white. HPLC analysis of these plants revealed that they also accumulated phytoene. The white leaf phenotype was also observed in plants treated with the herbicide norflurazon which specifically inhibits phytoene desaturase.
This change in the levels of phytoene represents one ofthe largest increases of any carotenoid (secondary metabolite) in any genetically engineered plant. Plants transfected with viral-encoded phytoene synthase showed a ten-fold increase in phytoene compared to the levels in noninfected plants. In addition, the accumulation of phytoene in plants transfected with positive-sense or antisense phytoene desaturase suggests that viral vectors can be used as a potent tool to manipulate pathways in the production of secondary metabolites through cytoplasmic .antisense inhibition. These data are presented by Kumagai et al, Proc. Natl. Acad. Sci. USA 92:1679-1683 (1995).
EXAMPLE 2 Expression of bell pepper cDΝA in transfected plant confirms that it encodes capsanthin-capsorubin svnthase.
The biosynthesis of leaf carotenoids in Nicotiana benthamiana was altered by rerouting the pathway to the synthesis of capsanthin, a non-native chromoplast-specific xanthophyll, using an RΝA viral vector. A cDΝA encoding capsanthin-capsorubin synthase (Ccs), was placed under the transcriptional control of a tobamovirus subgenomic promoter. Leaves from transfected plants expressing Ccs developed an orange phenotype and accumulated high levels of capsanthin. This phenomenon was associated by thylakoid membrane distortion and reduction of grana stacking. In contrast to the situation prevailing in chromoplasts, capsanthin was not esterified and its increased level was balanced by a concomitant decrease ofthe major leaf xanthophylls, suggesting an autoregulatory control of chloroplast carotenoid composition. Capsanthin was exclusively recruited into the trimeric and monomeric light-harvesting complexes of Photosystem II. This demonstration that higher plant antenna complexes can accommodate non-native carotenoids provides compelling evidence for functional remodeling of photosynthetic membranes by rational design of carotenoids.
Construction ofthe Ccs expression vector. Unique Xhol. Avrll sites were inserted into the bell pepper capsanthin-capsorubin synthase (Ccs) cDNA by polymerase chain reaction (PCR) mutagenesis using oligonucleo tides: 5'- GCCTCGAGTGCAGCATGGAAACCCTTCTAAAGCTTTTCC-3' (upstream) (SEQ ID NO: 9), 5'-TCCCTAGGTCAAAGGCTCTCTATTGCTAGATTGCCC- 3' (downstream) (SEQ ID NO: 10). The 1.6-kb Xhol, Avrll cDNA fragment was placed under the control ofthe TMV-U1 coat protein subgenomic promoter by subcloning into TTO1A, creating plasmid TTO1A CCS+ (FIGURE 3) in the sense orientation as represented by FIGURE 3.
Carotenoid analysis. Twelve days after inoculation upper leaves from 12 plants were harvested and lyophilized. The resulting non-saponified extract was evaporated to dryness under argon and weighed to determine the total lipid content. Pigment analysis from the total lipid content was performed by HPLC and also separated by thin layer chromatography on silica gel G using hexane / acetone (60v / 40v). Plants transfected with TTO1A CCS+ accumulated high levels of capsanthin (36% of total carotenoids).
EXAMPLE 3 Expression of bacterial Crt-9 gene in transfected plants confirms that it encodes phytoene svnthase.
We developed a new viral vector, TTU51, consisting of tobacco mosaic virus strain Ul (TMV-U1) (Goelet et al, Proc. Natl. Acad. Sci. USA 79:5818- 5822 (1982)), and tobacco mild green mosaic virus (TMGMV; U5 strain) (Solis et al, "The complete nucleotide sequence ofthe genomic RNA ofthe tobamovirus tobacco mild green mosaic virus" (1990)). The open reading frame (ORF) for Erwinia herbicola phytoene synthase (CrtE) (Armstrong et al, Proc. Natl. Acad. Sci. USA 82:9975-9979 (1990)) was placed under the control ofthe tobacco mosaic virus (TMV) coat protein subgenomic promoter in the vector TTU51. This construct also contained the gene encoding the chloroplast targeting peptide (CTP) for the small subunit of ribulose- 1,5-bisphosphate carboxylase (RUBISCO) (O'Neal et al, Nucl. Acids Res. L5: 8661-8677 (1987)) and was called TTU51 CTP Crt-5 as represented by FIGURE 4. Infectious RNA was prepared by in vitro transcription using SP6 DNA-dependent RNA polymerase (Dawson et al, Proc. Natl. Acad. Sci. USA 83_:1832-1836 (1986)); Susek et al, Cell 24:787-799 (1993)) and was used to mechanically inoculate N benthamiana. The hybrid virus spread throughout all the non-inoculated upper leaves and was verified by local lesion infectivity assay and polymerase chain reaction (PCR) amplification. The leaves from plants transfected with TTU51 CTP CrtB developed an orange pigmentation that spread systemically during plant growth and viral replication.
Leaves from plants transfected with TTU51 CTP CrtB had a decrease in chlorophyll content (result not shown) that exceeded the slight reduction that is usually observed during viral infection. Since previous studies have indicated that the pathways of carotenoid and chlorophyll biosynthesis are interconnected (Susek et al, Cell 24:787-799 (1993)), we decided to compare the rate of synthesis of phytoene to chlorophyll. Two weeks post-inoculation, chloroplasts from plants infected with TTU51 CTP CrtB transcripts were isolated and assayed for enzyme activity. The ratio of phytoene synthetase to chlorophyll syntheses was 0.55 in transfected plants and 0.033 in uninoculated plants (control). Phytoene synthase activity from plants transfected with TTU51 CTP Crt-5 was assayed using isolated chloroplasts and labeled [14C] geranylgeranyl PP. There was a large increase in phytoene and an unidentified C40 alcohol in the CrtB plants. Phytoene synthetase assay.
The chloroplasts were prepared as described previously (Camara, Methods Enzymol. 214:352-365 (1993)). The phytoene synthase assays were carried out in an incubation mixture (0.5 ml final volume) buffered with Tris-HCL, pH 7.6, containing [14C] geranylgeranyl PP (100,000 cpm) (prepared using pepper GGPP synthase expressed in E. coli), 1 mM ATP, 5 mM MnCl , 1 mM MgCl2, Triton X-100 (20 mg per mg of chloroplast protein) and chloroplast suspension equivalent to 2 mg protein. After 2 h incubation at 30°C, the reaction products were extracted with chloroform methanol (Camara, supra) and subjected to TLC onto silicagel plate developed with benzene/ethyl acetate (90/10) followed by autoradiography.
Chlorophyll synthetase assay.
For the chlorophyll synthetase assay, the isolated chloroplasts were lysed by osmotic shock before incubation. The reaction mixture (0.2 ml, final volume) consisting of 50 mM Tris-HCL (pH 7.6) containing [14C] geranylgeranyl PP (100,000 cpm), 5 MgCl2, 1 mM ATP, and ruptured plasmid suspension equivalent to 1 mg protein was incubated for 1 hr at 30°C. The reaction products were analyzed as described previously.
Plasmid Constructions.
The chloroplast targeting, phytoene synthase expression vector, TTU51 CTP CrtB as represented in FIGURE 4, was constructed in several subcloning steps. First, a unique Sphl site was inserted in the start codon for the Erwinia herbicola phytoene synthase gene by polymerase chain reaction (PCR) mutagenesis (Saiki et al, Science 230:1350-1354 (1985)) using oligonucleotides
Crt-5 MIS 5'"CCA AGC TTC TCG AGT GCA GCA TGC AGC AAC CGC
CGC TGC TTG AC-3' (upstream) (SEQ ID NO: 11) and Crt-5 P300 5'"AAG ATC TCT CGA GCT AAA CGG GAC GCT GCC AAA GAC CGG CCG G-3* (downstream) (SEQ ID NO: 12). The CrtB PCR fragment was subcloned into pBluescript® (Stratagene) at the EcoRV site, creating plasmid pBS664. A 938 bp Sphl, Xhol CrtB fragment from pBS664 was then subcloned into a vector containing the sequence encoding the N tabacum chloroplast targeting peptide (CTP) for the small subunit of RUBISCO, creating plasmid pBS670. Next, the tobamoviral vector, TTU51, was constructed. A 1020 base pair fragment from the tobacco mild green mosaic virus (TMGMV; U5 strain) containing the viral subgenomic promoter, coat protein gene, and the 3'-end was isolated by PCR using TMGMV primers 5'"GGC TGT GAA ACT CGA AAA GGT TCC GG-3'
(upstream) (SEQ ID NO: 13) and 5'"CGG GGT ACC TGG GCC GCT ACC GGC GGT TAG GGG AGG-3' (downstream) (SEQ ID NO: 14), subcloned into the Hindi site of Bluescript KS-, and verified by dideoxynucleotide sequencing. This clone contains a naturally occurring duplication of 147 base pairs (SEQ ID NO: 15) that includes the whole upstream pseudoknot domain in the 3' noncoding region. The hybrid viral cDNA consisting of TMV-U1 and TMGMV was constructed by swapping a 1-Kb Xhol-Kpnl TMGMV fragment into TTO1 (Kumagai et al, Proc. Natl. Acad. Sci. USA 92:1679-1683 (1995)), creating plasmid TTU51. Finally, the 1.1 Kb Xhol CTP Crt-5 fragment from ρBS670 was subcloned into the Xhol of TTU51, creating plasmid TTU51 CTP Crt-5. As a CTP negative control, a 942 bp Xhol fragment containing the Crt-5 gene from pBS664 was subcloned into TTU51, creating plasmid TTU51 Crt-5 #15.
EXAMPLE 4
Expression of bacterial phytoene desaturase (Crt gene in transfected plants confers resistance to norflurazon herbicide.
Erwinia phytoene desaturase (PDS), which is encoded by the gene CrtI (Armstrong et al, 1990), converts phytoene to lycopene through four desaturation steps. While plant PDS is sensitive to the bleaching herbicide norflurazon,
Erwinia PDS is not inhibited by norflurazon (Misawa et al, Plant J. 6(4):481- 489 (1994)). The open reading frame (ORF) for CrtI was placed under the control ofthe tobacco mosaic virus (TMV) coat protein subgenomic promoter in the vector TTOSA1. This construct also contained the gene encoding the chloroplast targeting peptide (CTP) for the small subunit of ribulose- 1,5- bisphosphate carboxylase (RUBISCO) and was called TTOSA1 CTP CrtI 491 #7 Infectious RNA was prepared by in vitro transcription using SP6 DNA-dependent RNA polymerase (Dawson et al, Proc. Natl. Acad. Sci. USA 83:1832-1836 (1986)) and was used to mechanically inoculate N benthamiana. The hybrid virus spread throughout all the non-inoculated upper leaves, conferring resistance to norflurazon to the entire plant. TTOSAl CTP CrtI 491 #7 (FIGURE 5) inoculated plants remained green instead of bleaching white, and maintained higher levels of β-carotene compared to uninoculated control plants.
Plasmid Constructions.
The chloroplast targeting, bacterial phytoene desaturase expression vector, TTOSAl CTP CrtI 491 #7 (FIGURE 5) was constructed as follows. First, a unique Sphl site was inserted in the start codon for the Erwinia herbicola phytoene desaturase gene (plasmid pAU211, (FIGURE 6) by polymerase chain reaction (PCR) mutagenesis using the oligonucleotides CrtI HSM1 5'-GA CAG AAG CTT TGC AGC ATG CAA AAA ACC GTT-3' (upstream) (SEQ ID NO: 16) and IQ419A 5'-CGC GGT CAT TGC AGA TCC TCA ATC ATC AGG C-3' (downstream) (SEQ ID NO: 17). The 1504 bp CrtI PCR fragment was subcloned into pBluescript® (Stratagene) by inserting it between the EcoRV and Hindlll sites, creating plasmid KS+/Crt/* 491 (FIGURE 7). A 1481 bp Sphl, Avrll CrtI fragment from plasmid KS+ICrtI* 491 was then subcloned into the tobamo viral vector TTOSAl , creating TTOSAl CTP CrtI 491 #7.
Treatment of Transfected Plants with Norflurazon and Results.
Starting 7 days after viral inoculation, the plants were treated with 5 ml of a 10 mg/ml SolicamΘDF (Sandoz Agro, Inc.) norflurazon herbicide solution [(4- chloro-5-(methylamino)-2-(alpha, alpha, alpha-trifluoro-m-tolyl)-3(2H)- pyridazinone)] every 4 days by applying to leaves and soil. Five days after initiating treatment, uninfected plants were almost entirely white, especially in the upper leaves and meristematic areas. Plants infected with TTOSAl CTP CrtI 491 #7 were still green and were almost identical in appearance to the non- norflurazon treated infected controls. Leaf Analysis.
The spread ofthe virally expressed CrtI gene throughout the plant was verified by Northern blotting (Alwine et al, Proc. Natl. Acad. Sci. USA 74:5350- 5354 (1977)). Viral RNA was purified from uninoculated upper leaves and was probed with the 1.5 kb CrtI gene. Positive results were obtained from plants inoculated with TTOSAl CTP CrtI 491 #7.
Leaf tissue from a TTOSAl CTP CrtI 491 #7 infected plant was examined for β-carotene levels. Treating an uninoculated control plant with norflurazon resulted in severely depressed β-carotene levels (7.8% ofthe wild- type level). However, when a plant which had been previously inoculated with the viral construct TTOSAl CTP CrtI 491 #7 was treated with norflurazon, the β- carotene level were partially restored (28.3% ofthe wild-type level). This is similar to the level of β-carotene in TTOSAl CTP CrtI 491 #7 samples not treated with norflurazon (an average of 38.3% of wild-type), indicating that the herbicide norflurazon had little effect on β-carotene levels in previously transfected plants. The expression ofthe bacterial phytoene desaturase in systematically infected tissue conferred resistance to the herbicide norflurazon.
EXAMPLE 5 Expression of 5-enolpyruvylshikimate-3-phosphate svnthase -.EPSPS") genes in plants confers resistance to Roundup® herbicide.
Systemic expression via a recombinant viral vector of 5- enolpyruvylshikimate-3-phosphate synthase (EPSPS) genes in plants confers resistance to Roundup® herbicide. See also della-Cioppa, et al, "Genetic Engineering of herbicide resistance in plants," Frontiers of Chemistry:
Biotechnology. Chemical Abstract Service, ACS, Columbus, OH, pp. 665-70 (1989). The purpose of this experiment is to provide a method to systemically express EPSPS genes via a recombinant viral vector in fully-grown pl-ants. Transfected plants that oveφroduce the enzyme EPSPS in vegetative tissue (root, stem, and leaf) are resistant to Roundup® herbicide. The present invention provides a method for the production of plasmid-targeted EPSPS in plants via an RNA viral vector. A dual subgenomic promoter vector encoding the full-length EPSPS gene from Nicotiana tabacum (Class I EPSPS) is shown in plasmid pBS736. Systemic expression ofthe Nicotiana tabacum Class I EPSPS confers resistance to Roundup® herbicide in whole plants and tissue culture. FIGURE 8 shows plasmid pBS736.
EXAMPLE 6 Cvtoplasmic inhibition of 5-enolpyruwlshikimate-3-phosphate svnthase (EPSPS) genes in plants blocks aromatic amino acid biosynthesis.
Cytoplasmic inhibition of 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) genes in plants blocks aromatic amino acid biosynthesis and causes a systemic bleaching phenotype similar to Roundup® herbicide. See also della- Cioppa, et al, "Genetic Engineering of herbicide resistance in plants," Frontiers of Chemistry: Biotechnology. Chemical Abstract Service, ACS, Columbus, OH, pp. 665-70 (1989). A dual subgenomic promoter vector encoding 1097 base pairs of an antisense EPSPS gene from Nicotianan tabacum (Class I EPSPS) is shown in plasmid pBS712. FIGURE 9 shows plasmid pBS712. Systemic expression ofthe Nicotiana tabacum Class I EPSPS gene in the antisense orientation causes a systemic bleaching phenotype similar to Roundup® herbicide.
EXAMPLE 7 Exemplary complementation analysis.
A transgenic plant or naturally occurring plant mutant may have a nonfunctional gene such as the one which produces EPSP synthase. A plant deficient or lacking in the EPSP synthase gene could grow only in the presence of added aromatic amino acids. Transfection of plants with a viral vector containing a functional EPSP synthase gene or cDNA sequence encoding the same would cause the plant to produce a functional EPSP synthase gene product. A plant so transfected would then be able to grow normally without added aromatic amino acids to its environment. In this transfected plant, the EPSP synthase mutation in the plant would be complemented in trans by the viral nucleic acid sequence containing the native or foreign EPSP synthase cDNA sequence. EXAMPLE 8 Expression of methylotrophic veast ZZA1 gene in transfected plants confirms that it encodes alcohol oxidase.
A genomic clone encoding alcohol oxidase ZZA1, the first enzyme involved in methanol utilization, was isolated from a newly described Pichia pastoris strain. Kumagai et al, Bio/Technology 11:606-610 (1993). Sequence analysis indicates that gene encodes a polypepide of approximately 72 -kDa (FIGURE 10). Comparison ofthe amino acid sequence to Pichia pastoris AOXI and AOX2 alcohol oxidases indicates that they show 97.4% and 96.4% similarity to each other, respectively. The open reading frame (ORF) for alcohol oxidase, from the a genomic clone containing ZZA1, was placed under the control ofthe tobamo viral subgenomic promoter in TTO1 A, a hybrid tobacco mosaic virus (TMV) and tomato mosaic virus (ToMV) vector. Infectious RNA from TTO1 APE ZZA1 (FIGURE 11) was prepared by in vitro transcription using SP6 DNA-dependent RNA polymerase and used to mechanically inoculate N benthamiana. The 72-kDa protein accumulated in systemically infected tissue and was analyzed by immunoblotting, using Pichia pastoris alcohol oxidase as a standard. No detectable cross-reacting protein was observed in the noninfected N benthamiana control plant extracts.
Isolation ofthe alcohol oxidase gene.
Three hundred nanograms ofthe yeast Pichia pastoris genomic DΝA digested with Pstl .and Xhol was amplified by PCR using a 25-mer oligonucleotide (5'-TTG CAC TCT GTT GGC TCA TGA CGA T-3') (SEQ ID NO: 18) conesponding to the nucleotide sequence of AOXI promoter and a 26- mer oligonucleotide (5'-CAA GCT TGC ACA AAC GAA CGT CTC AC-3') (SEQ ID NO: 19) conesponding to a nucleotide sequence derived from the -4O-Y7 terminator. The PCR conditions using Thermus aquaticus DNA polymerase (2.5U; Perkin-Elmer Cetus) consisted of an initial 2 minute incubation at 97°C followed by two cycles at 97°C (Imin.), 45°C (Imin.), 60°C (1 min.), thirty-five cycles at 94°C (1 min.), 45°C (1 min.), 60°C (1 min.), and a final DNA polymerase extension at 60°C for 7 min. The 3273 base pair fragment containing ZZA1 gene was phenol/chloroform treated and precipitated with ammonium acetate/ethanol. After digestion with Sacl the fragment was purified by 1% low melt agarose electrophoresis and subcloned into the Sacl/EcoRV sites in pBluescript KS-. The alcohol oxidase genomic clone KS-AO7'8' was characterized by restriction mapping and dideoxynucleotide sequencing.
Plasmid Constructions.
Unique Xhol, Avrll sites were inserted into the Pichia pastoris clone KS- AO7'8' by polymerase chain reaction (PCR) mutagenesis using oligonucleotides: 5'-CAC TCG AGA GCA TGG CTA TTC CCG AAG AAT TTG ATA TTA TCG-3' (upstream) (SEQ ID NO: 20) and 5'-TCC CTA GGT TAG AAT CTA
GCA AGA CCG GTC TTC TCG-3' (downstream) (SEQ ID NO: 21). The 2.0-kb Xhol, Avrll ZZA1 PCR fragment was subcloned into pTTOl APE, creating plasmid TTOl APE ZZA1.
EXAMPLE 9
Rapid, high-level expression of rice OS 103 cDNA in transfected plants confirms that it encodes glycosylated rice α-amylase.
The open reading frame (ORF) for rice α-amylase, from the cDNA clone pOS103 (O'Neill et al, Mol. Gen. Genet. 221:235-244 (1990)), was placed under the control ofthe tobamoviral subgenomic promoter in TTOl A (Kumagai et al, Proc. Natl. Acad. Sci. USA 92:1679-1683 (1995)), a hybrid tobacco mosaic virus (TMV) and tomato mosaic virus (ToMV) vector. Infectious RNA from TTOl A 103L (FIGURE 12) was prepared by in vitro transcription using SP6 DNA- dependent RNA polymerase and used to mechanically inoculate N benthamiana. The hybrid virus spread throughout the noninoculated upper leaves as verified by transmission electron microscopy, local lesion infectivity assay, and PCR amplification. The viral symptoms consisted of plant stunting with mild chlorosis and distortion of systemic leaves. The 46-kDa α-amylase accumulated to levels of at least 5% total soluble protein, and was analyzed by immunoblotting, using yeast expressed α-amylase as a standard. No detectable cross-reacting protein was observed in the noninfected N benthamiana control plant extracts. The expression level ofthe recombinant enzyme produced in transfected plants was at least ten times higher than the amount of thermostable bacterial α-amylase produced in transgenic tobacco. The α-amylase was purified using ion exchange chromatography and its structural and biological properties were analyzed. The secreted protein had an approximate relative molecular mass of 46 kDa, cross-reacted with anti-α-amylase antibody, and hydro lyzed starch and oligomaltose in an in vitro assay.
The recombinant enzyme from transfected N benthamiana was glycosylated at an asparagine residue via an N-glycosidic linkage. The heterologously expressed α-amylases from transfected N benthamiana .and from transformed strains of S. cerevisiae and P. pastoris were treated with endo-H and were compared by Western blot SDS-PAGE analysis. There was an equivalent mobility shift for the enzymes expressed in S. cerevisiae and P. pastoris. The extent ofthe change in mobility suggests that the yeast expressed enzymes are hyperglycosylated while the recombinant protein from transfected plants is similar to that ofthe native rice α-amylase. While it is known that mannose-rich and complex oligosaccharide side chains are covalently attached to the mature rice seed α-amylase (Mitsui et al, Plant Physiol. 82:880-884 (1986)), the actual carbohydrate composition and structure ofthe recombinant plant glycoprotein remains to be determined. MALDI-TOF analysis revealed that the relative molecular mass (Mr) of the N benthamiana expressed sample was 46,064 Da. The Mr ofthe α-amylase determined by MALDI-TOF was 918 Da larger than the Mr derived from the amino acid sequence (PCGEΝE). The change in molecular mass (-ΪMr) ofthe plant expressed enzyme was smaller than the ΔMr of α-amylases produced in yeast. This result suggests that there is a difference in glycosylation patterns between foreign proteins expressed in plants and those that are secreted in yeast.
Plasmid Constructions.
Unique Xhol, Avrll sites were inserted into the rice α-amylase pOS103 cDΝA by PCR mutagenesis using oligonucleotides: 5'-CTC TCG AGA TCA
ATC ATC CAT CTC CGA AGT GTG TCT GC-3* (upstream) (SEQ ID NO: 22) and 5'-TCC CTA GGT CAG ATT TTC TCC CAG ATT GCG TAG C-3' (downstream) (SEQ ID NO: 23). The 1.4-kb Xhol, Avrll OS 103 PCR fragment was subcloned into pTTOlA, creating plasmid TTOl A 103L.
Purification. Immunological Detection, .and in vitro Assay of α-amylase. Ten days after inoculation, total soluble protein was isolated from 10 g of upper, noninoculated N benthamiana leaf tissue. The leaves were frozen in liquid nitrogen and ground in 20 ml of 5% 2-mercaptoethanol/lO mM Tris-bis propane, pH 6.0. The suspension was centrifuged .and the supernatant, containing recombinant α-amylase, was bound to a POROS® 50 HQ ion exchange column (PerSeptive Biosystems). The α-amylase was eluted with a linear gradient of 0.0- 1 M ΝaCl in 50 mM Tris-bis propane pH 7.0. The α-amylase eluted in fraction 16, 17 and its enzyme activity was analyzed (Sigma Kit #576-3). Fractions containing cross-reacting material to α-amylase antibody were concentrated with a Centriprep-30® (Amicon) and the buffer was exchanged by diafiltration (50 mM Tris-bis propane, pH 7.0). The sample was then loaded on a POROS HQ/M column (Perceptive Biosystems), eluted with a linear gradient of 0.0-1 M ΝaCl in 50 mM Tris-bis propane pH 7.0, and assayed for α-amylase activity. Fractions containing cross-reacting material to α-amylase antibody were concentrated with a Centriprep-30 and the buffer was exchanged by diafiltration (20 mM Sodium Acetate/HEPES/MES, pH 6.0). The sample was finally loaded on a POROS
HS/TVI column (Perceptive Biosystems), eluted with a linear gradient of 0.0-1 M ΝaCl in 20 mM Sodium Acetate/HEPES/MES, pH 6.0, and assayed for α- amylase activity. Total soluble plant protein concentrations were determined using bovine serum albumin as a standard. The proteins were analyzed on a 0.1 % SDS/ 10% polyacrylamide gel and fransfened by elecfrob lotting for 1 hour to a nitrocellulose membrane. The blotted membrane was incubated for 1 hr with a 2000- fold dilution of anti-α-amylase antiserum. Using standard protocols, the antisera was raised in rabbits against S cerevisiae expressed rice α-amylase. The enhanced chemiluminescence horseradish peroxidase-linked, goat anti-rabbit IgG assay (Cappel Laboratories) was performed according to the manufacturer's (Amersham) specifications. The blotted membrane was subjected to film exposure times of up to 10 sec. The quantity of total recombinant α-amylase in an extracted leaf sample was determined (using a 1-sec exposure ofthe blotted membrane) by comparing the crude extract chemiluminescent signal to the signal obtained from known quantities of α-amylase. Shorter and longer chemiluminescent exposure times ofthe blotted membrane gave the same quantitative results.
Analysis of post-translational modifications of recombinant α-amylases.
Approximately 5 μg of recombinant protein was dissolved in 1 M acetic acid and subjected to matrix-assisted laser desoφtion/ionization time of flight (MALDI-TOF) analysis (Karas et al, Anal Chem. 60:2299-2301 (1988)). For treatment with endo--5-N-acetylglucosaminidase H (endo H), 2 μg ofthe recombinant α-amylases were denatured in 0.5% SDS/ 1% β-mercaptoethanol at 100°C for 10 minutes. After the addition of 500 U of endo H (New England Biolabs) the samples were incubated at 37°C for 4 hours in 50 mM sodium citrate (pH 5.5 @ 25°C) and then subjected to Western blot analysis using anti-α- amylase antiserum.
EXAMPLE 10 Expression of Chinese cucumber cDNA clone pO21D in transfected plants confirms that it encodes α-trichosanthin.
We have developed a plant viral vector that directs the expression of α- trichosanthin in transfected plants. The open reading frame (ORF) for α- trichosanthin, from the genomic clone SEO, was placed under the control ofthe TMV coat protein subgenomic promoter. Infectious RNA from TTU51A QSEO #3 (FIGURE 13) was prepared by in vitro transcription using SP6 DNA- dependent RNA polymerase and was used to mechanically inoculate N. benthamiana. The hybrid virus spread throughout all the non-inoculated upper leaves as verified by local lesion infectivity assay, and PCR amplification. The viral symptoms consisted of plant stunting with mild chlorosis and distortion of systemic leaves. The 27-kDa α-trichosanthin accumulated in upper leaves (14 days after inoculation) and cross-reacted with an anti-trichosanthin antibody. Plasmid Constructions.
An 0.88-kb Xhol, Avrll fragment, containing the α-trichosanthin coding sequence, was amplified from genomic DNA isolated from Trichosanthes kirilowii Maximowicz by PCR mutagenesis using oligonucleotides QMIX: 5'- GCC TCG AGT GCA GCA TGA TCA GAT TCT TAG TCC TCT CTT TGC-3 ' (upstream) (SEQ ID NO: 24) and Q1266A 5'-TCC CTA GGC TAA ATA GCA TAA CTT CCA CAT CA AAGC-3' (downstream) (SEQ ID NO: 25). The α- trichosanthin open reading frame was verified by dideoxy sequencing, and placed under the control ofthe TMV-U1 coat protein subgenomic promoter by subcloning into TTU51 A, creating plasmid TTU51 A QSEO #3.
In vitro Transcriptions. Inoculations, and Analysis of Transfected Plants.
N benthaminana plants were inoculated with in vitro transcripts of Kpn I- digested TTU51A QSEO #3 as previously described (Dawson et al, supra). Virions were isolated from N benthamiana leaves infected with TTU51A QSEO #3 transcripts.
Purification. Immunological Detection, and in vitro Assay of α-Trichosanthin. Two weeks after inoculation, total soluble protein was isolated from upper, noninoculated N benthamiana leaf tissue and assayed from cross- reactivity to a α-trichosanthin antibody. The proteins from systemically infected tissue were analyzed on a 0.1% SDS/12.5% polyacrylamide gel and transfened by electroblotting for 1 hr to a nitrocellulose membrane. The blotted membrane was incubated for 1 hr with a 2000-fold dilution of goat anti-α-trichosanthin antiserum. The enhanced chemiluminescence horseradish peroxidase-linked, rabbit anti-goat IgG assay (Cappel Laboratories) was performed according to the manufacturer's (Amersham) specifications. The blotted membrane was subjected to film exposure times of up to 10 sec. Shorter and longer chemiluminescent exposure times ofthe blotted membrane gave the same quantitative results. EXAMPLE 11 Expression of human β-globin cDNA clone in transfected plants confirms that it encodes hemoglobin.
The hemoglobin expression vector, RED1, was constructed in several subcloning steps. A unique Sphl site was inserted in the start codon for the human β-globin and a Xbal site was placed downstream ofthe stop codon by
PCR mutagenesis by using oligonucleotides 5'"CAC TCG AGA GCA TGC TGC
ACC TGA CTC CTG AGG AGA AG-3' (upstream) (SEQ ID NO: 26) and 5'" CGT CTA GAT TAG TGA TAC TTG TGG GCC AGC GCA TTA GC-3* (downstream) (SEQ ID NO: 27). The 452 bp Sphl-Xbal hemoglobin fragment was subcloned into the Sphl-Avrll site of a modified tobamoviral vector. This construct consists of a 1020 bp fragment from the tobacco mild green mosaic virus (TMGMV; U5 strain) containing the viral subgenomic promoter, coat protein gene, and the 3'-end that was isolated by PCR using TMGMV primers 5'~ GGC TGT GAA ACT CGA AAA GGT TCC GG-3* (upstream) (SEQ ID NO: 28) and 5'-CGG GGT ACC TGG GCC GCT ACC GGC GGT TAG GGG AGG-3' (downstream) (SEQ ID NO: 29). In this vector, an artificial 40 base pair 5' untranslated coat protein leader was fused to a hybrid cDNA encoding rice α- amylase signal peptide and human β-globin. A hybrid sequence encoding rice alpha-amylase signal peptide and β- chain of human hemoglobin was placed under the control ofthe tobacco mosaic virus (TMV-U1) coat protein subgenomic promoter. Infectious RNA was made in vitro and directly applied to N benthamiana. One to two weeks post- inoculation transfected plants had accumulated recombinant hemoglobin. The 16-KDa β-globin accumulated in systemically infected leaves and was analyzed by immunoblotting, using hum.an hemoglobin as a standard. The recombinant hemoglobin was detected in transfected plants using a rabbit anti-human hemoglobin antibody. No detectable cross-reacting protein was observed in the noninfected N benthamiana control plants. The β-globin from transfected plants co-migrated with an authentic human standard and appears to form homodimers. This result suggests that rice α-amylase signal peptide was removed and that it may be possible to rapidly secrete functional hemoglobin in transfected plants. EXAMPLE 12 Construction of a tobamoviral vector for expression of heterologous genes in A. thaliana. Virions that were prepared as a crude aqueous extract of tissue from turnip infected with RMV were used to inoculate N benthamiana, N tabacum, A. thaliana, and oilseed rape (canola). Two to three weeks after fransfection, systemically infected plants were analyzed by immunoblotting, using purified RMV as a standard. Total soluble plant protein concentrations were determined using bovine serum albumin as a standard. The proteins were analyzed on a 0.1% SDS/12.5% polyacrylamide gel and transfened by electroblotting for 1 hr to a nitrocellulose membrane. The blotted membrane was incubated for 1 hr with a 2000-fold dilution of anti-ribgrass mosaic virus coat antiserum. Using standard protocols, the antisera was raised in rabbits against purified RMV coat protein. The enhanced chemiluminescence horseradish peroxidase- linked, goat anti-rabbit IgG assay (Cappel Laboratories) was performed according to the manufacturer's (Amersham) specifications. The blotted membrane was subjected to film exposure times of up to 10 sec. No detectable cross-reacting protein was observed in the noninfected N benthamiana control plant extracts. A 18 kDa protein cross-reacted to the anti-RMV coat antibody from systemically infected N benthamiana, N. tabacum, A. thaliana, and oilseed rape (canola). This result demonstrates that RMV can systemically infect N benthamiana, N tabacum, A. thaliana, and oilseed rape (canola).
Plasmid constructions.
Ribgrass mosaic virus (RMV) is a member ofthe tobamovirus group that infects crucifers. A partial RMV cDΝA containing the 30K subgenomic promoter, 30K ORF, coat subgenomic promoter, coat ORF, and 3 '-end was isolated by RT-PCR using by using oligonucleotides TVCV183X 5'-TAC TCG AGG TTC ATA AGA CCG CGG TAG GCG G-3' (upstream) (SEQ ID NO: 30) and TVCV Kpnl 5*-CGG GGT ACC TGG GCC CCT ACC CGG GGT TTA GGG AGG-3' (downstream) (SEQ ID NO: 31), and subcloned into the EcoRV site of KS+, creating plasmid KS+ TVCV #23 (FIGURE 14). The RMV cDNA was characterized by restriction mapping and dideoxy nucleotide sequencing. The partial nucleotide sequence is as follows: 5'- ctcgaggttcataagaccgcggtaggcggagcgtttgtttactgtagtataattaaatatttgtcagataaaaggttgttt aaagatttgttttttgtttgactgagtcgataATGTCTTACGAGCCTAAAGTTAGTGACTTC CTTGCTCTTACGAAAAAGGAGGAAATTTTACCCAAGGCTTTGACGAGA TTAAAGACTGTCTCTATTAGTACTAAGGATGTTATATCTGTTAAGGAG TCTGAGTCCCTGTGTGATATTGATTTGTTAGTGAATGTGCCATTAGATA AGTATAGGTATGTGGGTGTTTTGGGTGTTGTTTTC ACCGGTGAATGGC TGGTACCGGATTTCGTTAAAGGTGGGGTAACAGTGAGCGTGATTGAC AAACGGCTTGAAAATTCCAGAGAGTGCATAATTGGTACGTACCGAGC TGCTGTAAAGGACAGAAGGTTCCAGTTCAAGCTGGTTCCAAATTACTT CGTATCCATTGCGGATGCCAAGCGAAAACCGTGGCAGGTTCATGTGC GAATTC AAAATCTGAAGATCGAAGCTGGATGGC AACCTCTAGCTCTA GAGGTGGTTTCTGTTGCCATGGTTACTAATAACGTGGTTGTTAAAGGT TTGAGGGAAAAGGTCATCGCAGTGAATGATCCGAACGTCGAAGGTTT CGAAGGTGTGGTTGACGATTTCGTCGATTCGGTTGCTGCATTCAAGGC GATTGACAGTTTCCGAAAGAAAAAGAAAAAGATTGGAggaagggatGTAA ATAATAATAAGTATAGATATAGACCGGAGAGATACGCCGGTCCTGAT TCGTTACAATATAAAGAAGAAAaTGGTTTACAACATCACGAGCTCGAA TCAGTACCAGTATTTCGCAGCGATGTGGGCAGAGCCCACAGCGATGCT TAAccaGTGCGTGTCTGCGTTGTCGCAATCGTATCAAACTCAGGCGGCA AgAGATACTGTTAGACAGCAGTTCTCTAACCTTCTGAGTGCGATTGTG ACACCGAACCAGCGGTTTCCAgAAACAGGATACCGGGTGTATATTAAT TCAGCAGTTCTAAAACCGTTGTACGAGTCTCTCATGAAGTCCTTTGAT ACTAGAAATAGGATCATTGAAACTGAAGAAGAGTCGCGTCCATCGGC TTCCGAAGTATCTAATGCAACACAACGTGTTGATGATGCGACCGTGGC CATCAGGAGTCAAATTCAGCTTTTGCTGAACGAGCTCTCCAACGGACA TGGTCTGATGAACAGGGCAGAGTTCGAGGTTTTATTACCTTGGGCTAC TGCGCCAGCTACATAGgcgtggtgcacacgatagtgcatagtgtttttctctccacttaaatcgaagag atatacttacggtgtaattccgcaagggtggcgtaaaccaaattacgcaatgttttaggttccatttaaatcgaaacctgt tatttcctggatcacctgttaacgtacgcgtggcgtatattacagtgggaataactaaaagtgagaggttcgaatcctcc ctaaccccgggtaggggccca-3'(SEQ ID NO: 32).
The 1543 base pair from the partial RMV cDNA was compared (PCGENE) to oilseed rape mosaic virus (ORMV). The nucleotide sequence identity was 97.8%. The RMV 30K and coat ORF were compared to ORMV and the amino acid identity was 98.11% (30K) and 98.73% (coat), respectively. A partial RMV cDNA containing the 5 '-end and part ofthe replicase was isolated by RT-PCR from RMV RNA using by using oligonucleotides RGMV1 5'"GAT GGC GCC TTA ATA CGA CTC ACT ATA GTT TTA TTT TTG TTG CAA CAA CAA CAA C-3' (upstream) (SEQ ID NO: 33) and RGR 132 5'-CTT GTG CCC TTC ATG ACG AGC TAT ATC ACG-3' (downstream) (SEQ ID NO: 34). The RMV cDNA was characterized by dideoxy nucleotide sequencing. The partial nucleotide sequence containing the T7 RNA polymerase promoter and part ofthe RMV cDNA is as follows: 5'- ccttaatacgactcactataGTTTTATTTTTGTTGCAACAACAACAACAAATTACAA TAACAACAAAACAAATACAAACAACAACAACATGGCACAATTTCAAC AAACAGTAAACATGCAAACATTGCAGGCTGCCGCAGGGCGCAACAGC CTGGTGAATGATTTAGCCTCACGACGTGTTTATGACAATGCTGTCGAG GAGCTAAATGCACGCTCGAGACGCCCTAAGGTTCATTACTCCAAATCA GTGTCTACGGAACAGACGCTGTTAGCTTCAAACGCTTATCCGGAGTTT GAGATTTCCTTTACTCATACCCAACATGCCGTACACTCCCTTGCGGGT GGCCTAAGGACTCTTGAGTTAGAGTATCTCATGATGCAAGTTCCGTTC GGTTCTCTGACGTACGACATCGGTGGTAACTTTGCAGCGCACCTTTTC AAAGGACGCGACTACGTTCACTGCTGTATGCCAAACTTGGATGTACGT GATATAGCT-3' (SEQ ID NO: 35). The uppercase letters are nucleotide sequences from RMV cDNA. The lower case letters are nucleotide sequences from T7 RNA polymerase promoter. The nucleotide sequences from the 5' and 3' oligonucleotides are underlined. Full length infectious RMV cDNA clones were obtained by RT-PCR from
RMV RNA using by using oligonucleotides RGMV1 5'"GAT GGC GCC TTA ATA CGA CTC ACT ATA GTT TTA TTT TTG TTG CAA CAA CAA CAA C- 3' (upstream) (SEQ ID NO: 36) and RG1 APE 5'-ATC GTT TAA ACT GGG CCC CTA CCC GGG GTT AGG GAG G-3' (downstream) (SEQ ID NO: 37). The RMV cDNA was characterized by dideoxy nucleotide sequencing. The partial nucleotide sequence containing the T7 RNA polymerase promoter and part of the RMV cDNA is as follows: 5'-
CCTTAATACGACTCACTATAGTTTTATTTTTGTTGCAACAACAACAAC AAATTACAATAACAACAAAACAAATACAAACAACAACAACATGGCAC AATTTCAACAAACAGTAAACATGCAAACATTCCAGGCTGCCGCAGGG CGCAACAGCCTGGTGAATGATTTAGCCTC ACGACGTGTTTATGACAAT GCTGTCGAGGAGCTAAATGCACGCTCGAGACGCCCTAAGGTTCATTAC TCCAAATCAGTGTCTACGGAACAGACGCTGTTAGCTTCAAACGCTTAT CCGGAGTTTGAGATTTCCTTTACTCATACCCAAACATGCCGTACACTC CCTTGCGGGTGGCCTAAGGACTCTTGAGTTAGAGTATCTCATGATGCA AGTTCCGTTCGGTTCTCTGACGTACGACATCGGTGGTAACTTTGC AGC GCACCTTTTCAAAGGACGCGACTACGTTCACTGCTGTATGCCAAACTT GGATGTACGTGATATAGCT-3 ' (SEQ ID NO: 38). The uppercase letters are nucleotide sequences from RMV cDNA. The nucleotide sequences from the 5' .and 3' oligonucleotides are underlined. Full length infectious RMV cDNA clones were obtained by RT-PCR from RMV RNA using oligonucleotides
RGMV1 5 '-gat ggc gcc tta ata cga etc act ata gtt tta ttt ttg ttg caa caa caa caa c-3' (upstream) (SEQ ID NO: 39) and RG1 APE 5'-ATC GTT TAA ACT GGG CCC CTA CCC GGG GTT AGG GAG G-3' (downstream) (SEQ ID NO: 40).
EXAMPLE 13
Arabidopsis thaliana cDNA library construction in a dual subgenomic promoter vector.
Arabidopsis thaliana cDNA libraries obtained from the Arabidopsis
Biological Resource Center (ABRC). The four libraries from ABRC were size- fractionated with inserts of 0.5-1 kb (CD4-13), 1-2 kb (CD4-14), 2-3 kb (CD4-
15), and 3-6 kb (CD4-16). All libraries are of high quality and have been used by several dozen groups to isolate genes. The pBluescript® phagemids from the Lambda ZAP II vector were subjected to mass excision and the libraries were recovered as plasmids according to standard procedures.
Alternatively, the cDNA inserts in the CD4-13 (Lambda ZAP II vector) were recovered by digestion with NotI. Digestion with NotI in most cases liberates the entire Arabidopsis thaliana cDΝA insert because the original library was assembled with NotI adapters. NotI is an 8-base cutter that infrequently cleaves plant DΝA. In order to insert the NotI fragments into a transcription plasmid, the pBS735 transcription plasmid (FIGURE 15) was digested with PacVXhol and ligated to an adapter DΝA sequence created from the oligonucleotides 5'-TCGAGCGGCCGCAT-3' (SEQ ID NO: 41) and 5'-
GCGGCCGC-3' (SEQ ID NO: 42). The resulting plasmid pBS740 (FIGURE 16) contains a unique NotI restriction site for bidirectional insertion of NotI fragments from the CD4-13 library. Recovered colonies were prepared from these for plasmid minipreps with a Qiagen BioRobot 9600®. The plasmid DΝA preps performed on the BioRobot 9600® are done in 96-well format and yield transcription quality DΝA. An Arabidopsis cDΝA library was transformed into the plasmid and analyzed by agarose gel electrophoresis to identify clones with inserts. Clones with inserts may be transcribed in vitro .and inoculated onto N benthamiana .and/or Arabidopsis thaliana. Selected leaf disks from transfected plants may be then taken for biochemical analysis.
EXAMPLE 14 Expression and targeting to the chloroplasts of a green fluorescent protein in Arabidopsis thaliana via a recombinant viral nucleic acid vector. The gene encoding green fluorescent protein (GFP) was fused at the Ν- terminus to the chloroplast transit peptide (CTP) sequence of RuBPCase to create plasmid pBS723 (FIGURE 17). Plasmid pBS723 was modified by PCR mutagenesis to create a unique Pad site upstream ofthe ATG start codon ofthe CTP-GFP gene fusion. The PCR .amplification product obtained from plasmid pBS723 was digested PacllSall and cloned into plasmid GFP-30B/clone 60 (also digested with PacllSall) to create plasmid pBS731 (FIGURE 18). Plasmid pBS731 was linearized at a unique Kpnl restriction site and transcribed into infectious RΝA with T7 RΝA polymerase according to standard procedures. Infectious RNA transcripts that were inoculated onto Nicotiana benthamiana plants showed systemic expression in the upper leaves of CTP-GFP within six days. Plants infected with RNA transcripts from plasmid pBS731 were harvested by grinding the leaves with a mortar and pestle to obtain recombinant virions derived from pBS731 infectious RNA transcripts. Virions from pBS731 were inoculated onto Arabidopsis thaliana leaves. The inoculated leaves of Arabidopsis thaliana plants showed strong green fluorescence under UV light, thus indicating successful expression ofthe CTP-GFP reporter gene.
EXAMPLE 15
High throughput robotics.
Inoculation of subject organisms such as plants may be effected by using means of high throughput robotics. For example, Arabidopsis thaliana were grown in microtiter plates such as the standard 96-well and 384- well micro titer plates. A robotic handling arm then moved the plates containing the organism to a colony picker or other robot that may deliver inoculations to each plant in the well. By this procedure, inoculation was performed in a very high speed and high throughput manner. It is preferable in the case of plants that the organism be a germinating seed at least in the development cycle to enable access to the cells to be transfected. Equipment used for automated robotic production line could include, but not be limited to, robots of these types: electronic multichannel pipetmen, Qiagen BioRobot 9600®, Robbins Hydra liquid handler, Flexys Colony Picker, New Brunswick automated plate pourer, GeneMachines HiGro shaker incubator, New Brunswick floor shaker, three Qiagen BioRobots, MJ Research PCR machines (PTC-200, Tetrad), ABI 377 sequencer and Tecan Genesis RSP200 liquid handler.
EXAMPLE 16 Genomic DNA library construction in a recombinant viral nucleic acid vector. Genomic DNA represented in BAC (bacterial artificial chromosome) or
YAC (yeast artificial chromosome) libraries may be obtained from the Arabidopsis Biological Resource Center (ABRC). The BAC/YAC DNA can be mechanically size- fractionated, ligated to adapters with cohesive ends, and shotgun-cloned into recombinant viral nucleic acid vectors. Alternatively, mechanically size-fractionated genomic DNA can be blunt-end ligated into a recombinant viral nucleic acid vector. Recovered colonies can be prepared for plasmid minipreps with a Qiagen BioRobot 9600®. The plasmid DNA preps done on the BioRobot 9600® may be assembled in 96-well format and yield transcription quality DNA. The recombinant viral nucleic acid; 'Arabidopsis genomic DNA library may be analyzed by agarose gel electrophoresis (template quality control step) to identify clones with inserts. Clones with inserts can then be transcribed in vitro and inoculated onto N benthamiana and/or Arabidopsis thaliana. Selected leaf disks from transfected plants can then be taken for biochemical analysis.
Genomic DΝA from Arabidopsis typically contains a gene every 2.5 kb (kilobases) on average. Genomic DΝA fragments of 0.5 to 2.5 kb obtained by random shearing of DΝA were shotgun assembled in a recombinant viral nucleic acid expression/knockout vector library. Given a genome size of Arabidopsis of approximately 120,000 kb, a random recombinant viral nucleic acid genomic DΝA library would need to contain minimally 48,000 independent inserts of 2.5 kb in size to achieve IX coverage of 'the Arabidopsis genome. Alternatively, a random recombinant viral nucleic acid genomic DΝA library would need to contain minimally 240,000 independent inserts of 0.5 kb in size to achieve IX coverage ofthe Arabidopsis genome. Assembling recombinant viral nucleic acid expression/knockout vector libraries from genomic DΝA rather than cDΝA has the potential to overcome known difficulties encountered when attempting to clone rare, low-abundance mRΝA's in a cDΝA library. A recombinant viral nucleic acid expression/knockout vector library made with genomic DΝA would be especially useful as a gene silencing knockout library. In addition, the DHSPES expression/knockout vector library made with genomic DΝA would be especially useful for expression of genes lacking introns. Furthermore, other plant species with moderate to small genomes (e.g. rose, approximately 80,000 kb) would be especially useful for recombinant viral nucleic acid expression/knockout vector libraries made with genomic DΝA. A recombinant viral nucleic acid expression/knockout vector library could be made from existing BAC/YAC genomic DΝA or from newly-prepared genomic DΝA for any plant species. Alternatively, a recombinant viral nucleic acid expression/knockout vector library could be made with genomic DNA obtained from yeast, bacteria, or animals including humans.
EXAMPLE 17
Genomic DNA or cDNA library construction in a DHSPES vector, and fransfection of individual clones from said vector library onto T-DNA tagged or transposon tagged or mutated plants.
Genomic DNA or cDNA library construction in a recombinant viral nucleic acid vector, and fransfection of individual clones from the vector library onto T-DNA tagged or transposon tagged or mutated plants may be performed according the procedure set forth in Example 16. Such a protocol may be easily designed to complement mutations introduced by random insertional mutagenesis of T-DNA sequences or transposon sequences.
EXAMPLE 18
Production of a malarial CTL epitope genetically fused to the C terminus ofthe
TMVCP.
Malarial immunity induced in mice by inadiated sporozites of P. yoelii is also dependent on CD8+ T lymphocytes. Clone B is one ocytotoxic T lymphocyte (CTL) cell clone shown to recognize an epitope present in both the P. yoelii and P. berghei CS proteins. Clone B recognizes the following amino acid sequence; SYVPSAEQILEFVKQISSQ (SEQ ID NO: 43) and when adoptively fransfened to mice protects against infection from both species of malaria sporozoites. Construction of a genetically modified tobamovirus designed to carry this malarial CTL epitope fused to the surface of virus particles is set forth herein.
Construction of plasmid pBGC289. A 0.5 kb fragment of pBGCl 1 was
PCR amplified using the 5' primer TB2ClaI5' and the 3' primer C/-5_4vrII. The amplified product was cloned into the Smαl site of pBstKS+ (Stratagene Cloning
Systems) to form pBGC214. PBGC215 was formed by cloning the 0.15 kb Accl-Nsil fragment of pBGC214 into pBGC235. The 0.9 kb Ncol-Kpήl fragment from pBGC215 was cloned in pBGC152 to form pBGC216.
A 0.07 kb synthetic fragment was formed by annealing PYCS.2p with PYCS.2m and the resulting double stranded fragment, encoding the P. yoelii CTL malarial epitope, was cloned into the -4vrII site of pBGC215 made blunt ended by treatment with mung bean nuclease and creating a unique -4αtII site, to form pBGC262. A 0.03 kb synthetic -4αtII fragment was formed by annealing TLS.1EXP with TLS.1EXM and the resulting double stranded fragment, encoding the leaky-stop sequence and a stuffer sequence used to facilitate cloning, was cloned into Aaill digested ρBGC262 to form pBGC263. PBGC262 was digested with-4αtll and ligated to itself removing the 0.02 kb stuffer fragment to form pBGC264. The 1.0 kb Ncol-Kpnl fragment of pBGC264 was cloned into pSNC004 to form pBGC289. The virus TMV289 produced by transcription of plasmid pBGC289 in vitro contains a leaky stop signal resulting in the removal of four amino acids from the C terminus ofthe wild type TMV coat protein gene and is therefore predicted to synthesize a truncated coat protein and coat protein with a CTL epitope fused at the C terminus at a ratio of 20: 1. The recombinant TMVCP/CTL epitope fusion present in TMV289 is with the stop codon decoded as the amino acid Y (amino acid residue 156). The amino acid sequence ofthe coat protein of virus TMV216 produced by transcription ofthe plasmid pBGC216 in vitro, is truncated by four amino acids. The epitope SYVPSAEQILEFVKQISSQ (SEQ ID NO: 43) is calculated to be present at approximately 0.5% ofthe weight ofthe virion using the same assumptions confirmed by quantitative ELISA analysis.
Propagation and purification ofthe epitope expression vector. Infectious transcripts were synthesized from i-jcwl-linearized pBGC289 using T7 RNA polymerase and cap (7mGpppG) according to the manufacturer (New England Biolabs). An increased quantity of recombinant virus was obtained by passaging
Sample ID No. TMV289.1 lBla. Fifteen tobacco plants were grown for 33 days post inoculation accumulating 595 g fresh weight of harvested leaf biomass not including the two lower inoculated leaves. Purified Sample ID No. TMV289.11B2 was recovered (383 mg) at a yield of 0.6 mg virion per gram of fresh weight. Therefore, 3 g of 19-mer peptide was obtained per gram of fresh weight extracted. Tobacco plants infected with TMV289 accumulated greater than 1.4 micromoles of peptide per kilogram of leaf tissue.
Product analysis. Partial confirmation ofthe sequence ofthe epitope coding region of TMV289 was obtained by restriction digestion analysis of PCR amplified cDNA using viral RNA isolated from Sample ID No. TMV289.11B2. The presence of proteins in TMV289 with the predicted mobility ofthe cp fusion at 20 kD and the truncated cp at 17.1 kD was confirmed by denaturing polyacrylamide gel electrophoresis.
EXAMPLE 19 Identification of nucleotide sequences involved in the regulation of plant growth by cvtoplasmic inhibition of gene expression using viral derived RNA.
Antisense RNA has been used to down regulate gene expression in transgenic and transfected plants. The effectiveness of antisense on the inhibition of eukaryotic gene expression was first demonstrated by Izant et al. (Cell 36(4): 1007-1015 (1984)). Since then, the down-regulation of numerous genes from transgenic plants has been reported. In addition, there is evidence that "co- suppression" of genes occurs in transgenic plants containing sense RNA by readthrough transcription from distal promoters located on the opposite strand of the DNA (Van der Krol et al, Plant Cell 2(4):291-299 (1990) and Napoli et al, Plant Cell 2:279-289 (1990)). In this example and examples 20 and 21, we show: (1) a novel method for producing sense/antisense RNA using an RNA viral vector, (2) a process to produce viral-derived sense/antisense RNA in the cytoplasm, (3) a process to inhibit the expression of endogenous plant proteins in the cytoplasm by viral antisense RNA, (4) a process to "co-suppress" the expression of endogenous plant proteins in the cytoplasm by viral RNA, and (5) a process to produce transfected plants containing viral antisense RNA which is much faster than the time required to obtain genetically engineered antisense transgenic plants. Systemic infection and expression of viral antisense RNA occurs as short as four days post inoculation, whereas it takes several months or longer to create a single transgenic plant. This example demonstrates that novel positive strand viral vectors, which replicate solely in the cytoplasm, can be used to identify genes involved in the regulation of plant growth by inhibiting the expression of specific endogenous genes. This example will enable one to characterize specific genes and biochemical pathways in transfected plants using an RNA viral vector.
Tobamoviral vectors have been developed for the heterologous expression of uncharacterized nucleotide sequences in transfected plants. A partial Arabidopsis thaliana cDNA library was placed under the transcriptional control of a tobamovirus subgenomic promoter in a RNA viral vector. Colonies from transformed E. coli were automatically picked using a Flexys robot and transfened to a 96 well flat bottom block containing terrific broth (TB) Amp 50 ug/ml. Approximately 2000 plasmid DNAs were isolated from overnight cultures using a BioRobot and infectious RNAs from 430 independent clones were directly applied to plants. One to two weeks after inoculation, transfected Nicotiana benthamiana plants were visually monitored for changes in growth rates, moφhology, and color. One set of plants transfected with 740 AT #120 were severely stunted. DNA sequence analysis revealed that this clone contained an Arabidopsis GTP binding protein open reading frame (ORF) in the antisense orientation. This demonstrates that an episomal RNA viral vector can be used to deliberately manipulate a signal transduction pathway in plants. In addition, our results suggest that the Arabidopsis .antisense transcript can turn off the expression ofthe N benthamiana gene.
Construction of an Arabidopsis thaliana cDΝA library in an RΝA viral vector. An Arabidopsis thaliana CD4-13 cDΝA library was digested with NotI. DΝA fragments between 500 and 1000 bp were isolated by trough elution and subcloned into the NotI site of pBS740. E. coli C600 competent cells were transformed with the pBS740 AT library and colonies containing Arabidopsis cDΝA sequences were selected on LB Amp 50 ug/ml. Recombinant C600 cells were automatically picked using a Flexys robot and then transfened to a 96 well flat bottom block containing terrific broth (TB) Amp 50 ug ml. Approximately 2000 plasmid DΝAs were isolated from overnight cultures using a BioRobot (Qiagen) and infectious RNAs from 430 independent clones were directly applied to plants.
Isolation of a gene encoding a GTP binding protein. One to two weeks after inoculation, transfected Nicotiana benthamiana plants were visually monitored for changes in growth rates, moφhology, and color. Plants transfected with 740 AT #120 (FIGURE 19) were severely stunted.
DNA sequencing and computer analysis. A 782 bp NotI fragment of 740 AT # 120 containing the ADP-ribosylation factor (ARF) cDΝA was characterized. DΝA sequence of NotI fragment of 740 AT #120 (774 base pairs) is as follows: 5'-
CCGAAACATTCTTCGTAGTGAAGCAAAATGGGGTTGAGTTTCGCCAAG CTGTTTAGCAGGCTTTTTGCCAAGAAGGAGATGCGAATTCTGATGGTT GGTCTTGATGCTGCTGGTAAGACCACAATCTTGTACAAGCTCAAGCTC GGAGAGATTGTCACCACCATCCCTACTATTGGTTTCAATGTGGAAACT GTGGAATACAAGAACATTAGTTTCACCGTGTGGGATGTCGGGGGTCA GGACAAGATCCGTCCCTTGTGAGACACTACTTCCAGAACACTCAAGGT CTAATCTTTGTTGTTGATAGCAATGACAGAGACAGAGTTGTTGAGGCT CGAGATGAACTCCACAGGATGCTGAATGAGGACGAGCTGCGTGATGC TGTGTTGCTTGTGTTTGCCAACAAGCAAGATCTTCCAAATGCTATGAA CGCTGCTGAAATCACAGATAAGCTTGGCCTTCACTCCCTCCGTCAGCG TCATTGGTATATCCAGAGCACATGTGCCACTTCAGGTGAAGGGCTTTA TGAAGGTCTGGACTGGCTCTCCAACAACATCGCTGGCAAGGCATGAT GAGGGAGAAATTGCGTTGCATCGAGATGATTCTGTCTGCTGTGTTGGG ATCTCTCTCTGTCTTGATGCAAGAGAGATTATAAATATTATCTGAACC TTTTTGCTTTTTTGGGTATGTGAATGTTTCTTATTGTGCAAGTAGATGG TCTTGTACCTAAAAATTTACTAGAAGAACCCTTTTAAATAGCTTTCGT GTATTGT-3' (SEQ ID NO: 44) The nucleotide sequencing of 740 AT #120 was carried out by dideoxy termination using double stranded templates (Sanger et al, Proc. Natl. Acad. Sci. USA 24(12):5463-5467 (1977)). Nucleotide sequence analysis and amino acid sequence comparisons were performed using DNA Strider, PCGENE and NCBI Blast programs. The nucleotide sequence from 740 AT #120 was compared the human ADP-ribosylation factor (ARF3) W3384 (FIGURE 20).
Isolation of a cDNA encoding Nicotiana benthamiana ADP-ribosylation factor. Partial cDNAs from Nicotiana benthamiana leaf RNA may be isolated by polymerase chain reaction (PCR) using the following oligonucleotides: ATARFM1X, 5'-GCC TCG AGT GCA GCA TGG GGT TGT CAT TCG GAA AGT TGT TC-3' (upstream) (SEQ ID NO: 45) and ATARFA181 A, 5'-TAC CTA GGC CTT GCT TGC GAT GTT GTT GGA GAG-3' (downstream) (SEQ ID NO: 46). A full-length cDNA encoding ARF may be isolated by screening a cDNA library by colony hybridization using a 32P labeled Arabidopsis thaliana ARF PCR product. Hybridization can be carried out at 42°C for 48h in 50% formamide, 5X SSC, 0.02M phosphate buffer, 5X Denhart's solution, and 0.1 mg/ml sheared calf thymus DNA. Filters may be washed at 65°C in 0.1X SSC and 0.1% SDS, prior to autoradiography. PCR products and the ARF cDNA clones may be verified by dideoxynucleotide sequencing.
EXAMPLE 20 Identification of nucleotide sequences involved in the regulation of plant development by cvtoplasmic inhibition of gene expression using viral derived RNA.
This example again demonstrates that an episomal RNA viral vector can be used to deliberately manipulate a signal transduction pathway in plants. In addition, our results suggest that the Arabidopsis antisense transcript can turn off the expression ofthe N. benthamiana gene.
A partial Arabidopsis thaliana cDΝA library was placed under the transcriptional control of a tobamovirus subgenomic promoter in a RΝA viral vector. Colonies from transformed E. coli were automatically picked using a Flexys robot and transfened to a 96 well flat bottom block containing terrific broth (TB) Amp 50 ug/ml. Approximately 2000 plasmid DΝAs were isolated from overnight cultures using a BioRobot and infectious RΝAs from 430 independent clones were directly applied to plants. One to two weeks after inoculation, transfected Nicotiana benthamiana plants were visually monitored for changes in growth rates, moφhology, and color. One set of plants transfected with 740 AT #88 developed a white phenotype on the infected leaf tissue. DNA sequence analysis revealed that this clone contained an Arabidopsis G-protein coupled receptor open reading frame (ORF) in the antisense orientation.
Construction of an Arabidopsis thaliana cDNA library in an RNA viral vector. An Arabidopsis thaliana CD4-13 cDNA library was digested with NotI. DΝA fragments between 500 and 1000 bp were isolated by trough elution and subcloned into the NotI site of pBS740. E. coli C600 competent cells were transformed with the pBS740 AT library and colomes containing Arabidopsis cDΝA sequences were selected on LB Amp 50 ug/ml. Recombinant C600 cells were automatically picked using a Flexys robot and then transfened to a 96 well flat bottom block containing terrific broth (TB) Amp 50 ug/ml. Approximately 2000 plasmid DΝAs were isolated from overnight cultures using a BioRobot (Qiagen) and infectious RΝAs from 430 independent clones were directly applied to plants.
Isolation of a gene encoding a G-protein coupled receptor. One to two weeks after inoculation, transfected Nicotiana benthamiana plants were visually monitored for changes in growth rates, moφhology, and color. Plants transfected with 740 AT #88 (FIGURE 21) developed a white phenotype on the infected leaf tissue.
DΝA sequencing and computer analysis. A 750 bp NotI fragment of 740 AT #88 containing the G-protein coupled receptor cDΝA was characterized. DΝA sequence of NotI fragment of 740 AT #88 (750 bp) is as follows: 5'-
TTTCGATCTAAGGTTCGTGATCTCCTTCTTCTCTACGAAGTTTACACTT TTTCTTCAAAGGAAACAATGAGCCAGTACAATCAACCTCCCGTTGGTG TTCCTCCTCCTCAAGGTTATCCACCGGAGGGATATCCAAAAGATGCTT ATCCACCACAAGGATATCCTCCTCAGGGATATCCTCAGCAAGGCTATC CACCTCAGGGATATCCTCAACAAGGTTATCCTCAGCAAGGATATCCTC CACCGTACGCGCCTCAATATCCTCCACCACCGCAAGCATCAGCAACA ACAGAGCAAGTCCTGGCTTTCTAGAAGGATGTCTTGCTGCTCTGTGTT GTTGCTGTCTCTTGGATGCTTGCTTCTGATTGGAGTCTCTCTCTCTCTG CATAAAGCTTCGGGATTTATTTGTAAGAGGGTTTTTGGGTTAAACAAA AACCTTAATTGATTTGTGGGGCATTAAAAATGAATCTCTCGATGATTC TCTTCGTTTATGTGGTAATGTTCTTCGGTTATAACATTTAACATTGCTA TCGACGTTCTGCCTAGTTGGATTTGATTATTGGGAATGTAAATTGGTT GGGAAGACACCGGGCCGTTAATGACAGAACCCGAACTGAGATGGAGT ATGATCTGAAATATTTAAAACAATCCTCGCGACATAGCCTCCAATCTC ATCGTAAATATTCTTTTTAAACTATTCCCAATCTTAACTTTTATAGTCT GGTCGACTGACCACTACTCTTTTTCCTT-3' (SEQ ID NO: 47) The nucleotide sequencing of 740 AT #88 was carried out by dideoxy termination using double stranded templates (Sanger et al, Proc. Natl. Acad. Sci. USA 24(12):5463-5467 (1977)). Nucleotide sequence analysis and amino acid sequence comparisons were performed using DNA Strider, PCGENE and NCBI Blast programs. The nucleotide sequence from 740 AT #88 was compared to Brassica rapa cDNA L33574 (FIGURE 22), the octopus rhodopsin mRNA X07797 (FIGURE 23). The amino acid sequence derived from 740 AT #88 was compared to an Arabidopsis EST ORF ATTS2938 (FIGURE 24) and octopus rhodopsin P31356 (FIGURE 25).
EXAMPLE 21 Identification of nucleotide sequences involved in the regulation of plant growth bv cvtoplasmic inhibition of gene expression using viral derived RNA.
Antisense RNA has been used to down regulate gene expression in transgenic and transfected plants. The puφose of this example is again to demonstrate that novel positive strand viral vectors, which replicate solely in the cytoplasm, can be used to identify genes involved in the regulation of plant growth by inhibiting the expression of specific endogenous genes. This example will enable one to characterize specific genes and biochemical pathways in transfected plants using an RNA viral vector.
The protocols of this example are analogous to those of examples 19 and 20. Tobamoviral vectors have been developed for the heterologous expression of uncharacterized nucleotide sequences in transfected plants. A partial Arabidopsis thaliana cDNA library was placed under the transcriptional control of a tobamovirus subgenomic promoter in a RNA viral vector. Colonies from transformed E. coli were automatically picked using a Flexys robot and transfered to a 96 well flat bottom block containing terrific broth (TB) Amp 50 ug/ml. Approximately 2000 plasmid DNAs were isolated from overnight cultures using a BioRobot and infectious RNAs from 430 independent clones were directly applied to plants. One to two weeks after inoculation, transfected Nicotiana benthamiana plants were visually monitored for changes in growth rates, moφhology, and color. One set of plants transfected with 740 AT #2441 developed white leaves and were severely stunted. DNA sequence analysis revealed that this clone contained an Arabidopsis GTP binding protein open reading frame (ORF) in the positive orientation. This demonstrates that an episomal RNA viral vector can be used to deliberately manipulate a signal transduction pathway in plants.
Construction of an Arabidopsis thaliana cDNA library in an RNA viral vector. An Arabidopsis thaliana CD4-13 cDNA library was digested with NotI. DΝA fragments between 500 and 1000 bp were isolated by trough elution and subcloned into the NotI site of pBS740. E. coli C600 competent cells were transformed with the pBS740 AT library and colonies containing Arabidopsis cDΝA sequences were selected on LB Amp 50 ug/ml. Recombinant C600 cells were automatically picked using a Flexys robot and then transfered to a 96 well flat bottom block containing terrific broth (TB) Amp 50 ug/ml. Approximately 2000 plasmid DΝAs were isolated from overnight cultures using a BioRobot (Qiagen) and infectious RΝAs from 430 independent clones were directly applied to plants.
Isolation of a gene encoding a GTP binding protein. One to two weeks after inoculation, transfected Nicotiana benthamiana plants were visually monitored for changes in growth rates, moφhology, and color. Plants transfected with 740 AT #2441 developed white leaves and were severely stunted. DNA sequencing and computer analysis. A NotI fragment of 740 AT #2441 containing the RAN GTP binding protein ORF cDNA was characterized. DNA sequence of NotI fragment of 740 AT #2441 (350 bp) is as follows: 5'- CTTCACTTTCGCCGATGGCTCTACCTAACCAGCAAACCGTGGATTACC CTAGCTTCAAGCTCGTTATCGTTGGCGATGGAGGCACAGGGAAGACC ACATTTGTAAAGAGACATCTTACTGGAGAGTTTGAGAAGAAGTATGA ACCCACTATTGGTGTTGAGGTTCATCCTCTTGATTTCTTCACTAACTGT GGCAAGATCCGTTTCTACTGTTGGGATACTGCTGGCCAAGAGAAATTT GGTGGTCTTAGGGATGGTTACTACATCCATGGACAATGTGCTATCATC ATGTTTGATGTCACAAGCACGACTGACATACAAGAATGTTCC AACATG GCACCGTGATCTTTG-3' (SEQ ID NO. 48)
The nucleotide sequencing of 740 AT #2441 was carried out by dideoxy termination using double stranded templates (Sanger et al, Proc. Natl. Acad. Sci. USA 24(12):5463-5467 (1977)). Nucleotide sequence analysis and amino acid sequence comparisons were performed using DNA Strider, PCGENE and NCBI Blast programs. The nucleotide sequence from 740 AT #2441 was compared to tobacco RAN-Bl GTP binding protein (FIGURE 26). The nucleotide sequence from 740 AT #2441 was compared to human RAN GTP-binding protein (FIGURE 27).
EXAMPLE 22 Gene silencing/co-supression of genes induced by delivering an RNA capable of base pairing with itself to form double stranded regions.
Gene silencing has been used to down regulate gene expression in transgenic plants. Recent experimental evidence suggests that double stranded RNA may be an effective stimulator of gene silencing/co-suppression phenomenon in transgenic plant. For example, Waterhouse et al. (Proc. Natl. Acad. Sci. USA 95:13959-13964 (1998), incoφorated herein by reference) described that virus resistance and gene silencing in plants could be induced by simultaneous expression of sense and antisense RNA. Gene silencing/co- suppression of plant genes may be induced by delivering an RNA capable of base pairing with itself to form double stranded regions. This example shows: (1) a novel method for generating an RNA virus vector capable of producing an RNA capable of forming double stranded regions, and (2) a process to silence plant genes by using such a viral vector.
Step 1 : Construction of a DNA sequence which after it is transcribed would generate an RNA molecule capable of base pairing with itself. Two identical, or nearly identical, ds DNA sequences can be ligated together in an inverted orientation to each other (i.e., in either a head to tail or tail to head orientation) with or without a linking nucleotide sequence between the homologous sequences. The resulting DNA sequence can then be cloned into a cDNA copy of a plant viral vector genome.
Step 2: Cloning, screening, transcription of clones of interest using known methods in the art.
Step 3: Infect plant cells with transcripts from clones.
As virus expresses foreign gene sequence, RNA from foreign gene will base pair upon itself, forming double-stranded RNA regions. This approach could be used with any plant or non-plant gene and used to silence plant gene homologous to assist in identification ofthe function of a particular gene sequence.
EXAMPLE 23
Preparation of a Non-Infective Eastern Equine Encephalomyelitis Vims Nucleotide Sequence.
Methods for genetic manipulation of Eastern Equine Encephalomyelitis Virus are described in Garoff et al, Curr. Opin. Biotechnol. 9(5):464-9 (1998); Pushko et al, Virology 239(2^:389-401 (1997); and Davis et al., J. Virol.
2Q(6):3781-7 (1996), all of which are incoφorated herein by reference. A full- length cDNA copy ofthe Eastern Equine Encephalomyelitis Virus (EEEV) genome is prepared and inserted into the Pstl site of pUC18 as described by Chang et al, J. Gen. Virol. 68:2129 (1987). The sequence for the viral coat protein and its adjacent El and E2 glycoprotein fransmissibility factors are located on the region conesponding to the 26S RNA region. The vector containing the cDNA copy ofthe EEEV genome is digested with the appropriate restriction enzymes and exonucleases to delete the coding sequence ofthe coat protein and the El and E2 proteins (structural protein coding sequence).
For example, the structural protein coding sequence is removed by partial digestion with Mbol, followed by religation to remove a vital portion ofthe structural gene. Alternatively, the vector is cut at the 3 '-end ofthe viral structural gene. The viral DNA is sequentially removed by digestion with Bal31 or Micrococcal SI nuclease up through the start codon ofthe structural protein sequence. The DNA sequence containing the sequence ofthe viral 3 '-tail is then ligated to the remaining 5 '-end. The deletion ofthe coding sequence for the structural proteins is confirmed by isolating EEEV RNA and using it to infect an equine cell culture. The isolated EEEV RNA is found to be non-infective under natural conditions.
Alternatively, only the coding sequence for the coat protein is deleted and the sequence for the El and E2 glycoproteins remain in the vector containing the cDNA copy ofthe EEEV genome. In this case, the coat protein coding sequence is removed by partial digestion with Mbol followed by religation to reattach the 3 '-tail ofthe virus. This will remove a vital portion ofthe coat protein gene.
A second alternative method for removing only the coat protein sequence is to cut the vector at the 3 '-end ofthe viral coat protein gene. The viral DNA is removed by digestion with Bal31 or Micrococcal S 1 nuclease up through the start codon ofthe coat protein sequence. The synthetic DNA sequence containing the sequence ofthe 3 '-tail is then ligated to the remaining 5 '-end.
The deletion ofthe coding sequence for the coat protein is confirmed by isolating EEEV RNA and using it to infect an equine cell culture. The isolated EEEV RNA is found to be non-infective under natural conditions.
EXAMPLE 24 Preparation of a Non-Transmissible Sindbis Virus Nucleotide Sequence.
Methods for genetic manipulation of Sindbis viruses are described in Garoff et al, Curr. Opin. Biotechnol. 9(5):464-9 (1998); Agapov et al, Proc. Natl. Acad. Sci. USA 95(22): 12989-94 (1998); Frolov et al, J. Virol. Apr;71(4):2819-29 (1997), all of which are incoφorated herein by reference. A full-length cDNA copy ofthe Sindbis virus genome is prepared and inserted into the Smαl site of a plasmid derived from pBR322 as described by Lindquist et al, Virology 151:10 (1986). The sequence for the viral coat protein and the adjacent El and E2 glycoprotein fransmissibility factors are located on the region conesponding to the 26S RNA region. The vector containing the cDNA copy of the Sindbis virus genome is digested with the appropriate restriction enzymes and exonucleases to delete the coding sequence for the structural proteins.
For example, the structural protein coding sequence is removed by partial digestion with Biήl, followed by religation to remove a vital portion ofthe structural gene. Alternatively, the vector is cut at the 3'-end ofthe viral nucleic acid. The viral DNA is removed by digestion with Ba 1 or Micrococcal S 1 nuclease up through the start codon ofthe structural protein sequence. The synthetic DNA sequence containing the sequence ofthe viral 3 '-tail is then ligated to the remaining 5 '-end. The deletion ofthe coding sequence for the structural proteins is confirmed by isolating Sindbis RNA and using it to infect an avian cell culture. The isolated Sindbis RNA is found to be non-infective under natural conditions.
Alternatively only the coding sequence for the coat protein is deleted and the sequence for the El and E2 glycoproteins remain in the vector containing the cDNA copy ofthe Sindbis genome. In this case, the coat protein coding sequence is removed by partial digestion with Aflll followed by religation to reattach the 3 '-tail ofthe virus.
A second alternative method for removing only the coat protein sequence is to cut the vector at the 3 '-end ofthe viral nucleic acid. The viral DNA is removed by digestion with BaB 1 or Micrococcal S 1 nuclease up through the start codon ofthe coat protein sequence (the same start codon as for the sequence for all the structural proteins). The synthetic DNA sequence containing the sequence ofthe 3 '-tail is then ligated to the remaining 5 '-end.
The deletion ofthe coding sequence for the coat protein is confirmed by isolating Sindbis RNA and using it to infect an avian cell culture. The isolated Sindbis RNA is found to be non-infective under natural conditions. EXAMPLE 25 Preparation of a Non-Transmissible Western Equine Encephalomyelitis Virus Nucleotide Sequence.
Methods for genetic manipulation of Western Equine Encephalomyelitis Vims are described in Garoff et al, Curr. Opin. Biotechnol. 9(5):464-9 (1998) and Weaver et al, J. Virol. 21(l):613-23 (1997), both of which are incoφorated herein by reference. A full-length cDNA copy ofthe Western Equine Encephalomyelitis Vims (WEEV) genome is prepared as described by Hahn et al, Proc. Natl. Acad. Sci. USA 85:5997 (1988). The sequence for the viral coat protein and its adjacent El and E2 glycoprotein fransmissibility factors are located on the region conesponding to the 26S RNA region. The vector containing the cDNA copy ofthe WEEV genome is digested with the appropriate restriction enzymes and exonucleases to delete the coding sequence ofthe coat protein and the El and E2 proteins (structural protein coding sequence). For example, the structural protein coding sequence is removed by partial digestion with NαcI, followed by religation to remove a vital portion ofthe structural protein sequence. Alternatively, the vector is cut at the 3 '-end ofthe structural protein DΝA sequence. The viral DΝA is removed by digestion with -5α/31 or Micrococcal SI nuclease up through the start codon ofthe structural protein sequence. The DΝA sequence ofthe viral 3 '-tail is then ligated to the remaining 5 '-end. The deletion ofthe coding sequence for the structural proteins is confirmed by isolating WEEV RΝA and using it to infect a Vero cell culture. The isolated WEEV RΝA is found to be non-infective under natural conditions. Alternatively, only the coding sequence for the coat protein is deleted and the sequence for the El and E2 glycoproteins remain in the vector containing the cDΝA copy ofthe WEEV genome. In this case, the coat protein coding sequence is removed by partial digestion with HgiAI followed by religation to reattach the 3 '-tail of the vims.
A second alternative method for removing only the coat protein sequence is to cut the vector at the 3'-end ofthe viral coat protein sequence. The viral
DΝA is removed by digestion with -5α/31 or Micrococcal SI nuclease up through the a vital portion ofthe coat protein sequence. The DΝA sequence containing the sequence ofthe 3'-tail is then ligated to the remaining 5'-end. The deletion ofthe coding sequence for the coat protein is confirmed by isolating WEEV RNA and using it to infect a Vero cell culture. The isolated WEEV RNA is found to be non-infective, i.e., biologically contained, under natural conditions.
EXAMPLE 26 Preparation of a Non-Infective Simian Vims 40 Nucleotide Sequence.
Methods for genetic manipulation of Simian vimses are described in Piechaczek et al, Nucleic Acids Res. 22(2):426-428 (1999) and Chittenden et al, J. Virol. 65(11):5944-51 (1991), both of which are incoφorated herein by reference. A full-length cDNA copy ofthe Simian vims 40 (SV40) genome is prepared, and inserted into the AccI site of plasmid pCW18 as described by Wychowski et al, J. Virol. 61:3862 (1987). The nucleotide sequence ofthe viral coat protein VPI is located between position 1488 and 2574 ofthe genome. The vector containing the DNA copy ofthe SV40 genome is digested with the appropriate restriction enzymes and exonucleases to delete the coat protein coding sequence.
For example, the VPI coat protein coding sequence is removed by partial digestion with -5αmHI nuclease, and then treated with EcoRI, filled in with Klenow enzyme and recircularized. The deletion ofthe coding sequence for the coat protein VPI is confirmed by isolating SV40 RNA and using it to infect simian cell cultures. The isolated SV40 RNA is found to be non-infective, i.e., biologically contained, under natural conditions.
EXAMPLE 27
Novel requirements for production of infectious viral vector in vitro derived RNA transcripts.
This example demonstrates the production of highly infectious viral vector transcripts containing 5' nucleotides with reference to the vims vector. Construction of a library of subgenomic cDNA clones of TMV and BMV has been described in Dawson et al, Proc. Natl. Acad. Sci. USA 83:1832-1836
(1986) and Ahlquist et al, Proc. Natl. Acad. Sci. USA 81:7066-7070 (1984).
Nucleotides were added between the transcriptional start site ofthe promoter for in vitro transcription, in this case T7, .and the start ofthe cDNA of TMV in order to maximize transcription product yield and possibly obviate the need to cap vims transcripts to insure infectivity. The relevant sequence is the T7 promoter ...TATAGΛTATTTT.... where the Λ indicates the base preceding is the start site for transcription and the bold letter is the first base ofthe TMV cDNA. Three approaches were taken: 1) addition of G, GG or GGG between the start site of transcription and the TMV cDNA ( ... TATAGGTATTT... and associated sequences); 2) addition of G and a random base (GN or N2) or a G and two random bases (GNN or N3) between the start site of transcription and the TMV cDNA (...TATAGNTATTT... and associated sequences), and the addition of a GT and a single random base between the start site of transcription and the TMV cDNA (...TATAGTNGTATTT... and associated sequences). The use of random bases was based on the hypothesis that a particular base may be best suited for an additional nucleotide attached to the cDNA, since it will be complementary to the normal nontemplated base incoφorated at the 3 '-end ofthe TMV (-) strand RNA. This allows for more ready mis-initiation and restoration of wild type sequence. The GTN would allow the mimicking of two potential sites for initiation, the added and the native sequence, and facilitate more ready mis-initiation of transcription in vivo to restore the native TMV cDNA sequence. Approaches included cloning GFP expressing TMV vector sequences into vectors containing extra G, GG or GGG bases using standard molecular biology techniques. Likewise, full length PCR of TMV expression clone 1056 was done to add N2, N3 and GTN bases between the T7 promoter and the TMV cDNA. Subsequently, these PCR products were cloned into pUC based vectors. Capped and uncapped transcripts were made in vitro .and inoculated to tobacco protoplasts or Nicotiana benthamiana plants, wild type and 30k expressing transgenics. The results are that an extra G, ... TATAGGTATTTT..., or a GTC, ... TATAGTCGTATTTT..., were found to be well tolerated as additional 5' nucleotides on the 5' of TMV vector RNA transcripts and were quite infectious on both plant types and protoplasts as capped or non-capped franscripts. Other sequences may be screened to find other options. Clearly, infectious transcripts may be derived with extra 5' nucleotides. Other derivatives based on the putative mechanistic function ofthe GTN strategy that yielded the GTC functional vector are to use multiple GTN motifs preceeding the 5' most nt ofthe vims cDNA or the duplication of larger regions ofthe 5'-end ofthe TMV genome. For example: TATAΛGTNGTNGTATT... or TATAΛGTNGTNGTNGTNGTATT.... or TATAΛGTATTTGTATTT... . In this manner the replication mediated repair mechanism may be potentiated by the use of multiple recognition sequences at the 5 '-end of transcribed RNA. The replicated progeny may exhibit the results of reversion events that would yield the wild type vims 5' vims sequence, but may include portions or entire sets of introduced additional base sequences. This strategy can be applied to a range of RNA vimses or RNA viral vectors of various genetic anangements derived from wild type vims genome. This would require the use of sequences particular to that ofthe vims used as a vector.
EXAMPLE 28
Infectivity of uncapped transcripts.
Two TMV-based vims expression vectors were initially used in these studies pBTI 1056 which contains the T7 promoter followed directly by the vims cDNA sequence f... TAT AGT ATT...), and pBTI SBS60-29 which contains the T7 promoter (underlined) followed by an extra guanine residue then the vims cDNA sequence (...TATAGGTATT...). Both expression vectors express the cycle 3 shuffled green fluorescent protein (GFPc3) in localized infection sites and systemically infected tissue of infected plants. Transcriptions of each plasmid were carried out in the absence of cap analogue (uncapped) or in the presence of 8-fold greater concentration of RNA cap analogue than rGTP (capped).
Transcriptions were mixed with abrasive and inoculated on expanded older leaves of a wild type Nicotiana benthamiana (Nb) plant and a Nb plant expressing a TMV Ul 30k movement protein transgene (Nb 3 OK). Four days post inoculation (dpi) long wave UV light was used to judge the number of infection sites on the inoculated leaves ofthe plants. Systemic, noninoculated tissues, were monitored from 4 dpi on for appearance of systemic infection indicating vascular movement ofthe inoculated vims. Table 1 shows data from one representative experiment. Table 1
Construct Local infection sites Systemic Infection
Nb Nb 30K Nb Nb 30K pBTI1056
Capped 5 6 yes yes
Uncapped 0 5 no yes
PBTI SBS60-29 Capped 6 6 yes yes
Uncapped 1 5 yes yes
Nicotiana tabacum protoplasts were infected with either capped or uncapped transcriptions (as described above) of pBTI SBS60 which contains the T7 promoter followed directly by the vims cDNA sequence ("TATAGTATT...). This expression vector also expresses the GFPc3 gene in infected cells and tissues. Nicotiana tabacum protoplasts were transfected with 1 mcl of each transcriptions. Approximately 36 hours post infection fransfected protoplasts were viewed under UV illumination and cells showing GFPc3 expression. Approximately 80% cells transfected with the capped PBTI SBS60 transcripts showed GFP expression while 5% of cells transfected with uncapped franscripts showed GFP expression. These experiments were repeated with higher amounts of uncapped inoculum. In this case a higher proportion of cells, >30% were found to be infected at this time with uncapped transcripts, where >90% of cells infected with greater amounts of capped franscripts were scored infected. These results indicate that, contrary to the practiced art in scientific literature and in issued patents (Ahlquist et al, U.S. Patent No. 5,466,788), uncapped transcripts for vims expression vectors are infective on both plants and in plant cells, however with much lower specific infectivity. Therefore, capping is not a prerequisite for establishing an infection of a vims expression vector in plants; capping just increases the efficiency of infection. This reduced efficiency can be overcome, to some extent, by providing excess in vitro transcription product in an infection reaction for plants or plant cells.
The expression ofthe 30K movement protein of TMV in transgenic plants also has the unexpected effect of equalizing the relative specific infectivity of uncapped verses capped franscripts. The mechanism behind this effect is not fully understood, but could arise from the RNA binding activity ofthe movement protein stabilizing the uncapped transcript in infected cells from prereplication cytosolic degradation.
Extra guanine residues located between the T7 promoter and the first base of a vims cDNA lead to increased amount of RNA transcript as predicted by previous work with phage polymerases. These polymerases tend to initiate more efficiently at ... TATAGG or ...TATAGGG than ...TATAG. This has an indirect effect on the relative infectivity of uncapped transcripts in that greater amounts are synthesized per reaction resulting in enhanced infectivity.
Data concerning cap dependent transcription of pBTI1056 GTN#28.
TMV-based vims expression vector pBTI 1056 GTN#28 which contains the T7 promoter (underlined) followed GTC bases (bold) then the vims cDNA sequence (...TATAGTCGTATT...). This expression vector expresses the cycle 3 shuffled green fluorescent protein (GFPc3) in localized infection sites and systemically infected tissue of infected plants. This vector was transcribed in vitro in the presence (capped) and absence (uncapped) of cap analogue. Transcriptions were mixed with abrasive and inoculated on expanded older leaves of a wild type Nicotiana benthamiana (Nb) plant and a Nb plant expressing a TMV Ul 30k movement protein transgene (Nb 30K). Four days post inoculation (dpi) long wave UV light was used to judge the number of infection sites on the inoculated leaves ofthe plants. Systemic, non-inoculated tissues, were monitored from 4 dpi on for appearance of systemic infection indicating vascular movement ofthe inoculated vims. Table 2 shows data from two representative experiments at 11 dpi.
Table 2
Construct Local infection sites Systemic Infection
Nb Nb 30K Nb Nb
30K
Experiment 1 pBTI1056 GTN#28 Capped 18 25 yes yes Uncapped 2 4 yes yes Experiment 2 pBTI1056 GTN#28
Capped 8 12 yes yes
Uncapped 3 7 yes yes
These data further support the claims concerning the utility of uncapped transcripts to initiate infections by plant vims expression vectors and further demonstrates that the introduction of extra, non- viral nucleotides at the 5 '-end of in vitro transcripts does not preclude infectivity of uncapped transcripts.
EXAMPLE 29 Methods for inhibiting endogenous proteolytic activity in plants in vivo.
Elicitor recognition and the response cascades occurring in plants form an essential link between the environmental stress and plant survival responses. Many products are induced following induction by environmental stimuli or pathogen infection, which include, but are not limited to, proteases, protease inhibitors, alkaloids and other metabolites. Glazebrook, et al, Annu. Rev. Gen. 31:547-569 (1997); Grahm, et al, J. Biol. Chem. 260:6555-6560 (1985); and Ryan, et al, Ann. Rev. Cell Dev. Biol. 14:1-17 (1998), all incoφorated herein by reference. The components ofthe recognition and response pathways are poorly understood, yet have tremendous practical value for input traits in genetically improved crops. Traditional methods of mutagenesis or biochemistry are leading to slow and incremental advances in our understanding. However, if these pathways are to be elucidated, understood and exploited, more rapid discovery methods must be brought to bear on the problem. Vims expression vectors capable of either overexpressing gene products or suppressing the expression of particular endogenous host genes provide a unique tool to discover the nature of the genes whose products contribute to the response pathways.
This example describes methods for inhibiting endogenous plant proteases which interfere with the expression and purification of recombinant proteins in plants. In particular, this example shows methods for inhibiting proteolytic activity in planta which is responsible for the degradation of a viral vector- expressed recombinant protein. These methods are also applicable to the protection of recombinant proteins expressed via a stable transformation system or endogenous plant proteins. Viral vectors have been constructed to include an N-terminal signal peptide sequence. This sequence directs the recombinant protein through the secretory pathway to the cell surface and ultimately accumulating in the plant intercellular fluid (IF) (Kermode, Critical Reviews in Plant Sciences 5(4):285-423 (1996), incoφorated herein by reference). In some instances, the target protein was cleaved abenantly in vivo. Three examples include a mammalian growth hormone and single chain antibody and an avian interferon. In vivo residence time in the IF led to the accumulation ofthe cleavage product(s) as detected by immunoblotting. Cleavage was either complete in vivo or continued in vitro following IF extraction (Co-pending U.S. Patent Application Serial No.
09/037,751, incoφorated herein by reference). Quantitation of western blots using UVP Gelbase/Gelblot-Pro software revealed as much as 40-50% ofthe expressed protein was cleaved.
We designed in vitro experiments to inhibit the plant proteolytic activity. When we added protease inhibitors to an isolated IF fraction in vitro, we were able to inhibit further degradation of our recombinant protein. In addition, when we treated an IF fraction from an unrelated virally infected plant with protease inhibitors and incubated that with a known susceptible protein, we completely inhibited the protease and protected the protein from degradation. Following the observation that the cleavage was occurring in vivo by a plant protease that could be inhibited by proteinase inhibitors, we designed experiments to inhibit this activity in planta. Three possible methods to inhibit the protease are as follows: 1. Recombinant expression of a proteinase inhibitor: The activity ofthe plant protease may be inhibited by the recombinant expression of a plant proteinase inhibitor secreted to the IF based on the following results:
(1) We cloned a tomato proteinase inhibitor gene (Wingate, et al, J. Biol. Chem. 264:17734-17738 (1989), incoφorated herein by reference) into our viral vector. We verified that the expression ofthe recombinant inhibitor protein was in the IF fraction by western detection. Virally-expressed proteinase inhibitor protected our recombinant (E. co/t-derived) mammalian growth hormone protein standard that was known to be susceptible to the plant protease in an in vitro assay; (2) Virally-expressed proteinase inhibitor specifically inhibited an IF-localized protease in vivo as per detection on Zymogram gelatin Tris-glycine gels; and
(3) Co-inoculation ofthe vims vector proteinase inhibitor construct and the viral vector mammalian growth hormone construct resulted in the expression of both proteins in systemic leaves and partial protection ofthe growth hormone in the IF.
Another possible approach is to combine transgenic plants and virally- expressed proteins. One could either inoculate the vims vector proteinase inhibitor construct on transgenic plants expressing a target protein or make a proteinase inhibitor transgenic plant and inoculate with a viral vector construct expressing the target sequence.
2. Induction of endogenous proteinase inhibitors:
One could also induce the endogenous production of plant proteinase inhibitors using an elicitor. For example, jasmonic acid (JA) is produced as part of a general plant defense mechanism and is known to induce specific proteinase inhibitors (Lightner et al, JMol Gen Genet. 241:595-601 (1993), incoφorated herein by reference). Exogenous application of JA as been used to induce a plant defense response in Nicotiana attenuata to against herbivore attack (Baldwin, PNAS, 95(14):8113-8118 (1998), incoφorated herein by reference). To protect against specific endogenous proteolysis of a recombinant protein, one could treat the plant material with JA to induce the synthesis ofthe proteinase inhibitor and then inoculate with a viral vector construct expressing the target sequence.
The desired phenotype in host plants used for gene discovery program using vims expression vectors is reduced proteolytic activities in the cytosol, secretory pathway or apoplast so to increase the half-life of virally produced proteins. This will allow virally expressed proteins to exert their influence on plant biochemistry, development and growth optimally. Rapid or premature degradation may reduce the amount ofthe expressed protein below the necessary threshold to exert a measurable effect. Transgenic expression of protease inhibitors, such as those induced by the systemin pathway (Ryan, et al, Ann. Rev. Cell Dev. Biol. 14:1-17 (1998)), will provide a continuous source of inhibitor to slow particular degradation processes. Conversely, as outlined in the example above, treating vims vector infected plants with JA will induce the response pathways and result in the expression of various inhibitors in infected/treated plants. In both ways, by specific protease inhibitor expression or by induction of response cascade, the half-lives of many proteins, whose presence is requisite for detecting the novel functions of gene products, are increased.
EXAMPLE 30 Selection of optimized RNA and protein activities by use of vims vectors to express libraries of sequence variants generated by means of in vitro mutagenenisis and/or recombination.
DNA shuffling is a process for recursive mutation and in vitro recombination, performed by random fractionation .and re-assembly of a gene of interest to generate a pool of related, yet not identical, gene sequences. Stemmer et al, U.S. Patent Nos. 5,830,721 and 5,811,238, incoφorated herein by reference. Fractionation occurs through the treatment of DNA sequences with limiting amounts of nuclease and re-assembly typically requires two steps, first primerless PCR to re-align fragments based on local homology and then primer driven PCR to recover full length assembled fragments. The advantages of this approach are many: (1) gene or sequence function can be optimized or improved without first determining the sites within the sequence that require alteration; (2) several generations of "improved" sequences can be generated, given proper selection, in time frame unattainable by natural circumstances; (3) mutations of every sort are randomly dispersed throughout the gene sequence allowing a "saturation" approach to determine the genetic potential of a given sequence. Crameri et al, Nature Biotech. 14:315 (1996); Crameri et al, Nature Biotech. 15:436 (1997); Zhang et al, Proc. Natl. Acad. Sci. USA 94:4504 (1997); Zhao and Arnold, Proc. Natl. Acad. Sci. USA 94:7997 (1997).
DNA shuffling has been successfully applied to prokaryotic or cell-based systems to select sequences of desired protein activities. However, the ability to introduce shuffled sequences throughout an organism in a rapid and high throughput manner necessary to harness the full potential of this technology has not been demonstrated. In this example, we describe the use of plant vims expression vectors to bear populations of shuffled DNA sequences and were applied to plant hosts and those sequences with desired properties were selected and further characterized. The properties confened by the selected shuffled sequences were demonstrated to be inherited by progeny vimses. Two aspects that must be continually improved in vims expression vectors are: 1) their ability to move in a facile manner both locally and systemically in plants, and 2) the need for greater levels of foreign gene expression. Both of these functions can potentially be affected by modifications to the 30 kDa ORF. Functions within the 30 kDa coding region include the movement protein (MP), the vims origin of virion assembly and the subgenomic promoter used for coat protein synthesis. This is the promoter used for expression of foreign gene sequences in most tobamovirus vectors. It has been demonstrated that natural variation in viral populations can be the substrates for selection of improved characters in viral vectors can lead to dramatic improvements in their performance. This work further showed that single or multiple amino acid substitutions in the 30 kDa ORF can significantly effect the movement properties of vims vectors. Vimses function genomically, as an integrated whole of RNA and protein sequences, suggesting that either individual elements, such as the 30 kDa ORF, or the entire plant vims genomes could be subjected to shuffling so to improve plant vims vector performance. Obvious following the application of shuffling in this context is the use of plant vims vectors to house shuffled foreign gene populations which, following inoculation onto plants, gene products with optimized activities can be selected. Plant vims vectors are the ultimate tool for shuttling genes into plants for selection of optimized activities. No other tool, transient or stable expression methods, can match the ability of plant vims vectors to develop optimized genes for plant activities.
Experiments to demonstrate the ability of plant vimses to house libraries of sequence variants focused on optimizing the coding region for the 30 kDa movement protein from TMV Ul for movement properties in Nicotiana tabacum and subgenomic promoter activity responsible for coat protein mRNA production. The base expression vector, p30B GFP, was used as a tool to be modified as desired for a shuffling vector. p30B GFP vector is the TMV Ul infectious cDNA (bases 1-5756) containing the 5' NTR, replicase genes (126 and 183 kDa proteins), movement protein gene with associated subgenomic promoter and an RNA leader derived from the Ul coat protein gene. Following the RNA leader is a unique Pad site and the green fluorescent protein (GFP) gene. Following a unique Xhol site, the clone continues with a portion ofthe TMV Ul 3' NTR followed by a subgenomic promoter, coat protein gene and 3' NTR from TMV U5 strain.
The first stage ofthe project required the construction of a vector into which shuffled DNA fragments could be reintroduced. The polymerase chain reaction (PCR) was used to amplify a DNA fragment from the TMV vector p30B comprising the T7 promoter, 5' non-translated region (NTR), and the reading frames for the 126 and 183 kDa replicase proteins. The 5' primer covered the T7 promoter and initial bases ofthe TMV genome while the second primer modified the context surrounding the start codon for the 30 kDa MP of TMV. This allowed DNA fragments to be ligated into the modified vector, designated 3 OB GFP d30K, as vrll, Pa restriction endonuclease digested fragments.
Native TMV 183/30 kDa junction and 30k/GFP junction
183 kDa ORF AGT TTG TTT ATA GAT GGC TCT AGT TGT TAA AGG AAA A... GAT TCG TTT TAA (cont.) S L F I D G S S C *
M A L V V K G K ... D S F * 30kDa ORF ATAgaTCTTACAGTATCACTACTCCATCTCAGTTCGTGTTCTTGTCATTAATTAA ATG ...
Pad GFP ORF
Modified TMV 183/30 kDa/GFP junction (without 30 kDa gene): P30B d30k ANP 183 kDa ORF
AGTTTG TTTATA GAc GGCTCTAGTTGTTAA g CCTAGG A GCCGGCTTAATTAA ATG...
GFP ORF
S L F I D G S S C * Ayrll NeoMI Pad
Modified TMV 183/30 kDa junction and 30k/GFP junction (with 30 kDa gene present)
183 kDa ORF
AGT TTG TTT ATA GAT GGC TCT AGT TGT TAA g ATG GCT CTA GTT GTT AAA GGA AAA..
S L F I D G S S C * Ayrll M A L V V K G K ... ..GTTTTAAATAgaTCTTACAGTATCACTACTCCATCTCAGTTCGTGTTCTTGTCATTAATTAA ATG ..
Pad GFP ORF
This modification allowed the ready insertion of modified 30 kDa gene fragments into a vims vector and have them expressed in plant cells, tissues or systemically. The wild type GFP ORF is the reporter gene since the visual level of fluorescence as observed under long wave UV light conelates directly with levels of GFP protein present in plant tissues. This has been demonstrated by looking at different vims vectors expressing GFP, each having different strength subgenomic promoters, that were infected in plants and GFP levels determined by UV fluorescence and Western blotting using anti-GFP antibodies.
The procedure for shuffling ofthe 30 kDa gene is similar to that described by Crameri et al, Nature Biotech. 15:436 (1997), and contained the following steps. The 30 kDa gene fragment also containing the coat protein RNA leader was amplified from tobamoviras expression vectors using primers: TMVUl 3 OK 5'A (5'-GGCCCTAGGATGGCTCTAGTTGTTAAAGG-3') (SEQ ID NO: 49) and 3-5' Pac primer (5'-GTTCTTCTCCTTTGCTAGCCATTTAATTAATGAC- 3') (SEQ ID NO: 50). The PCR DNA product was gel isolated and then incompletely digested with DNasel. DNA fragments of 500 bp or smaller were isolated by using DEAE blotting paper technique and then eluted. Purified DNA fragments were mixed together with taq DNA polymerase and allowed to "reassemble" for 40 cycles. "Reassembly" reaction was assayed by gel electrophoresis for DNA bands of approximately 800-850 bp. Approximately 1 mcl ofthe "reassembly" reaction was then subjected to PCR using primers TMV Ul 30K 5'A and 3-5' Pac that hybridize to terminal DNA ends of reassembled fragments. The reassembled fragments will be gel isolated and digested with restriction enzymes Avrll and Pad (sites present in the terminal primers) to allow for facile cloning back into the p30B d30k ANP digested with -4vrII and Pad. Ligations of shuffled genes into p30B d30k ANP resulted in pooled libraries of sequences containing 100 to 50,000 members in five separate experiments. Pooled vims vectors with libraries of variant 30 kDa coding regions were transcribed with T7 RNA polymerase and then inoculated by standard PEG fransfection into 0.5 x 106 Nicotiana tabacum protoplasts per sample. Inspection of cells 24 hours post inoculation revealed varied intensities of GFP fluorescence in individual cells indicating possible different levels of GFP accumulation and possible effects in the subgenomic promoter activity as desired. Cells were incubated for 48 hours post inoculation, harvested by centrifugation and then lysed using freeze/thaw and grinding with a mortar and pestle. The virions that accumulated in protoplasts were released by the grinding.
The protoplast extracts were then inoculated on leaves of wild type and transgenic Nicotiana tabacum c.v. MD609 expressing the TMV Ul 30 kDa movement protein. Three to five days post inoculation localized infection sites were observed expressing GFP. A variety of intensities of GFP fluorescence were observed varying from that observed with the wild type GFP gene to much duller to very bright, as observed from the viral expression ofthe shuffled GFP gene of Crameri et al, Nature Biotech. (1996) (GFPc3). The occurrence of vimses expressing enhanced GFP fluorescence varied between libraries tested from 1/200 to 1/50 infection foci depending on libraries tested. These local infection sites with enhanced GFP fluorescence were excised from the leaves and inoculated on Nicotiana benthamiana plants. The bright local infection variants were then purified on the inoculated leaves of these plants from contaminating vimses expressing less GFP protein. These vimses expressing brighter GFP proteins were found to express larger amounts of GFP protein in systemic tissues than the starting p30B GFP vims. Sequencing and genetic studies indicated that no mutations accumulated in the GFP genes and that the effects were due to mutations in the TMV Ul 30 kDa ORF that up regulated the subgenomic promoter. The accumulation of GFP in the shuffled variants with brighter GFP phenotype was 3.4 fold greater than that produced by p30B GFP as measured by quantitative Western blotting of plant extracts using an anti-GFP sera. These data demonstrated that shuffling could be used to enhance the cis-acting functions of RNA sequences and that plant RNA vims expression vectors are effective tools to shuttle large diversity of sequence variants in whole plants and plant cells.
The protoplast extracts isolated from transfections with vims libraries were inoculated on one half of wild type Nicotiana tabacum c.v. MD609 and Nicotiana benthamiana leaves. To the other leaf half, vims derived from p30B GFP was inoculated. Some infection sites resulting from infection of vimses containing shuffled 30 kDa ORFs grew more rapidly than those ofthe average from p30B GFP. These events occurred at a frequency of 1/100 to 1/500 infection foci depending on the vims library analyzed. These more rapidly growing infection foci were excised and inoculated on young Nicotiana tabacum c.v. MD609 plants. As a control, p30B GFP was inoculated on similar sized and aged plants. The p30B GFP vector does not move systemically on tobacco plants. However, some shuffled 30 kDa ORF variant vectors, that were identified as rapidly growing local infection sites, were able to move systemically on tobacco plants. The movement was primarily on phloem source tissue and were localized to veins and circular spots in green lamina. This movement ability was reproducible in multiple inoculations of these individual vims variants. Sequence analysis ofthe vimses containing shuffled 30 kDa ORFs capable of systemic movement on Nicotiana tabacum plants demonstrated that localized amino acid substitutions were present and responsible for altered movement phenotype.
Further recursive shuffling ofthe top 5-10% of GFP expressing vectors or those that demonstrated an enhanced ability to invade systemic tissues of tobacco could be carried out to meld synergistic mutations to lead to greater gains in expression or vims movement. Likewise, the 30 kDa ORFs that contain the most potent subgenomic promoters and most enabled movement activities in tobacco could be shuffled together so to bring both sets of properties into the same 30 kDa ORF. It is also apparent from these data that by testing vims expression vectors containing libraries of these shuffled variants, one can select the variant with the protein or RNA activity that one desires. The phenotypes that can be assayed are protein activity in planta, as with the movement activities ofthe 30 kDa protein, enzyme activities in planta or in plant extracts or other surrogate features such as substrate or product accumulation. These data demonstrate the power of vims expression vectors to be effective tools for shuttling sequence variants into plants and allow the selection of genes encoding the desired altered property. This tool allows one to mine the hidden activities, enhance the isolated activities of enzymes or eliminate allosteric inhibition of enzyme activities. This could be applied to any plant gene or genes from other sources to optimize the activities desired for agronomic, pharmaceutical or developmental effects caused by altered genes. EXAMPLE 31 Composite clomng to facilitate cloning of libraries in vims vectors and/or their introduction into host cells for expression of sequences.
Vims vector clones could be integrated into lambda phage or cosmid clones to facilitate library constmction, clone representation, elimination of cell based amplification by direct transcription and archiving of individual clones. Likewise, cis-acting elements allowing for expression in plant cells or integration into plant DNA could be included into such plasmids to facilitate inoculation of DNA for direct expression, obviating the need for transcription of vector cDNA, or construction of dedicated plant transformation vectors.
Vims vectors are tools housing libraries of sequences that can be screened for novel gene discovery. However libraries are often first constructed in plasmid or phage shuttle vectors before excising and introduction into vims vectors. Likewise, sequences can be screened in hosts using vims vectors, but must be subcloned into appropriate eukaryotic expression vectors before the trait identified in the vector transfected host will become a stable trait in the host by gene integration. Additional hurdles to overcome are: (1) constmction of libraries to most efficiently represent the clones in a cDNA library, (2) obtaining maximal fransfection efficiency into bacterial hosts (if used), and (3) archiving DNA samples without the need for fransfection into bacteria and transcription of ligated DNA. The integration of a virus vector into a cosmid clone, or lambda phage itself, (both termed phagmids here) could allow a multi-puφose vector to be generated to be both the repository of primary generated library sequences, source for ligation transcriptions, high efficiency bacterial fransfection and direct expression in higher eukaryotic hosts. Using normal cloning procedures, the 5' half of the vims vector to be inserted into one arm of a phagmid DNA clone with a non symmetrical restriction (such as BstXI: CCANNNNNNTGG) containing a unique sticky sequence (the N's). The 3' part ofthe vector will be inserted into another arm with a non-symmetrical restriction (such as BstXI: CCANNNNNNTGG) containing a second unique sticky sequence (the N's). The vector would be split at the determined restriction site (e.g. BstXI) within the site for foreign sequence expression in the vims vector. The 5'-end ofthe vims cDNA would be appropriately fused to a promoter for in vitro transcription (e.g. T7) or for in vivo expression (e.g. an appropriate higher eukaryotic RNA polymerase promoter). The 3'-end ofthe vims cDNA would terminate with a ribozyme for in vitro cleavage and/or a 3' terminator from a gene from host organism to lead to in vivo termination of transcription. Left and right T-DNA borders that promote the integration of sequences in between into plant genomic DNA, could flank the promoter and terminator sequences. At the terminus of each arm would be cos sequences to allow complete regeneration ofthe phagmid upon ligation in the presence of foreign library DNA containing the two unique sticky sequences at each respective termini. These library DNA fragments could be generated by PCR amplification using determined restriction sites (e.g., BstXI) to generate unique sticky ends complementary to those in the phagmid-vector arms integrated in the PCR primers. The 5' and 3' primers would each have unique recognition sequences in the BstXI restriction site (the N's) that would match the sticky sites on the respective sides ofthe vims vector. The sites could be switched on a second set of PCR primers to allow the amplification of DNA to be ligated into the phagmid-viral vector arms in the "sense" and "anti-sense" orientation. These constructions would allow for efficient in vitro ligation and use of cmde ligation mix as template for E. coli transformation, plant transformation, in vitro lambda packaging to 109 pfu/mcg or in vitro transcription. In this manner, the vector and flexibility for its screening could be maximized. These tools we can directly build complex libraries into and simultaneously be the enabling tool for analysis.
EXAMPLE 32 Improvement of Host Plant Performance with a Viral Expression System via Interspecific Hybridization.
The goal of this example is to improve the host plant by introducing foreign genetic material via interspecific hybridization. Host plant species vary in their ability to support expression of a sequence inserted into a plant viral vector. Some species support expression to a high specific activity, such as
Nicotiana benthamiana, but have relatively low biomass. Other species, such as N tabacum, have high biomass and/or other desirable properties for growth in the field, but have a relatively low specific activity ofthe expressed sequence. In this example, the desirable properties of two or more species are combined by making an interspecific hybrid by standard methods. After chromosome doubling to restore fertility, the primary hybrid may have suitable properties, or it may be desirable to backcross toward either parent selecting or screening at each generation for the desired property(ies) ofthe non-recunent parent, for example, infrogress the superior biomass of N tabacum into N benthamiana, or infrogress the superior viral vector performance of N benthamiana into N tabacum, among others. A viral vector expressing the green fluorescent protein (GFP) is one example of a useful tool for screening the level of systemic expression in candidate hybrid plants.
Many hybrids are possible, especially within the genus Nicotiana. For example, we have hybrids between N benthamiana and N tabacum. N. benthamiana and N. clevelandii, N. benthamiana .and N excelsior, N. benthamiana and N africana, N. clevelandii and N africana, N. umbratica and N africana, N. umbratica and N otophora, and N. bigelovii and N. excelsior. In addition, hybrids with more than two parents are possible. For example, we have N benthamiana! tabacuml africana and N benthamiana! clevelandii! tabacum.
EXAMPLE 33 Libraries of heterologous nucleic acid sequences in DHSPES constructs generated in a restriction-endonuclease-free and cell-free manner.
The goal of this example is to generate libraries of DHSPES constmcts containing heterologous sequences while avoiding the potential problems associated with the use of restriction enzymes for preparation ofthe inserted nucleic acids and with passage ofthe resultant constructions through E. coli. Normally, DNA fragments are generated by restriction endonuclease treatment and ligated into a DHSPES vector with compatible termini. However, when a complex population of DNA molecules, such as that found in a cDNA library, is used as starting material and a given restriction endonuclease is used to treat the insert DNA to render the appropriate termini for ligation to the cloning vector, the recognition sequence for that enzyme will occur with a certain frequency within the population, rendering the molecule bearing that sequence truncated after digestion. Passage of certain plasmid-based viral clones through E. coli has been observed to result in instability ofthe plasmid a certain proportion ofthe time. The cause of this instability is unclear, but may be related to insert size, sequence or to toxicity resulting from expression ofthe gene from cryptic promoter sequences present in the DHSPES viral sequences.
In order to avoid the above-mentioned problems, libraries of DHSPES constructs harboring cDNA molecules in a restriction endonuclease-free and E. co/t-free manner are constructed. Such a system will permit the inclusion into DHSPES constmcts of molecules that harbor inconvenient internal restriction sites. This method of "cell-free cloning" will also allow us to obtain DHSPES- derived vimses containing genes that are not well tolerated by E. coli in traditional cloning approaches.
In essence, cell-free cloning will entail the in vitro assembly of partial viral sequences with a DNA fragment into a configuration that that will yield infectious viral RNA molecules upon in vitro transcription. In one system, the viral sequences are divided into two "arms"; the left arm and the right arm. The left arm encodes a T7 RNA polymerase promoter followed by viral sequences encoding replicase followed by the gene encoding movement protein and the subgenomic promoter that controls expression ofthe desired gene. The right arm will contain sequences ofthe viral genome that encode the viral coat protein and the sequences that control its expression, the viral 3' untranslated region, and a ribozyme sequence for generating the desired 3' terminus on the transcribed molecules. A schematic diagram for cell free cloning is shown in FIGURE 28. The left arm and right arm will each have separate asymmetric (non- palindromic, thus self-incompatible) overhangs that will permit the two arms to be brought together by an intervening insert that is derived either from PCR product, cDNA reaction, or elsewhere. The insert will have termini that are compatible with both the left and right arms. The termini of these molecules are such that ligation of left and right arms to insert will ensure assembly into the proper configuration to yield infectious viral transcripts. The sequence contained in the insert will then be in the conect orientation and genomic position to permit its expression from the vims in plant cells. Specifically, the right arm will be synthesized by PCR and will have a biotin group incoφorated into the reverse (3') primer. The resulting biotinylated PCR product representing the right arm will then be immobilized upon streptavidin paramagnetic beads. Treatment ofthe DNA with T4 DNA polymerase and a single dNTP (in the present case, dGTP) will give a 5' overhang as a result ofthe exonuclease activity ofthe polymerase. The insert DNA, being PCR product, restriction fragment, or cDNA will be treated with T4 DNA polymerase with a single dNTP to generate 5' overhangs on its termini; the 3' of which is compatible with the 5' ofthe right arm. The 5' terminus ofthe insert DNA will be compatible with the left .arm 3' terminus that had been generated similarly.
The ligation reactions in the assembly ofthe vims on the paramagnetic beads will be carried out sequentially, with the insert being ligated to the immobilized right .arm first, followed by washing ofthe bead complex and then ligation ofthe left arm. Following the subsequent wash, in vitro transcription will be carried out to generate infectious RNA transcripts.
In this cell-free manner, replication-competent vimses expressing the GFP gene were constmcted. Using PCR, a biotinylated right arm was prepared. Following immobilization on avidincoated paramagnetic beads and treatment with T4 DNA polymerase and a single nucleotide (dGTP) to generate the appropriate 5' overhang, the right arm was ligated to a PCR product encoding the GFP gene that had been treated with T4 DNA polymerase .and dCTP to render a compatible 5' overhang. A DNA fragment comprising the left arm ofthe vims was then ligated to the resulting DNA-bead complex to generate a full-length vims clone that was subsequently used as template for in vitro transcription. After each step of enzymatic manipulation ofthe magnetic bead-bound DNA, DNA-bead complexes were washed by sedimenting them in a magnetic field and resuspending them in the appropriate buffer. In addition, after each manipulation, aliquots were taken for analysis to confirm that the desired reaction had occurred. The infectious RNA products ofthe franscription reaction were introduced into protoplasts of tobacco cell suspension cultures. At 12-18 hours after protoplast infection, fluorescence emitted by the GFP encoded by the virus clone was observed in a majority ofthe cells confirming that the RNA transcript derived from the DNA-bead complexes was infectious, and hence, that the sequentially assembled vims-encoding DNA molecules had been assembled in the desired configuration so as to permit vims replication and expression ofthe inserted foreign gene sequences.
EXAMPLE 34 Use of undefined sequences to increase the genetic stability of foreign genes in vims expression vectors.
Insertion of foreign gene sequences into vims expression vectors can result in anangements of sequences that interfere with normal vims function and thereby, establish a selection landscape that favors the genetic deletion ofthe foreign sequence. Such events are adverse to the use of such expression vectors to stably express gene sequences systemically in plants. A method that would allow sequences to be identified that may "insulate" functional virus sequences from the potential adverse effects of insertion of foreign gene sequences would greatly augment the expression potential of vims expression vectors. In addition, identification of such "insulating" sequences that simultaneously enhanced the translation ofthe foreign gene product or the stability ofthe mRNA encoding the foreign gene would be quite helpful. The example below demonstrates how libraries of random sequences can be introduced into vims vectors flanking foreign gene sequences. Upon analysis, a subset of introduced sequences allowed a foreign gene sequence that was previously prone to genetic deletion to remain stabily in the vims vectors upon serial passage. The use of undefined sequences to enhance the stability of foreign gene sequences can be extrapolated to the use of undefined sequences to enhance the translation of foreign genes and the stability of coding mRNAs by those skilled in the art.
The genetic stability ofthe human growth hormone gene (hGH) or an Ubiquitin fusion to hGH (Ubiq hGH) in the tob-amo virus expression vector p30B is rather poor, such that no stable vims preparations could not be made to serially passage infection onto plants and detect the expression of hGH recombinant protein. The site of gene insertion is following a Pad site (underlined) in the vims vector. This sequence is known as a leader sequence and has been derived from the native leader and coding region from the native TMV Ul coat protein gene. In this leader, the normal coat protein ATG has been mutated to a Aga sequence (underlined in
GTTTTAAATAgaTCTTACAGTATCACTACTCCATCTCAGTTCGTGTTCTT GTCATTAATTAA ATG ... (hGH GENE)). A particular subset of this leader sequence (TCTTACAGTATCACTACTCCATCTCAGTTCGTGTTCTTGTCA) has been known to increase genetic stability and gene expression when compared with vims construct lacking the leader sequence. The start site of subgenomic RNA synthesis is found at the GTTTT... An oligonucleotide RL-1 (GTTTTAAATAGATCTTAC N(20)TTAATTAAGGCC ) was used with a primer homologous to the NcollApal region ofthe TMV genome to amplify a portion ofthe TMV movement protein. The population of sequences were cloned into the Apal and Pad sites ofthe p30B hGH vector. Vectors containing the undefined sequences leading the hGH genes were transcribed and inoculated onto Nicotiana benthamiana plants. 14 days post inoculation, systemic leaves were ground .and the plant extracts were inoculated onto a second set of plants.
Following the onset of vims symptoms in the second set of plants, Western blot analysis was used to detect if hGH or Ubiq-hGH fusions were present in the serially inocuated plants. Several variants containing novel sequences in the non- translated leader sequence were identified that were associated with vimses that were genetically stable and allowed successful passage of hGH expression on plants inoculated with serially passaged vims. Whereas the parental controls, p30B hGH and p30B Ubiq-hGH, did not. Vimses derived from undefined sequence library, p30B hGH vims #2 and #5, were shown to genetically stable upon virion passage and likewise, p30B Ubiq hGH #6 showed expression ofthe Ubiq-hGH expression upon serial virion passage. Again, this property was never observed in each ofthe starting vimses p30B hGH and p30B Ubiq hGH. The sequence sunounding the leader was determined and compared with that ofthe control vims vectors. p30B #5 HGH GTTTTAAATAGATCTTAC--TATAACATGAATAGTCATCG p30B #5 HGH GTTTTAAATAGATCTTAC--TATACCATGAATTAGTACCG p30B #6 UbiqHGH GΓΓTTAAATAGATCTTAC--ACTCGGTTGAGATAAAACTAAACTA p30B #2 HGH GTTTTAAATAGATCTTAC--TCCGACGTATAGTCACCACG p30B HGH GTTTTAAATAGATCTTAC--
AGTATCACTACTCCATCTCAGTTCGTGTTCT p30BUbiqHGH GTTTTAAATAGATCTTAC--
AGTATCACTACTCCATCTCAGTTCGTGTTCT p30B #5 HGH TTAATTAAAATGGGA— p30B #5 HGH TTATTTAAAATGGGAAAAATGGCTTCTCTATTTGCCACATTTTTA p30B #6 UbiqHGH
TT-AATTAA-^-ATGGGAAAAATGGCTCTCTTATTGGCCCCATTTTTA p30B #2 HGH TTAATTA-AAAATGCAGATTTTCGTCAAGACTTTGACCGGG p30B HGH
TGTCATTAATTA-AAATGGGAAAAATGGCTTCTCTATTTGCCACATTTTTA p30B UbiqHGH TGTCATT-^TTAAAATGCAGATTTTCGTCAAGACTTTGACCGGT
****************
* indicates sequences that are identical in all viruses.
— indicates end of defined primer and start of N(20) region ofthe oligonucleotide that was introduced during PCR amplification.
The result was that undefined leader constmcts transcribed were passageable as vims, while the parental 3 OB vectors with native leaders were not.
The nature ofthe random leaders indicates that each are unique and that multiple solutions are readily available to solve RNA based stability problems. Likewise, such random sequence introductions could also increase the translational efficiency.
In order to select for undefined sequences that may increase the translational efficiency of foreign genes or increases the stability ofthe mRNA encoding the foreign gene derived from a vims expression vector, a selectable marker could be used to discover which ofthe undefined sequences yield the desired function. The amount ofthe GFP protein conelates with the level of fluorescence seen under long wave UV light and the amount of herbicide resistance gene product conelates with survival of plant cells or plants upon treatment with the herbicide. Therefore introduction of undefined sequences sunounding the GFP or herbicide resistance genes and then screening for individual vimses that either express the greatest level of fluorescence or cells that survive the highest amount of herbicide. In this manner the cells with the vimses with the highest foreign gene activity would be then purified and characterized by sequencing and more thorough analysis such as Northern and Western blotting to access the stability ofthe mRNA and the abundance ofthe foreign gene of interest. EXAMPLE 35 Method for using reporter genes fused to regulated or constitutive promoters as a surrogate marker for identifying genes impacting gene regulation.
In this example we will show 1) a method to construct transgenic hosts expressing a reporter gene under the control of various promoter types; 2) means to use such hosts to identify genes from libraries expressed in vims expression vectors that alter gene regulation.
The initial constmction ofthe reporter gene expression cassette will require identification ofthe appropriate reporter gene, which could include GFP (fluorescent in live plants under long wave UV light), GUS (fluorescent and color-based assay in detected tissue), herbicide resistance genes (live or death phenotype upon treatment with herbicide) or other scoreable gene products known to the art. Promoter sequences can express RNA in constitutive or induced conditions. An example of a regulated promoter would be that of tomato or potato protease inhibitor type I gene (Graham, et al, J. Biol. Chem. 260:6555- 6560 (1985)). These promoters are up regulated in the presence of jasmonic acid or herbivore damage to plant tissues. Constitutive promoters are readily identifiable from anyone skilled in the art inspecting the relevant literature. Such combinations of inducible or constitutive promoters using appropriate reporter genes would be integrated into binary plant transformation vectors, fransformed into Agrobacterium and transformed into Nicotiana benthamiana leaf disks. Upon identification ofthe appropriate gene construct in regenerated tissues, the primary transformants would be selfed to obtain the first stable line of plants for assay. Libraries of cDNAs, full-length for gene overexpression or gene fragments for sense or anti-sense based gene suppression, would be ligated into vims expression vectors by normal molecular biology techniques. These libraries would be prepared for inoculation by the methods described in this patent application. Once inoculated, hosts with inducible promoters fused to reporter genes, maintained in uninduced state, would be monitored for abenant expression ofthe reporter gene in tissue that contains replicating vims. If hosts containing constitutive promoter fusions to reporter genes .are used, monitoring for hyper- or hypo-expression conditions ofthe reporter gene would be the focus. In this manner, genes that augment pathways that induce or upregulate the activity of certain promoters could be identified by following the surrogate marker of reporter gene expression. Conversely, gene that down-regulate or halt reporter gene expression could be identified as products that negatively effect the activities ofthe promoter or signaling pathway to which it is responsive. Vims vectors containing sequences that effected reporter gene expression by overexpression or suppression positive or negative regulatory factors can be isolated, and foreign gene contained may be sequenced and analyzed by bioinformatic methods.
EXAMPLE 36 Method to induce the expression of alternative splicing variants to discover biological effects in host organisms and to use said host organism as a source for novel cDNA libraries enriched for alternatively spliced variants of genes. Transcription of nuclear genes in higher eukaryotic organisms results in a primary RNA transcript that contains both coding (exon) and non-coding (intron) information. A cmcial step in RNA maturation before exporting to the cytosol for translation is the splicing of introns from the primary transcript and the rendering of contiguous exons for coding ofthe desired product. It is interesting to note that, although, splicing may occur in defined sites constitutively in certain gene, many genes can be spliced to produce multiple protein products, each with separate functions. The process of splicing out different sets of intron .and splicing together of different anay and order of exons for the same primary transcript is known is alternative splicing. This is powerful way genetic economy can be achieved in higher organisms to encode for multiple functions in a single gene cistron. The events of alternative splicing are regulated by families of small nuclear RNAs .and associated proteins. These factors are responsible for the choice of splice sites used in primary RNA transcript and the nature ofthe mature mRNA reconstmcted from the splicing process. Many alternative splicing events produce rare or tissue specific RNAs that result in the translation of specific protein products that have unique activities. The most famous of which is the alternative splicing of a Drosophila transcription factor results in the sex determination ofthe developing embryo. For a reference describing general alternative splicing, see Lopez, Ann. Rev. Genetics, 32 (1998), in press.
Since alternatively spliced mRNAs encode for proteins with differing functions, it would be interesting to investigate hosts that are deficient in these factors or hosts that no longer express such factors. It is difficult to accurately and effectively represent this diversity in standard cDNA libraries constructed from unaltered eukaryotic hosts. However, the use of vims expression vectors to overexpress or suppress the expression of factors involved in the splicing process will make it possible to increase the proportion of alternatively spliced mRNA in the host organism. Focused gene libraries will be constructed for the overexpression and the sense or antisense suppression of factors with potential and actual activities in the RNA splicing process in plants. Gene families can include the SF2/ASF-like group of splicing factors (Lopato et al, PNAS 92:7672- 7676 (1995)), the RS-rich family of splicing factors (Lapato et al, The Plant Cell :2255-2264 (1996)) and other splicing families that have been identified in the literature in lower or upper eukaryotic systems. The gene libraries will be subcloned into vims expression vectors and vims libraries will be inoculated as individuals or pools onto plants or plant cells. Once individual or groups of splicing factors are overexpressed or have their expression suppressed in plant cells, novel forms of splicing will occur due to the role of these proteins in alternative splicing of many transcription factors, splicing factors or other gene products. The high level of expression achieved by vims expression vectors and their ability to infect most cell types in plants should raise the overall level of abenantly expressed mRNAs in the plant. The transfected plants will be used as the starting point for the isolation of poly A(+) RNA for the constmction of cDNAs enriched for alternatively spliced genes. The alterations in the alternative splicing could be the splicing of a greater or lesser number of introns from the primary mRNA than normally occurs in non-transfected plants. These enriched cDNA libraries can now be cloned into vims expression vectors and the functions of these novel spliced forms of genes can be assayed on plants transfected with these vector libraries.
In this example, one can discover the plietropic functions of factors effecting alternative or normal splicing functions in plants from primary directed vims libraries with original splicing factor genes, or from vims libraries derived from plants containing induced novel spliced mRNAs.
Similar methods could be to derive novel cDNA libraries by using vims vectors to express factors responsible for transcriptional regulation of genes in plants. In this example, targeted cloning of transcription factor families would be ligated into vims expression vectors. Families could include homeodomain, Zn finger, leucine zipper and other transcription factor families appearing in pro or eukaryotic genomes. Schwechheimer, et al., Ann. Rev. Plant Phys. and Plant Mol. Biol. 49 (1998), in press. The gene libraries will be sub-cloned into vims expression vectors and vims libraries will be inoculated as individuals or pools onto plants or plant cells. Once individual or groups of franscription factors are overexpressed or have their expression suppressed in plant cells or plants, novel patterns of gene expression patterns will be induced. This will result in the appearance of a higher proportion of cDNAs normally present at low levels in the plant tissue or that are normally developmentally regulated. However, with the high level of expression achieved by vims expression vectors and their ability to infect most cell types in plants should induce these tissue specific cDNAs in abenant cell types and at much higher than normal levels. The transfected plants will be used as the starting point for the isolation of poly A(+) RNA for the constmction of cDNAs enriched for alternatively lowly expressed or developmentally expressed cDNAs. These cDNAs would be used to construct expression or gene suppression libraries that will be enriched for these rare or abenantly expressed cDNAs. These enriched cDNA libraries can now be cloned into vims expression vectors and the functions of these novel spliced forms of genes can be assayed on plants transfected with these vector libraries.
Although the invention has been described with reference to the presently prefened embodiments, it should be understood that various modifications can be made without departing from the spirit ofthe invention. It is further understood that the instant invention applies to all plus stranded RNA viral vectors.

Claims

CLAIMSWhat is claimed is:
1. A method of determining the function of a nucleic acid sequence in an organism comprising the steps of: (a) introducing the nucleic acid sequence into a host organism by way of a viral nucleic acid capable of expressing the nucleic acid sequence; and
(b) observing changes resulting from the expression ofthe nucleic acid sequence in the organism.
2. The method of claim 1 further comprising the step of (c) recovering the expressed product ofthe nucleic acid sequence.
3. The method of claim 2 wherein said expressed product is selected from the group consisting of peptides, proteins, polyproteins, enzymes, ribozymes, antibodies and antigens.
4. A functional RNA, DNA, or amino acid sequence molecule recovered according to the method of claim 1 or 2.
5. The method of claim 1 wherein said observing comprises complementation analysis.
6. The method of claim 1 wherein said observing comprises analyzing the biochemical alterations in the accumulation of subsfrates or products from enzymatic reactions.
7. The method of claim 1 wherein said observing comprises determining phenotypic changes in the host transfected with a nucleic acid.
8. The method of claim 1 wherein said observing comprises noting changes in one or more biochemical pathways in the host transfected with the nucleic acid.
Ill
9. The method of claim 6 wherein analyzing the biochemical alterations is performed by a method selected from the group consisting of MALDI-TOF, LC/MS, GC/MS, two-dimensional IEF/SDS-PAGE and ELISA.
10. The method of claim 1 wherein said observing comprises observing inhibition of endogenous gene expression in the cytoplasm of cells resulting from expression ofthe nucleic acid.
11. The method of claim 1 wherein said observing comprises comparing an RNA expression profile in the nucleus of a cell contaimng the selected nucleic acid with the RNA expression profile in a cell which does not contain the selected nucleotide sequence.
12. The method of claim 1 wherein said observing comprises comparing an RNA expression profile in the cytoplasm of a cell containing the selected nucleic acid with the RNA expression profile in a cell which does not contain the selected nucleotide sequence.
13. The method of claim 1 wherein said observing comprises comparing an RNA expression profile in an organelle of a cell containing the selected nucleic acid with the RNA expression profile in a cell which does not contain the selected nucleotide sequence.
14. The method of claim 1 wherein said observing comprises comparing the protein expression profile in the cytoplasm of a cell containing the selected nucleic acid sequence with the protein expression profile in the cytoplasm of a cell which does not contain the selected nucleotide sequence.
15. The method of claim 1 wherein said viral nucleic acid comprises a native plant viral subgenomic promoter, a plant viral coat protein coding sequence and at least one non-native nucleic acid sequence.
16. The method of claim 1 wherein said viral nucleic acid comprises a native plant viral subgenomic promoter, at least one non-native plant viral subgenomic promoter and a plant viral coat protein coding sequence wherein said native plant viral subgenomic promoter initiates franscription ofthe plant viral coat protein sequence and said non-native plant viral subgenomic promoter initiates franscription of an operably joined nucleic acid sequence in a host plant and wherein said recombinant plant viral nucleic acid is expressed locally or systemically in a host plant.
17. The method of claim 1 wherein said viral nucleic acid comprises a native plant viral subgenomic promoter, at least one non-native plant viral subgenomic promoter and a plant viral coat protein coding sequence, wherein said native plant viral subgenomic promoter initiates transcription of an operably joined nucleic acid sequence and said non-native plant viral subgenomic promoter initiates transcription ofthe plant viral coat protein sequence in a host plant and wherein said viral nucleic acid and said recombinant plant viral nucleic acid is expressed locally or systemically in a host plant.
18. The method of claim 1 wherein said viral nucleic acid contains a nucleic acid sequence of interest and wherein said nucleic acid sequence of interest is transcribed with the genomic DNA or RNA ofthe recombinant plant viral nucleic acid.
19. The method of claim 1 wherein the nucleic acid sequence comprises all or a part of the cDNA library of an organism.
20. The method of claim 19 wherein said cDNA library is normalized or subtracted.
21. The method of claim 1 wherein said viral nucleic acid is selected from the group consisting of a potyvirus, a tobamoviras, a bromovirus, a rice necrosis vims, a geminivirus, a rhinovirus, a poliovims, a simian vims, a polyomaviras and an adenovirus.
22. The method of claim 1 wherein said viral nucleic acid comprises a nucleic acid fusion of a first nucleic acid sequence encoding a viral coat protein and a second nucleic acid sequence.
23. The method of claim 1 wherein the viral nucleic acid is present in a transgenic plant.
24. The method of claim 1 wherein said viral nucleic acid comprises a nucleic acid sequence of interest and at least a part of a viral genome.
25. The method of claim 21 wherein the nucleic acid sequence is present in the viral genomic DNA or RNA of a viral nucleic acid.
26. The method of claim 1 wherein the host organism is a plant or a plant cell.
27. The method of claim 1 wherein the host organism is selected from the group consisting of an animal, an animal cell, a plant, a plant cell, a bacterium and a fungus.
28. The method of claim 1 wherein the host organism is a cell or tissue derived from tissues, organs or organisms subjected to treatment with drags or chemicals or which have been infected with disease-causing agents.
29. A method of silencing one or more endogenous genes in an organism, a cell or a tissue comprising the step of introducing a nucleic acid into the organism by way of a viral nucleic acid suitable to express the nucleic acid wherein said nucleic acid is antisense or positive-sense to said endogenous genes.
30. The method of claim 29 wherein the nucleic acid is operable to silence a gene or a multigene family.
31. The method of claim 29 or 30 further comprising the step of recovering primary or secondary metabolites whose expression is affected by the gene silencing.
32. A functional RNA, DNA or amino acid-containing molecule recovered according to the process of any one of claims 29, 30 or 31.
33. The method of claim 29 wherein said host is selected from the group consisting of a plant, an animal, a bacterium and a yeast.
34. The method of claim 29 wherein the host organism is a cell or tissue derived from tissues, organs or organisms subjected to freatment with drags or chemicals or which have been infected with disease-causing agents.
35. The method of claim 29 wherein said viral nucleic acid is suitable to produce local or systemic expression of a non-native nucleic acid and comprises a native plant viral subgenomic promoter, at least one non-native plant viral subgenomic promoter and a plant viral coat protein coding sequence, wherein said native plant viral subgenomic promoter initiates franscription of an operably joined nucleic acid sequence and said non-native plant viral subgenomic promoter initiates transcription ofthe plant viral coat protein sequence in a host plant and wherein said recombinant plant viral nucleic acid is expressed locally or systemically in a host plant.
36. The method of claim 29 wherein said viral nucleic acid comprises a native plant viral subgenomic promoter, at least one non-native plant viral subgenomic promoter and a plant viral coat protein coding sequence wherein said native plant viral subgenomic promoter initiates franscription ofthe plant viral coat protein sequence and said non-native plant viral subgenomic promoter initiates transcription of an operably joined nucleic acid sequence in a host plant and wherein said recombinant plant viral nucleic acid is expressed locally or systemically in a host plant.
37. The method of claim 29 wherein said viral nucleic acid is selected from the group consisting of a potyvims, a tobamoviras, a bromovirus, a rice necrosis virus, a geminivirus, a rhinovirus, a poliovirus, a simian vims, a polyomaviras and an adenovirus.
38. A method of determining the function of a nucleic acid sequence in an organism comprising the steps of:
(a) cloning one or more expressed sequence tag cDNAs into a viral nucleic acid suitable to produce expression of one or more expressed sequence tag cDNAs in the organism;
(b) transfecting the organism with said viral nucleic acid comprising said expressed sequence tag cDNAs; and
(c) observing changes resulting from the expression ofthe nucleic acid sequence in said organism.
39. The method of claim 38 further comprising the step of (d) recovering the expressed product ofthe nucleic acid sequence.
40. A functional DNA, RNA or amino acid molecule recovered according to the method of either claim 38 or 39.
41. The method of claim 38 wherein said observing comprises complementation analysis.
42. The method of claim 38 wherein said observing comprises analyzing the biochemical alterations in the accumulation of substrates or products from enzymatic reactions.
43. The method of claim 38 wherein said observing comprises determining phenotypic changes in the host transfected with a nucleic acid.
44. The method of claim 38 wherein said observing comprises noting changes in one or more biochemical pathways in the host transfected with the nucleic acid.
45. The method of claim 42 wherein analyzing the biochemical alterations is performed by a method selected from the group consisting of MALDI-TOF, LC/MS, GC/MS, two-dimensional IEF/SDS-PAGE and ELISA.
46. The method of claim 38 wherein said observing comprises observing inhibition of endogenous gene expression in the cytoplasm of cells resulting from expression ofthe nucleic acid.
47. The method of claim 38 wherein said observing comprises comparing an RNA expression profile in the nucleus of a cell containing the selected nucleic acid with the RNA expression profile in a cell which does not contain the selected nucleotide sequence.
48. The method of claim 38 wherein said observing comprises comparing an RNA expression profile in an organelle of a cell containing the selected nucleic acid with the RNA expression profile in a cell which does not contain the selected nucleotide sequence.
49. The method of claim 38 wherein said observing comprises comparing the protein expression profile in the cytoplasm of a cell containing the selected nucleic acid sequence with the protein expression profile in the cytoplasm of a cell which does not contain the selected nucleotide sequence.
50. The method of claim 38 wherein said observing comprises comparing the RNA expression profile in the cytoplasm of a cell containing the selected nucleic acid sequence with the RNA expression profile in the cytoplasm of a cell which does not contain the selected nucleic acid sequence.
51. A method of determining the function of a nucleic acid sequence in an organism comprising the steps of:
(a) altering the genome of a host organism;
(b) introducing a nucleic acid sequence of interest; and (c) observing changes resulting from expression ofthe nucleic acid sequence in the organism.
52. The method of claim 51 further comprising the step of (d) recovering the expressed product ofthe nucleic acid sequence.
53. A functional DNA, RNA or amino acid sequence recovered according to the method of either claim 51 or 52.
54. The method according to claim 51 wherein mutating the genome of a host organism is performed by introducing one or more transposon DNA sequences into the organism.
55. The method according to claim 51 wherein said observing comprises complementation analysis.
56. A method for identifying the function of a nucleic acid sequence in a transgenic plant comprising the steps of:
(a) introducing said nucleic acid into said transgenic plant; and
(b) observing changes resulting from expression ofthe nucleic acid sequence in the transgenic plant.
57. The method of claim 56 further comprising the step of (c) recovering the expressed product ofthe nucleic acid sequence.
58. A functional DNA, RNA or amino acid sequence molecule recovered according to the method of either claim 56 or 57.
59. The method of claim 56 wherein the transgenic plant comprises at least one non-native transposon DNA sequence.
60. A method for identifying a gene function in a transgenic plant carrying a conditional lethal mutation in a gene comprising the steps of:
(a) growing the plant under first permissive conditions;
(b) exposing the plant from step (a) to restrictive conditions for a period of time of at least about one growth cycle;
(c) shifting the plant from step (b) to second permissive conditions for a period of time of at least about one growth cycle; and
(d) selecting a plant having a lethal mutation, thereby identifying a plant carrying a lethal mutation that is sensitive to the restrictive condition and essential for survival ofthe organism.
61. The method of claim 60, further comprising after step (d), the step of (e) complementing a transgenic plant carrying a recessive or dominant conditional lethal mutation by transfecting with a viral vector containing a functional copy ofthe mutated gene.
62. The method of claim 60, further comprising after step (e), the step of (f) isolating from said viral vector a gene conecting or complementing said mutation.
63. The method of claim 62, further comprising after the step of isolating said gene, a step selected from (i) identifying the function of said gene, (ii) identifying the product expressed by said gene, and (iii) sequencing said gene.
64. The method of claim 60 in which the first permissive conditions include a complete growth medium for the plant tissue, plant cell or plant organ.
65. The method of claim 60 in which the first permissive conditions include a growth medium at low osmotic strength.
66. The method of claim 60 in which the first permissive conditions include a temperature between about 5 and 15┬░C below the optimal growth temperature for the wild type.
67. The method of claim 60 in which the restrictive conditions include a temperature between the optimal growth temperature for the organism and at least about 15┬░C above the optimal growth temperature for the organism.
68. The method of claim 60 in which the second permissive conditions are substantially the same as the first permissive conditions.
69. The method of claim 60 in which the plant cells in growing step (a) are replica plated plant cells on plant leaf disks.
70. The method of claim 60 in which the period of time in step (c) is equivalent to at least one growth cycle.
71. The method of claim 60 in which the plants from step (a) are selected from the group consisting of monocotyledons and dicotyledons.
72. The method of claim 60 in which the plants from step (a) have been mutagenized by insertion mutagenesis with T-DNA or transposon nucleic acid sequences.
73. The method of claim 72 in which the plants have been mutagenized with a mutagen selected from the group consisting of nucleic acid alkylating agents, intercalating agents, ionizing radiation, heat, and sound.
74. The method of claim 73 wherein said alkylating and intercalating agents are selected from the group consisting of methanesulfonate, methyl methanesulfonate, methylnitrosoguanidine, 4-nitroquinoline-l -oxide, 2- aminopurine, 5-bromouracil, ICR 191 and other acridine derivatives, ethidium bromide, nitrous acid, and N-methyl-N'-nitroso-N-nitroguanidine.
75. A method for identifying a gene product target of an antimicrobial drag, herbicide, pesticide or fungicide compound comprising the steps of: (a) growing plants under first permissive conditions; (b) exposing the plants from step (a) to restrictive conditions for a period of time equivalent to at least one growth cycle;
(c) shifting the plants from step (b) to second permissive conditions for a period of time equivalent to at least one growth cycle;
(d) selecting a plant having a gene carrying a conditional lethal mutation; and
(e) identifying the gene product conesponding to the conditional lethal mutation, thereby identifying a gene product target of a pesticidal or herbicidal compound.
76. The method of claim 75, wherein the restrictive conditions include changing the temperature and the conditional lethal mutation is a temperature- sensitive lethal mutation.
77. The method of claim 75, wherein the method includes at least 50 plants in steps (a)-(c).
78. A method for constructing an infectious viral vector comprising the step of inserting one or more nucleotides between the transcription start site of a promoter sequence of a viral nucleic acid and the cDNA start site ofthe viral nucleic acid.
79. The method of claim 78 wherein a single nucleotide is inserted.
80. The method of claim 78 wherein two nucleotides are inserted.
81. The method of claim 78 wherein three nucleotides are inserted.
82. The method of claim 79 wherein the inserted single nucleotide is a G.
83. The method of claim 80 or 81 wherein the inserted nucleotides contain a G at the 5 '-end.
84. The method according to claim 78 wherein the inserted nucleotides are GNN, GTN, or multiples thereof.
85. The method of claim 78 wherein said viral nucleic acid is selected from the group consisting of a potyvims, a tobamovims, a bromovims, a rice necrosis vims, a rhinovims, and a poliovirus.
86. A functional RNA, DNA, or amino acid containing molecule recovered according to the method of claim 78.
87. A method for infecting a plant host comprising the step of transcribing a viral nucleic acid in the absence of a cap analogue.
88. A method for infecting a plant host comprising the step of transcribing a viral nucleic acid in the presence of a cap analogue.
89. The method of claim 87 wherein the viral nucleic acid contains a sequence encoding a movement protein.
90. The method of claim 87 wherein the viral nucleic acid contains additional one or more nucleotides between the transcription start site of a promoter sequence of said viral nucleic acid and the cDNA start site of said viral nucleic acid.
91. The method of claim 87 wherein said viral nucleic acid is selected from the group consisting of a potyvirus, a tobamovims, a bromovims, a rice necrosis vims, a rhinovims, and a poliovims.
92. A functional RNA, DNA, or .amino acid containing molecule recovered according to the method of claims 87 or 88.
93. A method for inhibiting an endogenous protease of a plant host comprising the step of treating the plant host with a compound which induces the production of an endogenous inhibitor of said protease.
94. The method according to claim 93 wherein said compound is jasmonic acid.
95. The method according to claim 93 wherein the treatment ofthe plant host with a compound results an increased representation of an exogenous nucleic acid or the protein product thereof.
96. The method of claim 93 wherein said viral nucleic acid is selected from the group consisting of a potyvims, a tobamoviras, a bromovims, a rice necrosis vims, a geminivirus, a rhinovirus, a poliovirus, a simian vims, a polyomaviras and an adenovirus.
97. A functional RNA, DNA, or amino acid containing molecule recovered according to the method of claim 93.
98. A method for improving the expression of a foreign sequence in a plant host comprising the step of interspecific hybridization.
99. A method for optimizing the function of a nucleic acid sequence in a plant host comprising the steps of:
(a) constructing a viral expression vector comprising a library containing variants ofthe nucleic acid sequence;
(b) introducing the viral nucleic acid library into the plant host by way of expressing variants ofthe nucleic acid sequence; and
(c) observing changes resulting from the expression ofthe variants ofthe nucleic acid sequence.
100. The method according to claim 99 further comprising the step of (d) sequencing one or more variants ofthe nucleic acid sequence.
101. The method according to claim 99 wherein said library containing random variants ofthe nucleic acid sequence.
102. The method according to claim 99 wherein said library containing known variants ofthe nucleic acid sequence.
103. The method according to claim 99 wherein said library is constructed by shuffling the nucleic acid.
104. The method according to claim 99 wherein the nucleic acid is non- native to the viral expression vector.
105. The method according to claim 99 wherein the nucleic acid is native to the viral expression vector.
106. The method according to claim 99 wherein the movement ability ofthe nucleic acid sequence is optimized.
107. The method according to claim 99 wherein the promoter ofthe nucleic acid sequence is optimized.
108. The method according to claim 99 wherein the host range ofthe nucleic acid sequence is optimized.
109. The method according to claim 99 wherein the signaling function ofthe nucleic acid sequence is optimized.
110. The method according to claim 99 wherein the replication stability ofthe nucleic acid sequence is optimized.
111. The method according to claim 99 wherein the translation efficiency ofthe nucleic acid sequence is optimized.
112. The method according to claim 99 wherein said library is constructed in a cell-free manner.
113. The method of claim 99 wherein said viral expression vector is selected from the group consisting of a potyvirus, a tobamovirus, a bromoviras, a rice necrosis virus, a geminivirus, a rhinovirus, a poliovirus, a simian virus, a polyomaviras and an adenovirus.
114. A functional RNA, DNA, or amino acid containing molecule recovered according to the method of any one of claims 99 or 100.
115. A method for increasing the representation of nucleic acid sequences in a viral expression library comprising the step of propagating said library in the absence of E. coli.
116. A method for determining the function of a gene wherein one or more reporter genes are fused to one or more constitutive or induced promoters in a viral expression vector.
117. A method for constructing a novel cDNA library from a plant host comprising the steps of:
(a) constructing a viral expression vector containing a non-native nucleic acid sequence;
(b) infecting the plant host with said viral expression vector; (c) measuring transcription or processing of one or more RNA molecules in the plant host; and
(d) synthesizing said cDNA library from said one or more RNA molecules.
EP99903208A 1998-01-16 1999-01-15 Method of determining the function of nucleotide sequences and the proteins they encode by transfecting the same into a host Withdrawn EP1045899A2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US818698A 1998-01-16 1998-01-16
US8186 1998-01-16
PCT/US1999/001164 WO1999036516A2 (en) 1998-01-16 1999-01-15 Method of determining the function of nucleotide sequences and the proteins they encode by transfecting the same into a host

Publications (1)

Publication Number Publication Date
EP1045899A2 true EP1045899A2 (en) 2000-10-25

Family

ID=21730222

Family Applications (1)

Application Number Title Priority Date Filing Date
EP99903208A Withdrawn EP1045899A2 (en) 1998-01-16 1999-01-15 Method of determining the function of nucleotide sequences and the proteins they encode by transfecting the same into a host

Country Status (6)

Country Link
EP (1) EP1045899A2 (en)
JP (1) JP2002508957A (en)
KR (1) KR20010040337A (en)
AU (1) AU761367B2 (en)
CA (1) CA2318662A1 (en)
WO (1) WO1999036516A2 (en)

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6300133B1 (en) 1998-01-16 2001-10-09 Large Scale Biology Corporation RNA transformation vectors derived from an uncapped single-component RNA virus
US6468745B1 (en) 1998-01-16 2002-10-22 Large Scale Biology Corporation Method for expressing a library of nucleic acid sequence variants and selecting desired traits
US6300134B1 (en) 1998-01-16 2001-10-09 Large Scale Biology Corporation RNA transformation vectors derived from a single-component RNA virus and contain an intervening sequence between the cap and the 5′ end
US6700040B2 (en) * 1998-01-16 2004-03-02 Large Scale Biology Corporation Cytoplasmic gene inhibition or gene expression in transfected plants by a tobraviral vector
US6303848B1 (en) * 1998-01-16 2001-10-16 Large Scale Biology Corporation Method for conferring herbicide, pest, or disease resistance in plant hosts
CA2370515C (en) * 1999-04-20 2011-08-09 Aventis Cropscience N.V. Methods and means for delivering inhibitory rna to plants and applications thereof
AU6235000A (en) * 1999-07-21 2001-02-13 Large Scale Biology Corporation Method for enhancing rna or protein production using non-native 5' untranslated sequences in recombinant viral nucleic acids
AU6608300A (en) * 1999-07-21 2001-02-13 Large Scale Biology Corporation Method of correlating sequence function by transfecting a nucleic acid sequence of a donor organism into a plant host in an anti-sense or positive sense orientation
WO2001038513A2 (en) * 1999-11-23 2001-05-31 Maxygen, Inc. Shuffling of agrobacterium and viral genes, plasmids and genomes for improved plant transformation
AU2001234980A1 (en) 2000-02-09 2001-08-20 Genvec, Inc. Methods of preparing and using a viral vector library
AU2001248371A1 (en) * 2000-04-18 2001-10-30 Syngenta Participations Ag Amino acid:glyoxylate aminotransferase genes from plants and uses thereof
WO2002004628A2 (en) 2000-07-06 2002-01-17 Genvec, Inc. Method of identifying a binding partner of a gene product
US6830885B1 (en) 2000-08-18 2004-12-14 Phenogene Therapeutiques Inc. Nucleic acid molecule, method and kit for selecting a nucleic acid having a desired feature
DE10049587A1 (en) * 2000-10-06 2002-05-02 Icon Genetics Ag Vector system for plants
DE10061150A1 (en) 2000-12-08 2002-06-13 Icon Genetics Ag Methods and vectors for the production of transgenic plants
DE10102389A1 (en) 2001-01-19 2002-08-01 Icon Genetics Ag Methods and vectors for plastid transformation of higher plants
KR20030090626A (en) * 2001-01-25 2003-11-28 비륵시스 코포레이션 Methods and compositions for identifying gene function
US6800748B2 (en) 2001-01-25 2004-10-05 Large Scale Biology Corporation Cytoplasmic inhibition of gene expression and expression of a foreign protein in a monocot plant by a plant viral vector
JP2002238574A (en) * 2001-02-16 2002-08-27 Japan International Research Center For Agricultural Services Whole length genomic rna of papaya deformed leaf mosaic virus
DE10114209A1 (en) 2001-03-23 2002-12-05 Icon Genetics Ag Site-directed transformation using amplification vectors
DE10115507A1 (en) 2001-03-29 2002-10-10 Icon Genetics Ag Method for coding information in nucleic acids of a genetically modified organism
DE10121283B4 (en) 2001-04-30 2011-08-11 Icon Genetics GmbH, 80333 Methods and vectors for amplification or expression of desired nucleic acid sequences in plants
DE10132780A1 (en) 2001-07-06 2003-01-16 Icon Genetics Ag Plastid gene expression via autonomously replicating vectors
US7635798B2 (en) 2001-08-31 2009-12-22 Dow Agrosciences, Llc Nucleic acid compositions conferring altered metabolic characteristics
US7608758B2 (en) 2001-08-31 2009-10-27 Riken Plant system for comprehensive gene function analysis with the use of full-length cDNA
DE10143238A1 (en) 2001-09-04 2003-03-20 Icon Genetics Ag Identification of eukaryotic internal ribosome entry sites (IRES) elements
DE10143237A1 (en) 2001-09-04 2003-03-20 Icon Genetics Ag Manufacture of artificial internal ribosomal entry point elements (Ires elements)
WO2003052108A2 (en) 2001-12-18 2003-06-26 Bayer Bioscience N.V. Improved methods and means for delivering inhibitory rna to plants and applications thereof
AU2003235582A1 (en) * 2002-01-16 2003-07-30 Cropdesign N.V. Herbicide mode of action determination
DE10212158A1 (en) * 2002-03-19 2003-10-02 Metanomics Gmbh & Co Kgaa Population of transgenic plants, derived biological material, corresponding plasmid collection and population of transformed host organisms, as well as their use and methods for their production
MXPA05001034A (en) * 2002-07-29 2005-06-08 Univ Laval Method for enhancing yield of recombinant protein production from plants.
AU2003212136A1 (en) * 2003-03-07 2004-09-28 National Research Council Of Canada The determination of function of mammalian proteins by analysis of global gene expression profiles
CN102498998B (en) * 2011-11-24 2013-09-18 湖南杂交水稻研究中心 Method for purification, retainment and propagation of seeds of rice photo-thermo-sensitive male sterile line
DE102013210506A1 (en) * 2013-06-06 2014-12-11 Henkel Ag & Co. Kgaa Alcohol oxidase with activity for glycerol
WO2019006263A1 (en) * 2017-06-30 2019-01-03 Dow Global Technologies Llc Coating for aldehyde remediation and method of making
AR112465A1 (en) 2017-07-31 2019-10-30 Reynolds Tobacco Co R METHODS AND COMPOSITIONS FOR THE EDITION OF VIRAL BASED GENES IN PLANTS
CN111328347A (en) 2017-09-11 2020-06-23 R.J.雷诺兹烟草公司 Methods and compositions for increasing expression of a gene of interest in a plant by co-expression with P21

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5922602A (en) * 1988-02-26 1999-07-13 Biosource Technologies, Inc. Cytoplasmic inhibition of gene expression
GB8916213D0 (en) * 1989-07-14 1989-08-31 Ici Plc Dna constructs,cells and plants derived therefrom
GB9703146D0 (en) * 1997-02-14 1997-04-02 Innes John Centre Innov Ltd Methods and means for gene silencing in transgenic plants

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO9936516A2 *

Also Published As

Publication number Publication date
AU761367B2 (en) 2003-06-05
WO1999036516A2 (en) 1999-07-22
CA2318662A1 (en) 1999-07-22
WO1999036516A3 (en) 2000-03-16
AU2328699A (en) 1999-08-02
JP2002508957A (en) 2002-03-26
KR20010040337A (en) 2001-05-15

Similar Documents

Publication Publication Date Title
AU761367B2 (en) Method of determining the function of nucleotide sequences and the proteins they encode by transfecting the same into a host
US20050005319A1 (en) Method of determining the function of nucleotide sequences and the proteins they encode by transfecting the same into a host
US6800748B2 (en) Cytoplasmic inhibition of gene expression and expression of a foreign protein in a monocot plant by a plant viral vector
AU2002361997A1 (en) Improved methods and means for delivering inhibitory rna to plants and applications thereof
EP0999281A1 (en) Non-nuclear chromosomal transformation
US7498480B2 (en) Cytoplasmic gene inhibition or gene expression in transfected plants by a tobraviral vector
Vidal et al. Use of gene transfer technology for functional studies in grapevine
JPH11507217A (en) Recombinant virus nucleic acid
US6426185B1 (en) Method of compiling a functional gene profile in a plant by transfecting a nucleic acid sequence of a donor plant into a different host plant in an anti-sense orientation
US6468745B1 (en) Method for expressing a library of nucleic acid sequence variants and selecting desired traits
US6300133B1 (en) RNA transformation vectors derived from an uncapped single-component RNA virus
AU2003220702A1 (en) Method of determining the function of nucleotide sequences and the proteins they encode by transfecting the same into a host
US20020164585A1 (en) Method for enhancing RNA or protein production using non-native 5' untranslated sequences in recombinant viral nucleic acids
AU6608300A (en) Method of correlating sequence function by transfecting a nucleic acid sequence of a donor organism into a plant host in an anti-sense or positive sense orientation
US20030167512A1 (en) Method of determining the presence of a trait in a plant by transfecting a nucleic acid sequence of a donor plant into a different host plant in a positive orientation
US6300134B1 (en) RNA transformation vectors derived from a single-component RNA virus and contain an intervening sequence between the cap and the 5′ end
US20030166169A1 (en) Method for constructing viral nucleic acids in a cell-free manner
US20030041355A1 (en) Method of humanizing plant cDNA
WO2001007613A2 (en) Method for enhancing rna or protein production using non-native 5' untranslated sequences in recombinant viral nucleic acids
US20030064392A1 (en) Method of humanizing plant cDNAs by transfecting a nucleic acid sequence of a non-plant donor into a host plant in an anti-sense orientation
US20030097683A1 (en) Single-component RNA vectors derived from a virus and containing an intervening sequence between the cap and the 5' end and able to replicate in a host plant cell within a host plant
AU2002242001A1 (en) Cytoplasmic gene inhibition or gene expression in transfected plants by a tobraviral vector
AU3400100A (en) Recombinant viral nucleic acids

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20000727

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

17Q First examination report despatched

Effective date: 20041018

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20050301