WO2001064022A2 - Genes du type cotyledon1 feuillu et leurs utilisations - Google Patents

Genes du type cotyledon1 feuillu et leurs utilisations Download PDF

Info

Publication number
WO2001064022A2
WO2001064022A2 PCT/US2001/005454 US0105454W WO0164022A2 WO 2001064022 A2 WO2001064022 A2 WO 2001064022A2 US 0105454 W US0105454 W US 0105454W WO 0164022 A2 WO0164022 A2 WO 0164022A2
Authority
WO
WIPO (PCT)
Prior art keywords
seq
promoter
nucleic acid
expression cassette
lec
Prior art date
Application number
PCT/US2001/005454
Other languages
English (en)
Inventor
John Harada
Tamar Lotan
Masa-Aki Ohto
Robert B. Goldberg
Robert L. Fischer
Anhthu Bui
Raymond Kwong
Original Assignee
The Regents Of The University Of California
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/516,052 external-priority patent/US6781035B1/en
Application filed by The Regents Of The University Of California filed Critical The Regents Of The University Of California
Priority to EP01912861A priority Critical patent/EP1263280A4/fr
Priority to AU4160001A priority patent/AU4160001A/xx
Priority to AU2001241600A priority patent/AU2001241600B2/en
Priority to CA002399886A priority patent/CA2399886A1/fr
Priority to JP2001562933A priority patent/JP2004500823A/ja
Publication of WO2001064022A2 publication Critical patent/WO2001064022A2/fr

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8216Methods for controlling, regulating or enhancing expression of transgenes in plant cells
    • C12N15/8222Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
    • C12N15/823Reproductive tissue-specific promoters
    • C12N15/8234Seed-specific, e.g. embryo, endosperm
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • C12N15/8262Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield involving plant development
    • C12N15/8267Seed dormancy, germination or sprouting
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A40/00Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
    • Y02A40/10Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
    • Y02A40/146Genetically Modified [GMO] plants, e.g. transgenic plants

Definitions

  • the present invention is directed to plant genetic engineering.
  • it relates to new embryo-specific genes useful in improving agronomically important plants.
  • Embryogenesis in higher plants is a critical stage of the plant life cycle in which the primary organs are established.
  • Embryo development can be separated into two main phases: the early phase in which the primary body organization of the embryo is laid down and the late phase which involves maturation, desiccation and dormancy.
  • the early phase the symmetry of the embryo changes from radial to bilateral, giving rise to a hypocotyl with a shoot meristem surrounded by the two cotyledonary primordia at the apical pole and a root meristem at the basal pole.
  • the late phase during maturation the embryo achieves its maximum size and the seed accumulates storage proteins and lipids. Maturation is ended by the desiccation stage in which the seed water content decreases rapidly and the embryo passes into metabolic quiescent state.
  • Dormancy ends with seed germination, and development continues from the shoot and the root meristem regions.
  • abscisic acid ABA
  • ABA abscisic acid
  • the VP1 gene has been shown to function as a transcription activator (McCarty. et al., ( 1991) Cell 66:895-906). It has been suggested that ABI3 has a similar function.
  • Another class of embryo defective mutants involves three genes: LEAFY
  • COTYLEDON 1 and 2 (LEC 1 , LEC2) and FUSCA3 (FUS3). These genes are thought to play a central role in late embryogenesis (Baumlein, et al. ( 1994) Plant J. 6:379-387; Meinke. D. W. (1992) Science 258: 1647-1650; Meinke et al.. Plant Cell 6: 1049- 1064: West et al., ( 1994) Plant Cell 6: 1731 -1745). Like the abi3 mutant, leafy cotyledon-type mutants are defective in late embryogenesis. In these mutants, seed morphology is altered, the shoot meristem is activated early, storage proteins are lacking and developing cotyledons accumulate anthocyanin.
  • abi3 mutants As with abi3 mutants, they are desiccation intolerant and therefore die during late embryogenesis. Nevertheless, the immature mutants embryos can be rescued to give rise to mature and fertile plants. However, unlike abi3 when the immature mutants germinate they exhibit trichomes on the adaxial surface of the cotyledon. Trichomes are normally present only on leaves, stems and sepals, not cotyledons. Therefore, it is thought that the leafy cotyledon type genes have a role in specifying cotyledon identity during embryo development.
  • the lee 1 mutant exhibits the most extreme phenotype during embryogenesis.
  • the maturation and postgermination programs are active simultaneously in the lecl mutant (West et al., 1994). suggesting a critical role for LEC 1 in gene regulation during late embryogenesis.
  • the present invention is based, in part, on the isolation and characterization of
  • LEC l genes The invention provides isolated nucleic acid molecules comprising a LEC l polynucleotide sequence which is at least 68% identical to the B domain of SEQ ID NO:2.
  • the invention also provides expression cassettes comprising a promoter operably linked to a heterologous polynucleotide sequence or complement thereof, encoding a LEC l polypeptide comprising a sequence which is at least 68% identical to the B domain of SEQ ID NO:2.
  • the polynucleotide sequence is heterologous to any element in the expression cassette.
  • the B domain comprises a polypeptide between about amino acid residue 28 and amino acid residue 1 17 of SEQ ID NO:2.
  • the B domain comprises a polypeptide sequence with an amino terminus at amino acid residues 28-35 and a carboxy terminus at amino acid residues 103-1 17 of SEQ ID NO:2.
  • the LECl polypeptide is shown in SEQ ID NO:20 or 22.
  • Such LEC l polypeptides can be encoded by the polynucleotide sequences shown in SEQ ID NO: 19 or SEQ ID NO:21. respectively.
  • the LECl polypeptide is a fusion between two or more LEC l polypeptides of polypeptide subsequences.
  • the expression cassette comprises a promoter operably linked to the LEC l polynucleotide or its complement.
  • the promoter can be a constitutive promoter.
  • the promoter can be a promoter from a LEC 1 gene.
  • the LEC 1 promoter can be from about nucleotide 1 to about nucleotide 1998 of SEQ ID NO:3.
  • the heterologous polynucleotide can be linked to the promoter in the antisense orientation.
  • the promoter is SEQ ID NO:23.
  • the promoter can further compromise SEQ ID NO:24.
  • the invention provides an expression cassette comprising a promoter operably linked to a heterologous polynucleotide sequence, or complement thereof, encoding a LEC l polypeptide comprising a subsequence at least 90% identical to the A or C domain of a LEC l polypeptide.
  • the polynucleotide sequence can be heterologous to any element in the expression cassette.
  • Such expression cassettes can encode fusions of two or more LEC l polypeptides or polypeptide subsequences.
  • the invention also provides for an expression cassette for the expression of heterologous polypeptides in a plant.
  • the expression cassette comprises a LECl promoter operably linked to a heterologous polynucleotide.
  • the LEC 1 promoter is at least 70% identical to SEQ ID NO:23.
  • the expression cassette promoter comprises a promoter at least 70% identical to SEQ ID NO:24.
  • the promoter comprises the sequence displayed in SEQ ID NO:24.
  • the invention also provides an isolated nucleic acid or complement thereof, encoding a LECl polypeptide comprising a subsequence at least 68% identical to the B domain of SEQ ID NO:2, with the proviso that the nucleic acid is not clone MN J7.
  • the B domain comprises a polypeptide sequence with an amino terminus at amino acids 28-35 and a carboxy terminus at amino acids 103- 1 1 7 of SEQ ID NO:2.
  • the LEC l polypeptide is shown in SEQ ID NO: 20 or SEQ ID NO:22.
  • LECl polypeptides can be encoded by the polynucleotide sequences shown in SEQ ID NO: 19 or SEQ ID NO:21 , respectively.
  • the LEC 1 polypeptide is a fusion between two or more LEC 1 polypeptides of polypeptide subsequences.
  • the isolated nucleic acid can further compromise a promoter operably linked to the LECl -encoding nucleic acid.
  • the promoter can be a constitutive promoter.
  • the promoter can be a promoter from a LEC l gene.
  • the LEC l promoter can be from about nucleotide 1 to about nucleotide 1998 of SEQ ID NO:3.
  • the heterologous polynucleotide can be linked to the promoter in the antisense orientation.
  • the invention provides a host cell comprising expression cassettes or nucleic acids of the invention.
  • the host cells of the invention comprise an expression cassette comprising a promoter operably linked to a heterologous a polynucleotide sequence, or complement thereof, encoding a LEC l polypeptide with a subsequence at least 68%o identical to the B domain of SEQ ID NO:2.
  • the host cell of the invention comprises an expression cassette comprising a promoter operably linked to a heterologous polynucleotide sequence or complement thereof, encoding a LEC 1 polypeptide with a subsequence at least 90%) identical to the A or C domain of a LECl polypeptide.
  • hosts cells comprising an expression cassette comprising a promoter at least 70%) identical to SEQ ID NO:23 or an isolated nucleic acid comprising a subsequence at least 68%) identical to the B domain of SEQ ID NO:2, so long as the nucleic acid is not clone MNJ7.
  • the invention also provides isolated polypeptides comprising amino acid sequences at least 68%o identical to the B domain of SEQ ID NO:2 and capable of exhibiting at least one of the biological activities of the polypeptides encoded in SEQ ID NO: l .
  • SEQ ID NO: 19 or SEQ ID NO:21 or a fragment thereof.
  • Antibodies capable of binding the above- described polypeptide are also provided.
  • the method comprises providing an expression cassette of nucleic acid of the invention as described herein and contacting the expression cassette or nucleic acid with the host cell under conditions that permit insertion of the nucleic acid into the host cell.
  • the invention also provides transgenic plant cells or plants comprising an expression cassette comprising a promoter operably linked to a heterologous polynucleotide sequence, or complement thereof, encoding a LEC l polypeptide comprising a sequence which is at least 68%> identical to the B domain of SEQ ID NO:2.
  • the LECl polypeptide is shown in SEQ ID NO: 20 or SEQ ID NO:22.
  • Such LEC l polypeptides can be encoded by the polynucleotide sequences shown in SEQ ID NO: 19 or SEQ ID NO:21 , respectivel) .
  • the invention also provides plants that are regenerated from the plant cells discussed above.
  • the expression cassette promoter can be a constitutive promoter.
  • the promoter can be a promoter from a LEC l gene.
  • the LEC l promoter can be from about nucleotide 1 to about nucleotide 1998 of SEQ ID NO:3.
  • the heterologous polynucleotide can be linked to the promoter in the antisense orientation.
  • the promoter is SEQ ID NO:23.
  • the promoter can also further comprise SEQ ID NO:24.
  • the invention also provides an expression cassette for the expression of a heterologous polynucleotide in a plant cell, comprising a promoter polynucleotide at least
  • the promoter polynucleotide is SEQ ID NO:23.
  • the promoter can also further comprise a polynucleotide at least 70%) identical to SEQ ID NO:24.
  • the promoter comprises SEQ ID NO:24.
  • the invention also provides methods of modulating transcription comprising, introducing into the plant an expression cassette containing a plant promoter operabh linked to a heterologous LECl polynucleotide, the heterologous LECl polynucleotide encoding a LEC l polypeptide comprising a subsequence at least 68%) identical to the B domain of SEQ ID NO:2 and detecting a plant with modulated transcription.
  • Embodiments of these methods include where the LECl polynucleotide is SEQ ID NO:2. SEQ ID NO:20 or SEQ ID NO:22. In other embodiments, the LECl polypeptides are encoded by SEQ ID NO: l . SEQ ID NO: 19 or SEQ ID NO:21.
  • Preferred embodiments of the invention include the method where transcription modulation results in induction of embyonic characteristics in a plant. In an alternative embodiment, transcription modulation results in induction of seed development.
  • the invention also provides a method of detecting a nucleic acid in a sample. The method comprises providing an isolated LECl nucleic acid molecule comprising a polynucleotide sequence, or complement thereof, encoding a LEC 1 polypeptide with a subsequence at least 68%> identical to the B domain of SEQ ID NO:2., contacting the isolated nucleic acid molecule with a sample under conditions which permit a comparison of the sequence of the isolated nucleic acid molecule with the sequence of DNA in the sample; and analyzing the result of the comparison.
  • the isolated nucleic acid molecule and the sample are contacted under conditions which permit the formation of a duplex between complementary nucleic acid sequences. Definitions
  • nucleic acid refers to a single or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases read from the 5' to the 3' end. Nucleic acids may also include modified nucleotides that permit correct read through by a polymerase and do not alter expression of a polypeptide encoded by that nucleic acid.
  • polynucleotide sequence or “nucleic acid sequence” includes both the sense and antisense strands of a nucleic acid as either individual single strands or in the duplex. It includes, but is not limited to, self-replicating plasmids, chromosomal sequences, and infectious polymers of DNA or RNA.
  • nucleic acid sequence encoding refers to a nucleic acid which directs the expression of a specific protein or peptide.
  • the nucleic acid sequences include both the DNA strand sequence that is transcribed into RNA and the RNA sequence that is translated into protein.
  • the nucleic acid sequences include both the full length nucleic acid sequences as well as non-full length sequences derived from the full length sequences. It should be further understood that the sequence includes the degenerate codons of the native sequence or sequences which may be introduced to provide codon preference in a specific host cell.
  • promoter refers to a region or sequence determinants located upstream or downstream from the start of transcription and which are involved in recognition and binding of RNA polymerase and other proteins to initiate transcription.
  • a "plant promoter” is a promoter capable of initiating transcription in plant cells. Such promoters need not be of plant origin, for example, promoters derived from plant viruses, such as the CaMV35S promoter, can be used in the present invention.
  • plant includes whole plants, shoot vegetative organs/structures (e.g. leaves, stems and tubers), roots, flowers and floral organs/structures (e.g. bracts, sepals, petals, stamens, carpels, anthers and ovules), seed (including embryo, endosperm, and seed coat) and fruit (the mature ovary), plant tissue (e.g. vascular tissue, ground tissue, and the like) and cells (e.g. guard cells, egg cells, trichomes and the like), and progeny of same.
  • shoot vegetative organs/structures e.g. leaves, stems and tubers
  • roots e.g. bracts, sepals, petals, stamens, carpels, anthers and ovules
  • seed including embryo, endosperm, and seed coat
  • fruit the mature ovary
  • plant tissue e.g. vascular tissue, ground tissue, and the like
  • cells e.g. guard cells, egg cells, trichomes
  • the class of plants that can be used in the method of the invention is generally as broad as the class of higher and lower plants amenable to transformation techniques, including angiosperms (monocotyledonous and dicotyledonous plants), gymnosperms. ferns, and multicellular algae. It includes plants of a variety of ploidy levels, including aneuploid. polyploid. diploid, haploid and hemizygous.
  • a polynucleotide sequence is "heterologous to" an organism or a second polynucleotide sequence if it originates from a foreign species, or. if from the same species, is modified from its original form.
  • a promoter operably linked to a heterologous coding sequence refers to a coding sequence from a species different from that from which the promoter was derived, or, if from the same species, a coding sequence which is different from any naturally occurring allelic variants.
  • a modified LEC l coding sequence which is heterologous to an operably linked LEC l promoter does not include the T- DNA insertional mutants as described in West et al.. The Plant Cell 6: 1731 -1745 ( 1994).
  • a polynucleotide "exogenous to" an individual plant is a polynucleotide which is introduced into the plant by any means other than by a sexual cross. Examples of means by which this can be accomplished are described below, and include Agrobacterium- mediated transformation, biolistic methods, electroporation, in planta techniques, and the like. Such a plant containing the exogenous nucleic acid is referred to here as an R
  • embryo-specific gene or “seed specific gene” is a gene that is preferentially expressed during embryo development in a plant.
  • embryo development begins with the first cell divisions in the zygote and continues through the late phase of embryo development (characterized by maturation, desiccation, dormancy), and ends with the production of a mature and desiccated seed.
  • Embryo-specific genes can be further classified as “early phase-specific " and "late phase- specific”.
  • Early phase-specific genes are those expressed in embryos up to the end of embryo morphogenesis.
  • Late phase-specific genes are those expressed from maturation through to production of a mature and desiccated seed.
  • a "LECl polynucleotide” is a nucleic acid sequence comprising (or consisting of) a coding region of about 100 to about 900 nucleotides, sometimes from about 300 to about 630 nucleotides, which hybridizes to SEQ ID NO: l under stringent conditions (as defined below), or which encodes a LEC l polypeptide.
  • LEC l polynucleotides can also be identified by their ability to hybridize under low stringency conditions (e.g., Tm ⁇ 40°C) to nucleic acid probes having a sequence from position 1 to 81 in SEQ ID NO: l or from position 355 to 627 in SEQ ID NO: l .
  • a “promoter from a LEC l gene” or “LECl promoter” will typically be about 500 to about 2000 nucleotides in length, usually from about 750 to 1500. Exemplary promoter sequences are shown as nucleotides 1 - 1998 of SEQ ID NO:3 or as SEQ ID NO:23.
  • a LECl promoter can also be identified by its ability to direct expression in all. or essentialK all, proglobular embryonic cells, as well as cotyledons and axes of a late embryo.
  • a "LECl polypeptide” is a sequence of about 50 to about 210. sometimes 100 to 150, amino acid residues encoded by a LECl polynucleotide.
  • a full length LECl polypeptide and fragments containing a CCAAT binding factor (CBF) domain can act as a subunit of a protein capable of acting as a transcription factor in plant cells.
  • LEC l polypeptides are often distinguished by the presence of a sequence which is required for binding the nucleotide sequence: CCAAT.
  • a short region of seven residues (MPIANVI) at residues 34-40 of SEQ ID NO: 3 shows a high degree of similarity to a region that has been shown to required for binding the CCAAT box.
  • residues 61 -72 of SEQ ID NO: 3 (IQECVSEYISFV) is nearly identical to a region that contains a subunit interaction domain (Xing, et al.. ( 1993 ) EMBO J. 12:4647-4655).
  • a homolog of a particular embryo-specific gene is a second gene in the same plant type or in a different plant type, which has a polynucleotide sequence of at least 50 contiguous nucleotides which are substantially identical (determined as described below) to a sequence in the first gene. It is believed that. in general, homologs share a common evolutionary past.
  • “Increased or enhanced LEC l activity or expression of the LECl gene” refers to an augmented change in LECl activity. Examples of such increased activity or expression include the following. LEC l activit ⁇ or expression of the LECl gene is increased above the level of that in wild-type, non-transgenic control plants (i.e. the quantity of L ⁇ C 1 activity or expression of the LECl gene is increased). LECl activity or expression of the LECl gene is in an organ, tissue or cell where it is not normally detected in wild-type, non-transgenic control plants (i.e. spatial distribution of LECl activity or expression of the LECl gene is increased).
  • LECl activity or expression is increased when LEC l activity or expression of the LECl gene is present in an organ, tissue or cell for a longer period than in a wild-type, non- transgenic controls (i.e. duration of LEC l activity or expression of the LE l gene is increased).
  • a "polynucleotide sequence from" a particular embryo-specific gene is a subsequence or full length polynucleotide sequence of an embryo-specific gene which, when present in a transgenic plant, has the desired effect, for example, inhibiting expression of the endogenous gene driving expression of an heterologous polynucleotide.
  • a full length sequence of a particular gene disclosed here may contain about 95%, usually at least about 98%o of an entire sequence shown in the Sequence Listing, below.
  • reproductive tissues includes fruit, ovules, seeds. pollen, pistols, flowers, or any embryonic tissue.
  • polynucleotide sequence from a particular embryo-specific gene, such as LECl .
  • the term specifically includes sequences (e.g., full length sequences) substantially identical (determined as described below) with a LEC 1 gene sequence and that encode proteins that retain the function of a LEC l polypeptide.
  • the introduced sequence need not be perfectly identical to a sequence of the target endogenous gene.
  • the introduced polynucleotide sequence will typically be at least substantially identical (as determined below) to the target endogenous sequence.
  • nucleic acid sequences or polypeptides are said to be “identical” if the sequence of nucleotides or amino acid residues, respectively, in the two sequences is the same when aligned for maximum correspondence as described below.
  • complementary to is used herein to mean that the sequence is complementary to all or a portion of a reference polynucleotide sequence.
  • Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman Add. APL. Math. 2:482 ( 1981), by the homology alignment algorithm of Needle man and Wunsch J. Mol. Biol 48:443 ( 1970), by the search for similarity method of Pearson and Lipman Proc. Natl. Acad. Sci. (U.S.A.) 85 : 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), or by inspection.
  • Percentage of sequence identity is determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions ( i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
  • substantially identical of polynucleotide sequences means that a polynucleotide comprises a sequence that has at least 25%o sequence identity. Alternatively, percent identity can be any integer from 25% to 100%). More preferred embodiments include at least: 25%, 30%, 35%. 40%), 45%, 50%, 55%. 60%, 65%, 70%, 75%, 80%, 85%, 90%. 95%o, or 99%o. compared to a reference sequence using the programs described herein; preferably BLAST using standard parameters, as described below.
  • LEC l sequences of the invention include nucleic acid sequences that have substantial identity to SEQ ID NO: l , SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO: 19 and SEQ ID NO:21 .
  • LEC l sequences of the invention include polypeptide sequences having substantial identify to SEQ ID NO:2.
  • SEQ ID NO:20 or SEQ ID NO:22 One of skill will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins encoded by two nucleotide sequences by taking into account codon degeneracy, amino acid similarity, reading frame positioning and the like.
  • Substantial identity of amino acid sequences for these purposes normally means sequence identity of at least 40%.
  • Preferred percent identity of polypeptides can be any integer from 40%o to 100%.
  • More preferred embodiments include at least 60%, 65%, 70%, 75%, 80%, 85%o. 90%, 95%, or 99%. Most preferred embodiments include 67%. 68%, 69%, 70%, 71%, 72%, 73%, 74% and 75%.
  • Polypeptides which are "substantially similar" share sequences as noted above except that residue positions which are not identical may differ by conservative amino acid changes. Conservative amino acid substitutions refer to the interchangeability of residues having similar side chains. For example, a group of amino acids having aliphatic side chains is glycine.
  • alanine, valine, leucine, and isoleucine a group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulfur-containing side chains is cysteine and methionine.
  • Preferred conservative amino acids substitution groups are: valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanme-valine. aspartic acid-glutamic acid, and asparagine-glutamine.
  • Another indication that nucleotide sequences are substantially identical is if two molecules hybridize to each other, or a third nucleic acid, under stringent conditions. Stringent conditions are sequence dependent and will be different in different circumstances. Generally, stringent conditions are selected to be about 5°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH.
  • the Tin is the temperature (under defined ionic strength and pH) at which 50%> of the target sequence hybridizes to a perfectly matched probe.
  • stringent conditions will be those in which the salt concentration is about 0.02 molar at pFI 7 and the temperature is at least about 60 ⁇ C.
  • RNA encoded by embryo-specific genes of the invention can be identified in Northern blots under stringent conditions using cDNAs of the invention or fragments of at least about 100 nucleotides.
  • stringent conditions for such RNA-DNA hybridizations are those which include at least one wash in 0.2X SSC at 63 ⁇ C for 20 minutes, or equivalent conditions.
  • Genomic DNA or cDNA comprising genes of the invention can be identified using the same cDNAs (or fragments of at least about 100 nucleotides) under stringent conditions, which for purposes of this disclosure, include at least one wash (usually 2) in 0.2X SSC at a temperature of at least about 50°C, usually about 55°C, for 20 minutes, or equivalent conditions.
  • Figure 1A shows a schematic representation of the three domains of the LEC l polypeptide.
  • Figure 2B shows a comparison of the predicted amino acid sequence of the B domain encoded by LECl with HAP3 homologs from maize, chicken, lamprey, Xenopus laveis, human, mouse, rat, Emericella nidulans, Schizosaccharomyces pombe,
  • the present invention provides new embryo-specific genes useful in genetically engineering plants.
  • Polynucleotide sequences from the genes of the invention can be used, for instance, to direct expression of desired heterologous genes in embryos (in the case of promoter sequences) or to modulate development of embryos or embyonic characteristics on other organs (e.g., by enhancing expression of the gene in a transgenic plant).
  • the invention provides a new gene from Arabidopsis referred to here as LEC l .
  • LEC l encodes polypeptides which subunits of a protein which acts as a transcription factor.
  • modulation of the expression of this gene can be used to manipulate a number of useful traits, such as increasing or decreasing storage protein content in cotyledons or leaves.
  • oligonucleotide probes based on the sequences disclosed here can be used to identify the desired gene in a cDNA or genomic DNA library from a desired plant species.
  • genomic libraries large segments of genomic DNA are generated by random fragmentation, e.g. using restriction endonucleases. and are ligated with vector DNA to form concatemers that can be packaged into the appropriate vector.
  • the cDNA or genomic library can then be screened using a probe based upon the sequence of a cloned embryo-specific gene such as the polynucleotides disclosed here. Probes may be used to hybridize with genomic DNA or cDNA sequences to isolate homologous genes in the same or different plant species.
  • the nucleic acids of interest can be amplified from nucleic acid samples using amplification techniques. For instance, polymerase chain reaction (PCR) technology to amplify the sequences of the genes directly from mRNA. from cDNA. from genomic libraries or cDNA libraries. PCR and other in vitro amplification methods may also be useful, for example, to clone nucleic acid sequences that code for proteins to be expressed, to make nucleic acids to use as probes for detecting the presence of the desired mRNA in samples, for nucleic acid sequencing, or for other purposes.
  • PCR polymerase chain reaction
  • Appropriate primers and probes for identifying embryo-specific genes from plant tissues are generated from comparisons of the sequences provided herein.
  • PCR Protocols A Guide to Methods and Applications. (Innis. M. Gelfand, D., Sninsky, J. and White, T., eds.). Academic Press. San Diego ( 1990).
  • Appropriate primers for this purpose include, for instance: UP primer - 5' GGA ATT CAG CAA CAA CCC AAC CCC A 3" and LP primer - 5' LP primer - 5' GCT CTA GAC ATA CAA CAC TTT TCC TTA 3'.
  • primer pairs can be used: 5' ATG ACC AGC TCA GTC ATA GTA GC 3' and 5' GCC ACA CAT GGT GGT TGC TGC TG 3' or 5' GAG ATA GAG ACC GAT CGT GGT TC 3' and 5' TCA CTT ATA CTG ACC ATA ATG GTC 3'.
  • a third set of primers include: 5'-AGG ATC CAT GGA ACG TGG
  • the amplifications conditions are typically as follows. Reaction components: 10 mM Tris-HCl. pH 8.3, 50 mM potassium chloride, 1.5 mM magnesium chloride, 0.001 % gelatin, 200 microM (uM) dATP, 200 microM dCTP. 200 microM dGTP, 200 microM dTTP, 0.4 microM primers, and 100 units per ml Taq polymerase. Program: 96 C for 3 min., 30 cycles of 96 C for 45 sec, 50 C for 60 sec, 72 for 60 sec, followed by 72 C for 5 min.
  • Polynucleotides may also be synthesized by well-known techniques as described in the technical literature. See, e.g., Carruthers et al , Cold Spring Harbor Symp. Quant. Biol. 47:41 1-418 (1982). and Adams et al., J. Am. Chem. Soc. 105:661 (1983). Double stranded DNA fragments may then be obtained either by synthesizing the complementary strand and annealing the strands together under appropriate conditions, or by adding the complementary strand using DNA polymerase with an appropriate primer sequence. Analysis of LEC l Gene Sequences
  • LEC l nucleic acid sequences of the invention includes genes and gene products identified and characterized by analysis using the sequences nucleic acid sequences, including SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4. SEQ ID NO: 19 and SEQ ID NO:21 , and protein sequences, including SEQ ID NO:2. SEQ ID NO:20 and SEQ ID NO:22.
  • LEC l sequences of the invention include nucleic acid sequences having substantial identity to SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4. SEQ ID NO: 19 and SEQ ID NO:21 .
  • LEC l sequences of the invention include polypeptide sequences having substantial identify to SEQ ID NO:2, SEQ ID NO:20 and SEQ ID NO:22.
  • LECl nucleic acid sequences also include fusions between two or more LEC l genes. Different domains of different genes can be fused. LEC l gene fusions can be linked directly or can be attached by additional amino acids that link the two of more fusion partners.
  • Gene fusions can be generated by basic recombinant DNA techniques as described below. Selection of gene fusions will depend on the desired phenotype caused by the gene fusion. For instance, if phenotypes associated with the A domain of one LEC l protein are desired with phenotypes associated with the B domain of a second LEC 1 protein, the a fusion of the first LEC l protein ' s A domain to the second LEC l 's B domain would be created. The fusion can subsequently be tested in vitro or in vivo for the desired phenotypes.
  • nucleic acids of the invention to inhibit eene expression
  • expression cassettes of the invention can be used to suppress endogenous LEC l gene expression.
  • Inhibiting expression can be useful, for instance, in weed control (by transferring an inhibitory sequence to a weedy species and allowing it to be transmitted through sexual crosses) or to produce fruit with small and non-viable seed.
  • antisense technology can be conveniently used. To accomplish this, a nucleic acid segment from the desired gene is cloned and operably linked to a promoter such that the antisense strand of RNA will be transcribed. The expression cassette is then transformed into plants and the antisense strand of RNA is produced.
  • antisense RNA inhibits gene expression by preventing the accumulation of mRNA which encodes the enzyme of interest, see, e.g., Sheehy et al.. Proc. Nat. Acad. Sci. USA, 85:8805-8809 ( 1988), and Hiatt et al. U.S. Patent No. 4,801 ,340.
  • the antisense nucleic acid sequence transformed into plants will be substantially identical to at least a portion of the endogenous embryo-specific gene or genes to be repressed.
  • the sequence does not have to be perfectly identical to inhibit expression.
  • the vectors of the present invention can be designed such that the inhibitory effect applies to other proteins within a family of genes exhibiting homology or substantial homology to the target gene.
  • the introduced sequence also need not be full length relative to either the primary transcription product or fully processed mRNA. Generally, higher homology can be used to compensate for the use of a shorter sequence. Furthermore. the introduced sequence need not have the same intron or exon pattern, and homology of non- coding segments may be equally effective. Normally, a sequence of between about 30 or 40 nucleotides and about full length nucleotides should be used, though a sequence of at least about 100 nucleotides is preferred, a sequence of at least about 200 nucleotides is more preferred, and a sequence of at least about 500 nucleotides is especially preferred. Catalytic RNA molecules or ribozymes can also be used to inhibit expression of embryo-specific genes.
  • ribozymes that specifically pair with virtually any target RNA and cleave the phosphodiester backbone at a specific location, thereby functionally inactivating the target RNA.
  • the ribozyme In carrying out this cleavage, the ribozyme is not itself altered, and is thus capable of recycling and cleaving other molecules, making it a true enzyme.
  • the inclusion of ribozyme sequences within antisense RNAs confers RNA-cleaving activity upon them, thereby increasing the activity of the constructs.
  • RNAs A number of classes of ribozymes have been identified.
  • One class of ribozymes is derived from a number of small circular RNAs that are capable of self-cleavage and replication in plants.
  • the RNAs replicate either alone (viroid RNAs) or with a helper virus (satellite RNAs). Examples include RNAs from avocado sunblotch viroid and the satellite RNAs from tobacco ringspot virus, lucerne transient streak virus, velvet tobacco mottle virus, solanum nodiflorum mottle virus and subterranean clover mottle virus.
  • the design and use of target RNA-specific ribozymes is described in Haseloff et al. Nature, 334:585-591 (1988).
  • Another method of suppression is sense suppression.
  • Introduction of expression cassettes in which a nucleic acid is configured in the sense orientation with respect to the promoter has been shown to be an effective means by which to block the transcription of target genes.
  • this method to modulate expression of endogenous genes see, Napoli et al.. The Plant Cell 2:279-289 ( 1990). and U.S. Patents Nos. 5,034,323, 5,231 ,020, and 5.283, 184.
  • the introduced sequence generally will be substantially identical to the endogenous sequence intended to be repressed. This minimal identity will typically be greater than about 65%o, but a higher identity might exert a more effective repression of expression of the endogenous sequences. Substantially greater identity of more than about 80%> is preferred, though about 95% to absolute identity would be most preferred. As with antisense regulation, the effect should apply to any other proteins within a similar family of genes exhibiting homology or substantial homology.
  • the introduced sequence in the expression cassette needing less than absolute identity, also need not be full length, relative to either the primary transcription product or fully processed mRNA. This may be preferred to avoid concurrent production of some plants which are overexpressers. A higher identity in a shorter than full length sequence compensates for a longer, less identical sequence. Furthermore, the introduced sequence need not have the same intron or exon pattern, and identity of non- coding segments will be equally effective. Normally, a sequence of the size ranges noted above for antisense regulation is used.
  • families of homologous genes can be suppressed with a single sense or antisense transcript.
  • a sense or antisense transcript is designed to have a sequence that is conserved among a family of genes (e.g. , the B domain of LECl), then multiple members of a gene family can be suppressed.
  • the sense or antisense transcript should be targeted to sequences with the most vairance between family members.
  • an antisense transcript identical to the A and C domains of LEC l can be used to suppress LECl without suppressing related genes such as described in SEQ ID NO: 19 or SEQ ID NO:21.
  • LECl function in a plant is by creation of dominant negative mutations.
  • non-functional, mutant LECl polypeptides which retain the ability to interact with wild-type subunits are introduced into a plant.
  • Identification of residues that can be changed to create a dominant negative can be determined by published work examining interaction of different subunits of CBF homologs from different species (see, e.g., Sinha et al., ( 1995). Proc. Nail. Acad. Sci. USA 92: 1624-1628.)
  • nucleic acids of the invention to enhance gene expression
  • Isolated sequences prepared as described herein can also be used to prepare expression cassettes which enhance or increase endogenous LEC l gene expression. Where overexpression of a gene is desired, the desired gene from a different species may be used to decrease potential sense suppression effects. Enhanced expression of LEC l polynucleotides is useful, for example, to increase storage protein content in plant tissues. Such techniques may be particularly useful for improving the nutritional value of plant tissues.
  • Any of a number of means well known in the art can be used to increase LEC 1 activity in plants.
  • Enhanced expression is useful, for example, to induce embyonic characteristics in plants or plant organs.
  • Any organ can be targeted, such as shoot vegetative organs/structures (e.g. leaves, stems and tubers), roots, flowers and floral organs/structures (e.g. bracts, sepals, petals, stamens, carpels, anthers and ovules), seed (including embryo, endosperm, and seed coat) and fruit.
  • shoot vegetative organs/structures e.g. leaves, stems and tubers
  • roots e.g. bracts, sepals, petals, stamens, carpels, anthers and ovules
  • seed including embryo, endosperm, and seed coat
  • fruit including embryo, endosperm, and seed coat
  • one or several LEC 1 genes can be expressed constitutively (e.g., using the CaMV 35S promoter).
  • polypeptides encoded by the genes of the invention like other proteins, have different domains which perform different functions.
  • LEC l polypeptides are related to CCAAT box-binding factor (CBF) proteins.
  • CBFs are highly conserved family of transcription factors that regulate gene activity in eukaryotic organisms (see, e.g,, Mantvani ( 1992) Nucl. Acids Res. 20: 1087 -1091 ; Li (1992) Nucleic Acids Res. 20: 1087-1091 ).
  • LEC l was found to have high similarity to a portion of the HAP3 subunit of CBF. Thus, without being bound to any particular theory or mechanism, LECl is likely to act as a transcriptional modulator.
  • HAP3 is divided into three domains, an amino terminal A domain, a central B domain, and a carboxyl terminal C domain, as shown diagrammatically in Figure 2A.
  • LEC l has between about 75%o and 85% sequence similarity, which is equivalent to 55% to 63%> sequence identity, with the B domains of the other HAP3 homologs shown in Figure 2B; see also, Example 1 , below.
  • Figure 2B shows the amino acid sequence homology between LEC 1 and other CBF homologs.
  • the LEC l polypeptide also has an amino terminal A domain, a central B domain, and a carboxyl terminal C domain.
  • the three domains of the LEC l polypeptide are defined as follows: in SEQ ID NO:2, the A domain is located between about amino acid position 1 to about position 27; the B domain is located between about amino acid position 28 to about position 1 17; and, the C domain is located between about position 1 18 to about position 208.
  • the B domain of LEC l , L1L and Phaseolus L1 L are all closely related, whereas the A and C domains display almost no homology to each other.
  • the nucleotide sequence for LEC l corresponding to each domain is displayed in SEQ ID NO 1 , e.g.. the A domain is located between about nucleotide position 1 to about nucleotide position 82; the B domain is located between about nucleotide position 83 to about nucleotide position 351 ; the C domain is located between about nucleotide position 352 to about nucleotide position 624.
  • domain boundaries are approximate.
  • the boundaries for the domains of the LEC l polypeptides and nucleotides can vary from 1 to 20 amino acids residues ( 1 -60 nucleotides) from the boundaries listed above.
  • LECl polypeptides is thought to be modulated by a short region of seven residues, MPIANV1 (found, e.g. , at residues 34-40 of SEQ ID NO: 2). Thus, the polypeptides of the invention will often retain these sequences.
  • LEC l genes Methods for introducing genetic mutations into plant genes and selecting plants with desired traits are well known. For instance, seeds or other plant material can be treated with a mutagenic chemical substance, according to standard techniques. Such chemical substances include, but are not limited to. the following: diethyl sulfate. ethylene imine, ethyl methanesulfonate and N-nitroso-N-ethylurea. Alternatively, ionizing radiation from sources such as, X-rays or gamma rays can be used.
  • Modified protein chains can also be readily designed utilizing various recombinant DNA techniques well known to those skilled in the art and described for instance, in Sambrook et al., supra. Hydroxylamine can also be used to introduce single base mutations into the coding region of the gene (Sikorski, et al.. (1991 ). Meth. Enzymol. 194: 302-318).
  • the chains can vary from the naturally occurring sequence at the primary structure level by amino acid substitutions, additions, deletions, and the like. These modifications can be used in a number of combinations to produce the final modified protein chain.
  • homologous recombination can be used to induce targeted gene modifications by specifically targeting the LECl gene in vivo (see, generally, Grewal and Klar, Genetics 146: 1221 -1238 ( 1997) and Xu el al.. Genes Dev. 10: 241 1 -2422 ( 1996)). Homologous recombination has been demonstrated in plants (Puchta et al. , Experientia 50: 277-284 (1994), Swoboda et al. , EMBO J. 13: 484-489 ( 1994); Offringa el al.. Proc. Natl Acad. Sci. USA 90: 7346-7350 ( 1993); and Kempin et al. Nature 389:802-803 ( 1997)).
  • oligonucleotides composed of a contiguous stretch of RNA and DNA residues in a duplex conformation with double hairpin caps on the ends can be used.
  • the RNA/DNA sequence is designed to align with the sequence of the target LECl gene and to contain the desired nucleotide change.
  • Introduction of the chimeric oligonucleotide on an extrachromosomal T-DNA plasmid results in efficient and specific LEC l gene conversion directed by chimeric molecules in a small number of transformed plant cells. This method is described in Cole-Strauss et al, Science 273: 1386-1389 (1996) and Yoon et al Proc. Natl. Acad. Sci. USA 93: 2071 -2076 (1996).
  • Desired modified LECl polypeptides can be identified using assays to screen for the presence or absence of wild type LECl activity. Such assays can be based on the ability of the LECl protein to functionally complement the hap3 mutation in yeast. As noted above, it has been shown that homologs from different species functionally interact with yeast subunits of the CBF. (Sinha, et al, (1995). Proc. Natl. Acad. Sci. USA 92: 1624-1628); see, also, Becker, et al., (1991). Proc. Natl. Acad. Sci. USA 88: 1968-1972).
  • the reporter for this screen can be any of a number of standard reporter genes such as the lacZ gene encoding beta-galactosidase that is fused with the regulatory DNA sequences and promoter of the yeast CYC1 gene. This promoter is regulated by the yeast CBF.
  • a plasmid containing the LECl cD A clone is mutagenized in vitro according to techniques well known in the art.
  • the cDNA inserts are excised from the plasmid and inserted into the cloning site of a yeast expression vector such as pYES2 (Invitrogen).
  • the plasmid is introduced into hap3- yeast containing a lacZ reporter that is regulated by the yeast CBF such as pLG265UP l -lacZ (Guarente, et al , ( 1984) Cell 36: 317-321). Transformants are then selected and a filter assay is used to test colonies for beta-galactosidase activity.
  • the promoters derived from the LEC l genes of the invention can be used to drive expression of heterologous genes in an embryo-specific or seed-specific manner, such that desired gene products are present in the embryo, seed, or fruit.
  • Suitable structural genes that could be used for this purpose include genes encoding proteins useful in increasing the nutritional value of seed or fruit. Examples include genes encoding enzymes involved in the biosynthesis of antioxidants such as vitamin A. vitamin C. vitamin E and melatonin.
  • Other suitable genes encoding proteins involved in modification of fatty acids, or in the biosynthesis of lipids, proteins, and carbohydrates are examples of the promoters derived from the LEC l genes of the invention.
  • promoters are identified by analyzing the 5' sequences of a genomic clone corresponding to the embryo-specific genes described here. Sequences characteristic of promoter sequences can be used to identify the promoter. Sequences controlling eukaryotic gene expression have been extensively studied. For instance, promoter sequence elements include the TATA box consensus sequence (TATAAT), which is usually 20 to 30 base pairs upstream of the transcription start site. In most instances the TATA box is required for accurate transcription initiation.
  • TATAAT TATA box consensus sequence
  • DNA sequence coding for the desired polypeptide for example a cDNA sequence encoding a full length protein, will preferably be combined with transcriptional and translational initiation regulatory sequences which will direct the transcription of the sequence from the gene in the intended tissues of the transformed plant.
  • a plant promoter fragment may be employed which will direct expression of the gene in all tissues of a regenerated plant.
  • Such promoters are referred to herein as "constitutive" promoters and are active under most environmental conditions and states of development or cell differentiation.
  • constitutive promoters include the cauliflower mosaic virus (CaMV) 35S transcription initiation region, the 1 '- or 2'- promoter derived from T-DNA of Agrobacterium tumafaciens, and other transcription initiation regions from various plant genes known to those of skill.
  • the plant promoter may direct expression of the polynucleotide of the invention in a specific tissue (tissue-specific promoters) or may be otherwise under more precise environmental control (inducible promoters).
  • tissue-specific promoters under developmental control include promoters that initiate transcription only in certain tissues, such as fruit, seeds, or flowers.
  • the promoters from the LEC l genes described here are particularly useful for directing gene expression so that a desired gene product is located in embryos or seeds.
  • Other suitable promoters include those from genes encoding storage proteins or the lipid body membrane protein, oleosin. Examples of environmental conditions that may affect transcription by inducible promoters include anaerobic conditions, elevated temperature, or the presence of light.
  • polyadenylation region at the 3'- end of the coding region should be included.
  • the polyadenylation region can be derived from the natural gene, from a variety of other plant genes, or from T-DNA.
  • the vector comprising the sequences (e.g., promoters or coding regions) from genes of the invention will typically comprise a marker gene which confers a selectable phenotype on plant cells.
  • the marker may encode biocide resistance, particularly antibiotic resistance, such as resistance to kanamycin, G418, bleomycin, hygromycin, or herbicide resistance, such as resistance to chlorosluforon or Basta.
  • LEC l nucleic acid sequences of the invention are expressed recombinantly in plant cells to enhance and increase levels of endogenous LEC l polypeptides.
  • antisense or other LECl constructs are used to suppress LEC l levels of expression.
  • a variety of different expression constructs, such as expression cassettes and vectors suitable for transformation of plant cells can be prepared. Techniques for transforming a wide variety of higher plant species are well known and described in the technical and scientific literature. See. e.g., Weising et al. Ann. Rev. Genet. 22:421 -477 (1988). A DNA sequence coding for a LEC l polypeptide.
  • a cDNA sequence encoding a full length protein can be combined with cis-acting (promoter) and trans-acting (enhancer) transcriptional regulatory sequences to direct the timing, tissue type and levels of transcription in the intended tissues of the transformed plant.
  • promoter cis-acting
  • enhancer trans-acting
  • Translational control elements can also be used.
  • the invention provides a LEC l nucleic acid operably linked to a promoter which, in a preferred embodiment, is capable of driving the transcription of the LEC l coding sequence in plants.
  • the promoter can be, e.g.. derived from plant or viral sources.
  • the promoter can be, e.g., constitutively active, inducible. or tissue specific
  • a different promoters can be chosen and employed to differentially direct gene expression, e.g., in some or all tissues of a plant or animal.
  • promoter sequence elements include the TATA box consensus sequence (TATAAT), which is usually 20 to 30 base pairs upstream of the transcription start site. In most instances the TATA box is required for accurate transcription initiation. In plants, further upstream from the TATA box, at positions -80 to -100, there is typically a promoter element with a series of adenines surrounding the trinucleotide G (or T) N G. J.
  • a promoter fragment can be employed which will direct expression of LEC 1 nucleic acid in all transformed cells or tissues, e.g. as those of a regenerated plant.
  • Such promoters are referred to herein as “constitutive” promoters and are active under most environmental conditions and states of development or cell differentiation. Promoters that drive expression continuously under physiological conditions are referred to as “constitutive” promoters and are active under most environmental conditions and states of development or cell differentiation.
  • constitutive promoters include those from viruses which infect plants, such as the cauliflower mosaic virus (CaMV) 35S transcription initiation region (see, e.g., Dagless (1997) Arch. Virol.
  • the 1 '- or 2'- promoter derived from T- DNA of Agrobacterium tumafaciens see, e.g., Mengiste ( 1997) supra; O'Grady ( 1995) Plant Mol Biol. 29:99-108
  • the promoter of the tobacco mosaic virus the promoter of Figw ort mosaic virus (see, e.g., Maiti ( 1997) Transgenic Res. 6: 143-156); actin promoters, such as the Arabidopsis actin gene promoter (see. e.g., Huang ( 1997) Plant Mol. Biol.
  • a plant promoter may direct expression of the LEC l nucleic acid of the invention under the influence of changing environmental conditions or developmental conditions.
  • environmental conditions that may effect transcription by inducible promoters include anaerobic conditions, elevated temperature, drought, or the presence of light.
  • inducible promoters are referred to herein as "inducible" promoters.
  • the invention incorporates the drought-inducible promoter of maize (Busk (1997) supra); the cold, drought, and high salt inducible promoter from potato ( Kirch (1997) Plant Mol. Biol 33:897-909).
  • plant promoters which are inducible upon exposure to plant hormones, such as auxins, are used to express the nucleic acids of the invention.
  • the invention can use the auxin-response elements El promoter fragment (AuxREs) in the soybean (Glycine max L.) (Liu ( 1997) Plant Physiol. 1 15:397-407); the auxin- responsive Arabidopsis GST6 promoter (also responsive to salicylic acid and hydrogen peroxide) (Chen ( 1996) Plant J. 10: 955-966); the auxin-inducible parC promoter from tobacco (Sakai (1996) 37:906-913); a plant biotin response element (Streit ( 1997) Mol Plant Microbe Interact.
  • Plant promoters which are inducible upon exposure to chemicals reagents which can be applied to the plant, such as herbicides or antibiotics, are also used to express the nucleic acids of the invention.
  • the maize In2-2 promoter activated by benzenesulfonamide herbicide safeners, can be used (De Veylder (1997) Plant Cell Physiol 38:568-577); application of different herbicide safeners induces distinct gene expression patterns, including expression in the root, hydathodes, and the shoot apical meristem.
  • LEC l coding sequence can also be under the control of. e.g.. a tetracycline-inducible promoter, e.g.. as described with transgenic tobacco plants containing the Avena sativa L. (oat) arginine decarboxylase gene (Masgrau ( 1997) Plant J. 1 1 :465-473); or, a salicylic acid-responsive element (Stange (1997) Plant J. 1 1 : 1315-1324.
  • a tetracycline-inducible promoter e.g. as described with transgenic tobacco plants containing the Avena sativa L. (oat) arginine decarboxylase gene (Masgrau ( 1997) Plant J. 1 1 :465-473); or, a salicylic acid-responsive element (Stange (1997) Plant J. 1 1 : 1315-1324.
  • the plant promoter may direct expression of the polynucleotide of the invention in a specific tissue (tissue-specific promoters).
  • tissue specific promoters are transcriptional control elements that are only active in particular cells or tissues at specific times during plant development, such as in vegetative tissues or reproductive tissues.
  • Promoters from the LECl genes of the invention are particularly useful for tissue-specific direction of gene expression so that a desired gene product is generated only or preferentially in embryos or seeds, as described below.
  • tissue-specific promoters under developmental control include promoters that initiate transcription only (or primarily only) in certain tissues, such as vegetative tissues, e.g., roots or leaves, or reproductive tissues, such as fruit, ovules, seeds, pollen, pistols, flowers, or any embryonic tissue.
  • Reproductive tissue-specific promoters may be, e.g., ovule-specific, embryo-specific, endosperm-specific, integument-specific, seed and seed coat-specific, pollen-specific, petal-specific, sepal-specific, or some combination thereof.
  • Suitable seed-specific promoters are derived from the following genes: MAC 1 from maize, Sheridan (1996) Genetics 142: 1009-1020; Cat3 from maize. GenBank No. L05934, Abler (1993) Plant Mol. Biol. 22: 10131 -1038; vivparous-1 from Arabidopsis, Genbank No. U93215; atmycl from Arabidopsis, Urao ( 1996) Plant Mol. Biol. 32:571 -57; Conceicao ( 1994) Plant 5:493-505; napA from Brassica napus. GenBank No. J02798,
  • the egg and central cell specific FIEl promoter is also a useful reproductive tissue-specific promoter.
  • Sepal and petal specific promoters are also used to express LEC l nucleic acids in a reproductive tissue-specific manner.
  • the Arabidopsis floral homeotic gene APETALA1 encodes a putative transcription factor that is expressed in young flower primordia, and later becomes localized to sepals and petals (see. e.g., Gustafson- Brown
  • a related promoter, for AP2. a floral homeotic gene that is necessary for the normal development of sepals and petals in floral whorls, is also useful (see, e.g.. Drews ( 1991 ) Cell 65 :991 -1002; Bowman ( 1991 ) Plant Cell 3:749-758).
  • Another useful promoter is that controlling the expression of the unusual floral organs (ufo) gene of Arabidopsis, whose expression is restricted to the junction between sepal and petal primordia (Bossinger (1996) Development 122: 1093-1 102).
  • a maize pollen-specific promoter has been identified in maize (Guerrero (1990) Mol. Gen. Genet. 224: 161-168). Other genes specifically expressed in pollen are described, e.g., by Wakeley (1998) Plant Mol. Biol. 37: 187-192; Ficker ( 1998) Mol. Gen. Genet. 257: 132-142; Kulikauskas (1997) Plant Mol. Biol. 34:809-814; Treacy (1997) Plant Mol. Biol. 34:603-61 1.
  • promoters include those from genes encoding embryonic storage proteins.
  • Arabidopsis the gene encoding oleosin 20kD from Brassica napus, GenBank No. M63985; the genes encoding oleosin A, Genbank No. U091 1 8. and, oleosin B, Genbank No. U091 19. from soybean; the gene encoding oleosin from Arabidopsis, Genbank No. Z17657: the gene encoding oleosin 18kD from maize, GenBank No. J05212. Lee (1994) Plant Mol. Biol. 26: 1981 -1987; and, the gene encoding low molecular weight sulphur rich protein from soybean, Choi (1995) Mol Gen, Genet. 246:266-268, can be used.
  • the tissue specific E8 promoter from tomato is particularly useful for directing gene expression so that a desired gene product is located in fruits.
  • a tomato promoter active during fruit ripening, senescence and abscission of leaves and, to a lesser extent, of flowers can be used (Blume ( 1997) Plant J. 12:731 -746).
  • Other exemplary promoters include the pistol specific promoter in the potato (Solanum tuberosum L.) SK2 gene, encoding a pistil-specific basic endochitinase (Ficker (1997) Plant Mol. Biol. -.5:425-431 ); the Blec4 gene from pea (Pisum sativum cv. Alaska), active in epidermal tissue of vegetative and floral shoot apices of transgenic alfalfa. This makes it a useful tool to target the expression of foreign genes to the epidermal layer of actively growing shoots.
  • promoters specifically active in vegetative tissues can also be used to express the LEC l nucleic acids of the invention.
  • promoters controlling patatin the major storage protein of the potato tuber
  • the ORF13 promoter from Agrobacterium rhizogenes which exhibits high activity in roots can also be used (Hansen (1997) Mol. Gen. Genet. 254:337-343.
  • vegetative tissue-specific promoters include: the tarin promoter of the gene encoding a globulin from a major taro (Colocasia esculenta L. Schott) corm protein family, tarin (Bezerra ( 1995) Plant Mol. Biol. 28: 137-144); the curculin promoter active during taro corm development (de Castro (1992) Plant Cell 4: 1549- 1559) and the promoter for the tobacco root-specific gene TobRB7, whose expression is localized to root meristem and immature central cylinder regions (Yamamoto (1991) Plant Cell 3:371 -382).
  • Leaf-specific promoters such as the ribulose biphosphate carboxylase (RBCS) promoters can be used.
  • RBCS ribulose biphosphate carboxylase
  • the tomato RBCS 1 , RBCS2 and RBCS3A genes are expressed in leaves and light-grown seedlings, only RBCS 1 and RBCS2 are expressed in developing tomato fruits (Meier (1997) FEBS Lett. 415:91 -95).
  • a ribulose bisphosphate carboxylase promoters expressed almost exclusively in mesophyll cells in leaf blades and leaf sheaths at high levels, described by Matsuoka ( 1994) Plant J. 6:31 1 -319, can be used.
  • Another leaf-specific promoter is the light harvesting chlorophyll a/b binding protein gene promoter, see, e.g., Shiina ( 1997) Plant Physiol 1 15:477-483; Casal ( 1998) Plant Physiol. 1 16: 1533-1538.
  • the Arabidopsis thaliana myb-related gene promoter (Atmyb5) described by Li (1996) FEBS Lett. 379: 1 17-121. is leaf-specific.
  • the Atmyb5 promoter is expressed in developing leaf trichomes, stipules, and epidermal cells on the margins of young rosette and cauline leaves, and in immature seeds. AtmybS mRNA appears between fertilization and the 16 cell stage of embryo development and persists beyond the heart stage.
  • a leaf promoter identified in maize by Busk (1997) Plant J. 1 1 : 1285-1295, can also be used.
  • Another class of useful vegetative tissue-specific promoters are meristematic (root tip and shoot apex) promoters.
  • meristematic (root tip and shoot apex) promoters For example, the "SHOOTMERISTEMLESS” and “SCARECROW” promoters, which are active in the developing shoot or root apical meristems, described by Di Laurenzio ( 1996) Cell 86:423-433; and, Long ( 1996) Nature 379:66-69; can be used.
  • Another useful promoter is that which controls the expression of 3-hydroxy-3- methylglutaryl coenzyme A reductase ITMG2 gene, whose expression is restricted to meristematic and floral (secretory zone of the stigma, mature pollen grains, gynoecium vascular tissue, and fertilized ovules) tissues (see, e.g., Enjuto ( 1 995) Plant Cell 7:517-527). Also useful are knl -related genes from maize and other species which show meristem-speeific expression, see, e.g.. Granger (1996) Plant Mol. Biol. 31 :373-378; Kerstetter ( 1994) Plant Cell 6: 1877-1887; Hake ( 1995) Philos. Trans.
  • tissue-specific promoter may drive expression of operably linked sequences in tissues other than the target tissue.
  • a tissue-specific promoter is one that drives expression preferentially in the target tissue, but may also lead to some expression in other tissues as well.
  • a LEC 1 nucleic acid is expressed through a transposable element.
  • This allows for constitutive, yet periodic and infrequent expression of the constitutively active polypeptide.
  • tissue-specific promoters derived from viruses which can include, e.g., the tobamovirus subgenomic promoter (Kumagai (1995) Proc. Natl Acad. Sci.
  • RTBV rice tungro bacilliform virus
  • CVMV cassava vein mosaic virus
  • DNA constructs of the invention may be introduced into the genome of the desired plant host by a variety of conventional techniques.
  • the DNA construct may be introduced directly into the genomic DNA of the plant cell using techniques such as electroporation and microinjection of plant cell protoplasts, or the DNA constructs can be introduced directly to plant tissue using ballistic methods, such as DNA particle bombardment.
  • the DNA constructs may be combined with suitable T-DNA flanking regions and introduced into a conventional Agrobacterium tumefaciens host vector. The virulence functions of the Agrobacterium tumefaciens host will direct the insertion of the construct and adjacent marker into the plant cell DNA when the cell is infected by the bacteria.
  • Microinjection techniques are known in the art and well described in the scientific and patent literature.
  • the introduction of DNA constructs using polyethylene glycol precipitation is described in Paszkowski et al. Emb ⁇ J. 3 :2717-2722 ( 1984).
  • Electroporation techniques are described in Fromm et al. Proc. Natl. Acad. Sci. USA 82:5824 ( 1985).
  • Ballistic transformation techniques are described in Klein et al. Nature 327:70-73 (1987).
  • Agrobacterium tumefaciens-mediated transformation techniques including disarming and use of binary vectors, are well described in the scientific literature. See. for example Horsch et al. Science 233:496-498 (1984), and Fraley et al. Proc. Natl. Acad. Sci. USA 80:4803 (1983).
  • Transformed plant cells which are derived by any of the above transformation techniques can be cultured to regenerate a whole plant which possesses the transformed genotype and thus the desired phenotype such as seedlessness.
  • Such regeneration techniques rely on manipulation of certain phytohormones in a tissue culture growth medium, typically relying on a biocide and/or herbicide marker which has been introduced together with the desired nucleotide sequences.
  • Plant regeneration from cultured protoplasts is described in Evans et al., Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, pp. 124-176, MacMillilan Publishing Company, New York, 1983; and Binding, Regeneration of Plants, Plant Protoplasts, pp. 21 -73. CRC Press. Boca Raton, 1985. Regeneration can also be obtained from plant callus, explants, organs, or parts thereof. Such regeneration techniques are described generally in Klee et al. Ann. Rev. of Plant Phys. 38:467-486 ( 1987).
  • the nucleic acids of the invention can be used to confer desired traits on essentially any plant.
  • the invention has use over a broad range of plants, including species from the genera Asparagus, Atropa, Avena, Brassica, Citrus, Citrullus, Capsicum, Cucumis, Cucurbita, Daucus. Fragaria, Glycine, Gossypium, Helianthus, Heterocallis, Hordeum, Hyoscyamus, Lactuca, Linum, Lolium, Lycopersicon, Malus, Manihot, Majorana.
  • Medicago Nicotiana, Oryza, Panieum, Pannesetum, Persea, Pisum, Pyrus, Prunus, Raphanus, Secale, Senecio, Sinapis, Solanum, Sorghum, Trigonella, Triticum. Vitis, Vigna, and, Zea.
  • the LECl genes of the invention are particularly useful in the production of transgenic plants in the genus Brassica. Examples include broccoli, cauliflower, brussel sprouts, canola, and the like.
  • the LEC l nucleic acids of the invention can be expressed together with other structural or regulatory genes to achieve a desired effect.
  • a cell or plant such as a transformed cell or a transgenic plant, can be transformed, engineered or bred to co-express both LEC 1 nucleotide and/or LEC 1 polypeptide, and another gene or gene product.
  • two or more LEC l nucleic acids can be co-expressed together in the same plant or cell.
  • LEC l nucleic acids of the invention when expressed in plant reproductive or vegetative tissue, can induce ectopic embryo morphogenesis.
  • a LECl nucleic acid of the invention is expressed in a sense conformation in a transgenic plant to induce the expression of ectopic embryo-like structures, as discussed above.
  • LECl is co-expressed with a gene or nucleic acid that increases reproductive tissue mass, e.g., increases fruit size, seed mass, seed protein or seed oils.
  • ADC genes such as AP2 and RAP2 genes of Arabidopsis
  • the LEC l nucleic acids of the invention are expressed in plant reproductive or vegetative cells and tissues which lack the ability to produce functional ADC genes, such as AP2 and RAP2 genes.
  • the LEC 1 nucleic acid can be expressed in an ADC "knockout" transgenic plant.
  • the LEC l nucleic acid can be expressed in a cell, tissue or plant expressing a mutant ADC nucleic acid or gene product. Expression of LEC l nucleic acid in any of these non-functioning ADC models will also produce a cell, tissue or plant with increased fruit and seed mass, greater yields of embryonic storage proteins, and the like.
  • the expression cassette is stably incorporated in transgenic plants and confirmed to be operable, it can be introduced into other plants by sexual crossing. Any of a number of standard breeding techniques can be used. depending upon the species to be crossed.
  • Example 1 This example describes the isolation and characterization of an exemplary LEC l gene. Experimental Procedures Plant Material
  • a lee 1-2 mutant was identified from a population of Arabidopsis thaliana ecotype Wassilewskija (Ws-O) lines mutagenized with T-DNA insertions as described before (West et al., 1994).
  • the abi3-3, fus3-3 and lecl -1 mutants were generously provided by Peter MeCourt, University of Toronto and David Meinke, Oklahoma State University. Wild type plants and mutants were grown under constant light at 22°C.
  • Double mutants were constructed by intercrossing the mutant lines lec l -1. lee 1-2, abi3-3, fus3-3, and lec2. The genotype of the double mutants was verified through backcrosses with each parental line. Double mutants were those who failed to complement both parent lines. Homozygous single and double mutants were generated by germinating intact seeds or dissected mature embryos before desiccation on basal media. Isolation and Sequence analysis of Genomic and cDNA Clones
  • Genomic libraries of Ws-0 wild type plants, lec l -1 and lee 1 -2 mutants were made in GEM1 1 vector according to the instructions of the manufacturer (Promega).
  • Two silique-specific cDNA libraries (stages globular to heart and heart to young torpedo) were made in ZAPII vector (Stratagene).
  • the genomic library of lee 1 -2 was screened using right and left T-DNA specific probes according to standard techniques. About 12 clones that cosegregate with the mutation, were isolated and purified and the entire DNAs were further labeled and used as probes to screen a southern blot containing wild type and lecl -1 genomic DNA. One clone hybridized with plant DNA and was further analyzed. A 7.1 kb Xhol fragment containing the left border and the plant sequence flanking the T-DNA was subcloned into pBluescript-KS plasmid (Stratagene) to form ML7 and sequenced using a left border specific primer (5' GCATAGATGCACTCGAAATCAGCC 3').
  • the T-DNA organization was partially verified using southern analysis with T-DNA left and right borders and PBR322 probes. The results suggested that the other end of the T-DNA is also composed of left border. This was confirmed by generating a PCR fragment using a genomic plant DNA primer (LP primer 5' GCT CTA GAC ATA CAA CAC TTT TCC TTA 3') and a T-DNA left border specific primer (5' GCTTGGTAATAATTGTCATTAG 3') and sequencing.
  • the EcoRI insert of ML7 was used to screen a wild type genomic library. Two overlapping clones were purified and a 7.4 EcoRI genomic fragment from the wild type DNA region was subcloned into pBluescript-KS plasmid making WT74. This fragment was sequenced (SEQ ID NO: 4) and was used to screen lecl -1 genomic library and wild type silique-specific cDNA libraries. 8 clones from the lecl -1 genomic library were identified and analyzed by restriction mapping.
  • RNA was prepared from siliques, two days old seedlings, stems, leaves, buds and roots.
  • Poly(A)+ RNA was purified from total RNA by oligo(dT) cellulose chromatography, and two micrograms of each Poly(A)+ RNA samples were separated in 1 % denatured formal dehyde-agarose gel. Hybridizations were done under stringent conditions unless it specifies otherwise. Radioactive probes were prepared as described above. Complementation of lee 1 mutants
  • a 3.4 kb Bstyl fragment of genomic DNA (SEQ ID NO: 3) containing sequences from 1.992 kb upstream of the ORF to a region 579 bp downstream from the poly A site was subcloned into the hygromycin resistant binary vector pBIB-Hyg.
  • the LECl cDNA was placed under the control of the 35 S promoter and the ocs polyadenylation signals by inserting a PCR fragment spanning the entire coding region into the plasmid pART7. The entire regulatory fragment was then removed by digestion with Notl and transferred into the hygromycin resistant binary vector BJ49.
  • the binary vectors were introduced into the Agrobacterium strain GV3101 , and constructions were checked by re-isolation of the plasmids and restriction enzyme mapping, or by PCR. Transformation to homozygous lecl -1 and lecl -2 mutants were done using the in planta transformation procedure (Bechtold. et al., (1993). Comptes Rendus de l'Academie des Sciences Serie III Sciences de la Vie, 316: 1 194-1 199. Dry seeds from lecl mutants were selected for transformants by their ability to germinate after desiccation on plates containing 5g/ml hygromycin. The transformed plants were tested for the present of the transgene by PCR and by screening the siliques for the present of viable seeds. In Situ Hybridization
  • the two classes of mutants were crossed to lecl -1 and lecl -2 mutants to construct plants homozygous to both mutations.
  • the lec l and lec2 mutations interact synergistically. resulting in a double mutant which is arrested in a stage similar to the late heart stage, the double mutant embryo, however, is larger.
  • the lecl or lec2 and fus3-3 double mutants did not display any epistasis and the resulting embryo had an intermediate phenotype.
  • the lecl/abi3-3 double mutants and lec2/abi3-3 double mutants were ABA insensitive and had a lec-like phenotype. There was no different between double mutants that consist of either lecl-1 or lecl-2.
  • LECl Functions Early in Embryogenesis The effects of lecl is not limited to late embryogenesis, it also has a role in early embryogenesis. The embryos of the lec l/lec2 double mutants were arrested in the early stages of development, while the single mutants developed into mature embryos, suggesting that these genes act early during development.
  • the suspensor of wild type embryo consists of a single file of six to eight cells, whereas the suspensors of the mutants are often enlarged and undergo periclinal divisions.
  • Leafy cotyledon mutants exhibit suspensor anomalies at the globular or transition stage whereas wild type and abi3 mutant do not show any abnormalities. The number of anomalous suspensors increases as the embryos continue to develop.
  • the wild type suspensor cells undergo programmed cell death, but in the mutants secondary embryos often develop from the abnormal suspensors and, when rescued, give rise to twins.
  • the Organization of the LECl Locus in Wild Type Plants and lec l Mutants Two mutant alleles of the LEC l gene have been reported, lec l - 1 and lec l -2
  • lecl -1 Only one clone hybridized with Arabidopsis DNA and also gave polymorphic restriction fragment in lecl -1 .
  • the lecl -1 polymorphism resulted from a small deletion, approximate ⁇ 2 kb in length.
  • genomic wild type DNA clones and the lecl -1 genomic clones were isolated.
  • An EcoRI fragment of 7.4 kb of the genomic wild type DNA that corresponded to the polymorphic restriction fragment in lecl -1 was further analyzed and sequenced. The exact site of the deletion in lec l- 1 was identified using a PCR fragment that was generated by primers, within the expected borders of the deleted fragment, and sequencing.
  • genomic wild type fragment indeed contained the ORF of the LEC l gene
  • the clone consists of a 3395 bp BstYI restriction fragment containing the gene and the promoter region.
  • the translation start codon (ATG) of the polypeptide is at 1999 and the stop codon is at 2625 (TGA). There are no introns in the gene.
  • the transformed plants were selected on hygromycin plates and were tested to contain the wild type DNA fragment by PCR analysis. Both transgenic mutants were able to produce viable progeny, that were desiccation tolerant and did not posses trichomes on their cotyledons. We concluded that the 3.4 kb fragment can complement the lec l mutation and since there is only one ORF in the deleted 2 kb fragment in lec l - 1 we suggest that this ORF corresponds to the LEC 1 gene.
  • the LECl Gene is a Member of Gene Family
  • the cDNA contains 626 bp ORF specifying 208 amino acid protein (SEQ ID NO: l and SEQ ID NO:2).
  • the LECl cDNA was used to hybridize a DNA gel blot containing Ws-0 genomic DNA digested with three different restriction enzymes. Using low stringency hybridization we found that there is at least one more gene. This confirmed our finding of two more Arabidopsis ESTs that show homology to the LECl gene.
  • the LECl gene is Embryo Specific
  • the lecl mutants are affected mostly during embryogenesis. Rescued mutants can give rise to homozygous plants that have no obvious abnormalities other than the presence of trichomes on their cotyledons and their production of defective progeny. Therefore, we expected the LEC l gene to have a role mainly during embryogenesis and not during vegetative growth.
  • poly (A)+ RNA was isolated from siliques. seedling, roots, leaves, stems and buds of wild type plants and from siliques of lecl plants. Only one band was detected on northern blots using either the LEC 1 gene as a probe or the 7.4 kb genomic DNA fragment suggesting that there is only one gene in the genomic DNA fragment which is active transcriptionally.
  • the transcript was detected only in siliques containing young and mature embryos and was not detected in seedlings, roots, leaves, stems and buds indicating that the LEC l gene is indeed embryo specific. In addition, no RNA was detected in siliques of both alleles of lec l mutants confirming that this ORF corresponds to the LECl gene.
  • the LECl gene specifies cotyledons identity
  • e analyzed its expression by in situ hybridization.
  • the LEC l transcript was first detected in proglobular embryos. The transcript was found in all cells of the proembryo and was also found in the suspensor and the endosperm. However, from the globular stage and on it accumulates more in the outer layer of the embryo, namely the protoderm and in the outer part of the ground meristem leaving the procambium without a signal.
  • the signal was stronger in the cotyledons and the root meristem, and was more limited to the protoderm layer.
  • the signal was present throughout the embryo and at the last stage of development when the embryo is mature and filling the whole seed we could not detect the LEC l transcript. This might be due to sensitivity limitation and may imply that if the LECl transcript is expressed at that stage it is not localized in the mature embryo, but rather spread throughout the embryo.
  • the LEC l gene encodes a Homolog of CCAAT binding factor.
  • CBF CCAAT box binding factor
  • the LEC l B domain shares between 75%>-85% similarity and 55%o-63% identity with different B domains that are found in organisms ranging from yeast to human. Within this central domain, two highly conserved amino acid segments are present. Deletion and mutagenesis analysis in the CBF-A yeast homolog hap3 protein demonstrated that a short region of seven residues (42-48) (LPIANVA) is required for binding the CCAAT box, while the subunit interaction domain lies in the region between residues 69-80 (MQECVSEFISFV) (Xing et al., supra). LECl protein shares high homology to those regions.
  • the lecl mutant belongs to the leafy cotyledon class that interferes mainly with the embryo program and therefore is thought to play a central regulatory role during embryo development. It was shown before that LEC l gene activity is required to suppress germination during the maturation stage. Therefore, we analyzed the genetic interaction ol ' homozygous double mutants of the different members of the leafy cotyledon class and the abi3 mutant that has an important role during embryo maturation. All the five different combinations of the double mutants showed either an intermediate phenotype or an additive effect. No epistatic relationship among the four genes was found. These findings suggest that the different genes act in parallel genetic pathways.
  • CBF is a transcription activator that regulates constitutively expressed genes, but also participates in differential activation of developmental genes Wingender, E. ( 1993). Gene Regulation in Eukaryotes (New York: VCH Publishers).
  • CBF-A In mammalian cells, three subunits have been identified CBF-A, CBF-B and CBF-C and all of which are required for DNA binding.
  • the CBF homolog HAP activates the CYC 1 and other genes involved in the mitochondrial electron transport Johnson, et al., Proteins. Annu. Rev. Biochem. 58, 799-840. ( 1989).
  • FIAP consists of four subunits hap2, hap3. hap4 and hap5. Only hap2. 3 and 5 are required for DNA binding.
  • CBF-A. B and C show high similarity to the yeast hap3. 2 and 5, respectively. It was also reported that mammalian CBF-A and B can be functionally interchangeable with the corresponding yeast subunits (Sinha et al., supra.).
  • the LEC l gene encodes a protein that shows more then 75%) similarity to the conserved region of CBF-A.
  • CCAAT motifs are not common in plants' promoters and their role in transcription regulation is not clear.
  • maize and Brassica homologs have been identified.
  • a search of the GenBank revealed several Arabidopsis ES s that show high similarity to CBF-A, B and C. Accession numbers of CBF-A (HAP3) homologs: H37368, H76589; CBF-B (HAP2) homologs: T20769; CBF-C (HAP5) homologs: T43909. T44300.
  • LEC l is a member of a heteromeric complex that functions as a transcription factor.
  • the model suggests that LEC l acts as transcription activator to several sets of genes, which keep the embryonic program on and repress the germination process. Defective LEC l expression partially shuts down the embryonic program and as a result the cotyledons lose their embryonic characteristics and the germination program is active in the embryo.
  • Example 2 This example demonstrates that LEC 1 is sufficient to induce embryonic pathways in transgenic plants.
  • the phenotype of lecl mutants and the gene ' s expression pattern indicated that LEC l functions specifically during embryogenesis.
  • a LEC l cDNA clone under the control of the cauliflower mosaic virus 35S promoter was transferred into lec l - 1 mutant plants in planta using standard methods as described above. Viable dry seeds were obtained from lecl -1 mutants transformed with the
  • 35S/LEC1 construct was produced. However, half the seeds that germinated (12/23 ) produced seedlings with an abnormal morphology. Unlike wild type seedlings, these 35S/LEC 1 seedlings possessed cotyledons that remained fleshy and that failed to expand. Roots often did not extend or extended abnormally and sometimes greened. These seedlings occasionally produced a single pair of organs on the shoot apex at the position normal occupied by leaves. Unlike wild type leaves, these organs did not expand and did not possess trichomes. Morphologically, these leaf-like structures more closely resembled embryonic cotyledons than leaves.
  • the other 75% of seeds contained embryos with either a wild type phenotype or a phenotype intermediate between lec l mutants and wild type. Only 25% of the dry seed from this plant germinated, and all seedlings resembled the embryo-like seedlings described above. Some seedlings continued to grow and displayed a striking phenotype.
  • These 35S/LEC1 plants developed two types of structures on leaves. One type resembled embryonic cotyledons while the other looked like intact torpedo stage embryos. Thus. ectopic expression of LEC l induces the morphogenesis phase of embryo development in vegetative cells.
  • LECl The ability of LECl to induce embryonic programs of development in vegetative cells establishes the gene as a central regulator of embryogenesis. LECl is sufficient to induce both the seed maturation pathway as indicated by the induction of storage protein genes in the 35S/LEC1 seedlings. The presence of ectopic embryos on leaf surfaces and cotyledons at the position of leaves also shows that LEC l can activate the embryo morphogenesis pathway. Thus, LECl regulates both early and late embryonic processes.
  • Example 3 This example shows that LEC l is expressed in zygotes and that the promoters of the invention can therefore be used to target expression in zygotes.
  • LEC l RNA levels were analyzed in the egg apparatus of mature female gametophytes before fertilization. in zygotes after fertilization, and in very early stage embryos containing an apical cell and two to three suspensor cells. In situ hybridization experiments showed that LEC l RNA was present in zygotes and early stage embryos but was not detected in female gametophytes. These results show that the LECl promoter becomes active in the zygote. The LECl is therefore useful to target the expression of sense or antisense versions of regulatory genes or cytotoxic genes to zygotes and early stage embryos.
  • This example shows the identification of a LEC 1 homolog from Arabidopsis designated the LEAFY COTYLEDON 1 -LIKE gene.
  • a Blast search was conducted through the Arabidopsis Database (http://genome-www.stanford.edu/Arabidopsis/) using the LECl cDNA nucleotide sequence as a probe to identify homologs of the HAP3 subunit of CCAAT box binding transcription factor from Arabidopsis.
  • the Arabidopsis BAC clone, MNJ7 contains a gene, designated LECl -Like (LIE), that displays the highest amino acid sequence identity with the LECl protein of any known Arabidopsis HAP3 gene.
  • LIE LECl -Like
  • PCR Polymerase Chain Reaction
  • BAMMNJ7-5 sequence is 5'-AGGATCCATGGAACGTGGAGGCTTCCAT-3 ' with the BamHI site underlined.
  • the reverse primer, 3-MNJ7XBA sequence is 5 ' - ATCTAGATCAGTACTTATGTTGTTGAGTCG-3' with the Xbal site underlined.
  • the PCR conditions were as follows: 30 cycles of 45 seconds at 94°C. 45 seconds at 53°C, and 3 minutes at 72°C. AmphiTaq DNA polymerase (Perkin Elmer Cetus, 761 Main Ave.,
  • L1L RNA was detected only in siliques containing all three stages of embryos.
  • the L1L clone was inserted into the LECl promoter/terminator cassette within the plant transformation vector BJ49.
  • the LECl promoter/terminator cassette consists of 1992 bp of DNA 5' of the LECl translation start codon and 770 bp 3' of the LECl cDNA translation stop codon (H.S. Lee, R.W. Kwong. and J.J. Harada, unpublished results).
  • the promoter and terminator are separated by a short polylinker with Bglll and Avrll restriction endonuclease sites in which the L1L gene was inserted.
  • This construct was transferred into homozygous led -I null mutants using in planta transformation procedures with Agrobacterium tumefaciens strain GV3101 . Unlike lecl-1 mutant plants whose progeny die following desiccation, plants transformed with the LIE construct produced viable seedlings. PCR amplification experiments confirmed that the viable seedlings have the /ec7-7 mutation and the transgene. Seedlings morphologically resembled wild type rather than lecl mutant plants. These results show that the L1L gene complements the 7ec7 mutation, suggesting overlapping functions for the two genes.
  • This example shows the identification of a LECl ortholog from scarlet runner bean.
  • first strand cDNA was synthesized from EP total RNA using Superscript II RNase H- reverse transcriptase (Gibco/BRL, cat # 18064-014) in the presence of an Sfi IB-site containing oligo-dT primer (CDSIII/3' PCR primer. Clontech) and a SMART III containing an Sfi IA- site primer (Clontech). Second strand was generated by polymerase chain reaction using 5'- and 3' PCR primers (Clontech).
  • Double-stranded cDNA was digested with Sfi I restriction enzyme (New England BioLabs, cat # 123S) and then size-fractionated over a CHROMA S- 400 sepharose column (Clontech). After analyzing collected fractions on a 1 .1 %) agarose gel. four fractions containing high amount of cDNAs in a range of 0.5 kb to 4 kb were pooled and precipitated in an ethanol/salt solution at -20°C overnight. A cDNA pellet was recovered b> centrifugation and resuspended in 7 uL of sterile water. cDNA inserts were ligated to Sfi 1- digested lambda arms (lTriplEx2, Clontech). Ligation mixtures were packaged into phage heads using Gigapack III Gold Packaging Extract (Stratagene).
  • the cDNA library was converted from a lambda form to a plasmid form via
  • Cre-Lox system in vivo excision, Clontech. Colonies were picked randomly for plasmid
  • the nucleotide sequences of cDNA clones were determined using BigDye terminator, a 5'-TriplEx sequencing primer (Clontech), and the ABI Prism 377 DNA sequencer (Perkin-Elmer Applied Biosvstems). The identity of the cDNA clone was determined by BlastX and BlastN analyses.
  • SEQ ID NO:22 with high amino acid sequence identity to the Arabidopsis LEC l . especially in the conserved B domain.
  • a BlastN result indicated that this SRB cDNA sequence is more similar to the Arabidopsis L1L gene at the nucleotide level.
  • the entire sequence of the pPCEPl 12 insert was determined to be 988 bp (SEQ ID NO:21).

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Biomedical Technology (AREA)
  • Organic Chemistry (AREA)
  • Molecular Biology (AREA)
  • Wood Science & Technology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Cell Biology (AREA)
  • Reproductive Health (AREA)
  • Pregnancy & Childbirth (AREA)
  • Developmental Biology & Embryology (AREA)
  • Botany (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Medicinal Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Physiology (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
PCT/US2001/005454 2000-03-01 2001-02-21 Genes du type cotyledon1 feuillu et leurs utilisations WO2001064022A2 (fr)

Priority Applications (5)

Application Number Priority Date Filing Date Title
EP01912861A EP1263280A4 (fr) 2000-03-01 2001-02-21 Genes du type cotyledon1 feuillu et leurs utilisations
AU4160001A AU4160001A (en) 2000-03-01 2001-02-21 Leafy cotyledon1 genes and their uses
AU2001241600A AU2001241600B2 (en) 2000-03-01 2001-02-21 Leafy cotyledon1 genes and their uses
CA002399886A CA2399886A1 (fr) 2000-03-01 2001-02-21 Genes du type cotyledon1 feuillu et leurs utilisations
JP2001562933A JP2004500823A (ja) 2000-03-01 2001-02-21 Leafycotyledon1遺伝子およびその使用

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/516,052 US6781035B1 (en) 1997-02-21 2000-03-01 Leafy cotyledon1 genes and their uses
US09/516,052 2000-03-01

Publications (1)

Publication Number Publication Date
WO2001064022A2 true WO2001064022A2 (fr) 2001-09-07

Family

ID=24053920

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/005454 WO2001064022A2 (fr) 2000-03-01 2001-02-21 Genes du type cotyledon1 feuillu et leurs utilisations

Country Status (5)

Country Link
EP (1) EP1263280A4 (fr)
JP (1) JP2004500823A (fr)
AU (2) AU2001241600B2 (fr)
CA (1) CA2399886A1 (fr)
WO (1) WO2001064022A2 (fr)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7868229B2 (en) 1999-03-23 2011-01-11 Mendel Biotechnology, Inc. Early flowering in genetically modified plants
WO2013096562A1 (fr) 2011-12-22 2013-06-27 E. I. Du Pont De Nemours And Company Utilisation du promoteur de saccharose synthase de soja pour augmenter la teneur en lipides des graines de plantes
US8847012B2 (en) 2007-12-05 2014-09-30 Toyota Jidosha Kabushiki Kaisha Genes that increase plant oil and method for using the same
US8847011B2 (en) 2007-12-05 2014-09-30 Toyota Jidosha Kabushiki Kaisha Genes that increase plant oil and method for using the same
WO2015017510A1 (fr) 2013-07-31 2015-02-05 E. I. Du Pont De Nemours And Company Modification de la composition de graine de soja pour améliorer la nourriture pour animaux, les aliments et d'autres applications industrielles de produits à base de soja
US9045786B2 (en) 2008-03-04 2015-06-02 Toyota Jidosha Kabushiki Kaisha Gene that increases production of plant fat-and-oil and method for using the same
US9169488B2 (en) 2009-06-04 2015-10-27 Toyota Jidosha Kabushiki Kaisha Gene capable of improving material productivity in seed and method for use thereof
WO2015164457A1 (fr) 2014-04-22 2015-10-29 E. I. Du Pont De Nemours And Company Gènes d'anhydrase carbonique plastidiale pour l'augmentation d'huile dans des graines présentant une expression augmentée de dgat
US9303265B2 (en) 2009-06-04 2016-04-05 Toyota Jidosha Kabushiki Kaisha Gene for increasing plant weight and method for using the same
US9309531B2 (en) 2009-06-04 2016-04-12 Toyota Jidosha Kabushiki Kaisha Plant with reduced protein productivity in seeds, and method for producing same

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6320102B1 (en) * 1997-02-21 2001-11-20 The Regents Of The University Of California Leafy cotyledon1 genes and their uses

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of EP1263280A4 *

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7868229B2 (en) 1999-03-23 2011-01-11 Mendel Biotechnology, Inc. Early flowering in genetically modified plants
US9062318B2 (en) 2007-12-05 2015-06-23 Toyota Jidosha Kabushiki Kaisha Genes that increase plant oil and method for using the same
US8847011B2 (en) 2007-12-05 2014-09-30 Toyota Jidosha Kabushiki Kaisha Genes that increase plant oil and method for using the same
US9068193B2 (en) 2007-12-05 2015-06-30 Toyota Jidosha Kabushiki Kaisha Genes that increase plant oil and method for using the same
US8847012B2 (en) 2007-12-05 2014-09-30 Toyota Jidosha Kabushiki Kaisha Genes that increase plant oil and method for using the same
US9012727B2 (en) 2007-12-05 2015-04-21 Toyota Jidosha Kabushiki Kaisha Genes that increase plant oil and method for using the same
US9012726B2 (en) 2007-12-05 2015-04-21 Toyota Jidosha Kabushiki Kaisha Genes that increase plant oil and method for using the same
US9018446B2 (en) 2007-12-05 2015-04-28 Toyota Jidosha Kabushiki Kaisha Genes that increase plant oil and method for using the same
US9018450B2 (en) 2007-12-05 2015-04-28 Toyota Jidosha Kabushiki Kaisha Genes that increase plant oil and method for using the same
US9045786B2 (en) 2008-03-04 2015-06-02 Toyota Jidosha Kabushiki Kaisha Gene that increases production of plant fat-and-oil and method for using the same
US9816099B2 (en) 2009-06-04 2017-11-14 Toyota Jidosha Kabushiki Kaisha Gene for increasing plant weight and method for using the same
US10000764B2 (en) 2009-06-04 2018-06-19 Toyota Jidosha Kabushiki Kaisha Gene for increasing plant weight and method for using the same
US9309529B2 (en) 2009-06-04 2016-04-12 Toyota Jidosha Kabushiki Kaisha Gene capable of improving material productivity in seed and method for use thereof
US9303265B2 (en) 2009-06-04 2016-04-05 Toyota Jidosha Kabushiki Kaisha Gene for increasing plant weight and method for using the same
US9309530B2 (en) 2009-06-04 2016-04-12 Toyota Jidosha Kabushiki Kaisha Gene capable of improving material productivity in seed and method for use thereof
US9309531B2 (en) 2009-06-04 2016-04-12 Toyota Jidosha Kabushiki Kaisha Plant with reduced protein productivity in seeds, and method for producing same
US9840717B2 (en) 2009-06-04 2017-12-12 Toyota Jidosha Kabushiki Kaisha Plant with reduced protein productivity in seeds and method for producing same
US9856488B2 (en) 2009-06-04 2018-01-02 Toyota Jidosha Kabushiki Kaisha Plant with reduced protein productivity in seeds and method for producing same
US9970020B2 (en) 2009-06-04 2018-05-15 Toyota Jidosha Kabushiki Kaisha Plant with reduced protein productivity in seeds and method for producing same
US9169488B2 (en) 2009-06-04 2015-10-27 Toyota Jidosha Kabushiki Kaisha Gene capable of improving material productivity in seed and method for use thereof
WO2013096562A1 (fr) 2011-12-22 2013-06-27 E. I. Du Pont De Nemours And Company Utilisation du promoteur de saccharose synthase de soja pour augmenter la teneur en lipides des graines de plantes
WO2015017510A1 (fr) 2013-07-31 2015-02-05 E. I. Du Pont De Nemours And Company Modification de la composition de graine de soja pour améliorer la nourriture pour animaux, les aliments et d'autres applications industrielles de produits à base de soja
WO2015164457A1 (fr) 2014-04-22 2015-10-29 E. I. Du Pont De Nemours And Company Gènes d'anhydrase carbonique plastidiale pour l'augmentation d'huile dans des graines présentant une expression augmentée de dgat

Also Published As

Publication number Publication date
CA2399886A1 (fr) 2001-09-07
JP2004500823A (ja) 2004-01-15
AU4160001A (en) 2001-09-12
EP1263280A4 (fr) 2005-07-27
AU2001241600B2 (en) 2007-02-01
EP1263280A1 (fr) 2002-12-11

Similar Documents

Publication Publication Date Title
US6235975B1 (en) Leafy cotyledon1 genes and methods of modulating embryo development in transgenic plants
US6781035B1 (en) Leafy cotyledon1 genes and their uses
US9200294B2 (en) Brassica indehiscent1sequences
US6320102B1 (en) Leafy cotyledon1 genes and their uses
AU2007200683B2 (en) Control of fruit dehiscence in arabidopsis by indehiscent1 genes
EP1263782B1 (fr) Genes de cotyledon2 feuillus et leurs utilisations
US8704044B2 (en) Nucleic acids that encode ERS1 from maize and their uses
US7612253B2 (en) Methods of modulating cytokinin related processes in a plant using B3 domain proteins
AU2001245729A1 (en) Leafy cotyledon2 genes and their uses
AU2001241600B2 (en) Leafy cotyledon1 genes and their uses
AU2001241600A1 (en) Leafy cotyledon1 genes and their uses
US20050120417A1 (en) Control of fruit dehiscence in plants by indehiscent1 genes
US20060041953A1 (en) Control of fruit dehiscence in Arabidopsis by indehiscenti genes
US7951993B2 (en) Increasing grain yield through targeted reduction in ethylene signaling
AU2001257024A1 (en) Control of fruit dehiscence in arabidopsis by indehiscent1 genes

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2399886

Country of ref document: CA

ENP Entry into the national phase in:

Ref country code: JP

Ref document number: 2001 562933

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 2001912861

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2001241600

Country of ref document: AU

WWP Wipo information: published in national office

Ref document number: 2001912861

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642