WO2006094976A2 - Expression enhancing intron sequences - Google Patents
Expression enhancing intron sequences Download PDFInfo
- Publication number
- WO2006094976A2 WO2006094976A2 PCT/EP2006/060513 EP2006060513W WO2006094976A2 WO 2006094976 A2 WO2006094976 A2 WO 2006094976A2 EP 2006060513 W EP2006060513 W EP 2006060513W WO 2006094976 A2 WO2006094976 A2 WO 2006094976A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sequence
- intron
- seq
- plant
- expression
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8222—Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
- C12N15/8223—Vegetative tissue-specific promoters
- C12N15/8225—Leaf-specific, e.g. including petioles, stomata
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8222—Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
- C12N15/8223—Vegetative tissue-specific promoters
- C12N15/8227—Root-specific
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8222—Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
- C12N15/823—Reproductive tissue-specific promoters
- C12N15/8234—Seed-specific, e.g. embryo, endosperm
Definitions
- the invention relates to methods for the identification and use of introns with gene expression enhancing properties.
- the teaching of this invention enables the identification of introns causing intron-mediated enhancement (IME) of gene expression.
- the invention furthermore relates to recombinant expression construct and vectors comprising said IME-introns operably linked with a promoter sequence and a nucleic acid sequence.
- the present invention also relates to transgenic plants and plant cells transformed with these recombinant expression constructs or vectors, to cultures, parts or propagation material derived there from, and to the use of same for the preparation of foodstuffs, animal feeds, seed, pharmaceuticals or fine chemicals, to improve plant biomass, yield, or provide desirable phenotypes.
- the aim of plant biotechnology is the generation of plants with advantageous novel properties, such as pest and disease resistance, resistance to environmental stress (e.g., drought), improved qualities (e.g., high yield), or for the production of certain chemicals or pharmaceuticals.
- Appropriate gene expression rates play an important role in order to obtain the desired phenotypes.
- the gene expression rate is mainly modulated by the promoter, additional DNA sequence located in the 5 ' untranscribed and 5 ' untranslated region and the terminator sequences of a given gene. Promoters are the portion of DNA sequences located at the 5 ' end a gene which contains signals for RNA polymerases to begin transcription so that a protein synthesis can then proceed.
- Regulatory DNA sequences positioned in the 5 ' untranscribed region modulate gene expression in response to specific biotic (e.g. pathogen infection) or abiotic (e.g. salt-, heat-, drought-stress) stimuli.
- abiotic e.g. salt-, heat-, drought-stress
- other so called “enhancer' sequences have been identified that elevate the expression level of nearby located genes in a position and orientation independent manner.
- intron mediated enhancement' (IME) of gene expression (Mascarenhas et al., (1990) Plant MoI. Biol. 15:913-920). Introns known to stimulate expression in plants have been identified in maize genes (e.g. tubA1, Adh1, Sh1, UbH (Jeon et al. (2000) Plant Physiol.
- Enhancement of gene expression by introns is not a general phenomenon because some intron insertions into recombinant expression cassettes fail to enhance expression (e.g. introns from dicot genes (rbcS gene from pea, phaseolin gene from bean and the stls-1 gene from Solanum tuberosum) and introns from maize genes (adh1 gene the ninth intron, hsp81 gene the first intron)) (Chee et al. (1986) Gene 41 :47-57; Kuh- lemeier ef al. (1988) MoI Gen Genet 212:405-411 ; Mascarenhas et al.
- introns from dicot genes rbcS gene from pea, phaseolin gene from bean and the stls-1 gene from Solanum tuberosum
- introns from maize genes adh1 gene the ninth intron, hsp81 gene the first intron
- control elements including promoters, regulatory sequences (e.g., inducible elements, enhancers) or intron sequences that have an impact on gene expression rates. It is therefore an objective of the present invention, to provide a highly reproducible and reliable method for the identification of introns with expression enhancing properties.
- a first subject matter of the invention therefore relates to a method for identifying an intron with expression enhancing properties in plants comprising selecting an intron from a plant genome, wherein said intron is characterized by at least the following fea- tures
- the invention relates to a method for enriching the number of introns with expression enhancing properties in plants in a population of plant introns to a percentage of at least 50% of said population, said method comprising selecting introns from said population, wherein said introns are characterized by at least the following features I) an intron length shorter than 1 ,000 base pairs, and
- V an adenine plus thymine content of at least 40% over 100 nucleotides downstream from the 5' splice site
- the population of plant introns chosen for the enrichment of introns with gene expression enhancing properties in plants comprises substantially all introns of a plant genome represented in a genomic DNA sequence database or a plant genomic DNA library.
- the intron with gene expression enhancing properties in plants ("IME-intron') is selected by the method of the invention for identifying IME- introns or the method of the invention for enriching the number of IME-introns in a population of plant introns.
- said intron is selected from the group consisting of introns located between two protein encoding exons or introns located within the 5 ' untranslated region of the corresponding gene.
- the IME-intron is identified or enriched by one of the inventive methods from a group or population of genes representing the 10% fraction of genes with the highest expression rate in a gene expression analysis experiment performed using a plant cell, plant tissue or a whole plant.
- the invention furthermore relates to a method wherein the gene sequence information used for the identification or enrichment of IME-introns is present in a DNA sequence database and the selection steps for identifying or enriching said introns are performed using an automated process, preferably by using a computer device and an algorithm that defines the instructions needed for accomplishing the selection steps for identifying or enriching said introns.
- the invention relates to computer algorithm that defines the instructions needed for accomplishing the selection steps for identifying or enriching IME-introns from a plant genome or a population of introns selected from the group consisting of introns located between two protein encoding exons, and/or introns located within the 5 ' untranslated region of the corresponding gene and/or introns located in the DNA sequences of genes representing the 10% fraction of genes with the highest expression rate in a gene expression analysis experiment performed using a plant cell, plant tissue and/or a whole plant.
- the invention also relates to the computer device or data storage device comprising an algorithm as described above.
- the invention relates to methods for isolating, providing or producing IME-introns comprising the steps of performing an identification or enrichment of IME-introns as described above and providing the sequence information of said IME-introns identified or enriched, and providing the physical nucleotide sequence of said identified or enriched introns and evaluating the gene expression enhancing properties of the isolated introns in an in vivo or in vitro expression experiment, and isolating the IME-introns from the population of introns tested in the in vivo or in vitro expression experiment.
- the evaluation of the gene expression enhancing properties of the IME-intron is done in a plant cell and wherein IME-intron enhances the expression of a given nucleic acid at least twofold.
- An additional subject matter of the invention relates to a recombinant DNA expression construct comprising at least one promoter sequence functioning in plants cells, at least one nucleic acid sequence and at least one intron selected from the group consisting of the sequences described by SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 22, and functional equivalents thereof, wherein said promoter sequence and at least one of said intron sequences are functionally linked to said nucleic acid sequence and wherein said intron is heterologous to said nucleic acid sequence or to said promoter sequence.
- the invention relates to recombinant expression constructs comprising at least one promoter sequence functioning in plants cells, at least one nucleic acid sequence and at least one functional equivalents of an intron described by any of sequences SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 22, wherein said functional equivalent comprises the functional elements of an intron and is characterized by a) a sequence having at least 50 consecutive base pairs of the intron sequence described by any of SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22, or b) having an identity of at least 80% over a sequence of at least 95 consecutive nucleic acid base pairs to a sequences described by any of SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22, or c) hybridizing under high stringent conditions with a nucleic acid fragment of at least 50 consecutive base pairs of a nucleic acid molecule described
- the recombinant DNA expression construct of the invention further contains one or more additional regulatory sequences functionally linked to promoter.
- Those regulatory sequences can be selected from the group consisting of heat shock responsive-, anaerobic responsive-, pathogen responsive-, drought responsive-, low temperature responsive-, ABA responsive-elements, 5 ' untranslated gene region, 3 ' untranslated gene region, transcription terminators, polyadenylation signals and enhancers.
- the nucleic acid sequence of the inventive recombinant DNA expression construct may result in the expression of a protein and/or sense, antisense or double-stranded RNA encoded by said nucleic acid sequence.
- nucleotide sequence encoding the transgenic expression construct of the invention is double-stranded. In yet another embodiment, the nucleotide sequence encoding the transgenic expression construct of the invention is single- stranded.
- the recombinant expression construct comprises a nucleic acid sequence encoding for a selectable marker protein, a screenable marker protein, a anabolic active protein, a catabolic active protein, a biotic or abiotic stress resistance protein, a male sterility protein or a protein affecting plant agronomic characteristics.
- the invention relates furthermore to vectors containing a transgenic expression construct of the invention. Additionally, the invention relates to transgenic cells or trans- genie non-human-organisms like bacteria, fungi, yeasts or plants comprising an expression vector containing a transgenic expression construct of the invention.
- the transgenic cell or transgenic non-human organism transformed with an expression construct of the invention is a monocotyledonous plant or is derived from such a plant.
- the monocotyledonous plant is selected from the group consisting of the genera Hordeum, Avena, Secale, Triticum, Sorghum, Zea, Saccharum, and Oryza.
- inventions relate to cell cultures, parts or propagation material derived from non-human-organisms like bacteria, fungi, yeasts and/or plants, preferably monocotyledonous plants, most pref- erably plants selected from the group consisting of the genera Hordeum, Avena, Secale, Triticum, Sorghum, Zea, Saccharum, and Oryza, transformed with the inventive vectors or containing the inventive recombinant expression constructs.
- the invention furthermore relates to a method for providing an expression cassette for enhanced expression of a nucleic acid sequence in a plant or a plant cell, comprising the step of functionally linking at least one sequence selected from the group consisting of SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 22 to said nucleic acid sequence.
- the invention further relates to a method for enhancing the expression of a nucleic acid sequence in a plant or a plant cell, comprising functionally linking at least one sequence selected from the group consisting of SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 22 to said nucleic acid sequence.
- An additional embodiment of the invention relates to a method a) for providing an expression cassette for enhanced expression of a nucleic acid sequence in a plant or a plant cell, or b) for enhancing the expression of a nucleic acid sequence in a plant or a plant cell said method comprising functionally linking at least one sequence selected from the group consisting of SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 22 to said nucleic acid sequence, wherein furthermore a promoter sequence functional in plants is linked to said nucleic acid sequence.
- At least one sequence selected from the group consisting of SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 22 is linked to a nucleic acid sequence by insertion into the plant genome via homologous recombination.
- said homologous recombination is comprising at least the steps of a) providing in vivo or in vitro a DNA construct comprising said intron flanked by sequences ("recombination substrate') allowing homologous recombination into a pre- existing expression cassette between the promoter and the nucleic acid of said expression cassette, and b) transforming a recipient plant cell comprising said cassette of step a) and regenerating a transgenic plant, wherein said intron has been inserted into the genome of said plant.
- the site of integration into the genome of said plant is deter- mined by the DNA sequence of the recombination substrate of step a), wherein said sequence sharing sufficient homology (as defined herein) with said genomic target DNA sequence allowing the sequence specific integration via homologous recombination at said genomic target DNA locus.
- said recipient plant or plant cell is a mono- cotyledonous plant or plant cell, more preferably a plant or plant cell selected from the group consisting of the genera Hordeum, Avena, Secale, Triticum, Sorghum, Tea, Sac- charum, and Oryza, most preferably a maize plant.
- the nucleic acid sequence to which one of the inventive intron is functionally linked encodes for a selectable marker protein, a screenable marker protein, an anabolic active protein, a catabolic active protein, a biotic or abiotic stress resistance protein, a male sterility protein or a protein affecting plant agronomic characteristics and/or a sense, antisense, or double-stranded RNA.
- the invention relate to the use of a transgenic organism of the invention or of cell cultures, parts of transgenic propagation material derived there from for the production of foodstuffs, animal feeds, seeds, pharmaceuticals or fine chemicals.
- the invention furthermore relates to a recombinant DNA expression construct comprising a) at least one promoter sequence functioning in plants or plant cells, and b) at least one intron selected from the group of introns with expression enhancing properties in plants or plant cells characterized by at least the following features
- This vector comprises the maize ubiquitin promoter, followed by the BPSI.1 , then the GUSint ORF (including the potato invertase [PIV]2 intron to prevent bacterial expression), followed by nopaline synthase (NOS) terminator.
- This vector contains the attl_1 and attl_2 sites to make it compatible with modification via the Gateway® cloning Technology from InvitrogenTM.
- This vector is based on the pUC based expression vector pBPSMM267.
- the Xma ⁇ -Rs ⁇ digested BPSI.1 PCR product was ligated into the Xma ⁇ -Rs ⁇ digested pBPSMM267 to create pBPSMM291.
- the vectors pBPSMM293, pBPSMM294 and pBPSMM295 have been created accordingly (see table 6 and 1.6.1).
- the expression vector pBPSMM305 comprises the maize lactate dehydrogenase (LDH) promoter without intron driving expression of the GUSint ORF (including the potato invertase [PIV]2 intron to prevent bacterial expression), fol- lowed by the NOS terminator.
- LDH maize lactate dehydrogenase
- This vector has been used to create the pUC based expression vectors pBPSJB041 , pBPSJB042, pBPSJB043, pBPSJB044, pBPSJB045, pBPSJB046 and pBPSJB050 (see examples 2.3).
- Fig. 3 Map of pBPSMM350 (SEQ ID NO:111):
- the vector pBPSMM350 comprises the maize ubiquitin promoter, followed by the BPSI.1 , then the GUSint ORF (including the potato invertase [PIV]2 intron to prevent bacterial expression), followed by nopaline synthase (NOS) terminator.
- the expression cassette has been transferred from the vector pBPSMM291 using the Gateway® cloning Technology from InvitrogenTM.
- the vectors pBPSMM353, pBPSMM312 and pBPSMM310 have been created accordingly
- the vector pBPSLM139 comprises the selectable marker expression cassette.
- Pme ⁇ /Pac ⁇ fragments have been isolated from the vectors pBPSJB-042, -043, -044, -045, 046 and 050 and cloned into the Pme ⁇ -Pac ⁇ digested pBPSLM130 (see example 2.3 and 2.4)
- Fig. 5a-f Computer algorithm for retrieving sequence information from NCBI genebank file.
- Fig. 6 Transgenic plants containing promoter constructs with BPSI.1 intron (all but pBPSLM229) or BPSI.5 intron (only pBPSLM229) were tested for GUS expression at 5-leaf (A), flowering (B) and seed set (C) stages. Shown are examples of typical staining patterns obtained from at least 15 independent events. All samples were stained for 16 hours in GUS solution. Promoters in the constructs are: rice chloroplast protein 12 (Os.CP12; pBPSMM355), the maize hydroxyproline-rich glycoprotein (Zm. HRGP; pBPSMM370), the rice p-caffeoyl- CoA 3-O-methyltransferase (Os.CCoAMTI ; pBPSMM358), the maize Globulin-
- the term “about” is used herein to mean approximately, roughly, around, or in the region of. When the term “about” is used in conjunction with a numerical range, it modifies that range by extending the boundaries above and below the numerical values set forth. In general, the term “about” is used herein to modify a numerical value above and below the stated value by a variance of 20 percent, preferably 10 percent up or down (higher or lower). As used herein, the word “or” means any one member of a particular list.
- Agrobacterium refers to a soil-borne, Gram-negative, rod-shaped phytopathogenic bacterium which causes crown gall.
- the term "Agrobacterium” includes, but is not limited to, the strains Agrobacterium tumefaciens, (which typically causes crown gall in infected plants), and Agrobacterium rhizogenes (which causes hairy root disease in infected host plants). Infection of a plant cell with Agrobacterium generally results in the production of opines (e.g., nopaline, agropine, octopine etc.) by the infected cell.
- opines e.g., nopaline, agropine, octopine etc.
- Agrobacterium strains which cause production of nopaline are referred to as "nopaline-type” Agrobacteria
- Agrobacterium strains which cause production of octopine e.g., strain LBA4404, Ach5, B6
- oc- topine-type e.g., strain EHA105, EHA101 , A281
- agropine-type e.g., strain EHA105, EHA101 , A281
- Algorithm refers to the way computers process information, because a computer program is essentially an algorithm that tells the computer what specific steps to perform (in what specific order) in order to carry out a specified task, such as identification of coding regions of a set of genes.
- an algorithm can be considered to be any sequence of operations that can be performed by a computer system.
- Typi- cally when an algorithm is associated with processing information, data is read from an input source or device, written to an output sink or device, and/or stored for further use.
- the algorithm must be rigorously defined: specified in the way it applies in all possible circumstances that could arise.
- any conditional steps must be systematically dealt with, case-by-case; the criteria for each case must be clear (and computable). Because an algorithm is a precise list of precise steps, the order of computation will almost always be critical to the functioning of the algorithm. Instructions are usually assumed to be listed explicitly, and are described as starting 'from the top' and going 'down to the bottom', an idea that is described more formally by flow of control.
- a script is a computer program that automates the sort of task that a user might otherwise do interactively at the keyboard. Languages that are largely used to write such scripts are called scripting languages.
- scripting languages are computer programming languages designed for "scripting" the operation of a computer. Early script languages were often called batch languages or job control languages.
- script languages are: ACS, ActionScript, Active Server Pages (ASP), AppleScript, Awk, BeanShell (scripting for Java), bash, Brain, CobolScript, csh, ColdFusion, Dylan, Escapade (server side scripting), Euphoria, Groovy, Guile, Haskell, Hy- perTalk, ICI, IRC script, JavaScript, mlRC script, MS-DOS batch, Nwscript, Perl, PHP, Pike, ScriptBasic.
- Antisense is understood to mean a nucleic acid having a sequence complementary to a target sequence, for example a messenger RNA (mRNA)
- mRNA messenger RNA
- the terms “complementary” or “complementarity” are used in reference to nucleotide sequences related by the base-pairing rules.
- sequence 5'-AGT-3' is complementary to the sequence 5'-ACT-3'.
- Complementarity can be "partial” or “total.”
- Partial complementarity is where one or more nucleic acid bases is not matched according to the base pairing rules.
- Total or “complete” complementarity between nucleic acids is where each and every nucleic acid base is matched with another base under the base pairing rules. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands.
- Sense is understood to mean a nucleic acid having a sequence that is homologous or identical to a target sequence, for example a sequence which is bound by a protein factor of the spliceosome.
- Bombarding refers to the process of accelerating particles (microprojectiles) towards a target biological sample (e.g., cell, tissue, etc.) to effect wounding of the cell membrane of a cell in the target biological sample and/or entry of the particles into the target biological sample.
- a target biological sample e.g., cell, tissue, etc.
- Methods for biolistic bombardment are known in the art (e.g., US 5,584,807, the contents of which are herein incorporated by reference), and are commercially available (e.g., the helium gas-driven microprojectile accelerator (PDS-1000/He) (BioRad).
- Cell refers to a single cell.
- the term "cells" refers to a population of cells.
- the population may be a pure population comprising one cell type. Likewise, the population may comprise more than one cell type. In the present invention, there is no limit on the number of cell types that a cell population may comprise.
- the cells may be synchronize or not synchronized, preferably the cells are synchronized.
- Chromosomal DNA or chromosomal DNA-sequence is to be understood as the genomic DNA of the cellular nucleus independent from the cell cycle status. Chromosomal DNA might therefore be organized in chromosomes or chromatids, they might be condensed or uncoiled. An insertion into the chromosomal DNA can be demonstrated and analyzed by various methods known in the art like e.g., polymerase chain reaction (PCR) analysis, Southern blot analysis, fluorescence in situ hybridization (FISH), and in situ PCR.
- PCR polymerase chain reaction
- FISH fluorescence in situ hybridization
- Coding region or coding sequence when used in reference to a gene refers to the nucleotide sequences which encode the amino acids found in the nascent polypeptide as a result of translation of a mRNA molecule.
- the coding region is bounded, in eucaryotes, on the 5'-side by the nucleotide triplet "ATG" which encodes the initiator methionine and on the 3'-side by one of the three triplets, which specify stop codons (i.e., TAA, TAG, TGA).
- nucleic acid sequence refers to a nucleotide sequence whose nucleic acids show total complementarity to the nucleic acids of the nucleic acid sequence.
- Decile when used in connection with statistical data is any of the 10 values that divide sorted data into 10 equal parts, so that each part represents 1/1 Oth of the sample or population.
- the 1st decile cuts off lowest 10% of data
- the 9th decile cuts off lowest 90% or the highest 10% of data.
- a percentile is any of the 99 values that divide the sorted data into 100 equal parts, so that each part represents 1/100th of the sample or population.
- the 1st percentile cuts off lowest 1 % of data
- the 98th percentile cuts off lowest 98% of data
- the 25 th percentile cuts off lowest 25% of data.
- DNA databases in the field of bioinformatics, a DNA sequence database is a large collection of DNA sequences stored on a computer.
- a database can include sequences from only one organism, or it can include sequences from all organisms whose DNA has been sequenced.
- Enrichment or enriching when used in connection with the selection of inventive in- trons refers to an increase in the success rate of identifying introns with gene expression enhancing properties within a population of introns (e.g. a population of introns representing all introns of a plant genome present in a genomic DNA sequence data- base). The enrichment is achieved by reducing the number of candidate introns by using the inventive method and the inventive selection criteria.
- the success rate of identifying an intron with expression enhancing properties from a given population of introns - by using the herein described methods for measuring gene expression enhancement- is one out of ten analyzed introns
- enrichment has to be under- stood as an increase in the number of identified introns with gene expression enhancing properties -by using the inventive method- to at least five out of ten analyzed introns. Therefore, the number of introns needed to be analyzed in order to identify one inventive intron is reduced to two introns by using the inventive method as a preselection or filtering process.
- Evaluation of the expression enhancing properties can be done using methods known in the art.
- a candidate intron sequence whose gene ex- pression enhancing effect is to be determined can be inserted into the 5 ' UTR of a nucleic acid sequence encoding for a reporter gene (e.g., a visible marker protein, a selectable marker protein) under control of an appropriate promoter active in plants or plant cells to generate a reporter vector.
- a reporter gene e.g., a visible marker protein, a selectable marker protein
- the reporter vector and an identical control reporter vector lacking the candidate intron can be introduced into a plant tissue using methods described herein, and the expression level of the reporter gene, in dependence of the presence of the candidate intron, can be measured and compared (e.g., detecting the presence of encoded mRNA or encoded protein, or the activity of a protein encoded by the reporter gene).
- An intron with expression enhancing properties will result in a higher expression rate than a reference value obtained with an identical control reporter vector lacking the candidate intron under otherwise unchanged conditions.
- the reporter gene may express visible markers.
- Reporter gene systems which express visible markers include ⁇ -glucuronidase and its substrate (X-Gluc), luciferase and its substrate (luciferin), and ⁇ -galactosidase and its substrate (X-GaI) which are widely used not only to identify transformants, but also to quantify the amount of transient or stable protein expression attributable to a specific vector system (Rhodes (1995) Methods MoI Biol 55:121-131).
- the assay with ⁇ glucuronidase (GUS) being very especially preferred (Jefferson et a/., GUS fusions: beta-glucuronidase as a sensitive and versa- tile gene fusion marker in higher plants.
- X-Gluc ⁇ -glucuronidase and its substrate
- luciferin luciferin
- X-GaI ⁇ -galactosidase and its substrate
- ⁇ - glucuronidase (GUS) expression is detected by a blue color on incubation of the tissue with 5-bromo-4-chloro-3-indolyl- ⁇ -D-glucuronic acid.
- the selectable marker gene may confer antibiotic or herbicide resistance.
- reporter genes include, but are not limited to, the dhfr gene, which confers resistance to methotrexate (Wigler (1980) Proc Natl Acad Sci 77:3567-3570); npt, which confers resistance to the aminoglycosides neomycin and G-418 (Colbere-Garapin (1981) J. MoI. Biol. 150:1-14) and als or pat, which confer resistance to chlorsulfuron and phosphinotricin acetyl transferase, respectively.
- Expect value when used in the context of DNA sequence alignments or DNA sequence database searches refers to the number of times a certain match or a better one would be expected to occur purely by chance in a search of the entire database. Thus, the lower the Expect value, the greater the similarity between the input sequence and the match.
- the Expect value (E) is a parameter that describes the number of hits one can "expect" to see just by chance when searching a database of a particular size. It decreases exponentially with the Similarity Score (S) that is assigned to a match between two sequences. The higher the score, the lower the E value. Essentially, the E value describes the random background noise that exists for matches between sequences.
- the Expect value is used as a convenient way to create a significance threshold for reporting results.
- E value of 1 assigned to a hit can be interpreted as meaning that in a database of the current size you might expect to see 1 match with a similar score simply by chance.
- the E-value is influenced by: a) length of sequence (the longer the query the lower the probability that it will find a sequence in the database by chance), b) size of database (the larger the database the higher the probability that the query will find a match by chance), c) the scoring matrix (the less stringent the scoring matrix the higher the probability that the query will find a sequence in the database by chance).
- Expressible nucleic acid sequence as used in the context of this invention is any nucleic acid sequence that is capable of being transcribed into RNA (e.g. mRNA, an- tisense RNA, double strand forming RNA etc.) or translated into a particular protein.
- RNA e.g. mRNA, an- tisense RNA, double strand forming RNA etc.
- Expression refers to the biosynthesis of a gene product.
- expression involves transcription of the structural gene into mRNA and - optionally - the subsequent translation of mRNA into one or more polypeptides.
- introns has to be understood as natural or artificial mutations of said introns described in any of the SEQ ID NOs: 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22. Mutations can be insertions, deletions or substitutions of one or more nucleic acids that do not diminish the expression enhancing properties of said introns.
- homologs are in particular homologs of said introns derived from other plant species.
- Homologs when used in reference to introns refers to introns with ex- pression enhancing properties isolated from a genomic nucleic acid sequence that encodes for a protein
- the gene expression enhancing effect of the functional equivalent intron is at least 50% higher, preferably at least 100% higher, especially preferably at least 300% higher, very especially preferably at least 500% higher than a reference value obtained with any of the introns shown in SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22 under otherwise unchanged conditions.
- Functionally linked or operably linked is to be understood as meaning, for example, the sequential arrangement of a regulatory element (e.g. a promoter) with a nucleic acid sequence to be expressed and, if appropriate, further regulatory elements (such as e.g., a terminator) in such a way that each of the regulatory elements can fulfill its intended function to allow, modify, facilitate or otherwise influence expression of said nucleic acid sequence.
- the expression may result depending on the arrangement of the nucleic acid sequences in relation to sense or antisense RNA. To this end, direct linkage in the chemical sense is not necessarily required. Genetic control sequences such as, for example, enhancer sequences, can also exert their function on the target sequence from positions that are further away, or indeed from other DNA molecules.
- the terms “functionally linked', "operably linked,” “in operable combination,” and “in operable order” as used herein with reference to an inventive intron with gene expression enhancing properties refers to the linkage of at least one of said introns to a nucleic acid sequences in a way that the expression enhancing effect is realized and, if functional splice sites have been included, that the intron can be spliced out by the cell factors responsible for the splicing procedure.
- the intron is introduced into the 5 ' non coding region of a nucleic acid sequence.
- inventive expression constructs, wherein an inventive intron is functionally linked to an nucleic acid sequence are shown in the examples.
- More preferred arrangements are those in which an intron functioning in intron mediated expression enhancement is inserted between a promoter and a nucleic acid sequence, preferably into the transcribed nucleic acid sequence, or in case of a nucleic acid sequence encoding for a protein, into the 5 ' untranslated region of a nucleic acid sequence.
- the distance between the promoter sequence and the nucleic acid sequence to be expressed recombinantly is preferably less than 200 base pairs, especially preferably less than 100 base pairs, very especially preferably less than 50 base pairs.
- Operable link- age, and an expression cassette can be generated by means of customary recombination and cloning techniques as are described, for example, in Maniatis T, Fritsch EF and Sambrook J (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor (NY), in Silhavy TJ, Berman ML and Enquist LW (1984) Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor (NY), in Ausubel FM et al. (1987) Current Protocols in Molecular Biology, Greene Publishing Assoc, and Wiley lnterscience and in Gelvin et al. (1990) In: Plant Molecular Biology Manual.
- sequences which, for example, act as a linker with specific cleavage sites for restriction enzymes, or as a signal peptide, may also be positioned between the two sequences.
- the insertion of sequences may also lead to the expression of fusion proteins.
- the expression construct consisting of a linkage of promoter, intron and nucleic acid sequence to be expressed, can exist in a vector-integrated form and be inserted into a plant genome, for example by transformation.
- Gene refers to a coding region operably linked to appropriate regulatory sequences capable of regulating the expression of the polypeptide in some manner.
- a gene includes untranslated regulatory regions of DNA (e.g., promoters, enhancers, repressors, etc.) preceding (upstream) and following (downstream) the coding region (open reading frame, ORF) as well as, where applicable, intervening sequences (i.e., introns) between individual coding regions (i.e., exons). Genes may also include sequences located on both the 5 1 - and 3'-end of the sequences, which are present on the RNA tran- script.
- flanking sequences or regions are referred to as "flanking" sequences or regions (these flanking sequences are located 5' or 3' to the non-translated sequences present on the mRNA transcript).
- the 5'-flanking region may contain regulatory sequences such as promoters and enhancers, which control or influence the transcription of the gene.
- the 3'-flanking region may contain sequences, which direct the termination of transcription, posttranscriptional cleavage and polyadenylation.
- Gene expression enhancing properties, gene expression enhancing effect or intron mediated gene expression enhancement when made in reference to an intron sequence refers to the ability of the intron to enhance quantitatively the expression level of a nucleic acid sequence (e.g. a gene) that is part of an recombinant/transgenic DNA expression cassette (as defined herein), measured on the basis of the transcribed RNA, mRNA, protein amount or protein activity compared to the otherwise identical expression construct lacking the intron under otherwise unchanged conditions.
- Gene expression enhancing properties in plants refers to an intron that is able to enhance quantitatively the expression level of a plant derived nucleic acid sequence in a plant or plant cell and the enhancement of gene expression rate of a non-plant derived nucleic acid in a plant or a plant cell compared to the otherwise identical expression construct lacking the intron under otherwise unchanged conditions.
- the expression enhancing effect is understood as an increase in the RNA steady state level, the protein steady state level or the protein activity of a nucleic acid sequence or the corresponding protein (e.g.
- Fur- thermore expression enhancing effect or intron mediated enhancement has to be understood as the ability of an intron to change the tissue, organ or cell specific expression pattern of a nucleic acid sequence (e.g. a gene) that is part of an inventive expression cassette.
- Changing the tissue, organ or cell specific expression pattern of a nucleic acid sequence that is part of an inventive expression cassette refers to the fact that due to the presence of an inventive intron, the expression level (mRNA or encoded protein steady state level, or the activity of a protein) of the respective gene is increased above the detection threshold of the used detection method.
- Gene silencing can be realized by antisense or double-stranded RNA or by co- suppression (sense-suppression).
- the skilled worker knows that he can use alternative cDNA or the corresponding gene as starting template for suitable antisense constructs.
- the 'antisense' nucleic acid is preferably complementary to the coding region of the target protein or part thereof. However, the 'antisense' nucleic acid may also be complementary to the non-coding region or part thereof. Starting from the sequence infor- mation on a target protein, an antisense nucleic acid can be designed in the manner with which the skilled worker is familiar, taking into consideration Watson s and Crick rules of base pairing.
- An antisense nucleic acid can be complementary to the entire or part of the nucleic acid sequence of a target protein. Likewise encompassed is the use of the above-described sequences in sense orientation, which, as is known to the skilled worker, can lead to co-suppression (sense- suppression). It has been demonstrated that expression of sense nucleic acid sequences can reduce or switch off expression of the corresponding gene, analogously to what has been described for antisense approaches (Goring (1991 ) Proc. Natl Acad. Sci. USA 88:1770-1774; Smith (1990) MoI. Gen. Genet. 224:447-481 ; Napoli (1990) Plant Cell 2:279-289;Van der Krol (1990) Plant Cell 2:291-299).
- the construct introduced may represent the gene to be reduced fully or only in part.
- the possibility of translation is not necessary.
- gene regu- lation methods by means of double-stranded RNAi ('double-stranded RNA interference').
- Such methods are known to the person skilled in the art (e.g., Matzke 2000; Fire 1998; WO 99/32619; WO 99/53050; WO 00/68374; WO 00/44914; WO 00/44895; WO 00/49035; WO 00/63364).
- Matzke 2000; Fire 1998; WO 99/32619; WO 99/53050; WO 00/68374; WO 00/44914; WO 00/44895; WO 00/49035; WO 00/63364 The processes and methods described in the references stated are expressly referred to.
- Genome and genomic DNA of an organism as used herein is the whole hereditary information of an organism that is encoded in the DNA (or, for some viruses, RNA). This includes both the genes and the non-coding sequences.
- Said genomic DNA comprises the DNA of the nucleus (also referred to as chromosomal DNA) but also the DNA of the plastids (e.g., chloroplasts) and other cellular organelles (e.g., mitochondria).
- the terms genome or genomic DNA is referring to the chromosomal DNA of the nucleus.
- the term "chromosomal DNA' or "chromosomal DNA-sequence" is to be understood as the genomic DNA of the cellular nucleus independent from the cell cycle status.
- Chromosomal DNA might therefore be organized in chromosomes or chromat- ids, they might be condensed or uncoiled.
- An insertion into the chromosomal DNA can be demonstrated and analyzed by various methods known in the art like e.g., polymerase chain reaction (PCR) analysis, Southern blot analysis, fluorescence in situ hybridization (FISH), and in situ PCR.
- PCR polymerase chain reaction
- FISH fluorescence in situ hybridization
- Heterologous with respect to a nucleic acid sequence refers to a nucleotide sequence, which is ligated to a nucleic acid sequence to which it is not ligated in nature, or to which it is ligated at a different location in nature.
- Hybridizing includes "any process by which a strand of nucleic acid joins with a complementary strand through base pairing.” (Coombs 1994, Dictionary of Biotechnology, Stockton Press, New York N. Y.). Hybridization and the strength of hybridization (i.e., the strength of the association between the nucleic acids) is impacted by such factors as the degree of complementarity between the nucleic acids, stringency of the conditions involved, the Tm of the formed hybrid, and the G:C ratio within the nucleic acids. As used herein, the term “Tm” is used in reference to the “melting temperature.” The melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half dissociated into single strands.
- hybridization conditions may be employed to comprise either low or high stringency conditions; factors such as the length and nature (DNA, RNA, base composition) of the probe and nature of the target (DNA, RNA, base composition, present in solution or immobilized, etc.) and the concentration of the salts and other components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene glycol) are considered and the hybridization solution may be varied to generate conditions of either low or high hybridization stringency Those skilled in the art know that higher stringencies are preferred to reduce or eliminate non-specific binding between the nucleotide sequence of an inven- tive intron and other nucleic acid sequences, whereas lower stringencies are preferred to detect a larger number of nucleic acid sequences having different homologies to the inventive nucleotide sequences.
- Identity when used in relation to nucleic acids refers to a degree of complementarity. Identity between two nucleic acids is understood as meaning the identity of the nucleic acid sequence over in each case the entire length of the sequence, which is calculated by comparison with the aid of the program algorithm GAP (Wisconsin Package Version 10.0, University of Wisconsin, Genetics Computer Group (GCG), Madison, USA) with the parameters being set as follows:
- Gap Weight 12 Length Weight: 4
- a sequence with at least 95% identity to the sequence SEQ ID NO. 1 at the nucleic acid level is understood as meaning the sequence that, upon comparison with the sequence SEQ ID NO. 1 by the above program algorithm with the above parameter set, has at least 95% identity.
- Introducing a recombinant DNA expression construct in plant cells refers to a recombinant DNA expression construct that will be introduced into the genome of a plant by transformation and is stably maintained.
- the term "introducing' encompasses for ex- ample methods such as transfection, transduction or transformation.
- Identification With regard to transformation of plants has to be understood as a screening procedure to identify and select those plant cells in which the recombinant expression construct has been introduced stably into the ge- nome.
- "Identifying' with regard to an intron with gene expression enhancing properties refers to a process for the selection of said intron out of a population of introns.
- "identifying' refers to an in silico selection process, more preferably to an automated in silico selection process, using the selection criteria of the inventive methods.
- Such an in silico identification process can comprise for instance the steps of (1 ) generating an intron sequence database on the basis of DNA sequences present in a DNA sequence database (e.g.
- genomic DNA databases publicly available via the internet
- screening of the generated intron DNA sequence database -or other genomic DNA sequences containing databases - for introns with gene expression enhancing properties using the criteria according to the inventive method wherein the steps for retrieving or generating the DNA sequences, the generation of an intron specific DNA sequence database and the screening of these DNA sequences - using the criteria according to the inventive method - will be performed with the aid of appropriate computer algorithms and computer devices.
- Intron refers to sections of DNA (intervening sequences) within a gene that do not en- code part of the protein that the gene produces, and that is spliced out of the mRNA that is transcribed from the gene before it is exported from the cell nucleus.
- Intron sequence refers to the nucleic acid sequence of an intron.
- introns are those regions of DNA sequences that are transcribed along with the coding sequence (exons) but are removed during the formation of mature mRNA. Introns can be positioned within the actual coding region or in either the 5 or 3 untranslated leaders of the pre-mRNA (unspliced mRNA).
- Introns in the primary transcript are excised and the coding sequences are simultaneously and precisely ligated to form the mature mRNA.
- the junctions of introns and exons form the splice site.
- the sequence of an intron begins with GU and ends with AG.
- two examples of AU-AC introns have been described: the fourteenth intron of the RecA-like protein gene and the seventh intron of the G5 gene from Arabidopsis thaliana are AT-AC introns.
- Pre-mRNAs containing introns have three short sequences that are beside other sequences- essential for the intron to be accurately spliced. These sequences are the 5' splice-site, the 3 splice-site, and the branchpoint.
- mRNA splicing is the removal of intervening se- quences (introns) present in primary mRNA transcripts and joining or ligation of exon sequences. This is also known as c/s-splicing which joins two exons on the same RNA with the removal of the intervening sequence (intron).
- the functional elements of an intron comprising sequences that are recognized and bound by the specific protein components of the spliceosome (e.g. splicing consensus sequences at the ends of introns). The interaction of the functional elements with the spliceosome results in the removal of the intron sequence from the premature mRNA and the rejoining of the exon sequences.
- Introns have three short sequences that are essential -although not sufficient- for the intron to be accurately spliced. These sequences are the 5 ' splice site, the 3 ' splice site and the branch point.
- the branchpoint sequence is important in splic- ing and splice-site selection in plants.
- the branchpoint sequence is usually located 10- 60 nucleotides upstream of the 3 ' splice site. Plant sequences exhibit sequence deviations in the branchpoint, the consensus sequences being 5-CURAY-3 (SEQ ID NO:75) or 5 -YURAY-3 (SEQ ID NO: 76).
- IME-intron' or intron mediated enhancement (IME)-intron when made in reference to an intron sequence refers to an intron with gene expression enhancing properties in plants as defined herein (see gene expression enhancing properties, gene expression enhancing effect or intron mediated gene expression enhancement).
- Isolation or isolated when used in relation to an intron or gene, as in "isolation of an intron sequence' or "isolation of a gene” refers to a nucleic acid sequence that is identified within and isolated/separated from its chromosomal nucleic acid sequence context within the respective source organism.
- Isolated nucleic acid is nucleic acid present in a form or setting that is different from that in which it is found in nature.
- nonisolated nucleic acids are nucleic acids such as DNA and RNA, which are found in the state they exist in nature. For example, a given DNA sequence (e.g.
- nucleic acid sequence may be present in single-stranded or double-stranded form.
- the nucleic acid sequence will contain at a minimum at least a portion of the sense or coding strand (i.e., the nucleic acid sequence may be single- stranded). Alternatively, it may contain both the sense and anti-sense strands (i.e., the nucleic acid sequence may be double-stranded).
- Nucleic acid refers to deoxyribonucleotides, ribonucleotides or polymers or hybrids thereof in single-or double-stranded, sense or antisense form. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) and complementary sequences, as well as the sequence explicitly indicated.
- the term “nucleic acid” can be used to describe a "gene”, “cDNA'.'DNA' "mRNA”, “oligonucleotide,” and “polynucleotide”.
- Nucleic acid sequence refers to the consecutive sequence of deoxyri- bonucleotides or ribonucleotides (nucleotides) of a DNA fragment (oligonucleotide, polynucleotide, genomic DNA, cDNA etc.) as it can made be available by DNA sequencing techniques as a list of abbreviations, letters, characters or words, which represent nucleotides.
- Organ with respect to a plant (or "plant organ') means parts of a plant and may include (but shall not limited to) for example roots, fruits, shoots, stem, leaves, anthers, sepals, petals, pollen, seeds, etc.
- Plant is generally understood as meaning any single-or multi-celled organism or a cell, tissue, organ, part or propagation material (such as seeds or fruit) of same which is capable of photosynthesis. Included for the purpose of the invention are all genera and species of higher and lower plants of the Plant Kingdom. Annual, perennial, monocoty- ledonous and dicotyledonous plants are preferred. The term includes the mature plants, seed, shoots and seedlings and their derived parts, propagation material (such as seeds or microspores), plant organs, tissue, protoplasts, callus and other cultures, for example cell cultures, and any other type of plant cell grouping to give functional or structural units. Mature plants refer to plants at any desired developmental stage beyond that of the seedling.
- Seedling refers to a young immature plant at an early devel- opmental stage. Annual, biennial, monocotyledonous and dicotyledonous plants are preferred host organisms for the generation of transgenic plants. The expression of genes is furthermore advantageous in all ornamental plants, useful or ornamental trees, flowers, cut flowers, shrubs or lawns.
- Plants which may be mentioned by way of example but not by limitation are angiosperms, bryophytes such as, for example, Hepaticae (liverworts) and Musci (mosses); Pteridophytes such as ferns, horsetail and club mosses; gymnosperms such as conifers, cycads, ginkgo and Gnetatae; algae such as Chlorophyceae, Phaeophpyceae, Rhodophyceae, Myxophyceae, Xanthophy- ceae, Bacillariophyceae (diatoms), and Euglenophyceae.
- angiosperms bryophytes such as, for example, Hepaticae (liverworts) and Musci (mosses); Pteridophytes such as ferns, horsetail and club mosses; gymnosperms such as conifers, cycads, ginkgo and Gnetatae; algae such as
- Preferred are plants which are used for food or feed purpose such as the families of the Leguminosae such as pea, alfalfa and soya; Gramineae such as rice, maize, wheat, barley, sorghum, millet, rye, triticale, or oats; the family of the Umbelliferae, especially the genus Daucus, very especially the species carota (carrot) and Apium, very especially the species Graveolens dulce (celery) and many others; the family of the Solanaceae, especially the genus Lycopersicon, very especially the species esculentum (tomato) and the genus Solanum, very especially the species tuberosum (potato) and melongena (egg plant), and many others (such as tobacco); and the genus Capsicum, very especially the species annuum (peppers) and many others; the family of the Leguminosae, especially the genus Glycine, very especially the
- Producing when used in relation to an intron as in "producing an intron' refers to the synthesis of DNA molecules on the basis of DNA sequence information of an inventive intron.
- Promoter refers to a DNA sequence which when ligated to a nucleotide sequence of interest is capable of control- ling the transcription of the nucleotide sequence of interest into mRNA.
- a promoter is a recognition site on a DNA sequence that provide an expression control element for a gene and to which RNA polymerase specifically binds and initiates RNA synthesis (transcription) of that gene.
- a promoter is typically, though not necessarily, located 5' (i.e., upstream) of a nucleotide sequence of interest (e.g., proximal to the transcriptional start site of a structural gene). Promoters may be tissue specific or cell specific.
- tissue specific refers to a promoter that is capable of directing selective expression of a nucleotide sequence of interest to a specific type of tissue (e.g., petals) in the relative absence of expression of the same nucleotide sequence of interest in a different type of tissue (e.g., roots). Promoters may be constitutive or regulatable.
- constitutive when made in reference to a promoter means that the promoter is capable of directing transcription of an operably linked nucleic acid sequence in the absence of a stimulus (e.g., heat shock, chemicals, light, etc.). Typically, constitutive promoters are capable of directing expression of a transgene in substantially any cell and any tissue.
- a "regulatable" promoter is one which is capable of directing a level of transcription of an operably linked nuclei acid sequence in the presence of a stimulus (e.g., heat shock, chemicals, light, etc.) which is different from the level of transcription of the operably linked nucleic acid se- quence in the absence of the stimulus.
- a promoter sequence functioning in plants is understood as meaning, in principle, any promoter which is capable of governing the expression of genes, in particular foreign genes, in plants or plant parts, plant cells, plant tissues or plant cultures. In this context, expression can be, for example, constitutive, inducible or development-dependent.
- a constitutive promoter is a promoter where the rate of RNA polymerase binding and initiation is approximately constant and relatively independent of external stimuli.
- Usable promoters are constitutive promoters (Benfey et al. (1989) EMBO J. 8:2195-2202), such as those which originate from plant viruses, such as 35S CAMV (Franck et al. (1980) Cell 21 :285-294), 19S CaMV (see also US 5352605 and WO 84/02913), 34S FMV (Sanger et al. (1990) Plant. MoI. Biol., 14:433-443), the parsley ubiquitin promoter, or plant promoters such as the Rubisco small subunit promoter described in US 4,962,028 or the plant promoters PRP1 [Ward et al. (1993) Plant. MoI. Biol.
- An inducible promoter is a promoter where the rate of RNA polymerase binding and initiation is modulated by external stimuli. Such stimuli include light, heat, anaerobic stress, alteration in nutrient conditions, presence or absence of a metabolite, presence of a ligand, microbial attack, wounding and the like (for a review, see Gatz (1997) Annu. Rev. Plant Physiol. Plant MoI. Biol. 48:89-108). Chemically inducible promoters are particularly suitable when it is desired to express the gene in a time-specific manner.
- a viral promoter is a promoter with a DNA sequence substantially similar to the promoter found at the 5 1 end of a viral gene.
- a typical viral promoter is found at the 5' end of the gene coding for the p21 protein of MMTV described by Huang et al. ((1981) Cell 27:245).
- a synthetic promoter is a promoter that was chemi- cally synthesized rather than biologically derived. Usually synthetic promoters incorporate sequence changes that optimize the efficiency of RNA polymerase initiation.
- a temporally regulated promoter is a promoter where the rate of RNA polymerase binding and initiation is modulated at a specific time during development. Examples of temporally regulated promoters are given in Chua et al. [(1989) Science 244:174-181].
- a spa- tially regulated promoter is a promoter where the rate of RNA polymerase binding and initiation is modulated in a specific structure of the organism such as the leaf, stem or root. Examples of spatially regulated promoters are given in Chua et al.
- a spatiotemporally regulated promoter is a promoter where the rate of RNA polymerase binding and initiation is modulated in a specific structure of the organism at a specific time during development.
- a typical spatiotemporally regulated promoter is the EPSP synthase-35S promoter described by Chua et al. [(1989) Science 244:174-181]. Suitable promoters are furthermore the oilseed rape napin gene pro- moter (US 5,608,152), the Vicia faba USP promoter (Baumlein et al.
- Advantageous seed-specific promoters are the sucrose binding protein promoter (WO 00/26388), the phaseolin promoter and the napin promoter.
- Suitable promoters which must be considered are the barley Ipt2 or Ipt1 gene promoter (WO 95/15389 and WO 95/23230), and the promoters described in WO 99/16890 (promoters from the barley hordein gene, the rice glutelin gene, the rice oryzin gene, the rice prolamin gene, the wheat gliadin gene, the wheat glutelin gene, the maize zein gene, the oat glutelin gene, the sorghum kasirin gene and the rye se- calin gene).
- promoters are Amy32b, Amy 6-6 and Aleurain [US 5,677,474], Bce4 (oilseed rape) [US 5,530,149], glycinin (soya) [EP 571 741], phos- phoenolpyruvate carboxylase (soya) [JP 06/62870], ADR12-2 (soya) [WO 98/08962], isocitrate lyase (oilseed rape) [US 5,689,040] or ⁇ -amylase (barley) [EP 781 849].
- promoters which are available for the expression of genes in plants are leaf- specific promoters such as those described in DE-A 19644478 or light-regulated promoters such as, for example, the pea petE promoter.
- leaf-specific promoters such as those described in DE-A 19644478 or light-regulated promoters such as, for example, the pea petE promoter.
- suitable plant promoters are the cytosolic FBPase promoter or the potato ST-LSI promoter (Stockhaus et al. (1989) EMBO J. 8:2445), the Glycine max phosphoribosylpyrophosphate amidotrans- ferase promoter (GenBank Accession No. U87999) or the node-specific promoter described in EP A 0 249 676.
- promoters are those which react to biotic or abiotic stress conditions, for example the pathogen-induced PRP1 gene promoter (Ward et al.. (1993) Plant. MoI. Biol. 22:361-366), the tomato heat-inducible hsp80 promoter (US 5,187,267), the potato chill-inducible alpha-amylase promoter (WO 96/12814) or the wound-inducible pinll promoter (EP-A-O 375 091 ) or others as described herein.
- Other promoters, which are particularly suitable, are those that bring about plastid-specific expression.
- Suitable promoters such as the viral RNA poly- merase promoter are described in WO 95/16783 and WO 97/06250, and the Arabidopsis clpP promoter, which is described in WO 99/46394.
- Other promoters which are used for the strong expression of heterologous sequences in as many tissues as possible, in particular also in leaves, are, in addition to several of the abovementioned viral and bacterial promoters, preferably, plant promoters of actin or ubiquitin genes such as, for example, the rice actin 1 promoter.
- Further examples of constitutive plant promoters are the sugarbeet V-ATPase promoters (WO 01/14572).
- Examples of synthetic constitutive promoters are the Super promoter (WO 95/14098) and promoters derived from G-boxes (WO 94/12015). If appropriate, chemical inducible promoters may furthermore also be used, compare EP-A 388186, EP-A 335528, WO 97/06268.
- the above listed promoters can be comprise other regulatory elements that affect gene expression in response to plant hormones (Xu et al., 1994, Plant Cell 6(8):1077-1085) biotic or abiotic environmental stimuli, such as stress conditions, as exemplified by drought (Tran et al. (2004) Plant Cell 16(9):2481-2498), heat, chilling, freezing, salt stress, oxidative stress (US 5,290,924) or biotic stressors like bacteria, fungi or viruses.
- Polypeptide, peptide, oligopeptide, gene product, expression product and protein are used interchangeably herein to refer to a polymer or oligomer of consecutive amino acid residues.
- Recombinant or transgenic DNA expression construct refers to all those constructs originating by experimental manipulations in which either a) said nucleic acid sequence, or b) a genetic control sequence linked operably to said nucleic acid sequence (a), for example a promoter, or c) (a) and (b) is not located in its natural genetic environment or has been modified by experimental manipulations, an example of a modification being a substitution, addition, deletion, inversion or insertion of one or more nucleotide residues.
- Natural genetic environment refers to the natural chromosomal locus in the organism of origin, or to the presence in a genomic library.
- the natural genetic environment of the nucleic acid sequence is preferably retained, at least in part.
- the environment flanks the nucleic acid sequence at least at one side and has a sequence of at least 50 bp, preferably at least 500 bp, especially preferably at least 1 ,000 bp, very especially preferably at least 5,000 bp, in length.
- a naturally occurring expression construct for example the naturally occurring combination of a promoter with the corresponding gene - becomes a transgenic expression construct when it is modified by non-natural, synthetic "artificial' methods such as, for example, mutagenesis. Such methods have been described (US 5,565,350; WO 00/15815).
- Recombinant polypeptides or proteins refer to polypeptides or proteins produced by recombinant DNA techniques, i.e., pro- prised from cells transformed by an exogenous recombinant DNA construct encoding the desired polypeptide or protein.
- Recombinant nucleic acids and polypeptide may also comprise molecules which as such does not exist in nature but are modified, changed, mutated or otherwise manipulated by man.
- An important use of the intron sequences of the invention will be the enhancement of the expression of a nucleic acid sequence, which encodes a particular protein, a polypeptide or DNA sequences that interfere with normal transcription or translation, e.g. interference- or antisense-RNA.
- the recombinant DNA expression construct confers expression of one or more nucleic acid molecules.
- Said recombinant DNA expression construct according to the invention advantageously encompasses a promoter functioning in plants, additional regulatory or control elements or sequences functioning in plants, an intron sequence with expression enhancing properties in plants and a terminator functioning in plants.
- the recombinant expression construct might contain additional functional elements such as expression cassettes conferring expression of e.g. positive and negative selection markers, reporter genes, recombinases or endonucleases effecting the production, amplification or function of the expression cassettes, vectors or recombinant organisms according to the invention.
- the recombinant expression construct can comprise nucleic acid sequences homologous to a plant gene of interest having a sufficient length in order to induce a homologous recombination (HR) event at the locus of the gene of interest after introduction in the plant.
- HR homologous recombination
- a recombinant transgenic expression cassette of the invention (or a transgenic vector comprising said transgenic expression cassette) can be produced by means of customary recombination and cloning techniques as are described (for example, in Maniatis 1989, Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor (NY); Silhavy 1984, ) Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY; and in Ausubel 1987, Current Protocols in Molecular Biology, Greene Publishing Assoc, and Wiley Interscience).
- an expression cassette according to the invention into an organism or cells, tissues, organs, parts or seeds thereof (preferably into plants or plant cells, tissue, organs, parts or seeds) can be effected advantageously using vectors, which comprise the above described nucleic acids, promoters, introns, terminators, regulatory or control elements and functional elements.
- Regeneration means growing a whole plant from a plant cell, a group of plant cells, a plant part or a plant piece (e.g., from a protoplast, callus, protocorm-like body, or tissue part).
- Regulatory sequence refers to promoters, enhancer or other segments of DNA where regulatory proteins such as transcription factors bind and thereby influencing the transcription rate of a given gene.
- Substantially all introns of a plant genome represented in a genomic DNA sequence database or genomic DNA library refers to more than 80%, preferably to more than 90%, more preferably to more than 95%, still more preferably more than 98% of all introns present in the genome of the plant used as a source for the preparation of the genomic DNA sequence database or genomic DNA library.
- the construction of genomic libraries and the subsequent sequencing of the genomic DNA and the construction of a genomic or genome DNA sequence database using the obtained sequence information is well established in the art (Mozo et al. (1998) MoI. Gen. Genet. 258:562- 570; Choi et al. (1995) Weeds World 2 :17-20; Lui et al. (1999) Proc. Natl. Acad. Sci. USA 96:6535-6540; The Arabidopsis Genome initiative, Nature 402:761-777 (1999); The Arabidopsis Genome initiative, Nature 408:796-826 (2000).
- Structural gene as used herein is intended to mean a DNA sequence that is transcribed into mRNA which is then translated into a sequence of amino acids characteristic of a specific polypeptide.
- a homology sequence comprised in a DNA-construct is to be understood to comprise sequences of a length of at least 100 base pair, preferably at least 250 base pair, more preferably at least 500 base pair, especially preferably at least 1 ,000 base pair, most preferably at least 2,500 base pair.
- the term "sufficient homology' with respect to a homology sequence comprised in a DNA-construct is to be understood to comprise sequences having a homology to the corresponding target sequence comprised in the chromosomal DNA (e.g., the target sequence A or B ) of at least 70 %, preferably at least 80 %, more preferably at least 90 %, especially prefera- bly at least 95 %, more especially preferably at least 99%, most preferably 100 %, wherein said homology extends over a length of at least 50 base pair, preferably at least 100 base pair, more preferably at least 250 base pair, most preferably at least 500 base pair.
- Target region/sequence of a nucleic acid sequence is a portion of a nucleic acid sequence that is identified to be of interest.
- a "coding region" of a nucleic acid sequence is the portion of the nucleic acid sequence, which is transcribed and translated in a sequence-specific manner to produce into a particular polypeptide or protein when placed under the control of appropriate regulatory sequences. The coding region is said to encode such a polypeptide or protein.
- Plant tissue with respect to a plant (or "plant tissue') means arrangement of multiple plant cells including differentiated and undifferentiated tissues of plants.
- Plant tissues may constitute part of a plant organ (e.g., the epidermis of a plant leaf) but may also constitute tumor tissues and various types of cells in culture (e.g., single cells, protoplasts, embryos, calli, protocorm-like bodies, etc.).
- Plant tissue may be in planta, in organ culture, tissue culture, or cell culture.
- Transforming or transformation refers to the introduction of genetic material (e.g., a transgene) into a cell. Transformation of a cell may be stable or transient.
- transient transformation refers to the introduction of one or more transgenes into a cell in the absence of integration of the transgene into the host cell's genome. Transient transformation may be detected by, for example, enzyme-linked immunosorbent assay (ELISA) which detects the presence of a polypeptide encoded by one or more of the transgenes.
- ELISA enzyme-linked immunosorbent assay
- transient transformation may be detected by detecting the activity of the protein (e.g., ⁇ - glucuronidase) encoded by the transgene (e.g., the uidA gene) as demonstrated herein [e.g., examples 1.6 and 2.4, histochemical assay of GUS enzyme activity by staining with X-gluc which gives a blue precipitate in the presence of the GUS enzyme; and a chemiluminescent assay of GUS enzyme activity using the GUS-Light kit (Tropix)].
- the term "transient transformant” refers to a cell which has transiently incorporated one or more transgenes.
- stable transformation refers to the introduction and integration of one or more transgenes into the genome of a cell, preferably resulting in chromosomal integration and stable heritability through meiosis.
- Stable transformation of a cell may be detected by Southern blot hybridization of genomic DNA of the cell with nucleic acid sequences, which are capable of binding to one or more of the transgenes.
- stable transformation of a cell may also be detected by the polymerase chain reaction of genomic DNA of the cell to amplify transgene sequences.
- stable transformant refers to a cell that has stably integrated one or more transgenes into the genomic DNA.
- a stable transformant is distinguished from a transient transformant in that, whereas genomic DNA from the stable transformant contains one or more transgenes, genomic DNA from the transient transformant does not contain a transgene. Transformation also includes introduction of genetic material into plant cells in the form of plant viral vectors involving extrachromosomal replication and gene expression, which may exhibit variable properties with respect to meiotic stability.
- Transgenic or recombinant when used in reference to a cell refers to a cell which contains a transgene, or whose genome has been altered by the introduction of a trans- gene.
- transgenic when used in reference to a tissue or to a plant refers to a tissue or plant, respectively, which comprises one or more cells that contain a trans- gene, or whose genome has been altered by the introduction of a transgene.
- Transgenic cells, tissues and plants may be produced by several methods including the introduction of a "transgene” comprising nucleic acid (usually DNA) into a target cell or integration of the transgene into a chromosome of a target cell by way of human intervention, such as by the methods described herein.
- Wild-type, natural or of natural origin means with respect to an organism, polypeptide, or nucleic acid sequence, that said organism polypeptide, or nucleic acid sequence is naturally occurring or available in at least one naturally occurring organism polypeptide, or nucleic acid sequence which is not changed, mutated, or otherwise manipulated by man.
- Vector is a DNA molecule capable of replication in a host cell. Plasmids and cosmids are exemplary vectors. Furthermore, the terms “vector” and “vehicle” are used interchangeably in reference to nucleic acid molecules that transfer DNA segment(s) from one cell to another, whereby the cells not necessarily belonging to the same organism (e.g. transfer of a DNA segment form an Agrobacterium cell to a plant cell).
- expression vector refers to a recombinant DNA molecule containing a desired coding sequence and appropriate nucleic acid sequences neces- sary for the expression of the operably linked coding sequence in a particular host organism.
- the teaching of the present invention enables the identification of introns causing intron mediated enhancement (IME) of gene expression. Furthermore, the present invention provides isolated plant introns that, if functionally combined with a promoter functioning in plants and a nucleic acid fragment, can enhance the expression rate of said nucleic acid in a plant or a plant cell.
- IME intron mediated enhancement
- a first embodiment of the present invention relates to a method for identifying an in- tron with plant gene expression enhancing properties comprising selecting an intron from a plant genome, wherein said intron is characterized by at least the following features
- the invention relates to a method for enriching the number of introns with expression enhancing properties in plants in a population of plant introns to a percentage of at least 50% of said population, said method comprising selecting introns from said population, said introns are characterized by at least the following fea- tures
- any of the inventive introns described by SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22 into the 5 ' untranslated region (UTR) of the ⁇ -glucuronidase gene (GUS) driven by the Zea mays Ubiquitin promoter has led to strong expression enhancement of the reporter gene in maize protoplasts (Black Mexican Sweet) suspension cells and stable transformed plants (see examples).
- Fur- thermore it could be shown that the gene expression enhancement properties of said introns are comparable to those known from the literature (e.g. the first intron of the Zea mays Ubiquitin gene, used as positive control in the expression assays).
- the number of introns - with gene expression enhancing properties identified within a population of introns by applying the method of the invention for enrichment is enriched to a percentage of at least 50%, preferably at least 55%, more preferably at least 60%, especially preferably at least 65%, or very especially preferably at least 70% (i.e., a given population of 100 introns pre-selected by using the inventive method will comprise at least 50, preferably at least 55, more pref- erably at least 60, especially preferably at least 65 or 70 introns with gene expression enhancing properties).
- the number of introns - with gene expression enhancing properties identified within a population of introns by applying the method of the invention for enrichment is enriched to a percentage of at least 50%, wherein the selected introns, if part of an recombinant DNA expression construct leads to an in- crease in the gene expression of a given gene of at least 300% compared to the otherwise identical expression construct lacking the intron under otherwise unchanged conditions.
- the enrichment is at least 60% percent, wherein the selected introns, increasing the transcription of a gene driven by a given promoter of at least 200%.
- the enrichment is at least 70%, wherein the selected introns, increasing the transcription of a gene driven by a given promoter of at least 50%.
- the length of an inventive IME-intron is preferably shorter than 1 ,000 base pairs, more preferably shorter than 900 bp, most preferably shorter than 800 bp.
- the branchpoint sequence of the intron identified by a method of the invention is described by the nucleotide sequences 5 ' -CURAY-3 ' (SEQ ID NO. 75) or 5 ' -YURAY-3 ' (SEQ ID NO. 76), wherein the U and A are essential nucleotides, and purines and pyrimidines are preferred nucleotides at positions 3 and 5 respectively. In position 1 , pyrimidines are preferred but also C is preferred to U.
- sequence context of the 5 splice-site surrounding the GT dinucleotide may vary. Preferred are 5 splice- sites of the sequence 5 ' -RR/GT(RT)(RT)(GY)-3 ' (SEQ ID NO. 77), wherein R stands for the nucleotides G or A, Y stands for the nucleotides C or T. The nucleotides given in brackets describing alternative nucleotides at the respective position.
- the adenine/thymine (AT) content of an inventive intron over the entire sequence is at least 50%, more preferably at least 55%, even more preferably at least 60%.
- the populations of plant introns to which the inventive methods will be applied comprises a) substantially all introns of a plant genome represented in a DNA sequence database or b) a plant genomic DNA library.
- the population of introns to which the inven- tive methods will be applied to is selected from the group consisting of a) introns located between two protein encoding exons, and b) introns located within the 5 ' untranslated region of the corresponding gene.
- the coding regions and the 5 ' untranslated regions from a set of genes can be screened for the presence of introns located in said regions and the identified introns are subsequently screened using one of the inventive methods.
- Such an in silico identification process using bioinformatics tools known to the persons skilled in the art can be performed by screening a) specific DNA sequence databases (e.g., containing solely coding regions or the 5 ' untranslated regions), or b) other publicly accessible genomic DNA sequences containing databases.
- the introns with expression enhancing properties located in the 5 ' untranslated regions are identified by a method comprising the steps of: a. identifying a coding sequences within a set of genes present in a sequence database, and b. identifying EST sequences corresponding to the genes identified under (a), and c. comparing said coding sequences and EST sequences with the genomic sequence of the respective genes, and d. selecting EST sequences comprising the 5 ' untranslated region, and e. identifying introns located in said 5 ' untranslated regions.
- the steps of retrieving or generating DNA sequences or the generation of specific DNA sequence database and screening the same e.g.
- the introns where selected from a population of introns derived from monocotyledonous plants especially preferred are monocotyle- donous plants selected from the group consisting of the genera Hordeum, Avena, Se- cale, Triticum, Sorghum, Tea, Saccharum and Oryza.
- the population of introns to which the inventive methods will be applied are selected from a population of plant genes representing the 10% fraction (9 th decile) of genes with the highest expression rate in a gene expression analysis experiment performed using a plant cell, plant tissue or a whole plant.
- gene expression analysis systems could be employed in accordance with the instant invention, including, but not limited to microarray analysis, "digital northern' , clone distribution analysis of cDNA libraries using the "DNA sequencing by hybridization method' (Strezoska, Z. et al. (1991) Proc. Natl. Acad. Sci. USA 88:10089-10093) and Serial Analysis of Gene Expression (SAGE, Velculescu, V. E. et al. (1995) Science 270:484-487).
- microarray analysis "digital northern' , clone distribution analysis of cDNA libraries using the "DNA sequencing by hybridization method' (Strezoska, Z. et al. (1991) Proc. Natl. Acad. Sci. USA 88:10089-10093) and Serial Analysis of Gene Expression (SAGE, Velculescu, V. E. et al. (1995) Science 270:484-487).
- RNA array analysis has become a standard technique in the molecular biology laboratory for monitoring gene expression.
- Arrays can be made either by the mechanical spotting of pre-synthesized DNA products or by the de novo synthesis of oligonucleotides on a solid substrate, usually a de- rivatized glass slide.
- arrays are used to detect the presence of mRNAs that may have been transcribed from different genes and which encode different proteins.
- the RNA is extracted from many cells, or from a single cell type, then converted to cDNA or cRNA.
- the copies may be "amplified" by (RT-) PCR.
- Fluorescent tags are enzymatically incorporated into the newly synthesized strands or can be chemically attached to the new strands of DNA or RNA.
- a cDNA or cRNA molecule that contains a sequence complementary to one of the single-stranded probe sequences will hybridize, or stick, via base pairing to the spot at which the complementary probes are affixed. The spot will then fluoresce when examined using a microarray scanner. Increased or decreased fluorescence intensity indicates that cells in the sample have recently transcribed, or ceased transcription, of a gene that contains the probed sequence. The intensity of the fluorescence is proportional to the number of copies of a particular mRNA that were present and thus roughly indicates the activity or expression level of that gene.
- Microarrys (and the respective equipment needed to perform the expression analysis experiments) that can be employed in accordance with the present invention are commercially available.
- the GeneChip Arabidopsis ATH 1 Genome Array produced from Affimetrix (Santa Clara, CA), contains more than 22,500 probe sets repre- senting approximately 24,000 genes.
- the array is based on information from the international Arabidopsis sequencing project that was formally completed in December 2000 (http://www.affymetrix. com).
- the expression rate of the analyzed genes can be ranked (according to the intensity of the fluorescence of the respective genes after the hybridization process) and the genes belonging to the 10% of genes showing the highest gene expression rate can be identified by using microarray analysis.
- microarray expression profiling results are publicly available via the internet e.g. the Nottingham Arabidopsis Stock Center s microarray database or the OSMID (osmotic stress microarray information) database.
- the Nottingham Arabidopsis Stock Center s microarray database containing a wide selection of microarray data from Affimetrix gene chips (http://affymetrix.arabidopsis. info).
- the OSMID database http://www.osmid.org) contains the results of approximately 100 microarray experiments performed at the University of Arizona.
- the expression rate of genes can be ranked (according to the clone distribution of the respective cDNA in the library) and genes belonging to the 10% of genes showing the highest (abundance) gene expression rate can be identified.
- the SAGE method is a further development of this technique, which requires only nine nucleotides as a tag, therefore allowing a larger throughput.
- the expression rate of the analyzed genes by using the "digital Northern' method can be ranked (according to the abundance of the tags of the respective gene in the cDNA library) and the genes belonging to the 10% of genes showing the highest (abundance) gene expression rate can be identified.
- mRNA is then extracted from each of the collected samples and used for the library production.
- the libraries can be generated from mRNA purified on oligo dT columns. Colonies from transformation of the cDNA library into E.coli are randomly picked and placed into microtiter plates and subsequently spotted DNA onto a surface. The cDNA inserts from each clone from the microtiter plates are PCR amplified and spotted onto a nylon membrane. A battery of 288 33 P radiolabeled seven-mer oligonucleotides are then sequentially hybridized to the membranes.
- a blot image is captured during a phosphorimage scan to generate a profile for each single oligonucleotide. Absolute identity is maintained by barcoding for image cassette, filter and ori- entation within the cassette.
- the filters are then treated using relatively mild conditions to strip the bound probes and then returned to the hybridization chambers for another round.
- the hybridization and imaging cycle is repeated until the set of 288 oligomers is completed. After completion of hybridizations, each spot (representing a cDNA insert) will have recorded the amount of radio signal generated from each of the 288 seven- mer oligonucleotides.
- the profile of which oligomers bound, and to what degree, to each single cDNA insert (a spot on the membrane) is defined as the signature generated from that clone.
- Each clone's signature is compared with all other signatures generated from the same organism to identify clusters of related signatures. This process "sorts' all of the clones from an organism into so called “clusters' before sequencing.
- clustering process complex or tissue specific cDNA libraries are "mined' using a series of 288 seven base-pair oligonucleotides. By collecting data on the hybridization signature of these oligos, the random set of clones in a library can be sorted into "clusters' .
- a cluster is indicative for the abundance of each gene in a particular library and is therefore a measure of the gene expression rate of an individual gene.
- the expression rate of genes can be ranked using the 'HySeq' technology and the genes belonging to the 10% of genes showing the highest (abundance) gene expression rate can be identified.
- genes, cDNAs or expressed sequence tags chosen for the identification of the inventive introns belonging to the 10%, preferably 5%, more preferably 3% most preferably 1% of genes showing the highest gene expression rate in a gene expression analysis experiment, wherein the gene expression rate can be calculated indirectly by using the above described methods.
- the nucleic acid sequences of the genes belonging to the 10% of genes showing the high- est gene expression rate where used to isolate the complete genomic DNA sequence including the intron sequences- of the respective genes by screening of e.g. appropriate DNA sequences containing databases, or genomic DNA or genomic DNA libraries using hybridization methods or RACE cloning techniques (rapid amplification of cDNA ends), or chromosome walking techniques.
- the intron sequences present in said genes were screened using the above described criteria to identify those introns, having expression enhancing properties.
- the described in silico methods for the selection of introns with expression enhancing properties have a high probability of success, but the efficiency of the described methods may be further increased by combination with other methods. Therefore, in one preferred embodiment of the invention independent validation of the genes representing the 10% of genes showing the highest gene expression rate in a gene expression analysis experiment is done using alternative gene expression analysis tools, like Northern analysis, or real time PCR analysis (see examples).
- the method for the identification or enrich- ment of introns with gene expression enhancing properties in plants is applied to DNA sequence databases using an automated process, more preferably using a computer device and an algorithm that defines the instructions needed for accomplishing the selection steps for identifying or enriching introns with gene expression enhancing properties in plants within the screened population of DNA sequences.
- a further embodiment of the invention is a computer algorithm that defines the instructions needed for accomplishing the selection steps for identifying or enriching introns with plant gene expression enhancing properties as described above.
- Useful computer algorithms are well known in the art of bioinformatics or computational biology. Bioinformatics or computational biology is the use of mathematical and informational techniques to analyze sequence data (e.g.
- bioinformatics are the data mining and analysis of data gathered from different sources. Other areas are sequence alignment, protein structure prediction.
- Another aspect of bioinformatics in sequence analysis is the automatic search for genes or regulatory sequences within a genome (e.g. intron sequences within a stretch of genomic DNA). Sequence databases can be searched using a variety of methods. The most common is probably searching for a sequence similar to a certain target gene whose sequence is already known to the user.
- a useful program is the BLAST (Basic Local Alignment Search Tool) program a method of this type.
- BLAST is an algorithm for comparing biological sequences, such as DNA sequences of different genes.
- BLAST search Given a library or database of sequences, a BLAST search enables a researcher to look for specific sequences.
- the BLAST algorithm and a computer program that implements it were developed by Stephen Altschul at the U.S. National Center for Biotechnology Information (NCBI) and is available on the web at http://www.ncbi.nlm.nih.gov/BLAST.
- the BLAST program can either be downloaded and run as a command-line utility "blastall" or accessed for free over the web.
- the BLAST web server hosted by the NCBI, allows anyone with a web browser to perform similarity searches against constantly updated databases of proteins and DNA that include most of the newly sequenced organisms.
- BLAST is actually a family of pro- grams (all included in the blastall executable) including beside others the Nucleotide- nucleotide BLAST (BLASTN).
- This program given a DNA query, returns the most similar DNA sequences from the DNA database that the user specifies.
- a person skilled in the art knows how to produce or retrieve sequence Data from e.g. public sequence database and to design algorithms to screen the set of sequences in a customized way (see examples).
- the invention relates to computer algorithm that defines the instructions needed for accomplishing the selection steps for identifying or enriching introns with gene expression enhancing properties in plants from a plant genome or a population of introns selected from the group consisting of introns located between two protein encoding exons, and/or introns located within the 5 ' untranslated region of the corresponding gene and/or introns located in the DNA sequences of genes representing the 10% fraction of genes with the highest expression rate in a gene expression analysis experiment performed using a plant cell, plant tissue and/or a whole plant.
- Another embodiment of the invention is a computer device or data storage device comprising the algorithm.
- a storage device can be a hard disc” (or “hard drive”) or an optical data storage medium like a CD-ROM (“Compact Disc Read-Only Memory” (ROM) or DVD (digital versatile disc) or any other mechanically, magnetically, or optically data storage medium.
- ROM Compact Disc Read-Only Memory
- DVD digital versatile disc
- Another embodiment of the invention relates to a method for isolating, providing or producing an intron with gene expression enhancing properties in plants comprising the steps of a) performing an identification or enrichment of introns with gene expression enhancing properties in plants as described above and providing the sequence information of said identified or enriched introns, and b) providing the physical nucleotide sequence of said introns identified or enriched un- der a) and c) evaluating the gene expression enhancing properties of the intron sequence provided under b) in an in vivo or in vitro expression experiment, and d) isolating introns from said expression experiment c), which demonstrate expression enhancing properties.
- evaluation of the gene expression enhancing properties of the isolated introns comprises, c1) providing a recombinant expression constructs by functionally linking an individual nucleotide sequence from step b) with at least one promoter sequence functioning in plants or plant cells, and at least one readily quantifiable nucleic acid sequence, and c2) introducing said recombinant DNA expression construct in plant cells and evaluating the gene expression enhancing properties of the isolated intron.
- the evaluation of the gene expression enhancing properties is done in a plant cell or stable transformed plants and wherein said isolated intron enhances expression of a given gene at least twofold (see examples).
- An additional subject matter of the invention relates to a recombinant DNA expression construct comprising at least one promoter sequence functioning in plants cells, at least one nucleic acid sequence and at least one intron selected from the group consisting of the sequences described by SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 22, and functional equivalents thereof, wherein said promoter sequence and at least one of said intron sequences are functionally linked to said nucleic acid sequence and wherein said intron is heterologous to said nucleic acid sequence or to said promoter sequence.
- the invention relates to recombinant expression constructs comprising at least one promoter sequence functioning in plants cells, at least one nucleic acid sequence and at least one functional equivalents of an intron described by any of sequences SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 22.
- said functional equivalents comprising the functional elements of an intron, wherein said promoter sequence and at least one of said intron sequences are functionally linked to said nucleic acid sequence and wherein said intron is heterologous to said nucleic acid sequence or to said promoter sequence.
- the func- tional equivalent is further characterized by i) having at least 50 consecutive base pairs of the intron sequence described by any of SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22, or ii) having an identity of at least 80% over a sequence of at least 95 consecutive nu- cleic acid base pairs to a sequences described by any of SEQ ID NOs: 1 , 2, 3, 5, 6,
- the introns comprising at least 50 bases pairs, more preferably at least 40 bases pairs, most preferably 30 bases pairs of the sequences/exons 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the recombinant DNA expression construct of the invention further comprises one or more additional regulatory sequences functionally linked to a promoter.
- Those regulatory sequences can be selected from the group consisting of heat shock-, anaerobic responsive-, pathogen responsive-, drought responsive-, low temperature responsive-, ABA responsive-elements, 5 -untranslated gene region, 3 -untranslated gene region, transcription terminators, polyadenylation signals and enhancers.
- Cis- and fra ⁇ s-acting factors involved in ABA-induced gene expression have been reviewed by Bray (1997) Trends Plant Sci. 2:48 54; Buskef al. (1998) Plant MoI. Biol. 37:425 435 and Shinozaki and Yamaguchi-Shinozaki (2000) Curr. Opin. Plant Biol. 3:217 223).
- Many ABA-inducible genes contain a conserved, ABA-responsive, c/s-acting element named ABRE (ABA-responsive element; PyACGTGGC) in their promoter regions (Guiltinan et al. (1990) Science 250 :267 271 ; Mundy et al. (1990) Proc. Natl. Acad. Sci. USA 87:406 410).
- the promoter region of the rd29A gene was analyzed, and a novel c/s-acting element responsible for dehydration- and cold-induced expression was identified at the nucleotide sequence (Yamaguchi-Shinozaki and Shinozaki (1994) Plant Cell 6:251 264.).
- a 9-bp conserved se- quence, TACCGACAT, termed the dehydration-responsive element (DRE) is essential for the regulation of dehydration responsive gene expression.
- DRE-related motifs have been reported in the promoter regions of cold- and drought-inducible genes such as kin1, cor6.6 , and rd17 (Wang et al. (1995) Plant MoI. Biol. 28:605 617; Iwasaki ef al.
- thermoinducibility of the heat shock genes is at- tributed to activation of heat shock factors (HSF).
- HSF act through a highly conserved heat shock promoter element (HSE) that has been defined as adjacent and inverse repeats of the motif 5'-nGAAn-3' (Amin et al. (1988) MoI Cell Biol 8:3761-3769).
- W-box TTGACY
- W-box- like elements representing binding sites for plant-specific WRKY transcription factors involved in plant development and plant responses to environmental stresses (Eulgem et al. (2000) Trends Plant Sci 5:199 206; Robatzek S et al.
- Such regulatory sequences or elements that can be employed in con- junction with a described promoter encompass the 5 -untranslated regions, enhancer sequences and plant polyadenylation signals.
- Examples of translation enhancers which may be mentioned, are the tobacco mosaic virus 5 leader sequence (Gal- lie et al. (1987) Nucl Acids Res 15:8693-8711), the enhancer from the octopine syn- thase gene and the like. Furthermore, they may promote tissue specificity (Rouster J et al.
- the recombinant DNA expression construct will typically include the gene of interest along with a 3 ' end nucleic acid sequence that acts as a signal to terminate transcription and subsequent polyadenylation of the RNA.
- Preferred plant polyadenylation signals are those, which essentially correspond to T-DNA polyadenylation signals from Agrobacterium tumefaciens, in particular gene 3 of the T-DNA (octopine synthase) of the Ti plasmid pTiACHS (Gielen et al. (1984) EMBO J 3:835-46) or functional equivalents thereof.
- terminator sequences which are especially suitable, are the OCS (octopin synthase) terminator and the NOS (nopaline synthase) terminator.
- An expression cassette and the vectors de- rived from it may comprise further functional elements.
- the term functional element is to be understood in the broad sense and refers to all those elements, which have an effect on the generation, amplification or function of the expression cassettes, vectors or recombinant organisms according to the invention. The following may be mentioned by way of example, but not by limitation:
- Selection markers are useful to select and separate successfully transformed or homologous recombined cells.
- a selectable marker which confers resistance to a biocide (for example herbicide), a metabolism inhibitor such as 2-deoxyglucose-6-phosphate (WO 98/45456) or an antibiotic to the cells which have successfully undergone recombination.
- the selection marker permits the selection of the transformed cells from untrans- formed ones (McCormick et al. (1986) Plant Cell Reports 5:81-84).
- Selection markers confer a resistance to a biocidal compound such as a metabolic inhibitor (e.g., 2-deoxyglucose-6-phosphate, WO 98/45456), antibiotics (e.g., kanamycin, G 418, bleomycin or hygromycin) or herbicides (e.g., phosphinothricin or glyphosate).
- a metabolic inhibitor e.g., 2-deoxyglucose-6-phosphate, WO 98/45456
- antibiotics e.g., kanamycin, G 418, bleomycin or hygromycin
- herbicides e.g., phosphinothricin or glyphosate
- Especially preferred negative selection markers are those which confer resistance to herbicides. Examples which may be mentioned are:
- Phosphinothricin acetyltransferases also named Bialophos resistance; bar; de Block et al. (1987) EMBO J 6:2513-2518
- EPSPS 5-enolpyruvylshikimate-3-phosphate synthase
- Glyphosate degrading enzymes (Glyphosate oxidoreductase; gox),
- NPTII Kanamycin- or G418- resistance genes
- NPTII NPTI
- NPTI 2-Desoxyglucose-6-phosphate phosphatase
- DOGRI -Gene product WO 98/45456; EP 0 807 836 conferring resistance against 2-desoxyglucose (Randez- GiI et al., 1995 Yeast 1 1 :1233-1240).
- Additional suitable negative selection marker are the aadA gene, which confers resistance to the antibiotic spectinomycin, the streptomycin phosphotransferase (SPT) gene, which allows resistance to streptomycin and the hygromycin phosphotransferase (HPT) gene, which mediates resistance to hygromycin.
- SPT streptomycin phosphotransferase
- HPT hygromycin phosphotransferase
- negative selection markers that confer resistance against the toxic effects imposed by D-amino acids like e.g., D-alanine and D-serine (WO 03/060133; Erikson 2004).
- Especially preferred as negative selection marker in this contest are the cfaol gene (EC: 1.4.
- GenBank Acc.-No.: U60066 GenBank Acc.-No.: U60066
- yeast Rhodotorula gracilis Rhodosporidium toruloides
- E. coli gene cfscfA D-serine dehydratase (D- serine deaminase) [EC: 4.3. 1.18; GenBank Acc.-No.: J01603).
- Counter selection markers are especially suitable to select organisms with defined deleted sequences comprising said marker (Koprek T et al. (1999) Plant J 19(6): 719-726).
- Examples for counter selection marker comprise thymidin kinases (TK), cytosine deaminases (Gleave AP et al. (1999) Plant MoI Biol. 40(2):223-35; Perera RJ et al. (1993) Plant MoI. Biol 23(4): 793-799; Stougaard J. (1993) Plant J 3:755-761 ), cytochrom P450 proteins (Koprek et al.
- positive selection marker can be employed.
- Genes like isopentenyltrans- ferase from Agrobacterium tumefaciens may as a key enzyme of the cytokinin biosynthesis facilitate regeneration of transformed plants (e.g., by selection on cytokinin-free medium).
- Corresponding selection methods are described (Ebinuma 2000a, b). Additional positive selection markers, which confer a growth advantage to a transformed plant in comparison with a non- transformed one, are described e.g., in EP-A 0 601 092.
- Growth stimulation selection markers may include (but shall not be limited to) ⁇ -Glucuronidase (in combination with e.g., a cytokinin glucuronide), mannose-6-phosphate isomerase (in combination with mannose), UDP-galactose-4-epimerase (in combination with e.g., galactose), wherein mannose-6-phosphate isomerase in combination with mannose is especially preferred. 2) Reporter genes
- Reporter genes encode readily quantifiable proteins and, via their color or enzyme activity, make possible an assessment of the transformation efficacy, the site of expression or the time of expression.
- genes encoding reporter proteins such as the green fluorescent protein (GFP) (Sheen et al. (1995) Plant Journal 8(5):777-784; Haseloff et al. (1997) Proc Natl Acad Sci USA 94(6):2122-2127; Reichel et al. (1996) Proc Natl Acad Sci USA 93(12):5888-5893; Tian et al.
- GFP green fluorescent protein
- Origins of replication which ensure amplification of the expression cassettes or vectors according to the invention in, for example, E. coli.
- Examples which may be mentioned are ORI (origin of DNA replication), the pBR322 ori or the P15A ori (Sambrook et al.: Molecular Cloning. A Laboratory Manual, 2nd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989).
- the inventive recombinant expression construct contains expressible nucleic acid sequences in addition to, or other than, nucleic acid sequences encoding for marker proteins.
- the recombinant DNA expression construct comprises an nucleic acid sequence encodes for i) a protein or ii) a sense, antisense, or double-stranded RNA sequence.
- the recombinant DNA expression construct contains a nucleic acid sequence encoding a protein.
- the recombinant DNA expression construct may contain a DNA for the purpose of expressing RNA transcripts that function to affect plant phenotype without being translated into a pro- tein.
- RNAi double strand forming RNA molecules
- the transgenic expression constructs of the invention can be employed for suppressing or reducing expression of endogenous target genes by "gene silencing' .
- the skilled worker knows preferred genes or proteins whose suppres- sion brings about an advantageous phenotype. Examples may include but are not limited to down-regulation of the ⁇ -subunit of Arabidopsis G protein for increasing root mass (Ullah et al.
- the transgenic expression constructs of the invention contain nucleic acids, which when transcribed, produce RNA enzymes (Ribozymes) which can act as endonucleases and catalyze the cleavage of RNA molecules with selected sequences. The cleavage of the selected RNA can result in the reduced production of their encoded polypeptide prod- ucts.
- Ribozymes have specific catalytic domains that possess endonuclease activity (Kim and Ceck 1987, Proc. Natl. Acad. Sci. USA, 84:8788-8792; Gerlach et al., 1987, Nature, 328:802-805; Forster and Symons, 1987, Cell, 49:211-220).
- Several different ribozyme motifs have been described with RNA cleavage activity (Symons, 1992, Annu. Rev. Biochem., 61 : 641-671). Examples include sequences from group 1 self splicing introns including Tobacco Ringspot Virus (Prody et al., 1986, Science, 231 :1577-1580).
- ribozymes include sequences from RNaseP with cleavage activity (Yan et al. (1992) Proc. Natl. Acad. Sci. USA 87:4144-4148), hairpin ribozyme structures (Berzal-Herranz et al. (1992) Genes and Devel. 98:1207-1210) and Hepatitis Delta virus based ribozyme (U.S. Pat. No. 5,625,047).
- the general design and optimization of ribozymes directed RNA cleavage activity has been discussed on detail (Haseloff and Gerlach (1988) Nature 224:585-591 ; Symons (1992) Annu. Rev. Biochem. 61 : 641-671).
- the choice of a particular nucleic acid sequence to be delivered to a host cell or plant depends on the aim of the transformation. In general, the main goal of producing transgenic plants is to add some beneficial traits to the plant.
- the recombinant expression construct com- prises a nucleic acid sequence encoding for a selectable marker protein, a screenable marker protein, a anabolic active protein, a catabolic active protein, a biotic or abiotic stress resistance protein, a male sterility protein or a protein affecting plant agronomic characteristics.
- Such traits include, but are not limited to, herbicide resistance or tolerance, insect resistance or tolerance, disease resistance or tolerance (viral, bacterial, fungal, nematode); stress tolerance, as exemplified by tolerance to drought, heat, chilling, freezing, salt stress, oxidative stress; increased yield, food content, male sterility, starch quantity and quality, oil content and quality, vitamin content and quality (e.g. vitamin E) and the like.
- stress tolerance as exemplified by tolerance to drought, heat, chilling, freezing, salt stress, oxidative stress; increased yield, food content, male sterility, starch quantity and quality, oil content and quality, vitamin content and quality (e.g. vitamin E) and the like.
- the recombinant expres- sion constructs of the invention can comprise artificial transcription factors (e.g. of the zinc finger protein type; Beerli (2000) Proc Natl Acad Sci USA 97(4): 1495
- nucleic acids are those which encode the transcriptional activator CBF1 from Arabidopsis thaliana (GenBank Ace. No.: U77378) or the Myoxocephalus octodecemspinosus antifreeze protein (GenBank Ace. No.: AF306348), or functional equivalents of these.
- the nucleic acid molecule must be linked operably to a suitable promoter.
- the plant specific promoter, regulatory element and the terminator of the inventive recombinant expression construct needs not be of plant origin, and may originate from viruses or microorganisms, in particular for example from viruses which attack plant cells.
- An additional subject matter of the invention is the introduction of an inventive intron sequence into a target nucleic acid sequence via homologous recombination (HR).
- HR homologous recombination
- the recombinant expression construct must contain fragments of the target nucleic acid sequence of sufficient length and homology.
- the intron sequences that has to be inserted into the gene of interest via HR is (within the recombinant expression construct) placed between a pair of DNA sequences identical to the region 5 ' and 3 ' to the preferred place of insertion.
- the recombinant expression construct can comprises only the intron sequence and the nucleic acid sequences needed to in- prise the HR event.
- the intron sequence that is flanked by the nucleic acid sequence of the target DNA contains an expression cassette that enables the expression of an selectable marker protein which allows the selection of transgenic plants in which a homologues or illegitimate recombination had occurred subsequent to the transformation.
- the expression cassette driving the ex- pression of the selection marker protein can be flanked by HR control sequences that are recognized by specific endonucleases or recombinases, facilitating the removal of the expression cassette from the genome.
- marker excision methods e.g.
- cre/lox technology permit a tissue-specific, if appropriate inducible, removal of the expression cassette from the genome of the host organism (Sauer B (1998) Methods. 14(4):381-92).
- specific flanking sequences which later allow removal by means of ere recombinase, are attached to the target gene.
- the present invention relates to transgenic expression cassettes compris- ing the following introns with gene expression enhancing properties in plants:
- the gene comprises two introns and three exons.
- the first intron of the Oryza sativa metallothioneine-like gene (BPSI.1 , SEQ ID NO:1) is flanked by the 5 ' (5 ' -GU-3 ' , base pair (bp) 1-2 in SEQ ID NO:1) and 3 ' (5 ' -CAG-3 ' ,bp 582-584 in SEQ ID NO:1) splice sites.
- the first intron of the Oryza sativa metallothioneine-like gene (BPSI.1 , SEQ ID NO:1) comprises at least 28 bases pairs, more preferably at least 40 bases pairs, most preferably at least 50 base pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively (SEQ ID NO: 82).
- the Oryza sativa metallothionein- like gene shares high homology or identity with the coding region of orthologous genes from other monocotyledonous or dicotyledonous plants e.g. 89% identity to the Tea mays CL1155_3 mRNA sequence (ace.
- the gene comprises 13 introns and 14 exons.
- the first intron of the Oryza sativa Sucrose UDP Glucosyltransferase-2 gene (BPSI.2, SEQ ID NO: 2) is flanked by the 5 ' (5 ' -GU-3 ' , bp 1-2 in SEQ ID NO:2) and 3 ' (5 ' -CAG-3 ' ,bp 726-728 in SEQ ID NO: 2) splice sites.
- the first intron of the Oryza sativa Sucrose UDP Glucosyltransferase-2 gene comprises at least 19 bases pairs of the sequence 5 ' to the 5 ' -splice site and 23 bases pairs of the sequences/exons 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 83).
- the intron BPSI.2 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively
- the second intron of the Oryza sativa Su- erase UDP Glucosyltransferase-2 gene comprises at least 25 bases pairs of the sequence 5 ' to the 5 ' -splice site and 30 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 84).
- the intron BPSI.3 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the Oryza sativa Sucrose UDP Glucosyltransferase-2 gene shares high homology or identity with the coding region of orthologous genes from other monocotyledonous or dicotyledonous plants e.g. 88% identity to the Tea mays sucrose synthase (Sus1 ) mRNA (ace. No. L22296.1), 85% identity to the Triticum aestivum mRNA for sucrose synthase type 2 (ace. No. AJ000153), 85% identity to the H. vulgare mRNA for sucrose synthase (ace No.
- the eighth intron of the Oryza sativa gene encoding for the Sucrose transporter comprises at least 35 bases pairs of the sequence 5 ' to the 5 ' -splice site and 30 bases pairs of the sequences 3 ' to the 3 ' - splice site of the intron (SEQ ID NO: 86).
- the intron BPSI.5 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respec- tively.
- the 5 ' and 3 ' splice sites of the eighth intron are modified in order to match the plant consensus sequences for 5 ' splice sites 5 ' -AG::GTAAGT-3 ' (SEQ ID NO: 80) and 3 ' splice sites 5 ' -CAG::GT- 3 ' (SEQ ID NO: 81) using a PCR mutagenesis approach (SEQ ID NO:87).
- the fourth intron of the Oryza sativa gene comprises at least 34 bases pairs of the sequence 5 ' to the 5 ' -splice site and 34 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 88).
- the intron BPSI.6 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the 5 ' and 3 ' splice sites of fourth intron are modified in order to match the plant consensus sequences for 5 ' splice sites 5 ' -AG::GTAAGT-3 ' (SEQ ID NO: 80) and 3 ' splice sites 5 ' -CAG::GT-3 ' (SEQ ID NO: 81) using a PCR mutagenesis approach (SEQ ID NO:89).
- BAB90130 (SEQ ID NO:7) comprises at least 34 bases pairs of the sequence 5 ' to the 5 ' -splice site and 26 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 90).
- the intron BPSI.7 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the 5 ' and 3 ' splice sites of the fourth intron are modified in order to match the plant consensus sequences for 5 ' splice sites 5 ' -AG-GTAAGT-S ' (SEQ ID NO: 80) and 3 ' splice sites 5 ' -CAG::GT- 3 ' (SEQ ID NO: 81) using a PCR mutagenesis approach (SEQ ID NO:91).
- AP003300 (SEQ ID NO:10) comprises at least 31 bases pairs of the sequence 5 ' to the 5 ' -splice site and 31 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 94).
- the intron BPSI.10 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the 5 ' and 3 ' splice sites of the third intron are modified in order to match the plant consensus sequences for 5 ' splice sites 5 ' -AG::GTAAGT-3 ' (SEQ ID NO: 80) and 3 ' splice sites 5 ' -CAG::GT-3 ' (SEQ ID NO: 81 ) using a PCR mutagenesis approach (SEQ ID NO:95).
- the first intron of the Oryza sativa gene accession No.
- L37528 (SEQ ID NO:11) comprises at least 35 bases pairs of the sequence 5 ' to the 5 ' -splice site and 34 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 96).
- the intron BPSI.11 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the 5 ' and 3 ' splice sites of the first intron are modified in order to match the plant consensus sequences for 5 ' splice sites 5 ' -AG::GTAAGT-3 ' (SEQ ID NO: 80) and 3 ' splice sites 5 ' -CAG::GT-3 ' (SEQ ID NO: 81) using a PCR mutagenesis approach (SEQ ID NO:97).
- the first intron of the Oryza sativa gene accession No.
- CB625805 (SEQ ID NO:12) comprises at least 26 bases pairs of the sequence 5 ' to the 5 ' -splice site and 26 bases pairs of the sequences 3 ' to the 3 ' - splice site of the intron (SEQ ID NO: 98).
- the intron BPS 1.12 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the first intron of the Oryza sativa gene accession No.
- CF297669 (SEQ ID NO:13) comprises at least 26 bases pairs of the sequence 5 ' to the 5 ' -splice site and 24 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 99).
- the intron BPSI.13 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the first intron of the Oryza sativa gene accession No.
- CB674940 (SEQ ID NO:14) comprises at least 26 bases pairs of the sequence 5 ' to the 5 ' -splice site and 25 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 100).
- the intron BPSI.14 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the sequence of the first intron (BPSI.15, SEQ ID NO:15) isolated from the 5 UTR of the Oryza sativa gene (accession No. BAD37295.1) encoding for a putative SaIT protein precursor. Said the first intron (SEQ ID NO:15) is flanked by the 5 ' (5 ' -GU-3 ' , bp 1-2 in SEQ ID NO:15) and 3 ' (5 ' -CAG-3 ' , bp 312-314 in SEQ ID NO: 15) splice sites.
- the first intron of the Oryza sativa gene comprises at least 26 bases pairs of the sequence 5 ' to the 5 ' -splice site and 25 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 101).
- the intron BPS 1.15 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the first intron of the Oryza sativa gene accession No.
- BX928664 (SEQ ID NO:16) comprises at least 26 bases pairs of the sequence 5 ' to the 5 ' -splice site and 23 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 102).
- the intron BPSI.16 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the first intron of the Oryza sativa gene accession No.
- AA752970 (SEQ ID NO:17) comprises at least 26 bases pairs of the sequence 5 ' to the 5 ' -splice site and 35 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 103).
- the intron BPSI.17 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the sequence of the first intron (BPSI.18, SEQ ID NO:18) isolated from the Oryza sativa clone Gl 40253643 (accession No. AK064428) is similar to AT4g33690. Said the first intron (SEQ ID NO:18) is flanked by the 5 ' (5 ' -GU-3 ' , bp 1-2 in SEQ ID NO:18) and 3 ' (5 ' -CAG-3 ' , bp 544-546 in SEQ ID NO:18) splice sites.
- the first intron of the Oryza sativa gene accession No.
- AK064428 (SEQ ID NO:18) comprises at least 26 bases pairs of the sequence 5 ' to the 5 ' -splice site and 21 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 104).
- the intron BPSI.18 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the se- quences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the sequence of the first intron (BPSI.19, SEQ ID NO:19) isolated from the Oryza sativa clone Gl 51091887 (accession No. AK062197)). Said the first intron (SEQ ID NO:19) is flanked by the 5 ' (5 ' -GU-3 ' , bp 1-2 in SEQ ID NO:19) and 3 ' (5 ' -CAG-3 ' , bp 810-812 in SEQ ID NO:19) splice sites.
- the first intron of the Oryza sativa gene accession No.
- AK062197 (SEQ ID NO:19) comprises at least 26 bases pairs of the sequence 5 ' to the 5 ' -splice site and 26 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 105).
- the intron BPSI.19 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the first intron of the Oryza sativa gene accession No.
- CF279761 (SEQ ID NO:20) comprises at least 26 bases pairs of the sequence 5 ' to the 5 ' -splice site and 27 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 106).
- the intron BPSI.20 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the first intron of the Oryza sativa gene accession No.
- CF326058 (SEQ ID NO:21) comprises at least 26 bases pairs of the sequence 5 ' to the 5 ' -splice site and 25 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 107).
- the intron BPSI.21 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- the sequence of the first intron (BPSI.22, SEQ ID NO:22) isolated from the Oryza sativa gene (accession No. C26044) encoding for a putative ACT domain repeat protein.
- the first intron (SEQ ID NO:22) is flanked by the 5 ' (5 ' -GU-3 ' , bp 1-2 in SEQ ID NO:22) and 3 ' (5 ' -CAG-3 ' , bp 386-388 in SEQ ID NO:22) splice sites.
- the first intron of the Oryza sativa gene (accession No. C26044) (SEQ ID NO:22) comprises at least 26 bases pairs of the sequence 5 ' to the 5 ' -splice site and 28 bases pairs of the sequences 3 ' to the 3 ' -splice site of the intron (SEQ ID NO: 108).
- the intron BPSI.22 comprises at least 40 bases pairs, more preferably at least 50 bases pairs of the sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron, respectively.
- Table 1 Genes from which the introns of the invention are preferably isolated, putative function of said genes, cDNA and the protein encoded by said genes.
- ID NOs: 1 , 2, 3, 5, 6, 7, 10 and 11 have an impact on the expression rate of the GUS gene in transient expression assays and stable transformed plants, respectively. It could be shown that the inclusion of said Introns into the 5 ' UTR of the GUS gene has led to a strong enhancement in the expression rate of this gene in transiently and stable transformed cell, respectively, compared to a control construct that lacks the first intron (see examples 1.6.1 (table 7), 1.6.2 (table 8), 2.4 (table 15).
- the expression enhancing properties of the introns with the SEQ ID NOs: 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22 can be demonstrated by performing the above described transient or stable expression assays.
- Functional equivalents of the inventive introns can be identified via homology searches in nucleic acid databases or via DNA hybridization (screening of genomic DNA librar- ies) using a fragment of at least 50 consecutive base pairs of the nucleic acid molecule described by any of the SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22 and stringent hybridization conditions.
- the stringent hybridizing conditions can be chosen as follows:
- the hybridization puffer contains Formamide, NaCI and PEG 6000 (Polyethyleneglykol MW 6000).
- Formamide has a destabilizing effect on double strand nucleic acid molecules, thereby, when used in hybridization buffer, allowing the reduction of the hybridization temperature to 42°C without reducing the hybridization stringency.
- NaCI has a positive impact on the renaturation-rate of a DNA duplex and the hybridization efficiency of a DNA probe with its complementary DNA target.
- PEG increases the viscosity of the hybridization buffer, which has in principle a negative impact on the hybridization efficiency.
- the composition of the hybridization buffer is as follows:
- hybridization is preferably performed over night at 42°C. In the morning, the hy- bridized filter will be washed 3 x for 10 minutes with 2 ⁇ SSC + 0,1 % SDS.
- Hybridization should advantageously be carried out with fragments of at least 50, 60, 70 or 80 bp, preferably at least 90 bp. In an especially preferred embodiment, the hybridization should be carried out with the entire nucleic acid sequence with conditions described above.
- the recombinant DNA expression construct comprises (functionally linked to an intron of the invention) a promoter sequence functioning in plants or plant cells selected from the group consisting of a) the rice chloroplast protein 12 (Os.CP12) promoter as described by nucleotide 1 to 854 of SEQ ID NO: 113 (the 'fragment'), or a sequence having at least 60% (preferably at least 70% or 80%, more preferably at least 90% or 95%, most preferably at least 98% or 99%) identity to said fragment, or a sequence hybridizing under stringent conditions (preferably under conditions equivalent to the high stringency conditions defined in the paragraph above) to said fragment, or a sequence com- prising at least 50 (preferably at least 100, more preferably at least 200 or 300, most preferably at least 400 or 500) consecutive nucleotides of said fragment, and b) the maize hydroxypro
- HRGP HRGP promoter as described by nucleotide 1 to 1184 of SEQ ID NO: 114, or a sequence having at least 60% (pref- erably at least 70% or 80%, more preferably at least 90% or 95%, most preferably at least 98% or 99%) identity to said fragment, or a sequence hybridizing under stringent conditions (preferably under conditions equivalent to the high stringency conditions defined in the paragraph above) to said fragment, or a sequence comprising at least 50 (preferably at least 100, more preferably at least 200 or 300, most preferably at least 400 or 500) consecutive nucleotides of said fragment, and c) the rice p-caffeoyl-CoA 3-O-methyltransferase (Os.
- CCoAMTI CCoAMTI promoter as described by nucleotide 1 to 1034 of SEQ ID NO: 115, or a sequence having at least 60% (preferably at least 70% or 80%, more preferably at least 90% or 95%, most preferably at least 98% or 99%) identity to said fragment, or a sequence hybridiz- ing under stringent conditions (preferably under conditions equivalent to the high stringency conditions defined in the paragraph above) to said fragment, or a sequence comprising at least 50 (preferably at least 100, more preferably at least 200 or 300, most preferably at least 400 or 500) consecutive nucleotides of said fragment, and d) the maize Globulin-1 (Zm.Glbi) promoter (W64A) as described by nucleotide 1 to 1440 of SEQ ID NO: 116, or a sequence having at least 60% (preferably at least 70% or 80%, more preferably at least 90% or 95%, most preferably at least 98% or 99%) identity to said fragment, or a sequence hybridizing under stringent
- the putative Rice H+-transporting ATP synthase (Os.V-ATPase) promoter as described by nucleotide 1 to 1589 of SEQ ID NO: 117, or a sequence having at least 60% (preferably at least 70% or 80%, more preferably at least 90% or 95%, most preferably at least 98% or 99%) identity to said fragment, or a sequence hybridizing under stringent conditions (preferably under conditions equivalent to the high stringency conditions defined in the paragraph above) to said fragment, or a sequence comprising at least 50 (preferably at least 100, more preferably at least 200 or 300, most preferably at least 400 or 500) consecutive nucleotides of said fragment, and f) the putative rice C-8,7 sterol isomerase (Os.C8,7 Sl) promoter as described by nucleotide 1 to 796 of SEQ ID NO: 118
- LDH LDH promoter as described by nucleotide 1 to 1062 of SEQ ID NO: 119, or a sequence having at least 60% (preferably at least 70% or 80%, more preferably at least 90% or 95%, most preferably at least 98% or 99%) identity to said fragment, or a sequence hybridizing under stringent condi- tions (preferably under conditions equivalent to the high stringency conditions defined in the paragraph above) to said fragment, or a sequence comprising at least 50 (preferably at least 100, more preferably at least 200 or 300, most preferably at least 400 or 500) consecutive nucleotides of said fragment, and h) the rice Late Embryogenesis Abundant (Os.
- Lea) promoter as described by nucleotide 1 to 1386 of SEQ ID NO: 121 , or a sequence having at least 60% (preferably at least 70% or 80%, more preferably at least 90% or 95%, most preferably at least 98% or 99%) identity to said fragment, or a sequence hybridizing under stringent conditions (preferably under conditions equivalent to the high stringency conditions defined in the paragraph above) to said fragment, or a sequence comprising at least 50 (preferably at least 100, more preferably at least 200 or 300, most preferably at least 400 or 500) consecutive nucleotides of said fragment.
- said expression construct is comprising a combination of one of the above defined promoters with at least one intron selected from the group consisting of i) the BPSI.1 intron as described by nucleotide 888 to 1470 of SEQ ID NO: 113 or a sequence having at least 60% (preferably at least 70% or 80%, more preferably at least 90% or 95%, most preferably at least 98% or 99%) identity to said fragment, or a sequence hybridizing under stringent conditions (preferably under conditions equivalent to the high stringency conditions defined above) to said fragment, or a sequence comprising at least 50 (preferably at least 100, more preferably at least
- nucleotide 1068 to 1318 of SEQ ID NO: 120 or a sequence having at least 60% (preferably at least 70% or 80%, more preferably at least 90% or 95%, most preferably at least 98% or 99%) identity to said fragment, or a sequence hybridizing under stringent conditions (preferably under conditions equivalent to the high stringency conditions defined above) to said fragment, or a sequence comprising at least 50 (preferably at least 100, more preferably at least 200 or 300, most preferably at least 400 or 500) consecutive nucleotides of said fragment.
- More preferably expression construct is comprising a combination of promoter and intron selected from the group consisting of i) sequences as described by any of SEQ ID NO: 113, 114, 115, 116, 117, 118, 119,
- sequences having at least 50 preferably at least 100, more preferably at least 200 or 300, most preferably at least 400 or 500
- sequences having an identity of at least 60% preferably at least 70% or 80%, more preferably at least 90% or 95%, most preferably at least 98% or 99%
- sequences hybridizing under stringent conditions preferably under conditions equivalent to the high stringency conditions defined above
- a preferred subject matter of the invention is a vector, preferably a plant transformation vector, containing an inventive recombinant expression construct.
- the expression cassette can be introduced into the vector via a suitable restriction cleavage site.
- the plasmid formed is first introduced into E.coli. Correctly transformed E.coli are selected, grown, and the recombinant plasmid is obtained by the methods familiar to the skilled worker. Restriction analysis and sequencing may serve to verify the cloning step.
- Preferred vectors are those, which make possible stable integration of the expression cassette into the host genome.
- An expression construct according to the invention can advantageously be introduced into cells, preferably into plant cells, using vectors.
- the methods of the invention involve transformation of organism or cells (e.g.
- transgenic expression vectors comprising at least a transgenic expression cassette of the invention.
- the methods of the invention are not limited to the expression vectors disclosed herein. Any expression vector which is capable of introducing a nucleic acid sequence of interest into a plant cell is contemplated to be within the scope of this invention.
- expression vectors comprise the transgenic expression cassette of the invention in combination with elements which allow cloning of the vector into a bacterial or phage host.
- the vector preferably, though not necessarily, contains an origin of replication which is functional in a broad range of prokaryotic hosts.
- a selectable marker is generally, but not necessarily, included to allow selection of cells bearing the desired vector. Preferred are those vectors that allowing a stable integration of the expression construct into the host genome.
- the plasmid used need not meet any particular requirements. Simple plasmids such as those of the pUC series can be used. If intact plants are to be regenerated from the transformed cells, it is nec- essary for an additional selectable marker gene to be present on the plasmid.
- a variety of possible plasmid vectors are available for the introduction of foreign genes into plants, and these plasmid vectors contain, as a rule, a replication origin for multiplication in E.coli and a marker gene for the selection of transformed bacteria. Examples are pBR322, pUC series, M13mp series, pACYC184 and the like.
- the expression construct can be introduced into the vector via a suitable restriction cleavage site.
- the plasmid formed is first introduced into E.coli. Correctly transformed E.coli are selected and grown, and the recombinant plasmid is obtained by methods known to the skilled worker. Restriction analysis and sequencing can be used for verifying the cloning step.
- Agrobacterium tumefaciens and A. rhizogenes are plant-pathogenic soil bacteria, which genetically transform plant cells.
- the Ti and Ri plasmids of A. tumefaciens and A. rhizogenes, respectively, carry genes responsible for genetic transformation of the plant (Kado (1991) Crit Rev Plant Sci 10:1).
- Vectors of the invention may be based on the Agrobacterium Ti- or Ri-plasmid and may thereby utilize a natural system of DNA transfer into the plant genome.
- Agrobacterium transfers a defined part of its genomic information (the T-DNA; flanked by about 25 bp repeats, named left and right border) into the chromosomal DNA of the plant cell (Zupan (2000) Plant J 23(1): 11 -28).
- T-DNA genomic information
- vir genes part of the original Ti-plasmids
- Ti-plasmids were developed which lack the original tumor inducing genes ("disarmed vectors").
- the T-DNA was physically separated from the other functional elements of the Ti- plasmid (e.g., the vir genes), by being incorporated into a shuttle vector, which allowed easier handling (EP-A 120 516; US 4.940.838).
- These binary vectors comprise (beside the disarmed T-DNA with its border sequences), prokaryotic sequences for replication both in Agrobacterium and E. coli. It is an advantage of Agrobacterium-mediated transformation that in general only the DNA flanked by the borders is transferred into the genome and that preferentially only one copy is inserted.
- a Ti or Ri plasmid is to be used for the transformation, at least the right border, but in most cases the right and left border, of the Ti or Ri plasmid T-DNA is linked to the transgenic expression construct to be introduced in the form of a flanking region.
- Binary vectors are preferably used. Binary vectors are capable of replication both in E.coli and in Agrobacterium. They may comprise a selection marker gene and a linker or polylinker (for insertion of e.g. the expression construct to be transferred) flanked by the right and left T- DNA border sequence. They can be transferred directly into Agrobacterium (Holsters (1978) MoI Gen Genet 163:181-187).
- the selection marker gene permits the selection of transformed agrobacteria and is, for example, the nptU gene, which confers resistance to kanamycin.
- the Agrobacterium which acts as host organism in this case should already contain a plasmid with the vir region. The latter is required for transferring the T-DNA to the plant cell.
- An Agrobacterium transformed in this way can be used for transforming plant cells.
- the use of T-DNA for transforming plant cells has been studied and described intensively (EP 120 516; Hoekema (1985) Nature 303:179-181 ; An (1985) EMBO J. 4:277-287; see also below).
- Common binary vectors are based on "broad host range"-plasmids like pRK252 (Bevan (1984) Nucl Acid Res 12:8711-8720) or pTJS75 (Watson (1985) EMBO J 4(2):277-284) derived from the P-type plasmid RK2. Most of these vectors are derivatives of pBIN19 (Bevan 1984, Nucl Acid Res 12:8711-8720). Various binary vectors are known, some of which are commercially available such as, for example, pBI101.2 or pBIN19 (Clontech Laboratories, Inc. USA). Additional vectors were improved with regard to size and handling (e.g.
- Agrobacterium strains for use in the prac- tice of the invention include octopine strains, e.g., LBA4404 or agropine strains, e.g., EHA101 or EHA105. Suitable strains of A.
- tumefaciens for DNA transfer are for example EHA101pEHA101 (Hood (1986) J Bacteriol 168:1291-1301), EHA105[pEHA105] (Li (1992) Plant MoI Biol 20:1037-1048), LBA4404[pAL4404] (Hoekema (1983) Nature 303:179-181), C58C1 [pMP90] (Koncz (1986) MoI Gen Genet 204:383-396), and C58C1 [pGV2260] (Deblaere (1985) Nucl Acids Res 13:4777-4788.
- Other suitable strains are Agrobacterium tumefaciens C58, a nopaline strain.
- Other suitable strains are A.
- the Agrobacterium strain used to transform the plant tissue pre-cultured with the plant phenolic compound contains a L,L-succinamopine type Ti-plasmid, preferably disarmed, such as pEHA101.
- the Agrobacterium strain used to transform the plant tissue pre-cultured with the plant phenolic compound contains an octopine-type Ti-plasmid, preferably disarmed, such as pAL4404. Generally, when using octopine-type Ti-plasmids or helper plasmids, it is preferred that the virF gene be deleted or inactivated (Jarchow (1991 ) Proc. Natl. Acad. Sci. USA 88:10426-10430). In a preferred embodiment, the Agrobacterium strain used to transform the plant tissue pre-cultured with the plant phenolic compound such as acetosyringone.
- the method of the invention can also be used in combination with particular Agrobacterium strains, to further increase the transformation efficiency, such as Agrobacterium strains wherein the vir gene expression and/or induction thereof is altered due to the presence of mutant or chimeric virA or virG genes (e.g. Hansen (1994) Proc. Natl. Acad. Sci. USA 91 :7603-7607;Chen 1991 J. Bacteriol. 173:1139- 1144; Scheeren-Groot (1994) J. Bacteriol 176:6418-6426).
- a binary vector or any other vector can be modified by common DNA recombination techniques, multiplied in E.
- Agrobacterium is grown and used as described in the art.
- the vector comprising Agrobacterium strain may, for ex- ample, be grown for 3 days on YP medium (5 g/L yeast extract, 10 g/L peptone, 5 g/L Nail, 15 g/L agar, pH 6.8) supplemented with the appropriate antibiotic (e.g., 50 mg/L spectinomycin). Bacteria are collected with a loop from the solid medium and resus- pended.
- An additional subject matter of the invention relates to transgenic non-human organisms transformed with at least one vector containing a transgenic expression construct of the invention.
- the invention relates to bacteria, fungi, yeasts, more preferably to plants or plant cell.
- the transgenic organism is a monocotyledonous plant.
- the monocotyledonous plant is selected from the group consisting of the genera Hordeum, Avena, Secale, Triticum, Sorghum, Tea, Saccharum and Oryza, very especially preferred are plants selected from the group consisting of Hordeum vulgare, Triticum aestivum, Triticum aestivum subsp.spelta, Triticale, Avena sativa, Secale ce- reale, Sorghum bicolor, Saccharum officinarum, Tea mays and Oryza sativa trans- formed with the inventive vectors or containing the inventive recombinant expression constructs.
- Preferred bacteria are bacteria of the genus Escherichia, Erwinia, Agrobacterium, Flavobacterium, Alcaligenes or cyanobacteria, for example of the genus Synechocystis.
- microorganisms which are capable of infecting plants and thus of transferring the constructs according to the invention.
- Preferred mi- croorganisms are those from the genus Agrobacterium and, in particular, the species Agrobacterium tumefaciens.
- Preferred yeasts are Candida, Saccharomyces, Han- senula or Pichia.
- Preferred fungi are Aspergillus, Trichoderma, Ashbya, Neurospora, Fusarium, Beauveria or other fungi. Plant organisms are furthermore, for the purposes of the invention, other organisms which are capable of photosynthetic activity such as, for example, algae or cyanobacteria, and also mosses.
- Preferred algae are green algae such as, for example, algae of the genus Haematococcus, Phaedactylum tricor- natum, Volvox or Dunaliella.
- the invention relates cell cultures, tissues, organs (e.g., leaves, roots and the like in the case of plant organisms), or propagation material derived from transgenic non-human organisms like bacteria, fungi, yeasts, plants or plant cells transformed with at least one vector containing a transgenic expression construct of the invention.
- An additional subject matter of the invention relates to a method for providing an expression cassette for enhanced expression of a nucleic acid in a plant or a plant cell, comprising the step of functionally linking the inventive introns to a plant expression cassette not comprising said intron.
- the invention relates to a method for enhancing the expression of a nucleic acid sequence in a plant or a plant cell, comprising functionally linking the inventive introns to said nucleic acid sequence.
- the method for providing an expression cassette for enhanced expression of a nucleic acid in a plant or a plant cell and the method for enhancing the expression of a nucleic acid sequence in a plant or a plant cell further comprises the steps of i) providing an recombinant expression cassette, wherein the nucleic acid sequence is functionally linked with a promoter sequence functional in plants and with an intron sequence selected from the group consisting of SEQ ID NOs: 1 , 2, 3, 5, 6, 7, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 22, ii) introducing said recombinant expression into a plant cell or a plant, iii) identifying or selecting the transgenic plant cell comprising said transgenic expression construct.
- the above-described method further comprises the steps of iv) regenerating transgenic plant tissue from the transgenic plant cell.
- the method further comprises v) regenerating a transgenic plant from the transgenic plant cell.
- the generation of a transformed organism or a transformed cell requires introducing the DNA in question into the host cell in question.
- a multiplicity of methods is available for this procedure, which is termed transformation (see also Keown (1990) Methods in Enzymology 185:527-537).
- the DNA can be introduced directly by microinjection or by bombardment via DNA-coated microparticles.
- the cell can be per- meabilized chemically, for example using polyethylene glycol, so that the DNA can enter the cell by diffusion.
- the DNA can also be introduced by protoplast fusion with other DNA-containing units such as minicells, cells, lysosomes or liposomes.
- Another suit- able method of introducing DNA is electroporation, where the cells are permeabilized reversibly by an electrical pulse.
- Methods for introduction of a transgenic expression construct or vector into plant tissue may include but are not limited to, e.g., electroinjec- tion (Nan (1995) In “Biotechnology in Agriculture and Forestry,” Ed. Y. P. S. Bajaj, Springer-Verlag Berlin Heidelberg 34:145-155; Griesbach (1992) Hort Science 27:620); fusion with liposomes, lysosomes, cells, minicells or other fusible lipid-surfaced bodies (Fraley (1982) Proc. Natl. Acad. Sci.
- microprojectile particles are coated with DNA and accelerated by a mechanical device to a speed high enough to penetrate the plant cell wall and nucleus (WO 91/02071). The foreign DNA gets incorporated into the host DNA and results in a transformed cell.
- biolistics Steanford (1990) Physiologia Plantarium 79:206-209; Fromm (1990) Bio/Technology 8:833-839; Christou (1988) Plant Physiol 87:671-674; Sautter (1991 ) Bio/Technology 9:1080-1085).
- the method has been used to produce stably trans- formed monocotyledonous plants including rice, maize, wheat, barley, and oats (Christou (1991 ) Bio/Technology 9:957-962; Gordon-Kamm (1990) Plant Cell 2:603-618; Va- sil (1992) Bio/Technology 10:667-674, (1993) Bio/Technology 11 :1153-1158; Wan (1994) Plant Physiol. 104:3748; Somers (1992) Bio/Technology 10:1589-1594).
- transformation can also be effected by bacterial infection by means of Agrobacterium tumefaciens or Agrobacterium rhizogenes.
- T-DNA transferred DNA
- plant explants are cocultured with a transgenic Agrobacterium tumefaciens or Agrobacterium rhizogenes.
- a suitable medium which may contain, for example, antibiotics or biocides for selecting transformed cells.
- the plants obtained can then be screened for the presence of the DNA introduced, in this case the expression construct according to the invention.
- the genotype in question is, as a rule, stable and the insertion in question is also found in the subsequent generations.
- the plants obtained can be cultured and hybridized in the customary fashion. Two or more generations should be grown in order to ensure that the genomic integration is stable and heredi- tary.
- Agrobacterium may be enhanced by using a number of methods known in the art. For example, the inclusion of a natural wound response molecule such as acetosyringone (AS) to the Agrobacterium culture has been shown to enhance transformation efficiency with Agrobacterium tumefaciens (Shahla (1987) Plant MoI. Biol. 8:291-298).
- AS acetosyringone
- transformation efficiency may be enhanced by wounding the target tissue to be transformed. Wounding of plant tissue may be achieved, for example, by punching, maceration, bombardment with microprojectiles, etc. (see, e.g., Bidney (1992) Plant Molec. Biol. 18:301-313).
- the expression construct integrated contains a selection marker, which imparts a resistance to a biocide (for example a herbicide) or an antibiotic such as kana- mycin, G 418, bleomycin, hygromycin or phosphinotricin and the like to the transformed plant.
- the selection marker permits the selection of transformed cells from untrans- formed cells (McCormick 1986) Plant Cell Reports 5:81-84).
- the plants obtained can be cultured and hybridized in the customary fashion. Two or more generations should be grown in order to ensure that the genomic integration is stable and hereditary. The abovementioned methods are described (for example, in Jenes 1983; and in Potrykus 1991).
- the present invention provides transgenic plants.
- the transgenic plants of the invention are not limited to plants in which each and every cell expresses the nucleic acid sequence of interest under the control of the promoter sequences provided herein. Included within the scope of this invention is any plant which contains at least one cell which expresses the nucleic acid sequence of interest (e.g., chimeric plants). It is preferred, though not neces- sary, that the transgenic plant comprises the nucleic acid sequence of interest in more than one cell, and more preferably in one or more tissue.
- transgenic plants may be regenerated from this transgenic plant tissue using methods known in the art.
- Species from the following examples of genera of plants may be regenerated from transformed protoplasts: Fragaria, Lotus, Medicago, Onobrychis, Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum, Hyoscyamus, Lycopersicon, Nicotiana, Solarium, Petunia, Digitalis, Majo- rana, Ciohorium, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, Hererocallis, Nemesia, Pelargonium, Panicum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Pisum, Lol
- a suspension of transformed protoplasts or a Petri plate containing transformed explants is first provided.
- Callus tissue is formed and shoots may be induced from callus and subsequently rooted.
- somatic embryo formation can be induced in the callus tissue.
- These somatic embryos germinate as natural embryos to form plants.
- the culture media will generally contain various amino acids and plant hormones, such as auxin and cytokinins. It is also advantageous to add glutamic acid and proline to the medium, especially for such species as corn and alfalfa. Efficient regeneration will depend on the medium, on the genotype, and on the history of the culture. These three variables may be empirically controlled to result in reproducible regeneration.
- Plants may also be regenerated from cultured cells or tissues.
- Dicotyledonous plants which have been shown capable of regeneration from transformed individual cells to obtain transgenic whole plants include, for example, apple (Malus pumila), blackberry (Rubus), Blackberry/raspberry hybrid (Rubus), red raspberry (Rubus), carrot (Daucus carota), cauliflower (Brassica oleracea), celery (Apium graveolens), cucumber (Cucumis sativus), eggplant (Solanum melongena), lettuce (Lactuca sativa), potato (Solanum tuberosum), rape (Brassica napus), wild soybean (Glycine canescens), soybean (Glycine max), strawberry (Fragaria ananassa), tomato (Lycopersicon esculentum), walnut (Juglans regia), melon (Cucumis meld), grape (Vitis vinifera), and mango (Mangifera indica). Monocotyledonous plants which
- the regenerated plants are transferred to standard soil conditions and cultivated in a conventional manner. After the expression vector is stably incorporated into regenerated transgenic plants, it can be transferred to other plants by vegetative propagation or by sexual crossing.
- vegetatively propagated crops the mature transgenic plants are propagated by the taking of cuttings or by tissue culture techniques to produce multiple identical plants.
- the mature transgenic plants are self crossed to produce a homozygous inbred plant which is capable of passing the transgene to its progeny by Mendelian inheritance.
- the inbred plant produces seed containing the nucleic acid sequence of interest. These seeds can be grown to produce plants that would produce the selected phenotype.
- the inbred plants can also be used to develop new hybrids by crossing the inbred plant with another inbred plant to produce a hybrid.
- Confirmation of the transgenic nature of the cells, tissues, and plants may be performed by PCR analysis, antibiotic or herbicide resistance, enzymatic analysis and/or Southern blots to verify transformation. Progeny of the regenerated plants may be obtained and analyzed to verify whether the transgenes are heritable. Heritability of the transgene is further confirmation of the stable transformation of the transgene in the plant. The resulting plants can be bred in the customary fashion. Two or more generations should be grown in order to ensure that the genomic integration is stable and hereditary. Corresponding methods are described, (Jenes 1993; Potrykus 1991).
- transgenic plant organisms derived from the above-described transgenic organisms, and transgenic propagation material such as seeds or fruits.
- the method for enhancing the expression of a nucleic acid sequence in a plant or a plant cell further comprises, linking the introns with expression enhancing properties to the expression cassette by insertion via homologous recombination comprising the following steps: a) providing in vivo or in vitro a DNA construct comprising said introns flanked by sequences allowing homologous recombination into a pre-existing expression cassette between the promoter and the nucleic acid of said expression cassette, b) transforming a recipient plant cell comprising said cassette, regenerating a transgenic plant where said intron has been inserted into the genomic DNA of said promoter nucleic acid construct via homologous recombination.
- Homologous recombination is a reaction between any pair of DNA sequences having a similar sequence of nucleotides, where the two sequences interact (recombine) to form a new recombinant DNA species.
- the frequency of homologous recombination increases as the length of the shared nucleo- tide DNA sequences increases, and is higher with linearized plasmid molecules than with circularized plasmid molecules.
- Homologous recombination can occur between two DNA sequences that are less than identical, but the recombination frequency declines as the divergence between the two sequences increases.
- Introduced DNA sequences can be targeted via homologous recombination by linking a DNA molecule of interest to sequences sharing homology with endogenous sequences of the host cell. Once the DNA enters the cell, the two homologous sequences can interact to insert the introduced DNA at the site where the homologous genomic DNA sequences were located. Therefore, the choice of homologous sequences contained on the introduced DNA will determine the site where the introduced DNA is integrated via homologous recombination. For example, if the DNA sequence of interest is linked to DNA sequences sharing homology to a single copy gene of a host plant cell, the DNA sequence of interest will be inserted via homologous recombination at only that single specific site.
- the DNA se- quence of interest can be inserted via homologous recombination at each of the specific sites where a copy of the gene is located.
- the introduced DNA should contain sequences homologous to the selected gene.
- a double recombination event can be achieved by flanking each end of the DNA sequence of interest (the sequence intended to be inserted into the genome) with DNA sequences homologous to the selected gene.
- a homologous recombination event involving each of the homologous flanking regions will result in the insertion of the foreign DNA. Thus only those DNA sequences located between the two regions sharing genomic homology become integrated into the genome.
- the inventive intron that has to be introduced in the chromosome preferably in the 5 ' UTR of a gene (a preexisting expression cassette), is (for example) located on a DNA construct and is 5 ' and 3 ' flanked by nucleic acid sequences of sufficient homology to the target DNA (such an construct is called "gene targeting substrate') in which the intron should be integrated.
- Said flanking regions must be sufficient in length for making possible re- combination. They are, as a rule, in the range of several hundred bases to several kilo bases in length (Thomas KR and Capecchi MR (1987) Cell 51 :503; Strepp et al.
- the gene targeting substrate comprises an selection marker that is co-integrated with the intron into the genomic region of interest, allowing the selection of recombination events.
- the gene targeting substrate is integrated via a double cross over event between pairs of homologous DNA sequences of sufficient length and homology resulting in the insertion of the intron sequence (and if desired additional nucleic acid sequences e.g. selection marker).
- a intron of the invention can be placed in the 5 ' non coding region of the target gene (e.g., an en- dogenous plant gene) to be transgenically expressed, by linking said intron to DNA sequences which are homologous to, for example, endogenous sequences upstream and/or downstream of the reading frame of the target gene.
- the homologous sequences can interact and thus place the intron of the invention at the desired site so that the intron sequence of the invention becomes operably linked to the target gene and constitutes an expression construct of the invention.
- the host organism - for example a plant - is transformed with the recombination construct using the methods described herein, and clones, which have successfully undergone recombination, are selected, for example using a resistance to antibiotics or herbicides.
- site-directed integration of the nucleic acid sequence of interest into the plant cell genome may be achieved by, for example, homologous recombination using Agrobacterium- ⁇ e ⁇ ve ⁇ sequences.
- plant cells are incubated with a strain of Agrobacterium which contains a targeting vector in which sequences that are homolo- gous to a DNA sequence inside the target locus are flanked by Agrobacterium transfer- DNA (T-DNA) sequences, as previously described (US 5,501 ,967, the entire contents of which are herein incorporated by reference).
- Agrobacterium transfer- DNA (T-DNA) sequences as previously described (US 5,501 ,967, the entire contents of which are herein incorporated by reference).
- homologous recombination may be achieved using targeting vectors which contain sequences that are homologous to any part of the targeted plant gene, whether belonging to the regulatory elements of the gene, or the coding regions of the gene. Homologous recombination may be achieved at any region of a plant gene so long as the nucleic acid sequence of regions flanking the site to be targeted is known. Gene targeting is a relatively rare event in higher eucaryotes, espe- daily in plants. Random integrations into the host genome predominate.
- Counter selection is a powerful approach in mammalian and plant systems to enrich for gene targeting events.
- the bacterial codA gene as a cell autonomous negative selection marker can be used for selection in tissue culture (Schlaman and Hooykaas Plant J 11 :1377-1385, 1997; Thykjaer et ai, Plant MoI Biol. 1997 Nov;35(4):523-30.).
- Negative selection in plants allowed a more than a thousand-fold suppression of random integration (Risseeuw et al., Plant J. 1997 Apr; 11(4):717-28. ; Gallego et al., Plant MoI Biol. 1999 Jan;39(1 ):83-93; Terada et al., Nat Biotechnol.
- sequence-unspecific induction of DNA strand breaks is disadvantageous because of the potential mutagenic effect. Sequence-specific induction of DNA strand-breaks may also increase efficiency of HR but is limited to artificial scenarios (Siebert R and Puchta H (2002) Plant Cell 14(5):1121- 1131).
- site-specific integration or excision of transformation constructs prepared in accordance with the instant invention.
- An advantage of site- specific integration or excision is that it can be used to overcome problems associated with conventional transformation tech- niques, in which transformation constructs typically randomly integrate into a host genome in multiple copies. This random insertion of introduced DNA into the genome of host cells can be lethal if the foreign DNA inserts into an essential gene.
- the expression of a transgene may be influenced by "position effects' caused by the surrounding genomic DNA.
- DNA-constructs utilized within the method of this invention may comprise additional nucleic acid sequences. Said sequences may be for example localized in different positions with respect to the homology sequences.
- the additional nucleic acid sequences are localized between two homology sequences and may be introduced via homologous recombination into the chromosomal DNA, thereby resembling an insertion mutation of said chromosomal DNA.
- the additional sequences may also be localized outside of the homology sequences (e.g., at the 5 - or 3 -end of the DNA-construct). In cases where the additional sequence resembles a counter selection marker this may allow a distinction of illegitimate insertion events from correct insertion events mediated by homologous recombination. Corresponding negative markers are described below and suitable methods are well known in the art (WO 99/20780).
- efficiency of the method of the invention may be further increased by combination with other methods suitable for increasing homologous recombination.
- Said methods may include for example expression of HR enhancing proteins (like e.g., RecA; WO 97/08331 ; Reiss B et al. (1996) Proc Natl Acad Sci USA 93(7):3094-3098; Reiss B et al. (2000) Proc Natl Acad Sci USA 97(7):3358-3363) or treatment with PARP inhibitors (Puchta H et al. (1995) Plant J. 7:203-210).
- HR enhancing proteins like e.g., RecA; WO 97/08331 ; Reiss B et al. (1996) Proc Natl Acad Sci USA 93(7):3094-3098; Reiss B et al. (2000) Proc Natl Acad Sci USA 97(7):3358-3363
- PARP inhibitors Pieric Acids
- PARP inhibitors suitable for use within this invention are known to the person skilled in the art and may include for example preferably 3-Aminobenzamid, 8-Hydroxy-2-methylquinazolin-4-on (N U 1025), 1 ,11 b-Dihydro-[2H]benzopyrano[4,3,2- de]isoquinolin-3-on (GPI 6150), 5-Aminoisoquinolinon, 3,4-Dihydro-5-[4-(1-piperidinyl) butoxy]-1 (2H)-isoquinolinon or compounds described in WO 00/26192, WO 00/29384, WO 00/32579, WO 00/64878, WO 00/68206, WO 00/67734,WO 01/23386 or WO 01/23390.
- the method may be combined with other methods facilitation homologous recombination and/or selection of the recombinants like e.g., positive/negative selection, excision of illegitimate recombination events or induction of sequence-specific or unspecific DNA double-strand breaks.
- facilitation homologous recombination and/or selection of the recombinants like e.g., positive/negative selection, excision of illegitimate recombination events or induction of sequence-specific or unspecific DNA double-strand breaks.
- the method for enhancing the expression of a nucleic acid sequence in a plant or a plant cell further via linking the intron with expression enhancing properties to the expression cassette by insertion via homologous recombination is applied to monocotyle- donous plants or plant cells, more preferably to plants selected from the group consist- ing of the genera Hordeum, Avena, Secale, Triticum, Sorghum, Tea, Saccharum, and Oryza, most preferably a maize plant .
- said nucleic acid sequence encodes a protein.
- the method is applied to recombinant DNA expression construct that contain a DNA for the purpose of expressing RNA transcripts that function to affect plant pheno- type without being translated into a protein.
- RNA transcripts that function to affect plant pheno- type without being translated into a protein.
- non protein expressing sequences comprising antisense RNA molecules, sense RNA molecules, RNA molecules with ribozyme activity, double strand forming RNA molecules (RNAi) as described above.
- This process can be used widely for fine chemicals such as enzymes, vitamins, amino acids, sugars, fatty acids, natural and synthetic fla- vorings, aroma substances and colorants.
- fine chemicals such as enzymes, vitamins, amino acids, sugars, fatty acids, natural and synthetic fla- vorings, aroma substances and colorants.
- Culturing the transformed host organisms, and isolation from the host organisms or the culture medium is performed by methods known to the skilled worker.
- pharmaceuticals such as, for example, antibodies, vaccines, enzymes or pharmaceuti- cally active proteins is described (Hood (1999) Curr Opin Biotechnol. 10(4):382-6;Ma (1999) Curr Top Microbiol. Immunol.
- the present invention relates to recombinant DNA expression construct comprising at least one promoter sequence functioning in plants or plant cells, at least one intron with expression enhancing properties in plants or plant cells characterized by VIII) an intron length shorter than 1 ,000 base pairs, and
- XIV an adenine plus thymine content of at least 55%, and a thymine content of at least 30% over the entire intron, and at least one nucleic acid sequence, wherein said promoter sequence and at least one of said intron sequences are func- tionally linked to said nucleic acid sequence and wherein said intron is heterologous to said nucleic acid sequence and/or to said promoter sequence.
- SEQ ID NO: 1 BPSI.1 Sequence of the first intron isolated from the Oryza sativa metallothioneine-like gene (accession No. AP002540) 2.
- SEQ ID NO: 2 BPSI.2 Sequence of the first intron isolated from the Oryza sativa Sucrose UDP Glucosyltransferase-2 gene (accession No. AC084380)
- SEQ ID NO: 3 BPSI.3 Sequence of the second intron isolated from the Oryza sativa Sucrose UDP Glucosyltransferase-2 gene (accession No. AC084380)
- SEQ ID NO: 4 BPSI.4 Sequence of the third intron isolated from the Oryza sativa Sucrose UDP Glucosyltransferase-2 gene (accession No. AC084380)
- SEQ ID NO: 5 BPSI.5 Sequence of the eighth intron isolated from the O. sa tiva gene encoding for the Sucrose transporter (accession No.
- SEQ ID NO: 6 BPSI.6 Sequence of fourth intron isolated from the Oryza sativa gene (accession No. BAA94221) encoding for an unknown protein with homology to the A. thaliana chromosome Il sequence from clones T22O13, F12K2 encoding for a putative lipase (AC006233).
- SEQ ID NO: 7 BPSI.7 Sequence of the fourth intron isolated from the Oryza sativa gene (accession No. BAB90130) encoding for a putative cinnamyl-alcohol dehydrogenase.
- SEQ ID NO: 8 BPSI.8 Sequence of the second intron isolated from the Oryza sativa gene (accession No. AC084766) encoding for a putative ribonucleoprotein.
- SEQ ID NO: 9 BPSI.9 Sequence of the fifth intron isolated from the Oryza sativa clone Gl 12061241.
- SEQ ID NO: 10 BPSI.10 Sequence of the third intron isolated from the O. sa tiva gene (accession No. AP003300) encoding for a putative protein kinase.
- SEQ ID NO: 11 BPSI.11 Sequence of the first intron isolated from the O. sativa gene (accession No. L37528) encoding for a MADS3 box pro tein.
- SEQ ID NO: 12 BPSI.12 Sequence of the first intron isolated from the Oryza sativa gene (accession No. CB625805) encoding for a putative Adenosylmethionine decarboxylase.
- SEQ ID NO: 13 BPSI.13 Sequence of the first intron isolated from the O. sativa gene (accession No. CF297669) encoding for an Aspartic proteinase.
- SEQ ID NO: 14 BPSI.14 Sequence of the first intron isolated from the O. sativa gene (accession No. CB674940) encoding for a Led 4b protein.
- SEQ ID NO: 15 BPSI.15 Sequence of the first intron isolated from the Oryza sativa gene (accession No. BAD37295.1) encoding for a putative SaIT protein precursor
- SEQ ID NO: 16 BPSI.16 Sequence of the first intron isolated from the O. sativa gene (accession No. BX928664) encoding for a putative Reticu- lon.
- SEQ ID NO: 17 BPSI.17 Sequence of the first intron isolated from the O. sativa gene (accession No. AA752970) encoding for a glycolate oxi dase.
- SEQ ID NO: 18 BPSI.18 Sequence of the first intron isolated from the Oryza sativa clone (accession No. AK06442 encoding putative non- coding
- SEQ ID NO: 19 BPSI.19 Sequence of the first intron isolated from the Oryza sativa clone (accession No. AK062197) encoding putative non- coding
- SEQ ID NO: 21 BPSI.21 Sequence of the first intron isolated from the Oryza sativa gene (accession No. CF326058) encoding for a putative membrane transporter.
- SEQ ID NO: 22 BPSI.22 Sequence of the firsit intron isolated from the Oryza sativa gene (accession No. C26044) encoding for a putative ACT domain repeat protein 23.
- SEQ ID NO: 23 Sucrose-UDP glucosyltransferase 2 forward (for) primer
- SEQ ID NO: 24 Sucrose-UDP glucosyltransferase 2 reverse (rev) primer
- SEQ ID NO: 25 Putative Bowman-Kirk trypsin inhibitor (for) primer
- SEQ ID NO: 26 Putative Bowman-Kirk trypsin inhibitor rev primer
- SEQ ID NO: 29 Phenylalanine ammonia-lyase (for) primer 30.
- SEQ ID NO: 30 Phenylalanine ammonia-lyase rev primer
- SEQ ID NO: 33 Catalase (for) primer 34.
- SEQ ID NO: 34 Catalase rev primer
- SEQ ID NO: 35 Putative stress-related protein (for) primer
- SEQ ID NO: 37 Putative translation initiation factor SUM (for) primer
- SEQ ID NO: 38 Putative translation initiation factor SUM rev primer 39.
- SEQ ID NO: 39 Polyubiquitin (for) primer
- SEQ ID NO: 41 Glutathione S-transferase Il (for) primer
- SEQ ID NO: 42 Glutathione S-transferase Il rev primer
- SEQ ID NO: 45 Translational initiation factor elF1 (for) primer
- SEQ ID NO: 48 OSJNBa0024F24.10 (unknown protein) rev primer 49.
- SEQ ID NO: 49 Protein similar to Histone 3.2-614 (for) primer
- SEQ ID NO: 50 Protein similar to Histone 3.2-614 rev primer
- SEQ ID NO: 53 BPSI.1-5 ' primer 54.
- SEQ ID NO: 54 BPS 1.1-3 ' primer
- SEQ ID NO: 75 5 ' -CURAY-3 ' plant branchpoint consensus sequences 1 76.
- SEQ ID NO: 80 5 ' splice site plant consensus sequence 5 ' -AG:: GTAAGT- 3 ' 81.
- SEQ ID NO: 82 Sequence of the first intron isolated from the Oryza sativa met- allothioneine-like gene (accession No. AP002540) including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.1 (SEQ ID NO:1 )
- SEQ ID NO: 83 Sequence of the first intron isolated from the O. sativa Sucrose
- UDP Glucosyltransferase-2 gene (accession No. AC084380) including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.2 (SEQ ID NO:2)
- SEQ ID NO: 84 Sequence of the second intron isolated from the O. sativa Sucrose UDP Glucosyltransferase-2 gene (accession No. AC084380) including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.3 (SEQ ID NO: 84).
- SEQ ID NO: 85 Sequence of the third intron isolated from the O. sativa Sucrose
- UDP Glucosyltransferase-2 gene (accession No. AC084380) including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.4 (SEQ ID NO:4)
- SEQ ID NO: 86 Sequence of the eighth intron isolated from the Oryza sativa gene encoding for the Sucrose transporter (GenBank accession No. AF 280050) including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.5 (SEQ ID NO:5)
- SEQ ID NO: 87 Sequence of the eighth intron isolated from the Oryza sativa gene encoding for the Sucrose transporter (accession No. AF 280050) including modified 5 ' and 3 ' splice sites and sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.5 (SEQ ID NO:5)
- SEQ ID NO: 88 Sequence of the fourth intron isolated from the Oryza sativa gene encoding for an unknown protein with homology to the A.thaliana chromosome Il sequence from clones T22O13, F12K2 encoding for a putative lipase (AC006233) including se- quences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.6 (SEQ ID NO:6)
- SEQ ID NO: 89 Sequence of the fourth intron isolated from the Oryza sativa gene encoding for an unknown protein with homology to the A.thaliana chromosome Il sequence from clones T22O13, F12K2 encoding for a putative lipase (AC006233) including modified 5 ' and 3 ' splice sites and sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.6 (SEQ ID NO:6)
- SEQ ID NO: 90 Sequence of the fourth intron isolated from the Oryza sativa gene (accession No. BAB90130) encoding for a putative cin- namyl-alcohol dehydrogenase including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.7 (SEQ ID NO:7)
- SEQ ID NO: 91 Sequence of the fourth intron isolated from the Oryza sativa gene (accession No. BAB90130) encoding for a putative cin- namyl-alcohol dehydrogenase including modified 5 ' and 3 ' splice sites and sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.7 (SEQ ID NO:7)
- SEQ ID NO: 92 Sequence of the second intron isolated from the Oryza sativa gene (accession No. AC084766) encoding for a putative ribo- nucleoprotein including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.8 (SEQ ID NO:8)
- SEQ ID NO: 93 Sequence of the second intron isolated from the Oryza sativa gene (accession No. AC084766) encoding for a putative ribo- nucleoprotein including modified 5 ' and 3 ' splice sites and sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.8 (SEQ ID NO:8)
- SEQ ID NO: 10 Sequence of the third intron isolated from the Oryza sativa gene (accession No. AP003300) encoding for a putative protein including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.10 (SEQ ID NO:10)
- SEQ ID NO: 95 Sequence of the third intron isolated from the Oryza sativa gene (accession No. AP003300) encoding for a putative protein including modified 5 ' and 3 ' splice sites and sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence
- SEQ ID NO: 98 Sequence of the first intron isolated from the Oryza sativa gene
- SEQ ID NO: 101 Sequence of the first intron isolated from the O. sativa gene (accession No. CA128696) encoding for a putative mannose- binding rice lectin including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.15 (SEQ ID NO:15)
- SEQ ID NO: 102 Sequence of the first intron isolated from the Oryza sativa gene (accession No. BX928664) encoding for a putative Reticulon including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.16 (SEQ ID NO:16)
- SEQ ID NO: 103 Sequence of the first intron isolated from the Oryza sativa gene
- SEQ ID NO: 104 Sequence of the first intron isolated from the Oryza sativa clone
- Gl 34763855 including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.18 (SEQ ID NO:18)
- SEQ ID NO: 105 Sequence of the first intron isolated from the Oryza sativa clone
- Gl 32533738 including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.19 (SEQ ID NO:19) 106.
- SEQ ID NO: 106 Sequence of the first intron isolated from the Oryza sativa gene
- SEQ ID NO: 107 Sequence of the first intron isolated from the O. sativa gene (accession No. CF326058) encoding for a putative membrane transporter including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.21 (SEQ ID NO:21).
- SEQ ID NO: 108 Sequence of the first intron isolated from the O. sativa gene (accession No. C26044) encoding for a putative ACT domain repeat protein including sequences 5 ' and 3 ' adjacent to the 5 ' and 3 ' splice sites of the intron sequence BPSI.22 (SEQ ID NO:22).
- SEQ ID NO: 109 Binary vector pBPSMM291 110.
- SEQ ID NO: 110 Binary vector pBPSMM305
- SEQ ID NO: 111 Binary vector pBPSMM350
- SEQ ID NO: 112 Binary vector pBPSLMI 39
- SEQ ID NO: 113 Artificial sequence: cassette from vector pBPSMM355
- OsCP12::BPSI.1 comprising Os CP12 promoter (bp 1 - 854) and BPSI.1 intron (bp 888 - 1470).
- SEQ ID NO: 114 Artificial sequence: cassette from from vector pBPSMM355
- ZmHRGP::BPSI.1 comprising Maize [HRGP] hydroxyproline- rich glycoprotein (extensin) 5'/UTR promoter (bp 1 - 1184) and oryza sativa BPSI.1 intron (bp 1217- 1799) 115.
- SEQ ID NO: 115 Artificial sequence: cassette from vector pBPSMM358 (OsC-
- CoAMTI ::BPSI.1) comprising p-caffeoyl-CoA 3-0- methyltransferase [CoA-O-Methyl] promoter (bp 1 - 1034)and BPSI.1 intron (1119 - 1701)
- SEQ ID NO: 116 Artificial sequence: cassette from vector EXS1025 (ZmGlobu- Nn1 ::BPSI.1) comprising Maize Globulin-1 [ZmGIbI] promoter
- ATPase::BPSI.1) comprising putative Rice H+-transporting ATP synthase 5'/UTR promoter (1 - 1589) and BPSI.1 intron (1616 - 2198)
- SEQ ID NO: 118 Artificial sequence: cassette from vector pBPSMM366
- SEQ ID NO: 119 Artificial sequence: cassette from vector pBPSMM357 (Zml_DH::BPSI.1) comprising maize gene Lactate Dehydrogenase 5'/UTR promoter (bp 1 - 1062) and BPSI.1 intron (bp 1095 - 1677).
- SEQ ID NO: 120 Artificial sequence: cassette from vector pBPSLM229
- Zml_DH::BPSI.5 comprising maize gene Lactate Dehydroge- nase 5'/UTR promoter (bp 1 - 1062) and BPSI.5 intron (bp 1068
- SEQ ID NO: 121 Artificial sequence: cassette from vector pBPSMM371 (Os-
- Lea::BPSI.1 comprising rice Lea (Late Embryogenesis Abundant) promoter (bp 1 - 1386) and BPSI.1 intron (bp 1387 - 2001)
- the cloning steps carried out for the purposes of the present invention such as, for example, restriction cleavages, agarose gel electrophoresis, purification of DNA fragments, transfer of nucleic acids to nitrocellulose and nylon membranes, linking DNA fragments, transformation of E. coli cells, growing bacteria, multiplying phages and sequence analysis of recombinant DNA, are carried out as described by Sambrook (1989).
- the sequencing of recombinant DNA molecules is carried out using ABI laser fluorescence DNA sequencer following the method of Sanger (Sanger 1977).
- Example 1 Identification and characterization of IME-introns in highly expressing genes
- the rice cDNA clone distribution profiles were derived from about 7.6 million rice cDNA clones, which were generated over 23 rice cDNA libraries of different tissues at different developmental stages and biotic/abiotic treatments.
- Method for the production of cDNA libraries are well known in the art (e.g. Gubler U, and Hoffman BJ. (1983) A simple and very efficient method for generating cDNA libraries. Gene 25(2-3):263-269.; Jung-Hwa Oh et al. (2003) An improved method for constructing a full-length enriched cDNA library using small amounts of total RNA as a starting material.
- the normalized cDNA library was produced by first adjusting the orignal library clone size to the average clone size of all of the 23 libraries, then adjusting the number of clones per variant in that library based on the adjusted total number of clones in that library.
- Rice clones are selected from the rice clusters for sequencing to generate rice EST data.
- 145 variants were selected based on the number of clones exceeding top 1% of the clone distribution across the entire library for over each of 23 libraries, and genes were identified using the homologs to the EST sequences derived from the variants. These candidate genes showed strong, constitutive, and ubiquitous expression.
- the rice EST sequences ho- molog to these candidate genes were mapped to the rice genomic DNA sequences. Top 15 candidates out of 145 were selected based on availability of genomic sequences, annotation, and high level of expression (Table 2).
- reaction solution I (1 ⁇ g RNA, 2 ⁇ L 10x Buffer, 4 ⁇ l_ 25 mM MgCI 2 , 2 ⁇ l_ 1 mM dNTPs, 2 ⁇ l_ 3.2 ⁇ g Random Primers, 1 ⁇ L 50 units RNase Inhibitor, 0.8 ⁇ L 20 units AMV-RT polymerase, fill to 20 ⁇ L with sterile water) under the optimized PCR program (25 0 C 10 min, 42 0 C 1hr, 99 0 C 5 min, 4 0 C stop reaction).
- the RT-PCR samples were used for the LightCycler reaction (11.6 ⁇ L sterile water, 2.4 ⁇ L 25mM MgCI 2 , 2 ⁇ L SYBR Green Polymerase mix, 2 ⁇ L 1OmM Specific Primer Mix, 2 ⁇ L RT-PCR reaction product) under the optimized program (95 0 C 5 min, 95 0 C 30 sec, 61 0 C 40 sec, 72 0 C 40 sec and repeat steps 2-4 for 30 cycles, 72 0 C 10 min, and 4 0 C stop reaction) provided by Roche (LightCycler FastStart DNA Master SYBR Green I, Cat. No.3003230).
- Standardizing the concentration of RNA (1 ⁇ g) in each of the RT-PCR reactions was sufficient to directly compare the samples if the same primers were used for each Lightcycler reaction.
- the output results were a number that corresponds to the cycle of PCR at which the sample reaches the inflection point in the log curve generated. The lower the cycle numbers the higher the concentration of target RNA present in the sample.
- Each sample was repeated in triplicate and an average was generated to produce the sample "crosspoinf value. The lower the crosspoint, the stronger the target gene was expressed in that sample. (Roche Molecular Biochemicals LightCycler System: Reference Guide May 1999 version) Based on the LightCycler results, 11 candidates were selected (Table 4).
- the numbers represent PCR cycle that reaches the start of the exponential curve of the PCR product. Lower the number indicates that higher the expression of the endogenous gene is.
- Candidate introns were isolated using the public available genomic DNA sequences (e.g. http://www.ncbi.nlm.nih.gov/genomes/PLANTS/PlantList.html), leading to a total of 20 introns, mostly first, second, and/or third introns from the targeted genes. These intron sequences were screened by the following IME criteria:
- CURAY branch point • Intron size less than 1kb Selected intron candidates can retain up to 50 bp exon sequences upstream and downstream of the 5' and 3' splice sites, respectively.
- Genomic DNA from rice was extracted using the Qiagen DNAeasy Plant Mini Kit (Qiagen). Genomic DNA regions containing introns of interest were isolated using conventional PCR. Approximately 0.1 ⁇ g of digested genomic DNA was used for the regular PCR reaction (see below). The primers were designed based on the rice genomic sequences. One ⁇ l_ of the diluted digested genomic DNA was used as the DNA tem- plate in the primary PCR reaction. The reaction comprised six sets of primers (Table 6) in a mixture containing Buffer 3 following the protocol outlined by an Expand Long PCR kit (Cat #1681 -842, Roche-Boehringer Mannheim). The isolated DNA was employed as template DNA in a PCR amplification reaction using the following primers:
- Amplification was carried out in the PCR reaction (5 ⁇ l_ 1OX Advantage PCR Mix [Ep- pendorf], 5 ⁇ l_ genomic DNA [corresponds to approximately 80 ng], 2.5 mM of each dATP, dCTP, dGTP and dTTP [Invitrogen: dNTP mix], 1 ⁇ l_ of 20 ⁇ M 5 -intron specific primer 2OpM, 1 ⁇ l_ of 20 ⁇ M 3 intron specific primer, 1 ⁇ l_ TripleMaster DNA Polymerase mix [Eppendorf], in a final volume of 50 ⁇ l_) under the optimized PCR program (1 cycle with 15 sec at 94°C and 1 min at 80 0 C 35cycles with 15 sec at 94°C, 1 min at 58°C and 1 min at 72°C) provided by Thermocycler (T3 Thermocycler Biometra).
- the PCR product was applied to an 1% (w/v) agarose gel and separated at 80V.
- the PCR products were excised from the gel and purified with the aid of the Qiagen Gel Extraction Kit (Qiagen, Hilden, Germany).
- the PCR product can be cloned directly into vector pCR4-TOPO (Invitrogen) following the manufacturer s instructions/.e. the PCR product obtained was inserted into a vector having T overhangs with its A overhangs and a topoisomerase.
- the base vector to which the intron candidates were clone in was pBPSMM267.
- This vector comprises the maize ubiquitin promoter with no intronic sequence, followed by multiple cloning sites (MCS) to be used for addition of introns of interest, then the GUS- int ORF (including the potato invertase [PIV]2 intron to prevent bacterial expression), followed by nopaline synthase (NOS) terminator.
- MCS multiple cloning sites
- GUS- int ORF including the potato invertase [PIV]2 intron to prevent bacterial expression
- NOS nopaline synthase
- the target tissues for these experiments can be plant tissues (e.g. leaf or root), cultured cells (e.g. maize BMS), or plant tissues (e.g. immature embryos) for Agrobacterium protocols.
- plant tissues e.g. leaf or root
- cultured cells e.g. maize BMS
- plant tissues e.g. immature embryos
- IME-introns four introns (BPSI.1 , 2, 3, and 4) were tested using Micropro- jectile bombardment.
- the maize ubiquitin promoter (Zm. ubiquitin) without any intronic sequence was used as basal expression (negative control).
- Introns of interest were cloned into the 5 UTR region of Zm. ubiquitin promoter.
- Maize ubiquitin intron was used as a positive control to measure the relative levels of expression enhanced by introns of interest based on GUS expression. Strong enhancement with BPSI.1 and BPSI.2 introns was detected (Table 8).
- BPSI.3 intron showed medium enhancement levels of GUS expression. No expression was detected with BPSI.4 intron.
- GUS histochemical assays a range of GUS activities (- no expression to ++++ high expression), "Relative GUS expression compared to the expression controlled by maize ubiquitin promoter fused with Zm.ubiquitin intron. 1.6.2 Analysis of IME-intron candidates in stably transformed maize
- the binary vectors pBPSMM350, pBPSMM353, pBPSMM312, and pBPSMM310 were transformed into maize using Agrobacterium-mediated transformation (Example 4.3).
- the levels and patterns of GUS expression controlled by BPSI.1 , BPSI.2, BPSI.3, or BPSI.4 intron were compared with those controlled by Zm.ubiquitin intron.
- BPSI.1 , BPSI.2 and BPSI.3 introns enhanced expression in roots, leaves, and kernels throughout the various development stages at a similar level to that observed in transient assays (Table 9).
- the in silico intron-screening system for identifying introns that have the functional IME comprises three major components: (1 ) Generate intron sequence database and screen for intron candidates using the functional IME criteria (indicated in Example 1.3); (2) Define the expression profiles of these candidate genes from which introns were selected; (3) Further examine the selected gene structures by conducting a mapping of EST sequences onto the genomic region where the candidate genes resided.
- annotated rice and maize genomic sequences were downloaded from NCBI.
- Intron, 5 - and 3 -UTR, promoter and terminator sequences were isolated (in silico) from those annotated genes and their corresponding sequence databases were generated (Table 10, 11).
- From the generated intron sequence database more than 111 ,800 introns (i.e., 106049 rice introns, 4587 maize introns) were screened for potential intron regulatory enhancement elements based on the functional IME criteria (see 1.3). A total of 108 potential intron candidates have been identified, and the protein sequences of the intron candidate genes were retrieved from NCBI.
- the rice (we do not disclose maize sequences) homolog EST sequences were identified from the cDNA libraries described in example 1 using the BLASTx algorithm (this program compares the six-frame conceptual translation products of a nucleotide query sequence (both strands) against protein sequences) at an E-value of 1.Oe "20 against those protein sequences.
- the rice variant expression profiling data see example 1
- the introns whose genes were homolog to the rice genes with desirable expression profiling, such as constitutive and tissue specific expression pattern were selected as final in silico identified intron candidates for lab experimental test.
- the rice UniGenes which was derived from the EST sequence assembly, were updated using the combined public rice EST data and the EST data obtained using the databases described in example 1 , and the UniGene expression profiling data was generated using the rice variant expression profiling data over the 23 different libraries described in example 1.
- the newly updated rice UniGene expression profiling data were used to help select the final 108-intron candidates.
- Perl scripts have been written to isolate intron, 5 - and 3 -UTR, terminator, and promoter sequences from the entire NCBI rice and maize annotated gnomic DNA sequences for creating corresponding sequence databases, to screen for functional IME, and to compare the expression profiling data (see example 5).
- the introns were retrieved from the CDS (coding sequences) features of the annotated genes.
- CDS coding sequences
- a total of 106,049 rice introns and 4,587 maize introns have been retrieved (Table 10) from more that 30,000 annotated genes as the data summarized in Table 11 and 12.
- Genomic DNA from rice was extracted using the Qiagen DNAeasy Plant Mini Kit (Qiagen). Genonic DNA regions containing introns of interest were isolated using conventional PCR. Approximately 0.1 ⁇ g of digested genomic DNA was used for the regular PCR reaction (see below). The primers were designed based on the rice genomic sequences. Five ⁇ l_ of the diluted digested genomic DNA was used as the DNA template in the PCR reaction. PCR was performed using the TripleMaster PCR System (Eppendorf, Hamburg, Germany) as described by the manufacturer. Table 14. Primers used for amplification of widely expressed intron candidates
- Amplification was carried out in the PCR reaction (5 ⁇ l_ 1OX Advantage PCR Mix [Ep- pendorf], 5 ⁇ l_ genomic DNA [corresponds to approximately 80 ng], 2.5 mM of each dATP, dCTP, dGTP and dTTP [Invitrogen: dNTP mix], 1 ⁇ l_ of 20 ⁇ M 5 -intron specific primer 2OpM, 1 ⁇ l_ of 20 ⁇ M 3 intron specific primer, 1 ⁇ l_ TripleMaster DNA Polymerase mix [Eppendorf], in a final volume of 50 ⁇ l_) under the optimized PCR program (1 cycle with 15 sec at 94°C and 1 min at 80 0 C 35cycles with 15 sec at 94°C, 1 min at 58°C and 1 min at 72°C) provided by Thermocycler (T3 Thermocycler Biometra).
- a QIAspin column was used to purify the PCR products as directed by the manufacturer (Qiagen, Valencia, CA), and the amplified introns were used directly for cloning into expression vectors, as described below.
- the base expression vector for these experiments was pBPSMM305, which comprises the maize lactate dehydrogenase (LDH) promoter without intron driving expression of the GUSint gene followed by the NOS terminator.
- LDH maize lactate dehydrogenase
- the LDH promoter has been demonstrated to direct undetectable levels of GUS expression by colorimetric staining in the absence of an intron capable of providing IME.
- Intron PCR products were digested with Sacl & SamHI and cloned into pBPSMM305 linearized with Sacl & SamHI, generating the following LDH:intron:GUS expression vectors. Table 15. GUS chimeric constructs containing introns in the 5 UTR
- Binary vector pBPSLI017 comprises the expression cassette containing the BPSI.5 intron and was generated by ligating in the Pme ⁇ -Pac ⁇ fragment from pBPSJB041 into pBPSLM139 linearized with Pme ⁇ and Pac ⁇ .
- Binary vector pBPSLI018 comprises the expression cassette containing the BPSI.6 intron and was generated by ligating in the Pme ⁇ -Pac ⁇ fragment from pBPSJB042 into pBPSLM139 linearized with Pme ⁇ and Pac ⁇ .
- Binary vector pBPSLI019 comprises the expression cassette containing the BPSI.7 intron and was generated by ligating in the Pme ⁇ -Pac ⁇ fragment from pBPSJB043 into pBPSLM139 linearized with Pme ⁇ and Pac ⁇ .
- Binary vector pBPSLI020 comprises the expression cassette containing the BPSI.8 intron and was generated by ligating in the Pme ⁇ -Pac ⁇ fragment from pBPSJB044 into pBPSLM139 linearized with Pme ⁇ and Pac ⁇ .
- Binary vector pBPSLI021 comprises the expression cassette containing the BPSI.9 intron and was generated by ligating in the Pme ⁇ -Pac ⁇ fragment from pBPSJB045 into pBPSLM139 linearized with Pme ⁇ and Pac ⁇ .
- Binary vector pBPSLI022 comprises the expression cassette containing the BPSI.10 intron and was generated by ligating in the Pme ⁇ -Pac ⁇ fragment from pBPSJB046 into pBPSLM139 linearized with Pme ⁇ and Pac ⁇ .
- Binary vector pBPSLI023 comprises the expression cassette containing the BPSI.11 intron and was generated by ligating in the Pme ⁇ -Pac ⁇ fragment from pBPSJB050 into pBPSLM139 linearized with Pme ⁇ and Pac ⁇ .
- the target tissues for these experiments can be plant tissues (e.g. leaf or root), cultured cells (e.g. maize BMS), or plant tissues (e.g. immature embryos) for Agrobacterium protocols. Characterization of these introns for their ability to direct IME in conjunction with the LDH promoter was undertaken via transient expression by bombardment of expression vectors into maize leaf tissue and liquid-cultured BMS cells, respectively.
- the maize lactate dehydrogenase promoter (ZmLDH) without any intronic sequence was used as basal expression (negative control).
- Introns of interest were cloned into the 5 UTR region of ZmLDH promoter.
- Maize ubiquitin intron was used as a positive control to measure the relative levels of expression enhanced by introns of interest based on GUS expression. Due to the very low background (no detectable GUS expression) of the ZmLDH promoter in the absence of intron, the presence of any GUS staining indicates that a particular intron is capable of providing IME.
- BPSI.10 and BPSI.11 introns consistently yielded the highest GUS expression, at a level comparable to the LDH::Zm. ubiquitin intron construct.
- BPSI.5, BPSI.6, and BPSI.7 introns consistently resulted in an intermediate level of GUS expression in between LDH alone and LDH::Zm. ubiquitin intron. Comparable results were obtained in maize leaves and BMS cells, indicating that the tested introns confer IME in green and non-green tissues (Table 16).
- GUS histochemical assays a range of GUS activities (- no expression to ++++ high expression), ND: not determined.
- the in silico intron screening system for identifying introns that have the functional IME located in the '5 UTR comprises three major components: (1 ) Genome mapping of the entire rice CDS, released from Institute of Genome Research on October 2, 2003 and the EST sequence collections; (2) identification and selection of the introns located in the 5 UTR using both the functional IME criteria and the rice cDNA clone distribution profiles; (3) validation of the selected 5 UTR introns by examining the sequence alignments among the genomic DNA, CDS and ESTs, the gene model, sequence reading frame and intron splicing sites A total of 56,056 annotated rice CDS were mapped onto the Japonica rice genome in which both rice CDS and genomic DNA sequences were obtained from The Institute of Genome Research.
- sequence alignment extension from the EST sequences beyond CDS indicates the identification of the 5 UTRs, which have not been contained in the CDS, but in the EST sequences.
- the system selects these EST sequences, which extend the sequence alignment beyong the CDS along the gnome for up to 5k base long for 5 URT intron screening.
- the last exon in the prediceted 5 UTR region must aligned at the same position of the 1 st exon of the CDS.
- the gnome mapping results have identified 461 genes that have their 5 UTR containing at least one intron.
- Genomic DNA containing introns of interest is isolated using conventional PCR amplifi- cation with sequence specific primers (see 1.4) followed by cloning into a PCR cloning vector in the art.
- Introns are PCR amplified from rice genomic DNA using primers that engineer a Sacl site on the 5 end of the intron and aSamHI site on the 3 end of the sequence.
- the PCR products are digested with Sacl and SamHI and ligated into pBPSMM305 linearized with Sacl and SamHI to generate pUC-based expression vectors comprising the Zm.
- Binary vectors for stable maize transformation are constructed by digesting the pUC expression vectors with Pme ⁇ and Pac ⁇ and ligating into pBPSLM139 digested with Pme ⁇ and Pac ⁇ .
- the target tissues for these experiments can be plant tissues (e.g. leaf or root), cultured cells (e.g. maize BMS), or plant tissues (e.g. immature embryos) for Agrobacterium protocols.
- EXAMPLE 4 Assays for identifying IME-introns These experiments are performed by bombardment of plant tissues or culture cells (Example 4.1), by PEG-mediated (or similar methodology) introduction of DNA to plant protoplasts (Example 4.2), or by Agrobacterium- ⁇ e ⁇ ate ⁇ transformation (Example 4.3).
- the target tissue for these experiments can be plant tissues (e.g. leaf tissue), cultured plant cells (e.g. maize Black Mexican Sweetcorn (BMS), or plant embryos for Agrobacterium protocols.
- 4.1 Transient assay using microprojectile bombardment
- the plasmid constructs are isolated using Qiagen plasmid kit (cat# 12143). DNA is precipitated onto 0.6 ⁇ M gold particles (Bio-Rad cat# 165-2262) according to the protocol described by Sanford et al. (1993) and accelerated onto target tissues (e.g. two week old maize leaves, BMS cultured cells, etc.) using a PDS-1000/He system device (Bio-Rad). All DNA precipitation and bombardment steps are performed under sterile conditions at room temperature.
- BMS Black Mexican Sweet corn
- BMS cell culture liquid medium [Murashige and Skoog (MS) salts (4.3 g/L), 3% (w/v) sucrose, myo-inositol (100 mg/L), 3 mg/L 2,4-dichlorophenoxyacetic acid (2,4-D), casein hydro- lysate (1 g/L), thiamine (10mg/L) and L-proline (1.15 g/L), pH 5.8]. Every week 10 mL of a culture of stationary cells are transferred to 40 mL of fresh medium and cultured on a rotary shaker operated at 110 rpm at 27°C in a 250 mL flask.
- 60 mg of gold particles in a siliconized Eppendorf tube are resuspended in 100% etha- nol followed by centrifugation in a Mini centrifuge C1200 (National Labnet Co. Wood- bridge, NJ) for 30 seconds.
- the pellet is rinsed once in 100% ethanol and twice in sterile water with centrifugation after each wash.
- the pellet is finally resuspended in 1 mL sterile 50% glycerol.
- the gold suspension is then divided into 50 ⁇ L aliquots and stored at 4 0 C.
- the following reagents are added to one aliquot: 5 ⁇ L of 1 ⁇ g/ ⁇ L total DNA, 50 ⁇ L 2.5M CaCb, 20 ⁇ L 0.1 M spermidine, free base.
- the DNA solution is vortexed for 1 minute and placed at -80 0 C for 3 min followed by centrifugation for 10 seconds in a Mini centrifuge C1200. The supernatant is removed. The pellet is carefully resus- pended in 1 mL 100% ethanol by flicking the tube followed by centrifugation for 10 seconds. The supernatant is removed and the pellet is carefully resuspended in 50 ⁇ L of 100% ethanol and placed at -80 0 C until used (30 min to 4 hr prior to bombardment). If gold aggregates are visible in the solution the tubes are sonicated for one second in a waterbath sonicator just prior to use.
- two-week-old maize leaves are cut into pieces approximately 1 cm in length and placed ad-axial side up on osmotic induction medium M-N6-702 [N6 salts (3.96 g/L), 3% (w/v) sucrose, 1.5 mg/L 2,4-dichlorophenoxyacetic acid (2,4-D), casein hydrolysate (100 mg/L), and L-proline (2.9 g/L), MS vitamin stock solution (1 mL/L), 0.2 M mannitol, 0.2 M sorbitol, pH 5.8]. The pieces are incubated for 1-2 hours.
- BMS cultured cells In the case of BMS cultured cells, one-week-old suspension cells are pelleted at 1000 g in a Beckman/Coulter Avanti J25 centrifuge and the supernatant is discarded. Cells are placed onto round ash-free No 42 Whatman filters as a 1/16 inch thick layer using a spatula. The filter papers holding the plant materials are placed on osmotic induction media at 27 0 C in darkness for 1-2 hours prior to bombardment. Just before bombardment the filters are removed from the medium and placed onto on a stack of sterile filter paper to allow the calli surface to partially dry.
- Each plate is shot with 6 ⁇ L of gold-DNA solution twice, at 1 ,800 psi for the leaf materials and at 1 ,100 psi for the BMS cultured cells.
- a sterilized wire mesh screen is laid on top of the sample.
- the filters holding the samples are transferred onto M-N6-702 medium lacking mannitol and sorbitol and incubated for 2 days in darkness at 27 0 C prior to transient assays.
- Transient expression levels of the reporter genes are determined by GUS staining, quantification of luminescence or RT-PCR using the protocols in the art.
- GUS staining is done by incubating the plant materials in GUS solution [100 mM NaHPO4, 10 mM EDTA, 0.05% Triton X100, 0.025% X-Gluc solution (5-bromo-4-chloro-3-indolyl-beta-D- glucuronic acid dissolved in DMSO), 10% methanol, pH 7.0] at 37 0 C for 16-24 hours. Plant tissues are vacuum-infiltrated 2 times for 15 minutes to aid even staining.
- Transient expression levels of the reporter genes are determined by staining, en- zyme assays or RT-PCR using the protocols in the art.
- Isolation of protoplasts is conducted by following the protocol developed by Sheen (1990). Maize seedlings are kept in the dark at 25°C for 10 days and illuminated for 20 hours before protoplast preparation. The middle part of the leaves are cut to 0.5 mm strips (about 6 cm in length) and incubated in an enzyme solution containing 1% (w/v) cellulose RS, 0.1% (w/v) macerozyme R10 (both from Yakult Honsha, Nishinomiya, Japan), 0.6 M mannitol, 10 mM Mes (pH 5.7), 1 mM CaCI 2 , 1 mM MgCI 2 , 10 mM ⁇ - mercaptoethanol, and 0.1% BSA (w/v) for 3 hr at 23 0 C followed by gentle shaking at 80 rpm for 10 min to release protoplasts.
- Protoplasts are collected by centrifugation at 100 x g for 2 min, washed once in cold 0.6 M mannitol solution, centrifuged, and resus- pended in cold 0.6 M mannitol (2 x 10 6 /ml_).
- a total of 50 ⁇ g plasmid DNA in a total volume of 100 ⁇ l_ sterile water is added into 0.5 mL of a suspension of maize protoplasts (1 x 10 6 cells/mL) and mix gently.
- 0.5 mL PEG solution (40 % PEG 4,000, 100 mM CaNO 3 , 0.5 mannitol) is added and pre-warmed at 70 0 C with gentle shaking followed by addition of 4.5 mL MM solution (0.6 M mannitol, 15 mM MgCI 2 , and 0.1 % MES). This mixture is incubated for 15 minutes at room temperature.
- the protoplasts are washed twice by pelleting at 600 rpm for 5 min and re- suspending in 1.0 mL of MMB solution [0.6 M mannitol, 4 mM Mes (pH 5.7), and brome mosaic virus (BMV) salts (optional)] and incubated in the dark at 25°C for 48 hr. After the final wash step, collect the protoplasts in 3 mL MMB medium, and incubate in the dark at 25°C for 48 hr. Transient expression levels of the reporter gene are determined quantification of expression of reporter genes or RT-PCR using the protocols in the art in order to determine potentially intron candidates that function in intron-mediated enhancement.
- MMB solution 0.6 M mannitol, 4 mM Mes (pH 5.7), and brome mosaic virus (BMV) salts (optional)
- Agrobacterium tumefaciens (strain C58C1 pGV2260) is transformed with the various vector constructs described above.
- the Agrobacterial strains are subsequently used to generate transgenic plants.
- a single transformed Agrobacterium colony is incubated overnight at 28°C in a 4 mL culture (medium: YEB medium with 50 ⁇ g/mL kanamycin and 25 ⁇ g/mL rifampicin). This culture is subsequently used to inoculate a 400 mL culture in the same medium, and this is incubated overnight (28°C, 220 rpm) and spun down (GSA rotor, 8,000 rpm, 20 min).
- the pellet is resuspended in infiltration medium (1/2 MS medium; 0.5 g/L MES, pH 5.8; 50 g/L sucrose).
- the suspension is introduced into a plant box (Duchefa), and 100 ml of SILWET L-77 (heptamethyltrisiloxan modified with polyal- kylene oxide; Osi Specialties Inc., Cat. P030196) is added to a final concentration of 0.02%.
- SILWET L-77 heptamethyltrisiloxan modified with polyal- kylene oxide; Osi Specialties Inc., Cat. P030196
- the plant box with 8 to 12 plants is exposed to a vacuum for 10 to 15 minutes, followed by spontaneous aeration. This is repeated twice or 3 times. Thereupon, all plants are planted into flowerpots with moist soil and grown under long- day conditions (daytime temperature 22 to 24°C, nighttime temperature 19°C; relative atmospheric humidity 65%). The seeds are harvested after 6 weeks.
- transgenic Arabidopsis plants can be obtained by root transformation.
- White root shoots of plants with a maximum age of 8 weeks are used.
- plants that are kept under sterile conditions in 1 MS medium (1 % sucrose; 100mg/L inositol; 1.0 mg/L thiamine; 0.5 mg/L pyridoxine; 0.5 mg/L nicotinic acid; 0.5 g MES, pH 5.7; 0.8 % agar) are used.
- Roots are grown on callus-inducing medium for 3 days (1x Gamborg s B5 medium; 2% glucose; 0.5 g/L mercaptoethanol; 0.8% agar; 0.5 mg/L 2,4-D (2,4-dichlorophenoxyacetic acid); 0.05 mg/L kinetin). Root sections 0.5 cm in length are transferred into 10 to 20 mL of liquid callus-inducing medium (composition as described above, but without agar supplementation), inoculated with 1 mL of the above-described overnight Agrobacterium culture (grown at 28°C, 200 rpm in LB) and shaken for 2 minutes.
- the root ex- plants are transferred to callus-inducing medium with agar, subsequently to callus- inducing liquid medium without agar (with 500 mg/L betabactyl, SmithKline Beecham Pharma GmbH, Kunststoff), incubated with shaking and finally transferred to shoot- inducing medium (5 mg/L 2-isopentenyladenine phosphate; 0.15 mg/L indole-3-acetic acid; 50 mg/L kanamycin; 500 mg/L betabactyl).
- the small green shoots are transferred to germination medium (1 MS medium; 1 % sucrose; 100 mg/L inositol; 1.0 mg/L thiamine; 0.5 mg/L pyridoxine; 0.5 mg/L nicotinic acid; 0.5 g MES, pH 5.7; 0.8% agar) and regenerated into plants.
- MS medium 1 % sucrose; 100 mg/L inositol; 1.0 mg/L thiamine; 0.5 mg/L pyridoxine; 0.5 mg/L nicotinic acid; 0.5 g MES, pH 5.7; 0.8% agar
- the Agrobacterium- ⁇ e ⁇ ate ⁇ plant transformation using standard transformation and regeneration techniques may also be carried out for the purposes of transforming crop plants (Gelvin& Schilperoort (1995) Plant Molecular Biology Manual, 2 nd Edition, Dordrecht: Kluwer Academic Publ. ISBN 0-7923-2731-4; Glick & Thompson (1993) Methods in Plant Molecular Biology and Biotechnology, Boca Raton: CRC Press, ISBN 0-8493-5164-2).
- oilseed rape can be transformed by cotyledon or hypo- cotyl transformation (Moloney (1989) Plant Cell Reports 8: 238-242).
- antibi- otics for the selection of agrobacteria and plants depends on the binary vector and the Agrobacterium strain used for the transformation.
- the selection of oilseed rape is generally carried out using kanamycin as selectable plant marker.
- the Agrobacterium- mediated gene transfer in linseed can be carried out using for example a technique described by Mlynarova (1994) Plant Cell Report 13:282-285.
- the transformation of soybean can be carried out using, for example, a technique described in EP A1 0 424 047 or in EP A1 0 397 687, US 5,376,543, US 5,169,770.
- the transformation of maize or other monocotyledonous plants can be carried out using, for example, a technique described in US 5,591 ,616.
- the transformation of plants using particle bombardment, polyethylene glycol-mediated DNA uptake or via the silicon carbonate fiber technique is described, for example, by Freeling & Walbot (1993) 'The maize handbook' ISBN 3-540-97826-7, Springer Verlag New York).
- EXAMPLE 5 Computer algorithm for retrieving sequence information from NCBI genebank file.
- the target feature keys are intron, terminator, promoter, UTR.
- the following script (written in computer language Pearl) is giving an example for a computer algorithm of the invention suitable to identify suitable intron sequences based of database information (see also Fig.5a-f):
- EXAMPLE 6 Expression of tissue-specific promoters in combination with IME- introns BPSI.1 and BPSI.5 have been fused with various monocot promoters and demonstrated that most of these promoters without IME-intron did not show GUS expression, but IME-introns have enhanced expression.
- pBPSMM355 shows strong leaf-specific expression. This expression was detected in all tested developmental stages. No expression was detected in any other tissue tested.
- pBPSMM370 is strongly expressed in roots. Significant expression was also detected in silk and in the outermost layers of the kernel that include the aleuron layer and seed coat. This expression was strongest around the base of the kernel. Staining in silk was strongest in the region close to the attachment point with the kernel and was detected at very early developmental stages.
- CCoAMTI promoter::BPSI.1 intron::GUS::NOS terminator Os.Caffeoyl-CoA-O-methyltransferase (CCoAMTI ) promoter in combination with BPSI.1 (pBPSMM358) showed embryo-specific expression in T1 and T2 kernels. The expression level was low but very specific. No expression was detected in any other tissue tested.
- EXS1025 is strongly expressed in the embryo. This expression starts between 5 days after pollination (DAP) and 10DAP. Expression is strongest in the scutellum and weaker in the embryo axis (plumule with leaves and internodes, primary root).
- pBPSMM369 is strongly expressed in roots. This expression was detected in all tested stages. Significant expression was also detected in all parts of the kernels and in pollen. Weak expression was detected in the leaves at early developmental stages and at flowering. This expression was variable in strength and was in several plants at the detection limit. In general, expression was higher in homozygous T1 plants than in the heterozygous TO. 6.6 Zm. LDH promoter::BPSI.1 intron::GUS::NOS terminator (pBPSMM357) pBPSMM357 shows weak activity in kernels. Expression in kernels was mainly located in and around the embryo. Very weak expression was also detected in roots.
- Os.C8,7SI promoter::BPSI.1 intron::GUS::NOS terminator pBPSMM366
- Os.C-8,7-sterol-isomerase promoter containing BPSI.1 shows weak activity in roots and good expression in kernels.
- pBPSMM371 Os. Lea promoter in combination with BPSI.1 (pBPSMM371) showed strong embryo- specific expression in kernels. Some expression could be detected in root tips but no expression was detected in any other tissue tested. 6.9 Zm.LDH promoter::BPSI.5 intron::GUS::NOS terminator (pBPSLM229) pBPSLM229 shows weak expression in endosperm and aleuron layer, mainly at the top side of the kernel. No expression was detected in any other tissue tested.
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Chemical & Material Sciences (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Cell Biology (AREA)
- Developmental Biology & Embryology (AREA)
- Pregnancy & Childbirth (AREA)
- Reproductive Health (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Acyclic And Carbocyclic Compounds In Medicinal Compositions (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
Abstract
Description
Claims
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
BRPI0609283-7A BRPI0609283A2 (en) | 2005-03-08 | 2006-03-07 | methods for identifying an intron with plant expression enhancing properties, to enrich the number of introns with plant expression enhancing properties, to isolate, supply or produce an intron with plant expression enhancing properties, to provide an expression cassette and to enhance expression of a nucleic acid sequence in a plant or a plant cell, computer algorithm, computer device or data storage device, recombinant DNA expression construct, expression vector, transgenic cell, or nonhuman organism transgenic material, cell culture, propagating parts or material, and, use of a transgenic organism or cell cultures, transgenic propagating material parts derived from these |
AU2006222012A AU2006222012B2 (en) | 2005-03-08 | 2006-03-07 | Expression enhancing intron sequences |
CA002599405A CA2599405A1 (en) | 2005-03-08 | 2006-03-07 | Expression enhancing intron sequences |
US11/885,988 US8088971B2 (en) | 2005-03-08 | 2006-03-07 | Expression enhancing intron sequences |
EP06708664A EP1859037A2 (en) | 2005-03-08 | 2006-03-07 | Expression enhancing intron sequences |
CN2006800076447A CN101137752B (en) | 2005-03-08 | 2006-03-07 | Expression enhancing intron sequences |
US13/241,493 US8759506B2 (en) | 2005-03-08 | 2011-09-23 | Expression enhancing intron sequences |
US14/270,814 US20140237683A1 (en) | 2005-03-08 | 2014-05-06 | Expression enhancing intron sequences |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US65948205P | 2005-03-08 | 2005-03-08 | |
US60/659,482 | 2005-03-08 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/885,988 A-371-Of-International US8088971B2 (en) | 2005-03-08 | 2006-03-07 | Expression enhancing intron sequences |
US13/241,493 Division US8759506B2 (en) | 2005-03-08 | 2011-09-23 | Expression enhancing intron sequences |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2006094976A2 true WO2006094976A2 (en) | 2006-09-14 |
WO2006094976A3 WO2006094976A3 (en) | 2006-12-28 |
Family
ID=36593798
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2006/060513 WO2006094976A2 (en) | 2005-03-08 | 2006-03-07 | Expression enhancing intron sequences |
Country Status (9)
Country | Link |
---|---|
US (3) | US8088971B2 (en) |
EP (7) | EP2045327B8 (en) |
CN (2) | CN102925479A (en) |
AT (1) | ATE541043T1 (en) |
AU (1) | AU2006222012B2 (en) |
BR (1) | BRPI0609283A2 (en) |
CA (1) | CA2599405A1 (en) |
WO (1) | WO2006094976A2 (en) |
ZA (1) | ZA200708512B (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008099013A1 (en) | 2007-02-16 | 2008-08-21 | Basf Plant Science Gmbh | Nucleic acid sequences for regulation of embryo-specific expression in monocotyledonous plants |
WO2010122110A1 (en) | 2009-04-22 | 2010-10-28 | Basf Plant Science Company Gmbh | Whole seed specific promoter |
US8129588B2 (en) | 2004-04-20 | 2012-03-06 | Syngenta Participations Ag | Regulatory sequences for expressing gene products in plant reproductive tissue |
WO2012127373A1 (en) | 2011-03-18 | 2012-09-27 | Basf Plant Science Company Gmbh | Promoters for regulating expression in plants |
US20130145502A1 (en) * | 2010-06-09 | 2013-06-06 | Pioneer Hi-Bred International, Inc. | Regulatory sequences for modulating transgene expression in plants |
US8673631B2 (en) | 2006-02-17 | 2014-03-18 | Monsanto Technology Llc | Chimeric regulatory sequences comprising introns for plant gene expression |
US9862977B2 (en) | 2011-10-19 | 2018-01-09 | Massachusetts Institute Of Technology | Engineered microbes and methods for microbial oil production |
EP3760726A1 (en) * | 2010-01-14 | 2021-01-06 | Monsanto Technology LLC | Plant regulatory elements and uses thereof |
WO2021048316A1 (en) * | 2019-09-12 | 2021-03-18 | Basf Se | Regulatory nucleic acid molecules for enhancing gene expression in plants |
Families Citing this family (53)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2006275753B2 (en) | 2005-07-29 | 2012-01-12 | Targeted Growth, Inc. | Dominant negative mutant KRP protein protection of active cyclin-CDK complex inhibition by wild-type KRP |
US8832581B2 (en) * | 2009-03-05 | 2014-09-09 | Ming Zhang | Gene expression browser for web-based search and visualization of characteristics of gene expression |
WO2011023539A1 (en) * | 2009-08-31 | 2011-03-03 | Basf Plant Science Company Gmbh | Regulatory nucleic acid molecules for enhancing seed-specific and/or seed-preferential gene expression in plants |
WO2012065166A2 (en) * | 2010-11-12 | 2012-05-18 | Targeted Growth, Inc. | Dominant negative mutant kip-related proteins (krp) in zea mays and methods of their use |
CN102154289B (en) * | 2011-01-11 | 2012-08-22 | 吉林大学 | Corn drought inducible gene promoters and activity analysis thereof |
WO2012142116A2 (en) | 2011-04-11 | 2012-10-18 | Targeted Growth, Inc. | Identification and use of krp mutants in wheat |
US9556448B2 (en) | 2011-04-11 | 2017-01-31 | Targeted Growth, Inc. | Identification and the use of KRP mutants in plants |
WO2014004638A2 (en) * | 2012-06-29 | 2014-01-03 | Monsanto Technology Llc | Methods and compositions for enhancing gene expression |
BR112015023709A2 (en) | 2013-03-15 | 2017-07-18 | Pioneer Hi Bred Int | phi-4 polypeptide, polynucleotide, composition, method for inhibiting growth, method for controlling a population, plant, seed, expression cassette, method for expressing a polynucleotide in a plant, method for protecting a plant, fusion protein |
US10006045B2 (en) | 2013-08-16 | 2018-06-26 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
BR102014021330A2 (en) * | 2013-08-30 | 2015-09-22 | Dow Agrosciences Llc | constructs for transgene expression using panicum ubiquitin gene regulatory elements |
CA3175984A1 (en) | 2013-09-13 | 2015-03-19 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
CN103555716B (en) * | 2013-11-06 | 2016-01-20 | 北京大北农科技集团股份有限公司 | Intron sequences of Enhanced expressing and uses thereof |
CA2939156A1 (en) | 2014-02-07 | 2015-08-13 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
CN106232620B (en) | 2014-02-07 | 2022-05-13 | 先锋国际良种公司 | Insecticidal proteins and methods of use thereof |
CN104318131A (en) * | 2014-10-11 | 2015-01-28 | 中国农业科学院棉花研究所 | Electric FISH (fluorescence in situ hybridization) method |
CA2963558C (en) | 2014-10-16 | 2023-04-04 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
CA2977026A1 (en) | 2015-03-11 | 2016-09-15 | E.I. Du Pont De Nemours And Company | Insecticidal combinations of pip-72 and methods of use |
BR112017024948A2 (en) | 2015-05-19 | 2018-07-31 | Pioneer Hi Bred Int | insecticide proteins and methods for their use |
EP3331352B1 (en) | 2015-08-06 | 2022-07-06 | Pioneer Hi-Bred International, Inc. | Plant derived insecticidal proteins and methods for their use |
CN108575091A (en) | 2015-12-18 | 2018-09-25 | 先锋国际良种公司 | insecticidal proteins and methods of use thereof |
AU2017249365B2 (en) * | 2016-04-13 | 2023-04-20 | BASF Agricultural Solutions Seed US LLC | Seed- and funiculus-preferential promoters and uses thereof |
MX2018013249A (en) | 2016-05-04 | 2019-02-13 | Pioneer Hi Bred Int | Insecticidal proteins and methods for their use. |
MX2018015906A (en) | 2016-07-01 | 2019-04-04 | Pioneer Hi Bred Int | Insecticidal proteins from plants and methods for their use. |
WO2018111551A1 (en) | 2016-12-14 | 2018-06-21 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
WO2018118811A1 (en) | 2016-12-22 | 2018-06-28 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
WO2018148001A1 (en) | 2017-02-08 | 2018-08-16 | Pioneer Hi-Bred International Inc | Insecticidal combinations of plant derived insecticidal proteins and methods for their use |
CN110621780B (en) | 2017-05-11 | 2024-03-19 | 先锋国际良种公司 | Insecticidal proteins and methods of use thereof |
CA3091439A1 (en) * | 2018-02-21 | 2019-08-29 | Nemametrix Inc. | Transgenic animal phenotyping platform and uses thereof |
WO2019169150A1 (en) | 2018-03-02 | 2019-09-06 | Pioneer Hi-Bred International, Inc. | Plant health assay |
WO2019178042A1 (en) | 2018-03-14 | 2019-09-19 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins from plants and methods for their use |
US11820791B2 (en) | 2018-03-14 | 2023-11-21 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins from plants and methods for their use |
CN111902547B (en) | 2018-03-23 | 2024-06-14 | 先锋国际良种公司 | Methods for identifying, selecting and producing disease-resistant crops |
CN111988988A (en) | 2018-04-18 | 2020-11-24 | 先锋国际良种公司 | Method for identifying, selecting and producing bacterial blight resistant rice |
CN108546670A (en) * | 2018-04-19 | 2018-09-18 | 贵州大学 | A kind of method for transformation of the preparation method of Sorghum Protoplast and the protoplast of preparation |
CN108642064B (en) * | 2018-05-21 | 2021-11-26 | 安徽农业大学 | Wheat seed dormancy duration gene TaCNGC-2A and functional marker thereof |
MX2020013154A (en) | 2018-06-06 | 2021-07-16 | Univ Huazhong Agricultural | Methods of identifying, selecting, and producing southern corn rust resistant crops. |
US20210301285A1 (en) * | 2018-08-02 | 2021-09-30 | Novozymes A/S | Preparation of Combinatorial Libraries of DNA Constructs |
MX2021002290A (en) | 2018-08-29 | 2021-04-28 | Pioneer Hi Bred Int | Insecticidal proteins and methods for their use. |
AU2019346655A1 (en) * | 2018-09-28 | 2021-05-06 | Voyager Therapeutics, Inc. | Frataxin expression constructs having engineered promoters and methods of use thereof |
EP4017252A1 (en) | 2019-08-23 | 2022-06-29 | Pioneer Hi-Bred International, Inc. | Methods of identifying, selecting, and producing anthracnose stalk rot resistant crops |
CN111153974A (en) | 2020-01-15 | 2020-05-15 | 华中农业大学 | Corn disease-resistant gene and molecular marker and application thereof |
CN116096901A (en) | 2020-04-15 | 2023-05-09 | 先锋国际良种公司 | Plant pathogen effector and disease resistance gene identification, compositions and methods of use |
CN111534538B (en) * | 2020-05-11 | 2022-02-01 | 山西大学 | Method for rapidly screening non-transgenic site-directed mutant plants |
WO2022015619A2 (en) | 2020-07-14 | 2022-01-20 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
CN113046373A (en) * | 2021-03-24 | 2021-06-29 | 中国热带农业科学院热带生物技术研究所 | Sugarcane UDP-glycosyltransferase gene ShUGT2 and application thereof |
CN115216554A (en) | 2021-04-16 | 2022-10-21 | 华中农业大学 | Plant pathogen effector and disease resistance gene identification, compositions, and methods of use |
CN113462690B (en) * | 2021-07-02 | 2022-06-21 | 河南大学 | Application of soybean gene promoters pRPS28 and pRPS28-I in soybeans, arabidopsis thaliana and tobaccos |
AU2022324708A1 (en) | 2021-08-06 | 2024-02-15 | Kws Vegetables B.V. | Durable downy mildew resistance in spinach |
CN114875025B (en) * | 2022-03-25 | 2023-09-19 | 广东省科学院南繁种业研究所 | Drought and ABA inducible promoter P SCBV-YZ2060 And applications thereof |
WO2023250325A1 (en) * | 2022-06-21 | 2023-12-28 | The Regents Of The University Of California | Compositions and methods for treating huntington's disease |
WO2023250324A2 (en) * | 2022-06-21 | 2023-12-28 | The Regents Of The University Of California | Compositions and methods for reducing rna levels |
CN116970070A (en) * | 2023-08-02 | 2023-10-31 | 湖南诺合新生物科技有限公司 | Plant expression method of III type human collagen, coding gene containing intron and application thereof |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1134285A1 (en) * | 1999-09-27 | 2001-09-19 | Japan Tobacco Inc. | Nucleic acid fragment, recombinant vector containing the same and method of promoting the expression of structural gene by using the same |
Family Cites Families (97)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US695A (en) | 1838-04-16 | Improved water-wheel | ||
US5527A (en) | 1848-04-25 | Ship s windlass | ||
EP0131623B2 (en) | 1983-01-17 | 1999-07-28 | Monsanto Company | Chimeric genes suitable for expression in plant cells |
US5352605A (en) | 1983-01-17 | 1994-10-04 | Monsanto Company | Chimeric genes for transforming plant cells using viral promoters |
NL8300698A (en) | 1983-02-24 | 1984-09-17 | Univ Leiden | METHOD FOR BUILDING FOREIGN DNA INTO THE NAME OF DIABIC LOBAL PLANTS; AGROBACTERIUM TUMEFACIENS BACTERIA AND METHOD FOR PRODUCTION THEREOF; PLANTS AND PLANT CELLS WITH CHANGED GENETIC PROPERTIES; PROCESS FOR PREPARING CHEMICAL AND / OR PHARMACEUTICAL PRODUCTS. |
US5504200A (en) | 1983-04-15 | 1996-04-02 | Mycogen Plant Science, Inc. | Plant gene expression |
US5420034A (en) | 1986-07-31 | 1995-05-30 | Calgene, Inc. | Seed-specific transcriptional regulation |
DK162399C (en) | 1986-01-28 | 1992-03-23 | Danisco | PROCEDURE FOR EXPRESSION OF GENES IN BELGIUM PLANT CELLS, DNA FRAGMENT, RECOMBINED DNA FRAGMENT AND PLASMID FOR USE IN EXERCISING THE PROCEDURE |
US4962028A (en) | 1986-07-09 | 1990-10-09 | Dna Plant Technology Corporation | Plant promotors |
US5290924A (en) | 1987-02-06 | 1994-03-01 | Last David I | Recombinant promoter for gene expression in monocotyledonous plants |
US5202231A (en) | 1987-04-01 | 1993-04-13 | Drmanac Radoje T | Method of sequencing of genomes by hybridization of oligonucleotide probes |
US5525464A (en) | 1987-04-01 | 1996-06-11 | Hyseq, Inc. | Method of sequencing by hybridization of oligonucleotide probes |
EP0397687B1 (en) | 1987-12-21 | 1994-05-11 | The University Of Toledo | Agrobacterium mediated transformation of germinating plant seeds |
US5614395A (en) | 1988-03-08 | 1997-03-25 | Ciba-Geigy Corporation | Chemically regulatable and anti-pathogenic DNA sequences and uses thereof |
NZ228320A (en) | 1988-03-29 | 1991-06-25 | Du Pont | Nucleic acid promoter fragments of the promoter region homologous to the em gene of wheat, dna constructs therefrom and plants thereof |
CA1341467C (en) | 1988-07-29 | 2004-12-07 | John C. Rogers | Producing commercially valuable polypeptides with genetically transformed endosperm tissue |
DE3843628A1 (en) | 1988-12-21 | 1990-07-05 | Inst Genbiologische Forschung | Wound-inducible and potato-tuber-specific transcriptional regulation |
US5364780A (en) | 1989-03-17 | 1994-11-15 | E. I. Du Pont De Nemours And Company | External regulation of gene expression by inducible promoters |
US5501967A (en) | 1989-07-26 | 1996-03-26 | Mogen International, N.V./Rijksuniversiteit Te Leiden | Process for the site-directed integration of DNA into the genome of plants |
AU644097B2 (en) | 1989-08-09 | 1993-12-02 | Monsanto Technology, Llc | Methods and compositions for the production of stably transformed, fertile monocot plants and cells thereof |
US5322783A (en) | 1989-10-17 | 1994-06-21 | Pioneer Hi-Bred International, Inc. | Soybean transformation by microparticle bombardment |
ES2163392T3 (en) | 1990-03-16 | 2002-02-01 | Calgene Llc | NEW SEQUENCES EXPRESSED PREFERREDLY IN THE DEVELOPMENT OF PRECED SEEDS, AND PROCEDURES RELATED TO THEMSELVES. |
EP0459643B1 (en) * | 1990-05-18 | 2000-08-16 | Mycogen Plant Science, Inc. | A recombinant promoter for gene expression in monocotyledonous plants |
US5187267A (en) | 1990-06-19 | 1993-02-16 | Calgene, Inc. | Plant proteins, promoters, coding sequences and use |
DK152291D0 (en) | 1991-08-28 | 1991-08-28 | Danisco | PROCEDURE AND CHEMICAL RELATIONS |
AU683011B2 (en) | 1992-01-13 | 1997-10-30 | Duke University | Enzymatic RNA molecules |
IL101119A0 (en) | 1992-03-03 | 1992-11-15 | Univ Ramot | Transgenic wheat |
DE4208050A1 (en) | 1992-03-13 | 1993-09-23 | Bayer Ag | AZOLYL METHYL FLUORCYCLOPROPYL DERIVATIVES |
CA2092069A1 (en) | 1992-03-27 | 1993-09-28 | Asako Iida | An expression plasmid for seeds |
DE69331055T2 (en) | 1992-04-13 | 2002-06-20 | Syngenta Ltd., Haselmere | DNA CONSTRUCTIONS AND PLANTS CONTAINING THEM |
AU4539593A (en) | 1992-06-23 | 1994-01-24 | South Dakota State University | Transformation of plants by direct injection of dna |
EP1983056A1 (en) | 1992-07-07 | 2008-10-22 | Japan Tobacco Inc. | Method for transforming monocotyledons |
JPH0662870A (en) | 1992-08-18 | 1994-03-08 | Mitsui Giyousai Shokubutsu Bio Kenkyusho:Kk | Promoter region of soybean phosphoenolpyruvate carboxylase gene and 5'-nontranslating region |
WO1994012015A1 (en) | 1992-11-30 | 1994-06-09 | Chua Nam Hai | Expression motifs that confer tissue- and developmental-specific expression in plants |
US5527695A (en) | 1993-01-29 | 1996-06-18 | Purdue Research Foundation | Controlled modification of eukaryotic genomes |
CN1061376C (en) | 1993-11-19 | 2001-01-31 | 生物技术研究及发展有限公司 | Chimeric regulatory regions and gene cassettes for expression of genes in plants |
GB9324707D0 (en) | 1993-12-02 | 1994-01-19 | Olsen Odd Arne | Promoter |
CN1048254C (en) | 1993-12-09 | 2000-01-12 | 托马斯杰弗逊大学 | Compounds and methods for site-directed mutations in eukaryotic cells |
US5576198A (en) | 1993-12-14 | 1996-11-19 | Calgene, Inc. | Controlled expression of transgenic constructs in plant plastids |
BR9505691A (en) | 1994-01-21 | 1996-01-16 | Agracetus | Gas powered instrument for transporting people |
GB9403512D0 (en) | 1994-02-24 | 1994-04-13 | Olsen Odd Arne | Promoter |
GB9421286D0 (en) | 1994-10-21 | 1994-12-07 | Danisco | Promoter |
US5689040A (en) | 1995-02-23 | 1997-11-18 | The Regents Of The University Of California | Plant promoter sequences useful for gene expression in seeds and seedlings |
DE69637041D1 (en) * | 1995-03-29 | 2007-06-06 | Japan Tobacco Inc | DNA FRAGMENT, THIS CONTAINING RECOMBINANT VECTOR AND METHOD FOR EXPRESSING FOREIGN GENES USING THEREOF |
EP0781849B1 (en) | 1995-07-05 | 2007-09-19 | Sapporo Breweries Ltd. | Method employing tissue specific promoter |
GB9516241D0 (en) | 1995-08-08 | 1995-10-11 | Zeneca Ltd | Dna constructs |
GEP20012558B (en) | 1995-08-10 | 2001-10-25 | Rutgers Univ | Nuclear-Encoded Transcription System in Plastids of Higher Plants |
JP2000507804A (en) | 1995-08-30 | 2000-06-27 | マックス−プランク−ゲゼルシャフト・ツア・フェルデルング・デア・ヴィッセンシャフテン・アインゲトラーゲナー・フェアアイン | Stimulation of homologous recombination in eukaryotes or cells by recombination promoting enzymes |
US5879941A (en) | 1995-10-06 | 1999-03-09 | University Of Florida | Polypeptides and polynucleotides relating to the α-and β-subunits of a glutamate dehydrogenase and methods of use |
AU724041B2 (en) | 1995-10-12 | 2000-09-07 | Cornell Research Foundation Inc. | Production of water stress or salt stress tolerant transgenic cereal plants |
AR006928A1 (en) | 1996-05-01 | 1999-09-29 | Pioneer Hi Bred Int | AN ISOLATED DNA MOLECULA CODING A GREEN FLUORESCENT PROTEIN AS A TRACEABLE MARKER FOR TRANSFORMATION OF PLANTS, A METHOD FOR THE PRODUCTION OF TRANSGENIC PLANTS, A VECTOR OF EXPRESSION, A TRANSGENIC PLANT AND CELLS OF SUCH PLANTS. |
IN1997CH00924A (en) | 1996-05-03 | 2005-03-04 | Syngenta Mogen Bv | Regulating metabolism by modifying the level of trehalose-6-phosphate |
DE19619353A1 (en) | 1996-05-14 | 1997-11-20 | Bosch Gmbh Robert | Method for producing an integrated optical waveguide component and arrangement |
CN1208437A (en) | 1996-06-21 | 1999-02-17 | 孟山都公司 | Methods for the production of stably-transformed, fertile wheat employing agrobacterium-mediated transformation and compositions derived therefrom |
US5981841A (en) | 1996-08-30 | 1999-11-09 | Monsanto Company | Early seed 5' regulatory sequence |
US6420629B1 (en) | 1996-09-09 | 2002-07-16 | B.C. Research Inc. | Process of increasing plant growth and yield and modifying cellulose production in plants |
DE19644478A1 (en) | 1996-10-25 | 1998-04-30 | Basf Ag | Leaf-specific expression of genes in transgenic plants |
WO1998026045A1 (en) | 1996-12-13 | 1998-06-18 | The General Hospital Corporation | Stress-protected transgenic plants |
US6309824B1 (en) | 1997-01-16 | 2001-10-30 | Hyseq, Inc. | Methods for analyzing a target nucleic acid using immobilized heterogeneous mixtures of oligonucleotide probes |
US5977436A (en) | 1997-04-09 | 1999-11-02 | Rhone Poulenc Agrochimie | Oleosin 5' regulatory region for the modification of plant seed lipid composition |
EP0870836A1 (en) | 1997-04-09 | 1998-10-14 | IPK Gatersleben | 2-Deoxyglucose-6-Phosphate (2-DOG-6-P) Phosphatase DNA sequences for use as selectionmarker in plants |
WO1998050561A1 (en) | 1997-05-02 | 1998-11-12 | Mogen International N.V. | Regulating metabolism by modifying the level of trehalose-6-phosphate by inhibiting endogenous trehalase levels |
ZA986308B (en) | 1997-07-18 | 1999-11-24 | Pioneer Hi Bred Int | The induction of stress-related factors in plants. |
WO1999005902A1 (en) | 1997-07-30 | 1999-02-11 | Purdue Research Foundation | Transgenic plants tolerant of salinity stress |
WO1999006580A2 (en) | 1997-08-01 | 1999-02-11 | Performance Plants, Inc. | Stress tolerance and delayed senescence in plants |
ATE346944T1 (en) | 1997-09-30 | 2006-12-15 | Univ California | PRODUCTION OF PROTEINS IN PLANT SEEDS |
ZA989497B (en) | 1997-10-20 | 2000-04-19 | Roche Diagnostics Gmbh | Positive-negative selection in homologous recombination. |
US6506559B1 (en) | 1997-12-23 | 2003-01-14 | Carnegie Institute Of Washington | Genetic inhibition by double-stranded RNA |
EP1062351B1 (en) | 1998-03-11 | 2006-05-10 | Syngenta Participations AG | Novel plant plastid promoter sequence |
JP5015373B2 (en) | 1998-04-08 | 2012-08-29 | コモンウェルス サイエンティフィック アンド インダストリアル リサーチ オーガニゼイション | Methods and means for obtaining an improved phenotype |
US6429293B1 (en) | 1998-06-26 | 2002-08-06 | Hsc Research And Development Limited Partnership | Sculpin-type antifreeze polypeptides and nucleic acids |
US6555732B1 (en) | 1998-09-14 | 2003-04-29 | Pioneer Hi-Bred International, Inc. | Rac-like genes and methods of use |
TR200200972T2 (en) | 1998-11-03 | 2002-07-22 | Basf Aktiengesellschaft | Substituted 2-phenylbenzimidazoles, their production and uses |
DE19852195C2 (en) | 1998-11-04 | 2000-11-02 | Inst Pflanzengenetik & Kultur | New expression cassette for the expression of any genes in plant seeds |
HUP0104129A3 (en) | 1998-11-17 | 2002-09-30 | Basf Ag | Use of 2-phenylbenzimidazoles and 2-phenylindoles for producing pharmaceutical compositions having parp enzym effect and new 2-phenylbenzimidazol derivatives |
DK1133477T3 (en) | 1998-11-27 | 2004-06-21 | Abbott Gmbh & Co Kg | Substituted benzimidazoles and their use as pair inhibitors |
CA2361201A1 (en) | 1999-01-28 | 2000-08-03 | Medical College Of Georgia Research Institute, Inc. | Composition and method for in vivo and in vitro attenuation of gene expression using double stranded rna |
DE19956568A1 (en) | 1999-01-30 | 2000-08-17 | Roland Kreutzer | Method and medicament for inhibiting the expression of a given gene |
WO2000049035A1 (en) | 1999-02-19 | 2000-08-24 | The General Hospital Corporation | Gene silencing |
US6537753B1 (en) | 1999-02-23 | 2003-03-25 | Health Research Incorporated | CaESS1: a Candida albicans gene, methods for making and using, and targeting it and its expression products for antifungal applications |
JP2002541853A (en) | 1999-04-19 | 2002-12-10 | ロビオ | Plant transformation method |
CA2370628A1 (en) | 1999-04-21 | 2000-10-26 | American Home Products Corporation | Methods and compositions for inhibiting the function of polynucleotide sequences |
DE19918211A1 (en) | 1999-04-22 | 2000-10-26 | Basf Ag | New 2-carbocyclyl-benzimidazole-carboxamide derivatives, are PARP inhibitors useful e.g. for treating neurodegenerative disease, epilepsy, ischemia, tumors, inflammation or diabetes |
DE19920936A1 (en) | 1999-05-07 | 2000-11-09 | Basf Ag | Heterocyclically substituted benzimidazoles, their preparation and use |
CN100420748C (en) | 1999-05-10 | 2008-09-24 | 辛根塔参与股份公司 | Regulation of viral gene expression |
DE19921567A1 (en) | 1999-05-11 | 2000-11-16 | Basf Ag | Use of phthalazine derivatives |
JP2003505076A (en) | 1999-07-23 | 2003-02-12 | ウィスコンシン・アラムナイ・リサーチ・ファウンデイション | Arabidopsis thaliana cyclic nucleotide gated ion channel / DND gene; regulator of plant disease resistance and cell death |
ATE309381T1 (en) | 1999-08-26 | 2005-11-15 | PLANT GENE EXPRESSION UNDER THE CONTROL OF CONSTITUTIVE PLANT V-ATPASE PROMOTERS | |
JP2003510328A (en) | 1999-09-28 | 2003-03-18 | ビーエーエスエフ アクチェンゲゼルシャフト | Azepinoindole derivatives, their preparation and use |
DE19946289A1 (en) | 1999-09-28 | 2001-03-29 | Basf Ag | Benzodiazepine derivatives, their production and use |
WO2001038504A2 (en) | 1999-11-23 | 2001-05-31 | Maxygen, Inc. | Homologous recombination in plants |
US6569681B1 (en) | 2000-03-14 | 2003-05-27 | Transkaryotic Therapies, Inc. | Methods of improving homologous recombination |
EP1149915A1 (en) | 2000-04-28 | 2001-10-31 | Frommer, Wolf-Bernd | Modification of gene expression in transgenic plants |
CA2413425A1 (en) | 2000-06-28 | 2002-12-19 | Sungene Gmbh & Co. Kgaa | Binary vectors for improved transformation of plant systems |
AU2001288765B2 (en) | 2000-09-05 | 2006-04-27 | Board Of Control Of Michigan Technological University | Genetic engineering of syringyl-enriched lignin in plants |
GB0201043D0 (en) | 2002-01-17 | 2002-03-06 | Swetree Genomics Ab | Plants methods and means |
GB0201686D0 (en) * | 2002-01-25 | 2002-03-13 | Dca Design Consultants Ltd | Improvements in and relating to a medicament delivery device |
-
2006
- 2006-03-07 EP EP08168818A patent/EP2045327B8/en not_active Not-in-force
- 2006-03-07 WO PCT/EP2006/060513 patent/WO2006094976A2/en active Application Filing
- 2006-03-07 EP EP06708664A patent/EP1859037A2/en not_active Withdrawn
- 2006-03-07 CN CN201210405556XA patent/CN102925479A/en active Pending
- 2006-03-07 AU AU2006222012A patent/AU2006222012B2/en not_active Ceased
- 2006-03-07 EP EP09176172A patent/EP2169058B1/en not_active Not-in-force
- 2006-03-07 AT AT08168818T patent/ATE541043T1/en active
- 2006-03-07 EP EP09176171A patent/EP2166101B1/en not_active Not-in-force
- 2006-03-07 EP EP09176173A patent/EP2166102B1/en not_active Not-in-force
- 2006-03-07 BR BRPI0609283-7A patent/BRPI0609283A2/en not_active IP Right Cessation
- 2006-03-07 CA CA002599405A patent/CA2599405A1/en not_active Abandoned
- 2006-03-07 US US11/885,988 patent/US8088971B2/en not_active Expired - Fee Related
- 2006-03-07 CN CN2006800076447A patent/CN101137752B/en not_active Expired - Fee Related
- 2006-03-07 EP EP09176168A patent/EP2166099B1/en not_active Not-in-force
- 2006-03-07 EP EP09176170A patent/EP2166100B1/en not_active Not-in-force
-
2007
- 2007-10-05 ZA ZA200708512A patent/ZA200708512B/en unknown
-
2011
- 2011-09-23 US US13/241,493 patent/US8759506B2/en not_active Expired - Fee Related
-
2014
- 2014-05-06 US US14/270,814 patent/US20140237683A1/en not_active Abandoned
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1134285A1 (en) * | 1999-09-27 | 2001-09-19 | Japan Tobacco Inc. | Nucleic acid fragment, recombinant vector containing the same and method of promoting the expression of structural gene by using the same |
Non-Patent Citations (11)
Title |
---|
BRENDEL V ET AL: "Prediction of locally optimal splice sites in plant pre-mRNA with applications to gene identification in Arabidopsis thaliana genomic DNA." NUCLEIC ACIDS RESEARCH. 15 OCT 1998, vol. 26, no. 20, 15 October 1998 (1998-10-15), pages 4748-4757, XP002387950 ISSN: 0305-1048 * |
BRENDEL VOLKER ET AL: "Gene structure prediction from consensus spliced alignment of multiple ESTs matching the same genomic locus." BIOINFORMATICS (OXFORD, ENGLAND) 1 MAY 2004, vol. 20, no. 7, 1 May 2004 (2004-05-01), pages 1157-1169, XP002387953 ISSN: 1367-4803 * |
LUEHRSEN K R ET AL: "Intron enhancement of gene expression and the splicing efficiency of introns in maize cells." MOLECULAR & GENERAL GENETICS : MGG. JAN 1991, vol. 225, no. 1, January 1991 (1991-01), pages 81-93, XP001083782 ISSN: 0026-8925 -& DENNIS E S ET AL: "Molecular analysis of the alcohol dehydrogenase (Adh1) gene of maize." NUCLEIC ACIDS RESEARCH. 11 MAY 1984, vol. 12, no. 9, 11 May 1984 (1984-05-11), pages 3983-4000, XP002387949 ISSN: 0305-1048 * |
MASCARENHAS D ET AL: "INTRON-MEDIATED ENHANCEMENT OF HETEROLOGOUS GENE EXPRESSION IN MAIZE" PLANT MOLECULAR BIOLOGY, SPRINGER, DORDRECHT, NL, vol. 15, no. 6, 1990, pages 913-920, XP002933738 ISSN: 0167-4412 cited in the application * |
PAVY NATHALIE ET AL: "Evaluation of gene prediction software using a genomic data set: Application to Arabidopsis thaliana sequences" BIOINFORMATICS (OXFORD), vol. 15, no. 11, November 1999 (1999-11), pages 887-899, XP002387951 ISSN: 1367-4803 * |
PERTEA M ET AL: "GeneSplicer: a new computational method for splice site prediction." NUCLEIC ACIDS RESEARCH. 1 MAR 2001, vol. 29, no. 5, 1 March 2001 (2001-03-01), pages 1185-1190, XP002387952 ISSN: 1362-4962 * |
RETHMEIER N ET AL: "Intron-mediated enhancement of transgene expression in maize is a nuclear, gene-dependent process." THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY. OCT 1997, vol. 12, no. 4, October 1997 (1997-10), pages 895-899, XP002387946 ISSN: 0960-7412 & DATABASE EMBL EBI, Hinxton, UK; 1 September 1993 (1993-09-01), GARCIA A. ET AL.: "O. sativa salT gene" Database accession no. Z25811 * |
ROSE A B ET AL: "Intron-mediated enhancement of gene expression independent of unique intron sequences and splicing." PLANT PHYSIOLOGY. FEB 2000, vol. 122, no. 2, February 2000 (2000-02), pages 535-542, XP002387947 ISSN: 0032-0889 cited in the application * |
ROSE ALAN B: "Requirements for intron-mediated enhancement of gene expression in Arabidopsis." RNA (NEW YORK, N.Y.) NOV 2002, vol. 8, no. 11, November 2002 (2002-11), pages 1444-1453, XP002329166 ISSN: 1355-8382 * |
See also references of EP1859037A2 * |
ZANOR M I ET AL: "Isolation and expression of a barley beta-1,3-glucanase isoenzyme II gene." DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING. 2000, vol. 10, no. 6, 2000, pages 395-398, XP008065988 ISSN: 1042-5179 * |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8129588B2 (en) | 2004-04-20 | 2012-03-06 | Syngenta Participations Ag | Regulatory sequences for expressing gene products in plant reproductive tissue |
US8597913B2 (en) | 2004-04-20 | 2013-12-03 | Syngenta Participations Ag | Method of constructing an expression cassette comprising regulatory sequences of a target gene of a plant for expressing gene products |
US8679844B2 (en) | 2004-04-20 | 2014-03-25 | Syngenta Participations Ag | MADS gene regulatory sequences for expressing gene products in plant reproductive tissue |
US8673631B2 (en) | 2006-02-17 | 2014-03-18 | Monsanto Technology Llc | Chimeric regulatory sequences comprising introns for plant gene expression |
WO2008099013A1 (en) | 2007-02-16 | 2008-08-21 | Basf Plant Science Gmbh | Nucleic acid sequences for regulation of embryo-specific expression in monocotyledonous plants |
AU2008214568B2 (en) * | 2007-02-16 | 2013-04-18 | Basf Plant Science Gmbh | Nucleic acid sequences for regulation of embryo-specific expression in monocotyledonous plants |
WO2010122110A1 (en) | 2009-04-22 | 2010-10-28 | Basf Plant Science Company Gmbh | Whole seed specific promoter |
DE112010003162T5 (en) | 2009-04-22 | 2012-08-16 | Basf Plant Science Company Gmbh | Total seed-specific promoter |
EP3760726A1 (en) * | 2010-01-14 | 2021-01-06 | Monsanto Technology LLC | Plant regulatory elements and uses thereof |
US11981902B2 (en) | 2010-01-14 | 2024-05-14 | Monsanto Technology Llc | Plant regulatory elements and uses thereof |
US20130145502A1 (en) * | 2010-06-09 | 2013-06-06 | Pioneer Hi-Bred International, Inc. | Regulatory sequences for modulating transgene expression in plants |
US11242535B2 (en) | 2010-06-09 | 2022-02-08 | E. I. Du Pont De Nemours And Company | Regulatory sequences for modulating transgene expression in plants |
WO2012127373A1 (en) | 2011-03-18 | 2012-09-27 | Basf Plant Science Company Gmbh | Promoters for regulating expression in plants |
EP3434781A2 (en) | 2011-03-18 | 2019-01-30 | Basf Plant Science Company GmbH | Promoters for regulating expression in plants |
EP3342859A1 (en) | 2011-03-18 | 2018-07-04 | Basf Plant Science Company GmbH | Promoters for regulating expression in plants |
US9862977B2 (en) | 2011-10-19 | 2018-01-09 | Massachusetts Institute Of Technology | Engineered microbes and methods for microbial oil production |
WO2021048316A1 (en) * | 2019-09-12 | 2021-03-18 | Basf Se | Regulatory nucleic acid molecules for enhancing gene expression in plants |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8088971B2 (en) | Expression enhancing intron sequences | |
US8884098B2 (en) | Expression cassettes for regulation of expression in monocotyledonous plants | |
EP1723241B1 (en) | Transgenic expression constructs for vegetative plant tissue specific expression of nucleic acids | |
AU2011202966B2 (en) | Expression enhancing intron sequences | |
CA2521207C (en) | Expression cassettes for guard cell-specific expression in plants | |
CA2524565A1 (en) | Expression cassettes for mesophyll- and/or epidermis-preferential expression in plants | |
CA2602156A1 (en) | Expression cassettes for seed-preferential expression in plants | |
CA2526304A1 (en) | Expression cassettes for vascular tissue-preferential expression in plants | |
JP2004049144A (en) | Infection-inducing pal gene promoter |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DPE1 | Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2006708664 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2599405 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 12007501922 Country of ref document: PH |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006222012 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 200680007644.7 Country of ref document: CN |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2006222012 Country of ref document: AU Date of ref document: 20060307 Kind code of ref document: A |
|
WWP | Wipo information: published in national office |
Ref document number: 2006222012 Country of ref document: AU |
|
NENP | Non-entry into the national phase |
Ref country code: RU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 4448/CHENP/2007 Country of ref document: IN |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: RU |
|
WWP | Wipo information: published in national office |
Ref document number: 2006708664 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11885988 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: PI0609283 Country of ref document: BR Kind code of ref document: A2 Effective date: 20070906 |