WO2001000858A1 - Dna sequences specific to rice centromeres - Google Patents

Dna sequences specific to rice centromeres Download PDF

Info

Publication number
WO2001000858A1
WO2001000858A1 PCT/US2000/017535 US0017535W WO0100858A1 WO 2001000858 A1 WO2001000858 A1 WO 2001000858A1 US 0017535 W US0017535 W US 0017535W WO 0100858 A1 WO0100858 A1 WO 0100858A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
dna
plant
sequence
yeast
Prior art date
Application number
PCT/US2000/017535
Other languages
French (fr)
Inventor
Jiming Jiang
Fenggao Dong
Original Assignee
Wisconsin Alumni Research Foundation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wisconsin Alumni Research Foundation filed Critical Wisconsin Alumni Research Foundation
Priority to AU64888/00A priority Critical patent/AU6488800A/en
Publication of WO2001000858A1 publication Critical patent/WO2001000858A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)

Definitions

  • the present invention relates generally to molecular biology.
  • the present invention relates to nucleic acid sequences which encode a centromere from rice.
  • centromeres Among the most distinguishing and characteristic landmarks of chromosomes of higher eukaryotes is the location of the centromere.
  • the centromeric region is the site for mitoitic and meiotic spindle fiber attachment and is responsible for sister chromatid association.
  • the molecular organization of centromeres has been studied extensively in yeast, Drosophila melanogaster, and humans.
  • Centromeric regions usually consist of heterochromation and are thought to be highly methylated. Miller, J.T., et al., Theor. Appl. Genet., 96:832-839 (1998).
  • centromeres show varying amounts of nontranscribed repetitive sequences, which are referred to as satellite DNAs. Haaf, T., et al., Cell, 70:681-695 (1992).
  • the predominant class of centromeric DNA is the alpha-satellite DNA, which is found in diverged form in all centromeres. Id. To the extent that it is known, alpha-satellite arrays appear to be uninterrupted by other (nonsatellite) DNA sequences.
  • centromeres naturally occurring satellite arrays range in size from several hundred kbs to several megabases in length. Recent studies, however, suggest that as little as 140 kb of alpha satellite DNA may be sufficient to confer centromere function in human cells. Harrington, John, O 01/00858
  • centromeric DNAs originated from the budding yeast S. cerevisiae (See Clarke and Carbon, Ann. Rev. Genet. 19:29-56 (1985)).
  • the DNA region required for centromere function in S. cerevisiae is approximately 120 base pair (hereinafter "bp") long and is composed of three conserved domains: CDEI, an 8 bp element (A/G)TCAC(A/G)TG), CDEII, an extremely (about90%) AT-rich region of approximately 80 bp, and CDEIII, a 25 bp element (TGTTT(A/T)TGNTTTCCGAAANNNNAAA).
  • centromeric DNAs from the fission yeast Schizosaccharomyces pombe have also been characterized.
  • Several classes of 5. pombe moderately repeated DNA elements have been identified which are found only in the centromere regions. These centromere-specific repetitive elements have been designated dg (3.8 kb), dh (4 kb), and yn by Yanagida and co-workers (Nakaseko et al., Embo. J. 5..T011-1021 (1986); Nakaseko et al., Nuc. Acid Res.
  • the dg element has an AT-rich region and a 600 bp domain containing numerous small direct repeat motifs. Similarly, the dh element has an overall AT content approaching 70% and contains many short direct repeats. No nucleotide similarities to the S. cerevisiae CDEs have been found in the S. pombe elements. Attempts to demonstrate that the S.
  • pombe centromere-specific repetitive elements can function individually as centromeres have been unsuccessful.
  • large restriction fragments 65 to 150 kb
  • fission yeast centromere regions of chromosome 1 or 3 function as centromeres when introduced into acentric episomes (Hahnenberger et al.. PNAS USA 86:577-581 (1989)).
  • a 180-bp tandem repeat (pALl family) is the major component of the centromeric region of Arabidopsis thaliana chromosomes. The genomic organization of this repeat family shares similarities to the alpha satellite DNA at the human centromeres (see Martinez-Sapater, J., et al., Mol. Gen. Genet., 204:417-423 (1986);
  • chromosomes are man-made linear or circular DNA molecules constructed from essential cis-acting DNA sequence elements that are responsible for the proper replication and partitioning of natural chromosomes (see. Murray et al., Nature, 305:189-193 (1983)).
  • ARS Autonomous Replication Sequences
  • centromeres site of kinetochore assembly and responsible for proper distribution of replicated chromosomes at mitosis and meiosis
  • telomeres specialized structures at the ends of linear chromosomes that function to stabilize the ends and facilitate the complete replication of the extreme termini of the DNA molecule.
  • the present invention relates to isolated and purified nucleic acids having the nucleotide sequences shown in: SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 SEQ ID NO:6 and SEQ ID NO:7.
  • the present invention also relates to a recombinant DNA construct which contains a rice centromere.
  • the centromere contains a number of highly repetitive regions of DNA that have the nucleotide sequence of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 or combinations thereof.
  • the recombinant DNA construct may also contain a yeast autonomous replication sequence, an autonomous replication sequence from a higher eukaryotic organism, a yeast telomere sequence or a telomere sequence from a higher eukaryotic organism and a selectable marker gene.
  • the present invention also relates to a plasmid containing the hereinbefore described DNA construct.
  • This plasmid may contain an origin of replication and a selectable marker which functions in bacteria (such as E. coli) or in yeast (such as S. cerevisiae).
  • the present invention relates to a plant artificial chromosome vector.
  • the plant artificial chromosome vector of the present invention contains an autonomous replication sequence, two telomere sequences, a centromere sequence having the nucleotide sequence of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 or a combination thereof, and at least one selectable marker sequence.
  • the autonomous replication sequence may be from yeast or from a higher eukaryotic organism and the telomere sequence may from yeast or from a higher eukaryotic organism, such as, but not limited to, Arabidopsis thaliana.
  • the present invention also relates to a plant cell transformed with the plant artificial chromosome vector hereinbefore described and to transgenic plants containing said plant cell.
  • the plant cell and plant may be from Oryza sativa.
  • the present invention relates to a method of identifying centromeric DNA in a higher eukaryotic organism.
  • the method involves hybridizing an isolated nucleic acid selected from the group consisting of SEQ ID NO:l. SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO: 6 and SEQ ID NO: 7 and combinations thereof with a sample of DNA from a higher eukaryotic organism and then identifying and isolating the centromeric DNA from said sample.
  • FIG. 1 shows the nucleotide sequence of pSau3A9.
  • FIG. 2A - FIG. 2N shows the FISH analysis of rice centromeric DNA elements.
  • the probes were biotinylated and hybridized in situ to rice chromosomes or DNA fibers.
  • the probes were detected by fluorescein isothiocyanate-conjugated antibodies (green color) and the chromosomes were stained with propidium iodide (red color).
  • Probes pRCSl hybridized exclusively to the centromeric regions of the chromosomes from rice (FIG. 2A), rye (FIG.2B), barley (FIG. 2C), sorghum (FIG. 2D), and maize (FIG. 2E).
  • FISH signals also were detected in the centromeric regions of the acrocentric B chromosomes (see arrows) from rye (FIG. 2B) and maize (2E).
  • rice centromeric DNA families RCH2 (FIG. 2F), RCHl (FIG. 2G), RCH3 (FIG. 2H), RCE1 (FIG. 21), RCE2 (FIG. 2J), and RCS2 (FIG. 2K) all were located in the centromere of every rice chromosome.
  • Two pairs of chromosomes with the strongest signals are indicated by arrows and the third pair with the weakest signals by arrowheads (FIG. 2K).
  • the same metaphase cell FIG.
  • FIC. 3 shows a Southern blot of the genomic organization of the RCS1 family.
  • Rice genomic DNA was digested with Sau3 Al (lane 1), Dpnll (lane 2), H ⁇ elll (lane 3), Mspl (lane 4), Hpall (lane 5), Sail (lane 6), BamRl (lane 7), Dral (lane 8), EcoRl (lane 9), and Hindlll (lane 10) and probed with pRCS 1.
  • FIG 4 shows the nucleotide sequence of pRCS2.
  • the 639-bp insert of pRCS2 contains four copies of a tandemly arranged repeat.
  • the four members (A-D) range from 155 to 165 bp and share 84-91 % sequence identify with one another.
  • F represents the consensus sequence of the four members.
  • FIG. 5 shows a Southern blot of the genomic organization of the RCS2 family. Rice genomic DNA was digested with Dpnll (lane 1), Sau3 Al (lane 2), Mspl (lane 3), Hpall (lane 4). Sail (lane 5), and H ⁇ elll (lane 6), and probed with pRCS2.
  • FIG. 6A - FIG. 6B show a Southern blot of the conservation of the RCHl and RCE1 families in Gramineae species. Genomic DNA from sorghum (lane 1), maize (lane 2), sugar cane (lane 3), Ag. intermedium (lane 4), barley (lane 5), oats (lane 6), rye (lane 7), wheat (lane 8), Ae. Squarrosa (lane 9), rice (lane 10), bamboo (lane 1 1), Pharus sp. (lane 12), J. effusus (lane
  • FIG. 7 shows the nucleotide sequence of RCS1.
  • FIG. 8 shows the nucleotide sequence of RCS2.
  • FIG. 9 shows the nucleotide sequence of RCHl.
  • FIG. 10 shows the nucleotide sequence of RCH2.
  • FIG. 11 shows the nucleotide sequence of RCH3.
  • FIG. 12 shows the nucleotide sequence of RCE1.
  • FIG. 13 shows the nucleotide sequence of RCE2.
  • the present invention relates to cloned centromeric DNA from Oryza sativa (rice). More specifically, the inventors of the present invention have discovered that the cloned centromeric DNA of the present invention contains seven (7) different repetitive regions of complex DNA. These seven (7) repetitive regions are referred to herein as follows: RCSl, RCS2, RCHl. RCH2, RCH3, RCE1 and RCE2.
  • the present invention relates to isolated and purified nucleic acids for each of the seven (7) different repetitive regions of centromeric DNA from Oryza sativa.
  • the nucleic acids of the present invention encode a functional centromere from Oryza sativa.
  • the present invention further relates to the use of the nucleic acids of the present invention as primers and probes to identify centromeric DNA from other plants and animals.
  • the nucleic acid sequences disclosed herein can be used to create a plant artificial chromosome vector. Definitions
  • plant includes reference to whole plants, plant organs (e.g., leaves, stems, roots, etc.), seeds and plant cells and progeny thereof.
  • plant organs e.g., leaves, stems, roots, etc.
  • the class of plants which can be used in the methods of the present invention are generally as broad as the class of higher plants amenable to transformation techniques, including both monocotyledonous and dicotyledonous plants.
  • transformation means the acquisition in cells of new DNA sequences through incorporation of added DNA. This is the process by which naked DNA, DNA coated with protein, or whole artificial chromosomes are introduced into a cell, resulting in a heritable change.
  • the term "host” means any organism that is the recipient of a replicable plasmid or vector comprising a plant artificial chromosome.
  • host strains used for cloning are free of any restriction enzyme activity that might degrade the foreign DNA used.
  • Preferred examples of host cells for cloning which are useful in the present invention are bacteria, such as Escherichia coli, Bacillus subtilis, Pseudomonas, Streptomyces, Salmonella, and yeast cells such as S. cerevisiae.
  • Host cells which can be targeted for expression of a plant artificial chromosome may be plant cells of any source, such as, but not limited to, Arabidopsis, maize, rice, sugarcane, sorghum, barley, soybeans, tobacco, wheat, tomato, potato or citrus.
  • linker means a DNA molecule, generally up to 50 or 60 nucleotides long and synthesized chemically, or cloned from other vectors.
  • plasmid or "vector” (such as a cloning vector or expression vector) refers to a closed covalently circular extrachromosomal DNA or linear DNA which is able to autonomously replicate in a host cell and which is normally nonessential to the survival of the cell.
  • vector such as a cloning vector or expression vector
  • heterologous when used to describe nucleic acids or polypeptides refers to nucleic acids or polypeptides that originate from a foreign species, or, if from the same species, are substantially modified from their original form.
  • a promoter operably linked to a heterologous structural gene is from a species different from that from which the structural gene was derived, or, if from the same species, one or both are substantially modified from their original form.
  • isolated includes reference to material which is substantially or essentially free from components which normally accompany or interact with it as found in its naturally occurring environment.
  • the isolated material optionally comprises material not found with the material in its natural environment.
  • nucleic acid includes reference to a deoxyribonucleotide or ribonucleotide polymer in either single- or double-stranded form, and unless otherwise limited, encompasses known analogues of natural nucleotides that hybridize to nucleic acids in a manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence includes the complementary sequence thereof.
  • operably linked includes reference to a functional linkage between a promoter and a second sequence, wherein the promoter sequence initiates and mediates transcription of the DNA sequence corresponding to the second sequence.
  • operably linked means that the nucleic acid sequences being linked are contiguous and, where necessary to joint two protein coding regions, contiguous and in the same reading frame.
  • recombinant includes reference to a cell, or nucleic acid, or vector, that has been modified by the introduction of a heterologous nucleic acid or the alteration of a native nucleic acid to a form not native to that cell, or that the cell is derived from a cell so modified.
  • recombinant cells express genes that are not found within the native (non- recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under expressed or not expressed at all.
  • a "recombinant DNA construct” is a nucleic acid construct, generated recombinantly or synthetically, with a series of specified nucleic acid elements which permit transcription and translation of a particular nucleic acid in a target cell.
  • the DNA construct can be part of a plasmid, vector, virus, or nucleic acid fragment.
  • the recombinant DNA construct portion of the construct includes a nucleic acid to be transcribed and translated, and a promoter.
  • the recombinant DNA construct can be a plant artificial chromosome.
  • transgenic plant includes reference to a plant modified by introduction of a heterologous nucleic acid.
  • telomere refers to the end of a chromosome comprising a simple repeat DNA. The function of a telomere is to allow the ends of a linear DNA molecule to be replicated.
  • eukaryote refers to living organisms whose cells contain nuclei. A eukaryote may be distinguished from a "prokaryote” which is an organism which lacks nuclei. Prokaryotes and eukaryotes differ fundamentally in the way their genetic information is organized, as well as their patterns of RNA and protein synthesis.
  • lower eukaryote refers to a eukaryote characterized by a comparatively simple physiology and composition, and unicellularity. Examples of lower eukaryotes include flagellates, ciliates. and yeast.
  • higher eukaryote refers to a multicellular eukaryote, characterized by its greater complex physiological mechanisms as well as its ability to interact with its environment in a more sophisticated manner. Generally, more complex organisms such as plants and animals are included in this category.
  • Preferred higher eukaryotes to be transformed by the present invention include, for example, monocot and dicot angiosperm species, gymnosperm species, fern soecies, plant tissue culture cells of these species, and algal cells. It will of course be understood that prokaryotes and eukaryotes alike may be transformed by the methods of this invention.
  • a "selectable marker” is a gene whose presence results in a clear pheontype. and most often a growth advantage for cells that contain the marker. This growth advantage may be present under standard conditions, altered conditions such as elevated temperature, on in presence of certain chemicals such as herbicides or antibiotics.
  • selectable markers include the thymidine kinase gene, the cellular adenine- phosphoriboysltransferase gene and the dihydrylfolate reducast gene, hygromycin phosphotransferase genes, the bar gene and the neomycin phosphotransferase genes, among others.
  • Preferred selectable markers in the present invention include genes whose expression confer antibiotic or herbicide resistance to the host cell, sufficient to enable the maintenance of a vector with a host cell, and which facilitate the manipulation of a plasmid into new host cells.
  • nucleotide refers to one of the monomeric units from which DNA or
  • RN A polymers are constructed, consisting of a purine or pyrimidine base, a pentose, and a phosphoric acid group.
  • the nucleotides of DNA are deoxyadenylic acid, thymidylic acid, deoxyguanilic acid, and deoxycytidylic acid.
  • the corresponding nucleotides of RNA are adenylic acid, uridylic acid, guanylic acid, and cytidylic acid.
  • the present application also contains a sequence listing that contains 8 sequences.
  • the sequence listing contains nucleotide sequences.
  • the base pairs are represented by the following base codes:
  • the present invention relates to isolated and purified nucleic acids which encode a functional centromere from Oryza sativa.
  • a functional centromere refers to the centromere or chromosome site that directs or supports kinetechore formation.
  • the kinetochore is the physical structure that mediates the attachment of the spindle fibers to the chromosome and is therefore responsible for the proper partition of the chromosomes at mitosis and meiosis.
  • the nucleic acids of the present invention encode seven (7) different repetitive regions of centromeric DNA from Oryza sativa.
  • nucleic acids for such centromeres have the nucleotide sequences shown in SEQ ID NO: l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 or combinations thereof.
  • SEQ ID NO: 1 is also referred to herein as RCSl .
  • SEQ ID NO:2 is also referred to herein as: RCS2.
  • SEQ ID NO:3 is also referred to herein as RCHl.
  • SEQ ID NO:4 is also referred to herein as RCH2.
  • SEQ ID NO:5 is referred to herein as RCH3.
  • SEQ ID NO:6 is referred to herein as RCEl .
  • SEQ ID NO:7 is referred to herein as RCE2.
  • the present invention also contemplates nucleic acids which hybridize under stringent hybridization conditions to the nucleotide sequences set forth above.
  • stringent conditions are selected to be about 5°C to about 20°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH.
  • Tm is the temperature (under defined ionic strength and pH 7) at which 50% of the target sequence hybridizes to a perfectly matched probe.
  • stringent wash conditions are those in which the salt concentration is about 0.22 molar at pH 7 and the temperature is at least about 50°C.
  • nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical if it encodes a substantially identical and functional centromere.
  • the present invention also contemplates naturally occurring allelic variations and mutations of the nucleotide sequences set forth above so long as those variations and mutations code, on expression, for a functional centromere.
  • nucleic acid sequences of the present invention can be used in marker-aided selection using techniques which are well-known in the art. Marker-aided selection does not require the complete sequence of the gene or precise knowledge of which sequence confers which specificity. Instead, partial sequences can be used as hybridization probes or as the basis for oligonucleotide primers to amplify by PCR or other methods to identify nucleic acids specific for functional centromeric DNA in other plants and animals.
  • the present invention relates to a plant artificial chromosome.
  • nucleic acid sequences of the present invention can be used to construct a plant artificial chromosome vector.
  • a plant artificial chromosome must contain the following essential elements: (1) autonomous replication sequences (hereinafter referred to as "ARS"), (2) a centromere which is functional in plants, and (3) telomeres which are functional in plants.
  • ARS autonomous replication sequences
  • centromere which is functional in plants
  • telomeres which are functional in plants.
  • ARSs have been isolated from the unicellular fungi Saccharomyces cerevisiae (brewer's yeast) and Schizosaccharomyces pombe (see Stinchcomb et al.. Nature 282:39- 43 (1979) and Hsiao et al, J. Proc. Natl. Acad. Sci. USA 76:3829-3833 (1979)).
  • ARSs behave like replication origins allowing DNA molecules that contain the ARS to be replicated as an episome after introduction into the cell nuclei of these fungi. Although plasmids containing these sequences replicate, they do not segregate properly.
  • U.S. Patent 5,270,201 discloses a method for isolating ARS sequences for use in higher eukaryotic organisms, by the formation of minichromosomes derivative of natural chromosomes. It has been demonstrated in yeast that inverted repeats of telomeric sequences are "resolved" by an unknown mechanism which results in a double-stranded cleavage between inverted repeats. After an inverted telomere repeat is introduced into a chromosome, a resolution reaction will lead to scission of the chromosome and formation of two chromosomal fragments, each with two telomeres.
  • a second approach for an obtaining ARS is also disclosed in the '201 Patent. This approach is referred to as a "shotgun cloning approach " .
  • Higher eukaryotic organisms have many replication origins distributed throughout their genomes. For example, the A. thaliana genome contains approximately 1000 origins spaced every 70 kb along the chromosome. Therefore, the shotgun cloning approach involves looking for random fragments of genomic
  • Autonomous replication sequences for use in the plant artificial chromosome of the present invention can be obtained using methods which are well known in the art. Autonomous replication sequences from yeast, such as those described above, can be used in the present invention. Moreover, ARS sequences from higher eukaryotic organisms obtained using the methods described in the '201 patent can also be used in plant artificial chromosome of the present invention.
  • Telomeres are believed to be involved in the priming of DNA replication at the chromosome end (see, Blackburn et al., Ann. Rev. Biochem. 53: 163-194 (1984)). This is because conventional DNA polymerases are template dependent, synthesize DNA in the 5' to 3' direction, and require an oligonucleotide primer to donate a 3' OH group. When this primer is removed, unreplicated single-stranded gaps arise; most of these gaps can be filled in by priming from 3'
  • telomeres must provide an alternative priming mechanism.
  • Telomeres are also responsible for the stability of chromosomal termini. Telomeres act as
  • caps suppressing the recombinogenic properties of free, unmodified DNA ends (see Blackburn et al., Ann. Rev. Biochem. 53:163-194 (1984)). This reduces the formation of damaged and rearranged chromosomes which arise as a consequence of recombination-mediated chromosome fusion events.
  • Telomeres may also contribute to the establishment or maintenance of intranuclear chromatin organization through their association with the nuclear envelope (see, for example, Fussell, C. P., Genetica 62:192-201 (1984)).
  • telomeres have been cloned from several lower eukaryotic organisms, principally protozoans and yeast.
  • the ends of the Tetrahymena linear DNA plasmid have been shown to function like a telomere on linear plasmids in S. cerevisiae (see Szostak, J. W., Cold Spring Harbor Symp. Quant. Biol. 47: 1 187-1 194 (1983)).
  • a telomere from the flagellate Trypanosoma has been cloned (see, for example, Blackburn et al., Cell 36:447-457 (1984)).
  • a yeast telomeric sequence has been identified (see, for example, Shampay et al., N ⁇ twre 310:154-157 (1984)).
  • U.S. Patent 5,270,201 disclose a method for obtaining a telomere from a higher eukaryotic organism, specifically, from Arabidopsis thaliana.
  • the telomeric sequences disclosed in the '201 Patent contain a tandem repeat of the sequence 5'-CCCTAAA-3.
  • telomeric sequence which produces a telomere which is functional in plants can be inserted into the plant artificial chromosome of the present invention.
  • the telomeric sequence may be from yeast or from a higher eukaryotic organism as described above.
  • the plant artificial chromosome of the present invention will contain two (2) telomeric sequences.
  • a plant artificial chromosome vector can be constructed using methods which are well-known in the art (see, for example, Maniatis, T.. et al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor, 1982)).
  • positive and negative selectable plant markers for example, antibiotic or herbicide resistance genes
  • a cloning site for insertion of foreign DNA are preferably included.
  • the artificial plant chromosome may also contain a origin of replication that can function in plants.
  • Artificial plant chromosomes which replicate in yeast also may be constructed to take advantage of the large insert capacity and stability of repetitive DNA inserts afforded by this system (Burke et al., Science, 236:806-812 (1987)). In this case, yeast ARS and centromere sequences are added to the artificial chromosome. The artificial chromosome is maintained in yeast as a circular molecule using a stuffer fragment to separate the teleomeres.
  • Nucleic acids for the essential components of the plant artificial chromosome obtained from any source whatsoever may be purified and inserted into the plant artificial chromosome at any appropriate restriction endonuclease cleavage site.
  • the nucleic acids usually will contain various regulatory signals (for example, promoters, termination segments, enhancers, etc., which are well known in the art) that allow for the expression of proteins encoded by the nucleic acids.
  • regulatory signals residing in the artificial chromosome may be utilized.
  • restriction enzymes the endonucleases which cleave only a specific nucleotide sequence are called restriction enzymes. Restriction endonucleases generally internally cleave DNA molecules at specific recognition sites, making breaks within "recognition" sequences that in many, but not all. cases exhibit twofold symmetry around a given point. Such enzymes typically create double-stranded breaks.
  • Some endonucleases create fragments that have blunt ends, that is, that lack any protruding single strands.
  • An alternative way to create blunt ends is to use a restriction enzyme that leaves overhangs, but to fill in the overhangs with a polymerase, such as klenow, thereby resulting in blunt ends.
  • blunt end ligation can be used to join the fragments directly together. The advantage of this technique is that any pair of ends may be joined together, irrespective of sequence.
  • exonucleases that preferentially break off terminal nucleotides are referred to as exonucleases.
  • small deletions can be produced in any DNA molecule by treatment with an exonuclease which starts from each 3' end of the DNA and chews away single strands in a 3' to 5' direction, creating a population of DNA molecules with single-stranded fragments at each end, some containing terminal nucleotides.
  • exonucleases that digest DNA from the 5' end or enzymes that remove nucleotides from both strands have often been used.
  • Some exonucleases which may be particularly useful in the present invention include Bal31, 51, and ExoIII.
  • nucleolytic reactions can be controlled by varying the time of incubation, the temperature, and the enzyme concentration needed to make deletions.
  • Phosphatases and kinases also may be used to control which fragments have ends which can be joined.
  • useful phosphatases include shrimp alkaline phosphatase and calf intestinal alkaline phosphatase.
  • An example of a useful kinase is T4 polynucleotide kinase.
  • the source DNA sequences and vector sequences have been cleaved and modified to generate appropriate ends they are incubated together with enzymes capable of mediating the ligation of the two DNA molecules.
  • enzymes capable of mediating the ligation of the two DNA molecules include T4 ligase, E. coli ligase, or other similar enzymes.
  • T4 ligase T4 ligase
  • E. coli ligase E. coli ligase
  • the action of these enzymes results in the sealing of the linear DNA to produce a larger DNA molecule containing the desired fragment (see, for example, U.S. Pat. Nos. 4,237,224; 4,264,731; 4,273,875; 4,322,499 and 4,336,336, which are specifically incorporated herein by reference).
  • termini of the linearized plasmid and the termini of the DNA fragment being inserted must be complementary or blunt in order for the ligation reaction to be successful. Suitable complementarity can be achieved by choosing appropriate restriction endonucleases (i.e., if the fragment is produced by the same restriction endonuclease or one that generates the same overhang as that used to linearize the plasmid, then the termini of both molecules will be complementary). As discussed previously, in a preferred embodiment, at least two classes of the vectors used in the present invention are adapted to receive the foreign oligonucleotide fragments in only one orientation. After joining the DNA segment to the vector. the resulting hybrid DNA can then be selected from among the large population of clones or libraries.
  • a method useful for the molecular cloning of DNA sequences includes in vitro joining of DNA segments, fragmented from a source high molecular weight genomic DNA, to vector DNA molecules capable of independent replication.
  • the cloning vector may include plasmid DNA
  • Methods and compositions for transforming a bacterium, a yeast cell, a plant cell, or an entire plant with one or more plant artificial chromosome vectors are further aspects of the present invention.
  • Means for transforming bacteria and yeast cells are well known in the art. Typically, means of transformation are similar to those well known means used to transform other bacteria or yeast such as E. coli or Saccharomyces cerevisiae.
  • Methods for DNA transformation of plant cells include Agrobacterium-mediated plant transformation, protoplast transformation, gene transfer into pollen, injection into reproductive organs, injection into immature embryos and particle bombardment. There are various advantages and disadvantages associated with each of these methods.
  • Methods for transforming plant cells include any method by which DNA can be introduced into a cell, such as by Agrob ⁇ cterium infection, direct delivery of DNA such as, by PEG-mediated transformation of protoplasts, by desiccation/inhibition-mediated DNA uptake, by electroporation, by agitation with silicon carbide fibers, by acceleration of DNA coated particles, etc.
  • Agrobacterium-mediated transformation is a widely applicable system for introducing genes into plant cells because the DNA can be introduced into whole plant tissues.
  • Agrob ⁇ cterium-mediated plant integrating vectors to introduce DNA into plant cells is well known in the art.
  • chromosomal integration is required for stable inheritance of the foreign DNA.
  • the artificial plant chromosome vector described herein may be used for transformation with or without integration, as the centromere function required for stable inheritance is encoded within the plant artificial chromosome.
  • transformation events in which the plant artificial chromosome is not chromosomally integrated may be preferred, in that problems with site- specific variations in expression and insertional mutagenesis may be avoided.
  • the integration of the Ti-DNA is a relatively precise process resulting in few rearrangements.
  • the region of DNA to be transferred is defined by the border sequences, and intervening DNA is usually inserted into the plant genome as described (Shmann et ⁇ l, Mol. Gen. Genet., 205:34 (1986); Jorgensen et ⁇ l, Mol. Gen. Genet., (1987)).
  • Modern Agrob ⁇ cterium transformation vectors are capable of replication in E. coli as well as Agrob ⁇ cterium, allowing for convenient manipulations as described (Klee et ⁇ l., Bio/Technology, 3:637-642(1985)).
  • Agrob ⁇ cterium-mediated transformation of leaf disks and other tissues such as cotyledons and hypocotyls appears to be limited to plants that Agrobacterium naturally infects. Agrob ⁇ cterium-mediated transformation is most efficient in dicotyledonous plants. Few monocots appear to be natural hosts for Agrob ⁇ cterium, although transgenic plants have been produced in asparagus and more significantly in maize using Agrob ⁇ cterium vectors as described (Bytebier et ⁇ l., Proc. N ⁇ tl. Ac ⁇ d. Sci. USA, 84:5345 (1987)); U.S. Patent No. 5,591,616, specifically inco ⁇ orated herein by reference). Therefore, commercially important cereal grains such as rice, corn, and wheat must usually be transformed using alternative methods. However, as mentioned above, the transformation of asparagus using Agrob ⁇ cterium also can be achieved
  • Transformation of plant protoplasts can be achieved using methods based on calcium phosphate precipitation, polyethylene glycol treatment, electroporation, and combinations of these treatments (see, for example, Potrykus et ⁇ l., Mol. Gen. Genet. 199: 183 (1985); Lorz et ⁇ l., Mol. Gen. Genet., 199: 178 (1985); Fromm et ⁇ l. , Nature, 312:791-793 (1986); Uchimiya et al. , Mol. Gen. Genet., 204:204 (1986); Callis et al, Genes and Development, 1 : 1 183 (1987); Marcotte et al, Nature, 335:454 (1988)).
  • DNA is carried through the cell wall and into the cytoplasm on the surface of small metal particles as described (Klein et al. Nature, 327:70-73 (1987); Klein et al. Proc. Natl. Acad. Sci. USA, 85:8502-8505 (1988); McCabe et al. Biotechnology, 6:923 (1988)).
  • the metal particles penetrate through several layers of cells and thus allow the transformation of cells within tissue explants.
  • a rice BAC library was constructed from an indica rice (Oryza sativa ssp. Indica) line IR-BB21 and consisted of 1 1,000 clones (see Wang, G.-L., et al., Plant J. 7: 525-533 (1995), herein inco ⁇ orated by reference).
  • the cereal centromeric DNA element pSau3A9 (Jiang, J., et al., Proc. Natl,. Acad. Sci. USA 93: 14210-14213 (1996), herein inco ⁇ orated by reference) was used to isolate the rice centromere-specific BAC clones.
  • Rice lines used in this example include a javanica rice (O. sativa ssp. Javanica) line DV85, a japonica rice (O. sativa ssp. Japonica ) line Norin 28, and indica rice line IR72, and four other Oryza species (O. glaberrima, O. rufipogon, O. officinalis, and O. aha).
  • Gramineae species used in conservation studies include two species from the Bambusoideae subfamily [bamboo (Bambusa vulgaris), Pharus sp.], three species from the Panicoideae subfamily [sorghum, maize (Zea mays), and sugar cane (Saccharum officinarum)], six species form the Pooideae subfamily [Agropyron intermedium, barley (Hordeum vulgare), oats (Avena sativa), rye (Secale cereale), wheat (Triticum aestivum), and Aegilops squarrosa ⁇ .
  • Three non-Gramineae species, Juncus effusus, Cyperus alternifolius, and A. thaliana, and rye and maize lines containing B chromosomes also were included.
  • BAC filter preparations and BAC library screening were conducted as described in Wang, G.-L., et al., Plant J. 7:525-533 (1995); Hoheisel, J. D., et al., Cell 73:109-120 (1993), herein inco ⁇ orated by reference.
  • BAC clones were isolated by using pSau3A9 as a probe, and their cytological locations were confirmed by fluorescence in situ hybridization (hereinafter referred to as "FISH").
  • DNA from clone 17p22 was digested with restriction enzymes, BamHl, Dral, EcoRl,
  • Clone pRCSl contains a 877-bp Sau3Al fragment that hybridizes to the pSau3A9 sequence. Sequencing analysis revealed that the 259 bp at the 3' end of pRCSl had 80% sequence identity to the central part (bases 338-602) of the pSau3A9 sequence (see Jiang, J., et al., Proc. Natl,. Acad. Sci. USA 93: 14210-14213 (1996), FIG. 1 and SEQ ID NO:8). The first 95 bp in pRCSl had 76% sequence identity to a Ty3/ 'gypsy class of retrotransposon sequence reported in maize (GenBank Accession No. AF030633).
  • Nucleotides 171-228 of pRCSl had 70% sequence identity to a Ty3/gypsy class of retrotransposon sequence reported in Lilium henryi (XI 3886). It was also discovered that the pSau3A9 sequence in sorghum has similar sequence identities to the Ty3l gypsy class of retrotransposons. These results indicated that both pSau3A9 and pRCSl probably were derived from retrotransposon-related DNA sequences.
  • the RCSl sequence was located in the centromeric regions of all 24 rice chromosomes by FISH (see FIG. 2A). The sizes and intensities of the FISH signals were uniform on different chromosomes, suggesting that all rice chromosomes contain a similar number of copies of this element. Slot blot analysis suggested that ther are about 130 copies of RCSl present in the haploid genome of japonica rice DV85 (see Table 2).
  • Rice genomic DNA was digested with several restriction enzymes and probed with the 259-bp fragment conserved between rice and sorghum. One or few major bands and several minor bands were detected in most of the lanes (see FIG. 3). Fiber-FISH using pRCSl as a probe did not generate clustered signals. These results suggested that the RCS 1 sequence is dispersed in the centromeric regions of rice chromosomes.
  • FISH analysis revealed that pRCSl also hybridized exclusively to centromeric regions of chromosomes from other Gramineae species (see FIG. 2B-E .
  • the FISH results on rye (see FIG. 25) and barley (see FIG. 2C) chromosomes showed that hybridization was exclusive to the primary constrictions.
  • FISH signals also were detected in the centromeres of the supernumerary
  • Clone pRCS2 contains a 639-bp Sau3Al fragment consisting of four copies of a tandemly arranged repeat with a consensus sequence of 168bp (see FIG. 4). The four copies were 84-91% identical with one another. The third copy of the repeat contains a 6-bp insertion (TTGGCC) at base 147.
  • TTGGCC 6-bp insertion
  • Southern hybridization analysis of rice genomic DNA using probe pRCS2 revealed ladder patterns using several restriction enzymes, including Dpnll, Sau3Al. Mspl, Hpall, and Haelll. indicating that the RCS2 family is tandemly arranged in the rice genome (see FIG. 5).
  • Several enzymes produced digestion profiles comprised of monomer and multiples (dimer, trimer, tetramer, etc.) of the 168-bp basic repeat.
  • Probe pRCS2 hybridized only to the centromeric regions on all rice chromosomes (see FIG. 2K). Significant variation in the size and intensity of the FISH signals was detected in different centromeres. Two pairs of chromosomes had strong signals, and a third pair had very faint signals (see FIG. 2K). All of the signals became weaker as the posthybridization washing stringency was increased (see FIG. 2L). However, even after washing in 70% formamide at
  • AA genome Three subspecies of O. sativa (AA genome), together with O. glaberrima (AA), O. rufipogon (AA), O. aha (CCDD), and O. of ⁇ cinalis (CC) were included for FISH analysis. FISH signals were detected in the centromeric regions from all of the chromosomes of these species. Southern hybridization analysis revealed that the RCS2 family is present only in the species within genus Oryza. Homologous sequences could not be detected even at a low stringency in any plant species outside of genus Orya. RCS2 is the most abundant element isolated from BAC 17p22 and has about 1.550 copies, corresponding to 6,200 monomers, in the haploid genome of DV85 (see Table 2 above).
  • BAC 17p22 contains about 46 copies of this element, corresponding to approximately 39% of the BAC insert.
  • Fiber-FISH analysis demonstrated that the RCS2 family is organized into various sizes of uninterrupted arrays in the rice genome.
  • the longest observed block with small interspersed gaps ( ⁇ 2 ⁇ m) was 51 ⁇ m (see FIG. 2N). Based on a 2.96-kb/ m resolution of the Fiber-FISH technique, this block represents approximately 151 kb of uninterrupted RCS2 sequences.
  • the longest observed single Fiber-FISH signal with interspersed gaps larger than 2 ⁇ m was 188 ⁇ m representing approximately 556 kb of centromeric DNA sequences.
  • the other five centromeric DNA elements isolated from rice BAC clone 17p22 were analyzed by FISH, and all of them hybridized exclusively to the centromeric regions of all rice chromosomes (see FIG. 2F-J).
  • One or two pairs of rice metaphase chromosomes showed weak hybridization when pRCH2, pRCH3, and pRCE2 were used as probed. No relationship can be confirmed between signal intensities and the sizes of the chromosomes.
  • cytosine nucleotides especially those in dinucleotide sequence 5'CpG3', are the most common sites for methylation in plant genomes. Methylation occurs at lower frequencies when the C and G are separated by 1-2 A/T nucleotides (see Gruenbaum, T., et al. Nature (London)
  • Enzymes Mspl and Hpall ar isoschizomers that recognize the 5'CCGG3' sequence. Neither enzyme can cut when the 5'C is methylated, and only Mspl can cleave when the internal cytosine is methylated. Though both enzymes produced similar digestion profiles of rice genomic DNA, Mspl generated much smaller-sized hybridization bands from all of the rice centromeric DNA probes than Hpall did (see FIG. 3 for RCS 1 and FIG. 5 for RCS2). For the
  • RCS2 element monomers of the 168-bp basic repeat could be found in Mspl lane, and most of the hybridization was in the fragments smaller than 2 kb. whereas the majority of hybridization in the Hpall lane was larger than 2 kb (see FIG. 5).
  • DNA fragments smaller than 5 kb were not detected in Hpall lanes (see FIG. 3 for RCSl).
  • Restriction enzyme Sail recognizes 5OTCGAC3' and is sentsitive to the methylation of CpG dinucleotides. Small fragments ( ⁇ 10 kb) that hybridized to the centromeric elements were not detected in the Sail lanes (see FIGS. 3 and 5).

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biomedical Technology (AREA)
  • Organic Chemistry (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Wood Science & Technology (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Cell Biology (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

The present invention relates to nucleic acids which encode a functional centromere from Oryza sativa. The nucleic acids of the present invention can be also used to create a plant artificial chromosome.

Description

DNA SEQUENCES SPECIFIC TO RICE CENTROMERES
This invention was made with United States government support awarded by the following agencies: USDA HATCH 3935. The United States has certain rights in this invention.
Field of the Invention
The present invention relates generally to molecular biology. In particular, the present invention relates to nucleic acid sequences which encode a centromere from rice.
Background of the Invention
Among the most distinguishing and characteristic landmarks of chromosomes of higher eukaryotes is the location of the centromere. The centromeric region is the site for mitoitic and meiotic spindle fiber attachment and is responsible for sister chromatid association. Jiang, J., et al., Proc. Natl Acad. Sci. USA, 95: 8135-8140 (1998). Therefore, centromeres play a central role in the process of chromosomal segregation and transmission in cell divisions. Id. The molecular organization of centromeres has been studied extensively in yeast, Drosophila melanogaster, and humans.
Centromeric regions usually consist of heterochromation and are thought to be highly methylated. Miller, J.T., et al., Theor. Appl. Genet., 96:832-839 (1998). In addition, centromeres show varying amounts of nontranscribed repetitive sequences, which are referred to as satellite DNAs. Haaf, T., et al., Cell, 70:681-695 (1992). The predominant class of centromeric DNA, is the alpha-satellite DNA, which is found in diverged form in all centromeres. Id. To the extent that it is known, alpha-satellite arrays appear to be uninterrupted by other (nonsatellite) DNA sequences.
In centromeres, naturally occurring satellite arrays range in size from several hundred kbs to several megabases in length. Recent studies, however, suggest that as little as 140 kb of alpha satellite DNA may be sufficient to confer centromere function in human cells. Harrington, John, O 01/00858
J., et al.. Nature Genetics, 15:345-355 (1997). Unfortunately, satellite DNA of this size has proven difficult to clone and propagate stably in microorganisms using conventional cloning vectors. Id. In large part, the difficulty in propagating satellite DNA stems from the tendency of tandemly repetitive DNA to recombine into smaller arrays and this effect increases with the size of the repetitive array. Id.
As briefly mentioned hereinbefore, functional centromeric sequences have been isolated and purified from S. cerevisiae (see Clark et al., Nature, 287:504-509 (1980) and Stinchcomb et al., J. Molec. Biol. 158:157-179 (1982)). Episomes carrying the yeast centromeric sequences display proper segregation into daughter yeast cells during mitosis and meiosis, in contrast to autonomous replication sequences plasmids lacking a centromere.
The best characterized centromeric DNAs originated from the budding yeast S. cerevisiae (See Clarke and Carbon, Ann. Rev. Genet. 19:29-56 (1985)). The DNA region required for centromere function in S. cerevisiae is approximately 120 base pair (hereinafter "bp") long and is composed of three conserved domains: CDEI, an 8 bp element (A/G)TCAC(A/G)TG), CDEII, an extremely (about90%) AT-rich region of approximately 80 bp, and CDEIII, a 25 bp element (TGTTT(A/T)TGNTTTCCGAAANNNNAAA). The molecular structure of centromeric DNAs from the fission yeast Schizosaccharomyces pombe have also been characterized. Several classes of 5. pombe moderately repeated DNA elements have been identified which are found only in the centromere regions. These centromere-specific repetitive elements have been designated dg (3.8 kb), dh (4 kb), and yn by Yanagida and co-workers (Nakaseko et al., Embo. J. 5..T011-1021 (1986); Nakaseko et al., Nuc. Acid Res. 15:4705-4715 (1987)), and K (6.4 kb), L (6 kb), and B (1 kb) by Carbon and his colleagues (Clarke et al., PNAS 83:8253-8257 (1986); Fishel et al., Mol. Cell Biol. 8:754-763 (1988)). The dg element has an AT-rich region and a 600 bp domain containing numerous small direct repeat motifs. Similarly, the dh element has an overall AT content approaching 70% and contains many short direct repeats. No nucleotide similarities to the S. cerevisiae CDEs have been found in the S. pombe elements. Attempts to demonstrate that the S. pombe centromere-specific repetitive elements can function individually as centromeres have been unsuccessful. However, large restriction fragments (65 to 150 kb) carrying the entire fission yeast centromere regions of chromosome 1 or 3 function as centromeres when introduced into acentric episomes (Hahnenberger et al.. PNAS USA 86:577-581 (1989)). These results indicate that either fission yeast centromeres are large composite structures that cannot be subdivided, or the functional fission yeast centromere element has not yet been identified.
In contrast to the detailed studies done in S. cerevisiae and S. pombe, in most eukaryotes. only limited information is available regarding the organization of the centromeres. For example, limited information is known about plant centromeres. Peacock et al., Proc. Natl. Acad. Sci. USA. 78:4490-4494 (1981 ) report the first isolation of a repetitive DNA element from maize knobs. This repetitive DNA element acts as neocentromeres in certain genetic backgrounds. A repetitive DNA element has also been cloned from the centromeres of the supernumerary B chromosomes of maize (see Alfenito, M.R., et al.. Genetics 135:589-597
(1993) and Kaszas, E., et al., EMBO J., 15:5246-5255 (1996)). Part of this B-specific DNA element shows strong homology to the maize sequences. A 180-bp tandem repeat (pALl family) is the major component of the centromeric region of Arabidopsis thaliana chromosomes. The genomic organization of this repeat family shares similarities to the alpha satellite DNA at the human centromeres (see Martinez-Sapater, J., et al., Mol. Gen. Genet., 204:417-423 (1986);
Simoens, C. R., Nucleic Acids Res., 16:6753-6766 (1988); Maluszynska, J., et al.. Plant J., 1 : 159-166 (1991): Round, E.K., et al., Genome Res., 7: 1045-1053 (1997)).
As discussed above, very few putative functional centromeres have been cloned from plants. The cloning of a putative functional centromere from a plant is a necessary first step in the production of artificial chromosomes suitable for use in plants. Artificial chromosomes are man-made linear or circular DNA molecules constructed from essential cis-acting DNA sequence elements that are responsible for the proper replication and partitioning of natural chromosomes (see. Murray et al., Nature, 305:189-193 (1983)). The essential elements of an artificial chromosome are: Autonomous Replication Sequences (ARS) (have properties of replication origins, which are the sites for initiation of DNA replication), (2) centromeres (site of kinetochore assembly and responsible for proper distribution of replicated chromosomes at mitosis and meiosis), and (3) telomeres (specialized structures at the ends of linear chromosomes that function to stabilize the ends and facilitate the complete replication of the extreme termini of the DNA molecule). The use of artificial chromosomes as an alternative to commonly used method of introducing new genetic information into cells is steadily increasing.
Summary of the Invention
The present invention relates to isolated and purified nucleic acids having the nucleotide sequences shown in: SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 SEQ ID NO:6 and SEQ ID NO:7.
The present invention also relates to a recombinant DNA construct which contains a rice centromere. The centromere contains a number of highly repetitive regions of DNA that have the nucleotide sequence of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 or combinations thereof. The recombinant DNA construct may also contain a yeast autonomous replication sequence, an autonomous replication sequence from a higher eukaryotic organism, a yeast telomere sequence or a telomere sequence from a higher eukaryotic organism and a selectable marker gene.
The present invention also relates to a plasmid containing the hereinbefore described DNA construct. This plasmid may contain an origin of replication and a selectable marker which functions in bacteria (such as E. coli) or in yeast (such as S. cerevisiae).
The present invention relates to a plant artificial chromosome vector. The plant artificial chromosome vector of the present invention contains an autonomous replication sequence, two telomere sequences, a centromere sequence having the nucleotide sequence of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 or a combination thereof, and at least one selectable marker sequence. The autonomous replication sequence may be from yeast or from a higher eukaryotic organism and the telomere sequence may from yeast or from a higher eukaryotic organism, such as, but not limited to, Arabidopsis thaliana.
The present invention also relates to a plant cell transformed with the plant artificial chromosome vector hereinbefore described and to transgenic plants containing said plant cell. The plant cell and plant may be from Oryza sativa.
Finally, the present invention relates to a method of identifying centromeric DNA in a higher eukaryotic organism. The method involves hybridizing an isolated nucleic acid selected from the group consisting of SEQ ID NO:l. SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO: 6 and SEQ ID NO: 7 and combinations thereof with a sample of DNA from a higher eukaryotic organism and then identifying and isolating the centromeric DNA from said sample.
Brief Description of the Drawings
FIG. 1 shows the nucleotide sequence of pSau3A9.
FIG. 2A - FIG. 2N shows the FISH analysis of rice centromeric DNA elements. The probes were biotinylated and hybridized in situ to rice chromosomes or DNA fibers. The probes were detected by fluorescein isothiocyanate-conjugated antibodies (green color) and the chromosomes were stained with propidium iodide (red color). Probes pRCSl hybridized exclusively to the centromeric regions of the chromosomes from rice (FIG. 2A), rye (FIG.2B), barley (FIG. 2C), sorghum (FIG. 2D), and maize (FIG. 2E). FISH signals also were detected in the centromeric regions of the acrocentric B chromosomes (see arrows) from rye (FIG. 2B) and maize (2E). Similarly, rice centromeric DNA families RCH2 (FIG. 2F), RCHl (FIG. 2G), RCH3 (FIG. 2H), RCE1 (FIG. 21), RCE2 (FIG. 2J), and RCS2 (FIG. 2K) all were located in the centromere of every rice chromosome. Two pairs of chromosomes with the strongest signals are indicated by arrows and the third pair with the weakest signals by arrowheads (FIG. 2K). The same metaphase cell (FIG. 2K) was washed under medium (FIG. 2L) and high strigencies (FIG. 2M), and most signals were still discernible (FIG. 2N). The marked array between two arrows is 51 μm long and represents approximately 151 -kb DNA. All bars are 10 μm.
FIC. 3 shows a Southern blot of the genomic organization of the RCS1 family. Rice genomic DNA was digested with Sau3 Al (lane 1), Dpnll (lane 2), Hαelll (lane 3), Mspl (lane 4), Hpall (lane 5), Sail (lane 6), BamRl (lane 7), Dral (lane 8), EcoRl (lane 9), and Hindlll (lane 10) and probed with pRCS 1.
FIG 4 shows the nucleotide sequence of pRCS2. The 639-bp insert of pRCS2 contains four copies of a tandemly arranged repeat. The four members (A-D) range from 155 to 165 bp and share 84-91 % sequence identify with one another. F represents the consensus sequence of the four members.
FIG. 5 shows a Southern blot of the genomic organization of the RCS2 family. Rice genomic DNA was digested with Dpnll (lane 1), Sau3 Al (lane 2), Mspl (lane 3), Hpall (lane 4). Sail (lane 5), and Hαelll (lane 6), and probed with pRCS2.
FIG. 6A - FIG. 6B show a Southern blot of the conservation of the RCHl and RCE1 families in Gramineae species. Genomic DNA from sorghum (lane 1), maize (lane 2), sugar cane (lane 3), Ag. intermedium (lane 4), barley (lane 5), oats (lane 6), rye (lane 7), wheat (lane 8), Ae. Squarrosa (lane 9), rice (lane 10), bamboo (lane 1 1), Pharus sp. (lane 12), J. effusus (lane
13), C. alternifolius (lane 14) and A. thaliana (lane 15) was digested with Hindlll and probed with pRCHl (FIG. 6A) and pRCEl (FIG. 6B).
FIG. 7 shows the nucleotide sequence of RCS1. FIG. 8 shows the nucleotide sequence of RCS2.
FIG. 9 shows the nucleotide sequence of RCHl.
FIG. 10 shows the nucleotide sequence of RCH2.
FIG. 11 shows the nucleotide sequence of RCH3.
FIG. 12 shows the nucleotide sequence of RCE1.
FIG. 13 shows the nucleotide sequence of RCE2.
Detailed Description of the Invention
Background
The present invention relates to cloned centromeric DNA from Oryza sativa (rice). More specifically, the inventors of the present invention have discovered that the cloned centromeric DNA of the present invention contains seven (7) different repetitive regions of complex DNA. These seven (7) repetitive regions are referred to herein as follows: RCSl, RCS2, RCHl. RCH2, RCH3, RCE1 and RCE2.
The present invention relates to isolated and purified nucleic acids for each of the seven (7) different repetitive regions of centromeric DNA from Oryza sativa. The nucleic acids of the present invention encode a functional centromere from Oryza sativa.
The present invention further relates to the use of the nucleic acids of the present invention as primers and probes to identify centromeric DNA from other plants and animals. In addition, the nucleic acid sequences disclosed herein can be used to create a plant artificial chromosome vector. Definitions
Units, prefixes, and symbols can be denoted in the SI accepted form. Numeric ranges are inclusive of the numbers defining the range. Unless otherwise indicated, nucleic acids are written left to right in 5' to 3' orientation, respectively. The headings provided herein are not limitations of the various aspects or embodiments of the invention which can be had by reference to the specification as a whole. Accordingly, the terms defined immediately below are more fully defined by reference to the specification as a whole.
As used herein, the term "plant" includes reference to whole plants, plant organs (e.g., leaves, stems, roots, etc.), seeds and plant cells and progeny thereof. The class of plants which can be used in the methods of the present invention are generally as broad as the class of higher plants amenable to transformation techniques, including both monocotyledonous and dicotyledonous plants.
As used herein, the term "transformation" or "transfection" means the acquisition in cells of new DNA sequences through incorporation of added DNA. This is the process by which naked DNA, DNA coated with protein, or whole artificial chromosomes are introduced into a cell, resulting in a heritable change.
As used herein, the term "host" means any organism that is the recipient of a replicable plasmid or vector comprising a plant artificial chromosome. Preferably, host strains used for cloning are free of any restriction enzyme activity that might degrade the foreign DNA used. Preferred examples of host cells for cloning which are useful in the present invention are bacteria, such as Escherichia coli, Bacillus subtilis, Pseudomonas, Streptomyces, Salmonella, and yeast cells such as S. cerevisiae. Host cells which can be targeted for expression of a plant artificial chromosome may be plant cells of any source, such as, but not limited to, Arabidopsis, maize, rice, sugarcane, sorghum, barley, soybeans, tobacco, wheat, tomato, potato or citrus.
As used herein, the term "linker" means a DNA molecule, generally up to 50 or 60 nucleotides long and synthesized chemically, or cloned from other vectors.
As used herein, the term "plasmid" or "vector" (such as a cloning vector or expression vector)" refers to a closed covalently circular extrachromosomal DNA or linear DNA which is able to autonomously replicate in a host cell and which is normally nonessential to the survival of the cell. A wide variety of plasmids and other vectors are well known and commonly used in the art.
As used herein, "heterologous" when used to describe nucleic acids or polypeptides refers to nucleic acids or polypeptides that originate from a foreign species, or, if from the same species, are substantially modified from their original form. For example, a promoter operably linked to a heterologous structural gene is from a species different from that from which the structural gene was derived, or, if from the same species, one or both are substantially modified from their original form.
As used herein, "isolated" includes reference to material which is substantially or essentially free from components which normally accompany or interact with it as found in its naturally occurring environment. The isolated material optionally comprises material not found with the material in its natural environment.
As used herein, "nucleic acid" includes reference to a deoxyribonucleotide or ribonucleotide polymer in either single- or double-stranded form, and unless otherwise limited, encompasses known analogues of natural nucleotides that hybridize to nucleic acids in a manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence includes the complementary sequence thereof.
As used herein, "operably linked" includes reference to a functional linkage between a promoter and a second sequence, wherein the promoter sequence initiates and mediates transcription of the DNA sequence corresponding to the second sequence. Generally, operably linked means that the nucleic acid sequences being linked are contiguous and, where necessary to joint two protein coding regions, contiguous and in the same reading frame.
As used herein "recombinant" includes reference to a cell, or nucleic acid, or vector, that has been modified by the introduction of a heterologous nucleic acid or the alteration of a native nucleic acid to a form not native to that cell, or that the cell is derived from a cell so modified. For example, recombinant cells express genes that are not found within the native (non- recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under expressed or not expressed at all.
As used herein, a "recombinant DNA construct" is a nucleic acid construct, generated recombinantly or synthetically, with a series of specified nucleic acid elements which permit transcription and translation of a particular nucleic acid in a target cell. The DNA construct can be part of a plasmid, vector, virus, or nucleic acid fragment. Typically, the recombinant DNA construct portion of the construct includes a nucleic acid to be transcribed and translated, and a promoter. In the present invention, the recombinant DNA construct can be a plant artificial chromosome.
As used herein, "transgenic plant" includes reference to a plant modified by introduction of a heterologous nucleic acid.
As used herein, "telomere" refers to the end of a chromosome comprising a simple repeat DNA. The function of a telomere is to allow the ends of a linear DNA molecule to be replicated.
As used herein, "eukaryote" refers to living organisms whose cells contain nuclei. A eukaryote may be distinguished from a "prokaryote" which is an organism which lacks nuclei. Prokaryotes and eukaryotes differ fundamentally in the way their genetic information is organized, as well as their patterns of RNA and protein synthesis. As used herein, "lower eukaryote" refers to a eukaryote characterized by a comparatively simple physiology and composition, and unicellularity. Examples of lower eukaryotes include flagellates, ciliates. and yeast.
As used herein, "higher eukaryote" refers to a multicellular eukaryote, characterized by its greater complex physiological mechanisms as well as its ability to interact with its environment in a more sophisticated manner. Generally, more complex organisms such as plants and animals are included in this category. Preferred higher eukaryotes to be transformed by the present invention include, for example, monocot and dicot angiosperm species, gymnosperm species, fern soecies, plant tissue culture cells of these species, and algal cells. It will of course be understood that prokaryotes and eukaryotes alike may be transformed by the methods of this invention.
As used herein, a "selectable marker" is a gene whose presence results in a clear pheontype. and most often a growth advantage for cells that contain the marker. This growth advantage may be present under standard conditions, altered conditions such as elevated temperature, on in presence of certain chemicals such as herbicides or antibiotics. Examples of selectable markers include the thymidine kinase gene, the cellular adenine- phosphoriboysltransferase gene and the dihydrylfolate reducast gene, hygromycin phosphotransferase genes, the bar gene and the neomycin phosphotransferase genes, among others. Preferred selectable markers in the present invention include genes whose expression confer antibiotic or herbicide resistance to the host cell, sufficient to enable the maintenance of a vector with a host cell, and which facilitate the manipulation of a plasmid into new host cells.
As used herein, "nucleotide" refers to one of the monomeric units from which DNA or
RN A polymers are constructed, consisting of a purine or pyrimidine base, a pentose, and a phosphoric acid group. The nucleotides of DNA are deoxyadenylic acid, thymidylic acid, deoxyguanilic acid, and deoxycytidylic acid. The corresponding nucleotides of RNA are adenylic acid, uridylic acid, guanylic acid, and cytidylic acid. SEQUENCE LISTING
The present application also contains a sequence listing that contains 8 sequences. The sequence listing contains nucleotide sequences. For the nucleotide sequences, the base pairs are represented by the following base codes:
Symbol Meaning
A A; adenine
C C; cytosine
G G; guanine
T T; thymine u U; uracil
M A or C
R A or G
W A or T/U
S C or G
Symbol Meaning
Y C or T U
K G or T U
V A or C or G; not T/U
H A or C or T U; not G
D A or G or T/U; not C
B C or G or T U; not A
N (A or C or G or T U)
Nucleic Acid Sequences
In one embodiment, the present invention relates to isolated and purified nucleic acids which encode a functional centromere from Oryza sativa. As used herein, the term " a functional centromere" refers to the centromere or chromosome site that directs or supports kinetechore formation. The kinetochore is the physical structure that mediates the attachment of the spindle fibers to the chromosome and is therefore responsible for the proper partition of the chromosomes at mitosis and meiosis. The nucleic acids of the present invention encode seven (7) different repetitive regions of centromeric DNA from Oryza sativa. Exemplary nucleic acids for such centromeres have the nucleotide sequences shown in SEQ ID NO: l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 or combinations thereof. SEQ ID NO: 1 is also referred to herein as RCSl . SEQ ID NO:2 is also referred to herein as: RCS2. SEQ ID NO:3 is also referred to herein as RCHl. SEQ ID NO:4 is also referred to herein as RCH2. SEQ ID NO:5 is referred to herein as RCH3. SEQ ID NO:6 is referred to herein as RCEl . SEQ ID NO:7 is referred to herein as RCE2.
The present invention also contemplates nucleic acids which hybridize under stringent hybridization conditions to the nucleotide sequences set forth above. Generally, stringent conditions are selected to be about 5°C to about 20°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH 7) at which 50% of the target sequence hybridizes to a perfectly matched probe. Typically, stringent wash conditions are those in which the salt concentration is about 0.22 molar at pH 7 and the temperature is at least about 50°C. However, nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical if it encodes a substantially identical and functional centromere. This may occur, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code. The present invention also contemplates naturally occurring allelic variations and mutations of the nucleotide sequences set forth above so long as those variations and mutations code, on expression, for a functional centromere.
As is well known in the art, because of the degeneracy of the genetic code, there are numerous other DNA and RNA molecules that can code for the same functional centromere as encoded by SEQ ID NOS: 1-7, or portions thereof. The present invention, therefore, contemplates those other DNA and RNA molecules, which, on expression, encode for a functional centromere encoded by the nucleic acid sequences of SEQ ID NOS: 1-7 or portions thereof. With knowledge of all triplet codons for each particular amino acid residue, it is possible to describe all such encoding RNA and DNA sequences. DNA and RNA molecules other than those specifically disclosed herein and, which molecules are characterized simply by a change in a codon for a particular amino acid are within the scope of this invention. A table of codons representing particular amino acids is set forth below in Table 1.
TABLE 1
First Position S Seeccoonndd PPoossiittiioonn Third Position
(5' end) (3' end)
T/U C A G
P Phhee S Seerr T lyyrr C Cyyss T/U
P Phhee S Seerr T Tyyrr C Cyyss C
T/U L Leeuu S Seerr S Sttoopp S Sttoopp A
L Leeuu S Seerr S Sttoopp S Sttoopp G
L Leeuu P Prroo H Hiiss A Arrgg T/U L Leeuu P Prroo H Hiiss A Arrgg C c L Leeuu P Prroo G Giinn A Arrgg A
L Leeuu P Prroo G Giinn A Arrgg G
lie Thr Asn Ser T/U
He Thr Asn Ser C
A H Hee T Thhrr L Lyyss A Arrgg A
Met Thr Lys Arg G
Val Ala Asp Gly T/U Val Ala Asp Gly C
G Val Ala Glu Gly A
Val Ala Glu Gly G
The nucleic acid sequences of the present invention can be used in marker-aided selection using techniques which are well-known in the art. Marker-aided selection does not require the complete sequence of the gene or precise knowledge of which sequence confers which specificity. Instead, partial sequences can be used as hybridization probes or as the basis for oligonucleotide primers to amplify by PCR or other methods to identify nucleic acids specific for functional centromeric DNA in other plants and animals.
Plant Artificial Chromosome In a second embodiment, the present invention relates to a plant artificial chromosome.
More specifically, the nucleic acid sequences of the present invention can be used to construct a plant artificial chromosome vector. A plant artificial chromosome must contain the following essential elements: (1) autonomous replication sequences (hereinafter referred to as "ARS"), (2) a centromere which is functional in plants, and (3) telomeres which are functional in plants.
Autonomous Replication Sequences
ARSs have been isolated from the unicellular fungi Saccharomyces cerevisiae (brewer's yeast) and Schizosaccharomyces pombe (see Stinchcomb et al.. Nature 282:39- 43 (1979) and Hsiao et al, J. Proc. Natl. Acad. Sci. USA 76:3829-3833 (1979)). ARSs behave like replication origins allowing DNA molecules that contain the ARS to be replicated as an episome after introduction into the cell nuclei of these fungi. Although plasmids containing these sequences replicate, they do not segregate properly.
U.S. Patent 5,270,201 (hereinafter the " '201 Patent"), hereby incoφorated by reference, discloses a method for isolating ARS sequences for use in higher eukaryotic organisms, by the formation of minichromosomes derivative of natural chromosomes. It has been demonstrated in yeast that inverted repeats of telomeric sequences are "resolved" by an unknown mechanism which results in a double-stranded cleavage between inverted repeats. After an inverted telomere repeat is introduced into a chromosome, a resolution reaction will lead to scission of the chromosome and formation of two chromosomal fragments, each with two telomeres. This process generates a minichromosome small enough to be isolated intact allowing further manipulation by in vitro techniques to delimit the sequences required for autonomous replication. A second approach for an obtaining ARS is also disclosed in the '201 Patent. This approach is referred to as a "shotgun cloning approach". Higher eukaryotic organisms have many replication origins distributed throughout their genomes. For example, the A. thaliana genome contains approximately 1000 origins spaced every 70 kb along the chromosome. Therefore, the shotgun cloning approach involves looking for random fragments of genomic
DNA throughout the genome of interest which promote extrachromosomal replication.
Autonomous replication sequences for use in the plant artificial chromosome of the present invention can be obtained using methods which are well known in the art. Autonomous replication sequences from yeast, such as those described above, can be used in the present invention. Moreover, ARS sequences from higher eukaryotic organisms obtained using the methods described in the '201 patent can also be used in plant artificial chromosome of the present invention.
Telomeres
Telomeres are believed to be involved in the priming of DNA replication at the chromosome end (see, Blackburn et al., Ann. Rev. Biochem. 53: 163-194 (1984)). This is because conventional DNA polymerases are template dependent, synthesize DNA in the 5' to 3' direction, and require an oligonucleotide primer to donate a 3' OH group. When this primer is removed, unreplicated single-stranded gaps arise; most of these gaps can be filled in by priming from 3'
OH groups donated by newly replicated strands located at the 5' end of the gap. However, the unreplicated gaps which lie next to the extreme 5' end of the DNA duplex cannot be primed in this manner. Consequently, telomeres must provide an alternative priming mechanism.
Telomeres are also responsible for the stability of chromosomal termini. Telomeres act as
"caps," suppressing the recombinogenic properties of free, unmodified DNA ends (see Blackburn et al., Ann. Rev. Biochem. 53:163-194 (1984)). This reduces the formation of damaged and rearranged chromosomes which arise as a consequence of recombination-mediated chromosome fusion events.
Telomeres may also contribute to the establishment or maintenance of intranuclear chromatin organization through their association with the nuclear envelope (see, for example, Fussell, C. P., Genetica 62:192-201 (1984)).
Telomeric or telomeric-like DNA sequences have been cloned from several lower eukaryotic organisms, principally protozoans and yeast. The ends of the Tetrahymena linear DNA plasmid have been shown to function like a telomere on linear plasmids in S. cerevisiae (see Szostak, J. W., Cold Spring Harbor Symp. Quant. Biol. 47: 1 187-1 194 (1983)). A telomere from the flagellate Trypanosoma has been cloned (see, for example, Blackburn et al., Cell 36:447-457 (1984)). A yeast telomeric sequence has been identified (see, for example, Shampay et al., Nαtwre 310:154-157 (1984)).
U.S. Patent 5,270,201 disclose a method for obtaining a telomere from a higher eukaryotic organism, specifically, from Arabidopsis thaliana. The telomeric sequences disclosed in the '201 Patent contain a tandem repeat of the sequence 5'-CCCTAAA-3.
Any telomeric sequence which produces a telomere which is functional in plants can be inserted into the plant artificial chromosome of the present invention. The telomeric sequence may be from yeast or from a higher eukaryotic organism as described above. Preferably, the plant artificial chromosome of the present invention will contain two (2) telomeric sequences.
Construction of a Plant Artificial Chromosome Once the essential elements of a plant artificial chromosome are obtained (the ARS, centromere and teleomeres), a plant artificial chromosome vector can be constructed using methods which are well-known in the art (see, for example, Maniatis, T.. et al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor, 1982)). In addition to the essential elements described above, preferably positive and negative selectable plant markers (for example, antibiotic or herbicide resistance genes), and a cloning site for insertion of foreign DNA are preferably included. In order to propagate vectors in E. coli, it is necessary to convert the linear molecule into a circle by the addition of a stuffer fragment between the telomeres. In addition to the stuffer fragment, the artificial plant chromosome may also contain a origin of replication that can function in plants.
Artificial plant chromosomes which replicate in yeast also may be constructed to take advantage of the large insert capacity and stability of repetitive DNA inserts afforded by this system (Burke et al., Science, 236:806-812 (1987)). In this case, yeast ARS and centromere sequences are added to the artificial chromosome. The artificial chromosome is maintained in yeast as a circular molecule using a stuffer fragment to separate the teleomeres.
Nucleic acids for the essential components of the plant artificial chromosome obtained from any source whatsoever, may be purified and inserted into the plant artificial chromosome at any appropriate restriction endonuclease cleavage site. The nucleic acids usually will contain various regulatory signals (for example, promoters, termination segments, enhancers, etc., which are well known in the art) that allow for the expression of proteins encoded by the nucleic acids. Alternatively, regulatory signals residing in the artificial chromosome may be utilized.
The techniques and procedures required to accomplish insertion are well-known in the art (see Maniatis et al. , Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., (1982)). Typically, this is accomplished by incubating a circular plasmid or a linear DNA fragment in the presence of a restriction endonuclease such that the restriction endonuclease cleaves the DNA molecule. Endonucleases preferentially break the internal phosphodiester bonds of polynucleotide chains. They may be relatively unspecific, cutting polynucleotide bonds regardless of the surrounding nucleotide sequence. However, the endonucleases which cleave only a specific nucleotide sequence are called restriction enzymes. Restriction endonucleases generally internally cleave DNA molecules at specific recognition sites, making breaks within "recognition" sequences that in many, but not all. cases exhibit twofold symmetry around a given point. Such enzymes typically create double-stranded breaks.
Many of these enzymes make a staggered cleavage, yielding DNA fragments with protruding single-stranded 5' or 3' termini. Such ends are said to be "sticky" or "cohesive" because they will hydrogen bond to complementary 3' or 5' ends. As a result, the end of any DNA fragment produced by an enzyme, such as EcoRl, can anneal with any other fragment produced by that enzyme. This properly allows splicing of foreign genes into plasmids, for example. Some restriction endonucleases that may be particularly useful with the current invention include Hindlll, Pstl, EcoRl, and BamHl.
Some endonucleases create fragments that have blunt ends, that is, that lack any protruding single strands. An alternative way to create blunt ends is to use a restriction enzyme that leaves overhangs, but to fill in the overhangs with a polymerase, such as klenow, thereby resulting in blunt ends. When DNA has been cleaved with restriction enzymes that cut across both strands at the same position, blunt end ligation can be used to join the fragments directly together. The advantage of this technique is that any pair of ends may be joined together, irrespective of sequence.
Those nucleases that preferentially break off terminal nucleotides are referred to as exonucleases. For example, small deletions can be produced in any DNA molecule by treatment with an exonuclease which starts from each 3' end of the DNA and chews away single strands in a 3' to 5' direction, creating a population of DNA molecules with single-stranded fragments at each end, some containing terminal nucleotides. Similarly, exonucleases that digest DNA from the 5' end or enzymes that remove nucleotides from both strands have often been used. Some exonucleases which may be particularly useful in the present invention include Bal31, 51, and ExoIII. These nucleolytic reactions can be controlled by varying the time of incubation, the temperature, and the enzyme concentration needed to make deletions. Phosphatases and kinases also may be used to control which fragments have ends which can be joined. Examples of useful phosphatases include shrimp alkaline phosphatase and calf intestinal alkaline phosphatase. An example of a useful kinase is T4 polynucleotide kinase.
Once the source DNA sequences and vector sequences have been cleaved and modified to generate appropriate ends they are incubated together with enzymes capable of mediating the ligation of the two DNA molecules. Particularly useful enzymes for this purpose include T4 ligase, E. coli ligase, or other similar enzymes. The action of these enzymes results in the sealing of the linear DNA to produce a larger DNA molecule containing the desired fragment (see, for example, U.S. Pat. Nos. 4,237,224; 4,264,731; 4,273,875; 4,322,499 and 4,336,336, which are specifically incorporated herein by reference).
It is to be understood that the termini of the linearized plasmid and the termini of the DNA fragment being inserted must be complementary or blunt in order for the ligation reaction to be successful. Suitable complementarity can be achieved by choosing appropriate restriction endonucleases (i.e., if the fragment is produced by the same restriction endonuclease or one that generates the same overhang as that used to linearize the plasmid, then the termini of both molecules will be complementary). As discussed previously, in a preferred embodiment, at least two classes of the vectors used in the present invention are adapted to receive the foreign oligonucleotide fragments in only one orientation. After joining the DNA segment to the vector. the resulting hybrid DNA can then be selected from among the large population of clones or libraries.
A method useful for the molecular cloning of DNA sequences includes in vitro joining of DNA segments, fragmented from a source high molecular weight genomic DNA, to vector DNA molecules capable of independent replication. The cloning vector may include plasmid DNA
(see Cohen et al, Proc. Natl. Acad. Sci. USA, 70:3240 (1973)), phage DNA (see Thomas et al, Proc. Natl. Acad. Sci. USA, 71 :4579 (1974)), SV40 DNA (see Nussbaum et al, Proc. Natl. Acad. Sci. USA, 73:1068 (1976)), yeast DNA, E. coli DNA and most significantly, plant DNA. A variety of processes are known which may be utilized to effect transformation; i.e., the inserting of a heterologous DNA sequences into a host cell, whereby the host becomes capable of efficient expression of the inserted sequences.
Transformed Host Cells and Transgenic Plants
Methods and compositions for transforming a bacterium, a yeast cell, a plant cell, or an entire plant with one or more plant artificial chromosome vectors are further aspects of the present invention.
Means for transforming bacteria and yeast cells are well known in the art. Typically, means of transformation are similar to those well known means used to transform other bacteria or yeast such as E. coli or Saccharomyces cerevisiae. Methods for DNA transformation of plant cells include Agrobacterium-mediated plant transformation, protoplast transformation, gene transfer into pollen, injection into reproductive organs, injection into immature embryos and particle bombardment. There are various advantages and disadvantages associated with each of these methods.
Methods for transforming plant cells include any method by which DNA can be introduced into a cell, such as by Agrobαcterium infection, direct delivery of DNA such as, by PEG-mediated transformation of protoplasts, by desiccation/inhibition-mediated DNA uptake, by electroporation, by agitation with silicon carbide fibers, by acceleration of DNA coated particles, etc.
Many methods for delivering genes into cells are known and well described. These methods include: (1) chemical methods (Graham et al., Virology, 54(2):536-539 (1973);
Zatloukal et al. Ann. N Y. Acαd. Sci, 660:136-153 (1992)); (2) physical methods such as micro injection (Capecchi, Cell 22(2):479-488 (1980)), electroporation (Wong et al., Biochim. Biophys. Res. Commun., 107(2):584-587 (1982); Fromm et al, Proc. Nαtl Acαd. Sci. USA, 82(17):5824-5828 (1985); U.S. Patent 5,384,253) and microprojectile bombardment (i.e. the gene gun) (Johnston et al., Methods Cell. Biol, 43(A):353-365 (1994); Fynan et al., Proc. Natl. Acad. Sci. USA, 90(24): 1 1478-1 1482 (1993); (3) viral vectors (Clapp, Clin. Perionatol., 20( 1 ): 155- 168 (1993); Lu et al., J. Exp. Med, 178(6):2089-2096 (1993)); Eglitis et al. Biotechniques, 6(7):608-614 (1988); Eglitis et al. Avd. Exp. Med. Biol, 241 :19-27 (1988); and (4) receptor-mediated mechanisms (Curiel et al., Proc. Natl. Acad. Sci. USA, 88(19):8850-8854
(1991); Curiel et al., Hum. Gen. Ther., 3(2): 147-154 (1992); Wagner et al., Proc. Natl. Acad. Sci. USA 89(13):6099-6103 (1992)).
Agrobacterium-mediated transformation is a widely applicable system for introducing genes into plant cells because the DNA can be introduced into whole plant tissues. The use of
Agrobαcterium-mediated plant integrating vectors to introduce DNA into plant cells is well known in the art. Using conventional transformation vectors, chromosomal integration is required for stable inheritance of the foreign DNA. However, the artificial plant chromosome vector described herein may be used for transformation with or without integration, as the centromere function required for stable inheritance is encoded within the plant artificial chromosome. In particular embodiments, transformation events in which the plant artificial chromosome is not chromosomally integrated may be preferred, in that problems with site- specific variations in expression and insertional mutagenesis may be avoided.
The integration of the Ti-DNA is a relatively precise process resulting in few rearrangements. The region of DNA to be transferred is defined by the border sequences, and intervening DNA is usually inserted into the plant genome as described (Spielmann et αl, Mol. Gen. Genet., 205:34 (1986); Jorgensen et αl, Mol. Gen. Genet., (1987)). Modern Agrobαcterium transformation vectors are capable of replication in E. coli as well as Agrobαcterium, allowing for convenient manipulations as described (Klee et αl., Bio/Technology, 3:637-642(1985)).
Moreover, recent technological advances in vectors for Agrobαcterium-mediated gene transfer have improved the arrangement of genes and restriction sites in the vectors to facilitate construction of vectors capable of expressing various polypeptide coding genes. The vectors described (Rogers et αl., Meth. In Enzymol., 153:253-277 (1987)), have convenient multi-linker regions flanked by a promoter and a polyadenylation site for direct expression of inserted polypeptide coding genes and are suitable for present purposes. In addition, Agrobacterium containing both armed and disarmed Ti genes can be used for the transformations. In those plant strains where Agrobacterium-mediated transformation is efficient, it is the method of choice because of the facile and defined nature of the gene transfer.
Agrobαcterium-mediated transformation of leaf disks and other tissues such as cotyledons and hypocotyls appears to be limited to plants that Agrobacterium naturally infects. Agrobαcterium-mediated transformation is most efficient in dicotyledonous plants. Few monocots appear to be natural hosts for Agrobαcterium, although transgenic plants have been produced in asparagus and more significantly in maize using Agrobαcterium vectors as described (Bytebier et αl., Proc. Nαtl. Acαd. Sci. USA, 84:5345 (1987)); U.S. Patent No. 5,591,616, specifically incoφorated herein by reference). Therefore, commercially important cereal grains such as rice, corn, and wheat must usually be transformed using alternative methods. However, as mentioned above, the transformation of asparagus using Agrobαcterium also can be achieved
(see, for example, Bytebier et αl, Proc. Nαtl. Acαd. Sci. USA, 84:5345 (1987)).
Other Transformation Methods
Transformation of plant protoplasts can be achieved using methods based on calcium phosphate precipitation, polyethylene glycol treatment, electroporation, and combinations of these treatments (see, for example, Potrykus et αl., Mol. Gen. Genet. 199: 183 (1985); Lorz et αl., Mol. Gen. Genet., 199: 178 (1985); Fromm et αl. , Nature, 312:791-793 (1986); Uchimiya et al. , Mol. Gen. Genet., 204:204 (1986); Callis et al, Genes and Development, 1 : 1 183 (1987); Marcotte et al, Nature, 335:454 (1988)).
Application of these systems to different plant strains for the puφose of making transgenic plants depends upon the ability to regenerate that particular plant strain from protoplasts. Illustrative methods for the regeneration of cereals from protoplasts are described (Fujimura et al. Plant Tissue Culture Letters, 2:74 (1985); Toriyama et al. Theor. Appl. Genet.. 73: 16 (1986); Yamada et al, Plant Cell Rep., 4:85 (1986); Abdullah et al , Biotechnology. 4:1087 (1986)).
To transform plant strains that cannot be successfully regenerated from protoplasts, other ways to introduce DNA into intact cells or tissues can be utilized. For example, regeneration of cereals from immature embryos or explants can be effected as described (Vasil., Biotechnology 6:397 (1988)). In addition, "particle gun" or high- velocity microprojectile technology can be utilized (Vasil et al., Biotechnology, 10:667-674 (1992)).
Using that latter technology, DNA is carried through the cell wall and into the cytoplasm on the surface of small metal particles as described (Klein et al. Nature, 327:70-73 (1987); Klein et al. Proc. Natl. Acad. Sci. USA, 85:8502-8505 (1988); McCabe et al. Biotechnology, 6:923 (1988)). The metal particles penetrate through several layers of cells and thus allow the transformation of cells within tissue explants.
By way of example, and not of limitation, Examples of the present invention will now be given.
EXAMPLE 1: CLONING OF CENTROMERIC DNA FROM ORZYA SATIVA
1. Materials and Methods. a. A rice BAC library was constructed from an indica rice (Oryza sativa ssp. Indica) line IR-BB21 and consisted of 1 1,000 clones (see Wang, G.-L., et al., Plant J. 7: 525-533 (1995), herein incoφorated by reference). The cereal centromeric DNA element pSau3A9 (Jiang, J., et al., Proc. Natl,. Acad. Sci. USA 93: 14210-14213 (1996), herein incoφorated by reference) was used to isolate the rice centromere-specific BAC clones. The DNA sequence of pSau3A9 is shown in Figure 1 and SEQ ID NO:8. Rice lines used in this example include a javanica rice (O. sativa ssp. Javanica) line DV85, a japonica rice (O. sativa ssp. Japonica ) line Norin 28, and indica rice line IR72, and four other Oryza species (O. glaberrima, O. rufipogon, O. officinalis, and O. aha). Gramineae species used in conservation studies include two species from the Bambusoideae subfamily [bamboo (Bambusa vulgaris), Pharus sp.], three species from the Panicoideae subfamily [sorghum, maize (Zea mays), and sugar cane (Saccharum officinarum)], six species form the Pooideae subfamily [Agropyron intermedium, barley (Hordeum vulgare), oats (Avena sativa), rye (Secale cereale), wheat (Triticum aestivum), and Aegilops squarrosa}. Three non-Gramineae species, Juncus effusus, Cyperus alternifolius, and A. thaliana, and rye and maize lines containing B chromosomes also were included.
b. BAC Library Screening. BAC filter preparations and BAC library screening were conducted as described in Wang, G.-L., et al., Plant J. 7:525-533 (1995); Hoheisel, J. D., et al., Cell 73:109-120 (1993), herein incoφorated by reference. BAC clones were isolated by using pSau3A9 as a probe, and their cytological locations were confirmed by fluorescence in situ hybridization (hereinafter referred to as "FISH").
c. Subcloning and Sequencing. DNA fragments recovered from agarose gels were subcloned into pUC18 plasmids as described in Jiang, J., et al., Proc. Natl,. Acad. Sci. USA 93: 14210-14213 (1996), herein incoφorated by reference. Cycle sequencing reactions were performed by using Applied Biosytems AmpliTaq DNA polymerase, FS Dye Terminator Ready Reactions kit, and a Perkin-Elmer Thermocycler (model 2400). Reaction products were analyzed on an Applied Biosystems DNA sequencer (model 373).
d. Southern Blot Hybridization. Plant genomic DNA was isolated as described in Gill, K. S.. et al., Genome 34:362-374 (1991), herein incoφorated by reference. BAC DNA was prepared by using an alkaline lysis method described in Sambrook, J., Fritsch, E. F. & Maniatis,
T. (1989) Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Lab. Press, Plainview. NY), 2nd Ed., pp. 1.25-1.26, herein incoφorated by reference, and purified by CsCl ultracentrifugation. Gel transfers, prehybridizations, hybridizations, and posthybridization washing were all as previously described Jiang, J.. et al.. Proc. Natl,. Acad. Sci. USA 93: 14210- 14213 (1996).
e. Slot Blot Hybridization. Copy number of each subclone in rice genome was determined by slot blot hybridization as described Zhao, X., et al., Theor. Appl Genet. 78:201-
209 (1989), herein incoφorated by reference. Band intensities were measured on the autoradiographs by IPLab Spectrum v3.1 software.
f. FISH. Detailed protocols for FISH and Fiber-FISH are described in Jiang, J., et al.. Proc. Natl,. Acad. Sci. USA 93: 14210-14213 (1996) and Fransz, P. F., et al.. Plant J. 9:421 -430
(1996). herein incoφorated by reference. The formamide in the hybridization mixture was 50% and 30%) in regular and low stringency hybridizations, respectively. Washing was conducted at either low [2 x saline sodium citrate (SSC) at 42 °C for 15 minutes], medium (50%> formamide at 45 °C for 15 minutes) or high stringency (70% formamide at 50 °C for 15 minutes).
2. A rice BAC library constructed from indica rice (Oryza sativa ssp. Indica) and described by Wang, G.-L., et al.. Plant J. 7: 525-533 (1995), was screened by using pSau3A9 as a probe. Twenty-two clones showed unambiguous positive hybridizations. Ten of the 22 clones were analyzed cytologically by FISH. Eight clones hybridized to the centromeric or/and paracentromeric regions of all rice chromosomes. Clone 17p22 showed bright and shaφ signals specific to the centromeric regions. At a low hybridization stringency, this clone also hybridized exclusively to the centromeric regions of chromosomes from sorghum, maize, wheat, barley, oats and rye.
DNA from clone 17p22 was digested with restriction enzymes, BamHl, Dral, EcoRl,
Haelll. Hindlll, Mspl, Pstl, Sau3Al and Sail and blotted onto nylon membrane. Small DNA fragments ranging from 0.5 to 3 kb were subcloned, and their distinctiveness was confirmed by Southern hybridization using blots containing 17p22 DNA digested with the above described nine restriction enzymes. Seven different DNA families, including two Sau3Al fragments (subclones pRCSl and pRCS2), three Hindlll fragments (subclones pRCHl, pRCH2 and pRCH3), and two £cøRI fragment (subclones pRCEl and pRCE2), were identified (see below in Table 2). These seven families hybridized to all of the fragments generated by the nine enzymes. FISH and Southern hybridization analysis indicated that all seven elements are repetitive in the rice genome (see below).
Table 2 Summary of the Seven Rice Centromeric Repetitive DNA Families
Figure imgf000029_0001
* Based on the haploid genome of rice as 424 Mb (24).
#The copy number of the 168-bp monomer in the rice genome.
Clone pRCSl contains a 877-bp Sau3Al fragment that hybridizes to the pSau3A9 sequence. Sequencing analysis revealed that the 259 bp at the 3' end of pRCSl had 80% sequence identity to the central part (bases 338-602) of the pSau3A9 sequence (see Jiang, J., et al., Proc. Natl,. Acad. Sci. USA 93: 14210-14213 (1996), FIG. 1 and SEQ ID NO:8). The first 95 bp in pRCSl had 76% sequence identity to a Ty3/ 'gypsy class of retrotransposon sequence reported in maize (GenBank Accession No. AF030633). Nucleotides 171-228 of pRCSl had 70% sequence identity to a Ty3/gypsy class of retrotransposon sequence reported in Lilium henryi (XI 3886). It was also discovered that the pSau3A9 sequence in sorghum has similar sequence identities to the Ty3l gypsy class of retrotransposons. These results indicated that both pSau3A9 and pRCSl probably were derived from retrotransposon-related DNA sequences.
The RCSl sequence was located in the centromeric regions of all 24 rice chromosomes by FISH (see FIG. 2A). The sizes and intensities of the FISH signals were uniform on different chromosomes, suggesting that all rice chromosomes contain a similar number of copies of this element. Slot blot analysis suggested that ther are about 130 copies of RCSl present in the haploid genome of japonica rice DV85 (see Table 2).
Rice genomic DNA was digested with several restriction enzymes and probed with the 259-bp fragment conserved between rice and sorghum. One or few major bands and several minor bands were detected in most of the lanes (see FIG. 3). Fiber-FISH using pRCSl as a probe did not generate clustered signals. These results suggested that the RCS 1 sequence is dispersed in the centromeric regions of rice chromosomes.
FISH analysis revealed that pRCSl also hybridized exclusively to centromeric regions of chromosomes from other Gramineae species (see FIG. 2B-E . The FISH results on rye (see FIG. 25) and barley (see FIG. 2C) chromosomes showed that hybridization was exclusive to the primary constrictions. FISH signals also were detected in the centromeres of the supernumerary
B chromosomes from both rye and maize (see FIG. 2 B and E). Positive Southern hybridization signals were detected in all other Gramineae species analyzed, including bamboo, Pharus sp., oats, wheat, sugar cane, Ae. squarrosa, and Ag. intermedium. However, homologous sequences could not be detected by Southern hybridization analysis in dicot species and any monocot species outside of Gramineae, indicating that the RCSl family is sufficiently conserved only in the grass family Gramineae.
Clone pRCS2 contains a 639-bp Sau3Al fragment consisting of four copies of a tandemly arranged repeat with a consensus sequence of 168bp (see FIG. 4). The four copies were 84-91% identical with one another. The third copy of the repeat contains a 6-bp insertion (TTGGCC) at base 147. A search of the GenBank database found a highly significant match to a repetitive DNA element isolated from O. Sativa (GenBank Accession No. U63977).
Southern hybridization analysis of rice genomic DNA using probe pRCS2 revealed ladder patterns using several restriction enzymes, including Dpnll, Sau3Al. Mspl, Hpall, and Haelll. indicating that the RCS2 family is tandemly arranged in the rice genome (see FIG. 5). Several enzymes produced digestion profiles comprised of monomer and multiples (dimer, trimer, tetramer, etc.) of the 168-bp basic repeat.
Probe pRCS2 hybridized only to the centromeric regions on all rice chromosomes (see FIG. 2K). Significant variation in the size and intensity of the FISH signals was detected in different centromeres. Two pairs of chromosomes had strong signals, and a third pair had very faint signals (see FIG. 2K). All of the signals became weaker as the posthybridization washing stringency was increased (see FIG. 2L). However, even after washing in 70% formamide at
50 °C for 15 minutes, most signals were still discernible (see FIG. 2M), suggesting that the signal disparity reflects difference in copy numbers rather than sequence divergence of the RCS2 family in different rice centromeres. Though the longest chromosome (chromosome 1) had the strongest signals, it was not possible to relate the copy numbers to the chromosome sizes. It was evident that the weakest signals were not on the smallest chromosomes (see FIG. 2K).
Three subspecies of O. sativa (AA genome), together with O. glaberrima (AA), O. rufipogon (AA), O. aha (CCDD), and O. ofβcinalis (CC) were included for FISH analysis. FISH signals were detected in the centromeric regions from all of the chromosomes of these species. Southern hybridization analysis revealed that the RCS2 family is present only in the species within genus Oryza. Homologous sequences could not be detected even at a low stringency in any plant species outside of genus Orya. RCS2 is the most abundant element isolated from BAC 17p22 and has about 1.550 copies, corresponding to 6,200 monomers, in the haploid genome of DV85 (see Table 2 above). BAC 17p22 contains about 46 copies of this element, corresponding to approximately 39% of the BAC insert. Fiber-FISH analysis demonstrated that the RCS2 family is organized into various sizes of uninterrupted arrays in the rice genome. The longest observed block with small interspersed gaps (<2 μm) was 51 μm (see FIG. 2N). Based on a 2.96-kb/ m resolution of the Fiber-FISH technique, this block represents approximately 151 kb of uninterrupted RCS2 sequences. The longest observed single Fiber-FISH signal with interspersed gaps larger than 2 μm was 188 μm representing approximately 556 kb of centromeric DNA sequences.
The other five centromeric DNA elements isolated from rice BAC clone 17p22 were analyzed by FISH, and all of them hybridized exclusively to the centromeric regions of all rice chromosomes (see FIG. 2F-J). One or two pairs of rice metaphase chromosomes showed weak hybridization when pRCH2, pRCH3, and pRCE2 were used as probed. No relationship can be confirmed between signal intensities and the sizes of the chromosomes.
The sequence information for these families is listed in Table 2, above. Searching in the GenBank database did not uncover any significant matches to these sequences except for pRCH2. Bases 39-102 and 204-232 in pRCH2 had sequence identities to the centromeric CCSl sequence isolated from B. sylvaticum (see Aragon- Alcaide, L., et al., Chromosoma 105, 261-268
(1996); Abbo, S., et al., Chromosome Res. 3:5-15 (1995)). Interestingly, about 120 bp (bases 8- 130) of this element had 80% sequence identity to the spacer sequence that separates the rice 5S rRNA genes. The possibility that this element associates with the 5S rDNA locus was excluded because the FISH signals from pRCH2 was located proximal to those from the 5S rDNA locus.
In Southern hybridization analysis, all five elements produced one or few major bands and several minor bands under several restriction enzymes, similar to the RCSl family (see FIG. 3), suggesting that they all are dispersed in the rice centromeric regions. The copy numbers of these elements ranged from 53 to 305 copies per haploid rice genome (see Table 2, above). All five elements were hybridized to various plant species by Southern hybridization. The RCEl family was present only in the species from the Bambusoideae subfamily, including rice, bamboo, and Pharus sp. (See FIG. 6B). whereas RCHl. RCH2, RCH3, and RCE2 all were conserved across the Gramineae species (see FIG. 6A for RCHl). Species from subfamily Panicoideae and Bambusoideae generally had stronger hybridization signals than those from subfamily Pooideae (see FIG. 6A).
The cytosine nucleotides, especially those in dinucleotide sequence 5'CpG3', are the most common sites for methylation in plant genomes. Methylation occurs at lower frequencies when the C and G are separated by 1-2 A/T nucleotides (see Gruenbaum, T., et al. Nature (London)
292: 860-862 (1981)). Enzymes Mspl and Hpall ar isoschizomers that recognize the 5'CCGG3' sequence. Neither enzyme can cut when the 5'C is methylated, and only Mspl can cleave when the internal cytosine is methylated. Though both enzymes produced similar digestion profiles of rice genomic DNA, Mspl generated much smaller-sized hybridization bands from all of the rice centromeric DNA probes than Hpall did (see FIG. 3 for RCS 1 and FIG. 5 for RCS2). For the
RCS2 element, monomers of the 168-bp basic repeat could be found in Mspl lane, and most of the hybridization was in the fragments smaller than 2 kb. whereas the majority of hybridization in the Hpall lane was larger than 2 kb (see FIG. 5). For the other centromeric elements, DNA fragments smaller than 5 kb were not detected in Hpall lanes (see FIG. 3 for RCSl). These results suggest that the cytosine of the CpG dinucleotides are heavily methylated in the rice centromeric DNA sequences. Restriction enzyme Sail recognizes 5OTCGAC3' and is sentsitive to the methylation of CpG dinucleotides. Small fragments (<10 kb) that hybridized to the centromeric elements were not detected in the Sail lanes (see FIGS. 3 and 5).

Claims

What is Claimed Is:
1. An isolated and purified nucleic acid comprising a nucleotide sequence of SEQ ID NO:7.
2. A recombinant DNA construct comprising a centromere, wherein said centromere comprises a number of highly repetitive regions of DNA having a nucleotide sequence of SEQ ID NO:7.
3. The recombinant DNA construct of claim 2, further comprising a yeast autonomous replication sequence.
4. The recombinant DNA construct of claim 2, further comprising an autonomous replication sequence from a higher eukaryotic organism.
5. The recombinant DNA construct of claim 2, further comprising a yeast telomere.
6. The recombinant DNA construct of claim 2, further comprising a telomere from a higher eukaryotic organism.
7. The recombinant DNA construct of claim 2, further comprising a selectable marker gene.
8. A plasmid comprising the DNA construct of claim 2.
9. The plasmid of claim 8, wherein said plasmid further comprises an origin of replication and a selectable marker that functions in bacteria.
10. The plasmid of claim 9, wherein said bacteria is E. coli.
11. The plasmid of claim 8, wherein said plasmid further comprises an origin of replication and a selectable marker that functions in yeast.
12. The plasmid of claim 8, wherein said yeast is S. cerevisiae.
13. A plant artificial chromosome vector comprising an autonomous replication sequence, two telomere sequences, a centromere sequence having the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID
NO: 5, SEQ ID NO: 6 and SEQ ID NO: 7 or combinations thereof, and at least one selectable marker sequence.
14. The plant artificial chromosome vector of claim 13, wherein the autonomous replication sequence if from yeast.
15. The plant artificial chromosome vector of claim 13, wherein the autonomous replication sequence is from a higher eukaryotic organism.
16. The plant artificial chromosome vector of claim 13, wherein the telomere sequences are from a higher eukaryotic organism.
17. The plant artificial chromosome of claim 16, wherein the telomere sequences are from Arabieopsis thaliana.
18. The plant artificial chromosome vector of claim 13, wherein the telomere sequences are from yeast.
19. A plant cell transformed with the plant artificial chromosome vector of claim 13.
20. The transformed plant cell of claim 19, wherein the plant cell is from Oryza sativa.
21. A transgenic plant comprising the transformed plant cell of claim 19.
22. A method of identifying centromeric DNA in a higher eukaryotic organism, the method comprising the steps of: hybridizing an isolated nucleic acid selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7 and combinations thereof, with a sample of DNA from a higher eukaryotic organism; and identifying and isolating centromeric DNA from said sample.
PCT/US2000/017535 1999-06-30 2000-06-26 Dna sequences specific to rice centromeres WO2001000858A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU64888/00A AU6488800A (en) 1999-06-30 2000-06-26 Dna sequences specific to rice centromeres

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US34377599A 1999-06-30 1999-06-30
US09/343,775 1999-06-30

Publications (1)

Publication Number Publication Date
WO2001000858A1 true WO2001000858A1 (en) 2001-01-04

Family

ID=23347616

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/017535 WO2001000858A1 (en) 1999-06-30 2000-06-26 Dna sequences specific to rice centromeres

Country Status (2)

Country Link
AU (1) AU6488800A (en)
WO (1) WO2001000858A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1644510A2 (en) * 2003-06-27 2006-04-12 Chromatin, Inc. Plant centromere compositions
EP2295586A3 (en) * 2003-06-27 2011-06-29 Chromatin, Inc. Plant centromere compositions
CN103525859A (en) * 2013-10-09 2014-01-22 中国农业科学院作物科学研究所 Construction method of artificial rice minichromosome and applications
US9096909B2 (en) 2009-07-23 2015-08-04 Chromatin, Inc. Sorghum centromere sequences and minichromosomes
CN114350700A (en) * 2021-10-19 2022-04-15 深圳大学 Saccharomyces cerevisiae vector and construction method and application thereof

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1989009219A1 (en) * 1988-03-24 1989-10-05 The General Hospital Corporation Artificial chromosome vector
WO1998055637A1 (en) * 1997-06-03 1998-12-10 Arch Development Corporation Plant artificial chromosome (plac) compositions and methods

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1989009219A1 (en) * 1988-03-24 1989-10-05 The General Hospital Corporation Artificial chromosome vector
WO1998055637A1 (en) * 1997-06-03 1998-12-10 Arch Development Corporation Plant artificial chromosome (plac) compositions and methods

Non-Patent Citations (12)

* Cited by examiner, † Cited by third party
Title
ARAGON-ALCAIDE LUIS ET AL: "A cereal centromeric sequence.", CHROMOSOMA (BERLIN), vol. 105, no. 5, 1996, pages 261 - 268, XP000965076, ISSN: 0009-5915 *
DATABASE EMBL SEQUENCE DATABASE 20 July 1998 (1998-07-20), DONG, F., ET AL.: "Rice (Oryza sativa) centromeric regions consist of complex DNA.", XP002153049 *
DATABASE EMBL SEQUENCE DATABASE 20 July 1998 (1998-07-20), DONG, F., ET AL.: "Rice (Oryza sativa) centromeric regions consist of complex DNA.", XP002153050 *
DATABASE EMBL SEQUENCE DATABASE 20 July 1998 (1998-07-20), DONG, F., ET AL.: "Rice (Oryza sativa) centromeric regions consist of complex DNA.", XP002153051 *
DATABASE EMBL SEQUENCE DATABASE 20 July 1998 (1998-07-20), DONG, F., ET AL.: "Rice (Oryza sativa) centromeric regions consist of complex DNA.", XP002153052 *
DATABASE EMBL SEQUENCE DATABASE 20 July 1998 (1998-07-20), DONG, F., ET AL.: "Rice (Oryza sativa) centromeric regions consist of complex DNA.", XP002153053 *
DATABASE EMBL SEQUENCE DATABASE 3 December 1998 (1998-12-03), WING, R.A. AND DEAN, R.A.: "a BAC end sequencing framework to sequence the rice genome", XP002153047 *
DATABASE EMBL SEQUENCE DATABASE 31 January 2000 (2000-01-31), SASAKI, T. , ET AL.: "Oryza sativa nipponbare (GA3) genomic DNA, chromosome 1, PAC clone: PO499C11", XP002153054 *
DATABASE EMBL SEQUENCE DATABASE 6 January 1999 (1999-01-06), DONG, F., ET AL.: "Rice (Oryza sativa) centromeric regions consist of complex DNA", XP002153048 *
DONG FENGGAO ET AL: "Rice (Oryza sativa) centromeric regions consist of complex DNA.", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES, vol. 95, no. 14, 7 July 1998 (1998-07-07), July 7, 1998, pages 8135 - 8140, XP002153043, ISSN: 0027-8424 *
JIANG JIMING ET AL: "A conserved repetitive DNA element located in the centromeres of cereal chromosomes.", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES, vol. 93, no. 24, 1996, 1996, pages 14210 - 14213, XP002153045, ISSN: 0027-8424 *
NONOMURA K -I ET AL: "Organization of the 1.9-kb repeat unit REC1 in the centromeric region of rice chromosomes.", MOLECULAR AND GENERAL GENETICS, vol. 261, no. 1, February 1999 (1999-02-01), pages 1 - 10, XP002153044, ISSN: 0026-8925 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1644510A2 (en) * 2003-06-27 2006-04-12 Chromatin, Inc. Plant centromere compositions
EP1644510A4 (en) * 2003-06-27 2007-10-24 Chromatin Inc Plant centromere compositions
EP2295586A3 (en) * 2003-06-27 2011-06-29 Chromatin, Inc. Plant centromere compositions
EP2357240A1 (en) * 2003-06-27 2011-08-17 Chromatin, Inc. Plant centromere compositions
US9096909B2 (en) 2009-07-23 2015-08-04 Chromatin, Inc. Sorghum centromere sequences and minichromosomes
CN103525859A (en) * 2013-10-09 2014-01-22 中国农业科学院作物科学研究所 Construction method of artificial rice minichromosome and applications
CN114350700A (en) * 2021-10-19 2022-04-15 深圳大学 Saccharomyces cerevisiae vector and construction method and application thereof

Also Published As

Publication number Publication date
AU6488800A (en) 2001-01-31

Similar Documents

Publication Publication Date Title
KR102607893B1 (en) Methods and compositions for increasing yield of short stature plants through manipulation of gibberellin metabolism
US5270201A (en) Artificial chromosome vector
Lucas et al. RNA‐mediated transposition of the tobacco retrotransposon Tnt1 in Arabidopsis thaliana.
CA2087610C (en) Biologically safe plant transformation system
US7816506B2 (en) Methods and constructs for producing transgenic plants and methods of recovering expressed proteins
CN107267527B (en) Method for maintaining male fertility and application thereof
EP2018435B1 (en) Artificial plant minichromosomes
EP0338266A2 (en) Artificial chromosome vector
US5073675A (en) Method of introducing spectinomycin resistance into plants
EP0513054B1 (en) Virus/herbicide resistance genes process for producing same and their use
CA3188408A1 (en) Inir12 transgenic maize
US5217902A (en) Method of introducing spectinomycin resistance into plants
WO2001000858A1 (en) Dna sequences specific to rice centromeres
WO1997005260A2 (en) An expression control sequence for general and effective expression of genes in plants
CN102131932A (en) Artificial plant minichromosomes
EP1179003A1 (en) Methods for increasing plant cell proliferation by functionally inhibiting a plant cyclin inhibitor gene
CN112080513A (en) Rice artificial genome editing system with expanded editing range and application thereof
Sam et al. DESIGN AND TRANSFER OF OsSWEET14-EDITING T-DNA CONSTRUCT TO BAC THOM 7 RICE CULTIVAR.
US20030196215A1 (en) Novel class of proteins and uses thereof for plant resistance to various pathogenic agents
US20210054367A1 (en) Methods and compositions for targeted editing of polynucleotides
AU622900C (en) Artificial chromosome vector
Hille et al. Genetic transformation of tomato and prospects for gene transfer
Nuc et al. Nucleotide sequence of nuclear tRNAGly genes and tRNAGly pseudogenes from yellow lupin (Lupinus luteus): Expression ofthe tRNAGly genes in vitro and in vivo
AU749274B2 (en) Methods for obtaining plant varieties
AU705682C (en) An expression control sequence for general and effective expression of genes in plants

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP