WO2007076727A1 - A rice gene, gs3, exerting primary control over grain length and grain weight - Google Patents

A rice gene, gs3, exerting primary control over grain length and grain weight Download PDF

Info

Publication number
WO2007076727A1
WO2007076727A1 PCT/CN2007/000002 CN2007000002W WO2007076727A1 WO 2007076727 A1 WO2007076727 A1 WO 2007076727A1 CN 2007000002 W CN2007000002 W CN 2007000002W WO 2007076727 A1 WO2007076727 A1 WO 2007076727A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
grain
polynucleotide
polypeptide
acid sequence
Prior art date
Application number
PCT/CN2007/000002
Other languages
French (fr)
Other versions
WO2007076727A8 (en
Inventor
Qifa Zhang
Chuchuan Fan
Yongzhong Xing
Original Assignee
Huazhong Agricultural University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huazhong Agricultural University filed Critical Huazhong Agricultural University
Priority to EP07701934.7A priority Critical patent/EP1969124B1/en
Priority to US12/159,964 priority patent/US20100017919A1/en
Publication of WO2007076727A1 publication Critical patent/WO2007076727A1/en
Priority to US14/468,261 priority patent/US20150082496A1/en
Publication of WO2007076727A8 publication Critical patent/WO2007076727A8/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8257Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits for the production of primary gene products, e.g. pharmaceutical products, interferon
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8201Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
    • C12N15/8213Targeted insertion of genes into the plant genome by homologous recombination
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A40/00Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
    • Y02A40/10Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
    • Y02A40/146Genetically Modified [GMO] plants, e.g. transgenic plants

Definitions

  • the present invention relates to biotechnology of plants. Particularly, the present invention relates to the gene cloning of GS3, a major QTL regulating grain weight and grain length, which is located in the pericentrometric region of chromosome 3 in rice.
  • Grain size of rice is an important economic trait because: (1) grain size is a major determinant of grain weight, which is one of the three components of grain yield, and therefore, grain size is an important trait for yield; (2) grain size is also an important trait of rice appearance because grain weight is positively correlated with several characters including grain length, grain width and grain thickness (Evans, 1972, Rice Breeding, Los Banos, International Rice Research Institute, Manila, pp. 499-511). In China, the USA and some Asian countries, Indica rice with long and slender grain is generally preferred by most consumers. In China, a length/width ratio of 2.8 is adopted as an enforced threshold for a national standard for high quality rice. Thus, understanding the genetic basis and molecular mechanisms of grain size is important in improving both rice yield and quality.
  • grain size also plays an important role in the study of evolution of cultivars. It is generally believed that wild type relatives are usually small and round in shape, and are thus favored under natural selection. After a long-term domestication and selection by humans, the shape of grain particles has changed significantly. Therefore, the study ' of the genetic basis of grain size provides clues for the study of the evolution of cultivars.
  • Grain size is a typical quantitative trait and is complex in its genetic basis. Utilization of molecular marker technology is able to separate and locate QTL (Quantitative Trait Loci) controlling quantitative traits, thus separating the complex quantitative traits to simple Mendelian factors for studies.
  • QTL Quantitative Trait Loci
  • GS3 major QTL located in the pericentrometric region of rice chromosome 3 was detected by many researchers (Huang et al., 1997, MoI. Breed. 3:105-113; Redona and Mackill, 1998, Theor. Appl. Genet.
  • GS3 gene can be stably expressed in various genetic backgrounds and different environments. Therefore, GS3 gene is greatly potent and a good prospect for the improvement of both yield and quality traits of rice.
  • the high resolution mapping of GSS gene and the cloning of the corresponding gene provide new genetic resources for the improvement of both yield and quality in rice breeding.
  • the present inventors Upon aforesaid purposes, the present inventors have isolated and cloned a major gene GS3 regulating grain weight and grain length in rice by the approach of mapping and cloning, therefore providing new genetic resources for the improvement of both yield and quality in rice breeding, and also providing clues for the study on the evolution of cultivars.
  • the present invention relates to the isolation and cloning of the whole DNA fragment encoding a major gene regulating both grain weight and grain length in rice.
  • the present invention also relates to the improvements in both yield and quality of rice using said gene.
  • Said gene is named GS3.
  • the invention established a near isogenic line (NIL) ofGS3.
  • GS3 was finely mapped to a region of 7.9-kb in length using the approach of mapping and cloning. With the help of the sequence information of a whole cDNA in aforesaid region, the invention predicted and analyzed the gene structure of GS3 and the protein encoded by GS3. It was found that GS3 comprises 5 exons and encodes 232 amino acids.
  • said protein contains conserved domains including a PEBP-like domain, a transmembrane domain, a cysteine-rich domain of TNFR (tumour necrosis factor receptor)/NGFR (nerve growth factor receptor) and a VWFC (von Willebrand factor type C) homologous domain.
  • TNFR tumor necrosis factor receptor
  • NGFR nerve growth factor receptor
  • VWFC von Willebrand factor type C homologous domain.
  • the inventors sequenced and compared three large grain species (Minghui 63, a indica rice in China; H94, from Shanghai Agrobiological Gene Center; 93-11, a rice variety that has been completely sequenced) and 3 small grain species (Zhenshan97, a indica rice in China; Chun 7, from Shanghai Agrobiological Gene Center; Niponbare, a rice variety that has been completely sequenced).
  • FIG. 1 shows the technical flowchart of the present invention
  • Figure 2 shows the six cultivars used in sequence alignment
  • Figure 3 shows the frequency distribution of 1,000-grain weight, grain length, grain width and grain thickness in the random BC 3 F 2 subpopulation; wherein (a) frequency distribution of 1,000-grain weight;
  • Figure 4 shows the map of the GS3 locus on the molecular linkage map of chromosome 3 (unit in cM);
  • Figure 5 shows the maps of the gene region, wherein
  • Figure 6 shows the organization of the predicted GS3 protein indicating the localization of the various conserved domains including the PEBP-like domain, the transmembrane domains, the cysteine-rich domain of TNFR/NGFR and the VWFC homologous domain, wherein the PEBP-like domain is located inside the membrane, while the cysteine-rich domain of TNFR and the VWFC homologous domain are located outside the membrane.
  • a near isogenic line of rice GS3 gene was established using Minghui 63 as the recurrent parent and Chuan 7 as the donor parent. Mapping and effect evaluation of GS3 gene were carried out on a random BC 3 F 2 subpopulation consisting of 201 BC 3 F 2 individuals. By further analysis on 1,384 BC 3 F 2 individuals with large grain phenotype using CAPS markers, GS3 was finally mapped between the two CAPS markers GS63 and SF 19 (designed by the inventors; see Table 2), which was approximately 7.9kb in distance. Based on a whole length cDNA sequence, the GS3 gene structure was predicted. In addition, the possible function of GS3 gene was predicted based on the bioinformational technology.
  • Example 1 establishment of a near isogenic lines of rice GS 3 gene
  • BC 3 F 1 -19 One plant (BC 3 F 1 -19) was finally selected, whose genotype in the RM282 and RMl 6 region was Minghui63/Chuan7 heterozygous genotype, while whose genotype in said 125 SSR markers, only about 20% (25 pairs) of the markers were Minghui63/Chuan7 heterozygous genotype, and the rest thereof were all Mingui63 homozygous genotype.
  • the progeny (BC 3 F 2 and BC 3 F 3 ) of said individual plant were used in the following experiments.
  • Grain particles were air-dried and stored at room temperature for at least 3 months before testing in order to make sure the dryness and water contents thereof were relatively identical
  • Ten randomly chosen full filled grains from each plant were lined up closely in a way that each lay head to head, tail to tail, with no overlap and no gap in between Said grains were arranged length-wise to measure the grain length using a vernier caliper, and then were lined up closely side by side, that is, arranged by breadth to measure the grain width using a vernier caliper
  • Grain thickness was determined for each grain individually using a vernier caliper, and the values were averaged and used as the measurements for the plant Grain weight was calculated based on 200 randomly chosen fully filled grains and converted to 1,000-grain weight
  • Table 1 Descriptive statistics of the traits for the two parents and the long grains and short grains in the random subpopulation
  • Minghiu fil Cliuan 7 Mean fc SD Range Mean ⁇ SD Range Mean ⁇ SD Range
  • SSR markers used in the present invention are publicly known
  • 11 pairs of Indel (Insert/Deletion) and CAPS (cleaved amplified polymorphic sequence) markers which shows polymorphism between Minghui 63 and Chuan 7 were designed based on the genome DNA sequence of Japonica Rice, Nipponbare and India Rice, 93-11.
  • Said Indel and CAPS markers were used in the high resolution mapping analysis of GS3.
  • the DNA sequences of said markers are listed in Table 2.
  • Said random subpopulation was subjected to genotype analysis using 6 SSR markers (MRG5959, MRG0164, MRG5881, MRG2646, RM411, RMl 6) and 2 Indel markers (GS06 and GS09).
  • Mapmaker/Exp 3.0 (Lincoln et al.1992, Whitehead Institute Technical Report, Whitehead Institute, Cambridge, Massachusetts, USA) was used to establish a partial genetic linkage map in GS3 region.
  • QTL analysis on the traits including grain length, width, thickness and 1,000-grain weight of the random subpopulation was conducted using the program Mapmaker/QTL 1.1 at a threshold of LOD 3.0.
  • the three distinct phenotypic classes corresponded to the three genotypes of the BC 3 F 2 individuals at the GS3 locus: homozygote for the Minghui 63 allele (long grain), homozygote for the Chuan 7 alleles (short grain), and heterozygote.
  • GS3 was directly mapped into a 1-cM region delimitated by an Indel marker GS09 and SSR marker MRG5881 ( Figure 4).
  • CAPS analysis 9 CAPS markers used for the high resolution locating are listed in Table 2.
  • the amplification product amplified by said markers had a size of around 1-kb.
  • the PCR reaction system was identical with the SSR reaction system mentioned above.
  • the amplification condition of the PCR was as below: 94 0 C predenature 4min; 94 0 C lmin, 53°C ⁇ 57°C lmin, 72 0 C 1.5min, 34 cycles; 72 0 C elongation lOmin.
  • the digestion of the amplicons was carried out in a 20 ⁇ l reaction system containing: lO ⁇ l PCT product, 1 U restriction digestive enzyme (from Takara Ltd., Japan). Additional components were as described in the manual provided by Takara Ltd. After digestion in 37 0 C for 3-5 hours, lO ⁇ l of the digestion product was subjected to separation by electrophoresis in a 1.5% agrose gel, which was then observed using UV after EB staining.
  • 1,384 plants with long grain phenotype (long gram, 9.7 mm or longer) from the BC 3 F 2 population of 5,740 individuals derived from B C 3 F 1 - 19 individual plant were selected for recombinant screening.
  • GS64 and SF 18 were found to co-segregate with the GS3 locus. Therefore, the genomic region containing the GS3 locus was narrowed down to the DNA fragment bounded by GS 63 and SF 19, which corresponded to approximately 7.9-kb in length in the genome sequence ofNipponbare and 93-l l( Figure 5a).
  • a full-length cDNA which is named osigcea013f09t3, from the plumule of an indica cultivar Guangluai 4 (provided by Shanghai Agrobiological Gene Center) was identified in the 7.9-kb fragment (Fig.4b) between GS63 and SF 19 (which were designed by the applicants; refer to Table 2).
  • the nucleotide sequence is shown in SEQ ID NO: 1.
  • the cDNA sequence of the cloned GS3 gene of the present invention is 953bp hi length, which matches well with the region between positions 1.6 and 7.3 kb of the 7.9-kb fragment. Allowing for the regulatory regions on both ends, this is considered as the only candidate gene for GS 3,
  • GS3 gene was obtained by comparing the sequences of said total cDNA with the genomic DNA sequence of Nipponbare.
  • GS3 gene is 5,363bp in length from the translation start codon to the termination codon. It comprises 5 exons and 4 introns.
  • the starting exon is 117bp in length, while the second exon is 53bp, third exon 45bp, fourth exon 54bp, terminal exon 430bp in length, respectively.
  • the first intron is l,472bp in length, while the second intron is l,439bp, third intron 83bp, fourth intron l,671bp in length, respectively (Table 5b). Therefore, the open reading frame of GS3 gene is 699bp in length and encodes 232 amino acids.
  • the sequence of said gene is shown in SEQ ID NO: 1.
  • GS3 protein Prediction for the structure of GS3 protein was carried out with InterProScan. It was revealed that the protein encoded by GS3 gene consists of 232 amino acids and comprises several conserved domains. There is a PEBP (phosphatidylethanolamine- binding protein)-like domain in amino acid 12-65 at the 5' terminus. A transmembrane region is located at amino acid 97-117. The region of amino acid 116-1557 is a TNFR (tumor necrosis factor receptor)/NGFR (nerve growth factor receptor) family cysteine-rich domain.
  • PEBP phosphatidylethanolamine- binding protein
  • the 3' terminal cysteine-rich region shows the characters of the conserved amino acid sequence of the von Willebrand factor type C (VWFC) domain, which is typically 60 -80 amino acid in length and comprises ten cysteines, especially is characterized in that it contains a C2XXC3XC4 sequence located in the middle and a C8C9XXC10 sequence at the 3' terminal end (wherein C represents cysteine; the number represents the order of said conserved cysteines; X indicates any amino acid) ( Figure 5 c and Figure 6).
  • VWFC von Willebrand factor type C
  • VWFC domain is represented in a number of extracellular matrix proteins.
  • T7-R and SP6-F universal primers (Shanghai Sangon Biological Engineering Technology and Services Co. LTD.) and the Big Dye Terminator Cycle Sequencing v3.1 (Applied Biosystems, Foster City, CA, USA) were used for sequencing from both ends of the subclones. Sequence contigs were assembled using the computer program SEQUENCHER 4.1 (Gene Codes Corporation, USA).
  • the size of the PCR products depended on the sequence of Nipponbare and varied among different species.
  • Sequence alignment was carried out on three large grain varieties (Minghui 63, H94, 93-11) and three small grain varieties (Zhenshan97, Chun 7, Niponbare) ( Figure 2).
  • the GS3 region sequences of Nipponbare and 93-11 were from the BAC clone OSJNBa0030J19 and contig Ctg009226, respectively. Sequence alignment was conducted using the computer program Vector NTI 9 (InforMaxTM Corporation,
  • nucleotide mutation was located at the second exon of the GS3 gene, in which a cysteine codon (TGC) in the small-grain group was mutated to a termination codon (TGA) in the large-grain group (Fig.4b).

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Wood Science & Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • Cell Biology (AREA)
  • Microbiology (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Medicinal Chemistry (AREA)
  • Botany (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)

Abstract

The present invention relates to an isolated major gene GS3 which regulates grain weight and grain length in the rice and the cloning of said gene. The DNA sequence of GS3 gene is as shown in SEQ ID NO. 1 and is 7883bp in length. GS3 gene comprises 5 exons and encodes 232 amino acids. It is predicted based on bioinformatics analysis that said protein contains conserved domains including a PEBP-like domain, a transmembrane domain, a cysteine-rich domain of TNFR/NGFR and a VWFC domain. cDNA sequence of said gene is as shown in SEQ ID NO. 2. By sequence alignment between three large grain species and 3 small grain species of rice, it is revealed there is only one common single nucleotide mutation in a 7.9-kb region between the two different grain-length groups. Said nucleotide mutation is located at the second exon of the GS3 gene, in which a cysteine codon (TGC) in the small-grain group is mutated to a termination codon (TGA) in the large-grain group. This mutation causes a premature termination in the large-grain group, which leads to a 178-amino acids truncation (including part of the PEBP-like domain and all the other three conserved domains). The present invention also provides methods of producing transgenic plants comprising sequences disclosed herein.

Description

A Rice Gene, GS3, Exerting Primary Control over Grain Length and
Grain Weight
Technical Field
The present invention relates to biotechnology of plants. Particularly, the present invention relates to the gene cloning of GS3, a major QTL regulating grain weight and grain length, which is located in the pericentrometric region of chromosome 3 in rice.
Background of Invention
Grain size of rice is an important economic trait because: (1) grain size is a major determinant of grain weight, which is one of the three components of grain yield, and therefore, grain size is an important trait for yield; (2) grain size is also an important trait of rice appearance because grain weight is positively correlated with several characters including grain length, grain width and grain thickness (Evans, 1972, Rice Breeding, Los Banos, International Rice Research Institute, Manila, pp. 499-511). In China, the USA and some Asian countries, Indica rice with long and slender grain is generally preferred by most consumers. In China, a length/width ratio of 2.8 is adopted as an enforced threshold for a national standard for high quality rice. Thus, understanding the genetic basis and molecular mechanisms of grain size is important in improving both rice yield and quality.
In addition, grain size also plays an important role in the study of evolution of cultivars. It is generally believed that wild type relatives are usually small and round in shape, and are thus favored under natural selection. After a long-term domestication and selection by humans, the shape of grain particles has changed significantly. Therefore, the study ' of the genetic basis of grain size provides clues for the study of the evolution of cultivars.
Grain size is a typical quantitative trait and is complex in its genetic basis. Utilization of molecular marker technology is able to separate and locate QTL (Quantitative Trait Loci) controlling quantitative traits, thus separating the complex quantitative traits to simple Mendelian factors for studies. By using aforesaid methods, many QTLs regulating rice grain size were identified in recent years. Among these QTLs3 a major QTL (referred to as GS3 in the present invention) located in the pericentrometric region of rice chromosome 3 was detected by many researchers (Huang et al., 1997, MoI. Breed. 3:105-113; Redona and Mackill, 1998, Theor. Appl. Genet. 96:957-963; Kubo et al., 2001, Rice Genet. Newsl. 18:26-28; Thomson et al., 2003, Theor, Appl. Genet. i 107:479-493; Aluko et al., 2004, Theor. Appl. Genet. 109:630-639). Using F2.3 and a recombinant inbred line population derived from a cross between Zhenshan 97 and Minghui 63, the present invention has detected a major QTL several times that is present in the GS3 locus regulating both the grain length and the grain weight. This QTL explains over 55% of the total variation of grain length, as well as approximately 20% of the total variation of grain weight (Yu et al., 1997, Proc. Natl. Acad. Sci. USA 94:9226-9231; Li et al., 2000, Theor. Appl. Genet. 101:248-254; Tan et al., 2000, Theor. Appl. Genet. 101:823-829; Xing et al., 2001, Acta. Bot. Sin. 43:721-726; Xing et al., 2002, Theor. Appl. Genet. 105:248-257; Hua et al., 2002, Genetics 162:1885-1895). These results indicate that GS3 gene can be stably expressed in various genetic backgrounds and different environments. Therefore, GS3 gene is greatly potent and a good prospect for the improvement of both yield and quality traits of rice. The high resolution mapping of GSS gene and the cloning of the corresponding gene provide new genetic resources for the improvement of both yield and quality in rice breeding.
High resolution mapping of the quantitative traits in common population groups is very difficult because it is not easy to determine whether a major QTL or multiple minor QTL are detected (Yano et al., 1997, Plant Molecular Biology 35:145-153) in said population groups. Secondly, in said population groups, multiple QTLs affecting the same trait are separated, therefore the interference caused and the affects of the environmental factors greatly limit the resolution of the location of a QTL. Special population groups can be established from high resolution mapping of a QTL. A common approach is to establish a near isogenic line (NIL) of an interesting QTL and to eliminate most background differences besides the interesting QTL locus to make said QTL presented as typical Mendelian genetics. Said approach has played an important role in many high resolution mapping and gene cloning of QTLs. Li et al. mapped GS3 to a region of 93.8-kb in length (Li et al., 2004, Genetics 168:2187-2195). Said results have provided a basis for the separation of mapping and the cloning of the GS3 gene.
Upon aforesaid purposes, the present inventors have isolated and cloned a major gene GS3 regulating grain weight and grain length in rice by the approach of mapping and cloning, therefore providing new genetic resources for the improvement of both yield and quality in rice breeding, and also providing clues for the study on the evolution of cultivars.
Detailed Description of the Invention
The present invention relates to the isolation and cloning of the whole DNA fragment encoding a major gene regulating both grain weight and grain length in rice. The present invention also relates to the improvements in both yield and quality of rice using said gene. Said gene is named GS3. The invention established a near isogenic line (NIL) ofGS3. GS3 was finely mapped to a region of 7.9-kb in length using the approach of mapping and cloning. With the help of the sequence information of a whole cDNA in aforesaid region, the invention predicted and analyzed the gene structure of GS3 and the protein encoded by GS3. It was found that GS3 comprises 5 exons and encodes 232 amino acids. It was predicted based on bioinformatics that said protein contains conserved domains including a PEBP-like domain, a transmembrane domain, a cysteine-rich domain of TNFR (tumour necrosis factor receptor)/NGFR (nerve growth factor receptor) and a VWFC (von Willebrand factor type C) homologous domain. In addition, the inventors sequenced and compared three large grain species (Minghui 63, a indica rice in China; H94, from Shanghai Agrobiological Gene Center; 93-11, a rice variety that has been completely sequenced) and 3 small grain species (Zhenshan97, a indica rice in China; Chun 7, from Shanghai Agrobiological Gene Center; Niponbare, a rice variety that has been completely sequenced). It was found that there was only a single base variation in said region of 7.9-kb in length between large grain species and small grain species. Said variation was located in the end of the second exon of GS3, wherein a cysteine codon (TGC) in small grain species is mutated to the terminator codon TGA in large grain species. Said mutation causes GS3 to be prematurely terminated in large grain species and leads to the loss of 178 amino acids, including part of the PEBP-like domain and three other conserved domains. It was found through bioinformatics analysis that the VWFC domain regulates growth and development signals in combination with the growth regulator TGF-β family members. In the large grain species, GS3 lacks the VWFC domain, therefore it loses its capacity to regulate growth and development signaling, which finally leads to the change in grain length and grain weight.
The present invention has the following advantages:
(1) it is the first cloning of a gene that highly affects the grain length and grain weight in rice, therefore providing new gene resources for high yield and good quality in rice breeding, and providing a good technology example for the cloning of homologous genes in other species;
(2) the gene cloned in the present invention provides evidence for the study of domestication and molecular evolution of rice and other species.
Description of Figures
Figure 1 shows the technical flowchart of the present invention;
Figure 2 shows the six cultivars used in sequence alignment;
Figure 3 shows the frequency distribution of 1,000-grain weight, grain length, grain width and grain thickness in the random BC3 F2 subpopulation; wherein (a) frequency distribution of 1,000-grain weight;
(b) frequency distribution of grain length;
(c) frequency distribution of grain width;
(d) frequency distribution of grain thickness;
Figure 4 shows the map of the GS3 locus on the molecular linkage map of chromosome 3 (unit in cM);
Figure 5 shows the maps of the gene region, wherein
(a) The high-resolution map of the GS3 gene. The numbers between molecular markers indicate the numbers of recombination events detected between the GS3 locus and respective markers. Genebank no. OSJNBa0030J19 is a BAC clone of Nipponbare encompassing the GS3 locus;
(b) Organization of the GS3 gene. The positions of exons (black boxes), 5' and 3' UTR (hatched boxes ), translation start codon (ATG), translation stop codon (TGA), and one common single nucleotide mutation in the second exon between the two grain-length groups are indicated, in which a substitution of cysteine (TGC) in the small grain group mutated to the translation stop codon (TGA) in the large grain group;
(c) Predicted sequence of the GS3 gene expression product. The position of the amino acid change in large grain group (cysteine to stop codon) is indicated by an asterisk. The PEBP-like domain is indicated by dashed underline, the transmembrane region by single solid underline, the TNFR/NGFR family cysteine-rich domain by double underline, and the VWFC domain is boxed;
Figure 6 shows the organization of the predicted GS3 protein indicating the localization of the various conserved domains including the PEBP-like domain, the transmembrane domains, the cysteine-rich domain of TNFR/NGFR and the VWFC homologous domain, wherein the PEBP-like domain is located inside the membrane, while the cysteine-rich domain of TNFR and the VWFC homologous domain are located outside the membrane.
Examples
A near isogenic line of rice GS3 gene was established using Minghui 63 as the recurrent parent and Chuan 7 as the donor parent. Mapping and effect evaluation of GS3 gene were carried out on a random BC3 F2 subpopulation consisting of 201 BC3F2 individuals. By further analysis on 1,384 BC3F2 individuals with large grain phenotype using CAPS markers, GS3 was finally mapped between the two CAPS markers GS63 and SF 19 (designed by the inventors; see Table 2), which was approximately 7.9kb in distance. Based on a whole length cDNA sequence, the GS3 gene structure was predicted. In addition, the possible function of GS3 gene was predicted based on the bioinformational technology. It was found by sequence alignment of the 7.9kb fragment that a mutation was presented in all the large grain species which led to deficiency of 178 amino acids. Such deficiency could lead to the loss of the gene function. It implied the essential reason of the change in grain size, and at the same time, proved the correctness of setting GS3 as the target gene (Figure 1).
The following Examples further define the present invention and describe the methods for the isolation and cloning of GS3 gene, as well as the methods for the detection of the base mutation between GS3 alleles by sequence alignment. Based on the following recitation and the Examples, the skilled in the art are able to confirm the essential features of the present invention, and are able to make various changes and adjustments on the present invention to apply it on different uses and conditions without deviating from the concepts and scope of the present invention.
Example 1: establishment of a near isogenic lines of rice GS 3 gene
1. Backcrossing and Screening
As shown in Figure 2, successive crossing was carried out using Minghui 63 (large grain) as the recurrent parent and Chuan 7 (small grain) as the donor parent. Positive selection of GS3 was carried out in F1, BC1F1 and BC2F1, that is, selecting individual plants whose targeting region was Minghui63/Chuan7 heterozygous genotype for the following backcrossing. The targeting region was determined in a region between two known SSR (Simple Sequence Repeat) markers RM282 and RMl 6. In BC3F1, in addition to positive selection, surveillance was also carried out in the genetic backgrounds besides the targeting region. Individual plants with a genetic background closest to Minghui63 were selected for the following experiments. Referring to the published rice genetic linkage map (Temnykh et al., 2000, Theor. Appl. Genet. 100:697-712; Temnykh et al., 2001, Genome Res. 11:1441-1452), 125 SSR markers with known polymorphism in the parents and evenly distributed on the 12 rice chromosomes were selected for surveillance of genetic backgrounds. One plant (BC3F1-19) was finally selected, whose genotype in the RM282 and RMl 6 region was Minghui63/Chuan7 heterozygous genotype, while whose genotype in said 125 SSR markers, only about 20% (25 pairs) of the markers were Minghui63/Chuan7 heterozygous genotype, and the rest thereof were all Mingui63 homozygous genotype. The progeny (BC3F2 and BC3F3) of said individual plant were used in the following experiments.
2. SSR methods
The standard PCR protocol followed the methods taught in Sambrook, J. et al., Molecular Cloning: A Laboratory Manual, 3rd ed., Translated by Jin Dong Yan et al., Science Press. In the PCR, a 20μl reaction system was used, which contained: 20-50ng DNA template, 10 mM Tris-HCl, 50 mMKCl, 0.1% Triton X-100, 1.8 niM MgCl2, 0.1 fflM dNTP, 0.2 uM primers (primers of RM282 and RMl 6 as mentioned above) and 1 U Taq DNA polymerase. Conditions for PCR included: 94 °C predenature for 4 min; 94 "C lmin, 55°C lmin, 72 "C lmin, 34 cycles; 72 °C elongation lOmin. PCR products were separated on a 6% acrylamide gel and then silver-stained (Bassam et al , 1991, Anal Biochem 196 80-83)
Example 2 Mapping and effect evaluation of GS3 in the random subpopulation
1 Measurements of traits of large and small grain
Grain particles were air-dried and stored at room temperature for at least 3 months before testing in order to make sure the dryness and water contents thereof were relatively identical Ten randomly chosen full filled grains from each plant were lined up closely in a way that each lay head to head, tail to tail, with no overlap and no gap in between Said grains were arranged length-wise to measure the grain length using a vernier caliper, and then were lined up closely side by side, that is, arranged by breadth to measure the grain width using a vernier caliper Grain thickness was determined for each grain individually using a vernier caliper, and the values were averaged and used as the measurements for the plant Grain weight was calculated based on 200 randomly chosen fully filled grains and converted to 1,000-grain weight
201 individuals of BC3F2 derived from the BC3F1-19 individual plant were randomly selected to form a random subpopulation Distributions of grain weight, grain length, grain width and grain thickness were studied in Minhui 63, Chuan 7 and said random subpopulation of BC3F2 It was found that all the traits were significantly different between the two parents (Table 1) In the random subpopulation, both grain length and 1,000-grain weight expressed a discontinuous distribution The plants were classified as long grain and short grain based on a boundary of 8 50-9 50 mm in grain length or 20 5-21 5g in 1,000-grain weight (Table 1 and Figure 3) Grain length concurred completely with grain weight, such that long grains were heavier than short grains, and vice visa However, grain width and thickness showed normal distributions (Figure 3) For simplicity, in the present invention the large and heavier grains are referred to as long grain, and the opposite type as short grain
Table 1 Descriptive statistics of the traits for the two parents and the long grains and short grains in the random subpopulation
Tiait Parent (mean i SD) MM CC MC
Minghiu fil Cliuan 7 Mean =fc SD Range Mean ± SD Range Mean ± SD Range
1 000-gram weight (g) 2S 6±06 12 5±04 25 6±20 21 5-29 8 175=1=1 3 142-200 190±1 2 140-205 Grain length (mm) 9 91 ) 0099 610 I 00S9 1025 1 029 9 64-10 73 732 J 026 686-7 84 772 J 025 7 24-8 50 Giam width (nun) 2 Ϊ.0+003 248+002 272+0 13 243-2% 2 i>2+0 12 245-λ 06 2ϊ>5+009 2 56-304 Grain thickness (mm) 2 H±006 1 68±003 201 ±009 J 81-221 1 9Id=O 10 1 45-2 10 I 99 ±006 ] 79-2 13
2 Design of molecular markers
Some SSR markers used in the present invention are publicly known In addition, 11 pairs of Indel (Insert/Deletion) and CAPS (cleaved amplified polymorphic sequence) markers which shows polymorphism between Minghui 63 and Chuan 7 were designed based on the genome DNA sequence of Japonica Rice, Nipponbare and India Rice, 93-11. Said Indel and CAPS markers were used in the high resolution mapping analysis of GS3. The DNA sequences of said markers are listed in Table 2.
Table 2. Indel and CAPS markers (primers) developed for the mapping of the GS3 gene locus
Marker Type Forward primer (5'-3') Reverse primer (5'-3') Annealing Restriction
Temp.('C) enzyme
GS06 Indel AGCAAAGCTGGAACGAAGAG TAAATTACGCCGTGTCAACG 55
(SEQ ID NO. 4) (SEQ ID NO. 5)
GS09 Indel GCAACCAAGTCCACGCTAAT TAGCCGAAGATCAGCCTCCT 57
(SEQ ID NO. 6) (SEQ ID NO. 7)
GS47 CAPS GATTATTGGAGACGGGACGA GACGGCATGACCACTCTTTT 55 Hapπ
(SEQ ID NO. 8) (SEQ ID NO. 9)
GS52 CAPS AGCTTTGGTGTCGTTCTGCT CCGACTTGGAGAGAATGGAA 55 BgII
(SEQ ID NO. 10) (SEQ ID NO. 11)
GS56 CAPS GCTGTGTTGTCCTTTGCTGA CCAATAAACCCCACTGCAAC 55 BgII
(SEQ ID NO. I2) (SEQ ID NO. 13)
GS61 CAPS CTTTACAAAACCGGCGGTAA TGAAGCGGACCTAGCATTTT 53 BcII
(SEQ ID NO. 14) (SEQ ID NO. 15)
GS63 CAPS AAGAACGACTACGCGCATCT CCATCGCTCTCTTTCCTCAG 53 Hhal
(SEQID NO. 16) (SEQID NO. 17)
GS64 CAPS CAACACCAGCAACGAACAAC ACGAGGGATTATCAGCCATT 55 EcoRI,
(SEQIDNO. 18) (SEQ IDNO. 19) Hapπ
GS65 CAPS CGGTATGCCAAGTTGAATGA TTGCCGCAGTAAACAAGAAG 55 Hhai
(SEQ ID NC 2O) (SEQ ID NO. 21) HapH
SF18 CAPS CCTTCAGTAAGAGAGATGTG AGTTGATGGTTTTGTGGGAT 57 BcII
(SEQ ID NO. 22) (SEQ ID NO. 23)
SF19 CAPS TCTGCTTGCGGTTATCTGTA TTAGGTCCCTTTTCTCGTCC 57 Sacl
(SEQ ID NO. 24) (SEQ ID NO. 25)
Remark: Markers (primers) GS63 and SF19 were designed by the inventors.
3. QTL mapping and effect evaluation of GS3
Said random subpopulation was subjected to genotype analysis using 6 SSR markers (MRG5959, MRG0164, MRG5881, MRG2646, RM411, RMl 6) and 2 Indel markers (GS06 and GS09). Mapmaker/Exp 3.0 (Lincoln et al.1992, Whitehead Institute Technical Report, Whitehead Institute, Cambridge, Massachusetts, USA) was used to establish a partial genetic linkage map in GS3 region. QTL analysis on the traits including grain length, width, thickness and 1,000-grain weight of the random subpopulation was conducted using the program Mapmaker/QTL 1.1 at a threshold of LOD 3.0. QTL analysis indicated that a QTL found in the interval between GS09 and MRG5881 had effects simultaneously on grain weight, grain length, grain width and grain thickness, and contributed 83.4%, 95.6%, 19.8% and 12.1% of the phenotypic variation on these traits, respectively. The allele from Minghui 63 contributed to the increase of grain weight, grain length and grain thickness, but to the decrease of gram width. Moreover, the QTL also showed different modes of gene actions on the traits, such that partial dominance was observed for 1000-grain weight, grain length and grain thickness, while overdominance was detected for grain width.
Table 3. Effects of the QTL (in the interval GS09-MRG5881) on grain shape and weight
Traits LOD A" Db Var. %c
1,000-grain weight (g) 72.8 -4.08d -2.52d 83.4
Grain length (mm) 129.2 -1.47d -1.06d 95.6
Grain width (mm) 8.9 0.05d 0.08d 19.8
Grain thickness (mm) 5.3 -0.04d 0.004e 12.1 a. Additive effect of Chuan 7 allele b. Dominance effect of Chuan 7 allele c. Percentage of total phenotypic variance explained by the QTL d. Significant at P <0.0001 in t test e. Not significant at P <0.05
4. Progeny test
Each plant in said random subpopulation was bred to 20 families (BC3 F3) which were subjected to progeny test. Progenies of 56 in 201 families were uniformly long grains, while 61 families were uniformly short grains and 84 families had both long and short grains. The ratio of the three groups fit well to the expected ratio (1 :2: 1) of single locus Mendelian segregation (χ2=5.67,P >0.05) in the χ2 test. The results indicated that in this BC3 F2 subpopulation, the grain size was controlled by a major gene, and the small size allele is dominant over the large size allele. The three distinct phenotypic classes corresponded to the three genotypes of the BC3 F2 individuals at the GS3 locus: homozygote for the Minghui 63 allele (long grain), homozygote for the Chuan 7 alleles (short grain), and heterozygote. Using the three phenotypic classes as a marker, GS3 was directly mapped into a 1-cM region delimitated by an Indel marker GS09 and SSR marker MRG5881 (Figure 4).
Example 3. High resolution mapping of GS3
1. CAPS analysis 9 CAPS markers used for the high resolution locating are listed in Table 2. The amplification product amplified by said markers had a size of around 1-kb. The PCR reaction system was identical with the SSR reaction system mentioned above. The amplification condition of the PCR was as below: 940C predenature 4min; 940C lmin, 53°C~57°C lmin, 720C 1.5min, 34 cycles; 720C elongation lOmin. The digestion of the amplicons was carried out in a 20μl reaction system containing: lOμl PCT product, 1 U restriction digestive enzyme (from Takara Ltd., Japan). Additional components were as described in the manual provided by Takara Ltd. After digestion in 370C for 3-5 hours, lOμl of the digestion product was subjected to separation by electrophoresis in a 1.5% agrose gel, which was then observed using UV after EB staining.
2. Analysis of the recombinant plant and high resolution mapping of GS3
To further narrow down the GS3 containing genomic region, 1,384 plants with long grain phenotype (long gram, 9.7 mm or longer) from the BC3 F2 population of 5,740 individuals derived from B C3F1- 19 individual plant were selected for recombinant screening.
All of the 1,384 selected plants were screened using an SSR marker MRG5881 and Indel marker GS09, which identified a total of 55 recombinants which were further confirmed to be very large size singles by progeny test. Using 9 designed CAPS markers to screen the 55 recombinants, it was found that five recombination events were resolved between GS47 and GS3, four identified between GS 52 and GS3, four between GS56 and GS3, three between GS61 and GS3, and two between GS65 and GS3 (Figure 5a). In particular, the assay revealed one recombination event between GS63 and GS3 and two recombination events between SF19 and GS3. In addition, GS64 and SF 18 were found to co-segregate with the GS3 locus. Therefore, the genomic region containing the GS3 locus was narrowed down to the DNA fragment bounded by GS 63 and SF 19, which corresponded to approximately 7.9-kb in length in the genome sequence ofNipponbare and 93-l l(Figure 5a).
Example 4. Gene structure and predicted function analysis of GS 3
(1) Gene structure analysis of GS 3
A full-length cDNA, which is named osigcea013f09t3, from the plumule of an indica cultivar Guangluai 4 (provided by Shanghai Agrobiological Gene Center) was identified in the 7.9-kb fragment (Fig.4b) between GS63 and SF 19 (which were designed by the applicants; refer to Table 2). The nucleotide sequence is shown in SEQ ID NO: 1. The cDNA sequence of the cloned GS3 gene of the present invention is 953bp hi length, which matches well with the region between positions 1.6 and 7.3 kb of the 7.9-kb fragment. Allowing for the regulatory regions on both ends, this is considered as the only candidate gene for GS 3,
The structure of GS3 gene was obtained by comparing the sequences of said total cDNA with the genomic DNA sequence of Nipponbare. GS3 gene is 5,363bp in length from the translation start codon to the termination codon. It comprises 5 exons and 4 introns. The starting exon is 117bp in length, while the second exon is 53bp, third exon 45bp, fourth exon 54bp, terminal exon 430bp in length, respectively. The first intron is l,472bp in length, while the second intron is l,439bp, third intron 83bp, fourth intron l,671bp in length, respectively (Table 5b). Therefore, the open reading frame of GS3 gene is 699bp in length and encodes 232 amino acids. The sequence of said gene is shown in SEQ ID NO: 1.
(2) Function prediction of GS '3
Prediction for the structure of GS3 protein was carried out with InterProScan. It was revealed that the protein encoded by GS3 gene consists of 232 amino acids and comprises several conserved domains. There is a PEBP (phosphatidylethanolamine- binding protein)-like domain in amino acid 12-65 at the 5' terminus. A transmembrane region is located at amino acid 97-117. The region of amino acid 116-1557 is a TNFR (tumor necrosis factor receptor)/NGFR (nerve growth factor receptor) family cysteine-rich domain. The 3' terminal cysteine-rich region shows the characters of the conserved amino acid sequence of the von Willebrand factor type C (VWFC) domain, which is typically 60 -80 amino acid in length and comprises ten cysteines, especially is characterized in that it contains a C2XXC3XC4 sequence located in the middle and a C8C9XXC10 sequence at the 3' terminal end (wherein C represents cysteine; the number represents the order of said conserved cysteines; X indicates any amino acid) (Figure 5 c and Figure 6).
VWFC domain is represented in a number of extracellular matrix proteins. Some studies show that VWFC binds to members of the transforming growth factor TGF-β superfamily, thus disrupting the receptor binding sites of TGF-b superfamily proteins and preventing activation of the TGF-b receptor, such that it regulates the growth factor signaling pathway (Abreu et al., 2002, Gene 287:39-47; O'Leary et al., 2004, J. Biol. Chem. 279:53857-53866).
Example 5: Detection of base pair variation between GS3 alleles by sequence alignment
(1) Sequencing
Two large grain species (Minghui 63 and H94) and two small grain species (Chuan 7 and Zhenshan 97) were sequenced in the target genomic DNA region. DNA fragments from these cultivars were amplified using 10 pairs of primers whose amplicons were partially overlaping with each other. The amplification was carried out with high fidelity LA-Taq (TakaRa, Dalian, China). The PCR products were cloned into pGEM- T vector (Promega, USA) according to the manufacturer's specification. The cloned product was transformed into E. coli DHlOB (Invitrogen, USA) and positive clones were screened with blue-white methods. The T7-R and SP6-F universal primers (Shanghai Sangon Biological Engineering Technology and Services Co. LTD.) and the Big Dye Terminator Cycle Sequencing v3.1 (Applied Biosystems, Foster City, CA, USA) were used for sequencing from both ends of the subclones. Sequence contigs were assembled using the computer program SEQUENCHER 4.1 (Gene Codes Corporation, USA).
Table 4. Primers used in the sequence alignment.
Marker Forward primer (5'-3') Reverse primer (5'-3') Size of Annealing Amplicon Temρ.('C)
GS63 AAGAACGACTACGCGCATCT CCATCGCTCTCTTTCCTCAG 704bp 53
(SEQ ID NO. 16) (SEQID NO. 17) SF26 GTCTGAGGAAAGAGAGCGAT AAGCAAGCCAAGGGAAATGT 1091bp 60
(SEQ ID NO. 26) (SEQ ID NO. 27) SF15 AGCAAAAAAGGTGAAGGACG CAAAGGGAATAACAAGGCAG 1295bp 57
(SEQID NO. 28) (SEQID NO. 29) SF16 CGAATAGGAAGTCAATGGC GTCGTACCCGCCTTAGTTGA 1159bp 55
(SEQ ID NO. 30) (SEQ ID NO. 31) SF28 TGCCCATCTCCCTCGTTTAC TGTTCGTTGCTGGTGTTG 1065bp 55
(SEQ ID NO. 32) (SEQ ID NO.33) GS64 CAACACCAGCAACGAACAAC ACGAGGGATTATCAGCCATT 1155 bp 55
(SEQ ID NO. 18) (SEQID NO. 19) SF18 CCTTCAGTAAGAGAGATGTG AGTTGATGGTTTTGTGGGAT 1245 bp 57
(SEQ ID NO. 22) (SEQ ID NO. 23) SF45 AACCTTCTCTTCCTACCCTT TCAGCAATCACGTACTCATC 1137 bp 55
(SEQ ID NO. 34) (SEQID NO. 35) SF19 TCTGCTTGCGGTTATCTGTA TTAGGTCCCTTTTCTCGTCC 1224 bp 57
(SEQ ID NO. 24) (SEQ ID NO. 25)
The size of the PCR products depended on the sequence of Nipponbare and varied among different species.
(2) Sequence alignment
Sequence alignment was carried out on three large grain varieties (Minghui 63, H94, 93-11) and three small grain varieties (Zhenshan97, Chun 7, Niponbare) (Figure 2). The GS3 region sequences of Nipponbare and 93-11 were from the BAC clone OSJNBa0030J19 and contig Ctg009226, respectively. Sequence alignment was conducted using the computer program Vector NTI 9 (InforMaxTM Corporation,
USA).
Although many nucleotide changes were observed among the six cultivars in the 7.9-kb region, there was only one common single nucleotide mutation detected between these two different grain-length groups, which indicated said mutation was the essential reason causing the grain size change. Further studies showed that said nucleotide mutation was located at the second exon of the GS3 gene, in which a cysteine codon (TGC) in the small-grain group was mutated to a termination codon (TGA) in the large-grain group (Fig.4b). This premature termination resulted in a 178-amino acids truncation in the C-terminus of the predicted protein in the large-grain group, which eliminated part of the PEBP-like domain and all the other three conserved domains. Such nonsense mutation was clearly in agreement with the recessive nature of the long grain phenotype, indicating that long grains resulted from the loss of the function of the protein otherwise producing short grains.

Claims

Claims:What is claimed is:
1. An isolated polynucleotide comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1 and SEQ ID NO: 2 or complements thereof.
2. A recombinant DNA construct comprising a polynucleotide according to claim 1.
3. A recombinant DNA construct according to claim 2, wherein the polynucleotide is operably linked to a promoter functional in a plant cell.
4. A recombinant DNA construct according to claim 2, wherein the polynucleotide is operably linked to a 3' untranslated region functional in a plant cell.
5. A transformed cell or organism comprising a polynucleotide according to claim 1.
6. The transformed cell or organism according to claim 5, wherein the cell is a plant cell or plant.
7. The transformed cell or organism according to claim 6, wherein the organism is a plant selected from the group consisting of cotton, wheat, soybean, maize, rice, and canola.
8. A substantially purified polypeptide comprising the amino acid sequence of SEQ ID NO: 3.
9. An isolated polynucleotide encoding a polypeptide having an amino acid sequence of SEQ ID NO: 3.
10. An isolated polynucleotide encoding a polypeptide having at least 70% amino acid sequence identity with a polypeptide having an amino acid sequence of SEQ ID NO: 3.
11. A recombinant DNA construct comprising a polynucleotide selected from the group consisting of:
(a) a polynucleotide comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1 and SEQ ID NO: 2;
(b) a polynucleotide encoding a polypeptide having an amino acid sequence of SEQ ID NO: 3; (c) a polynucleotide comprising a nucleic acid sequence complementary to a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1 and SEQ ID NO: 2;
(d) a polynucleotide having at least 70% sequence identity to a polynucleotide of (a), (b) or (c);
(e) a polynucleotide encoding a polypeptide having at least 70% sequence identity to a polypeptide having an amino acid sequence of SEQ ID NO: 3;
(f) an oligonucleotide comprising from about 15 to 100 nucleotide bases, wherein the oligonucleotide hybridizes under high stringency conditions to a polynucleotide of (a), (b) or (c);
(g) a polynucleotide comprising a promoter functional in a plant cell, operably joined to a coding sequence for a polypeptide having at least 70% sequence identity to a polypeptide having an amino acid sequence of SEQ ID NO: 3, wherein the encoded polypeptide is a functional homolog of the polypeptide having an amino acid sequence selected of SEQ ID NO: 3; and
(h) a polynucleotide comprising a promoter functional in a plant cell, operably joined to a coding sequence for a polypeptide having an amino acid sequence of SEQ ID NO: 3, wherein transcription of the coding sequence produces an RNA molecule having sufficient complementarity to a polynucleotide encoding the polypeptide to result in decreased expression of the polypeptide when the construct is expressed in a plant cell.
12. A transformed plant comprising a recombinant DNA construct, wherein the construct comprises a promoter region functional in a plant cell operably joined to a polynucleotide comprising a coding sequence for a polypeptide having an amino acid sequence of SEQ ID NO: 3.
13. A transformed plant of claim 12 wherein the polynucleotide is oriented with respect to the promoter such that transcription of the polynucleotide produces an mRNA encoding the polypeptide.
14. A transformed plant of claim 12 wherein the polynucleotide is oriented with respect to the promoter such that transcription from the polynucleotide produces an RNA complementary to the mRNA encoding the polypeptide.
15. The transformed plant according to claim 12, wherein the plant is selected from the group consisting of cotton, wheat, soybean, maize, rice, and canola.
PCT/CN2007/000002 2006-01-05 2007-01-04 A rice gene, gs3, exerting primary control over grain length and grain weight WO2007076727A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP07701934.7A EP1969124B1 (en) 2006-01-05 2007-01-04 A rice gene, gs3, exerting primary control over grain length and grain weight
US12/159,964 US20100017919A1 (en) 2006-01-05 2007-01-04 Rice Gene, GS3, Exerting Primary Control Over Grain Length and Grain Weight
US14/468,261 US20150082496A1 (en) 2006-01-05 2014-08-25 Rice gene, gs3, exerting primary control over grain length and grain weight

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200610018107.4 2006-01-05
CNB2006100181074A CN100554423C (en) 2006-01-05 2006-01-05 A kind of rice grain grain length and heavy major gene GS3 of grain of controlling

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US12/159,964 A-371-Of-International US20100017919A1 (en) 2006-01-05 2007-01-04 Rice Gene, GS3, Exerting Primary Control Over Grain Length and Grain Weight
US14/468,261 Division US20150082496A1 (en) 2006-01-05 2014-08-25 Rice gene, gs3, exerting primary control over grain length and grain weight

Publications (2)

Publication Number Publication Date
WO2007076727A1 true WO2007076727A1 (en) 2007-07-12
WO2007076727A8 WO2007076727A8 (en) 2016-03-17

Family

ID=38227923

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2007/000002 WO2007076727A1 (en) 2006-01-05 2007-01-04 A rice gene, gs3, exerting primary control over grain length and grain weight

Country Status (4)

Country Link
US (2) US20100017919A1 (en)
EP (1) EP1969124B1 (en)
CN (1) CN100554423C (en)
WO (1) WO2007076727A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2363465A1 (en) * 2008-10-16 2011-09-07 Riken Transgenic plant of which seed has enlarged size
CN110923245A (en) * 2019-12-24 2020-03-27 江西省农业科学院水稻研究所 Rice granule heterosis regulation gene and breeding application thereof

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101880671B (en) * 2010-05-27 2012-05-30 华中农业大学 Cloning and application of major gene GS5 capable of controlling width and weight of rice grain
CN102532289B (en) * 2010-12-31 2014-06-18 中国农业大学 Protein relevant to rice grain weight and encoding gene and application thereof
CN103421802B (en) * 2012-06-18 2015-04-29 华中农业大学 Pleiotropic gene GDS7 for controlling weight and length of paddy rice grain and spikelet number of each panicle
CN103409418B (en) * 2013-08-20 2015-12-23 湖南杂交水稻研究中心 With the closely linked molecule marker of rice big grain gene GS2 and application thereof
CN103882146B (en) * 2014-04-15 2015-04-29 江苏省农业科学院 Four-primer molecular marking method for identifying different genotypes of rice grain length gene GS3
CN105585619B (en) * 2014-11-12 2019-02-26 中国农业大学 With rice grain grain length and grain weight GAP-associated protein GAP and its encoding gene GL3-3 and application
CN104694626B (en) * 2015-01-22 2017-08-11 中国水稻研究所 A kind of method of short grain type paddy rice in utilization molecular marking supplementary breeding
CN106119280B (en) * 2016-07-14 2019-11-05 湖南新春农业生物高科技有限公司 To the long relevant albumen OsJGL2 of rice grain and its encoding gene and application
US11299744B2 (en) 2016-12-12 2022-04-12 Academia Sinica Transgenic plants expressing type 2C protein phosphatase abscisic acid (PP2CABA) proteins and uses thereof
CN109735536A (en) * 2019-03-15 2019-05-10 扬州大学 A kind of wide new gene site qGW2-1 and its molecule labelling method with grain thickness of control rice grain
CN111979233A (en) * 2019-05-22 2020-11-24 江苏省农业科学院 Method for increasing rice grain type and application thereof
CN111849999B (en) * 2019-10-24 2022-11-29 扬州大学 Rice GS3 mutant gene, molecular marker and application thereof
CN111118196B (en) * 2020-01-20 2022-08-26 中国农业科学院油料作物研究所 Molecular marker CNU288 primer of rape grain weight character major gene locus and application thereof
CN111321241B (en) * 2020-03-07 2023-09-26 中国科学院遗传与发育生物学研究所农业资源研究中心 Molecular marker of wheat thousand-grain weight and grain length gene TaGS3-4A and application thereof
CN113817754B (en) * 2021-09-18 2023-03-31 中国水稻研究所 Rice short-grain gene SHG1 and application thereof
CN116162142B (en) * 2022-09-29 2024-02-20 华中农业大学 Application of plant GS3 gene or protein in regulation and control of saline-alkali tolerance of plants

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005070195A1 (en) * 2004-01-26 2005-08-04 Japan As Represented By The President Of National Institute Of Genetics Transgenic plant having increased seed weight and utilization thereof
WO2005094562A1 (en) * 2004-04-02 2005-10-13 Cropdesign N.V. Plants having improved growth characteristics and method for making the same
WO2005097824A2 (en) * 2004-04-02 2005-10-20 Pioneer Hi-Bred International, Inc. Cytokinin oxidase sequences and methods of use

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7834146B2 (en) * 2000-05-08 2010-11-16 Monsanto Technology Llc Recombinant polypeptides associated with plants

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005070195A1 (en) * 2004-01-26 2005-08-04 Japan As Represented By The President Of National Institute Of Genetics Transgenic plant having increased seed weight and utilization thereof
WO2005094562A1 (en) * 2004-04-02 2005-10-13 Cropdesign N.V. Plants having improved growth characteristics and method for making the same
WO2005097824A2 (en) * 2004-04-02 2005-10-20 Pioneer Hi-Bred International, Inc. Cytokinin oxidase sequences and methods of use

Non-Patent Citations (23)

* Cited by examiner, † Cited by third party
Title
ABREU ET AL., GENE, vol. 287, 2002, pages 39 - 47
ALUKO ET AL., THEOR APPL. GENET., vol. 109, 2004, pages 630 - 639
BASSAM ET AL., ANAL, BIOCHEM., vol. 196, 1991, pages 80 - 83
DATABASE GENBANK [online] CHUCHUAN F. ET AL.: "GS3, a major QTL for grain length and weight and minor QLT for grain width and thickness in rice, encodes a putative transmembrane protein", XP003015373, Database accession no. (DQ355996) *
EVANS, RICE BREEDING, BANOS LOS, INTERNATIONAL RICE RESEARCH INSTITUTE, 1972, pages 499 - 511
HUA ET AL., GENETICS, vol. 162, 2002, pages 1885 - 1895
HUANG, MOL. BREED, vol. 3, 1997, pages 105 - 113
KUBO ET AL., RICE GENET. NEWSL., vol. 18, 2001, pages 26 - 28
LI ET AL., APPL. GENET., vol. 101, 2000, pages 248 - 254
LI ET AL., GENETICS, vol. 168, 2004, pages 2187 - 2195
O'LEARY ET AL., J. BIOL. CHEM., vol. 279, 2004, pages 53857 - 53866
REDONA; MACKILL, THEOR. APPL. GENET., vol. 96, 1998, pages 957 - 963
SAMBROOK, J. ET AL.: "Molecular Cloning: A Laboratory Manual", SCIENCE PRESS
See also references of EP1969124A4
TAG THEORETICAL AND APPLIED GENETICS, vol. 112, no. 6, 30 April 2006 (2006-04-30), pages 1164 - 1171 *
TAN ET AL., THEOR. APPL. GENET, vol. 101, 2000, pages 823 - 829
TEMNYKH ET AL., GENOME RES., vol. 11, 2001, pages 1441 - 1452
TEMNYKH ET AL., THEOR, APPL. GENET., vol. 100, 2000, pages 697 - 712
THOMSON ET AL., THEOR. APPL. GENET., vol. 107, 2003, pages 479 - 493
XING ET AL., ACTA. BOT. SIN., vol. 43, 2001, pages 721 - 726
XING ET AL., THEOR. APPL. GENET., vol. 105, 2002, pages 248 - 257
YANO ET AL., PLANT MOLECULAR BIOLOGY, vol. 35, 1997, pages 145 - 153
YU ET AL., PROC. NATL. ACAD. SCI. USA, vol. 94, 1997, pages 9226 - 9231

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2363465A1 (en) * 2008-10-16 2011-09-07 Riken Transgenic plant of which seed has enlarged size
EP2363465A4 (en) * 2008-10-16 2012-05-09 Riken Transgenic plant of which seed has enlarged size
CN110923245A (en) * 2019-12-24 2020-03-27 江西省农业科学院水稻研究所 Rice granule heterosis regulation gene and breeding application thereof

Also Published As

Publication number Publication date
US20150082496A1 (en) 2015-03-19
CN100554423C (en) 2009-10-28
EP1969124A4 (en) 2010-07-14
WO2007076727A8 (en) 2016-03-17
US20100017919A1 (en) 2010-01-21
CN1995347A (en) 2007-07-11
EP1969124A1 (en) 2008-09-17
EP1969124B1 (en) 2016-04-13

Similar Documents

Publication Publication Date Title
EP1969124B1 (en) A rice gene, gs3, exerting primary control over grain length and grain weight
Liu et al. The soybean stem growth habit gene Dt1 is an ortholog of Arabidopsis TERMINAL FLOWER1
EP3018217B1 (en) Maize cytoplasmic male sterility (cms) c-type restorer rf4 gene, molecular markers and their use
US20220186238A1 (en) Diplospory gene
WO2013060136A1 (en) Cloning and application of semi-dominant gene qgl3 capable of controlling grain length and grain weight of rice kernel
Win et al. A single base change explains the independent origin of and selection for the nonshattering gene in African rice domestication
KR20080075908A (en) Nucleic acids and methods for producing seeds having a full diploid complement of the maternal genome in the embryo
KR100990370B1 (en) Genes enhancing resistance to Magnaporthe oryzae and uses thereof
Jiang et al. Mutations in the miRNA165/166 binding site of the HB2 gene result in pleiotropic effects on morphological traits in wheat
Saisho et al. Allelic variation of row type gene Vrs1 in barley and implication of the functional divergence
EP2489738B1 (en) Gene controlling flowering habit/cleistogamy in plants, and use thereof
AU2003259011A1 (en) Nucleic acids from rice conferring resistance to bacterial blight disease caused by xanthomonas spp.
Kerckhoffs et al. Characterization of the gene encoding the apoprotein of phytochrome B2 in tomato, and identification of molecular lesions in two mutant alleles
JP2016507240A (en) Manipulating self-incompatibility in plants
CN103524608B (en) Rice spike neck node regulation gene SUI1 (shorted uppermost Internode 1) and application thereof
WO2013187554A1 (en) In GENE FOR CONTROLLING NUMBER OF SEEDS PER POD IN SOYBEAN AND USES THEREOF
Kim et al. A novel embryo phenotype associated with interspecific hybrid weakness in rice is controlled by the MADS-domain transcription factor OsMADS8
CN114350836B (en) QTL qHD1b for promoting rice heading and application thereof
CN114540366B (en) Rice fertility regulating gene GMS3, mutant and application thereof
CN113817754B (en) Rice short-grain gene SHG1 and application thereof
WO2023168402A2 (en) Rice sequences involved in grain weight under high temperature conditions and methods of making and using
Ding et al. The hairless stem phenotype of cotton (G. barbadense) is linked to a copia-like retrotransposon insertion in homeodomainleucine zipper gene (HD1)
YU et al. Haplotypic structure and allelic variation of rab17, an ABA-responsive gene, in a mini core set of Chinese diversified maize inbred lines
Kobayashi et al. Evidence for an evolutionary force that prevents epigenetic silencing between tail-to-tail rice genes with a short spacer
Park et al. Molecular re-confirmation and floral characteristics of drooping leaf (DL) mutants generated by insertional mutagenesis in rice

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2007701934

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

WWP Wipo information: published in national office

Ref document number: 2007701934

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 12159964

Country of ref document: US