CN114763375A - Gene for regulating and controlling quality of rice grains and application thereof - Google Patents
Gene for regulating and controlling quality of rice grains and application thereof Download PDFInfo
- Publication number
- CN114763375A CN114763375A CN202011626461.1A CN202011626461A CN114763375A CN 114763375 A CN114763375 A CN 114763375A CN 202011626461 A CN202011626461 A CN 202011626461A CN 114763375 A CN114763375 A CN 114763375A
- Authority
- CN
- China
- Prior art keywords
- grain
- reducing
- increasing
- gene
- plant
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 235000013339 cereals Nutrition 0.000 title claims abstract description 339
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 289
- 235000007164 Oryza sativa Nutrition 0.000 title claims abstract description 132
- 235000009566 rice Nutrition 0.000 title claims abstract description 124
- 230000001105 regulatory effect Effects 0.000 title claims abstract description 43
- 230000001276 controlling effect Effects 0.000 title claims abstract description 17
- 240000007594 Oryza sativa Species 0.000 title abstract description 132
- 241000196324 Embryophyta Species 0.000 claims abstract description 231
- 230000014509 gene expression Effects 0.000 claims abstract description 92
- 238000000034 method Methods 0.000 claims abstract description 60
- 230000009418 agronomic effect Effects 0.000 claims abstract description 48
- 230000001965 increasing effect Effects 0.000 claims description 133
- 229920001184 polypeptide Polymers 0.000 claims description 77
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 77
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 77
- 102000004169 proteins and genes Human genes 0.000 claims description 77
- 230000000694 effects Effects 0.000 claims description 41
- 102000040430 polynucleotide Human genes 0.000 claims description 41
- 108091033319 polynucleotide Proteins 0.000 claims description 41
- 239000002157 polynucleotide Substances 0.000 claims description 41
- 239000003112 inhibitor Substances 0.000 claims description 38
- 150000007523 nucleic acids Chemical group 0.000 claims description 38
- 102000039446 nucleic acids Human genes 0.000 claims description 30
- 108020004707 nucleic acids Proteins 0.000 claims description 30
- 229920002472 Starch Polymers 0.000 claims description 23
- 235000019698 starch Nutrition 0.000 claims description 23
- 239000008107 starch Substances 0.000 claims description 23
- 229920000856 Amylose Polymers 0.000 claims description 21
- 230000035945 sensitivity Effects 0.000 claims description 19
- 239000000126 substance Substances 0.000 claims description 19
- 230000002401 inhibitory effect Effects 0.000 claims description 15
- -1 small molecule compound Chemical class 0.000 claims description 15
- 230000000295 complement effect Effects 0.000 claims description 14
- 238000012217 deletion Methods 0.000 claims description 14
- 230000037430 deletion Effects 0.000 claims description 14
- 238000013518 transcription Methods 0.000 claims description 14
- 230000035897 transcription Effects 0.000 claims description 14
- 108091027544 Subgenomic mRNA Proteins 0.000 claims description 13
- 125000000539 amino acid group Chemical group 0.000 claims description 12
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 11
- 108010073032 Grain Proteins Proteins 0.000 claims description 10
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 9
- 230000002222 downregulating effect Effects 0.000 claims description 9
- 239000002773 nucleotide Substances 0.000 claims description 9
- 125000003729 nucleotide group Chemical group 0.000 claims description 9
- 238000006467 substitution reaction Methods 0.000 claims description 8
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims description 7
- 108700011259 MicroRNAs Proteins 0.000 claims description 6
- 108091030071 RNAI Proteins 0.000 claims description 6
- 230000000692 anti-sense effect Effects 0.000 claims description 6
- 230000009368 gene silencing by RNA Effects 0.000 claims description 6
- 239000002679 microRNA Substances 0.000 claims description 6
- 230000008685 targeting Effects 0.000 claims description 6
- 108020004459 Small interfering RNA Proteins 0.000 claims description 5
- 238000004904 shortening Methods 0.000 claims description 5
- 239000003147 molecular marker Substances 0.000 claims description 4
- 108020003589 5' Untranslated Regions Proteins 0.000 claims description 3
- 241000209094 Oryza Species 0.000 claims description 3
- 238000007792 addition Methods 0.000 claims description 3
- 230000000087 stabilizing effect Effects 0.000 claims description 3
- 230000005764 inhibitory process Effects 0.000 claims description 2
- 230000004952 protein activity Effects 0.000 claims description 2
- 210000004027 cell Anatomy 0.000 description 118
- 235000018102 proteins Nutrition 0.000 description 69
- 238000002474 experimental method Methods 0.000 description 37
- 239000000463 material Substances 0.000 description 32
- 239000013598 vector Substances 0.000 description 31
- 241000746966 Zizania Species 0.000 description 30
- 235000002636 Zizania aquatica Nutrition 0.000 description 30
- 238000004458 analytical method Methods 0.000 description 28
- 230000006870 function Effects 0.000 description 26
- 108091026890 Coding region Proteins 0.000 description 25
- 101001086444 Oryza sativa subsp. japonica Transcription repressor OFP8 Proteins 0.000 description 22
- 108020004414 DNA Proteins 0.000 description 20
- 230000009261 transgenic effect Effects 0.000 description 20
- 102100028914 Catenin beta-1 Human genes 0.000 description 18
- 101000903715 Oryza sativa subsp. japonica Shaggy-related protein kinase GSK2 Proteins 0.000 description 18
- 230000002829 reductive effect Effects 0.000 description 18
- 239000012634 fragment Substances 0.000 description 15
- 108010014223 Armadillo Domain Proteins Proteins 0.000 description 14
- 235000001014 amino acid Nutrition 0.000 description 14
- 229940024606 amino acid Drugs 0.000 description 14
- 210000001519 tissue Anatomy 0.000 description 14
- 150000001413 amino acids Chemical class 0.000 description 13
- 230000033228 biological regulation Effects 0.000 description 13
- 239000005090 green fluorescent protein Substances 0.000 description 13
- 210000000056 organ Anatomy 0.000 description 13
- 230000009466 transformation Effects 0.000 description 13
- 241000208125 Nicotiana Species 0.000 description 12
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 12
- 238000012360 testing method Methods 0.000 description 12
- 241000289632 Dasypodidae Species 0.000 description 11
- 230000035772 mutation Effects 0.000 description 11
- 238000011160 research Methods 0.000 description 11
- 239000002245 particle Substances 0.000 description 10
- 210000001938 protoplast Anatomy 0.000 description 10
- 108091033409 CRISPR Proteins 0.000 description 9
- 238000009395 breeding Methods 0.000 description 9
- 230000001488 breeding effect Effects 0.000 description 9
- 230000008859 change Effects 0.000 description 9
- 239000003795 chemical substances by application Substances 0.000 description 9
- 210000000805 cytoplasm Anatomy 0.000 description 9
- 210000005069 ears Anatomy 0.000 description 9
- 239000013604 expression vector Substances 0.000 description 9
- 239000008187 granular material Substances 0.000 description 9
- 238000003780 insertion Methods 0.000 description 9
- 230000037431 insertion Effects 0.000 description 9
- 230000037361 pathway Effects 0.000 description 9
- 238000011282 treatment Methods 0.000 description 9
- 241000589158 Agrobacterium Species 0.000 description 8
- 102000004190 Enzymes Human genes 0.000 description 8
- 108090000790 Enzymes Proteins 0.000 description 8
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 8
- 210000000349 chromosome Anatomy 0.000 description 8
- 230000001976 improved effect Effects 0.000 description 8
- 230000002018 overexpression Effects 0.000 description 8
- 101100058970 Arabidopsis thaliana CALS11 gene Proteins 0.000 description 7
- 238000011529 RT qPCR Methods 0.000 description 7
- 101100341076 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) IPK1 gene Proteins 0.000 description 7
- 238000007796 conventional method Methods 0.000 description 7
- 230000002068 genetic effect Effects 0.000 description 7
- 230000019491 signal transduction Effects 0.000 description 7
- 102000015735 Beta-catenin Human genes 0.000 description 6
- 108060000903 Beta-catenin Proteins 0.000 description 6
- 101001121312 Oryza sativa subsp. japonica Serine/threonine protein kinase OSK1 Proteins 0.000 description 6
- 210000000170 cell membrane Anatomy 0.000 description 6
- 239000002299 complementary DNA Substances 0.000 description 6
- 239000005556 hormone Substances 0.000 description 6
- 229940088597 hormone Drugs 0.000 description 6
- 230000003993 interaction Effects 0.000 description 6
- 238000013507 mapping Methods 0.000 description 6
- 230000007246 mechanism Effects 0.000 description 6
- 230000004960 subcellular localization Effects 0.000 description 6
- 238000012795 verification Methods 0.000 description 6
- 235000007340 Hordeum vulgare Nutrition 0.000 description 5
- 240000005979 Hordeum vulgare Species 0.000 description 5
- 240000002582 Oryza sativa Indica Group Species 0.000 description 5
- 240000008467 Oryza sativa Japonica Group Species 0.000 description 5
- 241000209504 Poaceae Species 0.000 description 5
- 244000184734 Pyrus japonica Species 0.000 description 5
- 238000000692 Student's t-test Methods 0.000 description 5
- 238000010459 TALEN Methods 0.000 description 5
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 5
- 241000209140 Triticum Species 0.000 description 5
- 235000021307 Triticum Nutrition 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 231100000221 frame shift mutation induction Toxicity 0.000 description 5
- 230000037433 frameshift Effects 0.000 description 5
- 238000001727 in vivo Methods 0.000 description 5
- 230000002452 interceptive effect Effects 0.000 description 5
- 238000012216 screening Methods 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 238000012256 transgenic experiment Methods 0.000 description 5
- 108700028369 Alleles Proteins 0.000 description 4
- 235000007319 Avena orientalis Nutrition 0.000 description 4
- 244000075850 Avena orientalis Species 0.000 description 4
- 102100039556 Galectin-4 Human genes 0.000 description 4
- 101000608765 Homo sapiens Galectin-4 Proteins 0.000 description 4
- 241000209056 Secale Species 0.000 description 4
- 235000007238 Secale cereale Nutrition 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 241001433070 Xiphoides Species 0.000 description 4
- 230000032823 cell division Effects 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 230000008045 co-localization Effects 0.000 description 4
- 235000009508 confectionery Nutrition 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 235000014113 dietary fatty acids Nutrition 0.000 description 4
- 229930195729 fatty acid Natural products 0.000 description 4
- 239000000194 fatty acid Substances 0.000 description 4
- 150000004665 fatty acids Chemical class 0.000 description 4
- 235000013305 food Nutrition 0.000 description 4
- 238000003209 gene knockout Methods 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 230000004807 localization Effects 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 230000004853 protein function Effects 0.000 description 4
- 230000011664 signaling Effects 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 238000012353 t test Methods 0.000 description 4
- 230000014616 translation Effects 0.000 description 4
- 238000010200 validation analysis Methods 0.000 description 4
- 210000002417 xiphoid bone Anatomy 0.000 description 4
- 238000010354 CRISPR gene editing Methods 0.000 description 3
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 3
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 3
- 241000282414 Homo sapiens Species 0.000 description 3
- 101000916173 Homo sapiens Catenin beta-1 Proteins 0.000 description 3
- 244000118056 Oryza rufipogon Species 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 238000003559 RNA-seq method Methods 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 108700019146 Transgenes Proteins 0.000 description 3
- 238000010378 bimolecular fluorescence complementation Methods 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000001086 cytosolic effect Effects 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 210000001161 mammalian embryo Anatomy 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004983 pleiotropic effect Effects 0.000 description 3
- 238000006116 polymerization reaction Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 230000001172 regenerating effect Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 230000035882 stress Effects 0.000 description 3
- 230000017260 vegetative to reproductive phase transition of meristem Effects 0.000 description 3
- 238000001086 yeast two-hybrid system Methods 0.000 description 3
- 108091005957 yellow fluorescent proteins Proteins 0.000 description 3
- JESCETIFNOFKEU-SJORKVTESA-N (2s,5r)-5-[4-[(2-fluorophenyl)methoxy]phenyl]pyrrolidine-2-carboxamide Chemical compound N1[C@H](C(=O)N)CC[C@@H]1C(C=C1)=CC=C1OCC1=CC=CC=C1F JESCETIFNOFKEU-SJORKVTESA-N 0.000 description 2
- 102100035352 2-oxoisovalerate dehydrogenase subunit alpha, mitochondrial Human genes 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- 241000219194 Arabidopsis Species 0.000 description 2
- 241000384062 Armadillo Species 0.000 description 2
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 238000010453 CRISPR/Cas method Methods 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 208000035240 Disease Resistance Diseases 0.000 description 2
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 108091006027 G proteins Proteins 0.000 description 2
- 101150086875 GRF4 gene Proteins 0.000 description 2
- 108060006662 GSK3 Proteins 0.000 description 2
- 102000001267 GSK3 Human genes 0.000 description 2
- 102000030782 GTP binding Human genes 0.000 description 2
- 108091000058 GTP-Binding Proteins 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 102100033558 Histone H1.8 Human genes 0.000 description 2
- 101100123312 Homo sapiens H1-8 gene Proteins 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- 101150043187 LAX2 gene Proteins 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 2
- 101710183958 MADS-box transcription factor 1 Proteins 0.000 description 2
- 240000003433 Miscanthus floridulus Species 0.000 description 2
- 101150080703 OFP8 gene Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 101100230208 Oryza sativa subsp. japonica GSK2 gene Proteins 0.000 description 2
- 101100454022 Oryza sativa subsp. japonica OSH1 gene Proteins 0.000 description 2
- 101100521401 Oryza sativa subsp. japonica PRR37 gene Proteins 0.000 description 2
- 101100466242 Oryza sativa subsp. japonica TUD1 gene Proteins 0.000 description 2
- 101710197096 Protein BZR1 homolog 1 Proteins 0.000 description 2
- 238000002123 RNA extraction Methods 0.000 description 2
- 108700005075 Regulator Genes Proteins 0.000 description 2
- 101100242307 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SWH1 gene Proteins 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 230000000996 additive effect Effects 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 239000005557 antagonist Substances 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 238000007405 data analysis Methods 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 210000002615 epidermis Anatomy 0.000 description 2
- 238000013401 experimental design Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000030279 gene silencing Effects 0.000 description 2
- 238000012252 genetic analysis Methods 0.000 description 2
- 238000010362 genome editing Methods 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- NFVJNJQRWPQVOA-UHFFFAOYSA-N n-[2-chloro-5-(trifluoromethyl)phenyl]-2-[3-(4-ethyl-5-ethylsulfanyl-1,2,4-triazol-3-yl)piperidin-1-yl]acetamide Chemical compound CCN1C(SCC)=NN=C1C1CN(CC(=O)NC=2C(=CC=C(C=2)C(F)(F)F)Cl)CCC1 NFVJNJQRWPQVOA-UHFFFAOYSA-N 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 230000034512 ubiquitination Effects 0.000 description 2
- 238000010798 ubiquitination Methods 0.000 description 2
- 230000003827 upregulation Effects 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- 101150084750 1 gene Proteins 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 108010036211 5-HT-moduline Proteins 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- IXTPACPAXIOCRG-ACZMJKKPSA-N Ala-Glu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N IXTPACPAXIOCRG-ACZMJKKPSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 101100112524 Arabidopsis thaliana CYCA2-3 gene Proteins 0.000 description 1
- 101100188600 Arabidopsis thaliana OFP14 gene Proteins 0.000 description 1
- 101100301807 Arabidopsis thaliana RGA gene Proteins 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- SVHRPCMZTWZROG-DCAQKATOSA-N Arg-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N SVHRPCMZTWZROG-DCAQKATOSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 101150060196 BZR1 gene Proteins 0.000 description 1
- IXVMHGVQKLDRKH-VRESXRICSA-N Brassinolide Natural products O=C1OC[C@@H]2[C@@H]3[C@@](C)([C@H]([C@@H]([C@@H](O)[C@H](O)[C@H](C(C)C)C)C)CC3)CC[C@@H]2[C@]2(C)[C@@H]1C[C@H](O)[C@H](O)C2 IXVMHGVQKLDRKH-VRESXRICSA-N 0.000 description 1
- 108700020472 CDC20 Proteins 0.000 description 1
- 101150104247 CYCB2-1 gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 101710174494 Catenin beta-1 Proteins 0.000 description 1
- 102000016362 Catenins Human genes 0.000 description 1
- 108010067316 Catenins Proteins 0.000 description 1
- 101150023302 Cdc20 gene Proteins 0.000 description 1
- 102100038099 Cell division cycle protein 20 homolog Human genes 0.000 description 1
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000938605 Crocodylia Species 0.000 description 1
- CLDCTNHPILWQCW-CIUDSAMLSA-N Cys-Arg-Glu Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N CLDCTNHPILWQCW-CIUDSAMLSA-N 0.000 description 1
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 1
- MKVKKORBPTUSNX-LPEHRKFASA-N Cys-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N MKVKKORBPTUSNX-LPEHRKFASA-N 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 102100039606 DNA replication licensing factor MCM3 Human genes 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 108010036466 E2F2 Transcription Factor Proteins 0.000 description 1
- 102000012078 E2F2 Transcription Factor Human genes 0.000 description 1
- 102100031780 Endonuclease Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 108091092584 GDNA Proteins 0.000 description 1
- 101150102561 GPA1 gene Proteins 0.000 description 1
- 108091007911 GSKs Proteins 0.000 description 1
- 102000038624 GSKs Human genes 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- UVUIXIVPKVMONA-CIUDSAMLSA-N His-Cys-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CN=CN1 UVUIXIVPKVMONA-CIUDSAMLSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- 101000963174 Homo sapiens DNA replication licensing factor MCM3 Proteins 0.000 description 1
- 101000801619 Homo sapiens Long-chain-fatty-acid-CoA ligase ACSBG1 Proteins 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- JXUGDUWBMKIJDC-NAKRPEOUSA-N Ile-Ala-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JXUGDUWBMKIJDC-NAKRPEOUSA-N 0.000 description 1
- FHCNLXMTQJNJNH-KBIXCLLPSA-N Ile-Cys-Gln Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)O FHCNLXMTQJNJNH-KBIXCLLPSA-N 0.000 description 1
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- 102000010638 Kinesin Human genes 0.000 description 1
- 108010063296 Kinesin Proteins 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241000878006 Miscanthus sinensis Species 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 102100033615 Nucleoprotein TPR Human genes 0.000 description 1
- BZQFBWGGLXLEPQ-UHFFFAOYSA-N O-phosphoryl-L-serine Natural products OC(=O)C(N)COP(O)(O)=O BZQFBWGGLXLEPQ-UHFFFAOYSA-N 0.000 description 1
- 235000005044 Oryza sativa Indica Group Nutrition 0.000 description 1
- 101000933485 Oryza sativa subsp. japonica Brassinosteroid LRR receptor kinase BRI1 Proteins 0.000 description 1
- 101100223975 Oryza sativa subsp. japonica DLT gene Proteins 0.000 description 1
- 101000766225 Oryza sativa subsp. japonica LRR receptor kinase BAK1 Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- ZBAGOWGNNAXMOY-IHRRRGAJSA-N Pro-Cys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZBAGOWGNNAXMOY-IHRRRGAJSA-N 0.000 description 1
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 102000004245 Proteasome Endopeptidase Complex Human genes 0.000 description 1
- 108090000708 Proteasome Endopeptidase Complex Proteins 0.000 description 1
- 101150028245 RGA1 gene Proteins 0.000 description 1
- 108020004412 RNA 3' Polyadenylation Signals Proteins 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 101100010298 Schizosaccharomyces pombe (strain 972 / ATCC 24843) pol2 gene Proteins 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 108010074506 Transfer Factor Proteins 0.000 description 1
- 102100038896 Transmembrane protein domain Human genes 0.000 description 1
- 101710141239 Transmembrane protein domain Proteins 0.000 description 1
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000006137 acetoxylation reaction Methods 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000012271 agricultural production Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 238000012098 association analyses Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 238000009412 basement excavation Methods 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 102000023732 binding proteins Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- IXVMHGVQKLDRKH-KNBKMWSGSA-N brassinolide Chemical compound C1OC(=O)[C@H]2C[C@H](O)[C@H](O)C[C@]2(C)[C@H]2CC[C@]3(C)[C@@H]([C@H](C)[C@@H](O)[C@H](O)[C@@H](C)C(C)C)CC[C@H]3[C@@H]21 IXVMHGVQKLDRKH-KNBKMWSGSA-N 0.000 description 1
- 150000001646 brassinolides Chemical class 0.000 description 1
- 235000021329 brown rice Nutrition 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 230000021523 carboxylation Effects 0.000 description 1
- 238000006473 carboxylation reaction Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 1
- 239000013256 coordination polymer Substances 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 230000003436 cytoskeletal effect Effects 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 230000009025 developmental regulation Effects 0.000 description 1
- 229950006137 dexfosfoserine Drugs 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 230000004577 ear development Effects 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000006353 environmental stress Effects 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 238000009313 farming Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 238000011049 filling Methods 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 102000054766 genetic haplotypes Human genes 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 230000001279 glycosylating effect Effects 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 238000001033 granulometry Methods 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 239000010903 husk Substances 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 210000004020 intracellular membrane Anatomy 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 230000000366 juvenile effect Effects 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000007479 molecular analysis Methods 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 231100000707 mutagenic chemical Toxicity 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 238000007481 next generation sequencing Methods 0.000 description 1
- 238000010899 nucleation Methods 0.000 description 1
- 230000030648 nucleus localization Effects 0.000 description 1
- 230000031787 nutrient reservoir activity Effects 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 239000012188 paraffin wax Substances 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- USRGIUJOYOXOQJ-GBXIJSLDSA-N phosphothreonine Chemical compound OP(=O)(O)O[C@H](C)[C@H](N)C(O)=O USRGIUJOYOXOQJ-GBXIJSLDSA-N 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- DCWXELXMIBXGTH-UHFFFAOYSA-N phosphotyrosine Chemical compound OC(=O)C(N)CC1=CC=C(OP(O)(O)=O)C=C1 DCWXELXMIBXGTH-UHFFFAOYSA-N 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 229930195732 phytohormone Natural products 0.000 description 1
- 230000004260 plant-type cell wall biogenesis Effects 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000006916 protein interaction Effects 0.000 description 1
- 230000020978 protein processing Effects 0.000 description 1
- 238000000455 protein structure prediction Methods 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000004626 scanning electron microscopy Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 238000002791 soaking Methods 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 238000012418 validation experiment Methods 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 239000001993 wax Substances 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/6895—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for plants, fungi or algae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/13—Plant traits
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- Analytical Chemistry (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Botany (AREA)
- Physics & Mathematics (AREA)
- Immunology (AREA)
- Cell Biology (AREA)
- Mycology (AREA)
- Plant Pathology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Peptides Or Proteins (AREA)
Abstract
The invention relates to a gene for regulating and controlling rice grain quality and application thereof. The invention provides a method for regulating and controlling the agronomic traits and/or the grain quality of plants, which comprises regulating the expression of a GS10 gene in the plants.
Description
Technical Field
The invention belongs to the field of biotechnology and botany; more specifically, the invention relates to a regulatory gene of rice grain quality and application thereof.
Background
Rice is one of the most important food crops for human beings. It is estimated that by 2050, the world population will reach 90 billion. The growing food demand of the rapid population, the reduction of arable area, and the deterioration of farming environment pose serious challenges to human food safety. Therefore, in limited land resources, the cultivation of a new green rice variety with high safety, high yield and high efficiency is an important scientific problem faced by contemporary molecular breeders.
The yield of rice is regulated and controlled by various important complex agronomic traits, such as plant type (plant height, spike shape, leaf inclination angle and the like), ion nutrition absorption, disease resistance, insect resistance, environmental stress resistance and the like. At present, methods for cloning rice related genes mainly comprise QTL, mutant localization and whole Genome association analysis (GWAS). The grain yield of rice is mainly determined by four factors, namely the number of ears per plant or the effective tiller number, the number of ears, the grain weight and the proportion of full grains. In contrast to the most common dicotyledonous model plant Arabidopsis embryos, which occupy most of the space of the mature seed, in rice seeds the endosperm occupies most of the space of the mature seed. In a mature seed, a large fraction of storage compounds, mainly starch, relatively small amounts of storage proteins and lipids and traces of other substances are contained. The rice grain type character is one of important agronomic characters of rice and is an important factor forming the rice yield, so that the excavation and analysis of the rice complex character related genes have certain guiding significance for future rice breeding. In the past research, as more and more rice grain type QTLs are discovered, such as GW2, GS3, GS5 and the like, molecular control network diagrams among the QTLs are continuously perfected, so that more favorable conditions are provided for molecular marker breeding. The regulation mechanism of the granule type molecule mainly comprises ubiquitination mediated proteasome degradation, phytohormones, G-protein signal transduction and the like.
Tandem Repeat proteins such as Armadillo repeats associating protein (ARM protein), Tetratricopeptide Repeat (TPR protein) and WD40 protein are proteins that are commonly present in animals and plants and play important roles in various vital activities. Among them, the Armadillo (ARM) catenin family is a family that is widely found in prokaryotes and eukaryotes and consists of multiple tandem repeat proteins, each repeat consisting of approximately 42 amino acids. First, each repeat forms three α -helical building blocks, and gradually forms a conserved tertiary structure when multiple tandem repeats are formed.
In the past research on rice grain type related genes, more than 20 QTLs for controlling grain types are reported and the mechanism of the QTLs is analyzed one after another, but the regulation network of the grain type gene mechanism and the relation between the grain types and the quality are rarely known. Wild rice is subject to a great deal of spontaneous variation in order to adapt to a variable natural environment, and the gene polymorphism of the wild rice is often more abundant than that of cultivated rice on the genome of the wild rice. Meanwhile, only a few natural variations are fixed in the cultivated rice due to the influence of bottleneck effect in the acclimatization process from the wild rice to the cultivated rice. There are still a large number of variations in the wild rice population, which have great potential for improving the adaptation of cultivation to the natural environment. Few QTLs are known based on current development and utilization of wild rice populations, and the specific function of these environmentally selected mutation sites is largely unknown.
With the discovery of more and more molecular analysis on rice grain type characters, a certain balance mechanism exists between rice grain type and single plant yield. For example, in mutant studies, GSN1 was found to encode a protein that binds OsMKP1 to OsMPK6 and dephosphorylates it, thereby further participating in the cascade signaling of OsMKKK10-OsMKK4-OsMPK 6. There is a balance between GSN1 and GSN1 between grain number per ear and grain size, and the final yield per plant is not necessarily affected. The same balance mechanism can also reflect the balance between grain type, effective tillering, plant height and other agronomic traits, and the plant self has 'source-sink flow'. Although many grain types have been cloned at present, the actual application of the grain type greatly contributes little to the yield.
When the progeny of the hybrid rice population is analyzed for quality-related genes, the most closely related characteristics in the aspect of rice quality regulation are grain type characteristics, particularly aspect ratio, except for the known natural variation sites of wax and talk 5 which mainly regulate the rice quality. In addition, research and grain types affecting rice quality, particularly appearance quality, are the most compact, and currently cloned QTLs mainly comprise GL7/GW7, GLW7, GW8, GS9, GW5 and the like, and the genes show excellent appearance quality when the genes are slender grains, which is probably related to the development and filling rate of glumes. Therefore, to further address the future breeding goal of "high yield and high quality", we need to resort to newer methods and more grain type-related gene cloning and functional studies.
Disclosure of Invention
The invention aims to provide a rice quality related gene and application thereof.
In a first aspect the present invention provides the use of a substance selected from the group consisting of: a GS10 gene or a protein encoding it, or an enhancer or inhibitor thereof, for use in modulating an agronomic trait in a plant selected from one or more of: leaf length, leaf width, single plant yield, cell yield, plant height, tiller number, grain number per ear, ear length, branch number, top awn, heading stage, stem diameter, glume cell width, glume cell length, glume cell unit area, leaf inclination angle, grain length, grain width, grain weight, BR sensitivity, BR signal path, grain transparency, grain starch cell compactness, grain protein content and grain amylose content.
In one or more embodiments, the substance is the GS10 gene or its encoded protein, or its promoter, and the modulating an agronomic trait of a plant is selected from one or more of: reducing the leaf length, increasing the leaf width, reducing the yield of a single plant, reducing the yield of a cell, reducing the plant height, reducing the tillering number, reducing the grain number per spike, reducing the spike length, reducing the number of branches, reducing the awn, shortening the heading stage, increasing the diameter of stalks, increasing the width of glumes cells, reducing the length of glumes cells, increasing the unit area of glumes cells, reducing the inclination angle of leaves, reducing the grain length, increasing the grain width, increasing the grain thickness, increasing the grain weight, increasing the BR sensitivity, negatively regulating and controlling a BR signal path, reducing the transparency of grains, increasing the compactness of starch cells of the grains, reducing the protein content of the grains and reducing the amylose content of the grains.
In one or more embodiments, the substance is an inhibitor of the GS10 gene, and the agronomic trait of the regulatory plant is selected from one or more of: increasing leaf length, reducing leaf width, increasing single plant yield, increasing cell yield, increasing plant height, increasing tiller number, increasing grain number per spike, increasing spike length, increasing branch number, increasing top awn, prolonging heading period, reducing stem diameter, reducing glume cell width, increasing glume cell length, reducing glume cell unit area, increasing leaf inclination angle, increasing grain length, reducing grain width, reducing grain thickness, reducing grain weight, BR sensitivity, positively regulating BR signal path, increasing grain transparency, reducing grain starch cell compactness, increasing grain protein content, and increasing grain amylose content.
In one or more embodiments, the accelerator is selected from the group consisting of: a small molecule compound, a nucleic acid molecule, or a combination thereof. Preferably, the nucleic acid molecule is a nucleic acid construct comprising the coding sequence of GS 10.
In one or more embodiments, the inhibitor is an inhibitory molecule that specifically interferes with the transcription and/or expression of the GS10 gene.
In one or more embodiments, the suppressor molecule targets the GS10 gene or its transcript as a suppressor.
In one or more embodiments, the inhibitory molecule has SEQ ID NO 1 or 2 as the inhibitory target.
In one or more embodiments, the inhibitory molecule is selected from the group consisting of: (1) a small molecule compound, an antisense nucleic acid, a microRNA, a siRNA, an RNAi, a dsRNA, a sgRNA, an antibody, or a combination thereof, and (2) a nucleic acid construct capable of expressing or forming (1). Preferably, the inhibitory molecule is a dsRNA or construct targeting SEQ ID NO 2 or its transcript for silencing.
In one or more embodiments, the inhibitor is an agent that causes a deletion mutation in an exon of the GS10 gene. Preferably, the inhibitor is an agent that deletes position 569-572 or 1209-1213 of the GS10 coding sequence (e.g., SEQ ID NO: 2).
In one or more embodiments, the inhibitor is an agent, such as an sgRNA, that knocks out the GS10 coding sequence (e.g., SEQ ID NO:2) at positions 569-572 or 1209-1213 using a technique selected from the group consisting of ZFN, TALEN, and CRISPR.
In one or more embodiments, the inhibitor is a sgRNA used to knock out the GS10 coding sequence (e.g., SEQ ID NO:2) at positions 569-572 or 1209-1213.
In one or more embodiments, the sgrnas are as set forth in SEQ ID NOs 3 or 4.
In one or more embodiments, the inhibitor further comprises a Cas enzyme (e.g., Cas9), its coding sequence, and/or a nucleic acid construct expressing the Cas enzyme.
In one or more embodiments, the plant is a cereal crop.
In one or more embodiments, the plant is a graminaceous plant.
In one or more embodiments, the gramineae is rice, barley, wheat, oats, rye.
In one or more embodiments, the rice comprises indica, japonica, or a combination thereof.
In one or more embodiments, the rice is nipponbare, DJ.
In one or more embodiments, the number of branches includes a primary number of branches and/or a secondary number of branches.
In one or more embodiments, the leaves include xiphoid leaves.
In one or more embodiments, the GS10 gene includes a cDNA sequence, a genomic sequence, or a combination thereof.
In one or more embodiments, the GS10 gene is from a plant of the family poaceae, preferably from rice.
In one or more embodiments, the amino acid sequence of the GS10 gene is selected from the group consisting of:
(a) polypeptide with a sequence shown as SEQ ID NO. 1;
(b) a polypeptide which is formed by substituting, deleting or adding one or more (such as 1-20; preferably 1-10; more preferably 1-5) amino acid residues of the sequence shown in SEQ ID NO. 1, has the functions of the polypeptide (a) and is derived from the polypeptide (a); or
(c) A polypeptide derived from (a) having more than 90% (preferably 93%; more preferably 95% or 98%) homology to the polypeptide sequence of (a) and having the function of the polypeptide of (a).
In one or more embodiments, the nucleic acid sequence of the GS10 gene is selected from the group consisting of:
(1) a polynucleotide encoding a polypeptide as shown in SEQ ID NO. 1;
(2) 2 or a polynucleotide having 80% (preferably 90%; more preferably 95% or 98%) or more homology thereto;
(3) a polynucleotide in which 1 to 60, preferably 1 to 30, more preferably 1 to 10) nucleotides are truncated or added at the 5 'end and/or 3' end of the polynucleotide shown in SEQ ID NO. 2;
(4) a polynucleotide complementary to the polynucleotide of any one of (1) to (3).
In a second aspect, the present invention provides a method of modulating an agronomic trait in a plant, the method comprising: regulating the expression or activity of the GS10 gene in the plant, thereby regulating the agronomic traits of the plant. Preferably, the agronomic trait is selected from one or more of: leaf length, leaf width, single plant yield, cell yield, plant height, tillering number, grain number per spike, spike length, branch number, top awn, heading period, stem diameter, glume cell width, glume cell length, glume cell unit area, leaf inclination angle, grain length, grain width, grain weight, BR sensitivity, BR signal path, grain transparency, grain starch cell compactness, grain protein content and grain amylose content.
In one or more embodiments, the plant is a cereal crop.
In one or more embodiments, the plant is a graminaceous plant.
In one or more embodiments, the gramineae is rice, barley, wheat, oats, rye.
In one or more embodiments, the rice comprises indica, japonica, or a combination thereof.
In one or more embodiments, the rice is nipponica.
In one or more embodiments, the amino acid sequence of the GS10 gene is selected from the group consisting of seq id no:
(a) polypeptide with a sequence shown as SEQ ID NO. 1;
(b) a polypeptide which is formed by substituting, deleting or adding one or more (such as 1-20; preferably 1-10; more preferably 1-5) amino acid residues of the sequence shown in SEQ ID NO. 1, has the functions of the polypeptide (a) and is derived from the polypeptide (a); or
(c) A polypeptide derived from (a) having more than 90% (preferably 93%; more preferably 95% or 98%) homology to the polypeptide sequence of (a) and having the function of the polypeptide of (a).
In one or more embodiments, the nucleic acid sequence of the GS10 gene is selected from the group consisting of:
(1) a polynucleotide encoding a polypeptide as shown in SEQ ID NO. 1;
(2) 2 or a polynucleotide having 80% (preferably 90%; more preferably 95% or 98%) or more homology thereto;
(3) a polynucleotide in which 1 to 60, preferably 1 to 30, more preferably 1 to 10) nucleotides are truncated or added at the 5 'end and/or 3' end of the polynucleotide shown in SEQ ID NO. 2;
(4) a polynucleotide complementary to any one of the polynucleotides described in (1) to (3).
In a preferred embodiment, the method of modulating an agronomic trait in a plant comprises: up-regulating the expression or activity of the GS10 gene in a plant; thereby reducing the leaf length, increasing the leaf width, reducing the yield of a single plant, reducing the yield of a cell, reducing the plant height, reducing the tillering number, reducing the grain number per spike, reducing the spike length, reducing the number of branches and stalks, reducing the awn, shortening the heading stage, increasing the diameter of the stems, increasing the width of glume cells, reducing the length of glume cells, increasing the unit area of glume cells, reducing the leaf inclination angle, reducing the grain length, increasing the grain width, increasing the grain thickness, increasing the grain weight, increasing the BR sensitivity, negatively regulating and controlling the BR signal path, reducing the transparency of grains, increasing the compactness of starch cells of the grains, reducing the protein content of the grains and reducing the amylose content of the grains.
In one or more embodiments, the upregulating expression of the GS10 gene in a plant comprises: the GS10 gene is transferred into plants to obtain transformed plants.
In one or more embodiments, expression of the GS10 gene is driven by the Ubi promoter.
In one or more embodiments, the method of upregulating expression of the GS10 gene in a plant comprises:
(1) providing an Agrobacterium harboring a nucleic acid construct comprising the GS10 gene,
(2) contacting a cell or tissue or organ of a plant with the Agrobacterium of step (1), thereby transferring the nucleic acid construct into the plant tissue or organ.
In one or more embodiments, the nucleic acid construct is an expression vector or a recombinant vector.
In one or more embodiments, the method of upregulating expression of the GS10 gene in a plant further comprises:
(3) selecting plant tissues, organs or seeds into which the GS10 gene is transferred; and
(4) regenerating the plant tissue, organ or seed of step (3) into a plant.
In another preferred embodiment, the method of modulating an agronomic trait in a plant comprises: down-regulating expression or activity of GS10 in a plant; thereby increasing the leaf length, reducing the leaf width, increasing the yield of a single plant, increasing the yield of a cell, increasing the plant height, increasing the tillering number, increasing the grain number per spike, increasing the spike length, increasing the number of branches and stalks, increasing the awn, prolonging the heading period, reducing the diameter of the stalks, reducing the width of glume cells, increasing the length of glume cells, reducing the unit area of glume cells, increasing the inclination angle of the leaves, increasing the grain length, reducing the grain width, reducing the grain thickness, reducing the grain weight, BR sensitivity, positively regulating a BR signal channel, increasing the transparency of grains, reducing the compactness of starch cells of the grains, increasing the protein content of the grains and increasing the amylose content of the grains.
In one or more embodiments, the downregulating expression or activity of a GS10 gene in a plant comprises: an inhibitor that down-regulates GS10 gene transcription, protein expression, or protein activity is transferred into a plant.
In one or more embodiments, the inhibitor is an inhibitory molecule that specifically interferes with the transcription and/or expression of the GS10 gene.
In one or more embodiments, the suppressor molecule targets the GS10 gene or its transcript as a suppressor.
In one or more embodiments, the inhibitory molecule has SEQ ID NO 1 or 2 as the inhibitory target.
In one or more embodiments, the inhibitory molecule is selected from the group consisting of: (1) a small molecule compound, an antisense nucleic acid, a microRNA, a siRNA, an RNAi, a dsRNA, a sgRNA, an antibody, or a combination thereof, and (2) a nucleic acid construct capable of expressing or forming (1). Preferably, the inhibitory molecule is a dsRNA or construct targeting SEQ ID NO 2 or its transcript for silencing.
In one or more embodiments, the inhibitor is an agent that causes a deletion mutation in an exon of the GS10 gene. Preferably, the inhibitor is an agent that deletes the GS10 coding sequence (e.g., SEQ ID NO:2) at positions 569-572 or 1209-1213.
In one or more embodiments, the inhibitor is an agent, e.g., an sgRNA, that knocks out the GS10 coding sequence (e.g., SEQ ID NO:2) at positions 569-572 or 1209-1213 using a technique selected from the group consisting of ZFNs, TALENs, and CRISPR.
In one or more embodiments, the inhibitor is a sgRNA for the knock-out of the GS10 coding sequence (e.g., SEQ ID NO:2) at positions 569-572 or 1209-1213.
In one or more embodiments, the sgRNA is as set forth in SEQ ID NOs 3 or 4. In one or more embodiments, the inhibitor further comprises a Cas enzyme (e.g., Cas9), its coding sequence, and/or a nucleic acid construct expressing the Cas enzyme.
In one or more embodiments, the method of down-regulating expression of a GS10 gene in a plant comprises:
(i) providing an agrobacterium carrying a nucleic acid construct that can interfere with gene expression, said nucleic acid construct containing or producing said inhibitor;
(ii) (ii) contacting a cell or tissue or organ of the plant with the Agrobacterium of step (i) thereby transferring the nucleic acid construct into the plant tissue or organ.
In one or more embodiments, the nucleic acid construct is an expression vector or a recombinant vector.
In one or more embodiments, the method of down-regulating expression of a GS10 gene in a plant further comprises:
(iii) selecting a plant tissue, organ or seed into which the nucleic acid construct has been transferred; and
(iv) (iv) regenerating the plant tissue, organ or seed of step (iii) into a plant.
In another aspect of the invention, there is provided the use of the GS10 gene as a molecular marker for identifying agronomic traits in plants.
In one or more embodiments, the agronomic trait includes: leaf length, leaf width, single plant yield, cell yield, plant height, tillering number, grain number per spike, spike length, branch number, top awn, heading period, stem diameter, glume cell width, glume cell length, glume cell unit area, leaf inclination angle, grain length, grain width, grain weight, BR sensitivity, BR signal path, grain transparency, grain starch cell compactness, grain protein content and grain amylose content.
In one or more embodiments, the plant is a cereal crop.
In one or more embodiments, the plant is a graminaceous plant.
In one or more embodiments, the gramineae is rice, barley, wheat, oats, rye.
In one or more embodiments, the rice comprises indica, japonica, or a combination thereof.
In one or more embodiments, the rice is nipponica.
In one or more embodiments, the number of branches includes a primary number of branches and/or a secondary number of branches.
In one or more embodiments, the leaves include xiphoid leaves.
In one or more embodiments, the GS10 gene comprises a cDNA sequence, a genomic sequence, or a combination thereof.
In one or more embodiments, the GS10 gene is from a plant of the family poaceae, preferably from rice.
In one or more embodiments, the amino acid sequence of the GS10 gene is selected from the group consisting of seq id no:
(a) polypeptide with a sequence shown as SEQ ID NO. 1;
(b) a polypeptide which is formed by substituting, deleting or adding one or more (such as 1-20; preferably 1-10; more preferably 1-5) amino acid residues of the sequence shown in SEQ ID NO. 1, has the functions of the polypeptide (a) and is derived from the polypeptide (a); or
(c) A polypeptide derived from (a) having more than 90% (preferably 93%; more preferably 95% or 98%) homology to the polypeptide sequence of (a) and having the function of the polypeptide of (a).
In one or more embodiments, the nucleic acid sequence of the GS10 gene is selected from the group consisting of:
(1) a polynucleotide encoding a polypeptide as shown in SEQ ID NO. 1;
(2) 2 or a polynucleotide having 80% (preferably 90%; more preferably 95% or 98%) or more homology thereto;
(3) 2 at the 5 'end and/or 3' end of the polynucleotide shown in SEQ ID NO. 2, or 1-60, preferably 1-30, more preferably 1-10) nucleotides;
(4) a polynucleotide complementary to the polynucleotide of any one of (1) to (3).
The invention also provides an expression cassette for expressing the GS10 gene, wherein the expression cassette comprises the following elements from 5 'to 3': a 5' UTR region, an ORF sequence of the GS10 gene, and a terminator,
in one or more embodiments, the ORF sequence of the GS10 gene encodes:
(a) polypeptide with a sequence shown as SEQ ID NO. 1;
(b) a polypeptide which is formed by substituting, deleting or adding one or more (such as 1-20; preferably 1-10; more preferably 1-5) amino acid residues of the sequence shown in SEQ ID NO. 1, has the functions of the polypeptide (a) and is derived from the polypeptide (a); or
(c) A polypeptide derived from (a) having more than 90% (preferably 93%; more preferably 95% or 98%) homology to the polypeptide sequence of (a) and having the function of the polypeptide of (a).
In one or more embodiments, the ORF sequence of the GS10 gene comprises a nucleic acid sequence selected from the group consisting of:
(1) a polynucleotide encoding a polypeptide as shown in SEQ ID NO. 1;
(2) 2 or a polynucleotide having 80% (preferably 90%; more preferably 95% or 98%) or more homology thereto;
(3) a polynucleotide in which 1 to 60, preferably 1 to 30, more preferably 1 to 10) nucleotides are truncated or added at the 5 'end and/or 3' end of the polynucleotide shown in SEQ ID NO. 2;
(4) a polynucleotide complementary to the polynucleotide of any one of (1) to (3).
The invention also provides nucleic acid constructs comprising an expression cassette as described herein or a complement thereof.
In one or more embodiments, the nucleic acid construct is an expression vector or a recombinant vector.
The invention also provides a host cell (1) comprising a nucleic acid construct comprising an expression cassette described herein, or a complement thereof, or (2) having an expression cassette described herein integrated into the chromosome.
In one or more embodiments, the host cell is a plant cell, preferably a graminaceous plant cell, more preferably a rice cell.
In another aspect of the invention, an inhibitor targeting the GS10 gene is provided.
In one or more embodiments, the inhibitor is selected from the group consisting of: (1) a small molecule compound, an antisense nucleic acid, a microRNA, a siRNA, an RNAi, a dsRNA, a sgRNA, a specific antibody, or a combination thereof, and (2) a nucleic acid construct capable of expressing or forming (1). The inhibitor specifically interferes with the transcription and/or expression of the GS10 gene. Preferably, the inhibitor is an inhibitor targeting SEQ ID NO 1 or 2.
In one or more embodiments, the inhibitor further comprises a Cas enzyme (e.g., Cas9), its coding sequence, and/or a nucleic acid construct expressing the Cas enzyme.
In one or more embodiments, the sgrnas are as set forth in SEQ ID NOs 3 or 4.
The present invention also provides the use of an expression cassette as described herein for improving an agronomic trait in a crop selected from the group consisting of:
reducing the leaf length, increasing the leaf width, reducing the yield of a single plant, reducing the yield of a cell, reducing the plant height, reducing the tillering number, reducing the grain number per spike, reducing the spike length, reducing the number of branches, reducing the awn, shortening the heading stage, increasing the diameter of stalks, increasing the width of glumes cells, reducing the length of glumes cells, increasing the unit area of glumes cells, reducing the inclination angle of leaves, reducing the grain length, increasing the grain width, increasing the grain thickness, increasing the grain weight, increasing the BR sensitivity, negatively regulating and controlling a BR signal path, reducing the transparency of grains, increasing the compactness of starch cells of the grains, reducing the protein content of the grains and reducing the amylose content of the grains; or
Increasing leaf length, reducing leaf width, increasing single plant yield, increasing cell yield, increasing plant height, increasing tiller number, increasing grain number per spike, increasing spike length, increasing branch number, increasing top awn, prolonging heading period, reducing stem diameter, reducing glume cell width, increasing glume cell length, reducing glume cell unit area, increasing leaf inclination angle, increasing grain length, reducing grain width, reducing grain thickness, reducing grain weight, BR sensitivity, positively regulating BR signal path, increasing grain transparency, reducing grain starch cell compactness, increasing grain protein content, and increasing grain amylose content.
The invention also provides the use of the promoter element of the GS10 gene for the space-time specific expression of foreign proteins, wherein the space-time specific expression refers to the specific expression in the cytoplasm or cell membrane of cells at seedling and/or young ear stage.
The invention also provides the application of the substance for down-regulating the expression or activity of the GS10 gene and the substance for up-regulating the expression or activity of one or two genes selected from GW5 and GW7(GL7) in stabilizing the grain type, improving the appearance quality of rice and improving the aspect ratio of the rice.
In one or more embodiments, the agent that down-regulates expression or activity of the GS10 gene is an inhibitor of the GS10 gene. Preferably, the inhibitor of the GS10 gene is as described in the first aspect herein.
In one or more embodiments, the substance that upregulates the expression or activity of a gene selected from one or both of GW5 and GW7(GL7) is a gene selected from one or both of GW5 and GW7(GL7) or a protein encoded thereby, or a promoter therefor. The accelerator is selected from the following group: a small molecule compound, a nucleic acid molecule, or a combination thereof.
The invention also provides a complex formed by binding GS10 protein to a protein selected from SnRK1A, OsOFP8 and OSH 1.
In one or more embodiments, the GS10 protein is from rice.
In one or more embodiments, the amino acid sequence of the GS10 protein is selected from the group consisting of:
(a) a polypeptide with a sequence shown in SEQ ID NO. 1;
(b) a polypeptide which is formed by substituting, deleting or adding one or more (such as 1-20; preferably 1-10; more preferably 1-5) amino acid residues of the sequence shown in SEQ ID NO. 1, has the functions of the polypeptide (a) and is derived from the polypeptide (a); or
(c) A polypeptide derived from (a) having more than 90% (preferably 93%; more preferably 95% or 98%) homology to the polypeptide sequence of (a) and having the function of the polypeptide of (a).
Drawings
FIG. 1 is a rice grain type QTL mapping chart. A, based on W1943 and GLA4 recombinant inbred line; b, mapping of rice grain type related genes.
FIG. 2 is a LOD plot of rice grain type QTL GS10 location. TGW, thousand kernel weight; GL, grain length; GW, grain width; GS, aspect ratio.
FIG. 3 is a fine mapping and phenotypic analysis of GS 10. A, GLA4 and CSSL 50. Scale, 5 cm. B, comparison of the particle types of GLA4 and CSSL 50. Scale, 1 cm. C, initial positioning of GS 10. D, fine positioning of GS 10. E, the gene structure of GS 10. F-I, GLA4 and NIL-gs10 were compared for differences in grain length (F), grain width (G), thousand kernel weight (H) and grain thickness (I). Values show mean ± standard deviation (n ═ 10), differential significance test was performed using Student's t-test (P <0.05, t test).
FIG. 4 is the identification and screening of the GS10 near isogenic line. Background characterization of A-B, recombinants 2017#44-6-6(A) and NIL-gs10 (B).
FIG. 5 is a comparison of other important agronomic traits for GLA4 and NIL-gs 10. A-J, GLA4 and NIL-gs10 for comparison of differences in ear length (A), first-order branch (B), second-order branch (C), grain per ear (D), plant height (E), sword leaf length (F), sword leaf width (G), effective tiller number (H), individual plant yield (I) and plot yield (J). Values show mean ± standard deviation (n ═ 10), with Student's t-test for differential significance (P <0.05, t test).
FIG. 6 is an analysis of natural variation at the level of GS10 DNA in GLA4 and W1943. Yellow part, CDS. Grey part, UTR.
FIG. 7 is a protein alignment of GS10 in GLA4 and W1943. A-B, comparison of the differences between the GS10 gene sequence (A) and the protein sequence (B).
FIG. 8 shows transgene validation of GS 10. A, GLA4, NIL-GS10, CP-1(pGLA4:: gGS10GLA4), GS10-CR, OX (Ubi:: GS10GLA4) and CP-2(pW1943:: gGS10GLA4) were compared in grain and ear type between different rice lines. Granular Scale bar,1 cm. Spike type Scale bar,5 cm. And B-E, GLA4, NIL-GS10, CP-1, GS10-CR, OX, CP-2 and the like, and the difference statistics of grain length (B), grain width (C), thousand kernel weight (D) and ear length (E) among different rice lines. CP, complementary line; OX, overlay line. The values show the mean ± standard deviation (n ═ 20), with different letters representing significant differences between the 0.05 levels (Duncan's multiple range test).
FIG. 9 shows a comparison of grain width differences between two different complementation lines CP-1 and CP-2. A-B, comparison of grain width differences between two different complementation lines CP-1(A) and CP-2 (B). 10 strains with phenotype were selected from each strain T0 generation and subjected to T1 generation statistics (n-12).
FIG. 10 shows the plant type and expression level analysis of transgenic plants. And comparing the differences of plant types among different rice strains, such as A, GLA4, NIL-gs10, CP-1, OX, CP-2 and the like. B, analyzing the expression quantity among different rice strains such as GLA4, NIL-gs10, CP-1, OX and CP-2.
FIG. 11 shows a comparison of the differences in leaf inclination and flag leaf between strains GLA4, NIL-GS10, GS10-CR and OX. A, GLA4, NIL-GS10, GS10-CR and OX leaf inclination map. Scale, 10 cm. B, GLA4, NIL-GS10, GS10-CR and OX xiphoid map. Scale, 5 cm. C-D, GLA4, NIL-GS10, GS10-CR and OX leaf inclination and flag leaf difference data statistics. The values show the mean ± standard deviation (n ═ 20), with different letters representing significant differences between the 0.05 levels (Duncan's multiple range test).
FIG. 12 is a knockout experiment in Nipponbare using CRISPR-Cas 9. A, NIP and CR strain patterns. Scale, 10 cm. Spike patterns of NIP and CR. Scale, 5 cm. C, particle pattern of NIP and CR. Scale, 1 cm. Statistics of particle length (D) and particle width (E) for D-E, NIP and CR. CR, CRISPR-Cas 9. The values show the mean ± standard deviation (n ═ 10) and the different letters represent significant differences between 0.05 levels (Duncan's multiple range test).
FIG. 13 shows a T-DNA mutant validation experiment for GS 10. Strain diagrams of the A, HY and T-DNA mutants of gs 10. HY, hwayuong. Scale, 10 cm. B, DNA structure diagram of T-DNA mutant gs 10. C, HY, gs10 and the complementary plant HY-CP-1. Scale, 1 cm. The expression level of GS10 in the context of D, HY and GS10 varied. Data statistics for grain length (E) and grain width (F) for E-F, HY, and gs 10. The values show the mean ± standard deviation (n ═ 10), with different letters representing significant differences between the 0.05 levels (Duncan's multiple range test).
FIG. 14 is an analysis of the expression level of GS10 in rice at each stage. PA, pre-anthesis spikelet; FL, xiphoid leaf; LS, leaf sheath; TB, tillering nodules.
FIG. 15 is the subcellular localization of GS 10. Subcellular localization of GS10 in tobacco and rice protoplasts. Rice protoplast ruler, 10 μm; tobacco cell ruler, 10 μm and 20 μm. 35S GFP was used as a control.
Fig. 16 is the co-location of GS10 and GW 5. The rice protoplast scale was 20 μm.
Figure 17 shows that GS10 regulates cell size rather than cell number in glumes. Semi-thin sections of the glumes in the non-flowering spikelets in a, GLA4 and NIL-gs 10. The scale bar is, in order, 0.5cm and 50 μm. B, GLA4 and NIL-gs10 glume cell outer epidermis electron scanning microscope observation. Scale, 1mm (top) and 100 μm (bottom). C, GLA4 and NIL-gs10 statistics on the number of glume cell lengths. D, statistics of cell numbers at the GLA4 and NIL-gs10 grain length levels. E, analyzing the difference of expression quantity of related cell cycle and division genes in GLA4 and NIL-gs10 young ear stage (3-5 cm). OsActin was used as an internal reference gene, and the expression level of GLA4 was set to 1. Values show mean ± standard deviation (n ═ 3), with Student's t-test for differential significance (P <0.05, t test).
FIG. 18 shows a comparison of the protein structures of GS10, arm and CTNNB 1.
FIG. 19 shows the genetic relationship of GS10 and its homologous gene, GSL 1. A, GS10 and its homologous genes. B, GS10 and the knockout strain type of the homologous gene GSL1 in the rice. Scale, 10 cm. C, single and double knockout grain type difference comparison of GS10 and GSL1 based on ZH11 background. Scale, 1 cm. D-E, statistics of grain length and grain width of single-knockout and double-knockout plants of GS10 and GSL1 based on the background of ZH 11. The values show the mean ± standard deviation (n ═ 10) and the different letters represent significant differences between 0.05 levels (Duncan's multiple range test).
FIG. 20 is a graph showing the measurement of the expression levels of the related differentially expressed genes in the young ears of GLA4 and OX. Differential expression detection of genes related to GS10 development regulation in A-J, GLA4 and OX scion. OsActin is used as an internal reference gene, and values are shown as mean ± standard deviation (n ═ 3).
FIG. 21 is a GS10 yeast Sieve library experiment. A, GS10 and BR related gene yeast interaction verification. B, GS10 was validated for yeast interaction with OsGSK2, SnRK1A, GS9 and OsOFP 8.
FIG. 22 is a genetic analysis of plant type and grain type between GS10 and OsGSK2 transgenic lines. A, plant type comparison of different transgenic plants GS10-CR-ZH11, GS10-OX-ZH11, GS10-OsGSK2-CR, OsGSK2-CR, OsGSK2-OX and OsGSK2-RNAi based on the background of ZH 11. Scale, 10 cm. B, comparing the grain types of different transgenic plants based on the ZH11 background. Scale, 1 cm. C-D, statistics of grain width (C) and grain length (D) among different transgenic plants based on ZH11 background. The values show the mean ± standard deviation (n ═ 10) and the different letters represent significant differences between 0.05 levels (Duncan's multiple range test).
FIG. 23 is a comparison of the difference in leaf inclination between transgenic lines GS10 and OsGSK 2. A, comparison of leaf inclination differences based on the background of ZH11 for GS10-CR-ZH11, GS10-OX-ZH11, GS10-OsGSK2-CR, OsGSK2-CR, OsGSK2-OX and OsGSK 2-RNAi. Scale, 10 cm. B, leaf inclination data statistics of GS10-CR-ZH11, GS10-OX-ZH11, GS10-OsGSK2-CR, OsGSK2-CR, OsGSK2-OX and OsGSK2-RNAi based on the background of ZH 11. Analysis of GS10 expression level in C, ZH11 and GS10-OX-ZH 11. D, plant height comparison of ZH11 and GS10-OX-ZH 11.
FIG. 24 is a demonstration of the in vivo and in vitro interaction of GS10 and OsOFP 8. A, GS10 and OsOFP 8. And B, verifying the interaction of GS10 and OsOFP8 proteins in tobacco bodies based on the YFP system. The scale bars are 20 μm and 5 μm in this order. And C, GS10 and OsOFP8 protein interactive verification in tobacco bodies based on the YFP system. D, GS10 and OsOFP8 protein validation of Co-IP level in plants.
FIG. 25 is the co-localization of GS10 and OsOFP8 in plants. A, co-localization of GS10 and OsOFP8 in tobacco bodies. Scale, 40 nm. B, co-localization of rice protoplasts GS10 and OsOFP 8. Scale, 40 nm.
FIG. 26 shows that GS10 may be involved in negative regulator of rice BR signal. A, GAL4 and NIL-gs10 in the background of 0 and 1mg BL hormone treatment after 3 days of seedling growth status comparison. Scale, 1 cm. Statistics of epicotyls of 0 and 1mg BL hormone treated seedlings against a background of GAL4 and NIL-gs 10. C, GS10 protein degradation in response to BL in OX background. CBB, coomassie brilliant blue staining. Changes in the inclination of posterior leaflet treated with 0 and 1mg BL hormone in the context of D-E, GAL4 and NIL-gs 10. Statistics of the inclination of the posterior leaflet treated with 0. mu.g, 10. mu.g and 1000. mu.g BL hormones against the background of GAL4 and NIL-gs 10. The values show the mean ± standard deviation (n ═ 3). t-test significant differences: p <0.05, P <0.01, P < 0.001; NS, no significant difference.
FIG. 27 is the genetic relationship of gs10 and other major-granule QTLs. And A, comparing the grain types of different polymeric breeding materials. The polymeric breeding materials NIL-gs10/GL7, NIL-gs10/GW5, NIL-GL7/GW5 and NIL-gs10/GW5/GL7 are polymerized from NIL-gs10, NIL-GW5 and NIL-GL7/GW7 respectively. B-D, NIL-gs10, NIL-GW5, NIL-GL7/GW7, NIL-gs10/GL7, NIL-gs10/GW5, NIL-GL7/GW5 and NIL-gs10/GW5/GL7 (n-10). The values show the mean ± standard deviation, with different letters representing significant differences between the 0.05 levels (Duncan's multiple range test).
Fig. 28 shows that the polymerization of gs10 and GW5 can greatly improve the appearance quality of rice kernels.
Fig. 29 is a comparison of grain taste value, protein, fatty acid and amylose content differences based on gs10 and GW5 polymeric materials. A, GLA4, NIL-gs10, NIL-GW5 and NIL-gs10/GW5 are observed by electronic scanning of endosperm cells of different materials. Scale, 10 μm. B-E, GLA4, NIL-gs10, NIL-GW5 and NIL-gs10/GW5, the food taste value (B), the protein content (C), the amylose content (D) and the fatty acid content (E) are respectively taken as data statistics of the intrinsic quality of different material seeds. The values show the mean ± standard deviation, with different letters representing significant differences between the 0.05 levels (Duncan's multiple range test).
Fig. 30 is CRISPR-Cas9 knock-out experiment of GS10 based on Dongjing background. Genotypes of A, DJ and CR plants. B, DJ and CR. Scale, 10 cm. Spike type of C, DJ and CR. Particle type comparison of D, DJ and CR. Scale, 1 cm. Statistics of grain length (E) and grain width (F) for E-F, DJ and CR. CR, CRISPR-Cas 9. The values show the mean ± standard deviation (n ═ 10), with different letters representing significant differences between the 0.05 levels (Duncan's multiple range test).
Detailed Description
The inventor firstly discloses that the agronomic traits of plants can be obviously regulated by directionally regulating the expression level of GS10 gene in cereal plants (crops), thereby achieving the purposes of improving the quality of the cereal crops, increasing the yield and the like. The agronomic trait is selected from one or more of the group consisting of: leaf length, leaf width, single plant yield, cell yield, plant height, tiller number, grain number per ear, ear length, branch number, top awn, heading stage, stem diameter, glume cell width, glume cell length, glume cell unit area, leaf inclination angle, grain length, grain width, grain weight, BR sensitivity, BR signal path, grain transparency, grain starch cell compactness, grain protein content and grain amylose content.
As used herein, a "cereal crop" may be a graminaceous plant or a miscanthus (crop). Preferably, the gramineous plant is rice, barley, wheat, oats, rye. Miscanthus sinensis refers to a plant having needles on the seed husk. As used herein, the term "crop" or "crop" is not particularly limited, including but not limited to: rice, wheat, barley, etc.
As used herein, the polypeptide encoded by the GS10 gene is designated "GS 10". In the present invention, the term "GS 10" refers to a polypeptide of SEQ ID NO. 1 having GS10 activity. The term also includes variants of the sequence of SEQ ID NO:1 having the same function as GS 10. These variants include (but are not limited to): deletion, insertion and/or substitution of several (usually 1 to 50, preferably 1 to 30, 1 to 20, 1 to 10, 1 to 8, 1 to 5) amino acids, and addition or deletion of one or several (usually up to 20, preferably up to 10, more preferably up to 5) amino acids at the C-terminal and/or N-terminal. For example, in the art, substitutions with amino acids of similar or similar properties will not generally alter the function of the protein. Amino acids with similar properties are often referred to in the art as families of amino acids with similar side chains, which are well defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, lactic acid, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine tryptophan, histidine). Also, for example, the addition of one or more amino acids at the amino-and/or carboxy-terminus will not generally alter the function of the polypeptide or protein. Conservative amino acid substitutions for many commonly known non-genetically encoded amino acids are known in the art. Conservative substitutions of other non-coding amino acids may be determined based on a comparison of their physical properties with those of genetically coded amino acids.
Variants of the polypeptides include: homologous sequences, conservative variants, allelic variants, natural mutants, induced mutants.
Any polypeptide having high homology to GS10 (e.g., 70% or greater homology to the sequence shown in SEQ ID NO: 1; preferably 80% or greater homology; more preferably 90% or greater homology, e.g., 95%, 98% or 99%) and having a similar or identical function to GS10 is also encompassed by the present invention. The same or similar functions are mainly used for regulating and controlling the agronomic traits of crops (such as rice).
The invention also includes analogs of the claimed polypeptides. These analogs may differ from the native SEQ ID NO:1 by amino acid sequence differences, by modifications that do not affect the sequence, or by both. Analogs of these proteins include natural or induced genetic variants. Induced variants can be obtained by various techniques, such as random mutagenesis by irradiation or exposure to mutagens, site-directed mutagenesis, or other well-known biological techniques. Analogs also include analogs having residues other than the natural L-amino acids (e.g., D-amino acids), as well as analogs having non-naturally occurring or synthetic amino acids (e.g., beta, gamma-amino acids). It is to be understood that the proteins of the present invention are not limited to the representative proteins exemplified above.
Modified (generally without altering primary structure) forms include: chemically derivatized forms of the protein in vivo or in vitro such as acetoxylation or carboxylation. Modifications also include glycosylation, such as those performed during protein synthesis and processing. Such modification may be accomplished by exposing the protein to an enzyme that performs glycosylation, such as mammalian glycosylating or deglycosylating enzymes. Modified forms also include sequences having phosphorylated amino acid residues (e.g., phosphotyrosine, phosphoserine, phosphothreonine).
The polypeptide fragment, derivative or analogue of the invention may be: (i) polypeptides in which one or more conserved or non-conserved amino acid residues (preferably conserved amino acid residues) are substituted, and such substituted amino acid residues may or may not be encoded by the genetic code; or (ii) a polypeptide having a substituent group in one or more amino acid residues; or (iii) a polypeptide formed by fusing the mature polypeptide to another compound, such as a compound that increases the half-life of the polypeptide, e.g., polyethylene glycol; or (iv) a polypeptide formed by fusing an additional amino acid sequence to the polypeptide sequence (e.g., a leader or secretory sequence or a sequence used to purify the polypeptide or a proprotein sequence, or a fusion protein). Such fragments, derivatives and analogs are within the scope of those skilled in the art as defined herein.
In addition, any biologically active fragment of GS10 can be used in the present invention. Herein, a biologically active fragment of GS10 is meant to be a polypeptide that still retains all or part of the function of full-length GS 10. Typically, the biologically active fragment retains at least 50% of the activity of full-length GS 10. Under more preferred conditions, the active fragment is capable of retaining 60%, 70%, 80%, 90%, 95%, 99%, or 100% of the activity of full-length GS 10.
The invention also relates to polynucleotide sequences encoding the GS10 of the invention or variants, analogs, derivatives thereof. The polynucleotide may be in the form of DNA or RNA. The form of DNA includes cDNA, genomic DNA or artificially synthesized DNA. The DNA may be single-stranded or double-stranded. The DNA may be the coding strand or the non-coding strand. The sequence of the coding region encoding the mature polypeptide may be identical to the sequence of the coding region shown in SEQ ID NO. 2 or may be a degenerate variant.
The present invention also relates to variants of the above polynucleotides encoding fragments, analogs and derivatives of the polypeptides having the same amino acid sequence as the present invention. The variant of the polynucleotide may be a naturally occurring allelic variant or a non-naturally occurring variant. These nucleotide variants include substitution variants, deletion variants and insertion variants. As is known in the art, an allelic variant is a substitution of a polynucleotide, which may be a substitution, deletion, or insertion of one or more nucleotides, without substantially altering the function of the polypeptide encoded thereby. As used herein, degenerate variants refer in the present invention to nucleic acid sequences which encode a protein having SEQ ID NO. 1, but differ from the sequence of the coding region shown in SEQ ID NO. 2. A "polynucleotide encoding a polypeptide" may be a polynucleotide comprising a sequence encoding the polypeptide, or may further comprise additional coding and/or non-coding sequences.
The present invention also relates to polynucleotides which hybridize to the sequences described above and which have at least 50%, preferably at least 70%, more preferably at least 80% identity between the two sequences. The present invention particularly relates to polynucleotides which hybridize under stringent conditions to the polynucleotides of the present invention. In the present invention, "stringent conditions" refer to (1) hybridization and elution at a lower ionic strength and a higher temperature, such as 0.2 XSSC, 0.1% SDS, 60 ℃; or (2) adding denaturant during hybridization, such as 50% (v/v) formamide, 0.1% calf serum/0.1% Ficoll, 42 deg.C, etc.; or (3) hybridization occurs only when the identity between two sequences is at least 90% or more, preferably 95% or more. And, the polypeptides encoded by the hybridizable polynucleotides have the same biological functions and activities as the mature polypeptide of SEQ ID NO. 1.
It is to be understood that although the genes provided in the examples of the present invention are derived from rice, the gene sequences of GS10 derived from other similar plants (particularly plants belonging to the same family or genus as rice) and having a certain homology (e.g., greater than 70%, such as 80%, 85%, 90%, 95%, or even 98% sequence identity) with the sequence of the present invention (preferably, the sequence is shown in SEQ ID NO: 1) are also included in the scope of the present invention, as long as the sequence can be easily isolated from other plants by one skilled in the art after reading the present application, based on the information provided herein. Methods and means for aligning sequence identity are also well known in the art, for example BLAST.
The full-length GS10 nucleotide sequence or its fragment of the present invention can be obtained by PCR amplification, recombination or artificial synthesis. For PCR amplification, primers can be designed based on the nucleotide sequences disclosed herein, particularly open reading frame sequences, and the sequences can be amplified using a commercially available DNA library or a cDNA library prepared by conventional methods known to those skilled in the art as a template. When the sequence is long, two or more PCR amplifications are often required, and then the amplified fragments are spliced together in the correct order. Once the sequence of interest has been obtained, it can be obtained in large quantities by recombinant methods. Usually, it is cloned into a vector, transferred into a cell, and then isolated from the propagated host cell by a conventional method to obtain the relevant sequence.
In addition, the sequence of interest can be synthesized by artificial synthesis, especially when the fragment length is short. Generally, fragments with long sequences are obtained by first synthesizing a plurality of small fragments and then ligating them. At present, DNA sequences encoding the proteins of the present invention (or fragments or derivatives thereof) have been obtained completely by chemical synthesis. The DNA sequence may then be introduced into various existing DNA molecules (or vectors, for example) and cells known in the art. Furthermore, mutations can also be introduced into the protein sequences of the invention by chemical synthesis.
The invention also provides a recombinant vector comprising the gene of the invention. In a preferred embodiment, the promoter downstream of the recombinant vector comprises a multiple cloning site or at least one cleavage site. When it is desired to express the target gene of the present invention, the target gene is ligated into a suitable multiple cloning site or restriction enzyme site, thereby operably linking the target gene with the promoter. As another preferred mode, the recombinant vector comprises (in the 5 'to 3' direction): a promoter, a gene of interest, and a terminator. If desired, the recombinant vector may further comprise an element selected from the group consisting of: a 3' polyadenylation signal; an untranslated nucleic acid sequence; transport and targeting nucleic acid sequences; resistance selection markers (dihydrofolate reductase, neomycin resistance, hygromycin resistance, green fluorescent protein, etc.); an enhancer; or operator. Exemplary vectors are for example pNCGR-OX.
Methods for preparing recombinant vectors are well known to those of ordinary skill in the art. The expression vector may be a bacterial plasmid, a bacteriophage, a yeast plasmid, a plant cell virus, a mammalian cell virus, or other vector. In general, any plasmid and vector may be used as long as it can replicate and is stable in the host.
One of ordinary skill in the art can construct expression vectors containing the genes described herein using well known methods. These methods include in vitro recombinant DNA techniques, DNA synthesis techniques, in vivo recombinant techniques, and the like. When the gene of the invention is used for constructing a recombinant expression vector, any one of enhanced, constitutive, tissue-specific or inducible promoters can be added in front of the transcription initiation nucleotide.
Vectors comprising the gene, expression cassette or gene of the invention may be used to transform appropriate host cells to allow the host to express the protein. The host cell may be a prokaryotic cell, such as E.coli, Streptomyces, Agrobacterium; or lower eukaryotic cells, such as yeast cells; or higher eukaryotic cells, such as plant cells. It will be clear to one of ordinary skill in the art how to select an appropriate vector and host cell. Transformation of a host cell with recombinant DNA may be carried out using conventional techniques well known to those skilled in the art. When the host is a prokaryote (e.g., Escherichia coli), the cells may be treated by the CaCl2 method or may be electroporated. When the host is a eukaryote, the following DNA transfection methods may be used: calcium phosphate coprecipitation, conventional mechanical methods (e.g., microinjection, electroporation, liposome encapsulation, etc.). The transformed plant may be transformed by methods such as Agrobacterium transformation or biolistic transformation, for example, leaf disc method, immature embryo transformation, flower bud soaking method, etc. The transformed plant cells, tissues or organs can be regenerated into plants by conventional methods to obtain transgenic plants. When expressed in higher eukaryotic cells, the polynucleotide will provide enhanced transcription when an enhancer sequence is inserted into the vector. Enhancers are cis-acting elements of DNA, usually about 10 to 300 bp in length, that act on a promoter to increase gene transcription.
It will be clear to one of ordinary skill in the art how to select appropriate vectors, promoters, enhancers and host cells.
The polypeptides described herein may be expressed intracellularly, or on the cell membrane, or secreted extracellularly. If necessary, the recombinant protein can be isolated and purified by various separation methods using its physical, chemical and other properties. These methods are well known to those skilled in the art. Examples of such methods include (but are not limited to): conventional renaturation treatment, treatment with a protein precipitant (such as salt precipitation), centrifugation, cell lysis by osmosis, sonication, ultracentrifugation, molecular sieve chromatography (gel filtration), adsorption chromatography, ion exchange chromatography, High Performance Liquid Chromatography (HPLC), and other various liquid chromatography techniques and combinations thereof.
Transformation of a host with recombinant DNA may be carried out by conventional techniques well known to those skilled in the art. The transformed plant may be transformed by methods such as Agrobacterium transformation or particle gun transformation, for example, spray method, leaf disk method, rice immature embryo transformation method, etc. The transformed plant tissue or organ can be regenerated into a plant by a conventional method, thereby obtaining a plant with modified traits.
The invention provides application of the GS10 gene to regulation and control of plant agronomic traits; or for screening substances useful for regulating agronomic traits in plants (i.e., the substances regulate agronomic traits in plants by modulating the expression of the GS10 gene). Preferably, the GS10 gene can be used for increasing agronomic traits.
The invention also relates to GS10 up-regulation agents or inhibitors and uses thereof. Because the up-regulator or the inhibitor of GS10 can regulate the expression and/or the activity and the like of GS10, the up-regulator or the inhibitor can also regulate and control the agronomic traits of plants by influencing GS10, thereby achieving the aim of improving the plants.
In one aspect, any substance that increases the activity of GS10, increases its stability, promotes its expression, prolongs its effective duration, or promotes its gene transcription and translation can be used in the present invention as an "enhancer" of the GS10 gene for the regulation of agronomic traits in plants. For example, an expression vector which enhances the transcription, expression or activity of the GS10 gene.
In another aspect, any substance that reduces the activity, reduces the stability, inhibits the expression, reduces the duration of action, or reduces the transcription and translation of GS10 can be used in the present invention as a down-regulator, antagonist, or inhibitor of GS10, such as an interfering molecule that interferes with the expression of the GS10 gene (e.g., an interfering molecule that can form microRNA). The inhibitor, antagonist or inhibitor can be used for regulating and controlling the agronomic traits of plants. Methods for making interfering molecules that interfere with the expression of a particular gene, once the target sequence is known, are well known to those skilled in the art.
Furthermore, to down-regulate GS10 gene expression or activity, a gene knockout vector can be introduced into the cell and/or the gene can be knocked out or knocked down using gene editing techniques such as ZFNs, TALENs, or CRISPR/Cas 9. ZFN, TALEN and CRISPR/Cas9 technologies suitable for use in the present invention are well known in the art. Each technique realizes the knockout of a target gene through the coaction of a DNA recognition domain and an endonuclease.
The invention also relates to a method for modulating an agronomic trait in a plant comprising modulating expression or activity of a GS10 gene in said plant. The up-regulation of the expression or activity of the GS10 gene can reduce the leaf length, increase the leaf width, reduce the yield of a single plant, reduce the cell yield, reduce the plant height, reduce the tillering number, reduce the grain number per spike, reduce the spike length, reduce the stem number, reduce the top awn, shorten the heading period, increase the stem diameter, increase the glume cell width, reduce the glume cell length, increase the unit area of the glume cell, reduce the leaf inclination angle, reduce the grain length, increase the grain width, increase the grain thickness, increase the grain weight, BR sensitivity, negatively regulate and control a BR signal path, reduce the grain transparency, increase the grain starch cell compactness, reduce the grain protein content and reduce the grain amylose content. The expression or activity of the GS10 gene is reduced, so that the leaf length can be increased, the leaf width can be reduced, the yield of a single plant can be increased, the yield of a cell can be increased, the plant height can be increased, the tillering number can be increased, the grain number per spike can be increased, the spike length can be increased, the number of branches can be increased, the awn can be increased, the heading stage can be prolonged, the diameter of a stem can be reduced, the width of glume cells can be reduced, the length of glume cells can be increased, the unit area of the glume cells can be reduced, the inclination angle of the leaves can be increased, the grain length can be increased, the grain width can be reduced, the grain thickness can be reduced, the grain weight can be reduced, the BR sensitivity can be improved, the BR signal channel can be positively regulated, the grain transparency can be increased, the grain starch cell compactness can be reduced, the protein content of the grain can be increased, and the amylose content of the grain can be increased.
In one aspect, the present invention provides a method of modulating an agronomic trait in a plant (such as a crop), the method comprising: over-expressing the GS10 gene in the plant to increase the agronomic trait. After knowing the use of the GS10 gene, various methods well known to those skilled in the art can be used to modulate the expression of the GS10 gene. For example, GS10, which can deliver an expression unit (e.g., an expression vector, a virus, etc.) carrying the GS10 gene to a target and allow expression of the activity, can be achieved by a method known to those skilled in the art.
In one embodiment of the present invention, the GS10 gene is cloned into an appropriate vector by a conventional method, and the recombinant vector containing the foreign gene is introduced into a plant tissue or organ to express the GS10 gene in the plant. Plants overexpressing the GS10 gene can be obtained by regenerating the plant tissues or organs into plants.
In another aspect, the present invention provides another method of modulating an agronomic trait in a plant (e.g., a crop), the method comprising: reducing expression of a GS10 gene in the plant (including making no or low expression of a GS10 gene); thereby reducing the agronomic characters.
Various methods well known to those skilled in the art can be used to reduce or delete the expression of the GS10 gene, such as delivering an expression unit (e.g., expression vector or virus, etc.) carrying the antisense GS10 gene to a target such that the cells or plant tissues do not express or express GS10 is reduced. Alternatively, the GS10 gene can be knocked out by approaches known to those skilled in the art, and/or the GS10 gene can be knocked out or knocked out using gene editing techniques such as ZFNs, TALENs, or CRISPR/Cas 9.
The invention also provides the use of the promoter element of the GS10 gene for the spatiotemporal specific expression of foreign proteins, wherein the spatiotemporal specific expression refers to the specific expression in the cytoplasm or cell membrane of cells at seedling and/or young ear stage.
The invention also provides the application of the substance for down-regulating the expression or activity of the GS10 gene and the substance for up-regulating the expression or activity of one or two genes selected from GW5 and GL7 in stabilizing the grain type, improving the appearance quality of rice and improving the length-width ratio of the rice. In one or more embodiments, the agent that down-regulates the expression or activity of the GS10 gene is an inhibitor of the GS10 gene. Preferably, the inhibitor of the GS10 gene is as described in the first aspect herein. In one or more embodiments, the substance that upregulates the expression or activity of one or both of the genes selected from GW5 and GL7/GW7 is a gene selected from one or both of GW5 and GL7/GW7 or a protein encoded thereby, or a promoter therefor. Methods for up-regulating the expression or activity of one or both of the genes for GW5 and GL7/GW7 are the same as those described elsewhere herein, e.g., vectors for expression of GW5 or GL7/GW7 are transferred or integrated into plants.
The invention also provides a complex formed by binding GS10 protein to a protein selected from SnRK1A, OsOFP8 and OSH 1. In one or more embodiments, the GS10 protein is from rice. In one or more embodiments, the amino acid sequence of the GS10 protein is selected from the group consisting of seq id no: (a) polypeptide with a sequence shown as SEQ ID NO. 1; (b) a polypeptide which is formed by substituting, deleting or adding one or more (such as 1-20; preferably 1-10; more preferably 1-5) amino acid residues of the sequence shown in SEQ ID NO. 1, has the functions of the polypeptide (a) and is derived from the polypeptide (a); or (c) a polypeptide derived from (a) which has more than 90% (preferably 93%; more preferably 95% or 98%) homology to the polypeptide sequence of (a) and which has the function of the polypeptide of (a).
Other aspects of the invention will be apparent to those skilled in the art in view of the disclosure herein. The invention will be further illustrated with reference to the following specific examples. It should be understood that these examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Experimental procedures without specific conditions noted in the following examples, molecular cloning is generally performed according to conventional conditions such as Sambrook et al: the conditions described in the Laboratory Manual (New York: Cold Spring Harbor Laboratory Press,1989), or according to the manufacturer's recommendations.
Examples
Materials and methods
1 Rice Material
The gene mapping of GS10 in this experiment was from phenotypic studies and data analysis of recombinant Inbred Lines (BILs, Backcross infected Lines) of oryza sativa indica, europa japonica l.ssp GLA4 and oryza sativa (o.rufipogon W1943). Both the primary and fine positioning originate from the single-Segment replacement system CSSL50(CSSLs, Chromosome Segment localization Lines). The basic transformation materials referred to herein are the Asian cultivated rice Nipponbare (Nipponbare), Dongjin and Hwayoung, respectively. Both wild rice and oryza sativa material involved in population genetic analysis were collected, stored and subjected to low abundance sequencing by the inventors.
2 Experimental methods
The DNA extraction methods referred to in this paper are mainly DNA crude extraction methods (TPS method and CTAB method) and plasmid major extraction. The conventional Trizol method was used for RNA extraction, and the ReverTra Ace qPCR RT Master Mix with gDNA Remover kit from TOYOBO was used for RNA inversion into cDNA.
The real-time quantitative PCR instrument selected in the quantitative experiment is an Applied Biosystems Q5 real time PCR instrument, and the reagent is THUNDERBIRD from TOYOBOThe qPCR Mix kit and OsActin in the paper are used as sample internal references.
In the examples, the following experimental methods, paraffin section, seedling hormone treatment, preparation of rice protoplast, PEG-mediated transformation of rice protoplast, RACE experiment, Co-IP experiment, construction and transformation of rice embryo callus transgenic material, statistical measurement of rice glume cell and rice endosperm quality by scanning electron microscopy, subcellular localization experiment, bimolecular fluorescence complementation experiment (BiFC & BiLC), yeast sieve bank and yeast two-hybrid experiment were also used. The above experiments were performed according to conventional conditions such as Sambrook et al, molecular cloning: the conditions described in the laboratory manual, or according to the manufacturer's recommendations.
3 data statistics and analysis
All data analyses were based on setting up multiple replicates and differential test analyses were performed using Duncan's multiple range test and Student's t-test. The mapping software referred to in this paper is mainly Microsoft Excle and Graphpad Prism 8.0. In this, specific SNP calling data were referenced to laboratory published data when analyzing wild and cultivated rice populations for GS10 (Huang et al, 2012). Protein evolutionary tree analysis was generated in MEGA6 software based on NCBI database selection of protein sequences for specific rice genes. The DNA, RNA and protein reference sequences of The relevant genes involved in this study were referenced to IRGSP 1.0(The Rice mutation Project Database) and MSU (Rice Genome mutation Release 7) databases.
Example 1, identification of genotype-associated genes in the recombinant inbred line populations of W1943 and GLA4
Common wild rice o.rufipogon and oryza sativa o.sativa l. are the most typical resources in AA genes in classified populations of rice genera to study rice domestication and evolution (Chen et al, 2019). In the past, polymorphic domestication analysis of 446 wild rice and 950 oryza sativa worldwide located 55 naturally selected domesticated regions (Huang et al, 2012 a). In order to verify and analyze selective sweet, a set of low abundance sequenced wild rice with indica rice background Guangdong dwarf 4(GLA4) and wild rice with japonica rice background (W1943) which are available in laboratories are utilized to generate a recombinant inbred line after hybridization, and a series of QTLs (shown in figure 1 and A) related to the grain type trait are positioned through a whole genome SNP marker. Among these candidate QTLs, GW5/qSW5/GSE5, GL7/GW7, and GL6 were cloned in conventional studies, and among them, GW5, which is the major gene controlling the GRAIN width of CHROMOSOME 5 and has the highest contribution rate, was selected and studied at site GRAIN SIZE ON CHROMOSOME 10(GS10) having a phenotype contribution rate of about 10% (fig. 1, B).
Example 2 Primary and Fine positioning of GS10
In the population of recombinant inbred lines BILs, we initially selected, by examining the grain type traits (grain length, grain width and aspect ratio), a single-fragment replacement line CSSL50 which was replaced by wild rice W1943 only at 16Mb-23.66Mb (IRGSP-1.0) in the context of chromosome 10. In the BIL population, the characteristics such as grain length, grain width and length-width ratio are combined with WinQTL and the like by combining with related molecular markers, mainly insertion and deletion molecular markers (Indel markers), and linkage analysis is carried out, so that a grain type gene locus exists on a chromosome 10, and the association degree between the grain type gene locus and the grain width and thousand grain weight is highest (figure 2). We observed the agronomic traits of CSSL50 rice related grain type, and found that the replacement line showed significant differences in both grain length and grain width compared to the parent GLA4 (FIG. 3, A). Compared to the parental GLA4, CSSL59 increased grain length by 4%, but decreased grain width and thousand kernel weight by about 9% and 10%, respectively (fig. 3, B). In 6 months of 2015, GLA4 is used as a recurrent hybrid male parent and CSSL50 is used as a female parent to obtain hybrid F1And (4) generation. In 2015 winter, we will F1Under generation of seed, F is obtained2Segregating population seeds of generations and seeding F in summer 20162Seed generation, and 155 strains F2The population was initially mapped (FIG. 3, C).
During initial localization, we initially placed one deletion and insertion molecular marker (In-del marker) approximately every 500kb, and finally a total of 17 valid markers. The gene location was located between OP10-25i (19.44Mb) and OP10-11(22.05Mb) by phenotypic and genotypic correlations, still within the selective sweet range (Huang et al, 2012). Subsequently, In-del and SNP marker mapping were continued for the important recombinants 2016#26-1-4, 2016#27-2-3, 2016#27-4-5, and 2016# 27-5-4. Due to the limited number of recombinants and recombination exchange conditions, the genes are preliminarily positioned between 20.576Mb and 21.33Mb, and are deviated from a selection range (selective sweet), and the domesticated genes related to the selective sweet are predicted to be probably not used for controlling the grain width isophenotype. In pair F2The analysis of the grain phenotype of the population revealed that the genotype was classified into 91 strain GLA4 genotype and 71 strain W1943 genotypeTaking the progeny as an example, the grain type controlled by GS10 has the advantages that the grain length is lengthened by 3%, the grain width is narrowed by 8%, the grain weight is reduced by 9% and the grain thickness is reduced by 4% in wild rice and cultivated rice, and the analysis results show that GS10 is a QTL for negative regulation of the grain type.
After obtaining the precise position of the grain width gene, 5250 effective groups are planted in the summer of 2017 by using a method of using recombinants to assist a large group, and finally the gene is positioned between two important recombinants of 2017#19-3-10 and 2017#26-1-4 by continuously reducing the genetic distance. We annotated at BC by combining RAP-DB IRGSP 1.0(The Rice Annotation Project Database) and MSU genes2F4GS10 was further localized in SNP10-6 and SNP10-3, approximately in the range of 10kb, and the gene showed semi-dominant on the grain type (FIG. 3, D). Only one gene in this 10kb range, including the coding region of the gene and the range of about 2kb before the start of transcription, was used as a candidate gene for GS 10. The progeny of two important recombinants, which were planted and the phenotype of the grain type-related trait of the segregating population of the progeny, were not phenotyped, and the result confirmed that the candidate gene was located in the 10kb range. We refer to database analysis that there are only 1 gene annotated locus within the range of candidate genes and this locus encodes a novel unknown armadillo (arm) protein with 6 tandem repeats as candidate genes (fig. 3, E).
Example 3 phenotypic Observation analysis of agronomic traits
In 2017 winter, to further access the small segment near isogenic lines, we screened 4100 individuals in the greenhouse through 288-well seedling culture trays (tray), and screened 14 recombinants using Indel molecular markers G10-20.401, G10-20.576 and G10-20.838 as molecular markers. Among these recombinants, we used G10-20.401 and G10-20.838 as GLA4 type and G10-20.576 as heterozygote type as ideal for the final recombinants, and carried out next generation sequencing with low abundance of 2 individuals containing about 220kb heterozygote type among 14 recombinants (FIG. 4). We planted these two individuals in the greenhouse and performed progeny seed collection, planting and SNP sequencing. Finally, we have obtained a near isogenic line containing approximately the fragment from W1943170 kb, which contains 8 genes and contains GS 10.
Meanwhile, in summer of 2018, a near isogenic line NIL-GS10 containing a GS10 fragment and having an approximate 170kb W1943 homozygous mutation type is selected, wild type GLA4 is used as a contrast, and important agronomic characters such as sword leaf length, sword leaf width, single plant yield, cell yield, plant height, effective tiller number, grain number per spike, spike length, primary branch stalk, secondary branch stalk and the like are observed and counted respectively. Interestingly, NIL-gs10 showed a marked increase in yield per plant but no change in yield per plot compared to GLA4 with top miscanthus, 4 days later in heading, reduced stalk diameter and longer long ear type. The above results indicate that GS10 is a "pleiotropic" QTL (FIG. 5).
Example 4 polymorphism analysis of GS10
After the candidate gene was obtained, the full-length cDNA was then analyzed for polymorphisms. We combined the database predicted sequence comparison analysis and the determination of GS10 transcripts by 5 'RACE and 3' RACE, and found that there were 2 In-del markers and 3 SNP changes In both parental cDNAs, using GLA4 as reference (FIG. 6). The results showed that a deletion of one 3 bases was present in the wild rice 5' UTR, and that there were 2 SNP changes in the Coding region Coding sequence (CDS) of the gene which did not result in amino acid changes, located at positions +432 and +1599, respectively. Notably, in NIL-gs10, there was a five base deletion in the sequence (CCGCC), and protein translation was predicted to cause a frameshift mutation in wild rice (FIG. 7).
Example 5 verification of transgenic experiments
5.1 transgene validation against GLA4
As we did not exclude the influence of promoter region around 2kb on the phenotype of the granule type in the previous mapping, we first constructed the complementary vector CP-1(pGLA4:: gGS 10) in order to further verify whether the candidate gene GS10 is a QTL producing granule type changes or not (pGLA4:: gGS 10)GLA4) And CP-2(pW1943:: gGS 10)GLA4) By selecting from different parents GLA4And the Promoter of W1943 were fused to pCAMBIA1300 from genomic DNA derived from GLA4, respectively, and transformed into NIL-gs10 (FIG. 8, A). Each complementary line is at T0At least 30 plants are obtained in generation, the grain phenotype of each individual plant is preliminarily counted, and 10 individual plants are selected for T1The agronomic traits were carefully counted and the results showed that both of these complementary lines were able to complement the grain type of NIL plants compared to wild-type GLA4, with lines in the aspect ratio even exceeding the phenotype of GLA4 and appearing as short rounded grains (FIG. 9). In addition, we made statistics on the panicle type and the plant type of CP-1 and CP-2, and the results showed that GS10 shows a tendency to shorten with increasing dose, and the plant type becomes more compact (FIG. 10).
At the same time, we made Over-expression (vector: pNCGR-OX) Over-expression (OX, Ubi:: GS10) with GLA4 as background materialGLA4) And knockout experiments (GS10-CR) (FIG. 8, A). In overexpression experiments, we used ubiquitin (ubi) as a promoter to drive the CDS of the GLA4 coding region at T0In the generation, 13 plants are obtained. According to phenotype statistics of OX transgenic material offspring, the expression level of GS10 is greatly improved compared with wild type GLA4 (figure 10, B), but the change on the grain type level is weak, the change on the grain length is only 3%, and the plant type and the spike type do not change obviously (figure 8, B). In GS10-CR transgenic experiment, the coding region of the gene designs double targets to obtain large-fragment-deleted homozygous transgenic plants. Compared with the wild-type GLA4, GS10-CR has significant changes in both grain type (grain length and grain width) and plant type levels (FIG. 8, B-E), and particularly GS10-CR can be reduced by about 10% in grain width, which is similar to the grain type and spike type of NIL-GS10 (FIG. 10). In addition, we also performed overexpression experiments with the green fluorescent protein GFP tag in the context of NIL-GS10 Ubi:GS 10-GFP, which obtained transformation of 11 individuals in total, and each individual showed a tendency of short round grain type, indicating that GS10 had a certain dose effect on grain type regulation. As described above, the site where GS10 really functions in regulatory grain type is not involved in promoter and UTR changes but in coding region CDS, and allele from wild rice W1943 causes frame shiftAnd (4) mutation.
We analyzed the characteristics of inclination angle and length of leaves of main sword-shaped spikes after GLA4, NIL-GS10, GS10-CR and OX grouting, and found that the inclination angle of the leaves of NIL-GS10 and GS10-CR transgenic plants is increased in comparison with GLA4 in leaf inclination angle phenotype and is looser in plant type (FIG. 11, A); NIL-GS10 and GS10-CR transgenic plants showed greater length in comparison to GLA4 in relation to the length of the flag leaf (FIG. 11, B). In contrast, over-expressed material OX showed a tendency to decrease in either leaf inclination or flag compared to wild-type GLA 4. Taken together, the above results indicate that GS10 regulates leaf inclination and flag leaf length in addition to yield traits such as grain length, grain width and grain weight, presumably to be involved in plant type regulation (fig. 11, C-D).
5.2 CRISPR-Cas9 knockout validation under Nippon weather background
The CRISPR-Cas9 experiment is carried out on japonica rice material Nip by using the same knockout target point, and the PAM site selected in the experiment is consistent with GS 10-CR. We initially selected 2 independent T1Homozygous lines NIP-CR-1 and NIP-CR-2 (FIG. 12, A), in which T is expressed2Phenotype statistics was performed on the progeny of homozygous line, and the results showed that the panicle type of the knocked-out plants showed a tendency to lengthen, which is consistent with the knocking-out results in the background of indica rice (fig. 12, B). On the other hand, both knockout lines showed a tendency of longer grain length, narrower grain width and smaller grain thickness in grain type trait (FIG. 12, C-E). From the above results, we can find that GS10 has great variation in japonica rice material regardless of grain width, thousand grain weight and plant type, and is more obvious than the related phenotype of indica rice material GLA 4.
5.3 reverse complementation verification of T-DNA insertion mutants
We obtained Hwayoung (HY) rice mutants of T-DNA in the T-DNA (transfer DNA) mutant pool of the laboratory of professor Ann Korea (Gene An) (FIG. 13, A), and isolated F progeny after crossing the mutants2Phenotypic studies and linkage analysis were performed to find that the mutant insertion was indeed linked to the grain width phenotype. We design primers at both ends of T-DNA and combine one generationAfter sequencing, the T-DNA was found to be inserted 1480bp upstream of the CDS of GS10 (FIG. 13, B). Compared with wild type HY, GS10 has significant changes in grain length, grain width and plant type, grain length increased by 15%, grain width decreased by 6% and loose plant type shown (FIG. 13, C), which are very similar to GS10-CR and NIL-GS 10. The expression level of GS10 was reduced by 60% in the insertion mutant plants compared to the wild type HY (FIG. 13, D). At the same time, we combined the complementation experiment with pGLA4 with the GLA4 genotype gGS10GLA4The vector was introduced into a mutant gs10 on a p2300 vector to obtain 15 positive Ts in total0And (4) single plants. We are at T for complementary materials1Statistics of the phenotype of transgenic plants in generations revealed that the phenotype of wild type could not be fully restored in both grain length and grain width levels in progeny of transgenic plants represented by HY-CP-1 compared with wild type (FIG. 13, C). The identification and reverse complementation of this T-DNA insertion mutant further confirmed that GS10 plays an important role in both grain and plant type traits (FIG. 13, E-F). By combining a transgenic experiment taking GLA4 as background and a gene knockout experiment in Nipponbare background, we can conclude that GS10 can cause the change of rice grain type and plant type no matter under the regulation of a promoter or a coding region.
Example 6 subcellular localization and expression profiling
To further explore the changes in expression of GS10, we sampled the GLA4 parent material at the 2-week seedling stage of leaf, flag leaf, leaf sheath, scion and stalk node, respectively. After RNA extraction, we then performed qRT-PCR experiments, which showed no specific expression of GS10 from juvenile to reproductive phase. Wherein, the expression quantity of GS10 in the seedling stage, the sword leaf stage and the young ear stage of each stage is higher than that of the leaf sheath, the endosperm and the stalk node. At the same time, we also constructed a transformation vector with the self-promoter from GLA4 driving the reporter gene GUS (beta-glucuronidase, beta-glucuronide) after transformation with ZH11 at T2After homozygous plants are obtained in generation, GS10 is highly expressed in seedlings and young ears by color development through reaction with substrates in a reaction solution (figure 14).
In subcellular localization experiments, GLA4-GS10-CDS was ligated to the N-and C-termini of pCAMBIA1300-GFP, respectively, and these two vectors were transferred into Agrobacterium GV3101-P19, respectively, and the tobacco leaf lower epidermis was injected, and signals were observed under fluorescence 72 hours later. The experimental results showed that, regardless of whether GS10 was linked to the N-terminus or C-terminus of GFP, GS10 was localized in the cytoplasm or on the cell membrane of tobacco cells without being affected by the difference in GFP protein structure (fig. 15). We also constructed gs10 from W1943 allele, and linked GFP in front of the protein gs10 (GFP-gs10), and found that the protein distribution in plant leaves is diffuse, and the position of protein expression cannot be traced.
Protein structure prediction showed no transmembrane protein domain, and we combined with its homologous protein position to predict that GS10 might be localized inside the cell membrane, as well as binding to cytoskeletal or intracellular membrane proteins as the homologous protein in animal cells. We also verified that GS10 fused to the N-terminus of GFP in rice protoplasts, and marked NLS-RFP as a nuclear localization marker, indicating that GS10 might localize in the cytoplasm. We then co-localized with the key granulocytic gene GW5, which is known to be localized to the cell membrane. In experimental design, GW5 fused RFP protein and GS10 fused GFP protein were introduced into rice protoplasts together, and it was shown that GS10 and GW5 had membrane localization signals partially overlapping and expression signals were present in the cytoplasm (fig. 16).
Example 7 GS10 influences the granulotype by regulating cell shape rather than cell number
Previous grain type studies have found that changes in grain type are primarily limited by changes in the size resulting from glume cell division and changes in the number resulting from cell proliferation. To further clarify how GS10 was regulated at the glume cell level, we selected non-flowering rice grains from young ears 1 day before ear emergence and performed semi-thin section experiments in GLA4 and NIL-GS10, respectively. As shown, there was no significant difference in the number of cell layers of the palea compared to NIL-gs10 for GLA4, whereas the cells of GLA4 were larger than those of NIL-gs10 at the cell size level (FIG. 17, A). We preliminarily speculate that the main cause of the changes in grain type may be due to the fact that the cell width and thus the size of the cell is further altered in NIL without altering the number of palea cells.
Based on this hypothesis, we first performed scanning electron microscope observations on the cells of the palea and lemma of the mature grain glume cells, and in the statistics we used the cells at the longest transverse axis and the widest longitudinal axis of the grain length as statistical criteria, and the results showed that there was no difference in the cell numbers of NIL-gs10 and GLA4, but there was a difference in the length and width of the cells, both at the grain length and grain width levels (fig. 17, B-D). We combined the horizontal and vertical axis of the cells and found that GLA4 cells had a larger unit area than NIL-gs 10. To further validate the correlation with cell size, we selected some genes closely related to cell division, such as: CDC20, CDK family genes, E2F2, H1, KN, and the like. We select 1-3cm young ears and combine qRT-PCR experiments, and found that the expression amounts of genes such as CDKA2, CYCA2-3, CYCB2-1, CYCD4, CYCLA, H1, KN, MCM3 and the like are remarkably different between GLA4 and NIL-gs10 (figure 17, E). These more diverse genes were expressed in the NIL at a lower level than GLA4, and we preliminarily speculated that GS10 mainly positively regulates some genes involved in cell division during ear development and thus participates in changes in the granulometry.
Example 8 homologous protein and evolutionary Tree analysis
GS10 encodes 6 ARM repeats of armadillo, and a gene family was found to have 158 members in total in studies of their homologous genes (Tewari et al, 2010). In the past research on Arabidopsis thaliana, the gene family with ARM domain may be involved in different pathways in plants depending on whether the motif carried before or after, such as U-box, kinesin, LRR, etc. In the rice research report, the Spotted leaf 11(spl11) encodes a protein carrying U-box and ARM repeat domain, and participates in ubiquitination and protein-protein binding processes (Zeng et al, 2004), respectively. The frame shift mutation of the spl11 protein can cause cell death and weakened defense function of plants, thereby being more susceptible to diseases. It was found to be more similar to the β -Catenin (β -Catenin) family protein by comparing homologous proteins between similar species, as in human cell studies comparing extensive Catenin beta 1(CTNNB1) and arm in drosophila cells, but the mechanism of β -Catenin in rice and arabidopsis is rarely reported (fig. 18). In the Wnt/beta-catenin signaling pathway, GSK3 and AtBIN2/OsGSK2 in plant cells are homologous proteins (Sharma et al, 2014a), which play an important regulatory role in downstream pathways, and we speculate that GSK3 beta phosphorylated beta-catenin may also phosphorylate GS10 in plant cells by OsGSK 2.
Through the evolutionary tree analysis of related homologous PROTEINs of rice GS10, LOC _ Os03g02580(GSL1, GS10-LIKE1 PROTEIN) is found to be the most recent related PROTEIN in the family, and the functions of most genes in the family are not excavated (FIG. 19, A). GSL1 also encoded 6 ARM tandem repeats in MSU gene annotation, which we randomly knocked out in the ZH11 background (fig. 19, B). The results showed that there was no significant difference in plant type between GSL1 and GS10, and there was a difference in plant height and inclination of flag leaf (fig. 19, B). In the regulation of grain shape, GSL1-CR and GSL1-GS10-CR have no significant difference in grain length and grain width, and both can make the grain shape thin and long (FIG. 19, C-D). Combining the above results with the changes in grain and plant types of GS10-CR-ZH11, we speculate that GS10 may have some redundancy with GSL1 in protein functions, and they may play the same role in gene pathways.
Example 9, GS10 RNA-seq analysis
To further explore the changes in GS10 at the transcriptome level. We selected three kinds of materials such as GLA4, GS10-CR and OX as the flowering stage of the sword leaf to sample and extract the total RNA for RNA sequencing analysis. During the experimental design, we set up 2 biological replicates of each different combination, totaling 9 transcriptome data. Analysis results show that in comparison with GLA4, 2476 Differentially Expressed Genes (DEGs) are found in total, 1838 genes are up-regulated, and 638 genes are down-regulated. In contrast, CR found 346 differentially expressed genes compared to GLA4, of which 177 were up-regulated and 169 were down-regulated. The GO (gene ontology) analysis of the two types of differentially expressed genes shows that GS10 is mainly involved in certain metabolic processes, particularly catalysis and hydrolysis reaction, so as to regulate and control stress reaction in life activities, such as cell wall synthesis and certain hydrolytic metabolic processes. These results are consistent with the previous researches on the ARM tandem repeat protein family gene participating in stress reaction processes such as disease resistance and stress resistance. Because fewer DEGs were produced in GS10-CR compared to GLA4, we mined downstream regulatory genes in the data for OX and GLA 4.
As shown by the analysis of the differentially expressed genes of OX and WT, GS10 regulates some genes related to the development of flower organs, grain types and plant types, such as OsMADS1, LAX2, GL7, DTH7, OsBZR1, OsIAA family genes, OsJAZ family genes and the like. We speculate that GS10 may be involved in the regulation of BR as well as its homologous gene TUD1, thereby affecting changes in grain and plant types. It is worth noting that we preliminarily selected some reported known genes related to the grain type and the plant type, and combined with qRT-PCR, re-detected the expression level of these DEGs. As a result, it was found that GS10 can negatively regulate the granule type gene GL7 and the BR pathway receptor gene OsBR1, and positively regulate OsMADS1, BG1, LAX2, OsBZR1, OsAP2-39, OsGA2OX2, DTH7, OsGA2OX5, GH3.8 and the like. (FIG. 20) by combining the above results, it can be concluded that GS10 shows "pleiotropic effect" in its agronomic trait, which is likely to participate in and regulate multiple gene pathways of rice.
Example 10 GS10 binding to OsOFP8 regulates Rice grain and plant type
OsGSK2 is a most important transkinase in the BR signal transduction pathway, and many kernel-type-associated nuclear genes, such as GRF4, OFP8 and GS9 (Tong and Chu,2018), have been regulated in past studies. The overexpression plant OsGSK2-OX (go) of the negative regulatory factor GSK2 of BR shows that the plant is short, compact and small in grain shape on phenotype; and the RNAi plant OsGSK2-RNAi (Gi) shows that the plant is thick, the plant type is loose and the grain type is large. We wanted to go further to explore whether GS10 and OsGSK2 have a connection on the BR pathway. In previous studies, the subcellular localization of OsGSK2 was either cytoplasmic or nuclear, whereas GS10 was expressed only cytoplasmic. To further explore how GS10 participates in and exerts regulatory BR signals in the cytoplasm, we initially used the yeast two-hybrid screening library approach. In later experimental analysis, the method for screening the interacting protein by the yeast bank is not smooth, and only the gene SnRK1A is screened by the two screening results. Past researches show that SnRK1A is mainly involved in ABA signals and hardly involved in the research of rice yield regulation such as grain type and the like. Meanwhile, GS10 is connected to a yeast BD vector, and some key proteins related to BR signal transduction are selected and connected to an AD vector, wherein the key proteins comprise receptor proteins OsBRI1 and OsBAK1, GW5 inhibiting GSK2 activity, and transcription factors regulated by OsGSK2 at the downstream of BR, including DLT, GRF4, BZR1, OSH1, OFP8, OFP14, GS9 and the like. Experimental results show that GS10 is able to interact with SnRK1A, OsOFP8 and OSH1 in vitro (fig. 21, a).
We first ligated OsGSK2 to AD vector and GS10 to BD vector, respectively, by yeast double-hybrid experiments, using two-plate-lacking control and three-plate-lacking and four-plate-lacking experiments to show that there was no interaction between them (FIG. 21, B).
We also combined some genetic experiments to verify whether there is a certain relation between GS10 and OsGSK2, an important transgene in BR pathway. Interestingly, we designed a double knockout experiment of OsGSK2 and GS10 with ZH11 as a background, and used Go (OsGSK2-OX) and Gi (OsGSK2-RNAi) as controls to obtain transgenic lines such as GS10-CR-ZH11, GS10-OX-ZH11, GS10-OsGSK2-CR and OsGSK2-CR (FIG. 22, A). T in these transgenic plants2As shown by the examination of grain type and plant type in the generation homozygous plants, the grain type trait of GS10-OsGSK2-CR is compared with that of OsGSK2-CR and GS10-CR-ZH11, and OsGSK2 can cause grains to be enlarged without changing grain width; while GS10-OsGSK2-CR and GS10-CR-ZH11 do not have any difference in grain length and grain width; this suggests that even though OsGSK2 does not interact with GS10, they all affect grain type in BR pathway regulation, and that GS10 has some epistasis in BR pathway to OsGSK2 (fig. 22, B-D).
In the above transgenic plant types, GS10-OsGSK2-CR and GS10-CR showed looser plant types and a tendency of increased inclination angle of flag leaf compared with ZH11 (FIG. 23, A-B); however, OsGSK2-CR did not show the same drastic BR phenotype as OsGSK2-RNAi, indicating that some redundancy may exist in the rice GSK family (FIG. 23, A). Meanwhile, related overexpression experiments of GS10 in ZH11 are combined, and it can be seen that after the expression level of GS10 in japonica rice material ZH11 is greatly improved (fig. 23, C), the plant type is more compact, the stems are thickened, the inclination angles of the sword leaves are upright, the spike is thickened and shortened, and the number of grains without spikes is not significantly changed (fig. 23, A). It was noted that its plant height showed a decrease as low as OsGSK2, and the degree of change could reach around 25% (FIG. 23, D).
OsOFP8 is phosphorylated by OsGSK2, and the protein is phosphorylated to cytoplasm and is aggregated and finally degraded by protease, which is a transfer factor for positively regulating BR signals (Yang et al, 2016). It is used as a more important positive regulatory factor in BR signal path, and under the condition of overexpression, the inclination angle of plant leaf is increased, and the grain shape is narrowed and lengthened. We preliminarily predicted that OsOFP8 may act as a mediating protein of GS10 in response to BR signaling to regulate expression of downstream BR-associated genes. First, we performed BiFC experiments in tobacco, and by fusing GS10 and OsOFP8 to the N-terminus and C-terminus of yellow fluorescent protein YFP, respectively, the experimental results showed that GS10 and OsOFP8 are interactive in plants and cytoplasmic. Meanwhile, the verification is carried out in tobacco bodies by combining the BiLC experiment, GS10 and OsOFP8 are respectively fused with the N end and the C end of the fluorescein protein luciferase, and the same experiment result is also obtained by combining the results of a positive control and a negative control. We used Co-IP method to verify protein interaction at protein level, and the 35S::6xMyc-OsOFP8 and no-load 35S::6xMyc and 35S::3Flag-GS10 were transformed to express protein transiently in tobacco. After the protein extracts were adsorbed by Flag beads for 3-4h, the presence of in vivo interaction was checked by the western blotting experiment using Flag and Myc antibodies. Final experimental results showed that GS10 and OsOFP8 did interact in vivo (fig. 24).
OsOFP8 acts as a transcription factor downstream of OsGSK2, is phosphorylated by the latter and degraded in the cytoplasm. But we found that the region where OsOFP8 and GS10 interact is located in the cytoplasm, how do GS10 affect the role of OsOFP 8? We first performed co-localization experiments in plants (tobacco and rice protoplasts), using OsOFP8 fused with mCherry as a control, GS10 fused with GFP and then transferring GS10-GFP and OsOFP8-mCherry into tobacco and rice protoplasts together, and the results showed that GS10 indeed caused OsOFP8 coring (FIG. 25). Meanwhile, the qRT-PCR experiments are combined to find that the expression level of OsOFP8 is reduced in young ears of GS10-OX-ZH11 compared with wild type ZH11, which shows that GS10 has a certain inhibition effect on OsOFP8 at the transcription level.
Example 11, GS10 is involved in BR Signal transduction
In the verification of the transgenic experiment, GS10 is found to regulate and control the plant type besides the grain type and the panicle type. In the background of GLA4, NIL-gs10 showed significant differences in plant height and in the degree of looseness of plant type compared to wild type. Then, statistics is carried out on the leaf inclination angle of the transgenic material, and the result shows that NIL-gs10 and CR are both larger than GLA4, and the plant types are scattered; compared with NIL-gs10, CP-1, CP-2 showed a tendency to become smaller, and the plant type was more compact. This indicates that GS10 is a positive regulator in the regulation of rice plant type, especially leaf inclination.
In past studies, it was found that only one homologous gene TUD1 is currently cloned from members of the ARM family, which binds to RGA1(D1) and regulates changes in grain and plant types by participating in signal transduction between G-protein and BR (Hu et al, 2013). Meanwhile, in protein prediction, GS10 was found to encode 6 armadillo reptiles, which have certain homology with CTNNB1 in human and arm in Drosophila. The latter two proteins are mainly involved in the classical Wnt/beta-catenin signal transduction pathway. Therefore, we preliminarily predicted that GS10 may also have similar function of BR signaling.
BR signals are mainly used in rice research to regulate grain type and plant type (plant height and leaf inclination) (Tong and Chu, 2018). To further study the gene function of GS10 in rice, we first processed it with brassinolide response to see if it responds to BR. We grown them by treating GLA4 and NIL-gs10 (seeds soaked in advance until white) in 0.3% agar medium containing 0. mu.g, 500. mu.g, 1mg of the brassinolide analogue 2,4-epiBL, respectively, protected from light in a 30 ℃ incubator. The results of the treatments showed that the epicotyls became longer with increasing hormone concentrations in NIL plants compared to GLA4 under the same conditions (FIG. 26, A-B). We also used protein-tagged HA-bearing OX material against GLA4 as treatment material, with nodes every 30min and 1mM BL and MG132 as treatment conditions, respectively. The experimental results show that GS10 was gradually degraded over time (fig. 26, C). We also added different concentrations of BL to GLA4 and NIL plants at their leaf inclination at the 10-day seedling stage, followed by treatment with light at 30 degrees for 2 days, and the results showed that CR plants and WT showed a greater change in leaf inclination compared to WT under the 1mg concentration treatment (FIG. 26, D-F). It was demonstrated that mutant GS10 could increase sensitivity of BR, and GS10 could negatively regulate BR signaling during normal life activities.
Example 12 relationship of GS10 to other major granule type genes
In previous studies, it was found that there is a large correlation between the traits related to grain type and quality, for example, grain length, grain width, and aspect ratio (Gong et al, 2017). The appearance quality is enhanced by the long and narrow grains which are finally produced when the aspect ratio of the rice grain type is increased, and the appearance quality is deteriorated (Wang et al, 2012; Wang et al, 2015 b; Liu et al, 2018). To further explore the relationship of GS10 to other grain type genes, we purposefully combined three major grain type genes, GW5, GL7/GW7 and GS10, located in this set of recombinant inbred lines by means of molecular polymerization (fig. 27, a). Among them, GW5 and GL7/GW7 have been reported as excellent genes for excellent quality control and grain type. GW5 from wild rice W1943 controls elongated granules, with a phenotype very similar to GS10 on granule type, and regulates granule type by binding protein function and inhibiting kinase activity of OsGSK 2. GL7/GW7 is a QTL site with a relatively major effect on grain length, interestingly, GW8 binds to a promoter region of GW7 and inhibits the expression of the promoter region so as to regulate grain type, and the two genes also have certain improvement on rice quality.
In the progeny seeds bred by the polymeric materials, compared with NIL-GW5/gs10, NIL-GL7/gs10 and NIL-gs10, gs10 can increase the grain length of GW5 and GL7 in the grain length property and has a certain additive effect (figure 27, B). In the grain width trait, gs10 polymerized with either GW5 or GL7, also made the progeny seeds finer (fig. 27, C). The above results indicate that gs10 has some additive effect on the grain type trait for both major genes, GW5 and GL 7. It is noteworthy that the polymeric material NIL-GW5/GL7/gs10 is wider than the latter two in terms of particle width as compared to NIL-GW5/GL7 and NIL-GL7/gs 10. The above results further demonstrate that the aggregation of two major genes, GW5 and GL7, can greatly change the particle width, and that after being incorporated into gs10, a balance exists in the particle shape, and the particle shape is not drastically changed.
At the thousand kernel weight level we can see a tendency for both GW5 and gs10 to reduce the kernel weight, in addition to the significant contribution of GL7 to the kernel weight (fig. 27, D). NIL-GW5/GL7 did not show any significant difference in thousand kernel weight compared to NIL-GW 5. The thousand kernel weight of NIL-GW5/gs10 decreased when GW5 polymerized gs10, the same result was also seen in NIL-GL7/gs 10. While when the three genes were aggregated together, thousand grain weight of NIL-GW5/GL7/gs10 was lowest compared to the other materials, indicating that gs10 had a negative regulatory effect at the grain weight level (FIG. 27, D).
Example 13 polymerization of gs10 with GW5 improved the appearance and quality of rice kernels
It is noted that GLA4 is a high-yielding and poor-looking base material, and it is clear that the appearance clarity in brown rice is improved somewhat in NIL-gs10 and NIL-GW5 as compared to wild-type GLA 4. NIL-GW5/gs10 showed a great improvement in seed transparency, enabling a substantial reduction in chalkiness. Meanwhile, the scanning of an electron microscope is carried out on the arrangement degree of the starch at the fault level of the rice grains, and the result shows that the arrangement of the starch at the fault level of GLA4 is looser, and NIL-gs10 is followed, while the arrangement of starch cells of NIL-GW5/gs10 and NIL-GW5 is tighter, which is consistent with the result on the appearance quality of the grains (figure 29, A). The above results indicate that gs10 can improve the appearance quality of rice seeds by affecting the close arrangement of starch cells, and further, by fusing with GW5, the rice quality can be greatly improved in future molecular breeding (fig. 28).
We further measured the intrinsic quality of rice kernels for the above polymeric materials by a rice kernel taste instrument. As a result, it was found that NIL-gs10 and NIL-GW5 were only slightly increased compared to the wild-type GLA4, both in protein and amylose content. But if they are aggregated together, the intrinsic quality is greatly improved. It is noted that NIL-GW5 increased the fatty acid content in the fatty acid content, and these results indicate that gs10 is an allelic variation that does not combine quality with grain weight, and that the aggregation of GW5 and gs10 significantly improved the rice appearance quality. If the quality is pursued, the application can greatly improve the rice quality in future breeding (figure 29, B).
Example 14 analysis of Natural variation and acclimatization of GS10
By combining the low-abundance sequencing data of wild rice and cultivated rice in a laboratory, the polymorphism of the region is analyzed after SNP calling is carried out on 20.1-20.5Mb (IRGSP 1.0) of a No. 10 chromosome of rice, and the polymorphism of the wild rice is higher than that of the cultivated rice, and the polymorphism of the indica is higher than that of the japonica. This indicates that during the evolution, this region was strongly selected and that GS10, which is normally protein functional, was gradually immobilized in oryza sativa. As can be seen from our information on haplotyping of wild rice, GS10 still has a great deal of variation in the wild rice population, which may be due to partial deletion of the base and thus may also generate the same frame shift mutation and also bring about variation in grain type and plant type as allele of W1943. In which we used Indel3+569 as an example, GS10 deleted four bases of CGGC in part of wild rice and cultivated rice. In order to create similar genotypes in rice, a PAM locus as GS10-CR is designed and verified by Dongjing (DJ) of japonica rice background, so that two transgenic lines DJ-CR-1 and DJ-CR-2 are obtained. The results are shown in FIG. 30.
In both transgenic lines, they generated deletion mutations of a partial base and both contained Indel3+569(-CGGC) as the mutation site. At T2In the offspring, the homozygous plant phenotype is statistically analyzed, and the ears of DJ-CR-1 and DJ-CR-2 are longer and the ears of the plants are second-level compared with DJThe tendency of reduction in the number of peduncles was consistent with gene knockouts in NIP. In addition, the grain shape traits of the three materials are analyzed, and DJ-CR-1, DJ-CR-2 and DJ show the tendency that the plant types of plants become loose, the inclination angle of the sword leaves becomes larger, and the grain shape becomes thinner and longer compared with DJ. Especially at the grain width level, CRISPR-produced mutants can cause up to 15% changes. We speculate that other alleles produced by wild rice also produce a long-grain phenotype in rice due to mutations in the gene, as does W1943.
Summary of the invention
Ordinary wild rice (Oryza rufipogon) and Oryza sativa (Oryza sativa L.) are the most typical genetic resource materials for studying rice domestication and evolution in the AA genome of the classified population of Oryza. The grain type is a complex agronomic shape closely related to the yield and quality of rice, and is controlled by multiple genes. Meanwhile, with the rise of hybrid rice breeding and the great increase of rice yield, the rice quality is more and more paid attention by breeders and consumers. At present, research on Quantitative Trait Loci (QTLs) of grain types in natural populations has been greatly advanced, but QTLs that regulate quality and are grain types are still poorly understood in the evolution process. The scientific problem concerned by the research is to excavate and analyze new genes influencing yield and quality and an evolution mechanism thereof in wild rice with abundant genetic variation, perfect a granular gene molecular regulation network and evaluate the agricultural production application value of the genes.
A set of low-abundance sequenced wild rice material (W1943) with indica rice background, Guang-Lu-ai 4(GLA4) and ordinary wild rice material with a non-glutinous rice background, which are constructed in a laboratory, are hybridized to generate a recombinant inbred line, so that QTLs (quantitative trait loci) related to yield traits (grain type, plant type and seed setting rate) are positioned. In this study, we selected the recombinant inbred line and mapped a GRAIN SIZE candidate gene GS10(GRAIN SIZE ON CHROMOSOME 10) by combining phenotypic study and map-based cloning. This gene encodes a novel armadillo (arm) subfamily protein with 6 tandem repeats. The gs10 derived from wild rice has a frame shift mutation due to deletion of 5 nucleotides, resulting in loss of protein function, resulting in a narrow and long grain shape, a reduced grain thickness, a reduced thousand grain weight, and a larger leaf inclination angle, but the appearance quality of the grains is excellent. The agronomic trait regulatory analysis of GLA4 and the near isogenic line NIL-GS10 revealed that GS10 has a pleiotropic effect and that mutated GS10 did not affect individual and cell yields. Transgenic experiments such as complementation, overexpression, gene knockout and the like further prove that GS10 can positively regulate grain width, but negatively regulate properties such as grain length, panicle length and the like. In addition, by integrating the size and number statistical analysis of the glume cells of the mature seeds and the observation of the semi-thin slices of the immature glumes, the fact that the gene mainly influences the size of the glume cells is revealed, and the number is not influenced, so that the grain type change is regulated. Experiments such as RNA-seq and yeast two-hybrid combination prove that GS10 can be combined with OsOFP8 to participate in BR signal path, but does not interact with OsGSK2 in protein function but is positioned in OsGSK2 in grain type character. Meanwhile, the aggregate breeding of gs10 and GW5 with the main effect of controlling grain type and quality under the background of GLA4 can further improve the aspect ratio in cultivated rice and further improve the appearance quality. Based on the sequencing of the coding regions of 114 parts of wild rice and 997 parts of oryza sativa GS10, 16 haplotypes were obtained, and these results indicate that the allelic variation site of GS10, which functions normally in wild rice and has compact plant type and increased thousand-kernel weight, is essentially fixed in the oryza sativa population. Notably, gs10 from W1943 still has many allelic variations in the wild rice population that produce frameshift mutations in the coding region, and a few in the aus and aromatic populations, suggesting that dysfunctional gs10 has great potential for future rice quality improvement.
Sequences as referred to herein
GS10 protein | SEQ ID NO:1 |
GS10 Gene coding sequence | SEQ ID NO:2 |
U3-sgRNA | SEQ ID NO:3 |
U6a-sgRNA | SEQ ID NO:4 |
Sequence listing
<110> prominent innovation center of molecular plant science of Chinese academy of sciences
<120> gene for regulating and controlling quality of rice grains and application thereof
<130> 20A631
<160> 4
<170> SIPOSequenceListing 1.0
<210> 1
<211> 560
<212> PRT
<213> Artificial Sequence
<400> 1
Met Ser Leu His Cys Cys Pro Ala Ala Gly Asp Pro Pro Ala Pro Ala
1 5 10 15
Gly Thr Ala Glu Glu Leu Leu Glu Arg Ala Arg Ser Leu Val Pro Ala
20 25 30
Ala Leu Asp Ala Ala Arg Ala Ala Thr Gly Phe Gly Gly Arg Trp Lys
35 40 45
Val Ile Ala Ala Arg Leu Glu Arg Val Pro Pro Cys Leu Ser Asp Leu
50 55 60
Ser Ser His Pro Cys Phe Ser Lys Asn Ser Leu Cys Arg Glu Leu Leu
65 70 75 80
Gln Ser Val Ala Ala Thr Leu Ala Glu Ala Ala Glu Leu Gly Ala Arg
85 90 95
Cys Arg Glu Pro Pro Arg Ala Gly Lys Leu Gln Met Gln Ser Asp Leu
100 105 110
Asp Ala Leu Ala Gly Lys Leu Asp Leu Asn Leu Arg Asp Cys Ala Leu
115 120 125
Leu Ile Lys Thr Gly Val Leu Ser Asp Ala Thr Val Pro Pro Val Ala
130 135 140
Pro Ala Ala Glu Ala Ala Ala Gln Thr Asp Val Arg Glu Leu Leu Ala
145 150 155 160
Arg Leu Gln Ile Gly His Ala Glu Ala Lys His Arg Ala Val Asp Gly
165 170 175
Leu Leu Asp Ala Leu Arg Glu Asp Glu Lys Ser Val Leu Ser Ala Leu
180 185 190
Gly Arg Gly Asn Val Ala Ala Leu Val Gln Leu Leu Thr Ala Thr Ala
195 200 205
Pro Lys Ile Arg Glu Lys Ala Ala Thr Val Leu Cys Leu Leu Ala Glu
210 215 220
Ser Gly Ser Cys Glu Cys Leu Leu Val Ser Glu Gly Ala Leu Pro Pro
225 230 235 240
Leu Ile Arg Leu Val Glu Ser Gly Ser Leu Val Gly Arg Glu Lys Ala
245 250 255
Val Ile Thr Leu Gln Arg Leu Ser Met Ser Pro Asp Ile Ala Arg Ala
260 265 270
Ile Val Gly His Ser Gly Val Arg Pro Leu Ile Asp Ile Cys Gln Thr
275 280 285
Gly Asp Ser Ile Ser Gln Ser Ala Ala Ala Gly Ala Leu Lys Asn Leu
290 295 300
Ser Ala Val Pro Glu Val Arg Gln Ala Leu Ala Glu Glu Gly Ile Val
305 310 315 320
Arg Val Met Val Asn Leu Leu Asp Cys Gly Val Val Leu Gly Cys Lys
325 330 335
Glu Tyr Ala Ala Glu Cys Leu Gln Ser Leu Thr Ser Ser Asn Asp Gly
340 345 350
Leu Arg Arg Ala Val Val Ser Glu Gly Gly Leu Arg Ser Leu Leu Ala
355 360 365
Tyr Leu Asp Gly Pro Leu Pro Gln Glu Ser Ala Val Gly Ala Leu Arg
370 375 380
Asn Leu Val Ser Ser Ala Ile Ser Pro Asp Ser Leu Val Ser Leu Gly
385 390 395 400
Val Leu Pro Arg Leu Val His Val Leu Arg Glu Gly Ser Val Gly Ala
405 410 415
Gln Gln Ala Ala Ala Ala Ala Ile Cys Arg Val Ser Ser Ser Ser Glu
420 425 430
Met Lys Arg Leu Val Gly Glu His Gly Cys Met Pro Leu Leu Val Arg
435 440 445
Leu Leu Glu Ala Lys Ser Asn Gly Ala Arg Glu Val Ala Ala Gln Ala
450 455 460
Val Ala Ser Leu Met Ser Cys Leu Ala Asn Ala Arg Asp Ile Lys Lys
465 470 475 480
Asp Glu Lys Ser Val Pro Asn Leu Val Gln Leu Leu Glu Pro Ser Pro
485 490 495
Gln Asn Thr Ala Lys Lys Tyr Ala Ile Ser Cys Leu Leu Thr Leu Ser
500 505 510
Ala Ser Lys Arg Cys Lys Lys Leu Met Ile Ser His Gly Ala Ile Gly
515 520 525
Tyr Leu Lys Lys Leu Ser Glu Met Asp Val Ala Gly Ala Lys Lys Leu
530 535 540
Leu Glu Lys Leu Glu Arg Gly Lys Leu Arg Asn Leu Phe Ser Arg Lys
545 550 555 560
<210> 2
<211> 1683
<212> DNA
<213> Artificial Sequence
<400> 2
atgagcttgc attgctgtcc ggcggccggc gacccgccgg cgccggccgg gacggcggag 60
gagctgctgg agcgggcgcg gtcgctggtg ccggcggcgc tggacgcggc gcgcgcggcg 120
accggcttcg gcggccggtg gaaggttatc gcggcgaggc tggagagggt gccgccgtgc 180
ctgtccgacc tgtccagcca cccgtgcttc tccaagaact cgctctgccg cgagcttctg 240
cagtcggtgg ccgccacgct cgccgaggcc gccgagctcg gggcgcgctg ccgcgagccg 300
ccgagggccg ggaagctgca gatgcagagt gacctcgacg cgctcgccgg gaagctcgac 360
ctgaacctcc gggattgcgc gctgcttatc aagaccggtg tgctgtccga cgcgaccgtg 420
ccaccggtgg cgccggcggc tgaggcggcg gcgcagacgg atgtgcggga gctgcttgcg 480
aggcttcaga tcgggcacgc ggaggcgaag caccgggcgg tggatgggct cctcgacgcg 540
ctacgcgagg acgagaagag cgtgctgtcg gcgctcggcc gcggcaacgt ggcggcgctg 600
gtgcagctgc tgacggcgac ggcgcccaag atcagggaga aggcggccac cgtcctctgc 660
ttgctggccg agtccggcag ctgcgagtgc ttgctggtgt cggagggcgc gctgccgccg 720
ctcatccggc tggtcgagtc cggcagcctg gtcggccggg agaaggccgt gatcacgctg 780
cagcggctgt ccatgtcgcc cgacattgcc cgcgccatcg tcggccacag cggcgtccgc 840
ccgctgatcg acatctgcca gaccggggac tccatctcgc agtccgcggc ggccggcgcg 900
ctcaagaacc tctccgcggt gcccgaggtg cgccaagcgc tggcggagga agggatcgtg 960
cgcgtcatgg tcaacctgct cgactgcggc gtcgtgctcg gctgcaagga gtacgccgcg 1020
gagtgcctcc agagcctcac gtcgagcaac gacggcctcc gccgcgccgt cgtgtccgag 1080
ggcggcctcc gcagcctgct cgcctacctc gacggcccgc tgccgcagga gtccgccgtg 1140
ggcgcgctcc gcaacctggt gagcagcgcc atctcgccgg acagcctggt gtcgctgggc 1200
gtgctccccc gcctcgtcca cgtgctccgc gagggctccg tgggcgcgca gcaggcggcc 1260
gcggcggcga tctgcagggt gtcgagctcg tcggagatga agcgcctggt cggcgagcac 1320
gggtgcatgc cgctgctggt gcggctgctg gaggcgaagt cgaacggcgc gcgcgaggtg 1380
gcggcgcagg cggtggcgag cctgatgagc tgcctggcga acgccaggga catcaagaag 1440
gacgagaaga gcgtccccaa cctggtgcag ctgctggagc cgagccccca gaacacggcc 1500
aagaagtacg ccatctcctg cctcctcacg ctgtcggcga gcaagcgctg caagaagctg 1560
atgatctcgc acggcgccat tggctacctc aagaagctct ccgagatgga cgtcgccggc 1620
gccaagaagc tgctcgagaa gcttgagcga ggcaagctgc gcaacctctt cagtaggaag 1680
taa 1683
<210> 3
<211> 23
<212> DNA
<213> Artificial Sequence
<400> 3
aagagcgtgc tgtcggcgct cgg 23
<210> 4
<211> 24
<212> DNA
<213> Artificial Sequence
<400> 4
tgatcgacat ctgccagacc gggg 24
Claims (10)
1. Use of a substance for modulating an agronomic trait in a plant, the substance selected from the group consisting of: GS10 gene or its coding protein, or its promoter or inhibitor, wherein the agronomic trait is selected from one or more of the following: leaf length, leaf width, single plant yield, cell yield, plant height, tillering number, grain number per ear, spike length, branch number, top awn, heading period, stem diameter, glume cell width, glume cell length, glume cell unit area, leaf inclination angle, grain length, grain width, grain weight, BR sensitivity, BR signal path, grain transparency, grain starch cell compactness, grain protein content, grain amylose content,
preferably, the plant is a cereal crop.
2. The use according to claim 1,
the GS10 gene codes an amino acid sequence selected from the group consisting of:
(a) polypeptide with a sequence shown as SEQ ID NO. 1;
(b) 1 through one or more amino acid residue substitution, deletion or addition, and (a) derived polypeptide with (a) polypeptide function; or
(c) A polypeptide derived from (a) having more than 90% homology with the polypeptide sequence of (a) and having the function of the polypeptide of (a);
and/or the presence of a gas in the atmosphere,
the nucleic acid sequence of the GS10 gene is selected from the group consisting of:
(1) a polynucleotide encoding a polypeptide as shown in SEQ ID NO. 1;
(2) 2 or a polynucleotide having more than 80% homology with the polynucleotide shown in SEQ ID NO;
(3) 2, truncating or adding 1-60 nucleotides to the 5 'end and/or the 3' end of the polynucleotide shown in SEQ ID NO;
(4) a polynucleotide complementary to any one of the polynucleotides described in (1) to (3).
3. The use according to claim 1,
the accelerator is selected from the group consisting of: a small molecule compound, a nucleic acid molecule, or a combination thereof;
the inhibitor is an inhibitor molecule which specifically interferes with the transcription and/or expression of the GS10 gene, and preferably the inhibitor molecule takes the GS10 gene or a transcript or an encoded protein thereof as an inhibition target; more preferably, the inhibitory molecule is selected from the group consisting of: (1) a small molecule compound, an antisense nucleic acid, a microRNA, a siRNA, an RNAi, a dsRNA, a sgRNA, an antibody, or a combination thereof, and (2) a nucleic acid construct capable of expressing or forming (1).
4. A method of modulating an agronomic trait in a plant, the method comprising: modulating expression or activity of a GS10 gene in a plant, thereby modulating an agronomic trait in the plant, the agronomic trait being selected from one or more of the following: leaf length, leaf width, single plant yield, cell yield, plant height, tillering number, grain number per ear, spike length, branch number, top awn, heading period, stem diameter, glume cell width, glume cell length, glume cell unit area, leaf inclination angle, grain length, grain width, grain weight, BR sensitivity, BR signal path, grain transparency, grain starch cell compactness, grain protein content, grain amylose content,
preferably, the method of modulating an agronomic trait in a plant comprises: up-regulating the expression or activity of the GS10 gene in a plant; thereby reducing the leaf length, increasing the leaf width, reducing the yield of a single plant, reducing the yield of a cell, reducing the plant height, reducing the tillering number, reducing the grain number per spike, reducing the spike length, reducing the number of branches and stalks, reducing the awn, shortening the heading stage, increasing the diameter of the stems, increasing the width of glume cells, reducing the length of glume cells, increasing the unit area of glume cells, reducing the leaf inclination angle, reducing the grain length, increasing the grain width, increasing the grain thickness, increasing the grain weight, increasing the BR sensitivity, negatively regulating and controlling a BR signal path, reducing the transparency of grains, increasing the compactness of starch cells of the grains, reducing the protein content of the grains and reducing the amylose content of the grains; more preferably, said up-regulating expression of the GS10 gene in a plant comprises: transferring GS10 gene into plant to obtain transformed plant,
the method for regulating and controlling the agronomic traits of the plants comprises the following steps: down-regulating expression or activity of GS10 in a plant; thereby increasing the leaf length, reducing the leaf width, increasing the yield of a single plant, increasing the yield of a cell, increasing the plant height, increasing the tillering number, increasing the grain number per spike, increasing the spike length, increasing the number of branches and stalks, increasing the awn, prolonging the heading period, reducing the diameter of the stalks, reducing the width of glume cells, increasing the length of glume cells, reducing the unit area of glume cells, increasing the inclination angle of the leaves, increasing the grain length, reducing the grain width, reducing the grain thickness, reducing the grain weight, BR sensitivity, positively regulating a BR signal path, increasing the transparency of grains, reducing the compactness of starch cells of the grains, increasing the protein content of the grains and increasing the amylose content of the grains; more preferably, said down-regulating the expression or activity of the GS10 gene in the plant comprises: an inhibitor that down-regulates GS10 gene transcription, protein expression, or protein activity is transferred into the plant.
5. The method of claim 4, wherein the inhibitor is an inhibitory molecule that specifically interferes with the transcription and/or expression of the GS10 gene,
preferably, the suppressor molecule targets the GS10 gene or its transcript or encoded protein as a suppressor,
more preferably, the inhibitory molecule is selected from the group consisting of: (1) a small molecule compound, an antisense nucleic acid, a microRNA, a siRNA, an RNAi, a dsRNA, a sgRNA, an antibody, or a combination thereof, and (2) a nucleic acid construct capable of expressing or forming (1).
6. Use of the GS10 gene as a molecular marker for identifying agronomic traits in plants, the agronomic traits comprising: leaf length, leaf width, single plant yield, cell yield, plant height, tiller number, grain number per ear, ear length, branch number, top awn, heading stage, stem diameter, glume cell width, glume cell length, glume cell unit area, leaf inclination angle, grain length, grain width, grain weight, BR sensitivity, BR signal path, grain transparency, grain starch cell compactness, grain protein content and grain amylose content.
7. An expression cassette for expressing the GS10 gene, which has the following elements in order from 5 'to 3': the 5' UTR region, the ORF sequence of the GS10 gene, and a terminator.
8. Use of the expression cassette of claim 7 for improving an agronomic trait in a crop selected from the group consisting of: reducing the leaf length, increasing the leaf width, reducing the yield of a single plant, reducing the yield of a cell, reducing the plant height, reducing the tillering number, reducing the grain number per spike, reducing the spike length, reducing the number of branches, reducing the awn, shortening the heading stage, increasing the diameter of stems, increasing the width of glume cells, reducing the length of glume cells, increasing the unit area of glume cells, reducing the leaf inclination angle, reducing the grain length, increasing the grain width, increasing the grain thickness, increasing the grain weight, increasing the BR sensitivity, negatively regulating and controlling a BR signal path, reducing the transparency of grains, increasing the compactness of starch cells of the grains, reducing the protein content of the grains and reducing the amylose content of the grains.
9. A sgRNA targeting the GS10 gene or a nucleic acid construct capable of producing the sgRNA,
preferably, the sgRNA is shown in SEQ ID NO 3 or 4.
10. The application of a substance for down-regulating the expression or activity of GS10 gene and a substance for up-regulating the expression or activity of one or two genes selected from GW5 and GW7(GL7) in stabilizing grain shape, improving appearance quality of rice and improving aspect ratio of rice.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011626461.1A CN114763375B (en) | 2020-12-31 | 2020-12-31 | Gene for regulating quality of rice grains and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011626461.1A CN114763375B (en) | 2020-12-31 | 2020-12-31 | Gene for regulating quality of rice grains and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114763375A true CN114763375A (en) | 2022-07-19 |
CN114763375B CN114763375B (en) | 2024-06-21 |
Family
ID=82362994
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011626461.1A Active CN114763375B (en) | 2020-12-31 | 2020-12-31 | Gene for regulating quality of rice grains and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114763375B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115960859A (en) * | 2022-09-26 | 2023-04-14 | 中国科学院遗传与发育生物学研究所 | Rape green revolution gene bgr and application thereof |
CN116286878A (en) * | 2023-05-17 | 2023-06-23 | 三亚中国农业科学院国家南繁研究院 | Application of paddy rice OsJAZ5 gene |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111961673A (en) * | 2020-08-20 | 2020-11-20 | 中国水稻研究所 | Rice grain type gene GS10 and application thereof |
-
2020
- 2020-12-31 CN CN202011626461.1A patent/CN114763375B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111961673A (en) * | 2020-08-20 | 2020-11-20 | 中国水稻研究所 | Rice grain type gene GS10 and application thereof |
Non-Patent Citations (4)
Title |
---|
BIOPROJECT: PRJNA122,自动计算预测: "XM_015759176.2,PREDICTED: Oryza sativa Japonica Group U-box domain-containing protein 15 (LOC9268059), transcript variant X4, mRNA", 《NCBI GENBANK》, pages 1 - 2 * |
HU, XM等: "The U-Box E3 Ubiquitin Ligase TUD1 Functions with a Heterotrimeric G alpha Subunit to Regulate Brassinosteroid-Mediated Growth in Rice", 《PLOS GENETICS》, vol. 9, no. 3, pages 1 - 13 * |
SAKAMOTO, TOMOAKI等: "An E3 ubiquitin ligase, ERECT LEAF1, functions in brassinosteroid signaling of rice", 《PLANT SIGNALING & BEHAVIOR》, vol. 8, no. 11, pages 1 - 9 * |
SHARMA, M等: "Comprehensive Expression Analysis of Rice Armadillo Gene Family During Abiotic Stress and Development", 《DNA RESEARCH》, vol. 2020, no. 3, pages 267 - 283 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115960859A (en) * | 2022-09-26 | 2023-04-14 | 中国科学院遗传与发育生物学研究所 | Rape green revolution gene bgr and application thereof |
CN116286878A (en) * | 2023-05-17 | 2023-06-23 | 三亚中国农业科学院国家南繁研究院 | Application of paddy rice OsJAZ5 gene |
CN116286878B (en) * | 2023-05-17 | 2023-07-21 | 三亚中国农业科学院国家南繁研究院 | Application of paddy rice OsJAZ5 gene |
Also Published As
Publication number | Publication date |
---|---|
CN114763375B (en) | 2024-06-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Sun et al. | Natural selection of a GSK3 determines rice mesocotyl domestication by coordinating strigolactone and brassinosteroid signaling | |
Zhao et al. | GS9 acts as a transcriptional activator to regulate rice grain shape and appearance quality | |
US20230365987A1 (en) | Transcription factors to improve resistance to environmental stress in plants | |
US11873499B2 (en) | Methods of increasing nutrient use efficiency | |
Hussin et al. | SiMADS34, an E-class MADS-box transcription factor, regulates inflorescence architecture and grain yield in Setaria italica | |
US10647990B2 (en) | Rice high temperature resistance gene and use in crop breeding resistance to high temperature thereof | |
EP3169785B1 (en) | Methods of increasing crop yield under abiotic stress | |
Fang et al. | Modulation of evening complex activity enables north-to-south adaptation of soybean | |
Xu et al. | A C-terminal encoded peptide, ZmCEP1, is essential for kernel development in maize | |
CN114763375B (en) | Gene for regulating quality of rice grains and application thereof | |
Xiong et al. | Small EPIDERMAL PATTERNING FACTOR-LIKE2 peptides regulate awn development in rice | |
Sui et al. | A new GA-insensitive semidwarf mutant of rice (Oryza sativa L.) with a missense mutation in the SDG gene | |
US11939588B2 (en) | Elevated resistance to insects and plant pathogens without compromising seed production | |
CN110092819B (en) | Corn bract width regulating protein ARF2, and coding gene and application thereof | |
WO2022257697A1 (en) | Cll1 gene for regulating and controlling semi-dwarf plant type and leaf ratio of plant and use of leguminous orthologous gene of same | |
Qin et al. | PH13 improves soybean shade traits and enhances yield for high-density planting at high latitudes | |
Lyu et al. | The TaSnRK1‐TabHLH489 module integrates brassinosteroid and sugar signalling to regulate the grain length in bread wheat | |
CN111533807A (en) | Application of AET1-RACK1A-eIF3h complex in plant environmental temperature adaptability | |
CN113862291B (en) | Corn leaf senescence regulation gene ZmUPF1, and identification primer and application thereof | |
Karakoc | Reverse Genetic Analysis of EHD1 for an Association Between Flowering Time and Seed Dormancy in Rice (Oryza sativa L.) | |
Chiou | Genetic analysis of grain size using DNA transposon-tagged lines in rice | |
Wang et al. | Variation in TaSPL6-D confers salinity tolerance in bread wheat by activating TaHKT1; 5-D while preserving yield-related traits | |
Liu et al. | DDG1 and G Protein a Subunit RGA1 Interaction Regulates Plant Height and Senescence in Rice (Oryza sativa). | |
Wang et al. | CsHLS1‐CsSCL28 module regulates compact plant architecture in cucumber | |
CN118255854A (en) | O2 mutant obtained through phosphorylation and dephosphorylation modification and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant |