CN114214340A - 水稻粒型粒重相关基因、蛋白、分子标记及应用 - Google Patents
水稻粒型粒重相关基因、蛋白、分子标记及应用 Download PDFInfo
- Publication number
- CN114214340A CN114214340A CN202111606443.1A CN202111606443A CN114214340A CN 114214340 A CN114214340 A CN 114214340A CN 202111606443 A CN202111606443 A CN 202111606443A CN 114214340 A CN114214340 A CN 114214340A
- Authority
- CN
- China
- Prior art keywords
- gene
- grain
- rice
- weight
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 235000013339 cereals Nutrition 0.000 title claims abstract description 100
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 91
- 235000007164 Oryza sativa Nutrition 0.000 title claims abstract description 67
- 235000009566 rice Nutrition 0.000 title claims abstract description 60
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 23
- 239000003147 molecular marker Substances 0.000 title claims description 18
- 240000007594 Oryza sativa Species 0.000 title abstract description 68
- 241000196324 Embryophyta Species 0.000 claims abstract description 22
- 239000002773 nucleotide Substances 0.000 claims abstract description 18
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 18
- 230000001105 regulatory effect Effects 0.000 claims abstract description 15
- 230000001276 controlling effect Effects 0.000 claims abstract description 8
- 239000008187 granular material Substances 0.000 claims abstract description 4
- 108700028369 Alleles Proteins 0.000 claims abstract description 3
- 239000013598 vector Substances 0.000 claims description 20
- 230000000295 complement effect Effects 0.000 claims description 8
- 150000001413 amino acids Chemical class 0.000 claims description 7
- 238000003208 gene overexpression Methods 0.000 claims description 5
- 238000009394 selective breeding Methods 0.000 claims description 3
- 241000209094 Oryza Species 0.000 claims 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 235000018102 proteins Nutrition 0.000 description 17
- 230000002018 overexpression Effects 0.000 description 12
- 108020004414 DNA Proteins 0.000 description 10
- 230000009261 transgenic effect Effects 0.000 description 9
- 230000004807 localization Effects 0.000 description 6
- 108700026244 Open Reading Frames Proteins 0.000 description 5
- 235000001014 amino acid Nutrition 0.000 description 5
- 229940024606 amino acid Drugs 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 230000002068 genetic effect Effects 0.000 description 5
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 210000000349 chromosome Anatomy 0.000 description 4
- 230000014509 gene expression Effects 0.000 description 4
- 239000003999 initiator Substances 0.000 description 4
- 238000000034 method Methods 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 3
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 241000209510 Liliopsida Species 0.000 description 2
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 240000007377 Petunia x hybrida Species 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 238000009395 breeding Methods 0.000 description 2
- 230000001488 breeding effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000010230 functional analysis Methods 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 230000000306 recurrent effect Effects 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- OZRFYUJEXYKQDV-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-carboxypropanoyl)amino]-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]butanedioic acid Chemical compound OC(=O)CC(N)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(O)=O OZRFYUJEXYKQDV-UHFFFAOYSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 1
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- FIQKRDXFTANIEJ-ULQDDVLXSA-N Arg-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FIQKRDXFTANIEJ-ULQDDVLXSA-N 0.000 description 1
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- NMTANZXPDAHUKU-ULQDDVLXSA-N Arg-Tyr-Lys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 NMTANZXPDAHUKU-ULQDDVLXSA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- KOWYNSKRPUWSFG-IHPCNDPISA-N Asp-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)O)N KOWYNSKRPUWSFG-IHPCNDPISA-N 0.000 description 1
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- 241000743776 Brachypodium distachyon Species 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 1
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 1
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- DITJVHONFRJKJW-BPUTZDHNSA-N Gln-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DITJVHONFRJKJW-BPUTZDHNSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- RUDRIZRGOLQSMX-IUCAKERBSA-N Gly-Met-Met Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O RUDRIZRGOLQSMX-IUCAKERBSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 1
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 1
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 1
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 1
- PFOUFRJYHWZJKW-NKIYYHGXSA-N His-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O PFOUFRJYHWZJKW-NKIYYHGXSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- 101000690100 Homo sapiens U1 small nuclear ribonucleoprotein 70 kDa Proteins 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- 102100033603 Kelch domain-containing protein 4 Human genes 0.000 description 1
- 101710116444 Kelch domain-containing protein 4 Proteins 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- SFQPJNQDUUYCLA-BJDJZHNGSA-N Lys-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N SFQPJNQDUUYCLA-BJDJZHNGSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 240000002582 Oryza sativa Indica Group Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 101100029173 Phaeosphaeria nodorum (strain SN15 / ATCC MYA-4574 / FGSC 10173) SNP2 gene Proteins 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- YOFKMVUAZGPFCF-IHRRRGAJSA-N Phe-Met-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O YOFKMVUAZGPFCF-IHRRRGAJSA-N 0.000 description 1
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 101100094821 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SMX2 gene Proteins 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- IUFQHOCOKQIOMC-XIRDDKMYSA-N Trp-Asn-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N IUFQHOCOKQIOMC-XIRDDKMYSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- FWOVTJKVUCGVND-UFYCRDLUSA-N Tyr-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FWOVTJKVUCGVND-UFYCRDLUSA-N 0.000 description 1
- 102100024121 U1 small nuclear ribonucleoprotein 70 kDa Human genes 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 238000003277 amino acid sequence analysis Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- -1 aromatic amino acids Chemical class 0.000 description 1
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 235000015241 bacon Nutrition 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000005352 clarification Methods 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 150000002333 glycines Chemical class 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 229910021642 ultra pure water Inorganic materials 0.000 description 1
- 239000012498 ultrapure water Substances 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/11—Protein-serine/threonine kinases (2.7.11)
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Medicinal Chemistry (AREA)
- Cell Biology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Botany (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
本发明公开了基因在调控植物粒型、粒重中的应用,所述基因具有如(a)、(b)、(c)所示的序列:(a)Seq ID No:1所示的基因组核苷酸序列;(b)Seq ID No:2所示的cDNA核苷酸序列;(c)在(a)、(b)所示的核苷酸序列的中添加和/或取代和/或缺失一个或几个核苷酸而生成的可编码具有调控粒型粒重功能的蛋白质的突变基因、等位基因或衍生物。所述基因正调控植物(水稻)粒型、粒重。
Description
技术领域
本发明涉及植物基因工程领域,具体地,涉及一种水稻粒型粒重相关基因、蛋白、分子标记及应用。
背景技术
水稻不仅是单子叶模式植物,而且是我国重要的粮食作物。水稻籽粒的大小是影响稻米产量和品质的重要因素,在长期的水稻育种过程中一直备受关注。水稻粒型和粒重是影响产量的直接因素,千粒重是稻谷产量三要素之一,而粒重主要由粒型决定。同时,粒型对稻米品质也有重要影响,尤其是稻米的外观品质和碾磨加工品质等。控制粒型的4个关键因素分别为粒长、粒宽、粒厚和长宽比。
但是,籽粒大小是由多个遗传位点控制的复杂数量性状,至今,科学家们利用图位克隆方法已成功克隆到一系列调控水稻粒型变异的数量性状遗传位点(QTL)。这些QTL主要通过调控水稻颖壳的大小来影响籽粒的形成,其中与粒长和粒重相关的基因有如下所示:
一、GS3
Fan C,Xing Y,Mao H,Lu T,Han B,Xu C,Li X,Zhang Q.GS3,a major QTL forgrain length and weight and minor QTL for grain width and thickness in rice,encodes a putative transmembrane protein.Theor Appl Genet.2006,112(6):1164-1171(Fan Chuchuan,Xing Yongzhong,Mao Hailiang,Lu Tingting,Han Bin,Xu Caiguo,Li Xianghua,Zhang Qifa.GS3基因,水稻粒长粒重主效QTL、粒宽粒厚微效QTL,编码一推测的跨膜蛋白.理论和应用遗传学.2006,112(6):1164-1171)。
二、qGL3/qGL3.1
Hu Z,He H,Zhang S,Sun F,Xin X,Wang W,Qian X,Yang J,Luo X.A Kelchmotif-containing serine/threonine protein phosphatase determines the largegrain QTL trait in rice.J Integr Plant Biol.2012,54(12):979-990(Hu Zejun,HeHaohua,Zhang Shiyong,Sun Fan,Xin Xiaoyun,Wang Wenxiang,Qian Xi,Yang Jingshui,Luo Xiaojin.一包含Kelch结构域的丝氨酸/苏氨酸蛋白磷酸酶决定水稻大粒QTL性状.植物学报.2012,54(12):979-990);
Qi P,Lin YS,Song XJ,Shen JB,Huang W,Shan JX,Zhu MZ,Jiang L,Gao JP,LinHX.The novel quantitative trait locus GL3.1 controls rice grain size andyield by regulating Cyclin-T1;3.Cell Res.2012,22(12):1666-1680(Qi Peng,LinYou-Shun,Song Xian-Jun,Shen Jin-Bo,Huang Wei,Shan Jun-Xiang,Zhu Mei-Zhen,Jiang Liwen,Gao Ji-Ping,Lin Hong-Xuan.新的数量性状位点GL3.1通过调控细胞周期蛋白-T1;3控制水稻籽粒大小和产量.细胞研究.2012,22(12):1666-1680);
Zhang X,Wang J,Huang J,Lan H,Wang C,Yin C,Wu Y,Tang H,Qian Q,Li J,Zhang H.Rare allele of OsPPKL1 associated with grain length causes extra-large grain and a significant yield increase in rice.Proc Natl Acad SciUSA.2012,109(52):21534-21539(Zhang Xiaojun,Wang Jianfei,Huang Ji,Lan Hongxia,Wang Cailin,Yin Congfei,Wu Yunyu,Tang Haijuan,Qian Qian,Li Jiayang,ZhangHongsheng.与粒长相关的OsPPKL1基因的稀有等位基因产生了特大谷粒的增产水稻.美国科学院院刊.2012,109(52):21534-21539)。
三、GL7
Wang Y,Xiong G,Hu J,Jiang L,Yu H,Xu J,Fang Y,Zeng L,Xu E,Xu J,Ye W,Meng X,Liu R,Chen H,Jing Y,Wang Y,Zhu X,Li J,Qian Q.Copy number variation atthe GL7 locus contributes to grain size diversity in rice.Nat Genet.2015,47(8):944-948(Wang Yuexing,Xiong Guosheng,Hu Jiang,Jiang Liang,Yu Hong,Xu Jie,Fang Yunxia,Zeng Longjun,Xu Erbo,Xu Jing,Ye Weijun,Meng Xiangbing,LiuRuifang,Chen Hongqi,Jing Yanhui,Wang Yonghong,Zhu Xudong,Li Jiayang,QianQian.GL7位点的拷贝数变异引起了水稻籽粒大小的多样性.自然·遗传.2015,47(8):944-948)。
四、GLW7
Si L,Chen J,Huang X,Gong H,Luo J,Hou Q,Zhou T,Lu T,Zhu J,Shangguan Y,Chen E,Gong C,Zhao Q,Jing Y,Zhao Y,Li Y,Cui L,Fan D,Lu Y,Weng Q,Wang Y,ZhanQ,Liu K,Wei X,An K,An G,Han B.OsSPL13 controls grain size in cultivatedrice.Nat Genet.2016,48(4):447-456(Si Lizhen,Chen Jiaying,Huang Xuehui,GongHao,Luo Jianghong,Hou Qingqing,Zhou Taoying,Lu Tingting,Zhu Jingjie,ShangguanYingying,Chen Erwang,Gong Chengxiang,Zhao Qiang,Jing Yufeng,Zhao Yan,Li Yan,Cui Lingling,Fan Danlin,Lu Yiqi,Weng Qijun,Wang Yongchun,Zhan Qilin,LiuKunyan,Wei Xinghua,An Kyungsook,An Gynheung,Han Bin.OsSPL13基因控制栽培稻的籽粒大小.自然·遗传.2016,48(4):447-456)。
五、qTGW3
Hu Z,Lu SJ,Wang MJ,He H,Sun L,Wang H,Liu XH,Jiang L,Sun JL,Xin X,KongW,Chu C,Xue HW,Yang J,Luo X,Liu JX.A novel QTL qTGW3 encodes the GSK3/SHAGGY-like kinase OsGSK5/OsSK41 that interacts with OsARF4 to negatively regulategrain size and weight in rice.Mol Plant.2018,11(5):736-749(Hu Zejun,Lu Sun-Jie,Wang Mei-Jing,He Haohua,Sun Le,Wang Hongru,Liu Xue-Huan,Jiang Ling,SunJing-Liang,Xin Xiaoyun,Kong Wei,Chu Chengcai,Xue Hong-Wei,Yang Jinshui,LuoXiaojin,Liu Jian-Xiang.一新的QTL qTGW3编码类GSK3/SHAGGY激酶OsGSK5/OsSK41与OsARF4互作负调控水稻籽粒大小和粒重.分子植物.2018,11(5):736-749)。
因此,克隆水稻粒型基因,开发相应的分子标记,通过分子育种改良水稻粒型,可以同时提高水稻产量和改良水稻品质。
水稻GL4基因编码蛋白含有3个Kelch重复结构域。Kelch结构域的一级结构存在8个关键保守位点,包括4个疏水氨基酸、2个紧接连续的甘氨酸和隔开一段序列后的2个有固定间隔的芳香族氨基酸(Goebel SJ,Johnson GP,Perkus ME,Davis SW,Winslow JP,Paoletti E.The complete DNA sequence of vaccinia virus.Virology.1990,179(1):247-266)。该GL4蛋白与二穗短柄草的含Kelch结构域蛋白4同源性高(88%相似度)。但其功能至今尚不清楚。
发明内容
本发明要解决的技术问题在于提供一种调控水稻粒型和粒重的水稻粒型粒重基因GL4及其编码的蛋白质,并基于此开发其相关应用。
为解决上述技术问题,本发明提供基因在调控植物粒型、粒重中的应用,所述基因具有如(a)、(b)、(c)所示的序列:
(a)Seq ID No:1所示的基因组核苷酸序列;
(b)Seq ID No:2所示的cDNA核苷酸序列;
(c)在(a)、(b)所示的核苷酸序列的中添加和/或取代和/或缺失一个或几个核苷酸而生成的可编码具有调控粒型粒重功能的蛋白质的突变基因、等位基因或衍生物;
所述粒型为粒长。
作为本发明应用的改进:正调控植物(水稻)粒型、粒重,即,使得粒长增加、千粒重增加。
本发明还同时提供了上述基因编码的蛋白在调控植物粒型粒重中的应用,所述蛋白如(A)或(B)所示的序列:
(A)Seq ID No:3所示的氨基酸序列;
(B)在(A)所限定的氨基酸序列中添加和/或取代和/或缺失一个或几个氨基酸且具有相同功能的由(A)衍生的蛋白质。
其中的Seq ID No:3所示的蛋白质具有662个氨基酸,属于富含Kelch结构域(即Seq ID No:3中的第128~172位、第180~233位、第236~288位)的新蛋白。
其中Seq ID No:1所示的93-11基因组核苷酸序列共有6164个核苷酸,Seq ID No:2所示的93-11cDNA序列共有1989个核苷酸(包括终止子TAG)。
为了验证候选基因GL4的功能,构建了基因互补载体;所述基因互补载体为pCAMBIA1300载体;所述用于构建互补载体的基因引物为:
com-F:aataagcttACTAATTACATGGAATGCGTGTAAATTG;
com-R:agaagcttCATTTCGTACGGAGGAGAACAACTG;
进一步地,由于上述基因属于正调控基因,因此可通过过表达来提高水稻粒长粒重;所述基因过表达载体为pCAMBIA1300S载体;所述用于构建过表达载体的基因引物为:
over-F:aagggtaccATGGGGAAGAAGCAGAAGAAGCC;
over-R:tgctctagaTCGAACCAAAGCTATCTCATGCC;
此处的应用,主要是将基因过表达通过转基因的方法转入水稻中以提高水稻的粒长粒重。
另一方面,本发明提供了一种与水稻粒型粒重基因紧密连锁的分子标记,所述分子标记为P4-1、P4-2、P4-3、P4-4、P4-5、P4-6、P4-7、4-8或P4-9;所述各分子标记对应的引物序列分别为:
P4-1:
F,5’-TGGGTCTTCAAAAAATGTTCAGTGG-3’
R,5’-ACCCCGCCTAAACTCCATGAATC-3’;
P4-2:
F,5’-TATAGATTCATCGTACTAAGGC-3’
R,5’-TGAGATTTATTGTTTTGTGTG-3’;
P4-3:
F,5’-AGAAGTAGTGCAGAGTACAGTC-3’
R,5’-AGTACTCCTATCCTTTAATAATATG-3’;
P4-4:
F,5’-ATACGTAGCGTTTGGTTATAGC-3’
R,5’-TTCGGTTTTGAACTCAACTTC-3’;
P4-5:
F,5’-TGCTTGAAGAGGAGAATGGTGG-3’
R,5’-AGCTCCTGAGTTCCTTGCGTC-3’;
P4-6:
F,5’-TTACATTATCGAATTATGCACGATAC-3’
R,5’-TGATACCCGAACTTCCTGACTG-3’;
P4-7:
F,5’-TGTGGCAGAATTGTGGGACAC-3’
R,5’-ACTTTATATCTGATTTAGGCACGTTTAC-3’;
P4-8:
F,5’-TAGATTGGTTTTTATGAAACG-3’
R,5’-TGCTGTCACAGTTTATCACAC-3’;
P4-9:
F,5’-TCCTACGATTTCTCAATCCTG-3’
R,5’-TGAATTCCTTCAATTTTAGAGC-3’。
再一方面,本发明提供了上述与水稻粒型粒重基因紧密连锁的分子标记在分子标记辅助选择育种中的应用,所述应用为辅助选择与水稻粒型粒重相关性状。
上述分子标记是在进行水稻粒型粒重基因定位克隆过程中得到的与其紧密连锁的分子标记,因此,可通过上述获得的分子标记进行分子标记辅助选择育种,如筛选、鉴定与水稻粒型粒重相关性状。
实现本发明的具体技术步骤如下:
一、GL4基因的精细定位和候选基因确定:
利用粒长和粒重上存在显著差异的亲本培矮64s与93-11(图1,图2)构建了大规模的重组自交系(RIL)群体。结合RIL核心群体的高密度遗传图谱,我们检测到水稻第4号染色体上新的控制稻谷粒长和粒重的QTL——qGL4(图3a)。利用培矮64s为轮回亲本的大规模BC4F2群体将该QTL进一步精细定位于两个插入缺失(INDEL)标记P4-7和P4-8之间约9.3kb的物理距离内(图3b),其中只有1个开放阅读框(ORF),编码产物与含Kelch结构域的未知功能蛋白同源性高,暂命名为GL4。DNA测序发现,亲本培矮64s和93-11的GL4编码区1989bp内存在2个引起氨基酸改变的单核苷酸多态(SNP)(图3c)。
二、GL4基因的鉴定和功能分析:
通过转基因技术,结果表明本发明获得了粒长和千粒重显著增加的转基因互补培矮64s水稻植株和过表达培矮64s水稻植株(图5–图8),并且过表达培矮64s水稻植株的基因转录水平表达量显著升高(图9),证明了本发明正确克隆了GL4基因。氨基酸序列分析表明GL4编码包含Kelch结构域蛋白,对93-11背景的GL4基因为培矮64s类型的近等基因系NIL-GL4培矮64s和93-11的籽粒观察发现,粒长和千粒重也显著减小(图10;图11),证明该基因编码蛋白调控水稻粒型。
本发明采用图位克隆技术利用水稻BC4F2群体首次在水稻中克隆到粒型粒重基因GL4并通过转基因互补实验和过表达实验以及近等基因系鉴定了基因的功能。GL4基因的克隆和应用,可有效调控水稻粒型粒重和产量,GL4基因也可应用于其它单子叶植物中用于调控粒型粒重和产量。同时本发明有助于水稻粒型粒重调控机理的阐明,并为水稻高产优质育种奠定扎实的理论基础。
综上,公开了水稻粒型粒重基因GL4(Seq ID No:1、2)、其编码的蛋白质(Seq IDNo:3)及上述基因或蛋白质在调控植物粒型和粒重中的应用。本发明还公开了与上述基因紧密连锁的分子标记及其应用。GL4基因的克隆和应用,可通过调控水稻粒型粒重,有效提高水稻的产量。
附图说明
下面结合附图对本发明的具体实施方式作进一步详细说明。
图1是水稻品种培矮64s和93-11的粒型表型;其中a为培矮64s的籽粒,b为93-11的籽粒,c-1cm标尺;
图2是水稻品种培矮64s和93-11的粒长和千粒重比较;a平均值±标准差(n=100),b平均值±标准差(n=3);
图3GL4基因的定位图;其中竖线上方标注分子标记,竖线下方数字代表交换个体数,ATG和TAG分别表示GL4基因的起始子和终止子;
图4pCAMBIA1300-GL4互补载体图谱(a)和pCAMBIA1300S-GL4过表达载体图谱(b);
图5是培矮64s和GL4互补T0代转基因水稻的粒型表型;其中a为培矮64s的籽粒,b、c、d为转GL4互补株系的籽粒,e-1cm标尺;
图6是培矮64s和GL4过表达T0代转基因水稻的粒型表型;其中a为培矮64s的籽粒,b、c、d为转GL4过表达株系的籽粒,c-1cm标尺;
图7是培矮64s和转GL4互补载体株系的粒长和千粒重比较;a平均值±标准差(n=100),b平均值±标准差(n=3);
图8是培矮64s与转GL4过表达载体株系的粒长和千粒重比较;a平均值±标准差(n=100),b平均值±标准差(n=3);
图9是培矮64s与转GL4过表达载体株系的GL4转录水平的表达比较;平均值±标准差(n=3);
图10是93-11和近等基因系NIL-GL4培矮64s的粒型表型;其中a为93-11的籽粒,b为NIL-GL4培矮64s的籽粒,c-1cm标尺;
图11是93-11和近等基因系NIL-GL4培矮64s的粒长和千粒重比较;a平均值±标准差(n=100),b平均值±标准差(n=3)。
具体实施方式
下面结合具体实施例对本发明进行进一步描述,但本发明的保护范围并不仅限于此:
实施例1:
1、水稻材料
籼稻品种为(Oryza sativa L.indica)“93-11”和“培矮64s”,93-11背景的GL4基因替换为培矮64s类型的近等基因系NIL-GL4培矮64s。
近等基因系NIL-GL4培矮64s(93-11遗传背景下INDEL标记P4-6~P4-8之间被培矮64s片段代换)由染色体片段代换系CSSL-qGL4与93-11连续回交4代并自交结合INDEL标记P4-1~P4-9辅助筛选得到。染色体片段代换系CSSL-qGL4是由培矮64s与93-11连续回交5代并自交的BC5F2筛选得到,步骤详见Zhang B,Shang L,Ruan B,Zhang A,Yang S,Jiang H,Liu C,Hong K,Lin H,Gao Z,Hu J,Zeng D,Guo L,Qian Q.Development of three sets ofhigh-throughput genotyped rice chromosome segment substitution lines and QTLmapping for eleven traits.Rice.2019,12(1):33。
2、INDEL标记精细定位GL4基因
采用水稻微量DNA的快速提取方法从水稻叶片中提取用于基因定位的基因组DNA。取0.2g水稻叶片,液氮冷冻研磨成粉状,转移至1.5ml离心管里提取DNA,获得的DNA沉淀溶解于150μl超纯水中。每一PCR反应用2μl DNA样品。
GL4基因的初定位:将培矮64s与93-11杂交,F1代开始自交单粒传13代产生RIL群体。结合RIL核心群体的高密度遗传图谱,检测到水稻第4号染色体上新的控制稻谷粒型和粒重的QTL——qGL4(图3a)。
GL4基因的精细定位:利用培矮64s为轮回亲本的大规模BC4F2群体,同时在目标区间根据培矮64s和93-11的参考序列设计引物,筛选出具有多态的引物(表1),将该QTL进一步精细定位于两个INDEL标记P4-7和P4-8之间约9.3kb的物理距离内(图3b)。
3、基因预测和比较分析:
通过RGAP网站(http://rice.plantbiology.msu.edu/)分析,该区间内存在1个ORF,基因组测序发现,亲本培矮64s的如Seq ID No:4的基因组核苷酸序列与93-11的如SeqID No:1的基因组核苷酸序列间共存在11个SNP和1个INDEL,包括如Seq ID No:2的编码区1989bp内的2个引起氨基酸改变的SNP:SNP1为93-11的A175变为培矮64s的G175(由异亮氨酸变为缬氨酸),SNP2为93-11的C841变为培矮64s的T841(由精氨酸变为半胱氨酸)(图3c)。因此,该ORF可能是该性状的候选基因。
表1定位发展的分子标记
*F:正向引物;R:反向引物。
本发明的序列表中:
Seq ID No:1为93-11的GL4基因的基因组核苷酸序列(6164bp);
Seq ID No:2为93-11的GL4基因的cDNA核苷酸序列(1989bp);
Seq ID No:3为93-11的GL4基因编码的蛋白质的氨基酸序列(662aa);
Seq ID No:4为培矮64s的GL4基因的基因组核苷酸序列(6156bp);
Seq ID No:5为93-11的GL4基因起始子ATG前的包含启动子的序列(2237bp);
Seq ID No:6为93-11的GL4基因终止子TAG后的序列(1318bp)。
实施例2:
植物转化和功能分析:
为了验证该ORF为候选基因,用PCR扩增93-11的包含Seq ID No:5所示的起始子ATG前2237个核苷酸、Seq ID No:1所示的93-11基因组核苷酸序列共6164个核苷酸、Seq IDNo:6所示的终止子TAG后1318个核苷酸共9719个核苷酸,将该序列连接到常规双元表达载体pCAMBIA1300中(图4a),所得命名为pCAMBIA1300-GL4;
用PCR扩增93-11的如Seq ID No:2所示的cDNA核苷酸序列(从起始子ATG至终止子TAG共1989bp),然后将该序列连接到常规双元表达载体pCAMBIA1300S中(图4b),所得命名为pCAMBIA1300S-GL4。
通过农杆菌介导的植物转化方法将此载体(即,上述pCAMBIA1300-GL4、pCAMBIA1300S-GL4)分别转入培矮64s,各得到转基因植株3株,pCAMBIA1300-GL4载体对应所得称为基因互补系植株,pCAMBIA1300S-GL4载体对应所得称为基因过表达系植株。可委托武汉伯远生物科技有限公司完成。
按照常规水稻栽培方式在转基因圃内进行种植直至收获。所有转基因互补T0代植株和过表达T0代植株的表型均为粒长和千粒重显著增加(图5,图6,图7,图8),并且过表达T0代植株的基因转录水平的表达量显著升高(图9)。由此说明,该ORF就是控制粒型和粒重性状的候选基因GL4。
最后,还需要注意的是,以上列举的仅是本发明的若干个具体实施例。显然,本发明不限于以上实施例,还可以有许多变形。本领域的普通技术人员能从本发明公开的内容直接导出或联想到的所有变形,均应认为是本发明的保护范围。
序列表
<110> 中国水稻研究所
<120> 水稻粒型粒重相关基因、蛋白、分子标记及应用
<160> 6
<170> SIPOSequenceListing 1.0
<210> 1
<211> 6164
<212> DNA
<213> 水稻(Oryza sativa)
<400> 1
atggggaaga agcagaagaa gcccaggaag gggaaggaga agacggagcg gaagacggcc 60
aagggcgagg agaagcgcgc ccgccgcgag gcccggaagg tcggcgagga ggacgacatc 120
gacgccatcc tcgtacgtgt gctccctccc tcccgccccc ctcctctgcg tcagctcttc 180
acgctcgctc agtgcgctct agctcgatcg gcgtctccat gtgcggtttt gtttgctcac 240
caccaccttc tgcgtctcgt gatgcactcc ggtggctgaa aaattggaag cgattttcgc 300
actggctcac ctttttccct tcacatttcg ttgcgtagca gctatggatt ttagatgagt 360
ttggtgttgc tatgtgtgct gtttgaactt tttttttttg ctgatttatc tatttgctct 420
ggttattttt cattgcagag gagcatacaa aaggaggagg ctaagaagaa ggaggtacat 480
atagatgaga atgtccctgc accatctccc cggtccaatt gctcggtaag acattttaga 540
gcaagtgcca gctaaagaag ttaatctttc ggtattcttg tgattatgta cttgaagtgg 600
actaggtatc atttttatgc ttgtggtggt catgtttgca tgcttagaat ttatatactc 660
cagtaggatt aataaatttc ttcaggcaag atggattttt ttatgaagta atttgatcaa 720
atatgatgat cttttggtga tactgaaccg attggttttt cagttccgga taggtgatgt 780
caaattacaa ttcaggggca aaatgtgtta aaagatagat gttcgttttg tttttttttt 840
tgttaactgt tggaaaaagt ttttgatgtt gtacagagat cctccctcta aatgttaact 900
ataacaaata aacctgcata tgatcttcca gctgaccaag ttaattcctt cttaaatgca 960
gcttacaata aatcccctga aagatacaga attggttctg tatggaggag agttctacaa 1020
tggcagcaag gtgggagaca tatccttcac gatttcactg ctggttgaat gagatatgtt 1080
catctagtgt tctttttgcc tctattctgc ttctagaggt ttttgagtac tgaaatagta 1140
tttcttttcc tcgccacaat ctatttctgc agacctttgt ttatggtgat ctttatcgct 1200
acgatgtaga gaaaaatgag tggaagttgg tatctagtcc taacagtcct cctccacgaa 1260
gtgctcacca aacagttgcc tggaagaata atatatacat gtttggtaac ataacttaac 1320
tttgggaagg cattctatga gttaaaatgc tttttcagta tgtatgattt agttttttat 1380
tctgtgttga tgtttcaggt ggggaattca cttcgccaaa ccaagaacgt tttcatcatt 1440
acaaggtaga agactaattt tgtcagtcta tttcttgtga ctgttttgag tatcttctct 1500
tggcacaagg catttgcaca attcgtatat aagcagtagc ttcataagca atacatatct 1560
ggcatgattt tttttcaatt ttaaaccaag agataggtac tgattcccat gttcttacat 1620
aattataagt tgaaatatga ctaatggaga ctaatatgtg cagttcttct gctatttatt 1680
gatttaagat taagatgtag aggatgcggg gttcacctgc ccttttattt ggatttgcac 1740
atataacatc actttttatt cctgattgcc tttttttttc ctttctgttg tgatattaat 1800
ggggttattt cgagattagc tatcttagat ggaaaactca ttattatggt tctattcaaa 1860
tttctggtct gattttgtac tgtaggactt ttggtcattg gatctaaaaa caaatcaatg 1920
ggagcaaatt cttgcgaagg gttgtccaag tgcacgttca gggcacagga tggttagtgg 1980
ttttacatta aaatcagtca tcaactattc tgctcccctc ttttcacctt aatatttctg 2040
tattatgagc agtagtaaat tgtgttcttt ttcctacagg tcctctataa gcacaagatc 2100
gtgctatttg gtggttttta tgacactctt agggaagtga ggttagtaca gttacatttt 2160
atatgactct accctggtaa tcttgttatt agagtaacat ttatatttgg accatgtcta 2220
gaagtagtgg agtcattatg tccaaactac aaattatgca ctacctgaat aacagtgtag 2280
cactcttaca gctgatcatc cgcaaagaat gaaatggtgt ggcgtagagt acgttctgaa 2340
taaatagtgt gacatgacat gatctgatct gcattttttt tataatatcc ttctgcagat 2400
actacaatga cttacatgtt tttgatttag ataatttcaa ggtgagtaca ccatgttaat 2460
attttgttta atactgttag tagtaacaca tgaagtcatt tattttaata ctcttactgg 2520
gaatatttgt atttcagtgg gaggagatca agcctcgccc tgggtgcttg tggccaagtc 2580
caagaagtgg ctttcagcta atggtatacc aagatcaggt aggtcttttt ggatttaaag 2640
ctaggacatt gatacttcat aaaaagagtt taaattaact ataaaccaac cttgtcttcg 2700
actacttttg tttacaagag tattaatggc ccttatttct gtagatatat ctgtatggcg 2760
gatattttaa agaagtagtt tcttctgaca aatctgcatc agaaaaagga acagttcatg 2820
cagatatgtg gactcttgat cctcgtactt gggagtggaa taaggtgatc tcttgcaatt 2880
ttttagaaca ttgtatcaac ttccatcatg atagtgtatc gagttttact ttaagccata 2940
tatccactga gtgatttgca tattattacc ttcacttgat ttcttaatag gttaagaaaa 3000
ctgggatgcc acctggcccc agagctgggt tttctatgtg cgttcacaag aaaagggctg 3060
ttcttttcgg tggtgtggta gatatggaaa ttgaaggtta ttttcagctc aattttgctc 3120
tgtgcatagc tacttaggtt attttactaa gtatttgaaa taccacgcgt gtcaagtttg 3180
ttccttttct gtagaagttc tcaaggccta actgtagaac ccaattttgt gattgcaggg 3240
gatgtcatta tgagcatgtt tatgaatgag ctctatggtt tccagctgga caaccatcgc 3300
tggtatactt caatactcca tttgagtatc ttgtgttttt aagtaacaca ggctagtttt 3360
atccctgctt attttgttcc ttgctatttt tgtgtattgt tgcaactttc tttattatta 3420
tttaacagtg cagctaacta atgtcacttc ctattccgat atgcaaactg cttctaacta 3480
ggtttaaaat tattaaaggt ctaaatcttt ctcggctgat tgatggttat gttcatctat 3540
cgtcttagga tgaactttgt ttgtaatctt gtggttatct agataacata actactttga 3600
gaattgttca gtgatattat tgtttactct tgggatccct tcgtagcata ttattgttta 3660
cttcagcgga attgttcagc gatgttcatt ttggttgaaa ctactggtcc acggctcaca 3720
tattctccaa tttcaattgt ccacactggt gcatagatga gaaatgttat gctttcttat 3780
ttagcttcat ttttgtgtgt ctatgttcag attattttcc ttgctgtttc aggtatcctt 3840
tagagctcag gaaagacaag cctgctaaaa ataaggtgat tcctcgaatt caatcatatg 3900
accagccatc tcatgtttac agtttgtatt ttttaggtac ttgatacaca gtctactctg 3960
caggaatcta tttcattagt tcagacataa agactaagca tgtagtatag ggtatgcctg 4020
tttttctttt taataagcat atacgatatg tacatattct catatactat tatactgtta 4080
ttttcttaat aagcatatgc gatctgtcga catattcaca ttgactatta tatagtaata 4140
tttggatcat tgcttaaaag cagtttgtgc ttcttttttt ttttttgggt gccatgtgta 4200
gacaaaggac atcaaaagaa aagaaccatc gaacaatgtg gaagataatc ttggtaatga 4260
ggaggatgag atcatggagg actcagaaac tactggaggg caatccgaag tccatggggt 4320
ttcgaatcac ttgaccaaga gtctaacctt aaataaagct ggctcaggca atagctctga 4380
tattctctct gattcgacaa cacaagaagt actcccagag gtattgcagc tgttctttta 4440
gatgttgaca tttacattct aatgatcttt tgtttctcat tagcatttgc tgcttacagg 4500
cagtgaaacc cggtggtcgg atcaatgcat gcttggctgt agggaaagat acactctatt 4560
tatatggagg aatgatggaa ttgaaagata gagaaattac tcttgatgat atgtattcac 4620
ttaaccttag caaactagat gagtggaagt gtatcatacc ggtcagttgc agattggccc 4680
cttctttttg ccattttgtt gtttaactga tagtgttgtt tatttcaatc agagataaca 4740
gaaaattatc tgttatcatt ttttactatt cacattggtt tctgaacttg ccttactcac 4800
ctttcttatg caggcatctg aatctgaatg gctagaaatt tctgaagatg aggatgatga 4860
agatgatgat gatgatgata atgagaatga tagcgaggat gacgctaatc agaccgatga 4920
agatgatgaa gaggtatgca aaattatttt aggtttggtc acacattttt gggatttata 4980
tcttgctaag ttcatgatta atggctgtac tagatagaat ctttctaagt tcgcgtgggc 5040
gaggacttta tatctgattt aggcacgttt acattttctc tacaaattag aacagatttt 5100
caaaaaatgt tttttaagaa aatggggaag ataatgttga cttgatgtgt cccacaattc 5160
tgccacaaac caaacatcct gtctctggcc tgtttgtttc tagttgaatc ttggtgtttg 5220
accaaaatac tgcatgatgg ttcatcttct attacggata ctgtatacgt taatatgaag 5280
tccatggttc tcatggcatc cttctgagat ttatagctat tgtgtatctt tcatttcctc 5340
tcaatcatac tgtgtggtta ttaatctgta atcctaaaac tgttttcata gtctgatgaa 5400
gatgccgaga agaatgtcga tatgtccact gctgtatcgc taataaaggg tgaacgtaag 5460
aacttgcgaa gaaaagagaa gcgtgctcgg atagagcaaa ttcgggttat gctcggtctt 5520
tctgattctc aaaggactcc aatggtgatg ttgtaatcaa catttttttt gttctaaatt 5580
tgtttgaagt tgttccgaca aagtacatat actttgttta ctcagaggaa ctcttggctg 5640
ataatttgtt acacacagtt aacaattaaa accatatatc actaattccc atattcacac 5700
ttttaaagcc aggagagtca ctaaaagatt tctacaagag aacggatatg tactggcaga 5760
tggctgcata tgagcacact caacacactg gaaaggttag tttctgctcc ttaagtatct 5820
tcacccgtca tacctgttat catattctct aggttgctgg cagtatgagt ttgctgtatt 5880
tattcgtgct catccaatgc caggagctcc gcaaagatgg ttttgatctt gccgaaactc 5940
gatataagga actgaaaccc atactcgacg aggtaaaatt gtcatgttgt gtcccctttg 6000
agacaaaacg gtatttctga cttggtacat attaactgac tcttacacgc cctcttcagc 6060
tggctgtgct cgaggctgaa cagaaagctg aggaagaggc tagtgcttcc actagttcca 6120
agaaagacac gaagaaaagc aagcagaaga gtggcatgag atag 6164
<210> 2
<211> 1989
<212> DNA
<213> 水稻(Oryza sativa)
<400> 2
atggggaaga agcagaagaa gcccaggaag gggaaggaga agacggagcg gaagacggcc 60
aagggcgagg agaagcgcgc ccgccgcgag gcccggaagg tcggcgagga ggacgacatc 120
gacgccatcc tcaggagcat acaaaaggag gaggctaaga agaaggaggt acatatagat 180
gagaatgtcc ctgcaccatc tccccggtcc aattgctcgc ttacaataaa tcccctgaaa 240
gatacagaat tggttctgta tggaggagag ttctacaatg gcagcaagac ctttgtttat 300
ggtgatcttt atcgctacga tgtagagaaa aatgagtgga agttggtatc tagtcctaac 360
agtcctcctc cacgaagtgc tcaccaaaca gttgcctgga agaataatat atacatgttt 420
ggtggggaat tcacttcgcc aaaccaagaa cgttttcatc attacaagga cttttggtca 480
ttggatctaa aaacaaatca atgggagcaa attcttgcga agggttgtcc aagtgcacgt 540
tcagggcaca ggatggtcct ctataagcac aagatcgtgc tatttggtgg tttttatgac 600
actcttaggg aagtgagata ctacaatgac ttacatgttt ttgatttaga taatttcaag 660
tgggaggaga tcaagcctcg ccctgggtgc ttgtggccaa gtccaagaag tggctttcag 720
ctaatggtat accaagatca gatatatctg tatggcggat attttaaaga agtagtttct 780
tctgacaaat ctgcatcaga aaaaggaaca gttcatgcag atatgtggac tcttgatcct 840
cgtacttggg agtggaataa ggttaagaaa actgggatgc cacctggccc cagagctggg 900
ttttctatgt gcgttcacaa gaaaagggct gttcttttcg gtggtgtggt agatatggaa 960
attgaagggg atgtcattat gagcatgttt atgaatgagc tctatggttt ccagctggac 1020
aaccatcgct ggtatccttt agagctcagg aaagacaagc ctgctaaaaa taagacaaag 1080
gacatcaaaa gaaaagaacc atcgaacaat gtggaagata atcttggtaa tgaggaggat 1140
gagatcatgg aggactcaga aactactgga gggcaatccg aagtccatgg ggtttcgaat 1200
cacttgacca agagtctaac cttaaataaa gctggctcag gcaatagctc tgatattctc 1260
tctgattcga caacacaaga agtactccca gaggcagtga aacccggtgg tcggatcaat 1320
gcatgcttgg ctgtagggaa agatacactc tatttatatg gaggaatgat ggaattgaaa 1380
gatagagaaa ttactcttga tgatatgtat tcacttaacc ttagcaaact agatgagtgg 1440
aagtgtatca taccggcatc tgaatctgaa tggctagaaa tttctgaaga tgaggatgat 1500
gaagatgatg atgatgatga taatgagaat gatagcgagg atgacgctaa tcagaccgat 1560
gaagatgatg aagagtctga tgaagatgcc gagaagaatg tcgatatgtc cactgctgta 1620
tcgctaataa agggtgaacg taagaacttg cgaagaaaag agaagcgtgc tcggatagag 1680
caaattcggg ttatgctcgg tctttctgat tctcaaagga ctccaatgcc aggagagtca 1740
ctaaaagatt tctacaagag aacggatatg tactggcaga tggctgcata tgagcacact 1800
caacacactg gaaaggagct ccgcaaagat ggttttgatc ttgccgaaac tcgatataag 1860
gaactgaaac ccatactcga cgagctggct gtgctcgagg ctgaacagaa agctgaggaa 1920
gaggctagtg cttccactag ttccaagaaa gacacgaaga aaagcaagca gaagagtggc 1980
atgagatag 1989
<210> 3
<211> 662
<212> PRT
<213> 水稻(Oryza sativa)
<400> 3
Met Gly Lys Lys Gln Lys Lys Pro Arg Lys Gly Lys Glu Lys Thr Glu
1 5 10 15
Arg Lys Thr Ala Lys Gly Glu Glu Lys Arg Ala Arg Arg Glu Ala Arg
20 25 30
Lys Val Gly Glu Glu Asp Asp Ile Asp Ala Ile Leu Arg Ser Ile Gln
35 40 45
Lys Glu Glu Ala Lys Lys Lys Glu Val His Ile Asp Glu Asn Val Pro
50 55 60
Ala Pro Ser Pro Arg Ser Asn Cys Ser Leu Thr Ile Asn Pro Leu Lys
65 70 75 80
Asp Thr Glu Leu Val Leu Tyr Gly Gly Glu Phe Tyr Asn Gly Ser Lys
85 90 95
Thr Phe Val Tyr Gly Asp Leu Tyr Arg Tyr Asp Val Glu Lys Asn Glu
100 105 110
Trp Lys Leu Val Ser Ser Pro Asn Ser Pro Pro Pro Arg Ser Ala His
115 120 125
Gln Thr Val Ala Trp Lys Asn Asn Ile Tyr Met Phe Gly Gly Glu Phe
130 135 140
Thr Ser Pro Asn Gln Glu Arg Phe His His Tyr Lys Asp Phe Trp Ser
145 150 155 160
Leu Asp Leu Lys Thr Asn Gln Trp Glu Gln Ile Leu Ala Lys Gly Cys
165 170 175
Pro Ser Ala Arg Ser Gly His Arg Met Val Leu Tyr Lys His Lys Ile
180 185 190
Val Leu Phe Gly Gly Phe Tyr Asp Thr Leu Arg Glu Val Arg Tyr Tyr
195 200 205
Asn Asp Leu His Val Phe Asp Leu Asp Asn Phe Lys Trp Glu Glu Ile
210 215 220
Lys Pro Arg Pro Gly Cys Leu Trp Pro Ser Pro Arg Ser Gly Phe Gln
225 230 235 240
Leu Met Val Tyr Gln Asp Gln Ile Tyr Leu Tyr Gly Gly Tyr Phe Lys
245 250 255
Glu Val Val Ser Ser Asp Lys Ser Ala Ser Glu Lys Gly Thr Val His
260 265 270
Ala Asp Met Trp Thr Leu Asp Pro Arg Thr Trp Glu Trp Asn Lys Val
275 280 285
Lys Lys Thr Gly Met Pro Pro Gly Pro Arg Ala Gly Phe Ser Met Cys
290 295 300
Val His Lys Lys Arg Ala Val Leu Phe Gly Gly Val Val Asp Met Glu
305 310 315 320
Ile Glu Gly Asp Val Ile Met Ser Met Phe Met Asn Glu Leu Tyr Gly
325 330 335
Phe Gln Leu Asp Asn His Arg Trp Tyr Pro Leu Glu Leu Arg Lys Asp
340 345 350
Lys Pro Ala Lys Asn Lys Thr Lys Asp Ile Lys Arg Lys Glu Pro Ser
355 360 365
Asn Asn Val Glu Asp Asn Leu Gly Asn Glu Glu Asp Glu Ile Met Glu
370 375 380
Asp Ser Glu Thr Thr Gly Gly Gln Ser Glu Val His Gly Val Ser Asn
385 390 395 400
His Leu Thr Lys Ser Leu Thr Leu Asn Lys Ala Gly Ser Gly Asn Ser
405 410 415
Ser Asp Ile Leu Ser Asp Ser Thr Thr Gln Glu Val Leu Pro Glu Ala
420 425 430
Val Lys Pro Gly Gly Arg Ile Asn Ala Cys Leu Ala Val Gly Lys Asp
435 440 445
Thr Leu Tyr Leu Tyr Gly Gly Met Met Glu Leu Lys Asp Arg Glu Ile
450 455 460
Thr Leu Asp Asp Met Tyr Ser Leu Asn Leu Ser Lys Leu Asp Glu Trp
465 470 475 480
Lys Cys Ile Ile Pro Ala Ser Glu Ser Glu Trp Leu Glu Ile Ser Glu
485 490 495
Asp Glu Asp Asp Glu Asp Asp Asp Asp Asp Asp Asn Glu Asn Asp Ser
500 505 510
Glu Asp Asp Ala Asn Gln Thr Asp Glu Asp Asp Glu Glu Ser Asp Glu
515 520 525
Asp Ala Glu Lys Asn Val Asp Met Ser Thr Ala Val Ser Leu Ile Lys
530 535 540
Gly Glu Arg Lys Asn Leu Arg Arg Lys Glu Lys Arg Ala Arg Ile Glu
545 550 555 560
Gln Ile Arg Val Met Leu Gly Leu Ser Asp Ser Gln Arg Thr Pro Met
565 570 575
Pro Gly Glu Ser Leu Lys Asp Phe Tyr Lys Arg Thr Asp Met Tyr Trp
580 585 590
Gln Met Ala Ala Tyr Glu His Thr Gln His Thr Gly Lys Glu Leu Arg
595 600 605
Lys Asp Gly Phe Asp Leu Ala Glu Thr Arg Tyr Lys Glu Leu Lys Pro
610 615 620
Ile Leu Asp Glu Leu Ala Val Leu Glu Ala Glu Gln Lys Ala Glu Glu
625 630 635 640
Glu Ala Ser Ala Ser Thr Ser Ser Lys Lys Asp Thr Lys Lys Ser Lys
645 650 655
Gln Lys Ser Gly Met Arg
660
<210> 4
<211> 6156
<212> DNA
<213> 水稻(Oryza sativa)
<400> 4
atggggaaga agcagaagaa gcccaggaag gggaaggaga agacggagcg gaagacggcc 60
aagggcgagg agaagcgcgc ccgccgcgag gcccggaagg tcggcgagga ggacgacatc 120
gacgccatcc tcgtacgtgt gctccctccc tcccgccccc ctcctctgcg tcagctcttc 180
acgctcgctc agtgcgctct agctcgatcg gcgtctccat gtgcggtttt gtttgctcac 240
caccaccttc tgcgtctcgt gatgcactcc ggtggctgaa aaattggaag cgattttcgc 300
actggctcac ctttttccct tcacatttcg ttgcgtagca gctatggatt ttagatgagt 360
ttggtgttgc tatgtgtgct gtttgaactt tttttttttg ctgatttatc tatttgctct 420
ggttattttt cattgcagag gagcatacaa aaggaggagg ctaagaagaa ggaggtacat 480
gtagatgaga atgtccctgc accatctccc cggtccaatt gctcggtaag acattttaga 540
gcaagtgcca gctaaagaag ttaatctttc ggtattcttg tgattatgta cttgaagtgg 600
actaggtatc atttttatgc ttgtggtggt catgtttgca tgcttagaat ttatatactc 660
cagtaggatt aataaatttc ttcaggcaag atggattttt ttatgaagta atttgatcaa 720
atatgatgat cttttggtga tactgaaccg attggttttt cagttccgga taggtgatgt 780
caaattacaa ttcaggggca aaatgtgtta aaagatagat gttcgttttg tttttttttt 840
tgttaactgt tggaaaaagt ttttgatgtt gtacagagat cctccctcta aatgttaact 900
ataacaaata aacctgcata tgatcttcca gctgaccaag ttaattcctt cttaaatgca 960
gcttacaata aatcccctga aagatacaga attggttctg tatggaggag agttctacaa 1020
tggcagcaag gtgggagaca tatccttcac gatttcactg ctggttgaat gagatatgtt 1080
catctagtgt tctttttgcc tctattctgc ttctagaggt ttttgagtac tgaaatagta 1140
tttcttttcc tcgccacaat ctatttctgc agacctttgt ttatggtgat ctttatcgct 1200
acgatgtaga gaaaaatgag tggaagttgg tatctagtcc taacagtcct cctccacgaa 1260
gtgctcacca aacagttgcc tggaagaata atatatacat gtttggtaac ataacttaac 1320
tttgggaagg cattctatga gttaaaatgc tttttcagta tgtatgattt agttttttat 1380
tctgtgttga tgtttcaggt ggggaattca cttcgccaaa ccaagaacgt tttcatcatt 1440
acaaggtaga agactaattt tgtcagtcta tttcttgtga ctgttttgag tatcttctct 1500
tggcacaagg catttgcaca attcgtatat aagcagtagc ttcataagca atacatatct 1560
ggcatgattt tttttcaatt ttaaaccaag agataggtac tgattcccat gttcttacat 1620
aattataagt tgaaatatga ctaatggaga ctaatatgtg cagttcttct gctatttatt 1680
gatttaagat taagatgtag aggatgcggg gttcacctgc ccttttattt ggatttgcac 1740
atataacatc actttttatt cctgattgcc tttttttttc ctttctgttg tgatattaat 1800
ggggttattt cgagattagc tatcttagat ggaaaactca ttattatggt tctattcaaa 1860
tttctggtct gattttgtac tgtaggactt ttggtcattg gatctaaaaa caaatcaatg 1920
ggagcaaatt cttgcgaagg gttgtccaag tgcacgttca gggcacagga tggttagtgg 1980
ttttacatta aaatcagtca tcaactattc tgctcccctc ttttcacctt aatatttctg 2040
tattatgagc agtagtaaat tgtgttcttt ttcctacagg tcctctataa gcacaagatc 2100
gtgctatttg gtggttttta tgacactctt agggaagtga ggttagtaca gttacatttt 2160
atatgactct accctggtaa tcttgttatt agagtaacat ttatatttgg accatgtcta 2220
gaagtagtgg agtcattatg tccaaactac aaattatgca ctacctgaat aacagtgtag 2280
cactcttaca gctgatcatc cgcaaagaat gaaatggtgt ggcgtagagt acgttctgaa 2340
taaatagtgt gacatgacat gatctgatct gcattttttt tataatatcc ttctgcagat 2400
actacaatga cttacatgtt tttgatttag ataatttcaa ggtgagtaca ccatgttaat 2460
attttgttta atactgttag tagtaacaca tgaagtcatt tattttaata ctcttactgg 2520
gaatatttgt atttcagtgg gaggagatca agcctcgccc tgggtgcttg tggccaagtc 2580
caagaagtgg ctttcagcta atggtatacc aagatcaggt aggtcttttt ggatttaaag 2640
ctaggacatt gatacttcat aaaaagagtt taaattaact ataaaccaac cttgtcttcg 2700
actacttttg tttacaagag tattaatggc ccttatttct gtagatatat ctgtatggcg 2760
gatattttaa agaagtagtt tcttctgaca aatctgcatc agaaaaagga acagttcatg 2820
cagatatgtg gactcttgat ccttgtactt gggagtggaa taaggtgatc tcttgcaatt 2880
ttttagaaca ttgtatcaac ttccatcatg atagtgtatc gagttttact ttaagccata 2940
tatccactga gtgatttgca tattattacc ttcacttgat ttcttaatag gttaagaaaa 3000
ctgggatgcc acctggcccc agagctgggt tttctatgtg cgttcacaag aaaagggctg 3060
ttcttttcgg tggtgtggta gatatggaaa ttgaaggtta ttttcagctc aattttgctc 3120
tgtgcatagc tacttaggtt attttactaa gtatttgaaa taccacgtgt gtcaagtttg 3180
ttccttttct gtagaagttc tcaaggccta actgtagaac ccaattttgt gattgcaggg 3240
gatgtcatta tgagcatgtt tatgaatgag ctctatggtt tccagctgga caaccatcgc 3300
tggtatactt caatactcca tttgagtatc ttgtgttttt aagtaacaca ggctagtttt 3360
atccctgctt attttgttcc ttgctatttt tgtgtattgt tgcaactttc tttattatta 3420
tttaacagtg cagctaacta atgtcacttc ctattccgac atgcaaactg cttctaacta 3480
ggtttaaaat tattaaaggt ctaaatcttt ctcggctgat tgatggttat gttcatctat 3540
cgtcttagga tgaactttgt ttgtaatctt gtggttatct agataacata actactttga 3600
gaattgttca gtgatattat tgtttactct tgggatccct tcgtagcata ttattgttta 3660
cttcagcgga attgttcagc gatgttggtt ttggttgaaa ctactggtcc acggctcaca 3720
tattctccaa tttcaattgt ccacactggt gcatagatga gaaatgttat gctttcttat 3780
ttagcttcat ttttgtgtgt ctatgttcag attattttcc ttgctgtttc aggtatcctt 3840
tagagctcag gaaagacaag cctgctaaaa ataaggtgat tcctcgaatt caatcatatg 3900
accagccatc tcatgtttat agtttgtatt ttttaggtac ttgatacaca gtctactctg 3960
caggaatcta tttcattagt tcagacataa agactaagca tgtagtatag ggtatgcctg 4020
tttttctttt taataagcat atatgatatg tacatattct catatactat tatactgtta 4080
ttttcttaat aagcatatgc gatctgtcga catattcaca ttgactatta tatagtaata 4140
tttggatcat tgcttaaaag cagtttgtgc ttcttttttt ttttttgggt gccatgtgta 4200
gacaaaggac atcaaaagaa aagaaccatc gaacaatgtg gaagataatc ttggtaatga 4260
ggaggatgag atcatggagg actcagaaac tactggaggg caatccgaag tccatggggt 4320
ttcgaatcac ttgaccaaga gtctaacctt aaataaagct ggctcaggca atagctctga 4380
tattctctct gattcgacaa cacaagaagt actcccagag gtattgcagc tgttctttta 4440
gatgttgaca tttacattct aatgatcttt tgtttctcat tagcatttgc tgcttacagg 4500
cagtgaaacc cggtggtcgg atcaatgcat gcttggctgt agggaaagat acactctatt 4560
tatatggagg aatgatggaa ttgaaagata gagaaattac tcttgatgat atgtattcac 4620
ttaaccttag caaactagat gagtggaagt gtatcatacc ggtcagttgc agattggccc 4680
cttctttttg ccattttgtt gtttaactaa tagtgttgtt tatttcaatc agagataaca 4740
gaaaattatc tgttatcatt tttgactatt cacatttgtt tctgaacttg ccttactcac 4800
ctttcttatg caggcatctg aatctgaatg gctagaaatt tctgaagatg aggatgatga 4860
agatgatgat gatgatgata atgagaatga tagcgaggat gacgctaatc agaccgatga 4920
agatgatgaa gaggtatgca aaattatttt aggtttggtc acacattttt gggatttata 4980
tcttgctaag ttcatgatta atggctgtac tagatagaat ctttctaagt tcgcgtgggc 5040
gaggacttta tatctgattt aggcacgttt acattttctc tacaaattag aacagatttt 5100
caaaaaatgt tttttaagaa aatggggaag ataatgatgt gtcccacaat tctgccacaa 5160
accaaacatc ctgtctctgg cctgtttgtt tctagttgaa tcttggtgtt tgaccaaaat 5220
actgcatgat ggttcatctt ctattacgga tactgtatac gttaatatga agtccatggt 5280
tctcatggca tccttctgag atttatagct attgtgtatc tttcatttcc tctcaatcat 5340
actgtgtggt tattaatctg taatcctaaa actgttttca tagtctgatg aagatgccga 5400
gaagaatgtc gatatgtcca ctgctgtatc gctaataaag ggtgaacgta agaacttgcg 5460
aagaaaagag aagcgtgctc ggatagagca aattcgggtt atgctcggtc tttctgattc 5520
tcaaaggact ccaatggtaa tgttgtaatc aacatttttt ttgttctaaa tttgtttgaa 5580
gttgttccga caaagtacat atactttgtt tactcagagg aactcttggc tgataatttg 5640
ttacacacag ttaacaatta aaaccatata tcactaattc ccatattcac acttttaaag 5700
ccaggagagt cactaaaaga tttctacaag agaacggata tgtactggca gatggctgca 5760
tatgagcaca ctcaacacac tggaaaggtt agtttctgct ccttaagtat cttcacccgt 5820
catacctgtt atcatattct ctaggttgct ggcagtatga gtttgctgta tttattcgtg 5880
ctcatccaat gccaggagct ccgcaaagat ggttttgatc ttgccgaaac tcgatataag 5940
gaactgaaac ccatactcga cgaggtaaaa ttgtcatgtt gtgtcccctt tgagacaaaa 6000
cggtatttct gacttggtac atattaactg actcttacac gccctcttca gctggctgtg 6060
ctcgaggctg aacagaaagc tgaggaagag gctagtgctt ccactagttc caagaaagac 6120
acgaagaaaa gcaagcagaa gagtggcatg agatag 6156
<210> 5
<211> 2237
<212> DNA
<213> 水稻(Oryza sativa)
<400> 5
actaattaca tggaatgcgt gtaaattgtg agatgaatct tttaagtcta attgcgccat 60
gatttgacaa tgtggtgaca gtaaacattt gctaatgacg gattaattag gcttaataaa 120
ttcgtctcgc ggtttacaga cagattctgt aatttatttt attattagac tacgtttaat 180
acttcaaatg tgtgtccgta tatccgatgt gacacgccaa aacttttaca cctcttgata 240
taaacacagc gtagctttct tgctaactcg atattttctt accgtagtca catgtcacgt 300
ctccgatacc atctcaataa ttgcttttga agttattctc taatttaata gtcgaagaag 360
tcgatatacc ctatttattg ttgagggata tgaagtaaat ctgacccata gttgaggtag 420
tcaggactgt aggtttcaaa gtacgattta gcccatttag gttgttgggc caaacgctct 480
tccgttttag aagagacgtg gtcactctgc caaaaatagg aaaaagtaca cccaaggtcc 540
ctcaacttgt catagggata aaaaacgtcc tcaaatcaca aaaccagata tacggggtct 600
attaattata taaaaccggt cattagaggt ccttcggcgg tcttgaaccc ggttttatct 660
gacgtagcgg ctaaatcagt gcgggacccg cgtgggcccc acatgtcagc tggccacgtc 720
atcaaactcc tctctctttt cccctcctct ctctcttcct catctctctc ccttctctgc 780
cgccggcagt gcctcggcgg cgggcatcgg cggtgatggc ggcggcggtg gggggagcgg 840
catacgggct ccccggcggt cgccgtccac gcatagctcc tccccccgca gcttgcgccc 900
accgcctccg gtgcgccgcc gcctccccgt cggcgaggac gcggtgcttg aggttggata 960
tgtcgaagca gcagaggaag catgtaggct tggtgcagct ggggatgtag tcgcggagca 1020
tgtcagcgac gccgcggtag cggcggaacg cgaggagaga gcacgcccgg agctcgagag 1080
cggcctccat gcgcggcgac aactctagca ctgcttccac tagtccaagc gccgtcgtgg 1140
ccgctgagtg gtcgccacat tccccattga gcggcgcggc ggcagcgaaa gcagcccgtg 1200
cctcaatgag gtagtcgccg atgatctgct gcacaatgca gagcaccaac aaatcagttg 1260
ctgaagccaa aggcaaaggc aaatctaaat tgggagcaaa accgcccaaa ttgccatcat 1320
tttttgtggc acattagatg aagaattaaa gagggagcaa gaaacaacct ttcggtgtcc 1380
gcggagccaa atccttctct tgtcaggagg ggaagaagac ggcgaggaca ccgtcgccga 1440
ccaccgccgt tcgtccgcaa gccggctgtc gacgagatgt ccccgcgcag cctcgcacac 1500
ctgcagtgcc tcctcacgct cgggctccat cgccgacgag tggcggctgg gacgggctac 1560
tcgacccgct cgaccagaac ctccgccgcg aggtcctccg ctacggcgac ttcgtgcagg 1620
ccgcatacac agcgttccat tccatgctgt cggcggcggc ggcgtcgcag cacatctcgg 1680
gtgggcgcac cggacgctcg tgctccccga cctgcggcgg cgtcgcagca cagccagaag 1740
ggagagagag atgaggaaga gagagaagag gggaaaagag agaggagttt gatgacgtgg 1800
ccagctgaca tgtggggccc acgtgggtcc cgcgccgact cagccgccac gtcggataaa 1860
accggattca gaaccaccga aggacctcgg tgaccggttt tgtataatta agggaccccg 1920
tatatctggt tttgtggttc gaggatgttt ttttatcccc atgacaagtt gagggacctt 1980
cggtgtactt tttccaatgg agggagtagt atccaacccc agcgtccgac tccgactccc 2040
gcttgcacgc gttcgtcacg gcccgttgag gcccaactaa gtccaagatg ggccgtcggc 2100
ccacggtgcg aacgccggcg ccaccgttgt ccctgctgct ccccacgagg gttttaggca 2160
cgcctccgcc tccgcctccg cctccggcag caagtgagcg cggggagaga gacagaagcc 2220
ggcggcggcg gggcgag 2237
<210> 6
<211> 1318
<212> DNA
<213> 水稻(Oryza sativa)
<400> 6
ctttggttcg aaatatcgaa atagagagtg gtttaggcca atgctttaag ccatctggga 60
ttttttcctt tctgagccat tggtcagctc gatcgattaa tccacacgag gtgcctataa 120
ctacctgtgt aggttggctt aatccatggt taaaattttc caacgtactg tatttgcgat 180
tcctcggaag cattcatgga gagattgtac acagttctat tcatcggaaa ttcgttacta 240
cgtacaagtt ttgagttctc tactgttcag tgttcacagc ctgcgatgta acacggtaca 300
ccctaacagt atctgggtgc ggtgtaaatt cactccatcc gtttcaggtt accagacgtt 360
tatagacaaa ctattttaag tttgactaaa tttatagata aatatagtaa tatttataat 420
actaaattag tgtcatcaaa tcaataatcg aatatatttt cataataaat ttgtcttggg 480
tgaaaaatgt tgctattttt ttctataaac ttgatcaaac ttaaatcaat ttgagtttga 540
cgaaggttaa aacgttttat ggttgaaacg gagggagtac tcttctgtac acatattgtt 600
tttttcttct cgagtatatc gatcttgttc cagaaaaaaa aaaagagtat atcgatcaag 660
ttttctcccc atacgctcgt gctactgttc ttaggtcgaa acgtccatct tgtacctttg 720
tacggtggcc catggcgcca ctgctccagt gatgggtact atactacctg cggtgaggtg 780
atgcaaccgt gatggtggtg agcgggtggt ggtgggcgag caaccgatcg tccagcgaca 840
gctaaccaca ataacgaagc gagtacgcgc ttcacctcac caacagcgag ctcccgtttt 900
gttcgcacga aaagaagctg agctcgcgtg cgtgcggtgc gttttgtgtg cgctgcgatg 960
tgcgatggtt gattgtgtgg tgtgcgagga gaggagtaga aatcaggcgg gggggctttt 1020
aatatcctgc cgtgcccgtc atctgtgtgg gactttgacc acaccattta cttcagctca 1080
tcaaacctca acaaccataa ctgcaccacc tgtccgccca acggcccaac ccctgctggt 1140
cccggttcac cacgcctccg gtgaaccata catgcaacct aatgtgttca tggcacctaa 1200
tgtgccgttg gcaccgaaat tgaacggtac gtgcacccaa ggtgaaggtg aggcaaggtg 1260
ctcatggctg cagagtttag actttagagc aagcagttgt tctcctccgt acgaaatg 1318
Claims (7)
1.基因在调控植物粒型、粒重中的应用,其特征在于:所述基因具有如(a)、(b)、(c)所示的序列:
(a)Seq ID No:1所示的基因组核苷酸序列;
(b)Seq ID No:2所示的cDNA核苷酸序列;
(c)在(a)、(b)所示的核苷酸序列的中添加和/或取代和/或缺失一个或几个核苷酸而生成的可编码具有调控粒型粒重功能的蛋白质的突变基因、等位基因或衍生物;
所述粒型为粒长。
2.根据权利要求1所述的应用,其特征在于:正调控植物粒型、粒重。
3.如权利要求1或2所述基因编码的蛋白在调控植物粒型粒重中的应用,其特征在于,所述蛋白具有如(A)或(B)所示的序列:
(A)Seq ID No:3所示的氨基酸序列;
(B)在(A)所限定的氨基酸序列中添加和/或取代和/或缺失一个或几个氨基酸且具有相同功能的由(A)衍生的蛋白质。
4.一种基因互补载体,其特征在于:所述基因互补载体为pCAMBIA1300载体。
5.一种基因过表达载体,其特征在于:所述基因过表达载体为pCAMBIA1300S载体。
6.一种与水稻粒型粒重基因紧密连锁的分子标记,其特征在于,所述分子标记为P4-1、P4-2、P4-3、P4-4、P4-5、P4-6、P4-7、4-8或P4-9;所述各分子标记对应的引物序列分别为:
P4-1:
F,5’-TGGGTCTTCAAAAAATGTTCAGTGG-3’
R,5’-ACCCCGCCTAAACTCCATGAATC-3’;
P4-2:
F,5’-TATAGATTCATCGTACTAAGGC-3’
R,5’-TGAGATTTATTGTTTTGTGTG-3’;
P4-3:
F,5’-AGAAGTAGTGCAGAGTACAGTC-3’
R,5’-AGTACTCCTATCCTTTAATAATATG-3’;
P4-4:
F,5’-ATACGTAGCGTTTGGTTATAGC-3’
R,5’-TTCGGTTTTGAACTCAACTTC-3’;
P4-5:
F,5’-TGCTTGAAGAGGAGAATGGTGG-3’
R,5’-AGCTCCTGAGTTCCTTGCGTC-3’;
P4-6:
F,5’-TTACATTATCGAATTATGCACGATAC-3’
R,5’-TGATACCCGAACTTCCTGACTG-3’;
P4-7:
F,5’-TGTGGCAGAATTGTGGGACAC-3’
R,5’-ACTTTATATCTGATTTAGGCACGTTTAC-3’;
P4-8:
F,5’-TAGATTGGTTTTTATGAAACG-3’
R,5’-TGCTGTCACAGTTTATCACAC-3’;
P4-9:
F,5’-TCCTACGATTTCTCAATCCTG-3’
R,5’-TGAATTCCTTCAATTTTAGAGC-3’。
7.权利要求6所述的与水稻粒型粒重基因紧密连锁的分子标记在分子标记辅助选择育种中的应用,其特征在于:所述应用为辅助选择与植物粒型粒重相关性状。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111606443.1A CN114214340B (zh) | 2021-12-26 | 水稻粒型粒重相关基因、蛋白、分子标记及应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111606443.1A CN114214340B (zh) | 2021-12-26 | 水稻粒型粒重相关基因、蛋白、分子标记及应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114214340A true CN114214340A (zh) | 2022-03-22 |
CN114214340B CN114214340B (zh) | 2024-06-11 |
Family
ID=
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103882145A (zh) * | 2014-04-15 | 2014-06-25 | 江苏省农业科学院 | 一种鉴定水稻粒长基因qGL3等位基因变异的PCR分子标记方法 |
CN106754967A (zh) * | 2017-01-19 | 2017-05-31 | 南京农业大学 | 一种水稻粒型基因OsLG1及其编码蛋白质和应用 |
CN109575114A (zh) * | 2019-01-30 | 2019-04-05 | 中国水稻研究所 | 一种水稻粒形粒重相关基因、蛋白、分子标记及应用 |
US20210180078A1 (en) * | 2017-11-29 | 2021-06-17 | The University Of Hong Kong | Transgenic rice plants overexpressing acyl-coa-binding protein2 show enhanced grain size |
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103882145A (zh) * | 2014-04-15 | 2014-06-25 | 江苏省农业科学院 | 一种鉴定水稻粒长基因qGL3等位基因变异的PCR分子标记方法 |
CN106754967A (zh) * | 2017-01-19 | 2017-05-31 | 南京农业大学 | 一种水稻粒型基因OsLG1及其编码蛋白质和应用 |
US20210180078A1 (en) * | 2017-11-29 | 2021-06-17 | The University Of Hong Kong | Transgenic rice plants overexpressing acyl-coa-binding protein2 show enhanced grain size |
CN109575114A (zh) * | 2019-01-30 | 2019-04-05 | 中国水稻研究所 | 一种水稻粒形粒重相关基因、蛋白、分子标记及应用 |
Non-Patent Citations (3)
Title |
---|
GENBANK: "CM000129.1", NCBI * |
GENBANK: "EEC77505.1", NCBI * |
XIAOJUN ZHANG等: "Rare allele of OsPPKL1 associated with grain length causes extra-large grain and a significant yield increase in rice", PNAS, vol. 109, no. 52, pages 21534 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Sun et al. | OsGRF4 controls grain shape, panicle length and seed shattering in rice | |
CN108239647B (zh) | 一种控制油菜株型的基因、分子标记及应用 | |
US6313375B1 (en) | Maize aquaporins and uses thereof | |
US20110010799A1 (en) | Floral Transition Genes in Maize and Uses Thereof | |
CN101627125A (zh) | 具有增强的产量相关性状的植物及其制备方法 | |
CN108822194B (zh) | 一个植物淀粉合成相关蛋白OsFLO10及其编码基因与应用 | |
CN109575114B (zh) | 一种水稻粒形粒重相关基因、蛋白、分子标记及应用 | |
CN113874388A (zh) | 孤雌生殖基因 | |
US8716553B2 (en) | NAC transcriptional activators involved in abiotic stress tolerance | |
CN109721649B (zh) | 一种水稻株型调控相关基因、蛋白质与应用 | |
US7754945B2 (en) | Generation of plants with improved drought tolerance | |
EP1685242B1 (en) | Generation of plants with improved drought tolerance | |
WO2023221826A1 (zh) | 玉米穗粒重和产量调控基因 KWE2 、其编码蛋白、InDel1标记、表达载体及其在植物性状改良中的应用 | |
CN109797158B (zh) | 基因OsNTL3在改良水稻高温抗性方面的应用及获得的水稻高温抗性基因 | |
CN110484555B (zh) | 具有多籽粒簇生性状的转基因水稻的构建方法 | |
CN110777150B (zh) | 蛋白GmPLATZ在调控植物种子产量中的应用 | |
CN111826391A (zh) | 一种nhx2-gcd1双基因或其蛋白的应用 | |
CN114214340B (zh) | 水稻粒型粒重相关基因、蛋白、分子标记及应用 | |
CN111304219B (zh) | 一种分离自水稻wz1中的gl1基因及其在增加水稻粒长中的应用 | |
CN114214340A (zh) | 水稻粒型粒重相关基因、蛋白、分子标记及应用 | |
CN109112137B (zh) | 一种控制水稻谷粒大小和粒重的基因sng1及其应用 | |
CN113929756A (zh) | Gl11蛋白和编码gl11蛋白的基因在调控水稻粒形和千粒重中的应用 | |
CN114149995A (zh) | 水稻粒型相关基因drog1及其用途 | |
CN112745376A (zh) | 一种转录抑制因子lip1调控水稻产量的功能及应用 | |
CN110862441B (zh) | 拟南芥ppd1和ppd2基因在调控种子大小中的应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant |