CN114574500A - Clone and application of rice sword-leaf sheath and panicle whitening trait gene OsWSSP - Google Patents
Clone and application of rice sword-leaf sheath and panicle whitening trait gene OsWSSP Download PDFInfo
- Publication number
- CN114574500A CN114574500A CN202210280493.3A CN202210280493A CN114574500A CN 114574500 A CN114574500 A CN 114574500A CN 202210280493 A CN202210280493 A CN 202210280493A CN 114574500 A CN114574500 A CN 114574500A
- Authority
- CN
- China
- Prior art keywords
- seq
- sequence
- rice
- primer
- gene
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 61
- 235000007164 Oryza sativa Nutrition 0.000 title claims abstract description 48
- 235000009566 rice Nutrition 0.000 title claims abstract description 47
- 230000002087 whitening effect Effects 0.000 title description 4
- 240000007594 Oryza sativa Species 0.000 title description 2
- 241000209094 Oryza Species 0.000 claims abstract description 46
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 16
- 108020004414 DNA Proteins 0.000 claims description 28
- 238000011144 upstream manufacturing Methods 0.000 claims description 18
- 238000000034 method Methods 0.000 claims description 12
- 150000007523 nucleic acids Chemical group 0.000 claims description 12
- 210000003763 chloroplast Anatomy 0.000 claims description 9
- 238000011161 development Methods 0.000 claims description 8
- 108020004707 nucleic acids Proteins 0.000 claims description 8
- 102000039446 nucleic acids Human genes 0.000 claims description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 4
- 238000009395 breeding Methods 0.000 claims description 4
- 230000001488 breeding effect Effects 0.000 claims description 4
- 238000012408 PCR amplification Methods 0.000 claims description 3
- 229930002875 chlorophyll Natural products 0.000 claims description 3
- 235000019804 chlorophyll Nutrition 0.000 claims description 3
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 claims description 3
- 230000001276 controlling effect Effects 0.000 claims description 3
- 230000001105 regulatory effect Effects 0.000 claims description 3
- 241001261506 Undaria pinnatifida Species 0.000 claims description 2
- 230000015572 biosynthetic process Effects 0.000 claims description 2
- 238000003786 synthesis reaction Methods 0.000 claims description 2
- 230000035772 mutation Effects 0.000 abstract description 9
- 238000012216 screening Methods 0.000 abstract description 7
- 239000003147 molecular marker Substances 0.000 abstract description 5
- 230000014509 gene expression Effects 0.000 abstract description 3
- 239000003550 marker Substances 0.000 abstract description 3
- 238000010367 cloning Methods 0.000 abstract description 2
- 230000018109 developmental process Effects 0.000 description 7
- 238000011160 research Methods 0.000 description 6
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 230000000243 photosynthetic effect Effects 0.000 description 5
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 230000017260 vegetative to reproductive phase transition of meristem Effects 0.000 description 4
- 206010039509 Scab Diseases 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000009456 molecular mechanism Effects 0.000 description 3
- 206010001557 Albinism Diseases 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 241000894007 species Species 0.000 description 2
- JBFQOLHAGBKPTP-NZATWWQASA-N (2s)-2-[[(2s)-4-carboxy-2-[[3-carboxy-2-[[(2s)-2,6-diaminohexanoyl]amino]propanoyl]amino]butanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)C(CC(O)=O)NC(=O)[C@@H](N)CCCCN JBFQOLHAGBKPTP-NZATWWQASA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- YWWATNIVMOCSAV-UBHSHLNASA-N Ala-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YWWATNIVMOCSAV-UBHSHLNASA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- CKIBTNMWVMKAHB-RWGOJESNSA-N Ala-Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 CKIBTNMWVMKAHB-RWGOJESNSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 description 1
- YFKWIIRWHGKSQQ-WFBYXXMGSA-N Cys-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N YFKWIIRWHGKSQQ-WFBYXXMGSA-N 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- KYFSMWLWHYZRNW-ACZMJKKPSA-N Gln-Asp-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KYFSMWLWHYZRNW-ACZMJKKPSA-N 0.000 description 1
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- UUWOBINZFGTFMS-UWVGGRQHSA-N Gly-His-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O UUWOBINZFGTFMS-UWVGGRQHSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- UMRIXLHPZZIOML-OALUTQOASA-N Gly-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN UMRIXLHPZZIOML-OALUTQOASA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- PUFNQIPSRXVLQJ-IHRRRGAJSA-N His-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N PUFNQIPSRXVLQJ-IHRRRGAJSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- SQUTUWHAAWJYES-GUBZILKMSA-N Met-Asp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SQUTUWHAAWJYES-GUBZILKMSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 241000209117 Panicum Species 0.000 description 1
- 235000006443 Panicum miliaceum subsp. miliaceum Nutrition 0.000 description 1
- 235000009037 Panicum miliaceum subsp. ruderale Nutrition 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- BAKAHWWRCCUDAF-IHRRRGAJSA-N Pro-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CN=CN1 BAKAHWWRCCUDAF-IHRRRGAJSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- BARPGRUZBKFJMA-SRVKXCTJSA-N Pro-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BARPGRUZBKFJMA-SRVKXCTJSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- HRKOLWXWQSDMSK-XIRDDKMYSA-N Trp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HRKOLWXWQSDMSK-XIRDDKMYSA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- ILMVQSHENUZYIZ-JYJNAYRXSA-N Val-Met-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N ILMVQSHENUZYIZ-JYJNAYRXSA-N 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000009402 cross-breeding Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000009025 developmental regulation Effects 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6858—Allele-specific amplification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/6895—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for plants, fungi or algae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/13—Plant traits
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Analytical Chemistry (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Immunology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Botany (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Mycology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The application relates to cloning and application of a rice scabbard and panicle albino trait gene OsWSSP. The application discovers for the first time that the expression quantity of the gene coding protein is reduced due to the inverted mutation of the OsWSSP promoter, so that the albino phenotype appears on the rice scabbard and the ear. Meanwhile, the application also develops and verifies a molecular marker tightly linked with the target gene OsWSSP so as to be used as a screening marker of different rice lines and hybrids derived from the rice lines.
Description
Technical Field
The application relates to cloning and application of rice scabbard and panicle albino character gene OsWSSP, and belongs to the technical field of molecular biology.
Background
Rice is one of the important food crops in China. The rice photosynthetic efficiency is related to the rice yield, and the photosynthetic efficiency is closely related to the chloroplast development and the photosynthetic pigment content. Therefore, the deep analysis of the molecular mechanism of chloroplast development in the green tissues of rice has important significance for improving the photosynthetic efficiency of rice. Related researches in rice mainly focus on the development and control of chloroplast of leaves, and the development of chloroplast of panicle parts and sword-leaved sheaths and photosynthetic efficiency of chloroplast are deficient in research on the influence of rice yield.
At present, related research in rice mainly focuses on the chloroplast development regulation of leaves, people utilize leaf color mutation as a screening marker of sterile lines and pure varieties of two-line hybrid rice, but light green and light yellow leaves are difficult to distinguish, so that the efficiency of using the leaf color mutation as a screening tool is very low. Therefore, if the related genes influencing the whitening of the rice panicle and the scab of the sword-shaped leaf can be found, the molecular mechanism research of the development of rice chloroplast is facilitated, and the application research of the cob and the scab whitening regulation gene in rice crossbreeding is facilitated to be expanded.
In addition, in variety identification, although the field planting identification has the advantages of wide application range, reliable identification result and the like, the method has long time, low efficiency, large workload and lagged result. Generally speaking, the target characters to be identified are mostly quantitative characters, which are greatly influenced by the environment, and people with certain field inspection experience are needed to ensure the detection accuracy, while the DNA fingerprint spectrum technology has the advantages of high efficiency, accuracy, simple experimental operation, no influence by the environment and the like, and is widely applied to the authenticity identification and purity detection research of plant varieties.
Mature and widely used DNA fingerprinting techniques including RAPD, ISSR, InDel, etc. are known, each of which has advantages. InDel refers to the phenomenon that insertion or deletion of nucleotide fragments with different sizes occurs in sequences of the same site of genome between related species or different individuals of the same species, and gaps (gap) are generated by homologous sequence alignment. The InDel molecular marker has the characteristics of rich quantity, high polymorphism, high precision, short detection time, mature technology and the like, and has wide application prospect in the aspects of plant variety identification and purity detection. However, InDel production is mainly related to genomic features and DNA replication errors, and the mechanism of generation of most insertion/deletion mutations is still unknown at present. Since the ratio of insertions and deletions increases due to sequencing errors and the true InDel polymorphism rate also increases in these sequencing error regions, the challenge is presented to obtain effective molecular marker primers for tens of thousands of insertions and deletions in the enormous genome sequencing data, which locations are operational problems, which locations are sequencing errors, which locations are true InDel variations, and so on.
Disclosure of Invention
The application discovers a gene OsWSSP for controlling the panicle part of rice to show an obvious albino phenotype in the booting stage, the flowering stage and the seed filling stage for the first time, and a protein coded by CDS of the gene, and further determines a molecular mechanism of the gene or the protein for regulating the phenotype.
The inventor finds that the promoter of the gene has inversion mutation, and the mutation causes the expression level of target protein to be reduced, thereby causing the scab of the tassel during the booting stage of the rice and the albino phenotype of the tassel during the flowering stage and the filling stage of seeds.
Meanwhile, the optimal identifying primer is screened out to improve the efficiency of molecular assisted breeding. Specifically, the method comprises the following steps:
the application provides an OsWSSP gene, the nucleic acid sequence of which is shown in SEQ ID No.1 or SEQ ID No. 2.
Wherein, SEQ ID.NO.1 is the complete sequence of the genomic DNA of the OsWSSP gene; SEQ ID No.2 is the genomic DNA complete sequence of the mutated OsWSSP gene.
The application also provides a promoter, and the nucleic acid sequence of the promoter is shown in SEQ ID No.5 or SEQ ID No. 6.
Wherein, SEQ ID.NO.5 is a promoter sequence in the genomic DNA of the OsWSSP gene; SEQ ID No.6 is the promoter sequence in the genomic DNA of the mutated OsWSSP gene.
The difference of the promoters causes the corresponding protein expression quantity of the OsWSSP gene to be different.
The application also provides the application of the gene or the protein with the sequence shown in SEQ ID No.4 or the coding nucleic acid thereof or the promoter in identifying rice varieties or rice breeding.
The application also provides application of the gene or the protein with the sequence shown in SEQ ID No.4 or the coding nucleic acid thereof or the promoter in regulating and controlling chloroplast development and/or chlorophyll synthesis of rice scabbard and spike.
In one embodiment, the sequence of the nucleic acid encoding the above protein is as shown in SEQ id No. 3.
Wherein, SEQ ID No.3 is CDS sequence in OsWSSP gene genome DNA, and its coded protein sequence is shown in SEQ ID No. 4.
In some embodiments, the protein is expressed in rice in which albino is present in the scabbard and ear of the rice, in a lower amount than in normal rice.
The present application further provides a primer set, comprising at least the following 2 primer pairs:
1, the sequence of an upstream primer is shown as SEQ ID.NO.7, and the sequence of a downstream primer is shown as SEQ ID.NO. 8;
in the 2 nd pair of primers, the sequence of the upstream primer is shown in SEQ ID.NO.9, and the sequence of the downstream primer is shown in SEQ ID.NO. 10.
The application also provides a primer group, which at least comprises the following 4 primer pairs:
1, the sequence of an upstream primer is shown as SEQ ID.NO.11, and the sequence of a downstream primer is shown as SEQ ID.NO. 12;
in the 2 nd pair of primers, the sequence of the upstream primer is shown in SEQ ID.NO.13, and the sequence of the downstream primer is shown in SEQ ID.NO. 14;
the 3 rd pair of primers has an upstream primer sequence shown in SEQ ID No.15 and a downstream primer sequence shown in SEQ ID No. 16;
in the 4 th pair of primers, the sequence of the upstream primer is shown in SEQ ID.NO.17, and the sequence of the downstream primer is shown in SEQ ID.NO. 18.
The present application provides a kit comprising the above-described gene, and/or the above-described promoter, and/or the above-described primer set.
The application provides a method for identifying rice varieties, which comprises the step of identifying the genes in a rice genome, and/or the promoters, and/or the proteins with the sequences shown in SEQ ID No.4 or the coding nucleic acids thereof.
The application provides a method for identifying rice varieties, which comprises the following steps: (1) extracting the genome DNA of the rice sample; (2) carrying out PCR amplification on the genome DNA by using the primer group; (3) analyzing the size of the amplified product.
The application also provides the application of the gene, the protein or the coding nucleic acid thereof, the promoter, the primer group, the kit or the method in identifying rice varieties or rice breeding.
In some embodiments, the rice comprises wakame, nippon, a light-or temperature-sensitive sterile line, and/or hybrids derived therefrom.
In some embodiments, the rice has a sheath and ear that is normally non-albino, or has a albino phenotype.
In the present application, a primer set or a primer pair may refer to the same concept.
Drawings
FIG. 1 comparison of the phenotypes of conventional oryza sativa HHZ (wild type) and the near isogenic line NIL (mutant).
FIG. 2 Fine localization of OsWSSP gene of interest.
FIG. 3 is a PCR verification of the OsWSSP inversion mutation breakpoint.
FIG. 4 overexpression of the WSSP gene restores the cob albino phenotype of the mutant oswssp.
FIG. 5 polymorphism detection of closely linked markers in the target region.
FIG. 6 OsWSSP flanking markers of target genes assist in identifying heterozygous individuals of BC3F 1.
Detailed Description
The present application is further illustrated by the following examples, but any examples or combination thereof should not be construed as limiting the scope or implementation of the present invention. The experimental procedures used in the following examples are all conventional procedures unless otherwise specified. Materials, reagents and the like used in the following examples are commercially available unless otherwise specified.
Example 1 identification of Rice cob and Sword sheath albino phenotype
A mutant showing a significantly albino phenotype in panicle stage, flowering stage and seed filling stage is selected from natural population materials of rice and named oswssp (white Sword leaf shear and Panicum). To determine the genetic basis of this trait, we constructed F by crossing oswssp with the albino normal variety Huanghuazhan (HHZ)2Isolating the population.
Further, the target gene OsWSSP was initially located within a physical interval of about 1.61Mb in the short arm of chromosome 2 by BSA-seq analysis. Near Isogenic Lines (NILs) with HHZ background were selected using parental deep resequencing (>30X), developed in this interval and using InDels markers to assist backcrossing for 5 generations.
Compared with the conventional cultivar Huanghuazhan (HHZ), the near isogenic line NIL is mainly characterized by comprising the following components in percentage by weight: the scabbard at the booting stage and the ear at the flowering and seed filling stages showed a significantly albino phenotype (fig. 1), while the chlorophyll content of these two sites was reduced (fig. 1).
Example 2 localization and albinism mechanism of cob and scabbard albinism Gene OsWSSP
After the target gene was localized to a physical region of 1.61Mb using BSA-seq, the region was further encrypted with a molecular marker, and F was used2The target gene OsWSSP is finely positioned, and finally the target gene is determined in a 146kb physical interval (figure 2).
Using IGV (Integrated Genomics viewer) software, visually comparing Paired-end reads sequences of candidate genes with parents covering the interval, and searching potential sense mutation sites in the candidate genes so as to search potential candidate genes. As a result, it was found that: one candidate gene promoter region has inverted variation.
To confirm this conclusion, the inventors designed 2 pairs of identifying primers for this inversion site (table 1):
the 1 st pair of primers (identification 1), the sequence of the upstream primer is shown as SEQ ID.NO.7 (UP _ F), and the sequence of the downstream primer is shown as SEQ ID.NO.8 (UP _ R);
in the 2 nd primer pair (identification 2), the sequence of the upstream primer is shown as SEQ ID.NO.9 (Down _ F), and the sequence of the downstream primer is shown as SEQ ID.NO.10 (Down _ R).
The authenticity of the inversions was finally confirmed by PCR amplification (FIG. 3).
Furthermore, a binary overexpression vector proUBI of a CDS sequence of a wild-type Nipponbare WSSP gene driven by a maize Ubiquitin promoter is constructed, WSSP is genetically transformed by taking a cob albino mutant ospsp as a receptor, and the cob albino phenotype of positive transgenic materials (com _ #1, com _ #2 and com _ #3) is restored to the wild-type phenotype (as shown in figure 4). As can be seen, these results all substantiate that the mutation of the promoter of the candidate gene of interest is a direct cause of oswssp encapsium and panicle albino phenotype.
TABLE 1 primer sequences
Example 3 development and application of marker closely linked with target gene OsWSSP
Using the data of the parental sequencing, InDel molecular marker primers linked to the target gene were designed in all regions of the target region, and after a large number of preliminary screens, 19 pairs of primer sets were selected, and polymorphisms between wild type HHZ and mutant oswssp could be resolved by 2.5% agarose horizontal gel using these primers (FIG. 5). It can be seen that in the 19 primer pairs (table 1) of the present application, there are still some bands amplified by the primers with defects such as deletion, unclear or indistinguishable (e.g. 930269, 938043, 941198, etc.), and therefore, the inventors further screened and finally confirmed through experiments that 4 pairs of InDel labeled primers with the best specificity and the strongest stability (fig. 6, table 1):
the 1 st primer (screening 4), the sequence of the upstream primer is shown as SEQ ID.NO.11 (827382_ F), and the sequence of the downstream primer is shown as SEQ ID.NO.12 (827382_ R);
the 2 nd primer (screening 6), the sequence of the upstream primer is shown as SEQ ID.NO.13 (894514_ F), and the sequence of the downstream primer is shown as SEQ ID.NO.14 (894514_ R);
the 3 rd pair of primers (screening 12), wherein the sequence of the upstream primer is shown as SEQ ID No.15 (973145_ F), and the sequence of the downstream primer is shown as SEQ ID No.16 (973145_ R);
in the 4 th primer pair (screening 13), the sequence of the upstream primer is shown as SEQ ID.NO.17 (998331_ F), and the sequence of the downstream primer is shown as SEQ ID.NO.18 (998331_ R).
These labeled primers can aid in the selection of BC3F1For example, the 4 th, 6 th, 8 th, 17 th, 19 th, 21 th in FIG. 6, and used for the creation of a rice near isogenic line.
In summary, the applicant has endeavored to describe the idea and proof of the effect of the invention. The scope of the invention is defined by the appended claims, and it will be apparent to those skilled in the art from this description and general knowledge in the art that the scope of the invention is defined by the claims. Those skilled in the art can make any modification or change to the technical solution of the present invention without departing from the spirit and scope of the present invention, and such modifications and changes are also included in the scope of the present invention.
SEQUENCE LISTING
<110> Shenzhen agricultural genome institute of Chinese agricultural science institute
<120> clone and application of rice Jianyebao and panicle albino character gene OsWSSP
<130> 20220310
<160> 18
<170> PatentIn version 3.3
<210> 1
<211> 4677
<212> DNA
<213> Artificial sequence
<400> 1
tttggcgaga agtgggtcca gtctgtagga ttatatcttg ggccgtgcca ggacagggca 60
gcaatcgact gcagcaacag taaaaccgct tgtcaaacaa ccccaataga agttatagcc 120
gcttattaca aaattctttg ataaactaaa tgcaaatgat aaaaaaaagt atgatgaaaa 180
gatagacgcc ggaaaaaaac atattcggca actaataatt ttactgctaa atcttcatct 240
ataggttgat aaaaatttat aaaaccgttg tctttaaatg caaaatgtag gagcttatta 300
tcttcatcat attttagttg gacccaaaat cgaaaacatt aactagtatt gcctgcatat 360
agaatttagt aagagctcta taaaaaaaat gtaccattgc tactaagccg aaatactcaa 420
cgtaatttct agtaaggtta ggctcggtag aaccatatgt gtaatgatat aatatctaga 480
gtatatatat tgcacatgtc gcgacgttga tgactggcta tgtgagctta atctcaatta 540
atttccaaga aaaaaaatac ctagagtata tatattgcac atgttgcgac gttgataacg 600
ggctatgcga gccttatctt aattaatgtc caagaaaaaa ccgacctagt aattatcgtc 660
ggtgacaaga agtttgcttg tgtcaaaact attgggccag gcatgggtta cccgttggta 720
aaaattgatt cccaattttt gcccactgcg ctttcggttt cggacggacg tgtccacatg 780
aataccccgg acacatgggt aaaattgcca tccctacata catacataca tacatatata 840
tatatacata catatacata tatatacata tgtatatata catatatata tatatatata 900
tatatatata tatatatata tatatatata gacacacaca tcaacaacaa caaaaaccct 960
aacctaatac atatccctcc atccgcttca aggcggcgac ggctggcagc tggctctgag 1020
ggcaacgcgg ccgagcatgg cgccgaggcg gactcaggca gtgagcggcg agcagactcc 1080
acgtcgtcac caacttcctt cctcctggtc gccgagtccg gcccagaggc cgccgtcgag 1140
ctccgcttcg cttcctcgcg tcctggtcca aggccaaggc cctcctcctc caccatcgcc 1200
cggcagctgg tgacacggcg gcgcacggca ggcaagccca gtgagtagca agcacgcatc 1260
gtgatgtgac attctagctt agcttccgtc gtcttcttct tcggctgtgt ttagatttaa 1320
agtttggatc caaacttcag tcttttttat cacatcaact tatcatacac acataatttt 1380
tcatttttca gtcacatcat ctccaatttc aaccaaaatt caaactttgc gctaaactaa 1440
acacagcctt cgtcaggtca tcgccgatca gttaatactc ctactaggct actactccca 1500
gctgaattag ctgcacaatt aaatatcgga gaagacgacg agggtgactt gttgggccta 1560
agtgggccgt gtgccgaaca aatggcccaa gtctaggaca atattttttg ggccgtgata 1620
cgacattttg ccacgagggc ccaatatgga tattttttgc attttgctcc ctgatttgaa 1680
ctgtatttct gcattttttg acggcatctc atccaggagc ccaggtcggc gccgccgcac 1740
acgcgtggcg gcctcgccgg cgactcggct gctcagagcg gcgcgaagaa aggccgcgct 1800
ggagtctgca gccgctgcgt gtggcagagt agagaccagc cagggagagc gcagctgtca 1860
gcaacctata ccagccagtg agcttgctcg cgccgttcgc cggggcgggg gaagagaacc 1920
atctggcgac aaagcacagg tgaaatccag tctctgccaa gcaatctcta attcccccca 1980
tggttcccta attgtgtgtg tatttcccat cattagtgct gttctaattc atcatttgtg 2040
ctgtttgttc tcaagcgtat tgctgagcga atagaatgcc caaagtgctt ctaatttgta 2100
tgcataggaa ttcttgacct gtcaagttac acactgcatg agttgatatg ggaacgtgcg 2160
gatagaaaaa ttcgtatttt ttggcgaaat ttcatgaaac ttgattgcat gttttgttct 2220
ctgcgtttca ttttggaatt tggggttcaa ctgcagttga aacgagtcat ctcaaattcg 2280
ttctgaacaa ttcccttcac gctgttcaaa cgaaagcaaa gagcctctct tataaacatg 2340
tttgcaatcc ggtggggatg atggcaattt caaagcttcc gaaacctgca aggcaaaagt 2400
aaaaaatttc acattggaaa gctggcttga tttgaaagcc ctataatttt ttttagtata 2460
aaatctctgt ttcgaagtca aacaattctg atcctgagta ggaatcactg atttgctaag 2520
ctgaaatctg aatttgtttt tgcactgaaa ttttatttgg aaattgctca aaaagaaagt 2580
ccagtacatg tttagaaaat tcctgatttc aggaaacttt taagccgaaa acctgagttg 2640
gaagttgttc ttgggaattt accaattagt gtttttttgt aattataaat acaaatagta 2700
aggacaacaa aatcctcccg ctactttgtg acctaactgt tggtttgcat attagaaaca 2760
cccctcgtgc taaatccagt agtttatgtt gtctgttagt ttttgaggca aacttctata 2820
tttgtgcagg tggctgtgat gggaacataa gagaggctct tatgaacttc taattcatac 2880
tagtagtata tttatgctta ttatacatgc tgacttgcaa ccttttgctt tatcatgcta 2940
ctatctggaa tcaatattgt tctaagtaac aacattttgc ctaatcttaa aacaggaagc 3000
atgtcactgg ttaaccggcc gctccctgca ctgtatggaa tttgcacggc ctcggcgaag 3060
accagagctt ggtggctctg cagagatggc aatctcccat caacaagtcg aatttcttgc 3120
acagagccat cgaatggggg ttctgtcatg gagttggagg taatggaccg caatgagcag 3180
acataccatg agaattcttc tgcttctgag gatgaggacg atgatgagga agaggctgta 3240
gagtggagca aagatgagct tgatgccatc tcggcactct ttgaccggcc gatgcgccag 3300
aagccaccaa agccgccaaa cccggtgagg cagcgacctc tcccgctgcc actgcctcac 3360
aagacacgac tgcccaatgc tcctgcacca aagcagcata tcaggctggc agcaagggct 3420
gcactttctt cacgctcttc attcagtgac caggtgtgca agaacccgga ggtcctcatt 3480
gggattgccc gggagattgc agcgctgcca ccggagtctg acgtgtccat tgtgctcgat 3540
cggtgggttc ggttcctccg gaaaggatcc ttgtcaatga ccattcggga gcttgggcac 3600
atggggctcc cagagagggc attgcagaca ctttgctggg ctcagaggca gactgtcgtg 3660
ccactcttcc ccgacgaccg gattctcgct tcgacaattg aggtcttggc acgcttcgac 3720
cagctgaaga tggaggatgc gctagagcag tgcgtccctt cagcgagccg tgctgttctt 3780
gaagccatgg tcagtgggtt catcagggca ggtaaagtag gccttgcccg aaaactcctc 3840
gagttcgcca cgatcaacaa gaggacactg agcccgagcg tccacgtgaa gctgatgctg 3900
gaggccgtcc gtacgcctga gggctacggg ctcgccgcgg cgctgctgga cgagctcggc 3960
gagaggccgg agctgcacgt tcggcagcag gactgcacgg ccgtgatgaa ggtgtgcgtg 4020
aagctccggc ggtacgccgc cgtggagagc ctgttcggct ggttcaggga caccggcggg 4080
aggcccacgg tggtgatgta cacggcggtg atccacagcc ggtgccgcga cgggcggcac 4140
cgggaggcgc tgtccctggc gtgggagatg gagcggcacg ccggcggcct cctcgacctg 4200
ccggcgtacc gggtgctggt gaagctgtgc gtggcgttgc gcgaccacga gaggggcgtc 4260
cggtacttgg cgaggatgaa ggacgccggg ttcgtcccga ccggcgacat gtacggcggc 4320
ctgatcggag gctacgcggc ggaggggagg atgggcaggt gccggaggct gatcagggag 4380
gcggagttgg ccggcgtgaa gctggagagg aggctgcttt cgcgcttgtc tgagatgggg 4440
gttgagcatt ctcagctctg aaagaaagaa actgatcttg ctgttggcat ctgactagca 4500
tggacattgc acagaaattg aactgtaaaa ttaggctctg cgtaattgcc tttaacctgt 4560
agtaacagaa cattacatta tgaatatatt tgttttgttg ttggcttgtt atcctctatt 4620
acactgtcca aaaaaaaatc tagctacgaa gtttgtagtt agaattgagt ttttttt 4677
<210> 2
<211> 4677
<212> DNA
<213> Artificial sequence
<400> 2
tgccatcgat agcttagctc atcccctttc tcttgatcgg tcggccaaag tttgacatgc 60
ttgcagttgc agctaacgtc tctgcatttt ctttcttgtt tctcagatta atttcaggct 120
gcttaattat tctgtaagaa cgaggtgctc tttgtgctag ctagctatgg ctcagctaca 180
ccctcgttta tcataaacat acacgtttct caaactgtaa gctataaaca gtgtgtttcg 240
tacaaaatct ttttatacta tataaaagaa tttaatggtg atggtgaatc taccatcatt 300
tagagttggt gagatcttct aatgctacat aaatacatct ttgtgtgtgg tttggttttc 360
tattggatgt ttgattgcaa ttatgtttga caaatacaat ttctaaatca agaaattcac 420
ataaatcttt taaaatccgt ccgttgcaga gcacgttact ttgctagtct ttaaaatttt 480
aaataaattt agttttcaag tttataatag gtaatactga gtttatcatg tactattcac 540
ctgttccggt tttcatgcca ctaactagct cagtcaaact cctttatcga acacagccca 600
gccagaatgc acccttgtct ttttacctag tgctataaga ggaaccaatt caagatgatg 660
atgattgatt taccttgtag cattactacg cagttcaaat tactaacctt tcagattagg 720
tacggtctga atatgtgatg gaagatcgaa gcagctgtat ctcataaaat aaattacaga 780
ttccgtctgg aaactgcaag acgaatttat taagcctaat taatctatta ttagcaaata 840
tttattgtag caccaaattg tcaaatcatg gcgcaattag gcttaaaaga tttgtctcgt 900
aatttcaagg cgaaatgttt aattgttttt tttctacatt tagtactcta tgcatgtgtc 960
caaacatcct atgtaacagc gtgaaaaatt ttgtttggga actaaacaga ccctgattgt 1020
gatgcaaccc ataagtaatt gcaaaactag ctagctatct acagcatcat cagcatatat 1080
cttggtggtc taaacttctc aggaaacaaa agatgagaaa ttgttctgtt cttagtccac 1140
gctgatgata acgataggtg tgctgtggtg ggcacaagct tttttctttc tttttttttt 1200
tgaggggaaa gttgcttaag atgaagtgac ataaacaaga ttaattcgag gaaaatgtgc 1260
gcatctcaga tgtgtactcc atttatacaa aacccagcaa attaagaagg tacaattatc 1320
tgacagacta gcgaacaaac tgaaaattat agtcgtagta ctacaaattt gcagtttgtg 1380
gctgagggga atctatccca tctggtatat gctgtcactt tgtgctagta ctgtactatg 1440
tactagcctg tagcctctac ttttgtaatt gaatgagaat atttgaatgt tttacaccac 1500
ataatggaac agctcaaatc tcataagatc ttggagagat attcttgatt actctgatta 1560
atgggtcatg tgttgatgca tatatatata tatatatata tatatatata tatatatata 1620
tatatatata tatatatatg ttaatgaatc aaaacgctgc attttgctcc ctgatttgaa 1680
ctgtatttct gcattttttg acggcatctc atccaggagc ccaggtcggc gccgccgcac 1740
acgcgtggcg gcctcgccgg cgactcggct gctcagagcg gcgcgaagaa aggccgcgct 1800
ggagtctgca gccgctgcgt gtggcagagt agagaccagc cagggagagc gcagctgtca 1860
gcaacctata ccagccagtg agcttgctcg cgccgttcgc cggggcgggg gaagagaacc 1920
atctggcgac aaagcacagg tgaaatccag tctctgccaa gcaatctcta attcccccca 1980
tggttcccta attgtgtgtg tatttcccat cattagtgct gttctaattc atcatttgtg 2040
ctgtttgttc tcaagcgtat tgctgagcga atagaatgcc caaagtgctt ctaatttgta 2100
tgcataggaa ttcttgacct gtcaagttac acactgcatg agttgatatg ggaacgtgcg 2160
gatagaaaaa ttcgtatttt ttggcgaaat ttcatgaaac ttgattgcat gttttgttct 2220
ctgcgtttca ttttggaatt tggggttcaa ctgcagttga aacgagtcat ctcaaattcg 2280
ttctgaacaa ttcccttcac gctgttcaaa cgaaagcaaa gagcctctct tataaacatg 2340
tttgcaatcc ggtggggatg atggcaattt caaagcttcc gaaacctgca aggcaaaagt 2400
aaaaaatttc acattggaaa gctggcttga tttgaaagcc ctataatttt ttttagtata 2460
aaatctctgt ttcgaagtca aacaattctg atcctgagta ggaatcactg atttgctaag 2520
ctgaaatctg aatttgtttt tgcactgaaa ttttatttgg aaattgctca aaaagaaagt 2580
ccagtacatg tttagaaaat tcctgatttc aggaaacttt taagccgaaa acctgagttg 2640
gaagttgttc ttgggaattt accaattagt gtttttttgt aattataaat acaaatagta 2700
aggacaacaa aatcctcccg ctactttgtg acctaactgt tggtttgcat attagaaaca 2760
cccctcgtgc taaatccagt agtttatgtt gtctgttagt ttttgaggca aacttctata 2820
tttgtgcagg tggctgtgat gggaacataa gagaggctct tatgaacttc taattcatac 2880
tagtagtata tttatgctta ttatacatgc tgacttgcaa ccttttgctt tatcatgcta 2940
ctatctggaa tcaatattgt tctaagtaac aacattttgc ctaatcttaa aacaggaagc 3000
atgtcactgg ttaaccggcc gctccctgca ctgtatggaa tttgcacggc ctcggcgaag 3060
accagagctt ggtggctctg cagagatggc aatctcccat caacaagtcg aatttcttgc 3120
acagagccat cgaatggggg ttctgtcatg gagttggagg taatggaccg caatgagcag 3180
acataccatg agaattcttc tgcttctgag gatgaggacg atgatgagga agaggctgta 3240
gagtggagca aagatgagct tgatgccatc tcggcactct ttgaccggcc gatgcgccag 3300
aagccaccaa agccgccaaa cccggtgagg cagcgacctc tcccgctgcc actgcctcac 3360
aagacacgac tgcccaatgc tcctgcacca aagcagcata tcaggctggc agcaagggct 3420
gcactttctt cacgctcttc attcagtgac caggtgtgca agaacccgga ggtcctcatt 3480
gggattgccc gggagattgc agcgctgcca ccggagtctg acgtgtccat tgtgctcgat 3540
cggtgggttc ggttcctccg gaaaggatcc ttgtcaatga ccattcggga gcttgggcac 3600
atggggctcc cagagagggc attgcagaca ctttgctggg ctcagaggca gactgtcgtg 3660
ccactcttcc ccgacgaccg gattctcgct tcgacaattg aggtcttggc acgcttcgac 3720
cagctgaaga tggaggatgc gctagagcag tgcgtccctt cagcgagccg tgctgttctt 3780
gaagccatgg tcagtgggtt catcagggca ggtaaagtag gccttgcccg aaaactcctc 3840
gagttcgcca cgatcaacaa gaggacactg agcccgagcg tccacgtgaa gctgatgctg 3900
gaggccgtcc gtacgcctga gggctacggg ctcgccgcgg cgctgctgga cgagctcggc 3960
gagaggccgg agctgcacgt tcggcagcag gactgcacgg ccgtgatgaa ggtgtgcgtg 4020
aagctccggc ggtacgccgc cgtggagagc ctgttcggct ggttcaggga caccggcggg 4080
aggcccacgg tggtgatgta cacggcggtg atccacagcc ggtgccgcga cgggcggcac 4140
cgggaggcgc tgtccctggc gtgggagatg gagcggcacg ccggcggcct cctcgacctg 4200
ccggcgtacc gggtgctggt gaagctgtgc gtggcgttgc gcgaccacga gaggggcgtc 4260
cggtacttgg cgaggatgaa ggacgccggg ttcgtcccga ccggcgacat gtacggcggc 4320
ctgatcggag gctacgcggc ggaggggagg atgggcaggt gccggaggct gatcagggag 4380
gcggagttgg ccggcgtgaa gctggagagg aggctgcttt cgcgcttgtc tgagatgggg 4440
gttgagcatt ctcagctctg aaagaaagaa actgatcttg ctgttggcat ctgactagca 4500
tggacattgc acagaaattg aactgtaaaa ttaggctctg cgtaattgcc tttaacctgt 4560
agtaacagaa cattacatta tgaatatatt tgttttgttg ttggcttgtt atcctctatt 4620
acactgtcca aaaaaaaatc tagctacgaa gtttgtagtt agaattgagt ttttttt 4677
<210> 3
<211> 1461
<212> DNA
<213> Artificial sequence
<400> 3
atgtcactgg ttaaccggcc gctccctgca ctgtatggaa tttgcacggc ctcggcgaag 60
accagagctt ggtggctctg cagagatggc aatctcccat caacaagtcg aatttcttgc 120
acagagccat cgaatggggg ttctgtcatg gagttggagg taatggaccg caatgagcag 180
acataccatg agaattcttc tgcttctgag gatgaggacg atgatgagga agaggctgta 240
gagtggagca aagatgagct tgatgccatc tcggcactct ttgaccggcc gatgcgccag 300
aagccaccaa agccgccaaa cccggtgagg cagcgacctc tcccgctgcc actgcctcac 360
aagacacgac tgcccaatgc tcctgcacca aagcagcata tcaggctggc agcaagggct 420
gcactttctt cacgctcttc attcagtgac caggtgtgca agaacccgga ggtcctcatt 480
gggattgccc gggagattgc agcgctgcca ccggagtctg acgtgtccat tgtgctcgat 540
cggtgggttc ggttcctccg gaaaggatcc ttgtcaatga ccattcggga gcttgggcac 600
atggggctcc cagagagggc attgcagaca ctttgctggg ctcagaggca gactgtcgtg 660
ccactcttcc ccgacgaccg gattctcgct tcgacaattg aggtcttggc acgcttcgac 720
cagctgaaga tggaggatgc gctagagcag tgcgtccctt cagcgagccg tgctgttctt 780
gaagccatgg tcagtgggtt catcagggca ggtaaagtag gccttgcccg aaaactcctc 840
gagttcgcca cgatcaacaa gaggacactg agcccgagcg tccacgtgaa gctgatgctg 900
gaggccgtcc gtacgcctga gggctacggg ctcgccgcgg cgctgctgga cgagctcggc 960
gagaggccgg agctgcacgt tcggcagcag gactgcacgg ccgtgatgaa ggtgtgcgtg 1020
aagctccggc ggtacgccgc cgtggagagc ctgttcggct ggttcaggga caccggcggg 1080
aggcccacgg tggtgatgta cacggcggtg atccacagcc ggtgccgcga cgggcggcac 1140
cgggaggcgc tgtccctggc gtgggagatg gagcggcacg ccggcggcct cctcgacctg 1200
ccggcgtacc gggtgctggt gaagctgtgc gtggcgttgc gcgaccacga gaggggcgtc 1260
cggtacttgg cgaggatgaa ggacgccggg ttcgtcccga ccggcgacat gtacggcggc 1320
ctgatcggag gctacgcggc ggaggggagg atgggcaggt gccggaggct gatcagggag 1380
gcggagttgg ccggcgtgaa gctggagagg aggctgcttt cgcgcttgtc tgagatgggg 1440
gttgagcatt ctcagctctg a 1461
<210> 4
<211> 486
<212> PRT
<213> Artificial sequence
<400> 4
Met Ser Leu Val Asn Arg Pro Leu Pro Ala Leu Tyr Gly Ile Cys Thr
1 5 10 15
Ala Ser Ala Lys Thr Arg Ala Trp Trp Leu Cys Arg Asp Gly Asn Leu
20 25 30
Pro Ser Thr Ser Arg Ile Ser Cys Thr Glu Pro Ser Asn Gly Gly Ser
35 40 45
Val Met Glu Leu Glu Val Met Asp Arg Asn Glu Gln Thr Tyr His Glu
50 55 60
Asn Ser Ser Ala Ser Glu Asp Glu Asp Asp Asp Glu Glu Glu Ala Val
65 70 75 80
Glu Trp Ser Lys Asp Glu Leu Asp Ala Ile Ser Ala Leu Phe Asp Arg
85 90 95
Pro Met Arg Gln Lys Pro Pro Lys Pro Pro Asn Pro Val Arg Gln Arg
100 105 110
Pro Leu Pro Leu Pro Leu Pro His Lys Thr Arg Leu Pro Asn Ala Pro
115 120 125
Ala Pro Lys Gln His Ile Arg Leu Ala Ala Arg Ala Ala Leu Ser Ser
130 135 140
Arg Ser Ser Phe Ser Asp Gln Val Cys Lys Asn Pro Glu Val Leu Ile
145 150 155 160
Gly Ile Ala Arg Glu Ile Ala Ala Leu Pro Pro Glu Ser Asp Val Ser
165 170 175
Ile Val Leu Asp Arg Trp Val Arg Phe Leu Arg Lys Gly Ser Leu Ser
180 185 190
Met Thr Ile Arg Glu Leu Gly His Met Gly Leu Pro Glu Arg Ala Leu
195 200 205
Gln Thr Leu Cys Trp Ala Gln Arg Gln Thr Val Val Pro Leu Phe Pro
210 215 220
Asp Asp Arg Ile Leu Ala Ser Thr Ile Glu Val Leu Ala Arg Phe Asp
225 230 235 240
Gln Leu Lys Met Glu Asp Ala Leu Glu Gln Cys Val Pro Ser Ala Ser
245 250 255
Arg Ala Val Leu Glu Ala Met Val Ser Gly Phe Ile Arg Ala Gly Lys
260 265 270
Val Gly Leu Ala Arg Lys Leu Leu Glu Phe Ala Thr Ile Asn Lys Arg
275 280 285
Thr Leu Ser Pro Ser Val His Val Lys Leu Met Leu Glu Ala Val Arg
290 295 300
Thr Pro Glu Gly Tyr Gly Leu Ala Ala Ala Leu Leu Asp Glu Leu Gly
305 310 315 320
Glu Arg Pro Glu Leu His Val Arg Gln Gln Asp Cys Thr Ala Val Met
325 330 335
Lys Val Cys Val Lys Leu Arg Arg Tyr Ala Ala Val Glu Ser Leu Phe
340 345 350
Gly Trp Phe Arg Asp Thr Gly Gly Arg Pro Thr Val Val Met Tyr Thr
355 360 365
Ala Val Ile His Ser Arg Cys Arg Asp Gly Arg His Arg Glu Ala Leu
370 375 380
Ser Leu Ala Trp Glu Met Glu Arg His Ala Gly Gly Leu Leu Asp Leu
385 390 395 400
Pro Ala Tyr Arg Val Leu Val Lys Leu Cys Val Ala Leu Arg Asp His
405 410 415
Glu Arg Gly Val Arg Tyr Leu Ala Arg Met Lys Asp Ala Gly Phe Val
420 425 430
Pro Thr Gly Asp Met Tyr Gly Gly Leu Ile Gly Gly Tyr Ala Ala Glu
435 440 445
Gly Arg Met Gly Arg Cys Arg Arg Leu Ile Arg Glu Ala Glu Leu Ala
450 455 460
Gly Val Lys Leu Glu Arg Arg Leu Leu Ser Arg Leu Ser Glu Met Gly
465 470 475 480
Val Glu His Ser Gln Leu
485
<210> 5
<211> 3000
<212> DNA
<213> Artificial sequence
<400> 5
tttggcgaga agtgggtcca gtctgtagga ttatatcttg ggccgtgcca ggacagggca 60
gcaatcgact gcagcaacag taaaaccgct tgtcaaacaa ccccaataga agttatagcc 120
gcttattaca aaattctttg ataaactaaa tgcaaatgat aaaaaaaagt atgatgaaaa 180
gatagacgcc ggaaaaaaac atattcggca actaataatt ttactgctaa atcttcatct 240
ataggttgat aaaaatttat aaaaccgttg tctttaaatg caaaatgtag gagcttatta 300
tcttcatcat attttagttg gacccaaaat cgaaaacatt aactagtatt gcctgcatat 360
agaatttagt aagagctcta taaaaaaaat gtaccattgc tactaagccg aaatactcaa 420
cgtaatttct agtaaggtta ggctcggtag aaccatatgt gtaatgatat aatatctaga 480
gtatatatat tgcacatgtc gcgacgttga tgactggcta tgtgagctta atctcaatta 540
atttccaaga aaaaaaatac ctagagtata tatattgcac atgttgcgac gttgataacg 600
ggctatgcga gccttatctt aattaatgtc caagaaaaaa ccgacctagt aattatcgtc 660
ggtgacaaga agtttgcttg tgtcaaaact attgggccag gcatgggtta cccgttggta 720
aaaattgatt cccaattttt gcccactgcg ctttcggttt cggacggacg tgtccacatg 780
aataccccgg acacatgggt aaaattgcca tccctacata catacataca tacatatata 840
tatatacata catatacata tatatacata tgtatatata catatatata tatatatata 900
tatatatata tatatatata tatatatata gacacacaca tcaacaacaa caaaaaccct 960
aacctaatac atatccctcc atccgcttca aggcggcgac ggctggcagc tggctctgag 1020
ggcaacgcgg ccgagcatgg cgccgaggcg gactcaggca gtgagcggcg agcagactcc 1080
acgtcgtcac caacttcctt cctcctggtc gccgagtccg gcccagaggc cgccgtcgag 1140
ctccgcttcg cttcctcgcg tcctggtcca aggccaaggc cctcctcctc caccatcgcc 1200
cggcagctgg tgacacggcg gcgcacggca ggcaagccca gtgagtagca agcacgcatc 1260
gtgatgtgac attctagctt agcttccgtc gtcttcttct tcggctgtgt ttagatttaa 1320
agtttggatc caaacttcag tcttttttat cacatcaact tatcatacac acataatttt 1380
tcatttttca gtcacatcat ctccaatttc aaccaaaatt caaactttgc gctaaactaa 1440
acacagcctt cgtcaggtca tcgccgatca gttaatactc ctactaggct actactccca 1500
gctgaattag ctgcacaatt aaatatcgga gaagacgacg agggtgactt gttgggccta 1560
agtgggccgt gtgccgaaca aatggcccaa gtctaggaca atattttttg ggccgtgata 1620
cgacattttg ccacgagggc ccaatatgga tattttttgc attttgctcc ctgatttgaa 1680
ctgtatttct gcattttttg acggcatctc atccaggagc ccaggtcggc gccgccgcac 1740
acgcgtggcg gcctcgccgg cgactcggct gctcagagcg gcgcgaagaa aggccgcgct 1800
ggagtctgca gccgctgcgt gtggcagagt agagaccagc cagggagagc gcagctgtca 1860
gcaacctata ccagccagtg agcttgctcg cgccgttcgc cggggcgggg gaagagaacc 1920
atctggcgac aaagcacagg tgaaatccag tctctgccaa gcaatctcta attcccccca 1980
tggttcccta attgtgtgtg tatttcccat cattagtgct gttctaattc atcatttgtg 2040
ctgtttgttc tcaagcgtat tgctgagcga atagaatgcc caaagtgctt ctaatttgta 2100
tgcataggaa ttcttgacct gtcaagttac acactgcatg agttgatatg ggaacgtgcg 2160
gatagaaaaa ttcgtatttt ttggcgaaat ttcatgaaac ttgattgcat gttttgttct 2220
ctgcgtttca ttttggaatt tggggttcaa ctgcagttga aacgagtcat ctcaaattcg 2280
ttctgaacaa ttcccttcac gctgttcaaa cgaaagcaaa gagcctctct tataaacatg 2340
tttgcaatcc ggtggggatg atggcaattt caaagcttcc gaaacctgca aggcaaaagt 2400
aaaaaatttc acattggaaa gctggcttga tttgaaagcc ctataatttt ttttagtata 2460
aaatctctgt ttcgaagtca aacaattctg atcctgagta ggaatcactg atttgctaag 2520
ctgaaatctg aatttgtttt tgcactgaaa ttttatttgg aaattgctca aaaagaaagt 2580
ccagtacatg tttagaaaat tcctgatttc aggaaacttt taagccgaaa acctgagttg 2640
gaagttgttc ttgggaattt accaattagt gtttttttgt aattataaat acaaatagta 2700
aggacaacaa aatcctcccg ctactttgtg acctaactgt tggtttgcat attagaaaca 2760
cccctcgtgc taaatccagt agtttatgtt gtctgttagt ttttgaggca aacttctata 2820
tttgtgcagg tggctgtgat gggaacataa gagaggctct tatgaacttc taattcatac 2880
tagtagtata tttatgctta ttatacatgc tgacttgcaa ccttttgctt tatcatgcta 2940
ctatctggaa tcaatattgt tctaagtaac aacattttgc ctaatcttaa aacaggaagc 3000
<210> 6
<211> 3000
<212> DNA
<213> Artificial sequence
<400> 6
tgccatcgat agcttagctc atcccctttc tcttgatcgg tcggccaaag tttgacatgc 60
ttgcagttgc agctaacgtc tctgcatttt ctttcttgtt tctcagatta atttcaggct 120
gcttaattat tctgtaagaa cgaggtgctc tttgtgctag ctagctatgg ctcagctaca 180
ccctcgttta tcataaacat acacgtttct caaactgtaa gctataaaca gtgtgtttcg 240
tacaaaatct ttttatacta tataaaagaa tttaatggtg atggtgaatc taccatcatt 300
tagagttggt gagatcttct aatgctacat aaatacatct ttgtgtgtgg tttggttttc 360
tattggatgt ttgattgcaa ttatgtttga caaatacaat ttctaaatca agaaattcac 420
ataaatcttt taaaatccgt ccgttgcaga gcacgttact ttgctagtct ttaaaatttt 480
aaataaattt agttttcaag tttataatag gtaatactga gtttatcatg tactattcac 540
ctgttccggt tttcatgcca ctaactagct cagtcaaact cctttatcga acacagccca 600
gccagaatgc acccttgtct ttttacctag tgctataaga ggaaccaatt caagatgatg 660
atgattgatt taccttgtag cattactacg cagttcaaat tactaacctt tcagattagg 720
tacggtctga atatgtgatg gaagatcgaa gcagctgtat ctcataaaat aaattacaga 780
ttccgtctgg aaactgcaag acgaatttat taagcctaat taatctatta ttagcaaata 840
tttattgtag caccaaattg tcaaatcatg gcgcaattag gcttaaaaga tttgtctcgt 900
aatttcaagg cgaaatgttt aattgttttt tttctacatt tagtactcta tgcatgtgtc 960
caaacatcct atgtaacagc gtgaaaaatt ttgtttggga actaaacaga ccctgattgt 1020
gatgcaaccc ataagtaatt gcaaaactag ctagctatct acagcatcat cagcatatat 1080
cttggtggtc taaacttctc aggaaacaaa agatgagaaa ttgttctgtt cttagtccac 1140
gctgatgata acgataggtg tgctgtggtg ggcacaagct tttttctttc tttttttttt 1200
tgaggggaaa gttgcttaag atgaagtgac ataaacaaga ttaattcgag gaaaatgtgc 1260
gcatctcaga tgtgtactcc atttatacaa aacccagcaa attaagaagg tacaattatc 1320
tgacagacta gcgaacaaac tgaaaattat agtcgtagta ctacaaattt gcagtttgtg 1380
gctgagggga atctatccca tctggtatat gctgtcactt tgtgctagta ctgtactatg 1440
tactagcctg tagcctctac ttttgtaatt gaatgagaat atttgaatgt tttacaccac 1500
ataatggaac agctcaaatc tcataagatc ttggagagat attcttgatt actctgatta 1560
atgggtcatg tgttgatgca tatatatata tatatatata tatatatata tatatatata 1620
tatatatata tatatatatg ttaatgaatc aaaacgctgc attttgctcc ctgatttgaa 1680
ctgtatttct gcattttttg acggcatctc atccaggagc ccaggtcggc gccgccgcac 1740
acgcgtggcg gcctcgccgg cgactcggct gctcagagcg gcgcgaagaa aggccgcgct 1800
ggagtctgca gccgctgcgt gtggcagagt agagaccagc cagggagagc gcagctgtca 1860
gcaacctata ccagccagtg agcttgctcg cgccgttcgc cggggcgggg gaagagaacc 1920
atctggcgac aaagcacagg tgaaatccag tctctgccaa gcaatctcta attcccccca 1980
tggttcccta attgtgtgtg tatttcccat cattagtgct gttctaattc atcatttgtg 2040
ctgtttgttc tcaagcgtat tgctgagcga atagaatgcc caaagtgctt ctaatttgta 2100
tgcataggaa ttcttgacct gtcaagttac acactgcatg agttgatatg ggaacgtgcg 2160
gatagaaaaa ttcgtatttt ttggcgaaat ttcatgaaac ttgattgcat gttttgttct 2220
ctgcgtttca ttttggaatt tggggttcaa ctgcagttga aacgagtcat ctcaaattcg 2280
ttctgaacaa ttcccttcac gctgttcaaa cgaaagcaaa gagcctctct tataaacatg 2340
tttgcaatcc ggtggggatg atggcaattt caaagcttcc gaaacctgca aggcaaaagt 2400
aaaaaatttc acattggaaa gctggcttga tttgaaagcc ctataatttt ttttagtata 2460
aaatctctgt ttcgaagtca aacaattctg atcctgagta ggaatcactg atttgctaag 2520
ctgaaatctg aatttgtttt tgcactgaaa ttttatttgg aaattgctca aaaagaaagt 2580
ccagtacatg tttagaaaat tcctgatttc aggaaacttt taagccgaaa acctgagttg 2640
gaagttgttc ttgggaattt accaattagt gtttttttgt aattataaat acaaatagta 2700
aggacaacaa aatcctcccg ctactttgtg acctaactgt tggtttgcat attagaaaca 2760
cccctcgtgc taaatccagt agtttatgtt gtctgttagt ttttgaggca aacttctata 2820
tttgtgcagg tggctgtgat gggaacataa gagaggctct tatgaacttc taattcatac 2880
tagtagtata tttatgctta ttatacatgc tgacttgcaa ccttttgctt tatcatgcta 2940
ctatctggaa tcaatattgt tctaagtaac aacattttgc ctaatcttaa aacaggaagc 3000
<210> 7
<211> 20
<212> DNA
<213> Artificial sequence
<400> 7
<210> 8
<211> 20
<212> DNA
<213> Artificial sequence
<400> 8
<210> 9
<211> 20
<212> DNA
<213> Artificial sequence
<400> 9
<210> 10
<211> 27
<212> DNA
<213> Artificial sequence
<400> 10
ttatcacatc aacttatcat acacaca 27
<210> 11
<211> 20
<212> DNA
<213> Artificial sequence
<400> 11
<210> 12
<211> 23
<212> DNA
<213> Artificial sequence
<400> 12
ttggtcgatt atcagaaact tcc 23
<210> 13
<211> 22
<212> DNA
<213> Artificial sequence
<400> 13
tggaatcatg tttggtcttc ac 22
<210> 14
<211> 20
<212> DNA
<213> Artificial sequence
<400> 14
<210> 15
<211> 20
<212> DNA
<213> Artificial sequence
<400> 15
<210> 16
<211> 20
<212> DNA
<213> Artificial sequence
<400> 16
<210> 17
<211> 20
<212> DNA
<213> Artificial sequence
<400> 17
<210> 18
<211> 20
<212> DNA
<213> Artificial sequence
<400> 18
Claims (10)
1. A nucleic acid sequence of the OsWSSP gene is shown in SEQ ID No.1 or SEQ ID No. 2.
2. A promoter has a nucleic acid sequence shown as SEQ ID No.5 or SEQ ID No. 6.
3. The gene or the protein with the sequence shown in SEQ ID No.4 or the coding nucleic acid thereof according to claim 1 or the promoter according to claim 2 is used for regulating and controlling chloroplast development and/or chlorophyll synthesis of rice scabbard and panicle.
4. A primer set, comprising at least the following 2 primer pairs:
1, the sequence of an upstream primer is shown as SEQ ID.NO.7, and the sequence of a downstream primer is shown as SEQ ID.NO. 8;
in the 2 nd pair of primers, the sequence of the upstream primer is shown in SEQ ID.NO.9, and the sequence of the downstream primer is shown in SEQ ID.NO. 10.
5. A primer set comprising at least 4 primer pairs selected from the group consisting of:
1, the sequence of an upstream primer is shown as SEQ ID.NO.11, and the sequence of a downstream primer is shown as SEQ ID.NO. 12;
in the 2 nd pair of primers, the sequence of the upstream primer is shown in SEQ ID.NO.13, and the sequence of the downstream primer is shown in SEQ ID.NO. 14;
the 3 rd pair of primers has an upstream primer sequence shown in SEQ ID No.15 and a downstream primer sequence shown in SEQ ID No. 16;
in the 4 th pair of primers, the sequence of the upstream primer is shown in SEQ ID No.17, and the sequence of the downstream primer is shown in SEQ ID No. 18.
6. A kit, characterized in that: comprising the gene of claim 1, and/or the promoter of claim 2, and/or the primer set of claim 4 or 5.
7. A method for identifying a rice variety, comprising the steps of identifying the gene of claim 1, and/or the promoter of claim 2, and/or the protein of SEQ id No.4 or the nucleic acid encoding the protein in the genome of rice; or:
the method comprises the following steps: (1) extracting the genome DNA of the rice sample; (2) performing PCR amplification on the genomic DNA by using the primer set of claim 4 or 5; (3) analyzing the size of the amplified product.
8. Use of the gene of claim 1, or a protein having the sequence shown in SEQ ID No.4 or a nucleic acid encoding the same, or the promoter of claim 2, or the primer set of claim 4 or 5, or the kit of claim 6, or the method of claim 7 for identifying rice varieties or in rice breeding.
9. The use according to claim 3, or the method according to claim 7, or the use according to claim 8, wherein the rice comprises wakame, nipponlily, a light-or temperature-sensitive sterile line, and/or hybrids derived therefrom.
10. The use, method or use according to claim 9 wherein the rice has a sheath and panicle albino phenotype.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210280493.3A CN114574500B (en) | 2022-03-22 | 2022-03-22 | Clone and application of rice sword-leaf sheath and panicle whitening trait gene OsWSSP |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210280493.3A CN114574500B (en) | 2022-03-22 | 2022-03-22 | Clone and application of rice sword-leaf sheath and panicle whitening trait gene OsWSSP |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114574500A true CN114574500A (en) | 2022-06-03 |
CN114574500B CN114574500B (en) | 2022-11-22 |
Family
ID=81777434
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210280493.3A Active CN114574500B (en) | 2022-03-22 | 2022-03-22 | Clone and application of rice sword-leaf sheath and panicle whitening trait gene OsWSSP |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114574500B (en) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1505639A (en) * | 2001-04-25 | 2004-06-16 | ������ѧ��ʽ���� | Protein relating to restoration from cytoplasmic male sterility to fertility and gene coding the protein |
WO2009033369A1 (en) * | 2007-09-12 | 2009-03-19 | Huazhong Agricultural University | Cloning and application of a pleiotropic gene ghd7 that controls grains yield, heading date and plant height of rice |
CN101724031A (en) * | 2009-12-29 | 2010-06-09 | 中国科学院遗传与发育生物学研究所 | Protein related to rice panicle type and encoding gene and application thereof |
CN103497939A (en) * | 2013-09-04 | 2014-01-08 | 中国水稻研究所 | Protein capable of regulating and controlling growth and photosynthetic rate of chloroplast, and gene and application thereof |
CN103613649A (en) * | 2013-10-30 | 2014-03-05 | 中国水稻研究所 | Paddy rice leaf color control gene OscpSRP54 and protein encoded by same |
CN109456396A (en) * | 2018-12-29 | 2019-03-12 | 中国水稻研究所 | A kind of protein, molecular labeling and the application of Senescence of Rice and fringe type controlling gene HK73 and its coding |
CN113957082A (en) * | 2021-12-07 | 2022-01-21 | 南京农业大学 | Gene TSA for protecting rice chloroplast development at low temperature and encoded protein and application thereof |
-
2022
- 2022-03-22 CN CN202210280493.3A patent/CN114574500B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1505639A (en) * | 2001-04-25 | 2004-06-16 | ������ѧ��ʽ���� | Protein relating to restoration from cytoplasmic male sterility to fertility and gene coding the protein |
WO2009033369A1 (en) * | 2007-09-12 | 2009-03-19 | Huazhong Agricultural University | Cloning and application of a pleiotropic gene ghd7 that controls grains yield, heading date and plant height of rice |
CN101724031A (en) * | 2009-12-29 | 2010-06-09 | 中国科学院遗传与发育生物学研究所 | Protein related to rice panicle type and encoding gene and application thereof |
CN103497939A (en) * | 2013-09-04 | 2014-01-08 | 中国水稻研究所 | Protein capable of regulating and controlling growth and photosynthetic rate of chloroplast, and gene and application thereof |
CN103613649A (en) * | 2013-10-30 | 2014-03-05 | 中国水稻研究所 | Paddy rice leaf color control gene OscpSRP54 and protein encoded by same |
CN109456396A (en) * | 2018-12-29 | 2019-03-12 | 中国水稻研究所 | A kind of protein, molecular labeling and the application of Senescence of Rice and fringe type controlling gene HK73 and its coding |
CN113957082A (en) * | 2021-12-07 | 2022-01-21 | 南京农业大学 | Gene TSA for protecting rice chloroplast development at low temperature and encoded protein and application thereof |
Non-Patent Citations (3)
Title |
---|
GENBANK: ""pentatricopeptide repeat-containing protein At2g01860[Oryza sativa Japonica Group]",Accession Number:XP_015623222.1", 《GENBANK》 * |
王中豪 等: ""水稻白化转绿和穗顶端退化突变体vpa1的遗传分析和基因定位"", 《中国水稻科学》 * |
陈云 等: ""水稻白穗突变体wp7的鉴定及基因定位"", 《分子植物育种》 * |
Also Published As
Publication number | Publication date |
---|---|
CN114574500B (en) | 2022-11-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Motamayor et al. | The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color | |
US6484105B2 (en) | Method for obtaining a plant with a genetic lesion in a gene sequence | |
WO2017092110A1 (en) | Sesamum indicum inflorescence definite gene sidt1 and snp marker thereof | |
CN108165554B (en) | Corn leaf width control gene ZmNL4 and application thereof | |
CN112375130A (en) | Corn ear length gene and molecular marker and application thereof | |
CN111235180A (en) | Method for shortening flowering phase of corn | |
CN108796107B (en) | SNP molecular marker coseparated with cucumber spur hardness gene Hard and application thereof | |
CN113337636A (en) | SNP (Single nucleotide polymorphism) site and KASP (Kaposi-phosphate) molecular marker for identifying leaf vein traits of tomato and application | |
CN112521471A (en) | Gene and molecular marker for controlling water content of corn kernels and application thereof | |
CN114574500B (en) | Clone and application of rice sword-leaf sheath and panicle whitening trait gene OsWSSP | |
US20210388375A1 (en) | Genes associated with resistance to wheat yellow rust | |
CN108220466A (en) | For the molecular labeling and its application of Early Identification pawpaw gender and gender correlated traits | |
CN114457091B (en) | Gene Taxip affecting quality of wheat grains and application thereof | |
CN113151560B (en) | Molecular marker for screening poplar with high pore density and high photosynthetic efficiency as well as method and application thereof | |
CN111363751B (en) | Clone and application of rice grain width and grain weight gene GW5.1 | |
CN108441572A (en) | The identification method of DCIPThe chloroplast of maize cytoplasm type based on KASP technologies | |
CN114395580A (en) | Gene for controlling plant height of corn | |
CN112662687A (en) | Method, kit and gene for postponing maize florescence | |
US20130004951A1 (en) | Bulked mutant analysis | |
CN110407922A (en) | Rice cold tolerance gene qSCT1 and its application | |
CN114891800B (en) | Corn ear length gene and application thereof | |
CN112646013B (en) | Corn flowering phase gene and application thereof | |
CN112626085B (en) | Rice narrow leaf gene NAL13 and application thereof | |
WO2005003349A1 (en) | Rice transposon gene | |
Prasad et al. | Advancement in molecular tools of plant population genetics |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |