CN112760304A - 基于基因编辑技术的gbssi突变型蛋白及其在植物育种中的应用 - Google Patents
基于基因编辑技术的gbssi突变型蛋白及其在植物育种中的应用 Download PDFInfo
- Publication number
- CN112760304A CN112760304A CN202110239257.2A CN202110239257A CN112760304A CN 112760304 A CN112760304 A CN 112760304A CN 202110239257 A CN202110239257 A CN 202110239257A CN 112760304 A CN112760304 A CN 112760304A
- Authority
- CN
- China
- Prior art keywords
- rice
- gene
- ala
- gly
- gbssi
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108010021466 Mutant Proteins Proteins 0.000 title claims abstract description 25
- 102000008300 Mutant Proteins Human genes 0.000 title claims abstract description 25
- 108010004047 granule-bound starch synthase I Proteins 0.000 title claims abstract description 25
- 238000010362 genome editing Methods 0.000 title claims abstract description 24
- 238000005516 engineering process Methods 0.000 title claims abstract description 18
- 238000003976 plant breeding Methods 0.000 title description 4
- 241000209094 Oryza Species 0.000 claims abstract description 66
- 235000007164 Oryza sativa Nutrition 0.000 claims abstract description 62
- 235000009566 rice Nutrition 0.000 claims abstract description 62
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 52
- 108091033409 CRISPR Proteins 0.000 claims abstract description 51
- 229920000856 Amylose Polymers 0.000 claims abstract description 39
- 230000035772 mutation Effects 0.000 claims abstract description 36
- 238000009395 breeding Methods 0.000 claims abstract description 26
- 150000001413 amino acids Chemical class 0.000 claims abstract description 18
- 230000001488 breeding effect Effects 0.000 claims abstract description 18
- 238000010354 CRISPR gene editing Methods 0.000 claims abstract description 13
- 238000000034 method Methods 0.000 claims abstract description 12
- 102000003960 Ligases Human genes 0.000 claims abstract description 3
- 108090000364 Ligases Proteins 0.000 claims abstract description 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 6
- 241000196324 Embryophyta Species 0.000 claims description 57
- 239000013598 vector Substances 0.000 claims description 21
- 108700028369 Alleles Proteins 0.000 claims description 13
- 102000004169 proteins and genes Human genes 0.000 claims description 11
- 101150047832 hpt gene Proteins 0.000 claims description 10
- 239000010413 mother solution Substances 0.000 claims description 8
- 230000009261 transgenic effect Effects 0.000 claims description 8
- 230000000694 effects Effects 0.000 claims description 7
- 239000002773 nucleotide Substances 0.000 claims description 7
- 125000003729 nucleotide group Chemical group 0.000 claims description 7
- 238000000137 annealing Methods 0.000 claims description 6
- 239000012634 fragment Substances 0.000 claims description 6
- 238000012163 sequencing technique Methods 0.000 claims description 6
- 241000589158 Agrobacterium Species 0.000 claims description 5
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 5
- 150000007523 nucleic acids Chemical group 0.000 claims description 5
- 230000001131 transforming effect Effects 0.000 claims description 5
- 241000894006 Bacteria Species 0.000 claims description 4
- 239000012154 double-distilled water Substances 0.000 claims description 4
- 239000013604 expression vector Substances 0.000 claims description 4
- 239000003550 marker Substances 0.000 claims description 4
- 239000013612 plasmid Substances 0.000 claims description 4
- 241000588724 Escherichia coli Species 0.000 claims description 3
- 238000001816 cooling Methods 0.000 claims description 2
- 238000012217 deletion Methods 0.000 claims description 2
- 230000037430 deletion Effects 0.000 claims description 2
- 238000007865 diluting Methods 0.000 claims description 2
- 239000007788 liquid Substances 0.000 claims description 2
- 230000035939 shock Effects 0.000 claims description 2
- 238000006467 substitution reaction Methods 0.000 claims description 2
- 239000000463 material Substances 0.000 abstract description 13
- 238000012216 screening Methods 0.000 abstract description 9
- 230000009418 agronomic effect Effects 0.000 abstract description 8
- 230000006872 improvement Effects 0.000 abstract description 4
- 230000008569 process Effects 0.000 abstract description 4
- 238000009402 cross-breeding Methods 0.000 abstract description 3
- 239000000126 substance Substances 0.000 abstract description 3
- 108020004414 DNA Proteins 0.000 description 18
- 235000001014 amino acid Nutrition 0.000 description 9
- 229940024606 amino acid Drugs 0.000 description 9
- 235000018102 proteins Nutrition 0.000 description 7
- 238000001514 detection method Methods 0.000 description 6
- 235000013339 cereals Nutrition 0.000 description 5
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 4
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 210000004027 cell Anatomy 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 3
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 244000184734 Pyrus japonica Species 0.000 description 3
- 108010039811 Starch synthase Proteins 0.000 description 3
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 3
- 239000004473 Threonine Substances 0.000 description 3
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 3
- 229960000310 isoleucine Drugs 0.000 description 3
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 2
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 2
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 2
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 2
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 2
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- FSXDWQGEWZQBPJ-HERUPUMHSA-N Ala-Trp-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FSXDWQGEWZQBPJ-HERUPUMHSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 2
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 2
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 2
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 2
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 2
- FXGMURPOWCKNAZ-JYJNAYRXSA-N Arg-Val-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FXGMURPOWCKNAZ-JYJNAYRXSA-N 0.000 description 2
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 2
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 2
- KSZHWTRZPOTIGY-AVGNSLFASA-N Asn-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KSZHWTRZPOTIGY-AVGNSLFASA-N 0.000 description 2
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 2
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 2
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 2
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 2
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 2
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 2
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 2
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 2
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 2
- IHZFGJLKDYINPV-XIRDDKMYSA-N Asp-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(O)=O)N)C(O)=O)C1=CN=CN1 IHZFGJLKDYINPV-XIRDDKMYSA-N 0.000 description 2
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 2
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 2
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 2
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 2
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 2
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 2
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 2
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 2
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 2
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- VJVAQZYGLMJPTK-QEJZJMRPSA-N Glu-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VJVAQZYGLMJPTK-QEJZJMRPSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 2
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- WOAMZMXCLBBQKW-KKUMJFAQSA-N His-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)O WOAMZMXCLBBQKW-KKUMJFAQSA-N 0.000 description 2
- QYOGJYIRKACXEP-SLBDDTMCSA-N Ile-Asn-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N QYOGJYIRKACXEP-SLBDDTMCSA-N 0.000 description 2
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 2
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 2
- QHGBCRCMBCWMBJ-UHFFFAOYSA-N Ile-Glu-Ala-Lys Natural products CCC(C)C(N)C(=O)NC(CCC(O)=O)C(=O)NC(C)C(=O)NC(C(O)=O)CCCCN QHGBCRCMBCWMBJ-UHFFFAOYSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 2
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 2
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 2
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 2
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 2
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 2
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 2
- NSGXXVIHCIAISP-CIUDSAMLSA-N Met-Asn-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NSGXXVIHCIAISP-CIUDSAMLSA-N 0.000 description 2
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 2
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 2
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 2
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 2
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 2
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 2
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 2
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 2
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 2
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 2
- 108010025216 RVF peptide Proteins 0.000 description 2
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 2
- YTYHAYZPOARHAP-HOCLYGCPSA-N Trp-Lys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N YTYHAYZPOARHAP-HOCLYGCPSA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 2
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 2
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 2
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 2
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 2
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 2
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 2
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 2
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 238000012214 genetic breeding Methods 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 238000010453 CRISPR/Cas method Methods 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010002685 hygromycin-B kinase Proteins 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000012452 mother liquor Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 230000002028 premature Effects 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1051—Hexosyltransferases (2.4.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8202—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation by biological means, e.g. cell mediated or natural vector
- C12N15/8205—Agrobacterium mediated transformation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8218—Antisense, co-suppression, viral induced gene silencing [VIGS], post-transcriptional induced gene silencing [PTGS]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8245—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified carbohydrate or sugar alcohol metabolism, e.g. starch biosynthesis
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y204/00—Glycosyltransferases (2.4)
- C12Y204/01—Hexosyltransferases (2.4.1)
- C12Y204/01021—Starch synthase (2.4.1.21)
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Nutrition Science (AREA)
- Medicinal Chemistry (AREA)
- Virology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
本发明公开了水稻直链淀粉合成酶GBSSI突变型蛋白、其编码基因及在育种中的应用,所述GBSSI突变型蛋白的氨基酸序列存在以下突变:其对应的水稻GBSSI氨基酸序列的第178位氨基酸发生突变,本发明还公开了利用基因编辑创制直链淀粉含量5%左右水稻的育种方法。本发明利用CRISPR/Cas9基因编辑技术,对Waxy基因进行定点突变,通过后代筛选,在T1代可以获得剔除Cas9元件的新材料,而新材料的基本农艺性状与野生型相比无明显改变。相对于传统的化学诱变育种和杂交育种等手段,基因编辑定向改良分子育种技术具有高效、精准等优点,极大提高育种效率、加快育种进程。
Description
技术领域
本发明涉及水稻遗传育种、糯性品质改良邻域,具体地,本发明涉及基于基因编辑技术的GBSSI突变型蛋白及其在植物育种中的应用。
背景技术
水稻是重要的粮食作物,随着人们生活水平的提高,对优质水稻品种的需求进一步增大。水稻直链淀粉含量影响水稻糯性,进而影响稻米的品种和口感。高直链淀粉稻米的糊化温度较高,胶稠度降低,煮成米饭的黏性、光泽度及柔软性都比较差;低直链淀粉含量的稻米,煮成饭后米粒晶莹、口感软糯,从而备受人们青睐。因而,通过降低水稻中的直链淀粉含量从而来改善稻米品质一直以来是育种家的目标。
水稻中直链淀粉主要由水稻蜡质基因Waxy(Wx)编码的颗粒结合淀粉合成酶(GBSSI)催化合成。研究表明Waxy是控制稻米直链淀粉含量的主效基因,目前大部分基于水稻品质的分子遗传育种都是围绕该基因进行的。在栽培稻中存在三个主要的Wx等位基因,Wxa、Wxb和wx。wx存在于糯稻品种中,是Wx基因的功能缺失形式,wx稻米中没有直链淀粉或者直链淀粉含量非常低(小于2%)。Wxa等位基因存在于大多数籼稻品种中,而大多数粳稻品种含有Wxb等位基因。这两个等位基因之间的主要差异是G/T多态性,该多态性产生的差异剪接会影响Wx mRNA稳定性,导致Wxa等位基因产生的mRNA和蛋白质水平比Wxb高10倍,从而使籼稻产生更多的直链淀粉(一般大于25%,有的在30%以上),而粳稻中直链淀粉含量一般为15-18%。除了这三个等位基因,近年来通过对一些自然存在的一些直链淀粉含量变化的品种的研究,也发现了其它一些复等位基因,如Wxin、Wxop、Wxmq及Wxmp等。这些等位基因一般在序列上有一个或几个碱基的差异,这些差异会影响Wx基因的表达及GBSS的活性,从而导致直链淀粉含量及稻米品质的不同。
通过传统的育种方法也可以这些优良性状转育到当地品种中,但这种方式耗时、费力、不确定因素较多,很难在短期内对本地品种进行改良。因而需要发展新的技术和方法来加速优良品种的培育。近年来,基因编辑技术,特别是CRISPR/Cas9技术的出现,为快速培育优良新品种提供了可能。本发明通过CRISPR/Cas9技术对水稻Waxy基因进行了人工突变,并获得了直链淀粉含量5%左右的新材料,为水稻新品种改良提供基础材料。
发明内容
本发明的第一个目的是提供一种水稻GBSSI突变型蛋白,所述GBSSI突变型蛋白的氨基酸序列存在以下突变:其对应于水稻GBSSI氨基酸序列的第178位氨基酸发生突变。
本发明首次发现了该178位氨基酸由苏氨酸突变为异亮氨酸并改变稻米直链淀粉含量。本发明的第178位氨基酸的突变还可以包括谷氨酸、甘氨酸、色氨酸、天冬氨酸、色氨酸、丙氨酸、缬氨酸、亮氨酸、脯氨酸、苯丙氨酸、酪氨酸、丝氨酸、半胱氨酸、蛋氨酸、天冬酰胺、谷氨酰胺、赖氨酸、精氨酸、组氨酸以及终止密码子等19种变异类型。而关于上述的氨基酸的其他变异或提前终止是否影响淀粉合成酶活性、生理功能以及是否改变稻米直链淀粉含量,还有待进一步研究证实。
进一步地,本发明所述的水稻GBSSI突变型蛋白,其特征在于,包括:
(a)其氨基酸序列如SEQ ID NO.2所示;或
(b)在(a)中的氨基酸序列经过取代和/或缺失和/或添加一个或几个氨基酸且具有直链淀粉合成酶活性的由(a)衍生的蛋白质。
本发明第二个目的是提供编码上述突变型蛋白的基因。所述基因可以包括:在严格条件下与编码上述突变型蛋白的核苷酸序列杂交且编码具有直链淀粉合成酶活性的蛋白质的核苷酸序列。
进一步地,所述基因是:
a)其核苷酸序列如SEQ ID NO.1所示;或
b)在严格条件下与a)限定的核酸序列杂交且编码具有直链淀粉合成酶活性蛋白质的核酸序列;
本发明第三个目的是提供一种表达盒、重组载体或细胞,所述表达盒、重组载体或细胞含有编码上述突变型蛋白的基因。除重组载体外,还可以是表达盒,细胞等。
本发明第四个目的是提供上述水稻GBSSI突变型蛋白、基因、表达载体在水稻育种中的应用,特别是水稻糯性品质方面的应用。
本发明第五个目的是提供一种获得低直链淀粉含量(本发明所述低直链淀粉是指直链淀粉含量为5%左右)的水稻的方法,包括如下步骤:
1)使水稻植株包含上述编码GBSSI突变型蛋白的基因;或
2)使水稻植株表达上述水稻GBSSI突变型蛋白。
本发明的最后一个目的是提供一种利用基因编辑技术低直链淀粉含量水稻的育种方法,包括以下步骤:
1)设计Waxy基因定点编辑的靶位点:基因编辑的靶位点核苷酸序列如SEQ IDNO.5所示;
2)构建含有目的片段的CRISPR/Cas9基因编辑载体:
A)靶点接头制备:采用接头引物用双蒸水溶解成母液,母液稀释后90℃30s(second,秒),移至室温冷却完成退火,即获得靶点接头;
B)将退火获得的靶点片段连接到CRISPR/Cas9表达载体上获得连接产物;
C)将步骤B)的连接产物进行热激法转化大肠杆菌获得重组菌,提取通过验证的含目的条带的菌液的阳性质粒;
3)将阳性质粒转化农杆菌EHA105,获得T0代转基因植株,以引物Waxy TXT-F和Waxy TXT-R对T0代转基因植株进行扩增并测序鉴定获得具备权利要求1或2所述突变型蛋白的植株。
进一步地,所述育种方法还包括将具有突变型蛋白的T0代转基因植株的含有目标等位基因纯合突变的T1代植株的T-DNA载体的剔除,所述T-DNA载体包括筛选标记HPT基因和Cas9元件。
其中,所述T-DNA载体的剔除通过对含有目标等位基因纯合突变的T1代植株的HPT基因和Cas9元件的同时检测,重复多次,筛选得到不携带HPT基因和Cas9元件的T1代单株即为目标植株。
其中,所述HPT基因基因检测方法通过以目标等位基因纯合突变的T1代植株的基因组DNA为模板,以hyg283-F和hyg283-R为引物进行PCR扩增,同时,所述Cas9元件的检测方法通过以目标等位基因纯合突变的T1代植株的基因组DNA为模板,以Cas9 TXT-F和Cas9TXT-F为引物进行PCR扩增,当未同时检测到HPT基因和Cas9元件,表明这成功剔除了T-DNA载体。
有益效果:相对于现有技术,本发明具备以下优点:
1)本发明利用CRISPR/Cas9基因定点编辑技术,对Waxy基因进行编辑,通过后代的筛选,在T1代就可以获得剔除Cas9元件的稳定遗传的材料,而新材料的基本农艺性状无明显改变。相对于化学诱变、杂交转育等育种,基因编辑定向改良分子育种技术具有快速、精准、高效等优点,利用基因功能标记进行基因型选择,将会极大提高育种效率,大大加快育种进程。
2)本发明的基因编辑技术育种,通过对水稻GBSSI蛋白第178位氨基酸的改变,得到的水稻品种的稻米经测定淀粉含量为5%左右。
附图说明
图1显示基因编辑载体Anc689BEmax-WaxysgRNA-Cas9的整个图谱。
图2显示秀水134和基因编辑株系Waxy基因编辑位点测序图。
图3显示秀水134和基因编辑株系GBSSI氨基酸比较图。
图4显示稻米外观示意图。
图5显示秀水134和waxy-m突变体植株的表型观察;其中,图5A为秀水134和waxy-m突变体植株整株表型;图5B为秀水134和waxy-m突变体植株株高统计;图5C为秀水134和waxy-m突变体植株分蘖数统计结果;图5D为秀水134和waxy-m突变体植株穗形;图5E为秀水134和waxy-m突变体植株粒宽;图5F为秀水134和waxy-m突变体植株粒长。
具体实施方式
以下参照具体的实施例来说明本发明。本领域技术人员能够理解,这些实施例仅用于说明本发明,其不以任何方式限制本发明的范围。下述实施例中的实验方法,如无特殊说明,均为常规方法。下述实施例中所用的材料、试剂等,如无特殊说明,均为市售购买产品。
本发明选取的背景材料为秀水134(是常规水稻品种,存在于本实验室),该品种是由嘉兴市农业科学研究院、中国科学院遗传与发育生物学研究所、浙江嘉兴农作物高新技术育种中心和余姚市种子管理站所选育的迟熟中粳新品种,全生育期152.2天左右,适宜上海地区种植,具有优良的综合农艺性状,已在生产上大面积推广应用,深受市场欢迎。秀水134株型较紧凑,茎秆粗壮,生育期适中,穗型较大,抗稻瘟病,直链淀粉含量为14%-16%左右,稻米外观透明。本发明通过CRISPR/Cas9基因编辑技术对秀水134的Waxy基因进行定点编辑,获得直链淀粉含量5%左右的突变体,为培育不同水稻糯性品种提供材料。
实施例1:5%左右直链淀粉含量的水稻突变体waxy-m获取过程
1、CRISPR/Cas9修饰靶点的选择
针对Waxy全基因组序列进行扫描,Waxy基因共有13个外显子,根据已有研究得知Waxy单碱基突变位点主要集中在4-6外显子,基于此,我们将突变位点选择限定在第4个外显子上,借助CRISPR-GE(http://skl.scau.edu.cn)以及CRISPR-P(http://crispr.hzau.edu.cn/CRISPR)网站筛选到了第4外显子上的靶向序列:AAGACCGGTGAGAAGATCTA(SEQ ID NO.5),位于第4外显子中的第10-29位碱基。
2、CRISPR/Cas9载体构建
针对该靶向序列合成以下寡核苷酸:sgRNA-Waxy-F:5’-TGTGTGAGACCGGTGAGAAGATCTA-3’(SEQ ID NO.6);sgRNA-Waxy-R:5’-AAACTAGATC TTCTCACCGGTCTCA-3’(SEQ ID NO.7);将合成的引物sgRNA-Waxy-F和sgRNA-Waxy-R用双蒸水溶解,浓度为10μmol母液。将溶解的引物按如下比例混合:8μL双蒸水+1μL sgRNA-F+1μL sgRNA-F。并进行退火反应,获得带接头的双链DNA片段。用BsaI酶切Anc689BEmax-sgRNA-Cas9载体(参见MuguiWang等人,Optimizing base editors for improved efficiency and expanded editingscope in rice,Plant Biotechnology Journal(2019)17:1697-1699),琼脂糖凝胶电泳并回收酶切的线性载体片段。将酶切回收的载体与退火形成的带粘性末端的双链DNA进行连接反应,连接产物转化大肠杆菌,鉴定阳性克隆并测序验证,获得构建正确的Anc689BEmax-WaxysgRNA-Cas9载体(图1)。
3、CRISPR/Cas9 T0突变体的获得
将该正确的表达载体转化农杆菌EHA105中,以水稻秀水134为受体进行农杆菌转化,经过筛选、分化、再生过程获得T0代转基因植株。将T0代植株分单株种植,单株取叶片提取基因组DNA。针对第4外显子靶位点附近设计引物并进行PCR检测,检测引物为:CRISPRTXT-F:5’-ACCAGTACAAGGACG CTTGG-3’(SEQ ID NO.8)和CRISPR TXT-R:5’-TGCCTGCAGGTAGCATCAAT-3’(SEQ ID NO.9),将PCR产物进行测序,确定T0代植株发生Waxy纯合突变,突变类型为第849和第850位C突变位T(图2)。
实施例2:直链淀粉含量5%左右的水稻突变体Waxy-m基因克隆
对上述实施例1的纯合突变植株的T1代单株分别取叶片,提取基因组DNA,以Waxy基因全长的特异引物Waxy-F和Waxy-R进行PCR扩增。扩增产物进行测序。测序结果与秀水134野生型Waxy基因(核酸序列如SEQ ID NO.3所示,氨基酸序列如SEQ ID NO.4所示)比较,发现该基因的第849和第850位C突变为T的纯合突变,与T0代植株相同,进一步分析,Waxy基因该位点的突变导致其编码的GBSSI蛋白序列与秀水134GBSSI蛋白的第178位氨基酸由苏氨酸突变为异亮氨酸,如图3所示。综合该类纯合突变稻米直链淀粉含量均为5%左右,推测该突变为产生5%左右直链淀粉含量的品质的关键突变位点。
直链淀粉含量5%左右水稻突变体的Waxy基因的核苷酸序列如SEQID NO.1所示,其编码的GBSSI蛋白的氨基酸序列如SEQ ID NO.2所示,将克隆得到的新基因命名为Waxy-m。
本发明鉴定到的Waxy-m基因第849和第850位由野生型C到T的变异,以及因此引起的第178位氨基酸由苏氨酸突变为异亮氨酸的变异均为首次报道。
实施例3去除T-DNA载体的稳定突变体植株的获得
本发明构建的定向编辑Waxy基因的载体,含有T-DNA载体,本发明涉及的T-DNA主要包含了筛选标记HPT基因和Cas9元件,由于潮霉素磷酸转移酶HPT基因和Cas9元件的主要作用是用于转化阳性植株的筛选和完成对目标基因的定点突变,并且这两个基因相对于水稻基因组来说是外源基因,且HPT为抗生素筛选标记,需要剔除,Cas9元件如果保留在植株中有可能继续编辑突变位点,产生其他的Waxy等位基因,而且,T-DNA随机插入还可能造成非预期的基因突变,所以在其完成基因编辑任务后,需要清除。通过农杆菌介导转化秀水134,在转基因过程中,T-DNA序列会随机插入水稻的染色体中,可能以单拷贝或多拷贝插入。由于T-DNA插入位点与其靶位点一般不连锁,因此有望通过转基因植株的后代分离获得不携带T-DNA的植株。为了获得不含上述Cas9元件的植株,本发明人通过对目标基因纯合突变的T1代植株的HYP基因和Cas9元件进行检测,重复3次,筛选不携带HYP基因和Cas9元件的植株,即为剔除HYP基因和Cas9元件的T1代单株。
取上述实施例2的18个单株的基因组DNA,以引物hyg283-F:
TCCGGAAGTGCTTGACATT(SEQ ID NO.10)和hyg283-R:
GTCGTCCATCACAGTTTGC(SEQ ID NO.11)对HPT基因进行PCR扩增;以Cas9元件PCR检测引物,引物序列为:CAS9 TXT-F:5’-GTAAAACGACGGCCAGT-3’(SEQ ID NO.12)和CAS9 TXT-R:5’-TCTAGAGAGGGGCACGACC-3(SEQ ID NO.13),对Cas9元件进行PCR扩增,筛选出突变类型为纯合突变且HYP基因和Cas9元件检测为阴性的植株。对T1代检测没有HYP基因和Cas9元件的纯合突变体植株进行收种,进行T2代种植,随机筛选部分植株进行基因型检测验证。证明T2代植株中存在的突变类型的纯合突变体,且已经检测不到HYP基因和Cas9元件。
实施例4直链淀粉含量测定
对获得的没有HYP基因和Cas9元件的T2代纯合突变植株,单株收种,随机挑选3株,检测其稻米外观如图4,测定其种子中直链淀粉含量。
应用上海三黍生物科技有限公司提供的直链淀粉含量测定方法,结果见下表:
编号 | 生物学重复 | 直链淀粉含量(%) |
秀水134(对照) | 1 | 14.53 |
秀水134(对照) | 2 | 14.95 |
秀水134(对照) | 3 | 15.09 |
含有Waxy-m的植株 | 1 | 5.10 |
含有Waxy-m的植株 | 2 | 4.96 |
含有Waxy-m的植株 | 3 | 5.05 |
发现Waxy-m突变类型植株的直链淀粉含量都在5%左右,属于软米特征,由此证明本发明提供的Waxy基因人工定点突变体能够将水稻直链淀粉含量改变为5%左右。
实施例5:突变体农艺性状调查
将秀水134与实施例3中鉴定到的剔除T-DNA的T2代纯合突变体种子种植于上海市试验基地,秀水134与waxy-m突变体种植小区,每小区100苗,三次重复。对农艺性状进行观察分析,结果表明,waxy-m突变体与秀水134的株型、叶片颜色、株高、分蘖数、穗型、粒长、粒宽等农艺性状进行比较,均无明显差异。因此,编辑获得的新材料保留了秀水134材料的丰产、优质等重要农艺性状。
通过上述具体实施例表明,利用CRISPR/Cas9基因编辑技术,对Waxy基因进行编辑,通过后代的筛选,在T2代就可以获得剔除Cas9元件,特性稳定遗传的新材料,而新材料的基本农艺性状无明显改变。相对于化学诱变、杂交转育等育种方法,基因编辑定向改良分子育种技术具有快速、精准、高效等优点,将会极大提高育种效率,大大加快育种进。
以上对本发明具体实施方式的描述并不限制本发明,本领域技术人员可以根据本发明做出各种改变或者变形,只要不脱离本发明的精神,均应属于本发明所附权利要求的范围。
序列表
<110> 上海师范大学
<120> 基于基因编辑技术的GBSSI突变型蛋白及其在植物育种中的应用
<160> 13
<170> SIPOSequenceListing 1.0
<210> 1
<211> 3480
<212> DNA
<213> 未知(Unknown)
<400> 1
atgtcggctc tcaccacgtc ccagctcgcc acctcggcca ccggcttcgg catcgccgac 60
aggtcggcgc cgtcgtcgct gctccgccac gggttccagg gcctcaagcc ccgcagcccc 120
gccggcggcg acgcgacgtc gctcagcgtg acgaccagcg cgcgcgcgac gcccaagcag 180
cagcggtcgg tgcagcgtgg cagccggagg ttcccctccg tcgtcgtgta cgccaccggc 240
gccggcatga acgtcgtgtt cgtcggcgcc gagatggccc cctggagcaa gaccggcggc 300
ctcggtgacg tcctcggtgg cctcccccct gccatggctg taagcacaca caaacttcga 360
tcgctcgtcg tcgctgaccg tcgtcgtctt caactgttct tgatcatcgc attggatgga 420
tgtgtaatgt tgtgttcttg tgttctttgc aggcgaatgg ccacagggtc atggtgatct 480
ctcctcggta cgaccagtac aaggacgctt gggataccag cgttgtggct gaggtaggag 540
catatgcgtg atcagatcat cacaagatcg attagcttta gatgatttgt tacatttcgc 600
aagattttaa cccaagtttt tgtggtgcaa ttcattgcag atcaaggttg cagacaggta 660
cgagagggtg aggtttttcc attgctacaa gcgtggagtc gaccgtgtgt tcatcgacca 720
tccgtcattc ctggagaagg tggagtcatc attagtttac cttttttgtt tttactgaat 780
tattaacagt gcatttagca gttggactga gcttagcttc cactggtgat ttcaggtttg 840
gggaaagatt ggtgagaaga tctacggacc tgacactgga gttgattaca aagacaacca 900
gatgcgtttc agccttcttt gccaggtcag tgattacttc tatctgatga tggttggaag 960
catcacgagt ttaccatagt atgtatggat tcataactaa ttcgtgtatt gatgctacct 1020
gcaggcagca ctcgaggctc ctaggatcct aaacctcaac aacaacccat acttcaaagg 1080
aacttatggt gagttacaat tgatctcaag atcttataac tttcttcgaa ggaatccatg 1140
atgatcagac taattccttc cggtttgtta ctgacaacag gtgaggatgt tgtgttcgtc 1200
tgcaacgact ggcacactgg cccactggcg agctacctga agaacaacta ccagcccaat 1260
ggcatctaca ggaatgcaaa ggtctatgct tgttcttgcc ataccaactc aaatctgcat 1320
gcacactgca ttctgttcag aaactgactg tctgaatctt tttcactgca ggttgctttc 1380
tgcatccaca acatctccta ccagggccgt ttcgctttcg aggattaccc tgagctgaac 1440
ctctccgaga ggttcaggtc atccttcgat ttcatcgacg ggtatgagta agattctaag 1500
agtaacttac tgtcaattcg ccatatatcg attcaatcca agatcctttt gagctgacaa 1560
ccctgcacta ctgtccatcg ttcaaatccg gttaaatttc aggtatgaca cgccggtgga 1620
gggcaggaag atcaactgga tgaaggccgg aatcctggaa gccgacaggg tgctcaccgt 1680
gagcccgtac tacgccgagg agctcatctc cggcatcgcc aggggatgcg agctcgacaa 1740
catcatgcgg ctcaccggca tcaccggcat cgtcaacggc atggacgtca gcgagtggga 1800
tcctagcaag gacaagtaca tcaccgccaa gtacgacgca accacggtaa gaacgaatgc 1860
attcttcaca agatatgcaa tctgaatttt ctttgaaaaa gaaattatca tctgtcactt 1920
cttgattgat tctgacaagg caagaatgag tgacaaattt caggcaatcg aggcgaaggc 1980
gctgaacaag gaggcgttgc aggcggaggc gggtcttccg gtcgacagga aaatcccact 2040
gatcgcgttc atcggcaggc tggaggaaca gaagggccct gacgtcatgg ccgccgccat 2100
cccggagctc atgcaggagg acgtccagat cgttcttctg gtataatata atacactaca 2160
agacacactt gcacgatatg ccaaaaattc agaacaaatt cagtggcaaa aaaaaaactc 2220
gaatattagg gaaggaccta ataatatcaa ataattagaa ggggtgaggc tttgaaccca 2280
gatcgtctag tccaccacct tgtggagtta gccggaagac ctctgagcat ttctcaattc 2340
agtggcaaat gatgtgtata attttgatcc gtgtgtgttt cagggtactg gaaagaagaa 2400
gttcgagaag ctgctcaaga gcatggagga gaagtatccg ggcaaggtga gggccgtggt 2460
gaagttcaac gcgccgcttg ctcatctcat catggccgga gccgacgtgc tcgccgtccc 2520
cagccgcttc gagccctgtg gactcatcca gctgcagggg atgagatacg gaacggtata 2580
caatttccat ctatcaattc gattgttcga tttcatcttt gtgcaatgca atgcaattgc 2640
aaatgcaaat gcatgatgat tttccttgtt gatttctcca gccctgtgct tgcgcgtcca 2700
ccggtgggct cgtggacacg gtcatcgaag gcaagactgg tttccacatg ggccgtctca 2760
gcgtcgacgt aagcctatac atttacataa caatcagata tgacacatcc taataccgat 2820
aagtcggtac actactacac atttacatgg ttgctggtta tatggttttt ttggcagtgc 2880
aaggtggtgg agccaagcga cgtgaagaag gtggcggcca ccctgaagcg cgccatcaag 2940
gtcgtcggca cgccggcgta cgaggagatg gtcaggaact gcatgaacca ggacctctcc 3000
tggaaggtat aaattacgaa acaaatttaa cccaaacata tactatatac tccctccgct 3060
tctaaatatt caacgccgtt gtctttttta aatatgtttg accattcgtc ttattaaaaa 3120
aattaaataa ttataaattc ttttcctatc atttgattca ttgttaaata tacttatatg 3180
tatacatata gttttacata tttcataaaa ttttttgaac aagacgaacg gtcaaacatg 3240
tgctaaaaag ttaacggtgt cgaatattca gaaacggagg gagtataaac gtcttgttca 3300
gaagttcaga gattcacctg tctgatgctg atgatgatta attgtttgca acatggattt 3360
caggggcctg cgaagaactg ggagaatgtg ctcctgggcc tgggcgtcgc cggcagcgcg 3420
ccggggatcg aaggcgacga gatcgcgccg ctcgccaagg agaacgtggc tgctccttga 3480
<210> 2
<211> 609
<212> PRT
<213> 未知(Unknown)
<400> 2
Met Ser Ala Leu Thr Thr Ser Gln Leu Ala Thr Ser Ala Thr Gly Phe
1 5 10 15
Gly Ile Ala Asp Arg Ser Ala Pro Ser Ser Leu Leu Arg His Gly Phe
20 25 30
Gln Gly Leu Lys Pro Arg Ser Pro Ala Gly Gly Asp Ala Thr Ser Leu
35 40 45
Ser Val Thr Thr Ser Ala Arg Ala Thr Pro Lys Gln Gln Arg Ser Val
50 55 60
Gln Arg Gly Ser Arg Arg Phe Pro Ser Val Val Val Tyr Ala Thr Gly
65 70 75 80
Ala Gly Met Asn Val Val Phe Val Gly Ala Glu Met Ala Pro Trp Ser
85 90 95
Lys Thr Gly Gly Leu Gly Asp Val Leu Gly Gly Leu Pro Pro Ala Met
100 105 110
Ala Ala Asn Gly His Arg Val Met Val Ile Ser Pro Arg Tyr Asp Gln
115 120 125
Tyr Lys Asp Ala Trp Asp Thr Ser Val Val Ala Glu Ile Lys Val Ala
130 135 140
Asp Arg Tyr Glu Arg Val Arg Phe Phe His Cys Tyr Lys Arg Gly Val
145 150 155 160
Asp Arg Val Phe Ile Asp His Pro Ser Phe Leu Glu Lys Val Trp Gly
165 170 175
Lys Ile Gly Glu Lys Ile Tyr Gly Pro Asp Thr Gly Val Asp Tyr Lys
180 185 190
Asp Asn Gln Met Arg Phe Ser Leu Leu Cys Gln Ala Ala Leu Glu Ala
195 200 205
Pro Arg Ile Leu Asn Leu Asn Asn Asn Pro Tyr Phe Lys Gly Thr Tyr
210 215 220
Gly Glu Asp Val Val Phe Val Cys Asn Asp Trp His Thr Gly Pro Leu
225 230 235 240
Ala Ser Tyr Leu Lys Asn Asn Tyr Gln Pro Asn Gly Ile Tyr Arg Asn
245 250 255
Ala Lys Val Ala Phe Cys Ile His Asn Ile Ser Tyr Gln Gly Arg Phe
260 265 270
Ala Phe Glu Asp Tyr Pro Glu Leu Asn Leu Ser Glu Arg Phe Arg Ser
275 280 285
Ser Phe Asp Phe Ile Asp Gly Tyr Asp Thr Pro Val Glu Gly Arg Lys
290 295 300
Ile Asn Trp Met Lys Ala Gly Ile Leu Glu Ala Asp Arg Val Leu Thr
305 310 315 320
Val Ser Pro Tyr Tyr Ala Glu Glu Leu Ile Ser Gly Ile Ala Arg Gly
325 330 335
Cys Glu Leu Asp Asn Ile Met Arg Leu Thr Gly Ile Thr Gly Ile Val
340 345 350
Asn Gly Met Asp Val Ser Glu Trp Asp Pro Ser Lys Asp Lys Tyr Ile
355 360 365
Thr Ala Lys Tyr Asp Ala Thr Thr Ala Ile Glu Ala Lys Ala Leu Asn
370 375 380
Lys Glu Ala Leu Gln Ala Glu Ala Gly Leu Pro Val Asp Arg Lys Ile
385 390 395 400
Pro Leu Ile Ala Phe Ile Gly Arg Leu Glu Glu Gln Lys Gly Pro Asp
405 410 415
Val Met Ala Ala Ala Ile Pro Glu Leu Met Gln Glu Asp Val Gln Ile
420 425 430
Val Leu Leu Gly Thr Gly Lys Lys Lys Phe Glu Lys Leu Leu Lys Ser
435 440 445
Met Glu Glu Lys Tyr Pro Gly Lys Val Arg Ala Val Val Lys Phe Asn
450 455 460
Ala Pro Leu Ala His Leu Ile Met Ala Gly Ala Asp Val Leu Ala Val
465 470 475 480
Pro Ser Arg Phe Glu Pro Cys Gly Leu Ile Gln Leu Gln Gly Met Arg
485 490 495
Tyr Gly Thr Pro Cys Ala Cys Ala Ser Thr Gly Gly Leu Val Asp Thr
500 505 510
Val Ile Glu Gly Lys Thr Gly Phe His Met Gly Arg Leu Ser Val Asp
515 520 525
Cys Lys Val Val Glu Pro Ser Asp Val Lys Lys Val Ala Ala Thr Leu
530 535 540
Lys Arg Ala Ile Lys Val Val Gly Thr Pro Ala Tyr Glu Glu Met Val
545 550 555 560
Arg Asn Cys Met Asn Gln Asp Leu Ser Trp Lys Gly Pro Ala Lys Asn
565 570 575
Trp Glu Asn Val Leu Leu Gly Leu Gly Val Ala Gly Ser Ala Pro Gly
580 585 590
Ile Glu Gly Asp Glu Ile Ala Pro Leu Ala Lys Glu Asn Val Ala Ala
595 600 605
Pro
<210> 3
<211> 3480
<212> DNA
<213> 未知(Unknown)
<400> 3
atgtcggctc tcaccacgtc ccagctcgcc acctcggcca ccggcttcgg catcgccgac 60
aggtcggcgc cgtcgtcgct gctccgccac gggttccagg gcctcaagcc ccgcagcccc 120
gccggcggcg acgcgacgtc gctcagcgtg acgaccagcg cgcgcgcgac gcccaagcag 180
cagcggtcgg tgcagcgtgg cagccggagg ttcccctccg tcgtcgtgta cgccaccggc 240
gccggcatga acgtcgtgtt cgtcggcgcc gagatggccc cctggagcaa gaccggcggc 300
ctcggtgacg tcctcggtgg cctcccccct gccatggctg taagcacaca caaacttcga 360
tcgctcgtcg tcgctgaccg tcgtcgtctt caactgttct tgatcatcgc attggatgga 420
tgtgtaatgt tgtgttcttg tgttctttgc aggcgaatgg ccacagggtc atggtgatct 480
ctcctcggta cgaccagtac aaggacgctt gggataccag cgttgtggct gaggtaggag 540
catatgcgtg atcagatcat cacaagatcg attagcttta gatgatttgt tacatttcgc 600
aagattttaa cccaagtttt tgtggtgcaa ttcattgcag atcaaggttg cagacaggta 660
cgagagggtg aggtttttcc attgctacaa gcgtggagtc gaccgtgtgt tcatcgacca 720
tccgtcattc ctggagaagg tggagtcatc attagtttac cttttttgtt tttactgaat 780
tattaacagt gcatttagca gttggactga gcttagcttc cactggtgat ttcaggtttg 840
gggaaagacc ggtgagaaga tctacggacc tgacactgga gttgattaca aagacaacca 900
gatgcgtttc agccttcttt gccaggtcag tgattacttc tatctgatga tggttggaag 960
catcacgagt ttaccatagt atgtatggat tcataactaa ttcgtgtatt gatgctacct 1020
gcaggcagca ctcgaggctc ctaggatcct aaacctcaac aacaacccat acttcaaagg 1080
aacttatggt gagttacaat tgatctcaag atcttataac tttcttcgaa ggaatccatg 1140
atgatcagac taattccttc cggtttgtta ctgacaacag gtgaggatgt tgtgttcgtc 1200
tgcaacgact ggcacactgg cccactggcg agctacctga agaacaacta ccagcccaat 1260
ggcatctaca ggaatgcaaa ggtctatgct tgttcttgcc ataccaactc aaatctgcat 1320
gcacactgca ttctgttcag aaactgactg tctgaatctt tttcactgca ggttgctttc 1380
tgcatccaca acatctccta ccagggccgt ttcgctttcg aggattaccc tgagctgaac 1440
ctctccgaga ggttcaggtc atccttcgat ttcatcgacg ggtatgagta agattctaag 1500
agtaacttac tgtcaattcg ccatatatcg attcaatcca agatcctttt gagctgacaa 1560
ccctgcacta ctgtccatcg ttcaaatccg gttaaatttc aggtatgaca cgccggtgga 1620
gggcaggaag atcaactgga tgaaggccgg aatcctggaa gccgacaggg tgctcaccgt 1680
gagcccgtac tacgccgagg agctcatctc cggcatcgcc aggggatgcg agctcgacaa 1740
catcatgcgg ctcaccggca tcaccggcat cgtcaacggc atggacgtca gcgagtggga 1800
tcctagcaag gacaagtaca tcaccgccaa gtacgacgca accacggtaa gaacgaatgc 1860
attcttcaca agatatgcaa tctgaatttt ctttgaaaaa gaaattatca tctgtcactt 1920
cttgattgat tctgacaagg caagaatgag tgacaaattt caggcaatcg aggcgaaggc 1980
gctgaacaag gaggcgttgc aggcggaggc gggtcttccg gtcgacagga aaatcccact 2040
gatcgcgttc atcggcaggc tggaggaaca gaagggccct gacgtcatgg ccgccgccat 2100
cccggagctc atgcaggagg acgtccagat cgttcttctg gtataatata atacactaca 2160
agacacactt gcacgatatg ccaaaaattc agaacaaatt cagtggcaaa aaaaaaactc 2220
gaatattagg gaaggaccta ataatatcaa ataattagaa ggggtgaggc tttgaaccca 2280
gatcgtctag tccaccacct tgtggagtta gccggaagac ctctgagcat ttctcaattc 2340
agtggcaaat gatgtgtata attttgatcc gtgtgtgttt cagggtactg gaaagaagaa 2400
gttcgagaag ctgctcaaga gcatggagga gaagtatccg ggcaaggtga gggccgtggt 2460
gaagttcaac gcgccgcttg ctcatctcat catggccgga gccgacgtgc tcgccgtccc 2520
cagccgcttc gagccctgtg gactcatcca gctgcagggg atgagatacg gaacggtata 2580
caatttccat ctatcaattc gattgttcga tttcatcttt gtgcaatgca atgcaattgc 2640
aaatgcaaat gcatgatgat tttccttgtt gatttctcca gccctgtgct tgcgcgtcca 2700
ccggtgggct cgtggacacg gtcatcgaag gcaagactgg tttccacatg ggccgtctca 2760
gcgtcgacgt aagcctatac atttacataa caatcagata tgacacatcc taataccgat 2820
aagtcggtac actactacac atttacatgg ttgctggtta tatggttttt ttggcagtgc 2880
aaggtggtgg agccaagcga cgtgaagaag gtggcggcca ccctgaagcg cgccatcaag 2940
gtcgtcggca cgccggcgta cgaggagatg gtcaggaact gcatgaacca ggacctctcc 3000
tggaaggtat aaattacgaa acaaatttaa cccaaacata tactatatac tccctccgct 3060
tctaaatatt caacgccgtt gtctttttta aatatgtttg accattcgtc ttattaaaaa 3120
aattaaataa ttataaattc ttttcctatc atttgattca ttgttaaata tacttatatg 3180
tatacatata gttttacata tttcataaaa ttttttgaac aagacgaacg gtcaaacatg 3240
tgctaaaaag ttaacggtgt cgaatattca gaaacggagg gagtataaac gtcttgttca 3300
gaagttcaga gattcacctg tctgatgctg atgatgatta attgtttgca acatggattt 3360
caggggcctg cgaagaactg ggagaatgtg ctcctgggcc tgggcgtcgc cggcagcgcg 3420
ccggggatcg aaggcgacga gatcgcgccg ctcgccaagg agaacgtggc tgctccttga 3480
<210> 4
<211> 609
<212> PRT
<213> 未知(Unknown)
<400> 4
Met Ser Ala Leu Thr Thr Ser Gln Leu Ala Thr Ser Ala Thr Gly Phe
1 5 10 15
Gly Ile Ala Asp Arg Ser Ala Pro Ser Ser Leu Leu Arg His Gly Phe
20 25 30
Gln Gly Leu Lys Pro Arg Ser Pro Ala Gly Gly Asp Ala Thr Ser Leu
35 40 45
Ser Val Thr Thr Ser Ala Arg Ala Thr Pro Lys Gln Gln Arg Ser Val
50 55 60
Gln Arg Gly Ser Arg Arg Phe Pro Ser Val Val Val Tyr Ala Thr Gly
65 70 75 80
Ala Gly Met Asn Val Val Phe Val Gly Ala Glu Met Ala Pro Trp Ser
85 90 95
Lys Thr Gly Gly Leu Gly Asp Val Leu Gly Gly Leu Pro Pro Ala Met
100 105 110
Ala Ala Asn Gly His Arg Val Met Val Ile Ser Pro Arg Tyr Asp Gln
115 120 125
Tyr Lys Asp Ala Trp Asp Thr Ser Val Val Ala Glu Ile Lys Val Ala
130 135 140
Asp Arg Tyr Glu Arg Val Arg Phe Phe His Cys Tyr Lys Arg Gly Val
145 150 155 160
Asp Arg Val Phe Ile Asp His Pro Ser Phe Leu Glu Lys Val Trp Gly
165 170 175
Lys Thr Gly Glu Lys Ile Tyr Gly Pro Asp Thr Gly Val Asp Tyr Lys
180 185 190
Asp Asn Gln Met Arg Phe Ser Leu Leu Cys Gln Ala Ala Leu Glu Ala
195 200 205
Pro Arg Ile Leu Asn Leu Asn Asn Asn Pro Tyr Phe Lys Gly Thr Tyr
210 215 220
Gly Glu Asp Val Val Phe Val Cys Asn Asp Trp His Thr Gly Pro Leu
225 230 235 240
Ala Ser Tyr Leu Lys Asn Asn Tyr Gln Pro Asn Gly Ile Tyr Arg Asn
245 250 255
Ala Lys Val Ala Phe Cys Ile His Asn Ile Ser Tyr Gln Gly Arg Phe
260 265 270
Ala Phe Glu Asp Tyr Pro Glu Leu Asn Leu Ser Glu Arg Phe Arg Ser
275 280 285
Ser Phe Asp Phe Ile Asp Gly Tyr Asp Thr Pro Val Glu Gly Arg Lys
290 295 300
Ile Asn Trp Met Lys Ala Gly Ile Leu Glu Ala Asp Arg Val Leu Thr
305 310 315 320
Val Ser Pro Tyr Tyr Ala Glu Glu Leu Ile Ser Gly Ile Ala Arg Gly
325 330 335
Cys Glu Leu Asp Asn Ile Met Arg Leu Thr Gly Ile Thr Gly Ile Val
340 345 350
Asn Gly Met Asp Val Ser Glu Trp Asp Pro Ser Lys Asp Lys Tyr Ile
355 360 365
Thr Ala Lys Tyr Asp Ala Thr Thr Ala Ile Glu Ala Lys Ala Leu Asn
370 375 380
Lys Glu Ala Leu Gln Ala Glu Ala Gly Leu Pro Val Asp Arg Lys Ile
385 390 395 400
Pro Leu Ile Ala Phe Ile Gly Arg Leu Glu Glu Gln Lys Gly Pro Asp
405 410 415
Val Met Ala Ala Ala Ile Pro Glu Leu Met Gln Glu Asp Val Gln Ile
420 425 430
Val Leu Leu Gly Thr Gly Lys Lys Lys Phe Glu Lys Leu Leu Lys Ser
435 440 445
Met Glu Glu Lys Tyr Pro Gly Lys Val Arg Ala Val Val Lys Phe Asn
450 455 460
Ala Pro Leu Ala His Leu Ile Met Ala Gly Ala Asp Val Leu Ala Val
465 470 475 480
Pro Ser Arg Phe Glu Pro Cys Gly Leu Ile Gln Leu Gln Gly Met Arg
485 490 495
Tyr Gly Thr Pro Cys Ala Cys Ala Ser Thr Gly Gly Leu Val Asp Thr
500 505 510
Val Ile Glu Gly Lys Thr Gly Phe His Met Gly Arg Leu Ser Val Asp
515 520 525
Cys Lys Val Val Glu Pro Ser Asp Val Lys Lys Val Ala Ala Thr Leu
530 535 540
Lys Arg Ala Ile Lys Val Val Gly Thr Pro Ala Tyr Glu Glu Met Val
545 550 555 560
Arg Asn Cys Met Asn Gln Asp Leu Ser Trp Lys Gly Pro Ala Lys Asn
565 570 575
Trp Glu Asn Val Leu Leu Gly Leu Gly Val Ala Gly Ser Ala Pro Gly
580 585 590
Ile Glu Gly Asp Glu Ile Ala Pro Leu Ala Lys Glu Asn Val Ala Ala
595 600 605
Pro
<210> 5
<211> 20
<212> DNA
<213> 未知(Unknown)
<400> 5
aagaccggtg agaagatcta 20
<210> 6
<211> 25
<212> DNA
<213> 未知(Unknown)
<400> 6
tgtgtgagac cggtgagaag atcta 25
<210> 7
<211> 25
<212> DNA
<213> 未知(Unknown)
<400> 7
aaactagatc ttctcaccgg tctca 25
<210> 8
<211> 20
<212> DNA
<213> 未知(Unknown)
<400> 8
accagtacaa ggacgcttgg 20
<210> 9
<211> 20
<212> DNA
<213> 未知(Unknown)
<400> 9
tgcctgcagg tagcatcaat 20
<210> 10
<211> 19
<212> DNA
<213> 未知(Unknown)
<400> 10
tccggaagtg cttgacatt 19
<210> 11
<211> 19
<212> DNA
<213> 未知(Unknown)
<400> 11
gtcgtccatc acagtttgc 19
<210> 12
<211> 17
<212> DNA
<213> 未知(Unknown)
<400> 12
gtaaaacgac ggccagt 17
<210> 13
<211> 19
<212> DNA
<213> 未知(Unknown)
<400> 13
tctagagagg ggcacgacc 19
Claims (10)
1.一种水稻GBSSI突变型蛋白,其特征在于,所述GBSSI突变型蛋白的氨基酸序列存在以下突变:其对应于水稻GBSSI氨基酸序列的第178位氨基酸发生突变。
2.根据权利要求1所述的水稻GBSSI突变型蛋白,其特征在于,包括:
(a)其氨基酸序列如SEQ ID NO.2所示;或
(b)在(a)中的氨基酸序列经过取代和/或缺失和/或添加一个或几个氨基酸且具有直链淀粉合成酶活性的由(a)衍生的蛋白质。
3.一种编码权利要求1或2所述突变型蛋白的基因。
4.根据权利要求3所述的基因,其特征在于:
a)其核苷酸序列如SEQ ID NO.1所示;或
b)在严格条件下与a)限定的核酸序列杂交且编码具有直链淀粉合成酶活性蛋白质的核酸序列。
5.表达盒、重组载体或细胞,其特征在于,所述表达盒、重组载体或细胞含有权利要求3或4所述基因。
6.权利要求1或2所述水稻GBSSI突变型蛋白在水稻育种中的应用。
7.获得低直链淀粉含量的水稻的方法,其特征在于,包括如下步骤:
1)使水稻植株包含权利要求3或4所述的基因;或
2)使水稻植株表达权利要求1或2所述的水稻GBSSI突变型蛋白。
8.一种利用基因编辑技术低直链淀粉含量水稻的育种方法,其特征在于,包括以下步骤:
1)设计Waxy基因定点编辑的靶位点:基因编辑的靶位点核苷酸序列如SEQ ID NO.5所示;
2)构建含有目的片段的CRISPR/Cas9基因编辑载体:
A)靶点接头制备:采用接头引物用双蒸水溶解成母液,母液稀释后90℃30s(second,秒),移至室温冷却完成退火,即获得靶点接头;所述引物为Waxy TXT-F和Waxy TXT-R,序列如SEQ ID NO.8和SEQ ID NO.9所示;
B)将退火获得的靶点片段连接到CRISPR/Cas9表达载体上获得连接产物;
C)将步骤B)的连接产物进行热激法转化大肠杆菌获得重组菌,提取通过验证的含目的条带的菌液的阳性质粒;
3)将阳性质粒转化农杆菌EHA105,获得T0代转基因植株,以引物Waxy TXT-F和WaxyTXT-R对T0代转基因植株进行扩增并测序鉴定获得具备权利要求1或2所述突变型蛋白的植株。
9.根据权利要求8所述的育种方法,其特征在于,所述育种方法还包括将具有突变型蛋白的T0代转基因植株的含有目标等位基因纯合突变的T1代植株的T-DNA载体的剔除,所述T-DNA载体包括筛选标记HPT基因和Cas9元件。
10.根据权利要求9所述的育种方法,其特征在于,所述T-DNA载体的剔除通过对含有目标等位基因纯合突变的T1代植株的HPT基因和Cas9元件同时检测,重复多次,筛选得到不携带这两个基因的T1代单株即为目标植株。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110239257.2A CN112760304A (zh) | 2021-03-04 | 2021-03-04 | 基于基因编辑技术的gbssi突变型蛋白及其在植物育种中的应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110239257.2A CN112760304A (zh) | 2021-03-04 | 2021-03-04 | 基于基因编辑技术的gbssi突变型蛋白及其在植物育种中的应用 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112760304A true CN112760304A (zh) | 2021-05-07 |
Family
ID=75690800
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110239257.2A Pending CN112760304A (zh) | 2021-03-04 | 2021-03-04 | 基于基因编辑技术的gbssi突变型蛋白及其在植物育种中的应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112760304A (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111197034A (zh) * | 2020-01-08 | 2020-05-26 | 江苏省农业科学院 | 基于基因编辑技术的Wx突变型蛋白及其基因在植物育种中的应用 |
CN113564197A (zh) * | 2021-07-08 | 2021-10-29 | 上海师范大学 | 一种CRISPR/Cas9介导的植物多基因编辑载体的构建方法和应用 |
CN114085853A (zh) * | 2021-11-25 | 2022-02-25 | 湖南省核农学与航天育种研究所 | Waxy突变体及其筛选方法和用途 |
CN115197952A (zh) * | 2022-05-17 | 2022-10-18 | 重庆市农业科学院 | 水稻蜡质基因Wx的突变基因及其应用 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030106099A1 (en) * | 1998-06-19 | 2003-06-05 | Northwest Plant Breeding Co. | Unusual wheat starch mutants and amylopectin starch therefrom |
KR20120004844A (ko) * | 2010-07-07 | 2012-01-13 | 대한민국(관리부서:농촌진흥청장) | 거대배 유전자가 포함된 흑찰거대배 벼 식물체‘밀양 263호’및 이의 육종방법 |
CN107119071A (zh) * | 2017-06-07 | 2017-09-01 | 江苏三黍生物科技有限公司 | 一种降低植物直链淀粉含量的方法及应用 |
CN109097346A (zh) * | 2018-09-06 | 2018-12-28 | 江苏省农业科学院 | 基于基因编辑技术的als突变型蛋白及其基因在植物育种中的应用 |
CN111197034A (zh) * | 2020-01-08 | 2020-05-26 | 江苏省农业科学院 | 基于基因编辑技术的Wx突变型蛋白及其基因在植物育种中的应用 |
CN111996177A (zh) * | 2020-08-17 | 2020-11-27 | 北京市农林科学院 | 一种玉米waxy基因突变体及其分子标记和应用 |
-
2021
- 2021-03-04 CN CN202110239257.2A patent/CN112760304A/zh active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030106099A1 (en) * | 1998-06-19 | 2003-06-05 | Northwest Plant Breeding Co. | Unusual wheat starch mutants and amylopectin starch therefrom |
KR20120004844A (ko) * | 2010-07-07 | 2012-01-13 | 대한민국(관리부서:농촌진흥청장) | 거대배 유전자가 포함된 흑찰거대배 벼 식물체‘밀양 263호’및 이의 육종방법 |
CN107119071A (zh) * | 2017-06-07 | 2017-09-01 | 江苏三黍生物科技有限公司 | 一种降低植物直链淀粉含量的方法及应用 |
CN109097346A (zh) * | 2018-09-06 | 2018-12-28 | 江苏省农业科学院 | 基于基因编辑技术的als突变型蛋白及其基因在植物育种中的应用 |
CN111197034A (zh) * | 2020-01-08 | 2020-05-26 | 江苏省农业科学院 | 基于基因编辑技术的Wx突变型蛋白及其基因在植物育种中的应用 |
CN111996177A (zh) * | 2020-08-17 | 2020-11-27 | 北京市农林科学院 | 一种玉米waxy基因突变体及其分子标记和应用 |
Non-Patent Citations (4)
Title |
---|
HIRANO,H.Y. ET AL.: "O.sativa waxy locus for ADP(UDP)-glucose starch glycosyl transferase,X58228.1", 《GENBANK》 * |
何风华等: "不同 W axy 基因型水稻的遗传多样性", 《分子植物育种》 * |
唐湘如: "《作物栽培学》", 广东高等教育出版社 * |
蔡海亚等: "水稻 Waxy 基因型鉴定及其对稻米品质性状的影响", 《湖北农业科学》 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111197034A (zh) * | 2020-01-08 | 2020-05-26 | 江苏省农业科学院 | 基于基因编辑技术的Wx突变型蛋白及其基因在植物育种中的应用 |
CN113564197A (zh) * | 2021-07-08 | 2021-10-29 | 上海师范大学 | 一种CRISPR/Cas9介导的植物多基因编辑载体的构建方法和应用 |
CN113564197B (zh) * | 2021-07-08 | 2024-03-15 | 上海师范大学 | 一种CRISPR/Cas9介导的植物多基因编辑载体的构建方法和应用 |
CN114085853A (zh) * | 2021-11-25 | 2022-02-25 | 湖南省核农学与航天育种研究所 | Waxy突变体及其筛选方法和用途 |
CN114085853B (zh) * | 2021-11-25 | 2024-05-17 | 湖南省核农学与航天育种研究所 | Waxy突变体及其筛选方法和用途 |
CN115197952A (zh) * | 2022-05-17 | 2022-10-18 | 重庆市农业科学院 | 水稻蜡质基因Wx的突变基因及其应用 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107164347B (zh) | 控制水稻茎秆粗度、分蘖数、穗粒数、千粒重和产量的理想株型基因npt1及其应用 | |
CN112760304A (zh) | 基于基因编辑技术的gbssi突变型蛋白及其在植物育种中的应用 | |
CN108165554B (zh) | 控制玉米叶宽基因ZmNL4及其应用 | |
CN110903368B (zh) | 一种控制玉米雌性性状的基因以及用于创制玉米雌性不育系的试剂盒、突变基因型和方法 | |
CN110862993B (zh) | 控制玉米株高和穗位高基因zkm89及其应用 | |
CN111197034A (zh) | 基于基因编辑技术的Wx突变型蛋白及其基因在植物育种中的应用 | |
CN114990139B (zh) | CsHLS1基因或其编码的蛋白在调控黄瓜植株器官大小中的应用 | |
CN113265422B (zh) | 靶向敲除水稻粒型调控基因slg7的方法、水稻粒型调控基因slg7突变体及其应用 | |
CN116463356B (zh) | 大豆GmSPA3a变体及其在育种中的应用 | |
CN108395472A (zh) | 一种控制稻类粒长和粒重的基因及其应用 | |
CN115552038A (zh) | 通过4号染色体上的qtl增强玉米对玉米大斑病的抗病性 | |
CN109486830A (zh) | 水稻snb基因及应用、调控籽粒大小的方法 | |
CN107326035B (zh) | 一种调控水稻粒型和叶色的去泛素化酶基因ubp5及其应用 | |
CN112662687A (zh) | 推迟玉米花期的方法、试剂盒、基因 | |
CN115125262B (zh) | 一种水稻垩白相关基因及其编码蛋白和应用 | |
CN114395580B (zh) | 用于控制玉米株高的基因 | |
CN116694799A (zh) | 水稻OsAUX5基因中与稻米必需氨基酸积累相关InDel的位点及应用 | |
CN110407922B (zh) | 水稻耐冷基因qSCT11及其应用 | |
KR20230010678A (ko) | 표적화된 돌연변이유발에 의해 돌연변이체 식물을 수득하는 방법 | |
CN108148846B (zh) | 水稻叶型突变基因zy103及其应用 | |
CN113215187A (zh) | 一种利用CRISPR/Cas9技术快速获得香稻材料的方法 | |
CN111218434B (zh) | 小麦籽粒多酚氧化酶基因Ppo1突变体及其应用 | |
CN112646013B (zh) | 玉米开花期基因及其应用 | |
CN112194713B (zh) | 一种与水稻胚乳淀粉颗粒发育相关的蛋白fse5及其编码基因和应用 | |
CN112725359B (zh) | 水稻穗型调控基因sdp1及其应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210507 |