CN115216488B - 创制水稻大长粒型新种质或大长粒型矮杆新种质的方法及其应用 - Google Patents
创制水稻大长粒型新种质或大长粒型矮杆新种质的方法及其应用 Download PDFInfo
- Publication number
- CN115216488B CN115216488B CN202110432092.0A CN202110432092A CN115216488B CN 115216488 B CN115216488 B CN 115216488B CN 202110432092 A CN202110432092 A CN 202110432092A CN 115216488 B CN115216488 B CN 115216488B
- Authority
- CN
- China
- Prior art keywords
- ala
- gly
- leu
- val
- pro
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 235000007164 Oryza sativa Nutrition 0.000 title claims abstract description 115
- 235000009566 rice Nutrition 0.000 title claims abstract description 106
- 238000000034 method Methods 0.000 title claims abstract description 49
- 240000007594 Oryza sativa Species 0.000 title description 10
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 217
- 108091033409 CRISPR Proteins 0.000 claims abstract description 107
- 241000209094 Oryza Species 0.000 claims abstract description 105
- 238000010354 CRISPR gene editing Methods 0.000 claims abstract description 44
- 235000013339 cereals Nutrition 0.000 claims abstract description 41
- 239000013598 vector Substances 0.000 claims abstract description 22
- 230000008685 targeting Effects 0.000 claims abstract description 14
- 238000009395 breeding Methods 0.000 claims abstract description 8
- 230000001488 breeding effect Effects 0.000 claims abstract description 7
- 108091027544 Subgenomic mRNA Proteins 0.000 claims abstract description 6
- 239000002773 nucleotide Substances 0.000 claims description 59
- 125000003729 nucleotide group Chemical group 0.000 claims description 59
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 20
- 230000004048 modification Effects 0.000 claims description 18
- 238000012986 modification Methods 0.000 claims description 18
- 108020005004 Guide RNA Proteins 0.000 claims description 12
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 9
- 244000184734 Pyrus japonica Species 0.000 claims description 6
- 230000006870 function Effects 0.000 abstract description 43
- 241000196324 Embryophyta Species 0.000 abstract description 17
- 230000006872 improvement Effects 0.000 abstract description 7
- 230000008859 change Effects 0.000 abstract description 2
- 238000010362 genome editing Methods 0.000 description 36
- 102000004169 proteins and genes Human genes 0.000 description 31
- 108020004414 DNA Proteins 0.000 description 26
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 19
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 16
- 108010078144 glutaminyl-glycine Proteins 0.000 description 16
- 108010026333 seryl-proline Proteins 0.000 description 15
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 14
- 238000005516 engineering process Methods 0.000 description 14
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 14
- 108010047495 alanylglycine Proteins 0.000 description 13
- 108010016616 cysteinylglycine Proteins 0.000 description 13
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 12
- 238000012216 screening Methods 0.000 description 12
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 10
- 241000880493 Leptailurus serval Species 0.000 description 10
- 108010041407 alanylaspartic acid Proteins 0.000 description 10
- 150000001413 amino acids Chemical class 0.000 description 10
- 108010047857 aspartylglycine Proteins 0.000 description 10
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 10
- 239000000463 material Substances 0.000 description 10
- 206010020649 Hyperkeratosis Diseases 0.000 description 9
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 9
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 9
- 238000001514 detection method Methods 0.000 description 9
- 108010029020 prolylglycine Proteins 0.000 description 9
- 241000589158 Agrobacterium Species 0.000 description 8
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 8
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 8
- 108700019146 Transgenes Proteins 0.000 description 8
- 108010087924 alanylproline Proteins 0.000 description 8
- 230000002068 genetic effect Effects 0.000 description 8
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 8
- 108010077112 prolyl-proline Proteins 0.000 description 8
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 7
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 7
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 7
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 7
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 7
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 7
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 7
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 7
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 7
- 240000002582 Oryza sativa Indica Group Species 0.000 description 7
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 7
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 7
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 7
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 7
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 7
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 7
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 7
- 108010060035 arginylproline Proteins 0.000 description 7
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 7
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 7
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 7
- 230000002441 reversible effect Effects 0.000 description 7
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 7
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 6
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 6
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 description 6
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 6
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 6
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 6
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 6
- 230000001965 increasing effect Effects 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 5
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 5
- 108700004991 Cas12a Proteins 0.000 description 5
- 108700024394 Exon Proteins 0.000 description 5
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 5
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 5
- 108010065920 Insulin Lispro Proteins 0.000 description 5
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 5
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 5
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 5
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 5
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 5
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 5
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 5
- KVMZNMYZCKORIG-UBHSHLNASA-N Trp-Cys-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KVMZNMYZCKORIG-UBHSHLNASA-N 0.000 description 5
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 5
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 5
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 5
- 108010068265 aspartyltyrosine Proteins 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 108010020688 glycylhistidine Proteins 0.000 description 5
- 108010040030 histidinoalanine Proteins 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 4
- XJFPXLWGZWAWRQ-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O XJFPXLWGZWAWRQ-UHFFFAOYSA-N 0.000 description 4
- 101710197633 Actin-1 Proteins 0.000 description 4
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 4
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 4
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 4
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 4
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 4
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 4
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 4
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 4
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 4
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 4
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 4
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 4
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 4
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 4
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 4
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 4
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 4
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 4
- OCEHKDFAWQIBHH-FXQIFTODSA-N Cys-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N OCEHKDFAWQIBHH-FXQIFTODSA-N 0.000 description 4
- LHRCZIRWNFRIRG-SRVKXCTJSA-N Cys-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O LHRCZIRWNFRIRG-SRVKXCTJSA-N 0.000 description 4
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 4
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 4
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 4
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 4
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 4
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 4
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 4
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 4
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 4
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 4
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 4
- MCGOGXFMKHPMSQ-AVGNSLFASA-N His-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MCGOGXFMKHPMSQ-AVGNSLFASA-N 0.000 description 4
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 4
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 4
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 4
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 4
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 4
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 4
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 4
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 4
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 4
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 4
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 4
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 4
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 4
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 4
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 4
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 4
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- 108010066427 N-valyltryptophan Proteins 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 4
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 4
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 4
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 4
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 4
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 4
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 4
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 4
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 4
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 4
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 4
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 4
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 4
- 238000012300 Sequence Analysis Methods 0.000 description 4
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 4
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 4
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 4
- 238000010459 TALEN Methods 0.000 description 4
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 4
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 4
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 4
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 4
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 4
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 4
- DPMVSFFKGNKJLQ-VJBMBRPKSA-N Trp-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N DPMVSFFKGNKJLQ-VJBMBRPKSA-N 0.000 description 4
- YHRCLOURJWJABF-WDSOQIARSA-N Trp-His-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N YHRCLOURJWJABF-WDSOQIARSA-N 0.000 description 4
- OSMTVLSRTQDWHJ-JBACZVJFSA-N Tyr-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 OSMTVLSRTQDWHJ-JBACZVJFSA-N 0.000 description 4
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 4
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 4
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 4
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 4
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 4
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 4
- 241000746966 Zizania Species 0.000 description 4
- 235000002636 Zizania aquatica Nutrition 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 4
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 4
- 108010066988 asparaginyl-alanyl-glycyl-alanine Proteins 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 230000005782 double-strand break Effects 0.000 description 4
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 4
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 238000003752 polymerase chain reaction Methods 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 230000009261 transgenic effect Effects 0.000 description 4
- WXPZDDCNKXMOMC-AVGNSLFASA-N (2s)-1-[(2s)-2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carboxylic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@H](C(O)=O)CCC1 WXPZDDCNKXMOMC-AVGNSLFASA-N 0.000 description 3
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 3
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 3
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 3
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 3
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 3
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 3
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 3
- QNYWYYNQSXANBL-WDSOQIARSA-N Arg-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QNYWYYNQSXANBL-WDSOQIARSA-N 0.000 description 3
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 3
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 3
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 3
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 3
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 3
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 3
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 3
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 3
- -1 CRISPR/Cpf1 Proteins 0.000 description 3
- 238000010443 CRISPR/Cpf1 gene editing Methods 0.000 description 3
- RWAZRMXTVSIVJR-YUMQZZPRSA-N Cys-Gly-His Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CNC=N1)C(O)=O RWAZRMXTVSIVJR-YUMQZZPRSA-N 0.000 description 3
- CYHMMWIOEUVHHZ-IHRRRGAJSA-N Cys-Met-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CYHMMWIOEUVHHZ-IHRRRGAJSA-N 0.000 description 3
- 108010042407 Endonucleases Proteins 0.000 description 3
- 102000004533 Endonucleases Human genes 0.000 description 3
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 3
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 3
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 3
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 3
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 3
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 3
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 3
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 3
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 3
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 3
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 3
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 3
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 3
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 3
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 3
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- WOAMZMXCLBBQKW-KKUMJFAQSA-N His-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)O WOAMZMXCLBBQKW-KKUMJFAQSA-N 0.000 description 3
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 3
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 3
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 3
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 3
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 3
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 3
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 3
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 3
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 3
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 3
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 3
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 3
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 3
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 3
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 3
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- 101710163270 Nuclease Proteins 0.000 description 3
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 3
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 3
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 3
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 3
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 3
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 3
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 3
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 3
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 3
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 3
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 3
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 3
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 3
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 3
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 3
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 3
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 3
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 3
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 3
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 3
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 3
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 3
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 3
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 3
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 3
- 108010073062 Transcription Activator-Like Effectors Proteins 0.000 description 3
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 3
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 3
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 3
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 3
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 3
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 3
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 3
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 3
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 3
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 3
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 3
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 3
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 3
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 3
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 3
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 3
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 101150038500 cas9 gene Proteins 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 108010054813 diprotin B Proteins 0.000 description 3
- 235000013305 food Nutrition 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 238000012246 gene addition Methods 0.000 description 3
- 238000012224 gene deletion Methods 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 3
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010017446 glycyl-prolyl-arginyl-proline Proteins 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010028295 histidylhistidine Proteins 0.000 description 3
- 108010018006 histidylserine Proteins 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 108010053725 prolylvaline Proteins 0.000 description 3
- 238000003753 real-time PCR Methods 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 3
- 108010020532 tyrosyl-proline Proteins 0.000 description 3
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 2
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 2
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 2
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 2
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 2
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 2
- YNQMEIJEWSHOEO-SRVKXCTJSA-N Asn-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YNQMEIJEWSHOEO-SRVKXCTJSA-N 0.000 description 2
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 2
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 2
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 2
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- KCOPOPKJRHVGPE-AQZXSJQPSA-N Asp-Thr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O KCOPOPKJRHVGPE-AQZXSJQPSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 238000010453 CRISPR/Cas method Methods 0.000 description 2
- LRZPRGJXAZFXCR-DCAQKATOSA-N Cys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N LRZPRGJXAZFXCR-DCAQKATOSA-N 0.000 description 2
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 2
- KLLFLHBKSJAUMZ-ACZMJKKPSA-N Cys-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N KLLFLHBKSJAUMZ-ACZMJKKPSA-N 0.000 description 2
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 2
- XWTGTTNUCCEFJI-UBHSHLNASA-N Cys-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N XWTGTTNUCCEFJI-UBHSHLNASA-N 0.000 description 2
- YTMBNLHIDIKJIU-HCXYKTFWSA-N D-Arginyl-L-arginyl-D-glutaminyl-L-phenylalanine Chemical compound NC(=N)NCCC[C@@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](CCC(O)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YTMBNLHIDIKJIU-HCXYKTFWSA-N 0.000 description 2
- 230000007018 DNA scission Effects 0.000 description 2
- 108091029865 Exogenous DNA Proteins 0.000 description 2
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 2
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 2
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 2
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 2
- XKPOCESCRTVRPL-KBIXCLLPSA-N Glu-Cys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XKPOCESCRTVRPL-KBIXCLLPSA-N 0.000 description 2
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 2
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 2
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 2
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 2
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 2
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 2
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 2
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 2
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 2
- SRGRINJFBHKHAC-NAKRPEOUSA-N Ile-Cys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)O)N SRGRINJFBHKHAC-NAKRPEOUSA-N 0.000 description 2
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 2
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 2
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 2
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 2
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 2
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 2
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 2
- OMHMIXFFRPMYHB-SRVKXCTJSA-N Phe-Cys-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OMHMIXFFRPMYHB-SRVKXCTJSA-N 0.000 description 2
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 2
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 2
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 2
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 2
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 2
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- 240000000111 Saccharum officinarum Species 0.000 description 2
- 235000007201 Saccharum officinarum Nutrition 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 2
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 2
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 2
- SUGLEXVWEJOCGN-ONUFPDRFSA-N Trp-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)O SUGLEXVWEJOCGN-ONUFPDRFSA-N 0.000 description 2
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 2
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 2
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 2
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 2
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 235000013361 beverage Nutrition 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 235000021329 brown rice Nutrition 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 108010004073 cysteinylcysteine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 230000006780 non-homologous end joining Effects 0.000 description 2
- 150000007523 nucleic acids Chemical group 0.000 description 2
- 239000002777 nucleoside Substances 0.000 description 2
- 150000003833 nucleoside derivatives Chemical class 0.000 description 2
- 230000009437 off-target effect Effects 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 230000008929 regeneration Effects 0.000 description 2
- 238000011069 regeneration method Methods 0.000 description 2
- 230000008439 repair process Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 238000010008 shearing Methods 0.000 description 2
- 238000011895 specific detection Methods 0.000 description 2
- 108010029384 tryptophyl-histidine Proteins 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- IAOXXKYIZHCAQJ-ACZMJKKPSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2,4-diamino-4-oxobutanoyl]amino]propanoyl]amino]acetyl]amino]propanoic acid Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O IAOXXKYIZHCAQJ-ACZMJKKPSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- GRPHQEMIFDPKOE-HGNGGELXSA-N Ala-His-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GRPHQEMIFDPKOE-HGNGGELXSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 1
- LLZXKVAAEWBUPB-KKUMJFAQSA-N Arg-Gln-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLZXKVAAEWBUPB-KKUMJFAQSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 1
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- JCROZIFVIYMXHM-GUBZILKMSA-N Arg-Met-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N JCROZIFVIYMXHM-GUBZILKMSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 1
- VITDJIPIJZAVGC-VEVYYDQMSA-N Asn-Met-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VITDJIPIJZAVGC-VEVYYDQMSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- IXVMHGVQKLDRKH-VRESXRICSA-N Brassinolide Natural products O=C1OC[C@@H]2[C@@H]3[C@@](C)([C@H]([C@@H]([C@@H](O)[C@H](O)[C@H](C(C)C)C)C)CC3)CC[C@@H]2[C@]2(C)[C@@H]1C[C@H](O)[C@H](O)C2 IXVMHGVQKLDRKH-VRESXRICSA-N 0.000 description 1
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- ASHTVGGFIMESRD-LKXGYXEUSA-N Cys-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)O ASHTVGGFIMESRD-LKXGYXEUSA-N 0.000 description 1
- OTXLNICGSXPGQF-KBIXCLLPSA-N Cys-Ile-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTXLNICGSXPGQF-KBIXCLLPSA-N 0.000 description 1
- SCOPAVYZWHPDBA-DCAQKATOSA-N Cys-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N SCOPAVYZWHPDBA-DCAQKATOSA-N 0.000 description 1
- GQNZIAGMRXOFJX-GUBZILKMSA-N Cys-Val-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O GQNZIAGMRXOFJX-GUBZILKMSA-N 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 1
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 1
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- NTOWAXLMQFKJPT-YUMQZZPRSA-N Gly-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN NTOWAXLMQFKJPT-YUMQZZPRSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- UYPPAMNTTMJHJW-KCTSRDHCSA-N Gly-Ile-Trp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UYPPAMNTTMJHJW-KCTSRDHCSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 1
- CHZRWFUGWRTUOD-IUCAKERBSA-N His-Gly-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N CHZRWFUGWRTUOD-IUCAKERBSA-N 0.000 description 1
- SYIPVNMWBZXKMU-HJPIBITLSA-N His-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N SYIPVNMWBZXKMU-HJPIBITLSA-N 0.000 description 1
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 1
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- IIXDMJNYALIKGP-DJFWLOJKSA-N Ile-Asn-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IIXDMJNYALIKGP-DJFWLOJKSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- QRTVJGKXFSYJGW-KBIXCLLPSA-N Ile-Glu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N QRTVJGKXFSYJGW-KBIXCLLPSA-N 0.000 description 1
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 1
- OAQJOXZPGHTJNA-NGTWOADLSA-N Ile-Trp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N OAQJOXZPGHTJNA-NGTWOADLSA-N 0.000 description 1
- IPFKIGNDTUOFAF-CYDGBPFRSA-N Ile-Val-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IPFKIGNDTUOFAF-CYDGBPFRSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 1
- RNAGAJXCSPDPRK-KKUMJFAQSA-N Met-Glu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 RNAGAJXCSPDPRK-KKUMJFAQSA-N 0.000 description 1
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 1
- WXJXYMFUTRXRGO-UWVGGRQHSA-N Met-His-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 WXJXYMFUTRXRGO-UWVGGRQHSA-N 0.000 description 1
- AFVOKRHYSSFPHC-STECZYCISA-N Met-Ile-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFVOKRHYSSFPHC-STECZYCISA-N 0.000 description 1
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- 108020005093 RNA Precursors Proteins 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- 101150076211 TH gene Proteins 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- VEENWOSZGWWKHW-SZZJOZGLSA-N Thr-Trp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O VEENWOSZGWWKHW-SZZJOZGLSA-N 0.000 description 1
- FRQRWAMUESPWMT-HSHDSVGOSA-N Thr-Trp-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N)O FRQRWAMUESPWMT-HSHDSVGOSA-N 0.000 description 1
- VDUJEEQMRQCLHB-YTQUADARSA-N Trp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VDUJEEQMRQCLHB-YTQUADARSA-N 0.000 description 1
- VOCHZIJXPRBVSI-XIRDDKMYSA-N Trp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N VOCHZIJXPRBVSI-XIRDDKMYSA-N 0.000 description 1
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- RSEIVHMDTNNEOW-JYJNAYRXSA-N Val-Trp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N RSEIVHMDTNNEOW-JYJNAYRXSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- PTFCDOFLOPIGGS-UHFFFAOYSA-N Zinc dication Chemical compound [Zn+2] PTFCDOFLOPIGGS-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000009418 agronomic effect Effects 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- IXVMHGVQKLDRKH-KNBKMWSGSA-N brassinolide Chemical compound C1OC(=O)[C@H]2C[C@H](O)[C@H](O)C[C@]2(C)[C@H]2CC[C@]3(C)[C@@H]([C@H](C)[C@@H](O)[C@H](O)[C@@H](C)C(C)C)CC[C@H]3[C@@H]21 IXVMHGVQKLDRKH-KNBKMWSGSA-N 0.000 description 1
- 238000005251 capillar electrophoresis Methods 0.000 description 1
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 1
- 229960003669 carbenicillin Drugs 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 101150059443 cas12a gene Proteins 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000000408 embryogenic effect Effects 0.000 description 1
- 230000002616 endonucleolytic effect Effects 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 239000012882 rooting medium Substances 0.000 description 1
- 238000002791 soaking Methods 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8218—Antisense, co-suppression, viral induced gene silencing [VIGS], post-transcriptional induced gene silencing [PTGS]
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8202—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation by biological means, e.g. cell mediated or natural vector
- C12N15/8205—Agrobacterium mediated transformation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Cell Biology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Botany (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Virology (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
本申请公开了创制水稻大长粒型新种质或大长粒矮杆新种质的方法,其包括同时修饰水稻中的OsPPKL1基因和OsPPKL3基因,使OsPPKL1基因和OsPPKL3基因的结构或功能发生变化。本申请还公开了通过所述方法创制的水稻用于育种或改良种质资源的用途,能够靶向水稻OsPPKL1基因和/或OsPPKL3基因的sgRNA,以及能够靶向水稻OsPPKL1基因和/或OsPPKL3基因的CRISPR/Cas9编辑载体。本申请为水稻育种改良提供了一种新的方法,特别是对于一些其它性状良好,粒型尚需改良的品种,提供了一种便捷高效的改良策略。同时通过对OsPPKL1基因进行不同结构改变,可以获得株高不变或者矮杆的表型,对需要同时改良株高和粒型的品种,也提供一种便捷高效的改良策略。
Description
技术领域
本申请涉及基因编辑技术领域,尤其涉及一种以CRISPR/Cas9基因编辑技术创制OsPPKL1及OsPPKL3基因突变体来改变水稻粒型及株高的方法。
背景技术
水稻是中国主要的粮食作物,全国60%以上的人口以稻米为主食。水稻籽粒形状是影响产量的重要性状之一,粒型增大与水稻增产直接相关。同时籽粒形状也影响稻米的外观品质,随着人民生活的日益改善,对米质要求越来越高。在目前国内餐饮市场上,长粒米广受青睐。有一些水稻品种其它性状很好,却因为粒型不够长而限制了其市场份额。另外株高也是一个很重要的农艺性状,对于抗倒伏起到很重要的作用。
水稻OsPPKL1基因与油菜内酯调控途径有关,该基因含有Kelch结构域,已有研究表明该Kelch结构域调控水稻的籽粒大小,OsPPKL1基因中Kelch结构域的缺失导致了大粒的表型,而OsPPKL3是OsPPKL1的同源基因。
以Cas9为代表的基因编辑技术是近年来分子育种领域的热点技术,它能够实现亲本目标性状的快速精准改良,且不涉及转基因安全问题。该技术在育种中的应用大多数是通过对目标基因进行碱基的添加或删除,从而使基因功能失活,获得相应的目标性状。目前还没有利用CRISPR技术编辑OsPPKL1和OsPPKL3基因,创制粒型增大且矮杆的水稻种质的相关研究。
发明内容
本申请涉及一种以CRISPR/Cas9基因编辑技术创制OsPPKL1基因及OsPPKL3基因突变体来改良水稻粒型和/或株型的方法。最终获得了粒型明显增长和/或矮杆的优质水稻品种。为实现上述目的,本申请采用如下技术方案。
在第一方面,本申请提供了创制水稻大长粒型新种质或大长粒型矮杆新种质的方法,其包括修饰水稻中的OsPPKL1基因及OsPPKL3基因,使OsPPKL1基因和OsPPKL3基因的结构或功能发生变化。
在一些实施方案中,所述修饰导致OsPPKL1基因发生缺失、插入、取代或以上的组合。
在一些实施方案中,所述修饰导致OsPPKL3基因发生缺失、插入、取代或以上的组合。
在一些具体的实施方案中,所述修饰导致OsPPKL1基因缺失或添加3的倍数数量的碱基,例如3bp、6bp、9bp。
在一些实施方案中,所述OsPPKL1基因包含以下序列或由以下序列组成:SEQ IDNO:1所示的核苷酸序列或与其具有至少90%、95%、99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列,或者编码SEQ ID NO:2所示的氨基酸序列或与其具有至少90%、95%、99%或更高的序列同一性的保留其功能的活性变体序列的核苷酸序列。
在一些实施方案中,所述OsPPKL3基因包含以下序列或由以下序列组成:SEQ IDNO:3所示的核苷酸序列或与其具有至少90%、95%、99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列,或者编码SEQ ID NO:4所示的氨基酸序列或与其具有至少90%、95%、99%或更高的序列同一性的保留其功能的活性变体序列的核苷酸序列。
在优选的实施方案中,所述水稻为籼稻或粳稻。
在一些实施方案中,所述修饰通过基因编辑进行。
在一些实施方案中,所述基因编辑通过选自以下的序列特异性核酸酶中的一种或多种进行:CRISPR/Cas9、CRISPR/Cpf1、CRISPR/Cas12a、TALEN、大范围核酸酶和ZFN。优选地,所述基因编辑通过CRISPR/Cas9进行。
在一些实施方案中,所述CRISPR/Cas9包含Cas9,其中所述Cas9包含SEQ ID NO:9所示的核苷酸序列或与其具有至少90%、95%、99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列。
在一些实施方案中,所述CRISPR/Cas9包含导向RNA(sgRNA),其中所述sgRNA靶向OsPPKL1基因和/或OsPPKL3基因。
在一些具体的实施方案中,所述sgRNA靶向SEQ ID NO:5和/或SEQ ID NO:6所示的核苷酸序列。
在一些具体的实施方案中,所述sgRNA包含SEQ ID NO:7和/或SEQ ID NO:8所示的核苷酸序列或与其具有至少90%、95%、99%或更高的序列同一性的保留其功能的活性变体序列。
在第二方面,本申请提供了通过第一方面所述的方法创制的水稻用于育种的用途。
在第三方面,本申请提供了通过第一方面所述的方法创制的水稻用于改良种质资源的用途。
在第四方面,本申请提供了通过第一方面所述的方法创制的水稻的种子制成的制品,其选自食品、饮品、饲料或工业原料。
在第五方面,本申请提供了能够靶向水稻OsPPKL1基因的sgRNA,其包含SEQ IDNO:7所示的核苷酸序列或由SEQ ID NO:7所示的核苷酸序列组成。
在第六方面,本申请提供了能够靶向水稻OsPPKL3基因的sgRNA,其包含SEQ IDNO:8所示的核苷酸序列或由SEQ ID NO:8所示的核苷酸序列组成。
在第七方面,本申请提供了能够靶向水稻OsPPKL1基因和/或OsPPKL3基因基因的CRISPR/Cas9编辑载体,其包含表达Cas9的第一表达盒和表达sgRNA的第二表达盒,其中所述Cas9包含SEQ ID NO:9所示的核苷酸序列或与其具有至少90%、95%、99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列,并且所述sgRNA包含SEQ ID NO:7和/或SEQ ID NO:8所示的核苷酸序列或与其具有至少90%、95%、99%或更高的序列同一性的保留其功能的活性变体序列。
在第八方面,本申请提供了创制水稻大长粒型新种质或大长粒型矮杆新种质的方法,其包括将第七方面所述的CRISPR/Cas9编辑载体导入含有OsPPKL1基因和OsPPKL3基因的水稻中。
在一些实施方案中,所述OsPPKL1基因包含以下序列或由以下序列组成:SEQ IDNO:1所示的核苷酸序列或与其具有至少90%、95%、99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列,或者编码SEQ ID NO:2所示的氨基酸序列或与其具有至少90%、95%、99%或更高的序列同一性的保留其功能的活性变体序列的核苷酸序列。
在一些实施方案中,所述OsPPKL3基因包含以下序列或由以下序列组成:SEQ IDNO:3所示的核苷酸序列或与其具有至少90%、95%、99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列,或者编码SEQ ID NO:4所示的氨基酸序列或与其具有至少90%、95%、99%或更高的序列同一性的保留其功能的活性变体序列的核苷酸序列。
在优选的实施方案中,所述水稻为籼稻或粳稻。
本申请提出了一种基因编辑技术的巧妙运用,利用其定向打靶功能来实现基因突变,并筛选出特定突变类型。该方法简单易行、成本低、并且效率高。本申请利用基因编辑技术对OsPPKL1基因及OsPPKL3基因功能的定点编辑,成功地获得了粒型显著增大的矮杆籼稻品种,也为利用CRISPR/Cas9基因编辑技术快速改良水稻品种的粒型提供一种有效的策略。
附图说明
图1显示了野生型水稻和编辑株系中OsPPKL1基因序列的序列关系。其中WT:野生型;418C037003a、418C026036a、418C026014a、407C008004a和418F003063a:编辑株系;下划线表示PAM序列;虚线表示缺失序列;方框表示外显子;斜体字表示插入序列。
图2显示了野生型水稻和编辑株系中OsPPKL3基因序列的序列关系。其中WT:野生型;418C037003a、418C026036a、418C026014a、407C008004a和418F003063a:编辑株系;下划线表示PAM序列;虚线表示缺失序列;方框表示外显子;斜体字表示插入序列。
图3显示了野生型株系(X32或X12)和编辑株系OsPPKL1基因编码的氨基酸序列。其中WT:野生型(X32或X12);418C037003a、418C026036a、407C008004a和418F003063a:编辑株系;虚线表示缺失的氨基酸,斜体字表示无法与野生型匹配的氨基酸。
图4显示了野生型株系(X32或X12)和编辑株系OsPPKL3基因编码的氨基酸序列。其中WT:野生型(X32或X12);418C037003a、418C026014a和407C008004a:编辑株系;虚线表示缺失的氨基酸,斜体字表示无法与野生型匹配的氨基酸。
图5显示了OsPPKL1基因和OsPPKL3基因编辑株系与野生型株系(X32)的籽粒比较,其中WT(X32):野生型;418C037003a、418C026036a和418C026014a:编辑株系。图5中的上图为稻谷的照片,下图为米粒的照片。
图6显示了OsPPKL1基因和OsPPKL3基因编辑株系与野生型株系(X12)的株型比较,其中WT(X12):野生型;407C008004a和418F003063a:编辑株系。
图7显示了本申请的编辑载体的结构示意图。
详细描述
提供以下定义和方法以更好地界定本申请以及在本申请实践中指导本领域普通技术人员。除非另作说明,本申请的术语按照相关领域普通技术人员的常规用法理解。
定义
本文使用的术语“基因编辑(gene editing)”或“基因组编辑(genome edting)”,是一种新兴的比较精确的能对生物体基因组特定目标基因进行修饰的一种基因工程技术。基因编辑指能够对目标基因进行定点“编辑”,实现对特定DNA片段的修饰。基因编辑依赖于经过基因工程改造的核酸酶,也称“分子剪刀”,在基因组中特定位置产生基因特异性双链断裂(DSB),诱导生物体通过非同源末端连接(NHEJ)或同源重组(HR)来修复DSB,因为这个修复过程容易出错,从而导致靶向突变。
本文使用的术语“CRISPR/Cas9”是指使用RNA引导链靶向内切核酸酶切割基因的内切核酸酶。参见,Jinek等人,Science 337:816-821(2013);Cong等人,Science(2013年1月3日);和Mali等人,Science(2013年1月3日)。目前发现的CRISPR/Cas9系统有三种不同类型即I型、II型和III型,它们存在于大约40%已测序的真细菌和90%已测序的古细菌中。其中II型的组成较为简单,以Cas9蛋白以及向导RNA(gRNA)为核心组成,也是目前研究得最深入的类型。当细菌抵御噬菌体等外源DNA入侵时,在前导区的调控下,CRISPR被转录为长的RNA前体(pre-crRNA),然后加工成一系列短的含有保守重复序列和间隔区的成熟crRNA,最终识别并结合到与其互补的外源DNA序列上发挥剪切作用。CRISPR/Cas9的剪切基因位于crRNA互补序列下游临近的PAM区(Protospacer Adjacent Motif)的5’-GG-N18-NGG-3’特征区域中的NGG基因,而这种特征的序列在每128bp的随机DNA序列中就重复出现一次。
本文使用的术语“CRISPR/Cpf1”是一类新型的CRISPR-Cas系统。目前得到广泛应用的CRISPR/Cas9系统经常会发生脱靶效应。作为CRISPR/Cas9系统的一种潜在的对手,CRISPR-Cpf1系统作为基因编辑的新工具,进一步扩大了基因编辑靶基因的选择范围,同时几乎没有脱靶效应。Cpf1是一种新型CRISPR效应蛋白,具有许多与Cas9不同的特性,有利于克服CRISPR/Cas9应用中的一些限制。
本文使用的术语“CRISPR/Cas12a”也是一类新型的CRISPR-Cas系统。相较于Cas9蛋白来说,Cas12a蛋白更具准确度,同时也更安全。CRISPR/Cas9工作时,Cas9蛋白识别PAM序列(RNA编写的遗传密码),通过gRNA解开部分双螺旋,在这个过程中,Cas9蛋白一旦找到匹配较好的序列时,便会紧密贴合在该段DNA上,而在这个过程中,也许会出现一些错配的现象,但这种结合是不可逆的。而在这一方面,Cas12a就显得机智许多,它在寻找“靶标”时,会对沿途的DNA序列进行单碱基的识别,如若发现有匹配不好的碱基,便会离开重新寻找,在寻找到PAM序列时,Cas12a蛋白会与PAM序列形成一种半封闭的R-环(R-loop),当识别到正确的序列时,才会完全结合成封闭的R-环,所以这种结合是可逆的,这也体现出了它更安全的一面。
本文使用的术语“转录激活子样效应器核苷酶”或“TAL效应器核苷酶”或“TALEN”是指一类通过使TAL效应器DNA结合结构域与DNA切割结构域融合而产生的人工限制内切核酸酶。
本文使用的术语“锌指核酸酶(ZFN)”由一个DNA识别域以及一个DNA剪切域组成。DNA识别域为3~4个ZF串联结构,每个ZF约含30个氨基酸,被1个锌离子所固定,可识别并结合1个特异的三联体碱基,DNA剪切域由非特异性核酸内切酶Fok I羧基端的96个氨基酸残基组成。每个Fok I单体与1个ZFP相连构成1个ZFN,并识别特定的基因,当2个识别基因相距恰当的距离时(6~8bp),2个单体ZFN相互作用产生酶切功能,形成双链断裂,从而介导DNA定点剪切。
本文使用的术语“大范围核酸酶(meganuclease)”是指能够识别14-40碱基长度的核酸序列的归巢核酸内切酶。一些大范围核酸酶可以容忍小的归巢基因序列差异,大的识别区域仍然能够保证这些酶的高度特异性,而这反过来又可保持低水平的基因组内非特异性裂解和较低的毒性。大范围核酸酶由自我剪接RNA内含子或自我剪接蛋白内含子序列的移动序列内的开放阅读框架编码。
本文使用的术语“同一性”,在两个或更多个核酸或多肽的情况下,指相同或具有相同核苷酸或氨基酸的指定百分比(即在指定区域内,当在比较窗或指定区域内比较和比对最大一致性时,约60%的同一性,优选65%、70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或更高的同一性)的两条或更多条序列或亚序列,如利用BLAST或BLAST 2.0序列比较算法以默认参数或通过手工比对和可视检查(参阅如NCBI网站等)所测量的。
具体实施方式
本申请提出了一种以CRISPR/Cas9基因编辑技术创制OsPPKL1基因结构发生变化并且OsPPKL3基因功能丧失来改良水稻粒型及株高的方法,其适用于所有含有功能型OsPPKL1基因和OsPPKL3基因的水稻品种中,包括以下步骤:
(1)拟改良水稻品种OsPPKL1基因和OsPPKL3基因的序列分析,并据其设计靶标;
(2)构建编辑载体,将拟改良水稻品种的愈伤组织作为遗传转化的受体材料,用农杆菌介导法将编辑载体导入愈伤组织细胞,并再生成水稻株系;
(3)对OsPPKL1基因和OsPPKL3基因编辑株系的基因型检测和分析,选取目标基因编辑株系进行性状分析,考察粒型、株高等情况。
(4)经过传代分离、鉴定筛选,得到编辑纯合且非转基因的大粒长粒矮杆表型水稻种质。
本申请的创制水稻大长粒型矮杆新种质的方法,具体包括以下步骤:
(1)水稻中OsPPKL1基因和OsPPKL3基因的检测;
(2)水稻中OsPPKL1基因和OsPPKL3基因CRISPR/cas9编辑载体的构建;
(3)将籼稻的愈伤组织作为遗传转化的受体材料,用农杆菌介导法将编辑载体导入,鉴定编辑株系中OsPPKL1基因和OsPPKL3基因的编辑情况,获得不同编辑类型的OsPPKL1基因和OsPPKL3基因等位基因。经过传代分离、鉴定筛选,得到编辑纯合且非转基因的株系,最终获得粒型增大增长矮杆的改良品系。
本申请具体涉及如下技术方案。
在第一方面,本申请提供了创制水稻大长粒型新种质或大长粒型矮杆新种质的方法,其包括修饰水稻中的OsPPKL1基因和OsPPKL3基因,使其基因功能丧失或结构变化。
在一些具体的实施方案中,所述修饰使OsPPKL1基因结构发生变化,同时使OsPPKL3基因功能丧失。
在一些实施方案中,所述修饰导致OsPPKL1基因发生缺失、插入、取代或以上的组合。
在一些实施方案中,所述修饰导致OsPPKL3基因发生缺失、插入、取代或以上的组合。
在一些具体的实施方案中,所述修饰导致OsPPKL1基因缺失或添加3的倍数数量的碱基,例如3bp、6bp、9bp。
在一些具体的实施方案中,所述修饰导致OsPPKL3基因功能丧失。优选地,所述修饰导致OsPPKL3基因缺失或添加1或2个碱基。
在一些实施方案中,所述OsPPKL1基因包含以下序列或由以下序列组成:SEQ IDNO:1所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的同一性并编码相同功能的蛋白质的活性变体序列,或者编码SEQ ID NO:2所示的氨基酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性的保留其功能的活性变体序列的核苷酸序列。
在一些实施方案中,所述OsPPKL3基因包含以下序列或由以下序列组成:SEQ IDNO:3所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的同一性并编码相同功能的蛋白质的活性变体序列,或者编码SEQ ID NO:4所示的氨基酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性的保留其功能的活性变体序列的核苷酸序列。
本文所用的术语“活性变体序列”指基本上相似的序列。对于核苷酸序列,活性变体序列包括由于遗传密码子简并性而编码相同功能的蛋白质的那些序列。诸如天然存在的等位基因变体可以使用公知的分子生物学技术例如聚合酶链式反应(PCR)和杂交技术来鉴定。例如,在本申请中,与籼稻X12或X32的OsPPKL1基因具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的同一性并且也编码功能性OsPPKL1蛋白的来自其他水稻品种的基因,包括在本申请定义的“活性变体序列”中。在本申请中,与籼稻X12的OsPPKL3基因具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的同一性并且也编码功能性OsPPKL3蛋白的来自其他水稻品种的基因,包括在本申请定义的“活性变体序列”中。同一性的确定是通过本文描述的序列比对程序,所述程序使用默认参数。核苷酸的活性变体序列与该核苷酸序列的差异可以少至1-15个核苷酸、少至1-10个(例如6-10个),少至5个,少至4,3,2或甚至1个核苷酸。
对于蛋白质序列,术语“活性变体序列”包括衍生自天然蛋白的多肽,所述衍生是通过缺失(所谓的截短)天然蛋白N末端和/或C末端的一个或多个氨基酸或将一个或多个氨基酸添加至所述天然蛋白的N末端和/或C末端;在天然蛋白的一个或多个基因缺失或添加一个或多个氨基酸;或者在天然蛋白的一个或多个基因取代一个或多个氨基酸。因而,就蛋白质而言,术语“活性变体序列”包括天然蛋白的生物活性片段,其包含保留天然蛋白生物学活性,例如具有OsPPKL1或OsPPKL3蛋白功能的足够数目的连续氨基酸残基。这样的功能相对天然蛋白可以是不同的或者是改良的,或者可以是不变的,只要保留了OsPPKL1或OsPPKL3蛋白功能。同一性的确定是通过本文描述的序列比对程序,所述程序使用默认参数。蛋白的活性变体序列与该蛋白的差异可以少至1-15个氨基酸残基、少至1-10个(例如6-10个),少至5个,少至4,3,2或甚至1个氨基酸残基。
在优选的实施方案中,所述水稻为籼稻或粳稻。
在一些实施方案中,所述修饰通过基因编辑进行。
在一些实施方案中,所述基因编辑通过选自以下的序列特异性核酸酶中的一种或多种进行:CRISPR/Cas9、CRISPR/Cpf1、CRISPR/Cas12a、TALEN、大范围核酸酶和ZFN。优选地,所述基因编辑通过CRISPR/Cas9进行。
在一些实施方案中,所述CRISPR/Cas9包含Cas9,其中所述Cas9包含SEQ ID NO:9所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列。
在一些实施方案中,所述CRISPR/Cas9包含导向RNA(sgRNA),其中所述sgRNA靶向OsPPKL1基因。
在一些实施方案中,所述CRISPR/Cas9包含导向RNA(sgRNA),其中所述sgRNA靶向OsPPKL3基因。
在一些实施方案中,所述CRISPR/Cas9包含导向RNA(sgRNA),其中所述sgRNA靶向OsPPKL1基因和OsPPKL3基因。
在一些具体的实施方案中,所述sgRNA靶向SEQ ID NO:5所示的核苷酸序列。
在一些具体的实施方案中,所述sgRNA靶向SEQ ID NO:6所示的核苷酸序列。
在一些具体的实施方案中,所述sgRNA靶向SEQ ID NO:5和SEQ ID NO:6所示的核苷酸序列。
在一些具体的实施方案中,所述sgRNA包含SEQ ID NO:7所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性的保留其功能的活性变体序列。
在一些具体的实施方案中,所述sgRNA包含SEQ ID NO:8所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性的保留其功能的活性变体序列。
在一些具体的实施方案中,所述sgRNA包含SEQ ID NO:7和SEQ ID NO:8所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性的保留其功能的活性变体序列。
在第二方面,本申请提供了通过第一方面所述的方法创制的水稻用于育种的用途。
在第三方面,本申请提供了通过第一方面所述的方法创制的水稻用于改良种质资源的用途。
在第四方面,本申请提供了通过第一方面所述的方法创制的水稻的种子制成的制品,其选自食品、饮品、饲料或工业原料。
在第五方面,本申请提供了能够靶向水稻OsPPKL1基因的sgRNA,其包含SEQ IDNO:7所示的核苷酸序列或由SEQ ID NO:7所示的核苷酸序列组成。
在第六方面,本申请提供了能够靶向水稻OsPPKL3基因的sgRNA,其包含SEQ IDNO:8所示的核苷酸序列或由SEQ ID NO:8所示的核苷酸序列组成。
在第七方面,本申请提供了能够靶向水稻OsPPKL1基因的CRISPR/Cas9编辑载体,其包含表达Cas9的第一表达盒和表达sgRNA的第二表达盒,其中所述Cas9包含SEQ ID NO:9所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列,并且所述sgRNA包含SEQ ID NO:7所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性的保留其功能的活性变体序列。
在第八方面,本申请提供了能够靶向水稻OsPPKL3的CRISPR/Cas9编辑载体,其包含表达Cas9的第一表达盒和表达sgRNA的第二表达盒,其中所述Cas9包含SEQ ID NO:9所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列,并且所述sgRNA包含SEQ ID NO:8所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性的保留其功能的活性变体序列。
在第九方面,本申请提供了能够靶向水稻OsPPKL1基因及OsPPKL3的CRISPR/Cas9编辑载体,其包含表达Cas9的第一表达盒和表达sgRNA的第二表达盒,其中所述Cas9包含SEQ ID NO:9所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列,并且所述sgRNA包含SEQ ID NO:7和SEQ ID NO:8所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性的保留其功能的活性变体序列。
在第十方面,本申请提供了创制水稻大长粒型新种质或大长粒型矮杆新种质的方法,其包括将第七方面至第九方面中的任一方面所述的CRISPR/Cas9编辑载体导入含有OsPPKL1基因及OsPPKL3基因的水稻中。
在一些实施方案中,所述OsPPKL1基因包含以下序列或由以下序列组成:SEQ IDNO:1所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列,或者编码SEQ ID NO:2所示的氨基酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性的保留其功能的活性变体序列的核苷酸序列。
在一些实施方案中,所述OsPPKL3基因包含以下序列或由以下序列组成:SEQ IDNO:3所示的核苷酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性并编码相同功能的蛋白质的活性变体序列,或者编码SEQ ID NO:4所示的氨基酸序列或与其具有至少约70%,75%,80%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高的序列同一性的保留其功能的活性变体序列的核苷酸序列。
在优选的实施方案中,所述水稻为一些其它性状良好,粒型或株型尚需改良的品种,例如籼稻或粳稻。在优选的实施方案中,所述水稻为籼稻X12或X32。
本说明书和权利要求书中,词语“包括”、“包含”和“含有”意指“包括但不限于”,且并非意图排除其他部分、添加物、组分、或步骤。
应该理解,在本申请的特定方面、实施方案或实施例中描述的特征、特性、组分或步骤,可适用于本文所描述的任何其他的方面、实施方案或实施例,除非与之矛盾。
实施例
以下实施例用于说明本申请,但不用来限制本申请的范围。在不背离本申请精神和实质的情况下,对本申请方法、步骤或条件所作的修改或替换,均属于本申请的范围。
若未特别指明,实施例中所用的化学试剂均为常规市售试剂,实施例中所用的技术手段为本领域技术人员所熟知的常规手段。
以下实施例中使用的水稻品种为籼稻X12和X32,其由中国种子集团有限公司提供。
实施例1CRISPR/Cas9编缉载体的构建
水稻OsPPKL1基因及OsPPKL3基因的序列分析及基因编辑靶点设计
水稻OsPPKL1基因的序列如SEQ ID NO:1所示。水稻OsPPKL3基因的序列如SEQ IDNO:3所示。序列分析显示,OsPPKL1基因含有21个外显子,外显子为OsPPKL1基因的第1-415位碱基,第1322-1382位碱基,第1546-1624位碱基,第1801-1880位碱基,第1960-2054位碱基,第2161-2260位碱基,第2601-2670位碱基,第3548-3649位碱基,第3734-3815位碱基,第4325-4504位碱基,第4813-5086位碱基,第5201-5441位碱基,第5613-5783位碱基,第5948-6186位碱基,第6674-6758位碱基,第6853-6951位碱基,第7036-7235位碱基,第7326-7401位碱基,第7492-7627位碱基,第7707-7846位碱基和第7941-8027位碱基。OsPPKL3基因含有21个外显子,外显子为OsPPKL3基因的第1-430位碱基,第1359-1419位碱基,第1590-1668位碱基,第1750-1829位碱基,第1913-2007位碱基,第2109-2208位碱基,第2732-2801位碱基,第3381-3482位碱基,第3587-3668位碱基,第4076-4255位碱基,第4757-5030位碱基,第5162-5405位碱基,第5607-5777位碱基,第5986-6224位碱基,第6531-6615位碱基,第6713-6811位碱基,第6892-7091位碱基,第7180-7255位碱基,第7347-7482位碱基,第7569-7708位碱基和第7805-7892位碱基。本实施例所涉及的靶序列在籼稻X12及X32材料中OsPPKL1基因第10个外显子的1-18位碱基(AGTGCTAGACACAGCTGC(SEQ ID NO:5))如图1所示;OsPPKL3基因第10个外显子的103-123位碱基(TAGAGCTTACACGACGGTGC(SEQ ID NO:6)),如图2所示。
CRISPR/Cas9编缉载体的构建
本实施例采用的基因编辑技术为第三代基因编辑技术CRISPR/cas9,所使用的载体根据Liu等人(Hao Liu,Yuduan Ding,Yanqing Zhou,Wenqi Jin,Kabin Xie*,Ling-LingChen*.CRISPR-P 2.0:an improved CRISPR/Cas9 tool for genome editing inplants.Mol Plant,2017,10(3):530-532)的方法设计,利用CRISPR-P2.0(http://crispr.hzau.edu.cn/CRISPR2/)在线工具设计了导向RNA(sgRNA)(sgRNA的序列如SEQ IDNO:7或SEQ ID NO:8所示)。sgRNA是由引导RNA(gRNA)和骨架RNA(gRNA scaffold)核苷酸序列连接起来的一个整体分子。植物表达载体包含甘蔗Ubi4启动子(即甘蔗泛素4启动子)驱动的Cas9(Cas9的序列如SEQ ID NO:9所示)表达盒,以及水稻U3启动子驱动的sgRNA表达盒。构建的CRISPR/Cas9编缉载体的图谱请参见图7,编辑载体包含两个sgRNA,即SEQ IDNO:7及SEQ ID NO:8所示的序列。
实施例2籼稻X12和X32的遗传转化
将实施例1构建的编辑载体转入农杆菌菌株EHA105(华中农业大学馈赠)中并进行测序鉴定。由籼稻X12和X32种子诱导产生胚性愈伤组织,与含有上述编缉载体的农杆菌EHA105共同孵育,经过筛选再生等一系列步骤完成遗传转化。农杆菌侵染水稻愈伤组织及筛选、分化流程参照Nishimura等的文献报道(A protocol for Agrobacterium-mediatedtransformation in rice.Nature protocols,2006,6:2 796-2 802)。具体流程包括,水稻愈伤组织与农杆菌共培养3天后,将愈伤组织于250mg/LCarbenicillin的ddH2O中浸泡30min,去除多余的农杆菌后置于含250mg/L Carbenicillin的筛选培养基上筛选3轮,然后将抗性愈伤转移至再生培养基上培养,待分化出2~3cm的小苗后,将其转移至生根培养基中继续培养2周。
实施例3T0代OsPPKL1基因及OsPPKL3基因编辑水稻的筛选和鉴定
将实施例2中的再生植株送至温室种植,取再生的T0代小苗叶片,通过CTAB法提取植物基因组DNA作为模板。DNA样品用荧光定量PCR方法进行阳性检测。荧光定量PCR方法选择筛选标记基因CP4作为目标基因和水稻ACTIN1基因作为内参基因,于荧光定量PCR仪上进行扩增和荧光值检测,根据RQ值筛选出基因编辑阳性的植株(CP4基因RQ值>0.1,结果未示出)。用于扩增筛选标记基因CP4的引物序列为csp356:CAGCACAGGTTAAGTCTG(SEQ ID NO:10)和csp357:GTCTGTCTCAACGGTAAG(SEQ ID NO:11);用于扩增ACTIN1基因的引物序列为csp106:TGCTATGTACGTCGCCATCCAG(SEQ ID NO:12)和csp107:AATGAGTAACCACGCTCCGTCA(SEQ ID NO:13)。
用OsPPKL1基因靶基因特异性检测引物(正向引物为5’端含有FAM荧光修饰的csp4199:5’-CATCACTCTGTGATGACATTGCCAG-3’(SEQ ID NO:14);反向引物csp4062:5’-AAAGGCTCAAAGCCTATAGC-3’(SEQ ID NO:15))及OsPPKL3基因靶基因特异性检测引物(正向引物为5’端含有FAM荧光修饰的csp7268:5’-GCCAGTTGGTCTTAGTATG-3’(SEQ ID NO:16);反向引物csp7247:5’-CAGCTGAACAGGACTCATAGG-3’(SEQ ID NO:17))和Q5超保真DNA聚合酶进行PCR扩增。待反应结束后,利用琼脂糖电泳检测PCR扩增成功与否。之后利用ABI3730XL测序仪进行毛细管电泳,之后用软件分析扩增片段的碱基长度。
结果显示在29株T0代X32水稻中有28株的OsPPKL1基因和OsPPKL3基因发生插入或缺失,突变率为96.6%。其中有8个株系OsPPKL1基因缺失为3的倍数,这是我们的目标编辑结果,即同时编辑OsPPKL1基因及OsPPKL3基因,且OsPPKL1基因的缺失为3的倍数。另外,所有发生编辑的株系中,OsPPKL1基因及OsPPKL3基因的两个拷贝都被成功编辑。这些数据表明,编辑载体能够高效地编辑OsPPKL1基因及OsPPKL3基因。
结果显示在38株T0代X12水稻中有37株的OsPPKL1基因和OsPPKL3基因发生插入或缺失,突变率为97.3%。其中有11个株系OsPPKL1基因缺失为3的倍数,这是我们的目标编辑结果,即同时编辑OsPPKL1基因及OsPPKL3基因,且OsPPKL1基因的缺失为3的倍数。另外,所有发生编辑的株系中,OsPPKL1基因及OsPPKL3基因的两个拷贝都被成功编辑。这些数据表明,编辑载体能够高效地编辑OsPPKL1基因及OsPPKL3基因。
实施例4T1代OsPPKL1基因及OsPPKL3基因编辑纯合非转基因水稻的筛选和鉴定
将实施例3中鉴定的阳性水稻株系移入温室进行移栽种植,收取其自交T1代种子。选取3株编辑结果为大片段缺失的T1代X12种子(300粒),进行发芽、育苗,提取幼苗的DNA进行转基因成分检测及编辑基因PCR检测,去掉含有转基因成分、没有编辑或者编辑基因杂合的株系,保留不携带转基因成分,并且编辑基因纯合的株系,其编辑情况主要为2种类型,选取2种编辑情况各一个株系将其分别命名为407C008004a和418F003063a。转基因成分检测方法请参见《基因编辑后代材料GMO检测流程》,其是根据中国国家标准农业部953号公告6-2007号公告GB/T19495—2004转基因产品检测标准要求进行。为确认407C008004a和418F003063a编辑株系中OsPPKL1基因及OsPPKL3基因的编辑情况,对其靶基因区域进行PCR扩增和测序。结果显示,407C008004a的OsPPKL1基因缺失了3bp核苷酸,OsPPKL3基因有一个碱基插入。418F003063a的OsPPKL1基因缺失了6bp核苷酸,OsPPKL3基因有一个碱基插入(参见图1和图2)。
选取4株编辑结果为大片段缺失的T1代X32种子(400粒),进行发芽、育苗,提取幼苗的DNA进行转基因成分检测及编辑基因PCR检测,去掉含有转基因成分、没有编辑或者编辑基因杂合的株系,保留不携带转基因成分,并且编辑基因纯合的株系,OsPPKL1编辑情况主要为2种类型,OsPPKL3编辑情况主要为2种类型。选取2个基因都编辑的不同编辑情况组合各一个株系将其分别命名为418C037003a、418C026036a和418C026014a。为确认418C037003a、418C026036a和418C026014a编辑株系中OsPPKL1基因及OsPPKL3基因的编辑情况,对其靶基因区域进行PCR扩增和测序。结果显示,418C037003a的OsPPKL1缺失了3bp核苷酸,另外有1bp核苷酸替换,OsPPKL3有1bp核苷酸插入;418C026036a的OsPPKL1缺失了3bp核苷酸,OsPPKL3有1bp核苷酸插入;418C026014a的OsPPKL1缺失了3bp核苷酸,OsPPKL3有2bp核苷酸插入(参见图1和图2)。
实施例5编辑株系的籽粒、株高表型鉴定
将OsPPKL1基因及OsPPKL3基因同时编辑的T2代纯合编辑株系及其对应的对照种子(WT-X12或WT-X32)进行表型拍照(图5)。从图中可以看出,无论是稻谷还是米粒,粒长方面,编辑材料都明显大于对照。
用近红外仪对籽粒糙米的长、宽进行测量,并算出长宽比(表1)。
表1 OsPPKL1基因及OsPPKL3基因编辑株系与野生型株系的籽粒糙米的长、宽及长宽比
由于粒型明显增大,编辑材料稻谷的千粒重也明显增大(表2)。
表2 OsPPKL1基因及OsPPKL3基因编辑株系与野生型株系的籽粒稻谷千粒重
材料 | 千粒重(g) |
407C008004a | 26.7 |
418F003063a | 26.2 |
WT(X12) | 23.3 |
418C037003a | 20.13 |
418C026036a | 22.87 |
418C026014a | 22.44 |
WT(X32) | 18.27 |
另外X12背景材料中OsPPKL1基因的不同编辑情况,其株高表现出不同表型,其中OsPPKL1基因缺失6bp的材料(即418F003063a)表现出株型比对照矮的表型(图6),其他编辑材料的株高与对应的对照相比,无显著差异。
表3 OsPPKL1基因及OsPPKL3基因编辑株系与野生型株系的株高
从上述结果中可以看出,本申请利用CRISPR/Cas9基因编辑技术获得了OsPPKL1基因及OsPPKL3基因编辑突变体。与TALEN和ZFN技术相比,CRISPR/Cas9基因编辑技术操作过程简单方便,节约成本,一般实验室都可操作,是一种快速获得籽粒粒型增大增长和/或矮杆水稻品种或种质资源的生物技术育种方法。
虽然,上文中已经用一般性说明及具体实施方案对本申请的技术方案作了详尽的描述,但在这些技术方案的基础上,可以对之作一些修改或改进,这对本领域技术人员而言是显而易见的。因此,在不偏离本申请精神的基础上所做的这些修改或改进,均属于本申请要求保护的范围。
序列表
SEQ ID NO:1:野生型水稻OsPPKL1基因的核苷酸序列
ATGGACGTGGACTCCCGCATGACGACGGAGTCGGACTCCGACTCCGACGCCGCCGCCACCGCCGCCGCCTCCGCCTCGGTTGCTGCGCAAGGAGGCTTAGCGAGCGAGACCTCGTCGTCGTCGTCGGCCTCGGCCCCGTCCACGCCAGGGACGCCCACGGTAGCTCCGGCTCCTGCGGCCGCGGGAGCCACGGGGCCCAGGCCGGCGCCGGGGTACACGGCGGTCAGCGCTGTGATCGAGAAGAAGGAGGACGGGCCGGGCTGCCGCTGCGGGCACACGCTCACGGCGGTGCCGGCCGTCGGGGAGGAAGGCACGCCGGGGTACATCGGCCCACGCCTCATCCTCTTCGGGGGCGCCACGGCGCTCGAGGGCAACTCCGCCACGCCACCCTCCTCGGCCGGCAGTGCTGGGATCCGTACGCTTTCTATCTAGTATCATATATATGCCTAATAATTTCTGCTTTTTTCAAATGTCGATTGGTCGTGTCTCGTGGATATTAAGCGACGCTCTTGGGTGATCATGATGGCAACCTGGAGCTATTTTTATTCCCAGTTTGCAATTTCATTTTGAAATGAATTTGGAGGCTTCAGGAGTTTTACTGAGAAATCATTTGCTATTTTGTGGGTAATTCGGGTCGGTCTATACCTCGATGGCTGGAAGCTCATAGCTGGCTTGAGCCTTTACCATCAGAGGATTTAAATAGGCATAGCATCATTTCAGACACCATTTTGACTTAGGAATTGCGCTTGAGCTGTTTCTGGAACCGAATATCATTGGTTGAATTAGTGAAGATGGGTGCTGATGGCTGATTCCAGATGCCATGGTCATGGGAGCTGAAATGTTCGAACTGTTTCATATATCAGTCGTATTGCATAATTTTGAATTCTTATTTTTGAACGTATCCCCTGATTCACTGGGCATGCGCTTTCATGGACCTCTCATCTAAAGTTTCTAACTATCTTCAATGTTATGCTTAGCAAGTTGGAAGGAACATTTACTGCATTCTAGATTTTGTGTCATCGTGTTGCACATTGTATGACAGTAGGAGAAACCTTTTTTTTTTTTGTTTTTCCTTTTGGCAAGTTAAAGTAGGTGAGCGGAACTTCTAAGGCACGACTCTGATATGAAATTGAAGGTTTTCTTTTTTGGGAGCTATATTATTGCTGATGTAATCTATTTAAAGCATATATACAAAACTAGTTGCTCCATTGTGTACAAAATTGTTTTCGGTTAAAAAAATGTTATCTGGTATACAATGCTATTTTTACTCTCCAGTTAGTCACAAAGGTGGATTATTCATTGATCTCTCCTTGGTTTATAGGTCTTGCCGGTGCCACAGCAGATGTCCACTGTTATGATGTGTTATCAAATAAGTGGAGCAGGTAGGATATTCCATAAATGGTTTCCACCTTCTTTTGAGCATTTAATCGTTGCGTTGCTACTTTGGTTGACTTTAGCAACTTTTGTTTTAGAGTAACACGATTATTTTGTTGGTAGCATAAAGCAACTCAGCAAACTTATTTAATTGCGCCACCACTTTGGCAGGCTTACTCCACAAGGTGAGCCTCCTTCACCAAGAGCCGCACATGTAGCAACTGCAGTTGGAACCATGGTTGTCATCCAGGTATTTCTATATTTTACCTGTGGTTTCTGAGATTTATTACCTAGTGCTTAATTAGTATCAGAAACAATTACTTCCCAAGTAACCTGTTTTATGTACTCTTTGATTGTTTACATAAATTTTATGGAGGTAACTTCTAAATATGGTTTGTTTTTGTTTCATCATTCGTTGCAATGCAGGGTGGCATTGGCCCTGCTGGTTTATCTGCCGAGGATCTTCATGTTCTAGACCTTACACAACAACGACCGCGATGGCACAGGTAAACTATTATTCTGTACTGTGAGTGTGGAGTTGCATTTCTTCAGCACATCTTCTGAGGTGCTAATGTTTCTATGTAGAGTGGTGGTTCAAGGACCTGGTCCAGGTCCACGATATGGGCATGTCATGGCCTTGGTTGGACAGAGGTTTTTGTTGACAATTGGTGGAAATGATGGTAAGCAATTGCTCCCAGATGCACCTGTCAAATTCAAACTAGAGAACCTTTGCACTTGGATCTATGCTGAAATGGTTTCTATAATGGGATAATTTTTATTCTCTAGGGAAACGGCCGCTGGCAGATGTATGGGCTCTAGATACTGCAGCTAAGCCATACGAATGGAGGAAACTTGAACCAGAAGGTGAAGGACCGCCACCATGCATGTAAGTTTGCTAGCTGAAACACATTACAGTAATTTAATTATTGAGGATGGTGACGTAAAGTTTAATAGCAATTCCTGGGGAGGTATCCATTATATTACATTACTTACCATGTGTTGCATGTTACTTCATTAATTTAATATCAAATTATCCACAATGCAACCAGCTATTTACTTTCTTGGTTCAGAATAACATACGGTAGCACAGTTAACCCACCATATTTCCCAATTAGTTGTCAGGCATGACCCTCAGGTTATTTTCCTTTATTTTTTTTACTGGAAGTAGAGCTAGAACTAGGTTTGGCATCCTTTTGCTAAGTATACTGCATGGTGTTGTTGTTCAGGTATGCCACTGCCAGTGCACGCTCTGATGGTCTTCTTTTACTTTGTGGTGGGAGGGATGCCAACAGTGTGGTAAGTAAAATAATCAACTTATATAAATAAAGATAATAAATAGTCTAGTAAATATTTCGCTGATCAAAATATGTAAAGAACTAAAACTAATGGAATTAGAGACTTGAATGTATAACACTTGGCTGTCAATTTTACCATCTCAGTGATTGGATGTCATACTGTTGTAACAAGGTTCTATATAGCCTGTGATGTGAGGACTGCAATTCTCTCTAGATTACACCTTCAAATTTTGTATCTAAAATATTGAAACAGCAGTGATGTGGCAATTGGTGGATTTCCATGATCTGAAAAATAAGAAAGCTATTACTTTGTTATCAGTTTCATCTCATGTAGATCAAGAGAAAATGAACATAATTATGCATGGCATATGGGCATCTTGTCATGTAACTTACACAGACATCAGTTGCCAAGTAGCAAATATTGAGTGACTGGAACTACAAGTAGCAAATATTTGTTTTCTATAACAAAGATGTCTAGTTATCCTTATATTCGGCAAGTAAAATGATACTCCCCCTGTTTCACATTATAAGTTTATAGAAAAATTTAACAACATCTGCAACACAAAATTAGTTTAATTCAAGCTAACATTTAATATATTTTGATAATATATTTGTTTTGTGTCAAAAATATTGCTAATCTTTTTTTTATAAACTTGTTCAAACTTAAAAAAAGTTTGATTAGGGAAAAAAAACTCAAAATGACTTATAGTATGAAACGGAGTATATTTTAGTCACACCATACTAGTTGTACATCAAGCTATTTTCATTTGAGATGTCAATTTGATTTAAATTACTAATAATTACCTGTAGAAGTTCAATATCAATAATAGGATTTCTCTAATGTCTTTGTATACATGTTACTTCTGTAAACCCATCAGCCGCTGGCAAGTGCATATGGTCTTGCAAAACACAGAGATGGGCGATGGGAGTGGGCAATCGCCCCTGGTGTTTCTCCATCACCCAGATATCAACATGCAGCTGTAAGCTGTTCTTTCATTGTTTTTTCATCTAAAAGGTGTACCCACTTTGGCTCTTATACAATGCTTGATTCATCTGTTGTTTAGGTTTTTGTGAATGCACGTCTCCATGTATCAGGAGGAGCTCTTGGAGGTGGTCGGATGGTAGAAGACTCCTCAAGTGTTGCAGGTTCTAGTTTTAATCTTGAAATAACTGACATATTTCTTAAGGTATTCATGAAATACCTTAAGAAATATCTTGCACTGACATTAACCAGAAGAATAATCTTGAGATGAAATAATGTAGCTCTTTTAGTTTGCTGGATGCTCAGCATATAAAATACTTAGTTTTTGTTGGTGTATGTGCAATATTTGTTATCTTAATAAGATCTAGATTGCAGAAAAGCACTCAGATACTTGAATAGTTTAGTTGCTTTCAGCATTGGCCACTCATGCACCATAACTACACTTGCTAATTATGTTAATCTATATGACTGCATGTGAGTATTACAAAAGCATTTCCTGTCTTTAGCCTTTTTTTACTCATCACTCTGTGATGACATTGCCAGATTTTCTTCCTTTTTTTGGTAGCATTGAGATTCTCTTGTGTTGAGGTATAGGTAATTGTTGGTTTCCTCTAGAATGACAATATGCACTCCTATAAATTTCTTATAAGTTGCACGATTCTATCTGGTTCAGTGCTAGACACAGCTGCTGGAGTCTGGTGCGACACAAAGTCAGTTGTCACAACTCCCAGGATAGGAAGATATAGTGCAGATGCAGCAGGGGGTGATGCTGCTGTTGAACTTACACGGCGGTGCAGGCATGCAGCTGCTGCTGTTGGTGATCAAATATTCATTTATGGAGGTCTACGAGGAGGTAAGAAAACCTGTTTGTTTAGTATTATGCTATAGGCTTTGAGCCTTTGCTGGGCCATCAAGTCATCATTGTACAGTTCAGATTCAAGTTATTTGTGGTGTTATTTCTGATTTGATGGGGATAACTTGGCCCCACTGCCACATGATATGCAACTTGGATGCCACATGATATATATTATAGTTGGCATACTTTTCGTTTGGATGAAAACTTTTAGTGGACTGGAAATTGTCAAGTTTGACATTTGTTCTCACCGTTGGATTTTATGCACCTTCAATTTGCTTTCTAACATCAAACGACTGCCATTGTAGGGGTACTGCTAGATGATCTTCTTGTAGCTGAAGATCTTGCTGCTGCTGAAACGACAACCGCTGCTAATCATGCAGCGGCAAGTGCAGCAGCCACTAATGTACAAAGCGGAAGAACACCCGGAAGATATGCTTATAATGATGAGCGAGCGAGACAAACAGCTCCAGAATCAGCTCAGGATGGATCTGTAGTTCTTGGAACTCCAGTTGCCCCTCCTGTCAACGGTGACATGTACACTGATATCAGCCCTGAGAATGCTGTGCTTCAGGGACAGAGGTTATTTTCAATTTTTTTTCCCTATGAATATGTCCTGGTGATATTCCTATGGCAAGTTACTATTAGTTTTGTAATTGCTATTTTCCTAAGGTTTCAATATTAATTTCTCTACAGGAGATTAAGCAAAGGTGTCGATTACTTAGTTGAAGCATCAGCAGCAGAGGCAGAGGCTATTAGTGCAACTTTAGCTGCAGTGAAGGCTAGGCAGGTTAACGGTGAGATGGAGCAATTGCCTGACAAGGAGCAATCTCCAGATTCCGCATCGACCAGCAAGCATTCAAGCCTCATCAAACCAGACAGTATTCTCTCTAACAACATGACGCCTCCACCTGGGGTTCGGTTGCATCATAGAGCTGTGAGTAGTCTGTATGCAGTTTGTTACATTGTCTGCAGTTCATTTGTCAGTTTTTTTTTATGTTATAATATTGCTGTAGATACTTGTCTTTCTAGCGACTAAAAACAATTGTATTATCTCTTCATTTTGCATCACATGAGGTGATATCCTCTATTTTTCCTTTTACCTCAGGTTGTAGTGGCTGCAGAAACTGGAGGTGCCTTAGGTGGCATGGTCAGGCAGCTGTCAATTGACCAATTTGAAAATGAGGGAAGAAGAGTCAGCTATGGCACCCCTGAGAATGCAACTGCTGCAAGAAAGTTGCTAGATCGACAGATGTCCATTAATAGTGTTCCTAAAAAGGTATCATGGAGAATTTGGTCCATTCAACTGCTTTGGCACTAGGATTTAGTGATGCAGAAGCTGAATGCACAATATAGCCATGATTGAGAAAAGACTTAGCTAGTTACTACGCGTATTCTGAACTACTTAATTACTTTGTTCTATCTGCTGCGATACTTATCCAGGTGATTGCATCTCTATTGAAACCTCGTGGTTGGAAGCCCCCTGTGCGAAGACAGTTCTTCTTGGACTGCAATGAGATTGCAGATCTATGTGATAGTGCTGAGAGAATATTTTCAAGTGAACCAAGTGTTCTGCAACTTAAAGCTCCTGTTAAGATATTTGGTGATTTACATGGTCAATTTGGTGACCTCATGCGATTGTTTGATGAGTATGGTGCTCCTTCAACAGCAGGAGACATTGCGTGAGTTCTTGCTTTGAGAAACTAGTAATCACACACTTTTTAAATTTTGCTTCAATTGTGATATTTTACTTGCCAAATTAACTTGAAAAGCCTTGTTATTAATGTGTTTAATGAATTATCTTAGGCTATCTATACATACTCCCTCCGTACTCGTAAAGGAAGTCGTTTAGGACAATATTTAAGTCAAACCTTGGGAATATAAATCATGAATAACTCTCAAGTTGTTGAGTTTGAAAATGTAAAAATTATATGAATAGATTTGTCTTGAAAAATATTTTCATAAAAGTATACATATATCACTTTTCAATAAATATTATTATAGAAGCAATAAGTCAAAGTTGTGTTTTGGAGACCGTGTTGCTGTCCAAAACGACTTCCTTTACGAGTATGGAGGGAGTATGTACTTTTCTTTTCTTTTTTGGTTCAGAAGGTATTTCCCCACCTTTCTTGGTCTTGTGCTGATCCATAATTTTGTATGAACCTGCAGTTACATTGATTATCTCTTCTTGGGTGATTATGTGGATCGTGGCCAGCATAGCTTAGAAACAATGACTCTTCTTCTTGCATTGAAGGTATTAAGCTGCCTACTTACCTTGATGCCACATATACTTACTGTGTTGTGACGATTTATTAAAATCGTAATATATAAACAATTGTACATTTCAGGTTGAGTATCCTCAGAATGTACATTTGATTCGTGGAAATCATGAAGCTGCAGATATTAATGCTTTGTTTGGCTTCCGAATAGAGTGCATAGAGCGAATGGTAATCTATTTTATTCTGCATCTCACGTTACAACAAAACCTTTTCCTCTTTTGCTTATGAGGATGTTATTTGCTCAAAAAACAGGGTGAGAGAGATGGTATTTGGACATGGCACCGAATGAATAGATTATTTAACTGGCTTCCTTTGGCTGCACTTATCGAAAAGAAAATTATATGTATGCATGGTGGAATTGGTCGGTCAATCAACCATGTTGAGCAAATTGAGAATCTTCAGAGACCAATTACCATGGAAGCAGGCTCTGTTGTCCTTATGGATCTTCTATGGTAAAACATTTCAACAATAATTGCTATTTAACCTTCCATGAGCATTTATTTATGTGCCATGTTCAGATGTTCTCTTTATGATGCTTATAGGTCTGATCCAACCGAGAATGACAGTGTTGAAGGATTGAGACCAAATGCTCGAGGCCCTGGCCTTGTTACGTTTGGGGTTTGTGTTCCCTAACCCTAACCTTCTGAATTCGTCTTCCCTTTGTTGACCCCCTTTGTTCTCTGAAGCTAACATTTGCTGTTCATACAGCCTGATCGTGTTATGGAGTTTTGCAACAATAATGATCTTCAACTAATTGTGCGAGCGCATGAGTGTGTGATGGATGGCTTTGAGCGCTTTGCTCAAGGTCACCTGATCACTCTTTTCTCTGCAACAAACTATTGTGGTATGAATTATTCTAAAATATTCTTTTTTCAAACTTTTTTGCTGGATTTCTACGATTGCTTGACATGGTATGCTCACAGGTACTGCAAATAATGCCGGTGCTATCTTAGTTTTGGGCAGAGATCTTGTGGTCGTTCCAAAACTGATTCATCCTTTGCCCCCGGCAATCACATCACCTGAGACCTCTCCGGAGCATCATATTGAGGACACATGGATGCAGGTAATACTATTTTGTTGCAAAGATATTCCTGTTTGTAAACTAACTGGTACTACCACCTTCCTCGTAACACGGTAACATTTGAAATGAAATTCAGGAGCTGAATGCAAACAGACCACCGACTCCAACAAGGGGCCGCCCCCAAGTAGCAGCTAACGATCGAGGTTCTCTTGCCTGGATATAG
SEQ ID NO:2:野生型水稻OsPPKL1基因编码的氨基酸序列
MDVDSRMTTESDSDSDAAATAAASASVAAQGGLASETSSSSSASAPSTPGTPTVAPAPAAAGATGPRPAPGYTAVSAVIEKKEDGPGCRCGHTLTAVPAVGEEGTPGYIGPRLILFGGATALEGNSATPPSSAGSAGIRLAGATADVHCYDVLSNKWSRLTPQGEPPSPRAAHVATAVGTMVVIQGGIGPAGLSAEDLHVLDLTQQRPRWHRVVVQGPGPGPRYGHVMALVGQRFLLTIGGNDGKRPLADVWALDTAAKPYEWRKLEPEGEGPPPCMYATASARSDGLLLLCGGRDANSVPLASAYGLAKHRDGRWEWAIAPGVSPSPRYQHAAVFVNARLHVSGGALGGGRMVEDSSSVAVLDTAAGVWCDTKSVVTTPRIGRYSADAAGGDAAVELTRRCRHAAAAVGDQIFIYGGLRGGVLLDDLLVAEDLAAAETTTAANHAAASAAATNVQSGRTPGRYAYNDERARQTAPESAQDGSVVLGTPVAPPVNGDMYTDISPENAVLQGQRRLSKGVDYLVEASAAEAEAISATLAAVKARQVNGEMEQLPDKEQSPDSASTSKHSSLIKPDSILSNNMTPPPGVRLHHRAVVVAAETGGALGGMVRQLSIDQFENEGRRVSYGTPENATAARKLLDRQMSINSVPKKVIASLLKPRGWKPPVRRQFFLDCNEIADLCDSAERIFSSEPSVLQLKAPVKIFGDLHGQFGDLMRLFDEYGAPSTAGDIAYIDYLFLGDYVDRGQHSLETMTLLLALKVEYPQNVHLIRGNHEAADINALFGFRIECIERMGERDGIWTWHRMNRLFNWLPLAALIEKKIICMHGGIGRSINHVEQIENLQRPITMEAGSVVLMDLLWSDPTENDSVEGLRPNARGPGLVTFGPDRVMEFCNNNDLQLIVRAHECVMDGFERFAQGHLITLFSATNYCGTANNAGAILVLGRDLVVVPKLIHPLPPAITSPETSPEHHIEDTWMQELNANRPPTPTRGRPQVAANDRGSLAWI*
SEQ ID NO:3:野生型水稻OsPPKL3基因的核苷酸序列ATGGACGTGGACTCGAGGATGACGACGGAGTCGGACTCCGACTCGGACGCCGCGGCGCAGGGGGGAGGAGGAGGAGGGTTCGGGAGCGAGACCTCCTCGGCGTCGCCCTCGGCGCCCGGGACGCCGACGGCTATGGGGGCAGGAGGGGGAGCTGCTCCTATCGCTGCTGCTGCTATCGCTGCCGCCGCGTCGGCGGCGGTGGTGGCGGGCCCGAGGCCCGCGCCGGGGTACACGGTGGTGAACGCGGCGATGGAGAAGAAGGAGGACGGGCCCGGGTGCCGGTGCGGCCACACGCTCACCGCGGTGCCGGCCGTCGGGGAGGAGGGCGCGCCGGGGTACGTGGGGCCGCGGCTGATCCTCTTCGGCGGTGCCACCGCGCTCGAGGGGAACTCCGCCACGCCGCCATCCTCGGCCGGGAGCGCCGGAATCCGTAATGCCCTTTTAAACCTCAGCTGCTCTTGTTTTCATGTATGGTTTGGGGGGATGGTGTTGGCGTGCTAATTGTTTAGTTTGCTGTCGACGAGAGCCAAACTCCATGGGAGTTGGAATACTTGGGTTTGTACTGAATCTTGCTCGGGACTTTGGAGTGATATGCTGGTTCGCGTACCGACTGATTTAGGGATATAACTTACTGTTCGGGATAGCAGCTAGAGTTGATTGTAGCTCTGGTGTTTCAGTGTCTTAGTTAGAGGTAGAATTCGGAGCATCTGCTGATGGTGGTATTGCTAGTGTAGGCTGGATTAGGTTAAAAACTCGTACTGGGAGGGATTTTCATGTTCTGCCTGGTTAGGAAAGCGATTATGAGTGGCACGTTATGACAATATATGCAGTGTAACTCATGTGCACATGCTGTTAATTGGAGGGCCAAATATGGTGCTCTTTTATGATTTGAGGCACAAGATTTATTTAATGAACAACGATACTCGCATACTTCTGTTGCGGTGAAGTACCAGTTCCTTTTAATGTTTGTTGGTTGTTTCCTTTGGATTATGATTGTCTTTGTATGTTAGAAGAAGCACTCAAAGGTTTCAAATTGGAAAAACATTTATGTGCTTAGACTGTACTGATGCCCTTGGATACCCTAGGGCCTAGGTTAAGGCGTATGGGTTAATTAATTAGGTGAACTATGCCGTTTCAAAAGTTAAGCCTGAAATGCTATTATACGCATTCCGGTTCTCCAAGTCTTATTGGGGGATTAGACGTTTTGGTAGACCGGGTGCCAAATGTATTGTTTCAAAACTGAGCTTAGAAATGTTGGAGGTTTTGCTTGTGATGAAAGGGAAAGTTTATCTTCTTTATAGGCAAATCAGCAAAATTGACAAGATCTTACATAAAATTATCAATCTCCTTTCTTGCAGGTCTTGCTGGTGCTACTGCGGATGTGCACTGCTACGATGTTTCATCGAATAAGTGGAGCAGGTATGTTCTTTTTAGTAAGAGCCTTTTTTAGTAAATGTTTTAATTCCTTTGGATATTATGCTTGGATGCCATGTGTAAGTTTTTCTTTAGATCCCATGAAATTTTATCTCAATTGCACTTGTTAGTTCTGTGCACTACTACATTTGATTGACCACAGTAAATGTTGTCAGGCTTACTCCAGTTGGTGAACCTCCTTCACCAAGAGCGGCACATGTAGCGACTGCTGTTGGCACCATGGTGGTCATTCAGGTAGCTTTAATCTTTTTGCCTTAGAGTAGTAGCTATGTCAGTTTTATCGTCAGGCCAGCACTAACCTGCTCTCAACTACAGGGTGGGATAGGTCCTGCTGGTTTATCTGCAGAAGACCTTCATGTTCTGGATCTTACACAACAACGACCGCGATGGCACAGGTGGCTGCTGCTCTAATTTCGTTTGGGTTTAGAATATGCTTTTGCTGCTATCCCTAGTGAGCATTATCATGTCTTGTATGTAGAGTGGTGGTTCAAGGGCCTGGCCCTGGTCCTCGATATGGACATGTGATGGCTTTGGTTGGGCAGCGTTTCTTGTTGACAATCGGTGGAAATGATGGTAGGAGATTGCTTATGTGCCTATGAAACCTAGATTTTGGCTACTTAAGATATTGTTCCATCCCAAATGATGTTGATACTAAGAGTGAATAAATGTTTCAGGGAAGAGGCCCCTGGCTGATGTGTGGGCCCTTGATACAGCAGCTAAGCCATATGAGTGGAGGAAACTTGAACCAGAAGGTGAAGGACCACCCCCATGCATGTAAGTGTGCAGATAATAAAAAAGGAAACATCTCCCCTTTGAAATCTAGATCAACGAGGTTCTTTGGTGATGTTGAGTTTAAATCTTGCTATTTTCTTGGCAAGATTCCCCAGCTACATCCATGAGCCAACATATGATAATGCCTATATTTTCTAATGAAGCTTTCAGTGATTGTACTTTTGTTTGATGCCCATATCATATATAGTAACCTGTTTTGGCACCATTAGACTACTTTTAGGGCTCCACTTTCTTTTGTTGGCCAAGAAGTTACAGAAAACTTTGGTTATTTACAACTCAATATTGATAGTTTGTTCAAAAGCAACTGCCAATTAAGAACAAGCTACAAGTGTTTCGTTAAGTCTACTTGGATGCAGAACACAGTAAAGTGTCAAACCTAATATCCTAGGGTAACCCACCACTAGCTATTTTCTGTAATTTCACTTACGCTTTTCAATAGATTGCTAGAAGTTGAGATAGTTTTTGTATACTTCTAATAGTCTGTGCCTCTGGTAATGTTGCTCAGGTATGCAACTGCAAGTGCCCGTTCTGATGGACTTCTATTACTCTGTGGTGGGAGGGATGCTAATAGTGTGGTAAGTACCATTCTATTTGTGAATAGCTTTTGCTGAGGAAATTTTGTTCTAATAGGAATATAGGGTTGAAATTCTCTGTTCTTAGTTTTTAGATTTGAATCTGCTCTGAAGTCATAACACGAGAATGGTTCATATAGCTTTTTTGGTAAGTGATATTTATAAGCATTGCATGCTTGTGCCAACAAAGCATATTTATTTTTTCTCATGTTACATCTGCTTCTTTGGATGAGTGAGGAATTGTGATGTTAAGTGCACTGAATCCATATGGCATGCTTGAAAGGCCTGTTTTGTTCAATTTCACTTGTAAAGCCATGCCCTGCAAATAAAAAAAATAGAATCGTCTGGGCTGGAAACTGAATATGGATTCAACCGATTAGTTTTCATTAGATGGTAAGTGATTAGCATTTTGTAGGTGAATTGAAATAAGGATATATATGGACTTTTCTTTTTGGTAATTACGATTTTGGAGAGGCAGTGACTTCTATTTGTATGTCCTACACTGTTTTACTCCAAGGCTCAAGCATATTTGTCAAAGCTAACTATGTAGTTTTTCCTTTTCTTTTGGGCTAAAACCATCAGCCCCTAGCAAGTGCATATGGCCTTGCAAAACACAGGGATGGACGCTGGGAGTGGGCAATCGCCCCTGGTGTCTCTCCATCACCCAGATATCAACATGCAGCTGTGAGTTGTGAATATCCTTTCCCTTCAATTTTCTTTATTTTTCATCTTGAGATGGATCCCACTGCATGGTTTGTGTCATGTTTTATTTCCTTTTGTTTCTTTAGGTTTTCGTCAATGCACGACTTCATGTCTCAGGAGGGGCCCTTGGAGGCGGTCGCATGGTTGAAGACTCCTCAAGTGTTGCAGGTTCCTTACTATTATTTGCTTTGTATACTTTGCACCTGTTAGTTCATGTGGCATCCTTTTGAAAAATCTCCCACCTTTTGTGGCAACTCGAAACATTTCAAGCAATTTCTTTTGATTGTAAGTAGGATAAAGACCATATATGATTAATTATTTCAATTACTAAGTTCAATATATTGTTTTCTGTTTGCTAAATTACCCATTTACAATTCATCAGCAGAGAACAAGATGCATGCTGTTATTGATTGTCTCTGCAGTTCATATATCTTGTATTATTGAAAAAAGTAAATGGTCTGAATAATACTGGAACACAGTCGGTAGTTTAAGCATTGGCCAGTTGGTCTTAGTATGCATGTATATTTTCTTGTTTCCCTAGGTTCTTATTTTTAAGCCATGCTGCCCTTTTTCAGTGTTGGATACTGCTGCTGGAGTGTGGTGT GATACAAAATCAGTAGTTACTACTCCAAGGACAGGAAGATATAGTGCAGATGCTGCAGGCGGTGATGCTTCTGTAGAGCTTACACGACGGTGCAGGCATGCAGCTGCTGCAGTTGGTGATATGATATATGTTTATGGAGGTCTACGGGGAGGTAAGACACTTCAGTTTCTTAGCCTCATCTGATTTCTGACACCAATTCTCATGTTTGAATTGCGATCCGCTGTCCTATGAGTCCTGTTCAGCTGTATTCAGCAGCACACGTTTTTTAAGTAGTAAAGATGTTATCCGATGAGGAACATGTATATTCTTTCTGGGAAGATAAGGAATTTAGGATATTAAGATATCAGTACAAATTTAATTAGCATTGACTATGTTTACATAGCAGATCCAAAAGGTTTTTAAGGTGATCCGGATTTCAGCTTAATGGATTTCCAATTAACATTAGTATTATTACAATATCCTTATCTGAATGAAAAACTACTCCTGGAGCTAACTGTTAACTAACTTTTTACACTTGCATCACTTTTTTGAGGGTTGACTTTGGGTTATCAGAGCGAAATTGCCTCCTATTCAAATTAGGATATATTCATATTTTTGTTGATGCTAAGTTTTGTATAGTTGCAATTCTAAATTCCCTTTATTGTTACTCCAGGTGTGCTGCTAGATGACCTTCTTGTTGCCGAAGATCTTGCT GCTGCTGAAACAACAAATGCAGCAAATCAGGCAGCTGCAATTGCTGCAGCCTCTGACATACAAGCTGGAAGAGAACCTGGTAGGTATGCCTACAATGATGAACAAACAGGGCAGCCAGCTACAATAACATCTCCTGATGGAGCTGTGGTTCTTGGAACTCCAGTTGCTGCTCCCGTTAATGGGGACATGTATACTGATATTAGCCCTGAGAACGCTGTGATCCAGGGACAAAGGTTTGTAAAATTTCGTAGACATGTTCCGATAGTTTTTATTTTATTTTATTTTATTTTTTTTGCATATCTATCAGTCTATTGAGTAGAAGCTACATGCACTGAAATGTGTCTTTTGGTCTTGATTTGCACAGGAGAATGAGCAAAGGTGTTGATTATTTGGTTGAAGCATCTGCTGCAGAGGCTGAGGCAATCAGTGCTACTTTAGCTGCTGTAAAAGCTCGGCAAGTTA ATGGTGAGGCAGAGCATTCACCTGACAGGGAGCAGTCTCCAGATGCCACACCAAGTGTGAAGCAAAATGCAAGCCTGATAAAACCAGACTATGCTCTTTCAAATAATTCAACACCACCTCCTGGGGTTCGGTTACACCATAGAGCAGTAAGTAGAAAATGCAGTTAGTTTTACCGAGTTAATTCGTTACTTTATTTGCTTCTCTAGTTAATTGATAAATTCAAAACCAGATCATGATCTTTCCTTGTTCTACTGTTGAAATATATTGTATTACAGAAGCTGTCCCTGAACAACCTAGCTCAAGTTAGTTACAGCATTGTGAACTTTATTTTTTTCTTGCTCTTGTAGGTTGTGGTAGCAGCAGAAACCGGAGGTGCCTTAGGTGGCATGGTTAGGCAGCTCTCAATTGACCAGTTTGAAAATGAAGGAAGAAGGGTCATCTATGGCACTCCTGAGAGTGCAACTGCAGCAAGGAAGTTGCTAGATCGGCAAATGTCTATTAATAGTGTACCTAAAAAGGTATCTTACTTTGCAGATGTAATTTGACAAAAAAAAACAGCAAACAATTGCATGATTATCTGCTAAAACACTAAGGCAATGGCATACACTGGTGTGTGTTGTTGTCAGGGGGCATAAATGGAAGTCATGATAACCTTTGCCTTTAAACTTCTATGCTATTGTGCTATTAAGTACTCACATATTTCTATTTGATGACAAAATTATTCAGGTAATTGCATCCCTCTTGAAACCTCGTGGTTGGAAGCCCCCTGTGCGAAGGCAGTTCTTTTTGGACTGCAATGAGATTGCAGACCTATGTGATAGTGCGGAAAGAATATTTTCAAGTGAACCAAGTGTCCTACAGCTTAAAGCTCCAATTAAGATATTTGGTGATTTGCACGGTCAATTTGGTGACCTAATGCGCTTGTTTGATGAGTACGGTGCTCCTTCGACAGCTGGAGACATTGCGTGAGTATTTTCTGATTTTGAACTGCCCTGATTCATTTACTTATTCATTCCAATTTTGGGAACTTTTATGGCATTTTATTGGATGACAAAGCTGAAAAACCTTTTTTTTTTCAAAAAAGAACCTCTAGAACACTTTATGTTAAGTTGTTTTATTTTCAGTTATATTATTGATACTTTATCTGAACTTCTTGATTTGCGTATATAGGCTTTGGGTTAGAAAATGTTCGGTTAGAGGAGATATTTTCAATCGATCTTTCCTGACTTATATGTTTTCCTTTTGTTTTTTTTGTTCAAAAAACTTTGCAGTTACATTGATTACCTCTTCTTGGGAGATTACGTGGATCGTGGTCAGCATAGTTTGGAGACAATCACTCTTCTCCTTGCACTTAAGGTAAAGTTGTGAACAACTGAGCGAACCCTGAAAAAGTGCTATTTCAGCAGACAGTTCTTTTTCTCTCAAAAATGTAACAATTGATCTTATAATTCAGGTTGAATATCCTCTTAATGTACATTTGATTCGAGGGAATCATGAGGCCGCTGATATCAATGCTTTGTTTGGGTTCCGAATAGAGTGCATAGAGCGAATGGTAATGACCATGTTGCATTCTGACATTCTTATTAGGCTGTTTTTTATCTCTTGCTTATCACTGTTCTTTGCTAAAAACAGGGTGAGCGTGATGGGATCTGGACCTGGCATCGCATGAATAGGCTATTTAATTGGCTTCCTTTGGCTGCCCTAATAGAGAAGAAAATAATATGTATGCATGGTGGTATTGGCCGGTCTATCAATCATGTTGAACAAATTGAGAATCTTCAAAGACCAATTACCATGGAAGCAGGCTCGGTTGTTCTAATGGATCTTCTGTGGTCAGCATTTCAATAGCTCTTTGATACTACTATTGCAAGATTGTTCTTTATTTTTCCCTGATATAACCTATTTTTTAAATCTTAAAAGGTCTGATCCAACTGAAAATGATAGTGTTGAAGGACTTAGACCAAATGCCCGTGGCCCAGGCCTTGTTACCTTTGGGGTTAGTATTCTCTGTGTAATGTTGTGTCCAGCTCTTTTCTCCCTGTTGGTTCGTGTGGTTTCTGTCAAAATAACACGTATTGTAACTGCAGCCGGATCGTGTTATGGAGTTCTGCAACAACAATGACTTACAGTTAATTGTGCGAGCACATGAGTGTGTGATGGATGGTTTTGAGCGTTTTGCTCAAGGTCACCTAATCACTCTCTTCTCAGCAACAAATTACTGTGGTAATGATATGCTATTCTGACTTGGTCTGTTCGCGATTATCTCTTCAAGTTTGATTGTCCTGTTACTTAATATGATATGTATGCAGGTACGGCAAACAATGCGGGTGCAATCTTGGTTCTAGGAAGAGATCTTGTAGTAGTTCCGAAACTCATCCATCCTTTGCCGCCTGCCATTACATCTCCCGAGACCTCTCCAGAGCATCATCTTGAGGACACATGGATGCAGGTAACTCCTCTTTTATCATGTGATGCTCTTGGATTTGCAATAGGGTATTCCTGTCTATACGCTTCTAAGTTCTAACACCATCTTGTGTTAAAATCAGGAACTAAATGCCAACAGGCCGCCAACTCCAACAAGGGGCCGCCCTCAAGCAGCAAACAATGACCGAGGCTCTCTTGCATGGATATAG
SEQ ID NO:4:野生型水稻OsPPKL3基因编码的氨基酸序列
MDVDSRMTTESDSDSDAAAQGGGGGGFGSETSSASPSAPGTPTAMGAGGGAAPIAAAAIAAAASAAVVAGPRPAPGYTVVNAAMEKKEDGPGCRCGHTLTAVPAVGEEGAPGYVGPRLILFGGATALEGNSATPPSSAGSAGIRLAGATADVHCYDVSSNKWSRLTPVGEPPSPRAAHVATAVGTMVVIQGGIGPAGLSAEDLHVLDLTQQRPRWHRVVVQGPGPGPRYGHVMALVGQRFLLTIGGNDGKRPLADVWALDTAAKPYEWRKLEPEGEGPPPCMYATASARSDGLLLLCGGRDANSVPLASAYGLAKHRDGRWEWAIAPGVSPSPRYQHAAVFVNARLHVSGGALGGGRMVEDSSSVAVLDTAAGVWCDTKSVVTTPRTGRYSADAAGGDASVELTRRCRHAAAAVGDMIYVYGGLRGGVLLDDLLVAEDLAAAETTNAANQAAAIAAASDIQAGREPGRYAYNDEQTGQPATITSPDGAVVLGTPVAAPVNGDMYTDISPENAVIQGQRRMSKGVDYLVEASAAEAEAISATLAAVKARQVNGEAEHSPDREQSPDATPSVKQNASLIKPDYALSNNSTPPPGVRLHHRAVVVAAETGGALGGMVRQLSIDQFENEGRRVIYGTPESATAARKLLDRQMSINSVPKKVIASLLKPRGWKPPVRRQFFLDCNEIADLCDSAERIFSSEPSVLQLKAPIKIFGDLHGQFGDLMRLFDEYGAPSTAGDIAYIDYLFLGDYVDRGQHSLETITLLLALKVEYPLNVHLIRGNHEAADINALFGFRIECIERMGERDGIWTWHRMNRLFNWLPLAALIEKKIICMHGGIGRSINHVEQIENLQRPITMEAGSVVLMDLLWSDPTENDSVEGLRPNARGPGLVTFGPDRVMEFCNNNDLQLIVRAHECVMDGFERFAQGHLITLFSATNYCGTANNAGAILVLGRDLVVVPKLIHPLPPAITSPETSPEHHLEDTWMQELNANRPPTPTRGRPQAANNDRGSLAWI*
SEQ ID NO:5:水稻OsPPKL1基因中的靶序列
AGTGCTAGACACAGCTGC
SEQ ID NO:6:水稻OsPPKL3基因中的靶序列
TAGAGCTTACACGACGGTGC
SEQ ID NO:7:sgRNA的核苷酸序列1
AGTGCTAGACACAGCTGCGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTATCAACTTGA AAAAGTGGCACCGAGTCGGTGCT
SEQ ID NO:8:sgRNA的核苷酸序列2
TAGAGCTTACACGACGGTGCGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTATCAACTTG AAAAAGTGGCACCGAGTCGGTGCT
SEQ ID NO:9:Cas9的核苷酸序列
ATGCCGAAGAAGCGCCGCCGCGTGGACAAGAAGTACTCCATCGGCCTCGACATCGGCACCAACTCCGTGGGCTGGGCCGTGATCACCGACGAGTACAAGGTGCCGTCCAAGAAGTTCAAGGTGCTCGGCAACACCGACCGCCACTCCATCAAGAAGAACCTCATCGGCGCCCTCCTCTTCGACTCCGGCGAGACCGCCGAGGCCACCCGCCTCAAGCGCACCGCCCGCCGCCGCTACACCCGCCGCAAGAACCGCATCTGCTACCTCCAGGAGATCTTCTCCAACGAGATGGCCAAGGTGGACGACTCCTTCTTCCACCGCCTCGAGGAGTCCTTCCTCGTGGAGGAGGACAAGAAGCACGAGCGCCACCCGATCTTCGGCAACATCGTGGACGAGGTGGCCTACCACGAGAAGTACCCGACCATCTACCACCTCCGCAAGAAGCTCGTGGACTCCACCGACAAGGCCGACCTCCGCCTCATCTACCTCGCCCTCGCCCACATGATCAAGTTCCGCGGCCACTTCCTCATCGAGGGCGACCTCAACCCGGACAACTCCGACGTGGACAAGCTCTTCATCCAGCTCGTGCAGACCTACAACCAGCTCTTCGAGGAGAACCCGATCAACGCCTCCGGCGTGGACGCCAAGGCCATCCTCTCCGCCCGCCTCTCCAAGTCCCGCCGCCTCGAGAACCTCATCGCCCAGCTCCCGGGCGAGAAGAAGAACGGCCTCTTCGGCAACCTCATCGCCCTCTCCCTCGGCCTCACCCCGAACTTCAAGTCCAACTTCGACCTCGCCGAGGACGCCAAGCTCCAGCTCTCCAAGGACACCTACGACGACGACCTCGACAACCTCCTCGCCCAGATCGGCGACCAGTACGCCGACCTCTTCCTCGCCGCCAAGAACCTCTCCGACGCCATCCTCCTCTCCGACATCCTCCGCGTGAACACCGAGATCACCAAGGCCCCGCTCTCCGCCTCCATGATCAAGCGCTACGACGAGCACCACCAGGACCTCACCCTCCTCAAGGCCCTCGTGCGCCAGCAGCTCCCGGAGAAGTACAAGGAGATCTTCTTCGACCAGTCCAAGAACGGCTACGCCGGCTACATCGACGGCGGCGCCTCCCAGGAGGAGTTCTACAAGTTCATCAAGCCGATCCTCGAGAAGATGGACGGCACCGAGGAGCTCCTCGTGAAGCTCAACCGCGAGGACCTCCTCCGCAAGCAGCGCACCTTCGACAACGGCTCCATCCCGCACCAGATCCACCTCGGCGAGCTCCACGCCATCCTCCGCCGCCAGGAGGACTTCTACCCGTTCCTCAAGGACAACCGCGAGAAGATCGAGAAGATCCTCACCTTCCGCATCCCGTACTACGTGGGCCCGCTCGCCCGCGGCAACTCCCGCTTCGCCTGGATGACCCGCAAGTCCGAGGAGACCATCACCCCGTGGAACTTCGAGGAGGTGGTGGACAAGGGCGCCTCCGCCCAGTCCTTCATCGAGCGCATGACCAACTTCGACAAGAACCTCCCGAACGAGAAGGTGCTCCCGAAGCACTCCCTCCTCTACGAGTACTTCACCGTGTACAACGAGCTCACCAAGGTGAAGTACGTGACCGAGGGCATGCGCAAGCCGGCCTTCCTCTCCGGCGAGCAGAAGAAGGCCATCGTGGACCTCCTCTTCAAGACCAACCGCAAGGTGACCGTGAAGCAGCTCAAGGAGGACTACTTCAAGAAGATCGAGTGCTTCGACTCCGTGGAGATCTCCGGCGTGGAGGACCGCTTCAACGCCTCCCTCGGCACCTACCACGACCTCCTCAAGATCATCAAGGACAAGGACTTCCTCGACAACGAGGAGAACGAGGACATCCTCGAGGACATCGTGCTCACCCTCACCCTCTTCGAGGACCGCGAGATGATCGAGGAGCGCCTCAAGACCTACGCCCACCTCTTCGACGACAAGGTGATGAAGCAGCTCAAGCGCCGCCGCTACACCGGCTGGGGCCGCCTCTCCCGCAAGCTCATCAACGGCATCCGCGACAAGCAGTCCGGCAAGACCATCCTCGACTTCCTCAAGTCCGACGGCTTCGCCAACCGCAACTTCATGCAGCTCATCCACGACGACTCCCTCACCTTCAAGGAGGACATCCAGAAGGCCCAGGTGTCCGGCCAGGGCGACTCCCTCCACGAGCACATCGCCAACCTCGCCGGCTCCCCGGCCATCAAGAAGGGCATCCTCCAGACCGTGAAGGTGGTGGACGAGCTCGTGAAGGTGATGGGCCGCCACAAGCCGGAGAACATCGTGATCGAGATGGCCCGCGAGAACCAGACCACCCAGAAGGGCCAGAAGAACTCCCGCGAGCGCATGAAGCGCATCGAGGAGGGCATCAAGGAGCTCGGCTCCCAGATCCTCAAGGAGCACCCGGTGGAGAACACCCAGCTCCAGAACGAGAAGCTCTACCTCTACTACCTCCAGAACGGCCGCGACATGTACGTGGACCAGGAGCTCGACATCAACCGCCTCTCCGACTACGACGTGGACCACATCGTGCCGCAGTCCTTCCTCAAGGACGACTCCATCGACAACAAGGTGCTCACCCGCTCCGACAAGAACCGCGGCAAGTCCGACAACGTGCCGTCCGAGGAGGTGGTGAAGAAGATGAAGAACTACTGGCGCCAGCTCCTCAACGCCAAGCTCATCACCCAGCGCAAGTTCGACAACCTCACCAAGGCCGAGCGCGGCGGCCTCTCCGAGCTCGACAAGGCCGGCTTCATCAAGCGCCAGCTCGTGGAGACCCGCCAGATCACCAAGCACGTGGCCCAGATCCTCGACTCCCGCATGAACACCAAGTACGACGAGAACGACAAGCTCATCCGCGAGGTGAAGGTGATCACCCTCAAGTCCAAGCTCGTGTCCGACTTCCGCAAGGACTTCCAGTTCTACAAGGTGCGCGAGATCAACAACTACCACCACGCCCACGACGCCTACCTCAACGCCGTGGTGGGCACCGCCCTCATCAAGAAGTACCCGAAGCTCGAGTCCGAGTTCGTGTACGGCGACTACAAGGTGTACGACGTGCGCAAGATGATCGCCAAGTCCGAGCAGGAGATCGGCAAGGCCACCGCCAAGTACTTCTTCTACTCCAACATCATGAACTTCTTCAAGACCGAGATCACCCTCGCCAACGGCGAGATCCGCAAGCGCCCGCTCATCGAGACCAACGGCGAGACCGGCGAGATCGTGTGGGACAAGGGCCGCGACTTCGCCACCGTGCGCAAGGTGCTCTCCATGCCGCAGGTGAACATCGTGAAGAAGACCGAGGTGCAGACCGGCGGCTTCTCCAAGGAGTCCATCCTCCCGAAGCGCAACTCCGACAAGCTCATCGCCCGCAAGAAGGACTGGGACCCGAAGAAGTACGGCGGCTTCGACTCCCCGACCGTGGCCTACTCCGTGCTCGTGGTGGCCAAGGTGGAGAAGGGCAAGTCCAAGAAGCTCAAGTCCGTGAAGGAGCTCCTCGGCATCACCATCATGGAGCGCTCCTCCTTCGAGAAGAACCCGATCGACTTCCTCGAGGCCAAGGGCTACAAGGAGGTGAAGAAGGACCTCATCATCAAGCTCCCGAAGTACTCCCTCTTCGAGCTCGAGAACGGCCGCAAGCGCATGCTCGCCTCCGCCGGCGAGCTCCAGAAGGGCAACGAGCTCGCCCTCCCGTCCAAGTACGTGAACTTCCTCTACCTCGCCTCCCACTACGAGAAGCTCAAGGGCTCCCCGGAGGACAACGAGCAGAAGCAGCTCTTCGTGGAGCAGCACAAGCACTACCTCGACGAGATCATCGAGCAGATCTCCGAGTTCTCCAAGCGCGTGATCCTCGCCGACGCCAACCTCGACAAGGTGCTCTCCGCCTACAACAAGCACCGCGACAAGCCGATCCGCGAGCAGGCCGAGAACATCATCCACCTCTTCACCCTCACCAACCTCGGCGCCCCGGCCGCCTTCAAGTACTTCGACACCACCATCGACCGCAAGCGCTACACCTCCACCAAGGAGGTGCTCGACGCCACCCTCATCCACCAGTCCATCACCGGCCTCTACGAGACCCGCATCGACCTCTCCCAGCTCGGCGGCGAC
SEQ ID NO:10:用于扩增筛选标记基因CP4的正向引物序列
CAGCACAGGTTAAGTCTG
SEQ ID NO:11:用于扩增筛选标记基因CP4的反向引物序列
GTCTGTCTCAACGGTAAG
SEQ ID NO:12:用于扩增ACTIN1基因的正向引物序列
TGCTATGTACGTCGCCATCCAG
SEQ ID NO:13:用于扩增ACTIN1基因的反向引物序列
AATGAGTAACCACGCTCCGTCA
SEQ ID NO:14:用于扩增OsPPKL1基因编辑基因的正向引物
CATCACTCTGTGATGACATTGCCAG
SEQ ID NO:15:用于扩增OsPPKL1基因编辑基因的反向引物
AAAGGCTCAAAGCCTATAGC
SEQ ID NO:16:用于扩增OsPPKL3基因编辑基因的正向引物
GCCAGTTGGTCTTAGTATG
SEQ ID NO:17:用于扩增OsPPKL3基因编辑基因的反向引物
CAGCTGAACAGGACTCATAGG
SEQ ID NO:18:编辑株系407C008004a的PPKL3基因编码的氨基酸序列
MDVDSRMTTESDSDSDAAAQGGGGGGFGSETSSASPSAPGTPTAMGAGGGAAPIAAAAIAAAASAAVVAGPRPAPGYTVVNAAMEKKEDGPGCRCGHTLTAVPAVGEEGAPGYVGPRLILFGGATALEGNSATPPSSAGSAGIRLAGATADVHCYDVSSNKWSRLTPVGEPPSPRAAHVATAVGTMVVIQGGIGPAGLSAEDLHVLDLTQQRPRWHRVVVQGPGPGPRYGHVMALVGQRFLLTIGGNDGKRPLADVWALDTAAKPYEWRKLEPEGEGPPPCMYATASARSDGLLLLCGGRDANSVPLASAYGLAKHRDGRWEWAIAPGVSPSPRYQHAAVFVNARLHVSGGALGGGRMVEDSSSVAVLDTAAGVWCDTKSVVTTPRTGRYSADAAGGDASVELTRRLQACSCCSW
SEQ ID NO:19:编辑株系418C026014a的PPKL3基因编码的氨基酸序列
MDVDSRMTTESDSDSDAAAQGGGGGGFGSETSSASPSAPGTPTAMGAGGGAAPIAAAAIAAAASAAVVAGPRPAPGYTVVNAAMEKKEDGPGCRCGHTLTAVPAVGEEGAPGYVGPRLILFGGATALEGNSATPPSSAGSAGIRLAGATADVHCYDVSSNKWSRLTPVGEPPSPRAAHVATAVGTMVVIQGGIGPAGLSAEDLHVLDLTQQRPRWHRVVVQGPGPGPRYGHVMALVGQRFLLTIGGNDGKRPLADVWALDTAAKPYEWRKLEPEGEGPPPCMYATASARSDGLLLLCGGRDANSVPLASAYGLAKHRDGRWEWAIAPGVSPSPRYQHAAVFVNARLHVSGGALGGGRMVEDSSSVAVLDTAAGVWCDTKSVVTTPRTGRYSADAAGGDASVELTRRFAGMQLLQLVI
SEQ ID NO:20:编辑株系418C037003a、418C026036a和418F003063a的PPKL3基因编码的氨基酸序列
MDVDSRMTTESDSDSDAAAQGGGGGGFGSETSSASPSAPGTPTAMGAGGGAAPIAAAAIAAAASAAVVAGPRPAPGYTVVNAAMEKKEDGPGCRCGHTLTAVPAVGEEGAPGYVGPRLILFGGATALEGNSATPPSSAGSAGIRLAGATADVHCYDVSSNKWSRLTPVGEPPSPRAAHVATAVGTMVVIQGGIGPAGLSAEDLHVLDLTQQRPRWHRVVVQGPGPGPRYGHVMALVGQRFLLTIGGNDGKRPLADVWALDTAAKPYEWRKLEPEGEGPPPCMYATASARSDGLLLLCGGRDANSVPLASAYGLAKHRDGRWEWAIAPGVSPSPRYQHAAVFVNARLHVSGGALGGGRMVEDSSSVAVLDTAAGVWCDTKSVVTTPRTGRYSADAAGGDASVELTRRVQACSCCSW
SEQ ID NO:21:编辑株系418C026036a、418C026014a、418C037003a和418C026014a的PPKL1基因编码的氨基酸序列
MDVDSRMTTESDSDSDAAATAAASASVAAQGGLASETSSSSSASAPSTPGTPTVAPAPAAAGATGPRPAPGYTAVSAVIEKKEDGPGCRCGHTLTAVPAVGEEGTPGYIGPRLILFGGATALEGNSATPPSSAGSAGIRLAGATADVHCYDVLSNKWSRLTPQGEPPSPRAAHVATAVGTMVVIQGGIGPAGLSAEDLHVLDLTQQRPRWHRVVVQGPGPGPRYGHVMALVGQRFLLTIGGNDGKRPLADVWALDTAAKPYEWRKLEPEGEGPPPCMYATASARSDGLLLLCGGRDANSVPLASAYGLAKHRDGRWEWAIAPGVSPSPRYQHAAVFVNARLHVSGGALGGGRMVEDSSSVAVLDTAGVWCDTKSVVTTPRIGRYSADAAGGDAAVELTRRCRHAAAAVGDQIFIYGGLRGGVLLDDLLVAEDLAAAETTTAANHAAASAAATNVQSGRTPGRYAYNDERARQTAPESAQDGSVVLGTPVAPPVNGDMYTDISPENAVLQGQRRLSKGVDYLVEASAAEAEAISATLAAVKARQVNGEMEQLPDKEQSPDSASTSKHSSLIKPDSILSNNMTPPPGVRLHHRAVVVAAETGGALGGMVRQLSIDQFENEGRRVSYGTPENATAARKLLDRQMSINSVPKKVIASLLKPRGWKPPVRRQFFLDCNEIADLCDSAERIFSSEPSVLQLKAPVKIFGDLHGQFGDLMRLFDEYGAPSTAGDIAYIDYLFLGDYVDRGQHSLETMTLLLALKVEYPQNVHLIRGNHEAADINALFGFRIECIERMGERDGIWTWHRMNRLFNWLPLAALIEKKIICMHGGIGRSINHVEQIENLQRPITMEAGSVVLMDLLWSDPTENDSVEGLRPNARGPGLVTFGPDRVMEFCNNNDLQLIVRAHECVMDGFERFAQGHLITLFSATNYCGTANNAGAILVLGRDLVVVPKLIHPLPPAITSPETSPEHHIEDTWMQELNANRPPTPTRGRPQVAANDRGSLAWI
SEQ ID NO:22:编辑株系418F003063a的PPKL1基因编码的氨基酸序列
MDVDSRMTTESDSDSDAAATAAASASVAAQGGLASETSSSSSASAPSTPGTPTVAPAPAAAGATGPRPAPGYTAVSAVIEKKEDGPGCRCGHTLTAVPAVGEEGTPGYIGPRLILFGGATALEGNSATPPSSAGSAGIRLAGATADVHCYDVLSNKWSRLTPQGEPPSPRAAHVATAVGTMVVIQGGIGPAGLSAEDLHVLDLTQQRPRWHRVVVQGPGPGPRYGHVMALVGQRFLLTIGGNDGKRPLADVWALDTAAKPYEWRKLEPEGEGPPPCMYATASARSDGLLLLCGGRDANSVPLASAYGLAKHRDGRWEWAIAPGVSPSPRYQHAAVFVNARLHVSGGALGGGRMVEDSSSVAVLDTGVWCDTKSVVTTPRIGRYSADAAGGDAAVELTRRCRHAAAAVGDQIFIYGGLRGGVLLDDLLVAEDLAAAETTTAANHAAASAAATNVQSGRTPGRYAYNDERARQTAPESAQDGSVVLGTPVAPPVNGDMYTDISPENAVLQGQRRLSKGVDYLVEASAAEAEAISATLAAVKARQVNGEMEQLPDKEQSPDSASTSKHSSLIKPDSILSNNMTPPPGVRLHHRAVVVAAETGGALGGMVRQLSIDQFENEGRRVSYGTPENATAARKLLDRQMSINSVPKKVIASLLKPRGWKPPVRRQFFLDCNEIADLCDSAERIFSSEPSVLQLKAPVKIFGDLHGQFGDLMRLFDEYGAPSTAGDIAYIDYLFLGDYVDRGQHSLETMTLLLALKVEYPQNVHLIRGNHEAADINALFGFRIECIERMGERDGIWTWHRMNRLFNWLPLAALIEKKIICMHGGIGRSINHVEQIENLQRPITMEAGSVVLMDLLWSDPTENDSVEGLRPNARGPGLVTFGPDRVMEFCNNNDLQLIVRAHECVMDGFERFAQGHLITLFSATNYCGTANNAGAILVLGRDLVVVPKLIHPLPPAITSPETSPEHHIEDTWMQELNANRPPTPTRGRPQVAANDRGSLAWI
序列表
<110> 中国种子集团有限公司
<120> 创制水稻大长粒型矮杆新种质的方法及其应用
<160> 22
<170> SIPOSequenceListing 1.0
<210> 1
<211> 8027
<212> DNA/RNA
<213> Oryza sativa
<400> 1
atggacgtgg actcccgcat gacgacggag tcggactccg actccgacgc cgccgccacc 60
gccgccgcct ccgcctcggt tgctgcgcaa ggaggcttag cgagcgagac ctcgtcgtcg 120
tcgtcggcct cggccccgtc cacgccaggg acgcccacgg tagctccggc tcctgcggcc 180
gcgggagcca cggggcccag gccggcgccg gggtacacgg cggtcagcgc tgtgatcgag 240
aagaaggagg acgggccggg ctgccgctgc gggcacacgc tcacggcggt gccggccgtc 300
ggggaggaag gcacgccggg gtacatcggc ccacgcctca tcctcttcgg gggcgccacg 360
gcgctcgagg gcaactccgc cacgccaccc tcctcggccg gcagtgctgg gatccgtacg 420
ctttctatct agtatcatat atatgcctaa taatttctgc ttttttcaaa tgtcgattgg 480
tcgtgtctcg tggatattaa gcgacgctct tgggtgatca tgatggcaac ctggagctat 540
ttttattccc agtttgcaat ttcattttga aatgaatttg gaggcttcag gagttttact 600
gagaaatcat ttgctatttt gtgggtaatt cgggtcggtc tatacctcga tggctggaag 660
ctcatagctg gcttgagcct ttaccatcag aggatttaaa taggcatagc atcatttcag 720
acaccatttt gacttaggaa ttgcgcttga gctgtttctg gaaccgaata tcattggttg 780
aattagtgaa gatgggtgct gatggctgat tccagatgcc atggtcatgg gagctgaaat 840
gttcgaactg tttcatatat cagtcgtatt gcataatttt gaattcttat ttttgaacgt 900
atcccctgat tcactgggca tgcgctttca tggacctctc atctaaagtt tctaactatc 960
ttcaatgtta tgcttagcaa gttggaagga acatttactg cattctagat tttgtgtcat 1020
cgtgttgcac attgtatgac agtaggagaa accttttttt tttttgtttt tccttttggc 1080
aagttaaagt aggtgagcgg aacttctaag gcacgactct gatatgaaat tgaaggtttt 1140
cttttttggg agctatatta ttgctgatgt aatctattta aagcatatat acaaaactag 1200
ttgctccatt gtgtacaaaa ttgttttcgg ttaaaaaaat gttatctggt atacaatgct 1260
atttttactc tccagttagt cacaaaggtg gattattcat tgatctctcc ttggtttata 1320
ggtcttgccg gtgccacagc agatgtccac tgttatgatg tgttatcaaa taagtggagc 1380
aggtaggata ttccataaat ggtttccacc ttcttttgag catttaatcg ttgcgttgct 1440
actttggttg actttagcaa cttttgtttt agagtaacac gattattttg ttggtagcat 1500
aaagcaactc agcaaactta tttaattgcg ccaccacttt ggcaggctta ctccacaagg 1560
tgagcctcct tcaccaagag ccgcacatgt agcaactgca gttggaacca tggttgtcat 1620
ccaggtattt ctatatttta cctgtggttt ctgagattta ttacctagtg cttaattagt 1680
atcagaaaca attacttccc aagtaacctg ttttatgtac tctttgattg tttacataaa 1740
ttttatggag gtaacttcta aatatggttt gtttttgttt catcattcgt tgcaatgcag 1800
ggtggcattg gccctgctgg tttatctgcc gaggatcttc atgttctaga ccttacacaa 1860
caacgaccgc gatggcacag gtaaactatt attctgtact gtgagtgtgg agttgcattt 1920
cttcagcaca tcttctgagg tgctaatgtt tctatgtaga gtggtggttc aaggacctgg 1980
tccaggtcca cgatatgggc atgtcatggc cttggttgga cagaggtttt tgttgacaat 2040
tggtggaaat gatggtaagc aattgctccc agatgcacct gtcaaattca aactagagaa 2100
cctttgcact tggatctatg ctgaaatggt ttctataatg ggataatttt tattctctag 2160
ggaaacggcc gctggcagat gtatgggctc tagatactgc agctaagcca tacgaatgga 2220
ggaaacttga accagaaggt gaaggaccgc caccatgcat gtaagtttgc tagctgaaac 2280
acattacagt aatttaatta ttgaggatgg tgacgtaaag tttaatagca attcctgggg 2340
aggtatccat tatattacat tacttaccat gtgttgcatg ttacttcatt aatttaatat 2400
caaattatcc acaatgcaac cagctattta ctttcttggt tcagaataac atacggtagc 2460
acagttaacc caccatattt cccaattagt tgtcaggcat gaccctcagg ttattttcct 2520
ttattttttt tactggaagt agagctagaa ctaggtttgg catccttttg ctaagtatac 2580
tgcatggtgt tgttgttcag gtatgccact gccagtgcac gctctgatgg tcttctttta 2640
ctttgtggtg ggagggatgc caacagtgtg gtaagtaaaa taatcaactt atataaataa 2700
agataataaa tagtctagta aatatttcgc tgatcaaaat atgtaaagaa ctaaaactaa 2760
tggaattaga gacttgaatg tataacactt ggctgtcaat tttaccatct cagtgattgg 2820
atgtcatact gttgtaacaa ggttctatat agcctgtgat gtgaggactg caattctctc 2880
tagattacac cttcaaattt tgtatctaaa atattgaaac agcagtgatg tggcaattgg 2940
tggatttcca tgatctgaaa aataagaaag ctattacttt gttatcagtt tcatctcatg 3000
tagatcaaga gaaaatgaac ataattatgc atggcatatg ggcatcttgt catgtaactt 3060
acacagacat cagttgccaa gtagcaaata ttgagtgact ggaactacaa gtagcaaata 3120
tttgttttct ataacaaaga tgtctagtta tccttatatt cggcaagtaa aatgatactc 3180
cccctgtttc acattataag tttatagaaa aatttaacaa catctgcaac acaaaattag 3240
tttaattcaa gctaacattt aatatatttt gataatatat ttgttttgtg tcaaaaatat 3300
tgctaatctt ttttttataa acttgttcaa acttaaaaaa agtttgatta gggaaaaaaa 3360
actcaaaatg acttatagta tgaaacggag tatattttag tcacaccata ctagttgtac 3420
atcaagctat tttcatttga gatgtcaatt tgatttaaat tactaataat tacctgtaga 3480
agttcaatat caataatagg atttctctaa tgtctttgta tacatgttac ttctgtaaac 3540
ccatcagccg ctggcaagtg catatggtct tgcaaaacac agagatgggc gatgggagtg 3600
ggcaatcgcc cctggtgttt ctccatcacc cagatatcaa catgcagctg taagctgttc 3660
tttcattgtt ttttcatcta aaaggtgtac ccactttggc tcttatacaa tgcttgattc 3720
atctgttgtt taggtttttg tgaatgcacg tctccatgta tcaggaggag ctcttggagg 3780
tggtcggatg gtagaagact cctcaagtgt tgcaggttct agttttaatc ttgaaataac 3840
tgacatattt cttaaggtat tcatgaaata ccttaagaaa tatcttgcac tgacattaac 3900
cagaagaata atcttgagat gaaataatgt agctctttta gtttgctgga tgctcagcat 3960
ataaaatact tagtttttgt tggtgtatgt gcaatatttg ttatcttaat aagatctaga 4020
ttgcagaaaa gcactcagat acttgaatag tttagttgct ttcagcattg gccactcatg 4080
caccataact acacttgcta attatgttaa tctatatgac tgcatgtgag tattacaaaa 4140
gcatttcctg tctttagcct ttttttactc atcactctgt gatgacattg ccagattttc 4200
ttcctttttt tggtagcatt gagattctct tgtgttgagg tataggtaat tgttggtttc 4260
ctctagaatg acaatatgca ctcctataaa tttcttataa gttgcacgat tctatctggt 4320
tcagtgctag acacagctgc tggagtctgg tgcgacacaa agtcagttgt cacaactccc 4380
aggataggaa gatatagtgc agatgcagca gggggtgatg ctgctgttga acttacacgg 4440
cggtgcaggc atgcagctgc tgctgttggt gatcaaatat tcatttatgg aggtctacga 4500
ggaggtaaga aaacctgttt gtttagtatt atgctatagg ctttgagcct ttgctgggcc 4560
atcaagtcat cattgtacag ttcagattca agttatttgt ggtgttattt ctgatttgat 4620
ggggataact tggccccact gccacatgat atgcaacttg gatgccacat gatatatatt 4680
atagttggca tacttttcgt ttggatgaaa acttttagtg gactggaaat tgtcaagttt 4740
gacatttgtt ctcaccgttg gattttatgc accttcaatt tgctttctaa catcaaacga 4800
ctgccattgt aggggtactg ctagatgatc ttcttgtagc tgaagatctt gctgctgctg 4860
aaacgacaac cgctgctaat catgcagcgg caagtgcagc agccactaat gtacaaagcg 4920
gaagaacacc cggaagatat gcttataatg atgagcgagc gagacaaaca gctccagaat 4980
cagctcagga tggatctgta gttcttggaa ctccagttgc ccctcctgtc aacggtgaca 5040
tgtacactga tatcagccct gagaatgctg tgcttcaggg acagaggtta ttttcaattt 5100
tttttcccta tgaatatgtc ctggtgatat tcctatggca agttactatt agttttgtaa 5160
ttgctatttt cctaaggttt caatattaat ttctctacag gagattaagc aaaggtgtcg 5220
attacttagt tgaagcatca gcagcagagg cagaggctat tagtgcaact ttagctgcag 5280
tgaaggctag gcaggttaac ggtgagatgg agcaattgcc tgacaaggag caatctccag 5340
attccgcatc gaccagcaag cattcaagcc tcatcaaacc agacagtatt ctctctaaca 5400
acatgacgcc tccacctggg gttcggttgc atcatagagc tgtgagtagt ctgtatgcag 5460
tttgttacat tgtctgcagt tcatttgtca gttttttttt atgttataat attgctgtag 5520
atacttgtct ttctagcgac taaaaacaat tgtattatct cttcattttg catcacatga 5580
ggtgatatcc tctatttttc cttttacctc aggttgtagt ggctgcagaa actggaggtg 5640
ccttaggtgg catggtcagg cagctgtcaa ttgaccaatt tgaaaatgag ggaagaagag 5700
tcagctatgg cacccctgag aatgcaactg ctgcaagaaa gttgctagat cgacagatgt 5760
ccattaatag tgttcctaaa aaggtatcat ggagaatttg gtccattcaa ctgctttggc 5820
actaggattt agtgatgcag aagctgaatg cacaatatag ccatgattga gaaaagactt 5880
agctagttac tacgcgtatt ctgaactact taattacttt gttctatctg ctgcgatact 5940
tatccaggtg attgcatctc tattgaaacc tcgtggttgg aagccccctg tgcgaagaca 6000
gttcttcttg gactgcaatg agattgcaga tctatgtgat agtgctgaga gaatattttc 6060
aagtgaacca agtgttctgc aacttaaagc tcctgttaag atatttggtg atttacatgg 6120
tcaatttggt gacctcatgc gattgtttga tgagtatggt gctccttcaa cagcaggaga 6180
cattgcgtga gttcttgctt tgagaaacta gtaatcacac actttttaaa ttttgcttca 6240
attgtgatat tttacttgcc aaattaactt gaaaagcctt gttattaatg tgtttaatga 6300
attatcttag gctatctata catactccct ccgtactcgt aaaggaagtc gtttaggaca 6360
atatttaagt caaaccttgg gaatataaat catgaataac tctcaagttg ttgagtttga 6420
aaatgtaaaa attatatgaa tagatttgtc ttgaaaaata ttttcataaa agtatacata 6480
tatcactttt caataaatat tattatagaa gcaataagtc aaagttgtgt tttggagacc 6540
gtgttgctgt ccaaaacgac ttcctttacg agtatggagg gagtatgtac ttttcttttc 6600
ttttttggtt cagaaggtat ttccccacct ttcttggtct tgtgctgatc cataattttg 6660
tatgaacctg cagttacatt gattatctct tcttgggtga ttatgtggat cgtggccagc 6720
atagcttaga aacaatgact cttcttcttg cattgaaggt attaagctgc ctacttacct 6780
tgatgccaca tatacttact gtgttgtgac gatttattaa aatcgtaata tataaacaat 6840
tgtacatttc aggttgagta tcctcagaat gtacatttga ttcgtggaaa tcatgaagct 6900
gcagatatta atgctttgtt tggcttccga atagagtgca tagagcgaat ggtaatctat 6960
tttattctgc atctcacgtt acaacaaaac cttttcctct tttgcttatg aggatgttat 7020
ttgctcaaaa aacagggtga gagagatggt atttggacat ggcaccgaat gaatagatta 7080
tttaactggc ttcctttggc tgcacttatc gaaaagaaaa ttatatgtat gcatggtgga 7140
attggtcggt caatcaacca tgttgagcaa attgagaatc ttcagagacc aattaccatg 7200
gaagcaggct ctgttgtcct tatggatctt ctatggtaaa acatttcaac aataattgct 7260
atttaacctt ccatgagcat ttatttatgt gccatgttca gatgttctct ttatgatgct 7320
tataggtctg atccaaccga gaatgacagt gttgaaggat tgagaccaaa tgctcgaggc 7380
cctggccttg ttacgtttgg ggtttgtgtt ccctaaccct aaccttctga attcgtcttc 7440
cctttgttga ccccctttgt tctctgaagc taacatttgc tgttcataca gcctgatcgt 7500
gttatggagt tttgcaacaa taatgatctt caactaattg tgcgagcgca tgagtgtgtg 7560
atggatggct ttgagcgctt tgctcaaggt cacctgatca ctcttttctc tgcaacaaac 7620
tattgtggta tgaattattc taaaatattc ttttttcaaa cttttttgct ggatttctac 7680
gattgcttga catggtatgc tcacaggtac tgcaaataat gccggtgcta tcttagtttt 7740
gggcagagat cttgtggtcg ttccaaaact gattcatcct ttgcccccgg caatcacatc 7800
acctgagacc tctccggagc atcatattga ggacacatgg atgcaggtaa tactattttg 7860
ttgcaaagat attcctgttt gtaaactaac tggtactacc accttcctcg taacacggta 7920
acatttgaaa tgaaattcag gagctgaatg caaacagacc accgactcca acaaggggcc 7980
gcccccaagt agcagctaac gatcgaggtt ctcttgcctg gatatag 8027
<210> 2
<211> 1003
<212> PRT
<213> 水稻(Oryza sativa)
<400> 2
Met Asp Val Asp Ser Arg Met Thr Thr Glu Ser Asp Ser Asp Ser Asp
1 5 10 15
Ala Ala Ala Thr Ala Ala Ala Ser Ala Ser Val Ala Ala Gln Gly Gly
20 25 30
Leu Ala Ser Glu Thr Ser Ser Ser Ser Ser Ala Ser Ala Pro Ser Thr
35 40 45
Pro Gly Thr Pro Thr Val Ala Pro Ala Pro Ala Ala Ala Gly Ala Thr
50 55 60
Gly Pro Arg Pro Ala Pro Gly Tyr Thr Ala Val Ser Ala Val Ile Glu
65 70 75 80
Lys Lys Glu Asp Gly Pro Gly Cys Arg Cys Gly His Thr Leu Thr Ala
85 90 95
Val Pro Ala Val Gly Glu Glu Gly Thr Pro Gly Tyr Ile Gly Pro Arg
100 105 110
Leu Ile Leu Phe Gly Gly Ala Thr Ala Leu Glu Gly Asn Ser Ala Thr
115 120 125
Pro Pro Ser Ser Ala Gly Ser Ala Gly Ile Arg Leu Ala Gly Ala Thr
130 135 140
Ala Asp Val His Cys Tyr Asp Val Leu Ser Asn Lys Trp Ser Arg Leu
145 150 155 160
Thr Pro Gln Gly Glu Pro Pro Ser Pro Arg Ala Ala His Val Ala Thr
165 170 175
Ala Val Gly Thr Met Val Val Ile Gln Gly Gly Ile Gly Pro Ala Gly
180 185 190
Leu Ser Ala Glu Asp Leu His Val Leu Asp Leu Thr Gln Gln Arg Pro
195 200 205
Arg Trp His Arg Val Val Val Gln Gly Pro Gly Pro Gly Pro Arg Tyr
210 215 220
Gly His Val Met Ala Leu Val Gly Gln Arg Phe Leu Leu Thr Ile Gly
225 230 235 240
Gly Asn Asp Gly Lys Arg Pro Leu Ala Asp Val Trp Ala Leu Asp Thr
245 250 255
Ala Ala Lys Pro Tyr Glu Trp Arg Lys Leu Glu Pro Glu Gly Glu Gly
260 265 270
Pro Pro Pro Cys Met Tyr Ala Thr Ala Ser Ala Arg Ser Asp Gly Leu
275 280 285
Leu Leu Leu Cys Gly Gly Arg Asp Ala Asn Ser Val Pro Leu Ala Ser
290 295 300
Ala Tyr Gly Leu Ala Lys His Arg Asp Gly Arg Trp Glu Trp Ala Ile
305 310 315 320
Ala Pro Gly Val Ser Pro Ser Pro Arg Tyr Gln His Ala Ala Val Phe
325 330 335
Val Asn Ala Arg Leu His Val Ser Gly Gly Ala Leu Gly Gly Gly Arg
340 345 350
Met Val Glu Asp Ser Ser Ser Val Ala Val Leu Asp Thr Ala Ala Gly
355 360 365
Val Trp Cys Asp Thr Lys Ser Val Val Thr Thr Pro Arg Ile Gly Arg
370 375 380
Tyr Ser Ala Asp Ala Ala Gly Gly Asp Ala Ala Val Glu Leu Thr Arg
385 390 395 400
Arg Cys Arg His Ala Ala Ala Ala Val Gly Asp Gln Ile Phe Ile Tyr
405 410 415
Gly Gly Leu Arg Gly Gly Val Leu Leu Asp Asp Leu Leu Val Ala Glu
420 425 430
Asp Leu Ala Ala Ala Glu Thr Thr Thr Ala Ala Asn His Ala Ala Ala
435 440 445
Ser Ala Ala Ala Thr Asn Val Gln Ser Gly Arg Thr Pro Gly Arg Tyr
450 455 460
Ala Tyr Asn Asp Glu Arg Ala Arg Gln Thr Ala Pro Glu Ser Ala Gln
465 470 475 480
Asp Gly Ser Val Val Leu Gly Thr Pro Val Ala Pro Pro Val Asn Gly
485 490 495
Asp Met Tyr Thr Asp Ile Ser Pro Glu Asn Ala Val Leu Gln Gly Gln
500 505 510
Arg Arg Leu Ser Lys Gly Val Asp Tyr Leu Val Glu Ala Ser Ala Ala
515 520 525
Glu Ala Glu Ala Ile Ser Ala Thr Leu Ala Ala Val Lys Ala Arg Gln
530 535 540
Val Asn Gly Glu Met Glu Gln Leu Pro Asp Lys Glu Gln Ser Pro Asp
545 550 555 560
Ser Ala Ser Thr Ser Lys His Ser Ser Leu Ile Lys Pro Asp Ser Ile
565 570 575
Leu Ser Asn Asn Met Thr Pro Pro Pro Gly Val Arg Leu His His Arg
580 585 590
Ala Val Val Val Ala Ala Glu Thr Gly Gly Ala Leu Gly Gly Met Val
595 600 605
Arg Gln Leu Ser Ile Asp Gln Phe Glu Asn Glu Gly Arg Arg Val Ser
610 615 620
Tyr Gly Thr Pro Glu Asn Ala Thr Ala Ala Arg Lys Leu Leu Asp Arg
625 630 635 640
Gln Met Ser Ile Asn Ser Val Pro Lys Lys Val Ile Ala Ser Leu Leu
645 650 655
Lys Pro Arg Gly Trp Lys Pro Pro Val Arg Arg Gln Phe Phe Leu Asp
660 665 670
Cys Asn Glu Ile Ala Asp Leu Cys Asp Ser Ala Glu Arg Ile Phe Ser
675 680 685
Ser Glu Pro Ser Val Leu Gln Leu Lys Ala Pro Val Lys Ile Phe Gly
690 695 700
Asp Leu His Gly Gln Phe Gly Asp Leu Met Arg Leu Phe Asp Glu Tyr
705 710 715 720
Gly Ala Pro Ser Thr Ala Gly Asp Ile Ala Tyr Ile Asp Tyr Leu Phe
725 730 735
Leu Gly Asp Tyr Val Asp Arg Gly Gln His Ser Leu Glu Thr Met Thr
740 745 750
Leu Leu Leu Ala Leu Lys Val Glu Tyr Pro Gln Asn Val His Leu Ile
755 760 765
Arg Gly Asn His Glu Ala Ala Asp Ile Asn Ala Leu Phe Gly Phe Arg
770 775 780
Ile Glu Cys Ile Glu Arg Met Gly Glu Arg Asp Gly Ile Trp Thr Trp
785 790 795 800
His Arg Met Asn Arg Leu Phe Asn Trp Leu Pro Leu Ala Ala Leu Ile
805 810 815
Glu Lys Lys Ile Ile Cys Met His Gly Gly Ile Gly Arg Ser Ile Asn
820 825 830
His Val Glu Gln Ile Glu Asn Leu Gln Arg Pro Ile Thr Met Glu Ala
835 840 845
Gly Ser Val Val Leu Met Asp Leu Leu Trp Ser Asp Pro Thr Glu Asn
850 855 860
Asp Ser Val Glu Gly Leu Arg Pro Asn Ala Arg Gly Pro Gly Leu Val
865 870 875 880
Thr Phe Gly Pro Asp Arg Val Met Glu Phe Cys Asn Asn Asn Asp Leu
885 890 895
Gln Leu Ile Val Arg Ala His Glu Cys Val Met Asp Gly Phe Glu Arg
900 905 910
Phe Ala Gln Gly His Leu Ile Thr Leu Phe Ser Ala Thr Asn Tyr Cys
915 920 925
Gly Thr Ala Asn Asn Ala Gly Ala Ile Leu Val Leu Gly Arg Asp Leu
930 935 940
Val Val Val Pro Lys Leu Ile His Pro Leu Pro Pro Ala Ile Thr Ser
945 950 955 960
Pro Glu Thr Ser Pro Glu His His Ile Glu Asp Thr Trp Met Gln Glu
965 970 975
Leu Asn Ala Asn Arg Pro Pro Thr Pro Thr Arg Gly Arg Pro Gln Val
980 985 990
Ala Ala Asn Asp Arg Gly Ser Leu Ala Trp Ile
995 1000
<210> 3
<211> 7892
<212> DNA/RNA
<213> 水稻(Oryza sativa)
<400> 3
atggacgtgg actcgaggat gacgacggag tcggactccg actcggacgc cgcggcgcag 60
gggggaggag gaggagggtt cgggagcgag acctcctcgg cgtcgccctc ggcgcccggg 120
acgccgacgg ctatgggggc aggaggggga gctgctccta tcgctgctgc tgctatcgct 180
gccgccgcgt cggcggcggt ggtggcgggc ccgaggcccg cgccggggta cacggtggtg 240
aacgcggcga tggagaagaa ggaggacggg cccgggtgcc ggtgcggcca cacgctcacc 300
gcggtgccgg ccgtcgggga ggagggcgcg ccggggtacg tggggccgcg gctgatcctc 360
ttcggcggtg ccaccgcgct cgaggggaac tccgccacgc cgccatcctc ggccgggagc 420
gccggaatcc gtaatgccct tttaaacctc agctgctctt gttttcatgt atggtttggg 480
gggatggtgt tggcgtgcta attgtttagt ttgctgtcga cgagagccaa actccatggg 540
agttggaata cttgggtttg tactgaatct tgctcgggac tttggagtga tatgctggtt 600
cgcgtaccga ctgatttagg gatataactt actgttcggg atagcagcta gagttgattg 660
tagctctggt gtttcagtgt cttagttaga ggtagaattc ggagcatctg ctgatggtgg 720
tattgctagt gtaggctgga ttaggttaaa aactcgtact gggagggatt ttcatgttct 780
gcctggttag gaaagcgatt atgagtggca cgttatgaca atatatgcag tgtaactcat 840
gtgcacatgc tgttaattgg agggccaaat atggtgctct tttatgattt gaggcacaag 900
atttatttaa tgaacaacga tactcgcata cttctgttgc ggtgaagtac cagttccttt 960
taatgtttgt tggttgtttc ctttggatta tgattgtctt tgtatgttag aagaagcact 1020
caaaggtttc aaattggaaa aacatttatg tgcttagact gtactgatgc ccttggatac 1080
cctagggcct aggttaaggc gtatgggtta attaattagg tgaactatgc cgtttcaaaa 1140
gttaagcctg aaatgctatt atacgcattc cggttctcca agtcttattg ggggattaga 1200
cgttttggta gaccgggtgc caaatgtatt gtttcaaaac tgagcttaga aatgttggag 1260
gttttgcttg tgatgaaagg gaaagtttat cttctttata ggcaaatcag caaaattgac 1320
aagatcttac ataaaattat caatctcctt tcttgcaggt cttgctggtg ctactgcgga 1380
tgtgcactgc tacgatgttt catcgaataa gtggagcagg tatgttcttt ttagtaagag 1440
ccttttttag taaatgtttt aattcctttg gatattatgc ttggatgcca tgtgtaagtt 1500
tttctttaga tcccatgaaa ttttatctca attgcacttg ttagttctgt gcactactac 1560
atttgattga ccacagtaaa tgttgtcagg cttactccag ttggtgaacc tccttcacca 1620
agagcggcac atgtagcgac tgctgttggc accatggtgg tcattcaggt agctttaatc 1680
tttttgcctt agagtagtag ctatgtcagt tttatcgtca ggccagcact aacctgctct 1740
caactacagg gtgggatagg tcctgctggt ttatctgcag aagaccttca tgttctggat 1800
cttacacaac aacgaccgcg atggcacagg tggctgctgc tctaatttcg tttgggttta 1860
gaatatgctt ttgctgctat ccctagtgag cattatcatg tcttgtatgt agagtggtgg 1920
ttcaagggcc tggccctggt cctcgatatg gacatgtgat ggctttggtt gggcagcgtt 1980
tcttgttgac aatcggtgga aatgatggta ggagattgct tatgtgccta tgaaacctag 2040
attttggcta cttaagatat tgttccatcc caaatgatgt tgatactaag agtgaataaa 2100
tgtttcaggg aagaggcccc tggctgatgt gtgggccctt gatacagcag ctaagccata 2160
tgagtggagg aaacttgaac cagaaggtga aggaccaccc ccatgcatgt aagtgtgcag 2220
ataataaaaa aggaaacatc tcccctttga aatctagatc aacgaggttc tttggtgatg 2280
ttgagtttaa atcttgctat tttcttggca agattcccca gctacatcca tgagccaaca 2340
tatgataatg cctatatttt ctaatgaagc tttcagtgat tgtacttttg tttgatgccc 2400
atatcatata tagtaacctg ttttggcacc attagactac ttttagggct ccactttctt 2460
ttgttggcca agaagttaca gaaaactttg gttatttaca actcaatatt gatagtttgt 2520
tcaaaagcaa ctgccaatta agaacaagct acaagtgttt cgttaagtct acttggatgc 2580
agaacacagt aaagtgtcaa acctaatatc ctagggtaac ccaccactag ctattttctg 2640
taatttcact tacgcttttc aatagattgc tagaagttga gatagttttt gtatacttct 2700
aatagtctgt gcctctggta atgttgctca ggtatgcaac tgcaagtgcc cgttctgatg 2760
gacttctatt actctgtggt gggagggatg ctaatagtgt ggtaagtacc attctatttg 2820
tgaatagctt ttgctgagga aattttgttc taataggaat atagggttga aattctctgt 2880
tcttagtttt tagatttgaa tctgctctga agtcataaca cgagaatggt tcatatagct 2940
tttttggtaa gtgatattta taagcattgc atgcttgtgc caacaaagca tatttatttt 3000
ttctcatgtt acatctgctt ctttggatga gtgaggaatt gtgatgttaa gtgcactgaa 3060
tccatatggc atgcttgaaa ggcctgtttt gttcaatttc acttgtaaag ccatgccctg 3120
caaataaaaa aaatagaatc gtctgggctg gaaactgaat atggattcaa ccgattagtt 3180
ttcattagat ggtaagtgat tagcattttg taggtgaatt gaaataagga tatatatgga 3240
cttttctttt tggtaattac gattttggag aggcagtgac ttctatttgt atgtcctaca 3300
ctgttttact ccaaggctca agcatatttg tcaaagctaa ctatgtagtt tttccttttc 3360
ttttgggcta aaaccatcag cccctagcaa gtgcatatgg ccttgcaaaa cacagggatg 3420
gacgctggga gtgggcaatc gcccctggtg tctctccatc acccagatat caacatgcag 3480
ctgtgagttg tgaatatcct ttcccttcaa ttttctttat ttttcatctt gagatggatc 3540
ccactgcatg gtttgtgtca tgttttattt ccttttgttt ctttaggttt tcgtcaatgc 3600
acgacttcat gtctcaggag gggcccttgg aggcggtcgc atggttgaag actcctcaag 3660
tgttgcaggt tccttactat tatttgcttt gtatactttg cacctgttag ttcatgtggc 3720
atccttttga aaaatctccc accttttgtg gcaactcgaa acatttcaag caatttcttt 3780
tgattgtaag taggataaag accatatatg attaattatt tcaattacta agttcaatat 3840
attgttttct gtttgctaaa ttacccattt acaattcatc agcagagaac aagatgcatg 3900
ctgttattga ttgtctctgc agttcatata tcttgtatta ttgaaaaaag taaatggtct 3960
gaataatact ggaacacagt cggtagttta agcattggcc agttggtctt agtatgcatg 4020
tatattttct tgtttcccta ggttcttatt tttaagccat gctgcccttt ttcagtgttg 4080
gatactgctg ctggagtgtg gtgtgataca aaatcagtag ttactactcc aaggacagga 4140
agatatagtg cagatgctgc aggcggtgat gcttctgtag agcttacacg acggtgcagg 4200
catgcagctg ctgcagttgg tgatatgata tatgtttatg gaggtctacg gggaggtaag 4260
acacttcagt ttcttagcct catctgattt ctgacaccaa ttctcatgtt tgaattgcga 4320
tccgctgtcc tatgagtcct gttcagctgt attcagcagc acacgttttt taagtagtaa 4380
agatgttatc cgatgaggaa catgtatatt ctttctggga agataaggaa tttaggatat 4440
taagatatca gtacaaattt aattagcatt gactatgttt acatagcaga tccaaaaggt 4500
ttttaaggtg atccggattt cagcttaatg gatttccaat taacattagt attattacaa 4560
tatccttatc tgaatgaaaa actactcctg gagctaactg ttaactaact ttttacactt 4620
gcatcacttt tttgagggtt gactttgggt tatcagagcg aaattgcctc ctattcaaat 4680
taggatatat tcatattttt gttgatgcta agttttgtat agttgcaatt ctaaattccc 4740
tttattgtta ctccaggtgt gctgctagat gaccttcttg ttgccgaaga tcttgctgct 4800
gctgaaacaa caaatgcagc aaatcaggca gctgcaattg ctgcagcctc tgacatacaa 4860
gctggaagag aacctggtag gtatgcctac aatgatgaac aaacagggca gccagctaca 4920
ataacatctc ctgatggagc tgtggttctt ggaactccag ttgctgctcc cgttaatggg 4980
gacatgtata ctgatattag ccctgagaac gctgtgatcc agggacaaag gtttgtaaaa 5040
tttcgtagac atgttccgat agtttttatt ttattttatt ttattttttt tgcatatcta 5100
tcagtctatt gagtagaagc tacatgcact gaaatgtgtc ttttggtctt gatttgcaca 5160
ggagaatgag caaaggtgtt gattatttgg ttgaagcatc tgctgcagag gctgaggcaa 5220
tcagtgctac tttagctgct gtaaaagctc ggcaagttaa tggtgaggca gagcattcac 5280
ctgacaggga gcagtctcca gatgccacac caagtgtgaa gcaaaatgca agcctgataa 5340
aaccagacta tgctctttca aataattcaa caccacctcc tggggttcgg ttacaccata 5400
gagcagtaag tagaaaatgc agttagtttt accgagttaa ttcgttactt tatttgcttc 5460
tctagttaat tgataaattc aaaaccagat catgatcttt ccttgttcta ctgttgaaat 5520
atattgtatt acagaagctg tccctgaaca acctagctca agttagttac agcattgtga 5580
actttatttt tttcttgctc ttgtaggttg tggtagcagc agaaaccgga ggtgccttag 5640
gtggcatggt taggcagctc tcaattgacc agtttgaaaa tgaaggaaga agggtcatct 5700
atggcactcc tgagagtgca actgcagcaa ggaagttgct agatcggcaa atgtctatta 5760
atagtgtacc taaaaaggta tcttactttg cagatgtaat ttgacaaaaa aaaacagcaa 5820
acaattgcat gattatctgc taaaacacta aggcaatggc atacactggt gtgtgttgtt 5880
gtcagggggc ataaatggaa gtcatgataa cctttgcctt taaacttcta tgctattgtg 5940
ctattaagta ctcacatatt tctatttgat gacaaaatta ttcaggtaat tgcatccctc 6000
ttgaaacctc gtggttggaa gccccctgtg cgaaggcagt tctttttgga ctgcaatgag 6060
attgcagacc tatgtgatag tgcggaaaga atattttcaa gtgaaccaag tgtcctacag 6120
cttaaagctc caattaagat atttggtgat ttgcacggtc aatttggtga cctaatgcgc 6180
ttgtttgatg agtacggtgc tccttcgaca gctggagaca ttgcgtgagt attttctgat 6240
tttgaactgc cctgattcat ttacttattc attccaattt tgggaacttt tatggcattt 6300
tattggatga caaagctgaa aaaccttttt tttttcaaaa aagaacctct agaacacttt 6360
atgttaagtt gttttatttt cagttatatt attgatactt tatctgaact tcttgatttg 6420
cgtatatagg ctttgggtta gaaaatgttc ggttagagga gatattttca atcgatcttt 6480
cctgacttat atgttttcct tttgtttttt ttgttcaaaa aactttgcag ttacattgat 6540
tacctcttct tgggagatta cgtggatcgt ggtcagcata gtttggagac aatcactctt 6600
ctccttgcac ttaaggtaaa gttgtgaaca actgagcgaa ccctgaaaaa gtgctatttc 6660
agcagacagt tctttttctc tcaaaaatgt aacaattgat cttataattc aggttgaata 6720
tcctcttaat gtacatttga ttcgagggaa tcatgaggcc gctgatatca atgctttgtt 6780
tgggttccga atagagtgca tagagcgaat ggtaatgacc atgttgcatt ctgacattct 6840
tattaggctg ttttttatct cttgcttatc actgttcttt gctaaaaaca gggtgagcgt 6900
gatgggatct ggacctggca tcgcatgaat aggctattta attggcttcc tttggctgcc 6960
ctaatagaga agaaaataat atgtatgcat ggtggtattg gccggtctat caatcatgtt 7020
gaacaaattg agaatcttca aagaccaatt accatggaag caggctcggt tgttctaatg 7080
gatcttctgt ggtcagcatt tcaatagctc tttgatacta ctattgcaag attgttcttt 7140
atttttccct gatataacct attttttaaa tcttaaaagg tctgatccaa ctgaaaatga 7200
tagtgttgaa ggacttagac caaatgcccg tggcccaggc cttgttacct ttggggttag 7260
tattctctgt gtaatgttgt gtccagctct tttctccctg ttggttcgtg tggtttctgt 7320
caaaataaca cgtattgtaa ctgcagccgg atcgtgttat ggagttctgc aacaacaatg 7380
acttacagtt aattgtgcga gcacatgagt gtgtgatgga tggttttgag cgttttgctc 7440
aaggtcacct aatcactctc ttctcagcaa caaattactg tggtaatgat atgctattct 7500
gacttggtct gttcgcgatt atctcttcaa gtttgattgt cctgttactt aatatgatat 7560
gtatgcaggt acggcaaaca atgcgggtgc aatcttggtt ctaggaagag atcttgtagt 7620
agttccgaaa ctcatccatc ctttgccgcc tgccattaca tctcccgaga cctctccaga 7680
gcatcatctt gaggacacat ggatgcaggt aactcctctt ttatcatgtg atgctcttgg 7740
atttgcaata gggtattcct gtctatacgc ttctaagttc taacaccatc ttgtgttaaa 7800
atcaggaact aaatgccaac aggccgccaa ctccaacaag gggccgccct caagcagcaa 7860
acaatgaccg aggctctctt gcatggatat ag 7892
<210> 4
<211> 1009
<212> PRT
<213> 水稻(Oryza sativa)
<400> 4
Met Asp Val Asp Ser Arg Met Thr Thr Glu Ser Asp Ser Asp Ser Asp
1 5 10 15
Ala Ala Ala Gln Gly Gly Gly Gly Gly Gly Phe Gly Ser Glu Thr Ser
20 25 30
Ser Ala Ser Pro Ser Ala Pro Gly Thr Pro Thr Ala Met Gly Ala Gly
35 40 45
Gly Gly Ala Ala Pro Ile Ala Ala Ala Ala Ile Ala Ala Ala Ala Ser
50 55 60
Ala Ala Val Val Ala Gly Pro Arg Pro Ala Pro Gly Tyr Thr Val Val
65 70 75 80
Asn Ala Ala Met Glu Lys Lys Glu Asp Gly Pro Gly Cys Arg Cys Gly
85 90 95
His Thr Leu Thr Ala Val Pro Ala Val Gly Glu Glu Gly Ala Pro Gly
100 105 110
Tyr Val Gly Pro Arg Leu Ile Leu Phe Gly Gly Ala Thr Ala Leu Glu
115 120 125
Gly Asn Ser Ala Thr Pro Pro Ser Ser Ala Gly Ser Ala Gly Ile Arg
130 135 140
Leu Ala Gly Ala Thr Ala Asp Val His Cys Tyr Asp Val Ser Ser Asn
145 150 155 160
Lys Trp Ser Arg Leu Thr Pro Val Gly Glu Pro Pro Ser Pro Arg Ala
165 170 175
Ala His Val Ala Thr Ala Val Gly Thr Met Val Val Ile Gln Gly Gly
180 185 190
Ile Gly Pro Ala Gly Leu Ser Ala Glu Asp Leu His Val Leu Asp Leu
195 200 205
Thr Gln Gln Arg Pro Arg Trp His Arg Val Val Val Gln Gly Pro Gly
210 215 220
Pro Gly Pro Arg Tyr Gly His Val Met Ala Leu Val Gly Gln Arg Phe
225 230 235 240
Leu Leu Thr Ile Gly Gly Asn Asp Gly Lys Arg Pro Leu Ala Asp Val
245 250 255
Trp Ala Leu Asp Thr Ala Ala Lys Pro Tyr Glu Trp Arg Lys Leu Glu
260 265 270
Pro Glu Gly Glu Gly Pro Pro Pro Cys Met Tyr Ala Thr Ala Ser Ala
275 280 285
Arg Ser Asp Gly Leu Leu Leu Leu Cys Gly Gly Arg Asp Ala Asn Ser
290 295 300
Val Pro Leu Ala Ser Ala Tyr Gly Leu Ala Lys His Arg Asp Gly Arg
305 310 315 320
Trp Glu Trp Ala Ile Ala Pro Gly Val Ser Pro Ser Pro Arg Tyr Gln
325 330 335
His Ala Ala Val Phe Val Asn Ala Arg Leu His Val Ser Gly Gly Ala
340 345 350
Leu Gly Gly Gly Arg Met Val Glu Asp Ser Ser Ser Val Ala Val Leu
355 360 365
Asp Thr Ala Ala Gly Val Trp Cys Asp Thr Lys Ser Val Val Thr Thr
370 375 380
Pro Arg Thr Gly Arg Tyr Ser Ala Asp Ala Ala Gly Gly Asp Ala Ser
385 390 395 400
Val Glu Leu Thr Arg Arg Cys Arg His Ala Ala Ala Ala Val Gly Asp
405 410 415
Met Ile Tyr Val Tyr Gly Gly Leu Arg Gly Gly Val Leu Leu Asp Asp
420 425 430
Leu Leu Val Ala Glu Asp Leu Ala Ala Ala Glu Thr Thr Asn Ala Ala
435 440 445
Asn Gln Ala Ala Ala Ile Ala Ala Ala Ser Asp Ile Gln Ala Gly Arg
450 455 460
Glu Pro Gly Arg Tyr Ala Tyr Asn Asp Glu Gln Thr Gly Gln Pro Ala
465 470 475 480
Thr Ile Thr Ser Pro Asp Gly Ala Val Val Leu Gly Thr Pro Val Ala
485 490 495
Ala Pro Val Asn Gly Asp Met Tyr Thr Asp Ile Ser Pro Glu Asn Ala
500 505 510
Val Ile Gln Gly Gln Arg Arg Met Ser Lys Gly Val Asp Tyr Leu Val
515 520 525
Glu Ala Ser Ala Ala Glu Ala Glu Ala Ile Ser Ala Thr Leu Ala Ala
530 535 540
Val Lys Ala Arg Gln Val Asn Gly Glu Ala Glu His Ser Pro Asp Arg
545 550 555 560
Glu Gln Ser Pro Asp Ala Thr Pro Ser Val Lys Gln Asn Ala Ser Leu
565 570 575
Ile Lys Pro Asp Tyr Ala Leu Ser Asn Asn Ser Thr Pro Pro Pro Gly
580 585 590
Val Arg Leu His His Arg Ala Val Val Val Ala Ala Glu Thr Gly Gly
595 600 605
Ala Leu Gly Gly Met Val Arg Gln Leu Ser Ile Asp Gln Phe Glu Asn
610 615 620
Glu Gly Arg Arg Val Ile Tyr Gly Thr Pro Glu Ser Ala Thr Ala Ala
625 630 635 640
Arg Lys Leu Leu Asp Arg Gln Met Ser Ile Asn Ser Val Pro Lys Lys
645 650 655
Val Ile Ala Ser Leu Leu Lys Pro Arg Gly Trp Lys Pro Pro Val Arg
660 665 670
Arg Gln Phe Phe Leu Asp Cys Asn Glu Ile Ala Asp Leu Cys Asp Ser
675 680 685
Ala Glu Arg Ile Phe Ser Ser Glu Pro Ser Val Leu Gln Leu Lys Ala
690 695 700
Pro Ile Lys Ile Phe Gly Asp Leu His Gly Gln Phe Gly Asp Leu Met
705 710 715 720
Arg Leu Phe Asp Glu Tyr Gly Ala Pro Ser Thr Ala Gly Asp Ile Ala
725 730 735
Tyr Ile Asp Tyr Leu Phe Leu Gly Asp Tyr Val Asp Arg Gly Gln His
740 745 750
Ser Leu Glu Thr Ile Thr Leu Leu Leu Ala Leu Lys Val Glu Tyr Pro
755 760 765
Leu Asn Val His Leu Ile Arg Gly Asn His Glu Ala Ala Asp Ile Asn
770 775 780
Ala Leu Phe Gly Phe Arg Ile Glu Cys Ile Glu Arg Met Gly Glu Arg
785 790 795 800
Asp Gly Ile Trp Thr Trp His Arg Met Asn Arg Leu Phe Asn Trp Leu
805 810 815
Pro Leu Ala Ala Leu Ile Glu Lys Lys Ile Ile Cys Met His Gly Gly
820 825 830
Ile Gly Arg Ser Ile Asn His Val Glu Gln Ile Glu Asn Leu Gln Arg
835 840 845
Pro Ile Thr Met Glu Ala Gly Ser Val Val Leu Met Asp Leu Leu Trp
850 855 860
Ser Asp Pro Thr Glu Asn Asp Ser Val Glu Gly Leu Arg Pro Asn Ala
865 870 875 880
Arg Gly Pro Gly Leu Val Thr Phe Gly Pro Asp Arg Val Met Glu Phe
885 890 895
Cys Asn Asn Asn Asp Leu Gln Leu Ile Val Arg Ala His Glu Cys Val
900 905 910
Met Asp Gly Phe Glu Arg Phe Ala Gln Gly His Leu Ile Thr Leu Phe
915 920 925
Ser Ala Thr Asn Tyr Cys Gly Thr Ala Asn Asn Ala Gly Ala Ile Leu
930 935 940
Val Leu Gly Arg Asp Leu Val Val Val Pro Lys Leu Ile His Pro Leu
945 950 955 960
Pro Pro Ala Ile Thr Ser Pro Glu Thr Ser Pro Glu His His Leu Glu
965 970 975
Asp Thr Trp Met Gln Glu Leu Asn Ala Asn Arg Pro Pro Thr Pro Thr
980 985 990
Arg Gly Arg Pro Gln Ala Ala Asn Asn Asp Arg Gly Ser Leu Ala Trp
995 1000 1005
Ile
<210> 5
<211> 18
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 5
agtgctagac acagctgc 18
<210> 6
<211> 20
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 6
tagagcttac acgacggtgc 20
<210> 7
<211> 95
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 7
agtgctagac acagctgcgt tttagagcta gaaatagcaa gttaaaataa ggctagtccg 60
ttatcaactt gaaaaagtgg caccgagtcg gtgct 95
<210> 8
<211> 97
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 8
tagagcttac acgacggtgc gttttagagc tagaaatagc aagttaaaat aaggctagtc 60
cgttatcaac ttgaaaaagt ggcaccgagt cggtgct 97
<210> 9
<211> 4125
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 9
atgccgaaga agcgccgccg cgtggacaag aagtactcca tcggcctcga catcggcacc 60
aactccgtgg gctgggccgt gatcaccgac gagtacaagg tgccgtccaa gaagttcaag 120
gtgctcggca acaccgaccg ccactccatc aagaagaacc tcatcggcgc cctcctcttc 180
gactccggcg agaccgccga ggccacccgc ctcaagcgca ccgcccgccg ccgctacacc 240
cgccgcaaga accgcatctg ctacctccag gagatcttct ccaacgagat ggccaaggtg 300
gacgactcct tcttccaccg cctcgaggag tccttcctcg tggaggagga caagaagcac 360
gagcgccacc cgatcttcgg caacatcgtg gacgaggtgg cctaccacga gaagtacccg 420
accatctacc acctccgcaa gaagctcgtg gactccaccg acaaggccga cctccgcctc 480
atctacctcg ccctcgccca catgatcaag ttccgcggcc acttcctcat cgagggcgac 540
ctcaacccgg acaactccga cgtggacaag ctcttcatcc agctcgtgca gacctacaac 600
cagctcttcg aggagaaccc gatcaacgcc tccggcgtgg acgccaaggc catcctctcc 660
gcccgcctct ccaagtcccg ccgcctcgag aacctcatcg cccagctccc gggcgagaag 720
aagaacggcc tcttcggcaa cctcatcgcc ctctccctcg gcctcacccc gaacttcaag 780
tccaacttcg acctcgccga ggacgccaag ctccagctct ccaaggacac ctacgacgac 840
gacctcgaca acctcctcgc ccagatcggc gaccagtacg ccgacctctt cctcgccgcc 900
aagaacctct ccgacgccat cctcctctcc gacatcctcc gcgtgaacac cgagatcacc 960
aaggccccgc tctccgcctc catgatcaag cgctacgacg agcaccacca ggacctcacc 1020
ctcctcaagg ccctcgtgcg ccagcagctc ccggagaagt acaaggagat cttcttcgac 1080
cagtccaaga acggctacgc cggctacatc gacggcggcg cctcccagga ggagttctac 1140
aagttcatca agccgatcct cgagaagatg gacggcaccg aggagctcct cgtgaagctc 1200
aaccgcgagg acctcctccg caagcagcgc accttcgaca acggctccat cccgcaccag 1260
atccacctcg gcgagctcca cgccatcctc cgccgccagg aggacttcta cccgttcctc 1320
aaggacaacc gcgagaagat cgagaagatc ctcaccttcc gcatcccgta ctacgtgggc 1380
ccgctcgccc gcggcaactc ccgcttcgcc tggatgaccc gcaagtccga ggagaccatc 1440
accccgtgga acttcgagga ggtggtggac aagggcgcct ccgcccagtc cttcatcgag 1500
cgcatgacca acttcgacaa gaacctcccg aacgagaagg tgctcccgaa gcactccctc 1560
ctctacgagt acttcaccgt gtacaacgag ctcaccaagg tgaagtacgt gaccgagggc 1620
atgcgcaagc cggccttcct ctccggcgag cagaagaagg ccatcgtgga cctcctcttc 1680
aagaccaacc gcaaggtgac cgtgaagcag ctcaaggagg actacttcaa gaagatcgag 1740
tgcttcgact ccgtggagat ctccggcgtg gaggaccgct tcaacgcctc cctcggcacc 1800
taccacgacc tcctcaagat catcaaggac aaggacttcc tcgacaacga ggagaacgag 1860
gacatcctcg aggacatcgt gctcaccctc accctcttcg aggaccgcga gatgatcgag 1920
gagcgcctca agacctacgc ccacctcttc gacgacaagg tgatgaagca gctcaagcgc 1980
cgccgctaca ccggctgggg ccgcctctcc cgcaagctca tcaacggcat ccgcgacaag 2040
cagtccggca agaccatcct cgacttcctc aagtccgacg gcttcgccaa ccgcaacttc 2100
atgcagctca tccacgacga ctccctcacc ttcaaggagg acatccagaa ggcccaggtg 2160
tccggccagg gcgactccct ccacgagcac atcgccaacc tcgccggctc cccggccatc 2220
aagaagggca tcctccagac cgtgaaggtg gtggacgagc tcgtgaaggt gatgggccgc 2280
cacaagccgg agaacatcgt gatcgagatg gcccgcgaga accagaccac ccagaagggc 2340
cagaagaact cccgcgagcg catgaagcgc atcgaggagg gcatcaagga gctcggctcc 2400
cagatcctca aggagcaccc ggtggagaac acccagctcc agaacgagaa gctctacctc 2460
tactacctcc agaacggccg cgacatgtac gtggaccagg agctcgacat caaccgcctc 2520
tccgactacg acgtggacca catcgtgccg cagtccttcc tcaaggacga ctccatcgac 2580
aacaaggtgc tcacccgctc cgacaagaac cgcggcaagt ccgacaacgt gccgtccgag 2640
gaggtggtga agaagatgaa gaactactgg cgccagctcc tcaacgccaa gctcatcacc 2700
cagcgcaagt tcgacaacct caccaaggcc gagcgcggcg gcctctccga gctcgacaag 2760
gccggcttca tcaagcgcca gctcgtggag acccgccaga tcaccaagca cgtggcccag 2820
atcctcgact cccgcatgaa caccaagtac gacgagaacg acaagctcat ccgcgaggtg 2880
aaggtgatca ccctcaagtc caagctcgtg tccgacttcc gcaaggactt ccagttctac 2940
aaggtgcgcg agatcaacaa ctaccaccac gcccacgacg cctacctcaa cgccgtggtg 3000
ggcaccgccc tcatcaagaa gtacccgaag ctcgagtccg agttcgtgta cggcgactac 3060
aaggtgtacg acgtgcgcaa gatgatcgcc aagtccgagc aggagatcgg caaggccacc 3120
gccaagtact tcttctactc caacatcatg aacttcttca agaccgagat caccctcgcc 3180
aacggcgaga tccgcaagcg cccgctcatc gagaccaacg gcgagaccgg cgagatcgtg 3240
tgggacaagg gccgcgactt cgccaccgtg cgcaaggtgc tctccatgcc gcaggtgaac 3300
atcgtgaaga agaccgaggt gcagaccggc ggcttctcca aggagtccat cctcccgaag 3360
cgcaactccg acaagctcat cgcccgcaag aaggactggg acccgaagaa gtacggcggc 3420
ttcgactccc cgaccgtggc ctactccgtg ctcgtggtgg ccaaggtgga gaagggcaag 3480
tccaagaagc tcaagtccgt gaaggagctc ctcggcatca ccatcatgga gcgctcctcc 3540
ttcgagaaga acccgatcga cttcctcgag gccaagggct acaaggaggt gaagaaggac 3600
ctcatcatca agctcccgaa gtactccctc ttcgagctcg agaacggccg caagcgcatg 3660
ctcgcctccg ccggcgagct ccagaagggc aacgagctcg ccctcccgtc caagtacgtg 3720
aacttcctct acctcgcctc ccactacgag aagctcaagg gctccccgga ggacaacgag 3780
cagaagcagc tcttcgtgga gcagcacaag cactacctcg acgagatcat cgagcagatc 3840
tccgagttct ccaagcgcgt gatcctcgcc gacgccaacc tcgacaaggt gctctccgcc 3900
tacaacaagc accgcgacaa gccgatccgc gagcaggccg agaacatcat ccacctcttc 3960
accctcacca acctcggcgc cccggccgcc ttcaagtact tcgacaccac catcgaccgc 4020
aagcgctaca cctccaccaa ggaggtgctc gacgccaccc tcatccacca gtccatcacc 4080
ggcctctacg agacccgcat cgacctctcc cagctcggcg gcgac 4125
<210> 10
<211> 18
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 10
cagcacaggt taagtctg 18
<210> 11
<211> 18
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 11
gtctgtctca acggtaag 18
<210> 12
<211> 22
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 12
tgctatgtac gtcgccatcc ag 22
<210> 13
<211> 22
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 13
aatgagtaac cacgctccgt ca 22
<210> 14
<211> 25
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 14
catcactctg tgatgacatt gccag 25
<210> 15
<211> 20
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 15
aaaggctcaa agcctatagc 20
<210> 16
<211> 19
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 16
gccagttggt cttagtatg 19
<210> 17
<211> 21
<212> DNA/RNA
<213> 人工序列(Artificial Sequence)
<400> 17
cagctgaaca ggactcatag g 21
<210> 18
<211> 415
<212> PRT
<213> 水稻(Oryza sativa)
<400> 18
Met Asp Val Asp Ser Arg Met Thr Thr Glu Ser Asp Ser Asp Ser Asp
1 5 10 15
Ala Ala Ala Gln Gly Gly Gly Gly Gly Gly Phe Gly Ser Glu Thr Ser
20 25 30
Ser Ala Ser Pro Ser Ala Pro Gly Thr Pro Thr Ala Met Gly Ala Gly
35 40 45
Gly Gly Ala Ala Pro Ile Ala Ala Ala Ala Ile Ala Ala Ala Ala Ser
50 55 60
Ala Ala Val Val Ala Gly Pro Arg Pro Ala Pro Gly Tyr Thr Val Val
65 70 75 80
Asn Ala Ala Met Glu Lys Lys Glu Asp Gly Pro Gly Cys Arg Cys Gly
85 90 95
His Thr Leu Thr Ala Val Pro Ala Val Gly Glu Glu Gly Ala Pro Gly
100 105 110
Tyr Val Gly Pro Arg Leu Ile Leu Phe Gly Gly Ala Thr Ala Leu Glu
115 120 125
Gly Asn Ser Ala Thr Pro Pro Ser Ser Ala Gly Ser Ala Gly Ile Arg
130 135 140
Leu Ala Gly Ala Thr Ala Asp Val His Cys Tyr Asp Val Ser Ser Asn
145 150 155 160
Lys Trp Ser Arg Leu Thr Pro Val Gly Glu Pro Pro Ser Pro Arg Ala
165 170 175
Ala His Val Ala Thr Ala Val Gly Thr Met Val Val Ile Gln Gly Gly
180 185 190
Ile Gly Pro Ala Gly Leu Ser Ala Glu Asp Leu His Val Leu Asp Leu
195 200 205
Thr Gln Gln Arg Pro Arg Trp His Arg Val Val Val Gln Gly Pro Gly
210 215 220
Pro Gly Pro Arg Tyr Gly His Val Met Ala Leu Val Gly Gln Arg Phe
225 230 235 240
Leu Leu Thr Ile Gly Gly Asn Asp Gly Lys Arg Pro Leu Ala Asp Val
245 250 255
Trp Ala Leu Asp Thr Ala Ala Lys Pro Tyr Glu Trp Arg Lys Leu Glu
260 265 270
Pro Glu Gly Glu Gly Pro Pro Pro Cys Met Tyr Ala Thr Ala Ser Ala
275 280 285
Arg Ser Asp Gly Leu Leu Leu Leu Cys Gly Gly Arg Asp Ala Asn Ser
290 295 300
Val Pro Leu Ala Ser Ala Tyr Gly Leu Ala Lys His Arg Asp Gly Arg
305 310 315 320
Trp Glu Trp Ala Ile Ala Pro Gly Val Ser Pro Ser Pro Arg Tyr Gln
325 330 335
His Ala Ala Val Phe Val Asn Ala Arg Leu His Val Ser Gly Gly Ala
340 345 350
Leu Gly Gly Gly Arg Met Val Glu Asp Ser Ser Ser Val Ala Val Leu
355 360 365
Asp Thr Ala Ala Gly Val Trp Cys Asp Thr Lys Ser Val Val Thr Thr
370 375 380
Pro Arg Thr Gly Arg Tyr Ser Ala Asp Ala Ala Gly Gly Asp Ala Ser
385 390 395 400
Val Glu Leu Thr Arg Arg Leu Gln Ala Cys Ser Cys Cys Ser Trp
405 410 415
<210> 19
<211> 417
<212> PRT
<213> 水稻(Oryza sativa)
<400> 19
Met Asp Val Asp Ser Arg Met Thr Thr Glu Ser Asp Ser Asp Ser Asp
1 5 10 15
Ala Ala Ala Gln Gly Gly Gly Gly Gly Gly Phe Gly Ser Glu Thr Ser
20 25 30
Ser Ala Ser Pro Ser Ala Pro Gly Thr Pro Thr Ala Met Gly Ala Gly
35 40 45
Gly Gly Ala Ala Pro Ile Ala Ala Ala Ala Ile Ala Ala Ala Ala Ser
50 55 60
Ala Ala Val Val Ala Gly Pro Arg Pro Ala Pro Gly Tyr Thr Val Val
65 70 75 80
Asn Ala Ala Met Glu Lys Lys Glu Asp Gly Pro Gly Cys Arg Cys Gly
85 90 95
His Thr Leu Thr Ala Val Pro Ala Val Gly Glu Glu Gly Ala Pro Gly
100 105 110
Tyr Val Gly Pro Arg Leu Ile Leu Phe Gly Gly Ala Thr Ala Leu Glu
115 120 125
Gly Asn Ser Ala Thr Pro Pro Ser Ser Ala Gly Ser Ala Gly Ile Arg
130 135 140
Leu Ala Gly Ala Thr Ala Asp Val His Cys Tyr Asp Val Ser Ser Asn
145 150 155 160
Lys Trp Ser Arg Leu Thr Pro Val Gly Glu Pro Pro Ser Pro Arg Ala
165 170 175
Ala His Val Ala Thr Ala Val Gly Thr Met Val Val Ile Gln Gly Gly
180 185 190
Ile Gly Pro Ala Gly Leu Ser Ala Glu Asp Leu His Val Leu Asp Leu
195 200 205
Thr Gln Gln Arg Pro Arg Trp His Arg Val Val Val Gln Gly Pro Gly
210 215 220
Pro Gly Pro Arg Tyr Gly His Val Met Ala Leu Val Gly Gln Arg Phe
225 230 235 240
Leu Leu Thr Ile Gly Gly Asn Asp Gly Lys Arg Pro Leu Ala Asp Val
245 250 255
Trp Ala Leu Asp Thr Ala Ala Lys Pro Tyr Glu Trp Arg Lys Leu Glu
260 265 270
Pro Glu Gly Glu Gly Pro Pro Pro Cys Met Tyr Ala Thr Ala Ser Ala
275 280 285
Arg Ser Asp Gly Leu Leu Leu Leu Cys Gly Gly Arg Asp Ala Asn Ser
290 295 300
Val Pro Leu Ala Ser Ala Tyr Gly Leu Ala Lys His Arg Asp Gly Arg
305 310 315 320
Trp Glu Trp Ala Ile Ala Pro Gly Val Ser Pro Ser Pro Arg Tyr Gln
325 330 335
His Ala Ala Val Phe Val Asn Ala Arg Leu His Val Ser Gly Gly Ala
340 345 350
Leu Gly Gly Gly Arg Met Val Glu Asp Ser Ser Ser Val Ala Val Leu
355 360 365
Asp Thr Ala Ala Gly Val Trp Cys Asp Thr Lys Ser Val Val Thr Thr
370 375 380
Pro Arg Thr Gly Arg Tyr Ser Ala Asp Ala Ala Gly Gly Asp Ala Ser
385 390 395 400
Val Glu Leu Thr Arg Arg Phe Ala Gly Met Gln Leu Leu Gln Leu Val
405 410 415
Ile
<210> 20
<211> 415
<212> PRT
<213> 水稻(Oryza sativa)
<400> 20
Met Asp Val Asp Ser Arg Met Thr Thr Glu Ser Asp Ser Asp Ser Asp
1 5 10 15
Ala Ala Ala Gln Gly Gly Gly Gly Gly Gly Phe Gly Ser Glu Thr Ser
20 25 30
Ser Ala Ser Pro Ser Ala Pro Gly Thr Pro Thr Ala Met Gly Ala Gly
35 40 45
Gly Gly Ala Ala Pro Ile Ala Ala Ala Ala Ile Ala Ala Ala Ala Ser
50 55 60
Ala Ala Val Val Ala Gly Pro Arg Pro Ala Pro Gly Tyr Thr Val Val
65 70 75 80
Asn Ala Ala Met Glu Lys Lys Glu Asp Gly Pro Gly Cys Arg Cys Gly
85 90 95
His Thr Leu Thr Ala Val Pro Ala Val Gly Glu Glu Gly Ala Pro Gly
100 105 110
Tyr Val Gly Pro Arg Leu Ile Leu Phe Gly Gly Ala Thr Ala Leu Glu
115 120 125
Gly Asn Ser Ala Thr Pro Pro Ser Ser Ala Gly Ser Ala Gly Ile Arg
130 135 140
Leu Ala Gly Ala Thr Ala Asp Val His Cys Tyr Asp Val Ser Ser Asn
145 150 155 160
Lys Trp Ser Arg Leu Thr Pro Val Gly Glu Pro Pro Ser Pro Arg Ala
165 170 175
Ala His Val Ala Thr Ala Val Gly Thr Met Val Val Ile Gln Gly Gly
180 185 190
Ile Gly Pro Ala Gly Leu Ser Ala Glu Asp Leu His Val Leu Asp Leu
195 200 205
Thr Gln Gln Arg Pro Arg Trp His Arg Val Val Val Gln Gly Pro Gly
210 215 220
Pro Gly Pro Arg Tyr Gly His Val Met Ala Leu Val Gly Gln Arg Phe
225 230 235 240
Leu Leu Thr Ile Gly Gly Asn Asp Gly Lys Arg Pro Leu Ala Asp Val
245 250 255
Trp Ala Leu Asp Thr Ala Ala Lys Pro Tyr Glu Trp Arg Lys Leu Glu
260 265 270
Pro Glu Gly Glu Gly Pro Pro Pro Cys Met Tyr Ala Thr Ala Ser Ala
275 280 285
Arg Ser Asp Gly Leu Leu Leu Leu Cys Gly Gly Arg Asp Ala Asn Ser
290 295 300
Val Pro Leu Ala Ser Ala Tyr Gly Leu Ala Lys His Arg Asp Gly Arg
305 310 315 320
Trp Glu Trp Ala Ile Ala Pro Gly Val Ser Pro Ser Pro Arg Tyr Gln
325 330 335
His Ala Ala Val Phe Val Asn Ala Arg Leu His Val Ser Gly Gly Ala
340 345 350
Leu Gly Gly Gly Arg Met Val Glu Asp Ser Ser Ser Val Ala Val Leu
355 360 365
Asp Thr Ala Ala Gly Val Trp Cys Asp Thr Lys Ser Val Val Thr Thr
370 375 380
Pro Arg Thr Gly Arg Tyr Ser Ala Asp Ala Ala Gly Gly Asp Ala Ser
385 390 395 400
Val Glu Leu Thr Arg Arg Val Gln Ala Cys Ser Cys Cys Ser Trp
405 410 415
<210> 21
<211> 1002
<212> PRT
<213> 水稻(Oryza sativa)
<400> 21
Met Asp Val Asp Ser Arg Met Thr Thr Glu Ser Asp Ser Asp Ser Asp
1 5 10 15
Ala Ala Ala Thr Ala Ala Ala Ser Ala Ser Val Ala Ala Gln Gly Gly
20 25 30
Leu Ala Ser Glu Thr Ser Ser Ser Ser Ser Ala Ser Ala Pro Ser Thr
35 40 45
Pro Gly Thr Pro Thr Val Ala Pro Ala Pro Ala Ala Ala Gly Ala Thr
50 55 60
Gly Pro Arg Pro Ala Pro Gly Tyr Thr Ala Val Ser Ala Val Ile Glu
65 70 75 80
Lys Lys Glu Asp Gly Pro Gly Cys Arg Cys Gly His Thr Leu Thr Ala
85 90 95
Val Pro Ala Val Gly Glu Glu Gly Thr Pro Gly Tyr Ile Gly Pro Arg
100 105 110
Leu Ile Leu Phe Gly Gly Ala Thr Ala Leu Glu Gly Asn Ser Ala Thr
115 120 125
Pro Pro Ser Ser Ala Gly Ser Ala Gly Ile Arg Leu Ala Gly Ala Thr
130 135 140
Ala Asp Val His Cys Tyr Asp Val Leu Ser Asn Lys Trp Ser Arg Leu
145 150 155 160
Thr Pro Gln Gly Glu Pro Pro Ser Pro Arg Ala Ala His Val Ala Thr
165 170 175
Ala Val Gly Thr Met Val Val Ile Gln Gly Gly Ile Gly Pro Ala Gly
180 185 190
Leu Ser Ala Glu Asp Leu His Val Leu Asp Leu Thr Gln Gln Arg Pro
195 200 205
Arg Trp His Arg Val Val Val Gln Gly Pro Gly Pro Gly Pro Arg Tyr
210 215 220
Gly His Val Met Ala Leu Val Gly Gln Arg Phe Leu Leu Thr Ile Gly
225 230 235 240
Gly Asn Asp Gly Lys Arg Pro Leu Ala Asp Val Trp Ala Leu Asp Thr
245 250 255
Ala Ala Lys Pro Tyr Glu Trp Arg Lys Leu Glu Pro Glu Gly Glu Gly
260 265 270
Pro Pro Pro Cys Met Tyr Ala Thr Ala Ser Ala Arg Ser Asp Gly Leu
275 280 285
Leu Leu Leu Cys Gly Gly Arg Asp Ala Asn Ser Val Pro Leu Ala Ser
290 295 300
Ala Tyr Gly Leu Ala Lys His Arg Asp Gly Arg Trp Glu Trp Ala Ile
305 310 315 320
Ala Pro Gly Val Ser Pro Ser Pro Arg Tyr Gln His Ala Ala Val Phe
325 330 335
Val Asn Ala Arg Leu His Val Ser Gly Gly Ala Leu Gly Gly Gly Arg
340 345 350
Met Val Glu Asp Ser Ser Ser Val Ala Val Leu Asp Thr Ala Gly Val
355 360 365
Trp Cys Asp Thr Lys Ser Val Val Thr Thr Pro Arg Ile Gly Arg Tyr
370 375 380
Ser Ala Asp Ala Ala Gly Gly Asp Ala Ala Val Glu Leu Thr Arg Arg
385 390 395 400
Cys Arg His Ala Ala Ala Ala Val Gly Asp Gln Ile Phe Ile Tyr Gly
405 410 415
Gly Leu Arg Gly Gly Val Leu Leu Asp Asp Leu Leu Val Ala Glu Asp
420 425 430
Leu Ala Ala Ala Glu Thr Thr Thr Ala Ala Asn His Ala Ala Ala Ser
435 440 445
Ala Ala Ala Thr Asn Val Gln Ser Gly Arg Thr Pro Gly Arg Tyr Ala
450 455 460
Tyr Asn Asp Glu Arg Ala Arg Gln Thr Ala Pro Glu Ser Ala Gln Asp
465 470 475 480
Gly Ser Val Val Leu Gly Thr Pro Val Ala Pro Pro Val Asn Gly Asp
485 490 495
Met Tyr Thr Asp Ile Ser Pro Glu Asn Ala Val Leu Gln Gly Gln Arg
500 505 510
Arg Leu Ser Lys Gly Val Asp Tyr Leu Val Glu Ala Ser Ala Ala Glu
515 520 525
Ala Glu Ala Ile Ser Ala Thr Leu Ala Ala Val Lys Ala Arg Gln Val
530 535 540
Asn Gly Glu Met Glu Gln Leu Pro Asp Lys Glu Gln Ser Pro Asp Ser
545 550 555 560
Ala Ser Thr Ser Lys His Ser Ser Leu Ile Lys Pro Asp Ser Ile Leu
565 570 575
Ser Asn Asn Met Thr Pro Pro Pro Gly Val Arg Leu His His Arg Ala
580 585 590
Val Val Val Ala Ala Glu Thr Gly Gly Ala Leu Gly Gly Met Val Arg
595 600 605
Gln Leu Ser Ile Asp Gln Phe Glu Asn Glu Gly Arg Arg Val Ser Tyr
610 615 620
Gly Thr Pro Glu Asn Ala Thr Ala Ala Arg Lys Leu Leu Asp Arg Gln
625 630 635 640
Met Ser Ile Asn Ser Val Pro Lys Lys Val Ile Ala Ser Leu Leu Lys
645 650 655
Pro Arg Gly Trp Lys Pro Pro Val Arg Arg Gln Phe Phe Leu Asp Cys
660 665 670
Asn Glu Ile Ala Asp Leu Cys Asp Ser Ala Glu Arg Ile Phe Ser Ser
675 680 685
Glu Pro Ser Val Leu Gln Leu Lys Ala Pro Val Lys Ile Phe Gly Asp
690 695 700
Leu His Gly Gln Phe Gly Asp Leu Met Arg Leu Phe Asp Glu Tyr Gly
705 710 715 720
Ala Pro Ser Thr Ala Gly Asp Ile Ala Tyr Ile Asp Tyr Leu Phe Leu
725 730 735
Gly Asp Tyr Val Asp Arg Gly Gln His Ser Leu Glu Thr Met Thr Leu
740 745 750
Leu Leu Ala Leu Lys Val Glu Tyr Pro Gln Asn Val His Leu Ile Arg
755 760 765
Gly Asn His Glu Ala Ala Asp Ile Asn Ala Leu Phe Gly Phe Arg Ile
770 775 780
Glu Cys Ile Glu Arg Met Gly Glu Arg Asp Gly Ile Trp Thr Trp His
785 790 795 800
Arg Met Asn Arg Leu Phe Asn Trp Leu Pro Leu Ala Ala Leu Ile Glu
805 810 815
Lys Lys Ile Ile Cys Met His Gly Gly Ile Gly Arg Ser Ile Asn His
820 825 830
Val Glu Gln Ile Glu Asn Leu Gln Arg Pro Ile Thr Met Glu Ala Gly
835 840 845
Ser Val Val Leu Met Asp Leu Leu Trp Ser Asp Pro Thr Glu Asn Asp
850 855 860
Ser Val Glu Gly Leu Arg Pro Asn Ala Arg Gly Pro Gly Leu Val Thr
865 870 875 880
Phe Gly Pro Asp Arg Val Met Glu Phe Cys Asn Asn Asn Asp Leu Gln
885 890 895
Leu Ile Val Arg Ala His Glu Cys Val Met Asp Gly Phe Glu Arg Phe
900 905 910
Ala Gln Gly His Leu Ile Thr Leu Phe Ser Ala Thr Asn Tyr Cys Gly
915 920 925
Thr Ala Asn Asn Ala Gly Ala Ile Leu Val Leu Gly Arg Asp Leu Val
930 935 940
Val Val Pro Lys Leu Ile His Pro Leu Pro Pro Ala Ile Thr Ser Pro
945 950 955 960
Glu Thr Ser Pro Glu His His Ile Glu Asp Thr Trp Met Gln Glu Leu
965 970 975
Asn Ala Asn Arg Pro Pro Thr Pro Thr Arg Gly Arg Pro Gln Val Ala
980 985 990
Ala Asn Asp Arg Gly Ser Leu Ala Trp Ile
995 1000
<210> 22
<211> 1001
<212> PRT
<213> 水稻(Oryza sativa)
<400> 22
Met Asp Val Asp Ser Arg Met Thr Thr Glu Ser Asp Ser Asp Ser Asp
1 5 10 15
Ala Ala Ala Thr Ala Ala Ala Ser Ala Ser Val Ala Ala Gln Gly Gly
20 25 30
Leu Ala Ser Glu Thr Ser Ser Ser Ser Ser Ala Ser Ala Pro Ser Thr
35 40 45
Pro Gly Thr Pro Thr Val Ala Pro Ala Pro Ala Ala Ala Gly Ala Thr
50 55 60
Gly Pro Arg Pro Ala Pro Gly Tyr Thr Ala Val Ser Ala Val Ile Glu
65 70 75 80
Lys Lys Glu Asp Gly Pro Gly Cys Arg Cys Gly His Thr Leu Thr Ala
85 90 95
Val Pro Ala Val Gly Glu Glu Gly Thr Pro Gly Tyr Ile Gly Pro Arg
100 105 110
Leu Ile Leu Phe Gly Gly Ala Thr Ala Leu Glu Gly Asn Ser Ala Thr
115 120 125
Pro Pro Ser Ser Ala Gly Ser Ala Gly Ile Arg Leu Ala Gly Ala Thr
130 135 140
Ala Asp Val His Cys Tyr Asp Val Leu Ser Asn Lys Trp Ser Arg Leu
145 150 155 160
Thr Pro Gln Gly Glu Pro Pro Ser Pro Arg Ala Ala His Val Ala Thr
165 170 175
Ala Val Gly Thr Met Val Val Ile Gln Gly Gly Ile Gly Pro Ala Gly
180 185 190
Leu Ser Ala Glu Asp Leu His Val Leu Asp Leu Thr Gln Gln Arg Pro
195 200 205
Arg Trp His Arg Val Val Val Gln Gly Pro Gly Pro Gly Pro Arg Tyr
210 215 220
Gly His Val Met Ala Leu Val Gly Gln Arg Phe Leu Leu Thr Ile Gly
225 230 235 240
Gly Asn Asp Gly Lys Arg Pro Leu Ala Asp Val Trp Ala Leu Asp Thr
245 250 255
Ala Ala Lys Pro Tyr Glu Trp Arg Lys Leu Glu Pro Glu Gly Glu Gly
260 265 270
Pro Pro Pro Cys Met Tyr Ala Thr Ala Ser Ala Arg Ser Asp Gly Leu
275 280 285
Leu Leu Leu Cys Gly Gly Arg Asp Ala Asn Ser Val Pro Leu Ala Ser
290 295 300
Ala Tyr Gly Leu Ala Lys His Arg Asp Gly Arg Trp Glu Trp Ala Ile
305 310 315 320
Ala Pro Gly Val Ser Pro Ser Pro Arg Tyr Gln His Ala Ala Val Phe
325 330 335
Val Asn Ala Arg Leu His Val Ser Gly Gly Ala Leu Gly Gly Gly Arg
340 345 350
Met Val Glu Asp Ser Ser Ser Val Ala Val Leu Asp Thr Gly Val Trp
355 360 365
Cys Asp Thr Lys Ser Val Val Thr Thr Pro Arg Ile Gly Arg Tyr Ser
370 375 380
Ala Asp Ala Ala Gly Gly Asp Ala Ala Val Glu Leu Thr Arg Arg Cys
385 390 395 400
Arg His Ala Ala Ala Ala Val Gly Asp Gln Ile Phe Ile Tyr Gly Gly
405 410 415
Leu Arg Gly Gly Val Leu Leu Asp Asp Leu Leu Val Ala Glu Asp Leu
420 425 430
Ala Ala Ala Glu Thr Thr Thr Ala Ala Asn His Ala Ala Ala Ser Ala
435 440 445
Ala Ala Thr Asn Val Gln Ser Gly Arg Thr Pro Gly Arg Tyr Ala Tyr
450 455 460
Asn Asp Glu Arg Ala Arg Gln Thr Ala Pro Glu Ser Ala Gln Asp Gly
465 470 475 480
Ser Val Val Leu Gly Thr Pro Val Ala Pro Pro Val Asn Gly Asp Met
485 490 495
Tyr Thr Asp Ile Ser Pro Glu Asn Ala Val Leu Gln Gly Gln Arg Arg
500 505 510
Leu Ser Lys Gly Val Asp Tyr Leu Val Glu Ala Ser Ala Ala Glu Ala
515 520 525
Glu Ala Ile Ser Ala Thr Leu Ala Ala Val Lys Ala Arg Gln Val Asn
530 535 540
Gly Glu Met Glu Gln Leu Pro Asp Lys Glu Gln Ser Pro Asp Ser Ala
545 550 555 560
Ser Thr Ser Lys His Ser Ser Leu Ile Lys Pro Asp Ser Ile Leu Ser
565 570 575
Asn Asn Met Thr Pro Pro Pro Gly Val Arg Leu His His Arg Ala Val
580 585 590
Val Val Ala Ala Glu Thr Gly Gly Ala Leu Gly Gly Met Val Arg Gln
595 600 605
Leu Ser Ile Asp Gln Phe Glu Asn Glu Gly Arg Arg Val Ser Tyr Gly
610 615 620
Thr Pro Glu Asn Ala Thr Ala Ala Arg Lys Leu Leu Asp Arg Gln Met
625 630 635 640
Ser Ile Asn Ser Val Pro Lys Lys Val Ile Ala Ser Leu Leu Lys Pro
645 650 655
Arg Gly Trp Lys Pro Pro Val Arg Arg Gln Phe Phe Leu Asp Cys Asn
660 665 670
Glu Ile Ala Asp Leu Cys Asp Ser Ala Glu Arg Ile Phe Ser Ser Glu
675 680 685
Pro Ser Val Leu Gln Leu Lys Ala Pro Val Lys Ile Phe Gly Asp Leu
690 695 700
His Gly Gln Phe Gly Asp Leu Met Arg Leu Phe Asp Glu Tyr Gly Ala
705 710 715 720
Pro Ser Thr Ala Gly Asp Ile Ala Tyr Ile Asp Tyr Leu Phe Leu Gly
725 730 735
Asp Tyr Val Asp Arg Gly Gln His Ser Leu Glu Thr Met Thr Leu Leu
740 745 750
Leu Ala Leu Lys Val Glu Tyr Pro Gln Asn Val His Leu Ile Arg Gly
755 760 765
Asn His Glu Ala Ala Asp Ile Asn Ala Leu Phe Gly Phe Arg Ile Glu
770 775 780
Cys Ile Glu Arg Met Gly Glu Arg Asp Gly Ile Trp Thr Trp His Arg
785 790 795 800
Met Asn Arg Leu Phe Asn Trp Leu Pro Leu Ala Ala Leu Ile Glu Lys
805 810 815
Lys Ile Ile Cys Met His Gly Gly Ile Gly Arg Ser Ile Asn His Val
820 825 830
Glu Gln Ile Glu Asn Leu Gln Arg Pro Ile Thr Met Glu Ala Gly Ser
835 840 845
Val Val Leu Met Asp Leu Leu Trp Ser Asp Pro Thr Glu Asn Asp Ser
850 855 860
Val Glu Gly Leu Arg Pro Asn Ala Arg Gly Pro Gly Leu Val Thr Phe
865 870 875 880
Gly Pro Asp Arg Val Met Glu Phe Cys Asn Asn Asn Asp Leu Gln Leu
885 890 895
Ile Val Arg Ala His Glu Cys Val Met Asp Gly Phe Glu Arg Phe Ala
900 905 910
Gln Gly His Leu Ile Thr Leu Phe Ser Ala Thr Asn Tyr Cys Gly Thr
915 920 925
Ala Asn Asn Ala Gly Ala Ile Leu Val Leu Gly Arg Asp Leu Val Val
930 935 940
Val Pro Lys Leu Ile His Pro Leu Pro Pro Ala Ile Thr Ser Pro Glu
945 950 955 960
Thr Ser Pro Glu His His Ile Glu Asp Thr Trp Met Gln Glu Leu Asn
965 970 975
Ala Asn Arg Pro Pro Thr Pro Thr Arg Gly Arg Pro Gln Val Ala Ala
980 985 990
Asn Asp Arg Gly Ser Leu Ala Trp Ile
995 1000
Claims (14)
1.创制水稻大长粒型新种质或大长粒型矮杆新种质的方法,其包括修饰水稻中的OsPPKL1和OsPPKL3基因,使OsPPKL1基因和OsPPKL3基因的结构或功能发生变化,所述修饰通过CRISPR/Cas9进行,所述CRISPR/Cas9包含Cas9和两个导向RNA(sgRNA),其中所述两个sgRNA分别靶向OsPPKL1基因的SEQ ID NO:5和OsPPKL3基因的SEQ ID NO:6所示的核苷酸序列。
2.根据权利要求1所述的方法,其中所述OsPPKL1基因包含以下序列或由以下序列组成:SEQ ID NO:1所示的核苷酸序列,或者编码SEQ ID NO:2所示的氨基酸序列的核苷酸序列。
3.根据权利要求1或2所述的方法,其中所述OsPPKL3基因包含以下序列或由以下序列组成:SEQ ID NO:3所示的核苷酸序列,或者编码SEQ ID NO:4所示的氨基酸序列的核苷酸序列。
4.根据权利要求1或2所述的方法,其中所述水稻为籼稻或粳稻。
5.根据权利要求1所述的方法,其中所述Cas9由SEQ ID NO:9所示的核苷酸序列组成,和/或所述两个sgRNA分别由SEQ ID NO:7和SEQ ID NO:8所示的核苷酸序列组成。
6.通过权利要求1-5中任一项所述的方法创制的水稻用于育种的用途。
7.通过权利要求1-5中任一项所述的方法创制的水稻用于改良种质资源的用途。
8.能够靶向水稻OsPPKL1基因的sgRNA,其由SEQ ID NO:7所示的核苷酸序列组成。
9.能够靶向水稻OsPPKL3基因的sgRNA,其由SEQ ID NO:8所示的核苷酸序列组成。
10.能够靶向水稻OsPPKL1基因和OsPPKL3基因的CRISPR/Cas9编辑载体,其包含表达Cas9的第一表达盒和表达sgRNA的第二表达盒,其中所述Cas9由SEQ ID NO:9所示的核苷酸序列组成,所述表达sgRNA的第二表达盒包含SEQ ID NO:7和SEQ ID NO:8所示的核苷酸序列。
11.创制水稻大长粒型新种质或大长粒型矮杆新种质的方法,其包括将权利要求10所述的CRISPR/Cas9编辑载体导入含有OsPPKL1基因和OsPPKL3基因的水稻中。
12.根据权利要求11所述的方法,其中所述OsPPKL1基因包含以下序列或由以下序列组成:SEQ ID NO:1所示的核苷酸序列,或者编码SEQ ID NO:2所示的氨基酸序列的核苷酸序列。
13.根据权利要求11或12所述的方法,其中所述OsPPKL3基因包含以下序列或由以下序列组成:SEQ ID NO:3所示的核苷酸序列,或者编码SEQ ID NO:4所示的氨基酸序列的核苷酸序列。
14.根据权利要求11或12所述的方法,其中所述水稻为籼稻或粳稻。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110432092.0A CN115216488B (zh) | 2021-04-21 | 2021-04-21 | 创制水稻大长粒型新种质或大长粒型矮杆新种质的方法及其应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110432092.0A CN115216488B (zh) | 2021-04-21 | 2021-04-21 | 创制水稻大长粒型新种质或大长粒型矮杆新种质的方法及其应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115216488A CN115216488A (zh) | 2022-10-21 |
CN115216488B true CN115216488B (zh) | 2024-08-02 |
Family
ID=83605537
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110432092.0A Active CN115216488B (zh) | 2021-04-21 | 2021-04-21 | 创制水稻大长粒型新种质或大长粒型矮杆新种质的方法及其应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115216488B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116240236A (zh) * | 2022-12-30 | 2023-06-09 | 电子科技大学 | 一种编辑水稻OsD18基因的启动子调控水稻株型的方法 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112239761A (zh) * | 2019-07-19 | 2021-01-19 | 南京农业大学 | 一个水稻粒长基因的基因工程应用 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102352367B (zh) * | 2011-10-24 | 2014-01-08 | 南京农业大学 | 一种控制水稻籽粒粒长和粒重的半显性基因qGL3的克隆与应用 |
CN103289972B (zh) * | 2012-03-02 | 2016-08-17 | 中国科学院上海生命科学研究院 | 一种稻类长粒相关基因及其应用 |
CN106011146B (zh) * | 2016-06-13 | 2019-06-21 | 中国农业科学院作物科学研究所 | OsMADS47基因在调控水稻粒型中的应用 |
CN106754967B (zh) * | 2017-01-19 | 2020-04-28 | 南京农业大学 | 一种水稻粒型基因OsLG1及其编码蛋白质和应用 |
US20220307006A1 (en) * | 2019-07-23 | 2022-09-29 | Pioneer Hi-Bred International, Inc. | Donor design strategy for crispr-cas9 genome editing |
-
2021
- 2021-04-21 CN CN202110432092.0A patent/CN115216488B/zh active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112239761A (zh) * | 2019-07-19 | 2021-01-19 | 南京农业大学 | 一个水稻粒长基因的基因工程应用 |
Non-Patent Citations (1)
Title |
---|
Rare allele of OsPPKL1 associated with grain length causes extra-large grain and a significant yield increase in rice;Zhang Xiaojun等;《PNAS》;第109卷(第52期);第21534-21539页 * |
Also Published As
Publication number | Publication date |
---|---|
CN115216488A (zh) | 2022-10-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA3047163A1 (en) | Genome editing-based crop engineering and production of brachytic plants | |
WO2021147401A1 (zh) | 调控玉米叶夹角的dna序列及其突变体、分子标记、检测引物和应用 | |
CN110218810B (zh) | 调控玉米雄穗构型的启动子、分子标记及其应用 | |
CN111763682A (zh) | ZmSBP12基因在调控玉米抗旱性、株高及穗位高中的用途 | |
JP2022511508A (ja) | ゲノム編集による遺伝子サイレンシング | |
CN115605500A (zh) | 控制分生组织大小以改良作物的方法 | |
WO2019129145A1 (en) | Flowering time-regulating gene cmp1 and related constructs and applications thereof | |
CN109912702B (zh) | 蛋白质OsARE1在调控植物抗低氮性中的应用 | |
US20210087557A1 (en) | Methods and compositions for targeted genomic insertion | |
CN115811937A (zh) | 杂合的cenh3单子叶植物及其用于单倍体诱导和同时基因组编辑的方法 | |
CN113265422A (zh) | 靶向敲除水稻粒型调控基因slg7的方法、水稻粒型调控基因slg7突变体及其应用 | |
CN115216488B (zh) | 创制水稻大长粒型新种质或大长粒型矮杆新种质的方法及其应用 | |
CA3089886A1 (en) | Compositions and methods for improving crop yields through trait stacking | |
KR102516522B1 (ko) | 반수체 식물을 유도하는 pPLAⅡη 유전자 및 이의 용도 | |
CN114875062B (zh) | 一种通过基因组编辑提高小麦赤霉病抗性的方法 | |
KR20190122595A (ko) | 식물의 염기 교정용 유전자 구조체, 이를 포함하는 벡터 및 이를 이용한 염기 교정 방법 | |
CN112980839B (zh) | 创制水稻高直链淀粉型新种质的方法及其应用 | |
CN110452914B (zh) | 一个调控油菜素内酯信号转导的基因BnC04BIN2-like1及其应用 | |
CN113999871B (zh) | 创制矮杆直立株型的水稻种质的方法及其应用 | |
CN116134143A (zh) | 多种抗病基因及其基因组堆叠件 | |
CN115697043A (zh) | 通过靶向诱变获得突变植物的方法 | |
JP2022549430A (ja) | Dna塩基編集のための方法及び組成物 | |
CN112522259A (zh) | 单倍体介导培育具有Oslg1突变体表型的株型改良水稻材料的方法 | |
WO2018228348A1 (en) | Methods to improve plant agronomic trait using bcs1l gene and guide rna/cas endonuclease systems | |
CN112980870A (zh) | 创制水稻大长粒型新种质的方法及其应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Country or region after: China Address after: Room 3610, 6th Floor, Building 3, Yabulun Industrial Park, Yazhou Bay Science and Technology City, Yazhou District, Sanya City, Hainan Province, 572025 Applicant after: CHINA NATIONAL SEED GROUP Corp.,Ltd. Address before: 15 / F, Sinochem building, A2 Fuxingmenwai street, Xicheng District, Beijing 100045 Applicant before: CHINA NATIONAL SEED GROUP Corp.,Ltd. Country or region before: China |
|
GR01 | Patent grant | ||
GR01 | Patent grant |