CN114181948A - 用于调控植物多细胞表皮毛分化的方法 - Google Patents
用于调控植物多细胞表皮毛分化的方法 Download PDFInfo
- Publication number
- CN114181948A CN114181948A CN202111506682.XA CN202111506682A CN114181948A CN 114181948 A CN114181948 A CN 114181948A CN 202111506682 A CN202111506682 A CN 202111506682A CN 114181948 A CN114181948 A CN 114181948A
- Authority
- CN
- China
- Prior art keywords
- gene
- leu
- ser
- asn
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 210000004209 hair Anatomy 0.000 title claims abstract description 108
- 238000000034 method Methods 0.000 title claims abstract description 73
- 230000004069 differentiation Effects 0.000 title claims abstract description 41
- 230000001105 regulatory effect Effects 0.000 title abstract description 16
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 64
- 230000003248 secreting effect Effects 0.000 claims abstract description 39
- 101001128393 Homo sapiens Interferon-induced GTP-binding protein Mx1 Proteins 0.000 claims abstract description 31
- 102100025055 NTF2-related export protein 1 Human genes 0.000 claims abstract description 29
- 101000721712 Homo sapiens NTF2-related export protein 1 Proteins 0.000 claims abstract description 28
- 101150007249 LFS gene Proteins 0.000 claims abstract description 27
- 101001055984 Homo sapiens Exosome complex component MTR3 Proteins 0.000 claims abstract description 24
- 101000844521 Homo sapiens Transient receptor potential cation channel subfamily M member 5 Proteins 0.000 claims abstract description 24
- 102100031215 Transient receptor potential cation channel subfamily M member 5 Human genes 0.000 claims abstract description 23
- 230000030279 gene silencing Effects 0.000 claims abstract description 22
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 20
- 102100031802 Interferon-induced GTP-binding protein Mx1 Human genes 0.000 claims abstract description 14
- 238000011161 development Methods 0.000 claims abstract description 13
- 101100511055 Allium cepa LFS gene Proteins 0.000 claims abstract description 5
- 241000196324 Embryophyta Species 0.000 claims description 92
- 239000002773 nucleotide Substances 0.000 claims description 30
- 125000003729 nucleotide group Chemical group 0.000 claims description 30
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 29
- 230000009466 transformation Effects 0.000 claims description 26
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 15
- 240000003768 Solanum lycopersicum Species 0.000 claims description 15
- 230000014509 gene expression Effects 0.000 claims description 15
- 101150112867 MX1 gene Proteins 0.000 claims description 14
- 101100185352 Trypanosoma brucei brucei (strain 927/4 GUTat10.1) MTR1 gene Proteins 0.000 claims description 12
- 239000013604 expression vector Substances 0.000 claims description 12
- 102100026063 Exosome complex component MTR3 Human genes 0.000 claims description 11
- 101150019437 MTR2 gene Proteins 0.000 claims description 11
- 101150075880 MTR3 gene Proteins 0.000 claims description 11
- 108700028369 Alleles Proteins 0.000 claims description 9
- 244000025254 Cannabis sativa Species 0.000 claims description 8
- 235000012766 Cannabis sativa ssp. sativa var. sativa Nutrition 0.000 claims description 7
- 235000012765 Cannabis sativa ssp. sativa var. spontanea Nutrition 0.000 claims description 7
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 6
- 235000009120 camo Nutrition 0.000 claims description 6
- 235000005607 chanvre indien Nutrition 0.000 claims description 6
- 239000011487 hemp Substances 0.000 claims description 6
- 241000218231 Moraceae Species 0.000 claims description 5
- 241000208292 Solanaceae Species 0.000 claims description 5
- 238000003208 gene overexpression Methods 0.000 claims description 5
- 238000003209 gene knockout Methods 0.000 claims description 4
- 230000035772 mutation Effects 0.000 claims description 4
- 101150006159 3b gene Proteins 0.000 claims description 2
- -1 Slwox3b Proteins 0.000 claims description 2
- 238000012226 gene silencing method Methods 0.000 claims description 2
- 230000002401 inhibitory effect Effects 0.000 claims description 2
- 230000001737 promoting effect Effects 0.000 claims description 2
- 108700026220 vif Genes Proteins 0.000 claims description 2
- 244000061176 Nicotiana tabacum Species 0.000 claims 1
- 230000003779 hair growth Effects 0.000 claims 1
- 210000002768 hair cell Anatomy 0.000 abstract description 20
- 239000002207 metabolite Substances 0.000 abstract description 9
- 230000002708 enhancing effect Effects 0.000 abstract description 6
- 230000002018 overexpression Effects 0.000 abstract description 4
- 230000002194 synthesizing effect Effects 0.000 abstract description 4
- 239000013598 vector Substances 0.000 description 41
- 108091033409 CRISPR Proteins 0.000 description 35
- 108020004414 DNA Proteins 0.000 description 29
- 230000002068 genetic effect Effects 0.000 description 21
- 238000010354 CRISPR gene editing Methods 0.000 description 17
- 238000001976 enzyme digestion Methods 0.000 description 16
- 239000012634 fragment Substances 0.000 description 15
- 239000000463 material Substances 0.000 description 13
- 108091027544 Subgenomic mRNA Proteins 0.000 description 12
- 230000009261 transgenic effect Effects 0.000 description 12
- 210000004027 cell Anatomy 0.000 description 11
- 238000010276 construction Methods 0.000 description 10
- 108010050848 glycylleucine Proteins 0.000 description 10
- 239000002609 medium Substances 0.000 description 10
- 230000001965 increasing effect Effects 0.000 description 9
- 238000001000 micrograph Methods 0.000 description 9
- 238000012163 sequencing technique Methods 0.000 description 9
- 238000005516 engineering process Methods 0.000 description 8
- 210000002615 epidermis Anatomy 0.000 description 8
- 229930027917 kanamycin Natural products 0.000 description 8
- 229960000318 kanamycin Drugs 0.000 description 8
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 8
- 229930182823 kanamycin A Natural products 0.000 description 8
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 7
- 230000001404 mediated effect Effects 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- 102000004190 Enzymes Human genes 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- 230000035939 shock Effects 0.000 description 6
- 108010079364 N-glycylalanine Proteins 0.000 description 5
- 241000208125 Nicotiana Species 0.000 description 5
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 5
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 238000003776 cleavage reaction Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 102000004169 proteins and genes Human genes 0.000 description 5
- 230000007017 scission Effects 0.000 description 5
- 238000012216 screening Methods 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- 241000589158 Agrobacterium Species 0.000 description 4
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 4
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 4
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 4
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- HMUKKNAMNSXDBB-CIUDSAMLSA-N Asn-Met-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMUKKNAMNSXDBB-CIUDSAMLSA-N 0.000 description 3
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 3
- 241000193755 Bacillus cereus Species 0.000 description 3
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 3
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 3
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 3
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 3
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 3
- LXNPMPIQDNSMTA-AVGNSLFASA-N Lys-Gln-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 LXNPMPIQDNSMTA-AVGNSLFASA-N 0.000 description 3
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 3
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 3
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 3
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 3
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 3
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 3
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 238000000137 annealing Methods 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 230000001276 controlling effect Effects 0.000 description 3
- 239000000539 dimer Substances 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 238000001802 infusion Methods 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 230000037039 plant physiology Effects 0.000 description 3
- 108010090894 prolylleucine Proteins 0.000 description 3
- UZKQTCBAMSWPJD-UQCOIBPSSA-N trans-Zeatin Natural products OCC(/C)=C\CNC1=NC=NC2=C1N=CN2 UZKQTCBAMSWPJD-UQCOIBPSSA-N 0.000 description 3
- UZKQTCBAMSWPJD-FARCUNLSSA-N trans-zeatin Chemical compound OCC(/C)=C/CNC1=NC=NC2=C1N=CN2 UZKQTCBAMSWPJD-FARCUNLSSA-N 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- 229940023877 zeatin Drugs 0.000 description 3
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 2
- OQCPATDFWYYDDX-HGNGGELXSA-N Ala-Gln-His Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OQCPATDFWYYDDX-HGNGGELXSA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 2
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 2
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 2
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 2
- CVKOQHYVDVYJSI-QTKMDUPCSA-N Arg-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N)O CVKOQHYVDVYJSI-QTKMDUPCSA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 2
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 2
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 2
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 2
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 2
- BFDDUDQCPJWQRQ-IHRRRGAJSA-N Arg-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O BFDDUDQCPJWQRQ-IHRRRGAJSA-N 0.000 description 2
- FXGMURPOWCKNAZ-JYJNAYRXSA-N Arg-Val-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FXGMURPOWCKNAZ-JYJNAYRXSA-N 0.000 description 2
- 235000001405 Artemisia annua Nutrition 0.000 description 2
- 240000000011 Artemisia annua Species 0.000 description 2
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 2
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 2
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 2
- GYOHQKJEQQJBOY-QEJZJMRPSA-N Asn-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N GYOHQKJEQQJBOY-QEJZJMRPSA-N 0.000 description 2
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 2
- MOHUTCNYQLMARY-GUBZILKMSA-N Asn-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MOHUTCNYQLMARY-GUBZILKMSA-N 0.000 description 2
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 2
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 2
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 2
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 2
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 2
- KTDWFWNZLLFEFU-KKUMJFAQSA-N Asn-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KTDWFWNZLLFEFU-KKUMJFAQSA-N 0.000 description 2
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 2
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 2
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 2
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 2
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 2
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 2
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 2
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 2
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 2
- UKVGHFORADMBEN-GUBZILKMSA-N Cys-Arg-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UKVGHFORADMBEN-GUBZILKMSA-N 0.000 description 2
- YRJICXCOIBUCRP-CIUDSAMLSA-N Cys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N YRJICXCOIBUCRP-CIUDSAMLSA-N 0.000 description 2
- VTBGVPWSWJBERH-DCAQKATOSA-N Cys-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N VTBGVPWSWJBERH-DCAQKATOSA-N 0.000 description 2
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 2
- ZOKPRHVIFAUJPV-GUBZILKMSA-N Cys-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O ZOKPRHVIFAUJPV-GUBZILKMSA-N 0.000 description 2
- HMWBPUDETPKSSS-DCAQKATOSA-N Cys-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCCN)C(=O)O HMWBPUDETPKSSS-DCAQKATOSA-N 0.000 description 2
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 2
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 2
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 2
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 2
- ZXGLLNZQSBLQLT-SRVKXCTJSA-N Gln-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZXGLLNZQSBLQLT-SRVKXCTJSA-N 0.000 description 2
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 2
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 2
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 2
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 2
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 2
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 2
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 2
- ZTVGZOIBLRPQNR-KKUMJFAQSA-N Glu-Met-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZTVGZOIBLRPQNR-KKUMJFAQSA-N 0.000 description 2
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 2
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 2
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 2
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 2
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- VHHYJBSXXMPQGZ-AVGNSLFASA-N His-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N VHHYJBSXXMPQGZ-AVGNSLFASA-N 0.000 description 2
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 2
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 2
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 2
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 2
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 2
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 2
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 2
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 2
- RFMDODRWJZHZCR-BJDJZHNGSA-N Ile-Lys-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O RFMDODRWJZHZCR-BJDJZHNGSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- OAQJOXZPGHTJNA-NGTWOADLSA-N Ile-Trp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N OAQJOXZPGHTJNA-NGTWOADLSA-N 0.000 description 2
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 2
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 2
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 2
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 2
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- BQVJARUIXRXDKN-DCAQKATOSA-N Met-Asn-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 BQVJARUIXRXDKN-DCAQKATOSA-N 0.000 description 2
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 2
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 2
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 2
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 2
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 2
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 2
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 2
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 2
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 2
- 108091030071 RNAI Proteins 0.000 description 2
- 108010025216 RVF peptide Proteins 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 2
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 2
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 2
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- 241000723873 Tobacco mosaic virus Species 0.000 description 2
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 2
- ZHDQRPWESGUDST-JBACZVJFSA-N Trp-Phe-Gln Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ZHDQRPWESGUDST-JBACZVJFSA-N 0.000 description 2
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 2
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 2
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 2
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 2
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 2
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 2
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 2
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 2
- MYLNLEIZWHVENT-VKOGCVSHSA-N Val-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N MYLNLEIZWHVENT-VKOGCVSHSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 2
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 2
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 2
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 230000009368 gene silencing by RNA Effects 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 230000003803 hair density Effects 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 240000004308 marijuana Species 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- JQXXHWHPUNPDRT-WLSIYKJHSA-N rifampicin Chemical compound O([C@](C1=O)(C)O/C=C/[C@@H]([C@H]([C@@H](OC(C)=O)[C@H](C)[C@H](O)[C@H](C)[C@@H](O)[C@@H](C)\C=C\C=C(C)/C(=O)NC=2C(O)=C3C([O-])=C4C)C)OC)C4=C1C3=C(O)C=2\C=N\N1CC[NH+](C)CC1 JQXXHWHPUNPDRT-WLSIYKJHSA-N 0.000 description 2
- 229960001225 rifampicin Drugs 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000011426 transformation method Methods 0.000 description 2
- PPINMSZPTPRQQB-NHCYSSNCSA-N 2-[[(2s)-1-[(2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PPINMSZPTPRQQB-NHCYSSNCSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 1
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- YWENWUYXQUWRHQ-LPEHRKFASA-N Arg-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O YWENWUYXQUWRHQ-LPEHRKFASA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- SJUXYGVRSGTPMC-IMJSIDKUSA-N Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O SJUXYGVRSGTPMC-IMJSIDKUSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 1
- CQMQJWRCRQSBAF-BPUTZDHNSA-N Asn-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N CQMQJWRCRQSBAF-BPUTZDHNSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- DHVMIHWNDBFTHB-FXQIFTODSA-N Asn-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N DHVMIHWNDBFTHB-FXQIFTODSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- YHXNKGKUDJCAHB-PBCZWWQYSA-N Asn-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YHXNKGKUDJCAHB-PBCZWWQYSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- CGYKCTPUGXFPMG-IHPCNDPISA-N Asn-Tyr-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CGYKCTPUGXFPMG-IHPCNDPISA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 1
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 1
- QHHVSXGWLYEAGX-GUBZILKMSA-N Asp-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QHHVSXGWLYEAGX-GUBZILKMSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 108010031540 CCGCGG-specific type II deoxyribonucleases Proteins 0.000 description 1
- 235000008697 Cannabis sativa Nutrition 0.000 description 1
- 102100040501 Contactin-associated protein 1 Human genes 0.000 description 1
- ZVNFONSZVUBRAV-CIUDSAMLSA-N Cys-Gln-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)CN=C(N)N ZVNFONSZVUBRAV-CIUDSAMLSA-N 0.000 description 1
- MUZAUPFGPMMZSS-GUBZILKMSA-N Cys-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N MUZAUPFGPMMZSS-GUBZILKMSA-N 0.000 description 1
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- MTNUYDIILCWPEP-GUBZILKMSA-N Cys-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CS MTNUYDIILCWPEP-GUBZILKMSA-N 0.000 description 1
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 1
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 1
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 1
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- WVWZIPOJECFDAG-AVGNSLFASA-N Glu-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N WVWZIPOJECFDAG-AVGNSLFASA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- 240000004153 Hibiscus sabdariffa Species 0.000 description 1
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 1
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- 101100114028 Homo sapiens CNTNAP1 gene Proteins 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- WEWCEPOYKANMGZ-MMWGEVLESA-N Ile-Cys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N WEWCEPOYKANMGZ-MMWGEVLESA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- CCYGNFBYUNHFSC-MGHWNKPDSA-N Ile-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CCYGNFBYUNHFSC-MGHWNKPDSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 1
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 1
- NSGXXVIHCIAISP-CIUDSAMLSA-N Met-Asn-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NSGXXVIHCIAISP-CIUDSAMLSA-N 0.000 description 1
- HKRYNJSKVLZIFP-IHRRRGAJSA-N Met-Asn-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HKRYNJSKVLZIFP-IHRRRGAJSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 1
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- VAGCEUUEMMXFEX-GUBZILKMSA-N Met-Met-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O VAGCEUUEMMXFEX-GUBZILKMSA-N 0.000 description 1
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 1
- PHURAEXVWLDIGT-LPEHRKFASA-N Met-Ser-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N PHURAEXVWLDIGT-LPEHRKFASA-N 0.000 description 1
- VOAKKHOIAFKOQZ-JYJNAYRXSA-N Met-Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 VOAKKHOIAFKOQZ-JYJNAYRXSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- HQCSLJFGZYOXHW-KKUMJFAQSA-N Phe-His-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N HQCSLJFGZYOXHW-KKUMJFAQSA-N 0.000 description 1
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- OLZVAVSJEUAOHI-UNQGMJICSA-N Phe-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O OLZVAVSJEUAOHI-UNQGMJICSA-N 0.000 description 1
- IPVPGAADZXRZSH-RNXOBYDBSA-N Phe-Tyr-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IPVPGAADZXRZSH-RNXOBYDBSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- VWHJZETTZDAGOM-XUXIUFHCSA-N Pro-Lys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VWHJZETTZDAGOM-XUXIUFHCSA-N 0.000 description 1
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- BLPYXIXXCFVIIF-FXQIFTODSA-N Ser-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N)CN=C(N)N BLPYXIXXCFVIIF-FXQIFTODSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 1
- WPSYJHFHZYJXMW-JSGCOSHPSA-N Trp-Gln-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O WPSYJHFHZYJXMW-JSGCOSHPSA-N 0.000 description 1
- CSRCUZAVBSEDMB-FDARSICLSA-N Trp-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CSRCUZAVBSEDMB-FDARSICLSA-N 0.000 description 1
- UMIACFRBELJMGT-GQGQLFGLSA-N Trp-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UMIACFRBELJMGT-GQGQLFGLSA-N 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- ZZDFLJFVSNQINX-HWHUXHBOSA-N Trp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O ZZDFLJFVSNQINX-HWHUXHBOSA-N 0.000 description 1
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 1
- HGEHWFGAKHSIDY-SRVKXCTJSA-N Tyr-Asp-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O HGEHWFGAKHSIDY-SRVKXCTJSA-N 0.000 description 1
- QOEZFICGUZTRFX-IHRRRGAJSA-N Tyr-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O QOEZFICGUZTRFX-IHRRRGAJSA-N 0.000 description 1
- FJKXUIJOMUWCDD-FHWLQOOXSA-N Tyr-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N)O FJKXUIJOMUWCDD-FHWLQOOXSA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- YYZPVPJCOGGQPC-JYJNAYRXSA-N Tyr-His-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYZPVPJCOGGQPC-JYJNAYRXSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- MANXHLOVEUHVFD-DCAQKATOSA-N Val-His-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N MANXHLOVEUHVFD-DCAQKATOSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- 150000001273 acylsugars Chemical class 0.000 description 1
- 108010066829 alanyl-glutamyl-aspartylprolyine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 229930101531 artemisinin Natural products 0.000 description 1
- BLUAFEHZUWYNDE-NNWCWBAJSA-N artemisinin Chemical compound C([C@](OO1)(C)O2)C[C@H]3[C@H](C)CC[C@@H]4[C@@]31[C@@H]2OC(=O)[C@@H]4C BLUAFEHZUWYNDE-NNWCWBAJSA-N 0.000 description 1
- 229960004191 artemisinin Drugs 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 229930003827 cannabinoid Natural products 0.000 description 1
- 239000003557 cannabinoid Substances 0.000 description 1
- 229940065144 cannabinoids Drugs 0.000 description 1
- 230000036978 cell physiology Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 108010045673 endodeoxyribonuclease XBAI Proteins 0.000 description 1
- 230000004049 epigenetic modification Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000001254 nonsecretory effect Effects 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- 239000012882 rooting medium Substances 0.000 description 1
- 229930000044 secondary metabolite Natural products 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 210000002955 secretory cell Anatomy 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 150000003505 terpenes Chemical class 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Botany (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
本发明提供了用于调控植物多细胞表皮毛分化的方法。通过突变/沉默/超量表达一个或多个表皮毛发育相关基因,调控植物不同类型多细胞表皮毛细胞的分化。超量表达LFS;或者敲除/沉默Slwox3b、MX1基因中的一个或两个;或同时敲除/沉默Slwox3b、MX1、MTR1、MTR2和MTR3基因;或者敲除/沉默Slwox3b、MX1基因的同时超量表达WoP635R和/或WoI692RD695Y基因能促进植物盾状分泌型表皮毛的形成。敲除/沉默LFS基因或者MTR1、MTR2、MTR3基因中的一个或多个,促进植物指状多细胞表皮毛分化。该方法对于增强植物的抗逆性,增强植物合成分泌代谢产物能力具有重要意义。
Description
技术领域
本发明属于基因工程技术领域,具体涉及用于调控植物多细胞表皮毛分化的方法。
背景技术
植物表皮毛能够增强植物的抗逆能力(Li, L.. et al. The Plant cell, 2004,16, 126-143.)。多数陆生植物的地上组织表皮上都具有多细胞表皮毛。按照形态特征,将多细胞表皮毛分为指状多细胞表皮毛(Digitate trichome)和盾状分泌型多细胞表皮毛(Peltate trichome)(图1)。
因为植物表皮毛是植物表皮上独立的细胞群体,存储和分泌代谢物对植物其它组织的生长发育没有影响(Sirikanta ramas et al. Plant Cell Physiol, 2005, 46:1578-1582),因此分泌型多细胞表皮毛被认为是植物的“代谢物合成工厂”。在一些植物(如烟草、番茄和大麻)的叶子上的分泌型表皮毛上合成和累计的代谢物可占其叶干重的多达10-15%(Wagner et al. Ann.Bot. 2004, 93:3-11)。在不同类型表皮毛中往往可以合成和分泌不同的次级代谢产物(Schilmiller et al. Plant Journal, 2008, 54, 702-711)。如番茄指状多细胞表皮毛种的I和IV型毛可以合成和分泌酰基糖(McDowell et al.Plant physiology, 2011, 155, 523-439; Schilmiller et al. PNAS, 2012, 109,16377-16382),番茄的盾状分泌型表皮毛VI型毛可以合成大量烯萜类代谢产物(Schilmiller et al. Plant physiology, 2010, 153, 1212-1223; Bergau et al.,BMC plant biology, 2015, 15);青蒿(Artemisia annua)的盾状分泌型表皮毛是合成和分泌的青蒿素重要组织(Graham et al. Science, 2010, 327, 328-331);同样的,大麻(Cannabis sativa L .)盾状分泌型表皮毛也是是合成和分泌的大麻素重要组织(Brenneisen R. Chemistry and analysis of phytocannabinoids and other Cannabisconstituents. Marijuana and the Cannabinoids: Springer; 2007. p. 17–49.)。为此,控制植物不同类型多细胞表皮毛的分化,对于增强植物的抗逆能力,以及增强植物合成分泌代谢产物能力具有重要意义。
发明内容
本发明提供了用于调控植物多细胞表皮毛分化的方法。
为实现上述目的,本发明采用如下技术方案:
用于调控植物多细胞表皮毛分化的方法为突变植物细胞中的表皮毛发育相关基因;所述表皮毛发育相关基因包括:MTR1基因、MTR2基因、MTR3基因、LFS基因、Slwox3b基因、MX1基因、WoP635R基因和WoI692RD695Y;
所述MTR1基因的核苷酸序列如SEQ ID NO.1所示;氨基酸序列如SEQ ID NO.2所示;
所述MTR2基因的核苷酸序列如SEQ ID NO.3所示;氨基酸序列如SEQ ID NO.4所示;
所述MTR3基因的核苷酸序列如SEQ ID NO.5所示;氨基酸序列如SEQ ID NO.6所示;
所述LFS基因的核苷酸序列如SEQ ID NO.7所示;氨基酸序列如SEQ ID NO.8所示;
所述Slwox3b基因的核苷酸序列如SEQ ID NO.9所示;氨基酸序列如SEQ ID NO.10所示;
所述MX1基因的核苷酸序列如SEQ ID NO.11所示;氨基酸序列如SEQ ID NO.12所示;
所述WoP635R基因的核苷酸序列如SEQ ID NO.13所示;氨基酸序列如SEQ ID NO.14所示;
所述WoI692RD695Y基因的核苷酸序列如SEQ ID NO.15所示;氨基酸序列如SEQ IDNO.16所示;
所述方法为通过基因敲除方法,或基因沉默方法(RNAi),或基因超表达方法。
进一步的,所述表皮毛发育相关基因具有与上述表皮毛发育相关基因MTR1基因、MTR2基因、MTR3基因、LFS基因、Slwox3b基因、MX1基因和WoP635R基因至少70%相似度的核苷酸序列或者氨基酸序列。
进一步的,用于调控植物多细胞表皮毛分化的方法:敲除/沉默LFS基因,或者敲除/沉默MTR1、MTR2、MTR3基因中的一个或多个,促进植物指状多细胞表皮毛分化,同时抑制盾状分泌型表皮毛的形成。
进一步的,用于调控植物多细胞表皮毛分化的方法:超表达LFS基因,或者敲除/沉默Slwox3b基因和MX1基因中的一个或两个,促进盾状分泌型表皮毛的形成。
进一步的,用于调控植物多细胞表皮毛分化的方法:同时敲除/沉默Slwox3b基因、MX1基因、MTR1基因、MTR2基因和MTR3基因,促进植物盾状分泌型表皮毛的形成。
进一步的,用于一种调控植物多细胞表皮毛分化的方法:敲除/沉默Slwox3b基因、MX1基因的同时超表达Woolly基因功能获得型突变等位基因WoP635R和/或WoI692RD695Y,促进植物盾状分泌型表皮毛的形成。
进一步的,一种表达载体:所述表达载体含有上述方法中的表皮毛发育相关基因。所述植物表达载体包括重组转化所形成的突变体,以及由此产生的外源基因表达产物;所述突变体包括茄科、桑科和其他具有多细胞表皮毛的植物,包括但不限于烟草、番茄、大麻。
上述用于调控植物多细胞表皮毛分化的方法在促进或抑制植物多细胞表皮毛分化中的应用。
本发明的有益之处在于:本发明提供了多种调控植物多细胞表皮毛分化的方法。本发明提供了MTR1基因、MTR2基因、MTR3基因、LFS基因、Slwox3b基因、MX1基因、WoP635R基因和WoI692RD695Y基因,共8个表皮毛发育相关基因;通过改变植物细胞中的一个或多个表皮毛发育相关基因的表达调控植物植物不同类型多细胞表皮毛的分化。(1)敲除/沉默LFS基因或者敲除/沉默MTR1、MTR2、MTR3基因中的一个或多个,促进植物指状多细胞表皮毛分化,同时抑制盾状分泌型表皮毛的形成;(2)超量表达LFS基因或者敲除/沉默Slwox3b基因和MX1基因中的一个或两个,促进盾状分泌型表皮毛的形成。(3)同时敲除/沉默Slwox3b基因、MX1基因、MTR1基因、MTR2基因和MTR3基因等基因,促进植物盾状分泌型表皮毛的形成。(4)敲除/沉默Slwox3b基因和MX1基因的同时超量表达Woolly基因功能获得型突变等位基因WoP635R和/或WoI692RD695Y,促进植物盾状分泌型表皮毛的形成。本发明的方法对于增强植物的抗逆能力,以及增强植物合成分泌代谢产物能力具有重要意义。本发明的方法适用于与番茄具有类似表皮毛结构的植物中使用,其中包括烟草和大麻,以及具有相似表皮毛细胞结构的茄科、桑科植物。
附图说明
图1:盾状表皮毛细胞和指状表皮毛细胞的分类。I、II、III、IV、V 5种表皮毛细胞归类为指状多细胞表皮毛,VI、VII这两种具有盾状分泌细胞的分泌型表皮毛归类为盾状分泌型表皮毛。
图2:敲除MTR1、MTR2、MTR3基因可以增加指状表皮毛细胞的密度。(A) MTR1、MTR2、MTR3基因多靶点CRISPR/CAS9敲除载体示意图;(B) 野生型(WT)、单独突变MTR1、MTR2、MTR3基因转基因植株叶片表皮毛扫描电镜图;(C) 同时突变MTR1、MTR2、MTR3基因转基因植株叶片表皮毛扫描电镜图;标尺为500μm。
图3:敲除LFS基因可以抑制分泌型表皮毛细胞的形成。(A) LFS基因CRISPR/CAS9敲除载体示意图;(B)野生型(WT)、LFS基因突变转基因植株叶片表皮毛扫描电镜图;标尺为500μm。
图4:敲除Slwox3b和MX1基因可以促进盾状表皮毛细胞的分化。(A) Slwox3b、MX1基因多靶点CRISPR/CAS9敲除载体示意图;(B) 野生型(WT)、单独突变Slwox3b、MX1基因转基因植株叶片表皮毛扫描电镜图;(C) 同时突变Slwox3b、MX1基因转基因植株叶片表皮毛扫描电镜图;标尺为500μm。
图5:超量表达LFS基因可以促进盾状表皮毛细胞的形成。(A) LFS基因超量表达载体示意图;(B) 野生型(WT)、LFS基因超量表达转基因植株叶片表皮毛扫描电镜图;标尺为200μm。
图6:同时敲除MTR1、MTR2、MTR3、Slwox3b和MX1等基因可以极显著增加状表皮毛细胞的数量。野生型(WT)、CR-MTR1/2/3、CR-Slwox3b/MX1、CR-MTR1/2/3/Slwox3b/MX1等转基因植株叶片表皮毛扫描电镜图,标尺为500μm。
图7:在同时敲除Slwox3b和MX1基因的材料中表达WoP635R基因可以极显著增加状表皮毛细胞的数量。(A)Woolly基因启动子序列驱动WoP635R表达的载体示意图;(B)野生型(WT)、pwo: Wo P635R 、CR-Slwox3b/MX1、pwo: Wo P635R ×Slwox3b/MX1等转基因植株叶片表皮毛扫描电镜图,标尺为500μm。
具体实施方式
为了使本发明所述的内容更加便于理解,下面结合具体实施方式对本发明所述的技术方案做进一步的说明,但是下述的实例仅仅是本发明其中的例子而已,并不代表本发明所限定的权利保护范围,本发明的权利保护范围以权利要求书为准。
盾状不同类型的多细胞表皮毛往往具有不同的功能。本发明中公开的方法中,同时沉默或者敲除在SEQ ID NO.1-2,3-4,5-6所列出的MTR1、MTR2、MTR3基因,可以显著增加指状多细胞表皮毛的数量,同时抑制盾状分泌型表皮毛的形成,这对于指状多细胞表皮毛的功能研究和运用将会有很大帮助。
沉默或者敲除在SEQ ID NO.7-8所列出的LFS基因,同样可以极显著的减少包括盾状多细胞表皮毛在内的所有分泌型表皮毛的形成,而促进指状非分泌型多细胞表皮毛分化,这对于指状多细胞表皮毛的功能研究和运用将会有很大帮助。
盾状分泌型表皮毛是植物合成和分泌代谢产物的重要场所,因此,增加盾状分泌型多细胞表皮毛的分化和密度,相当于增加了合成和分泌植物代谢产物的产量。而且,本文公开的增加盾状分泌型多细胞表皮毛分化的方法对植物自身的生长发育几乎没有影响,这极大地增加了本发明的实用性。
其中,在沉默或者敲除在SEQ ID NO.9-12所列出的Slwox3b和MX1基因核苷酸和蛋白氨基酸序列,以及与SEQ ID NO.9-12所列序列具有至少约70%相似度的核苷酸或者氨基酸序列基因,可以极显著增加头状分泌型表皮毛的密度。
或者,通过组成型表达或者诱导表达或者在表皮中特异表达如SEQ ID NO.10所列LFS基因,或者与SEQ ID NO.10所列核苷酸序列具有至少约70%相似度的核苷酸序列,可以极显著增加头状分泌型表皮毛的密度。
或者,同时沉默或者敲除SEQ ID NO.1-2,3-4,5-6,9-12所列出的MTR1、MTR2、MTR3、Slwox3b和MX1基因核苷酸和蛋白氨基酸序列,以及与SEQ ID NO.1-2,3-4,5-6,9-12所列序列具有至少约70%相似度的核苷酸或者氨基酸序列,也可以极显著增加头状分泌型表皮毛的密度。
或者,在沉默或者敲除在SEQ ID NO.9-12所列出的Slwox3b和MX1基因核苷酸和蛋白氨基酸序列,以及与SEQ ID NO.9-12所列序列具有至少约70%相似度的核苷酸或者氨基酸序列的遗传材料中,再组成型表达或者诱导表达或者在植物表皮上特异表达SEQ IDNO.13-14 所列的已公开的Woolly基因功能获得型突变等位基因WoP635R或者SEQ ID NO.15-16所列已公开的Woolly 基因功能获得型突变等位基因WoI692RD695Y。或者在SEQ ID NO.13-16所列Woolly 基因功能获得型突变等位基因WoP635R和WoI692RD695Y相同或者序列具有至少约70%相似度的核苷酸或者氨基酸序突变体材料中,再沉默或者敲除在SEQ ID NO.9-12所列出的Slwox3b和MX1基因核苷酸和蛋白氨基酸序列,以及与SEQ ID NO.9-12所列序列具有至少约70%相似度的核苷酸或者氨基酸序列,都可以极显著增加头状分泌型表皮毛的密度。
说明
仅以说明的方式而不是以限制性的方式提供以下实施案例。本领域技术人员将容易地认识到可以改变或修改各种非关键参数以产生基本相同或相似结果。这些实施案例绝不应被解释为限制由所附权利要求书限定的本技术的范围。
在一些近源种植物中,它们的多细胞表皮毛的调控机制往往具有很高的相似性,比如烟草、番茄、大麻等植物。本发明要求的遗传工程转化植物或植物细胞包括茄科、桑科和其他具有上述表皮毛的植物,这其中包括了烟草、番茄、大麻等常见植物。本文提供实施案例中仅展示在番茄中的遗传转化结果,这只是为了展示我们的发明方法的可行性,而不是限制本发明的实施,也不应被解释为限制由所附权利要求书限定的本技术的范围。
本发明所描述的“I、II、III、IV、V、VI、VII”,“指状多细胞表皮毛”(Digitatetrichome),“盾状多细胞表皮毛”(Peltate trichome)等术语是其它文献中已经报道的多细胞表皮毛分类(Simmons,A.T. and Gurr, G.M. Agr Forest Entomol, 2005, 7, 265–276.)。为了方便展示本发明的成果,我们也在本发明附图1中定义了“指状多细胞表皮毛”和“盾状多细胞表皮毛”,在不同茄科和桑科等植物中,其多细胞表皮毛亚型和形态可能与文中所列表皮毛细胞存在部分差异,但是这不应被解释为限制由所附权利要求书限定的本技术的范围。
敲除或者沉默一个基因的方法有很多,其目的是为了让目的基因的无法表达或者表达量降低,只要能够达到抑制目的基因表达的方法都可以视为具有敲除或者沉默目的基因表达作用。这里面包括RNAi技术、CASPR/CAS9技术、诱变或者表观遗传修饰等。本发明中只展示CRISPR/CAS9敲除用于敲除目的基因,是为了证明本发明方法的可靠性,而不是限制本发明的实施,也不应被解释为限制由所附权利要求书限定的本技术的范围。
目前敲除或者沉默载体构建:敲除或者沉默载体的构建已经是公开多年的成熟技术,本文提供实施案例中仅使用CRISPR/CAS9敲除载体对靶标基因进行敲除。一个基因往往具有多个CRISPR/CAS9敲除靶位点,本文只选择其中的一个位点进行敲除实验展示。这些是为了展示说明我们的发明方法的结果,而不是限制本发明的实施,也不应被解释为限制由所附权利要求书限定的本技术的范围。
同时,组成型表达或者诱导表达或者再植物表皮上特异表达载体有很多,其构建技术也是已经公开的技术,本文提供实施案例中仅使用烟草花叶病毒35S启动子,以及woolly基因启动子载体是用于展示我们的发明方法的结果,而不是限制本发明的实施,也不应被解释为限制由所附权利要求书限定的本技术的范围。
植物的遗传转化:目前已公开的植物遗传转化方法有很多种。本文提供实施案例中仅使用农杆菌介导的遗传转化方法,这只是为了展示我们的发明方法的可行性,而不是限制本发明的实施,也不应被解释为限制由所附权利要求书限定的本技术的范围。
实施例1用于调控植物多细胞表皮毛分化的方法
(1)载体构建:以番茄基因组中的MTR1、MTR2、MTR3基因为例,从公开的CRISPR/CAS9靶点设计在线网站上(CRISPR-P v2.0 (hzau.edu.cn))设计SEQ ID NO.1,3,5所列出的MTR1、MTR2、MTR3基因的敲除靶位点扩增引物 (Hao Liu, et al. Mol Plant, 2017, 10(3): 530-532.),其引物分别为CR-MTR1-F: 5’-GATTATGAGCAGGAGAAATGGAAA-3’, CR-MTR1-R: 5’-AAACTTTCCATTTCTCCTGCTCAT-3’;CR-MTR2-F: 5’ -GATTGTTCAAAATGAACTACAGGA-3,CR-MTR2-R: 5’- AAACTCCTGTAGTTCATTTTGAAC-3’;CR-MTR3-F: 5’-GATTATGAGTAGAAGGAATGGAAA -3’, CR-MTR3-R: 5’ -AAACTTTCCATTCCTTCTACTCAT-3’。通过退火延伸将引物聚合成引物二聚体。再通过酶切链接(BsaI酶切位点)将靶序列插入到pU6-M载体中的sgRNA骨架上,sgRNA骨架的启动子为番茄U6启动子。用热激法将连接产物转入DH5α感受态细胞中,并用含有100mg/L氨苄霉素的LB培养基进行筛选,在在37℃培养箱中过夜培养后,第二天挑单菌落进行PCR鉴定,并将阳性克隆在含有相应抗性的LB液体培养基中大摇16小时,提质粒送测序公司进行测序分析。将测序正确的阳性质粒进行酶切(SpeI和SalI),并将切下来的sgRNA片段切胶回收,最后再通过酶切连接(SpeI和SalI酶切位点)插入35S启动子驱动的CRISPR/CAS9终载中。用热激法将连接产物转入DH5α感受态细胞中,并用含有100mg/L卡那霉素的LB培养基进行筛选,经过PCR鉴定后,将阳性克隆在含有相应抗性的LB液体培养基中大摇16小时,提质粒送测序公司进行测序分析。
本实施例中还使用了CRISPR/CAS9多基因敲除方法。实施步骤如下:先分别构建分别单独敲除MTR1、MTR2、MTR3基因的pU6-M载体。再用NheI和SalI酶对单独敲除MTR1基因的pU6-M载体进行酶切,并通过电泳切胶回收切开的载体骨架备用。同时使用SpeI和SalI酶对单独敲除MTR2基因的pU6-M载体进行酶切,并回收切下来的sgRNA-MTR2片段。通过酶切连接将单独敲除MTR2的sgRNA-MTR2片段插入单独敲除MTR1基因的pU6-M载体中,构建完成同时敲除MTR2和MTR2的pU6-M载体。使用NheI和SalI酶对同时敲除MTR1和MTR2的pU6-M载体进行酶切,并切胶回收作为骨架载体备用。同时使用SpeI和SalI酶对单独敲除MTR3基因的pU6-M载体进行酶切,并回收切下来的sgRNA-MTR3片段。通过酶切连接将单独敲除MTR3的sgRNA-MTR3片段插入到同时敲除MTR1和MTR2基因的pU6-M载体中,构建完成同时敲除MTR1、MTR2和MTR3的pU6-M载体。最后再通过酶切连接(SpeI和SalI酶切位点)将3个串联了sgRNA的DNA片段插入35S启动子驱动的CRISPR/CAS9终载中,完成多基因敲除载体构建(图2A)。
(2)遗传转化:通过浓杆菌介导的遗传转化方法进行遗传转化,操作步骤参考前人已报道的方法(Hyeon-Jin Sun, Sayaka Uchii, Shin Watanabe, Hiroshi Ezura. Plantand Cell Physiology, March 2006, Volume 47, Issue 3, Pages 426–431.)。步骤如下:首先,将阳性克隆质粒通过热激法转入C58感受态细胞中,通过含有卡那霉素(100mg/L)和利福平(50mg/L)的固体LB筛选培养基进行筛选,在28℃培养箱中培养48小时后。将经过PCR鉴定的阳性克隆,使用含有卡那霉素(100mg/L)和利福平(50mg/L)的LB液体培养基进行扩繁。待菌浓度OD值达到0.6时,将菌体离心后收集后用KC液体培养基(含1.5 mg/L zeatin的MS培养基)重悬,继而进行农杆菌侵染番茄子叶(时间5分钟)。侵染结束后,将子叶转移到KC固体培养基中,黑暗室温条件下共培养48小时,再将子叶转移至筛选培养基(含1.5 mg/Lzeatin和300mg/L的特美丁,植物筛选抗生素根据使用载体的植物抗性决定,如果使用载体的植物抗性基因为潮霉素基因,则加入10 mg/L的潮霉素,如果是卡那霉素抗性,则再加入100 mg/L的卡那霉素)进行筛选,温度26℃,光照16小时,黑暗8小时。生长2-3周后,转入含1mg/L zeatin的筛选培养基(其它成分不变)继续筛选;待转基因植株幼芽长出来后,将幼芽用手术刀切下来,并转移到生根培养基中进行生根培养(1/2MS,300mg/L的特美丁以及植物抗性10 mg/L的潮霉素或者100 mg/L的卡那霉素)。最终获得了单独突变或者同时突变MTR1、MTR2、MTR3基因的突变材料。从扫描电镜图片可以清楚的看到,单独突变MTR1、MTR2、MTR3基因就可以引起植物指状多细胞表皮毛密度的增加,以及盾状分泌型表皮毛细胞的减少(图2B)。同时突变MTR1、MTR2、MTR3基因则可以极显著增加指状多细胞表皮毛的密度,同时抑制盾状分泌型多细胞表皮毛的形成(图2C)。
实施例2用于调控植物多细胞表皮毛分化的方法
(1)载体构建:以番茄基因组中的LFS基因为例,从公开的CRISPR/CAS9靶点设计在线网站上(CRISPR-P v2.0 (hzau.edu.cn))设计SEQ ID NO.7所列出的LFS基因的敲除靶位点扩增引物,其引物分别为CR-LFS-F: 5’-GATTCCCTCCGCGGTGCGATACCG-3’, CR-LFS-R:5’-AAACCGGTATCGCACCGCGGAGGG-3’。通过通过退火延伸将引物聚合成引物二聚体。再通过酶切链接(BsaI酶切位点)将靶靶序列插入到pU6-M载体中的sgRNA骨架上。并将测序正确的阳性质粒进行酶切(SpeI和SalI),并将切下来的sgRNA片段切胶回收,最后再通过酶切连接(SpeI和SalI酶切位点)插入,将sgRNA片段插入最终的CRISPR/CAS9载体中(图3A)。用热激法将连接产物转入DH5α感受态细胞中,并用含有100mg/L卡那霉素的LB培养基进行筛选,经过PCR鉴定后,提取阳性克隆质粒送测序公司进行测序分析。
(2)遗传转化:通过浓杆菌介导的遗传转化方法进行遗传转化(转化步骤参考具体实施案例1)。经过筛选,最终获得了敲除LFS基因的突变材料。从扫描电镜图片可以清楚地看到,敲除LFS基因就可以引起植物包括盾状多细胞表皮毛细胞在内的所有分泌型表皮毛细胞消失,同时指状多细胞表皮毛密度增加(图3B)。
实施例3用于调控植物多细胞表皮毛分化的方法
(1)载体构建:以番茄基因组中的Slwox3b和MX1基因为例,从公开的CRISPR/CAS9靶点设计在线网站上(CRISPR-P v2.0 (hzau.edu.cn))设计SEQ ID NO.9、11所列出的Slwox3b和MX1基因的敲除靶位点扩增引物。其引物序列如下:CR-Slwox3b-F: 5’ -GATTGTTTCAAAACCATAAAGCTA-3’,CR-Slwox3b-R:5’-AAACTAGCTTTATGGTTTTGAAAC-3’;CR-MX1-F:5’-GATTGTGCTTTGCCAACTAAAGCCGG-3’,CR-MX1-R:5’-AAACCCGGCTTTAGTTGGCAAAGCAC-3’。通过退火延伸将引物聚合成引物二聚体,再通过酶切链接(BsaI酶切位点)将靶靶序列插入到pU6-M载体中的sgRNA骨架上。用热激法将连接产物转入DH5α感受态细胞中,初步筛选阳性sgRNA骨架载体。并将测序正确的阳性质粒进行酶切(SpeI和SalI),并将切下来的sgRNA片段切胶回收,最后再通过酶切连接(SpeI和SalI酶切位点)插入35S启动子驱动的CRISPR/CAS9终载中。用热激法将连接产物转入DH5α感受态细胞中,并用含有100mg/L卡那霉素的LB培养基进行筛选,经过PCR鉴定后,将阳性克隆质粒送测序公司进行测序分析。同时敲除Slwox3b和MX1基因基因的CRISPR/CAS9载体构建参考实施列1中的方法,构建所得载体示意图如图4A。
(2)遗传转化:通过浓杆菌介导的遗传转化方法进行遗传转化(转化步骤参考具体实施案例1)。最终获得敲除Slwox3b和MX1基因的突变材料(图4B)。从扫描电镜图片可以清楚地看到,单独敲除Slwox3b和MX1就可以引起指状多细胞表皮毛细胞密度减少,盾状多细胞表皮毛细胞增加的表型。同时敲除Slwox3b和MX1基因则可以完全抑制指状多细胞表皮毛细胞的分化,导致所以表皮毛细胞都转变为盾状多细胞表皮毛细胞(图4C)。
实施例4用于调控植物多细胞表皮毛分化的方法
(1)载体构建:以番茄基因组中的LFS基因为例。使用高保真酶,PCR扩增将SEQ IDNO.7所列出的LFS基因核苷酸片段。其扩增引物为:LFS-EcoRI-F: 5’-ACCGGTACCAAGCTTGAATTCATGGAAGATGCAATGAGAAGACTTA-3’,LFS-EcoRI-R: 5’-CCCTTGCTCACCATGAATTCAGCATTTTGCAGCTTGGCTGCAAAA-3’。切胶回收PCR产物,再通过infusion连接方法,将回收到的LFSDNA片段插入到由烟草花叶病毒35S启动子驱动的表达载体中(图5A)。
(2)遗传转化:通过浓杆菌介导的遗传转化方法进行遗传转化(转化步骤参考具体实施案例1)。最终获得了LFS基因超量表达转基因植株。从扫描电镜图片可以清楚的看到,超量表达LFS基因,可以极显著地增加植物表皮上盾状多细胞表皮毛细胞的密度(图5B)。
实施例5用于调控植物多细胞表皮毛分化的方法
(1)载体构建:与实施案例1和3中相同的方法构建载体,获得MTR1、MTR2、MTR3、Slwox3b和MX1基因的多靶点CRISPR/CAS9敲除载体。
(2)遗传转化:以实施案例3中同时敲除Slwox3b和MX1基因的不带CRISPR/CAS9载体的突变体材料为遗传转化背景材料。通过农杆菌介导的遗传转化(转化步骤参考具体实施案例1),筛选同时敲除MTR1、MTR2、MTR3基因的转基因植株,最终获得了MTR1、MTR2、MTR3、Slwox3b和MX1基因的多突变材料。从扫描电镜图片可以清楚地看到,同时敲除MTR1、MTR2、MTR3、Slwox3b和MX1基因的突变体材料上的盾状表皮毛细胞极显著高于Slwox3b和MX1基因的双突变材料上的盾状表皮毛细胞密度(图6)。
实施例6用于调控植物多细胞表皮毛分化的方法
(1)载体构建:
用pWo-SacI-F: 5'-AACGCGTTGGGAGCTCTACATACAGAACTTATGAGGGAAT-3'和pWo-XbaI-R: 5'-CATTAAAGCAGGACTCTAGACTTGAATACCTTCTCGATCTTCTT-3'引物扩增Woolly基因启动子(pWo)序列(SEQ ID NO. 17),并将目的条带切胶回收。同时通过SacI和EcoRI内切酶将表达载体上的35S启动子切除,再将切胶回收的Woolly基因启动子序列通过infusion方法插入表达载体中,构建Woolly基因启动子pWo驱动的表达载体。再使用Wo-XbaI-F: 5' -AGAAGGTATTCAAGTCTAGAATGTTTAATAACCACCAGCACTTGC-3'和Wo-XbaI-R: 5'-CATTAAAGCAGGACTCTAGATCACTGCATTTGCGGAAGTTACAGCA-3'引物扩增Woolly基因功能获得型突变等位基因WoP635R,切胶回收目的片段。同时用XbaI内切酶将pWo驱动的表达载体切开,再使用infusion方法将WoP635R基因片段插入到pWo启动子后面,最终构建形成pWo:WoP635R的表达载体(图7A)。
(2)遗传转化:以实施案例3中获得的Slwox3b和MX1双突变体为背景材料,通过农杆菌介导的遗传转化(转化步骤参考具体实施案例1),筛选获得Woolly基因功能获得型突变等位基因WoP635R的转基因植株。从扫描电镜图片可以清楚的看到,在Slwox3b和MX1的双突变体材料上,用Woolly基因启动子驱动Woolly基因功能获得型突变等位基因WoP635R(基因序列如SEQ ID NO.13所示,氨基酸序列如SEQ ID NO.14 所示)的表达,可以极显著促进盾状多细胞表皮毛在Slwox3b和MX1基因的双突变材料上的分化(图7B)。
以上所述仅为本发明的较佳实施例,凡依本发明申请专利范围所做的均等变化与修饰,皆应属本发明的涵盖范围。
SEQUENCE LISTING
<110> 福建农林大学
<120> 用于调控植物多细胞表皮毛分化的方法
<130> 35
<160> 35
<170> PatentIn version 3.3
<210> 1
<211> 321
<212> DNA
<213> SEQ ID NO.1
<400> 1
atgagcagga gaaatggaaa tggttcaagg cttgacctaa aattgaattt gtcaccacca 60
cctagagtga accacgtggg gccacaagtg gagtcgccga caaggtcgct gacaatgtcc 120
ccgacgaatt catgtgtatc atctgataat tcgtcggggg gacaacgtgg ttccaacagc 180
cccgaagcgg tgacttcgat gctgttggtt ggttgtcaaa ggtgtctaat gtatgttatg 240
gtatcggaag ttaatgatcc accaaggtgc ccaaaatgca agagtacttg tttgcttgaa 300
gttatctatg agaaaaaata a 321
<210> 2
<211> 106
<212> PRT
<213> SEQ ID NO.2
<400> 2
Met Ser Arg Arg Asn Gly Asn Gly Ser Arg Leu Asp Leu Lys Leu Asn
1 5 10 15
Leu Ser Pro Pro Pro Arg Val Asn His Val Gly Pro Gln Val Glu Ser
20 25 30
Pro Thr Arg Ser Leu Thr Met Ser Pro Thr Asn Ser Cys Val Ser Ser
35 40 45
Asp Asn Ser Ser Gly Gly Gln Arg Gly Ser Asn Ser Pro Glu Ala Val
50 55 60
Thr Ser Met Leu Leu Val Gly Cys Gln Arg Cys Leu Met Tyr Val Met
65 70 75 80
Val Ser Glu Val Asn Asp Pro Pro Arg Cys Pro Lys Cys Lys Ser Thr
85 90 95
Cys Leu Leu Glu Val Ile Tyr Glu Lys Lys
100 105
<210> 3
<211> 360
<212> DNA
<213> SEQ ID NO.3
<400> 3
atgaactaca ggagggagag tattcatgat ccaaagcttg atttgaagct gaatttatca 60
ccccctaggg aaaataataa tatgtttcca atattatcac caagcagatc atcgtcttcg 120
tatacttctt caatggagag atcaccagca agttcatgtg tttcaaggga accaactcca 180
gatcatcaag aattttttgg tagccctgaa gcaacaacaa cttctatgat gttagtagga 240
tgtccaagat gcttaatgta tgttatgtta tcagcggaag atccaaaatg ccccaaatgt 300
aagagctctg ttttgcttga ttttttccgc gaggatcgaa tcaataatcc aaagaattga 360
<210> 4
<211> 119
<212> PRT
<213> SEQ ID NO.4
<400> 4
Met Asn Tyr Arg Arg Glu Ser Ile His Asp Pro Lys Leu Asp Leu Lys
1 5 10 15
Leu Asn Leu Ser Pro Pro Arg Glu Asn Asn Asn Met Phe Pro Ile Leu
20 25 30
Ser Pro Ser Arg Ser Ser Ser Ser Tyr Thr Ser Ser Met Glu Arg Ser
35 40 45
Pro Ala Ser Ser Cys Val Ser Arg Glu Pro Thr Pro Asp His Gln Glu
50 55 60
Phe Phe Gly Ser Pro Glu Ala Thr Thr Thr Ser Met Met Leu Val Gly
65 70 75 80
Cys Pro Arg Cys Leu Met Tyr Val Met Leu Ser Ala Glu Asp Pro Lys
85 90 95
Cys Pro Lys Cys Lys Ser Ser Val Leu Leu Asp Phe Phe Arg Glu Asp
100 105 110
Arg Ile Asn Asn Pro Lys Asn
115
<210> 5
<211> 381
<212> DNA
<213> SEQ ID NO.5
<400> 5
atgagtagaa ggaatggaaa tggtcctaaa cttgacttga ggttgaattt atcaccacca 60
agggttgtta accatcagga ccaacaccag caccagtacc agtaccacat tgatcaacaa 120
gagtcgttat cgcctaatag atcattggca acatcaccaa caagctcatg tgtgtcctca 180
gaggattcgc catctcatca acgccgcgat cctaatagcc ctgaaacaac aacatccatg 240
gtgctagttg gatgtccaag gtgtatgatg tatgtcatgt tagctgaggt tgatccaaga 300
tgccctaaat gcaaaagcac ttgtttgctt gatgtaattc atgaatccaa caattgtact 360
actaggagaa ggatcactta a 381
<210> 6
<211> 126
<212> PRT
<213> SEQ ID NO.6
<400> 6
Met Ser Arg Arg Asn Gly Asn Gly Pro Lys Leu Asp Leu Arg Leu Asn
1 5 10 15
Leu Ser Pro Pro Arg Val Val Asn His Gln Asp Gln His Gln His Gln
20 25 30
Tyr Gln Tyr His Ile Asp Gln Gln Glu Ser Leu Ser Pro Asn Arg Ser
35 40 45
Leu Ala Thr Ser Pro Thr Ser Ser Cys Val Ser Ser Glu Asp Ser Pro
50 55 60
Ser His Gln Arg Arg Asp Pro Asn Ser Pro Glu Thr Thr Thr Ser Met
65 70 75 80
Val Leu Val Gly Cys Pro Arg Cys Met Met Tyr Val Met Leu Ala Glu
85 90 95
Val Asp Pro Arg Cys Pro Lys Cys Lys Ser Thr Cys Leu Leu Asp Val
100 105 110
Ile His Glu Ser Asn Asn Cys Thr Thr Arg Arg Arg Ile Thr
115 120 125
<210> 7
<211> 1161
<212> DNA
<213> SEQ ID NO.7
<400> 7
atggaagatg caatgagaag acttaatcaa gaaactgatg ttccactcca aaacactaca 60
aactcttcca ctgttaataa aactagatct tcttgttcat caaacaaaag atcactcaaa 120
gatactgcta ctggtccctc cgcggtgcga taccgcggag ttcgtcgtcg tccttgggga 180
cgttatgcag ctgaaattcg tgaccctcaa tcaaaagaaa gaaggtggct cggtactttt 240
gataccgctg aagaagctgc ttgtgcttat gattgcgctg ctagagcaat gcgcggtgtt 300
aaagcacgta ctaatttcgt ctacccacct tgtcctactc ctacccaacc cacttctacc 360
aatgacgctt tgtttaatat tcattcgtct tataaattaa catcgcctta ctatcatcag 420
agttctaata ctcttaagga tttgtctaat agaccctttt ttcattcctc cagtccttat 480
ggttctacgg ggcgtgctca tgtgattggt cagaaaagca atgattcttt gaatatgctc 540
ctgtttcgtg aactcctgag ttcaaattcc agcaacaata ctaataattt gaacgtgact 600
agtagtatga acatgcctaa tttgtacgaa cagctgccga atttcacgat gaatcgtaat 660
accaacagct tcggttcgta tttgcctaat agcagcaacc ccgtgattcc cagttcttct 720
gtcatgacta ctcaggtccc caaattcgat aacactgtcc actgtactat caacaacaat 780
aacagtagta gtggtgctac cgcaaatgat gattctgctg cagggatgga ttttttccca 840
tctgagtcat ctgattctgg attattagaa gaggctttaa atggtttttt cccaaagccc 900
aagcccatta aatctgtacc atcatcatta cctaattacg aattttgcaa tattttctct 960
caacaaccgc aacaacagga gcagattaat aatggtggtt taaacagtga ttttggatta 1020
ttatcatcgt cattgtcatc atttccagtg gattattttc caggaaatct tcaggtagca 1080
ccaggagaca acataatggg agacatattt cagtatccag atcttctcag tatttttgca 1140
gccaagctgc aaaatgctta a 1161
<210> 8
<211> 386
<212> PRT
<213> SEQ ID NO.8
<400> 8
Met Glu Asp Ala Met Arg Arg Leu Asn Gln Glu Thr Asp Val Pro Leu
1 5 10 15
Gln Asn Thr Thr Asn Ser Ser Thr Val Asn Lys Thr Arg Ser Ser Cys
20 25 30
Ser Ser Asn Lys Arg Ser Leu Lys Asp Thr Ala Thr Gly Pro Ser Ala
35 40 45
Val Arg Tyr Arg Gly Val Arg Arg Arg Pro Trp Gly Arg Tyr Ala Ala
50 55 60
Glu Ile Arg Asp Pro Gln Ser Lys Glu Arg Arg Trp Leu Gly Thr Phe
65 70 75 80
Asp Thr Ala Glu Glu Ala Ala Cys Ala Tyr Asp Cys Ala Ala Arg Ala
85 90 95
Met Arg Gly Val Lys Ala Arg Thr Asn Phe Val Tyr Pro Pro Cys Pro
100 105 110
Thr Pro Thr Gln Pro Thr Ser Thr Asn Asp Ala Leu Phe Asn Ile His
115 120 125
Ser Ser Tyr Lys Leu Thr Ser Pro Tyr Tyr His Gln Ser Ser Asn Thr
130 135 140
Leu Lys Asp Leu Ser Asn Arg Pro Phe Phe His Ser Ser Ser Pro Tyr
145 150 155 160
Gly Ser Thr Gly Arg Ala His Val Ile Gly Gln Lys Ser Asn Asp Ser
165 170 175
Leu Asn Met Leu Leu Phe Arg Glu Leu Leu Ser Ser Asn Ser Ser Asn
180 185 190
Asn Thr Asn Asn Leu Asn Val Thr Ser Ser Met Asn Met Pro Asn Leu
195 200 205
Tyr Glu Gln Leu Pro Asn Phe Thr Met Asn Arg Asn Thr Asn Ser Phe
210 215 220
Gly Ser Tyr Leu Pro Asn Ser Ser Asn Pro Val Ile Pro Ser Ser Ser
225 230 235 240
Val Met Thr Thr Gln Val Pro Lys Phe Asp Asn Thr Val His Cys Thr
245 250 255
Ile Asn Asn Asn Asn Ser Ser Ser Gly Ala Thr Ala Asn Asp Asp Ser
260 265 270
Ala Ala Gly Met Asp Phe Phe Pro Ser Glu Ser Ser Asp Ser Gly Leu
275 280 285
Leu Glu Glu Ala Leu Asn Gly Phe Phe Pro Lys Pro Lys Pro Ile Lys
290 295 300
Ser Val Pro Ser Ser Leu Pro Asn Tyr Glu Phe Cys Asn Ile Phe Ser
305 310 315 320
Gln Gln Pro Gln Gln Gln Glu Gln Ile Asn Asn Gly Gly Leu Asn Ser
325 330 335
Asp Phe Gly Leu Leu Ser Ser Ser Leu Ser Ser Phe Pro Val Asp Tyr
340 345 350
Phe Pro Gly Asn Leu Gln Val Ala Pro Gly Asp Asn Ile Met Gly Asp
355 360 365
Ile Phe Gln Tyr Pro Asp Leu Leu Ser Ile Phe Ala Ala Lys Leu Gln
370 375 380
Asn Ala
385
<210> 9
<211> 630
<212> DNA
<213> SEQ ID NO.9
<400> 9
atgggaaggc caacaagatg gagtccaaca ccagaacaac ttatgttctt agaagaaatg 60
tatagaaaag gtttaagaaa tccaaatgct acacaaatac aaagtattac ttgtcatttg 120
tcttcatttg gtaaaattga agggaaaaat gtgttttatt ggtttcaaaa ccataaagct 180
agggatagac aaaaacttaa gaaaaaactt cttgcacaaa tgaatcaaca acaaatatta 240
gctcaatatc ctattgatgc tcatagtaca actactacta ctaccaactc caataacaat 300
accctatttc attgccccac tactgatcag taccaaatat gtcctttgac atctactact 360
gcccttcttc aagagggtga aatcaaagaa gcatcatctc aagtaatgac ttatctatat 420
ccaatggatc tctcaaaacc agctgatcaa aatatggaaa attgcatgat aagaccctat 480
ggaaaagatt ggatcgtgat gatgaatatt aatccaaata acttaccata ttgtgtcaat 540
cgacccctca aaaccctacc actttttcct ataacaacaa ccgatgacct caaagaccaa 600
acaacatctt ccactagttt aagtctttga 630
<210> 10
<211> 209
<212> PRT
<213> SEQ ID NO.10
<400> 10
Met Gly Arg Pro Thr Arg Trp Ser Pro Thr Pro Glu Gln Leu Met Phe
1 5 10 15
Leu Glu Glu Met Tyr Arg Lys Gly Leu Arg Asn Pro Asn Ala Thr Gln
20 25 30
Ile Gln Ser Ile Thr Cys His Leu Ser Ser Phe Gly Lys Ile Glu Gly
35 40 45
Lys Asn Val Phe Tyr Trp Phe Gln Asn His Lys Ala Arg Asp Arg Gln
50 55 60
Lys Leu Lys Lys Lys Leu Leu Ala Gln Met Asn Gln Gln Gln Ile Leu
65 70 75 80
Ala Gln Tyr Pro Ile Asp Ala His Ser Thr Thr Thr Thr Thr Thr Asn
85 90 95
Ser Asn Asn Asn Thr Leu Phe His Cys Pro Thr Thr Asp Gln Tyr Gln
100 105 110
Ile Cys Pro Leu Thr Ser Thr Thr Ala Leu Leu Gln Glu Gly Glu Ile
115 120 125
Lys Glu Ala Ser Ser Gln Val Met Thr Tyr Leu Tyr Pro Met Asp Leu
130 135 140
Ser Lys Pro Ala Asp Gln Asn Met Glu Asn Cys Met Ile Arg Pro Tyr
145 150 155 160
Gly Lys Asp Trp Ile Val Met Met Asn Ile Asn Pro Asn Asn Leu Pro
165 170 175
Tyr Cys Val Asn Arg Pro Leu Lys Thr Leu Pro Leu Phe Pro Ile Thr
180 185 190
Thr Thr Asp Asp Leu Lys Asp Gln Thr Thr Ser Ser Thr Ser Leu Ser
195 200 205
Leu
<210> 11
<211> 1005
<212> DNA
<213> SEQ ID NO.11
<400> 11
atgggaagat caccatgttg tgagaaagtg ggattgaaga aaggaccatg gactcctgaa 60
gaagaccaaa aactcatgga ttacattgaa aaaaatggat gtggtagttg gcgtgctttg 120
ccaactaaag ccggactcaa gagatgtgga aaaagttgca gattaagatg gataaattat 180
ttaagacctg atattaagag aggaaaattc agtttgcaag aagaacagac aatcattcaa 240
ctccatgccc ttcttggtaa cagatggtca gctattgcga ctcatttggc taacagaacg 300
gacaatgaaa ttaagaatta ttggaacacg cacttgaaga agaggttaac caagatgggc 360
attgatccaa atactcataa accaaaatcc aacatctttg gttccgcaaa cctaagccac 420
atggctcagt gggaaaaggc tcgtttagaa gctgaagctc gacttgttcg cgaatccaaa 480
aaacaacacc aacaaattat ttcgaacaat aacaataaca ttaataatta taataatatc 540
cactttagtc ctaataattt gactactact acaacaacaa atgttttacc accccttcaa 600
acaaaattac cctctccacc ttgtcttgat gtgttaaaag catggcaagg aggggcaaat 660
tggtcaatta tgccaaaaat taccaaagat aattttttcg ataaccctcc aatatccact 720
tcaaatttat ctctaatcat ggtaccaaat aataatagta ttacaggggc tggattaatt 780
gataattcat gtttgattgg tactgaaaat ttcatggaga ataacattaa tggcatttca 840
tattctaatt atccaaattt gaatactatt caagggttta cacacttgga ccatgttctt 900
ggaagtcgtg aagaagaaga cgacaacgac aacaacgaca acacttattg gaacactata 960
ctaaaatctt gtacttcatt tgttgatggt tcatcagtat tttaa 1005
<210> 12
<211> 334
<212> PRT
<213> SEQ ID NO.12
<400> 12
Met Gly Arg Ser Pro Cys Cys Glu Lys Val Gly Leu Lys Lys Gly Pro
1 5 10 15
Trp Thr Pro Glu Glu Asp Gln Lys Leu Met Asp Tyr Ile Glu Lys Asn
20 25 30
Gly Cys Gly Ser Trp Arg Ala Leu Pro Thr Lys Ala Gly Leu Lys Arg
35 40 45
Cys Gly Lys Ser Cys Arg Leu Arg Trp Ile Asn Tyr Leu Arg Pro Asp
50 55 60
Ile Lys Arg Gly Lys Phe Ser Leu Gln Glu Glu Gln Thr Ile Ile Gln
65 70 75 80
Leu His Ala Leu Leu Gly Asn Arg Trp Ser Ala Ile Ala Thr His Leu
85 90 95
Ala Asn Arg Thr Asp Asn Glu Ile Lys Asn Tyr Trp Asn Thr His Leu
100 105 110
Lys Lys Arg Leu Thr Lys Met Gly Ile Asp Pro Asn Thr His Lys Pro
115 120 125
Lys Ser Asn Ile Phe Gly Ser Ala Asn Leu Ser His Met Ala Gln Trp
130 135 140
Glu Lys Ala Arg Leu Glu Ala Glu Ala Arg Leu Val Arg Glu Ser Lys
145 150 155 160
Lys Gln His Gln Gln Ile Ile Ser Asn Asn Asn Asn Asn Ile Asn Asn
165 170 175
Tyr Asn Asn Ile His Phe Ser Pro Asn Asn Leu Thr Thr Thr Thr Thr
180 185 190
Thr Asn Val Leu Pro Pro Leu Gln Thr Lys Leu Pro Ser Pro Pro Cys
195 200 205
Leu Asp Val Leu Lys Ala Trp Gln Gly Gly Ala Asn Trp Ser Ile Met
210 215 220
Pro Lys Ile Thr Lys Asp Asn Phe Phe Asp Asn Pro Pro Ile Ser Thr
225 230 235 240
Ser Asn Leu Ser Leu Ile Met Val Pro Asn Asn Asn Ser Ile Thr Gly
245 250 255
Ala Gly Leu Ile Asp Asn Ser Cys Leu Ile Gly Thr Glu Asn Phe Met
260 265 270
Glu Asn Asn Ile Asn Gly Ile Ser Tyr Ser Asn Tyr Pro Asn Leu Asn
275 280 285
Thr Ile Gln Gly Phe Thr His Leu Asp His Val Leu Gly Ser Arg Glu
290 295 300
Glu Glu Asp Asp Asn Asp Asn Asn Asp Asn Thr Tyr Trp Asn Thr Ile
305 310 315 320
Leu Lys Ser Cys Thr Ser Phe Val Asp Gly Ser Ser Val Phe
325 330
<210> 13
<211> 2193
<212> DNA
<213> SEQ ID NO.13
<400> 13
atgtttaata accaccagca cttgctcgat atatcgtcct cagctcaacg aacacctgat 60
aacgagttgg atttcattcg tgatgaagag tttgatagca actctggtgc tgataacatg 120
gaagctccca attcaggtga tgacgatcaa gctgatccaa accaacctcc aaacaagaag 180
aagcgttatc atcgccacac tcagaatcag attcaggaaa tggagtcctt ttacaaggaa 240
tgcaatcatc cagatgacaa gcaaaggaag gaattgggaa gaagacttgg tttggagcca 300
ttacaagtga aattttggtt ccagaacaag cgtactcaga tgaaggctca acatgagcga 360
tgtgagaaca cacagttgag gaatgaaaat gagaagcttc gcgctgagaa cataaggtac 420
aaagaagctt tgagtaatgc agcatgccca aattgtggag ggccagcagc tataggagag 480
atgtcatttg atgagcatca gttgaggatt gaaaatgctc gtcttagaga tgagattgac 540
aggataactg gaatagctgg aaagtatgtt ggtaaatcag cccttggata ttctcatcaa 600
cttcctcttc ctcagcccga agctcctcgg gttctggatc ttgcttttgg gcctcaatcg 660
ggcctgcttg gagaaatgta cgctgctggt gaccttctaa gaactgctgt tacgggcctt 720
acagatgctg agaagcccgt ggtcattgag cttgctgtta ctgcaatgga ggaacttata 780
aggatggctc aaactgaaga gccattatgg ttgccaagct caggctctga gactttatgt 840
gagcaagaat atgctcgtat tttccctcga ggccttggac ctaagccagc tacactcaat 900
tctgaagcct cacgagaatc tgctgttgtg attatgaatc atatcaattt agttgagatt 960
ttgatggatg tgaaccaatg gactactgtt tttgctggtc tggtgtcaaa agcaatgact 1020
cttgaagtct tatcaactgg tgtcgcagga aatcacaatg gagcattgca agtgatgaca 1080
gcagaatttc aagttccatc tccacttgtt ccaactcggg agaactattt cttaagatac 1140
tgtaaacaac atggtgaagg gacttgggta gtggttgatg tttccctgga caacttgcgc 1200
actgtttcag ttccgcgttg cagaagaagg ccatctggtt gtttaatcca agaaatgcca 1260
aatggttact caagggttat atgggttgaa cacgttgagg tggatgaaaa tgctgtccat 1320
gacatctaca aacctcttgt caattctggg attgcatttg gagcaaaacg ctgggtagca 1380
actttagata gacaatgtga acgccttgca agtgtgttgg cgcttaacat cccaacagga 1440
gatgttggaa tcattactag tccagctggt cgaaagagta tgctaaaact tgctgagaga 1500
atggtgatga gcttttgtgc tggagttggt gcatcgacaa ctcacatatg gacaactttg 1560
tctggaagtg gtgcggatga tgttagagtc atgactagga agagtatcga tgatccaggg 1620
agacctcctg gtattgtgct gagtgctgca acatcttttt ggcttccagt ttctcctaag 1680
agagtgtttg attttctccg cgatgagaac tctagaaatg agtgggatat tctttcaaat 1740
ggtgggattg ttcaggaaat ggcacacatt gcaaatggtc gtgatccagg aaactgtgtt 1800
tctctactcc gtgtcaatac tggaacaaac tctaaccaga gtaacatgct gatactccaa 1860
gagagcacaa ctgatgtaac aggatcttac gtcatttacg ctcgagttga tattgctgca 1920
atgaacgtgg tgttaggtgg gggtgaccct gactatgttg ctctgttgcc atctggtttt 1980
gctattcttc cagacggacc gatgaattat catggtggag gtaattcaga aattgattct 2040
cctggtggat cgctactaac tgtagcattt cagatattgg ttgattcagt cccaactgca 2100
aagctttccc ttggctctgt tgcgactgtt aatagtctca tcaaatgcac cgttgaaaag 2160
atcaaaggtg ctgtaacttc cgcaaatgca tga 2193
<210> 14
<211> 730
<212> PRT
<213> SEQ ID NO.14
<400> 14
Met Phe Asn Asn His Gln His Leu Leu Asp Ile Ser Ser Ser Ala Gln
1 5 10 15
Arg Thr Pro Asp Asn Glu Leu Asp Phe Ile Arg Asp Glu Glu Phe Asp
20 25 30
Ser Asn Ser Gly Ala Asp Asn Met Glu Ala Pro Asn Ser Gly Asp Asp
35 40 45
Asp Gln Ala Asp Pro Asn Gln Pro Pro Asn Lys Lys Lys Arg Tyr His
50 55 60
Arg His Thr Gln Asn Gln Ile Gln Glu Met Glu Ser Phe Tyr Lys Glu
65 70 75 80
Cys Asn His Pro Asp Asp Lys Gln Arg Lys Glu Leu Gly Arg Arg Leu
85 90 95
Gly Leu Glu Pro Leu Gln Val Lys Phe Trp Phe Gln Asn Lys Arg Thr
100 105 110
Gln Met Lys Ala Gln His Glu Arg Cys Glu Asn Thr Gln Leu Arg Asn
115 120 125
Glu Asn Glu Lys Leu Arg Ala Glu Asn Ile Arg Tyr Lys Glu Ala Leu
130 135 140
Ser Asn Ala Ala Cys Pro Asn Cys Gly Gly Pro Ala Ala Ile Gly Glu
145 150 155 160
Met Ser Phe Asp Glu His Gln Leu Arg Ile Glu Asn Ala Arg Leu Arg
165 170 175
Asp Glu Ile Asp Arg Ile Thr Gly Ile Ala Gly Lys Tyr Val Gly Lys
180 185 190
Ser Ala Leu Gly Tyr Ser His Gln Leu Pro Leu Pro Gln Pro Glu Ala
195 200 205
Pro Arg Val Leu Asp Leu Ala Phe Gly Pro Gln Ser Gly Leu Leu Gly
210 215 220
Glu Met Tyr Ala Ala Gly Asp Leu Leu Arg Thr Ala Val Thr Gly Leu
225 230 235 240
Thr Asp Ala Glu Lys Pro Val Val Ile Glu Leu Ala Val Thr Ala Met
245 250 255
Glu Glu Leu Ile Arg Met Ala Gln Thr Glu Glu Pro Leu Trp Leu Pro
260 265 270
Ser Ser Gly Ser Glu Thr Leu Cys Glu Gln Glu Tyr Ala Arg Ile Phe
275 280 285
Pro Arg Gly Leu Gly Pro Lys Pro Ala Thr Leu Asn Ser Glu Ala Ser
290 295 300
Arg Glu Ser Ala Val Val Ile Met Asn His Ile Asn Leu Val Glu Ile
305 310 315 320
Leu Met Asp Val Asn Gln Trp Thr Thr Val Phe Ala Gly Leu Val Ser
325 330 335
Lys Ala Met Thr Leu Glu Val Leu Ser Thr Gly Val Ala Gly Asn His
340 345 350
Asn Gly Ala Leu Gln Val Met Thr Ala Glu Phe Gln Val Pro Ser Pro
355 360 365
Leu Val Pro Thr Arg Glu Asn Tyr Phe Leu Arg Tyr Cys Lys Gln His
370 375 380
Gly Glu Gly Thr Trp Val Val Val Asp Val Ser Leu Asp Asn Leu Arg
385 390 395 400
Thr Val Ser Val Pro Arg Cys Arg Arg Arg Pro Ser Gly Cys Leu Ile
405 410 415
Gln Glu Met Pro Asn Gly Tyr Ser Arg Val Ile Trp Val Glu His Val
420 425 430
Glu Val Asp Glu Asn Ala Val His Asp Ile Tyr Lys Pro Leu Val Asn
435 440 445
Ser Gly Ile Ala Phe Gly Ala Lys Arg Trp Val Ala Thr Leu Asp Arg
450 455 460
Gln Cys Glu Arg Leu Ala Ser Val Leu Ala Leu Asn Ile Pro Thr Gly
465 470 475 480
Asp Val Gly Ile Ile Thr Ser Pro Ala Gly Arg Lys Ser Met Leu Lys
485 490 495
Leu Ala Glu Arg Met Val Met Ser Phe Cys Ala Gly Val Gly Ala Ser
500 505 510
Thr Thr His Ile Trp Thr Thr Leu Ser Gly Ser Gly Ala Asp Asp Val
515 520 525
Arg Val Met Thr Arg Lys Ser Ile Asp Asp Pro Gly Arg Pro Pro Gly
530 535 540
Ile Val Leu Ser Ala Ala Thr Ser Phe Trp Leu Pro Val Ser Pro Lys
545 550 555 560
Arg Val Phe Asp Phe Leu Arg Asp Glu Asn Ser Arg Asn Glu Trp Asp
565 570 575
Ile Leu Ser Asn Gly Gly Ile Val Gln Glu Met Ala His Ile Ala Asn
580 585 590
Gly Arg Asp Pro Gly Asn Cys Val Ser Leu Leu Arg Val Asn Thr Gly
595 600 605
Thr Asn Ser Asn Gln Ser Asn Met Leu Ile Leu Gln Glu Ser Thr Thr
610 615 620
Asp Val Thr Gly Ser Tyr Val Ile Tyr Ala Arg Val Asp Ile Ala Ala
625 630 635 640
Met Asn Val Val Leu Gly Gly Gly Asp Pro Asp Tyr Val Ala Leu Leu
645 650 655
Pro Ser Gly Phe Ala Ile Leu Pro Asp Gly Pro Met Asn Tyr His Gly
660 665 670
Gly Gly Asn Ser Glu Ile Asp Ser Pro Gly Gly Ser Leu Leu Thr Val
675 680 685
Ala Phe Gln Ile Leu Val Asp Ser Val Pro Thr Ala Lys Leu Ser Leu
690 695 700
Gly Ser Val Ala Thr Val Asn Ser Leu Ile Lys Cys Thr Val Glu Lys
705 710 715 720
Ile Lys Gly Ala Val Thr Ser Ala Asn Ala
725 730
<210> 15
<211> 2193
<212> DNA
<213> SEQ ID NO.15
<400> 15
atgtttaata accaccagca cttgctcgat atatcgtcct cagctcaacg aacacctgat 60
aacgagttgg atttcattcg tgatgaagag tttgatagca actctggtgc tgataacatg 120
gaagctccca attcaggtga tgacgatcaa gctgatccaa accaacctcc aaacaagaag 180
aagcgttatc atcgccacac tcagaatcag attcaggaaa tggagtcctt ttacaaggaa 240
tgcaatcatc cagatgacaa gcaaaggaag gaattgggaa gaagacttgg tttggagcca 300
ttacaagtga aattttggtt ccagaacaag cgtactcaga tgaaggctca acatgagcga 360
tgtgagaaca cacagttgag gaatgaaaat gagaagcttc gcgctgagaa cataaggtac 420
aaagaagctt tgagtaatgc agcatgccca aattgtggag ggccagcagc tataggagag 480
atgtcatttg atgagcatca gttgaggatt gaaaatgctc gtcttagaga tgagattgac 540
aggataactg gaatagctgg aaagtatgtt ggtaaatcag cccttggata ttctcatcaa 600
cttcctcttc ctcagcccga agctcctcgg gttctggatc ttgcttttgg gcctcaatcg 660
ggcctgcttg gagaaatgta cgctgctggt gaccttctaa gaactgctgt tacgggcctt 720
acagatgctg agaagcccgt ggtcattgag cttgctgtta ctgcaatgga ggaacttata 780
aggatggctc aaactgaaga gccattatgg ttgccaagct caggctctga gactttatgt 840
gagcaagaat atgctcgtat tttccctcga ggccttggac ctaagccagc tacactcaat 900
tctgaagcct cacgagaatc tgctgttgtg attatgaatc atatcaattt agttgagatt 960
ttgatggatg tgaaccaatg gactactgtt tttgctggtc tggtgtcaaa agcaatgact 1020
cttgaagtct tatcaactgg tgtcgcagga aatcacaatg gagcattgca agtgatgaca 1080
gcagaatttc aagttccatc tccacttgtt ccaactcggg agaactattt cttaagatac 1140
tgtaaacaac atggtgaagg gacttgggta gtggttgatg tttccctgga caacttgcgc 1200
actgtttcag ttccgcgttg cagaagaagg ccatctggtt gtttaatcca agaaatgcca 1260
aatggttact caagggttat atgggttgaa cacgttgagg tggatgaaaa tgctgtccat 1320
gacatctaca aacctcttgt caattctggg attgcatttg gagcaaaacg ctgggtagca 1380
actttagata gacaatgtga acgccttgca agtgtgttgg cgcttaacat cccaacagga 1440
gatgttggaa tcattactag tccagctggt cgaaagagta tgctaaaact tgctgagaga 1500
atggtgatga gcttttgtgc tggagttggt gcatcgacaa ctcacatatg gacaactttg 1560
tctggaagtg gtgcggatga tgttagagtc atgactagga agagtatcga tgatccaggg 1620
agacctcctg gtattgtgct gagtgctgca acatcttttt ggcttccagt ttctcctaag 1680
agagtgtttg attttctccg cgatgagaac tctagaaatg agtgggatat tctttcaaat 1740
ggtgggattg ttcaggaaat ggcacacatt gcaaatggtc gtgatccagg aaactgtgtt 1800
tctctactcc gtgtcaatac tggaacaaac tctaaccaga gtaacatgct gatactccaa 1860
gagagcacaa ctgatgtaac aggatcttac gtcatttacg ctccagttga tattgctgca 1920
atgaacgtgg tgttaggtgg gggtgaccct gactatgttg ctctgttgcc atctggtttt 1980
gctattcttc cagacggacc gatgaattat catggtggag gtaattcaga aattgattct 2040
cctggtggat cgctactaac tgtagcattt cagagattgg tttattcagt cccaactgca 2100
aagctttccc ttggctctgt tgcgactgtt aatagtctca tcaaatgcac cgttgaaaag 2160
atcaaaggtg ctgtaacttc cgcaaatgca tga 2193
<210> 16
<211> 730
<212> PRT
<213> SEQ ID NO.16
<400> 16
Met Phe Asn Asn His Gln His Leu Leu Asp Ile Ser Ser Ser Ala Gln
1 5 10 15
Arg Thr Pro Asp Asn Glu Leu Asp Phe Ile Arg Asp Glu Glu Phe Asp
20 25 30
Ser Asn Ser Gly Ala Asp Asn Met Glu Ala Pro Asn Ser Gly Asp Asp
35 40 45
Asp Gln Ala Asp Pro Asn Gln Pro Pro Asn Lys Lys Lys Arg Tyr His
50 55 60
Arg His Thr Gln Asn Gln Ile Gln Glu Met Glu Ser Phe Tyr Lys Glu
65 70 75 80
Cys Asn His Pro Asp Asp Lys Gln Arg Lys Glu Leu Gly Arg Arg Leu
85 90 95
Gly Leu Glu Pro Leu Gln Val Lys Phe Trp Phe Gln Asn Lys Arg Thr
100 105 110
Gln Met Lys Ala Gln His Glu Arg Cys Glu Asn Thr Gln Leu Arg Asn
115 120 125
Glu Asn Glu Lys Leu Arg Ala Glu Asn Ile Arg Tyr Lys Glu Ala Leu
130 135 140
Ser Asn Ala Ala Cys Pro Asn Cys Gly Gly Pro Ala Ala Ile Gly Glu
145 150 155 160
Met Ser Phe Asp Glu His Gln Leu Arg Ile Glu Asn Ala Arg Leu Arg
165 170 175
Asp Glu Ile Asp Arg Ile Thr Gly Ile Ala Gly Lys Tyr Val Gly Lys
180 185 190
Ser Ala Leu Gly Tyr Ser His Gln Leu Pro Leu Pro Gln Pro Glu Ala
195 200 205
Pro Arg Val Leu Asp Leu Ala Phe Gly Pro Gln Ser Gly Leu Leu Gly
210 215 220
Glu Met Tyr Ala Ala Gly Asp Leu Leu Arg Thr Ala Val Thr Gly Leu
225 230 235 240
Thr Asp Ala Glu Lys Pro Val Val Ile Glu Leu Ala Val Thr Ala Met
245 250 255
Glu Glu Leu Ile Arg Met Ala Gln Thr Glu Glu Pro Leu Trp Leu Pro
260 265 270
Ser Ser Gly Ser Glu Thr Leu Cys Glu Gln Glu Tyr Ala Arg Ile Phe
275 280 285
Pro Arg Gly Leu Gly Pro Lys Pro Ala Thr Leu Asn Ser Glu Ala Ser
290 295 300
Arg Glu Ser Ala Val Val Ile Met Asn His Ile Asn Leu Val Glu Ile
305 310 315 320
Leu Met Asp Val Asn Gln Trp Thr Thr Val Phe Ala Gly Leu Val Ser
325 330 335
Lys Ala Met Thr Leu Glu Val Leu Ser Thr Gly Val Ala Gly Asn His
340 345 350
Asn Gly Ala Leu Gln Val Met Thr Ala Glu Phe Gln Val Pro Ser Pro
355 360 365
Leu Val Pro Thr Arg Glu Asn Tyr Phe Leu Arg Tyr Cys Lys Gln His
370 375 380
Gly Glu Gly Thr Trp Val Val Val Asp Val Ser Leu Asp Asn Leu Arg
385 390 395 400
Thr Val Ser Val Pro Arg Cys Arg Arg Arg Pro Ser Gly Cys Leu Ile
405 410 415
Gln Glu Met Pro Asn Gly Tyr Ser Arg Val Ile Trp Val Glu His Val
420 425 430
Glu Val Asp Glu Asn Ala Val His Asp Ile Tyr Lys Pro Leu Val Asn
435 440 445
Ser Gly Ile Ala Phe Gly Ala Lys Arg Trp Val Ala Thr Leu Asp Arg
450 455 460
Gln Cys Glu Arg Leu Ala Ser Val Leu Ala Leu Asn Ile Pro Thr Gly
465 470 475 480
Asp Val Gly Ile Ile Thr Ser Pro Ala Gly Arg Lys Ser Met Leu Lys
485 490 495
Leu Ala Glu Arg Met Val Met Ser Phe Cys Ala Gly Val Gly Ala Ser
500 505 510
Thr Thr His Ile Trp Thr Thr Leu Ser Gly Ser Gly Ala Asp Asp Val
515 520 525
Arg Val Met Thr Arg Lys Ser Ile Asp Asp Pro Gly Arg Pro Pro Gly
530 535 540
Ile Val Leu Ser Ala Ala Thr Ser Phe Trp Leu Pro Val Ser Pro Lys
545 550 555 560
Arg Val Phe Asp Phe Leu Arg Asp Glu Asn Ser Arg Asn Glu Trp Asp
565 570 575
Ile Leu Ser Asn Gly Gly Ile Val Gln Glu Met Ala His Ile Ala Asn
580 585 590
Gly Arg Asp Pro Gly Asn Cys Val Ser Leu Leu Arg Val Asn Thr Gly
595 600 605
Thr Asn Ser Asn Gln Ser Asn Met Leu Ile Leu Gln Glu Ser Thr Thr
610 615 620
Asp Val Thr Gly Ser Tyr Val Ile Tyr Ala Pro Val Asp Ile Ala Ala
625 630 635 640
Met Asn Val Val Leu Gly Gly Gly Asp Pro Asp Tyr Val Ala Leu Leu
645 650 655
Pro Ser Gly Phe Ala Ile Leu Pro Asp Gly Pro Met Asn Tyr His Gly
660 665 670
Gly Gly Asn Ser Glu Ile Asp Ser Pro Gly Gly Ser Leu Leu Thr Val
675 680 685
Ala Phe Gln Arg Leu Val Tyr Ser Val Pro Thr Ala Lys Leu Ser Leu
690 695 700
Gly Ser Val Ala Thr Val Asn Ser Leu Ile Lys Cys Thr Val Glu Lys
705 710 715 720
Ile Lys Gly Ala Val Thr Ser Ala Asn Ala
725 730
<210> 17
<211> 2719
<212> DNA
<213> SEQ ID NO.17
<400> 17
tacatacaga acttatgagg gaatttatgt ataagttata cagaataaaa gatggaataa 60
gttatgtgaa tattagtaat tatacatgta ttagaaagat aaattacaca tttatctcta 120
ttaatttatt tttaattttt ttattgaaaa ctttacataa ctattctatt ttctaaccta 180
ttttcatttc tttttttatg caaacaactt tttatttgtt ttcatttaaa atcagaaaat 240
atttattaat attatgaata ggaatgtata attttgatag cacatatgtt aattatatat 300
ttgctttata tattatatat tcacatattt aattatcatg tattaatttt tatttagaaa 360
gtgaatttat ataatattta taaatttggt atattaaaac taggggtgta caaactgaat 420
tattaagtca aatcgaaccg acgaaccaaa tcaaatcgaa aataatcgac ttgtggtttg 480
gcgattgaaa aaaaactcga ccactcttga tttggtttgg tattagaaaa aaaaaatcaa 540
accgaactca accccccccc ccccccccac aaacacaatt tattttattt atagataaaa 600
aatattatct acatataatc tactttggta atatgtattt ttttttgtta attcatgatt 660
ttcaatttta tatatttatg tcaaatttta gattttgtaa aattttgtat atatacacat 720
atatataatt tattttattt atagctaaaa aatattattt ataatgtatt tgctaatata 780
tttttagtta atttatagtt ttcaacgtgt gtatatatat atgtgtgtat atatatatat 840
atatatataa aatcaataac caaacgtccc ttactctcca tatatccagt ttcctacttt 900
aaattagttg ccatcaaaca gttctaagta tgaatttgta atctactcat gatcctaata 960
ctcaacctac tctcctcctt ttaatttata atgatcatac tcgaactaaa tttcataatc 1020
aacacactat tattaagttt atcaattact ctccttaata gtagtttcct tctataaatt 1080
agttgccatc aaataatttc aattatgaat ttataatcta ctcatgatcc taatactcaa 1140
tctactttcc tcctcctaat ttataatgat cctagtcaag ctcgatttca taatcaatac 1200
actattatta agttaatcaa tactttactt atatcttgtt taaatttgtt gccatcaaat 1260
acttcttgag aaagaatttg taatgatcct actcaaatta cttgccttct tttaatctac 1320
aatgatccta ctcaaattaa ataggagtat tagattgaag tcataattaa gttgtttcta 1380
ccatattccc tatattagaa aatttaaaca tcaaagatga gttgctgcca acttctcatt 1440
ttctctctca acaacagcat taacacctcc cattaatgct atttcttcct ctcaataacc 1500
tctgctataa taataactcc cttcatcttc tccacctgtt ccctataatc cctctttttt 1560
cccctcttct ccatagctcc tttaattctc ttcacaataa acttttttga acacacacac 1620
acaaaaaaaa aaatcagact ccaaaaagtt tgactagatc tggattaatt ttctacactt 1680
gttgtttttt taatccaaag gcttcacaga tagtaagttt actgttccat cattctctta 1740
cttgctttct tctttttttc cctctcttga attgtttgct cagatgaaca tccttcgtca 1800
aaaaattata ccacagcaaa ataaattatt gttttttcat gtgtcgagag acttgtatag 1860
aagtgaagtt tggactttac cagatattag ttgttgagag ctatatatat gttttaccca 1920
tttttgaaaa aggaaaagaa aaaagagaag accctgtgtc cgtgttcaca cattgagtca 1980
actttatgtt ttgttattat tccaaaattt tctcattgca cttttaacac agtcaaaaag 2040
aaataaattt tattttcctt tgtcacatca agacatagaa aaaaatagaa taaaaacaaa 2100
cacgtcccaa aataaaatga ggtaaaaaga attaaagttt attttacata gagacaagaa 2160
agtcagtaga aaatgtggtg ggggttggag gttggggggg tgggtttagg gtttgaggag 2220
atttttttgt gtatatatat atcatacagt gagaagtgaa attctaccat cacatcaaca 2280
atgccttctt gagaagctca gtttcttgag taagtccatt tgttctttaa acgttttttt 2340
ttcttttttt gtgttatttg tatactatct tcagctaatt ctcaacaaca aatacactga 2400
ttttgatttt taaaccatgg tggcatgacg aattaaggaa ccctagagca aaaaatgtaa 2460
atttataagt cgtcacatcc taaaaatgaa tcacttttaa ccatttagta catgtttgtg 2520
aatttttatt tttttttata aaaaaaaaat cctgaacata tgttgttgag cttttgcttt 2580
tgtccacgct tccgtatata taatctttag catgtgtatg gtttttgtaa atcataaaca 2640
tgggcgtttt tttttttttt tgcagatgaa ataatttatc agagaaatta ttttgaagaa 2700
gatcgagaag gtattcaag 2719
<210> 18
<211> 24
<212> DNA
<213> CR-MTR1-F
<400> 18
gattatgagc aggagaaatg gaaa 24
<210> 19
<211> 24
<212> DNA
<213> CR-MTR1-R
<400> 19
aaactttcca tttctcctgc tcat 24
<210> 20
<211> 24
<212> DNA
<213> CR-MTR2-F
<400> 20
gattgttcaa aatgaactac agga 24
<210> 21
<211> 24
<212> DNA
<213> CR-MTR2-R
<400> 21
aaactcctgt agttcatttt gaac 24
<210> 22
<211> 24
<212> DNA
<213> CR-MTR3-F
<400> 22
gattatgagt agaaggaatg gaaa 24
<210> 23
<211> 24
<212> DNA
<213> CR-MTR3-R
<400> 23
aaactttcca ttccttctac tcat 24
<210> 24
<211> 24
<212> DNA
<213> CR-LFS-F
<400> 24
gattccctcc gcggtgcgat accg 24
<210> 25
<211> 24
<212> DNA
<213> CR-LFS-R
<400> 25
aaaccggtat cgcaccgcgg aggg 24
<210> 26
<211> 24
<212> DNA
<213> CR-Slwox3b-F
<400> 26
gattgtttca aaaccataaa gcta 24
<210> 27
<211> 24
<212> DNA
<213> CR-Slwox3b-R
<400> 27
aaactagctt tatggttttg aaac 24
<210> 28
<211> 26
<212> DNA
<213> CR-MX1-F
<400> 28
gattgtgctt tgccaactaa agccgg 26
<210> 29
<211> 26
<212> DNA
<213> CR-MX1-R
<400> 29
aaacccggct ttagttggca aagcac 26
<210> 30
<211> 46
<212> DNA
<213> LFS-EcoRI-F
<400> 30
accggtacca agcttgaatt catggaagat gcaatgagaa gactta 46
<210> 31
<211> 45
<212> DNA
<213> LFS-EcoRI-R
<400> 31
cccttgctca ccatgaattc agcattttgc agcttggctg caaaa 45
<210> 32
<211> 40
<212> DNA
<213> pWo-SacI-F
<400> 32
aacgcgttgg gagctctaca tacagaactt atgagggaat 40
<210> 33
<211> 44
<212> DNA
<213> pWo-XbaI-R
<400> 33
cattaaagca ggactctaga cttgaatacc ttctcgatct tctt 44
<210> 34
<211> 45
<212> DNA
<213> Wo-XbaI-F
<400> 34
agaaggtatt caagtctaga atgtttaata accaccagca cttgc 45
<210> 35
<211> 46
<212> DNA
<213> Wo-XbaI-R
<400> 35
cattaaagca ggactctaga tcactgcatt tgcggaagtt acagca 46
Claims (10)
1.用于调控植物多细胞表皮毛分化的方法,其特征在于:所述方法为突变植物中的表皮毛发育相关基因;所述表皮毛发育相关基因包括:MTR1基因、MTR2基因、MTR3基因、LFS基因、Slwox3b基因、MX1基因、WoP635R基因和WoI692RD695Y;
所述MTR1基因的核苷酸序列如SEQ ID NO.1所示;氨基酸序列如SEQ ID NO.2所示;
所述MTR2基因的核苷酸序列如SEQ ID NO.3所示;氨基酸序列如SEQ ID NO.4所示;
所述MTR3基因的核苷酸序列如SEQ ID NO.5所示;氨基酸序列如SEQ ID NO.6所示;
所述LFS基因的核苷酸序列如SEQ ID NO.7所示;氨基酸序列如SEQ ID NO.8所示;
所述Slwox3b基因的核苷酸序列如SEQ ID NO.9所示;氨基酸序列如SEQ ID NO.10所示;
所述MX1基因的核苷酸序列如SEQ ID NO.11所示;氨基酸序列如SEQ ID NO.12所示;
所述WoP635R基因的核苷酸序列如SEQ ID NO.13所示;氨基酸序列如SEQ ID NO.14所示;
所述WoI692RD695Y基因的核苷酸序列如SEQ ID NO.15所示;氨基酸序列如SEQ ID NO.16所示。
2.根据权利要求1所述的用于调控植物多细胞表皮毛分化的方法,其特征在于:所述突变方法包括基因敲除方法,或者基因沉默表达方法,或基因超量表达方法。
3.根据权利要求1所述的用于调控植物多细胞表皮毛分化的方法,其特征在于:所述表皮毛发育相关基因具有与上述表皮毛发育相关基因MTR1基因、MTR2基因、MTR3基因、LFS基因、Slwox3b基因、MX1基因和WoP635R基因,至少70%相似度核苷酸序列或者氨基酸序列的基因。
4.根据权利要求1所述的用于调控植物多细胞表皮毛分化的方法,其特征在于:敲除或者沉默LFS基因;或者敲除或沉默MTR1、MTR2、MTR3基因中的一个或多个,促进植物指状多细胞表皮毛分化,同时抑制盾状分泌型表皮毛的形成。
5.根据权利要求1所述的用于调控植物多细胞表皮毛分化的方法,其特征在于:超表达LFS基因;或者敲除或沉默Slwox3b基因和MX1基因中的一个或两个,促进盾状分泌型表皮毛的形成。
6.根据权利要求1所述的用于调控植物多细胞表皮毛分化的方法,其特征在于:同时敲除或者沉默Slwox3b基因、MX1基因、MTR1基因、MTR2基因和MTR3基因,促进植物盾状分泌型表皮毛的形成。
7.根据权利要求1所述的用于调控植物多细胞表皮毛分化的方法,其特征在于:敲除或者沉默Slwox3b基因、MX1基因的同时通过特异表达或者超量表达Woolly基因功能获得型突变等位基因WoP635R和/或WoI692RD695Y,促进植物盾状分泌型表皮毛的形成。
8.一种表达载体,其特征在于:所述表达载体含有权利要求1所述方法中的表皮毛发育相关基因。
9.根据权利要求8所述的表达载体,其特征在于:包括重组转化所形成的突变体;所述突变体包括茄科、桑科和其他具有多细胞表皮毛的植物,包括但不限于烟草、番茄、大麻。
10.权利要求1所述方法在促进或抑制植物多细胞表皮毛分化中的应用。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111506682.XA CN114181948B (zh) | 2021-12-10 | 2021-12-10 | 用于调控植物多细胞表皮毛分化的方法 |
PCT/CN2022/090120 WO2023103267A1 (zh) | 2021-12-10 | 2022-04-29 | 用于调控植物多细胞表皮毛分化的方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111506682.XA CN114181948B (zh) | 2021-12-10 | 2021-12-10 | 用于调控植物多细胞表皮毛分化的方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114181948A true CN114181948A (zh) | 2022-03-15 |
CN114181948B CN114181948B (zh) | 2023-12-22 |
Family
ID=80543053
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111506682.XA Active CN114181948B (zh) | 2021-12-10 | 2021-12-10 | 用于调控植物多细胞表皮毛分化的方法 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN114181948B (zh) |
WO (1) | WO2023103267A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023103267A1 (zh) * | 2021-12-10 | 2023-06-15 | 福建农林大学 | 用于调控植物多细胞表皮毛分化的方法 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103030686A (zh) * | 2011-09-29 | 2013-04-10 | 中国农业大学 | 与植物表皮毛发育相关的蛋白及其编码基因与应用 |
CN104046631A (zh) * | 2014-05-19 | 2014-09-17 | 华中农业大学 | 控制番茄表皮毛发生的关键基因SlCycB 2的克隆与鉴定 |
CN104561025A (zh) * | 2013-10-29 | 2015-04-29 | 华中农业大学 | 番茄SlML1基因及应用 |
CN106609267A (zh) * | 2015-10-21 | 2017-05-03 | 中国农业大学 | 黄瓜CsTTG1基因在调控植物表皮毛、果刺及刺瘤发育中的应用 |
CN110964731A (zh) * | 2019-12-25 | 2020-04-07 | 华中农业大学 | 番茄茸毛调控基因的克隆及应用 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101985623B (zh) * | 2010-07-20 | 2012-08-29 | 华中农业大学 | 一个控制番茄茸毛发生的关键基因Wo的克隆及应用 |
CN114181948B (zh) * | 2021-12-10 | 2023-12-22 | 福建农林大学 | 用于调控植物多细胞表皮毛分化的方法 |
CN113980969A (zh) * | 2021-12-10 | 2022-01-28 | 福建农林大学 | 用于操纵在植物表皮毛细胞中特异性表达的启动子 |
-
2021
- 2021-12-10 CN CN202111506682.XA patent/CN114181948B/zh active Active
-
2022
- 2022-04-29 WO PCT/CN2022/090120 patent/WO2023103267A1/zh unknown
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103030686A (zh) * | 2011-09-29 | 2013-04-10 | 中国农业大学 | 与植物表皮毛发育相关的蛋白及其编码基因与应用 |
CN104561025A (zh) * | 2013-10-29 | 2015-04-29 | 华中农业大学 | 番茄SlML1基因及应用 |
CN104046631A (zh) * | 2014-05-19 | 2014-09-17 | 华中农业大学 | 控制番茄表皮毛发生的关键基因SlCycB 2的克隆与鉴定 |
CN106609267A (zh) * | 2015-10-21 | 2017-05-03 | 中国农业大学 | 黄瓜CsTTG1基因在调控植物表皮毛、果刺及刺瘤发育中的应用 |
CN110964731A (zh) * | 2019-12-25 | 2020-04-07 | 华中农业大学 | 番茄茸毛调控基因的克隆及应用 |
Non-Patent Citations (11)
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023103267A1 (zh) * | 2021-12-10 | 2023-06-15 | 福建农林大学 | 用于调控植物多细胞表皮毛分化的方法 |
Also Published As
Publication number | Publication date |
---|---|
WO2023103267A1 (zh) | 2023-06-15 |
CN114181948B (zh) | 2023-12-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2018113702A1 (en) | Plant grain trait-related protein, gene, promoter and snps and haplotypes | |
CN114045303B (zh) | 一套用于水稻的人工基因编辑系统 | |
CN112011567B (zh) | 水稻pal1基因及其编码蛋白与应用 | |
CN110106183A (zh) | 一种抗除草剂基因及其应用 | |
CN112250742B (zh) | 蛋白质及其相关生物材料在调控植物机械强度中的用途 | |
CN114181948B (zh) | 用于调控植物多细胞表皮毛分化的方法 | |
CN108795943B (zh) | 一种植物特异表达启动子POssalt2及其应用 | |
CN112126652B (zh) | 水稻OsAUX3基因在调控水稻种子粒长中的应用 | |
CN107365772B (zh) | 一种植物花粉特异性启动子psp1及其应用 | |
CN115466747B (zh) | 糖基转移酶ZmKOB1基因及其在调控玉米雌穗结实性状或发育上的应用 | |
CN104911191A (zh) | 一种育性调控基因fg3及其应用 | |
CN110950944B (zh) | OsHCRF1功能蛋白及其编码基因在水稻育种中的应用 | |
CN111100867B (zh) | 水稻铁氧还蛋白编码基因OsFd1、该基因编码的蛋白质及其用途 | |
CN112680472B (zh) | ZmSPL基因在调控玉米冠根或气生根发育中的应用 | |
CN112553203A (zh) | 一种调控杨树生长发育的长链非编码RNA-lnc5及其应用 | |
CN109082437B (zh) | 一种提高大麦分蘖数量的方法 | |
US6232527B1 (en) | Maize Rad2/FEN-1 orthologues and uses thereof | |
US7034117B2 (en) | RAD51 polypeptides and uses thereof | |
CN109988775A (zh) | 一种调控柽柳耐盐性的关键基因TcSBP5及其应用 | |
CN112143744B (zh) | OsPLDδ3基因在控制水稻抗旱性中的应用 | |
CN114516908B (zh) | 水稻粒形调控蛋白hos59及其编码基因和应用 | |
CN114891802B (zh) | OsDUF6基因及其编码蛋白在水稻耐盐性育种中的应用 | |
CN113846109B (zh) | OsCRRP在水稻耐热胁迫中的应用 | |
US7057020B2 (en) | Isolated RuvB polypeptides | |
CN118531052A (zh) | 一种与水稻落粒性相关的蛋白OsCCT22及其编码基因与应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |