CN112813107B - 一种龙睛百褶裙泰狮金鱼的创制方法 - Google Patents
一种龙睛百褶裙泰狮金鱼的创制方法 Download PDFInfo
- Publication number
- CN112813107B CN112813107B CN202110089136.4A CN202110089136A CN112813107B CN 112813107 B CN112813107 B CN 112813107B CN 202110089136 A CN202110089136 A CN 202110089136A CN 112813107 B CN112813107 B CN 112813107B
- Authority
- CN
- China
- Prior art keywords
- asp
- gly
- cys
- ser
- arg
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 241000252229 Carassius auratus Species 0.000 title claims abstract description 85
- 238000000034 method Methods 0.000 title claims abstract description 45
- 241000282320 Panthera leo Species 0.000 title claims abstract description 20
- 241000251468 Actinopterygii Species 0.000 claims abstract description 71
- 240000001008 Dimocarpus longan Species 0.000 claims abstract description 19
- 108091033409 CRISPR Proteins 0.000 claims abstract description 18
- 230000004720 fertilization Effects 0.000 claims abstract description 14
- 238000010354 CRISPR gene editing Methods 0.000 claims abstract description 9
- 238000000520 microinjection Methods 0.000 claims abstract description 7
- 108010007622 LDL Lipoproteins Proteins 0.000 claims abstract description 5
- 235000013601 eggs Nutrition 0.000 claims description 51
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 43
- 230000012447 hatching Effects 0.000 claims description 23
- 108090000623 proteins and genes Proteins 0.000 claims description 21
- 238000002635 electroconvulsive therapy Methods 0.000 claims description 19
- 102000002322 Egg Proteins Human genes 0.000 claims description 15
- 108010000912 Egg Proteins Proteins 0.000 claims description 15
- 210000004681 ovum Anatomy 0.000 claims description 15
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 13
- 238000002156 mixing Methods 0.000 claims description 12
- 108020004999 messenger RNA Proteins 0.000 claims description 11
- 238000012163 sequencing technique Methods 0.000 claims description 10
- 108020004414 DNA Proteins 0.000 claims description 9
- 108020005004 Guide RNA Proteins 0.000 claims description 9
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 claims description 9
- 230000001276 controlling effect Effects 0.000 claims description 9
- 230000001105 regulatory effect Effects 0.000 claims description 9
- 230000035939 shock Effects 0.000 claims description 9
- 238000002347 injection Methods 0.000 claims description 8
- 239000007924 injection Substances 0.000 claims description 8
- 238000012408 PCR amplification Methods 0.000 claims description 7
- 238000000338 in vitro Methods 0.000 claims description 7
- 210000000582 semen Anatomy 0.000 claims description 7
- 239000007788 liquid Substances 0.000 claims description 6
- 241000252233 Cyprinus carpio Species 0.000 claims description 5
- 239000005457 ice water Substances 0.000 claims description 5
- 239000012528 membrane Substances 0.000 claims description 5
- 238000013518 transcription Methods 0.000 claims description 5
- 230000035897 transcription Effects 0.000 claims description 5
- 238000010367 cloning Methods 0.000 claims description 4
- 239000000243 solution Substances 0.000 claims description 4
- 230000007480 spreading Effects 0.000 claims description 4
- 238000003892 spreading Methods 0.000 claims description 4
- 230000009466 transformation Effects 0.000 claims description 4
- 241001465754 Metazoa Species 0.000 claims description 3
- 230000002779 inactivation Effects 0.000 claims description 3
- 230000002611 ovarian Effects 0.000 claims description 2
- 230000008569 process Effects 0.000 claims description 2
- 230000000644 propagated effect Effects 0.000 claims description 2
- 241000238366 Cephalopoda Species 0.000 claims 2
- 108700024394 Exon Proteins 0.000 claims 1
- 108091092195 Intron Proteins 0.000 claims 1
- 238000009395 breeding Methods 0.000 abstract description 11
- 230000001488 breeding effect Effects 0.000 abstract description 11
- 238000005516 engineering process Methods 0.000 abstract description 6
- 238000013461 design Methods 0.000 abstract description 5
- 238000010362 genome editing Methods 0.000 abstract description 4
- 238000012546 transfer Methods 0.000 abstract description 4
- 238000011534 incubation Methods 0.000 abstract description 2
- 238000002474 experimental method Methods 0.000 abstract 1
- 108010047857 aspartylglycine Proteins 0.000 description 18
- 108010038633 aspartylglutamate Proteins 0.000 description 15
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 11
- 108010060199 cysteinylproline Proteins 0.000 description 9
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 238000001914 filtration Methods 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 7
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 7
- 108010048818 seryl-histidine Proteins 0.000 description 7
- 230000007306 turnover Effects 0.000 description 7
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 6
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 6
- 108010047495 alanylglycine Proteins 0.000 description 6
- 108010016616 cysteinylglycine Proteins 0.000 description 6
- 108010085325 histidylproline Proteins 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 description 5
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 5
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- 108010047562 NGR peptide Proteins 0.000 description 5
- 108010013835 arginine glutamate Proteins 0.000 description 5
- 108010092854 aspartyllysine Proteins 0.000 description 5
- 230000003203 everyday effect Effects 0.000 description 5
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 5
- 108010037850 glycylvaline Proteins 0.000 description 5
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 4
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 4
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 4
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 4
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 4
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 4
- 229920000742 Cotton Polymers 0.000 description 4
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 4
- ODDOYXKAHLKKQY-MMWGEVLESA-N Cys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N ODDOYXKAHLKKQY-MMWGEVLESA-N 0.000 description 4
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 4
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 4
- JERJIYYCOGBAIJ-OBAATPRFSA-N Ile-Tyr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JERJIYYCOGBAIJ-OBAATPRFSA-N 0.000 description 4
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 4
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 4
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 4
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 108010028295 histidylhistidine Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 108010027338 isoleucylcysteine Proteins 0.000 description 4
- 108010078274 isoleucylvaline Proteins 0.000 description 4
- 230000021121 meiosis Effects 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 108010029020 prolylglycine Proteins 0.000 description 4
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 3
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 3
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 3
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 3
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 3
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 3
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 3
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 3
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 3
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 241000195628 Chlorophyta Species 0.000 description 3
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 3
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 3
- NIPJKKSXHSBEMX-CIUDSAMLSA-N Cys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N NIPJKKSXHSBEMX-CIUDSAMLSA-N 0.000 description 3
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 3
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 3
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 3
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 3
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 3
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 3
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 3
- YDUGVDGFKNXFPL-IXOXFDKPSA-N Phe-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YDUGVDGFKNXFPL-IXOXFDKPSA-N 0.000 description 3
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 3
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 3
- 241000239221 Tachypleus gigas Species 0.000 description 3
- SEFNTZYRPGBDCY-IHRRRGAJSA-N Tyr-Arg-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O SEFNTZYRPGBDCY-IHRRRGAJSA-N 0.000 description 3
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 3
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 3
- LMVWCLDJNSBOEA-FKBYEOEOSA-N Val-Tyr-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N LMVWCLDJNSBOEA-FKBYEOEOSA-N 0.000 description 3
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 210000003128 head Anatomy 0.000 description 3
- 230000009027 insemination Effects 0.000 description 3
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 3
- 230000000394 mitotic effect Effects 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 239000008399 tap water Substances 0.000 description 3
- 235000020679 tap water Nutrition 0.000 description 3
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 2
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 2
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- 241000395633 Andrias japonicus Species 0.000 description 2
- 241000238426 Anostraca Species 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 2
- 241000238582 Artemia Species 0.000 description 2
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 2
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 2
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 2
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 2
- MECFLTFREHAZLH-ACZMJKKPSA-N Asn-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N MECFLTFREHAZLH-ACZMJKKPSA-N 0.000 description 2
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 2
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 2
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 2
- FMWHSNJMHUNLAG-FXQIFTODSA-N Asp-Cys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FMWHSNJMHUNLAG-FXQIFTODSA-N 0.000 description 2
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 2
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000256135 Chironomus thummi Species 0.000 description 2
- 241000195493 Cryptophyta Species 0.000 description 2
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 2
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 2
- QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 2
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 2
- DEVDFMRWZASYOF-ZLUOBGJFSA-N Cys-Asn-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DEVDFMRWZASYOF-ZLUOBGJFSA-N 0.000 description 2
- UWXFFVQPAMBETM-ZLUOBGJFSA-N Cys-Asp-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UWXFFVQPAMBETM-ZLUOBGJFSA-N 0.000 description 2
- BPHKULHWEIUDOB-FXQIFTODSA-N Cys-Gln-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BPHKULHWEIUDOB-FXQIFTODSA-N 0.000 description 2
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 2
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 2
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 2
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 2
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 2
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 2
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 2
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 2
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 2
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 2
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 2
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 2
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 2
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 2
- AYBKPDHHVADEDA-YUMQZZPRSA-N Gly-His-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O AYBKPDHHVADEDA-YUMQZZPRSA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 2
- PASHZZBXZYEXFE-LSDHHAIUSA-N Gly-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN)C(=O)O PASHZZBXZYEXFE-LSDHHAIUSA-N 0.000 description 2
- WJUYPBBCSSLVJE-CIUDSAMLSA-N His-Asn-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N WJUYPBBCSSLVJE-CIUDSAMLSA-N 0.000 description 2
- LDTJBEOANMQRJE-CIUDSAMLSA-N His-Cys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LDTJBEOANMQRJE-CIUDSAMLSA-N 0.000 description 2
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 2
- WPUAVVXYEJAWIV-KKUMJFAQSA-N His-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WPUAVVXYEJAWIV-KKUMJFAQSA-N 0.000 description 2
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 2
- WSAILOWUJZEAGC-DCAQKATOSA-N His-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSAILOWUJZEAGC-DCAQKATOSA-N 0.000 description 2
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 2
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 2
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 2
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 2
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 2
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 2
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 2
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 2
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 2
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 2
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- 208000035199 Tetraploidy Diseases 0.000 description 2
- VUKVQVNKIIZBPO-HOUAVDHOSA-N Thr-Asp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VUKVQVNKIIZBPO-HOUAVDHOSA-N 0.000 description 2
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 2
- DHPPWTOLRWYIDS-XKBZYTNZSA-N Thr-Cys-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O DHPPWTOLRWYIDS-XKBZYTNZSA-N 0.000 description 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- XEEHBQOUZBQVAJ-BPUTZDHNSA-N Trp-Arg-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N XEEHBQOUZBQVAJ-BPUTZDHNSA-N 0.000 description 2
- NLYCSLWTDMPLSX-QEJZJMRPSA-N Trp-Gln-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NLYCSLWTDMPLSX-QEJZJMRPSA-N 0.000 description 2
- HQJOVVWAPQPYDS-ZFWWWQNUSA-N Trp-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQJOVVWAPQPYDS-ZFWWWQNUSA-N 0.000 description 2
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 2
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 2
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 230000001079 digestive effect Effects 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 210000001672 ovary Anatomy 0.000 description 2
- 238000006213 oxygenation reaction Methods 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 2
- 235000017557 sodium bicarbonate Nutrition 0.000 description 2
- 230000001954 sterilising effect Effects 0.000 description 2
- 238000004659 sterilization and disinfection Methods 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- SCAKQYSGEIHPLV-IUCAKERBSA-N (4S)-4-[(2-aminoacetyl)amino]-5-[(2S)-2-(carboxymethylcarbamoyl)pyrrolidin-1-yl]-5-oxopentanoic acid Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SCAKQYSGEIHPLV-IUCAKERBSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- WRDANSJTFOHBPI-FXQIFTODSA-N Ala-Arg-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N WRDANSJTFOHBPI-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- NJIFPLAJSVUQOZ-JBDRJPRFSA-N Ala-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)N NJIFPLAJSVUQOZ-JBDRJPRFSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- PWYFCPCBOYMOGB-LKTVYLICSA-N Ala-Gln-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PWYFCPCBOYMOGB-LKTVYLICSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- ANGAOPNEPIDLPO-XVYDVKMFSA-N Ala-His-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N ANGAOPNEPIDLPO-XVYDVKMFSA-N 0.000 description 1
- SHKGHIFSEAGTNL-DLOVCJGASA-N Ala-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 SHKGHIFSEAGTNL-DLOVCJGASA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- CWRBRVZBMVJENN-UVBJJODRSA-N Ala-Trp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N CWRBRVZBMVJENN-UVBJJODRSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 1
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- RCAUJZASOAFTAJ-FXQIFTODSA-N Arg-Asp-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N RCAUJZASOAFTAJ-FXQIFTODSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 1
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 1
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 1
- QIWYWCYNUMJBTC-CIUDSAMLSA-N Arg-Cys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIWYWCYNUMJBTC-CIUDSAMLSA-N 0.000 description 1
- VSPLYCLMFAUZRF-GUBZILKMSA-N Arg-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N VSPLYCLMFAUZRF-GUBZILKMSA-N 0.000 description 1
- YWENWUYXQUWRHQ-LPEHRKFASA-N Arg-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O YWENWUYXQUWRHQ-LPEHRKFASA-N 0.000 description 1
- RWDVGVPHEWOZMO-GUBZILKMSA-N Arg-Cys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCNC(N)=N)C(O)=O RWDVGVPHEWOZMO-GUBZILKMSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- PZVMBNFTBWQWQL-DCAQKATOSA-N Arg-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PZVMBNFTBWQWQL-DCAQKATOSA-N 0.000 description 1
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 1
- ZCSHHTFOZULVLN-SZMVWBNQSA-N Arg-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 ZCSHHTFOZULVLN-SZMVWBNQSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 1
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- VWJFQGXPYOPXJH-ZLUOBGJFSA-N Asn-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)N VWJFQGXPYOPXJH-ZLUOBGJFSA-N 0.000 description 1
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 1
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 1
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 1
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- FVKHEKVYFTZWDX-GHCJXIJMSA-N Asn-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FVKHEKVYFTZWDX-GHCJXIJMSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- YWFLXGZHZXXINF-BPUTZDHNSA-N Asn-Pro-Trp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 YWFLXGZHZXXINF-BPUTZDHNSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 1
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 1
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- ZRAOLTNMSCSCLN-ZLUOBGJFSA-N Asp-Cys-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)O ZRAOLTNMSCSCLN-ZLUOBGJFSA-N 0.000 description 1
- IAMNNSSEBXDJMN-CIUDSAMLSA-N Asp-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N IAMNNSSEBXDJMN-CIUDSAMLSA-N 0.000 description 1
- WXASLRQUSYWVNE-FXQIFTODSA-N Asp-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WXASLRQUSYWVNE-FXQIFTODSA-N 0.000 description 1
- WEDGJJRCJNHYSF-SRVKXCTJSA-N Asp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WEDGJJRCJNHYSF-SRVKXCTJSA-N 0.000 description 1
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- KPNUCOPMVSGRCR-DCAQKATOSA-N Asp-His-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KPNUCOPMVSGRCR-DCAQKATOSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 1
- VWWAFGHMPWBKEP-GMOBBJLQSA-N Asp-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)N VWWAFGHMPWBKEP-GMOBBJLQSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- JQWFNULBEWBJQM-FKEBYFGASA-N Asp-Phe-Val-Tyr Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC(O)=O)C1=CC=CC=C1 JQWFNULBEWBJQM-FKEBYFGASA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- DWOSGXZMLQNDBN-FXQIFTODSA-N Asp-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O DWOSGXZMLQNDBN-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 1
- LTARLVHGOGBRHN-AAEUAGOBSA-N Asp-Trp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O LTARLVHGOGBRHN-AAEUAGOBSA-N 0.000 description 1
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 1
- LEYKQPDPZJIRTA-AQZXSJQPSA-N Asp-Trp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LEYKQPDPZJIRTA-AQZXSJQPSA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241001609213 Carassius carassius Species 0.000 description 1
- ZAMOUSCENKQFHK-UHFFFAOYSA-N Chlorine atom Chemical compound [Cl] ZAMOUSCENKQFHK-UHFFFAOYSA-N 0.000 description 1
- 241000252210 Cyprinidae Species 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- PLBJMUUEGBBHRH-ZLUOBGJFSA-N Cys-Ala-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLBJMUUEGBBHRH-ZLUOBGJFSA-N 0.000 description 1
- CVOZXIPULQQFNY-ZLUOBGJFSA-N Cys-Ala-Cys Chemical compound C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O CVOZXIPULQQFNY-ZLUOBGJFSA-N 0.000 description 1
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 1
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 1
- PRVVCRZLTJNPCS-FXQIFTODSA-N Cys-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N PRVVCRZLTJNPCS-FXQIFTODSA-N 0.000 description 1
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 1
- FWYBFUDWUUFLDN-FXQIFTODSA-N Cys-Asp-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N FWYBFUDWUUFLDN-FXQIFTODSA-N 0.000 description 1
- FEJCUYOGOBCFOQ-ACZMJKKPSA-N Cys-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N FEJCUYOGOBCFOQ-ACZMJKKPSA-N 0.000 description 1
- IIGHQOPGMGKDMT-SRVKXCTJSA-N Cys-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N IIGHQOPGMGKDMT-SRVKXCTJSA-N 0.000 description 1
- XRTISHJEPHMBJG-SRVKXCTJSA-N Cys-Asp-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XRTISHJEPHMBJG-SRVKXCTJSA-N 0.000 description 1
- YRKJQKATZOTUEN-ACZMJKKPSA-N Cys-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N YRKJQKATZOTUEN-ACZMJKKPSA-N 0.000 description 1
- KEBJBKIASQVRJS-WDSKDSINSA-N Cys-Gln-Gly Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N KEBJBKIASQVRJS-WDSKDSINSA-N 0.000 description 1
- HHABWQIFXZPZCK-ACZMJKKPSA-N Cys-Gln-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HHABWQIFXZPZCK-ACZMJKKPSA-N 0.000 description 1
- UDPSLLFHOLGXBY-FXQIFTODSA-N Cys-Glu-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDPSLLFHOLGXBY-FXQIFTODSA-N 0.000 description 1
- UUOYKFNULIOCGJ-GUBZILKMSA-N Cys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N UUOYKFNULIOCGJ-GUBZILKMSA-N 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 1
- URDUGPGPLNXXES-WHFBIAKZSA-N Cys-Gly-Cys Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O URDUGPGPLNXXES-WHFBIAKZSA-N 0.000 description 1
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 1
- XELISBQUZZAPQK-CIUDSAMLSA-N Cys-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N XELISBQUZZAPQK-CIUDSAMLSA-N 0.000 description 1
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 1
- UVZFZTWNHOQWNK-NAKRPEOUSA-N Cys-Ile-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UVZFZTWNHOQWNK-NAKRPEOUSA-N 0.000 description 1
- LKUCSUGWHYVYLP-GHCJXIJMSA-N Cys-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N LKUCSUGWHYVYLP-GHCJXIJMSA-N 0.000 description 1
- DVIHGGUODLILFN-GHCJXIJMSA-N Cys-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DVIHGGUODLILFN-GHCJXIJMSA-N 0.000 description 1
- PRHGYQOSEHLDRW-VGDYDELISA-N Cys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N PRHGYQOSEHLDRW-VGDYDELISA-N 0.000 description 1
- VFGADOJXRLWTBU-JBDRJPRFSA-N Cys-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N VFGADOJXRLWTBU-JBDRJPRFSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- VDUPGIDTWNQAJD-CIUDSAMLSA-N Cys-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O VDUPGIDTWNQAJD-CIUDSAMLSA-N 0.000 description 1
- HJGUQJJJXQGXGJ-FXQIFTODSA-N Cys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HJGUQJJJXQGXGJ-FXQIFTODSA-N 0.000 description 1
- WTEACWBAULENKE-SRVKXCTJSA-N Cys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N WTEACWBAULENKE-SRVKXCTJSA-N 0.000 description 1
- GFMJUESGWILPEN-MELADBBJSA-N Cys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CS)N)C(=O)O GFMJUESGWILPEN-MELADBBJSA-N 0.000 description 1
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 1
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 1
- NITLUESFANGEIW-BQBZGAKWSA-N Cys-Pro-Gly Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O NITLUESFANGEIW-BQBZGAKWSA-N 0.000 description 1
- XCDDSPYIMNXECQ-NAKRPEOUSA-N Cys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS XCDDSPYIMNXECQ-NAKRPEOUSA-N 0.000 description 1
- JUNZLDGUJZIUCO-IHRRRGAJSA-N Cys-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O JUNZLDGUJZIUCO-IHRRRGAJSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- SRZZZTMJARUVPI-JBDRJPRFSA-N Cys-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N SRZZZTMJARUVPI-JBDRJPRFSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 1
- YWEHYKGJWHPGPY-XGEHTFHBSA-N Cys-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N)O YWEHYKGJWHPGPY-XGEHTFHBSA-N 0.000 description 1
- JIVJQYNNAYFXDG-LKXGYXEUSA-N Cys-Thr-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JIVJQYNNAYFXDG-LKXGYXEUSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- UKHNKRGNFKSHCG-CUJWVEQBSA-N Cys-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N)O UKHNKRGNFKSHCG-CUJWVEQBSA-N 0.000 description 1
- IWVNIQXKTIQXCT-SRVKXCTJSA-N Cys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O IWVNIQXKTIQXCT-SRVKXCTJSA-N 0.000 description 1
- CLEFUAZULXANBU-MELADBBJSA-N Cys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N)C(=O)O CLEFUAZULXANBU-MELADBBJSA-N 0.000 description 1
- VXDXZGYXHIADHF-YJRXYDGGSA-N Cys-Tyr-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VXDXZGYXHIADHF-YJRXYDGGSA-N 0.000 description 1
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 1
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 206010017472 Fumbling Diseases 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 1
- MINZLORERLNSPP-ACZMJKKPSA-N Gln-Asn-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N MINZLORERLNSPP-ACZMJKKPSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 1
- FJAYYNIXQNERSO-ACZMJKKPSA-N Gln-Cys-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FJAYYNIXQNERSO-ACZMJKKPSA-N 0.000 description 1
- IPHGBVYWRKCGKG-FXQIFTODSA-N Gln-Cys-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O IPHGBVYWRKCGKG-FXQIFTODSA-N 0.000 description 1
- ALUBSZXSNSPDQV-WDSKDSINSA-N Gln-Cys-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ALUBSZXSNSPDQV-WDSKDSINSA-N 0.000 description 1
- QFTRCUPCARNIPZ-XHNCKOQMSA-N Gln-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)C(=O)O QFTRCUPCARNIPZ-XHNCKOQMSA-N 0.000 description 1
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- DWDBJWAXPXXYLP-SRVKXCTJSA-N Gln-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DWDBJWAXPXXYLP-SRVKXCTJSA-N 0.000 description 1
- XQEAVUJIRZRLQQ-SZMVWBNQSA-N Gln-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCC(=O)N)N XQEAVUJIRZRLQQ-SZMVWBNQSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- DOMHVQBSRJNNKD-ZPFDUUQYSA-N Gln-Met-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DOMHVQBSRJNNKD-ZPFDUUQYSA-N 0.000 description 1
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 1
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- YJSCHRBERYWPQL-DCAQKATOSA-N Gln-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N YJSCHRBERYWPQL-DCAQKATOSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- QENSHQJGWGRPQS-QEJZJMRPSA-N Gln-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)N)C(O)=O)=CNC2=C1 QENSHQJGWGRPQS-QEJZJMRPSA-N 0.000 description 1
- XIYWAJQIWLXXAF-XKBZYTNZSA-N Gln-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XIYWAJQIWLXXAF-XKBZYTNZSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 1
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- FTMLQFPULNGION-ZVZYQTTQSA-N Gln-Val-Trp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FTMLQFPULNGION-ZVZYQTTQSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 1
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 1
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- XOIATPHFYVWFEU-DCAQKATOSA-N Glu-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOIATPHFYVWFEU-DCAQKATOSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- OLTHVCNYJAALPL-BHYGNILZSA-N Glu-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OLTHVCNYJAALPL-BHYGNILZSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 1
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- GYAUWXXORNTCHU-QWRGUYRKSA-N Gly-Cys-Tyr Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 GYAUWXXORNTCHU-QWRGUYRKSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- VUUOMYFPWDYETE-WDSKDSINSA-N Gly-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VUUOMYFPWDYETE-WDSKDSINSA-N 0.000 description 1
- YZPVGIVFMZLQMM-YUMQZZPRSA-N Gly-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN YZPVGIVFMZLQMM-YUMQZZPRSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- IHDKKJVBLGXLEL-STQMWFEESA-N Gly-Tyr-Met Chemical compound CSCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CN)C(O)=O IHDKKJVBLGXLEL-STQMWFEESA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- IDNNYVGVSZMQTK-IHRRRGAJSA-N His-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N IDNNYVGVSZMQTK-IHRRRGAJSA-N 0.000 description 1
- PROLDOGUBQJNPG-RWMBFGLXSA-N His-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O PROLDOGUBQJNPG-RWMBFGLXSA-N 0.000 description 1
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 1
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 1
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 1
- AASLOGQZZKZWKH-SRVKXCTJSA-N His-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AASLOGQZZKZWKH-SRVKXCTJSA-N 0.000 description 1
- BQYZXYCEKYJKAM-VGDYDELISA-N His-Cys-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQYZXYCEKYJKAM-VGDYDELISA-N 0.000 description 1
- WYWBYSPRCFADBM-GARJFASQSA-N His-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O WYWBYSPRCFADBM-GARJFASQSA-N 0.000 description 1
- MWXBCJKQRQFVOO-DCAQKATOSA-N His-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N MWXBCJKQRQFVOO-DCAQKATOSA-N 0.000 description 1
- NJZGEXYLSFGPHG-GUBZILKMSA-N His-Gln-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NJZGEXYLSFGPHG-GUBZILKMSA-N 0.000 description 1
- FLYSHWAAHYNKRT-JYJNAYRXSA-N His-Gln-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLYSHWAAHYNKRT-JYJNAYRXSA-N 0.000 description 1
- DVHGLDYMGWTYKW-GUBZILKMSA-N His-Gln-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DVHGLDYMGWTYKW-GUBZILKMSA-N 0.000 description 1
- CZXKZMQKXQZDEX-YUMQZZPRSA-N His-Gly-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N CZXKZMQKXQZDEX-YUMQZZPRSA-N 0.000 description 1
- VTMLJMNQHKBPON-QWRGUYRKSA-N His-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 VTMLJMNQHKBPON-QWRGUYRKSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 1
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 1
- CSTNMMIHMYJGFR-IHRRRGAJSA-N His-His-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 CSTNMMIHMYJGFR-IHRRRGAJSA-N 0.000 description 1
- BRZQWIIFIKTJDH-VGDYDELISA-N His-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N BRZQWIIFIKTJDH-VGDYDELISA-N 0.000 description 1
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 1
- LJUIEESLIAZSFR-SRVKXCTJSA-N His-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LJUIEESLIAZSFR-SRVKXCTJSA-N 0.000 description 1
- FJCGVRRVBKYYOU-DCAQKATOSA-N His-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N FJCGVRRVBKYYOU-DCAQKATOSA-N 0.000 description 1
- FBCURAVMSXNOLP-JYJNAYRXSA-N His-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBCURAVMSXNOLP-JYJNAYRXSA-N 0.000 description 1
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 1
- XIGFLVCAVQQGNS-IHRRRGAJSA-N His-Pro-His Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XIGFLVCAVQQGNS-IHRRRGAJSA-N 0.000 description 1
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 1
- AVQNTYBAFBKMDL-WDSOQIARSA-N His-Pro-Trp Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O AVQNTYBAFBKMDL-WDSOQIARSA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- CSRRMQFXMBPSIL-SIXJUCDHSA-N His-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N CSRRMQFXMBPSIL-SIXJUCDHSA-N 0.000 description 1
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 1
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 1
- DMAPKBANYNZHNR-ULQDDVLXSA-N His-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DMAPKBANYNZHNR-ULQDDVLXSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- UFRXVQGGPNSJRY-CYDGBPFRSA-N Ile-Met-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N UFRXVQGGPNSJRY-CYDGBPFRSA-N 0.000 description 1
- NNVXABCGXOLIEB-PYJNHQTQSA-N Ile-Met-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NNVXABCGXOLIEB-PYJNHQTQSA-N 0.000 description 1
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 1
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 1
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 1
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- YBHKCXNNNVDYEB-SPOWBLRKSA-N Ile-Trp-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N YBHKCXNNNVDYEB-SPOWBLRKSA-N 0.000 description 1
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 1
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- DXYBNWJZJVSZAE-GUBZILKMSA-N Leu-Gln-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DXYBNWJZJVSZAE-GUBZILKMSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- MJTOYIHCKVQICL-ULQDDVLXSA-N Leu-Met-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MJTOYIHCKVQICL-ULQDDVLXSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- QONKWXNJRRNTBV-AVGNSLFASA-N Leu-Pro-Met Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N QONKWXNJRRNTBV-AVGNSLFASA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 1
- SEOXPEFQEOYURL-PMVMPFDFSA-N Leu-Tyr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SEOXPEFQEOYURL-PMVMPFDFSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- SVJRVFPSHPGWFF-DCAQKATOSA-N Lys-Cys-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVJRVFPSHPGWFF-DCAQKATOSA-N 0.000 description 1
- SFQPJNQDUUYCLA-BJDJZHNGSA-N Lys-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N SFQPJNQDUUYCLA-BJDJZHNGSA-N 0.000 description 1
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 1
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- CAVGLNOOIFHJOF-SRVKXCTJSA-N Lys-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N CAVGLNOOIFHJOF-SRVKXCTJSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- OFNCSQNBSWGGNV-DCAQKATOSA-N Met-Cys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 OFNCSQNBSWGGNV-DCAQKATOSA-N 0.000 description 1
- CRGKLOXHKICQOL-GARJFASQSA-N Met-Gln-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N CRGKLOXHKICQOL-GARJFASQSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 description 1
- LLKWSEXLNFBKIF-CYDGBPFRSA-N Met-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCSC LLKWSEXLNFBKIF-CYDGBPFRSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 1
- RUTZUJXAVNWLQP-BVSLBCMMSA-N Met-Tyr-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 RUTZUJXAVNWLQP-BVSLBCMMSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 1
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- UEHNWRNADDPYNK-DLOVCJGASA-N Phe-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N UEHNWRNADDPYNK-DLOVCJGASA-N 0.000 description 1
- OMHMIXFFRPMYHB-SRVKXCTJSA-N Phe-Cys-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OMHMIXFFRPMYHB-SRVKXCTJSA-N 0.000 description 1
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- NGNNPLJHUFCOMZ-FXQIFTODSA-N Pro-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 NGNNPLJHUFCOMZ-FXQIFTODSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- QXNSKJLSLYCTMT-FXQIFTODSA-N Pro-Cys-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O QXNSKJLSLYCTMT-FXQIFTODSA-N 0.000 description 1
- SZZBUDVXWZZPDH-BQBZGAKWSA-N Pro-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 SZZBUDVXWZZPDH-BQBZGAKWSA-N 0.000 description 1
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 1
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- NFLNBHLMLYALOO-DCAQKATOSA-N Pro-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 NFLNBHLMLYALOO-DCAQKATOSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 1
- BLJMJZOMZRCESA-GUBZILKMSA-N Pro-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BLJMJZOMZRCESA-GUBZILKMSA-N 0.000 description 1
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 1
- MDAWMJUZHBQTBO-XGEHTFHBSA-N Pro-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1)O MDAWMJUZHBQTBO-XGEHTFHBSA-N 0.000 description 1
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 1
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- FOOZNBRFRWGBNU-DCAQKATOSA-N Ser-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N FOOZNBRFRWGBNU-DCAQKATOSA-N 0.000 description 1
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- FZNNGIHSIPKFRE-QEJZJMRPSA-N Ser-Trp-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZNNGIHSIPKFRE-QEJZJMRPSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- 241000239222 Tachypleus Species 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- NLJKZUGAIIRWJN-LKXGYXEUSA-N Thr-Asp-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O NLJKZUGAIIRWJN-LKXGYXEUSA-N 0.000 description 1
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 1
- UTCFSBBXPWKLTG-XKBZYTNZSA-N Thr-Cys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O UTCFSBBXPWKLTG-XKBZYTNZSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 1
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- OMRWDMWXRWTQIU-YJRXYDGGSA-N Thr-Tyr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N)O OMRWDMWXRWTQIU-YJRXYDGGSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- OETOOJXFNSEYHQ-WFBYXXMGSA-N Trp-Ala-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 OETOOJXFNSEYHQ-WFBYXXMGSA-N 0.000 description 1
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 1
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 1
- IJRXQJVGFBSKIV-ZFWWWQNUSA-N Trp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N IJRXQJVGFBSKIV-ZFWWWQNUSA-N 0.000 description 1
- BONYBFXWMXBAND-GQGQLFGLSA-N Trp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BONYBFXWMXBAND-GQGQLFGLSA-N 0.000 description 1
- CMXACOZDEJYZSK-XIRDDKMYSA-N Trp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CMXACOZDEJYZSK-XIRDDKMYSA-N 0.000 description 1
- KOVPHHXMHLFWPL-BPUTZDHNSA-N Trp-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CC(=O)N)C(=O)O KOVPHHXMHLFWPL-BPUTZDHNSA-N 0.000 description 1
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 1
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 1
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 1
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 1
- XXJDYWYVZBHELV-TUSQITKMSA-N Trp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CCCCN)C(=O)O)N XXJDYWYVZBHELV-TUSQITKMSA-N 0.000 description 1
- UIDJDMVRDUANDL-BVSLBCMMSA-N Trp-Tyr-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UIDJDMVRDUANDL-BVSLBCMMSA-N 0.000 description 1
- STJXERBCEWQLKS-IHPCNDPISA-N Trp-Tyr-Cys Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 STJXERBCEWQLKS-IHPCNDPISA-N 0.000 description 1
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- KLGFILUOTCBNLJ-IHRRRGAJSA-N Tyr-Cys-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O KLGFILUOTCBNLJ-IHRRRGAJSA-N 0.000 description 1
- CWQZAUYFWRLITN-AVGNSLFASA-N Tyr-Gln-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O CWQZAUYFWRLITN-AVGNSLFASA-N 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 1
- OLWFDNLLBWQWCP-STQMWFEESA-N Tyr-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OLWFDNLLBWQWCP-STQMWFEESA-N 0.000 description 1
- OSXNCKRGMSHWSQ-ACRUOGEOSA-N Tyr-His-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSXNCKRGMSHWSQ-ACRUOGEOSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 description 1
- BJCILVZEZRDIDR-PMVMPFDFSA-N Tyr-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 BJCILVZEZRDIDR-PMVMPFDFSA-N 0.000 description 1
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 1
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 1
- PYJKETPLFITNKS-IHRRRGAJSA-N Tyr-Pro-Asn Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O PYJKETPLFITNKS-IHRRRGAJSA-N 0.000 description 1
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- DTWMJYGOUWNWEC-IHPCNDPISA-N Tyr-Trp-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 DTWMJYGOUWNWEC-IHPCNDPISA-N 0.000 description 1
- QRCBQDPRKMYTMB-IHPCNDPISA-N Tyr-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N QRCBQDPRKMYTMB-IHPCNDPISA-N 0.000 description 1
- MQUYPYFPHIPVHJ-MNSWYVGCSA-N Tyr-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O MQUYPYFPHIPVHJ-MNSWYVGCSA-N 0.000 description 1
- BQASAMYRHNCKQE-IHRRRGAJSA-N Tyr-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BQASAMYRHNCKQE-IHRRRGAJSA-N 0.000 description 1
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 1
- BWVHQINTNLVWGZ-ZKWXMUAHSA-N Val-Cys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BWVHQINTNLVWGZ-ZKWXMUAHSA-N 0.000 description 1
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 1
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- XJFXZQKJQGYFMM-GUBZILKMSA-N Val-Cys-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N XJFXZQKJQGYFMM-GUBZILKMSA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- JMCOXFSCTGKLLB-FKBYEOEOSA-N Val-Phe-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JMCOXFSCTGKLLB-FKBYEOEOSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- AYHNXCJKBLYVOA-KSZLIROESA-N Val-Trp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N AYHNXCJKBLYVOA-KSZLIROESA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010031045 aspartyl-glycyl-aspartyl-alanine Proteins 0.000 description 1
- 108010027234 aspartyl-glycyl-glutamyl-alanine Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000037237 body shape Effects 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000000460 chlorine Substances 0.000 description 1
- 229910052801 chlorine Inorganic materials 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 235000013325 dietary fiber Nutrition 0.000 description 1
- 210000002249 digestive system Anatomy 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 208000010824 fish disease Diseases 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- CXKWCBBOMKCUKX-UHFFFAOYSA-M methylene blue Chemical compound [Cl-].C1=CC(N(C)C)=CC2=[S+]C3=CC(N(C)C)=CC=C3N=C21 CXKWCBBOMKCUKX-UHFFFAOYSA-M 0.000 description 1
- 229960000907 methylthioninium chloride Drugs 0.000 description 1
- 230000011278 mitosis Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000001706 oxygenating effect Effects 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000009394 selective breeding Methods 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000001502 supplementing effect Effects 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/89—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation using microinjection
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K61/00—Culture of aquatic animals
- A01K61/10—Culture of aquatic animals of fish
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New or modified breeds of animals
- A01K67/027—New or modified breeds of vertebrates
- A01K67/0275—Genetically modified vertebrates, e.g. transgenic
- A01K67/0276—Knock-out vertebrates
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/775—Apolipopeptides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/07—Animals genetically altered by homologous recombination
- A01K2217/075—Animals genetically altered by homologous recombination inducing loss of function, i.e. knock out
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/40—Fish
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/02—Animal zootechnically ameliorated
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/80—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in fisheries management
- Y02A40/81—Aquaculture, e.g. of fish
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Environmental Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Animal Husbandry (AREA)
- Medicinal Chemistry (AREA)
- Biodiversity & Conservation Biology (AREA)
- Physics & Mathematics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Veterinary Medicine (AREA)
- Animal Behavior & Ethology (AREA)
- Gastroenterology & Hepatology (AREA)
- Toxicology (AREA)
- Plant Pathology (AREA)
- Marine Sciences & Fisheries (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明涉及鱼类分子设计育种技术,尤其涉及一种龙睛百褶裙泰狮金鱼的创制方法。该方法包括靶点设计、显微注射、受精孵化步骤。本发明基于CRISPR/Cas9的基因编辑技术,对低密度脂蛋白2基因lrp2a进行敲除后在F0代金鱼上即产生了龙睛表型,通过实验确立了百褶裙泰狮金鱼高效的雌核生殖条件和高温扩群条件,在F1代即获得龙睛百褶裙泰狮金鱼纯合子;并通过温度诱导性逆转,F2代大量扩群。本发明通过分子育种手段,快速实现了单一目的性状的转移,稳定快速地创制了龙睛百褶裙泰狮金鱼这一具有高观赏价值的品种,具有广阔的应用前景和市场价值。
Description
技术领域
本发明涉及鱼类分子设计育种技术,尤其涉及一种龙睛百褶裙泰狮金鱼的创制方法。
技术背景
金鱼起源于中国的四倍体鲫,是经过数百年人工选择形成的观赏鱼,堪称我国国粹。在金鱼发展史中,由于突变,金鱼的体色、体型和几乎所有外部器官都发生明显的改变,在人工选择作用下,这些突变被保留下来,后这些突变通过杂交重组,形成多种复合变异的金鱼;如今,金鱼已有超过300多个品种。在金鱼多个突变表型中,龙睛是重要的观赏性状。龙睛性状因其双目外凸,和神话传说中龙的眼睛相似,故而得名。龙是中华名族的图腾,以龙命名可见人们对它的推崇。龙睛金鱼从出现一直流行至今,具有龙睛性状的金鱼也在金鱼品种占据重要的地位。
传统金鱼养在鱼盆中,品种的选育都以俯视观赏为主。鱼缸普及让侧视观赏成为趋势。百褶裙泰狮是侧视金鱼中最重要的观赏品种之一,而龙睛性状在俯视金鱼中占据重要的地位,将龙睛性状从传统的观赏鱼中转移到泰狮中是很多金鱼养殖场和金鱼爱好者的梦想。但经过多年的尝试,传统通过不断杂交和提纯的方法均不能实现单一目的性状的转移,龙睛性状通过杂交转移到百褶裙泰狮金鱼后会伴随着百褶裙泰狮金鱼其他重要观赏性状的改变,如尾巴夹角变小、无褶子,且体型不再粗壮,从而丢失了百褶裙泰狮原有的观赏价值。
如何稳定获得具有龙睛性状且其他观赏价值仍得以保留的百褶裙泰狮金鱼,是尚未解决的问题。
发明内容
为了解决现有技术中存在的问题,本发明提供了一种龙睛百褶裙泰狮金鱼的创制方法。
本发明采用了以下技术方案:
一种龙睛百褶裙泰狮金鱼的创制方法,包括以下步骤:
S1.靶点设计:对百褶裙泰狮金鱼进行基因测序,获取调控龙睛性状的基因的DNA序列,并在其外显子上设计适合的敲除靶点,将该靶点序列体外转录成 gRNA;
所述调控龙睛性状的基因为lrp2a即低密度脂蛋白2基因,该基因的DNA 序列和氨基酸序列参见说明书序列表部分。
所述靶点有五个,五个靶点序列分别为:
GGGTCTTTCCGCTGTGGGAC、GGAGCTGAGGTGTGCTAACG、 GGATGACTGCGAGGACAATG、TCTTCTGGTGTTTGCATCCC和 GGGACGTCCCACTGTCCGGA;
S2.显微注射:将百褶裙泰狮金鱼的鱼卵平铺在无菌培养皿中,添加适量的卵巢液,取CRISPR/Cas9蛋白和gRNA的mRNA各2μL并混合均匀,用显微注射仪向鱼卵动物极注射混合后的mRNA,4μL混合后的mRNA共注射 200-3000粒鱼卵;
S3.受精孵化:取雄性百褶裙泰狮金鱼精子加入注射操作完成后的培养皿中,等待鱼卵受精后将受精卵转移至孵化巢孵化,所得百褶裙泰狮金鱼即为具有龙睛性状的百褶裙泰狮金鱼。
优选的,该方法还包括雌核生殖步骤,所述雌核生殖方法为:以S3中具有龙睛表型的百褶裙泰狮金鱼为亲本即F0代,取鲤鱼精子灭活,将灭活精液与F0 代雌性百褶裙泰狮金鱼卵子混合受精后进行休克处理,经处理后的受精卵孵化即为具有龙睛性状的F1代纯合子百褶裙泰狮金鱼。
优选的,该方法还包括高温扩群步骤,所述高温扩群方法为,在F1代纯合子出膜后早期,对部分雌鱼用高温27-28℃饲养,使雌鱼性逆转为雄鱼,性逆转雌鱼和正常饲养的F1代雌鱼进行繁殖,批量得到纯合子子代百褶裙泰狮金鱼。
优选的,所述步骤S1中,根据所得基因的DNA序列设计获取调控龙睛性状的基因的外显子和内含子,并用GG[G/A]??????????????????GG或者 CC??????????????????[T/C]CC通配符在百褶裙泰狮金鱼lrp2a基因的外显子序列上设计敲除靶点。
优选的,所述步骤S1中,获得靶点序列后,在体外转录前,先利用该靶点序列构建基因组本地数据库,使用blast软件检索并确定靶点的唯一性。
优选的,步骤S3中,受精卵孵化前,还包括检测敲除效率操作,随机挑选 10颗出膜前的受精卵,提取受精卵内DNA作为模板,同时扩增敲除靶点附近的 DNA序列测序并与正常序列进行对比,根据对比结果计算敲除效率,若敲除效率达到20%及以上则留养该批受精卵继续孵化,否则终止孵化并重新敲除,重新敲除需提高CRISPR/Cas9和gRNA的mRNA注射浓度,或者将所述靶点序列重新体外转录后再注射。
优选的,所述提取受精卵内DNA和扩增具体为:将挑选的受精卵放入到 PCR管中,加入70-100μL的50mM的NaOH溶液,于PCR仪中95℃反应20min 后作为DNA模板;在敲除靶点附近设计PCR扩增引物,PCR扩增、转化、克隆后挑选10个目的克隆测序并与正常序列进行对比以检验该敲除位点是否敲除,所述靶点附近为靶点上下游200-300bp区域。
优选的,所述雌核生殖所用的金鱼卵子均取自同一条F0代雌性百褶裙泰狮金鱼。
优选的,步骤S2中,受精卵孵化出鱼苗后,孵化1个月内,鱼苗的饲养水温不超过24℃。
优选的,所述休克处理为热休克,将受精卵在24℃恒温发育43-47min后,再置于41℃水中进行热休克2min处理。
优选的,所述休克处理为冷休克,将受精卵在24℃恒温发育3-7min后,再置于0℃冰水中进行冷休克2min处理。
本发明的有益效果在于:
1.本发明首次在金鱼上建立CRISPR/Cas9基因编辑方法,注射鱼卵后再授精,让RNA提早翻译成蛋白产生作用,提高了基因编辑效率。对弱粘性的鱼卵进行注射,还有效避免了鱼卵受精后产生粘性堵塞注射针,使得显微注射成功开展。此外,通过提高CRISPR/Cas9蛋白(400-600ng/μL)和gRNA(150-300ng/μL) 的mRNA的注射浓度,能有效加大敲除效率。该技术实现F0代就获得了大量具有龙睛性状的嵌合体金鱼。
2.本发明前期工作定位到龙睛性状相关的基因为低密度脂蛋白2基因lrp2a 并通过全基因组测序获得了该基因的核苷酸和氨基酸序列。配合金鱼高效繁殖技术为金鱼的基因敲除提供了高质量的鱼卵,为雌核生殖条件的摸索提供了必要的实验材料;本发明通过使用同一条鱼的多次产卵进行实验可以缩小背景差异,有利于摸索出雌核生殖的最佳的条件。
3.本发明同时通过人工雌核生殖的方法,在F1代得到纯合子鱼;通过高温的刺激的方法让雌核生殖的鱼性逆转成雄鱼,再和正常的F1雌鱼受精,F2代批量得到具有龙睛性状的百褶裙泰狮。一系列在金鱼上具有挑战性的方法的建立和结合让分子设计育种技术首次应用在金鱼上并成功实现了单一目的性状的转移,将龙睛性状成功的转移到百褶裙泰狮金鱼上而不改变其它观赏性状,从而成功创制了观赏价值极高的龙睛百褶裙泰狮金鱼新品种。该方法也为分子设计育种高效培育金鱼新品种提供了技术支撑。
附图说明
图1为正常眼睛的百褶裙泰狮金鱼;
图2为本发明创制的具有龙睛性状的百褶裙泰狮金鱼,可以看出,龙睛百褶裙泰狮金鱼的眼睛突出,而在头瘤、背鳍、尾巴和体型等方面的观赏性状与正常眼睛的百褶裙泰狮金鱼一致,并未丢失。
具体实施方式
为了便于理解,下面结合实施例对本发明的技术方案做出更为具体的说明:
实施例1:百褶裙泰狮金鱼的高效繁育
S1.苗种培育:在周转箱中饲养刚开口的鱼苗,水温23-25℃,每天投喂3 次活丰年虫,周转箱长60-100cm,宽40-60cm,高20-30cm,水位15-20cm为宜。在周转箱一侧安装海绵过滤器过滤,旁边放置一个增氧头充氧,气量适宜。每周清洗一次海绵过滤器。
鱼苗体长超过2cm后,俯视挑选四尾鱼苗,再侧视去除鳍条畸形卷折的鱼苗,将挑选出来的优质鱼苗放进新容器内饲养,密度约为每升水2条鱼;容器内配备7瓦小型水泵,滴流加水,每天进水量约为总水体的1/5,三层周转箱滴流盒过滤。每天人工投喂1次丰年虫,自动喂食器投喂5次蛋白含量48%的饲料,每次投喂量以10min吃完为宜。周转箱滴流盒的第一层放置过滤垫材比如魔毯、过滤棉,第二层和第三层放置细菌球,放苗当天添加液体消化活菌帮助鱼苗快速建立消化系统,添加量为每100L水添加100mL液体消化活菌。魔毯为一侧带长绒毛的过滤毯,可以反复清洁。
鱼苗体长超过4cm时,侧视挑选体长较长、体格粗壮、有背峰、尾巴上翘、尾尖圆润的鱼苗放入鱼缸,鱼缸长80cm以上,宽50cm以上,水深30-40cm为宜,密度为每升水1条鱼。继续沿用周转箱滴流盒过滤,调整水泵每小时流量为总水体的3~5倍,每日自动喂食器投喂5次蛋白含量46%的饲料,每次投喂量以5min吃完为宜,同时中午和傍晚人工投喂摇蚊幼虫,投喂量为15min吃完。
鱼苗体长超过8cm时,挑选体格健壮匀称,尾巴圆润上翘且有褶子的鱼放入鱼缸,鱼缸长80cm以上,宽50cm~60cm,水深40cm~50cm,每天自动喂食器投喂5次蛋白含量46%的饲料,每次10min吃完,同时中午投喂丰年虾补钙,傍晚投喂摇蚊幼虫增体,投喂量为15min吃完。用带透镜的三基色灯珠灯光照射鱼缸,三基色灯珠功率80w,开灯时间7:30~24:00,通过灯光照射,让绿藻附着在鱼缸底部并铺满成苔状。
S2.亲本培育:鱼体长12cm以上时,挑选体格健壮匀称,尾巴圆润上翘且有褶子的鱼作为亲本,每天投喂5次蛋白含量44%的饲料,每次5min吃完,同时中午投喂丰年虾补钙,投喂量为15min吃完。
选用过滤强大的底滤系统过滤水体,且夏季水温24-26℃时每周换水2-3次,每次换水1/3,冬季水温21-23℃时每周换水1-2次,每次换水1/4-1/3;加水时宜加去氯自来水,也可直接加自来水,但进水水流要慢,控制在半小时加满,冬季由于温差大,适当延长。换水的同时更换和清洗过滤系统内的过滤棉,优选晴天中午更换,温度高能够有效去除自来水中的氯和提高水温,减少温差对鱼的损伤。过滤棉用高压水枪清洗后阳光下晒干杀菌。
用曝藻灯或萌虫灯加装3透镜,每天定时开启12小时曝藻,使得绿藻逐渐长满底层并形成绿苔后,每两天用刷子清理绿苔中的鱼便等有机残渣,绿藻能够作为植物饵料给种鱼提供丰富的维生素和膳食纤维等营养,且能够吸收水体中的氮磷等物质。
S3.人工授精:在观察到雄鱼追尾雌鱼时,捞出雌鱼,人工挤卵,使用10mL 离心管在泄殖孔处收集鱼卵,将精液添加进离心管中,盖上管盖,上下轻微颠倒完成精子与鱼卵的充分混匀,将孵化床放入孵化箱中,孵化箱水深10-30cm,水温控制在22~24℃,将充分混匀的鱼卵倒入孵化床均匀铺开,充氧孵化,最好使用流水孵化。
使用10mL离心管收集鱼卵,有利于将所有鱼卵都收集到,不浪费,且不会粘到金鱼鳍条,避免鱼卵被鳍条带回鱼缸污染水质,鱼卵在离心管中可以在室温保存半天,方便实验使用。需要授精时,将精液直接挤到离心管中,或用移液枪枪头吸取精液加入到离心管中,盖上管盖,上下轻微颠倒混匀就完成了精子和鱼卵的充分混匀。此时将混匀精液的鱼卵倒入盆中,均匀铺开孵化,授精率高达99%。
S4.产后护理:金鱼产卵后3小时内,冬天将鱼缸换水1/4,夏天换水1/3,清洗并更换晒干后的洁净过滤棉,防止水体中精液和卵巢液过多滋生细菌败坏水质,避免鱼的产后死亡。
在养殖过程中,常备盐、小苏打、亚甲基蓝、白点净和加热棒等用来预防和治疗鱼病。如果发现鱼状态不好,换水1/4后,按照3g/L水标准添加盐杀菌,并适量加小苏打调整pH值到8,并让水温维持在25-27℃,有利于让鱼快速恢复。
实施例2:雌核生殖条件探究
建立两种雌核生殖方法,比较两种方法产生正常鱼的比例,选取纯合子获得率高的作为泰狮金鱼最佳雌核生殖条件。
1.减数分裂雌核生殖
取鲤鱼精子灭活,将灭活精子与百褶裙泰狮金鱼卵子混合受精后进行冷休克处理,冷休克处理方法为,将受精卵在24℃恒温发育3-7min后,再置于0℃冰水中进行冷休克2min以抑制第二极体的排出,从而两个配子合二为一形成四倍体。经冷休克处理后的受精卵孵化即为具有龙睛性状的减数分裂泰狮金鱼。
2.有丝分裂雌核生殖
取鲤鱼精子灭活,将灭活精子与百褶裙泰狮金鱼卵子混合受精后进行热休克处理,热休克处理方法为,将受精卵在24℃恒温发育43-47min后,再置于41℃水中进行热休克处理2min以破坏纺锤丝的形成,合二为一。经热休克处理后的受精卵孵化即为具有龙睛性状的有丝分裂泰狮金鱼。
上述使用的百褶裙泰狮金鱼卵子优选为产自同一条百褶裙泰狮金鱼的卵子,以缩小背景差异。
统计减数分裂和有丝分裂产生的泰狮金鱼的受精率、出膜率和正常率,结果如表1所示,由表1可以看出,在受精47min后进行热休克得到的正常鱼数量高于其他组合,达到3.34%,在受精后6min进行冷休克,得到的正常鱼数量高于其他组合,达45.52%,因此,优选减数分裂雌核生殖作为泰狮金鱼的最佳雌核生殖条件。
表1有丝分裂与减数分裂条件下F1代纯合子情况统计
实施例3:龙睛百褶裙泰狮金鱼的创制
S1.靶点设计:对百褶裙泰狮金鱼进行基因测序,获取调控龙睛性状的基因的DNA序列和氨基酸序列,并利用通配符GG[G/A]??????????????????GG或者 CC??????????????????[T/C]CC在所述调控龙睛性状的基因的DNA序列的外显子上设计适合的敲除靶点,获得靶点后构建基因组本地数据库,用blast软件检索并验证靶点的唯一性,确定唯一性后将该靶点序列体外转录成gRNA;通配符中“?”代表一个任意碱基,“[]”代表与[]内任意一个碱基匹配,“/”代表“或”。
所述调控龙睛性状的基因为lrp2a即低密度脂蛋白2基因,所述靶点有五个,五个靶点序列分别为:
GGGTCTTTCCGCTGTGGGAC、GGAGCTGAGGTGTGCTAACG、 GGATGACTGCGAGGACAATG、TCTTCTGGTGTTTGCATCCC和 GGGACGTCCCACTGTCCGGA;
S2.显微注射:将百褶裙泰狮金鱼的鱼卵平铺在无菌培养皿中,添加适量的卵巢液防止鱼卵干涩,取CRISPR/Cas9蛋白和gRNA的mRNA各2μL并混合均匀,用显微注射仪向鱼卵动物极注射混合后的mRNA,共注射200-3000粒鱼卵;
S3.受精孵化:取雄性百褶裙泰狮金鱼精子加入注射操作完成后的培养皿中,等待鱼卵受精后将受精卵转移至孵化巢孵化,取即将出膜前的受精卵10颗放入 PCR管中,加入70-100μL的50mM的NaOH溶液,于PCR仪中95℃反应20min 后作为DNA模板。在敲除靶点附近设计PCR扩增引物,PCR扩增、转化、克隆后挑选10个目的克隆测序并与正常序列进行对比以检验该敲除位点是否被敲除,根据对比结果计算敲除效率。
所述PCR扩增、转化、克隆后挑选10个目的克隆测序的具体方法为:使用 10个DNA模板进行PCR扩增靶点附近的DNA序列,并将PCR产物回收后连接到载体并转化到感受态细胞中,在Amp培养基中37℃过夜培养后挑选10个阳性克隆测序,以检验该敲除位点是否被敲除。
本步骤中“靶点附近”是指靶点上下游200-300bp区域。
若靶点被敲除的受精卵率达到20%及以上,则留养该批受精卵继续孵化,所得百褶裙泰狮金鱼即为具有龙睛性状的百褶裙泰狮金鱼。否则终止孵化,重新敲除,重新敲除时提高CRISPR/Cas9和gRNA的mRNA注射浓度,或者将所述靶点序列重新体外转录后再重复步骤S2及S3。
S4.鱼苗饲养及亲本挑选:受精卵孵化出鱼苗后,将鱼苗饲养大,在早期1 个月内,饲养水温不超过24℃,饲养约2-3月时,眼睛开始向外凸出,此时所得百褶裙泰狮金鱼即为具有敲除目的基因后出现性状的嵌合子百褶裙泰狮金鱼;挑选出品相好的且眼睛凸出的百褶裙泰狮作为亲本记为F0代。
S5.雌核生殖:取鲤鱼精子灭活,将灭活精子与F0代雌性百褶裙泰狮金鱼卵子混合受精后进行冷休克处理,冷休克处理方法为,将受精卵在24℃恒温发育3-7min后,再置于0℃冰水中2min,经处理后的受精卵孵化即为具有龙睛性状的F1代纯合子。
S6.高温扩群:在F1代出膜后早期,对一半的雌鱼用高温27-28℃饲养,使雌鱼性逆转为雄鱼,性逆转雌鱼和正常饲养的F1代雌鱼进行繁殖,批量得到纯合子子代百褶裙泰狮金鱼。
实施例4:龙睛百褶裙泰狮金鱼的创制
本实施例4的创制方法与实施例3相同,不同的是,步骤S5中,对受精卵进行热休克处理,S5的具体操作为:
S5.雌核生殖:取鲤鱼精子灭活,将灭活精子与F0代雌性百褶裙泰狮金鱼卵子混合受精后进行热休克处理,热休克处理方法为,将受精卵在24℃恒温发育43-47min后,再置于41℃冰水中2min,经处理后的受精卵孵化即为具有龙睛性状的F1代纯合子。
以上实施方式仅用以说明本发明的技术方案,而并非对本发明的限制;尽管参照前述实施方式对本发明进行了详细的说明,本领域的普通技术人员应当理解:凡在本发明创造的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明创造的保护范围之内。
序列表
<120> 一种龙睛百褶裙泰狮金鱼的创制方法
<141> 2021-01-22
<160> 2
<170> SIPOSequenceListing 1.0
<210> 1
<211> 90429
<212> DNA
<213> Carassius auratus
<400> 1
tcagttccct gtgtctgccg acttcaccag gctgctcaga ctgagagaac cagacaaaca 60
gcagcagcac atgagaatca taccaaatca aactcaatat atggaatttt cacaaagaaa 120
agttcaattt aatgaattgt ttcacataaa tatagcgacc ggatcattaa agaacttgaa 180
aaacacatta ttgtaagaca tatgatttgt taagataaaa agttgttccg gttcagtcag 240
ttaataatat catgttctta atattaagct gtttctgtga gtcttcctca tcccagtaag 300
ataatgcgac cgctcggatg catcaatcaa tcccagtggc ctgtaactac ttcaatctgg 360
gagattgtat acgcaataaa ggcccactct caaaatatta taactttcat ttctactttt 420
taaatcatcg aaacatttag actttattct cataatattt caactttatt ctcaaagtat 480
tgcgacttta ttctcatagt attatgattt tattctgata atatttattt tcgaagtatt 540
ttgactttat tctcgaaaca tttcgacttt attctataat atctgtgtac tcatatattc 600
attgatgcac taaaagcaca ataagactag ctgcttaaac tgaagtcata caatatgaat 660
tcatatagct gaactgtagg cagtttttgt gttgtacctg ggtagtttag gcattgacgg 720
gatgagagag ccggtgcgct tgtagttgag gaaaactccc acaactagag ctccagtgat 780
cagaataatt accactgcta acaggacggt cacagctaca cacacacaca cacacacaca 840
cagtatgtaa gacagataag tgcagtcgca ttagtattca tttgaaatta acttgcgtaa 900
caagacggat agagtaatta gagttgataa ataacagcac cacatgttac aaaccgtgag 960
taaatgtatt tacgatgaaa gagttgttta actaatgaac aaggaagggt ttgtttacct 1020
gttcctgcag gagcagctct agaccgcccc atttcacaga aactacccga gtaaccgtat 1080
ggacacctga ggagcataaa cactgaaccg ttacagacaa tctacagtac ggctttgaca 1140
acaattattc tttctacaaa ctgatactca ctgcatattt ttagattttc atgatgaatt 1200
tcatttgtgt tcacgcagtc atactgtatt tgtgactatt gcatttattg catttatatg 1260
ttgcagaagt tcaggagtca tatggaaatg tttttttaat tgcaatattt atcgaattat 1320
aaaaaaattt aattatgtat tttcatgcta aatatcagtc aattatcact aaataaagag 1380
gttttgcgtc tgacagatta tatttataat tgaaattgat ctatctatct atctatctat 1440
ctatctatct atctatctat ctatctatct atctatctat ccacacacag agtcaggtcc 1500
atatatattt ggacactgac acaattgtca cagttttggc tctgtacgcc accagaatag 1560
aattaaaatg aatcagtcaa gatgaaattg aagtgcagac ttgaagcttt aattctagcg 1620
gttgaacaag aatattgtca ccttaaggat ctttagtttg ctttctctcc tcgtgtttct 1680
gatctagttt ttgtcttccc ttaattatgt ctattggttc aggtgtgttt tattaatcat 1740
cctcatttgc ctcagtagtt aagttcaagt ctgttcagtt cgtctttgtc cggtcttgtt 1800
gatgtgtttc aagccttcct tgtgtggatt accttcatgt ggatttatta aaaactgttt 1860
aatgtaatgc aatgacttct acatttaatg tataagtgag tcaactgtct atttacccag 1920
acagcaacat agtgttggcc cagatccggc ccacatctgg cccacatgaa atccatgtgg 1980
gccagatgtg ggtcagacct gggccgaatc tgtttgctgt ctgggtactt atattccctt 2040
gaaaaaggaa ggcgatacat atgaaagagc tgtaattcct tgtatgtagt gcgttgtata 2100
tacagtctat aatttgcagc aagttttctt agaacacata taacaaaata tgattgaaaa 2160
ttcctcatga tacgagccct ttcaataact cttatcagcg gggcggctgt gatgctctac 2220
actagcgttc acactcactt gcacttcggc aggcctccct catcagtgta gcatgttcct 2280
ccattcatgc atctgcatgc aagtggcatg gacacctctg cctcaatggc tgcaagtgcg 2340
aggcaatgca acacagtcac aacgcaatca caacacaagt caaacactca ctttcatcac 2400
aatatcagag ctctgaagcc aaaacatcac atgacaataa tgaaagcatc atgcaggagt 2460
agatgcatgc atgtaatgaa atgtattgat gggatgccat ggagccgatc agctcatctc 2520
aaactaacag cagtaggtgt gtgttcatct cttaccggca tcacattcat tcttgttgaa 2580
ggtgagcagt gtggagccct gtggacaggc acaggtgtat ccacccggcc gcagcaaaca 2640
caggtgactg cagacaccct gacaggggtt cagcactgca agagaatacg aacagtcgtt 2700
tactattatt attacaggaa ttatcttcag ctaaaagaaa cgtggtgaaa aagaactgag 2760
gaactgtgtt tttacacagt tgctcttggg tataaataca aatgcaaaaa cactatatat 2820
atgttaatta gcacgtaact gaatgttcac aggcatttgc atccttttgt gcagatacaa 2880
gatgttatgt atctaagtta tctaatttaa atcataggat gtgtcataga acgtgcatta 2940
gtttatgggc ataaagtcac tcagctttaa cctcaggcag ctattccttt ttagatgctt 3000
cttgtttaat aacattgctg cgtcataatc ataatctaac agtgttcttc taatcatgtt 3060
tccatattca ggttttcatg tcttttttat ttaattttta acattaattt acataacagc 3120
ccagatattt tcattatctc catgacggct aactaaggaa aacacttcat taatatgacg 3180
agctgcacct caaaatacat gttagatgga aacattgtgt attatgatta attcgatgta 3240
ttttaatagt ttgacaaatg caataggtaa taatcatttg catgaatgtt caatttatgt 3300
tgtaataaat atctgagaca ttattgttgt atctaaatgt attagtcttt atagagtgaa 3360
agtaaaaaga gcaaatggga gaagcagagt gtcacaccct cagctcgttc cctcagcccc 3420
agtcaccatt gccagtcacc atccactcaa tctcccatgc accgctcacg tgaaccgtca 3480
cctgaatact gattacctgc acctgcacca gcaattacca cacctttaaa tactcacctc 3540
actcattcac tcaccggctg gaatcgagat tgcatggacg ctcttatgga tcttctacct 3600
gcttcttgtg gatcatatag tgactctgtg gagattcaga cacagcgatt aagggaagtg 3660
actctttgtt gtctggccta gccttgtgct tgaactcacc tatgtgatca gtgcttgaag 3720
attcttcgtc tgtgaccaca cttacttgct ctgttaaaag tcctatgtta ataaagcttc 3780
aagtaaacac ttacccatct cctcctctcc ttgtgtggat gtgacaggat tccagcccct 3840
aaaacagaac ttcatatctc ccatggatgt taacgctgct gctcagggag cggcttcgga 3900
cccgttcacc gaaatggtga ctgccctccg ccaagtatta cagtcagttt caccacccac 3960
cgaaactagt gcttccacca ccaacagcat gccccttgca cgtcccgcgt gctatgcggg 4020
tgaaccgagt ggctgcagtg ggtttattct gcagtgctca ctgttcgtca acgcgaatac 4080
cagtaaattc cccaatgagg ccaccaaagt cgcctttgtt atctctctcc tcaccggacg 4140
agcgctacaa tgggcggagg ctctttggaa ctcccagagt ccggttctat cctcattcga 4200
tgccttcgcc acccacctcc aggaagtgtt cggggtggct ctaactcctc tttcaaccca 4260
tgatgaactc ttaaatctcc gccagggagc cacggaaatt cacgactaca ctctccgttt 4320
ccggacatta gccgccacca gtggttggaa ccaaactgcc ctgctagctg cataccgtaa 4380
aggtttaaag ccccagatca gaaagcaaat ggtcatttac gacgataatg tcgatctcga 4440
gacattcatc agaaaaacta taaatgttgc tcagcacctc tcggcttgtg cctccccggc 4500
ttcatatacc atctctccat caaccagaac gccctcgcct caacgaaacc aggaggaacc 4560
catgatcacc gactcctacc gcctagatgc ctccgaacgg agacgccgca tccagcagcg 4620
gctgtgttta tactgtggcg aggccactca cctgatccac gcctgcccgg tccgtccgcc 4680
tcgcccaatg gtgagtacag tttcctttac cccatctgta tctcacatac ctcatatcaa 4740
agcccagata acaattaact cacgtaatct tcccatccat gtcctggtgg attccggagc 4800
cgcaagcaac ttcatctcat ctcacttcgt aaccaagcac cgtataccca tcacccagaa 4860
tgagaccacc taccgtatca ctaccatcca aggatcaccg ttagggggag gcaaaataac 4920
tagaaggacc catgagctgg aactcgttct gccccatgga caccgggaac aactggccct 4980
cctcgtactt cctcgcgcta ccgttgacgt cgtcctgggt aggccgtggc tggcccaaca 5040
taacccacga atcaactgga gcacagggga gatccaggcg tgggatccgg gatgccagag 5100
tcacattcac cgttcaccag tccagagttc acagtccgag ccaccgcacc ttcaattgca 5160
ctccacctcc ataggaagac ctgcccttta ctcctcctac tccgatgtat tcagcaagga 5220
acaagccacc cgattaccac cacaccggcc ctgggactgt tgtatcgacc tgctgccagg 5280
tgccaagcta ccacatggga aaatatatcc actctcccgt cctgagcagg tggcgatgga 5340
ggaatacatc caggaggctc tcgatcaggg gttcatcaga ccatccacat ctccagctgc 5400
atccagtttc ttcttcgtcc ccaaaaagga tggcggactt cgaccatgca tcgactatcg 5460
agtgctaaat gacgccaccg tcaagttcgc ctacccatta ccactcgttc cggcagctct 5520
ggaggaactg cgtgaagcca aggtgttcac caaactcgac ctccgcagtg cctacaatct 5580
gatccgtatc cgagagggag acgagtggaa gactgctttc atcacccctg ccgggcacta 5640
cgagtatcag gttatgccct atgggcttgc taacagtcca tccatcttcc agagtttcat 5700
gaacgaaata ttccgtgatt atctccacca attcgtcatt gtctacatag acgatatcct 5760
gatctactcc cggaacatag aagaacacca aacccatgtc cgccacgtct tacaacgact 5820
ccgagaccac cacctctacc taaaagctga gaagtgtgag tttcactgca ctaccgtctc 5880
cttcctcgga tacgtgatct ctgcggaggg agttcagatg gagtcagcca aagtagatgc 5940
tgtgaccaac tgggcggaac ccacaacggt gaaagaactc cagcgattcc ttggctttgc 6000
caatttctat cggcgcttta tcaagaacta cagcctgcac tctgctcctc taacatccct 6060
cctaaaagga ggtcagcgcc ggctgcgctg gacaccccaa gctcgagaag ccttcagcca 6120
tcttaaacac ctcttcacct cagcgcccat tcttcgacat cctgacccgt ccaagccttt 6180
cgtagtagag gtggatgcgg ccaatcacgg cattggagcc gtgctatccc aaaggtctgg 6240
tgaaccttgc tcactccatc cgtgtgcgta cttctctaag aaactctctc ctgccgaatg 6300
caattatgga atcggagacc gtgagttgtt ggccattaaa ctcgcccttg aggagtggcg 6360
gcactggcta gagggagctc agttcccatt cactgttgtc accgaccaca aaaatctcca 6420
atatctgcag aatgccaaaa gactcaatgc tcgtcaggct cggtggtcac tcttctttgc 6480
tcgctttaat ttccagatca cgtatcaacc cggtcacaag aacactaaag cagatgccct 6540
gtcccgtatg tactcaccag atccagacac caacaagcct gattcaattc taccaccctc 6600
cgttttcctc gcgcctatcc tctggcagat cgacgaggaa attcgagccg ccaccctcga 6660
ggagcctgca cctccggagg tacccacggg tctaatgtac gtgcccacca accagcgact 6720
ccccctactg gaaagtacgc acacgtcacc tggctcagga caccctggca gcaggcgaac 6780
cctctcgctc atccagcaga agtactggtg gcctaacatg gttcgtgacg ttacccggta 6840
catcctcgga tgctcggtct gcgccgttac caccacacct cgtcaacttc ccgtgggtaa 6900
attacaacca ctgcccattc ctcgccgacc ctggacccat ctaggggtgg atttcgccac 6960
tgaccttccc ccctcaaggg ggtacactac cattctagtt gttgttgatc gtttttctaa 7020
attctgcaaa ctaatcccat taaagggtct tcccacagcg ttcgagaccg cagaagccct 7080
cttcaccaac gtgttccgga attatggcac tccagaggac attgtctcgg acagaggacc 7140
tcagtttatc tccagggtct ggcgagcgtt cttccaactc ctcggaacat ccgtcagttt 7200
atcttctgga tatcatcctc agaccaacgg ccagaccgag cggaagattc aagagatctc 7260
acggtaccct ccgtacttac tgctcccaac accaggatac ctggagccag tatctgccgt 7320
gggctgagta tgcccaaaac tcccttcgcc aaaccactac aggcttaacc ccttttcaat 7380
gtgttttagg ctatcaacca ccgctgtttc cctggtcagg agaagtctct gaagtgcccg 7440
cggtagatca ctggttccag gagagcgaga gggtgtggga ctcagctcac gtccatctcc 7500
aacgggcagt tcggaggcat acagagaccg ctaaccgacg cagatcgcct aatcccgtct 7560
atctacctgg agacaaggtt tggctctcca ctcgggatat ccgcctccga ctgccctgca 7620
aaaagctgag tccccgctac atagggccgt tcaccatcca ttctcagata aatcccgtca 7680
cctaccgcct ggacctacca ccacactacc ggatctcacc cacgtttcac gtctcactgc 7740
taaaacgtca cactgatcca gtttctcctt cctccacaga accagaaccg cctccccctc 7800
cagaccaacc cgagatcctc ggagacaaca tctaccaggt ccaagagatc ctcgactccc 7860
ggcggcggga cggccacctg gaatacctca tagattggga ggggttcggt cccgaagaga 7920
ggtcgtgggt accacgcaat gacatcctgg atccggctct cctcgaagat ttccaccgac 7980
aacacccaga ccgcccggct cccagaggtc gtggtcgtcc ccgtcgtcgc tctaggatgc 8040
ctggagtcac ccgtgggggg gggggggtag tgtcacaccc tcagctcgtt ccctcagccc 8100
cagtcaccat tgccagtcac catccactca atctcccatg caccgctcac gtgaaccgtc 8160
acctgaatac tgattacctg cacctgcacc agcaattacc acacctttaa atactcacct 8220
cactcattca ctcaccggct ggaatcgaga ttgcatggac gctcttatgg atcttctacc 8280
tgcttcttgt ggatcatata gtgactctgt ggagattcag acacagcgat taagggaagt 8340
gactctttgt tgtctggcct agccttgtgc ttgaactcac ctatgtgatc agtgcttgaa 8400
gattcttcgt ctgtgcccac acttacttgc tctgttaaaa gtcctatgtt aataaagctt 8460
cacgtaaaca cttacccatc tcctcctctc cttgtgtgga tgtgacacag atgaacaaaa 8520
cagccaaaca atataaagtt aaagaggggt tcaactgagc tatgcacgta tgttttaaac 8580
actacagatc attacatgtt aatttctcag gtaatattaa ttatttgacc tatttcagat 8640
gtggcagtgg tgtcataaac atatcccgtg tctcacccga gtggttgtgt cggtgctgct 8700
ggtagatgcg tacttgagtc agccaggggt ttatggtcaa caccttcacc ttgtctcctt 8760
tcccgaactt atcttttttc cacacttctc ccttctcttt agtcgtccag tacacgtgtc 8820
cctcgaacac atccagactg taggggtgac cgatgtctgc aatgacaaca aacgtgagat 8880
ccacgatcgg ccagtcaaaa gctgttttca cagatgtaat gctctgccgt ctcacctcct 8940
gatatgatca tttgacggtc tgttccgtcc gctttcatcg actcaatgat gttttctttg 9000
gagtcgcacc agtagatgcg gttttcattc aggtagtcta aagctaaacc agtgggccag 9060
cccagatctt catctaacag cacctctctg tgctgcccat ccatccacgc cgtctcaatc 9120
ttgggcttcc tgccccagtc ggtccagaac atctgactat aacatacaca cacaaaataa 9180
acatcattaa tgacaaaaac tgcagttttg aagatttggg catccaacat tttatgatca 9240
atgaattttc acacatatat tacaattggt gtgtgattat tagacaatat ttgatttagt 9300
gcttaagata taaaaagtaa aaaaatatat atatatatta gatgtattta ttttattacc 9360
aggacatttc cacgcataaa agttacaatt ttaaaacgtt catgaccatg cttcccccag 9420
gaatgtccat taattatgtt taaatcccaa ataaaaaaat taaatataca catcatataa 9480
tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 9540
tatatatata tatatatata tatatatata gtgctgtcaa acgattaatc atgattaatc 9600
gcatttaaaa taaaagtttt tgtttacata atatatgcgt gtattgtgta tatgtattat 9660
gtaataaaat aaataaatga ataaaaaaaa taaaaataaa taatatatat atctcaatat 9720
tttattattt ctttagtcac aaatctctac acaatatgag cgatatctac ccaaaacact 9780
atttcataag atgtatttat ttaatttcca tgactttccc agacctaaaa atcacaaatg 9840
taaaaatgtc attactttat agtaaatata tagtttcgtt catattctca tacatattca 9900
tattcttacc ctagaccagg attgaccaca atagcggctg gctgatcaag atcactgtga 9960
atcaaccact ttctgtaacg tccatccagt tttgacactt cgattctatt agtcccagca 10020
tctgtccagt aaatgtggct gatggaaaag acaaaacgca tagcagatgg gatgtttatt 10080
tctcttgaac tgtcccagtt agagaatctg catcagagct caagtaaagt atagagaagc 10140
acgacacaca ccctccgatc cagtccacgg cgatgccatc aggactggag atgtatctga 10200
ggttcaggtc cacctctttc acaggattat ttccatggtc atcaaacgtg gtcatgtagg 10260
ccctcttaat cgcaccgaac tctgagcctc gtcccaacac tgtccagtac actatacctg 10320
acaaaacatt tacaaatatg tttcagtcag tgtgcattca taaccggatt acagaaacac 10380
aacaatacag ggtattggat ataggatttc aaaacattct tcaaaatatt caaaagaaat 10440
cctgtaatag tatataaaga atataaccct ggctttgtct tattgagctt taaattgtgt 10500
ttccctgtgc tgcatctgca tcaaaacatt aagcatatgt catattagat gggatttccc 10560
atggatattg tcttactcag accgaggccc tctgggtccc acaggtagtc cagtgcctga 10620
atgtgctccg cattatccac ataatcagag tactgcgctg acgagaggtt aaagcggcgg 10680
atccgcacgt tatcaggcaa taacaagacg ggaggatttc ctgtggggaa aacaggagtt 10740
gtcactgaaa ggtgaggagt gtgcatctat gacttctaaa ggagcgctca cgtttagcaa 10800
tgtttgtcaa tttcaggcta tagaaggtaa aaaagagaat aaattatttc atattgaaaa 10860
actttaccct cagcagcgca ttcgactccc ggctgctctc ccactgatcg aaacccgtcg 10920
gcgcagaaac actcataact gcccttagtg ttcctgcaca gctgtggaca cgtcccgtaa 10980
acctcacact cgttcacatc tgcagacggc cagaaaacag ctcacattat acatgcagga 11040
agcatttcaa cttctgaggg tgtgccaatg ttgtctctaa gccgctttca tgcagagaga 11100
aaaaagtcca ggccaaggac agaatgaaaa actaactggt cgttttctgt agcactgctc 11160
aaatttgttt tgtgatccag aaactttcca acatcagcac tgtcaatgaa ctgaattgaa 11220
cctggattta ttatgattaa gtgaaacaaa aaagaataat aaaaaaaaaa catacaaaac 11280
atttttttac ttaaaataat tgttaaaaat aaaataaaac acaaagaaaa agtctttttc 11340
cagctagttg ccaaggtaga acttttcaga atttagtatt tagcttaact tgatgtccta 11400
aaataactga aataaattta atgaaacgca ctacaaatga aaaagaaaat gtaaaaatgc 11460
aaaaataaac cttattcaaa atatgaatac aaactataat attatctcag tggtactaat 11520
aaaacagtga actgacttga gataattaat catgctactg tatctgctga gctgcttata 11580
gatgaatcaa atttgtttta taactgatga acttttgcaa cactagctcc gttcacatgc 11640
aaccaaataa tccgctggta attggactga tggctcaatc agactgaaat ggcttcatgt 11700
aaacatctta atcgatccca ctgagctcct aaactcttga ttggattgag tacggaaact 11760
tgaaagtaca tgtgcaatga cagaaatggg ggaatgacgc aacagatgtt gttgatgttg 11820
aagttcctgt ctggctataa gcctgtcttc taataaagaa gtaaataatg aggttttcta 11880
atgaatacag tattcagacg acaccacaaa ccattgcagc tgttcttgtc ctccgggttg 11940
ggtttatatc caggtctgca ggaacaaatg aagcctccgg ctgacagatc tgtgcagtta 12000
tgctcacaca gattctcact gcatgtgcgt ccatggccgt agtctaacag agacagatga 12060
agaaatgtta caaagatgtg caggctaaca tatctagacc tttttttgga gagtttctgg 12120
agctttctcc tcttttgttt atgcaagaaa taaaataaag aatatgaggg tgagtgaatg 12180
agcaaaactc acagcagccg agttcatcgc tctggtcctc acagtcgtcg tagtcgtcgc 12240
atgcgtactg cagagggatg cactgaccgt tcccacattt atactcgtct tcagtgcagg 12300
gcccatgtgt gggagtcaca cctgaggagt cattcacaca aaacagtttc aatttaggga 12360
atgaaaaggg tttcataggt caatcgtgta tttgattatc aatctatcgg cggtcactca 12420
caatgctccg gcctctcgtc gcttccatca ccgcagtcat ctatcgagtt gcagagctcg 12480
tgactgtaga tgcagcgggt gttatcacag cggaaacgaa acggaggatc acacgagatg 12540
tccactgagg agacaagaaa catgacggtt gtgcattctg ttccacatca tccttattgt 12600
gttatataac tgatatactc tacaaatgat attagaaaaa gaaagtttag acagatagtc 12660
gggctgtgcg atataccgca ttttggtcag gatcacgact tctgcttctg cagtatttaa 12720
ctaagtgtta tttatcttga aaacatttaa ttttagtatg ccacttgcat tggtaacaag 12780
cctccacaag ctattatggc tgtggttgcc agatttcaat agattaaatc ccccgatctg 12840
agcgtttctc ttaaataaat acatttttca attagagttc aataaagata gatagataga 12900
tagatagata gatagataga tagatagata gatagataga tagatagata gatagataga 12960
tagatagata gatacagtat tgttcaaaat aatagcagta caatgtgact aaccagaata 13020
atcaaggttt ttagtatatt ttttattgct acgtggcaaa caagttacca gtaggttcag 13080
tagattgtca gaaaacaaac aagacccagc attcatgata tgcacgctct taaggctgtg 13140
caattgggca attagttgaa aggggtgtgt tcaaaaaaat agcagtgtct acctttgact 13200
gtacaaactc aaaactattt tgtacaaaca tttttttttt ctgggattta gcaatcctgt 13260
gaatcactaa actaatattt agttgtatga ccacagtttt ttaaaactgc ttgacatctg 13320
tgtggcatgg agtcaaccaa cttgtggcac ctctcagctg ttattccact ccatgattct 13380
ttaacaacat tccacaattc attcacattt cttggttttg cttcagaaac agcatttttg 13440
atatcacccc acaagttctc aattggatta aggtctggag attgggctgg ccactccata 13500
acattaattt tgttggtttg gaaccaagac tttgcccgtt tactagtgtg ttttgggtca 13560
ttgttttgtt gaaacaacca tttcaagggc atgtcctctt cagcataggg caacatgacc 13620
tcttcaagta ttttaacata tgcaaactga tccatgatcc ctggtatgcg ataaataggc 13680
ccaacaccat agtaggagaa acatgcccat atcatgatgc ttgcacctcc atgattcact 13740
gtcttcactg tgtactgtgg cttgaattca gagtttgggg gtcgtctcac aaactgcctg 13800
tggcccttgg acccaaaaag aacaatttta ctctcatcag tccacaaaat gttcctccat 13860
ttctctttag gccagttgat gtgttctttg gcaaattgta acctcttctg cacatgcctt 13920
ttttttaaca gagggacttt gcgggggatt cttgaaaata gattagcttc acacagacgt 13980
cttctaactg tcacagtact tacaggtaac tccagactgt ctttgatcat cctggaggtg 14040
atcattggct gagcctttgc cattctggtt attcttctac ccattttgat ggttgtcttc 14100
cgttttcttc cacgtctctc tggttttgct ctccatttta gggcattgga gatcatttta 14160
gctgaacagc ctatcatttt ttgcacctct ttataggttt tcccctctct aatcaacttt 14220
ttaatcaaag tacgctgttc ttctgaacaa tgtcttgaac gacccatttt cctcagcttt 14280
caaatgcatg ttcaacaagt gttggcttca tccttaaata ggggccacct gattcacacc 14340
tgtttcttca caaaattgat gacctcagtg attgaatgcc acactgctat ttttttgaac 14400
acaccccttt caactaattc aacgaattgc ccaattgcac agccttaaga gcgtgcatat 14460
catgaatgct gggtctcatt tgttttctga gaatctactg aacctactgg taacttgttt 14520
gccacgtagc aataaaaaaa tatacgaaaa accttgatta ttctggttag acacattgta 14580
ctgctattat tttgaacaat actgtagata gatagataga tagatagata gatagataga 14640
tagatagata gatagataga tagatagata gatagataga tagattctca cagcagagat 14700
gcagttcttc atctgaattg tccccgcagt cattatctcc gtcacatatc cagtgctgct 14760
gcacacacac gtggttcttg cactcgaaca gaaagggtgg acagtaggtg ctgtttgggt 14820
agcgtgtggc tgaaatatac acacatacag ctcaaagtca actttctcaa tacatgatga 14880
agatgcattc aagcattcag tcggaaacac acccatgact ttggcatggc tagtgctgtt 14940
tcaactactg gctgctcaaa atgatcctga tcttattgtg aacatttcat ctgtgcatat 15000
atatataaat atacatatat atatatatat atatatatat atatatatat atatatatat 15060
atatatatat atatatacac atatatacat atatatatat atatatatat atatatatat 15120
atatatatat atatatataa aggtttgtct tcatagtctt aaatcaaaat gtcaaattcc 15180
ttgatattcc ttattaagtt tgcatgaaga tgggaacagt tgcacaagcc ttttggactc 15240
tagagggcgc tgcaatacaa ctcagctgtc ttctggttgc atttcagaaa aaaaaaaagg 15300
aaaaaaacca tggcagcaga ctcacgacag gagctttcat cagtgaagtc gagacagtca 15360
gcgttgccgt cacatttaaa gcgctcggag atgcagtggc cgctaccgca ctggaagtaa 15420
cccgggtgac aggtccgcag ctctgaaaca cattcatcca caatctctca atacgccacg 15480
gagaatgaat tatgcacact ccctctaagt cattcacaag gtatcctggc acacaaatgc 15540
attgagatat cacccactga ggaccgatcc gtgataaaaa gacagtatta tgatgttgaa 15600
catggtggca agtgtcacag ttctctccat ccctctttta ctagtctgtc ctttctgctg 15660
tctcttcttt tctctcactc accacagtct ctttcatcag agttgtcctc gcagtcgttg 15720
tcatgatcac aaacccagcg gtcaggaatg cagtgcaggt tgtcgcagcg gtactcgctc 15780
tctgtgcacg gacgaggggc tgcggaggca taggaagagt gcatcagccg cttgtctaca 15840
tttaaccaac aattaaaaaa aatactgcaa tcagaattca aaagttaatt tttgtgaaca 15900
gaagtggcca aaatacagct gacacagcac ctgaacagtg cactgaaaac ctgaactttt 15960
actaaaacac aagggcagtg attagtcaca agactagagg tttatgtgat gagatacagc 16020
ttgctcttca aggccatttt tgggaggtga gttcatagct ggactcatca ggggagtgac 16080
ccgcgctcaa cacccctaca gctcttctgt agggaggaga aaagtgctga aaaaatcagc 16140
gcttctttca acatgcagct ccgaggtcca gcgcagacca tttgtcaacc tttcacactt 16200
cacagcacct cgtctgccat cacgtcttct cctccacact ctctgtccgt ccaatttgtt 16260
cctctcaatc tcttcatctt tgtctttact gttactctaa atatacaggt gtctttctct 16320
cttattaagg gaacagttca tccagcaata atgcattcac cctcatgtca ttttaaaacc 16380
agagaaagtg aaatgttaaa gaataactga gccattctat ttcaatgaca aaaaagcaga 16440
ataaaactag cttttacgat cggacaatat ttggcagaga tacaattaaa aaatcttgaa 16500
cctgagggtg caaaaaaaaa aaaaaaaaat ctaaatatta agagcatctt ctttaaagtt 16560
gtccaaatta aattcttagc aatgcatatt actaatcaaa agttatgttt ttatatattt 16620
atattgaaag tagtaaattt acaaaatgtc ttcatggaac atgatcttta tttaatattc 16680
tagtgatttt tggcataaaa gaaaaatcga taattttgac ccatacaatg ttttgttgta 16740
tatatctaca tatactatac cagtgcgatt tatgactggt tttgtggttc agttttattc 16800
cacttcttca gaagccattc attagataaa gccaaactca ttagctttat ctaatgagta 16860
aatcaaaatt taaagaaata gatcacccaa aaattacaat tctcaaaacc tgtataagtt 16920
tctttcttct gtcgaacata aaataagata ttttgaagtg tgttagtaac cgaatagttg 16980
ctggacccca ctgacttcct cactatggaa aaaatactat gaaaaagtca atggagacca 17040
aaaactgttt ggttacccac attcttcata ttcatttgag ttcagcagaa gagagaaagt 17100
catacatgtt tggaacaact tgagggagag ttaataattt ttgggtgaac tatcccatta 17160
aatcaatatt cacagaatat ctcgtcctgt acgtatcgtt ggttcacaca tgtcacaaac 17220
attaaaagac aatcatattt gatttaagag ctacatgaga gttgaccatt tatcatattt 17280
atttggagtt tattatctcc agtcctttta attaacattt ttatttcatt gaaccacaca 17340
aaggtgagta agtaatgact aatacttaac tatctctctg gtttgttgtg atattgaact 17400
gactgaaggc aagaggtgat ggctgttact cactgcagtt gcgttcgtca gatccgtcgc 17460
cgcagtcgtt gtctccatcg cagcgccagc gcagggggat gcactggtgg ttatcacagc 17520
ggaaatcacc cgccggctca caggtcaact cctctgcaaa cacaacacag acacacacaa 17580
accaacgctt aacagagcgg aagtgaacgt atttatgaag gtaaatgcac gatttttaag 17640
gacctttata agaacaagta aagaaagttt aaagaacaaa tggaagggga cacggagaaa 17700
tgaatcaaaa tgagtttatg attaaaacaa aaacaaattt atgatgatga ggtcataaaa 17760
aataactagt tcatgtatcc tactgtcaga tatttgttct tgttttaagc ataaacacac 17820
ttcaatttgt tagatttcac agatttcata cttcatatca ttttgcatct caagtatgaa 17880
tcctgatcct caaaatgttg aggtatttgt atacaacatt cattttttgt agtgcaatgc 17940
aacatttaat gccatgactt taagacgtcg tgctctgaat taatttgttc aagggcgaac 18000
taagatggaa accctgcaaa aaagaatgca gagtttaaaa aaaatagatg agaggagaag 18060
aaaaaacaag aaaactgaga ttgagaagga aaagaggtga gatagaatga gacgagggtg 18120
aaaaaagaga caagaagagg tcaaatgaga ggagtgaaga ggacaacata agaaagcaga 18180
ataagtgcaa tgagaagaga tgatgagaaa aggagacaaa gtgaggtgaa aaggaaagga 18240
taggagagga ggagatgaac acacacacac acagacacac acacacacac tggtgcctga 18300
cacggacata actaacacac acatgaatca gtggtgtggc cccagacagg agaaggggtc 18360
agcgagggcc cgtccatcac acacagactg aactaacact cactaatgat tcaatcacac 18420
gacacaacac caaccacaca gctttaaaca taataatcac tacacacatg attgaaataa 18480
atcttgaata ttttgtttaa gtcagcatca atgaaacggc gtccagtttg ggattttctg 18540
agcagtggct tatcgaaaag aaaaataaat ccaatgggaa ttaatatatt tttttctaaa 18600
actccattcc aacatatata aaaaaaaaaa atgtttttta tgctaatata aaaaacaaaa 18660
caaacaaaaa aaagccggac atggccaaaa tggatttttg ccgcatattt tcaacaaatg 18720
gattattata ctgatttacc cttttttaat aaaatgtgtc tagcaatttg tatttttttt 18780
gtttattttt atttaatact ttaactatga tattacttta ctatttctaa taaatatcta 18840
catttttgtg ttgtaagcat tgccgtgtga ataaaaacag ttatgtaaag taaagtttta 18900
aataatgcta tgaataattc aatttttttt aaaaatttta atttagattt acatttacac 18960
atttactgta gcacaagctt ctatccaaag caattcatct cagagccagc aatattcaaa 19020
atatataatt tcagatttat tagaaagcta gattaaaata acttaaatat ctacaattaa 19080
tatttaatat cggtttctga ataaaaaagt catttttgta tagtcattta aatgttcata 19140
ttctaatact tttgctatta ctttactatt ttattaatac aaatatctaa atttttcagt 19200
tgtaaacaaa aatattgttt tcagtcatgt taagaaatct tcaataattt tcataataat 19260
tttatgttta attattttta tcattttaat attaatacct ttactatatt tcatttataa 19320
aaagttaact ttttatgcac gttaatatat gcattcgtag tcgtgtttga agtggtgtgt 19380
tttctgctgt ctcaccacag ttttgttcgt cgctgttgtc cctgcagtcg ttgtgtccgt 19440
tgcacacggc ccacagagga acacagcggt agtttgtcct gcagtcgaac tctgtgtgat 19500
tgtcacatct gtacgccgga cccactgcaa acagcatccc acagatgatt aacaacaacc 19560
aaaaatactc agaaaggaga tttaaggtca cttcatactg cagaacctct atataaacta 19620
atgaaataac aagacagatg cattgtaacg gtgcagtaat gtgtgggcag ctcactgcat 19680
tcttcgatgg gctcgtccga gttgtctcca cagtcatcgt ccacgtcaca cttccagctc 19740
tgcgggatgc agcggccgtt cctgcacttg aactgtccgg gtctgcaggt cctgctggag 19800
cagtgagcgg gatcctcgtc agatccgtcg ccgcagtcgt tctccccgtc gcactgccag 19860
gcctcagaga tgcagcgctt gttggcacac tgccactgat gagactcaca ctgatgcgtg 19920
gctgaagggc agaaggagag gaagatgcat caccaaatac atgccaacag tttcatcaaa 19980
agcagatacc tgaataagag ggttattaca gctgactaaa actaaaccca ttaaaataac 20040
aggtcacgta aaataaaatt aaattaaatt aaataaaatt aaattaaata aaattaaaat 20100
aaattaaaat aaaattaaaa tgttaactga tataaaataa aataaaaacg tttttatttc 20160
agctagttgc tgaggcatca ttttaatttg tatttactta atttgatttg ctaaaataac 20220
taaaaaataa atgaaaaaaa aaaaactata cagacagata tatataaaag acaaaagaac 20280
aacctaaaac ttaaactaaa ataaaataaa acctatacaa ctatataaaa taataaaaaa 20340
ctattttttt taaatcagtg tttgattttt aaatacatat taaaggttag aaatgaaata 20400
catataaata ttacatggaa aaactaaagc aaaacatttg aaattaaata agttaaggta 20460
ttacaataaa agctatacag aaatattaaa acgatgaaag agatcacata aaatcactaa 20520
tatttcaaat gaaaaccaaa aagctaattt aaaaaatgaa taaacaatat aatggtatac 20580
aaatgattta aaacattcaa tacatctcag tttgtaatat gaaatatttg atttaaaata 20640
gaaatattaa atagtactgt acattataaa attttaaacc tggaaaaaaa ttcaaatgga 20700
aatcagaaat gttgccttag tacaaaaatt ttttaataaa ttaaagctta cgcagacagg 20760
tttaaaataa agacaaagga acattttctt aaactaaaat gaaaattgaa agtacaaaaa 20820
cttaacactg tagtacataa ataacactgg catcatcact gaaagtctag ttttggcatg 20880
tttcccttta aataataaaa tgttttagca aaaatgcatg tactgcaaat actgcttttt 20940
attacaacat tttattttat tatgaggctc aaacctacaa cttttcagca tcatgcccag 21000
atttttaacc attaggccct gagactgaca tgatatatgc tagtttttcc cataccacac 21060
agcacagcgt cctcgtccga cccatcggga cagtcctgat tagagttaca caggaagtga 21120
gggctggtgc agtttccgtc attacactgg aactggccca gtctgcagtg ccggacggga 21180
cacgtgtagg gttcatccga accgtctctg cagtcccgct gaccgtcaca cttccaccag 21240
atggggatac agctttaggc acagacagat acaaggttta gtttcagtta ggatgcttgc 21300
tcataaggca gctgtctatg tagtgagtat acagcaggct ttgaaacaga gtgtgcatgg 21360
attgagacac taaccgttca ttgtctgcac atctgtactg tgtgctggag cacatgggca 21420
ggcagcgtgc ggcgccccct atctgcacgg tgaggaagtg atcaggacac tcgcaggtga 21480
aaccctgacc tccggcccgc aggagacaca ggtgagaaca gcctccgtta ttgaccgcac 21540
agggattcgt cactacagga gaaagatgaa gtgttagaga gtcacagaca aacaaacaac 21600
caaagtttca ggtctgttct gaatctgtcc tgaattatgg acgcataaat tcaaataata 21660
aaatgcaact tatctaaatc atatcataac tataaactta acagtaatgg atgactaaga 21720
atattttagt gatacattaa aaaaaaaaaa aaaaacatac aaaaaaacat tcaaaataag 21780
gtaaatttta aagtattttt aaaaaattct ctaaaatcta acattacatt ttcttcaatg 21840
ttttttttaa taaaacataa gaaaaaaatc aataaatgct aaaataaatt taaatgataa 21900
ataactaata attctaagga tattataaat taatatattt tttaacgttg caaattaaat 21960
tccacagttt gatttgaagg acactttatt taatttttct tgatattaac agagaataat 22020
cagcaatgat aaaaaataat aacctaattt taatttgaaa cattgggaaa atttcagatt 22080
tttaaggaaa aattcaatgt ttgggatgca ttctgtgtta gcaattataa taaatgttat 22140
ttaatatttt tataatctga ttattctctg ttatttggac aaaagtaaat aaaatgccct 22200
taaaatcaaa cactggagtt tacttaatgt ttaaaatgta ttcagtgttt gaaataaaaa 22260
taaaaataat aattatcata ttataaatat cgatttattt tataattacc cttgttattc 22320
aacacttctt aatgaataat ggggtaatct ttagtaaatt gttcttataa tcaaaaacaa 22380
aaactaataa taaataaatg caaaaatatt ctaaaataaa tgcaaaacat aagtaagtgt 22440
ataatcatag gaataacatt ataaattcgt aacttcatac aggtttagtt tttgatttca 22500
gatttcaatg ttaaaatcta agaaaattca tcttttctgc aaataatcca gaaaaatatt 22560
aagaaaatga tcaattcata tttatttcat ttcaactttt gaataaaact catacatttc 22620
attttggaca gctttattaa ctttgatgcg ataaatgtgc ggtattgaat cgagtcgttc 22680
ttgttgtgca gactatatcc tcttacaaag aaccaggatg tggctccaca acatgtaggt 22740
gtagctcctc tttctacaga aagccacaga taaagagttt aaactacctg tcccggccat 22800
tacagcgatg ataaagagtt tagtctttcc tgtgtctgag aaaggacaga ctgtgtaaat 22860
attggcagag acccttaagt gcctcagtgg acgtttaaag gaagtgatag cgatctgacg 22920
tggtataatg agcgccccgc ttgggttacg ctgtcctggg ctcacctatg ggctggcggt 22980
agggatggca cacgtggatg tcgaacggtc ggtgggtggt gttgacgagc gcttcccgtc 23040
cagagccgtt gtatttgttg cccttctcca ccgtgcgcgt gttccagtcc gtccagtaca 23100
cgctttcctc gaacaccgtg atcgcgaaag gatgcggcag gactccgtcg tacacagtgt 23160
gcctgtgatt cccatccaga tcagagaatc tgaggagaca gaaccatctt actaatgtct 23220
gacaggttca ttttgggcaa aagaagaatg cagcagaaca acatgcataa ttagtcaaga 23280
cacagcatct ataaaatggt aaaaacaatg ttggtgcaca agcaaatctt cattattttg 23340
ggatttgtca gcatataatg ataaatgtaa tttaattaat caaattgatc agtttaatgt 23400
agagaaggat aaatgtgacc atgtcataag ggagatttgt agatcatctg aaagtttaat 23460
aaaaacgctt tccattgatg tatggtttgt taggatagga caatatttgg tcgatatttg 23520
aaaatctgga atctgaaaaa ttgcttttac agttgtccaa atgaagctct tagcaatgca 23580
tattattaat caaaaattac gttttgatat atttacggta ggaagtttac taaatatctt 23640
catgtttttt ggctattgct acaaatacac cctagcgact taagatgtaa tgctaaccat 23700
ataaacaaca acaacaaaca aacaaacaaa caacaacaac aaaaaaaaaa ctgatagact 23760
gactgtaaga atattctgtc tatttaattt tattttaaac attgaacagt tttcagtgct 23820
tgatgtgtgg aatttactca gtgtttgcca tttaaaataa aatgaattta taatattctt 23880
aagattattc tctattattc atgtgaaaag taaaatcaaa gatgttgttc aatgttctaa 23940
ccctttttct aatatatttt tcttattatt catataaaag tttaaaaata aacaaatgaa 24000
caaacaaatg ctaaaaatgc aaataataat taaataaata tataatcata agaataactg 24060
taaaaaaaat agtataaata tgtgaccgtg gagcacaaaa ccacccttaa gtcgctattt 24120
ttagaaatag ccaaagaata ataattttat gggtcaaaat tgttgatttt gtggtccggg 24180
gtaacatatg tccttaaaat caaaatctta ctcaatgtag ttgaggtgag catcagacca 24240
gtaaagcttg tcattggtgt agtcaatagt gagaccgttg ggccactcga tcttagtagt 24300
gatgatggcc gacttgttat tcccatccat cccgacacgt ccaatgaagg ctttgtctcc 24360
ccagtcagtc cagtacacat atctgcacaa agatgacatc tgatattaaa aaatagaagc 24420
taaaacaaat tccagcaaca taaaggccac atattgtaca ccacagttta aaagtttgag 24480
gtcgctaaga ccttattcag tttacaggct ctcctgctga ccaaggctac aattatttga 24540
caaaaaatgc aaaacatttt attacaattt aaaatatttt ggatatactg taaaatgtat 24600
tttttttctg tgatgcaaag ctgaatattc agcatcattg ctccagtctt cagtgtaatc 24660
ataaatcagt atttactaca cagagccata gttcacagac aagctatgca aaatcgcgtt 24720
cataatcgta gacgatacaa tcacgcgaaa atgaaatcta aaaaatcggt ctgtttttac 24780
tcataactta cctataaggc cctaaatggt ttagctccag cgtacctaac tagccttcta 24840
ccacgttaca acccatcacg ctccctaagg tcacaaaacg ctggactttt ggtagttcct 24900
aggatagcaa agtccactaa aggaggtaga gctttctcac atttggctcc caaactctgg 24960
aatagtcttc ctgataatgt tcggggttca gacacactct ctctgtttaa atctagatta 25020
aaaacacatc tctttcgcca agcattcgaa taatgtatct tttaaattgt gagtttagtt 25080
gcatctgttc aaaggtgcat ttttattcat tagcttgggt taaactaatt ttactttgtt 25140
ggatcagcag ctatgctaat gatgtctgta ttttgtttct ttgtttcgcc acgggattta 25200
catcccgtgg taactaggat ttacacaagc tccagtctgg atccagaaca cctgagaaga 25260
gatgatgctg accctcagag gaccccagat gatgctaacc ctgaatcaac aaacagaact 25320
aacaattatt gctaaatgtg tgactgaatc atataataac ttaattaata atattgatag 25380
ttcatcgtct agctgactac gtcttgtatt attattattt tttttatttt ttctaaaatc 25440
ctgtcaaatg tgcacaaact actagctact actaaatatt gtagaaacat aattttctgt 25500
aaagttgctt tgtaacgatt tattttgtaa aaagcgctat acaaataaac ttgaattgaa 25560
ttgaattgaa taacgtcagt ctagtgtttg aaataaattc ttctgagaga tgatgtggct 25620
tcttacacag aataaggcac acaacatatt taatagcagt gaaaaaggct gtgttgttta 25680
gcattgcatt attagtttat tataataaaa attctaatca gacgaactta gataaggcat 25740
atattgtcgt aatcgtgtgt ttattagtca cgttgaatca atgaaattag ataaatcttc 25800
tgtatattca gcaaacctat agggaattcc tgggtttctt ctgcgaggtg tattttgagt 25860
ttcagcgtga gagcgccctc tggccttcag atggagattt actactgatc acagaaccgt 25920
gcttcactga aagtatgcat ggcaacttct gtctaatcgc gatttatcgc ctttgcgatt 25980
taatttcaat gaatcgtgct gcagccctag gttggacagt aaacgtggat cgtagcagca 26040
aagacactgt gtgatgaaca tgctacataa aaatctggta aactcaccca aactttggat 26100
gcagaacgat ggctcttgga ttctcgaagc agtatgtgtt attagcatcc acacagtgtt 26160
ctgctagctt cctcacaaac cggccgtcca gctccgacac cttcatgcag tcgagaaagc 26220
tgtcgaccca gtacagcttc ctcccgaccc agtccaccgc cagaccctcg ccatgcaaaa 26280
tcccattgac caccacctct cgtcccgtcc cgttaaagaa catcctctcc aacaccctcc 26340
ggctcacgtc gatccagtac aagcgtctgt ccacccgatc gaagtccagc gccaccacac 26400
tagtgaggcc ctgcaggatg agcgagtagg cctcgccgtc cgtcgacagg ttgcgcagat 26460
agtagcggtt gctgaagatc aggtaggggc tgatgttgct gttttgcctg cagctgcgcc 26520
cgtcgggttc gcggaggaag cctggagcac acttgcacac gtacgagccc atggtgttct 26580
cgcacacctg gctgcacacg ctgggcgtga ccgagcactc gtccacatcg tcgcacgttt 26640
tgttgtcgga catgaggcgg aaaccgggtc gacaggtgca gatgaagctg gtgggagtgt 26700
cggtgcagtt gtggtcgcag tggtgcatgg atgggtctgt gcactcattg atgcctgaga 26760
caataaaaca gacaaataat acgattttta tgtatatacc gtattttccg ctctacaagg 26820
tgcacttaaa agcctttaat tttctcaaaa aatgacagtg cgtcttataa tccggagcgc 26880
cttatatatg gatcaaggcg ctctgtcaaa atgttgacat tccctttagc acagctccat 26940
ctagtggaag cataatgcaa gcccagtcaa acgtttgact gctgtatctt ctattctatg 27000
cgctttataa tccggtgcgc cctatatatg aaaacagttc tgaaataggc cattcattga 27060
aggtgctcct tataatccgg tgcgccttat agtgcggaaa atacagtata cataattttc 27120
attttatatt taaagctgca gtaggtaact tttgtaaata tatatttttt tcatatttgt 27180
taaacctgtc attatgtcct gacagtagaa tagacagata atctttgaaa aaatcaagct 27240
cctctggctc ctcccagtgg tcctattgcc atttgcagaa agtcatgcgc tcccggtgag 27300
aaacaaccaa tcagagctgc ggtccgtaac tttgtttgtg ttcaaaatgt agaaaaatgt 27360
atataataag cgagtacacc atgaatccat tttccaaacc gtgtttttag cttgtcctga 27420
gtcactaggg tacacctata ataagtgttt atattcggac tattttagat tgcttcgggg 27480
gtaccgcggc ggagtaaccc agtgcctttg tgattctgca tagacataaa cagagagaag 27540
tagttccggc tacgatgttc ttccgcaaga tgcaagcagt tctgttaatt aaccgctaga 27600
gcgtcaaaag ttccctaccg cagctttaac ttatattaag tatatttaat ttatcacttt 27660
tttttttatt taggcgtttt tgtgagtttt ttttctagct ataataaccc taatttatta 27720
ctcgcattga aagaaaataa aaaaaaatca ttaatgcaat aatttcacga taaaatacat 27780
atgctcacac tctttagaca taatcttaca tgctttttac agatggcctg atgtcctcta 27840
ctagtcctca catatacatt tctataaatg acagtgaata cagtgtttga cttgcattgt 27900
gtcactgagc ttaaaagatg attcaattca accatgcgat gatgcaaatg actttacttt 27960
tattataaat tgtaattaat ttataattaa attacttgta aagagggtat tttgccatta 28020
tttgcatgca ttgcatattt acctataaat aataaatatt acacagatat acatgtgaag 28080
caaaattact ttcaatagta attgaaaaac acacttaagt acagcaccag agtacaaata 28140
cagaccacag catgtcaagt tgtcggatga tgcagtgctt accacagcct ttctcatcac 28200
tgttgtcgtt gcagtcgtcg ttccggtcac acacctcgct gagaggtatg cagtttccgt 28260
tgtcacaccg gaagtttccg ggagggcagg tgggtgctgg agtcctgcac aggtgatcca 28320
gctcgtcgga ttcatcccca cagtcgttgt ccccatcaca gacgaaatcc tgcgagacgc 28380
agcggccgtt ctgacaggtg aactgatgct gttggcacgg ctggtaggtg cagccctgtt 28440
catcactgct gtccccacaa tcattacgac ggtcacacct aacagcaggg aattatggga 28500
gaaattactc tttttcactg gagtcaatca ccaaaacaaa gttttgtata aacactgcaa 28560
attgatttag aattgtattt ataaagttat gaacaaaata tgataaaata tgataaatta 28620
ttaaattaaa tatattcaat acaatgcatt atacaatata cagaattaac aaatttatta 28680
ttattattat tacttactgt atattattat ataatttcac actatttcat aaataaaaat 28740
tattataata tatttaacta ttttaaaaat tacaactgaa attatacaat gcattttcta 28800
tatttataat tatagtaata ataataataa aaataatacc ttattaatgt tattaataac 28860
aataaaaata tttctcaatg ttaaagtctt tgcatattat tataattaaa aaaatttaaa 28920
aaataggttc tgaatatcag ttagcaaaca cacacaaaat aaataaataa atctcagcca 28980
taaaaaacaa tgctcctgca aatgtttagc tctctatata atagtaacct gtaggagttt 29040
cggatgcaga ggccgttgtt gcaggtgaat tcgttggcag tgcaggatct gcgagtgcag 29100
ttctggtgct catcgtatgc gtccgaacaa tctgcgtctc cgtcacacac ccagtcccgc 29160
gggatacact tcctctgagg aggcctgtta tttatacagg tgaactcctg aggaaaacag 29220
gtacggttcg ctgtgtgtgt gagacaaaac aagaacatcg aatttaaaaa tggtactttt 29280
tggcacaaca tttatgttca acccagcata ataaatttgt gcacaaatgt gcactgtgaa 29340
ttccattatg gttctgttgc aaaaccctgt aagccgccta cctggacagc actttaaggc 29400
atcacagact tccaacttcc aaatttgttc taatgaatat ttatgcaata attatgattt 29460
ttgtaaaatt ttaagtatat ttatattact tttcaattaa ggcagtattt catgctaaat 29520
tagagataat gcctttagcc agattagtgc catctggtgg gagtaaaata agaaaaacat 29580
ctgcaattcc atggtttagc cactagactg ctgtatactg tagctctata taaaaggcat 29640
attaattctc tagttgattt tagacgtaaa tatttttttt tttattaagt tattaagtat 29700
tagatatata tttagacttt gtcagacaac ttttagtaaa gtttttgatt tttggggggg 29760
tggggtgttg aaatgctgat ttgaagtgtc agtttttatt cagtcagtta ctattactac 29820
agtcaaactt gaacacagag aaagcatttt gttacaccaa acattaaaat tctataaaaa 29880
tgtaacaaaa tgtttcatca gagcttaatt cttctgggtt tgatctgact tcaacctctt 29940
aaccacatat actgaagtat tatgatgtac tgtaattgtt atgattattc attagttgat 30000
ataacgtatt atacgttttt tttctttttt atatgcatta cgtattcatt ataatgtctt 30060
aagaataacc ttataatgta ttataaatac aggcttcata gaaagtgtta ccaatacttt 30120
tttaaacaat gaaaagtaaa aaagtaaaaa caagcacaga ctaaataaaa atggcctact 30180
ttgccaatta tttgtgtaga ctatgcattt ttatgtagtc taacaacata ttaacatcaa 30240
ggtttagtga caacaaaacc caaaccttaa tacgaactaa aacttttcat caaattcttt 30300
cacattatct aaactatgct atccacattc tgttcacagc tctatactcc aagtgtgtca 30360
acagttatct catttgtgtc tgtttttatt tgtcctatgg acatttttta gacaaatttg 30420
atagctcaat tgagtactcc atgaaattcc tgatcagcta tggagagcaa cctctggaga 30480
gcatggagag agtctgagag cagtcagcta aacacatcca caaagcaaat gtcccacacc 30540
cagccaccaa cactacctca ctaggaagga cgagagaaga cagatgacac aagaagacta 30600
aacatagcca aaggatgaaa caatgaagtt gtgtaagcac tttcccacta gaccatgtgg 30660
ttatgaccaa agtccagtag acgtcagaat agagttataa ctgtgaaact taaactagta 30720
atatattttt gttagcagac atcaatctga aatcaaatta aatactggat gaaaaattta 30780
aggtaaatta gaaatgttgc attcacaaat aaattgaaat tcaagcacta aaattacttt 30840
ttagtattac aaaaaattaa tttaatataa aaaaactaaa gagtaatatc tgaagaaata 30900
atgaatgaat aaataattat gtaaataaat ggcaaagcaa gacaaaaaaa acaatgaaaa 30960
caatgaaaac agtacatgca aatataataa caacaacaac aataataata attatttaca 31020
ttacatatta tgtttttttt ttctattgta gtccaacggc tatgaagtta ttagtatgag 31080
aaaatcagat aaaaaaaaaa tgggtacatt tggaaatgtg atgagggatt aataaatcaa 31140
aaataaaata aaattataaa ccaaaattgt gattaatgca tattgttttt tggaaactcc 31200
tataaagtaa atctgtatca gtcataaaaa gccaaatcta tcacttccta tagtacagta 31260
tatgcacttt tgcagctgtt ctgtgcgtaa acttttgggt gtaaatctaa agggacaaat 31320
atttcagtaa catgctgtaa caaaagtaca ttgtgaaatg agacctgtga tttggacttg 31380
gaccaggcaa tactgaccac agctgtgtct ctcgtcctcg tcgctcatgt ccccgcagtc 31440
gttgtctcca tcgcagatcc aggacgaagc gatgcacctc ccgtcatcgc agcggaactg 31500
gtcggagttg caggtcctca ccacggtggc tgcagggatg aaacacacgg cagtgagaag 31560
agccaccttc ccgagggcca tgtctcactc ggatctcaga ggaaactggg gaacatgtgg 31620
cccacgtaat ctatttaacc attaaatccc accctgagca gccagaaacg tggctgagct 31680
gctgctgctg ggattccact gccagaggaa ttctgggaga gaataagcat tcctctctct 31740
agtgtcccgc ttgaacaaaa ggcaccgagg ggtgctgtga ttaacgggac aaaagagaga 31800
ggatggggca gtgtgagtgt gtgagaggag ggggcagaga ctgtcaaagg aagggccagt 31860
gtgtgtgtgt gtgtgtgtga gggcttttct gccgcacgtc actttggagg aatgtgacac 31920
acaatagatg tccgggctga ccgctggcat cacagacaca cacccaaaaa cacttcagat 31980
acccctcgac tgtcattaag aaaagcaata ttctcatgtt atgttattgc acttgtgttt 32040
tgttatgtta ttttactgct ctgttatgac tttcttgtgt tgtggtttat aaaacttttt 32100
tctataattc atgtctacag catatgtgta tatcagactt gtttagttaa aaaaaaaaaa 32160
aaaaaaaaaa aagatacatt acttattaaa atatattgat aagaaaacct gcttgaaaca 32220
ttttggcctg aaattccaaa ttcggtttga aaaaatgtca atttagtgtt tccaaataaa 32280
tggacataat gaaaatgaca atatacaaag taaagtagtg attttattta aaatgttacc 32340
ttattttctc ttccaactac tgccttcaac acactaccat tacaagattt ttttttttta 32400
aagaaatgta tataatggat tgcggttctt ctgaactttc tatttaataa agaatcttgt 32460
aaaaaacaat gtatcactgt ttctacaaaa atgagaaaca gcacaacggt tttcaacatt 32520
tataataata aaatatatcc tgagcagcaa atccgcatat tagagtaatg atgctgaaaa 32580
ttcagctttg aatcacagag ataaattaca ttttactata tattcaacta gaaaacagtt 32640
attttaaatt gtaataatat ttcacaataa tacagtttta cctatttttt tatcaaatta 32700
tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 32760
aaaccttatt gtaccttatt gatgtaagga tctaaaaaca tcatgaatca aactcatttg 32820
tttagtgatt tacatacata tacacataca cgctatacac aaatagaacc gctaaatgtc 32880
tttcttgacc aagaagctct caaacctttg ttcttttgtt agggcatttt atcacagaca 32940
tactggtctg aataactaca tctgaagtct tttcctgcca aatgaaccag tttcttatac 33000
acacagtctg cctcatcttt ccactttctg actgcatgca tcaccagtgg agcgtaaaat 33060
gggggggggg ggggtacaat gggaaaccat cattaactca ccacccaaaa acatgggcaa 33120
tgctttccct tgaaaaacac agcttcatcc tcatattaca gtgttagtat tacagtatat 33180
ttagaaataa ctttctgcca catatgcaga atgagcgcca aataacatat attggtatat 33240
aattattgtt aatatgctat gtgtggcact tactacaggt aacgggctca tccgaaccat 33300
cactgcagtc tgtaccgcca tcacagtacc agtgtccggg aatacagcgt ccgctcgagc 33360
agcgaaactc attctgggaa caagtagatg tggctgtgga caaaacaaca caagacaaac 33420
aagagatgaa ccagaaacaa actcagcata catgggtcac tcatttaact gatattacag 33480
gaagggcaag ttgagcaaca gctttggctc aaaaagcaac aggaaattac taaacagtta 33540
ctcatctaac tccacactaa aggagaaaca tcaaaacaat gaaaatatta ttattacaca 33600
ttttgtcatt cttacattta ttttatttgg ttatgttttt cctctgcctt ttatgttagt 33660
atgatttaat attatttaaa tacctttaca gtttttttaa tagtatattg atcaataatt 33720
attcaacatt attgatcaaa taatctaaat tattgaacaa taatttcata atctattcaa 33780
tcaataatta aatcatttaa atatatgctt ttgtcatttt tttaaatgat actttttatg 33840
tttaatttat tttaattcca gatttacagt agatctaggt ataatttata ttagtttaca 33900
gctagtcatt ttagtacatc aacctatttt agttaattgc caagataata tttctcattt 33960
taatttaatt taatttaatt taatttaatt taatttaatt taatttaatt taattttcag 34020
tgagcacact aaagaaggcc ttgagccgaa atgtttatgt tttgtttatg tttatgttaa 34080
actaaaatat aacttacttt tttcataact atattataat aatagttatt attattataa 34140
ttattatata atgataataa taataattat tattattatt attattttta ttattattat 34200
tgcatttaat acaataataa taaagttttc aattttacat gacatttgca tggcaaacta 34260
aaagacataa agcataacaa atatcaggcc atttataaac ctaactatga tgattttgcc 34320
aagtactgta agacatgttc caggttcaac atttttctgt ccaaaaatat aaactaaact 34380
caatctctac ccctaatcct atacccataa cttcatccta aaatcattgt gaaatgatag 34440
ctgattaacg agggtgtaga agcaccaacc ctgatcgtaa gcctaaagca gatatttcct 34500
gaaaagttat attaaagtta tatatagtat attataattg gaatgttgtt ccaggaacat 34560
gttgtaacta tgttcccaca gtacagagga agacagtggg aaaggataac tcaccacaat 34620
gtgttgggct ctcgtcactg ttgtcaccac aatcattatc tccatcacac aggtacgagc 34680
gtgggatgca gatgttggtg gtctggcatt tggtgtgctc tggctgacat gtcctctcag 34740
ctgtgtttta taacaaggat ccattatgga ttgcttaata tcaacttttt tgtattatat 34800
aaatgctcat tagctgtagg ctgtttatac agtagctgac aaacaaccta gcaacttttt 34860
ttaggcaaac atgcaaatgt acttaccgca gttctgctca tcagaggtcc cattgtcgta 34920
gcaattgttc attccgttgc agacgtattc ccgtgcaatg cagcgaccgt tattgcaggt 34980
gaactcagtg ttggggtcac aggggcggaa tatgcagccc gtctcgtcac tgttatcccc 35040
acagtcatta tagtgatcgc agcggtagtg atatggcaca caacggccat ttccacaggt 35100
gaacacagtg ggctcgcagg tgtgaaatgc tgccatgcat aaaaaatgca cacacagaca 35160
aaaagattaa tttgatattt gctgtaggag ctcaacgaac tagcaggaac tactggagct 35220
caccaccttc ctcttctacc tatctctcta tttcagttta cctgagagtg aaatgaatgc 35280
tgctgcttgt gggactgcat gcgagagaga gcgagagcaa gagagagaga gagagagaga 35340
gaaagagatg cattgtggaa gaggttagta tgtgtttatg gatggttatt aaatttgatg 35400
tagtctaagg ctctgttcta aatcctagta agctgccttg gtgtcaattg atttcagcag 35460
aatattctat tagcaggtta agctaacaat tcaataatct ttttgcatgc atacatgcat 35520
gtgtttgtat ttatatacat aataaatata aacaatacac acaaatagta tgtaaacaaa 35580
cttttaattt gaattgcatt aatcacgatt aatcgttttg aagccctaat acttacttac 35640
taaaataaac attcttttag taacattatt agaatgtttg ttcaaagtta tctgatcttt 35700
aacaatgatc taaaaatgct agcaaaaaat aaatatttat acataattta tggaactttt 35760
taaaaaaaag ttttatttgg atggtcactg ttttttaaat gttactacgc atttcagaac 35820
gattaaagaa cgttcaaaaa taaaattcct ataatgtttg caatattata aaatagaatg 35880
ttcccttaac attcggatac tgaaaaacaa aacaaaacaa aatcatttaa gcgttcttag 35940
aacatatttg tttgttaact aggtactgca gcttgacaaa taatgcatta tgccattatc 36000
ttctggaaag gatgaatgca gtactgcttt gggatttggc ccaaagacag tataatcagg 36060
ttgcacttta gattttggaa cagagctttt agctacacat gaagagaagt aagcaagaat 36120
acacaccaaa aacgaaatga ctcttggatc ctggagcaaa agatttgaac agattgtgca 36180
agcaataaaa aaaggaagaa gaaacatgtt agtgaaagat tgtgccgcgg tgcaggagag 36240
gaaaactcca tcggcaaaca gacactcaca agatattttt aacattaggc ttgcagttta 36300
gtcatgcaca agtatcaatg ttaagttcac ccttcaaagt gcataatatg tcagtttgga 36360
ctctaggtgc atgtttggtg aaccatatat gaagccgctg gcatatattc tgaatcaaca 36420
acctgtgttt atataaaact gccttctatt ctgatttagg atttcaagac ctctgaagaa 36480
catgtacagt aagcaacaag gcatctcacc gcagacccgc tcgagctcat cactgccgtc 36540
cccacagtca tcatcgttgt cacatttcca ctgggcacgg atgcagcggc cattcatgca 36600
agtgaactgg cccgactgac agcgtgtccc attgtcaggg atacagtgtt tgttgtcggc 36660
cagataccag cggccctcag agggacactg acactctgct ccctgaggac ctgaaggaaa 36720
tagtgttgta ttagttcttc atttattcca attgtgtttt ttaaaagggt gatttgatta 36780
tgctaaaata acattatttt gtaaatttgg tgtaatgcaa tctgtttata tatacatttt 36840
tgcattattg acacactgtt ttcttaacaa attttgttca gttgctttga cccaatgtat 36900
tttgtttaaa gcgctatata aataaaggcg acttgacttg actttatgca gtttaaggtt 36960
aacaaacaaa ttttccacac actgtatatt atttgttgtg cctctatgcc ctgcctttct 37020
gaaacgtgtc aatttttgac aaagctcatt gctctgaaaa gtgaggtgtg ctctgattgg 37080
ccaactatcc agtgtgttgt gattggccga atacctcaag cgtgtgactg aaatcttacg 37140
ccccttacca tatgtgacct ggaccacaaa accagtcata aaggctcaat tttatatata 37200
tcacaagaaa gctgaataaa taaactttcc aattgatgaa ggatttgtta ggatcagaca 37260
atatttgtca gagatataac tatttgaaaa tctggaatct gggggtgcaa aaaaatctaa 37320
atattgagaa aatcaccttt aaagttgtcc aaatgaattc ttagcaatgc atattactat 37380
aaaaaatatg ttttaatata tgtaaagtat aaatattttc atgaaacatg atctttactt 37440
aatattctaa tgatttttta tttttaaaga aaaatctata attttgaccc ttacaatgta 37500
ttgttagcta atggtacaaa tgtaccccag agacttaaga ctgcttttgt gctccagggt 37560
cacatattta aaaacacacc gttacacctt ctttctttac gtaacattgg cgtagacaag 37620
tcttatatac atacatagtc tcatattata catatatttt ataaagaata tctttttggg 37680
tttgagactt tagtctttgc aactttactg atcttatata tgtgcaaaca gcttgaaaca 37740
ctagaaagac aaaggaaaac ttgaaattgc accatatgac ccctttgacg tagtgctaat 37800
gtacagtatt tgcatgattc attgagttgc tgtctacagt gttccaaatt caagttacta 37860
atgagaaatc ctttcagtat acagcaatgc agcccaacag gtttgggaat agagctttgg 37920
ttttggagga tactggctgt aaaaacctta agtagtatag aaaaagcata ttgtttggaa 37980
atataaacca acctggagcg caaatatggc tgcatccacc attgaattgt tcgcagggac 38040
tgctgcactg ctgctgtttg ctgttggaga gcacgtggat gtccatgggc cgcgatggaa 38100
ggttctgtag catcactctc tggtcggagc cgtcgtgctt gtttgcacgg tagatgctgc 38160
gtgtgttcca gtccgtccaa tagatatgct gaccatacac tgtcattgca aacgggtaga 38220
tggcggtgct tacgatgacc tcacggttag tcccggtgag tgagcatctc tcaatcttct 38280
gtctgcatag agagaacagt aaaccttcac ttatcattat gtggtgcaaa tatttataat 38340
gtattaagta taaattgtat aaactgtgta catatggaaa ggatttacca ctgatgatac 38400
aaacacgagt tttgagcagt gtagagtatc gcttgtttat agtttctcag atcacaaatg 38460
cagacatggt ttaagtttac gtggcacaat gcaatgcgta aaaccccagt tttagtcatt 38520
ataattcgga attatgtccc cattggatgc aacaaatgcc tcgtttgtaa tggtttttat 38580
tggtcttgtc tcgtcgagcc aggaaacaca cagcatcaca gtgttactaa caatttccgt 38640
cacatgcttg aggcatttga ccaatcagaa agcactggat agctagccaa tcagagcaca 38700
ccgtgctttt cagaccaatg agctttgtaa aaaaatcaat gagtttcaga aggcacataa 38760
accattacac caaatacaca aaataatgtt cattttagca gcatcataag acccctttaa 38820
aatgataaat cactgttaaa tgaatcttaa gaaagagttt tgccgctaat ataaaattaa 38880
tcaacctttt gtcttgcctg tgatttttca tagaggcaga ccaatgttat acaatatttt 38940
ttatatatta catgatagtt caaattataa tgtaaatcta aaattataat ttgataatta 39000
taataggtca ttacagcctc agcctcttgt gtgtaagctg attacaaaaa aaatgtgtta 39060
aacttctgtt aaaatatcct ttggtaataa agaccctccc atttaataag ttataaaatg 39120
agcactgaaa cagacacatt ttctcattca cctcctgatc aaagatggat ctgcaaatct 39180
cagtttttct tctgtttgtt ctatttacca gaggtacaaa atcttttttt tttgtaatgt 39240
tactgggtac aaataagatg cctctctctt tttcattact acaagacact ctctcagttg 39300
ttataagtgc acaagtttgt ctttcctgac aagtcaatgt ctttctggat ttaacagctg 39360
cttcagggca atattaacat gttcattatg atggattgaa attaaacttc tacctgacac 39420
agaacttcat ttctcgacag aatttataat tgcatgcata gaatactcta catagattgt 39480
tttgtttatt atttaaggta tattattaaa ttaaatgaac tataaatatt atgaatataa 39540
ttaaaagatg agatcttaca agctggcatc tgcccagtac agcctttgtt cgtcatagtc 39600
cagtgtcaga ccatttggcc aaaccagact agtgttgacg atctctgtcc tgaagtttcc 39660
tcccagagtt gcacgctcta tctttgcact ggttccccag tcggtccagt acatatacct 39720
gaccacatat acagagaaat gcatttgaat tagatgataa tatggctcat tgtatattta 39780
aaccacagtc taactgaatg agagactcac cctctacatg gatccagcat gatggctctg 39840
ggtctgggca cctgtgctac aacagtcctc tgagacccat ccactgccat ggagctgatg 39900
ctctggttaa tatagtcact gtaatagatc ctcttgttga tccagtcata ggcaatgcca 39960
tcaggagctc ccagatctcc acagagaaga aaatgttcag aaaaatagtt ataacaactt 40020
cattaggagc aaaggttcgg aataacatga ggttgagtaa ttgatgacag aattgtcacc 40080
aaaaaacggt ccctttaaat ctaattttcc caaactttct aatttaatgt ggttattgac 40140
agtttttttt tgcacacaat ttactgtgac tctcagacag tagacatgat cctaaatacc 40200
tgaactgaac atttgatctg ctcaaatcaa aatgtaaaac aacattaaca attaagcaaa 40260
ttttttgcct aagaaacaaa taacttctct acgtgcatga tctttctttt ttatatttca 40320
catttaaaag aacaggtttg ctcacgtagt actacataag gcctaaataa cctcaagtct 40380
gcgtactaac cattgtatat gtcatgcaag atactttttc aagccaacat tcaaagattg 40440
cctcaatgaa ctttccattt ccagctaaac ctgcagtgtc agggtgcatt aggagtttaa 40500
ataagcggct cacctgaggc cacctctgtc gcaggagagg taggtgaagc caaagtaatg 40560
aagctaatct tgctctggcc cactcccgag ctctgggtga aatagatcct gccgtccaac 40620
cggtcgaaat caagggctac cgaagtccgg ggcacgttaa ccacggggaa gggcagcgaa 40680
tgatcctctg gatccagccg cagagaccgt acagtgcttt ctgtcgtata gatgaggtag 40740
tcatcccggg ataccacgca actgctgcta tccgcagcca gattcccaaa ggcacaggaa 40800
cacttcctcg tctgagatcc tgggatggcg aagcagaagt gagcgcagcc gccgttagac 40860
tccaagcagg ggttgttatt gagctcgtgg gccgaacttg gctgcattct ccggtcgaag 40920
atggtgacgt cccgcagcat gttgatgttg tctcggatca cgagtggctg gtcggtcgca 40980
cctggctgtt tgctcgcttg gaagactttc ttcaggtttc tgtccaccca gattacattc 41040
ccctcgaaca cggtgacgcc gtacggagtc ggatagcgac tcccgtaccg tacgacctcc 41100
gtctcacccc catcgggccg gactcgtgcg atcatgtcca gggaatcgtc cacccagtag 41160
atgtatccgt ctcggtggtc caaagccagg cctctggggg tgacgatgcc tgaggaaacc 41220
aacacggtcc ggttggaacc gtctagaaaa gctcgctcga ttttgggtgt ctgtccatag 41280
tcggcccaga agaggtatct gttcatgggg tccaccacaa tatgacgagg catgtctact 41340
tgggtcttta gaaggacacg acggaacgtg gtattgagtc gaatcacttc gatgtacgtc 41400
tcggtcaaga aggcattggt gaaatagagg tttcctgtgg aaagacaaag acaagagaca 41460
ccagagaaac cgtgagaaac gctaaggctt aatgttgctt tgagagatta aggaacaaac 41520
actttgtttg tcgatgtaca catgcactag atgggtgtct attatttttc aggctgttgt 41580
accattagtg aaagtgagtg catcacagct gcgttttaaa atggcatgtc aagttagtcc 41640
gttatgaatg gtgttatgcg caacaaaaca tgttgcacaa catatggccc atttttagcc 41700
atagttgagt agaatgtata gtatatactg tatataaaca aataaataac tgacataaat 41760
cacttcgttc taaataaatg acataaatca cttcgttctg attggtccat tgcaacattc 41820
tgtggtcata cacagtatat atatacatat ataaagaaat tggtaatttt atgcagtaat 41880
aatgcattaa gtcagtaaag acatgcataa ttacttaata aatactgtta ttttgaactt 41940
tctatccacc aaagaatcat aaaaattgta tgtttgtttg tgttttggaa taaatgtata 42000
gttctatgcc ataatggtgc attacattca tcataattgg ccagtacaga catttataat 42060
gattttaaaa tatttatttt tcatataaat acttttaaac gttctgttca tcaaagaata 42120
taaaatatat catggtttcc acaaatattt taagcagcac agctgttttc aacattgatt 42180
ataagagatg tttcttgagc atcagtttct gaaggatcat gtgacactga agactggagt 42240
aatgatgcgg aaaattcagc tttgcatcac aggaataaat tacatttaga tataaattca 42300
aatagaaaac attatgataa aatgaatgca gttccattga gctaaggaga cttttgaatg 42360
gtatatatat atatggctat atatattgag aataaaacac ctgtaatgtt agattaggtc 42420
aatgccaatt acaactttgg attgcttatt cacatattca ggtcaaaagt gaatgtttaa 42480
caagccatgg gtcactgagg cctaaaccat cagaacctct tttttgtcat atgttaatct 42540
ttaatcaatg ccacctgttt gtgcttatac tgatgtgagt taatgtaaat gtccctacgg 42600
taaagcgatc accattctga cctctaaagc tgctctagag cattcctcac ttaacacttc 42660
atgagtccag tgtatctgaa gtacttgcca ttagtcttcc aattccttta gaagcacttc 42720
atccttcgta aagtgtcaaa ttagtttcat tcagaatgtc acaactgtac ttattattac 42780
cagagtcaaa caagttacca aatctgattt cacacaagcc ctcacctgcg gcccagtcga 42840
cagcgatgcc ccgtatgcca ttacgtccga tgccagaggt gacaatgctg cggaatccag 42900
atccgtccgg tttgatcctg cggattccat tctgagacgc cacggtgctg ctgaagtcac 42960
accagtaaat gaatccagaa gacatgtgca cgtccacatg aagagcattg cggcctacag 43020
gacacacaca gaaacacaca cacacacaca cacaaagttt ttaaattgta ctatgcgtgt 43080
ataatcacac acattattca taaatgtgac cctggagcac aaaaccagtc ataagtggca 43140
caggtatata taagggtcaa aattatagac ttttctttta tgccaaaaat cattaggata 43200
ttaagtaaag atcatgttcc atgaagatat ttttgtaaat ttcctaccat aaatatacca 43260
aaacttaatt tttgattagt aatttgaatt tggacgactt caaaggtgat tttctcaata 43320
tttagatttt tttgcacccc cagattttca aatagttgga cctcggacaa atattgtcct 43380
attgtccaaa aaactgaccc ttatgactgg ttttgtggtc cagggtcatt gcaaaattat 43440
tgcgggcaat gtacaataaa gcaacattgt aaatagcagt gtttttttat ttgggtttaa 43500
agtttaatag tgtctcttac taaaatatta gtagcaaaaa tgacattata gttggtaaat 43560
tagtctgttc acttcaatga acttatatgt gtatatgtat aatctattat taaatgatgt 43620
gtgtttgtgt gtgtgtgtgt gtgtgtatat aacatttaac atataacata taactttttt 43680
ttaacaaaat ataaaaaatg taaaaatata acactttatg aaaactttgt attatttata 43740
atccaaaaga ataaatataa tagaaaaatt ataaaataat agaattttat ttttatattt 43800
ttgctcataa atagattcat ttaaaactca aataaatatc tttgtgtgcc atttctcatg 43860
ttaaaatagt ttaagtatca ttcaacctag tgtttacata taggcgatat gaatttatat 43920
ttttttacga tctattttat tactgtcttt taatttatca attattctaa cattttgaat 43980
ctttatcgca tctacatagc atttaaaaca aacatatttt aaaaaaatta tacaaatatg 44040
ataaagaatt tttattctct ttattacaaa ataaatatca tgtaaagaga ataaaataat 44100
ataaaatata aattgggaaa taatatagat ataaataaat acatttcaat ttaaataaat 44160
aaatataaaa ttaaaaatga cattttaaat attttataca caaatgtata ataattcata 44220
tgttttctta attaaatata ttataatgtt taatataaaa gcgtttttct ctgtaccacc 44280
cagtgctact gaacccacct ctgcctgcca caggcaccat agactcggag tgatccgctc 44340
cctccagact gaagcccttg atggcagtga gcatcgagat caccacgtag gactggtatg 44400
gagagcaggt tctgttatca gcactgagct tgaaccccgt cgcacaggcg cagctgaaca 44460
ggccaccggg acgaggcaga cacaactgct gacacgtccc catgttatta ctgcagccgt 44520
tagacgagcc tgccgaggct atggggagac aggcagaaag acaaacagca gataatgaga 44580
acgagagacg gatatttaag agtacaggat actgactcat atctatctat tctggtaatg 44640
tatacacatt cctggtgcat atcagtgtgg agcagtactg atatattggg actctgcttt 44700
gctagcactt ccagatctta ctggaagcac tcatacaata gacggcgttc acagccctgg 44760
agaatgaaca tgtgcactga acaccgaaca aagggacact gattcttctc cacataaagc 44820
tgagtgtccc cctgaggcag aggggaacta attggtaaac tgatatccta ttgaaagtca 44880
ttcatcgctc tctggtaatt gctacacggc catcaaggtt acgatgactt ccagaaaaag 44940
accaccctat gtcagccgca gaatgaatca gaaataaatg ctttaaaaaa tggcactgag 45000
agatgccttc acaggcagca attgatcgat agaaatgaag aacattaatg tacatgttac 45060
agtatttttc ttcagaattg agttaaatgc aaataaaaag tgtgattatt ttacccaaca 45120
tttgcatagt gttgttttat gcttattaga attcttatcc aaaaaaaaaa aaaaaaatca 45180
cgcaaaaagt actttaaatt tctaaatgtc tgtataaaag ctgtatattg ttattgttat 45240
acatgtgaac ctagaccaca aaaccagtcg taagttgcac aggtatatct gtagcagaag 45300
ccaaagctaa caatacattt tatgggtcaa aattatctat ttttcttctt ttgtggtcca 45360
gggtcacaca catacacaaa tatatgttgc tttttaaatt tgattttgat tttaaattaa 45420
ataaaaatga tttttcattt cattaatttg cagcttattt taataaaaaa caattaactg 45480
ttgttcgtta ttgtcatata ttattatttt tggcttgtta ttgcttgttt tctgtatgat 45540
ttgacaatat tgtatgcaaa aaatcaagcc aatatactac tttaaattta attgaatatt 45600
aattaaattg gactctttta cccaacattt gtgtctgacg tgtagttgtt tgtttattag 45660
acattagcaa gatgacgtct accttatgtc tattttttat atatttctta ttttttatta 45720
tctttttgaa ttcaatagca gcttatttta attttttttt tttaactatt ctaaaaaaca 45780
agcatatgtt gtttaggtac agcaggtttt gaagcatggt agctggtttg agcttgttta 45840
agttggtctt tagttggtca tgagtgggtt taaactggtt ataagctggt cctgagcagg 45900
agttacttag gaccagttca ggatcagcac atgaccatct taaacaagct accatgcttc 45960
aaaacatacc taaccagcat aagctgtttt ttttaacagg gtattgtata tttttaactt 46020
gtcatcactt gctttgctta ctagagcacc aaactccact gaaatcagtt atgcactttg 46080
tatgggagga gctattttat gaagtgcatc acagtgtgag agccattttg ctacatgtat 46140
ttattatact cacagtctct gtggtgcact ttgagtactc tgagaccggc cacgttatcc 46200
cgcattacga cccggttgag tcctgtggct ttgtcgacgc gttcaatcac ctcaaagtct 46260
cgatctgtga agtacaggaa ggagtcgtat acggccacac cccacggatg ggacagtcca 46320
ttcaccatcg tgatgcggtt tgatccatcg ggatcaccgc gctcgatctg aaacatggaa 46380
gaaacagttg tgttaccctg ttaaataagt tttcatttgg ttgagataca gtatgtaacg 46440
gctgtcaaat gaaatagcga ccaacatggg ttctaaggtg gatgcaaaaa taaataaata 46500
aatgtatatg cagaaatgca ctattatagg aaatattact cagcagggac acatcaattt 46560
aaccaaaagt gggtaaaagc atttataatg ttacaaaaga attcttatat ttataaaatt 46620
ccaaatctgc atattagaat gatttctgaa tgatcatgtg acattgaaga atggagaaag 46680
ttcatcttgg aatacattta aatagcaccc tgctatttta aattgtatta atatttcaca 46740
atattgctgt ttttactgta tttttggcca gatatactca gcttttgtga acatcggata 46800
tctttttcaa catgagtgtt tctgactgtc ataccacgcc agttcctgta acagcccagt 46860
agagcttgtt ctctctgatg tctatggtta tgaactccac atgttccaga ttgtgggtga 46920
agaggatggc tgcgttggag ccgtccatat ctgccgatgc caccttagcc gggatcccgc 46980
tctctgttcc ctgatcagtc cagtacatct ttctaaaaca aagacagaac atcatgaaac 47040
agaacagcat catgtatgca aacccacggg aaacactact ctttcgtctc tttgtttgaa 47100
tacaaataca gcaattccat tcttatatta agcatataat ctaaggactg atcatgatta 47160
ctgtatgtgt gctttaagtt gactgttgtt tagtgagctc aggcagttca gtcctggcag 47220
gtgatattaa atcagctgga gcatgttgaa tctggggatc agggttaatt ccagaccaga 47280
cttctgctta tctggactta cccacgtgcc ggatccaccg cgatgccagc aggagctcct 47340
gcaccagtag gtgagccgtt gttagtgatg agagtctttc tgtactgaac ctctcctttt 47400
agcttgagaa cctgaaacaa atcatattgt gtttgtcaca ataacagtac cagtggcatt 47460
gctcaattat tgtattattg aattatttaa atgcaactgc acaacaaatt acatttaaac 47520
atttttaatt aaacgtacaa tttgtaagac atttgcaata aaatatccca aaaccactag 47580
gctagtgtta tatattttgt ccagctgatt actaacaata tctctaatgt tttcaactac 47640
ttgtaaatcg tgagaaaatt cccattctaa acagtgacac ggggcagtgc agtcgcctgt 47700
caatgacgtt agttaccctt tgttagcgcc tttactgaca tagaaaccat gtgacaacag 47760
tgtcgtggac gaatgcggaa gtagctttgt gacaaatttc aacaaaaaag agatgccgat 47820
ttcgcttgtg ttttactcga caggtgagtt gttttgattc gtatctttaa agaaaatgta 47880
tgctattata acaatcaaca acatcagtat tatggggcat acacggcaaa cgcggggcat 47940
cacgtctcta gacccgcgcg aggacgcacc tgatccatag tctgatcggg tctttgcatt 48000
gactttgtat gtaatctact cgcccaaatc gttgaactgg cgttaaatcc atgatttatg 48060
tttccgtctg gccattgatg gccattagtg gttgggtaga agacaacaaa tcccatcatt 48120
ccacgctcat ccttatcgtc atcaaaccac gcgattgtta ttgttttggt agtgcgccct 48180
gtagtggcag gtcctacaac ctgtaccttt aagtaaatat cttttcaaat aatgatccag 48240
gtaagtaata taaaaataga gaacaaagaa atgaataaca agcagaaaaa aaatacaaat 48300
gaaatttgta gcagtattaa gtagtcaagt aaatattcaa taattgaagc cgaatgcacg 48360
tctaatattt tgacatattg tatttttgtt tcacctgttc acagtgttta cattcattat 48420
gtatttttgt gttgaaaaca atgatttttg ttttcataat ttttcataag tgttttttta 48480
gtattattta tatattatta ttagtgctgt caagtgatta attgtgatta atacacacgc 48540
atacaaaata aaagtttttg tttgcataat atatgtgtaa taatatattt tattttggat 48600
gcaattaatt gcgattaatt atatgacagc actaattatt atttaaatac taaatattta 48660
ttattataaa atttaccttt tgcatatttt cagtttgcac ttgaatttta gtttaacttt 48720
taatattgtg cttttgtcat ttttttcata ttttttctat tgtctttatt tcagcattag 48780
tttgagcaac ttatctttca actttatttc tcattttcaa tatatatata tttttttttt 48840
tacaattttt ttttacattt ttgttactga taacaacaat gcattagcac acagaactaa 48900
ctggtcctaa agtagttttt aacattgatg aaactggtgt gtttaaacat ttcttcataa 48960
atcttcagca catagaaata accctctgcc ctgacaagaa caattcatga ttcgggggaa 49020
agagatctaa agtcttgcca acctctatag actgtgaagc agggttggag tagtacaggt 49080
tctgagacat ccagtccaga gccagtccga ccggagaacc caggatggct gctggtgcaa 49140
actcagttct gttggtacca tctgacttta ctcgatggat ctccccctgt gcagataaat 49200
aatattaaat taataggcaa aatcataatt atagcattca aaaaaacctt aatgatacaa 49260
ctcatttgag ataggataag atttaccgga tgttccaccc aatagatcat ctgctcagca 49320
tcatcaaagt cgacatcata tccattcagc agcccagcga ccggcaccat ggcgtcattg 49380
ctcttatcat tcgggttgag cgggatgcca taaatgatgc tgtctctcac caccaccagg 49440
aacggatcct caactgcaaa agtcatcagt cataattcac gtggtattca taaaaatcct 49500
aataatgggc tttttagcag tgtgtcacat cttatttcaa ccaaacaatt acgacatgtg 49560
atataattaa atgattgtgg gtgaacttcc tgtaagtaaa ctaagatatg tctagaaaaa 49620
gaagtacact ttagcatatt ttgataagag cacctattaa gatattgccc attcaataca 49680
attaagcgca ctgacttttc acaatggata gcacagattc ccagaattag gtttcattgc 49740
attgcattta gtgcagatat gagctactgg tgaaaaaggt cttgcagagc aaacatagca 49800
gtgtcagcac aattgattgc gataaacagc ccattcaatc agcgtttgtc tggattaatt 49860
gttaaacttt tatggaagaa tgcacttgac agcactaaca aaggctgaaa cctgagcgag 49920
gttgagagtt tgccctaagg gaatgttaag ggcttcttaa gactaaaggc ctcttgcgcg 49980
ctttcatccg ttaccgtcag ctgagctaag gccttttata atagtaaatg tttgcctctg 50040
caaagaagct tactaagtaa atcgaaaatg ggttgctgtt gacaagtttt cattcccatg 50100
aaatctatat ccaattttaa cagaagcctg ctttattcat tcacagtgaa gtagattaaa 50160
tggacagagg ccagaggtaa agacatagac acatgtatta ttcattaaaa cagccacaag 50220
caggaataaa ctgttaaact cattgcacaa attcatcttg tccattaaag tccttttaga 50280
tacaatggtt agcatcaacc tctttccctc aacagaaacc caataggatt tttccattag 50340
atttatggct tctttagctt ttgattcttt caggtttaat ttattaagaa aatctttaca 50400
aacttttatg aattttaaca acaagcagaa gcaatcttgg taacatgact tcattgtcat 50460
cgccatgaag tttaatgaca gctctttcag tctctttttt aaagtcggaa agtctgagtt 50520
ggaattgttt gcaagagcat tataaacact gcgagggtgt aaaaggtgaa tagtttgatg 50580
tgatgctatt ttatgtccct aattacctct gaagtcccat ttagccactt tttagcaact 50640
gtcttcttat ttacgataca caaaagctta aaaaaaatca caagtagggt ctttctgatt 50700
tattttatgt tgtagaataa atctttgagc ttatttcaaa agcaaataaa cttaactcaa 50760
aagggaaaat atatattttt ttatttgccc tattggcagt atatatattt attcatttat 50820
tatttttttt acattaacaa aataaatttt gatagatttt ccagaaaaca agacttcatc 50880
tattgtgtca ttatgcatct catgcaaatg caccttgatt gcagacaacg aaaaattcta 50940
aaagttctcc agactcgttt tttaaaagtt caactgactt taatttgagt gcaatgattt 51000
tataaatggt atgttgtcat gggtgagatg cgtgtaactg tttctgtggc aaaggtccct 51060
tcaccatgag aaactctgct ctagatatca gtatcggcca caaatatttg gctgatgtct 51120
ctcatacctc tagcgcatgt gaactggtca gcggcgagag tccaaccaga cggacaggca 51180
caggagtaga acctgggacc gaccgctgac aggagacaga tgtgagtgca ggggctcgat 51240
cgggggtcac aggggttata acctgcagga caaaacacat caaaactttg aagtcagcat 51300
ggaatcagtg taacccctgt tttcctttta atacactttt ctgtgtacaa ttcactggtg 51360
catgttattc tacagtaaaa tatgttattc ttggactaac atagtgtaca aaactgtcaa 51420
atattatcct tagactgtaa tcattataaa cacaaactgg tttcatcacc taacaggttc 51480
gtttgaaatg gtatgtattg tgaaaagcac tatataaata aataaattta aaactagttt 51540
aaatttacaa taaatactat ttactgaact tcagagtgtg tttttgactc acctaaatta 51600
cttctactgt ttttaataat cctaaagtgt acagtcaatt gcactgacaa gcatttatgg 51660
ttaaccaaaa tagacttaaa tgtaatttat attgtaaatc tgtatgtcat acagtatatt 51720
tgtaatgacg aagtagcaat ttaatacatt taacatgtaa catgtaataa aattatgatt 51780
tattatgtta gtcagcacat caaaataagt gtacttcttc aaagcttgac aacaggttat 51840
taaaacataa agactgtgta ttttaaagca acttctaatt tgacattatt acagattgca 51900
tattttaaaa gtgtgttagg aaacagtcat aaaagtggct ttttatataa ttaacaatct 51960
gcaagtatat ttttcaagta cacaacaagt gcacattcag tgcaattaag catgtgacaa 52020
aaagctctta ttattcagct cttttcaaac cttttttctc accatagtga cttccagaaa 52080
tgcctttttt tatttgctgt atttaatcct aacgataaga ctcttctcct caggtagcag 52140
ctagctgatc tggctgtatt tgcggtttgt aatcagggtt gtacctgcag gctgtcgggc 52200
aggatgaaga gccaccagac ccatgggctg gggcagatta tacagcatca cggtctggtt 52260
ctgtccgtgc cacttgttgg cccggatcac acggttgacg tatctgtctg tccaatagac 52320
aaaatcctca aaaatggtaa ttgcgtgggg atgttgcagc accttttaat tgagaaacaa 52380
taggtgtgtt atttattcac acagtaaata tttattcaaa cagtaaattc ataacttttt 52440
gagacactta gtgccagttt taataaaaag gggaaaatag cgcgatagca cagttccaaa 52500
acagcgccga tggctgtgta agtttcttgt gggtgattta ctaacaacac acaaattaaa 52560
caacacagaa gcaacatctc atttacttat cgaccaatgc aatatttcaa gaactgcgca 52620
aattagcata gattttgcct aaacaataaa taaataaatc agctagagta cattgaaaag 52680
aaccggaggc caaaaaatga agggttttga taaaccaggt attgcgtgac tcctaaatga 52740
gtcttttatt tcctgatgca aattaaaaaa aaataacatg gattggcaaa cgatatgttc 52800
tttccaaata aattaatatg tttggatgct ctctgttctc tgtggtgcct cctcctagca 52860
atacattgca tgcatttcaa gtgcaaaatg gtttaagaca tgcatttttg ggcgtttaat 52920
aatggtgctg taatgatgac agactcgaca tgggatccat ttgcaagctt ttattagaac 52980
actcgtagtc gtacaggcga agggtcagga tcagcaaaca cagtatagta gagatatcag 53040
aatcgtagtc aataccaagc agtgggtcgg ggtaaccagc aagtatcaca gtccgtatta 53100
caatctgggt tcaaaggtag gcagcaaagg ttctaggatc gataatcatc aagggttggc 53160
aactgggata gcaagacagg gtataacgct cagaaatgtt agccgtggca gaaacaagac 53220
ttcgcaatga ggtgacgcca aggtctggct tatatggcag tctctgatat ggaacacctg 53280
gccggtgatc agtccaaggt acggggctgt tgggtaatgt agtgcgtatc agtgtaagtg 53340
agtgtttgga tgcctgtagg tgtcagtatt cgggcgatgg tgccctctgc tggcctctgg 53400
agggactcac ggggttcgta ctcgtgacag agccccctcc ctgcgaacgc ctcctggcgt 53460
gaggtggagg gcgacgtcgg ggtcttcctc gaggtcgagg ggccggtttc tctggatgtt 53520
ctctatggaa atccgtggtt aagttggggt caagaatgtc ctcggcattg acccaggaat 53580
gttcctctgg accgtacccc tcccagtcaa ccaagtattg gagaactcgt ccccgacgtc 53640
ttgaatcaag cagctctcgc acctgatagg tctcctcgct ctccacctga accggttggg 53700
gactctgctc acggaacctc ccctcaccct cctcctcagg ggcagcagcg ggcttgagca 53760
gtgacatgtg gaaagtggga gaaatacgat atgtgtcagg caaagcaagg cggaatgaca 53820
caggagtgat ttgttgtgtg attttgaatg gacccacgta cctaggactt agctttcggc 53880
atggcaatct catccgaagg tctctggtgg acagccagac ccattgaccc ggttggtagg 53940
tgtgtccagg gcgtcgacga cggtcggcct gttccttcag gcgccgtaca gcttgttgaa 54000
gatgtacatg ggcctgattc catgtgtcct cgcttcgttg aagccaatca gtaacagcgg 54060
gaacattggt gggttcacct gaccagggga acatgggtgg ttggtaccct agaacgcatt 54120
tgaagggggt gattccagtg gaaggcttga tcagggaatt ctgagcatat tcggcccata 54180
ggagaaaacg agaccagtcc tcttgatggg agctgcagta ggagcgaagg aaccgagtga 54240
gctcctggtt gaggcgttcc acttgaccat tggcttcggg gtgatagcca gacgtgagac 54300
ttacgttgac ccctagggct tgaaagaaat ccttccataa acgagaggtg aactgaggac 54360
ctctatccga aacgatgtcg tcaggtatgc cgtaaaacct gaacacccaa ttacacagga 54420
gttcggcggt ctgaagagca gtagggagtt tgggtagagg tatgaggcga caagccttgg 54480
agaatcgatc gatgatcgtg agtatgaccg tgttaccctg ggataggggt aagtctgtga 54540
tgaagtctat ggcaaggtgt gaccagggtc gttgagggat aggaagaggt tgaaggagtc 54600
cagctggcag ttgacgaggg gttttgtgca tgttacacgt ctgacagttc tgaatgaatt 54660
taatggtgtc ggggttaatg gtctcccacc agaagcggtt cttgaggaga tggagtgtgg 54720
ctgtgatgcc ggggtgaccg gaggcaggaa gacagtgaat atgactcagg acctgatcac 54780
ggtgaggggc aggtacgtag gttttatccg gtggacaggc ggctggtggt gcttcttgtg 54840
aattgtgttg ttcaataaga gtcatgatat cccactggat gggagcaacc accacatgga 54900
agggcaatat gcgttcgggg actgagatgg aagggtcgtc atcatggata cgtgacaaag 54960
catcggcttt agtattcttg gaacctgggc gataagtgac attgaactgg aaccgggaga 55020
agaacaagga ccatctggct tggcgtgggt tgagtctctt ggcggagcgt aggtattcca 55080
gatttttatg atcagtaaag acggtgaaag gatgaatagc cccttcgagc caatgtctcc 55140
actcttcaaa cgctgccttc atggctaaca gttctctatc acccacatca taattcctct 55200
ccgctgggtt gagcttgcgt gagtagaaag cacagggatg aagtctctcc tgagggcctg 55260
gtcgttgtga gagaacggcc ccaatgccag tattggaagc gtcaacttcg actaagaagg 55320
gaagagaggg gtcagggtga tgcaggatag gagcggtgac aaagcgttcc tttaattcct 55380
taaaggcttg acgggaggct tcggcccagg tgagacgatg gatacctctt ttgatcatgg 55440
aagtgagagg actgacgact gtactgaaat ttctgataaa gcgtctgtag aagttggcga 55500
atcccaggaa ccgttgtagc tccttgagag tctgaggctc tggccacctg agtacagcag 55560
agaccttact gtcgtccata gcaactcccc ccggactgat gacataaccc aggaaggtag 55620
tggatgtggt gtgaaactca catttctcgg ctttggcgta cagttgatgt tcgatgaggc 55680
gctgtagcac agttctgacg tgttgaatat gatcctgtag ggtgttggag tagatgagaa 55740
tatcgtcaat gtagaccaca acaaacttgt tcagcatgtc cctgaagaca tcattaatga 55800
acgcctggaa gatggacgga ctattgacca gtccaaacgg cataacccgg tattcatagt 55860
gaccgttggt tgtagaaaag gccgtcttcc actcgtcacc ctctctgatg cgaatcaggt 55920
tgtaggcaca gcgtaagtca agcttggtgt agtattgagc tgtgcgaagt tgttctaatg 55980
cagctggaac cagaggaagg ggataacggt acttcaccgt gatctcgttg agaccacggt 56040
aatcaatgca gggtcttaaa ctgccgtcct tctttttgac gaagaagaaa ccagcggatg 56100
caggggaagt ggaaggacgt atgaagcctt tctccaactc ctccttgatg tatttagtca 56160
tggcttcggt ttctgggagt gacagtggga acacacgacc cttgggtgga ttgtgacctg 56220
gtagaagatc aacagcacag tcatggggtc gatgaggtgg taaggtgttt gcttgggttt 56280
tgctgaaagc tgccttgagg tcgtgatatt ccgctggtag gttgggaacc tcgaatgact 56340
ccttgttaat cgccaacgag cgcactggaa gaggagaaac ggtagacagg cattttgttt 56400
gacatttagt actccacttt attatctgac cttccctcca tgatactagc ggatcgtgca 56460
gtctcagcca tggtagtccc agaattatgg gtgtattaga cgtgggcaga acgtagaact 56520
gaatgacctc ctggtgcagt aaaccggctg tgacggatag actttgtgta aggtgagtga 56580
tgataccagt tcccagtgga cgaccatcta tggtagctac agacagagag ttagcacagg 56640
gtattaatga tacgttgttt gttcttgcaa actcagctga catgaaattc ccagccgctc 56700
cagagtccag taaagcgaat gtggtcactt gaacgttgtt aatggtcagc ttcaggggta 56760
caaacacact gtttactgga tggagagagt gtaaggattc actcaccaat ttggatgaga 56820
cagatgaagg acgtgaagga cagttgactc gttgatgacc aggggatccg caatacagac 56880
atagacgatt tgccaagcgg cgattcttct cctcagatga gagatgatag gtattaactt 56940
gcatgggttc agagtcctcg acagactgag ctgctttaat gacagtactg acatgagagc 57000
gaggtttacg tgaacgctgc agattgtcaa ttgctatcgt caactcgatg aaatcgttca 57060
aacttctccc ctcatctcga catgctagtt cagtttgtaa ctcatgactt aaaccctttc 57120
gaaacagaac ttttaaggtg tcactcaccc aggtcgtttg tgctgccaat gtgcggaagt 57180
tcatagcata ctcagcagct gaagatcccc cctgagagag ctctagcaag cgttcgccag 57240
cactctttcc ctccttggga tgatcgaaaa cctcacggaa gcgtgtgagg aaatcattga 57300
aggaggagaa ggatgatcca tccgtgcgcc atacagcggt aatccagtca agagcccttc 57360
cggttaacaa cgagcaaaca aaggcaacct tactgttgtt agtggaatag agcatgggtt 57420
gttgttccac aaacagcgaa cattgtagta agaagccttt acatttgtta gggtcaccgt 57480
cgaatctctc aggaaaagcc agtcgtgggt taatctgaga gggcgtggga accgcatgtg 57540
caatggcggc cgcttgtggc gcgggtgtcg tgacaacagg actatggaga ttctgaatag 57600
ccttcaccaa ttcctccgtg acggcggtaa gatgatttaa ctgttgctga ttagcagcga 57660
tctgacttgc ctgagctgcc aggttggtgc agagatgttc atgggctgct ggatcttgtc 57720
ctggcgaagt cttctgtaat gatgacagac tcgacatggg atccatttgc aagcttttat 57780
tagaacactc gtagtcgtac aggcgaaggg tcaggatcag caaacacagt atagtagaga 57840
tatcagaatc gtagtcaata ccaagcagtg ggtcggggta accagcaagt atcacagtcc 57900
gtattacaat ctgggttcaa aggtaggcag caaaggttct aggatcgata atcatcaagg 57960
gttggcaact gggatagcaa gacagggtat aacgctcaga aatgttagcc gtggcagaaa 58020
caagacttcg caatgaggtg acgccaaggt ctggcttata tggcagtctc tgatatggaa 58080
cacctggccg gtgatcagtc caaggtacgg ggctgttggg taatgtagtg cgtatcagtg 58140
taagtgagtg tttggatgcc tgtaggtgtc agtattcggg cgatggtgcc ctctgctggc 58200
ctctggaggg actcacgggg ttcgtactcg tgacaggtgc aaatggtggt aatgcatctg 58260
aaagggaaac tcatagagat gcatattcaa taaggtcaag cacaaaaatg actgtgtaca 58320
tgcattttca gtgctaattc gtcactgcac gtctagtaaa acctgacagc actattcact 58380
attttaaagc aaaagatgat ttgcgctctg ccttgtgact aaaaatggca tctatgcgac 58440
tttttatctc acaattcaga catttttttc tctgaaatta gatttttgtg agataattga 58500
aactttacat ctagcagttc tgtctttttc ctgatcataa aatctcaatt tcaagacaca 58560
ctcgcaattc tgagaaataa agtcagaatt gtgcgatata aacttcctac tgtgttataa 58620
agtcagaata gcaagacatt gcaagactct ttgtatctca aaattgcgaa tttattataa 58680
actcacaatt gaatttttct caaaaaaata aaaaattcta aattgcgact ttatatcatg 58740
caattctgac tttataacaa tcaaccgcga gtttctatga tgcacttctg aaaaaagttg 58800
cacaattatt tactttatcc atttttgatt caatggtgaa aaagaaaact atttaaaaat 58860
gttcttgatt taagaaagtt tatactggag aacaagacag aaactctgag aaagagcacc 58920
gttttgcagt ctgcaatgct ataatatcac ccatattcat aaacataaat gagtgcataa 58980
cacaaaagat ctcaccaaat cgctggccaa cacttgtttt cggttctttc cgtcgtaatc 59040
acagtagtcg ataaagtcta agtaagcatc agcaaagtaa agcaggttgt tggggtagtc 59100
aatagttaag ccattgggcc aataaagttt gctgctgatg atcgtggtcc ttaatttccc 59160
atccatgctg gctctctcaa tacgagggtt tctgccccag tccgtccaaa acatcagatg 59220
tgagctgaaa tacataaaaa tggatttcgt tttcaaactc cggtcattca aagtcatttt 59280
caaatacatg ttatttggtc actgatctta cttgtttcta gggtccacta ccagacctct 59340
tggattggtc acattttcac taatcagcac cactctgtga gacccgtcca gtttggacac 59400
ctctatggtc tccaaaatat agtctgtcca gtataggttt ctgccaaccc agtcgacggc 59460
gagactctca gtaacggtca ctccactgtc aaagatcttc ccgggaagac aaatgatatt 59520
ttagtgttaa ttgttggtca tgttgacttt aaagggtctt tgcacactga gtccgaaatt 59580
ctagtatgac atttttgcac gttaaaagaa ctaatacgag ctcacgttgt gtcaatcacg 59640
tttacacgct gccgtccgtt cggcataaaa aaaaaattag gatctggttt tattttctgc 59700
gttttcgcat ctgtagcaag cattttaatg ggaaaggatg acgaatacga ataaacgaat 59760
atgaaaattt cggactcagt gtgacaggac cttaagggtg ctggaggggt gttcatgtca 59820
ctcaccaagg tgcggtcggt cccattcttg tgagcgctcc atattctgtc ctgggtggtg 59880
tcagaccagt aaatacgatc cgtaacagag tcaaaatcca ctgccacaat attacggcca 59940
tcacgcacta gagaccgaat aatgtttggc tgtgcgagga tgtcgtctga tacaatctga 60000
ttccgactgg ccactaagag caaggcctct cgagatcctg tcacattgag aaagagagaa 60060
tagaattatg tcatatatta ttacaatttt aaataacttt ttattttaat atattcttca 60120
atgcaattta ttcctgtgat caaactgtat tttcagcatc attcctccag tcttcagtgt 60180
cacatgatcc ttcagaaata attctaatat gctgtatata ataagtttat ataaattaat 60240
agcattcatt tatatttatt tatttttaaa aaaaattggg gggggggggg ggattcattg 60300
ttacttccgg tcagtttaat gtgtccttgc ttaatacaat tttaaatctc tttaaaaaat 60360
tacattgacc ccaaacttca aatattaaat tacgtatata tatatatata tatatatata 60420
tatatttttt tttttacata taacttattt tgaatagata ttcatccttt aaatcaaggc 60480
tttcacaatc atgagaaaca attcagtcaa taataaaata cttcaaatta gatttacaag 60540
tgtttaaaat atatatttac ttgtatagac ttgtattggt tttaccagaa actgtaatgg 60600
accgtgttca tcacagtttg ccttctatag gtggcttgtt aaaaccaata aatcataaag 60660
tctttgttgc agaaaccatt caaattttct gtgatggttt tattggtttt tttttcagta 60720
gggacataag cctatatcct ataagcctgc tgtgtgtcac gtgacaagcc cctcaccccc 60780
ttcactcgtg cctcttcaaa aatgatgccc ctgacactac aatacaaaat gatttgtaaa 60840
ataattgaaa catgatgaaa cattcatttt agttagaaat tgttctagat tacattcaga 60900
ttgcagcgat aaatgacatg gtgaaaaatg actaaaggaa taataggcta taaagtagat 60960
taagattaat tttaaagttg tgccctctaa aagcaacagt tatggtggtc tagaaacgcc 61020
cctggctttc acaataatga taataataat aattccttac atttataaag cgcttttcta 61080
gacactcaaa gcgctcacaa tgatgagaaa catgattaag tcaagtgtgt caaacttttg 61140
actggcagtc agtgttttgc gcttcatatt tggatcgcaa tatgtgatta taactcaaag 61200
tacaccgtta tctcaaaaca tatcaaataa ccaggactag aaatgattcc aaacacgtat 61260
cagatctcag ttataaaact atgcaagacc tatttggcca tgtcagatag aagtataaaa 61320
aaaagagtca ttcaggaaga agacgaggta tatcggatga cagcattttt caccaggtta 61380
tctggaggga atggactctt gtcagcggca gagctcataa acaacccttt gtttggtgat 61440
taccacctaa ttacccccat aagaaagttt gtgactgaca tatatgttat tcctcttatc 61500
tgtggtaatc cgagaggctg ctttagaccc cgctgtgaac gtccagttgg cttccggagt 61560
cccgttgtgg ctccactcca ctaatatctg tcttagcact gtcagagatc acaaacctga 61620
ggccttgcag gtgcgcccgt ctgcttctag agtgtatcct tcctgacaat ggcagcggaa 61680
ggagcctctc tcgttgaagc agtgctgact gcagatgccc ggaggacggc actcatcgat 61740
gtcctcgcac gtctttgagt cgttactcag ctggaagcct ataggacacg tgcactgggc 61800
tccaaacggg ccttggatgc agccgtgagt gcagcccgca ttgttatctg agcagctctc 61860
ctgatctatg aggaagaaga cacatttaga tcaatgacaa tggaaagaaa aagcatgcat 61920
gaagacgcaa ggtattctgg gaacatgctc agcacaacca ttaaacggga agagtgctag 61980
ggctgggcaa aatctgtata gaaatctgta tatttataca tttagcttaa aattgcctaa 62040
aatatgtttt ttattttatt cagattaacc atttttgtag ccaaacaacc agtttacagt 62100
aatttataca tgttttaaca ataacagtgg tattatagtt aactaaaaaa agtaaaaaaa 62160
aaaaaaagta aaaaaaaata ataaataaat aaataaataa atacatgaat aaagaagaga 62220
agcaagggga aaaaacttaa caatgtattt cagctggttg cccaggaaat atttcattta 62280
gtttatcttg atatactaaa ataaccaaaa gtacaattaa aataaattta ttaaaataat 62340
aaacctattt taaaaataat aaaattaaaa gtggaaaaat gattaaaaaa aaatttcatt 62400
aaaactataa cagtatttca ttgatgctac agtaagacta atgcaaaaac tacaatacag 62460
ttaaaattat agttaataat atcaaagaaa tcaagaaagt ttccaaacac taaattaata 62520
atgaaggaac aaacatattt gattattttc atgaatatgc accaaaaacc tatatgcaga 62580
aatgtaaata aagtaaaatg tatctacata tatttatatt tcaaacattt caatattcaa 62640
tatccatttt tgccaattaa ataaatgcag taataatatt aataattata tatatatata 62700
tatatatata tatatatata tatatatata tatatatata tatatattaa agcatttaat 62760
ataaaaagtc acatggtcag aacattattg atggaggaaa ccaattactt aattggttcg 62820
gctgttggaa aaggaatcaa acttttcaat tctaatgttg tttttgctgc ttaagtttca 62880
ataagagcaa tttttataat attgcttaat atttaaaata tttaaaataa tactgcaaat 62940
atattgtgaa tcttcaatat attgcccagc cctaatgtta actgttgaaa tctaagctat 63000
ttctgtgttt gtgagccatt tctatggact aacacatgta acatatatca aacacagtac 63060
tacacacact acatctctaa gtgtctataa tgctaaacac acacaatctt gccatattta 63120
gacactctga acgtaattgg acggttagtg agtacagcca gacagatact gggatagatt 63180
agtgcttcat gcagaggatt gtgggtgatt agggatttag cctacctggt aaaggctcat 63240
ctgtgagaag ggagagcaac atggaaggaa aatcacagat cattgcaatg ttaattttca 63300
gatcagctaa acacatgcat gatttgacac atgggattta taacacagac gacgtactga 63360
atactccatg tatttgaatc aaccgcccga ccgcaatatc aacagaaata ttaaagattc 63420
cagtgtaaat taatgttctg gattcaagac aagttatgca tttgtaagat gaatggacta 63480
atgtactggc atccatagta tggaaaaatt gctagagtag ttaataggga ccagaaactg 63540
tcaggatacc cacattcttc aaaatatctt ctttacgttc aacagagaaa tggtcaaaga 63600
aactcaaacc ggtttgatac aacttaaagg tgagcgaact ttcacttttg gatgaatacc 63660
tgccaatata tacagctcat taggatttag aacaaagcca agatgtttat tagggagcta 63720
gaataatctt taaaaaaatg aatcaatata ttatactgta tgctttatct tatataataa 63780
agtttgtaca ctttcattcc agtagcatta aaaaaattct ttattcagca agaaaacatg 63840
caattgatca aaggtgactg taaagtcaat aataatgtta ccaaagattt ctatttcaaa 63900
taaatgctgc tctattaatt aaaaaaatcc tacaaacaac aaaaatatgt tcgttttttt 63960
aacactgata ataatgttta ttgagcagca aatcagcata ttagaattat ttcttgatga 64020
gtaacgagta atgatgctga aaatcagctc tgcatcacag aaataaatta catttaaaaa 64080
tacatccaaa tagaaaacag ctattttaaa ttgtaataat atttcacaat tttactgtat 64140
tttttttatc aaataaatgc agcctttgtg agacttattt caaaaccctt aaataataat 64200
cctgccacaa aatgctgaac agtaccgtac atgtcttaaa tattgctgta aatctcatta 64260
aacatatgcc atgacttact gcagagaggg gattcatcgg cgccgttggg acagtctggc 64320
gtgttgtcgc acaccctgct gaggttcaca cacacactgt gtccagggca ctgccactgc 64380
cagctgggac agcggaacgg cggcgtggga cagtctcgct catcgctgcc atcccgacag 64440
tcgttatcac catcacatat ccagccacgg aaaacacagt tcccattatc acagcggaac 64500
tgcgcggtgg gacacgtgcg cggggggcac gcgtggtgct cgtctgagcc gtcttcacag 64560
tctgggtgcc cgtcacattc ccagtttgag ggaacacaac tgccgtctga ctggcactga 64620
aactcactct ggtggtgaca catgccaggc ggcctcgtgg ctgggaaggt caaaggtcgt 64680
gttagcttca tttaacatac atcataatac aagcttcccg aggtttagaa ttaaggttca 64740
atataatgtt ttttaagtta aaacaagaag atgtccagga catattccat taaaaaaaag 64800
tacaataatt aaattaaacc tttttttgat ttttttttat tacacttaag cacatttttc 64860
cctacatttg ataactttaa tgaaaaattt tttttgtatc cttagtactg cagtataaat 64920
aaatataatc aataaaataa tgtattcatg cagtatttgt tgtttcaatt ttttttaatt 64980
atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 65040
atatatatat atatataaat gtatcaaata aaaaataaat aaaaacataa ataaaaaaga 65100
cgaccaaaat agtcatattg caaaaacaac aataataata atactagtat ctttagtatt 65160
agatgtaggg tgaacagtaa cctggaaatg tttttgtgag attcactcat ttataaactc 65220
taaatgtaat gcaattaatg cattattgaa gattatcaga actgaaagaa ttcatttgtt 65280
cctacttacg acagccagat tcatccgagc gatcgttaca gtcaaagacc ccgtcacatc 65340
ggtagtagga attgacacac tgatggccgt ttgcacactt aaactcgtac gctgtgcagt 65400
tgtacactga tggtcagaag aacaagaatc agtcggatca agaaaaacaa gtcatcaact 65460
gtgttaatct tccattagag attttattat gccgtttatg ttgtgaaaga ttctatgtat 65520
tcgtcaatga ttgcacagtt attggtgggc atgtctggtt tatgttgaaa tagaggttta 65580
tgaagaattt aaatggtatt aacacatttt taataaataa atcattttat ataaacaata 65640
tggagtgttg atgagatgca ataatgaaga gtttggtctt gcatgtaatg aactcttacc 65700
acagccctgt tcatccgctc catctgcaca atctctgtct ccatcacaca cgtaattggg 65760
gtcaatgcag cggtggtccg gacactggaa ctggtcaggg tggcatgtgc ccatcgaatc 65820
tagaagaaca gacaccatta gaaaacttca tagagcaaaa gttgctcttt caaagctcaa 65880
gacttgggat tctcaggtac gtttcattca cagatgttac aaataaaaac agacacaaca 65940
ttttgacaag aatatacagt gcaatatgaa cagcacaaag ctcggataat ttcgttaaac 66000
caaaatttga tacaaatatt ttagttcttt tggagatctg acttttttat atatatgaat 66060
gcaggcatgt atttatgtat actgttttag gaagccttcg aaatgcatct aatccaatca 66120
gaatttagat ttaaagctat ctaatattta tatactgttt ttattagtgc tgtcaaacga 66180
ttaatcgcga ttaatcgcat ccagaataaa agtttttgat cacataatat atgtgtatgt 66240
tctgtttatt tattatgtat atataaatac acacatgcat gtgtatatgt aggaaaaata 66300
tgttatcttc atatatgaaa tatatataat ataaataata tgaatataaa tatatacata 66360
tgctgtatgt gtgtatcttt atacatacat acaaacatac ataataaata tatatagtac 66420
acaaacatat attatgtaaa caaacttttt tttggatgcg attaatagtg attaatcgtt 66480
tgaaagcact agtttttttt atataacttt atgtttttta tttaattctc agtgttgatt 66540
gccacttcaa tcaaatgcgc acaaggtgtt attttcattg atcatactga tttaaattaa 66600
agtatctttt atatactctt atagtttttg ttaatttttt atttagattt aatatatttt 66660
ttaatcaact taacttaaat gcaatagtgt ttggagtttt aaataaaaat atagttttta 66720
tttattcatt tgaatttatt tagtgttatt tttatttagt gtttttttag tacttcaagt 66780
tacaaaacaa gctgttgtaa aaaaatatat atatataatt attatatata aattatatat 66840
aattatatat ataattataa atctattttg aattatatat ataattatat tttttatttt 66900
agttacatat aataatatcc ctatgtctca ccgcagttgt tctcatcgga tccgtctccg 66960
cagtcattgt cagtgtcaca gagccatgtg cgagggatgc agtgatggtt ggcgcagctg 67020
aactggtcag cctggcaagt gccaggggtc tgggtgggac agtccctctc atcactgcca 67080
tcaaaacagt cattgtgtcc atcgcagtgc cagcttctgg gcacacagcg ctgattggca 67140
caggtgaagg caatgggcga gcatgtgtta tctgagccgg cagagagagg aaaaaatgta 67200
catgctgctt attttcatgt taagtggaaa aagactataa aatctcatag aaatacatgc 67260
attattttaa ctcaaaaaga aagcatggca ctagcaatgc catgatcatg ggttttgatt 67320
ctctgagcat acatgataca gtgtaaatgc atgatgcaaa tgtaatttgc ttgggttgta 67380
tttattttta tttagtgttt tggtgatttt agtgcatttt taatgctgtg ccaagctgaa 67440
agctggtaag tgtacccatg tagagagttt tgaaaaagtg acctaataat gaaaataatg 67500
tccttgtgat tgctcgttac taatgtacag tatctataag cacatgaata tttgcatgaa 67560
atataaagta aaagattact attagcgccg cagttggcct cgtcactgtt gtcgtggcag 67620
tcatcgacgc tgtcacactg gtagctctga ggcacacatt tcccattggt gcatgagaag 67680
gagttagagc cacactggag ggtcggcggc tcattggatg gatcatccac acacgtctgc 67740
tgattggctt ccagcttcat tccgtacgga cagccacaca ctcgctggga gtacggcgct 67800
gggaaacaga aatgactgca gtctccattt gggtttgttt gcctgttgca gaagttagat 67860
cctgttggga gtaaagtatt gcaaacatat ttgattcatt tgctgcattt gaataaatgg 67920
cctttatgaa ccctaaaacc tttgtggagg cattgcattc aatgcaaaaa tgagtaaagt 67980
aaatgaatca acagaaagtg agtctcaccg atctgcgagt cagcgttgaa ggatttgaca 68040
tgcatgatgt ggctgatccc tcgccgtatg atcatcatct cccctccgtc tgttttacgg 68100
accctcacga ttcctcctag cctccagtcg gtgaagtaca catagcctgg accagcaaac 68160
ttcattagtg taatcaaaca ggccatcttt gagaaatacc agtgcgtgaa ctctgacttc 68220
tgcactttta tctttcagca ccgaaactgc cgaactacac tgaggttaca gagaccttca 68280
tttcacagat accagacgtg actgtccatc atcaaagaag aggccaacga gactaacatc 68340
tttaagatga catagagatg atgatagaca tacattaata gaaaaattag accgctttct 68400
aaaagtcaga cacgtctcag tgaagacaga tatacagatt actgaaatta tattgaacag 68460
atattgaatt atccaacgct agaatgatag atagaacgat agaatgatag atagaacgac 68520
agaatgatag atagaacgat agaatgatag aacgatggat acatagatag aatgatagat 68580
agatagaaca atagatagaa caatagatag aacgatacaa tgatagatag aacgatagat 68640
agacagaaca atagatagaa caatagaaac acagatagat agatagatag atagatagat 68700
agatagatag atagatagat agatagatag aacgatagat agaacgatag atagaacgat 68760
agaatgatag aacgatagat ttaacgatag atagatagat agatagatag atagatagat 68820
agatagatag atagatagat agatagatag atagatagat agatagatag atagatagat 68880
agatagatag aatgatagat agaacaatag atagaacaat agatagacag aacgatagaa 68940
caatagatag aacgatagaa tgatagatag aacgatagaa tgatagaacg atagatttaa 69000
cgatagatag atagatagat agatagatag atagatagat agatagatag atagatagat 69060
agattgatag atagatagat agatagatag atagatagat agatagatag atagatagat 69120
agatagatag atagaatgat agatagaaca atagatagaa caatagatag acagaacgat 69180
agaacaatag atagaacgat agaatgatag atagaacgat agatagaata atagatagaa 69240
cgatagatag aatgatagat agacagaaca atagatagaa cgacagaaac acggatagat 69300
agatagatag atagatagat agatagatag atagatagat agatagatag atagatagat 69360
agatagatag atagatagaa cgatagatag acagaacgct agaacaatag atagatagat 69420
agatagatag atagatagat agaacgatag atagacagaa cgatagatag atagatagat 69480
agatagatag atagatagat agatagatag atagatagat agatagatag atagatagat 69540
agatagatag atagatagat agatagatag atagatagat agatagatag acagaacaat 69600
agatagatag acgcacctcc aaagacagta agtccaaacg gatgggagat ctgagtgatg 69660
cgatccaagg acaatctgtt ttgtccgtca aagttgctgt gctcgatctt atcaaagaag 69720
gcgtccaccc aaaacagccg catggcactg agaggcaggg acacgaccga gagagagttt 69780
tagaccaata ctgagaaaca ggattatgtc acatcatgat ttaagtgcaa aaaaaataat 69840
aatgatgaac cacggcttca ggaagccatc gaagctaaag caaagaggaa ggtttatgaa 69900
tgaaacagaa atggatcatt gtttatagta ttatgagtac aactgaatgc tgtattgctg 69960
tacttttttt cggggaactg taactcacct ccagtcaatg gccaagccat tcggccagcc 70020
aagagtggtg tttacaatcg actgagcatg tgacccgtca ccccaggccc gcataatctt 70080
cgccggacgg taccagtcag tccagaagat ataactgccg agcacaaaca cacacacact 70140
gctttaaaag aaatagttca cccaaaaata aacatttact caccctcata cctatttttt 70200
actacatttc ctcagacgaa cacaagcttt tttacatgca attacaatga atgagcacta 70260
gagcttttaa gttttaaaat taatctttaa tatacagtag aagttgttca tattacttgt 70320
gagtaatata cacatacata caataccttt caaaagtttg gtaagattta tttaaaagaa 70380
gtctcttatg ctcaccaaga ctgcatttat ttgatcacag atgcattaaa aacagaaata 70440
ctgtgaaata ttattaaatg gtttttattt taaacggtta tttatcaata ttatttttca 70500
ggattttttg atgtatttaa ttgacataga aatttttaca cattatttgt attaactttc 70560
acttttcatt aatttattgt atctttgctg aataaaacta ttattttctt tcaataaaat 70620
aaaccttttg aacagcactg tctatacaga atttatatat atacaaatgt tcacttgttc 70680
aatttctgaa gtcttcagaa gtcatatgat ggactacttt tataatatac ctttaacttt 70740
ggatttagcc ccagtcataa actgaaaaga ggggccaaaa gtcatacagg tttgggacag 70800
catgtggatg agggaacaat accaaaatca ccacaaatct tcaccccatg ccctcaccaa 70860
aagccaaatt aataggtggg gatgtatttg aacccatccg ctgggtttta tttataaact 70920
gtgaggtata gacagttatc tgagtgctct gtggggtttt cccccattaa gccaagttct 70980
tagtgtggac tacaataccg gccttcatgc agcactaatc tagtctcctg ggcccagagg 71040
cctcatgact ttgaagccat tggaaatgtc ccactagcag ggcataaaga gctcctttat 71100
ctgtgccagg ttcaaagcag gagcaatcta tgaatggctg ggcacctcct gcattatgaa 71160
aggtgtgact ggacacaagt ggcccgattg ttaatgcttt ttcacggtct tagggaaaca 71220
gagaggcgag gtgaaagcag ggctcccccg aactgtgttc ctggctcacg gtcgctcgct 71280
gaattgtgtc tgttgacatc tgaacgtgcc tgtgatttca aacccacggt cgcaaccctt 71340
ttttccatga aggatgaagt ttgagtgtgt ttattttgtg tatgcaagta ttagcctaaa 71400
tgtttctgtt ctaaaagctg catataaaga cttcatatta aggcatcata gcgaaaactt 71460
tttcaaaaga tacaatatca gcataactat agcatattta tttatcctga aacgcaagat 71520
aaatatttat taaaaatata attaattaaa ttacagttta catcaccaac taaaactata 71580
aaaatatata tatatatata tatatatata tatatatata tatatattag tggtgggccg 71640
ttattggcgt taacgtgctg tgttaacgtg tgactcttat cgggcgataa aaaaaaatat 71700
cgccgttaat ctattctcaa agttgggttg agagctgggt ctatattagg cgagctatga 71760
tgactttcac cttgatattt tagtgcggat gtatacctaa ccgaattgca ctgtagggag 71820
cgagaatgac gtgcaaacat ggatgcagct gaagctgccg ggtttgcttc agggaatttt 71880
tttttttttt aagaagcttc ccaatggaaa ccattttgga aacaagaaac agctcctggt 71940
ctaatgcacc acctggcttt agaaacccgt tctaaaatac ttacttttag tcattgtttt 72000
atttgtagca cacatattct gaatgccttt ggcagaattc aagtgagcta ttttaatcta 72060
gactaattcc aagattacag agggattaat ctagataaaa aaattaatct atgcccacct 72120
ctaatacata tatatatata tatatatata tatatatata tatatatata tatatatacc 72180
gtattttccg gactataagt cacacctttt ttcacagttt ggctggtcct gcgacttata 72240
atcaggtgcg acttatttat ctaaattaat ttgacatgaa ccgagagaaa tgaactaaga 72300
gaaaacatta ccatctccag ccgcgagagg gcgctctacg ctgccagaga tgctggtcag 72360
tgcccctgta gttctacact gagcggcata gagcgccctc tcgcggctgt agatggtaat 72420
tttttctctt ggttcttggt tctaaataaa tgcgacttat agtccagtgc gacttatata 72480
tgttttttta gtcatcatga cgtatttttg gactgatgcg acttatactc aggtgcgact 72540
tatagtccga aaaaatacag tgtgtgtgtg tatatgtata tatatatata tatatatata 72600
taaattaaaa attgccttta aagttgtcta agtgaagctc ttagcaattc attttaaaat 72660
aaatcaaaaa tttaattttg atataattac agtaggaaat gtacaaaata ttttattgaa 72720
catgatcttt acttaacatc ctaatgattt ttggcataaa agaaaaacca ataattttga 72780
cccatgcaat gtatttttgg ctattgctac aaataaaccc cagtgacttc agactggttt 72840
tgtagtccag ggttacatac acccacccac acacacacac acacacacac acacacacac 72900
acacacacac acacacacac acacacacac acatacatac atatacatat atatatatat 72960
atatatatat atatatatat atatatatat atatatagag cattgttata aacaagacta 73020
gccaataaac cactcaccca gctactggat ggaccactat ggacctgggg ttgttgaggt 73080
ttcgtacaat ggctcgtctg gtcttatctg cgaccttcat gacagatata ctcctgtagc 73140
gtggatctgt ccagtacagg ttcttcgata tccaatcata agcaagatct tctgcaccat 73200
ccacacgatt ggctgctagt acctccctgc ctgtggagat taaagcaaag aaacacaaag 73260
attagagcac ggtccacttc ttgaactttt caaaaattca gttttcacaa aaatgttatg 73320
cagcacatgg ttatcaatat tgataatgac aatacatgtt tcctgagcag tgaatcagca 73380
cattagaatg atttctgaag atcatgtgac actgaaacct ggagtaatga tgatgaaaat 73440
tcagctttga tatcacagga ataaatacca ttttaaaata tatttaaata gaaaacagtt 73500
actttaaatt catataaatt ttttctgtat caaataaatg catcattgct actttgggga 73560
acatagtgca aatgaatgtg ctatccctta tatgtggctc acaggtgttg aatgaaactg 73620
tctatattat tttgggttga gtcaggggga ggagggtatt agtgtcctgg atttctatta 73680
gaaatgtagt ccaacaccag caaacacatt atctcatgta gctggccttt gaaggcagct 73740
gataggcgag atcacatgca gcaaaatgtg ttgttggcac acagcaaata tcccgagggc 73800
tgagtggagt attcagtgag tatcagtttc tcactttatc tgctcagaag taaaggacat 73860
ttcatcaaac agagcacaga ttctgtgctt tttttttcac caacaaaatg aataagaaaa 73920
tctgaattga agaacactcc agtgaataca agcaattata ttctgggtca gaaaataaag 73980
gcagggtata tgattggatt tttgttatgc tggttaaaag tcttctcaca ctacgatagc 74040
aaacactaag tggtcttaat gtatttcgat gaatttatat cgtctgtgga aggtgtaggg 74100
ctgtaaaatg tttgtcctat catatgtgaa ccaggatcac aaaaccagtc ctaagtgtca 74160
atttatttta ttaagtttta aacttaatga ataagctttc cattgatgta tggtttgtta 74220
tgataggaca atatttgttt gagatgcaac catttaaaaa tctggaatct gagggtgcaa 74280
aaatttaaaa atgtaaatat tgagaaaatt gcttttaaag ttgtccaaat gaagttctta 74340
gcaatgttta ttacttataa aaaattaaat tttgacatat tgtctgtagg aaatttacta 74400
aataacttca tggaacatga tctttactta atatccttat gttttttgca taaaagaaaa 74460
atagatcatt ttgacccata caatgtattt ttggctattg ctacaaatgt accccagaga 74520
cttcagactg gttttgtgct ccagggtcac atatgcctct gtccaagggc tatgataggc 74580
tgtcctgcct gccagtcaat gtatgtattt gcatacctcc acgcatcctg tttatgcagt 74640
tatgatacat cacatgaggt aatctgacat catttctcat atctgatctg ccagtgtgcg 74700
ttcatgtgtt ttggaggagg cgtggtctta gagtgtgatt tgcagggagg gtgagaactt 74760
atgtttttaa ggctagcttg ctattgctaa cctctaagga atctcctacc ctacctttaa 74820
aactggctcg ctaaattcag gatacgaggc aaaaaaagcc ccttaactct ttcctcgcca 74880
ttgatgagtt aactcttcat ttagaagaca acgcttcccc accaatgaca agtttttaaa 74940
tcatcaaaat cgctggcggt ggctggcaac ttttttcaaa aatgccgggt gggacagagt 75000
taaaaaagca acttgtttct aacttttggg gttgacacaa attcccaagc cattctcacc 75060
ggttccatca attttctgtt tgtagatgat gtccttcacc gtatcagaga agaaaatgga 75120
attgtctgca gcactgaagt ccactccaac gaaataagag ggagttcctg tgatagggag 75180
tataatgtct tcctgcgtgg acaggttgaa ggggattcct ctgaccgcca gctgactgga 75240
gaacagcaga aactgcctca cagctgcaag acacatattg catgcatatt gttttcttta 75300
aaagtcaagc tgttgcatgt tgcataacca aagacacaag gtatgactga gatctgtctt 75360
acacatgcat ctcttgccat cagcatgcag atcgaaaccc atccggcagc ggcagcgata 75420
tccaagacct ccgttatcag ttctgtgact cagaacacag atgtgttcac agcctcctcg 75480
atttgtgcca caggggttac ttactgtccg atcaaacaaa cgcacatgtc aataaatcac 75540
aaaaagaaga gcaataattt aattatgtct gcaagatttt tgtttaaagt ggctacattt 75600
aaaacctctc ctaagcacag tgagtccacc ctgctgtcct ggatacaaca ggacagatta 75660
acttaaacta caaatacgtt gcaatgcaat cacattgcat ttctggctat catcatgtat 75720
ggtatcagtg actattaaca aaacttttcg ggccatgttt aacctgtatt tagcgcatac 75780
tacttgtgag cgcaacaaat cctagatgta cgtcataggg aacataaatt gactatatat 75840
aatattcaac attttgtata tacagcattt atttacttgc atatatttgt ttataaattt 75900
gcatatgtaa attttaacat gcaactcttt tgtaaatagt taaaaccagt caattttgca 75960
cacatttatt taattacgtc atttgcgatg tatatatata tatatatata tatatataaa 76020
cttctgattt tttaaaaatg aatttttgtt ttatattata acaagttgta atttaataat 76080
ctgaggtcaa actgactgtc atgtgctttt gtaaaatgta ataaaattgt gagttatttt 76140
cctagtattt tatatttcaa ctttagttta agtcttagct attttattgt gcttttgtag 76200
tttttattat tttgttttaa tatttctatt tagatttcag tttttgtcat tttagcgctt 76260
cagccaaacc tgttttattt cagtcagttg cgaaagccaa tattttttaa ttccatgcaa 76320
gttttttaag ttaagttcta acatctaatc taatattcat attttatttc agctttattt 76380
caatgaacaa aaaatatttt aacagtttta cataacaata acaggctcat ccgtttgatt 76440
tattgcgtat aatttttttc ttttttgata acttctataa ccaaaaccac aatgaaatgc 76500
tatatttcca gaaccaaaac tgcactgcaa tcttctttct gacatccaac agtttaatat 76560
acataacaca gctccttacc gaagggctgt ctgtatgcat ggaccacagt caggccatgt 76620
ggtctctgag aggtcctgta gagctcttgt gggctgttgt cgctgtattt attagctttc 76680
ataacggcca tcttggtcca gtcagtgtag tagaccgaat gctcaaacag agtaatgcca 76740
aacggatgag gaatcacaga cccaccatgg accacggttt tcctattgga gaattcaaat 76800
aagagttaaa aacacttccg gaaagtgtga aaaaatgata aaaatcagca tgcacttcca 76860
aataaaacat ctgcaacccc ttaaagatac ctgtgcaagc catcataagt ggtggtttca 76920
atataatcgt aacggctgtc cacccaatac actcttttag cttcaaggtc caaagtaatc 76980
ccagcaggcc agccgagttt ggtcctgatg atcccatagc gattggtccc atccatgaag 77040
gccctctcca aacccggttc cccattcaaa gtatcccagt cagagaagaa aaggtaccta 77100
aactcaaaca caacaacctc tttgtgacac aagactttgt tttaatcatc taaaatagaa 77160
aggaactaat cctaaatgcc gctatcccgt tataatcatg aattgcgatt tattaatctc 77220
acgttgcact aaaagagacg aagagcaact tgaacttcaa tggtatttgc acaaatatat 77280
tgttagggct cagtttggac agcatcgggt tacggataaa acccttgagg gataagaacg 77340
agattttgaa atgaatacac aaagaacatt atttaagcga tgcaactatt tattgtcact 77400
tctcaattta gcaggaggcc acctggcctg ggtttctgtg aagctcatct ttgcttaatt 77460
acttctgact aatccttaca gacagtgtac aacattccca ttctcagtat ctgaggctcc 77520
acatcggaga tcagctttac gattatgatt aggtggagga agcaagggta ggattaaagt 77580
ctcccccggg agctcacccc acggtcgggt caagcgcgag acctctcggg ttgcccaggt 77640
gctcggcgat cagcgtgacc cggttgctcc cgtccaggtc caccatgtcg attctgttga 77700
cgctggcctc caccacgtac agcttgttat tcacccagtc cacagccagg ttttcaggat 77760
agtccacgga tacgttcagg accacctgca aatgcgaccc gtccatatcc acggaaaaca 77820
cctgtttgga gcaaacaaga tgtggttgga gaagttgtcg ttgaggtttt gacatgctgc 77880
tgaaaaggcc ggcatatgta gagaatgttg gtttgctgtt gtcacttcta aaggttctta 77940
atgggttaaa actgactagt tcaaaccgga gtcattactg ttaactgaaa ctattaaaaa 78000
cgttttacta attaaaaagt taccttggca agttgaagaa ctggaattac tcactaaagt 78060
agtaagtaaa tatggtttat attgtaaaat gttgtaaata ataaatatta taaatttttg 78120
aaaaatgtaa gcatcttaat taaagtaatg taaaatatta catagaaact ttaacgattg 78180
gaaattagaa atgttgcctt ggcaaattgt agaactggaa ttactaacta aattagtaaa 78240
taaatattta tatagtaaat tataaattgt tataaatgat aaatattata tatgaacaga 78300
aatacatttt tttaactatt gaaaaattaa gcatgttaaa aagcatggaa ttgaaataaa 78360
atattaaaat ctaatattag atgaaaaatt taattaaatg aaaatttgaa atgaagaact 78420
gaaattatta actaaattac taaataaaat gaatatcttg taattttttt agaatagttt 78480
aaattaactt aaatactaaa tgaataaaaa ataaaacagt tttattgtac attaaaatga 78540
cgaaataaaa tgaaataaat agaaaaacgt aaactaaatg aaaattataa atgctgcttt 78600
gctaattaaa aaatgaattc actaactatt ataatttact aactaaatat taaagagaca 78660
tgaaaatgaa ataaattaaa atgacatgaa tgaaattgaa aaatgttgaa attactaaat 78720
aatatacaaa ataaatgaaa aaacaaacat tctaaataaa taaaataaga atgaaataac 78780
taaaaaaata aaaactaaac cagaacgaaa ataaaatttt actaaaccta aataggattt 78840
atttatttat ttattaaatg acaaaatgtt tacaatggct aaaagtaatt tctttcctac 78900
gcattacgaa accagtctat aaacactcaa actgatctaa ggtggtcttt tcagcagtaa 78960
tcctacaaga acataatcaa aacttgacat tagattctct tcttaaataa tcacgggtaa 79020
taagactgtg gtctggcttc tagtatctga cgtccctgag tttggctttc tccgctgccc 79080
cagaggtggt actctgaaac cactgcccta ttgattaaaa gatggcacca gtttctcagg 79140
ctggatccct tggcatcata tcactaacta aggcaccaat gctctttgca ggcttcgccc 79200
tttggaaacc agagctaaac gtctaaaaga gctgacccgt taacagaaac tcaagctgtg 79260
aaggtggtct ggtggtgtgg ctctcgcaga ccccctgcca cgagactcag aagcccgtct 79320
tcttgcatgt acccactttt ttttcctcca tggctgatct ccaaatcatg aagcagccaa 79380
gagtttgaaa caaagctctt tgagctgtcc aaagaatgcg taaaaaaggg acagatactg 79440
cacgctcact ccagatactc tcaggttgca tacaattacg cagaaatatg cgagcatttc 79500
ctctcatatc agacactttc tggagattag aaaacgtgga gtctagcacc gatccgagcc 79560
agttaccctt cgggagccag actggtcgcc cattattaac gtattagcag tatctacagg 79620
catcagctga agaatcccct ttctgctgat gaagagccaa cttccttcat agcttccaca 79680
aactgttttt tttctttaat cctcactaat acttagctct gaaattttat atatatatat 79740
atatatatat atatatatat atatatatat atatatatat atatatatat atatatataa 79800
attgcaccat tgcattttct acactggaaa ttgaaaaaaa tatatataaa taaatacagt 79860
atttacaata taaattgcac tacaatttaa caaatacaca cgctgtaaac tggtgttcag 79920
tgatggtcat taagacgttt ttagaaaaat tattttagag aaattaagaa ttgatcaaaa 79980
gtgtccggaa aatacattta taatattgca aaatatttat atttttaaat aaatgctgtt 80040
cttttgagct ttctactcat cgagaagtcc tgggaaatga acatgtatca cagtttcgag 80100
aaaaatatta cgcagtcgca actgttttca acattgaata tgtttcattt aatcagacat 80160
gtttcttgag ctgcaaatct gattgatttc tgaagatcat gtgacactga agactggagg 80220
aatgatgctg aaaactcagc tttgatcaca ggaataaatt acatttgaaa atatattaaa 80280
gtagaaacca gttatcttaa attgtaataa tatttcacaa tatttctgtt tttactgtat 80340
cattagcctt ggtcagacaa aaaaaaaaaa tcgtatcgac catcatatac tgtatagctg 80400
ttatatttaa tacaaagcaa tccctactga gtgactccta catactggaa attcacatta 80460
aaaaaaattg ttaacttgct gctaatatgc atttgaaagt tgaacatttt aacccattat 80520
ttatacactt taaatttgtt ttcatttcaa agtggatggc tagggttaac ataatataag 80580
ttgggcaaga atgcttcagg aagagtttag ggaggcttga aagatactgg cacactttat 80640
ttgagtccag cagactttat ttgggccttt agccaaacgc tgttgctggg gtgtatgaga 80700
ggtgagatca gagcagactg actggcacaa tgaagatgtg tgtgtgtgtg tgtgtgttga 80760
gggtcaggga cggtagtgca tgtccaagga ggaagagcag gtataatgca gagctttgtt 80820
atcaacaaca aatgttacat ggatagctga caaggttagg acttcagtaa tctgtgaaaa 80880
gtcacttatt aaatttggtg aaactgagaa cccatgacag gtgccagatc attaaacgaa 80940
tgtaagcggg aaacattaat gccacacact cagccatgaa atataataga ctgtggtcta 81000
agtccattag gactgggcaa taaatatatt tgatgattat tagattaact ataaaacata 81060
tatggcccgg tttcacagac ggggcttaga ctaagccagg attagatcat agttcaatta 81120
agacatttaa gcaatttttt acaaatgtgc cttagacaaa aaaacattac tggtgtgcat 81180
cttgacacaa aacaaagatc agtcaatgca agtttctttc agatgaaaca gctctgattt 81240
acttattagt ctaggactag gtttaagact tgtctgtgaa accgggggat aatgtattaa 81300
attaagcaac agcgtaaata tggtcgttaa ttaactacag gttaactgtt tttaacttgc 81360
attaaacatc tactaggata taatggctgt ttgatggaaa ttagtaaaaa atgcaaatac 81420
ataaagagtg tcatgatgac tgaaaaatat caagagactc cattcaagca tttgaatcat 81480
tttcagtata aaagaagtct tgttaaaaaa tacagtaaaa tcagtaataa tgtgaaatat 81540
tattgcagtt tcaattagcc gttttctttt tgaatatatt taaaaatgta atttatttct 81600
atgacacaaa agctgtattt tcagcatcat tactccagtc ttcagtgtca catgatcctt 81660
cagaaatcat tctaataagc tgatttgctg ctcaataaat atttcttatt attaccgacg 81720
ttgaaaacag tgctgcttcg cattttgggg aaactgtgat acattaaatt ttattttgaa 81780
atttgaaatt gaaatcttat gtaacatcat aaatgtcttt actttataaa tgccagtcct 81840
aatgctatat tcttagcaac tatgtaactt cagaaggact gatgttgtta gtctgtcatc 81900
cgtttgacct tattttggat tgtgtcggtc caaaagactc gttgcagctg gaaatggaag 81960
tccacgccaa cagcaactcc tctgttctgg gactggacca gggttcgagc gttgtgtccg 82020
tggacatcag cgatcagcag gtctctcccg ttggagaata taagcgaagg aaccccagct 82080
gaagattgca gaacaaagtg aattcatgtg gtgagcttca aaatgaagga tacagtttga 82140
caatctttgg agaactcgca taatacaatg tacttctgat aagacctaac taatgtgaga 82200
gagtttggga ttggctgtgt ggtgccagcg tttgttattc atttccagga cctgccaaat 82260
aaatggttgg gaggcagaca tttcaggagc ttaataacac agtggcattc aatgcaaact 82320
gaggcgcttg aagagagatg ggctgcaaaa agcctgatta atggacatat aagataacag 82380
ttaacaccag gaatattagt ctctaagaaa cttatgtgaa agcatgcagg aaatgtaaca 82440
gcttacaaaa ttatttaata aagatgtaat ttaatatgat ttcttaatgt gaaaagtgac 82500
acagagaaac acaataaaaa tactgtataa gcatgagtaa tataatataa agtatgtatt 82560
aatattattt attatgagaa aaattacccc gagtgaataa atattttttt caaaatacat 82620
attaaccttt gtaaaatata tttaaaataa tattaataca tttaatgtga tttaatataa 82680
gaaaaaagga tactgtgtaa ataaatgaat acaaatttaa aatgtgcaag attaattaca 82740
tttaattctt tattatgaga ataaaagtac actgagtaaa actgaaaaat agaattaata 82800
aaaatattta aaagaaacat ttattaatgc agtatgactt taataaaata tacttgtaaa 82860
tctgtgtaaa atatatttaa aataaacatt aacaaacatt tatgatttac cattaaacaa 82920
aatgaaccat taggactggg tgacaaatat gtatgtttaa aatttcacaa aataaaagct 82980
aatattttgc ttgcactata aaatccaaaa caaaaagttt tgattaactt tttttgtttt 83040
tattattttt tttttactag aaaatatatt gcattaaaaa aaactatgct acaattgctt 83100
ttacaagctg ctttctgtct ttttgcttca aattttcaat ttttatttga tgtgttactt 83160
gctatttgtt tgtaattaat tgtgaaattt acaagcaatt tctaaagtaa aaataaattc 83220
aatgcacatg acttggatta tgttatagct gtatatcact gccgtatgga tggtactttg 83280
acactgatct ctgactgata gcagaatact ttcagaaaca tgaattttct gtgaagctgc 83340
tttgaaacaa gaattgtgaa aagctttata cacatataaa cttgaatcag ttacaggcct 83400
tcatcaaaca cagagatagt atactgaagg gactcacaag agggattggc tcggcagtat 83460
ctgtgctgct ccagaaggta accgtcccga cagctgcagc gatggcttcc ggtgcggtcc 83520
tcacacagct ggtcacacat gccccacaga ctacagtcat cataatctgc agagacgtgc 83580
acagaataca ggagagttga tggcaagtca ccggtttgac tgaatatgag caactgtttg 83640
taatctccaa tcgaaggttt taccaatgca tgagcggctg tcgtttctgc tgaccatgta 83700
tcccagcgga caggaacagg ttcctccggc gggagaggag tgacagtgat gctcacagct 83760
gagagacgca cagcgccaga tacctgacac aaaacacagc agacgttagg aaaatgtgca 83820
aatcgttaaa gcaaatcgta aagttctaat taaagatttt tggagtacgt tttacgagaa 83880
aattattttt aattatttaa atttttccaa atttatttca aagtgttact tgcttttctc 83940
tttaattatt tgtgactgtg gagcacaaga tctgtcttaa gtcactgggg tatatttgta 84000
gaaatagcca acaatacatt gcatgagtca aaatgatcaa tttttctttt atcccaaaaa 84060
tctttattta gttaagatat tttgtaaatg tcctaccata aatatatcaa aacttaattt 84120
ttgattagta atatgcattg ctaagaacta catttgaaca actttaaagc aattttctca 84180
gtatttagat ttttttgcag cctcaaattg aagaacatgg ccaataattg ttctatccta 84240
acaaactata catcagtgga aagcttattt attattcatt atattatttt gaatttattc 84300
agcttttaga tgatgtataa atctcaataa aagaaaaaat ggcccctttt gactggtttt 84360
gtggtccaat gtcacatttg tgaaatttac tagcaatttg aaaactttcc acataatttt 84420
tttcagtgaa ttaaaaatca attgacagtg aaataaataa agctgaagta aaataaaata 84480
taaaaattac attaaaaaac atgactaaaa tgactaaaat taaaatagaa taaattaatg 84540
tttgaaatgt ttacgttact aaaaacgata caaaatattg tgttaattgt taattgcaat 84600
tgttaattca atcctaaatt attaaaaact atttaaatca ataaaatgaa aaagaaataa 84660
taaaaaaaaa taaaaaataa aactgaactt aaactaaatt gtaataaaaa taaatattaa 84720
tattaaaata aatatacaaa agataaaaac acattacaaa atattaagta ataaaatgac 84780
taaatctgaa ataaaataat gtttgaaatg tttacataac taaaactatt ccaaatcgtt 84840
tttgattgtt aaatgctatt gttaatttaa tcctaaatta tgaataataa tttaaataaa 84900
taaaatgaaa ataaataaaa ataaaactgc gctaaaaata aatagtaatg ttaaaagaac 84960
taatacaaat cacaaaggca cataacaaat taactaaagt taaaatgtaa actgaaaatc 85020
taaaaataaa agtgaaagct ttaaaaaggt gactagctaa aataaaataa gatttaagtt 85080
gaagtattaa aattactaaa aactcaaact gaaataaaat aaaagaaagc tacatcaaaa 85140
tataaaaaaa gatactaaca aaaatgccaa aggcacaaaa ttaaatcact aaataaaaac 85200
atcggtaatt cgatactata atagtatata aataacacta aaataacaca ggcagtgaat 85260
ttaaggtgga gactgactgc agttgcgtcc agcagtgatg ttggtttcat cttctccgtc 85320
cggacagtgg gacgtcccgt cacagagatg ctccagaggg atgcaaacac cagaagaagg 85380
acacggccat tccccgggat aacactcacg ctggacatta tctaaacaac cataaacaaa 85440
accaagatgc tttcatgtgg aaaaaaacgt gcaatatttg aaatatttag catgcagaag 85500
gaggttttag ggacgtaccg cagcctctct catccgcatt gtcctcgcag tcatcctctc 85560
catcacacag ccagacctga tgaatgcagc ggccgctggg acaggtgaag tacgttcctc 85620
tgcaggtggg gtaggctgta gagaatacgt tacagggaac attcacacca cttcatgcct 85680
tacattcatt atggctggat tgagaaactg aaacctactg cagttctgtt catcacttct 85740
gtctccacaa tcatcatcat gatcgcacac gtaactgcga ggaatacact ccccgttact 85800
gcactgaaac tgaccgctgc tgcaccgccg ggctgaaaac aaacaagaga tatgtttatc 85860
cacaacacaa tttcagaaag tttacataaa tctgaagatt actatacttg gtactgtatg 85920
tttccaccgc accaaagggt gcaaaaatga tgaatgtcaa aaagaattca gcataggata 85980
cgttataata aagcatagtc gagccagaaa tgagtttttg tcatcattta ttcaccctca 86040
tgttgttcca actctattat taatacattc aatataatat aaatataaaa caatgcaaaa 86100
gtcgtgtaag atgcaggtct ggagtaatta ataacagaat ttttgggtga actaaacctt 86160
ttaggctctc aaacataatt gtgtgcttgc ttttatttca aatatgtcaa atttggacac 86220
attgcaccat gattgaaata atcactgatg gcacatgcca ttggttttca tgtgttgtga 86280
cgaatcatga caaaaatgag actgtaaatg gcataatcag tcaacaacat tttaaatgtg 86340
ctttgttagt cacaagaaat tatgaaatat agtgtataaa tatagtatga aagtcacata 86400
tattacatgg atagtctatg ggattttgca agctatggtg tatttataaa aaacaacgac 86460
aacaacaaaa aagctttttt atctaatatt tacattttat tttatttgat ctgatttcaa 86520
taaacaatta ttttttatat atattaagct gttgataata accctgaaat atatatatat 86580
attactctac actcttaaaa ataatggtta caaaagggtt ttcacaatgt tgccatagaa 86640
taaatatttt tggttcccca aagaaagttt ttagtaaaca gttcttaaat taatcttttt 86700
tttttttttt ttgtataaag aatatttaaa aaatctatag aactttattt tacaccatta 86760
tacatttttt gtccaatgga aagttccgga tggaatcatg aaaaaaaaat aaaaaaaacg 86820
ttttttaaac gtgcaatcaa gaagacaaaa agtcacatga ctaaacattt tcggttcata 86880
atgaactgtt aaagttgcaa tagtaaaaag tagcgcagaa gtttcccagg acatccagta 86940
gacgactgtg tcaatgtgtg tgtcctgcca gagaaataca ctcactgcaa ttggcctcgt 87000
ctgagccgtc tcggcagttg agcacctggt cacagcgctg gcttctgtta taacacgccc 87060
cgttagcaca cctcagctcc gagcactctg ggtagactga gagaaagaga gcaggtccat 87120
cactcactcg ttcaattaaa tagaatcacc ctgtaatatg aaacagggtt ccactggggt 87180
acatcatcga agcgagtgat cttaacatgt tagcaggaaa acagcataaa gctaatgtag 87240
aaaatcaaac acaaacgtct gcagccaaaa gagcattatg tgcacagcag gtctgttcga 87300
ttaaagcagc tcagtgatga actgacactc gcactaatcc acagaggaat ctgcaatcaa 87360
ctctgacaaa caagaagcat attgtggcca tttctgtcca tcagcgagac tcttaatctt 87420
acaaattgat taaaatgcca aaggagcctt cgaggggaag gaaacacaca caacatacac 87480
acacacacac tatataggat ttggtgatat atcaaatatg aatcatttat tcacaattta 87540
ttgctatgca ctttatgata ttaaaatgtt gagtaactag tgcaaatatg agagaaatac 87600
atgaaattca tgaaatgcca tgcaacgtcg tgaccgttct gcatccttaa tccaatgcat 87660
ttccgcatga aataggatat ttcaggaacg aggacacgag ttgaaccact ggaaaccgga 87720
tggatggtaa aagcatacca gaatggagtc aggtggttaa aggagaaggc ttgcaatgtg 87780
tttaatgtga gtttataaag taaatgggcg ctatttagat ttcacttgga catctgattt 87840
aagctcaatt atggaaaatg gcaaaataac ttggttgatt tgactcaaac tagttcaaac 87900
taattcacag ttgatttaaa agtaatagtt ttttttgtag taattgttgc aaaaaaatta 87960
caattattaa acagcaatta catatgtatt tgttcattta ataatttttt tagcttttac 88020
taggtttcct ttagtattct tcagatacta ttaaaactat acaatatata tatatatatt 88080
tagctgtatt gtccagctct aatcccacca cacacatacc gacagcatac ttacggcagt 88140
ttctctcatc tgcgccgtct agacagtcag gcacacggtc gcacctgtat cctcccggga 88200
tgcatgcccc gttcgtgcag gtgaactgtg aactggagca cgtcctgcca gctgccatca 88260
aagatcaaag acacatgatt ataatatgag agtgctgggt ggtcctcatc ttcactaaag 88320
tggtatgctc tctctacaac atatgcacgc attgcctttt gtttttcgga ctgttgcctg 88380
cacatctgag atattgtgtt tctccgacat gatggaaatt gcatccccgt acattactga 88440
gagtcaatgg agctctctgc tctgacggac agtcgtaatt gaaatggaag cagaaagcga 88500
gggagagaag gatggatgga gataaggtgt actcacggca gtgctgcctt tcatcggagc 88560
catcctcgca atcctcttca tcatcacaga cccagtgctg aggaatgcac tcgccgtcac 88620
tcaagcactg gaactgagcg ctctcgcagg tgggctgggc tggaggagcg caatgagacg 88680
ccattgttaa ctaagaacag atctgagctg gctccacgat acagtgaaat aaacatgcca 88740
cattaccttt tggaaaacgt ttttctttca ctttatttta gggtccaatt ctcactatta 88800
acttaccata aactatgact tttgccacaa ttaactcctt atttgctgct tattaatagt 88860
ttataaggta gttgttaagt ttagggtaat tgggtaggat tatggatgtc atgcattata 88920
tgtactttat aagcactaat aaacagccaa tatgttaata atagacatgc taataagcaa 88980
ctacttaata gtgtgaattg gaccctatac taaagtgtta cccataattt tggaaaatta 89040
gttctttttt ttgcagatga actcttcggg tcctatttta accatctaag tgcatggtcg 89100
acgcaccccg gcgcatggtc taaacaggtt gttcctattc tcttaatgag gaatgggtgt 89160
ttttcaggcg taatgtgcaa taaaccaatc agagtctcat ctcccatccc ctttaaaagc 89220
cagttgtgat catagcattc cgggtctgct atttacatgg cggaatgtaa tttttcattt 89280
ttaataatgt ttgtgtggtg ctgtacgtca ttgtgtgtgt aataagcaaa gtgtatgtgc 89340
tttgtgcacc agccatacag gcgcatatta cttaatacgc tatttaaaca gcgaagaaaa 89400
aacattgcgc cagactttag accaggtttg tgttggtctg tggcacagtc tatgttcagt 89460
ccctcaaaat agcaatgcgc caacaatgca cttgaacaca cctctttttt agaccagcac 89520
gcccctgagc acacaaatgg gtgcagaaca ggtgagaact gtgtcagact gaaactagca 89580
aaaacacttg cacccagttt atgatagggc tctttacatg tagtacgcta gacttcgtat 89640
ttttgtcttc tttccagtaa aacatctaaa cattcttaaa tttactttca tattgtgtct 89700
tgttttaaaa actaattaag tgactttttg gttaaaaaca acaaaaatat ttcagtgagc 89760
tgtgtaataa aacttagggg aaaacaagat tattttttac accacagatt tttaagcaaa 89820
aacccattta atttggattg gatggcggat ctgtgatagt catcttacta gatttgtagg 89880
ttttcacccc tcagaactct tttggtcttt gccagactag attttgagcc atgacttcaa 89940
gagatttgct gtccatagaa ggctagtgag atttacacta gaacttgatt tcatttctta 90000
ttttcttatg ctctgtatga atgtgtgaaa ccagccctaa aactggtaag aaactagttt 90060
tttgaatccc taaatggtgt ataaaacaag acttttaagt taaaaaagta atgttagcaa 90120
tgctaactat gttattttta aagtcaacac ttactatgct aaaaccctat ttgaaatacc 90180
tgagagaaga catggctagt ttctgctaat gttgttctgg gttatggatt acaaactgaa 90240
cagcagtatt tcacacttgc cactctttgt cattgtgagc ttgtgactgt ggatgaagag 90300
ttacttacgg cagccaattt cgtctgagtt atccgaacag tcggtcgttc catcgcacct 90360
caggcttgct ggaatacaac ggcctgtccc acagcggaaa gaccccgctt cacaacctga 90420
acctacaca 90429
<210> 2
<211> 4451
<212> PRT
<213> Carassius auratus
<400> 2
Cys Val Gly Ser Gly Cys Glu Ala Gly Ser Phe Arg Cys Gly Thr Gly
1 5 10 15
Arg Cys Ile Pro Ala Ser Leu Arg Cys Asp Gly Thr Thr Asp Cys Ser
20 25 30
Asp Asn Ser Asp Glu Ile Gly Cys Pro Gln Pro Thr Cys Glu Ser Ala
35 40 45
Gln Phe Gln Cys Leu Ser Asp Gly Glu Cys Ile Pro Gln His Trp Val
50 55 60
Cys Asp Asp Glu Glu Asp Cys Glu Asp Gly Ser Asp Glu Arg Gln His
65 70 75 80
Cys Pro Gly Arg Thr Cys Ser Ser Ser Gln Phe Thr Cys Thr Asn Gly
85 90 95
Ala Cys Ile Pro Gly Gly Tyr Arg Cys Asp Arg Val Pro Asp Cys Leu
100 105 110
Asp Gly Ala Asp Glu Arg Asn Cys Leu Tyr Pro Glu Cys Ser Glu Leu
115 120 125
Arg Cys Ala Asn Gly Ala Cys Tyr Asn Arg Ser Gln Arg Cys Asp Gln
130 135 140
Val Leu Asn Cys Arg Asp Gly Ser Asp Glu Ala Asn Cys Thr Arg Arg
145 150 155 160
Cys Ser Ser Gly Gln Phe Gln Cys Ser Asn Gly Glu Cys Ile Pro Arg
165 170 175
Ser Tyr Val Cys Asp His Asp Asp Asp Cys Gly Asp Arg Ser Asp Glu
180 185 190
Gln Asn Cys Thr Tyr Pro Thr Cys Arg Gly Thr Tyr Phe Thr Cys Pro
195 200 205
Ser Gly Arg Cys Ile His Gln Val Trp Leu Cys Asp Gly Glu Asp Asp
210 215 220
Cys Glu Asp Asn Ala Asp Glu Arg Gly Cys Asp Asn Val Gln Arg Glu
225 230 235 240
Cys Tyr Pro Gly Glu Trp Pro Cys Pro Ser Ser Gly Val Cys Ile Pro
245 250 255
Leu Glu His Leu Cys Asp Gly Thr Ser His Cys Pro Asp Gly Glu Asp
260 265 270
Glu Thr Asn Ile Thr Ala Gly Arg Asn Cys Ser Ile Trp Arg Cys Ala
275 280 285
Ser Leu Ser Cys Glu His His Cys His Ser Ser Pro Ala Gly Gly Thr
290 295 300
Cys Ser Cys Pro Leu Gly Tyr Met Val Ser Arg Asn Asp Ser Arg Ser
305 310 315 320
Cys Ile Asp Tyr Asp Asp Cys Ser Leu Trp Gly Met Cys Asp Gln Leu
325 330 335
Cys Glu Asp Arg Thr Gly Ser His Arg Cys Ser Cys Arg Asp Gly Tyr
340 345 350
Leu Leu Glu Gln His Arg Tyr Cys Arg Ala Asn Pro Ser Ser Gly Val
355 360 365
Pro Ser Leu Ile Phe Ser Asn Gly Arg Asp Leu Leu Ile Ala Asp Val
370 375 380
His Gly His Asn Ala Arg Thr Leu Val Gln Ser Gln Asn Arg Gly Val
385 390 395 400
Ala Val Gly Val Asp Phe His Phe Gln Leu Gln Arg Val Phe Trp Thr
405 410 415
Asp Thr Ile Gln Asn Lys Val Phe Ser Val Asp Met Asp Gly Ser His
420 425 430
Leu Gln Val Val Leu Asn Val Ser Val Asp Tyr Pro Glu Asn Leu Ala
435 440 445
Val Asp Trp Val Asn Asn Lys Leu Tyr Val Val Glu Ala Ser Val Asn
450 455 460
Arg Ile Asp Met Val Asp Leu Asp Gly Ser Asn Arg Val Thr Leu Ile
465 470 475 480
Ala Glu His Leu Gly Asn Pro Arg Gly Leu Ala Leu Asp Pro Thr Val
485 490 495
Gly Tyr Leu Phe Phe Ser Asp Trp Asp Thr Leu Asn Gly Glu Pro Gly
500 505 510
Leu Glu Arg Ala Phe Met Asp Gly Thr Asn Arg Tyr Gly Ile Ile Arg
515 520 525
Thr Lys Leu Gly Trp Pro Ala Gly Ile Thr Leu Asp Leu Glu Ala Lys
530 535 540
Arg Val Tyr Trp Val Asp Ser Arg Tyr Asp Tyr Ile Glu Thr Thr Thr
545 550 555 560
Tyr Asp Gly Leu His Arg Lys Thr Val Val His Gly Gly Ser Val Ile
565 570 575
Pro His Pro Phe Gly Ile Thr Leu Phe Glu His Ser Val Tyr Tyr Thr
580 585 590
Asp Trp Thr Lys Met Ala Val Met Lys Ala Asn Lys Tyr Ser Asp Asn
595 600 605
Ser Pro Gln Glu Leu Tyr Arg Thr Ser Gln Arg Pro His Gly Leu Thr
610 615 620
Val Val His Ala Tyr Arg Gln Pro Phe Val Ser Asn Pro Cys Gly Thr
625 630 635 640
Asn Arg Gly Gly Cys Glu His Ile Cys Val Leu Ser His Arg Thr Asp
645 650 655
Asn Gly Gly Leu Gly Tyr Arg Cys Arg Cys Arg Met Gly Phe Asp Leu
660 665 670
His Ala Asp Gly Lys Arg Cys Met Ser Val Arg Gln Phe Leu Leu Phe
675 680 685
Ser Ser Gln Leu Ala Val Arg Gly Ile Pro Phe Asn Leu Ser Thr Gln
690 695 700
Glu Asp Ile Ile Leu Pro Ile Thr Gly Thr Pro Ser Tyr Phe Val Gly
705 710 715 720
Val Asp Phe Ser Ala Ala Asp Asn Ser Ile Phe Phe Ser Asp Thr Val
725 730 735
Lys Asp Ile Ile Tyr Lys Gln Lys Ile Asp Gly Thr Gly Arg Glu Val
740 745 750
Leu Ala Ala Asn Arg Val Asp Gly Ala Glu Asp Leu Ala Tyr Asp Trp
755 760 765
Ile Ser Lys Asn Leu Tyr Trp Thr Asp Pro Arg Tyr Arg Ser Ile Ser
770 775 780
Val Met Lys Val Ala Asp Lys Thr Arg Arg Ala Ile Val Arg Asn Leu
785 790 795 800
Asn Asn Pro Arg Ser Ile Val Val His Pro Val Ala Gly Tyr Ile Phe
805 810 815
Trp Thr Asp Trp Tyr Arg Pro Ala Lys Ile Met Arg Ala Trp Gly Asp
820 825 830
Gly Ser His Ala Gln Ser Ile Val Asn Thr Thr Leu Gly Trp Pro Asn
835 840 845
Gly Leu Ala Ile Asp Trp Ser Ala Met Arg Leu Phe Trp Val Asp Ala
850 855 860
Phe Phe Asp Lys Ile Glu His Ser Asn Phe Asp Gly Gln Asn Arg Leu
865 870 875 880
Ser Leu Asp Arg Ile Thr Gln Ile Ser His Pro Phe Gly Leu Thr Val
885 890 895
Phe Gly Gly Tyr Val Tyr Phe Thr Asp Trp Arg Leu Gly Gly Ile Val
900 905 910
Arg Val Arg Lys Thr Asp Gly Gly Glu Met Met Ile Ile Arg Arg Gly
915 920 925
Ile Ser His Ile Met His Val Lys Ser Phe Asn Ala Asp Ser Gln Ile
930 935 940
Gly Ser Asn Phe Cys Asn Arg Gln Thr Asn Pro Asn Gly Asp Cys Ser
945 950 955 960
His Phe Cys Phe Pro Ala Pro Tyr Ser Gln Arg Val Cys Gly Cys Pro
965 970 975
Tyr Gly Met Lys Leu Glu Ala Asn Gln Gln Thr Cys Val Asp Asp Pro
980 985 990
Ser Asn Glu Pro Pro Thr Leu Gln Cys Gly Ser Asn Ser Phe Ser Cys
995 1000 1005
Thr Asn Gly Lys Cys Val Pro Gln Ser Tyr Gln Cys Asp Ser Val Asp
1010 1015 1020
Asp Cys His Asp Asn Ser Asp Glu Ala Asn Cys Gly Ala Asn Asn Asn
1025 1030 1035 1040
Thr Cys Ser Pro Ile Ala Phe Thr Cys Ala Asn Gln Arg Cys Val Pro
1045 1050 1055
Arg Ser Trp His Cys Asp Gly His Asn Asp Cys Phe Asp Gly Ser Asp
1060 1065 1070
Glu Arg Asp Cys Pro Thr Gln Thr Pro Gly Thr Cys Gln Ala Asp Gln
1075 1080 1085
Phe Ser Cys Ala Asn His His Cys Ile Pro Arg Thr Trp Leu Cys Asp
1090 1095 1100
Thr Asp Asn Asp Cys Gly Asp Gly Ser Asp Glu Asn Asn Cys Asp Ser
1105 1110 1115 1120
Met Gly Thr Cys His Pro Asp Gln Phe Gln Cys Pro Asp His Arg Cys
1125 1130 1135
Ile Asp Pro Asn Tyr Val Cys Asp Gly Asp Arg Asp Cys Ala Asp Gly
1140 1145 1150
Ala Asp Glu Gln Gly Cys Val Tyr Asn Cys Thr Ala Tyr Glu Phe Lys
1155 1160 1165
Cys Ala Asn Gly His Gln Cys Val Asn Ser Tyr Tyr Arg Cys Asp Gly
1170 1175 1180
Val Phe Asp Cys Asn Asp Arg Ser Asp Glu Ser Gly Cys Pro Thr Arg
1185 1190 1195 1200
Pro Pro Gly Met Cys His His Gln Ser Glu Phe Gln Cys Gln Ser Asp
1205 1210 1215
Gly Ser Cys Val Pro Ser Asn Trp Glu Cys Asp Gly His Pro Asp Cys
1220 1225 1230
Glu Asp Gly Ser Asp Glu His His Ala Cys Pro Pro Arg Thr Cys Pro
1235 1240 1245
Thr Ala Gln Phe Arg Cys Asp Asn Gly Asn Cys Val Phe Arg Gly Trp
1250 1255 1260
Ile Cys Asp Gly Asp Asn Asp Cys Arg Asp Gly Ser Asp Glu Arg Asp
1265 1270 1275 1280
Cys Pro Thr Pro Pro Phe Arg Cys Pro Ser Trp Gln Trp Gln Cys Pro
1285 1290 1295
Gly His Ser Val Cys Val Asn Leu Ser Arg Val Cys Asp Asn Thr Pro
1300 1305 1310
Asp Cys Pro Asn Gly Ala Asp Glu Ser Pro Leu Cys Asn Gln Glu Ser
1315 1320 1325
Cys Ser Asp Asn Asn Ala Gly Cys Thr His Gly Cys Ile Gln Gly Pro
1330 1335 1340
Phe Gly Ala Gln Cys Thr Cys Pro Ile Gly Phe Gln Leu Ser Asn Asp
1345 1350 1355 1360
Ser Lys Thr Cys Glu Asp Ile Asp Glu Cys Arg Pro Pro Gly Ile Cys
1365 1370 1375
Ser Gln His Cys Phe Asn Glu Arg Gly Ser Phe Arg Cys His Cys Gln
1380 1385 1390
Glu Gly Tyr Thr Leu Glu Ala Asp Gly Arg Thr Cys Lys Ala Ser Gly
1395 1400 1405
Ser Arg Glu Ala Leu Leu Leu Val Ala Ser Arg Asn Gln Ile Val Ser
1410 1415 1420
Asp Asp Ile Leu Ala Gln Pro Asn Ile Ile Arg Ser Leu Val Arg Asp
1425 1430 1435 1440
Gly Arg Asn Ile Val Ala Val Asp Phe Asp Ser Val Thr Asp Arg Ile
1445 1450 1455
Tyr Trp Ser Asp Thr Thr Gln Asp Arg Ile Trp Ser Ala His Lys Asn
1460 1465 1470
Gly Thr Asp Arg Thr Leu Ile Phe Asp Ser Gly Val Thr Val Thr Glu
1475 1480 1485
Ser Leu Ala Val Asp Trp Val Gly Arg Asn Leu Tyr Trp Thr Asp Tyr
1490 1495 1500
Ile Leu Glu Thr Ile Glu Val Ser Lys Leu Asp Gly Ser His Arg Val
1505 1510 1515 1520
Val Leu Ile Ser Glu Asn Val Thr Asn Pro Arg Gly Leu Val Val Asp
1525 1530 1535
Pro Arg Asn Asn Ser His Leu Met Phe Trp Thr Asp Trp Gly Arg Asn
1540 1545 1550
Pro Arg Ile Glu Arg Ala Ser Met Asp Gly Lys Leu Arg Thr Thr Ile
1555 1560 1565
Ile Ser Ser Lys Leu Tyr Trp Pro Asn Gly Leu Thr Ile Asp Tyr Pro
1570 1575 1580
Asn Asn Leu Leu Tyr Phe Ala Asp Ala Tyr Leu Asp Phe Ile Asp Tyr
1585 1590 1595 1600
Cys Asp Tyr Asp Gly Lys Asn Arg Lys Gln Val Leu Ala Ser Asp Leu
1605 1610 1615
Val Leu Gln His Pro His Ala Ile Thr Ile Phe Glu Asp Phe Val Tyr
1620 1625 1630
Trp Thr Asp Arg Tyr Val Asn Arg Val Ile Arg Ala Asn Lys Trp His
1635 1640 1645
Gly Gln Asn Gln Thr Val Met Leu Tyr Asn Leu Pro Gln Pro Met Gly
1650 1655 1660
Leu Val Ala Leu His Pro Ala Arg Gln Pro Ala Gly Tyr Asn Pro Cys
1665 1670 1675 1680
Asp Pro Arg Ser Ser Pro Cys Thr His Ile Cys Leu Leu Ser Ala Val
1685 1690 1695
Gly Pro Arg Phe Tyr Ser Cys Ala Cys Pro Ser Gly Trp Thr Leu Ala
1700 1705 1710
Ala Asp Gln Phe Thr Cys Ala Arg Val Glu Asp Pro Phe Leu Val Val
1715 1720 1725
Val Arg Asp Ser Ile Ile Tyr Gly Ile Pro Leu Asn Pro Asn Asp Lys
1730 1735 1740
Ser Asn Asp Ala Met Val Pro Val Ala Gly Leu Leu Asn Gly Tyr Asp
1745 1750 1755 1760
Val Asp Phe Asp Asp Ala Glu Gln Met Ile Tyr Trp Val Glu His Pro
1765 1770 1775
Gly Glu Ile His Arg Val Lys Ser Asp Gly Thr Asn Arg Thr Glu Phe
1780 1785 1790
Ala Pro Ala Ala Ile Leu Gly Ser Pro Val Gly Leu Ala Leu Asp Trp
1795 1800 1805
Met Ser Gln Asn Leu Tyr Tyr Ser Asn Pro Ala Ser Gln Ser Ile Glu
1810 1815 1820
Val Leu Lys Leu Lys Gly Glu Val Gln Tyr Arg Lys Thr Leu Ile Thr
1825 1830 1835 1840
Asn Asn Gly Ser Pro Thr Gly Ala Gly Ala Pro Ala Gly Ile Ala Val
1845 1850 1855
Asp Pro Ala Arg Gly Lys Met Tyr Trp Thr Asp Gln Gly Thr Glu Ser
1860 1865 1870
Gly Ile Pro Ala Lys Val Ala Ser Ala Asp Met Asp Gly Ser Asn Ala
1875 1880 1885
Ala Ile Leu Phe Thr His Asn Leu Glu His Val Glu Phe Ile Thr Ile
1890 1895 1900
Asp Ile Arg Glu Asn Lys Leu Tyr Trp Ala Val Thr Gly Thr Gly Val
1905 1910 1915 1920
Ile Glu Arg Gly Asp Pro Asp Gly Ser Asn Arg Ile Thr Met Val Asn
1925 1930 1935
Gly Leu Ser His Pro Trp Gly Val Ala Val Tyr Asp Ser Phe Leu Tyr
1940 1945 1950
Phe Thr Asp Arg Asp Phe Glu Val Ile Glu Arg Val Asp Lys Ala Thr
1955 1960 1965
Gly Leu Asn Arg Val Val Met Arg Asp Asn Val Ala Gly Leu Arg Val
1970 1975 1980
Leu Lys Val His His Arg Asp Ser Ser Ala Gly Ser Ser Asn Gly Cys
1985 1990 1995 2000
Ser Asn Asn Met Gly Thr Cys Gln Gln Leu Cys Leu Pro Arg Pro Gly
2005 2010 2015
Gly Leu Phe Ser Cys Ala Cys Ala Thr Gly Phe Lys Leu Ser Ala Asp
2020 2025 2030
Asn Arg Thr Cys Ser Pro Tyr Gln Ser Tyr Val Val Ile Ser Met Leu
2035 2040 2045
Thr Ala Ile Lys Gly Phe Ser Leu Glu Gly Ala Asp His Ser Glu Ser
2050 2055 2060
Met Val Pro Val Ala Gly Arg Gly Arg Asn Ala Leu His Val Asp Val
2065 2070 2075 2080
His Met Ser Ser Gly Phe Ile Tyr Trp Cys Asp Phe Ser Ser Thr Val
2085 2090 2095
Ala Ser Gln Asn Gly Ile Arg Arg Ile Lys Pro Asp Gly Ser Gly Phe
2100 2105 2110
Arg Ser Ile Val Thr Ser Gly Ile Gly Arg Asn Gly Ile Arg Gly Ile
2115 2120 2125
Ala Val Asp Trp Ala Ala Gly Asn Leu Tyr Phe Thr Asn Ala Phe Leu
2130 2135 2140
Thr Glu Thr Tyr Ile Glu Val Ile Arg Leu Asn Thr Thr Phe Arg Arg
2145 2150 2155 2160
Val Leu Leu Lys Thr Gln Val Asp Met Pro Arg His Ile Val Val Asp
2165 2170 2175
Pro Met Asn Arg Tyr Leu Phe Trp Ala Asp Tyr Gly Gln Thr Pro Lys
2180 2185 2190
Ile Glu Arg Ala Phe Leu Asp Gly Ser Asn Arg Thr Val Leu Val Ser
2195 2200 2205
Ser Gly Ile Val Thr Pro Arg Gly Leu Ala Leu Asp His Arg Asp Gly
2210 2215 2220
Tyr Ile Tyr Trp Val Asp Asp Ser Leu Asp Met Ile Ala Arg Val Arg
2225 2230 2235 2240
Pro Asp Gly Gly Glu Thr Glu Val Val Arg Tyr Gly Ser Arg Tyr Pro
2245 2250 2255
Thr Pro Tyr Gly Val Thr Val Phe Glu Gly Asn Val Ile Trp Val Asp
2260 2265 2270
Arg Asn Leu Lys Lys Val Phe Gln Ala Ser Lys Gln Pro Gly Ala Thr
2275 2280 2285
Asp Gln Pro Leu Val Ile Arg Asp Asn Ile Asn Met Leu Arg Asp Val
2290 2295 2300
Thr Ile Phe Asp Arg Arg Met Gln Pro Ser Ser Ala His Glu Leu Asn
2305 2310 2315 2320
Asn Asn Pro Cys Leu Glu Ser Asn Gly Gly Cys Ala His Phe Cys Phe
2325 2330 2335
Ala Ile Pro Gly Ser Gln Thr Arg Lys Cys Ser Cys Ala Phe Gly Asn
2340 2345 2350
Leu Ala Ala Asp Ser Ser Ser Cys Val Val Ser Arg Asp Asp Tyr Leu
2355 2360 2365
Ile Tyr Thr Thr Glu Ser Thr Val Arg Ser Leu Arg Leu Asp Pro Glu
2370 2375 2380
Asp His Ser Leu Pro Phe Pro Val Val Asn Val Pro Arg Thr Ser Val
2385 2390 2395 2400
Ala Leu Asp Phe Asp Arg Leu Asp Gly Arg Ile Tyr Phe Thr Gln Ser
2405 2410 2415
Ser Gly Val Gly Gln Ser Lys Ile Ser Phe Ile Thr Leu Ala Ser Pro
2420 2425 2430
Thr Ser Pro Ala Thr Glu Val Ala Ser Asp Leu Gly Ala Pro Asp Gly
2435 2440 2445
Ile Ala Tyr Asp Trp Ile Asn Lys Arg Ile Tyr Tyr Ser Asp Tyr Ile
2450 2455 2460
Asn Gln Ser Ile Ser Ser Met Ala Val Asp Gly Ser Gln Arg Thr Val
2465 2470 2475 2480
Val Ala Gln Val Pro Arg Pro Arg Ala Ile Met Leu Asp Pro Cys Arg
2485 2490 2495
Gly Tyr Met Tyr Trp Thr Asp Trp Gly Thr Ser Ala Lys Ile Glu Arg
2500 2505 2510
Ala Thr Leu Gly Gly Asn Phe Arg Thr Glu Ile Val Asn Thr Ser Leu
2515 2520 2525
Val Trp Pro Asn Gly Leu Thr Leu Asp Tyr Asp Glu Gln Arg Leu Tyr
2530 2535 2540
Trp Ala Asp Ala Ser Leu Gln Lys Ile Glu Arg Cys Ser Leu Thr Gly
2545 2550 2555 2560
Thr Asn Arg Glu Val Ile Val Ser Thr Ala Ile Tyr Pro Phe Ala Met
2565 2570 2575
Thr Val Tyr Gly Gln His Ile Tyr Trp Thr Asp Trp Asn Thr Arg Ser
2580 2585 2590
Ile Tyr Arg Ala Asn Lys His Asp Gly Ser Asp Gln Arg Val Met Leu
2595 2600 2605
Gln Asn Leu Pro Ser Arg Pro Met Asp Ile His Val Leu Ser Asn Ser
2610 2615 2620
Lys Gln Gln Gln Cys Ser Ser Pro Cys Glu Gln Phe Asn Gly Gly Cys
2625 2630 2635 2640
Ser His Ile Cys Ala Pro Gly Pro Gln Gly Ala Glu Cys Gln Cys Pro
2645 2650 2655
Ser Glu Gly Arg Trp Tyr Leu Ala Asp Asn Lys His Cys Ile Pro Asp
2660 2665 2670
Asn Gly Thr Arg Cys Gln Ser Gly Gln Phe Thr Cys Met Asn Gly Arg
2675 2680 2685
Cys Ile Arg Ala Gln Trp Lys Cys Asp Asn Asp Asp Asp Cys Gly Asp
2690 2695 2700
Gly Ser Asp Glu Leu Glu Arg Val Cys Ala Phe His Thr Cys Glu Pro
2705 2710 2715 2720
Thr Val Phe Thr Cys Gly Asn Gly Arg Cys Val Pro Tyr His Tyr Arg
2725 2730 2735
Cys Asp His Tyr Asn Asp Cys Gly Asp Asn Ser Asp Glu Thr Gly Cys
2740 2745 2750
Ile Phe Arg Pro Cys Asp Pro Asn Thr Glu Phe Thr Cys Asn Asn Gly
2755 2760 2765
Arg Cys Ile Ala Arg Glu Tyr Val Cys Asn Gly Met Asn Asn Cys Tyr
2770 2775 2780
Asp Asn Gly Thr Ser Asp Glu Gln Asn Cys Ala Glu Arg Thr Cys Gln
2785 2790 2795 2800
Pro Glu His Thr Lys Cys Gln Thr Thr Asn Ile Cys Ile Pro Arg Ser
2805 2810 2815
Tyr Leu Cys Asp Gly Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu Ser
2820 2825 2830
Pro Thr His Cys Ala Thr Ser Thr Cys Ser Gln Asn Glu Phe Arg Cys
2835 2840 2845
Ser Ser Gly Arg Cys Ile Pro Gly His Trp Tyr Cys Asp Gly Gly Thr
2850 2855 2860
Asp Cys Ser Asp Gly Ser Asp Glu Pro Val Thr Cys Thr Thr Val Val
2865 2870 2875 2880
Arg Thr Cys Asn Ser Asp Gln Phe Arg Cys Asp Asp Gly Arg Cys Ile
2885 2890 2895
Ala Ser Ser Trp Ile Cys Asp Gly Asp Asn Asp Cys Gly Asp Met Ser
2900 2905 2910
Asp Glu Asp Glu Arg His Ser Cys Ala Asn Arg Thr Cys Phe Pro Gln
2915 2920 2925
Glu Phe Thr Cys Ile Asn Asn Arg Pro Pro Gln Arg Lys Cys Ile Pro
2930 2935 2940
Arg Asp Trp Val Cys Asp Gly Asp Ala Asp Cys Ser Asp Ala Tyr Asp
2945 2950 2955 2960
Glu His Gln Asn Cys Thr Arg Arg Ser Cys Thr Ala Asn Glu Phe Thr
2965 2970 2975
Cys Asn Asn Gly Leu Cys Ile Arg Asn Ser Tyr Arg Cys Asp Arg Arg
2980 2985 2990
Asn Asp Cys Gly Asp Ser Ser Asp Glu Gln Gly Cys Thr Tyr Gln Pro
2995 3000 3005
Cys Gln Gln His Gln Phe Thr Cys Gln Asn Gly Arg Cys Val Ser Gln
3010 3015 3020
Asp Phe Val Cys Asp Gly Asp Asn Asp Cys Gly Asp Glu Ser Asp Glu
3025 3030 3035 3040
Leu Asp His Leu Cys Arg Thr Pro Ala Pro Thr Cys Pro Pro Gly Asn
3045 3050 3055
Phe Arg Cys Asp Asn Gly Asn Cys Ile Pro Leu Ser Glu Val Cys Asp
3060 3065 3070
Arg Asn Asp Asp Cys Asn Asp Asn Ser Asp Glu Lys Gly Cys Gly Ile
3075 3080 3085
Asn Glu Cys Thr Asp Pro Ser Met His His Cys Asp His Asn Cys Thr
3090 3095 3100
Asp Thr Pro Thr Ser Phe Ile Cys Thr Cys Arg Pro Gly Phe Arg Leu
3105 3110 3115 3120
Met Ser Asp Asn Lys Thr Cys Asp Asp Val Asp Glu Cys Ser Val Thr
3125 3130 3135
Pro Ser Val Cys Ser Gln Val Cys Glu Asn Thr Met Gly Ser Tyr Val
3140 3145 3150
Cys Lys Cys Ala Pro Gly Phe Leu Arg Glu Pro Asp Gly Arg Ser Cys
3155 3160 3165
Arg Gln Asn Ser Asn Ile Ser Pro Tyr Leu Ile Phe Ser Asn Arg Tyr
3170 3175 3180
Tyr Leu Arg Asn Leu Ser Thr Asp Gly Glu Ala Tyr Ser Leu Ile Leu
3185 3190 3195 3200
Gln Gly Leu Thr Ser Val Val Ala Leu Asp Phe Asp Arg Val Asp Arg
3205 3210 3215
Arg Leu Tyr Trp Ile Asp Val Ser Arg Arg Val Leu Glu Arg Met Phe
3220 3225 3230
Phe Asn Gly Thr Gly Arg Glu Val Val Val Asn Gly Ile Leu His Gly
3235 3240 3245
Glu Gly Leu Ala Val Asp Trp Val Gly Arg Lys Leu Tyr Trp Val Asp
3250 3255 3260
Ser Phe Leu Asp Cys Met Lys Val Ser Glu Leu Asp Gly Arg Phe Val
3265 3270 3275 3280
Arg Lys Leu Ala Glu His Cys Val Asp Ala Asn Asn Thr Tyr Cys Phe
3285 3290 3295
Glu Asn Pro Arg Ala Ile Val Leu His Pro Lys Phe Gly Tyr Val Tyr
3300 3305 3310
Trp Thr Asp Trp Gly Asp Lys Ala Phe Ile Gly Arg Val Gly Met Asp
3315 3320 3325
Gly Asn Asn Lys Ser Ala Ile Ile Thr Thr Lys Ile Glu Trp Pro Asn
3330 3335 3340
Gly Leu Thr Ile Asp Tyr Thr Asn Asp Lys Leu Tyr Trp Ser Asp Ala
3345 3350 3355 3360
His Leu Asn Tyr Ile Glu Phe Ser Asp Leu Asp Gly Asn His Arg His
3365 3370 3375
Thr Val Tyr Asp Gly Val Leu Pro His Pro Phe Ala Ile Thr Val Phe
3380 3385 3390
Glu Glu Ser Val Tyr Trp Thr Asp Trp Asn Thr Arg Thr Val Glu Lys
3395 3400 3405
Gly Asn Lys Tyr Asn Gly Ser Gly Arg Glu Ala Leu Val Asn Thr Thr
3410 3415 3420
His Arg Pro Phe Asp Ile His Val Cys His Pro Tyr Arg Gln Pro Ile
3425 3430 3435 3440
Val Thr Asn Pro Cys Ala Val Asn Asn Gly Gly Cys Ser His Leu Cys
3445 3450 3455
Leu Leu Arg Ala Gly Gly Gln Gly Phe Thr Cys Glu Cys Pro Asp His
3460 3465 3470
Phe Leu Thr Val Gln Ile Gly Gly Ala Ala Arg Cys Leu Pro Met Cys
3475 3480 3485
Ser Ser Thr Gln Tyr Arg Cys Ala Asp Asn Glu Arg Cys Ile Pro Ile
3490 3495 3500
Trp Trp Lys Cys Asp Gly Gln Arg Asp Cys Arg Asp Gly Ser Asp Glu
3505 3510 3515 3520
Pro Tyr Thr Cys Pro Val Arg His Cys Arg Leu Gly Gln Phe Gln Cys
3525 3530 3535
Asn Asp Gly Asn Cys Thr Ser Pro His Phe Leu Cys Asn Ser Asn Gln
3540 3545 3550
Asp Cys Pro Asp Gly Ser Asp Glu Asp Ala Val Leu Cys Ala Thr His
3555 3560 3565
Gln Cys Glu Ser His Gln Trp Gln Cys Ala Asn Lys Arg Cys Ile Ser
3570 3575 3580
Glu Ala Trp Gln Cys Asp Gly Glu Asn Asp Cys Gly Asp Gly Ser Asp
3585 3590 3595 3600
Glu Asp Pro Ala His Cys Ser Ser Arg Thr Cys Arg Pro Gly Gln Phe
3605 3610 3615
Lys Cys Arg Asn Gly Arg Cys Ile Pro Gln Ser Trp Lys Cys Asp Val
3620 3625 3630
Asp Asp Asp Cys Gly Asp Asn Ser Asp Glu Pro Ile Glu Glu Cys Met
3635 3640 3645
Gly Pro Ala Tyr Arg Cys Asp Asn His Thr Glu Phe Asp Cys Arg Thr
3650 3655 3660
Asn Tyr Arg Cys Val Pro Leu Trp Ala Val Cys Asn Gly His Asn Asp
3665 3670 3675 3680
Cys Arg Asp Asn Ser Asp Glu Gln Asn Cys Glu Glu Leu Thr Cys Glu
3685 3690 3695
Pro Ala Gly Asp Phe Arg Cys Asp Asn His Gln Cys Ile Pro Leu Arg
3700 3705 3710
Trp Arg Cys Asp Gly Asp Asn Asp Cys Gly Asp Gly Ser Asp Glu Arg
3715 3720 3725
Asn Cys Thr Pro Arg Pro Cys Thr Glu Ser Glu Tyr Arg Cys Asp Asn
3730 3735 3740
Leu His Cys Ile Pro Asp Arg Trp Val Cys Asp His Asp Asn Asp Cys
3745 3750 3755 3760
Glu Asp Asn Ser Asp Glu Arg Asp Cys Glu Leu Arg Thr Cys His Pro
3765 3770 3775
Gly Tyr Phe Gln Cys Gly Ser Gly His Cys Ile Ser Glu Arg Phe Lys
3780 3785 3790
Cys Asp Gly Asn Ala Asp Cys Leu Asp Phe Thr Asp Glu Ser Ser Cys
3795 3800 3805
Pro Thr Arg Tyr Pro Asn Ser Thr Tyr Cys Pro Pro Phe Leu Phe Glu
3810 3815 3820
Cys Lys Asn His Val Cys Val Gln Gln His Trp Ile Cys Asp Gly Asp
3825 3830 3835 3840
Asn Asp Cys Gly Asp Asn Ser Asp Glu Glu Leu His Leu Cys Leu Asp
3845 3850 3855
Ile Ser Cys Asp Pro Pro Phe Arg Phe Arg Cys Asp Asn Thr Arg Cys
3860 3865 3870
Ile Tyr Ser His Glu Leu Cys Asn Ser Ile Asp Asp Cys Gly Asp Gly
3875 3880 3885
Ser Asp Glu Arg Pro Glu His Cys Val Thr Pro Thr His Gly Pro Cys
3890 3895 3900
Thr Glu Asp Glu Tyr Lys Cys Gly Asn Gly Gln Cys Ile Pro Leu Gln
3905 3910 3915 3920
Tyr Ala Cys Asp Asp Tyr Asp Asp Cys Glu Asp Gln Ser Asp Glu Leu
3925 3930 3935
Gly Cys Tyr Tyr Gly His Gly Arg Thr Cys Ser Glu Asn Leu Cys Glu
3940 3945 3950
His Asn Cys Thr Asp Leu Ser Ala Gly Gly Phe Ile Cys Ser Cys Arg
3955 3960 3965
Pro Gly Tyr Lys Pro Asn Pro Glu Asp Lys Asn Ser Cys Asn Asp Val
3970 3975 3980
Asn Glu Cys Glu Val Tyr Gly Thr Cys Pro Gln Leu Cys Arg Asn Thr
3985 3990 3995 4000
Lys Gly Ser Tyr Glu Cys Phe Cys Ala Asp Gly Phe Arg Ser Val Gly
4005 4010 4015
Glu Gln Pro Gly Val Glu Cys Ala Ala Glu Gly Asn Pro Pro Val Leu
4020 4025 4030
Leu Leu Pro Asp Asn Val Arg Ile Arg Arg Phe Asn Leu Ser Ser Ala
4035 4040 4045
Gln Tyr Ser Asp Tyr Val Asp Asn Ala Glu His Ile Gln Ala Leu Asp
4050 4055 4060
Tyr Leu Trp Asp Pro Glu Gly Leu Gly Leu Ser Ile Val Tyr Trp Thr
4065 4070 4075 4080
Val Leu Gly Arg Gly Ser Glu Phe Gly Ala Ile Lys Arg Ala Tyr Met
4085 4090 4095
Thr Thr Phe Asp Asp His Gly Asn Asn Pro Val Lys Glu Val Asp Leu
4100 4105 4110
Asn Leu Arg Tyr Ile Ser Ser Pro Asp Gly Ile Ala Val Asp Trp Ile
4115 4120 4125
Gly Gly His Ile Tyr Trp Thr Asp Ala Gly Thr Asn Arg Ile Glu Val
4130 4135 4140
Ser Lys Leu Asp Gly Arg Tyr Arg Lys Trp Leu Ile His Ser Asp Leu
4145 4150 4155 4160
Asp Gln Pro Ala Ala Ile Val Val Asn Pro Gly Leu Gly Gln Met Phe
4165 4170 4175
Trp Thr Asp Trp Gly Arg Lys Pro Lys Ile Glu Thr Ala Trp Met Asp
4180 4185 4190
Gly Gln His Arg Glu Val Leu Leu Asp Glu Asp Leu Gly Trp Pro Thr
4195 4200 4205
Gly Leu Ala Leu Asp Tyr Leu Asn Glu Asn Arg Ile Tyr Trp Cys Asp
4210 4215 4220
Ser Lys Glu Asn Ile Ile Glu Ser Met Lys Ala Asp Gly Thr Asp Arg
4225 4230 4235 4240
Gln Met Ile Ile Ser Gly Asp Ile Gly His Pro Tyr Ser Leu Asp Val
4245 4250 4255
Phe Glu Gly His Val Tyr Trp Thr Thr Lys Glu Lys Gly Glu Val Trp
4260 4265 4270
Lys Lys Asp Lys Phe Gly Lys Gly Asp Lys Val Lys Val Leu Thr Ile
4275 4280 4285
Asn Pro Trp Leu Thr Gln Val Arg Ile Tyr Gln Gln His Arg His Asn
4290 4295 4300
His Ser Val Leu Asn Pro Cys Gln Gly Val Cys Ser His Leu Cys Leu
4305 4310 4315 4320
Leu Arg Pro Gly Gly Tyr Thr Cys Ala Cys Pro Gln Gly Ser Thr Leu
4325 4330 4335
Leu Thr Phe Asn Lys Asn Glu Cys Asp Ala Ala Ile Glu Ala Glu Val
4340 4345 4350
Ser Met Pro Leu Ala Cys Arg Cys Met Asn Gly Gly Thr Cys Tyr Thr
4355 4360 4365
Asp Glu Gly Gly Leu Pro Lys Cys Lys Cys Pro Tyr Gly Tyr Ser Gly
4370 4375 4380
Ser Phe Cys Glu Met Gly Arg Ser Arg Ala Ala Pro Ala Gly Thr Ala
4385 4390 4395 4400
Val Thr Val Leu Leu Ala Val Val Ile Ile Leu Ile Thr Gly Ala Leu
4405 4410 4415
Val Val Gly Val Phe Leu Asn Tyr Lys Arg Thr Gly Ser Leu Ile Pro
4420 4425 4430
Ser Met Pro Lys Leu Pro Ser Leu Ser Ser Leu Val Lys Ser Ala Asp
4435 4440 4445
Thr Gly Asn
4450
Claims (10)
1.一种龙睛百褶裙泰狮金鱼的创制方法,其特征在于,包括以下步骤:
S1.靶点设计:对百褶裙泰狮金鱼进行基因测序,获取调控龙睛性状的基因的DNA序列,并在DNA序列中的外显子上设计适合的敲除靶点,将该靶点的序列体外转录成gRNA;
所述调控龙睛性状的基因为lrp2a即低密度脂蛋白2基因,所述靶点有五个,五个靶点的序列分别为:
GGGTCTTTCCGCTGTGGGAC、GGAGCTGAGGTGTGCTAACG、GGATGACTGCGAGGACAATG、TCTTCTGGTGTTTGCATCCC和GGGACGTCCCACTGTCCGGA;
S2.显微注射:将百褶裙泰狮金鱼的鱼卵平铺在无菌培养皿中,添加适量的卵巢液,取CRISPR/Cas9蛋白和gRNA的mRNA各2μL并混合均匀,向鱼卵动物极注射混合后的mRNA,4μLmRNA共注射200-3000粒鱼卵;
S3.受精孵化:取雄性百褶裙泰狮金鱼精子加入注射操作完成后的培养皿中,等待鱼卵受精后将受精卵孵化,所得百褶裙泰狮金鱼即为具有龙睛性状的百褶裙泰狮金鱼。
2.如权利要求1所述的一种龙睛百褶裙泰狮金鱼的创制方法,其特征在于,该方法还包括雌核生殖步骤,所述雌核生殖方法为:以S3中具有龙睛表型的百褶裙泰狮金鱼为亲本即F0代,取鲤鱼精子灭活,将灭活精液与F0代雌性百褶裙泰狮金鱼卵子混合受精后进行休克处理,经处理后的受精卵孵化即为具有龙睛性状的F1代纯合子百褶裙泰狮金鱼。
3.如权利要求2所述的一种龙睛百褶裙泰狮金鱼的创制方法,其特征在于,该方法还包括高温扩群步骤,所述高温扩群方法为,在F1代纯合子出膜后早期,对部分雌鱼使用高温27-28℃饲养,使雌鱼性逆转为雄鱼,性逆转雌鱼和正常饲养的F1代雌鱼进行繁殖,批量得到纯合子子代百褶裙泰狮金鱼。
4.如权利要求1所述的一种龙睛百褶裙泰狮金鱼的创制方法,其特征在于,所述步骤S1中,根据所得基因的DNA序列设计获取调控龙睛性状的基因的外显子和内含子,并用通配符GG[G/A]??????????????????GG或者CC??????????????????[T/C]CC在百褶裙泰狮金鱼lrp2a基因的外显子序列上设计敲除靶点;所述通配符中“?”代表一个任意碱基,“[]”代表与[]内任意一个碱基匹配,“/”代表“或”。
5.如权利要求1所述的一种龙睛百褶裙泰狮金鱼的创制方法,其特征在于,所述步骤S1中,获得靶点序列后,在体外转录前,先构建金鱼基因组本地数据库,使用blast软件检索并确定靶点的唯一性。
6.如权利要求1所述的一种龙睛百褶裙泰狮金鱼的创制方法,其特征在于,步骤S3中,受精卵孵化过程中,还包括检测敲除效率操作,随机挑选10颗出膜前的受精卵,提取受精卵内DNA作为模板,同时扩增敲除靶点附近的DNA序列测序并与正常序列进行对比,根据对比结果计算敲除效率,若敲除效率达到20%及以上则留养该批受精卵继续孵化,否则终止孵化。
7.如权利要求6所述的一种龙睛百褶裙泰狮金鱼的创制方法,其特征在于,所述提取受精卵内DNA和扩增具体为:将挑选的受精卵放入到PCR管中,加入70-100μL的50mM的NaOH溶液,于PCR仪中95℃反应20min后作为DNA模板;在敲除靶点附近设计PCR扩增引物,PCR扩增、转化、克隆后挑选10个目的克隆测序并与正常序列进行对比以检验该敲除位点是否敲除。
8.如权利要求2所述的一种龙睛百褶裙泰狮金鱼的创制方法,其特征在于,所述雌性百褶裙泰狮金鱼卵子取自同一条F0代雌性百褶裙泰狮金鱼。
9.如权利要求2所述的一种龙睛百褶裙泰狮金鱼的创制方法,其特征在于,所述休克处理为热休克,将受精卵在24℃恒温发育43-47min后,再置于41℃水中进行热休克2min处理。
10.如权利要求2所述的一种龙睛百褶裙泰狮金鱼的创制方法,其特征在于,所述休克处理为冷休克,将受精卵在24℃恒温发育3-7min后,再置于0℃冰水中进行冷休克2min处理。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110089136.4A CN112813107B (zh) | 2021-01-22 | 2021-01-22 | 一种龙睛百褶裙泰狮金鱼的创制方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110089136.4A CN112813107B (zh) | 2021-01-22 | 2021-01-22 | 一种龙睛百褶裙泰狮金鱼的创制方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112813107A CN112813107A (zh) | 2021-05-18 |
CN112813107B true CN112813107B (zh) | 2023-10-13 |
Family
ID=75858863
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110089136.4A Active CN112813107B (zh) | 2021-01-22 | 2021-01-22 | 一种龙睛百褶裙泰狮金鱼的创制方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112813107B (zh) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109220904A (zh) * | 2018-09-27 | 2019-01-18 | 天津农学院 | 樱花泰狮金鱼的繁殖和选育方法 |
CN112680447A (zh) * | 2021-01-22 | 2021-04-20 | 中国科学院水生生物研究所 | 一种金鱼基因编辑技术及利用其创制金鱼新品种的方法 |
CN112715436A (zh) * | 2021-01-22 | 2021-04-30 | 中国科学院水生生物研究所 | 一种百褶裙泰狮金鱼的批量繁育方法 |
CN112790125A (zh) * | 2021-01-22 | 2021-05-14 | 中国科学院水生生物研究所 | 一种玉兔百褶裙泰狮金鱼的创制方法 |
-
2021
- 2021-01-22 CN CN202110089136.4A patent/CN112813107B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109220904A (zh) * | 2018-09-27 | 2019-01-18 | 天津农学院 | 樱花泰狮金鱼的繁殖和选育方法 |
CN112680447A (zh) * | 2021-01-22 | 2021-04-20 | 中国科学院水生生物研究所 | 一种金鱼基因编辑技术及利用其创制金鱼新品种的方法 |
CN112715436A (zh) * | 2021-01-22 | 2021-04-30 | 中国科学院水生生物研究所 | 一种百褶裙泰狮金鱼的批量繁育方法 |
CN112790125A (zh) * | 2021-01-22 | 2021-05-14 | 中国科学院水生生物研究所 | 一种玉兔百褶裙泰狮金鱼的创制方法 |
Non-Patent Citations (7)
Title |
---|
Kon, T. et al..The genetic basis of morphological diversity in domesticated goldfish..《Curr Biol》.2020,第30卷(第12期),全文. * |
Ryu JH et al..Advantages, Factors, Obstacles, Potential Solutions, and Recent Advances of Fish Germ Cell Transplantation for Aquaculture-A Practical Review.《Animals (Basel)》.2022,第12卷(第4期),全文. * |
Yu, P. et al..Upregulation of the PPAR signaling pathway and accumulation of lipids are related to the morphological and structural transformation of the dragon-eye goldfish eye..《Sci China Life Sci》.2021,第64卷(第7期),全文. * |
徐康等.鱼类遗传育种中生物学方法的应用及研究进展.《中国科学:生命科学》.2014,第44卷(第12期),全文. * |
李志等.黏性卵鱼类受精卵一种快速高效的显微注射方法.《水生生物学报》.2016,第40卷(第1期),全文. * |
杨恒宇.生物序列数据挖掘技术研究.《合肥工业大学学报(自然科学版)》.2012,第35卷(第9期),全文. * |
江山.金鱼雌核发育及其性别分化相关基因的表达.《中国优秀硕士学位论文全文数据库 农业科技辑》.2013,全文. * |
Also Published As
Publication number | Publication date |
---|---|
CN112813107A (zh) | 2021-05-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102243727B1 (ko) | 유전자 표적화 및 형질 스태킹을 위한 조작된 트랜스진 통합 플랫폼 (etip) | |
KR102147007B1 (ko) | Fad3 성능 유전자좌 및 표적화 파단을 유도할 수 있는 상응하는 표적 부위 특이적 결합 단백질 | |
KR101999410B1 (ko) | 염색체 랜딩 패드 및 관련된 용도 | |
KR20200045517A (ko) | 병태 및 질환 치료용 안티센스 올리고머 | |
AU2017376780A1 (en) | Compositions and methods for modulating growth of a genetically modified gut bacterial cell | |
KR101033818B1 (ko) | 돼지의 알파에스 1 카제인 유전자, 그 프로모터 및 그의 용도 | |
KR102281540B1 (ko) | 식물 조절 요소 및 그의 용도 | |
KR20210113675A (ko) | 토마토 브라운 루고스 프루트 바이러스에 내성이 있는 토마토 식물 | |
AU669484B2 (en) | Transgenic protein production | |
CN101712955B (zh) | 利用基因重组蚕生产生理活性蛋白质的方法 | |
CN113151310B (zh) | 非洲猪瘟基因缺失弱毒株的构建及其作为疫苗的应用 | |
KR20170032317A (ko) | 담배 프로테아제 유전자 | |
CN112715436B (zh) | 一种百褶裙泰狮金鱼的批量繁育方法 | |
KR102424119B1 (ko) | 저하된 상위 및 하위 운동 뉴런 기능 및 감각 지각을 나타내는 비-인간 동물 | |
CN112813107B (zh) | 一种龙睛百褶裙泰狮金鱼的创制方法 | |
CN112790125B (zh) | 一种玉兔百褶裙泰狮金鱼的创制方法 | |
CN109862909A (zh) | 病毒疫苗 | |
CN112969367B (zh) | 作为c3肾小球病模型的补体因子h基因敲除大鼠 | |
CN112243955B (zh) | 新型pls3基因敲除大鼠动物模型的构建方法和应用 | |
RU2820183C2 (ru) | Устойчивость к растрескиванию стручков у растений рода brassica | |
CN112365920B (zh) | 一种鉴定蜜蜂分化关键基因的方法及鉴定得到的基因和应用 | |
CN115322993B (zh) | 一种用于猪基因组定点整合外源基因的安全位点及用其构建猪育种群方法 | |
RU2775653C2 (ru) | Композиции и способы для изменения цветения и архитектуры растений для улучшения потенциальной урожайности | |
RU2817119C2 (ru) | Растения томата, устойчивые к вирусу бурой морщинистости плодов томата | |
CN108135151A (zh) | 前列腺癌的啮齿动物模型 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |