CN114437232A - Cell surface macromolecule quantitative display system and preparation method and application thereof - Google Patents
Cell surface macromolecule quantitative display system and preparation method and application thereof Download PDFInfo
- Publication number
- CN114437232A CN114437232A CN202011222236.1A CN202011222236A CN114437232A CN 114437232 A CN114437232 A CN 114437232A CN 202011222236 A CN202011222236 A CN 202011222236A CN 114437232 A CN114437232 A CN 114437232A
- Authority
- CN
- China
- Prior art keywords
- gly
- leu
- glu
- ala
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 210000004027 cell Anatomy 0.000 title claims abstract description 307
- 229920002521 macromolecule Polymers 0.000 title claims abstract description 36
- 238000002360 preparation method Methods 0.000 title claims abstract description 11
- 102000005962 receptors Human genes 0.000 claims abstract description 97
- 108020003175 receptors Proteins 0.000 claims abstract description 97
- 230000014509 gene expression Effects 0.000 claims abstract description 62
- 108010031480 Artificial Receptors Proteins 0.000 claims abstract description 55
- 239000003446 ligand Substances 0.000 claims abstract description 48
- 108091005703 transmembrane proteins Proteins 0.000 claims abstract description 32
- 102000035160 transmembrane proteins Human genes 0.000 claims abstract description 32
- 230000007704 transition Effects 0.000 claims abstract description 12
- 238000001514 detection method Methods 0.000 claims abstract description 10
- 238000000034 method Methods 0.000 claims description 34
- 239000000427 antigen Substances 0.000 claims description 33
- 102000036639 antigens Human genes 0.000 claims description 32
- 108091007433 antigens Proteins 0.000 claims description 32
- 230000027455 binding Effects 0.000 claims description 27
- 210000000170 cell membrane Anatomy 0.000 claims description 26
- 125000006853 reporter group Chemical group 0.000 claims description 20
- 108010033276 Peptide Fragments Proteins 0.000 claims description 17
- 102000007079 Peptide Fragments Human genes 0.000 claims description 17
- 102000003890 RNA-binding protein FUS Human genes 0.000 claims description 14
- 108090000292 RNA-binding protein FUS Proteins 0.000 claims description 14
- 230000003834 intracellular effect Effects 0.000 claims description 14
- 108091033319 polynucleotide Proteins 0.000 claims description 14
- 102000040430 polynucleotide Human genes 0.000 claims description 14
- 239000002157 polynucleotide Substances 0.000 claims description 14
- 238000012258 culturing Methods 0.000 claims description 11
- 239000002773 nucleotide Substances 0.000 claims description 8
- 125000003729 nucleotide group Chemical group 0.000 claims description 8
- 101000830411 Homo sapiens Probable ATP-dependent RNA helicase DDX4 Proteins 0.000 claims description 7
- 102000039446 nucleic acids Human genes 0.000 claims description 6
- 108020004707 nucleic acids Proteins 0.000 claims description 6
- 150000007523 nucleic acids Chemical class 0.000 claims description 6
- 101001099181 Homo sapiens TATA-binding protein-associated factor 2N Proteins 0.000 claims description 5
- 102100024770 Probable ATP-dependent RNA helicase DDX4 Human genes 0.000 claims description 4
- 238000003501 co-culture Methods 0.000 claims description 4
- 125000003275 alpha amino acid group Chemical group 0.000 claims 5
- 210000004899 c-terminal region Anatomy 0.000 claims 2
- 102100025171 Transcription initiation factor TFIID subunit 12 Human genes 0.000 claims 1
- 230000003284 homeostatic effect Effects 0.000 claims 1
- 238000005516 engineering process Methods 0.000 abstract description 11
- 108090000765 processed proteins & peptides Proteins 0.000 abstract description 10
- 238000002823 phage display Methods 0.000 abstract description 9
- 238000012795 verification Methods 0.000 abstract description 4
- 108090000623 proteins and genes Proteins 0.000 description 66
- 102000004169 proteins and genes Human genes 0.000 description 55
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 53
- 230000004913 activation Effects 0.000 description 46
- 239000005090 green fluorescent protein Substances 0.000 description 43
- 108010061238 threonyl-glycine Proteins 0.000 description 40
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 34
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 34
- 108010038633 aspartylglutamate Proteins 0.000 description 33
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 32
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 30
- 108010064235 lysylglycine Proteins 0.000 description 30
- 108010010147 glycylglutamine Proteins 0.000 description 28
- 150000001413 amino acids Chemical class 0.000 description 26
- 108010049041 glutamylalanine Proteins 0.000 description 25
- 230000000875 corresponding effect Effects 0.000 description 23
- 108010079364 N-glycylalanine Proteins 0.000 description 22
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 22
- 108010078144 glutaminyl-glycine Proteins 0.000 description 22
- 108010034529 leucyl-lysine Proteins 0.000 description 22
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 21
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 21
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 21
- 108010068265 aspartyltyrosine Proteins 0.000 description 21
- 108010004073 cysteinylcysteine Proteins 0.000 description 21
- 108010025306 histidylleucine Proteins 0.000 description 21
- 108010017391 lysylvaline Proteins 0.000 description 21
- 108010076441 Ala-His-His Proteins 0.000 description 20
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 20
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 20
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 20
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 20
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 20
- 108010031014 alanyl-histidyl-leucyl-leucine Proteins 0.000 description 20
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 20
- 108010047857 aspartylglycine Proteins 0.000 description 20
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 20
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 20
- 108010073832 phenylalanyl-leucyl-leucyl-arginyl-asparagine Proteins 0.000 description 20
- 108010071207 serylmethionine Proteins 0.000 description 20
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 19
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 19
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 19
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 19
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 19
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 19
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 19
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 19
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 19
- 108010054155 lysyllysine Proteins 0.000 description 19
- 108010090894 prolylleucine Proteins 0.000 description 19
- 108010026333 seryl-proline Proteins 0.000 description 19
- WBNIBLBGDVPFOO-LSBAASHUSA-N (2s)-4-amino-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-4-oxobutanoic acid Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 WBNIBLBGDVPFOO-LSBAASHUSA-N 0.000 description 18
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 18
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 18
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 18
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 18
- FSXDWQGEWZQBPJ-HERUPUMHSA-N Ala-Trp-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FSXDWQGEWZQBPJ-HERUPUMHSA-N 0.000 description 18
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 18
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 18
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 18
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 18
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 18
- BXLDDWZOTGGNOJ-SZMVWBNQSA-N Arg-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N BXLDDWZOTGGNOJ-SZMVWBNQSA-N 0.000 description 18
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 18
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 18
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 18
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 18
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 18
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 18
- HTSSXFASOUSJQG-IHPCNDPISA-N Asp-Tyr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HTSSXFASOUSJQG-IHPCNDPISA-N 0.000 description 18
- IRKLTAKLAFUTLA-KATARQTJSA-N Cys-Thr-Lys Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCCCN)C(O)=O IRKLTAKLAFUTLA-KATARQTJSA-N 0.000 description 18
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 18
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 18
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 18
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 18
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 18
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 18
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 18
- OFIHURVSQXAZIR-SZMVWBNQSA-N Glu-Lys-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OFIHURVSQXAZIR-SZMVWBNQSA-N 0.000 description 18
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 18
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 18
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 18
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 18
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 18
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 18
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 18
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 18
- ZYDYEPDFFVCUBI-SRVKXCTJSA-N His-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZYDYEPDFFVCUBI-SRVKXCTJSA-N 0.000 description 18
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 18
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 18
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 18
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 18
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 18
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 18
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 18
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 18
- OPJRECCCQSDDCZ-TUSQITKMSA-N Lys-Trp-Trp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OPJRECCCQSDDCZ-TUSQITKMSA-N 0.000 description 18
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 18
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 18
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 18
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 18
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 18
- GBRUQFBAJOKCTF-DCAQKATOSA-N Pro-His-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O GBRUQFBAJOKCTF-DCAQKATOSA-N 0.000 description 18
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 18
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 18
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 18
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 18
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 18
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 18
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 18
- HJWVPKJHHLZCNH-DVXDUOKCSA-N Trp-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)C)C(O)=O)=CNC2=C1 HJWVPKJHHLZCNH-DVXDUOKCSA-N 0.000 description 18
- HQJOVVWAPQPYDS-ZFWWWQNUSA-N Trp-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQJOVVWAPQPYDS-ZFWWWQNUSA-N 0.000 description 18
- HHPSUFUXXBOFQY-AQZXSJQPSA-N Trp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O HHPSUFUXXBOFQY-AQZXSJQPSA-N 0.000 description 18
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 18
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 18
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 18
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 18
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 18
- MJUTYRIMFIICKL-JYJNAYRXSA-N Tyr-Val-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJUTYRIMFIICKL-JYJNAYRXSA-N 0.000 description 18
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 18
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 18
- JVYIGCARISMLMV-HOCLYGCPSA-N Val-Gly-Trp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JVYIGCARISMLMV-HOCLYGCPSA-N 0.000 description 18
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 18
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 18
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 18
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 18
- 108010047495 alanylglycine Proteins 0.000 description 18
- 210000000612 antigen-presenting cell Anatomy 0.000 description 18
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 18
- 108010040030 histidinoalanine Proteins 0.000 description 18
- 108010034507 methionyltryptophan Proteins 0.000 description 18
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 17
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 17
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 17
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 17
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 17
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 17
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 17
- NMROINAYXCACKF-WHFBIAKZSA-N Gly-Cys-Cys Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O NMROINAYXCACKF-WHFBIAKZSA-N 0.000 description 17
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 17
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 17
- WVUDHMBJNBWZBU-XUXIUFHCSA-N Ile-Lys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N WVUDHMBJNBWZBU-XUXIUFHCSA-N 0.000 description 17
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 17
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 17
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 17
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 17
- CDGABSWLRMECHC-IHRRRGAJSA-N Pro-Lys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CDGABSWLRMECHC-IHRRRGAJSA-N 0.000 description 17
- FBGDDUKYOBNZJL-WDSOQIARSA-N Trp-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FBGDDUKYOBNZJL-WDSOQIARSA-N 0.000 description 17
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 17
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 17
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 17
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 16
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 16
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 16
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 16
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 16
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 16
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 16
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 16
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 16
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 16
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 16
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 16
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 16
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 16
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 16
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 16
- NZOAFWHVAFJERA-OALUTQOASA-N Gly-Phe-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NZOAFWHVAFJERA-OALUTQOASA-N 0.000 description 16
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 16
- LYDKQVYYCMYNMC-SRVKXCTJSA-N His-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYDKQVYYCMYNMC-SRVKXCTJSA-N 0.000 description 16
- ZHMZWSFQRUGLEC-JYJNAYRXSA-N His-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZHMZWSFQRUGLEC-JYJNAYRXSA-N 0.000 description 16
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 16
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 16
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 16
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 16
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 16
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 16
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 16
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 16
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 16
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 16
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 16
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 16
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 16
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 16
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 16
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 16
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 16
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 16
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 16
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 16
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 16
- 108010092114 histidylphenylalanine Proteins 0.000 description 16
- 108010009298 lysylglutamic acid Proteins 0.000 description 16
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 15
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 15
- JWUZOJXDJDEQEM-ZLIFDBKOSA-N Ala-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 JWUZOJXDJDEQEM-ZLIFDBKOSA-N 0.000 description 15
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 15
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 15
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 15
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 15
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 14
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 14
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 14
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 14
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 14
- 108010016616 cysteinylglycine Proteins 0.000 description 14
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 14
- 230000003993 interaction Effects 0.000 description 14
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 13
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 13
- 238000005191 phase separation Methods 0.000 description 13
- 102000053723 Angiotensin-converting enzyme 2 Human genes 0.000 description 12
- 108090000975 Angiotensin-converting enzyme 2 Proteins 0.000 description 12
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 12
- DGOJNGCGEYOBKN-BWBBJGPYSA-N Thr-Cys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O DGOJNGCGEYOBKN-BWBBJGPYSA-N 0.000 description 12
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 12
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 12
- 239000013604 expression vector Substances 0.000 description 12
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 12
- 239000012071 phase Substances 0.000 description 12
- KOHBWQDSVCARMI-BWBBJGPYSA-N Cys-Cys-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KOHBWQDSVCARMI-BWBBJGPYSA-N 0.000 description 11
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 11
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 11
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 11
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 11
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 11
- HYKFOHGZGLOCAY-ZLUOBGJFSA-N Cys-Cys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O HYKFOHGZGLOCAY-ZLUOBGJFSA-N 0.000 description 10
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 10
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 10
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 10
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 10
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 10
- 230000004927 fusion Effects 0.000 description 10
- 108010050848 glycylleucine Proteins 0.000 description 10
- CVOZXIPULQQFNY-ZLUOBGJFSA-N Cys-Ala-Cys Chemical compound C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O CVOZXIPULQQFNY-ZLUOBGJFSA-N 0.000 description 9
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 9
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 9
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 9
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 9
- 101000666131 Homo sapiens Protein-glutamine gamma-glutamyltransferase 4 Proteins 0.000 description 9
- 102100038103 Protein-glutamine gamma-glutamyltransferase 4 Human genes 0.000 description 9
- 241000315672 SARS coronavirus Species 0.000 description 9
- 230000002776 aggregation Effects 0.000 description 9
- 238000004220 aggregation Methods 0.000 description 9
- 238000010276 construction Methods 0.000 description 9
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 8
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 8
- 101000692455 Homo sapiens Platelet-derived growth factor receptor beta Proteins 0.000 description 8
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 8
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 108010057821 leucylproline Proteins 0.000 description 8
- 210000004962 mammalian cell Anatomy 0.000 description 8
- 230000002829 reductive effect Effects 0.000 description 8
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 7
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 7
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 7
- ZJBWJHQDOIMVLM-WHFBIAKZSA-N Cys-Cys-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZJBWJHQDOIMVLM-WHFBIAKZSA-N 0.000 description 7
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 7
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 7
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 7
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 7
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 7
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 7
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 7
- 102100026547 Platelet-derived growth factor receptor beta Human genes 0.000 description 7
- 108700008625 Reporter Genes Proteins 0.000 description 7
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 7
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 7
- 210000003719 b-lymphocyte Anatomy 0.000 description 7
- 238000010367 cloning Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 7
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 7
- 238000007898 magnetic cell sorting Methods 0.000 description 7
- 238000002826 magnetic-activated cell sorting Methods 0.000 description 7
- 210000003370 receptor cell Anatomy 0.000 description 7
- 238000012216 screening Methods 0.000 description 7
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 6
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 6
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 6
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 6
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 6
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 6
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 6
- 102100028089 RING finger protein 112 Human genes 0.000 description 6
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 6
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 6
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 238000001218 confocal laser scanning microscopy Methods 0.000 description 6
- 210000001280 germinal center Anatomy 0.000 description 6
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 6
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 6
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 6
- 241000894007 species Species 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 5
- RCQRKPUXJAGEEC-ZLUOBGJFSA-N Ala-Cys-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O RCQRKPUXJAGEEC-ZLUOBGJFSA-N 0.000 description 5
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 5
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 5
- SMYXEYRYCLIPIL-ZLUOBGJFSA-N Cys-Cys-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O SMYXEYRYCLIPIL-ZLUOBGJFSA-N 0.000 description 5
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 5
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 5
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 5
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 5
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 5
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 5
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 5
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 5
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 5
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 5
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 5
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 5
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 5
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 5
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 5
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 5
- 108010092854 aspartyllysine Proteins 0.000 description 5
- 239000011324 bead Substances 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 5
- 108010018006 histidylserine Proteins 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 210000004379 membrane Anatomy 0.000 description 5
- 230000009467 reduction Effects 0.000 description 5
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 4
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 4
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 4
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 4
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 4
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 4
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 4
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 4
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 4
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 4
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 4
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 4
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 4
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 4
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 4
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 4
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 4
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 4
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 4
- 108010052285 Membrane Proteins Proteins 0.000 description 4
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 4
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 4
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 4
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 4
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 4
- 101000629318 Severe acute respiratory syndrome coronavirus 2 Spike glycoprotein Proteins 0.000 description 4
- 101000629313 Severe acute respiratory syndrome coronavirus Spike glycoprotein Proteins 0.000 description 4
- 108010003723 Single-Domain Antibodies Proteins 0.000 description 4
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 4
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 4
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108091005948 blue fluorescent proteins Proteins 0.000 description 4
- 230000002596 correlated effect Effects 0.000 description 4
- 108020001507 fusion proteins Proteins 0.000 description 4
- 102000037865 fusion proteins Human genes 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- 239000007791 liquid phase Substances 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 108010056582 methionylglutamic acid Proteins 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004001 molecular interaction Effects 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 230000004962 physiological condition Effects 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 230000006916 protein interaction Effects 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 4
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 3
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 3
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 3
- ORXCYAFUCSTQGY-FXQIFTODSA-N Asn-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N ORXCYAFUCSTQGY-FXQIFTODSA-N 0.000 description 3
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 3
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 3
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 3
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 3
- 108091008875 B cell receptors Proteins 0.000 description 3
- 241001678559 COVID-19 virus Species 0.000 description 3
- 241000711573 Coronaviridae Species 0.000 description 3
- URDUGPGPLNXXES-WHFBIAKZSA-N Cys-Gly-Cys Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O URDUGPGPLNXXES-WHFBIAKZSA-N 0.000 description 3
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 3
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 3
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 3
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 3
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 3
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 3
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 3
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 3
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 3
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 3
- 102000018697 Membrane Proteins Human genes 0.000 description 3
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- 101710160107 Outer membrane protein A Proteins 0.000 description 3
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 3
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 3
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 3
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 3
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 3
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 3
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 3
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 3
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 3
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 3
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 3
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 3
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 3
- KZTLJLFVOIMRAQ-IHPCNDPISA-N Trp-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZTLJLFVOIMRAQ-IHPCNDPISA-N 0.000 description 3
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 3
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 3
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 3
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 3
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 3
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 230000000890 antigenic effect Effects 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 108010060199 cysteinylproline Proteins 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 108010079547 glutamylmethionine Proteins 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 230000003472 neutralizing effect Effects 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 229950010131 puromycin Drugs 0.000 description 3
- 238000011002 quantification Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 108010020532 tyrosyl-proline Proteins 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- IAOXXKYIZHCAQJ-ACZMJKKPSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2,4-diamino-4-oxobutanoyl]amino]propanoyl]amino]acetyl]amino]propanoic acid Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O IAOXXKYIZHCAQJ-ACZMJKKPSA-N 0.000 description 2
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 2
- XJFPXLWGZWAWRQ-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O XJFPXLWGZWAWRQ-UHFFFAOYSA-N 0.000 description 2
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 2
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 2
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 2
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 2
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 2
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 2
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 2
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 2
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 2
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 2
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 2
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 2
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 2
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 2
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 2
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 2
- JRBVWZLHBGYZNY-QEJZJMRPSA-N Asp-Gln-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRBVWZLHBGYZNY-QEJZJMRPSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- RWHHSFSWKFBTCF-KKUMJFAQSA-N Asp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N RWHHSFSWKFBTCF-KKUMJFAQSA-N 0.000 description 2
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 2
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 2
- RNAQPBOOJRDICC-BPUTZDHNSA-N Asp-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N RNAQPBOOJRDICC-BPUTZDHNSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 2
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 2
- 241000282472 Canis lupus familiaris Species 0.000 description 2
- 102000000844 Cell Surface Receptors Human genes 0.000 description 2
- 108010001857 Cell Surface Receptors Proteins 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 206010059866 Drug resistance Diseases 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- 108010001515 Galectin 4 Proteins 0.000 description 2
- 102100039556 Galectin-4 Human genes 0.000 description 2
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 2
- KWLMLNHADZIJIS-CIUDSAMLSA-N Gln-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N KWLMLNHADZIJIS-CIUDSAMLSA-N 0.000 description 2
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 2
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 2
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 2
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 2
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 2
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 2
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 2
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 2
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 2
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 2
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 2
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 2
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 2
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 2
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 2
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 2
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 2
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 2
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 2
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 2
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 2
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 2
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 2
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 2
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 2
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 2
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 2
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 2
- DVHGLDYMGWTYKW-GUBZILKMSA-N His-Gln-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DVHGLDYMGWTYKW-GUBZILKMSA-N 0.000 description 2
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 2
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 2
- FJCGVRRVBKYYOU-DCAQKATOSA-N His-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N FJCGVRRVBKYYOU-DCAQKATOSA-N 0.000 description 2
- 101000929928 Homo sapiens Angiotensin-converting enzyme 2 Proteins 0.000 description 2
- 101000854014 Homo sapiens Heterogeneous nuclear ribonucleoprotein A1 Proteins 0.000 description 2
- 101000938536 Homo sapiens RNA-binding protein EWS Proteins 0.000 description 2
- 101001061518 Homo sapiens RNA-binding protein FUS Proteins 0.000 description 2
- 101000891092 Homo sapiens TAR DNA-binding protein 43 Proteins 0.000 description 2
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 2
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 2
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 2
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 2
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 2
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 2
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 2
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 2
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 2
- GOVDTWNJCBRRBJ-DCAQKATOSA-N Lys-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GOVDTWNJCBRRBJ-DCAQKATOSA-N 0.000 description 2
- TYEJPFJNAHIKRT-DCAQKATOSA-N Lys-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N TYEJPFJNAHIKRT-DCAQKATOSA-N 0.000 description 2
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- RYOLKFYZBHMYFW-WDSOQIARSA-N Lys-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 RYOLKFYZBHMYFW-WDSOQIARSA-N 0.000 description 2
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 2
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 2
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 2
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 2
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 2
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 2
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 2
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 2
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 2
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 2
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 2
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 2
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 2
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 2
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 2
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 2
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 2
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 2
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 2
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 2
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 2
- 241000282330 Procyon lotor Species 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 102100024952 Protein CBFA2T1 Human genes 0.000 description 2
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 2
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 2
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 2
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 2
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- 102100036011 T-cell surface glycoprotein CD4 Human genes 0.000 description 2
- 102100038917 TATA-binding protein-associated factor 2N Human genes 0.000 description 2
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 2
- VUKVQVNKIIZBPO-HOUAVDHOSA-N Thr-Asp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VUKVQVNKIIZBPO-HOUAVDHOSA-N 0.000 description 2
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 2
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 2
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 2
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 2
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 2
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 2
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 2
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 2
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- OMRWDMWXRWTQIU-YJRXYDGGSA-N Thr-Tyr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N)O OMRWDMWXRWTQIU-YJRXYDGGSA-N 0.000 description 2
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 2
- OETOOJXFNSEYHQ-WFBYXXMGSA-N Trp-Ala-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 OETOOJXFNSEYHQ-WFBYXXMGSA-N 0.000 description 2
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 2
- NLWCSMOXNKBRLC-WDSOQIARSA-N Trp-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLWCSMOXNKBRLC-WDSOQIARSA-N 0.000 description 2
- NFVQCNMGJILYMI-SZMVWBNQSA-N Trp-Met-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NFVQCNMGJILYMI-SZMVWBNQSA-N 0.000 description 2
- IYHRKILQAQWODS-VJBMBRPKSA-N Trp-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IYHRKILQAQWODS-VJBMBRPKSA-N 0.000 description 2
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 2
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 2
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 2
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 2
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 2
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 2
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 2
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 2
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 2
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 2
- AKRHKDCELJLTMD-BVSLBCMMSA-N Tyr-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N AKRHKDCELJLTMD-BVSLBCMMSA-N 0.000 description 2
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 230000009824 affinity maturation Effects 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 230000009830 antibody antigen interaction Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010066988 asparaginyl-alanyl-glycyl-alanine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 108010025198 decaglycine Proteins 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 238000010494 dissociation reaction Methods 0.000 description 2
- 230000005593 dissociations Effects 0.000 description 2
- 238000000684 flow cytometry Methods 0.000 description 2
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 108010025934 hnRNP A2 Proteins 0.000 description 2
- 102000048657 human ACE2 Human genes 0.000 description 2
- 102000055979 human DDX4 Human genes 0.000 description 2
- 102000053502 human EWSR1 Human genes 0.000 description 2
- 102000045839 human FUS Human genes 0.000 description 2
- 102000056912 human TAF15 Human genes 0.000 description 2
- 102000055128 human TARDBP Human genes 0.000 description 2
- 102000046185 human hnRNPA1 Human genes 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 239000011325 microbead Substances 0.000 description 2
- 208000025113 myeloid leukemia Diseases 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 230000004845 protein aggregation Effects 0.000 description 2
- 108010054624 red fluorescent protein Proteins 0.000 description 2
- 230000008929 regeneration Effects 0.000 description 2
- 238000011069 regeneration method Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 102200061086 rs886041877 Human genes 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- 108091006106 transcriptional activators Proteins 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- SEFVRKXJJPMVHQ-YUMQZZPRSA-N (2s)-2-[[2-[[(2s)-2-[(2-aminoacetyl)amino]-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]butanedioic acid Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O SEFVRKXJJPMVHQ-YUMQZZPRSA-N 0.000 description 1
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 1
- HGHOBRRUMWJWCU-FXQIFTODSA-N (4s)-4-[[(2s)-2-aminopropanoyl]amino]-5-[[(2s)-3-carboxy-1-(carboxymethylamino)-1-oxopropan-2-yl]amino]-5-oxopentanoic acid Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O HGHOBRRUMWJWCU-FXQIFTODSA-N 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- WRDANSJTFOHBPI-FXQIFTODSA-N Ala-Arg-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N WRDANSJTFOHBPI-FXQIFTODSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- HFBFSOAKPUZCCO-ZLUOBGJFSA-N Ala-Cys-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HFBFSOAKPUZCCO-ZLUOBGJFSA-N 0.000 description 1
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 1
- YMIYZAOBQDRCPP-UHFFFAOYSA-N Ala-Thr-Cys-Cys Chemical compound CC(N)C(=O)NC(C(O)C)C(=O)NC(CS)C(=O)NC(CS)C(O)=O YMIYZAOBQDRCPP-UHFFFAOYSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- YWENWUYXQUWRHQ-LPEHRKFASA-N Arg-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O YWENWUYXQUWRHQ-LPEHRKFASA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- PZVMBNFTBWQWQL-DCAQKATOSA-N Arg-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PZVMBNFTBWQWQL-DCAQKATOSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 1
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 1
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 1
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- QGNXYDHVERJIAY-ACZMJKKPSA-N Asn-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGNXYDHVERJIAY-ACZMJKKPSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 1
- MECFLTFREHAZLH-ACZMJKKPSA-N Asn-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N MECFLTFREHAZLH-ACZMJKKPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- ASCGFDYEKSRNPL-CIUDSAMLSA-N Asn-Glu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O ASCGFDYEKSRNPL-CIUDSAMLSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- FODVBOKTYKYRFJ-CIUDSAMLSA-N Asn-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FODVBOKTYKYRFJ-CIUDSAMLSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 1
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- XHTUGJCAEYOZOR-UBHSHLNASA-N Asn-Ser-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XHTUGJCAEYOZOR-UBHSHLNASA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- AAIUGNSRQDGCDC-ZLUOBGJFSA-N Asp-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)O AAIUGNSRQDGCDC-ZLUOBGJFSA-N 0.000 description 1
- UWOPETAWXDZUJR-ACZMJKKPSA-N Asp-Cys-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O UWOPETAWXDZUJR-ACZMJKKPSA-N 0.000 description 1
- SPKRHJOVRVDJGG-CIUDSAMLSA-N Asp-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SPKRHJOVRVDJGG-CIUDSAMLSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- ZXRQJQCXPSMNMR-XIRDDKMYSA-N Asp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N ZXRQJQCXPSMNMR-XIRDDKMYSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 241000282836 Camelus dromedarius Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 102100035360 Cerebellar degeneration-related antigen 1 Human genes 0.000 description 1
- 108091007741 Chimeric antigen receptor T cells Proteins 0.000 description 1
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 1
- 235000007516 Chrysanthemum Nutrition 0.000 description 1
- 244000189548 Chrysanthemum x morifolium Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108010061994 Coronavirus Spike Glycoprotein Proteins 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- PORWNQWEEIOIRH-XHNCKOQMSA-N Cys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)C(=O)O PORWNQWEEIOIRH-XHNCKOQMSA-N 0.000 description 1
- HHABWQIFXZPZCK-ACZMJKKPSA-N Cys-Gln-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HHABWQIFXZPZCK-ACZMJKKPSA-N 0.000 description 1
- RWGDABDXVXRLLH-ACZMJKKPSA-N Cys-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N RWGDABDXVXRLLH-ACZMJKKPSA-N 0.000 description 1
- MUZAUPFGPMMZSS-GUBZILKMSA-N Cys-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N MUZAUPFGPMMZSS-GUBZILKMSA-N 0.000 description 1
- CIVXDCMSSFGWAL-YUMQZZPRSA-N Cys-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N CIVXDCMSSFGWAL-YUMQZZPRSA-N 0.000 description 1
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 1
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 1
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- BOMGEMDZTNZESV-QWRGUYRKSA-N Cys-Tyr-Gly Chemical compound SC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 BOMGEMDZTNZESV-QWRGUYRKSA-N 0.000 description 1
- HPZAJRPYUIHDIN-BZSNNMDCSA-N Cys-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N HPZAJRPYUIHDIN-BZSNNMDCSA-N 0.000 description 1
- NGWIXHCFVSSVHX-IHPCNDPISA-N Cys-Tyr-Trp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NGWIXHCFVSSVHX-IHPCNDPISA-N 0.000 description 1
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 1
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 1
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 108050001049 Extracellular proteins Proteins 0.000 description 1
- 210000000712 G cell Anatomy 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 1
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 1
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 1
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 1
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 1
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- DDNIZQDYXDENIT-FXQIFTODSA-N Gln-Glu-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DDNIZQDYXDENIT-FXQIFTODSA-N 0.000 description 1
- LTXLIIZACMCQTO-GUBZILKMSA-N Gln-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LTXLIIZACMCQTO-GUBZILKMSA-N 0.000 description 1
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 1
- LUGUNEGJNDEBLU-DCAQKATOSA-N Gln-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LUGUNEGJNDEBLU-DCAQKATOSA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- ZXLZWUQBRYGDNS-CIUDSAMLSA-N Glu-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXLZWUQBRYGDNS-CIUDSAMLSA-N 0.000 description 1
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- YUXIEONARHPUTK-JBACZVJFSA-N Glu-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YUXIEONARHPUTK-JBACZVJFSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 1
- MQVNVZUEPUIAFA-WDSKDSINSA-N Gly-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN MQVNVZUEPUIAFA-WDSKDSINSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 1
- JKSMZVCGQWVTBW-STQMWFEESA-N Gly-Trp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O JKSMZVCGQWVTBW-STQMWFEESA-N 0.000 description 1
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 244000060234 Gmelina philippensis Species 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- 102100031547 HLA class II histocompatibility antigen, DO alpha chain Human genes 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- CVEFOCIRMVGWDS-XIRDDKMYSA-N His-Cys-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 CVEFOCIRMVGWDS-XIRDDKMYSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 1
- BQFGKVYHKCNEMF-DCAQKATOSA-N His-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 BQFGKVYHKCNEMF-DCAQKATOSA-N 0.000 description 1
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 1
- LJUIEESLIAZSFR-SRVKXCTJSA-N His-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LJUIEESLIAZSFR-SRVKXCTJSA-N 0.000 description 1
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 1
- HBGKOLSGLYMWSW-DCAQKATOSA-N His-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CS)C(=O)O HBGKOLSGLYMWSW-DCAQKATOSA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 1
- 101100005713 Homo sapiens CD4 gene Proteins 0.000 description 1
- 101000866278 Homo sapiens HLA class II histocompatibility antigen, DO alpha chain Proteins 0.000 description 1
- 101100519721 Homo sapiens PDGFRB gene Proteins 0.000 description 1
- 108700039609 IRW peptide Proteins 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- RSDHVTMRXSABSV-GHCJXIJMSA-N Ile-Asn-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RSDHVTMRXSABSV-GHCJXIJMSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 1
- RQQCJTLBSJMVCR-DSYPUSFNSA-N Ile-Leu-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RQQCJTLBSJMVCR-DSYPUSFNSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 1
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- JCVOHUKUYSYBAD-DCAQKATOSA-N Lys-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CS)C(=O)O JCVOHUKUYSYBAD-DCAQKATOSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- XFANQCRHTMOEAP-WDSOQIARSA-N Lys-Pro-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XFANQCRHTMOEAP-WDSOQIARSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- UAPZLLPGGOOCRO-IHRRRGAJSA-N Met-Asn-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N UAPZLLPGGOOCRO-IHRRRGAJSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 1
- STLBOMUOQNIALW-BQBZGAKWSA-N Met-Gly-Cys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O STLBOMUOQNIALW-BQBZGAKWSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 1
- PZUUMQPMHBJJKE-AVGNSLFASA-N Met-Leu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N PZUUMQPMHBJJKE-AVGNSLFASA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 1
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 1
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 1
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 101100294417 Mus musculus Notch1 gene Proteins 0.000 description 1
- 241000282341 Mustela putorius furo Species 0.000 description 1
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 1
- JOCBASBOOFNAJA-UHFFFAOYSA-N N-tris(hydroxymethyl)methyl-2-aminoethanesulfonic acid Chemical compound OCC(CO)(CO)NCCS(O)(=O)=O JOCBASBOOFNAJA-UHFFFAOYSA-N 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 108010070047 Notch Receptors Proteins 0.000 description 1
- 102000005650 Notch Receptors Human genes 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 108091008606 PDGF receptors Proteins 0.000 description 1
- 241000282316 Paguma Species 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- YZJKNDCEPDDIDA-BZSNNMDCSA-N Phe-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 YZJKNDCEPDDIDA-BZSNNMDCSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- CJAHQEZWDZNSJO-KKUMJFAQSA-N Phe-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CJAHQEZWDZNSJO-KKUMJFAQSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- ABEFOXGAIIJDCL-SFJXLCSZSA-N Phe-Thr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ABEFOXGAIIJDCL-SFJXLCSZSA-N 0.000 description 1
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 1
- YTGGLKWSVIRECD-JBACZVJFSA-N Phe-Trp-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 YTGGLKWSVIRECD-JBACZVJFSA-N 0.000 description 1
- ZVJGAXNBBKPYOE-HKUYNNGSSA-N Phe-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 ZVJGAXNBBKPYOE-HKUYNNGSSA-N 0.000 description 1
- FXEKNHAJIMHRFJ-ULQDDVLXSA-N Phe-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N FXEKNHAJIMHRFJ-ULQDDVLXSA-N 0.000 description 1
- 241000283966 Pholidota <mammal> Species 0.000 description 1
- 102000011653 Platelet-Derived Growth Factor Receptors Human genes 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- SBYVDRLQAGENMY-DCAQKATOSA-N Pro-Asn-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O SBYVDRLQAGENMY-DCAQKATOSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 1
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- ZTMLZUNPFDGPKY-VKOGCVSHSA-N Pro-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ZTMLZUNPFDGPKY-VKOGCVSHSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- BNUKRHFCHHLIGR-JYJNAYRXSA-N Pro-Trp-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O BNUKRHFCHHLIGR-JYJNAYRXSA-N 0.000 description 1
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 1
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 102000004229 RNA-binding protein EWS Human genes 0.000 description 1
- 108090000740 RNA-binding protein EWS Proteins 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108091005634 SARS-CoV-2 receptor-binding domains Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 241000555745 Sciuridae Species 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- IDCKUIWEIZYVSO-WFBYXXMGSA-N Ser-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C)C(O)=O)=CNC2=C1 IDCKUIWEIZYVSO-WFBYXXMGSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- CTRHXXXHUJTTRZ-ZLUOBGJFSA-N Ser-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O CTRHXXXHUJTTRZ-ZLUOBGJFSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- OVQZAFXWIWNYKA-GUBZILKMSA-N Ser-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)N OVQZAFXWIWNYKA-GUBZILKMSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- YXGCIEUDOHILKR-IHRRRGAJSA-N Ser-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CO)N YXGCIEUDOHILKR-IHRRRGAJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 241000008910 Severe acute respiratory syndrome-related coronavirus Species 0.000 description 1
- 101710198474 Spike protein Proteins 0.000 description 1
- 241000713880 Spleen focus-forming virus Species 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 102100040347 TAR DNA-binding protein 43 Human genes 0.000 description 1
- 101150014554 TARDBP gene Proteins 0.000 description 1
- 239000007994 TES buffer Substances 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- XIHGJKFSIDTDKV-LYARXQMPSA-N Thr-Phe-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIHGJKFSIDTDKV-LYARXQMPSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- PTAWAMWPRFTACW-SZMVWBNQSA-N Trp-Gln-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PTAWAMWPRFTACW-SZMVWBNQSA-N 0.000 description 1
- PKUJMYZNJMRHEZ-XIRDDKMYSA-N Trp-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKUJMYZNJMRHEZ-XIRDDKMYSA-N 0.000 description 1
- DZIKVMCFXIIETR-JSGCOSHPSA-N Trp-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O DZIKVMCFXIIETR-JSGCOSHPSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- CFMGQWYCEJDTDG-XIRDDKMYSA-N Trp-Lys-Cys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 CFMGQWYCEJDTDG-XIRDDKMYSA-N 0.000 description 1
- VOCHZIJXPRBVSI-XIRDDKMYSA-N Trp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N VOCHZIJXPRBVSI-XIRDDKMYSA-N 0.000 description 1
- GQEXFCQNAJHJTI-IHPCNDPISA-N Trp-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GQEXFCQNAJHJTI-IHPCNDPISA-N 0.000 description 1
- YXSSXUIBUJGHJY-SFJXLCSZSA-N Trp-Thr-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=CC=C1 YXSSXUIBUJGHJY-SFJXLCSZSA-N 0.000 description 1
- FBVGQXJIXFZKSQ-GMVOTWDCSA-N Tyr-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FBVGQXJIXFZKSQ-GMVOTWDCSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 1
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- LMLBOGIOLHZXOT-JYJNAYRXSA-N Tyr-Glu-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O LMLBOGIOLHZXOT-JYJNAYRXSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 1
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 1
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 241000282458 Ursus sp. Species 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- DCOOGDCRFXXQNW-ZKWXMUAHSA-N Val-Asn-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DCOOGDCRFXXQNW-ZKWXMUAHSA-N 0.000 description 1
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 210000005006 adaptive immune system Anatomy 0.000 description 1
- 230000004520 agglutination Effects 0.000 description 1
- 108010084217 alanyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 238000011091 antibody purification Methods 0.000 description 1
- 230000030741 antigen processing and presentation Effects 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 238000010009 beating Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 229930189065 blasticidin Natural products 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 230000008614 cellular interaction Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 108700010039 chimeric receptor Proteins 0.000 description 1
- 238000005345 coagulation Methods 0.000 description 1
- 230000015271 coagulation Effects 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000007783 downstream signaling Effects 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 102000052116 epidermal growth factor receptor activity proteins Human genes 0.000 description 1
- 108700015053 epidermal growth factor receptor activity proteins Proteins 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- 238000012255 expression quantity analysis Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 238000001215 fluorescent labelling Methods 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 230000003325 follicular Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010065713 glycyl-glycyl-tyrosyl-arginine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010067502 glycylglycyl-methionyl-glycylglycine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 210000002443 helper t lymphocyte Anatomy 0.000 description 1
- 238000012165 high-throughput sequencing Methods 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 102000051957 human ERBB2 Human genes 0.000 description 1
- 239000005457 ice water Substances 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000013383 initial experiment Methods 0.000 description 1
- 210000004020 intracellular membrane Anatomy 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- -1 mCherry Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 210000000110 microvilli Anatomy 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- YOHYSYJDKVYCJI-UHFFFAOYSA-N n-[3-[[6-[3-(trifluoromethyl)anilino]pyrimidin-4-yl]amino]phenyl]cyclopropanecarboxamide Chemical compound FC(F)(F)C1=CC=CC(NC=2N=CN=C(NC=3C=C(NC(=O)C4CC4)C=CC=3)C=2)=C1 YOHYSYJDKVYCJI-UHFFFAOYSA-N 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 230000004960 subcellular localization Effects 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 231100001274 therapeutic index Toxicity 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000003146 transient transfection Methods 0.000 description 1
- 239000013638 trimer Substances 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/71—Receptors; Cell surface antigens; Cell surface determinants for growth factors; for growth regulators
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/10—Cells modified by introduction of foreign genetic material
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Abstract
The invention relates to the technical field of biology, in particular to a cell surface macromolecule quantitative display system and a preparation method and application thereof. The phase transition peptide segment is fused on the transmembrane protein of the ligand quantitative presenting cell, the receptor expression cell is a synNotch synthetic receptor expression cell, and the extracellular region of the synNotch synthetic receptor is fused with a region which can be specifically recognized and combined with the ligand on the surface of the ligand presenting cell. When the ligand quantitative presenting cells and the receptor expressing cells are cultured together, the receptor of the receptor expressing cells can be specifically and quantitatively activated. The cell surface macromolecule quantitative display system can be applied to ligand-receptor (including antigen-antibody) affinity detection and/or batch acquisition of high-affinity antibodies, so that batch high-affinity antibodies can be quickly obtained, and subsequent tedious verification work of technologies such as phage display and the like is avoided.
Description
Technical Field
The invention relates to the technical field of biology, in particular to a cell surface macromolecule quantitative display system and a preparation method and application thereof.
Background
How mammalian cells sense and differentiate the magnitude of cell surface receptors and their corresponding ligand affinities when the cells are in contact with each other, and convert them into specific quantitative biological events, remains a scientific puzzle. For example, the mechanism of how lymphoid B cells expressing high affinity antibodies (i.e., B cell receptors) compete with low affinity B cells for antigen molecules displayed on the surface of antigen presenting cells remains unclear.
Antibody affinity maturation is a hallmark of the immune response of the adaptive immune system, during which high affinity antibodies to a particular antigen will be produced. Whereas antibody affinity maturation of Germinal Centers (GCs) mainly involves high frequency mutation (SHM) of antibody variable regions and affinity-based B cell selection. Among them, high frequency mutation induces a large number of diverse B Cell Receptors (BCRs) with different affinities, and in the germinal center, B cells with a higher affinity BCR can better compete to bind follicular helper T cells (Tfh) located in the bright zone of the germinal center, thereby allowing a small number of high affinity B cells to finally dominate. This affinity-based selection depends on the amount of antigen that is extracted from the surface of the Antigen Presenting Cell (APC) by the B cell and re-presented at the plasma membrane of the autologous cell. Activation of B cell downstream signaling pathway responses by specific antigens is largely determined by the affinity between the antigen and the BCR. It has been reported that high affinity antigen-antibody interactions may lead to stronger mechanical forces, and that the particular micro-cluster structure of BCR formation in germinal centers may also be associated with this mechanism. However, the molecular mechanism of how the high affinity BCR captures more antigenic molecules is still unclear. The synNotch synthesis system reported as an antibody expression system does not discriminate well between antibody-antigen affinity differences.
Disclosure of Invention
In view of the above-mentioned shortcomings of the prior art, the present invention is directed to a quantitative display system for cell surface macromolecules, and a method for preparing the same and use thereof, which are used to solve the problems of the prior art.
In order to achieve the above objects and other related objects, the present invention provides a recombinant transmembrane protein fused with a phase transition peptide fragment.
The invention also provides an isolated polynucleotide encoding the recombinant transmembrane protein.
The invention also provides a nucleic acid construct, which comprises the polynucleotide.
The invention also provides a cell on which the transmembrane protein is fused.
The invention also provides a cell surface macromolecule quantitative display system, which comprises the ligand quantitative presenting cell and the receptor expressing cell, wherein the receptor of the receptor expressing cell can be specifically and quantitatively activated when the ligand quantitative presenting cell and the receptor expressing cell are cultured together.
The invention also provides a preparation method of the cell surface macromolecule quantitative display system, which comprises the following steps:
1) cloning the target receptor on the cell membrane of the receptor expression cell;
2) cloning the target ligand to the ligand quantitative presenting cell.
The invention also provides application of the cell surface macromolecule quantitative display system in ligand-receptor affinity detection and/or batch acquisition of high-affinity antibodies.
The invention also provides a method for detecting ligand-receptor affinity, which comprises the steps of co-culturing the ligand quantitative presenting cells and the receptor expression cells obtained in the preparation method, and then detecting the binding condition of the target ligand and the target receptor.
The invention also provides a method for obtaining high-affinity antibodies in batches, which is characterized by comprising the following steps:
1) constructing receptor expression cells carrying a synNotch antibody library and a reporter group;
2) co-culturing the receptor expressing cells of step 1) with said ligand quantitative presenting cells to activate a cell surface macromolecule quantitative display system;
3) collecting cells in a co-culture system, and primarily enriching receptor expression cells with positive report groups;
4) culturing the receptor expression cells primarily enriched in the step 3) for 5-10 days, and enriching again by using the new ligand quantitative presenting cells to obtain the high-affinity antibody carried by the receptor expression cells.
As mentioned above, the cell surface macromolecule quantitative display system, the preparation method and the application thereof have the following beneficial effects:
1) the cell surface macromolecule quantitative display technology based on protein phase transition can convert the affinity of a ligand-receptor, such as an antigen-antibody, into the strength of the expression of a reporter gene in cells by combining with a synNotch system;
2) the cell surface macromolecule display technology is proved to be related to the phase transition of the intracellular part of the membrane protein, and a plurality of tool molecules capable of supporting quantitative display are found, and the molecules can generate protein liquid-liquid phase transition under specific physiological conditions;
3) the cell surface macromolecule display technology can be used for cell surface ligand-receptor affinity identification, such as quantitative verification of interaction of coronavirus neutralizing antibody or receptor protein ACE2 and coronavirus spike protein;
4) by combining with a phage display technology or other antibody library construction technologies, the high-affinity antibodies in batches can be quickly obtained, and the subsequent tedious verification work of the phage display technology can be avoided.
Drawings
FIG. 1 shows an example of the cell surface macromolecule quantitative display system and the detection of ligand-receptor binding affinity of the present invention, wherein the following figures are illustrated: schematic system diagram of Ag-mem and Ag-m-LC cell interacting with anti-GFP synNotch receptor cell respectively; schematic representation of expression vectors for Ag-mem, Ag-m-LC and synNotch receptors; GFP-mem and GFP-mem-FUSLCReporting the activation ratio of the system after the cell interacts with the anti-GFP/mCherry receptor cell; reporting the activation ratio of the system after interaction of the mCherry-mem cell and the anti-GFP/mCherry receptor cell; GFP-mem and GFP-m-FUSLCReporting the activation ratio of the system after the cell interacts with two high/low anti-GFP receptor cells according to different ratios; F. the information of the anti-GFP nano antibody and the expression quantity on the cell membrane after corresponding syncnotch receptors are fused; GFP-mem and GFP-m-FUSLCThe corresponding BFP of the cells reports the proportion of systemic activation.
FIG. 2 shows the principle of quantitative presentation of a system for quantitative display of macromolecules on the surface of cells, in which the figures are illustrated as follows: GFP-mem and GFP-m-FUSLCConfocal fluorescence imaging of GFP in cells; GFP-mem and GFP-m-FUSLCPerforming GFP (green fluorescent protein) localization statistics on the cells; C. GFP-mem and GFP-m-FUS with different expression amountsLCFlow quantification of cell surface GFP protein; D. GFP-mem and GFP-m-FUS with different expression amountsLCThe ratio of reporter system activation to cell; E. interaction schematic diagram of red fluorescence labeling synNotch receptor expression cell (modified in HEK293T cell line of human embryonic kidney cell) and ligand quantitative presenting cell (modified in K562 cell line of human myeloid leukemia cell); F. the interface where the ligand quantitative presenting cells interact with synNotch receptor cells; qDis system mode of action diagram; HER2-mem and HER2-m-FUSLCSchematic representation of the system for interaction with anti-HER2 synNotch receptor cells; I. cell surface HER2 extracellular domain expression level; J. the expression level of cell surface anti-HER2-scfv synNotch receptor; HER2-mem and HER2-m-FUSLCThe corresponding reporting system activation ratio.
FIG. 3 shows the principle of phase transition for quantification of a system for quantitatively displaying macromolecules on a cell surface, wherein the diagrams are illustrated as follows: FUSLCDistribution of Q, G, Y amino termini in proteins, and FUSLCThe protein membrane surface expression quantity of different mutant cells and the corresponding activation ratio of a report system; GFP-m-FUSLC-478 cell corresponding reporter activation ratio; GFP-m-TAF15LCAnd GFP-m-DDX4LCThe ratio of reporter system activation to cell; the corresponding reporter system activation ratio of the GFP-foldon-m cells and the GFP-Fc-m cells;
FIG. 4 shows the principle of quantification and mutation analysis for a quantitative display system for macromolecules on the cell surface, wherein the figures are illustrated as follows: A. different GFP-m-FUSLCMutants form a statistical map of the rate of protein aggregation in the opto-droplet system, with the corresponding statistical cell numbers noted in the legend; GFP-m-FUSLC Q>gPG confocal fluorescence imaging of G cells, flow assay and reporting of system activation ratio results;C.GFP-m-FUSLC G>GFP confocal fluorescence imaging of cells A, flow detection and reporting of system activation ratio results; GFP-m-FUSLCGFP confocal fluorescence imaging of Y27S cells, flow assay and reporting of the system activation ratio results; GFP-m-FUSLCGFP confocal fluorescence imaging of Y7A cells, flow detection and reporting of the system activation ratio results; GFP-m-FUSLCGFP confocal fluorescence imaging of 27R cells, flow detection and reporting of system activation ratio results;
FIG. 5 shows the ratio of reporter activation for different GFP-m-LC cells.
FIG. 6 shows the interaction between the surface spinous process protein S-RBD of various coronaviruses and ACE2 or its corresponding antibody, wherein the illustration of each figure is as follows: A. synNotch cells expressing hACE2 and various antibodies and S-RBD-m-FUS fusing coronavirus surface spinous process proteins of various speciesLCThe reporter ratio for protein interaction activation; B. S-RBD-m-FUS on the surface of chimeric receptor expressing cells of ACE2 of different species and three coronavirusesLCReporter ratio of protein interaction activation. C. Three-dimensional structure diagram of interaction of human ACE2 protein (green) and SARS-CoV-2S-RBD protein (blue), wherein the peptide fragment directly acting on ACE2 is marked orange.
FIG. 7 shows the batch isolation of high affinity antibodies for a cell surface macromolecular quantitative display system, wherein the figures are illustrated as follows: schematic diagram of hcd8 reporting system; GFP-mem and GFP-m-FUSLCThe hCD8 report system activation proportion is obtained after the cells interact with high, medium and low affinity anti-GFP receptor cells respectively; C. the high affinity antibody expression cell proportion after each round of qDis-MACS enrichment; qDis-MACS flow diagram; E. after three rounds of qDis-MACS screening, the antibody is enriched in proportion, and the red color is the final selected expression antibody; bli method determines the corresponding affinity of the antibody.
FIG. 8 shows the binding dissociation curve for the antibody measured by the BLI method.
Detailed Description
The invention firstly provides a recombinant transmembrane protein, and a phase-change peptide segment is fused on the recombinant transmembrane protein.
The phase-transition peptide fragment refers to a peptide fragment capable of undergoing phase transition under physiological conditions. The phase transition peptide fragment can be any one or more peptide fragments in LLPSDB database (http:// bio-comp. ucas. ac. cn/LLPSDB), PhaSePro database (https:// PhaSePro. elte. hu) or PhaSepDB database (http:// db. phasep. pro), or peptide fragments mutated based on peptide fragments in the database. The peptide segment mutated based on the peptide segment in the database has a similarity of more than 80%, more than 90%, more than 95% or more than 99% with the phase-change peptide segment in the database. The low complexity sequence (also called LC tag) is one of the phase-change peptide fragments.
The low-complexity sequence refers to a protein sequence which is enriched with a specific amino acid component and/or repeated appearance of a specific amino acid sequence. Such as repeated expression of specific amino acid residues, repeated arrangement of single amino acid chains or short amino acid chains. Low Complexity Sequences (LCs) exist in regions of intrinsic disorder, and proteins or peptide fragments containing low complexity sequences can aggregate under physiological conditions and drive liquid-liquid phase separation to occur. The liquid-liquid phase separation occurs as a general property of proteins and nucleic acids in a particular situation, and the phase separation allows the protein to be concentrated in a localized area with a significant increase in concentration.
The low-complexity sequence is from one or more of TAF15 protein, PrL structural domain of FUS protein and DDX4 protein.
Specifically, the low-complexity sequence source comprises amino acids 1-214 (shown in SEQ ID NO. 1) of the human FUS protein, amino acids 1-478 (shown in SEQ ID NO. 2) of the FUS protein, amino acids 1-208 (shown in SEQ ID NO. 3) of the human TAF15 protein, and amino acids 1-236 (shown in SEQ ID NO. 4) of the human DDX4 protein.
The low-complexity sequence can also be applied to the ligand quantitative presenting cells of the invention by modifying some protein sequences of natural LC tags (such as amino acids 415-218 of human TDP-43 protein, amino acids 46-266 of human EWSR1 protein, amino acids 341-190-186 of human HNRNPA1 protein, amino acids 320-186 of human HNRNPA2 protein, and twenty, thirty, forty, fifty polymers of repeated arrangement of two amino acids proline and arginine). For example by adjusting the amount of protein expression to approach the threshold required for phase separation in a 2D system, or by altering the rate at which proteins form multivalent binding, etc.
The low complexity sequences enable quantitative detection of ligand-receptor binding by reducing cell surface antigen expression and increasing overall affinity between ligand-receptors.
In one embodiment, the phase change peptide segment is fused to an intracellular domain of a transmembrane protein.
In a preferred embodiment, the low complexity sequence is fused to a ligand-PDGFR (platelet derived growth factor receptor) transmembrane protein, i.e., the recombinant transmembrane protein is a ligand-PDGFR recombinant transmembrane protein. Under physiological conditions, antigen-PDGFR (ligand-PDGFR) transmembrane domain fusion protein (Ag-mem) is abundantly expressed in the plasma membrane of antigen-presenting cells (corresponding to the ligand quantitative-presenting cells of the present application). The recombinant transmembrane protein fused with the LC label can be called Ag-m-LC or Ag-mem-LC for short. The Ag-m-LC recombinant transmembrane protein is obtained by combining any target ligand with a PDGFR transmembrane domain by a molecular biology method well known in the field, then fusing an intracellular peptide segment of a PDGFR receptor, and finally fusing different LC tags at the C end. In one embodiment, the ligand of interest is selected from an antigen. Such as GFP protein, mCherry protein, HER2 extracellular domain, SARS-CoV S protein, SARS-CoV-2S protein, TGP protein.
In one embodiment, the amino acid sequence of the ligand-PDGFR recombinant transmembrane protein fused to the low-complexity sequence at the C-terminus not including the ligand moiety (i.e., the amino acid sequence including only the transmembrane region, intracellular region and LC tag of PDGFR) is as set forth in SEQ ID No. 5:
(the PDGFR transmembrane region is underlined; the PDGFR intracellular short peptide is underlined; and FUS is the remainder)LCPeptide fragment)
The invention also provides an isolated polynucleotide encoding the recombinant transmembrane protein.
The polynucleotide of the present invention may be in the form of DNA or RNA. The form of DNA includes cDNA, genomic DNA or artificially synthesized DNA. The DNA may be single-stranded or double-stranded. The DNA may be the coding strand or the non-coding strand.
In one embodiment, the polynucleotide has the nucleotide sequence shown as SEQ ID No. 6.
The invention also relates to variants of the above polynucleotides which encode fragments, analogues and derivatives of the enzymes or proteins having the same amino acid sequence as the present invention. The variant of the polynucleotide may be a naturally occurring allelic variant or a non-naturally occurring variant. These nucleotide variants include substitution variants, deletion variants and insertion variants. As is known in the art, an allelic variant is a substitution of a polynucleotide, which may be a substitution, deletion, or insertion of one or more nucleotides, without substantially altering the function of the protein encoded thereby.
The invention also provides a nucleic acid construct, which comprises the polynucleotide.
In certain embodiments of the invention, the nucleic acid construct is constructed by inserting the isolated polynucleotide into a multiple cloning site of an expression vector. The expression vector of the present invention is generally referred to various commercially available expression vectors well known in the art, and may be, for example, a bacterial plasmid, a bacteriophage, a yeast plasmid, a plant cell virus, a mammalian cell virus such as an adenovirus, a retrovirus, or other vectors.
The invention provides a cell expressing the recombinant transmembrane protein.
In one embodiment, the cell is a quantitative ligand-presenting cell; preferably, the ligand-quantitatively-presenting cells include antigen-quantitatively-presenting cells. The ligand quantitative presenting cells are selected from K562 human chronic myelogenous leukemia cell lines. In one embodiment, the quantitative ligand presenting cell is a stable transformable cell line stably expressing Ag-m-LC.
In another embodiment, the cell is a host cell for producing the transmembrane protein.
The invention also provides a cell surface macromolecule quantitative display system: the system comprises the ligand quantitative presenting cell and the receptor expressing cell, and the receptor of the receptor expressing cell can be specifically activated when the ligand quantitative presenting cell and the receptor expressing cell are cultured together.
The macromolecule refers to a macromolecule existing in organisms such as protein, nucleic acid, polysaccharide and the like or artificially synthesized.
Specifically, the extracellular domain of the receptor-expressing cells is fused to a region capable of specifically recognizing and binding to the ligand presented by the ligand-presenting cells, and upon contact with each other, specifically activates synNotch synthesis receptors.
The receptor expressing cells are also known as antibody expressing cells (ABC). In one embodiment, the receptor of the receptor-expressing cell is selected from synNotch synthetic receptors. The synNotch synthetic receptor is fused with a receptor of interest.
synNotch synthetic receptors are located on cell membranes, extracellular domains of the synNotch synthetic receptors are fused with purposeful receptors, such as nano antibodies, single chain antibodies (scFv), antibody analogs or cell membrane surface protein receptors and extracellular protein peptide fragments, and the purposeful receptors in the extracellular domains can be specifically combined after being contacted with antigens on the surface of ligand quantitative presenting cells; the intracellular domain inside the cytoplasmic membrane of synNotch synthetic receptors is fused with a transcriptional activator, which is released and initiates the expression of the intracellular blue fluorescent protein when the receptor of interest and the ligand of interest are bound (fig. 1A, 1B). Activation of synNotch synthetic receptors in receptor-expressing cells will be affected slightly when the affinity between ligand-receptor is altered. The synNotch synthetic receptor of the present invention is obtained by fusing only the intended receptor to an existing synNotch receptor, and methods for fusing either an existing synNotch receptor or an intended receptor are well known to those skilled in the art.
In one embodiment, the receptor expressing cell is a stable transgenic cell line expressing both synNotch synthetic receptor and reporter.
In one embodiment, the receptor expressing cells carry a fluorescent reporter group. The fluorescent reporter group is used to determine whether the receptor-expressing cell is activated.
In general, the fluorescent reporter may be selected from fluorescent protein-encoding genes such as GFP, mCherry, BFP, and the like. In one embodiment, the fluorescent reporter group is selected from BFP.
In another embodiment, the reporter is a non-fluorescent reporter. The non-fluorescent reporter group is only required to be capable of promoting protein expression and positioning expression on a cytoplasmic membrane when the antigen is combined with the antibody, and sorting out cells expressing the protein by utilizing a commercially available magnetic bead sorting technology. For example, the non-fluorescent reporter is selected from the group consisting of a CD8 reporter or a CD4 reporter. The CD8 and CD4 reporter genes are derived from mammals. Preferably hCD8 and hCD4 reporter genes, namely human CD8 and human CD4 reporter genes. The non-fluorescent reporter group may also be selected from drug resistance genes. For example, common drug resistance genes such as puromycin resistance, blasticidin resistance, and the like.
In general, any cell line that expresses a receptor can be used as the receptor-expressing cell. In one embodiment, the receptor expressing cells are prepared by expressing synNotch synthetic receptors in K562 myeloid leukemia cells.
The cell surface macromolecule quantitative display system is a mammalian cell or non-mammalian cell surface macromolecule quantitative display system. Preferably, the quantitative display system is a mammalian cell surface macromolecule quantitative display system.
The cell surface macromolecule quantitative display system can detect the molecular interaction of the ligand and the receptor with the affinity ranging from micromolar to nanomolar, but the activation of the receptor and the intermolecular affinity are not well correlated in the affinity range.
The liquid-liquid phase separation of membrane proteins inside the cytoplasmic membrane rapidly increases the concentration of receptor proteins in local regions, thereby increasing the affinity (avidity) between ligand-receptors in the extracellular region, i.e., the cumulative strength of multiple ligand-receptor molecular interactions, which in turn amplifies the differences in individual ligand-receptor affinities (affinity). The inventors have found that fusion of a variety of low complexity sequences to ligand (antigen) presenting transmembrane proteins can result in the production of large aggregates of antigen molecules on the membrane. This aggregated ligand (antigen) presentation pattern, when combined with the synNotch synthetic receptor system, enables the ligand-receptor system to be converted into an affinity-proportional quantitative mammalian cell membrane display system.
The invention also provides a preparation method of the cell surface macromolecule quantitative display system, which comprises the following steps:
1) cloning the target receptor on the cell membrane of the receptor expression cell;
2) cloning a ligand of interest onto said ligand quantitative presenting cells;
the method of cloning the receptor of interest into receptor expressing cells and the ligand of interest into the ligand-presenting cells is a molecular biological method well known to those skilled in the art.
The receptor expressing cells carry a fluorescent reporter group.
The ligand and the receptor can be antigen and antibody.
The invention also provides application of the cell surface macromolecule quantitative display system in ligand-receptor affinity detection and/or batch acquisition of high-affinity antibodies.
The invention also provides a method for detecting ligand-receptor affinity, which comprises the steps of culturing the ligand quantitative presenting cells cloned with the target ligand and the receptor expression cells together, and detecting the binding condition of the target ligand and the target receptor.
In particular, binding is detected by a reporter group. The greater the number of cells expressing the reporter group, the greater the affinity between the ligand of interest and the receptor of interest, and vice versa.
The invention also provides a method for obtaining high affinity antibodies in batches, comprising the following steps:
1) constructing receptor expression cells carrying a synNotch antibody library and a reporter group;
2) co-culturing the receptor expressing cells of step 1) with the ligand quantitative presenting cells to activate a cell surface macromolecule quantitative display system;
3) collecting cells in a co-culture system, and primarily enriching cells with positive reporter groups;
4) and (3) culturing the cells primarily enriched in the step 3) for 5-10 days, and enriching again by using the new ligand quantitative presenting cells, wherein the receptor expression cells obtained by enrichment carry a large amount of high-affinity antibodies.
The synNotch antibody library includes a plurality of synNotch synthetic receptors fused to a receptor of interest. In general, the synNotch antibody library can be obtained by the existing antibody library construction method. For example by phage display technology.
Specifically, the reporter group in the step 1) is selected from a fluorescent reporter group or a non-fluorescent reporter group. Preferably, it is selected from non-fluorescent reporter groups. More preferably, it is selected from the hCD8 reporter genes.
In one embodiment, in step 2), the following method is used to activate the cell surface macromolecule quantitative display system: separately taking receptor expression cells and antigen expression cells, resuspending the cells, and placing the cells in CO2And performing shake culture in an incubator for 36-60 hours.
In one embodiment, step 3) is performed by subjecting the magnetic beads capable of specifically binding to the protein expressed by the reporter group to a magnetic field for preliminary enrichment. Magnetic beads capable of specifically binding to a protein expressed by a reporter group are, for example, anti-hCD8 magnetic beads.
In one embodiment, after the cells initially enriched in step 3) are cultured for about 1 week and the expression level of hCD8 is significantly decreased, the initially enriched cells are mixed with new ligand-presenting cells and enriched again. The re-enrichment can be performed in one or more rounds. For example: two, three, four or more rounds of re-enrichment may be performed.
The embodiments of the present invention are described below with reference to specific embodiments, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention.
Before the present embodiments are further described, it is to be understood that the scope of the invention is not to be limited to the specific embodiments described below; it is also to be understood that the terminology used in the examples is for the purpose of describing particular embodiments, and is not intended to limit the scope of the present invention; in the description and claims of the present application, the singular forms "a", "an" and "the" include plural referents unless the context clearly dictates otherwise.
When numerical ranges are given in the examples, it is understood that both endpoints of each of the numerical ranges and any value therebetween can be selected unless the invention otherwise indicated. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. In addition to the specific methods, devices, and materials used in the examples, any methods, devices, and materials similar or equivalent to those described in the examples may be used in the practice of the invention in addition to the specific methods, devices, and materials used in the examples, in keeping with the knowledge of one skilled in the art and with the description of the invention. Unless otherwise specified, the experimental procedures used in the present invention are well known to those skilled in the art.
The plasmids required for the quantitative display system were as follows:
the synNotch synthetic receptor is formed by fusing a plurality of GFP nano antibodies (including G1-G9), mCherry nano antibodies, HER2 single-chain antibodies, SARS-CoV S protein antibodies, SARS-CoV-2S protein antibodies, ACE2 chimeras (shown as SEQ ID NO. 7-SEQ ID NO. 26) of different species and ACE2 chimeras of different species (raccoon, dog, ferret, raccoon, bear, cat, paguma, horse, Chinese chrysanthemum head bat, cow, sheep, pig, camel, mouse, rat, squirrel, hedgehog, pangolin and monkey) with a commercial mouse Notch1 partial domain and Gal4-VP64 transcriptional activator respectively, and carrying a myc tag (purchased from addge, cat number: 79127) at the front end of the receptor.
The blue fluorescence reporter system is a fusion of a copy of the Gal4 DNA binding domain sequence to a portion of the CMV promoter and cloned in front of the BFP gene, as described in L.Morshut et al, Engineering custom Cell Sensing and Response Behaviors Using Synthetic Notch receptors, Cell 164,780-791(2016).
The hCD8 reporter system (nucleotide sequence shown in SEQ ID NO. 27) was constructed by fusing a copy of the Gal4 DNA binding domain sequence to the partial CMV promoter and cloning it in front of the hCD8 gene.
The Ag-mem expression vector is prepared by combining a plurality of proteins including GFP protein, mCherry protein, HER2 extracellular domain (nucleotide sequence shown in SEQ ID NO. 28), SARS-CoV S-RBD protein (nucleotide sequence shown in SEQ ID NO. 29), SARS-CoV-2S-RBD protein (nucleotide sequence shown in SEQ ID NO. 30), TGP protein (derived from D.W.Close et al, Thermal green protein, an extreme state, non-amplifying fluorescing fluorescent protein secreted by structure-regulated surface engineering, proteins 83,1225-1237(2015)) with PDGFR domain (sequence shown in SEQ ID NO. 5), and fusing myc tag at N-terminal.
The Ag-m-LC expression vector is formed by combining a plurality of proteins including GFP (including G1-G9), mChery protein, HER2 extracellular domain, SARS-CoV S-RBD protein, SARS-CoV-2S-RBD protein and TGP protein with PDGFR transmembrane domain (namely a sequence shown in SEQ ID NO. 5), fusing an intracellular peptide segment (aa562-599) of PDGFR receptor, and finally fusing different LC tags at the C end.
The nano antibody of GFP and the mCherry nano antibody in the expression vector are derived from the following sources: fridy et al, A robust pipeline for rapid production of very small antibodies Nat. methods 11,1253-1260 (2014).
HER2 single chain antibody is derived from: liu et al, Affinity-Tuned ErbB2 or EGFR Chimeric Antigen Receptor T Cells inhibition an incorporated Therapeutic Index against turbines in Rice cancer Res 75, 3596-.
The SARS-CoV S protein antibody CR3022 is derived from: j. ter Meulen et al, Human monoclonal antibody combination against SARS coronavirus, synergy and coverage of escape mutants, PLoS Med 3, e237 (2006).
SARS-CoV S protein antibody S230 is derived from: A.C. walls et al, connected receiver function minor semiconductors Activation of Coronavir fusion.cell 176, 1026-.
SARS-CoV-2S protein antibody m396 is derived from: Z.Zhu et al, content cross-reactive conjugation of SARS coronavirs isolates by human monoclonal antibodies. Proc Natl Acad Sci U S A104, 12123-.
The SARS-CoV-2S protein antibody CV30 is derived from: seydoux et al, Analysis of a SARS-CoV-2-fed induced visual developments of Point neural fibres with Limited textual information Immunity 53,98-105e105 (2020).
The LC tag comprises amino acids 1-214 of the human FUS protein, amino acids 1-478 of the FUS protein, amino acids 1-208 of the human TAF15 protein, amino acids 1-236 of the human DDX4 protein, amino acids 218-415 of the human TDP-43 protein, amino acids 46-266 of the human EWSR1 protein, amino acids 190-341 of the human HNRNPA1 protein, amino acids 186-320 of the human HNRNPA2 protein, and twenty, thirty, forty and fifty polymers with repeated arrangement of two amino acids of proline and arginine.
Example 1 construction and activation of a cell line quantitatively displaying the desired stably transformed Strain
Construction: GFP-mem fusion proteins were expressed on the surface of ligand-presenting cells (Ag-mem APC) and GFP-M-LC fusion proteins were expressed on the surface of ligand-quantitative-presenting cells (Ag-M-LC APC) and fused GFP and mCherry nanobody (anti-GFP: KD ═ 0.31. mu.M, anti-mCherry: KD ═ 0.18nM, antibody derived from Fridy, P.C.et. A robust pipeline for rapid production of a vertical negative antibody synthesized receptor, 1253. 1260, doi:10.1038/nmeth.3170(2014), respectively, on the surface of K562 cells, called receptor expressing cells or receptor expressing cells (ABC) (FIGS. 1A, 1B.) the method is as follows: obtaining a virus transfected transiently with the corresponding T293 in the art using the corresponding liposome vectors, and simultaneously infecting in the desired cells, and a myc label or a fluorescent gene is used for sorting to obtain a cell line for stably expressing the target gene. synNotch synthetic receptor expression cells stably express synNotch fusion expression vectors and reporter system expression vectors in K562 cells. The cell lines constructed were stable populations of transformed cells, not corresponding monoclonal cells.
Activating: the synNotch synthetic receptor expressing cells and the antigen expressing cells were counted separately using a hemocytometer, and the two cells were mixed at a ratio of 1:1, wherein 0.025million cells were added to each 96-well plate and suspended in 200. mu.l of medium, 0.1million cells were added to each 24-well plate and suspended in 1ml of medium, and 1million cell was added to each 10ml of medium during shaking culture in a culture flask. Mixing well, at 37 deg.C and 5% CO2After culturing for 48h in the incubator, the cells were detected by a fluorescence reporter system using a flow cytometer.
The results showed that the BFP reporter of anti-GFP synNotch synthetic receptor expressing cells was successfully activated within 48 hours of co-culture, whereas the reporter failed to be activated when mixed in equal proportion to anti-mCherry synNotch synthetic receptor expressing cells (fig. 1C, 1D). Vice versa, ligand-presenting cells expressing mCherry-mem fusion proteins successfully activated anti-mCherry synNotch antibody expressing cells, but not anti-GFP synNotch antibody expressing cells (fig. 1C, 1D). The experimental results show that the mammalian surface antibody display system effectively reports the existence of the combination between the molecules, and the synNotch synthesis receptor is activated with a certain threshold.
Example 2 quantitative detection of ligand-receptor binding force by a mammalian cell surface antibody display System
To examine the effect of intermolecular affinity exerted during synNotch synthetic receptor activation, GFP-APC and two anti-GFP synNotch ABC cell lines were run at 2: 1:1 for 48 hours. Although the affinity of the two anti-GFP nanobodies differed by two orders of magnitude ("Hi": KD ═ 3.5 nM; "Lo": KD ═ 600nM), a similar proportion of BFP positive cells was observed in both anti-GFP synNotch cells (fig. 1E). And in addition, the ratio of 1: 3 or 1: 9, the ABC content is reduced by reducing the number of APC cells in the system, so that the ABC of the high-affinity antibody can obtain competitive advantage under extreme conditions.
The results show that 1: 9-time anti-GFPHiThe proportion of BFP positive cells in SynNotch ABC was slightly increased (fig. 1E). Thus, molecular interactions with affinities in the micromolar to nanomolar range can activate synNotch synthetic receptors when stimulated by Ag-mem cells, but the activation of this receptor is not well correlated with intermolecular affinities.
Example 3 intracellular Low complexity sequence (LC) tags to achieve ligand-receptor affinity discrimination
Fusion of the intracellular domain of ligand-presenting cells with LC tags (FIG. 1D, called Ag-m-LC to distinguish from conventional Ag-mem) can significantly reduce anti-GFPLoActivation of blue fluorescent protein in synNotch cells (FIG. 1E), with a significant increase in anti-GFPHiAnd anti-GFPLoDifferences in fluorescence activation in synNotch cells (fig. 1E). For anti-GFPHisynNotch synthetic receptor expressing cells showed higher activation capacity of Ag-m-LC expressing APC compared to commonly used Ag-mem APC (FIG. 1E).
To further explore the affinity differentiation of Ag-m-LC, a panel of anti-GFP nanobodies with different affinities was cloned into synNotch synthetic receptor expressing cells. These anti-GFP nanobodies recognize two non-overlapping epitopes (interface of antibody-antigen interaction) on the GFP protein with affinities (KD) ranging from micromolar to sub-nanomolar (fig. 1F). Through expression quantity analysis of synNotch synthetic receptors located on the surface of antibody-expressing cells, the receptors fused with various anti-GFP nanobodies were found to have differences in quantity (FIG. 1F). When a panel of anti-GFP synNotch ABC cells were separately co-cultured with GFP-mem APC, the level of synNotch synthetic receptor activation was similar in cells of different affinities (FIG. 1G). Also in this case, the level of synNotch synthetic receptor activation is determined primarily by its number corresponding to the plasma membrane surface of the cell (FIG. 1F). When synNotch synthetic receptors are expressed in consistent amounts, affinity has less effect on the level of activation of the fluorescent reporter system in the recipient cells. However, when co-cultured with GFP-m-LC APC, activation of anti-GFP synNotch synthetic receptor was highly correlated with avidity, regardless of identity of the antibody-binding epitope or high or low expression level of ABC cell surface receptor (FIG. 1G). Thus, the addition of an LC tag to the intracellular region of an antigen allows for affinity-dependent activation of synNotch synthetic receptors, a system referred to as the quantitative display (qDis) system.
Example 4 reduction of cell surface antigen expression and increase of Total affinity are key points for achieving quantitative display
To explain how intracellular LC tags allow affinity-dependent activation of synNotch synthetic receptor-displaying cells, the following experiments were performed.
First, subcellular localization of the two different forms of membrane-bound GFP antigen described above was observed. Although the overall GFP fluorescence signal in the cells was at a similar level, GFP-mem was predominantly expressed in the cytoplasmic membrane, including the microvilli structure at the plasma membrane surface, while most of the GFP-m-LC fluorescence signal came from the intracellular membrane (FIGS. 2A, 2B). The expression level of GFP localized on the plasma membrane surface of the cell is detected by using a fluorescently-labeled anti-GFP antibody in combination with a flow cell surface protein staining technique. The experimental results showed that the GFP-m-LC plasma membrane surface displayed significantly reduced content of GFP protein (fig. 2C), confirming that the fusion of the LC tag reduced the amount of protein expressed on the plasma membrane.
Secondly, based on the above experimental results, the effect of the decrease in the expression level of the antigen on the activation of synNotch synthetic receptors was investigated by decreasing the expression level of the antigen on the surface in GFP-mem APC cells without LC tags. The SFFV promoter on the GFP-mem expression vector was replaced with CMV promoter without kozak sequence or with SV40 weak promoter, while flow cytometry experiments demonstrated that GFP-mem cells after promoter replacement had lower overall GFP expression levels and a significant reduction of plasma membrane surface GFP expression (fig. 2C). At this time, the GFP-mem cells with a low expression level showed the same level of surface GFP as the GFP-m-LC cells (FIG. 2C). When the corresponding APC was co-cultured with synNotch ABC cells of different affinities in a panel, the activation level of the fluorescent reporter system was greatly reduced in all synNotch synthetic receptor expressing cells by GFP-mem at low expression levels, but this reduction was not clearly correlated with affinity (FIG. 2D).
Third, the interaction between APC-ABC was visualized. To visualize the anti-GFP synNotch synthetic receptors in real time, the mCherry fluorescent tag was added to the extracellular domain of anti-GFP synNotch and stably expressed in HEK293T cells. When anti-GFP-mCherry-synNotch HEK293T cells were co-cultured with GFP-mem APC (K562 cells), an interface of GFP-anti-GFP molecular interaction was observed (FIG. 2E, 2F). Surprisingly, despite the low expression level of GFP-m-LC on the cell surface, significant aggregation was observed at the APC-ABC interaction interface when co-cultured with anti-GFP-mCherry-synNotch (FIG. 2F).
The three observations above reveal the role of intracellular LC tags in the quantitative display system (fig. 2G). Its low complexity allows a significant reduction in the expression of antigen on the cell surface of Ag-m-LC APC, thereby reducing the synNotch synthetic receptor-activated cellular response. In this case, phase separation driven aggregation between ligand-receptor will significantly increase receptor activation, and this aggregation will be influenced by intermolecular forces, i.e., affinity, such that activation of signaling in synNotch synthetic receptor expressing cells is proportional to the ligand-receptor affinity.
Based on the above conclusion, the activation condition of blue fluorescent protein after the mixed culture of ABC cells expressing anti-Her2 scFv synNotch synthetic receptors and APC cells with different surface antigen display modes is detected. Extracellular domain of human HER2 and PDGFR or PDGFR-FUSLCFusions, abbreviated as Her2-mem or Her2-m-LC (FIG. 2H). The results show that high affinity anti-Her2 synNotch synthetic receptor ABC cells show a higher proportion of fluorescent reporter activation when co-cultured with Her2-mem APC (FIG. 2K). However, when the antigen is Her2-The affinity distinction is more pronounced in the presence of the m-LC form (FIG. 2K). For anti-Her2 synNotch synthetic receptor expressing cells with nanomolar affinity, the expression level of each fusion receptor is consistent (fig. 2J), while the expression level of Her2-m-LC fusion antigen on the cytoplasmic membrane is lower, but the activation ratio of the fluorescence reporter gene in the whole cells can be obviously improved (fig. 2I). Therefore, the reduction of the expression amount of the Ag-m-LC cell surface antigen protein and the increase of the total affinity highlight the capability of quantitative display.
Example 5 affinity differentiation requires proper aggregation of low complexity sequences within cells
The protein phase separation/aggregation can significantly increase its concentration in localized areas. In Ag-m-LC cells, multivalent interactions are formed between LC tags inside the cytoplasmic membrane, thereby promoting aggregation of antigenic monomers and increasing avidity (avidity). A number of proteins have been shown to phase separate, and FUS proteins are typical examples of such proteins, and therefore in the initial experiments FUS was selectedLCPeptide fragments were studied. Various FUS mutants were designed to examine the effect of phase separation on affinity discrimination in the qDis system. Construction of FUS Using methods well known to those skilled in the artLC Q>G and G>A mutants, which have an influence on the hardness and flowability, respectively, of the protein agglutination complex. In an opto-Droplet system, FUSLC Q>The G mutant tag produced protein aggregation more rapidly (fig. 4A), while, when expressed as a fusion in the plasma membrane of the cell, the expression of antigen on the membrane was reduced (fig. 3A,4B), and thus synNotch synthesis receptor-expressing cells could not be activated (fig. 3A, 4B). FUSLC G>The a mutation tag can activate synNotch synthetic receptor expressing cells, but cannot distinguish the difference between different levels of affinity (fig. 3A, 4C). In addition, two mutants were designed to reduce the formation of aggregation, FUSLCThe Y27S mutant significantly reduced phase transition capacity and was not able to efficiently support quantitative display (fig. 3A,4D), whereas FUSLCThe 7A mutant slightly reduced phase transition capacity (fig. 4A), but still supported quantitative display (fig. 3A, 4E).
Long FUS peptide fragments (aa 1-478) and FUSLCThe 27R mutants could all be induced to phase separate at lower protein levels, but the interaction between sub-nanomolar GFP-anti-GFP failed to activate synNotch synthetic receptors (fig. 3A,3B, 4F). Observations indicate that the threshold concentration required for phase separation of proteins in three-dimensional (cytoplasmic) and two-dimensional (cell membrane) systems varies. These FUS mutants indicate that affinity discrimination requires the LC tag to have the appropriate ability to phase separate.
Example 6 multiple low complexity sequences can support quantitative display
Continued exploration into whether there are other proteins or fragments that form phase separation can also achieve affinity discrimination in the qDis system. A group of low complexity tags is designed, which comprises PrL structural domains of a plurality of FUS protein family members, LC regions of DDX4 genes and poly PR synthetic peptide fragments. Antigens fused to the LC domain of hnRNPA1 or hnRNPA2 lose the ability to activate synNotch synthetic receptors when the various PrL domains of the FUS protein family are overexpressed (fig. 5), and when antigens are fused to the LC tag from TDP43 or EWSR1, activation of synNotch synthetic receptors can be supported, but this activation is not clearly related to affinity (fig. 5). While fusing the LC tag from TAF15 (TAF 15)LC) Can also be mixed with FUSLCAs such, synNotch synthetic receptor activation was made proportional to affinity (fig. 3C). Likewise, the LC tag from DDX4 also supported quantitative display (fig. 3C). But the synthetic polyPR peptide did not support quantitative display (fig. 5). Notably, DDX4 has a different amino acid composition and overall sequence pattern compared to the PrL domain of the FUS protein family, suggesting that the phase separation process plays an important role in affinity discrimination in quantitative display systems. Although the current experimental results show that many LC tags cannot support quantitative display, modification of the protein sequences of the LC tags, such as adjusting the protein expression level to be close to the threshold required for phase separation in a 2D system, or changing the rate of multivalent binding of the protein, can make the LC tags reusable in a quantitative display system. Meanwhile, other polymer labels including a trimer Foldon label and a dimer antibody Fc (antibody crystallizable fragment) label are tested, and the experimental results show that the polymer labels are allCannot support affinity-dependent activation of synNotch synthetic receptors (FIG. 3D). In summary, more LC tags can be applied in a quantitative presentation system, whereas multimerization tags cannot be applied in a system. At the same time, the different sequence patterns of these LC tags further indicate that in this system, the receptor molecules are aggregated by phase separation to differentiate the affinities.
Example 7 quantitative display of ligand-receptor binding
Quantitative display systems can also be used as a quantitative method to reveal ligand-receptor interactions. To demonstrate this, the interaction between the spike protein RBD domain (S-RBD) of various SARS coronavirus types and ACE2 or its corresponding antibody was taken as an example.
First, the affinity between S-RBD and human ACE2 or several antibodies was examined. S-RBD was cloned into Ag-m-LC construct and hACE 2/antibody (scFv) was cloned into the cytoplasmic membrane extracellular domain of synNotch synthetic receptor. The binding of hACE2 to SARS-CoV-2S-RBD was much higher than that of hACE2 to SARS-CoV or SARSr-CoV S-RBD (FIG. 6A), which is consistent with the results of protein interaction in biochemical experiments reported previously. Three neutralizing antibodies against SARS-CoV were able to bind to SARS-CoV S-RBD with high affinity (FIG. 6A), while only CR3022 showed cross-binding between SARS-CoV and SARS-CoV-2 (FIG. 6A). Vice versa, the SARS-CoV-2 neutralizing antibody (CV30) binds to SARS-CoV-2S-RBD only with high affinity (FIG. 6A).
Second, the interaction between S-RBD and ACE2 of different species was examined. Based on the spatial structure of the S-RBD/ACE2 interaction, the key peptide fragment of hACE2 interacting with S-RBD was replaced by the corresponding protein sequence of other species (FIG. 6C). After co-culturing S-RBD-m-LC cells with ACE2 chimera-synNotch cells, the binding force between S-RBD and ACE2 protein of different species was quantitatively shown by a fluorescence reporter system (FIG. 6B). SARS-CoV-2S-RBD binds hACE2 with high affinity compared to SARS-CoV S-RBD and interacts with chimeric ACE2 from cats, dogs, etc. with low affinity (FIG. 6B).
Example 8 quantitative display System for Rapid batch isolation of high affinity antibodies
The quantitative display system can directly separate the high-affinity antibody without subsequent large-scale screening and verification experiments. To develop the quantitative display system into a screening system for high affinity antibodies, the fluorescent reporter in synNotch synthetic receptor expressing cells was replaced with the hCD8 reporter (fig. 7A). In this system, hCD8 gene initiates expression and localizes expression to the cytoplasmic membrane when synNotch synthetic receptor expressing cells are specifically activated, at which time hCD8 positive cells can be enriched by magnetic cell sorting (MACS). As with blue fluorescent protein activation, the hCD8 protein expression on the surface was also proportional after binding of receptors of different affinities to the corresponding Ag-m-LC cells (fig. 7B). ABC cells of two high (nM) (red fluorescent protein labeled) and low (μ M) affinities were separately plated at 1: 1000,1: the qDis-MACS system was tested for discriminatory ability by mixing at 100 or 1:10 ratios. In each round, two ABCs were mixed in the above ratio, co-cultured with an equal amount of Ag-m-LC APC cells, and synNotch synthetic receptor activated ABC cells were enriched with magnetic beads carrying anti-hCD8 antibody. By examining the ratio of two high/low affinity ABC cells after each round of qDis-MACS enrichment, it was found that high affinity ABC cells could be enriched 3 to 14-fold (fig. 7C). Meanwhile, after two rounds of screening, the rare high-affinity antibody (0.1% of total ABC) in the original cell population can be significantly enriched to about 10%.
To test whether the quantitative display system could be applied to screen for high affinity antibodies, a phage display and mammalian quantitative display system were combined to screen for high affinity antibodies against the thermostable gfp (tgp) protein (fig. 7D). Although phage display libraries can reach 1011Orders of magnitude, but usually requires that hundreds of clones be expressed individually in subsequent validation steps and further tested for affinity by ELISA or similar methods after two or three rounds of enrichment. Antibody libraries from prior art phage display were cloned into a quantitative display system and potential high affinity antibodies were selected by three rounds of qDis-MCAS. Based on the fold enrichment of different antibodies in each round of display, 19 clones were selected from qDisTo verify binding affinity and to compare with randomly selected clones directly from phage display libraries (FIG. 7E). The results showed that 18 of the 19 antibodies from the quantitative display system were able to detect binding to TGP by BLI, while only 4 of the 188 antibodies randomly selected were able to detect binding to TGP. Of these 18 antibodies, 16 bound TGP in the nanomolar range, and two additional antibodies bound TGP with picomolar affinity (fig. 7F, 8). Therefore, the quantitative display system can be used as a method for separating high-affinity antibodies from a medium-sized antibody library, and the high-affinity antibodies can be separated in batches by combining a conventional phage display technology with the quantitative display system.
The specific experimental method is as follows:
1. construction of synNotch antibody library
The puromycin resistance gene was added to synNotch vector and used as a new vector backbone. Primers containing synNotch vector backbone linker sequences were designed to specifically amplify the antibodies in the library of interest. And recombining the recovered product with the vector skeleton after enzyme digestion by using a Gibson assembly method, and constructing a new recombinant library in escherichia coli by using a bacterial electrotransfer method.
2. Construction of stably transfected cell lines for qDIS-MACS screening
Retroviruses corresponding to expression vectors were obtained by transient transfection of liposomes in 293T. Overexpression of Ag-mem-FUS in K562 cell lineLCAnd obtaining the cell line stably expressing the fusion antigen by flow sorting. The hCD8 reporter system was overexpressed in the K562 cell line and a stable transgenic cell line was selected using the Blastidin resistance gene. The synNotch antibody library is over-expressed in a K562-hCD8 cell line, and after a cell population expressing the antibody library is screened for 7 days by using puromycin resistance genes to obtain a stable cell population, the next experiment is carried out.
3. Activating qDIS system
The number of cells is always ensured to be more than 100 times of the number of the antibody library during cell passage. The cell population was counted on day 8, and synNotch synthetic receptor-expressing cells and antigen-expressing cells were resuspended in 100ml of medium and shake-cultured at 37 ℃ in a 5% CO2 incubator at 50rpm/min for 48 h.
MACS enriched hCD8 reporter positive cells
The co-cultured cells were counted and collected, centrifuged at 300g for 5min and resuspended in 10ml MACS buffer, centrifuged at 300g for 5min and the cells were plated at 80. mu.l/107The proportion of cells was resuspended. According to 20. mu.l/107Adding anti-hCD8 microbeads according to the proportion of the cells, uniformly mixing, and incubating at 4 ℃ for 15 min. After the incubation was complete, 5ml of MACS buffer was added and centrifuged at 300g for 5min to remove excess unbound microbeads. After centrifugation, the cells were resuspended in 500. mu.l buffer and added to an MS sorting column prepared in advance to be placed in a magnetic field. After the cells completely passed through the sorting column, unbound hCD8 negative cells were removed by adding 1ml of MACS buffer and repeating the procedure twice more. And finally, removing the sorting column from the magnetic field, adding 1ml of MACS buffer solution, quickly beating out the cells, and finally collecting the cells which are positive to the hCD8 reporter gene. And taking a small amount of obtained positive cells, staining the positive cells, and detecting the expression level of hCD8 in the enriched cells by using flow cytometry.
5. Multi-round qDIS-MACS enrichment
And (3) centrifuging the cells obtained by enrichment, re-suspending, taking a part of the cells, extracting genomic DNA, and remaining most of the cells for continuous culture. After the expression level of the hCD8 in the round of cells is remarkably reduced, the 10 million-enriched cells and new Ag-m-FUS are continuously takenLCThe cells were mixed and the experiment was terminated after three rounds of enrichment.
6. Cell genome DNA enrichment and library construction
The extracted genomic DNA was dissolved in TE solution at a ratio of 20. mu.l/million cell, and the amount of cells was 100 times the volume of the antibody library as the initial cell amount for the library construction. The coding sequence of the antibody gene is amplified by using Q5 high fidelity polymerase, and in one round of amplification, 5 mul of DNA is added into each 50 mul of reaction system to be used as a template, and 20 cycles of amplification are carried out. In the two-round amplification, 4 reactions were performed for each sample, and 3. mu.l of one-round product was added to 50. mu.l of the reaction system as a template for 20 cycles of amplification. And (3) performing gel electrophoresis on the 2 nd round PCR product, recovering a target band, adding 50ng of the target band serving as a template for three rounds of amplification, and amplifying for 8 cycles. Finally, the third round of amplification product is purified and recovered, and high-throughput sequencing is carried out.
7. Analysis of antibody enrichment Using bioinformatics
And respectively extracting the sequences of the CDR1, the CDR2 and the CDR3 in each sequence from the original sequencing file according to the fixed sequences at the two sides of the CDR, and counting the corresponding occurrence times of the sequences. And comparing the sequence with the statistical frequency of 1 with other sequences, and classifying the sequences as the same sequence when the CDR sequences only differ by 1 base. And combining the sequencing results of multiple screens into the same file, carrying out homogenization treatment on the results according to the total data measured in each experiment, and filtering sequences which appear only once in the plasmid library and sequences which appear less than 10 times in the previous three enrichments. Finally, selecting 15 obviously enriched antibodies through the multiple difference of the single antibodies after the first round and the third round of screening, and selecting 4 obviously enriched antibodies through the multiple difference of the single antibodies in the initial cells after the third round of screening.
8. Antibody purification
Primers were designed, expression vectors for 19 antibodies were constructed, and verified by sequencing. The constructed plasmid was transformed into E.coli MC1061 strain and 0.02% (w/v) arabinose induced protein expression overnight. Bacterial cultures were harvested, resuspended in TES buffer (20% (w/v) Sucrose,0.5mM EDTA,200mM Tris pH 8.0), spun at 4 ℃ for 30min, added two volumes of chilled ice water and spun for 1 h. Centrifuge at 4 degrees for 30min at 20,000 g. And (3) performing protein purification on the centrifuged supernatant by using Ni-NTA, and finally adding an eluent containing 300mM imidazole to elute the target protein. Protein concentration was measured using Bradford, while purified protein band size was verified using SDS-PAGE.
9. Antibody affinity validation
Affinity between the purified antibody and TGP antigen was analyzed using an oct Red 96 protein interactor. Biotin labeling was fused to TGP, and the protein was diluted to 2. mu.g/ml with buffer (1xPBS, 0.02% Tween-20) and immobilized on an SA sensor with a control signal of about 0.6. The antibody was diluted to 0.2mg/ml (12.6. mu.M), further diluted to 100nM and finally diluted in a 2-fold gradient to 50nM,25nM,12.5nM,6.25nM,3.7nM, etc. During the experiment, antigen and antibody were allowed to bind for 240s, dissociate for 240s, and the corresponding binding and dissociation constants were calculated on the analytical software using a 1:1 binding mode. After completion of one set of experiments, the sensors were regenerated using a regeneration solution (10mM glycine, PH 2.0) to remove bound antibody, corresponding to the procedure of 5s regeneration, 5s equilibration, and the above procedure was repeated three times. And (4) repeatedly using the regenerated sensor to measure other antibodies, and when the signal of the sensor is obviously weakened, reusing a new sensor to measure.
The invention fuses a series of proteins which can be separated in cells with antigens, and realizes a quantitative display system based on affinity in mammalian cells. Among other things, the low complexity domain of the protein can help to differentiate the affinity of protein interactions outside the plasma membrane of the cell. Receptor aggregation and associated coagulation of downstream effector proteins contribute to signal transduction. The synNotch synthetic receptors and antigen presentation systems of the present invention indicate that aggregation of antigenic molecules inside the plasma membrane of cells may affect extracellular binding. Whether lymphocyte antigen receptors such as BCR and TCR also distinguish antigens with different affinities by a similar strategy is worthy of continued investigation in the future.
From the experimental results of the present invention, the qDis system provides a new method for the existing surface display technology, and high affinity antibodies can be found in batches. The size of the mammalian display library is limited by plasmid transfection efficiency and cell number, and these limitations can be overcome by combining multiple display systems. Antibodies selected from the initial or synthetic display libraries are generally of lower affinity, in which case the qDis system can help find high affinity antibodies in the library that are less abundant. Furthermore, by combining in vitro mutagenesis and qDis systems, we provided the possibility to mimic the antibody affinity process in germinal centers in vitro. In addition to searching for antibodies, affinity-based displays can also be used to reveal ligand-receptor binding, providing a quantitative approach to finding and/or validating potential invading receptors of viruses and potential ligand-receptor pairs in cellular interactions.
The above examples are intended to illustrate the disclosed embodiments of the invention and are not to be construed as limiting the invention. In addition, various modifications of the invention set forth herein, as well as variations of the methods of the invention, will be apparent to persons skilled in the art without departing from the scope and spirit of the invention. While the invention has been specifically described in connection with various specific preferred embodiments thereof, it should be understood that the invention should not be unduly limited to such specific embodiments. Indeed, various modifications of the above-described embodiments which are obvious to those skilled in the art to which the invention pertains are intended to be covered by the scope of the present invention.
Sequence listing
<110> center of outstanding innovation in molecular cell science of Chinese academy of sciences
<120> cell surface macromolecule quantitative display system and preparation method and application thereof
<160> 30
<170> SIPOSequenceListing 1.0
<210> 1
<211> 214
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 1
Met Ala Ser Asn Asp Tyr Thr Gln Gln Ala Thr Gln Ser Tyr Gly Ala
1 5 10 15
Tyr Pro Thr Gln Pro Gly Gln Gly Tyr Ser Gln Gln Ser Ser Gln Pro
20 25 30
Tyr Gly Gln Gln Ser Tyr Ser Gly Tyr Ser Gln Ser Thr Asp Thr Ser
35 40 45
Gly Tyr Gly Gln Ser Ser Tyr Ser Ser Tyr Gly Gln Ser Gln Asn Thr
50 55 60
Gly Tyr Gly Thr Gln Ser Thr Pro Gln Gly Tyr Gly Ser Thr Gly Gly
65 70 75 80
Tyr Gly Ser Ser Gln Ser Ser Gln Ser Ser Tyr Gly Gln Gln Ser Ser
85 90 95
Tyr Pro Gly Tyr Gly Gln Gln Pro Ala Pro Ser Ser Thr Ser Gly Ser
100 105 110
Tyr Gly Ser Ser Ser Gln Ser Ser Ser Tyr Gly Gln Pro Gln Ser Gly
115 120 125
Ser Tyr Ser Gln Gln Pro Ser Tyr Gly Gly Gln Gln Gln Ser Tyr Gly
130 135 140
Gln Gln Gln Ser Tyr Asn Pro Pro Gln Gly Tyr Gly Gln Gln Asn Gln
145 150 155 160
Tyr Asn Ser Ser Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Asn
165 170 175
Tyr Gly Gln Asp Gln Ser Ser Met Ser Ser Gly Gly Gly Ser Gly Gly
180 185 190
Gly Tyr Gly Asn Gln Asp Gln Ser Gly Gly Gly Gly Ser Gly Gly Tyr
195 200 205
Gly Gln Gln Asp Arg Gly
210
<210> 2
<211> 478
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 2
Met Ala Ser Asn Asp Tyr Thr Gln Gln Ala Thr Gln Ser Tyr Gly Ala
1 5 10 15
Tyr Pro Thr Gln Pro Gly Gln Gly Tyr Ser Gln Gln Ser Ser Gln Pro
20 25 30
Tyr Gly Gln Gln Ser Tyr Ser Gly Tyr Ser Gln Ser Thr Asp Thr Ser
35 40 45
Gly Tyr Gly Gln Ser Ser Tyr Ser Ser Tyr Gly Gln Ser Gln Asn Thr
50 55 60
Gly Tyr Gly Thr Gln Ser Thr Pro Gln Gly Tyr Gly Ser Thr Gly Gly
65 70 75 80
Tyr Gly Ser Ser Gln Ser Ser Gln Ser Ser Tyr Gly Gln Gln Ser Ser
85 90 95
Tyr Pro Gly Tyr Gly Gln Gln Pro Ala Pro Ser Ser Thr Ser Gly Ser
100 105 110
Tyr Gly Ser Ser Ser Gln Ser Ser Ser Tyr Gly Gln Pro Gln Ser Gly
115 120 125
Ser Tyr Ser Gln Gln Pro Ser Tyr Gly Gly Gln Gln Gln Ser Tyr Gly
130 135 140
Gln Gln Gln Ser Tyr Asn Pro Pro Gln Gly Tyr Gly Gln Gln Asn Gln
145 150 155 160
Tyr Asn Ser Ser Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Asn
165 170 175
Tyr Gly Gln Asp Gln Ser Ser Met Ser Ser Gly Gly Gly Ser Gly Gly
180 185 190
Gly Tyr Gly Asn Gln Asp Gln Ser Gly Gly Gly Gly Ser Gly Gly Tyr
195 200 205
Gly Gln Gln Asp Arg Gly Gly Arg Gly Arg Gly Gly Ser Gly Gly Gly
210 215 220
Gly Gly Gly Gly Gly Gly Gly Tyr Asn Arg Ser Ser Gly Gly Tyr Glu
225 230 235 240
Pro Arg Gly Arg Gly Gly Gly Arg Gly Gly Arg Gly Gly Met Gly Gly
245 250 255
Ser Asp Arg Gly Gly Phe Asn Lys Phe Gly Gly Pro Arg Asp Gln Gly
260 265 270
Ser Arg His Asp Ser Glu Gln Asp Asn Ser Asp Asn Asn Thr Ile Phe
275 280 285
Val Gln Gly Leu Gly Glu Asn Val Thr Ile Glu Ser Val Ala Asp Tyr
290 295 300
Phe Lys Gln Ile Gly Ile Ile Lys Thr Asn Lys Lys Thr Gly Gln Pro
305 310 315 320
Met Ile Asn Leu Tyr Thr Asp Arg Glu Thr Gly Lys Leu Lys Gly Glu
325 330 335
Ala Thr Val Ser Phe Asp Asp Pro Pro Ser Ala Lys Ala Ala Ile Asp
340 345 350
Trp Phe Asp Gly Lys Glu Phe Ser Gly Asn Pro Ile Lys Val Ser Phe
355 360 365
Ala Thr Arg Arg Ala Asp Phe Asn Arg Gly Gly Gly Asn Gly Arg Gly
370 375 380
Gly Arg Gly Arg Gly Gly Pro Met Gly Arg Gly Gly Tyr Gly Gly Gly
385 390 395 400
Gly Ser Gly Gly Gly Gly Arg Gly Gly Phe Pro Ser Gly Gly Gly Gly
405 410 415
Gly Gly Gly Gln Gln Arg Ala Gly Asp Trp Lys Cys Pro Asn Pro Thr
420 425 430
Cys Glu Asn Met Asn Phe Ser Trp Arg Asn Glu Cys Asn Gln Cys Lys
435 440 445
Ala Pro Lys Pro Asp Gly Pro Gly Gly Gly Pro Gly Gly Ser His Met
450 455 460
Gly Gly Asn Tyr Gly Asp Asp Arg Arg Gly Gly Arg Gly Gly
465 470 475
<210> 3
<211> 208
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 3
Met Ser Asp Ser Gly Ser Tyr Gly Gln Ser Gly Gly Glu Gln Gln Ser
1 5 10 15
Tyr Ser Thr Tyr Gly Asn Pro Gly Ser Gln Gly Tyr Gly Gln Ala Ser
20 25 30
Gln Ser Tyr Ser Gly Tyr Gly Gln Thr Thr Asp Ser Ser Tyr Gly Gln
35 40 45
Asn Tyr Ser Gly Tyr Ser Ser Tyr Gly Gln Ser Gln Ser Gly Tyr Ser
50 55 60
Gln Ser Tyr Gly Gly Tyr Glu Asn Gln Lys Gln Ser Ser Tyr Ser Gln
65 70 75 80
Gln Pro Tyr Asn Asn Gln Gly Gln Gln Gln Asn Met Glu Ser Ser Gly
85 90 95
Ser Gln Gly Gly Arg Ala Pro Ser Tyr Asp Gln Pro Asp Tyr Gly Gln
100 105 110
Gln Asp Ser Tyr Asp Gln Gln Ser Gly Tyr Asp Gln His Gln Gly Ser
115 120 125
Tyr Asp Glu Gln Ser Asn Tyr Asp Gln Gln His Asp Ser Tyr Ser Gln
130 135 140
Asn Gln Gln Ser Tyr His Ser Gln Arg Glu Asn Tyr Ser His His Thr
145 150 155 160
Gln Asp Asp Arg Arg Asp Val Ser Arg Tyr Gly Glu Asp Asn Arg Gly
165 170 175
Tyr Gly Gly Ser Gln Gly Gly Gly Arg Gly Arg Gly Gly Tyr Asp Lys
180 185 190
Asp Gly Arg Gly Pro Met Thr Gly Ser Ser Gly Gly Asp Arg Gly Gly
195 200 205
<210> 4
<211> 236
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 4
Met Gly Asp Glu Asp Trp Glu Ala Glu Ile Asn Pro His Met Ser Ser
1 5 10 15
Tyr Val Pro Ile Phe Glu Lys Asp Arg Tyr Ser Gly Glu Asn Gly Asp
20 25 30
Asn Phe Asn Arg Thr Pro Ala Ser Ser Ser Glu Met Asp Asp Gly Pro
35 40 45
Ser Arg Arg Asp His Phe Met Lys Ser Gly Phe Ala Ser Gly Arg Asn
50 55 60
Phe Gly Asn Arg Asp Ala Gly Glu Cys Asn Lys Arg Asp Asn Thr Ser
65 70 75 80
Thr Met Gly Gly Phe Gly Val Gly Lys Ser Phe Gly Asn Arg Gly Phe
85 90 95
Ser Asn Ser Arg Phe Glu Asp Gly Asp Ser Ser Gly Phe Trp Arg Glu
100 105 110
Ser Ser Asn Asp Cys Glu Asp Asn Pro Thr Arg Asn Arg Gly Phe Ser
115 120 125
Lys Arg Gly Gly Tyr Arg Asp Gly Asn Asn Ser Glu Ala Ser Gly Pro
130 135 140
Tyr Arg Arg Gly Gly Arg Gly Ser Phe Arg Gly Cys Arg Gly Gly Phe
145 150 155 160
Gly Leu Gly Ser Pro Asn Asn Asp Leu Asp Pro Asp Glu Cys Met Gln
165 170 175
Arg Thr Gly Gly Leu Phe Gly Ser Arg Arg Pro Val Leu Ser Gly Thr
180 185 190
Gly Asn Gly Asp Thr Ser Gln Ser Arg Ser Gly Ser Gly Ser Glu Arg
195 200 205
Gly Gly Tyr Lys Gly Leu Asn Glu Glu Val Ile Thr Gly Ser Gly Lys
210 215 220
Asn Ser Trp Lys Ser Glu Ala Glu Gly Gly Glu Ser
225 230 235
<210> 5
<211> 302
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 5
Asn Ala Val Gly Gln Asp Thr Gln Glu Val Ile Val Val Pro His Ser
1 5 10 15
Leu Pro Phe Lys Val Val Val Ile Ser Ala Ile Leu Ala Leu Val Val
20 25 30
Leu Thr Ile Ile Ser Leu Ile Ile Leu Ile Met Leu Trp Gln Lys Lys
35 40 45
Pro Arg Tyr Glu Ile Arg Trp Lys Val Ile Glu Ser Val Ser Ser Asp
50 55 60
Gly His Glu Tyr Ile Tyr Val Asp Pro Met Gln Leu Pro Tyr Asp Ser
65 70 75 80
Thr Trp Glu Leu Pro Arg Asp Gln Met Ala Ser Asn Asp Tyr Thr Gln
85 90 95
Gln Ala Thr Gln Ser Tyr Gly Ala Tyr Pro Thr Gln Pro Gly Gln Gly
100 105 110
Tyr Ser Gln Gln Ser Ser Gln Pro Tyr Gly Gln Gln Ser Tyr Ser Gly
115 120 125
Tyr Ser Gln Ser Thr Asp Thr Ser Gly Tyr Gly Gln Ser Ser Tyr Ser
130 135 140
Ser Tyr Gly Gln Ser Gln Asn Thr Gly Tyr Gly Thr Gln Ser Thr Pro
145 150 155 160
Gln Gly Tyr Gly Ser Thr Gly Gly Tyr Gly Ser Ser Gln Ser Ser Gln
165 170 175
Ser Ser Tyr Gly Gln Gln Ser Ser Tyr Pro Gly Tyr Gly Gln Gln Pro
180 185 190
Ala Pro Ser Ser Thr Ser Gly Ser Tyr Gly Ser Ser Ser Gln Ser Ser
195 200 205
Ser Tyr Gly Gln Pro Gln Ser Gly Ser Tyr Ser Gln Gln Pro Ser Tyr
210 215 220
Gly Gly Gln Gln Gln Ser Tyr Gly Gln Gln Gln Ser Tyr Asn Pro Pro
225 230 235 240
Gln Gly Tyr Gly Gln Gln Asn Gln Tyr Asn Ser Ser Ser Gly Gly Gly
245 250 255
Gly Gly Gly Gly Gly Gly Gly Asn Tyr Gly Gln Asp Gln Ser Ser Met
260 265 270
Ser Ser Gly Gly Gly Ser Gly Gly Gly Tyr Gly Asn Gln Asp Gln Ser
275 280 285
Gly Gly Gly Gly Ser Gly Gly Tyr Gly Gln Gln Asp Arg Gly
290 295 300
<210> 6
<211> 906
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 6
aacgccgtgg gccaggatac gcaggaggtg atcgtcgtgc cccacagctt gcccttcaag 60
gtcgtggtta ttagtgctat cctggcgttg gtggtgctga ctattatctc ccttatcata 120
cttatcatgc tgtggcaaaa aaaaccccga tacgagatcc gatggaaggt gattgagtct 180
gtgagctctg acggccatga gtacatctac gtggacccca tgcagctgcc ctatgactcc 240
acgtgggagc tgccgcggga ccagatggcc tcaaacgatt atacccaaca agcaacccaa 300
agctatgggg cctaccccac ccagcccggg cagggctatt cccagcagag cagtcagccc 360
tacggacagc agagttacag tggttatagc cagtccacgg acacttcagg ctatggccag 420
agcagctatt cttcttatgg ccagagccag aacacaggct atggaactca gtcaactccc 480
cagggatatg gctcgactgg cggctatggc agtagccaga gctcccaatc gtcttacggg 540
cagcagtcct cctatcctgg ctatggccag cagccagctc ccagcagcac ctcgggaagt 600
tacggtagca gttctcagag cagcagctat gggcagcccc agagtgggag ctacagccag 660
cagcctagct atggtggaca gcagcaaagc tatggacagc agcaaagcta taatccccct 720
cagggctatg gacagcagaa ccagtacaac agcagcagtg gtggtggagg tggaggtgga 780
ggtggaggta actatggcca agatcaatcc tccatgagta gtggtggtgg cagtggtggc 840
ggttatggca atcaagacca gagtggtgga ggtggcagcg gtggctatgg acagcaggac 900
cgtgga 906
<210> 7
<211> 595
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 7
Ser Thr Glu Asp Leu Val Asn Thr Phe Leu Glu Lys Phe Asn Tyr Glu
1 5 10 15
Ala Glu Glu Leu Ser Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr Asn
20 25 30
Thr Asn Ile Thr Asp Glu Asn Leu Gln Lys Met Asn Asn Ala Gly Ala
35 40 45
Lys Trp Ser Ala Phe Tyr Glu Glu Gln Ser Lys Leu Ala Lys Thr Tyr
50 55 60
Pro Leu Glu Glu Ile Gln Asp Ser Thr Val Lys Arg Gln Leu Arg Ala
65 70 75 80
Leu Gln His Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys Arg
85 90 95
Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly Lys
100 105 110
Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro Gly
115 120 125
Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu Trp
130 135 140
Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro Leu
145 150 155 160
Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn His
165 170 175
Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn Gly
180 185 190
Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val Glu
195 200 205
His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala Tyr
210 215 220
Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro Ile
225 230 235 240
Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe Trp
245 250 255
Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn Ile
260 265 270
Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg Ile
275 280 285
Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn Met
290 295 300
Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Glu Pro Ser Asp Ser
305 310 315 320
Trp Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Arg Gly Asp
325 330 335
Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu Thr
340 345 350
Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala Ala
355 360 365
Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu Ala
370 375 380
Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu Lys
385 390 395 400
Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr Glu
405 410 415
Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu Pro
420 425 430
Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly Glu
435 440 445
Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg Glu
450 455 460
Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys Asp
465 470 475 480
Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg Tyr
485 490 495
Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys Gln
500 505 510
Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn Ser
515 520 525
Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys Ser
530 535 540
Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn Met
545 550 555 560
Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp Leu
565 570 575
Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp Ser
580 585 590
Pro Tyr Ala
595
<210> 8
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 8
Ser Thr Glu Asp Leu Val Lys Thr Phe Leu Glu Lys Phe Asn Tyr Glu
1 5 10 15
Ala Glu Glu Leu Ser Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr Asn
20 25 30
Ile Asn Ile Thr Asp Glu Asn Val Gln Lys Met Asn Asn Ala Gly Ala
35 40 45
Lys Trp Ser Ala Phe Tyr Glu Glu Gln Ser Lys Leu Ala Lys Thr Tyr
50 55 60
Pro Leu Glu Glu Ile Gln Asp Ser Thr Val Lys Arg Gln Leu Arg Ala
65 70 75 80
Leu Gln His Ser Gly Ser Ser Val Leu Ser Ala Asp Lys Asn Gln Arg
85 90 95
Leu Asn Thr Ile Leu Asn Ser Met Ser Thr Val Tyr Ser Thr Gly Lys
100 105 110
Ala Cys Asn Pro Ser Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro Gly
115 120 125
Leu Asp Asp Ile Met Glu Asn Ser Lys Asp Tyr Asn Glu Arg Leu Trp
130 135 140
Ala Trp Glu Gly Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro Leu
145 150 155 160
Tyr Glu Glu Tyr Val Ala Leu Lys Asn Glu Met Ala Arg Ala Asn Asn
165 170 175
Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Glu Glu Trp
180 185 190
Glu Asn Gly Tyr Asn Tyr Ser Arg Asn Gln Leu Ile Asp Asp Val Glu
195 200 205
Leu Thr Phe Thr Gln Ile Met Pro Leu Tyr Gln His Leu His Ala Tyr
210 215 220
Val Arg Thr Lys Leu Met Asp Thr Tyr Pro Ser Tyr Ile Ser Pro Thr
225 230 235 240
Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe Trp
245 250 255
Thr Asn Leu Tyr Pro Leu Thr Val Pro Phe Gly Gln Lys Pro Asn Ile
260 265 270
Asp Val Thr Asn Ala Met Val Asn Gln Ser Trp Asp Ala Arg Lys Ile
275 280 285
Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn Met
290 295 300
Thr Gln Glu Phe Trp Gly Asn Ser Met Leu Thr Glu Pro Ser Asp Ser
305 310 315 320
Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly Asp
325 330 335
Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu Thr
340 345 350
Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala Ala
355 360 365
Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu Ala
370 375 380
Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Asn His Leu Lys
385 390 395 400
Asn Ile Gly Leu Leu Pro Pro Ser Phe Phe Glu Asp Ser Glu Thr Glu
405 410 415
Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu Pro
420 425 430
Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly Glu
435 440 445
Ile Pro Lys Asp Gln Trp Met Lys Thr Trp Trp Glu Met Lys Arg Asn
450 455 460
Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys Asp
465 470 475 480
Pro Ala Ser Leu Phe His Val Ala Asn Asp Tyr Ser Phe Ile Arg Tyr
485 490 495
Tyr Thr Arg Thr Ile Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys Gln
500 505 510
Ile Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn Ser
515 520 525
Ser Glu Ala Gly Gln Lys Leu Leu Glu Met Leu Lys Leu Gly Lys Ser
530 535 540
Lys Pro Trp Thr Tyr Ala Leu Glu Ile Val Val Gly Ala Lys Asn Met
545 550 555 560
Asp Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp Leu
565 570 575
Lys Glu Gln Asn Arg Asn Ser Phe Val Gly Trp Asn Thr Asp Trp Ser
580 585 590
Pro Tyr Ala Asp
595
<210> 9
<211> 595
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 9
Ser Thr Thr Glu Asp Leu Ala Lys Thr Phe Leu Glu Lys Phe Asn Tyr
1 5 10 15
Glu Ala Glu Glu Leu Ser Tyr Gln Asn Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Asp Glu Asn Ile Gln Lys Met Asn Ile Ala Gly
35 40 45
Ala Lys Trp Ser Ala Phe Tyr Glu Glu Glu Ser Gln His Ala Lys Thr
50 55 60
Tyr Pro Leu Glu Glu Ile Gln Asp Pro Ile Ile Lys Arg Gln Leu Arg
65 70 75 80
Ala Leu Gln Gln Ser Gly Ser Ser Val Leu Ser Ala Asp Lys Arg Glu
85 90 95
Arg Leu Asn Thr Ile Leu Asn Ala Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Ala Cys Asn Pro Asn Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asp Asp Ile Met Glu Asn Ser Lys Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Gly Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Ala Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
Asn Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Glu Glu
180 185 190
Trp Ala Asp Gly Tyr Ser Tyr Ser Arg Asn Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Thr Gln Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asp Ala Tyr Pro Ser Arg Ile Ser Pro
225 230 235 240
Thr Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Pro Leu Met Val Pro Phe Arg Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asn Gln Ser Trp Asp Ala Arg Arg
275 280 285
Ile Phe Glu Glu Ala Glu Thr Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Glu Gly Phe Trp Gln Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Asn Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Arg
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Glu Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Asn His Leu
385 390 395 400
Lys Asn Ile Gly Leu Leu Pro Pro Asp Phe Ser Glu Asp Ser Glu Thr
405 410 415
Asp Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Glu Gln Trp Met Gln Lys Trp Trp Glu Met Lys Arg
450 455 460
Asp Ile Val Gly Val Val Glu Pro Leu Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ala Leu Phe His Val Ala Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Ile Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ile Ala Lys His Glu Gly Pro Leu Tyr Lys Cys Asp Ile Ser Asn
515 520 525
Ser Ser Glu Ala Gly Gln Lys Leu His Glu Met Leu Ser Leu Gly Arg
530 535 540
Ser Lys Pro Trp Thr Phe Ala Leu Glu Arg Val Val Gly Ala Lys Thr
545 550 555 560
Met Asp Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Glu Gln Asn Arg Asn Ser Phe Val Gly Trp Asn Thr Asp Trp
580 585 590
Ser Pro Tyr
595
<210> 10
<211> 595
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 10
Ser Thr Thr Glu Asp Leu Ala Asn Thr Phe Leu Glu Asn Phe Asn Asn
1 5 10 15
Glu Thr Glu Glu Leu Ser Tyr Gln Asn Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Asp Glu Asn Ile Gln Lys Met Asn Asp Ala Ala
35 40 45
Ala Lys Trp Ser Ala Phe Tyr Asp Glu Gln Ser Lys Gln Ala Lys Thr
50 55 60
Tyr Pro Leu Glu Glu Ile Gln Asp Pro Thr Asn Lys Arg Gln Leu Gln
65 70 75 80
Ala Leu Gln His Ser Gly Ser Ser Val Leu Ser Ala Asp Lys Arg Glu
85 90 95
Arg Leu Asn Thr Ile Leu Asn Ala Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Thr Cys Asn Pro Asn Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asp Asp Ile Met Glu Asn Ser Lys Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Gly Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Thr Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
Asn Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Glu Glu
180 185 190
Trp Ala Asp Gly Tyr Asn Tyr Ser Arg Ser Gln Leu Ile Asp Asp Val
195 200 205
Glu His Thr Phe Lys Gln Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asp Thr Tyr Pro Ser His Met Ser Pro
225 230 235 240
Thr Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Pro Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asn Gln Ser Trp Asp Ala Arg Arg
275 280 285
Ile Phe Glu Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Asn Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Ser His Leu
385 390 395 400
Lys Asn Ile Gly Leu Leu Pro Pro Gly Phe Ser Glu Asp Asn Glu Thr
405 410 415
Asp Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Glu Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Leu Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ala Leu Phe His Val Ala Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Ile Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ile Ala Lys His Glu Gly Pro Leu Tyr Lys Cys Asp Ile Ser Asn
515 520 525
Ser Arg Glu Ala Gly Gln Lys Leu Leu Glu Met Leu Arg Leu Gly Arg
530 535 540
Ser Lys Pro Trp Thr Leu Ala Leu Glu Thr Val Val Gly Ala Lys Thr
545 550 555 560
Met Asp Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Gln Glu Arg Asn Arg Asn Ser Phe Val Gly Trp Asn Thr Asp Trp
580 585 590
Ser Pro Tyr
595
<210> 11
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 11
Ser Thr Thr Glu Asp Leu Ala Glu Thr Phe Leu Glu Lys Phe Asn Tyr
1 5 10 15
Glu Ala Glu Asp Leu Tyr Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Asn Glu Asn Ile Gln Lys Met Asn Asp Ala Gly
35 40 45
Ala Lys Trp Ser Ala Phe Tyr Glu Glu Gln Ser Lys His Ala Lys Thr
50 55 60
Tyr Pro Leu Glu Glu Ile His Asn Ser Thr Val Lys Arg Gln Leu Gln
65 70 75 80
Ala Leu Gln His Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Glu Phe Trp Glu Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Gly Gln Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 12
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 12
Ser Thr Thr Glu Glu Leu Ala Lys Thr Phe Leu Glu Lys Phe Asn His
1 5 10 15
Glu Ala Glu Glu Leu Ser Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Asp Glu Asn Val Gln Lys Met Asn Glu Ala Gly
35 40 45
Ala Lys Trp Ser Ala Phe Tyr Glu Glu Gln Ser Lys Leu Ala Lys Thr
50 55 60
Tyr Pro Leu Ala Glu Ile His Asn Thr Thr Val Lys Arg Gln Leu Gln
65 70 75 80
Ala Leu Gln Gln Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Ser Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 13
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 13
Ser Thr Thr Glu Glu Leu Ala Lys Thr Phe Leu Glu Thr Phe Asn Tyr
1 5 10 15
Glu Ala Gln Glu Leu Ser Tyr Gln Ser Ser Val Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Asp Glu Asn Ala Lys Asn Met Asn Glu Ala Gly
35 40 45
Ala Lys Trp Ser Ala Tyr Tyr Glu Glu Gln Ser Lys Leu Ala Gln Thr
50 55 60
Tyr Pro Leu Ala Glu Ile Gln Asp Ala Lys Ile Lys Arg Gln Leu Gln
65 70 75 80
Ala Leu Gln Gln Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Gly Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 14
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 14
Ser Thr Thr Glu Asp Leu Ala Lys Thr Phe Leu Glu Lys Phe Asn Ser
1 5 10 15
Glu Ala Glu Glu Leu Ser His Gln Ser Ser Leu Ala Ser Trp Ser Tyr
20 25 30
Asn Thr Asn Ile Thr Asp Glu Asn Val Gln Lys Met Asn Glu Ala Gly
35 40 45
Ala Arg Trp Ser Ala Phe Tyr Glu Glu Gln Cys Lys Leu Ala Lys Thr
50 55 60
Tyr Pro Leu Glu Glu Ile Gln Asn Leu Thr Val Lys Arg Gln Leu Gln
65 70 75 80
Ala Leu Gln Gln Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Gly Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 15
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 15
Ser Thr Thr Glu Asp Glu Ala Lys Met Phe Leu Asp Lys Phe Asn Thr
1 5 10 15
Lys Ala Glu Asp Leu Ser His Gln Ser Ser Leu Ala Ser Trp Asp Tyr
20 25 30
Asn Thr Asn Ile Asn Asp Glu Asn Val Gln Lys Met Asp Glu Ala Gly
35 40 45
Ala Lys Trp Ser Ala Phe Tyr Glu Glu Gln Ser Lys Leu Ala Lys Asn
50 55 60
Tyr Ser Leu Glu Gln Ile Gln Asn Val Thr Val Lys Leu Gln Leu Gln
65 70 75 80
Ile Leu Gln Gln Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Glu Gly Phe Trp Asn Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Gly Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Glu Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 16
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 16
Ser Thr Thr Glu Glu Gln Ala Lys Thr Phe Leu Glu Lys Phe Asn His
1 5 10 15
Glu Ala Glu Asp Leu Ser Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Asp Glu Asn Val Gln Lys Met Asn Glu Ala Arg
35 40 45
Ala Lys Trp Ser Ala Phe Tyr Glu Glu Gln Ser Arg Met Ala Lys Thr
50 55 60
Tyr Ser Leu Glu Glu Ile Gln Asn Leu Thr Leu Lys Arg Gln Leu Lys
65 70 75 80
Ala Leu Gln His Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Asp Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Gly Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 17
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 17
Ser Thr Thr Glu Glu Gln Ala Lys Thr Phe Leu Glu Lys Phe Asn His
1 5 10 15
Glu Ala Glu Asp Leu Ser Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Asp Glu Asn Val Gln Lys Met Asn Glu Ala Arg
35 40 45
Ala Lys Trp Ser Ala Phe Tyr Glu Glu Gln Ser Arg Met Ala Arg Thr
50 55 60
Tyr Ser Leu Glu Glu Ile Gln Asn Leu Thr Leu Lys Arg Gln Leu Lys
65 70 75 80
Ala Leu Gln His Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Asn Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Gly Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 18
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 18
Ser Thr Thr Glu Glu Leu Ala Lys Thr Phe Leu Glu Lys Phe Asn Leu
1 5 10 15
Glu Ala Glu Asp Leu Ala Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Asp Glu Asn Ile Gln Lys Met Asn Asp Ala Arg
35 40 45
Ala Lys Trp Ser Ala Phe Tyr Glu Glu Gln Ser Arg Ile Ala Lys Thr
50 55 60
Tyr Pro Leu Asp Glu Ile Gln Thr Leu Ile Leu Lys Arg Gln Leu Gln
65 70 75 80
Ala Leu Gln Gln Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Asn Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Gly Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 19
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 19
Ser Thr Thr Glu Glu Leu Ala Lys Thr Phe Leu Glu Glu Phe Asn His
1 5 10 15
Glu Ala Glu Asp Leu Ser Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Asp Glu Asn Val Gln Lys Met Asn Asp Ala Arg
35 40 45
Ala Lys Trp Ser Thr Phe Tyr Glu Glu Lys Ser Lys Thr Ala Lys Thr
50 55 60
Tyr Pro Leu Glu Glu Ile Gln Asn Val Thr Leu Lys Arg Gln Leu Gln
65 70 75 80
Ala Leu Gln Gln Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Asp Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Gly Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 20
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 20
Ser Leu Thr Glu Glu Asn Ala Lys Thr Phe Leu Asn Asn Phe Asn Gln
1 5 10 15
Glu Ala Glu Asp Leu Ser Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Glu Glu Asn Ala Gln Lys Met Ser Glu Ala Ala
35 40 45
Ala Lys Trp Ser Ala Phe Tyr Glu Glu Gln Ser Lys Thr Ala Gln Ser
50 55 60
Phe Ser Leu Gln Glu Ile Gln Thr Pro Ile Ile Lys Arg Gln Leu Gln
65 70 75 80
Ala Leu Gln Gln Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Ala Asn Ser Met Leu Thr Glu Pro Ala Asp
305 310 315 320
Gly Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly His Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asn Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 21
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 21
Ser Leu Ile Glu Glu Lys Ala Glu Ser Phe Leu Asn Lys Phe Asn Gln
1 5 10 15
Glu Ala Glu Asp Leu Ser Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Glu Glu Asn Ala Gln Lys Met Asn Glu Ala Ala
35 40 45
Ala Lys Trp Ser Ala Phe Tyr Glu Glu Gln Ser Lys Ile Ala Gln Asn
50 55 60
Phe Ser Leu Gln Glu Ile Gln Asn Ala Thr Ile Lys Arg Gln Leu Lys
65 70 75 80
Ala Leu Gln Gln Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Pro Gly Phe Trp Thr Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Asp Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly His Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asn Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 22
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 22
Ser Thr Ile Glu Glu Leu Ala Lys Thr Phe Leu Asp Lys Phe Asn Gln
1 5 10 15
Glu Ala Glu Asp Leu Asp Tyr Gln Arg Ser Leu Ala Ala Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Glu Glu Asn Thr Gln Lys Met Asn Glu Ala Glu
35 40 45
Ala Lys Trp Ser Ala Phe Tyr Glu Glu Gln Ser Lys Leu Ala Thr Ala
50 55 60
Tyr Pro Leu Gln Glu Ile Gln Asn Phe Thr Leu Lys Arg Gln Leu Gln
65 70 75 80
Ala Leu Gln Gln Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Glu Pro Thr Asp
305 310 315 320
Gly Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gln Lys Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asn Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 23
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 23
Phe Asn Leu Glu Glu Gln Ala Lys Thr Phe Leu Asp Glu Phe Asn Leu
1 5 10 15
Lys Ala Glu Asp Leu Tyr Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Asp Glu Asn Val Gln Lys Met Ser Glu Ala Gly
35 40 45
Gly Ile Leu Ser Ala Phe Tyr Glu Glu Gln Ser Asn Leu Ala Lys Ala
50 55 60
Tyr Pro Leu Gln Asp Ile Gln Asn Leu Thr Val Lys Arg Gln Leu Arg
65 70 75 80
Ile Leu Gln Gln Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Lys Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Gly Gln Lys Val Val Cys His Pro Thr Ala Trp Asp Met Gly Lys Asn
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp His Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 24
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 24
Thr Asn Ile Glu Glu Glu Ala Lys Lys Phe Leu Asp Asp Phe Asn Arg
1 5 10 15
Gln Ala Glu Asn Val Ser Tyr Glu Ser Ala Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Ile Asn Ile Thr Glu Glu Asn Ile Gln Lys Met Asn Asp Ala Gly
35 40 45
Ala Lys Trp Ser Glu Phe Tyr Glu Glu Gln Ser Lys Thr Ala Arg Asn
50 55 60
Tyr Pro Leu Gln Asp Ile Gln Asn Pro Thr Val Arg Arg Gln Leu Gln
65 70 75 80
Ile Leu Gln Gln Asn Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Glu Gly Phe Trp Asn Asn Ser Met Leu Thr Glu Pro Gln Asp
305 310 315 320
Gly Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Asn Gly
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 25
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 25
Ser Thr Ser Asp Glu Glu Ala Lys Thr Phe Leu Glu Lys Phe Asn Ser
1 5 10 15
Glu Ala Glu Glu Leu Ser Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Asp Glu Asn Val Gln Lys Met Asn Val Ala Gly
35 40 45
Ala Lys Trp Ser Thr Phe Tyr Glu Glu Gln Ser Lys Ile Ala Lys Asn
50 55 60
Tyr Gln Leu Gln Asn Ile Gln Asn Asp Thr Ile Lys Arg Gln Leu Gln
65 70 75 80
Ala Leu Gln Leu Ser Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Thr Phe Trp Glu Asn Ser Met Leu Thr Glu Pro Gly Asp
305 310 315 320
Gly Arg Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys His
325 330 335
Asp Phe Arg Ile Lys Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 26
<211> 596
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 26
Ser Thr Ile Glu Glu Gln Ala Arg Thr Phe Leu Asp Lys Phe Asn His
1 5 10 15
Glu Ala Glu Asp Leu Phe Tyr Gln Ser Ser Leu Ala Ser Trp Asn Tyr
20 25 30
Asn Thr Asn Ile Thr Glu Glu Asn Val Gln Asn Met Asn Asn Ala Gly
35 40 45
Asp Lys Trp Ser Ala Phe Leu Lys Glu Gln Ser Thr Leu Ala Gln Met
50 55 60
Tyr Pro Pro Gln Glu Ile Gln Asn Leu Thr Ile Lys Leu Gln Leu Gln
65 70 75 80
Ala Leu Gln Gln Asn Gly Ser Ser Val Leu Ser Glu Asp Lys Ser Lys
85 90 95
Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser Thr Gly
100 105 110
Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu Glu Pro
115 120 125
Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu Arg Leu
130 135 140
Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu Arg Pro
145 150 155 160
Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg Ala Asn
165 170 175
His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu Val Asn
180 185 190
Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu Asp Val
195 200 205
Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu His Ala
210 215 220
Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile Ser Pro
225 230 235 240
Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly Arg Phe
245 250 255
Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys Pro Asn
260 265 270
Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala Gln Arg
275 280 285
Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu Pro Asn
290 295 300
Met Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Asp Pro Gly Asn
305 310 315 320
Val Gln Lys Val Val Cys His Pro Thr Ala Trp Asp Leu Gly Lys Gly
325 330 335
Asp Phe Arg Ile Leu Met Cys Thr Lys Val Thr Met Asp Asp Phe Leu
340 345 350
Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala Tyr Ala
355 360 365
Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe His Glu
370 375 380
Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys His Leu
385 390 395 400
Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn Glu Thr
405 410 415
Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly Thr Leu
420 425 430
Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe Lys Gly
435 440 445
Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met Lys Arg
450 455 460
Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr Tyr Cys
465 470 475 480
Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe Ile Arg
485 490 495
Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala Leu Cys
500 505 510
Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile Ser Asn
515 520 525
Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu Gly Lys
530 535 540
Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala Lys Asn
545 550 555 560
Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe Thr Trp
565 570 575
Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr Asp Trp
580 585 590
Ser Pro Tyr Ala
595
<210> 27
<211> 905
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 27
Cys Gly Gly Ala Gly Cys Ala Cys Thr Gly Thr Cys Cys Thr Cys Cys
1 5 10 15
Gly Ala Ala Cys Gly Thr Cys Gly Gly Ala Gly Cys Ala Cys Thr Gly
20 25 30
Thr Cys Cys Thr Cys Cys Gly Ala Ala Cys Gly Thr Cys Gly Gly Ala
35 40 45
Gly Cys Ala Cys Thr Gly Thr Cys Cys Thr Cys Cys Gly Ala Ala Cys
50 55 60
Gly Thr Cys Gly Gly Ala Gly Cys Ala Cys Thr Gly Thr Cys Cys Thr
65 70 75 80
Cys Cys Gly Ala Ala Cys Gly Gly Ala Gly Cys Ala Thr Gly Thr Cys
85 90 95
Cys Thr Cys Cys Gly Ala Ala Cys Gly Thr Cys Gly Gly Ala Gly Cys
100 105 110
Ala Cys Thr Gly Thr Cys Cys Thr Cys Cys Gly Ala Ala Cys Gly Ala
115 120 125
Cys Thr Ala Gly Thr Thr Ala Gly Gly Cys Gly Thr Gly Thr Ala Cys
130 135 140
Gly Gly Thr Gly Gly Gly Ala Gly Gly Cys Cys Thr Ala Thr Ala Thr
145 150 155 160
Ala Ala Gly Cys Ala Gly Ala Gly Cys Thr Cys Gly Thr Thr Thr Ala
165 170 175
Gly Thr Gly Ala Ala Cys Cys Gly Thr Cys Ala Gly Ala Thr Cys Gly
180 185 190
Cys Cys Thr Gly Gly Ala Gly Ala Cys Gly Cys Cys Ala Thr Cys Cys
195 200 205
Ala Cys Gly Cys Thr Gly Thr Thr Thr Thr Gly Ala Cys Cys Thr Cys
210 215 220
Cys Ala Thr Ala Gly Ala Ala Gly Ala Cys Ala Cys Cys Gly Gly Gly
225 230 235 240
Ala Cys Cys Gly Ala Thr Cys Cys Ala Gly Cys Cys Thr Cys Thr Cys
245 250 255
Gly Ala Cys Ala Thr Thr Cys Gly Thr Thr Gly Gly Ala Thr Cys Cys
260 265 270
Gly Cys Cys Ala Cys Cys Ala Thr Gly Gly Cys Cys Thr Thr Ala Cys
275 280 285
Cys Gly Gly Thr Gly Ala Cys Cys Gly Cys Cys Thr Thr Gly Cys Thr
290 295 300
Cys Cys Thr Gly Cys Cys Gly Cys Thr Gly Gly Cys Cys Thr Thr Gly
305 310 315 320
Cys Thr Gly Cys Thr Cys Cys Ala Cys Gly Cys Cys Gly Cys Cys Ala
325 330 335
Gly Gly Cys Cys Gly Ala Gly Cys Cys Ala Gly Thr Thr Cys Cys Gly
340 345 350
Gly Gly Thr Gly Thr Cys Gly Cys Cys Gly Cys Thr Gly Gly Ala Thr
355 360 365
Cys Gly Gly Ala Cys Cys Thr Gly Gly Ala Ala Cys Cys Thr Gly Gly
370 375 380
Gly Cys Gly Ala Gly Ala Cys Ala Gly Thr Gly Gly Ala Gly Cys Thr
385 390 395 400
Gly Ala Ala Gly Thr Gly Cys Cys Ala Gly Gly Thr Gly Cys Thr Gly
405 410 415
Cys Thr Gly Thr Cys Cys Ala Ala Cys Cys Cys Gly Ala Cys Gly Thr
420 425 430
Cys Gly Gly Gly Cys Thr Gly Cys Thr Cys Gly Thr Gly Gly Cys Thr
435 440 445
Cys Thr Thr Cys Cys Ala Gly Cys Cys Gly Cys Gly Cys Gly Gly Cys
450 455 460
Gly Cys Cys Gly Cys Cys Gly Cys Cys Ala Gly Thr Cys Cys Cys Ala
465 470 475 480
Cys Cys Thr Thr Cys Cys Thr Cys Cys Thr Ala Thr Ala Cys Cys Thr
485 490 495
Cys Thr Cys Cys Cys Ala Ala Ala Ala Cys Ala Ala Gly Cys Cys Cys
500 505 510
Ala Ala Gly Gly Cys Gly Gly Cys Cys Gly Ala Gly Gly Gly Gly Cys
515 520 525
Thr Gly Gly Ala Cys Ala Cys Cys Cys Ala Gly Cys Gly Gly Thr Thr
530 535 540
Cys Thr Cys Gly Gly Gly Cys Ala Ala Gly Ala Gly Gly Thr Thr Gly
545 550 555 560
Gly Gly Gly Gly Ala Cys Ala Cys Cys Thr Thr Cys Gly Thr Cys Cys
565 570 575
Thr Cys Ala Cys Cys Cys Thr Gly Ala Gly Cys Gly Ala Cys Thr Thr
580 585 590
Cys Cys Gly Cys Cys Gly Ala Gly Ala Gly Ala Ala Cys Gly Ala Gly
595 600 605
Gly Gly Cys Thr Ala Cys Thr Ala Thr Thr Thr Cys Thr Gly Cys Thr
610 615 620
Cys Gly Gly Cys Cys Cys Thr Gly Ala Gly Cys Ala Ala Cys Thr Cys
625 630 635 640
Cys Ala Thr Cys Ala Thr Gly Thr Ala Cys Thr Thr Cys Ala Gly Cys
645 650 655
Cys Ala Cys Thr Thr Cys Gly Thr Gly Cys Cys Gly Gly Thr Cys Thr
660 665 670
Thr Cys Cys Thr Gly Cys Cys Ala Gly Cys Gly Ala Ala Gly Cys Cys
675 680 685
Cys Ala Cys Cys Ala Cys Gly Ala Cys Gly Cys Cys Ala Gly Cys Gly
690 695 700
Cys Cys Gly Cys Gly Ala Cys Cys Ala Cys Cys Ala Ala Cys Ala Cys
705 710 715 720
Cys Gly Gly Cys Gly Cys Cys Cys Ala Cys Cys Ala Thr Cys Gly Cys
725 730 735
Gly Thr Cys Gly Cys Ala Gly Cys Cys Cys Cys Thr Gly Thr Cys Cys
740 745 750
Cys Thr Gly Cys Gly Cys Cys Cys Ala Gly Ala Gly Gly Cys Gly Thr
755 760 765
Gly Cys Cys Gly Gly Cys Cys Ala Gly Cys Gly Gly Cys Gly Gly Gly
770 775 780
Gly Gly Gly Cys Gly Cys Ala Gly Thr Gly Cys Ala Cys Ala Cys Gly
785 790 795 800
Ala Gly Gly Gly Gly Gly Cys Thr Gly Gly Ala Cys Thr Thr Cys Gly
805 810 815
Cys Cys Thr Gly Thr Gly Ala Thr Ala Thr Cys Thr Ala Cys Ala Thr
820 825 830
Cys Thr Gly Gly Gly Cys Gly Cys Cys Cys Thr Thr Gly Gly Cys Cys
835 840 845
Gly Gly Gly Ala Cys Thr Thr Gly Thr Gly Gly Gly Gly Thr Cys Cys
850 855 860
Thr Thr Cys Thr Cys Cys Thr Gly Thr Cys Ala Cys Thr Gly Gly Thr
865 870 875 880
Thr Ala Thr Cys Ala Cys Cys Cys Thr Thr Thr Ala Cys Thr Gly Cys
885 890 895
Ala Ala Cys Cys Ala Cys Thr Gly Ala
900 905
<210> 28
<211> 731
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 28
Met Glu Leu Ala Ala Leu Cys Arg Trp Gly Leu Leu Leu Ala Leu Leu
1 5 10 15
Pro Pro Gly Ala Ala Ser Thr Gln Val Cys Thr Gly Thr Asp Met Lys
20 25 30
Leu Arg Leu Pro Ala Ser Pro Glu Thr His Leu Asp Met Leu Arg His
35 40 45
Leu Tyr Gln Gly Cys Gln Val Val Gln Gly Asn Leu Glu Leu Thr Tyr
50 55 60
Leu Pro Thr Asn Ala Ser Leu Ser Phe Leu Gln Asp Ile Gln Glu Val
65 70 75 80
Gln Gly Tyr Val Leu Ile Ala His Asn Gln Val Arg Gln Val Pro Leu
85 90 95
Gln Arg Leu Arg Ile Val Arg Gly Thr Gln Leu Phe Glu Asp Asn Tyr
100 105 110
Ala Leu Ala Val Leu Asp Asn Gly Asp Pro Leu Asn Asn Thr Thr Pro
115 120 125
Val Thr Gly Ala Ser Pro Gly Gly Leu Arg Glu Leu Gln Leu Arg Ser
130 135 140
Leu Thr Glu Ile Leu Lys Gly Gly Val Leu Ile Gln Arg Asn Pro Gln
145 150 155 160
Leu Cys Tyr Gln Asp Thr Ile Leu Trp Lys Asp Ile Phe His Lys Asn
165 170 175
Asn Gln Leu Ala Leu Thr Leu Ile Asp Thr Asn Arg Ser Arg Ala Cys
180 185 190
His Pro Cys Ser Pro Met Cys Lys Gly Ser Arg Cys Trp Gly Glu Ser
195 200 205
Ser Glu Asp Cys Gln Ser Leu Thr Arg Thr Val Cys Ala Gly Gly Cys
210 215 220
Ala Arg Cys Lys Gly Pro Leu Pro Thr Asp Cys Cys His Glu Gln Cys
225 230 235 240
Ala Ala Gly Cys Thr Gly Pro Lys His Ser Asp Cys Leu Ala Cys Leu
245 250 255
His Phe Asn His Ser Gly Ile Cys Glu Leu His Cys Pro Ala Leu Val
260 265 270
Thr Tyr Asn Thr Asp Thr Phe Glu Ser Met Pro Asn Pro Glu Gly Arg
275 280 285
Tyr Thr Phe Gly Ala Ser Cys Val Thr Ala Cys Pro Tyr Asn Tyr Leu
290 295 300
Ser Thr Asp Val Gly Ser Cys Thr Leu Val Cys Pro Leu His Asn Gln
305 310 315 320
Glu Val Thr Ala Glu Asp Gly Thr Gln Arg Cys Glu Lys Cys Ser Lys
325 330 335
Pro Cys Ala Arg Val Cys Tyr Gly Leu Gly Met Glu His Leu Arg Glu
340 345 350
Val Arg Ala Val Thr Ser Ala Asn Ile Gln Glu Phe Ala Gly Cys Lys
355 360 365
Lys Ile Phe Gly Ser Leu Ala Phe Leu Pro Glu Ser Phe Asp Gly Asp
370 375 380
Pro Ala Ser Asn Thr Ala Pro Leu Gln Pro Glu Gln Leu Gln Val Phe
385 390 395 400
Glu Thr Leu Glu Glu Ile Thr Gly Tyr Leu Tyr Ile Ser Ala Trp Pro
405 410 415
Asp Ser Leu Pro Asp Leu Ser Val Phe Gln Asn Leu Gln Val Ile Arg
420 425 430
Gly Arg Ile Leu His Asn Gly Ala Tyr Ser Leu Thr Leu Gln Gly Leu
435 440 445
Gly Ile Ser Trp Leu Gly Leu Arg Ser Leu Arg Glu Leu Gly Ser Gly
450 455 460
Leu Ala Leu Ile His His Asn Thr His Leu Cys Phe Val His Thr Val
465 470 475 480
Pro Trp Asp Gln Leu Phe Arg Asn Pro His Gln Ala Leu Leu His Thr
485 490 495
Ala Asn Arg Pro Glu Asp Glu Cys Val Gly Glu Gly Leu Ala Cys His
500 505 510
Gln Leu Cys Ala Arg Gly His Cys Trp Gly Pro Gly Pro Thr Gln Cys
515 520 525
Val Asn Cys Ser Gln Phe Leu Arg Gly Gln Glu Cys Val Glu Glu Cys
530 535 540
Arg Val Leu Gln Gly Leu Pro Arg Glu Tyr Val Asn Ala Arg His Cys
545 550 555 560
Leu Pro Cys His Pro Glu Cys Gln Pro Gln Asn Gly Ser Val Thr Cys
565 570 575
Phe Gly Pro Glu Ala Asp Gln Cys Val Ala Cys Ala His Tyr Lys Asp
580 585 590
Pro Pro Phe Cys Val Ala Arg Cys Pro Ser Gly Val Lys Pro Asp Leu
595 600 605
Ser Tyr Met Pro Ile Trp Lys Phe Pro Asp Glu Glu Gly Ala Cys Gln
610 615 620
Pro Cys Pro Ile Asn Cys Thr His Ser Cys Val Asp Leu Asp Asp Lys
625 630 635 640
Gly Cys Pro Ala Glu Gln Arg Ala Ser Pro Leu Thr Ser Ile Ile Ser
645 650 655
Ala Val Val Gly Ile Leu Leu Val Val Val Leu Gly Val Val Phe Gly
660 665 670
Ile Leu Ile Lys Arg Arg Gln Gln Lys Ile Arg Lys Tyr Thr Met Arg
675 680 685
Arg Leu Leu Gln Glu Thr Glu Leu Val Glu Pro Leu Thr Pro Ser Gly
690 695 700
Ala Met Pro Asn Gln Ala Gln Met Arg Ile Leu Lys Glu Thr Glu Leu
705 710 715 720
Arg Lys Val Lys Val Leu Gly Ser Gly Ala Phe
725 730
<210> 29
<211> 193
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 29
Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys
1 5 10 15
Phe Pro Ser Val Tyr Ala Trp Glu Arg Lys Lys Ile Ser Asn Cys Val
20 25 30
Ala Asp Tyr Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys
35 40 45
Cys Tyr Gly Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn
50 55 60
Val Tyr Ala Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gln Ile
65 70 75 80
Ala Pro Gly Gln Thr Gly Val Ile Ala Asp Tyr Asn Tyr Lys Leu Pro
85 90 95
Asp Asp Phe Met Gly Cys Val Leu Ala Trp Asn Thr Arg Asn Ile Asp
100 105 110
Ala Thr Ser Thr Gly Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu Arg His
115 120 125
Gly Lys Leu Arg Pro Phe Glu Arg Asp Ile Ser Asn Val Pro Phe Ser
130 135 140
Pro Asp Gly Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro
145 150 155 160
Leu Asn Asp Tyr Gly Phe Tyr Thr Thr Thr Gly Ile Gly Tyr Gln Pro
165 170 175
Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr
180 185 190
Val
<210> 30
<211> 273
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 30
Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn
1 5 10 15
Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val
20 25 30
Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser
35 40 45
Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val
50 55 60
Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp
65 70 75 80
Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln
85 90 95
Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr
100 105 110
Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly
115 120 125
Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys
130 135 140
Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr
145 150 155 160
Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser
165 170 175
Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val
180 185 190
Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly
195 200 205
Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn
210 215 220
Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys
225 230 235 240
Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp
245 250 255
Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys
260 265 270
Ser
Claims (20)
1. A recombinant transmembrane protein, which is fused with a phase-change peptide fragment.
2. The recombinant transmembrane protein according to claim 1, wherein the phase transition peptide fragment is a low complexity sequence.
3. The recombinant transmembrane protein according to claim 2, wherein the low-complexity sequence is derived from a TAF15 protein, a FUS protein or a DDX4 protein; preferably, the low complexity sequence is selected from one or more of the following:
1) an amino acid sequence shown as SEQ ID NO. 1;
2) an amino acid sequence shown as SEQ ID NO. 2;
3) an amino acid sequence shown as SEQ ID NO. 3;
4) the amino acid sequence shown as SEQ ID NO. 4.
4. The recombinant transmembrane protein according to claim 1, wherein the phase transition peptide fragment is fused to an intracellular domain of the transmembrane protein.
5. The recombinant transmembrane protein according to claim 1, wherein the recombinant transmembrane protein is a ligand-PDGFR recombinant transmembrane protein.
6. The recombinant transmembrane protein according to claim 5, wherein the C-terminal of the ligand-PDGFR recombinant transmembrane protein is fused to a low-complexity sequence; preferably, the amino acid sequence of the ligand-PDGFR recombinant transmembrane protein fused with the C-terminal low-complexity sequence is shown as SEQ ID NO. 5.
7. An isolated polynucleotide encoding a recombinant transmembrane protein according to any one of claims 1 to 6.
8. The polynucleotide of claim 7, wherein the polynucleotide has the nucleotide sequence shown in SEQ ID No. 6.
9. A nucleic acid construct comprising the polynucleotide of claim 7 or 8.
10. A cell expressing a recombinant transmembrane protein according to any one of claims 1 to 6.
11. The cell of claim 10, wherein the cell is a ligand-presenting cell; preferably, the ligand quantitative presenting cell is an antigen quantitative presenting cell.
12. A system for quantitatively displaying macromolecules on the surface of cells, the system comprising the cells of claim 11 and receptor-expressing cells, wherein the receptors of the receptor-expressing cells are quantitatively activated specifically when the ligand-quantitatively-presenting cells and the receptor-expressing cells are co-cultured.
13. The system for quantitatively displaying macromolecules on the surface of a cell according to claim 12, wherein the receptor-expressing cells are synNotch synthetic receptor-expressing cells, and preferably, extracellular regions of synNotch synthetic receptors on the synNotch synthetic receptor-expressing cells are fused with regions capable of specifically recognizing and binding to ligands on the surface of the ligand quantitative-presenting cells.
14. The system for quantitatively displaying macromolecules on the surface of a cell as claimed in claim 12, wherein the receptor-expressing cell is a homeostatic cell line which expresses synNotch synthetic receptor and reporter group simultaneously.
15. The quantitative cell surface macromolecule display system of claim 14, wherein the reporter is selected from the group consisting of a fluorescent reporter and a non-fluorescent reporter.
16. A preparation method of a cell surface macromolecule quantitative display system is characterized by comprising the following steps:
1) constructing a receptor expression cell expressing a target receptor on a cell membrane;
2) constructing a ligand quantitative presenting cell expressing a ligand of interest, said ligand quantitative presenting cell expressing a recombinant transmembrane protein according to any one of claims 1 to 6.
17. The method of claim 16, further comprising one or more of the following features:
1) the receptor expressing cells carry a reporter group;
2) the receptor expression cells are synNotch synthetic receptor expression cells; preferably, the extracellular region of synNotch synthetic receptor on the synNotch synthetic receptor-expressing cell is fused to the receptor of interest.
18. Use of the cell surface macromolecule quantitative display system of any one of claims 12-15 for ligand-receptor affinity detection and/or bulk acquisition of high affinity antibodies.
19. A method for detecting ligand-receptor affinity, comprising co-culturing the receptor-expressing cells of step 1) and the ligand-quantitatively presenting cells of step 2) in the preparation method of claim 16, and then detecting the binding between the target ligand and the target receptor.
20. A method for obtaining high affinity antibodies in bulk, comprising the steps of:
1) constructing a synNotch antibody library and a receptor expression cell library of a reporter group;
2) co-culturing the pool of receptor-expressing cells of step 1) with a quantitative ligand-presenting cell expressing the recombinant transmembrane protein of any one of claims 1-6 to activate a cell surface macromolecular quantitative display system;
3) collecting cells in a co-culture system, and primarily enriching receptor expression cells with positive report groups;
4) culturing the receptor expression cells primarily enriched in the step 3) for 5-10 days, and enriching the cells again by using the ligand quantitative presenting cells to obtain synNotch antibodies carried by the receptor expression cells, namely the high-affinity antibodies.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011222236.1A CN114437232B (en) | 2020-11-05 | 2020-11-05 | Cell surface macromolecule quantitative display system and preparation method and application thereof |
PCT/CN2021/128581 WO2022095916A1 (en) | 2020-11-05 | 2021-11-04 | Cell surface macromolecular quantitative display system, preparation method therefor and use thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011222236.1A CN114437232B (en) | 2020-11-05 | 2020-11-05 | Cell surface macromolecule quantitative display system and preparation method and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114437232A true CN114437232A (en) | 2022-05-06 |
CN114437232B CN114437232B (en) | 2023-11-24 |
Family
ID=81360839
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011222236.1A Active CN114437232B (en) | 2020-11-05 | 2020-11-05 | Cell surface macromolecule quantitative display system and preparation method and application thereof |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN114437232B (en) |
WO (1) | WO2022095916A1 (en) |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004020582A2 (en) * | 2002-08-15 | 2004-03-11 | Functional Genetics, Inc. | Mammalian genes involved in rapamycin resistance and tumorgenesis: rapr6 genes |
CN101123983A (en) * | 2004-10-27 | 2008-02-13 | 米迪缪尼股份有限公司 | Modulation of antibody specificity by tailoring the affinity to cognate antigens |
CN108350430A (en) * | 2015-09-11 | 2018-07-31 | 艾吉纳斯公司 | Modified host cell and its application method |
US20180313827A1 (en) * | 2015-10-19 | 2018-11-01 | Oxford University Innovation Ltd | Biomolecule separation and modification |
CN109180805A (en) * | 2018-08-21 | 2019-01-11 | 南方医科大学 | Novel SynNotch synthesis of receptor and its encoding gene |
CN109576292A (en) * | 2018-12-21 | 2019-04-05 | 深圳市爱思迪生物科技有限公司 | A kind of construction method of antibody library and its application |
CN109752557A (en) * | 2017-11-06 | 2019-05-14 | 清华大学 | Detect reagent set and the application of biological disperser and its regulatory factor |
CN109917120A (en) * | 2017-12-12 | 2019-06-21 | 清华大学 | Detect the reagent set of posttranslational modification protein and its ligand-ligand interaction |
WO2020038246A1 (en) * | 2018-08-18 | 2020-02-27 | Shanghaitech University | Liquid-liquid phase separation driven protein-based underwater adhesive coatings |
CN111018997A (en) * | 2019-11-28 | 2020-04-17 | 中国科学院武汉病毒研究所 | FRET-based fusion protein, fluorescent nanoparticle and application thereof |
CN111748580A (en) * | 2019-03-28 | 2020-10-09 | 百奥泰生物制药股份有限公司 | Method for detecting activity of immune checkpoint antibody |
CN111856024A (en) * | 2019-04-28 | 2020-10-30 | 清华大学 | Method for detecting interaction between biological membrane proteins and kit used in method |
-
2020
- 2020-11-05 CN CN202011222236.1A patent/CN114437232B/en active Active
-
2021
- 2021-11-04 WO PCT/CN2021/128581 patent/WO2022095916A1/en active Application Filing
Patent Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060099676A1 (en) * | 2002-08-15 | 2006-05-11 | Limin Li | Mammalian genes involved in rapamycin resistance and tumorgenesis: rapr6 genes |
WO2004020582A2 (en) * | 2002-08-15 | 2004-03-11 | Functional Genetics, Inc. | Mammalian genes involved in rapamycin resistance and tumorgenesis: rapr6 genes |
CN101123983A (en) * | 2004-10-27 | 2008-02-13 | 米迪缪尼股份有限公司 | Modulation of antibody specificity by tailoring the affinity to cognate antigens |
CN108350430A (en) * | 2015-09-11 | 2018-07-31 | 艾吉纳斯公司 | Modified host cell and its application method |
US20180313827A1 (en) * | 2015-10-19 | 2018-11-01 | Oxford University Innovation Ltd | Biomolecule separation and modification |
CN109752557A (en) * | 2017-11-06 | 2019-05-14 | 清华大学 | Detect reagent set and the application of biological disperser and its regulatory factor |
CN109917120A (en) * | 2017-12-12 | 2019-06-21 | 清华大学 | Detect the reagent set of posttranslational modification protein and its ligand-ligand interaction |
WO2020038246A1 (en) * | 2018-08-18 | 2020-02-27 | Shanghaitech University | Liquid-liquid phase separation driven protein-based underwater adhesive coatings |
CN109180805A (en) * | 2018-08-21 | 2019-01-11 | 南方医科大学 | Novel SynNotch synthesis of receptor and its encoding gene |
CN109576292A (en) * | 2018-12-21 | 2019-04-05 | 深圳市爱思迪生物科技有限公司 | A kind of construction method of antibody library and its application |
CN111748580A (en) * | 2019-03-28 | 2020-10-09 | 百奥泰生物制药股份有限公司 | Method for detecting activity of immune checkpoint antibody |
CN111856024A (en) * | 2019-04-28 | 2020-10-30 | 清华大学 | Method for detecting interaction between biological membrane proteins and kit used in method |
CN111018997A (en) * | 2019-11-28 | 2020-04-17 | 中国科学院武汉病毒研究所 | FRET-based fusion protein, fluorescent nanoparticle and application thereof |
Non-Patent Citations (3)
Title |
---|
MURTHY ET AL.: "Molecular interactions underlying liquid-liquid phase separation of the FUS low-complexity domain", 《NAT STRUCT MOL BIOL》, vol. 26, no. 7, pages 637 - 648, XP036826068, DOI: 10.1038/s41594-019-0250-x * |
RAYKO ET AL.: "Draft genome of Bugula neritina, a colonial animal packing powerful symbionts and potential medicines", 《SCI DATA》, vol. 7, no. 1, pages 356 * |
高芳銮: "水稻瘤矮病毒P8蛋白的结构分析及其表达", 《中国生物工程杂志》, vol. 29, no. 8, pages 51 - 56 * |
Also Published As
Publication number | Publication date |
---|---|
WO2022095916A1 (en) | 2022-05-12 |
CN114437232B (en) | 2023-11-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Dodevski et al. | Evolution of three human GPCRs for higher expression and stability | |
CN103620405B (en) | Monoclonal antibody produces comprehensively | |
Kruziki et al. | A 45-amino-acid scaffold mined from the PDB for high-affinity ligand engineering | |
US20060008844A1 (en) | c-Met kinase binding proteins | |
JP2020043860A (en) | Methods for high throughput receptor:ligand identification | |
JP2006166922A (en) | Yeast cell surface display of protein and use thereof | |
JP2017537642A (en) | Chimeric antigen receptor and method of use thereof | |
WO2010065407A2 (en) | High affinity recombinant sea lamprey antibodies selected by a yeast surface display platform | |
CN109554433A (en) | A kind of drug rapid screening method based on CD47/SIRP α-block function and its biological effect | |
JP2021508246A (en) | CAR-T cell assay for specificity testing of novel antigen binding moiety | |
CN115561462A (en) | Membrane protein interaction screening platform based on cell adhesion | |
CN114437232B (en) | Cell surface macromolecule quantitative display system and preparation method and application thereof | |
CA2977436C (en) | Antibody detection method and system | |
CN113699156A (en) | PD-1 aptamer, and screening method and application thereof | |
US20220281931A1 (en) | Chemically inducible polypeptide polymerization | |
JPH11506939A (en) | Secretory leader trap cloning | |
JP2001520366A (en) | Methods for identifying optimal binding ligands for receptors | |
CN106928355B (en) | CD105 nano antibody Nb184 | |
CN106928358B (en) | CD105 nano antibody Nb168 | |
WO2021051004A1 (en) | Methods of antibody panning against target proteins | |
CN112166324A (en) | Mammalian MHC peptide display as a tool for epitope selection for vaccine design | |
CN106928359B (en) | CD105 nano antibody Nb59 | |
CN106928360B (en) | CD105 nano antibody Nb68 | |
CN112940122B (en) | Mouse single-chain antibody of targeting DNAM-1, recombinant plasmid for expressing mouse single-chain antibody, recombinant cell and application | |
CN116003606B (en) | TIGIT nano antibody and preparation method and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |