CN115109790A - Recombinant a-L-iduronate prase and preparation method thereof - Google Patents
Recombinant a-L-iduronate prase and preparation method thereof Download PDFInfo
- Publication number
- CN115109790A CN115109790A CN202110306525.8A CN202110306525A CN115109790A CN 115109790 A CN115109790 A CN 115109790A CN 202110306525 A CN202110306525 A CN 202110306525A CN 115109790 A CN115109790 A CN 115109790A
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- pro
- arg
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000002360 preparation method Methods 0.000 title claims description 8
- 108010076504 Protein Sorting Signals Proteins 0.000 claims abstract description 48
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 33
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 31
- 230000014509 gene expression Effects 0.000 claims abstract description 29
- 239000002773 nucleotide Substances 0.000 claims abstract description 29
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 29
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 claims description 32
- 238000000034 method Methods 0.000 claims description 23
- 239000000243 solution Substances 0.000 claims description 21
- 239000011780 sodium chloride Substances 0.000 claims description 16
- 239000006228 supernatant Substances 0.000 claims description 10
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 claims description 9
- 238000011068 loading method Methods 0.000 claims description 8
- 210000004962 mammalian cell Anatomy 0.000 claims description 8
- 230000008569 process Effects 0.000 claims description 8
- 238000010828 elution Methods 0.000 claims description 7
- 230000004048 modification Effects 0.000 claims description 7
- 238000012986 modification Methods 0.000 claims description 7
- 238000001042 affinity chromatography Methods 0.000 claims description 6
- 230000001413 cellular effect Effects 0.000 claims description 6
- 239000012149 elution buffer Substances 0.000 claims description 6
- 238000004191 hydrophobic interaction chromatography Methods 0.000 claims description 6
- 238000004255 ion exchange chromatography Methods 0.000 claims description 6
- 108010003381 Iduronidase Proteins 0.000 claims description 5
- 238000004587 chromatography analysis Methods 0.000 claims description 5
- 230000013595 glycosylation Effects 0.000 claims description 5
- 238000006206 glycosylation reaction Methods 0.000 claims description 5
- 230000002209 hydrophobic effect Effects 0.000 claims description 5
- 239000012528 membrane Substances 0.000 claims description 5
- 229920001184 polypeptide Polymers 0.000 claims description 5
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 5
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 5
- OVRNDRQMDRJTHS-RTRLPJTCSA-N N-acetyl-D-glucosamine Chemical compound CC(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-RTRLPJTCSA-N 0.000 claims description 4
- 150000001768 cations Chemical class 0.000 claims description 4
- 239000013604 expression vector Substances 0.000 claims description 4
- 230000001404 mediated effect Effects 0.000 claims description 4
- 238000000108 ultra-filtration Methods 0.000 claims description 3
- 108020004705 Codon Proteins 0.000 claims description 2
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 claims description 2
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 claims description 2
- 239000011543 agarose gel Substances 0.000 claims description 2
- 238000011067 equilibration Methods 0.000 claims description 2
- 239000013613 expression plasmid Substances 0.000 claims description 2
- 239000008363 phosphate buffer Substances 0.000 claims description 2
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 claims description 2
- 229920000053 polysorbate 80 Polymers 0.000 claims description 2
- 150000003839 salts Chemical class 0.000 claims description 2
- 239000011534 wash buffer Substances 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 102000004627 Iduronidase Human genes 0.000 claims 1
- 210000004027 cell Anatomy 0.000 abstract description 40
- 230000004988 N-glycosylation Effects 0.000 abstract description 10
- 230000026731 phosphorylation Effects 0.000 abstract description 10
- 238000006366 phosphorylation reaction Methods 0.000 abstract description 10
- 230000004481 post-translational protein modification Effects 0.000 abstract description 7
- 238000003776 cleavage reaction Methods 0.000 abstract description 4
- 230000007017 scission Effects 0.000 abstract description 4
- 210000000170 cell membrane Anatomy 0.000 abstract description 3
- 230000002255 enzymatic effect Effects 0.000 abstract description 2
- 229960002486 laronidase Drugs 0.000 description 58
- 229940022705 aldurazyme Drugs 0.000 description 32
- 102000004190 Enzymes Human genes 0.000 description 24
- 108090000790 Enzymes Proteins 0.000 description 24
- 229940088598 enzyme Drugs 0.000 description 24
- 108020004414 DNA Proteins 0.000 description 13
- 230000000694 effects Effects 0.000 description 12
- HSHNITRMYYLLCV-UHFFFAOYSA-N 4-methylumbelliferone Chemical compound C1=C(O)C=CC2=C1OC(=O)C=C2C HSHNITRMYYLLCV-UHFFFAOYSA-N 0.000 description 10
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 10
- 108010000761 leucylarginine Proteins 0.000 description 10
- 108010057821 leucylproline Proteins 0.000 description 9
- 239000000758 substrate Substances 0.000 description 9
- 150000001413 amino acids Chemical group 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 239000000872 buffer Substances 0.000 description 8
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 8
- 108010051242 phenylalanylserine Proteins 0.000 description 8
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 7
- 108010060035 arginylproline Proteins 0.000 description 7
- 108010047857 aspartylglycine Proteins 0.000 description 7
- 239000013592 cell lysate Substances 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 6
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 6
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 5
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 5
- 102100035028 Alpha-L-iduronidase Human genes 0.000 description 5
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 5
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 5
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 5
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 5
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 5
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 5
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 5
- 208000002678 Mucopolysaccharidoses Diseases 0.000 description 5
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 5
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 5
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 5
- 108010064997 VPY tripeptide Proteins 0.000 description 5
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 5
- 108010050848 glycylleucine Proteins 0.000 description 5
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 5
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 206010028093 mucopolysaccharidosis Diseases 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 239000011535 reaction buffer Substances 0.000 description 5
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 4
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 4
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 4
- HWMQRQIFVGEAPH-XIRDDKMYSA-N Leu-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 HWMQRQIFVGEAPH-XIRDDKMYSA-N 0.000 description 4
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 4
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 4
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 4
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 4
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 4
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 4
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 4
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 238000001962 electrophoresis Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 210000003712 lysosome Anatomy 0.000 description 4
- 230000001868 lysosomic effect Effects 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 238000011282 treatment Methods 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 3
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 3
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 3
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 3
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 3
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 3
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 3
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 3
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 3
- XFXZKCRBBOVJKS-BVSLBCMMSA-N Arg-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XFXZKCRBBOVJKS-BVSLBCMMSA-N 0.000 description 3
- 108010051330 Arg-Pro-Gly-Pro Proteins 0.000 description 3
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 3
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 3
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 3
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 3
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 3
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 3
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 3
- CZECQDPEMSVPDH-MNXVOIDGSA-N Asp-Leu-Val-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CZECQDPEMSVPDH-MNXVOIDGSA-N 0.000 description 3
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 3
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 3
- 102100021277 Beta-secretase 2 Human genes 0.000 description 3
- 101710150190 Beta-secretase 2 Proteins 0.000 description 3
- 102100025024 Cation-dependent mannose-6-phosphate receptor Human genes 0.000 description 3
- 101710145225 Cation-independent mannose-6-phosphate receptor Proteins 0.000 description 3
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 3
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 3
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 3
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 3
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 3
- FGWRYRAVBVOHIB-XIRDDKMYSA-N Gln-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O FGWRYRAVBVOHIB-XIRDDKMYSA-N 0.000 description 3
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 3
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 3
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 3
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 3
- JDAYMLXPUJRSDJ-XIRDDKMYSA-N Glu-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 JDAYMLXPUJRSDJ-XIRDDKMYSA-N 0.000 description 3
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 3
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 3
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 3
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 3
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 3
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 3
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 3
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 3
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 3
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 3
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 3
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 3
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 3
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 3
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 3
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 3
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 3
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 3
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 3
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 3
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 3
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 3
- ZZVUXQCQPXSUFH-JBACZVJFSA-N Phe-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ZZVUXQCQPXSUFH-JBACZVJFSA-N 0.000 description 3
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 3
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 3
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 3
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 3
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 3
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 3
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 3
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 3
- 102000018120 Recombinases Human genes 0.000 description 3
- 108010091086 Recombinases Proteins 0.000 description 3
- 229920002684 Sepharose Polymers 0.000 description 3
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 3
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 3
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 3
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 3
- LPSKHZWBQONOQJ-XIRDDKMYSA-N Ser-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N LPSKHZWBQONOQJ-XIRDDKMYSA-N 0.000 description 3
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 3
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 3
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 3
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 3
- NCGUQWSJUKYCIT-SZZJOZGLSA-N Thr-His-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NCGUQWSJUKYCIT-SZZJOZGLSA-N 0.000 description 3
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 3
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 3
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 3
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 3
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 3
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 3
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 3
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 3
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 3
- STTVVMWQKDOKAM-YESZJQIVSA-N Tyr-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O STTVVMWQKDOKAM-YESZJQIVSA-N 0.000 description 3
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 3
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 3
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 3
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 3
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 3
- 108010081404 acein-2 Proteins 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 108010011559 alanylphenylalanine Proteins 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 239000012228 culture supernatant Substances 0.000 description 3
- 238000001917 fluorescence detection Methods 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 108010079317 prolyl-tyrosine Proteins 0.000 description 3
- 238000011002 quantification Methods 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- 108010080629 tryptophan-leucine Proteins 0.000 description 3
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- 238000001262 western blot Methods 0.000 description 3
- CEHZCZCQHUNAJF-AVGNSLFASA-N (2s)-1-[2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N1[C@H](C(O)=O)CCC1 CEHZCZCQHUNAJF-AVGNSLFASA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 2
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 2
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 2
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 2
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- PZVMBNFTBWQWQL-DCAQKATOSA-N Arg-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PZVMBNFTBWQWQL-DCAQKATOSA-N 0.000 description 2
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 2
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 2
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 2
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 2
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 2
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 2
- KSHJMDSNSKDJPU-QTKMDUPCSA-N Arg-Thr-His Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KSHJMDSNSKDJPU-QTKMDUPCSA-N 0.000 description 2
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 2
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 2
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 2
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 2
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 2
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 2
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 2
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 2
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 2
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 2
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 2
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 2
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 2
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 2
- 102100021257 Beta-secretase 1 Human genes 0.000 description 2
- 101710150192 Beta-secretase 1 Proteins 0.000 description 2
- 101100163949 Caenorhabditis elegans asp-3 gene Proteins 0.000 description 2
- ANRWXLYGJRSQEQ-CIUDSAMLSA-N Cys-His-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ANRWXLYGJRSQEQ-CIUDSAMLSA-N 0.000 description 2
- XXDATQFUGMAJRV-XIRDDKMYSA-N Cys-Leu-Trp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XXDATQFUGMAJRV-XIRDDKMYSA-N 0.000 description 2
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 2
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 2
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 2
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 2
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 2
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 2
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 2
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- QVXWAFZDWRLXTI-NWLDYVSISA-N Glu-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QVXWAFZDWRLXTI-NWLDYVSISA-N 0.000 description 2
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 229920002683 Glycosaminoglycan Polymers 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 description 2
- PROLDOGUBQJNPG-RWMBFGLXSA-N His-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O PROLDOGUBQJNPG-RWMBFGLXSA-N 0.000 description 2
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 2
- KAFZDWMZKGQDEE-SRVKXCTJSA-N His-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KAFZDWMZKGQDEE-SRVKXCTJSA-N 0.000 description 2
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 2
- XIGFLVCAVQQGNS-IHRRRGAJSA-N His-Pro-His Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XIGFLVCAVQQGNS-IHRRRGAJSA-N 0.000 description 2
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 2
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 2
- 101000581940 Homo sapiens Napsin-A Proteins 0.000 description 2
- 101150022680 IDUA gene Proteins 0.000 description 2
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 2
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 2
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 2
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 2
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 2
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 2
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 2
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 2
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 2
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 2
- KYJHWKAMFISDJE-RCWTZXSCSA-N Met-Thr-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCSC KYJHWKAMFISDJE-RCWTZXSCSA-N 0.000 description 2
- 206010056886 Mucopolysaccharidosis I Diseases 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 102100027343 Napsin-A Human genes 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 108010019160 Pancreatin Proteins 0.000 description 2
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 2
- VJEZWOSKRCLHRP-MELADBBJSA-N Phe-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O VJEZWOSKRCLHRP-MELADBBJSA-N 0.000 description 2
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 2
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 2
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 2
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 2
- BVHFFNYBKRTSIU-MEYUZBJRSA-N Phe-His-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BVHFFNYBKRTSIU-MEYUZBJRSA-N 0.000 description 2
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 2
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 2
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 2
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 2
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 2
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 2
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 2
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 2
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 2
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 2
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 2
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 2
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 2
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- XNJVJEHDZPDPQL-BZSNNMDCSA-N Pro-Trp-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1)C(O)=O XNJVJEHDZPDPQL-BZSNNMDCSA-N 0.000 description 2
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 2
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 2
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 2
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 2
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 2
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 2
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 2
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 2
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 2
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 2
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 2
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- PUEWAXRPXOEQOW-HJGDQZAQSA-N Thr-Met-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O PUEWAXRPXOEQOW-HJGDQZAQSA-N 0.000 description 2
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 2
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 2
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 2
- HOJPPPKZWFRTHJ-PJODQICGSA-N Trp-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HOJPPPKZWFRTHJ-PJODQICGSA-N 0.000 description 2
- QNMIVTOQXUSGLN-SZMVWBNQSA-N Trp-Arg-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QNMIVTOQXUSGLN-SZMVWBNQSA-N 0.000 description 2
- IXEGQBJZDIRRIV-QEJZJMRPSA-N Trp-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IXEGQBJZDIRRIV-QEJZJMRPSA-N 0.000 description 2
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 2
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 2
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 2
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 2
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
- ZYVAAYAOTVJBSS-GMVOTWDCSA-N Tyr-Trp-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZYVAAYAOTVJBSS-GMVOTWDCSA-N 0.000 description 2
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 2
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 2
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 2
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- AEMOLEFTQBMNLQ-VCSGLWQLSA-M alpha-L-iduronate Chemical compound O[C@@H]1O[C@@H](C([O-])=O)[C@@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-VCSGLWQLSA-M 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000009089 cytolysis Effects 0.000 description 2
- 230000007812 deficiency Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 230000005284 excitation Effects 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 238000002514 liquid chromatography mass spectrum Methods 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 229940055695 pancreatin Drugs 0.000 description 2
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 238000004007 reversed phase HPLC Methods 0.000 description 2
- 239000001632 sodium acetate Substances 0.000 description 2
- 235000017281 sodium acetate Nutrition 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 239000011550 stock solution Substances 0.000 description 2
- 239000012089 stop solution Substances 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- DQJCDTNMLBYVAY-ZXXIYAEKSA-N (2S,5R,10R,13R)-16-{[(2R,3S,4R,5R)-3-{[(2S,3R,4R,5S,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy}-5-(ethylamino)-6-hydroxy-2-(hydroxymethyl)oxan-4-yl]oxy}-5-(4-aminobutyl)-10-carbamoyl-2,13-dimethyl-4,7,12,15-tetraoxo-3,6,11,14-tetraazaheptadecan-1-oic acid Chemical compound NCCCC[C@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@@H](C)NC(=O)C(C)O[C@@H]1[C@@H](NCC)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 DQJCDTNMLBYVAY-ZXXIYAEKSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- OOBVTWHLKYJFJH-FXQIFTODSA-N Arg-Ala-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O OOBVTWHLKYJFJH-FXQIFTODSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- ZVTDYGWRRPMFCL-WFBYXXMGSA-N Asp-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N ZVTDYGWRRPMFCL-WFBYXXMGSA-N 0.000 description 1
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 1
- HTSSXFASOUSJQG-IHPCNDPISA-N Asp-Tyr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HTSSXFASOUSJQG-IHPCNDPISA-N 0.000 description 1
- NOWKCMXCCJGMRR-UHFFFAOYSA-N Aziridine Chemical compound C1CN1 NOWKCMXCCJGMRR-UHFFFAOYSA-N 0.000 description 1
- 239000012617 Butyl Sepharose™ 4 Fast Flow Substances 0.000 description 1
- 208000024172 Cardiovascular disease Diseases 0.000 description 1
- 108090000317 Chymotrypsin Proteins 0.000 description 1
- 208000006069 Corneal Opacity Diseases 0.000 description 1
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 1
- 238000012270 DNA recombination Methods 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- 239000004593 Epoxy Substances 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- OAOOXBSVCJEIFY-QAETUUGQSA-N Gln-Leu-Leu-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O OAOOXBSVCJEIFY-QAETUUGQSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108010015899 Glycopeptides Proteins 0.000 description 1
- 102000002068 Glycopeptides Human genes 0.000 description 1
- AASLOGQZZKZWKH-SRVKXCTJSA-N His-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AASLOGQZZKZWKH-SRVKXCTJSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- 101001019502 Homo sapiens Alpha-L-iduronidase Proteins 0.000 description 1
- 208000015178 Hurler syndrome Diseases 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- VOAKKHOIAFKOQZ-JYJNAYRXSA-N Met-Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 VOAKKHOIAFKOQZ-JYJNAYRXSA-N 0.000 description 1
- 208000028781 Mucopolysaccharidosis type 1 Diseases 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 239000002033 PVDF binder Substances 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 208000035977 Rare disease Diseases 0.000 description 1
- 102000040739 Secretory proteins Human genes 0.000 description 1
- 108091058545 Secretory proteins Proteins 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- WOJYIMBIKTWKJO-KKUMJFAQSA-N Ser-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N WOJYIMBIKTWKJO-KKUMJFAQSA-N 0.000 description 1
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 1
- WTRQBSSQBKRNKV-MNSWYVGCSA-N Trp-Thr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=C(O)C=C1 WTRQBSSQBKRNKV-MNSWYVGCSA-N 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- JVYIGCARISMLMV-HOCLYGCPSA-N Val-Gly-Trp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JVYIGCARISMLMV-HOCLYGCPSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- 102100039662 Xaa-Pro dipeptidase Human genes 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 108010066829 alanyl-glutamyl-aspartylprolyine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000010322 bone marrow transplantation Methods 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 229960002376 chymotrypsin Drugs 0.000 description 1
- 238000011281 clinical therapy Methods 0.000 description 1
- 231100000269 corneal opacity Toxicity 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 231100000895 deafness Toxicity 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000002651 drug therapy Methods 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 230000007071 enzymatic hydrolysis Effects 0.000 description 1
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 1
- 238000002641 enzyme replacement therapy Methods 0.000 description 1
- 239000006167 equilibration buffer Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 230000009395 genetic defect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 206010019847 hepatosplenomegaly Diseases 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000010253 intravenous injection Methods 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 125000000311 mannosyl group Chemical group C1([C@@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 208000030159 metabolic disease Diseases 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 238000003566 phosphorylation assay Methods 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010066823 proline dipeptidase Proteins 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 238000009256 replacement therapy Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000002636 symptomatic treatment Methods 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 238000004704 ultra performance liquid chromatography Methods 0.000 description 1
- 238000002525 ultrasonication Methods 0.000 description 1
- 229940070710 valerate Drugs 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
- A61K38/16—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- A61K38/43—Enzymes; Proenzymes; Derivatives thereof
- A61K38/46—Hydrolases (3)
- A61K38/47—Hydrolases (3) acting on glycosyl compounds (3.2), e.g. cellulases, lactases
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01076—L-Iduronidase (3.2.1.76)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/02—Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/50—Fusion polypeptide containing protease site
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/106—Plasmid DNA for vertebrates
- C12N2800/107—Plasmid DNA for vertebrates for mammalian
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Pharmacology & Pharmacy (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Diabetes (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Obesity (AREA)
- Hematology (AREA)
- Gastroenterology & Hepatology (AREA)
- Immunology (AREA)
- Epidemiology (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
The present invention discloses an isolated nucleotide sequence comprising a signal peptide sequence and a nucleotide sequence encoding a-L-iduronidase, wherein the signal peptide sequence is not the native signal peptide sequence of a-L-iduronidase. By replacing a natural signal peptide with an artificial signal peptide, the expression level of a-L-iduronidase can be increased, and a protein with a single cleavage point and correct post-translational modification (N-glycosylation phosphorylation) can be generated, and can cross cell membranes and exert enzymatic activity in cells.
Description
Technical Field
The invention relates to the field of molecular biology, in particular to a recombinant a-L-iduronate prolidase and a preparation method thereof.
Background
Mucopolysaccharidosis (MPS), a rare disease, is primarily due to a deficiency in the hydrolytic enzymes required in lysosomes to degrade mucopolysaccharides, resulting in a large mucopolysaccharidosis in the tissue. Mucopolysaccharidosis type I (MPS I) is a metabolic disorder syndrome caused by deficiency of Alpha-L-Iduronidase (Alpha-L-Iduronidase), the most severe subtype of the disease, also known as Hurler syndrome, usually dies at about 10 years old, and clinically, it is manifested by various manifestations such as low intelligence, ugly face, hepatosplenomegaly, skeletal disorders, cardiovascular disorders, corneal opacity and deafness. At present, besides symptomatic treatment, bone marrow transplantation and gene therapy are also available, but the treatment process is complex, the risk is high, and the treatment effect is not exact. The clinical recombinase replacement therapy is the only drug therapy with definite curative effect at present, is small in risk by intravenous injection, needs lifelong treatment and is high in cost.
Aldurazyme is a recombinant Alpha-L-Iduronidase co-developed by Genzyme and BioMarin for enzyme replacement therapy of MPS-I, and is currently the only commercially available specific drug for the treatment of MPS I.
The a-L-iduronidase is present in lysosomes in mammalian cell cytoplasm and is undetectable in normal human serum. The amino acid sequence of the protein has an atypical signal peptide sequence (1-27: MRPLRPRAALLALLASLL AAPPVAPAE) [ Unit https:// www.uniprot.org/Uniprot/P35475], however, analysis by signal peptide prediction software [ SignalP ] shows that the signal peptide is cleaved at least three positions [ FIG. 10], and the signal peptide is cleaved at positions other than amino acids 27-28. Therefore, the secretion process guided by the signal peptide of a-L-iduronidase is not a classical protein secretion process from the endoplasmic reticulum to the outside of the cell after translation, and may be a process that mainly guides the translated protein from the endoplasmic reticulum to lysosomes. Since the 90 s of the 20 th century, attempts have been made to extract natural enzymes from various tissues, and it has been found that amino termini of enzymes derived from various tissues are greatly different [ elements et al, 1989 ]. This finding also indirectly confirms that the so-called 'secretory signal peptide' of a-L-iduronidase is not a signal peptide of a typical secretory protein. Nevertheless, Neufeld discovered in 1968 that a-L-iduronidase was still secreted from cells and taken up by cells with genetic defects in the periphery, alleviating the cytoplasmic mucopolysaccharide storage phenotype [ Fratantoni et al, 1968], which was a foundational finding that has subsequently been used to facilitate the genetic engineering of a-L-iduronidase and for clinical therapy.
Further studies have shown that a-L-iduronidase is subject to a number of post-translational modifications, in addition to the usual N-glycosylation, also phosphorylation of two mannose sugar groups (bisphophorylated oligomanosidic glyco-can). The phosphorylation of Mannose residues at amino acids 347 and 390, which are essential for substrate recognition [ Maita et al, 2013], 311 and 390, are key modifications for binding to the 6-phospho-Mannose receptor (M6-phosphate receptor, M6PR) and achieving cellular internalization, crossing the cell membrane by M6PR, entering the cytosol from outside the cell, and then entering the lysosome by M6 PR. Thus, it is considered that post-translational modification is indispensable for the a-L-iduronidase to exert normal activity. Since the process by which these specific post-translational modifications (phosphorylation of N-glycosylation) are formed is still unclear, it is hypothesized that they may be formed during secretion.
Based on the complex post-translational modification of a-L-iduronidase (N-glycosylation phosphorylation), the first generation recombinase (Aldurazyme) carefully utilized its own native signal peptide and was produced by DNA recombination technology in CHO cells under medium conditions of 10% FBS and harvested from the culture supernatant. But the yield was low, about 20-40. mu.g/day/10E 7 cells. Low expression levels present many challenges for subsequent purification; especially if the native signal peptide of a-L-iduronidase is used, it may result in inconsistent amino termini, greatly limiting the purity and homogeneity of the recombinase.
Therefore, those skilled in the art have made efforts to develop a recombinant a-L-iduronate-prase capable of increasing the expression level of a-L-iduronate-prase and a method for producing the same.
Disclosure of Invention
In view of the above-mentioned drawbacks of the prior art, the present invention provides a recombinant a-L-iduronidase capable of increasing the expression level of the a-L-iduronidase, and a method for preparing the same.
To achieve the above objects, one aspect of the present invention provides an isolated nucleotide sequence comprising a signal peptide sequence and a nucleotide sequence encoding a-L-iduronidase, wherein the signal peptide sequence is not the native signal peptide sequence of a-L-iduronidase.
Furthermore, the signal peptide is selected from the sequences shown in SEQ ID No. 1, SEQ ID No. 2, SEQ ID No. 3 or SEQ ID No. 4.
Furthermore, the nucleotide sequence of the coding a-L-iduronidase is shown in SEQ ID No. 9.
Optionally, the isolated nucleotide sequence is selected from the group consisting of:
1) SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 7 or SEQ ID No. 8;
2) a nucleotide sequence having at least more than 80% homology with SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 7 or SEQ ID No. 8;
3) the codon for the polypeptide is degenerate with the coding part of the nucleotide sequence in 1) or 2) and codes for a nucleotide sequence of a-L-iduronidase.
In a second aspect the present invention provides a recombinant a-L-iduronidase comprising an a-L-iduronidase and a heterologous signal peptide capable of being cleaved at a single site.
Alternatively, the amino acid sequence is selected from the following sequences:
1) SEQ ID No. 10, SEQ ID No. 11, SEQ ID No. 12 or SEQ ID No. 13;
2) an amino acid sequence having at least 95% homology with SEQ ID No. 10, SEQ ID No. 11, SEQ ID No. 12 or SEQ ID No. 13.
Further, the recombinant a-L-iduronidase has glycosylation modifications at positions 311, 390 and 426.
Further, glycosylation is modified to GlcNAc (2) Man (7) P (2).
Further, cellular internalization of recombinant a-L-iduronidase is mediated by M6 PR.
In a third aspect of the invention, there is provided a process for the preparation of a-L-iduronidase for eukaryotic expression using an isolated nucleotide sequence as described above.
Alternatively, the isolated nucleotide sequence may be preceded by a KOZARK sequence, constructed into a eukaryotic expression vector, and then transferred into a eukaryotic expression system for expression.
Optionally, the KOZARK sequence is GCCGCCACCATGC.
Alternatively, eukaryotic expression is expression in mammalian cells.
Alternatively, the mammalian cell is CHOK1SV GS-KO.
Further, the preparation method comprises the following steps: constructing a recombinant eukaryotic expression vector containing the isolated nucleotide sequence; transferring the recombinant eukaryotic expression plasmid into an eukaryotic expression system for eukaryotic expression; collecting supernatant, and purifying to obtain the alpha-L-iduronidase.
Further, purification includes one or more of affinity chromatography, hydrophobic interaction chromatography, and ion exchange chromatography.
Alternatively, the purification is a three-step purification including affinity chromatography, hydrophobic interaction chromatography, and ion exchange chromatography.
Alternatively, affinity chromatography is a linear gradient elution using agarose gel to obtain an eluate containing the protein of interest. Wherein the elution buffer is buffer B, 20mM PB,2M NaCl, pH5.3.
Optionally, hydrophobic interaction chromatography employs a hydrophobic chromatography column for removing hetero-proteins, polymers; adding 4M NaCl to a final concentration of 2M before sample loading, adjusting the pH to 5-6 by using 1M NaOH, and isocratically eluting the target protein by using an elution buffer solution. Wherein the elution buffer is: 20mM PB 0.15M NaCl pH 5.5.
Alternatively, ion exchange chromatography uses a salt tolerant cation column, and after sample loading and equilibration, the target protein is eluted with an elution buffer in a linear gradient of 0-70%. Wherein the washing buffer is 20mM PB 1M NaCl pH5.5.
Optionally, the purified solution is concentrated by ultrafiltration using a 30kDa membrane to a phosphate buffer containing 0.0001% Tween 80, pH 5.5.
The heterologous signal peptide in the invention is used, not only the expression quantity of the recombinant alpha-L-iduronate prase is improved, but also the protein with single cutting point and correct post-translational modification (N-glycosylation phosphorylation) can be generated, and the heterologous signal peptide can cross cell membranes and exert the enzymatic activity in cells.
The conception, the specific steps, and the technical effects produced by the present invention will be further described in conjunction with the accompanying drawings to fully understand the objects, the features, and the effects of the present invention.
Drawings
FIG. 1 is an electrophoretogram of cell supernatant protein expressed by mammalian cells of a-L-iduronidase of 5 different signal peptides according to example 2 of the present invention;
FIG. 2 is a primary LC-MS spectrum of N-terminal sequence analysis of a commercial Aldurazyme;
FIG. 3 is a primary LC-MS spectrum of N-terminal sequence analysis of ASP-1-Laronidase prepared in example 3 of the present invention;
FIG. 4(A) is a secondary LC-MS/MS spectrum of a commercial N-glycosylation phosphorylation assay of Aldurazyme; (B) is the theoretical molecular weight of the polypeptide fragment ion matched (mass deviation less than 20ppm) in this secondary LC-MS/MS profile;
FIG. 5(A) is a secondary LC-MS/MS spectrum of N-glycosylation phosphorylation analysis of ASP-1-Laronidase prepared in example 3 of the present invention; (B) is the theoretical molecular weight of the polypeptide fragment ion matched (mass deviation less than 20ppm) in this secondary LC-MS/MS profile;
FIG. 6 is a schematic diagram of the degradation of Alpha-L tetrahydroxy epoxy-4-methylumbelliferyl valerate, a substrate by Alpha-L-iduronate to produce the product 4-methylumbelliferone (4-methylumbelliferone) with a fluorescent signal;
FIG. 7 is a graph showing the results of an experiment of the enzyme activities of ASP-1-Laronidase and commercial Aldura zyme in example 4 of the present invention;
FIG. 8 is a Western Blotting identification of the efficiency of ASP-1-Laronidase with commercial Aldura zyme in HEK293 in the presence and absence of M6P; HEK293 cells were cultured for 3 days in medium containing 0. mu.g/ml ASP-1-Laronidase and Aldura zyme (lane 1), 10. mu.g/ml ASP-1-Laronidase (lane 2), 10. mu.g/ml Aldura zyme (lane 4), 10. mu.g/ml ASP-1-Laronidase +10mM/L M6P (lane 3), 10. mu.g/ml Aldura zyme +10mM/L M6P (lane 5), then harvested for lysis and Western Blotting assay using 50ng total protein. St: (ASP-1-Laronidase);
FIG. 9 is a graph comparing the cell internalization experiments of ASP-1-Laronidase according to example 6 of the present invention with commercial Aldura zyme;
FIG. 10 is a diagram for analyzing cleavage positions of an atypical signal peptide sequence in the amino acid sequence of a-L-iduronidase using signal peptide prediction software [ SignalP ].
Detailed Description
The technical contents of the preferred embodiments of the present invention will be more clearly and easily understood by referring to the drawings attached to the specification. The present invention may be embodied in many different forms of embodiments and the scope of the invention is not limited to the embodiments set forth herein.
Experimental procedures without specific conditions noted in the following examples, generally followed by conventional conditions, such as molecular cloning in Sambrook et al: conditions described in a Laboratory Manual (New York: Cold Spring Harbor Laboratory Press,1989), or according to the manufacturer's recommendations. The reagents used are commercially available or publicly available reagents unless otherwise specified.
In the present invention, various vectors known in the art, such as commercially available vectors, including plasmids and the like, can be used.
According to a specific embodiment of the invention, an artificial signal peptide sequence is added before a coding nucleotide sequence of the a-L-iduronidase to form a recombinant nucleotide sequence, and the recombinant nucleotide sequence is used for eukaryotic expression to obtain the recombinant a-L-iduronidase. In one embodiment of the invention, the artificial signal peptide (SEQ ID No:1, SEQ ID No:2, SEQ ID No:3 or SEQ ID No:4) in the sequence 4 can be used for well improving the expression level of the a-L-iduronidase, and the a-L-iduronidase with single cleavage point and correct post-translational modification (N-glycosylation phosphorylation) can be obtained.
EXAMPLE 1 Synthesis of 5 plasmids a-L-iduronate Prolase for expression of 5 different Signal peptides
Laronidase plasmids with 4 different signal peptides (ASP-1, SEQ ID No: 1; ASP-2, SEQ ID No: 2; ASP-3, SEQ ID No: 3; ASP-4, SEQ ID No:4) and Laronidase plasmids containing Natural Signal Peptide (NSP) are synthesized by Jinwei Zhi company, wherein the nucleotide sequences of the enzymes with artificial signal peptides are shown as SEQ ID No:5 (with ASP-1), SEQ ID No:6 (with ASP-2), SEQ ID No:7 (with ASP-3) and SEQ ID No:8 (with ASP-4); the nucleotide sequence of the enzyme with the natural signal peptide is shown as SEQ ID No. 14, and the amino acid sequence is shown as SEQ ID No. 15. The vector used was pZD, the nucleotide sequence of which is shown in SEQ ID No. 16.
EXAMPLE 25 mammalian cell expression of a-L-iduronidase with different Signal peptides and culture supernatant quantification
2.1 mammalian cells express 5 different signal peptides of a-L-iduronate prase
Extracting plasmids from ASP-1-Laronidase-pZD, ASP-2-Laronidase-pZD, ASP-3-Laronidase-pZD, ASP-4-Laronidase-pZD and NSP-Laronidase-pZD, and mixing the plasmids with PEI 1: 5, and transfected into HEK293 cells. After 5 days, the supernatant was collected, cell debris removed, and the supernatant was adjusted to pH5.5 with 1M phosphoric acid. 25ul of each expression supernatant was taken, 5ul of protein loading buffer was added, and the sample was boiled at 95 ℃ for 5 minutes. Run 12% SDS-protein electrophoresis.
Protein electrophoresis results are shown in figure 1, and it can be seen visually from the electrophoresis results of cell supernatants expressing Laronidase by 5 different signal peptides that the expression level of Laronidase by using four heterologous signal peptides is improved compared with the expression level of natural signal peptide NSP.
2.2 quantification of supernatant
The RP-HPLC method can be used for content determination of Laronidase samples.
RP-HPLC quantitative method on Agilent's HPLC high performance liquid system, the network version of Waters' Empower3 liquid workstation control system, data acquisition and analysis used. HPLC chromatography column for detection of ACE 5C4-300,250X4.6mm I.D. using ACE. The run method time for each sample was 30 minutes. The flow rate was 0.8ml/min, the column temperature was 75 ℃ and the UV absorption was 215 nm. Mobile phase a was an aqueous solution containing 0.1% trifluoroacetic acid and 10% acetonitrile, and mobile phase B was an acetonitrile solution containing 0.1% trifluoroacetic acid. In an Agilent HPLC system, first equilibrate with 20% buffer B for 3 minutes. At 3.1 min, the buffer B ratio was increased to 31% (i.e., the initial buffer B ratio in the elution zone was X%), and the elution was carried out isocratically for 6 min. The elution of the target protein was eluted with a 31% -43% buffer B linear gradient over 15 min (i.e., the linear change of buffer B was from X% to [ X +12 ]%) over 15 min. The column was washed with 90% B for 4 minutes before re-equilibrating the column.
The results are shown in Table 2, and the Laronidase and NSP-Laronidase of 4 heterologous signal peptides are compared, and the expression levels of Asp-1-Laronidase, Asp-2-Laronidase, Asp-3-Laronidase and Asp-4-Laronidase are obviously improved. Wherein the expression levels of Asp-1-Laronidase and Asp-4-Laronidase respectively reach 3.38 times and 3.59 times of NSP-Laronidase.
Table 25 transient expression of Laronidase for different Signal peptides in HEK 293-UPLC quantification
Name (R) | Concentration ug/ml |
ASP-1-Laronidase | 12.5 |
ASP-2-Laronidase | 9.6 |
ASP-3-Laronidase | 6.9 |
ASP-4-Laronidase | 13.3 |
NSP-Laronidase | 3.7 |
Example 3 Mass Spectrometry N-terminal analysis and N-glycosylation phosphorylation analysis of ASP-1-Laronidase protein
3.1 ASP-1-Laronidase was synthesized by Kinzhi corporation using a plasmid provided in Lonza as a vector, to which a suitable KOZAK sequence (GCCGCCACCATGC, SEQ ID NO:17) was added before the ASP-1-Laronidase sequence.
3.2 monoclonal construction and selection were carried out using cell line CHOK1SV GS-KO (https:// go2.Lonza. com/Biologics _ CHOK1SV. html) from Lonza, according to the construction and selection program provided in line with Lonza. The obtained stable monoclonal cell line was then expressed according to the culture method provided by Lonza. After 9 days of expression, cell supernatant was harvested and purified to obtain high-purity ASP-1-Laronidase protease.
Wherein, the purification is three-step purification including affinity chromatography, hydrophobic interaction chromatography and ion exchange chromatography. The method specifically comprises the following steps: adjusting pH of the cell culture supernatant to 5.3 with 1M phosphoric acid, and clarifying by membrane deep filtration; the supernatant was filtered and loaded onto a Blue Sepharose 6Fast Flow (Cytiva 17-0948-01) column equilibrated with 20mM PB 0.15M NaCl pH5.3, and after loading, washed with an equilibration buffer until the baseline was stabilized to remove most of non-specifically bound impurities (e.g., host proteins, host DNA, etc.), and finally subjected to linear gradient elution with 0-80% buffer B (20mM PB,2M NaCl, pH5.3), analyzed by SDS-PAGE, and the elution peaks containing the target protein were combined.
The Blue Sepharose 6Fast Flow pool is purified by a hydrophobic column (e.g., Cytiva Phenyl Sepharose Fast Flow (High Sub) Butyl Sepharose 4Fast Flow, Butyl-650M available from Tosoh, Phenyl-5PW, Phenyl Bestarose FF (HS) available from Bogelon) to remove hetero-proteins, multimers, etc. Adding 4M NaCl to a final concentration of 2M before loading the collection solution, adjusting pH to 5.5 with 1M NaOH, loading the collection solution onto a hydrophobic chromatography column balanced with 20mM PB 2M NaCl pH5.5, eluting with an equilibrium buffer solution until the balance of a base line is achieved, isocratically eluting the target protein with 20mM PB 0.15M NaCl pH5.5, and collecting the eluate containing the target protein.
Further purifying ASP-1-Laronidase with salt-tolerant cation column (such as Sulfate-650F (Tosoh), POROS XS (Thermo Scientific), Eshmuno-CPS (Merck)). Loading the collected liquid from the hydrophobic chromatography onto a salt-tolerant cation column balanced by 20mM PB 0.15M NaCl pH5.5, eluting with the balanced liquid until the baseline is stable, and finally eluting the target protein with 20mM PB 1M NaCl pH5.5 according to a linear gradient of 0-70%. Collecting the eluent. The final sample was concentrated by ultrafiltration using a 30kDa membrane in a buffer containing 0.0001% Tween 80pH 5.5 phosphate.
3.3 ASP-1-Laronidase and commercial Aldura zyme were sent to great health Corp and protein N-terminal analysis was performed using LC-MS/MS. The results are shown in Table 3, FIG. 2, FIG. 3, FIG. 4, FIG. 5.
TABLE 3 commercial Aldurazyme and ASP-1-Laronidase enzymatic hydrolysis to N-terminal polypeptide LC-MS/MS Using Trypsin
As is clear from Table 3 and FIGS. 2 to 5, the LC-MS/MS results show that the cleavage site of the signal peptide of the ASP-1 signal peptide used after intracellular processing of the protein is consistent with that of commercial Aldulanzyme.
Wherein, when the IDUA gene expressing Laronidase is selected in this example, the selected gene is shown in SEQ ID No. 9, the protein expressed by the gene is different from commercial Aldurazyme in the 8 th position (without signal peptide) of the amino acid sequence, Laronidase in this example is 8Q, and commercial Aldurazyme is 8H. This is based on the polymorphic consideration of the IDUA gene, which accounts for about 85% of the population.
3.4 the high purity protein was sent to Zhengda health company for N-glycosylation phosphorylation analysis, and the results are shown in Table 4.
TABLE 4 sample quantitation of glycopeptide LC-MS/MS Using chymotrypsin hydrolysis
As can be seen from Table 4, the glycosylation modifications of commercial Aldura zyme and ASP-1-Laronidase were substantially the same.
Wherein, the 311 th, 390 th and 426 th sites of the modification site are the 311 th, 390 th and 426 th sites of Laronidase containing no signal peptide.
Example 4 identification of enzyme Activity of ASP-1-Laronidase with commercial Aldura zyme
4.1 sodium acetate solution at pH3.5 was prepared as the reaction buffer.
4.2 Glycine solution with pH10.7 was prepared as stop solution.
4.3 stock solutions of 4mg/mL substrate (Alpha-L tetrahydroxyepoxypentanoic acid-4-methylumbelliferyl ester) were prepared in DMSO and stored at-20 ℃.
4.4 the substrate stock was diluted to 800. mu.M with reaction buffer for use.
4.5 ASP-1-Laronidase was diluted to 2 μ M with commercial Aldura zyme in 0.9% sodium chloride solution, gradiently 10-fold diluted to 0.0002 μ M, and a blank (i.e., 0.9% sodium chloride solution without enzyme) was added.
4.6 taking 25 mul of substrate, 25 mul of enzyme and sodium chloride solution diluted to different concentrations in 4.5, uniformly mixing in a 96-hole fluorescence detection plate, preserving the temperature for 20 minutes at 37 ℃, and adding 200 mul of stop solution to stop the reaction.
4.7 placing the 96-hole fluorescence detection plate into an enzyme-labeling instrument for detection, and setting the parameters of the enzyme-labeling instrument as follows: excitation 360/40nm, emission 460/40 nm.
ASP-1-Laronidase and commercial Aldura zyme can decompose the substrate Alpha-L tetrahydroxy epoxy valeric acid-4-methylumbelliferyl ester to produce 4-methylumbelliferone (4-methylumbelliferone) with a fluorescent signal, as shown in FIG. 6.
ASP-1-Laronidase and commercial Aldura zyme with the same molar concentration and volume are used for reacting with a substrate with the same system, volume and concentration, the generated product is subjected to fluorescence value detection, and the magnitude of a fluorescence signal value directly shows the enzyme activity. The results are shown in FIG. 7, which shows that ASP-1-Laronidase and commercial Aldura zyme have equivalent enzyme activities at the same molar concentration and volume.
Example 5 internalization of ASP-1-Laronidase with commercial Aldura zyme was mediated by M6PR
5.1 Day 0
5.1.1 digestion of HEK293 cells in T25 flasks with 500ul of pancreatin for 1-2 min;
5.1.2 Add 5ml DMEM (10% FBS +25ug/ml G418), blow open the cells, mix;
5.1.3 cell count, then 0.2X 10 cells 6 5 bottles of/ml, 5ml/T25 bottles;
5.1.43-4 h later, 4 flasks of cells from 5.1.3 were taken and 10ug/ml ASP-1-Laronidase (without M6P), Aldura zyme (without M6P), ASP-1-Laronidase (with 10mM/L M6P), and Aldura zyme (with 10mM/L M6P) were added.
5.2 Day 3
5.2.1 trypsinizing the treated HEK293 cells cultured for 3 days, adding 500ul of pancreatin per T25 bottle for 1-2 min;
5.2.2 Add 3ml DMEM (10% FBS +25ug/ml G418), blow open the cells, mix well, and count;
5.2.3 based on counts, calculate 3 x 10 6 Volume of individual cells. Centrifuging the cells in the corresponding volume, removing the culture medium, and adding PBS for washing for 2 times;
5.2.4 lysis by ultrasonication and measurement of total protein concentration by BCA kit;
and 5.2.5, taking 50ug of the ultrasonically-crushed protein to carry out Western blotting detection. The protein samples are separated by SDS-PAGE, transferred to a PVDF membrane (Roche) by a semidry method, closed, sequentially added with a primary antibody (sheet Anti-Laronidase antibody) and a secondary antibody (peroxidase Anti-sheet IgG) for warm bath, and finally added with a developing solution for developing and photographing, wherein the results are shown in figure 8.
As shown in FIG. 8, after 3 days of culture of HEK293 cells in the presence of M6P in medium containing ASP-1-Laronidase or commercial Aldurazyme, no ASP-1-Laronidase or commercial Aldurazyme was detected in the lysates of HEK293 cells. Indicating that cellular internalization of both is mediated by M6 PR.
Example 6 cellular internalization efficiency characterization of ASP-1-Laronidase and commercial Aldura zyme in HEK293 cells
6.1 HEK293 cells were plated at 0.2X 10 6 The density per ml was seeded in 6-well plates with 2ml per well volume.
6.2 ASP-1-Laronidase and commercial Aldura zyme were added to the above HEK293 medium in amounts of 40. mu.g/ml, 20. mu.g/ml, 10. mu.g/ml, 0.055. mu.g/ml, 0.0275. mu.g/ml, respectively, and incubated on a shaker.
6.3 after 2.5h incubation, approximately 2ml of each cell was transferred to a new centrifuge tube, centrifuged at 200g for 5 min, resuspended in PBS and washed 3 times. To this end, the internalization process of ASP-1-Laronidase with commercial Aldura zyme was terminated. The washed cells were lysed by adding 300. mu.l of a cell lysate (Thermo/#78501), and the resulting cell lysate was regarded as a crude enzyme solution of ASP-1-Laronidase or Aldura zyme.
6.4 quantitate the concentration of total protein in the above crude enzyme solution using the BCA kit.
6.5A solution of sodium acetate at pH3.5 was prepared as the reaction buffer.
6.6 the substrate (Alpha-L tetrahydroxyepoxyvalerate-4-methylumbelliferyl) stock solution in 4.3 was diluted to 360. mu.M with reaction buffer for further use.
6.7 preparation of cell lysates of untreated (i.e.no enzyme added to the culture broth) HEK293 cells. Taking 15ml of the mixture with the density of 0.2 multiplied by 10 6 Perml of HEK293 cells into a new centrifuge tube, 200g centrifugation for 5 minutes, heavy suspension of cells with PBS, repeated washing 3 times, adding 7ml of fineThe cell lysate is lysed.
6.8 the product (4-methylumbelliferone) was diluted to 1.6. mu.g/ml, 0.8. mu.g/ml, 0.4. mu.g/ml, 0.2. mu.g/ml, 0.1. mu.g/ml, 0.05. mu.g/ml, 0.025. mu.g/ml with the mixture (50% of the untreated HEK293 cell lysate + 50% reaction buffer described in 6.7) and a blank was set.
6.9 mu.l of the 6.3 medium crude enzyme solution and 50. mu.l of 360. mu.M substrate were added to a 96-well fluorescent assay plate, while 100. mu.L of the product dilution described in 6.8 was added to the same 96-well fluorescent assay plate.
6.10 putting the 96-hole fluorescence detection plate into a microplate reader to detect the generation amount of the fluorescence product, and performing dynamic detection for 1.5 hours once per minute. (the parameters of the microplate reader are set as follows: 37 ℃, excitation 360/40nm and emission 460/40 nm.)
6.11 Final 3-16 minute readings were selected for enzyme activity unit calculation. The generation amount of the fluorescent product at the early stage of the reaction is in direct proportion to the time, the principle of minimum error is considered, and the reading value of 3-16 minutes is selected for calculating the final result.
Briefly, the same known concentration of ASP-1-Laronidase and commercial Aldura zyme were added to the same amount of HEK293 for incubation, and after 2.5 hours, a portion of the protein was internalized into HEK293 cells. At this point the cells were washed 3 times by centrifugation to remove enzymes that were not internalized into the cells. The cells were lysed separately with the same volume of cell lysate to obtain cell lysates (i.e., crude enzyme solutions of ASP-1-Laronidase or Aldura zyme). Due to the same setting of other conditions, the enzyme activity of the crude enzyme solution and the content of the enzyme internalized into the cells show positive correlation. Therefore, the condition of cell internalization can be reflected by comparing the enzyme activity of the crude enzyme solution. Background controls without enzyme were subtracted during the calculation.
As shown in FIG. 9, the cellular internalization rate of ASP-1-Laronidase was not significantly different from that of commercial Aldura zyme.
The foregoing detailed description of the preferred embodiments of the invention has been presented. It should be understood that numerous modifications and variations could be devised by those skilled in the art in light of the present teachings without departing from the inventive concepts. Therefore, the technical solutions available to those skilled in the art through logic analysis, reasoning and limited experiments based on the prior art according to the concept of the present invention should be within the scope of protection defined by the claims.
SEQUENCE LISTING
<110> Beijing Datder pharmaceutical science and technology Co., Ltd
<120> a recombinant a-L-iduronate prase and preparation method thereof
<160> 17
<170> PatentIn version 3.5
<210> 1
<211> 48
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 1
atgcctctgc tgctgctgct gcctctgctg tgggctggag ccctcgct 48
<210> 2
<211> 75
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 2
atggctgctg ctgccatccc tgccctgctg ctgtgcctgc ccctgctgtt cctgctgttt 60
<210> 3
<211> 60
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 3
atgtacagga tgcaactcct gtcttgcatt gcactaagtc ttgcacttgt cacgaattcc 60
<210> 4
<211> 60
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 4
atggagacag acacactcct gctatgggta ctgctgctct gggttccagg ttccactggt 60
<210> 5
<211> 1932
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 5
atgcctctgc tgctgctgct gcctctgctg tgggctggag ccctcgctgc tgaagctcct 60
cacctggtcc aggtcgatgc cgcaagagca ctgtggcctc tgcgcaggtt ctggagaagc 120
accggattct gtccacctct gcctcattca caggccgacc agtacgtcct gtcctgggat 180
cagcagctga acctggccta cgttggtgct gtgccacaca gaggcatcaa gcaagttcgc 240
acacattggc tgctggagct ggttacaaca agaggctcaa caggaagagg actgtcctat 300
aacttcacac acctggacgg ctatctcgac ctcctccggg agaatcagct gctcccaggg 360
tttgagctga tgggatcagc aagcggacac tttactgact tcgaggacaa gcaacaagtg 420
tttgagtgga aggatctcgt cagctctctg gcaagaaggt atatcggacg gtatggcctg 480
gcacacgtga gcaagtggaa tttcgaaaca tggaacgagc cagaccatca cgatttcgac 540
aacgttagca tgaccatgca gggatttctc aactactacg atgcctgctc agagggcctg 600
agggcagctt ctccagccct gcgcctgggt ggtcctggcg actctttcca cactccaccc 660
agaagccctc tgtcatgggg gctcctcaga cattgtcacg acggcaccaa tttcttcact 720
ggcgaggctg gtgttcgcct cgattacatt agtctccatc gcaaaggcgc aaggtctagt 780
atatcaatcc tggagcagga gaaagtggtg gcccagcaga tcaggcagct gttccctaag 840
ttcgccgata ctcccatata caatgacgaa gccgatccac tggtcggttg gtccctgcct 900
cagccctgga gagccgatgt gacttacgct gcaatggtcg tgaaagtgat cgcccagcac 960
cagaacctgc tgctggcaaa cactacttct gcattccctt acgcactcct gagcaacgat 1020
aacgctttcc tcagctacca ccctcaccct ttcgcccaac ggaccctgac agctcgcttt 1080
caggtgaata acactaggcc acctcatgtg cagctgctga gaaagcccgt gctcacagca 1140
atgggtctgc tggctctcct ggacgaagag caactgtggg ctgaggtgtc ccaggctggc 1200
accgttctgg atagtaatca cacagtcggc gtgctcgcct ccgcacatcg gcctcaaggc 1260
cctgctgatg cctggagagc cgctgttctc atctatgcta gtgacgacac acgggcacat 1320
ccaaacaggt ccgtggccgt cactctgcgc ctgagaggag tgccaccagg accagggctc 1380
gtctatgtta caagatacct ggataacgga ctgtgtagcc cagacggaga atggcgcagg 1440
ctgggtcgcc cagtctttcc tacagcagag cagttcagga gaatgagagc agctgaggac 1500
cctgtcgcag cagctcccag accactgcct gccggaggtc ggctgacact caggccagcc 1560
ctcagactgc cttctctcct cctcgtgcac gtgtgtgctc ggcctgagaa acctccaggt 1620
caagtcacta ggctgagggc actccctctg acccagggtc agctggtgct ggtttggtca 1680
gatgagcacg ttgggagcaa gtgtctgtgg acttacgaga ttcaattctc acaggacggg 1740
aaggcttata caccagttag ccgcaaacca agcactttca acctctttgt gtttagtcct 1800
gacacaggtg cagtctccgg ctcttacagg gttcgggctc tggactattg ggccagacct 1860
ggacctttct cagacccagt gccctacctg gaagtgccag tccctagagg cccaccttct 1920
cctggaaatc cc 1932
<210> 6
<211> 1959
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 6
atggctgctg ctgccatccc tgccctgctg ctgtgcctgc ccctgctgtt cctgctgttt 60
ggctggagca gagccgctga agctcctcac ctggtccagg tcgatgccgc aagagcactg 120
tggcctctgc gcaggttctg gagaagcacc ggattctgtc cacctctgcc tcattcacag 180
gccgaccagt acgtcctgtc ctgggatcag cagctgaacc tggcctacgt tggtgctgtg 240
ccacacagag gcatcaagca agttcgcaca cattggctgc tggagctggt tacaacaaga 300
ggctcaacag gaagaggact gtcctataac ttcacacacc tggacggcta tctcgacctc 360
ctccgggaga atcagctgct cccagggttt gagctgatgg gatcagcaag cggacacttt 420
actgacttcg aggacaagca acaagtgttt gagtggaagg atctcgtcag ctctctggca 480
agaaggtata tcggacggta tggcctggca cacgtgagca agtggaattt cgaaacatgg 540
aacgagccag accatcacga tttcgacaac gttagcatga ccatgcaggg atttctcaac 600
tactacgatg cctgctcaga gggcctgagg gcagcttctc cagccctgcg cctgggtggt 660
cctggcgact ctttccacac tccacccaga agccctctgt catgggggct cctcagacat 720
tgtcacgacg gcaccaattt cttcactggc gaggctggtg ttcgcctcga ttacattagt 780
ctccatcgca aaggcgcaag gtctagtata tcaatcctgg agcaggagaa agtggtggcc 840
cagcagatca ggcagctgtt ccctaagttc gccgatactc ccatatacaa tgacgaagcc 900
gatccactgg tcggttggtc cctgcctcag ccctggagag ccgatgtgac ttacgctgca 960
atggtcgtga aagtgatcgc ccagcaccag aacctgctgc tggcaaacac tacttctgca 1020
ttcccttacg cactcctgag caacgataac gctttcctca gctaccaccc tcaccctttc 1080
gcccaacgga ccctgacagc tcgctttcag gtgaataaca ctaggccacc tcatgtgcag 1140
ctgctgagaa agcccgtgct cacagcaatg ggtctgctgg ctctcctgga cgaagagcaa 1200
ctgtgggctg aggtgtccca ggctggcacc gttctggata gtaatcacac agtcggcgtg 1260
ctcgcctccg cacatcggcc tcaaggccct gctgatgcct ggagagccgc tgttctcatc 1320
tatgctagtg acgacacacg ggcacatcca aacaggtccg tggccgtcac tctgcgcctg 1380
agaggagtgc caccaggacc agggctcgtc tatgttacaa gatacctgga taacggactg 1440
tgtagcccag acggagaatg gcgcaggctg ggtcgcccag tctttcctac agcagagcag 1500
ttcaggagaa tgagagcagc tgaggaccct gtcgcagcag ctcccagacc actgcctgcc 1560
ggaggtcggc tgacactcag gccagccctc agactgcctt ctctcctcct cgtgcacgtg 1620
tgtgctcggc ctgagaaacc tccaggtcaa gtcactaggc tgagggcact ccctctgacc 1680
cagggtcagc tggtgctggt ttggtcagat gagcacgttg ggagcaagtg tctgtggact 1740
tacgagattc aattctcaca ggacgggaag gcttatacac cagttagccg caaaccaagc 1800
actttcaacc tctttgtgtt tagtcctgac acaggtgcag tctccggctc ttacagggtt 1860
cgggctctgg actattgggc cagacctgga cctttctcag acccagtgcc ctacctggaa 1920
gtgccagtcc ctagaggccc accttctcct ggaaatccc 1959
<210> 7
<211> 1944
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 7
atgtacagga tgcaactcct gtcttgcatt gcactaagtc ttgcacttgt cacgaattcc 60
gctgaagctc ctcacctggt ccaggtcgat gccgcaagag cactgtggcc tctgcgcagg 120
ttctggagaa gcaccggatt ctgtccacct ctgcctcatt cacaggccga ccagtacgtc 180
ctgtcctggg atcagcagct gaacctggcc tacgttggtg ctgtgccaca cagaggcatc 240
aagcaagttc gcacacattg gctgctggag ctggttacaa caagaggctc aacaggaaga 300
ggactgtcct ataacttcac acacctggac ggctatctcg acctcctccg ggagaatcag 360
ctgctcccag ggtttgagct gatgggatca gcaagcggac actttactga cttcgaggac 420
aagcaacaag tgtttgagtg gaaggatctc gtcagctctc tggcaagaag gtatatcgga 480
cggtatggcc tggcacacgt gagcaagtgg aatttcgaaa catggaacga gccagaccat 540
cacgatttcg acaacgttag catgaccatg cagggatttc tcaactacta cgatgcctgc 600
tcagagggcc tgagggcagc ttctccagcc ctgcgcctgg gtggtcctgg cgactctttc 660
cacactccac ccagaagccc tctgtcatgg gggctcctca gacattgtca cgacggcacc 720
aatttcttca ctggcgaggc tggtgttcgc ctcgattaca ttagtctcca tcgcaaaggc 780
gcaaggtcta gtatatcaat cctggagcag gagaaagtgg tggcccagca gatcaggcag 840
ctgttcccta agttcgccga tactcccata tacaatgacg aagccgatcc actggtcggt 900
tggtccctgc ctcagccctg gagagccgat gtgacttacg ctgcaatggt cgtgaaagtg 960
atcgcccagc accagaacct gctgctggca aacactactt ctgcattccc ttacgcactc 1020
ctgagcaacg ataacgcttt cctcagctac caccctcacc ctttcgccca acggaccctg 1080
acagctcgct ttcaggtgaa taacactagg ccacctcatg tgcagctgct gagaaagccc 1140
gtgctcacag caatgggtct gctggctctc ctggacgaag agcaactgtg ggctgaggtg 1200
tcccaggctg gcaccgttct ggatagtaat cacacagtcg gcgtgctcgc ctccgcacat 1260
cggcctcaag gccctgctga tgcctggaga gccgctgttc tcatctatgc tagtgacgac 1320
acacgggcac atccaaacag gtccgtggcc gtcactctgc gcctgagagg agtgccacca 1380
ggaccagggc tcgtctatgt tacaagatac ctggataacg gactgtgtag cccagacgga 1440
gaatggcgca ggctgggtcg cccagtcttt cctacagcag agcagttcag gagaatgaga 1500
gcagctgagg accctgtcgc agcagctccc agaccactgc ctgccggagg tcggctgaca 1560
ctcaggccag ccctcagact gccttctctc ctcctcgtgc acgtgtgtgc tcggcctgag 1620
aaacctccag gtcaagtcac taggctgagg gcactccctc tgacccaggg tcagctggtg 1680
ctggtttggt cagatgagca cgttgggagc aagtgtctgt ggacttacga gattcaattc 1740
tcacaggacg ggaaggctta tacaccagtt agccgcaaac caagcacttt caacctcttt 1800
gtgtttagtc ctgacacagg tgcagtctcc ggctcttaca gggttcgggc tctggactat 1860
tgggccagac ctggaccttt ctcagaccca gtgccctacc tggaagtgcc agtccctaga 1920
ggcccacctt ctcctggaaa tccc 1944
<210> 8
<211> 1944
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 8
atggagacag acacactcct gctatgggta ctgctgctct gggttccagg ttccactggt 60
gctgaagctc ctcacctggt ccaggtcgat gccgcaagag cactgtggcc tctgcgcagg 120
ttctggagaa gcaccggatt ctgtccacct ctgcctcatt cacaggccga ccagtacgtc 180
ctgtcctggg atcagcagct gaacctggcc tacgttggtg ctgtgccaca cagaggcatc 240
aagcaagttc gcacacattg gctgctggag ctggttacaa caagaggctc aacaggaaga 300
ggactgtcct ataacttcac acacctggac ggctatctcg acctcctccg ggagaatcag 360
ctgctcccag ggtttgagct gatgggatca gcaagcggac actttactga cttcgaggac 420
aagcaacaag tgtttgagtg gaaggatctc gtcagctctc tggcaagaag gtatatcgga 480
cggtatggcc tggcacacgt gagcaagtgg aatttcgaaa catggaacga gccagaccat 540
cacgatttcg acaacgttag catgaccatg cagggatttc tcaactacta cgatgcctgc 600
tcagagggcc tgagggcagc ttctccagcc ctgcgcctgg gtggtcctgg cgactctttc 660
cacactccac ccagaagccc tctgtcatgg gggctcctca gacattgtca cgacggcacc 720
aatttcttca ctggcgaggc tggtgttcgc ctcgattaca ttagtctcca tcgcaaaggc 780
gcaaggtcta gtatatcaat cctggagcag gagaaagtgg tggcccagca gatcaggcag 840
ctgttcccta agttcgccga tactcccata tacaatgacg aagccgatcc actggtcggt 900
tggtccctgc ctcagccctg gagagccgat gtgacttacg ctgcaatggt cgtgaaagtg 960
atcgcccagc accagaacct gctgctggca aacactactt ctgcattccc ttacgcactc 1020
ctgagcaacg ataacgcttt cctcagctac caccctcacc ctttcgccca acggaccctg 1080
acagctcgct ttcaggtgaa taacactagg ccacctcatg tgcagctgct gagaaagccc 1140
gtgctcacag caatgggtct gctggctctc ctggacgaag agcaactgtg ggctgaggtg 1200
tcccaggctg gcaccgttct ggatagtaat cacacagtcg gcgtgctcgc ctccgcacat 1260
cggcctcaag gccctgctga tgcctggaga gccgctgttc tcatctatgc tagtgacgac 1320
acacgggcac atccaaacag gtccgtggcc gtcactctgc gcctgagagg agtgccacca 1380
ggaccagggc tcgtctatgt tacaagatac ctggataacg gactgtgtag cccagacgga 1440
gaatggcgca ggctgggtcg cccagtcttt cctacagcag agcagttcag gagaatgaga 1500
gcagctgagg accctgtcgc agcagctccc agaccactgc ctgccggagg tcggctgaca 1560
ctcaggccag ccctcagact gccttctctc ctcctcgtgc acgtgtgtgc tcggcctgag 1620
aaacctccag gtcaagtcac taggctgagg gcactccctc tgacccaggg tcagctggtg 1680
ctggtttggt cagatgagca cgttgggagc aagtgtctgt ggacttacga gattcaattc 1740
tcacaggacg ggaaggctta tacaccagtt agccgcaaac caagcacttt caacctcttt 1800
gtgtttagtc ctgacacagg tgcagtctcc ggctcttaca gggttcgggc tctggactat 1860
tgggccagac ctggaccttt ctcagaccca gtgccctacc tggaagtgcc agtccctaga 1920
ggcccacctt ctcctggaaa tccc 1944
<210> 9
<211> 1884
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 9
gctgaagctc ctcacctggt ccaggtcgat gccgcaagag cactgtggcc tctgcgcagg 60
ttctggagaa gcaccggatt ctgtccacct ctgcctcatt cacaggccga ccagtacgtc 120
ctgtcctggg atcagcagct gaacctggcc tacgttggtg ctgtgccaca cagaggcatc 180
aagcaagttc gcacacattg gctgctggag ctggttacaa caagaggctc aacaggaaga 240
ggactgtcct ataacttcac acacctggac ggctatctcg acctcctccg ggagaatcag 300
ctgctcccag ggtttgagct gatgggatca gcaagcggac actttactga cttcgaggac 360
aagcaacaag tgtttgagtg gaaggatctc gtcagctctc tggcaagaag gtatatcgga 420
cggtatggcc tggcacacgt gagcaagtgg aatttcgaaa catggaacga gccagaccat 480
cacgatttcg acaacgttag catgaccatg cagggatttc tcaactacta cgatgcctgc 540
tcagagggcc tgagggcagc ttctccagcc ctgcgcctgg gtggtcctgg cgactctttc 600
cacactccac ccagaagccc tctgtcatgg gggctcctca gacattgtca cgacggcacc 660
aatttcttca ctggcgaggc tggtgttcgc ctcgattaca ttagtctcca tcgcaaaggc 720
gcaaggtcta gtatatcaat cctggagcag gagaaagtgg tggcccagca gatcaggcag 780
ctgttcccta agttcgccga tactcccata tacaatgacg aagccgatcc actggtcggt 840
tggtccctgc ctcagccctg gagagccgat gtgacttacg ctgcaatggt cgtgaaagtg 900
atcgcccagc accagaacct gctgctggca aacactactt ctgcattccc ttacgcactc 960
ctgagcaacg ataacgcttt cctcagctac caccctcacc ctttcgccca acggaccctg 1020
acagctcgct ttcaggtgaa taacactagg ccacctcatg tgcagctgct gagaaagccc 1080
gtgctcacag caatgggtct gctggctctc ctggacgaag agcaactgtg ggctgaggtg 1140
tcccaggctg gcaccgttct ggatagtaat cacacagtcg gcgtgctcgc ctccgcacat 1200
cggcctcaag gccctgctga tgcctggaga gccgctgttc tcatctatgc tagtgacgac 1260
acacgggcac atccaaacag gtccgtggcc gtcactctgc gcctgagagg agtgccacca 1320
ggaccagggc tcgtctatgt tacaagatac ctggataacg gactgtgtag cccagacgga 1380
gaatggcgca ggctgggtcg cccagtcttt cctacagcag agcagttcag gagaatgaga 1440
gcagctgagg accctgtcgc agcagctccc agaccactgc ctgccggagg tcggctgaca 1500
ctcaggccag ccctcagact gccttctctc ctcctcgtgc acgtgtgtgc tcggcctgag 1560
aaacctccag gtcaagtcac taggctgagg gcactccctc tgacccaggg tcagctggtg 1620
ctggtttggt cagatgagca cgttgggagc aagtgtctgt ggacttacga gattcaattc 1680
tcacaggacg ggaaggctta tacaccagtt agccgcaaac caagcacttt caacctcttt 1740
gtgtttagtc ctgacacagg tgcagtctcc ggctcttaca gggttcgggc tctggactat 1800
tgggccagac ctggaccttt ctcagaccca gtgccctacc tggaagtgcc agtccctaga 1860
ggcccacctt ctcctggaaa tccc 1884
<210> 10
<211> 644
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 10
Met Pro Leu Leu Leu Leu Leu Pro Leu Leu Trp Ala Gly Ala Leu Ala
1 5 10 15
Ala Glu Ala Pro His Leu Val Gln Val Asp Ala Ala Arg Ala Leu Trp
20 25 30
Pro Leu Arg Arg Phe Trp Arg Ser Thr Gly Phe Cys Pro Pro Leu Pro
35 40 45
His Ser Gln Ala Asp Gln Tyr Val Leu Ser Trp Asp Gln Gln Leu Asn
50 55 60
Leu Ala Tyr Val Gly Ala Val Pro His Arg Gly Ile Lys Gln Val Arg
65 70 75 80
Thr His Trp Leu Leu Glu Leu Val Thr Thr Arg Gly Ser Thr Gly Arg
85 90 95
Gly Leu Ser Tyr Asn Phe Thr His Leu Asp Gly Tyr Leu Asp Leu Leu
100 105 110
Arg Glu Asn Gln Leu Leu Pro Gly Phe Glu Leu Met Gly Ser Ala Ser
115 120 125
Gly His Phe Thr Asp Phe Glu Asp Lys Gln Gln Val Phe Glu Trp Lys
130 135 140
Asp Leu Val Ser Ser Leu Ala Arg Arg Tyr Ile Gly Arg Tyr Gly Leu
145 150 155 160
Ala His Val Ser Lys Trp Asn Phe Glu Thr Trp Asn Glu Pro Asp His
165 170 175
His Asp Phe Asp Asn Val Ser Met Thr Met Gln Gly Phe Leu Asn Tyr
180 185 190
Tyr Asp Ala Cys Ser Glu Gly Leu Arg Ala Ala Ser Pro Ala Leu Arg
195 200 205
Leu Gly Gly Pro Gly Asp Ser Phe His Thr Pro Pro Arg Ser Pro Leu
210 215 220
Ser Trp Gly Leu Leu Arg His Cys His Asp Gly Thr Asn Phe Phe Thr
225 230 235 240
Gly Glu Ala Gly Val Arg Leu Asp Tyr Ile Ser Leu His Arg Lys Gly
245 250 255
Ala Arg Ser Ser Ile Ser Ile Leu Glu Gln Glu Lys Val Val Ala Gln
260 265 270
Gln Ile Arg Gln Leu Phe Pro Lys Phe Ala Asp Thr Pro Ile Tyr Asn
275 280 285
Asp Glu Ala Asp Pro Leu Val Gly Trp Ser Leu Pro Gln Pro Trp Arg
290 295 300
Ala Asp Val Thr Tyr Ala Ala Met Val Val Lys Val Ile Ala Gln His
305 310 315 320
Gln Asn Leu Leu Leu Ala Asn Thr Thr Ser Ala Phe Pro Tyr Ala Leu
325 330 335
Leu Ser Asn Asp Asn Ala Phe Leu Ser Tyr His Pro His Pro Phe Ala
340 345 350
Gln Arg Thr Leu Thr Ala Arg Phe Gln Val Asn Asn Thr Arg Pro Pro
355 360 365
His Val Gln Leu Leu Arg Lys Pro Val Leu Thr Ala Met Gly Leu Leu
370 375 380
Ala Leu Leu Asp Glu Glu Gln Leu Trp Ala Glu Val Ser Gln Ala Gly
385 390 395 400
Thr Val Leu Asp Ser Asn His Thr Val Gly Val Leu Ala Ser Ala His
405 410 415
Arg Pro Gln Gly Pro Ala Asp Ala Trp Arg Ala Ala Val Leu Ile Tyr
420 425 430
Ala Ser Asp Asp Thr Arg Ala His Pro Asn Arg Ser Val Ala Val Thr
435 440 445
Leu Arg Leu Arg Gly Val Pro Pro Gly Pro Gly Leu Val Tyr Val Thr
450 455 460
Arg Tyr Leu Asp Asn Gly Leu Cys Ser Pro Asp Gly Glu Trp Arg Arg
465 470 475 480
Leu Gly Arg Pro Val Phe Pro Thr Ala Glu Gln Phe Arg Arg Met Arg
485 490 495
Ala Ala Glu Asp Pro Val Ala Ala Ala Pro Arg Pro Leu Pro Ala Gly
500 505 510
Gly Arg Leu Thr Leu Arg Pro Ala Leu Arg Leu Pro Ser Leu Leu Leu
515 520 525
Val His Val Cys Ala Arg Pro Glu Lys Pro Pro Gly Gln Val Thr Arg
530 535 540
Leu Arg Ala Leu Pro Leu Thr Gln Gly Gln Leu Val Leu Val Trp Ser
545 550 555 560
Asp Glu His Val Gly Ser Lys Cys Leu Trp Thr Tyr Glu Ile Gln Phe
565 570 575
Ser Gln Asp Gly Lys Ala Tyr Thr Pro Val Ser Arg Lys Pro Ser Thr
580 585 590
Phe Asn Leu Phe Val Phe Ser Pro Asp Thr Gly Ala Val Ser Gly Ser
595 600 605
Tyr Arg Val Arg Ala Leu Asp Tyr Trp Ala Arg Pro Gly Pro Phe Ser
610 615 620
Asp Pro Val Pro Tyr Leu Glu Val Pro Val Pro Arg Gly Pro Pro Ser
625 630 635 640
Pro Gly Asn Pro
<210> 11
<211> 653
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 11
Met Ala Ala Ala Ala Ile Pro Ala Leu Leu Leu Cys Leu Pro Leu Leu
1 5 10 15
Phe Leu Leu Phe Gly Trp Ser Arg Ala Ala Glu Ala Pro His Leu Val
20 25 30
Gln Val Asp Ala Ala Arg Ala Leu Trp Pro Leu Arg Arg Phe Trp Arg
35 40 45
Ser Thr Gly Phe Cys Pro Pro Leu Pro His Ser Gln Ala Asp Gln Tyr
50 55 60
Val Leu Ser Trp Asp Gln Gln Leu Asn Leu Ala Tyr Val Gly Ala Val
65 70 75 80
Pro His Arg Gly Ile Lys Gln Val Arg Thr His Trp Leu Leu Glu Leu
85 90 95
Val Thr Thr Arg Gly Ser Thr Gly Arg Gly Leu Ser Tyr Asn Phe Thr
100 105 110
His Leu Asp Gly Tyr Leu Asp Leu Leu Arg Glu Asn Gln Leu Leu Pro
115 120 125
Gly Phe Glu Leu Met Gly Ser Ala Ser Gly His Phe Thr Asp Phe Glu
130 135 140
Asp Lys Gln Gln Val Phe Glu Trp Lys Asp Leu Val Ser Ser Leu Ala
145 150 155 160
Arg Arg Tyr Ile Gly Arg Tyr Gly Leu Ala His Val Ser Lys Trp Asn
165 170 175
Phe Glu Thr Trp Asn Glu Pro Asp His His Asp Phe Asp Asn Val Ser
180 185 190
Met Thr Met Gln Gly Phe Leu Asn Tyr Tyr Asp Ala Cys Ser Glu Gly
195 200 205
Leu Arg Ala Ala Ser Pro Ala Leu Arg Leu Gly Gly Pro Gly Asp Ser
210 215 220
Phe His Thr Pro Pro Arg Ser Pro Leu Ser Trp Gly Leu Leu Arg His
225 230 235 240
Cys His Asp Gly Thr Asn Phe Phe Thr Gly Glu Ala Gly Val Arg Leu
245 250 255
Asp Tyr Ile Ser Leu His Arg Lys Gly Ala Arg Ser Ser Ile Ser Ile
260 265 270
Leu Glu Gln Glu Lys Val Val Ala Gln Gln Ile Arg Gln Leu Phe Pro
275 280 285
Lys Phe Ala Asp Thr Pro Ile Tyr Asn Asp Glu Ala Asp Pro Leu Val
290 295 300
Gly Trp Ser Leu Pro Gln Pro Trp Arg Ala Asp Val Thr Tyr Ala Ala
305 310 315 320
Met Val Val Lys Val Ile Ala Gln His Gln Asn Leu Leu Leu Ala Asn
325 330 335
Thr Thr Ser Ala Phe Pro Tyr Ala Leu Leu Ser Asn Asp Asn Ala Phe
340 345 350
Leu Ser Tyr His Pro His Pro Phe Ala Gln Arg Thr Leu Thr Ala Arg
355 360 365
Phe Gln Val Asn Asn Thr Arg Pro Pro His Val Gln Leu Leu Arg Lys
370 375 380
Pro Val Leu Thr Ala Met Gly Leu Leu Ala Leu Leu Asp Glu Glu Gln
385 390 395 400
Leu Trp Ala Glu Val Ser Gln Ala Gly Thr Val Leu Asp Ser Asn His
405 410 415
Thr Val Gly Val Leu Ala Ser Ala His Arg Pro Gln Gly Pro Ala Asp
420 425 430
Ala Trp Arg Ala Ala Val Leu Ile Tyr Ala Ser Asp Asp Thr Arg Ala
435 440 445
His Pro Asn Arg Ser Val Ala Val Thr Leu Arg Leu Arg Gly Val Pro
450 455 460
Pro Gly Pro Gly Leu Val Tyr Val Thr Arg Tyr Leu Asp Asn Gly Leu
465 470 475 480
Cys Ser Pro Asp Gly Glu Trp Arg Arg Leu Gly Arg Pro Val Phe Pro
485 490 495
Thr Ala Glu Gln Phe Arg Arg Met Arg Ala Ala Glu Asp Pro Val Ala
500 505 510
Ala Ala Pro Arg Pro Leu Pro Ala Gly Gly Arg Leu Thr Leu Arg Pro
515 520 525
Ala Leu Arg Leu Pro Ser Leu Leu Leu Val His Val Cys Ala Arg Pro
530 535 540
Glu Lys Pro Pro Gly Gln Val Thr Arg Leu Arg Ala Leu Pro Leu Thr
545 550 555 560
Gln Gly Gln Leu Val Leu Val Trp Ser Asp Glu His Val Gly Ser Lys
565 570 575
Cys Leu Trp Thr Tyr Glu Ile Gln Phe Ser Gln Asp Gly Lys Ala Tyr
580 585 590
Thr Pro Val Ser Arg Lys Pro Ser Thr Phe Asn Leu Phe Val Phe Ser
595 600 605
Pro Asp Thr Gly Ala Val Ser Gly Ser Tyr Arg Val Arg Ala Leu Asp
610 615 620
Tyr Trp Ala Arg Pro Gly Pro Phe Ser Asp Pro Val Pro Tyr Leu Glu
625 630 635 640
Val Pro Val Pro Arg Gly Pro Pro Ser Pro Gly Asn Pro
645 650
<210> 12
<211> 648
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 12
Met Tyr Arg Met Gln Leu Leu Ser Cys Ile Ala Leu Ser Leu Ala Leu
1 5 10 15
Val Thr Asn Ser Ala Glu Ala Pro His Leu Val Gln Val Asp Ala Ala
20 25 30
Arg Ala Leu Trp Pro Leu Arg Arg Phe Trp Arg Ser Thr Gly Phe Cys
35 40 45
Pro Pro Leu Pro His Ser Gln Ala Asp Gln Tyr Val Leu Ser Trp Asp
50 55 60
Gln Gln Leu Asn Leu Ala Tyr Val Gly Ala Val Pro His Arg Gly Ile
65 70 75 80
Lys Gln Val Arg Thr His Trp Leu Leu Glu Leu Val Thr Thr Arg Gly
85 90 95
Ser Thr Gly Arg Gly Leu Ser Tyr Asn Phe Thr His Leu Asp Gly Tyr
100 105 110
Leu Asp Leu Leu Arg Glu Asn Gln Leu Leu Pro Gly Phe Glu Leu Met
115 120 125
Gly Ser Ala Ser Gly His Phe Thr Asp Phe Glu Asp Lys Gln Gln Val
130 135 140
Phe Glu Trp Lys Asp Leu Val Ser Ser Leu Ala Arg Arg Tyr Ile Gly
145 150 155 160
Arg Tyr Gly Leu Ala His Val Ser Lys Trp Asn Phe Glu Thr Trp Asn
165 170 175
Glu Pro Asp His His Asp Phe Asp Asn Val Ser Met Thr Met Gln Gly
180 185 190
Phe Leu Asn Tyr Tyr Asp Ala Cys Ser Glu Gly Leu Arg Ala Ala Ser
195 200 205
Pro Ala Leu Arg Leu Gly Gly Pro Gly Asp Ser Phe His Thr Pro Pro
210 215 220
Arg Ser Pro Leu Ser Trp Gly Leu Leu Arg His Cys His Asp Gly Thr
225 230 235 240
Asn Phe Phe Thr Gly Glu Ala Gly Val Arg Leu Asp Tyr Ile Ser Leu
245 250 255
His Arg Lys Gly Ala Arg Ser Ser Ile Ser Ile Leu Glu Gln Glu Lys
260 265 270
Val Val Ala Gln Gln Ile Arg Gln Leu Phe Pro Lys Phe Ala Asp Thr
275 280 285
Pro Ile Tyr Asn Asp Glu Ala Asp Pro Leu Val Gly Trp Ser Leu Pro
290 295 300
Gln Pro Trp Arg Ala Asp Val Thr Tyr Ala Ala Met Val Val Lys Val
305 310 315 320
Ile Ala Gln His Gln Asn Leu Leu Leu Ala Asn Thr Thr Ser Ala Phe
325 330 335
Pro Tyr Ala Leu Leu Ser Asn Asp Asn Ala Phe Leu Ser Tyr His Pro
340 345 350
His Pro Phe Ala Gln Arg Thr Leu Thr Ala Arg Phe Gln Val Asn Asn
355 360 365
Thr Arg Pro Pro His Val Gln Leu Leu Arg Lys Pro Val Leu Thr Ala
370 375 380
Met Gly Leu Leu Ala Leu Leu Asp Glu Glu Gln Leu Trp Ala Glu Val
385 390 395 400
Ser Gln Ala Gly Thr Val Leu Asp Ser Asn His Thr Val Gly Val Leu
405 410 415
Ala Ser Ala His Arg Pro Gln Gly Pro Ala Asp Ala Trp Arg Ala Ala
420 425 430
Val Leu Ile Tyr Ala Ser Asp Asp Thr Arg Ala His Pro Asn Arg Ser
435 440 445
Val Ala Val Thr Leu Arg Leu Arg Gly Val Pro Pro Gly Pro Gly Leu
450 455 460
Val Tyr Val Thr Arg Tyr Leu Asp Asn Gly Leu Cys Ser Pro Asp Gly
465 470 475 480
Glu Trp Arg Arg Leu Gly Arg Pro Val Phe Pro Thr Ala Glu Gln Phe
485 490 495
Arg Arg Met Arg Ala Ala Glu Asp Pro Val Ala Ala Ala Pro Arg Pro
500 505 510
Leu Pro Ala Gly Gly Arg Leu Thr Leu Arg Pro Ala Leu Arg Leu Pro
515 520 525
Ser Leu Leu Leu Val His Val Cys Ala Arg Pro Glu Lys Pro Pro Gly
530 535 540
Gln Val Thr Arg Leu Arg Ala Leu Pro Leu Thr Gln Gly Gln Leu Val
545 550 555 560
Leu Val Trp Ser Asp Glu His Val Gly Ser Lys Cys Leu Trp Thr Tyr
565 570 575
Glu Ile Gln Phe Ser Gln Asp Gly Lys Ala Tyr Thr Pro Val Ser Arg
580 585 590
Lys Pro Ser Thr Phe Asn Leu Phe Val Phe Ser Pro Asp Thr Gly Ala
595 600 605
Val Ser Gly Ser Tyr Arg Val Arg Ala Leu Asp Tyr Trp Ala Arg Pro
610 615 620
Gly Pro Phe Ser Asp Pro Val Pro Tyr Leu Glu Val Pro Val Pro Arg
625 630 635 640
Gly Pro Pro Ser Pro Gly Asn Pro
645
<210> 13
<211> 648
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 13
Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro
1 5 10 15
Gly Ser Thr Gly Ala Glu Ala Pro His Leu Val Gln Val Asp Ala Ala
20 25 30
Arg Ala Leu Trp Pro Leu Arg Arg Phe Trp Arg Ser Thr Gly Phe Cys
35 40 45
Pro Pro Leu Pro His Ser Gln Ala Asp Gln Tyr Val Leu Ser Trp Asp
50 55 60
Gln Gln Leu Asn Leu Ala Tyr Val Gly Ala Val Pro His Arg Gly Ile
65 70 75 80
Lys Gln Val Arg Thr His Trp Leu Leu Glu Leu Val Thr Thr Arg Gly
85 90 95
Ser Thr Gly Arg Gly Leu Ser Tyr Asn Phe Thr His Leu Asp Gly Tyr
100 105 110
Leu Asp Leu Leu Arg Glu Asn Gln Leu Leu Pro Gly Phe Glu Leu Met
115 120 125
Gly Ser Ala Ser Gly His Phe Thr Asp Phe Glu Asp Lys Gln Gln Val
130 135 140
Phe Glu Trp Lys Asp Leu Val Ser Ser Leu Ala Arg Arg Tyr Ile Gly
145 150 155 160
Arg Tyr Gly Leu Ala His Val Ser Lys Trp Asn Phe Glu Thr Trp Asn
165 170 175
Glu Pro Asp His His Asp Phe Asp Asn Val Ser Met Thr Met Gln Gly
180 185 190
Phe Leu Asn Tyr Tyr Asp Ala Cys Ser Glu Gly Leu Arg Ala Ala Ser
195 200 205
Pro Ala Leu Arg Leu Gly Gly Pro Gly Asp Ser Phe His Thr Pro Pro
210 215 220
Arg Ser Pro Leu Ser Trp Gly Leu Leu Arg His Cys His Asp Gly Thr
225 230 235 240
Asn Phe Phe Thr Gly Glu Ala Gly Val Arg Leu Asp Tyr Ile Ser Leu
245 250 255
His Arg Lys Gly Ala Arg Ser Ser Ile Ser Ile Leu Glu Gln Glu Lys
260 265 270
Val Val Ala Gln Gln Ile Arg Gln Leu Phe Pro Lys Phe Ala Asp Thr
275 280 285
Pro Ile Tyr Asn Asp Glu Ala Asp Pro Leu Val Gly Trp Ser Leu Pro
290 295 300
Gln Pro Trp Arg Ala Asp Val Thr Tyr Ala Ala Met Val Val Lys Val
305 310 315 320
Ile Ala Gln His Gln Asn Leu Leu Leu Ala Asn Thr Thr Ser Ala Phe
325 330 335
Pro Tyr Ala Leu Leu Ser Asn Asp Asn Ala Phe Leu Ser Tyr His Pro
340 345 350
His Pro Phe Ala Gln Arg Thr Leu Thr Ala Arg Phe Gln Val Asn Asn
355 360 365
Thr Arg Pro Pro His Val Gln Leu Leu Arg Lys Pro Val Leu Thr Ala
370 375 380
Met Gly Leu Leu Ala Leu Leu Asp Glu Glu Gln Leu Trp Ala Glu Val
385 390 395 400
Ser Gln Ala Gly Thr Val Leu Asp Ser Asn His Thr Val Gly Val Leu
405 410 415
Ala Ser Ala His Arg Pro Gln Gly Pro Ala Asp Ala Trp Arg Ala Ala
420 425 430
Val Leu Ile Tyr Ala Ser Asp Asp Thr Arg Ala His Pro Asn Arg Ser
435 440 445
Val Ala Val Thr Leu Arg Leu Arg Gly Val Pro Pro Gly Pro Gly Leu
450 455 460
Val Tyr Val Thr Arg Tyr Leu Asp Asn Gly Leu Cys Ser Pro Asp Gly
465 470 475 480
Glu Trp Arg Arg Leu Gly Arg Pro Val Phe Pro Thr Ala Glu Gln Phe
485 490 495
Arg Arg Met Arg Ala Ala Glu Asp Pro Val Ala Ala Ala Pro Arg Pro
500 505 510
Leu Pro Ala Gly Gly Arg Leu Thr Leu Arg Pro Ala Leu Arg Leu Pro
515 520 525
Ser Leu Leu Leu Val His Val Cys Ala Arg Pro Glu Lys Pro Pro Gly
530 535 540
Gln Val Thr Arg Leu Arg Ala Leu Pro Leu Thr Gln Gly Gln Leu Val
545 550 555 560
Leu Val Trp Ser Asp Glu His Val Gly Ser Lys Cys Leu Trp Thr Tyr
565 570 575
Glu Ile Gln Phe Ser Gln Asp Gly Lys Ala Tyr Thr Pro Val Ser Arg
580 585 590
Lys Pro Ser Thr Phe Asn Leu Phe Val Phe Ser Pro Asp Thr Gly Ala
595 600 605
Val Ser Gly Ser Tyr Arg Val Arg Ala Leu Asp Tyr Trp Ala Arg Pro
610 615 620
Gly Pro Phe Ser Asp Pro Val Pro Tyr Leu Glu Val Pro Val Pro Arg
625 630 635 640
Gly Pro Pro Ser Pro Gly Asn Pro
645
<210> 14
<211> 1959
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 14
atgcgtcccc tgcgcccccg cgccgcgctg ctggcgctcc tggcctcgct cctggccgcg 60
cccccggtgg ccccggctga agctcctcac ctggtccagg tcgatgccgc aagagcactg 120
tggcctctgc gcaggttctg gagaagcacc ggattctgtc cacctctgcc tcattcacag 180
gccgaccagt acgtcctgtc ctgggatcag cagctgaacc tggcctacgt tggtgctgtg 240
ccacacagag gcatcaagca agttcgcaca cattggctgc tggagctggt tacaacaaga 300
ggctcaacag gaagaggact gtcctataac ttcacacacc tggacggcta tctcgacctc 360
ctccgggaga atcagctgct cccagggttt gagctgatgg gatcagcaag cggacacttt 420
actgacttcg aggacaagca acaagtgttt gagtggaagg atctcgtcag ctctctggca 480
agaaggtata tcggacggta tggcctggca cacgtgagca agtggaattt cgaaacatgg 540
aacgagccag accatcacga tttcgacaac gttagcatga ccatgcaggg atttctcaac 600
tactacgatg cctgctcaga gggcctgagg gcagcttctc cagccctgcg cctgggtggt 660
cctggcgact ctttccacac tccacccaga agccctctgt catgggggct cctcagacat 720
tgtcacgacg gcaccaattt cttcactggc gaggctggtg ttcgcctcga ttacattagt 780
ctccatcgca aaggcgcaag gtctagtata tcaatcctgg agcaggagaa agtggtggcc 840
cagcagatca ggcagctgtt ccctaagttc gccgatactc ccatatacaa tgacgaagcc 900
gatccactgg tcggttggtc cctgcctcag ccctggagag ccgatgtgac ttacgctgca 960
atggtcgtga aagtgatcgc ccagcaccag aacctgctgc tggcaaacac tacttctgca 1020
ttcccttacg cactcctgag caacgataac gctttcctca gctaccaccc tcaccctttc 1080
gcccaacgga ccctgacagc tcgctttcag gtgaataaca ctaggccacc tcatgtgcag 1140
ctgctgagaa agcccgtgct cacagcaatg ggtctgctgg ctctcctgga cgaagagcaa 1200
ctgtgggctg aggtgtccca ggctggcacc gttctggata gtaatcacac agtcggcgtg 1260
ctcgcctccg cacatcggcc tcaaggccct gctgatgcct ggagagccgc tgttctcatc 1320
tatgctagtg acgacacacg ggcacatcca aacaggtccg tggccgtcac tctgcgcctg 1380
agaggagtgc caccaggacc agggctcgtc tatgttacaa gatacctgga taacggactg 1440
tgtagcccag acggagaatg gcgcaggctg ggtcgcccag tctttcctac agcagagcag 1500
ttcaggagaa tgagagcagc tgaggaccct gtcgcagcag ctcccagacc actgcctgcc 1560
ggaggtcggc tgacactcag gccagccctc agactgcctt ctctcctcct cgtgcacgtg 1620
tgtgctcggc ctgagaaacc tccaggtcaa gtcactaggc tgagggcact ccctctgacc 1680
cagggtcagc tggtgctggt ttggtcagat gagcacgttg ggagcaagtg tctgtggact 1740
tacgagattc aattctcaca ggacgggaag gcttatacac cagttagccg caaaccaagc 1800
actttcaacc tctttgtgtt tagtcctgac acaggtgcag tctccggctc ttacagggtt 1860
cgggctctgg actattgggc cagacctgga cctttctcag acccagtgcc ctacctggaa 1920
gtgccagtcc ctagaggccc accttctcct ggaaatccc 1959
<210> 15
<211> 653
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 15
Met Arg Pro Leu Arg Pro Arg Ala Ala Leu Leu Ala Leu Leu Ala Ser
1 5 10 15
Leu Leu Ala Ala Pro Pro Val Ala Pro Ala Glu Ala Pro His Leu Val
20 25 30
Gln Val Asp Ala Ala Arg Ala Leu Trp Pro Leu Arg Arg Phe Trp Arg
35 40 45
Ser Thr Gly Phe Cys Pro Pro Leu Pro His Ser Gln Ala Asp Gln Tyr
50 55 60
Val Leu Ser Trp Asp Gln Gln Leu Asn Leu Ala Tyr Val Gly Ala Val
65 70 75 80
Pro His Arg Gly Ile Lys Gln Val Arg Thr His Trp Leu Leu Glu Leu
85 90 95
Val Thr Thr Arg Gly Ser Thr Gly Arg Gly Leu Ser Tyr Asn Phe Thr
100 105 110
His Leu Asp Gly Tyr Leu Asp Leu Leu Arg Glu Asn Gln Leu Leu Pro
115 120 125
Gly Phe Glu Leu Met Gly Ser Ala Ser Gly His Phe Thr Asp Phe Glu
130 135 140
Asp Lys Gln Gln Val Phe Glu Trp Lys Asp Leu Val Ser Ser Leu Ala
145 150 155 160
Arg Arg Tyr Ile Gly Arg Tyr Gly Leu Ala His Val Ser Lys Trp Asn
165 170 175
Phe Glu Thr Trp Asn Glu Pro Asp His His Asp Phe Asp Asn Val Ser
180 185 190
Met Thr Met Gln Gly Phe Leu Asn Tyr Tyr Asp Ala Cys Ser Glu Gly
195 200 205
Leu Arg Ala Ala Ser Pro Ala Leu Arg Leu Gly Gly Pro Gly Asp Ser
210 215 220
Phe His Thr Pro Pro Arg Ser Pro Leu Ser Trp Gly Leu Leu Arg His
225 230 235 240
Cys His Asp Gly Thr Asn Phe Phe Thr Gly Glu Ala Gly Val Arg Leu
245 250 255
Asp Tyr Ile Ser Leu His Arg Lys Gly Ala Arg Ser Ser Ile Ser Ile
260 265 270
Leu Glu Gln Glu Lys Val Val Ala Gln Gln Ile Arg Gln Leu Phe Pro
275 280 285
Lys Phe Ala Asp Thr Pro Ile Tyr Asn Asp Glu Ala Asp Pro Leu Val
290 295 300
Gly Trp Ser Leu Pro Gln Pro Trp Arg Ala Asp Val Thr Tyr Ala Ala
305 310 315 320
Met Val Val Lys Val Ile Ala Gln His Gln Asn Leu Leu Leu Ala Asn
325 330 335
Thr Thr Ser Ala Phe Pro Tyr Ala Leu Leu Ser Asn Asp Asn Ala Phe
340 345 350
Leu Ser Tyr His Pro His Pro Phe Ala Gln Arg Thr Leu Thr Ala Arg
355 360 365
Phe Gln Val Asn Asn Thr Arg Pro Pro His Val Gln Leu Leu Arg Lys
370 375 380
Pro Val Leu Thr Ala Met Gly Leu Leu Ala Leu Leu Asp Glu Glu Gln
385 390 395 400
Leu Trp Ala Glu Val Ser Gln Ala Gly Thr Val Leu Asp Ser Asn His
405 410 415
Thr Val Gly Val Leu Ala Ser Ala His Arg Pro Gln Gly Pro Ala Asp
420 425 430
Ala Trp Arg Ala Ala Val Leu Ile Tyr Ala Ser Asp Asp Thr Arg Ala
435 440 445
His Pro Asn Arg Ser Val Ala Val Thr Leu Arg Leu Arg Gly Val Pro
450 455 460
Pro Gly Pro Gly Leu Val Tyr Val Thr Arg Tyr Leu Asp Asn Gly Leu
465 470 475 480
Cys Ser Pro Asp Gly Glu Trp Arg Arg Leu Gly Arg Pro Val Phe Pro
485 490 495
Thr Ala Glu Gln Phe Arg Arg Met Arg Ala Ala Glu Asp Pro Val Ala
500 505 510
Ala Ala Pro Arg Pro Leu Pro Ala Gly Gly Arg Leu Thr Leu Arg Pro
515 520 525
Ala Leu Arg Leu Pro Ser Leu Leu Leu Val His Val Cys Ala Arg Pro
530 535 540
Glu Lys Pro Pro Gly Gln Val Thr Arg Leu Arg Ala Leu Pro Leu Thr
545 550 555 560
Gln Gly Gln Leu Val Leu Val Trp Ser Asp Glu His Val Gly Ser Lys
565 570 575
Cys Leu Trp Thr Tyr Glu Ile Gln Phe Ser Gln Asp Gly Lys Ala Tyr
580 585 590
Thr Pro Val Ser Arg Lys Pro Ser Thr Phe Asn Leu Phe Val Phe Ser
595 600 605
Pro Asp Thr Gly Ala Val Ser Gly Ser Tyr Arg Val Arg Ala Leu Asp
610 615 620
Tyr Trp Ala Arg Pro Gly Pro Phe Ser Asp Pro Val Pro Tyr Leu Glu
625 630 635 640
Val Pro Val Pro Arg Gly Pro Pro Ser Pro Gly Asn Pro
645 650
<210> 16
<211> 4923
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 16
aggcgtctaa ccagtcacag tcgcaaggta ggctgagcac cgtggcgggc ggcagcgggt 60
ggcggtcggg gttgtttctg gcggaggtgc tgctgatgat gtaattaaag taggcggtct 120
tgagacggcg gatggtcgag gtgaggtgtg gcaggcttga gatccagctg ttggggtgag 180
tactccctct caaaagcggg cattacttct gcgctaagat tgtcagtttc caaaaacgag 240
gaggatttga tattcacctg gcccgatctg gccatacact tgagtgacaa tgacatccac 300
tttgcctttc tctccacagg tgtccactcc caggtccaag tttaaacgga tctctagcga 360
attcgcggcc gccaccatgc cgctgctgct actgctgccc ctgctgtggg caggggccct 420
cgctgtggag gttgactcag ctcccggtac agacctggca aagattctgt cagatatgcg 480
ctcccagtat gaggttatgg cagagcagaa tcggaaagac gcagaggcat ggtttacttc 540
cagaacagaa gagctgaata gggaagtggc agggcataca gagcagctcc agatgagtag 600
gtcagaagtc acagacctga gaagaactct gcaagggctg gaaatcgaac tccagagcca 660
actgagcatg aaggcagctc tggaggacac actcgccgag actgaggcta ggtttggtgc 720
ccagctcgct cacattcagg ctctgataag tggtattgaa gcacagctcg gagatgtgag 780
agctgattct gaacgccaga atcaggaata ccagcggctg atggacatca agtctaggct 840
cgaacaagag atcgcaacct accggagcct cctggaaggg caagaagatc actataacaa 900
tctgtccgcc agtaaggtcc tgcaccatca ccatcaccat tgaaagcttg gatcccccga 960
cctcgacctc tggctaataa aggaaattta ttttcattgc aatagtgtgt tggaattttt 1020
tgtgtctctc actcggaagg acatatggga gggcaaatca tttggtcgag atccccggga 1080
tctctagcta gaggatcgat ccccgccccg gacgaactaa acctgactac gacatctctg 1140
ccccttcttc gcggggcagt gcatgtaatc ccttcagttg gttggtacaa cttgccaact 1200
gaaccctaaa cgggtagcat atgcttcccg ggtagtagta tatactatcc agactaaccc 1260
taattcaata gcatatgtta cccaacggga agcatatgct atcgaattag ggttagtaaa 1320
agggtcctaa ggaacagcga tgtaggtggg cgggccaaga taggggcgcg attgctgcga 1380
tctggaggac aaattacaca cacttgcgcc tgagcgccaa gcacagggtt gttggtcctc 1440
atattcacga ggtcgctgag agcacggtgg gctaatgttg ccaagggtag catatactac 1500
ccaaatatct ggatagcata tgctatccta atctatatct gggtagcata ggctatccta 1560
atctatatct gggtagcata tgctatccta atctatatct gggtagtata tgctatccta 1620
atttatatct gggtagcata ggctatccta atctatatct gggtagcata tgctatccta 1680
atctatatct gggtagtata tgctatccta atctgtatcc gggtagcata tgctatccta 1740
atagagatta gggtagtata tgctatccta atttatatct gggtagcata tactacccaa 1800
atatctggat agcatatgct atcctaatct atatctgggt agcatatgct atcctaatct 1860
atatctgggt agcataggct atcctaatct atatctgggt agcatatgct atcctaatct 1920
atatctgggt agtatatgct atcctaattt atatctgggt agcataggct atcctaatct 1980
atatctgggt agcatatgct atcctaatct atatctgggt agtatatgct atcctaatct 2040
gtatccgggt agcatatgct atcctcatga taagctgtca aacatgagaa ttaattcttg 2100
aagacgaaag ggcctcgtga tacgcctatt tttataggtt aatgtcatga taataatggt 2160
ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt 2220
tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca 2280
ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt 2340
ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga 2400
tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa 2460
gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct 2520
gctatgtggc gcggtattat cccgtgttga cgccgggcaa gagcaactcg gtcgccgcat 2580
acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga 2640
tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc 2700
caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat 2760
gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa 2820
cgacgagcgt gacaccacga tgcctgcagc aatggcaaca acgttgcgca aactattaac 2880
tggcgaacta cttactctag cttcccggca acaattaata gactggatgg aggcggataa 2940
agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc 3000
tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc 3060
ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag 3120
acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag accaagttta 3180
ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa 3240
gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc 3300
gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat 3360
ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga 3420
gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt 3480
tcttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata 3540
cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac 3600
cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg 3660
ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg 3720
tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag 3780
cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct 3840
ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc 3900
aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt 3960
ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg 4020
tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga 4080
gtcagtgagc gaggaagcgt acatttatat tggctcatgt ccaatatgcc cgggatgttg 4140
acattgatta ttgactagtt attaatagta atcaattacg gggtcattag ttcatagccc 4200
atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct gaccgcccaa 4260
cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc caatagggac 4320
tttccattga cgtcaatggg tggagtattt acggtaaact gcccacttgg cagtacatca 4380
agtgtatcat atgccaagtc cgccccctat tgacgtcaat gacggtaaat ggcccgcctg 4440
gcattatgcc cagtacatga ccttacggga ctttcctact tggcagtaca tctacgtatt 4500
agtcatcgct attaccaagg tgatgcggtt ttggcagtac accaatgggc gtggatagcg 4560
gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga gtttgttttg 4620
gcaccaaaat caacgggact ttccaaaatg tcgtaataac cccgccccgt tgacgcaaat 4680
gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctcgtttag tgaaccgtca 4740
gatcctcact ctcttccgca tcgctgtctg cgagggccag ctgttgggct cgcggttgag 4800
gacaaactct tcgcggtctt tccagtactc ttggatcgga aacccgtcgg cctccgaacg 4860
gtactccgcc accgagggac ctgagcgagt ccgcatcgac cggatcggaa aacctctcga 4920
gaa 4923
<210> 17
<211> 13
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 17
Claims (20)
1. An isolated nucleotide sequence comprising a signal peptide sequence and a nucleotide sequence encoding a-L-iduronidase, wherein the signal peptide sequence is not the native signal peptide sequence of a-L-iduronidase.
2. The nucleotide sequence of claim 1, wherein the signal peptide is selected from the group consisting of SEQ ID No. 1, SEQ ID No. 2, SEQ ID No. 3 and SEQ ID No. 4.
3. The nucleotide sequence of claim 1, wherein the nucleotide sequence encoding a-L-iduronidase is depicted in SEQ ID No. 9.
4. An isolated nucleotide sequence selected from the group consisting of:
1) SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 7 or SEQ ID No. 8;
2) a nucleotide sequence having at least more than 80% homology with SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 7 or SEQ ID No. 8;
3) the codon for the polypeptide is degenerate with the coding part of the nucleotide sequence in 1) or 2) and codes for a nucleotide sequence of a-L-iduronidase.
5. A recombinant a-L-iduronidase comprising an a-L-iduronidase and a heterologous signal peptide, said heterologous signal peptide capable of being cleaved at a single site.
6. A recombinant a-L-iduronidase having an amino acid sequence selected from the group consisting of:
1) SEQ ID No. 10, SEQ ID No. 11, SEQ ID No. 12 or SEQ ID No. 13;
2) an amino acid sequence having at least 95% homology with SEQ ID No. 10, SEQ ID No. 11, SEQ ID No. 12 or SEQ ID No. 13.
7. The recombinant a-L-iduronidase according to claim 5 having glycosylation modifications at positions 311, 390, and 426.
8. The recombinant a-L-iduronidase according to claim 6, wherein said glycosylation modification is GlcNAc (2) Man (7) P (2).
9. The recombinant a-L-iduronidase according to claim 5, wherein cellular internalization of said recombinant a-L-iduronidase is mediated by M6 PR.
10. A process for the preparation of a recombinant a-L-iduronidase comprising eukaryotic expression of the isolated nucleotide sequence of any of claims 1-4 or eukaryotic expression of the recombinant a-L-iduronidase of any of claims 5-9.
11. The method of claim 10, wherein the isolated nucleotide sequence is preceded by a KOZARK sequence, and the resulting construct is constructed into a eukaryotic expression vector, which is then transferred into a eukaryotic expression system for expression.
12. The method of claim 11, wherein the KOZARK sequence is GCCGCCACCATGC.
13. The method of claim 10, wherein the eukaryotic expression is in a mammalian cell.
14. The method of claim 13, wherein the mammalian cell is CHOK1SV GS-KO.
15. The method of claim 10, comprising: constructing a recombinant eukaryotic expression vector containing the isolated nucleotide sequence; transferring the recombinant eukaryotic expression plasmid into an eukaryotic expression system for eukaryotic expression; collecting the supernatant, and purifying to obtain the alpha-L-iduronidase.
16. The method of claim 10, wherein the purifying comprises one or more of affinity chromatography, hydrophobic interaction chromatography, and ion exchange chromatography.
17. The method of claim 16, wherein the affinity chromatography is a linear gradient elution using agarose gel to obtain an eluate containing the protein of interest.
18. The method of claim 16, wherein the hydrophobic interaction chromatography employs a hydrophobic chromatography column for removing hetero-proteins, multimers; before sample loading, 4M NaCl is added to reach the final concentration of 2M, the pH value is adjusted to 5-6 by using 1M NaOH, and the target protein is isocratically eluted by using an elution buffer solution.
19. The method of claim 16, wherein the ion exchange chromatography uses a salt tolerant cation column, and after sample loading and equilibration, the target protein is eluted with an elution buffer in a linear gradient of 0-70%. Wherein the washing buffer is 20mM PB 1M NaCl pH5.5.
20. The method of claim 16, wherein the purified solution is concentrated by ultrafiltration using a 30kDa membrane to phosphate buffer containing 0.0001% Tween 80 at pH 5.5.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110306525.8A CN115109790A (en) | 2021-03-23 | 2021-03-23 | Recombinant a-L-iduronate prase and preparation method thereof |
PCT/CN2022/073330 WO2022199243A1 (en) | 2021-03-23 | 2022-01-22 | Recombinant alpha-l-iduronidase and preparation method therefor |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110306525.8A CN115109790A (en) | 2021-03-23 | 2021-03-23 | Recombinant a-L-iduronate prase and preparation method thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115109790A true CN115109790A (en) | 2022-09-27 |
Family
ID=83323277
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110306525.8A Pending CN115109790A (en) | 2021-03-23 | 2021-03-23 | Recombinant a-L-iduronate prase and preparation method thereof |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN115109790A (en) |
WO (1) | WO2022199243A1 (en) |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
BR9910323A (en) * | 1998-05-13 | 2001-01-30 | Harbor Ucla | Recombinant alpha-l-iduronidase, methods for its production and purification and methods for treating diseases caused by its deficiency |
US6426208B1 (en) * | 1999-11-12 | 2002-07-30 | Harbor-Ucla Research And Education Institute | Recombinant α-L-iduronidase, methods for producing and purifying the same and methods for treating diseases caused by deficiencies thereof |
US20220339296A1 (en) * | 2016-12-06 | 2022-10-27 | Bluebird Bio, Inc. | Gene therapy for mucopolysaccharidosis, type i |
-
2021
- 2021-03-23 CN CN202110306525.8A patent/CN115109790A/en active Pending
-
2022
- 2022-01-22 WO PCT/CN2022/073330 patent/WO2022199243A1/en active Application Filing
Also Published As
Publication number | Publication date |
---|---|
WO2022199243A1 (en) | 2022-09-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102421895B (en) | Chrysosporium lucknowense protein production systems | |
KR101504969B1 (en) | A process for concentration of a polypeptide | |
CN112501101B (en) | High-yield strain of natural herbicide thaxtomins as well as preparation method and application thereof | |
CN108977454A (en) | A kind of Escherichia coli synthesis squalene plasmid pTsqs and its preparation and application | |
CN101410408A (en) | A process for concentration of a polypeptide | |
KR101856260B1 (en) | Process for production and purification of recombinant lysosomal alpha-mannosidase | |
CN108949599A (en) | A kind of production alpha, beta-lonone genetic engineering bacterium and its construction method and application | |
CN111187785A (en) | Cloning expression and application of serpentis grass tryptophan decarboxylase gene OpTDC2 | |
CN115109790A (en) | Recombinant a-L-iduronate prase and preparation method thereof | |
CN112980819A (en) | Construction method and application of retinitis pigmentosa animal model | |
CN106520820B (en) | A method of plasma pro-brain natriuretic peptide levels epitope is prepared using bacillus brevis | |
CN110628806A (en) | High-yield beta-ionone genetic engineering bacterium and construction method and application thereof | |
CN109929798B (en) | Method for inducing epithelial-mesenchymal transition of cells and method for screening iron death-inducing substance | |
CN101948866A (en) | Construction of bacillus subtilis integrated vector capable of being cloned by using homologous recombination and application thereof | |
CN114196712B (en) | Method for producing L-ornithine by immobilized enzyme method | |
CN114350698B (en) | Human recombinant arginase I production strain and construction method thereof | |
CN114214353B (en) | Method for producing human recombinant arginase I by fermentation | |
CN114350721B (en) | Method for producing L-ornithine by microbial enzyme method | |
KR20130087878A (en) | Expression vector for human erythropoietin and process for production of erythropoietin using thereof | |
AU2013202948B2 (en) | A process for concentration of a polypeptide | |
CN114085861B (en) | Arginine decarboxylase producing strain and construction method thereof | |
KR101411456B1 (en) | Soluble overexpression and purification method of biological active recombinant human BMP7 in Escherichia coli | |
CN108949795B (en) | Plasmid pCDSP for increasing squalene content in escherichia coli and preparation and use methods thereof | |
CN115029378B (en) | Method for creating flower-spot ornamental poplar by PtrDJ1C gene | |
KR101728600B1 (en) | - -xylosidase expression system for producing xylobiose |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |