CN113528482B - High-activity heparin skeleton synthase PmHS2 mutant and application thereof - Google Patents
High-activity heparin skeleton synthase PmHS2 mutant and application thereof Download PDFInfo
- Publication number
- CN113528482B CN113528482B CN202110881483.0A CN202110881483A CN113528482B CN 113528482 B CN113528482 B CN 113528482B CN 202110881483 A CN202110881483 A CN 202110881483A CN 113528482 B CN113528482 B CN 113528482B
- Authority
- CN
- China
- Prior art keywords
- leu
- ile
- lys
- asn
- asp
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 229920000669 heparin Polymers 0.000 title claims abstract description 115
- 229960002897 heparin Drugs 0.000 title claims abstract description 115
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical group OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 title claims abstract description 111
- 230000000694 effects Effects 0.000 title claims abstract description 50
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 31
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims abstract description 14
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims abstract description 11
- 239000004473 Threonine Substances 0.000 claims abstract description 11
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims abstract description 8
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims abstract description 8
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims abstract description 5
- 229960000310 isoleucine Drugs 0.000 claims abstract description 5
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims abstract description 5
- 150000001413 amino acids Chemical group 0.000 claims description 30
- 238000003786 synthesis reaction Methods 0.000 claims description 20
- 239000000758 substrate Substances 0.000 claims description 18
- 230000015572 biosynthetic process Effects 0.000 claims description 17
- 238000006243 chemical reaction Methods 0.000 claims description 15
- 239000002773 nucleotide Substances 0.000 claims description 9
- 125000003729 nucleotide group Chemical group 0.000 claims description 9
- LFTYTUAZOPRMMI-CFRASDGPSA-N UDP-N-acetyl-alpha-D-glucosamine Chemical compound O1[C@H](CO)[C@@H](O)[C@H](O)[C@@H](NC(=O)C)[C@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-CFRASDGPSA-N 0.000 claims description 7
- LFTYTUAZOPRMMI-UHFFFAOYSA-N UNPD164450 Natural products O1C(CO)C(O)C(O)C(NC(=O)C)C1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-UHFFFAOYSA-N 0.000 claims description 7
- 239000013598 vector Substances 0.000 claims description 7
- 239000013600 plasmid vector Substances 0.000 claims description 5
- HDYANYHVCAPMJV-UHFFFAOYSA-N Uridine diphospho-D-glucuronic acid Natural products O1C(N2C(NC(=O)C=C2)=O)C(O)C(O)C1COP(O)(=O)OP(O)(=O)OC1OC(C(O)=O)C(O)C(O)C1O HDYANYHVCAPMJV-UHFFFAOYSA-N 0.000 claims description 4
- WMWKTCPGFOEPBD-YGIWDPDDSA-N azane;(2s,3s,4s,5r,6r)-6-[[[(2r,3s,4r,5r)-5-(2,4-dioxopyrimidin-1-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-hydroxyphosphoryl]oxy-3,4,5-trihydroxyoxane-2-carboxylic acid Chemical compound N.C([C@@H]1[C@H]([C@H]([C@@H](O1)N1C(NC(=O)C=C1)=O)O)O)OP(O)(=O)OP(O)(=O)O[C@H]1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@H]1O WMWKTCPGFOEPBD-YGIWDPDDSA-N 0.000 claims description 4
- 230000001131 transforming effect Effects 0.000 claims description 3
- 238000006555 catalytic reaction Methods 0.000 claims description 2
- 102000004190 Enzymes Human genes 0.000 abstract description 18
- 108090000790 Enzymes Proteins 0.000 abstract description 18
- 102000004169 proteins and genes Human genes 0.000 abstract description 13
- 230000035772 mutation Effects 0.000 abstract 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 24
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 18
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 18
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 14
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 12
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 12
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 12
- 102000004357 Transferases Human genes 0.000 description 12
- 108090000992 Transferases Proteins 0.000 description 12
- AEMOLEFTQBMNLQ-UHFFFAOYSA-N beta-D-galactopyranuronic acid Natural products OC1OC(C(O)=O)C(O)C(O)C1O AEMOLEFTQBMNLQ-UHFFFAOYSA-N 0.000 description 12
- 108010049041 glutamylalanine Proteins 0.000 description 12
- 108010050848 glycylleucine Proteins 0.000 description 12
- 108010034529 leucyl-lysine Proteins 0.000 description 12
- 108010057821 leucylproline Proteins 0.000 description 12
- 238000000034 method Methods 0.000 description 12
- 108010090473 UDP-N-acetylglucosamine-peptide beta-N-acetylglucosaminyltransferase Proteins 0.000 description 10
- 238000001514 detection method Methods 0.000 description 10
- 229920002683 Glycosaminoglycan Polymers 0.000 description 9
- 235000018102 proteins Nutrition 0.000 description 9
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 6
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 6
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 6
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 6
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 6
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 6
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 6
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 6
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 6
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 6
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 6
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 6
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 6
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 6
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 6
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 6
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 6
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 6
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 6
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 6
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 6
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 6
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 6
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 6
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 6
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 6
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 6
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 6
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 6
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 6
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 6
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 6
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 6
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 6
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 6
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 6
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 6
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 6
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 6
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 6
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 6
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 6
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 6
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 6
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 6
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 6
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 6
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 6
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 6
- FJAYYNIXQNERSO-ACZMJKKPSA-N Gln-Cys-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FJAYYNIXQNERSO-ACZMJKKPSA-N 0.000 description 6
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 6
- OOLCSQQPSLIETN-JYJNAYRXSA-N Gln-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)O OOLCSQQPSLIETN-JYJNAYRXSA-N 0.000 description 6
- DAAUVRPSZRDMBV-KBIXCLLPSA-N Gln-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DAAUVRPSZRDMBV-KBIXCLLPSA-N 0.000 description 6
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 6
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 6
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 6
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 6
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 6
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 6
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 6
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 6
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 6
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 6
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 6
- JVZLZVJTIXVIHK-SXNHZJKMSA-N Glu-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N JVZLZVJTIXVIHK-SXNHZJKMSA-N 0.000 description 6
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 6
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 6
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 6
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 6
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 6
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 6
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 6
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 6
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 6
- FMRKUXFLLPKVPG-JYJNAYRXSA-N His-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O FMRKUXFLLPKVPG-JYJNAYRXSA-N 0.000 description 6
- CTGZVVQVIBSOBB-AVGNSLFASA-N His-His-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTGZVVQVIBSOBB-AVGNSLFASA-N 0.000 description 6
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 6
- VUUFXXGKMPLKNH-BZSNNMDCSA-N His-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N VUUFXXGKMPLKNH-BZSNNMDCSA-N 0.000 description 6
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 6
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 6
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 6
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 6
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 6
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 6
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 6
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 6
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 6
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 6
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 6
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 6
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 6
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 6
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 6
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 6
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 6
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 6
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 6
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 6
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 6
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 6
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 6
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 6
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 6
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 6
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 6
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 6
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 6
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 6
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 6
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 6
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 6
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 6
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 6
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 6
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 6
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 6
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 6
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 6
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 6
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 6
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 6
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 6
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 6
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 6
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 6
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 6
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 6
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 6
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 6
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 6
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 6
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 6
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 6
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 6
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 6
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 6
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 6
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 6
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 6
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 6
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 6
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 6
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 6
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 6
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 6
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 6
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 6
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 6
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 6
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 6
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 6
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 6
- KZUJCMPVNXOBAF-LKXGYXEUSA-N Thr-Cys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KZUJCMPVNXOBAF-LKXGYXEUSA-N 0.000 description 6
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 6
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 6
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 6
- OHDXOXIZXSFCDN-RCWTZXSCSA-N Thr-Met-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OHDXOXIZXSFCDN-RCWTZXSCSA-N 0.000 description 6
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 6
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 6
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 6
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 6
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 6
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 6
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 6
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 6
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 6
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 6
- MANXHLOVEUHVFD-DCAQKATOSA-N Val-His-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N MANXHLOVEUHVFD-DCAQKATOSA-N 0.000 description 6
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 6
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 6
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 6
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 6
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 6
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 6
- 108010060035 arginylproline Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 108010068265 aspartyltyrosine Proteins 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- 108010079547 glutamylmethionine Proteins 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 108010085325 histidylproline Proteins 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 6
- 108010012058 leucyltyrosine Proteins 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 6
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 108010048818 seryl-histidine Proteins 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- 108010027345 wheylin-1 peptide Proteins 0.000 description 6
- 108010000998 wheylin-2 peptide Proteins 0.000 description 6
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 5
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 5
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 5
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 5
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 5
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 5
- 238000004128 high performance liquid chromatography Methods 0.000 description 5
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 5
- 238000012546 transfer Methods 0.000 description 5
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 description 4
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 description 4
- -1 heparin oligosaccharide Chemical class 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- AEMOLEFTQBMNLQ-AQKNRBDQSA-N D-glucopyranuronic acid Chemical compound OC1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-AQKNRBDQSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- OVRNDRQMDRJTHS-RTRLPJTCSA-N N-acetyl-D-glucosamine Chemical compound CC(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-RTRLPJTCSA-N 0.000 description 3
- 108091005804 Peptidases Proteins 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 3
- 235000004279 alanine Nutrition 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 229950006780 n-acetylglucosamine Drugs 0.000 description 3
- 229910052759 nickel Inorganic materials 0.000 description 3
- 229920001542 oligosaccharide Polymers 0.000 description 3
- 239000012071 phase Substances 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- IAJILQKETJEXLJ-UHFFFAOYSA-N Galacturonsaeure Natural products O=CC(O)C(O)C(O)C(O)C(O)=O IAJILQKETJEXLJ-UHFFFAOYSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 102000008300 Mutant Proteins Human genes 0.000 description 2
- 108010021466 Mutant Proteins Proteins 0.000 description 2
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 235000001014 amino acid Nutrition 0.000 description 2
- 229940024606 amino acid Drugs 0.000 description 2
- 238000009835 boiling Methods 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 150000002016 disaccharides Chemical group 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 229940097043 glucuronic acid Drugs 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 239000007791 liquid phase Substances 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 239000011664 nicotinic acid Substances 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000012827 research and development Methods 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- KIUKXJAPPMFGSW-DNGZLQJQSA-N (2S,3S,4S,5R,6R)-6-[(2S,3R,4R,5S,6R)-3-Acetamido-2-[(2S,3S,4R,5R,6R)-6-[(2R,3R,4R,5S,6R)-3-acetamido-2,5-dihydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-2-carboxy-4,5-dihydroxyoxan-3-yl]oxy-5-hydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-3,4,5-trihydroxyoxane-2-carboxylic acid Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@H](O3)C(O)=O)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](C(O)=O)O1 KIUKXJAPPMFGSW-DNGZLQJQSA-N 0.000 description 1
- IAJILQKETJEXLJ-KLVWXMOXSA-N (2s,3r,4r,5r)-2,3,4,5-tetrahydroxy-6-oxohexanoic acid Chemical compound O=C[C@H](O)[C@H](O)[C@@H](O)[C@H](O)C(O)=O IAJILQKETJEXLJ-KLVWXMOXSA-N 0.000 description 1
- RVDBHQMAUIHAHC-OFKSJOQMSA-N (2s,3s,4s,5r,6r)-3,4,5,6-tetrahydroxy-6-(4-nitrophenyl)oxane-2-carboxylic acid Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](C(O)=O)O[C@]1(O)C1=CC=C([N+]([O-])=O)C=C1 RVDBHQMAUIHAHC-OFKSJOQMSA-N 0.000 description 1
- MSWZFWKMSRAUBD-UHFFFAOYSA-N 2-Amino-2-Deoxy-Hexose Chemical compound NC1C(O)OC(CO)C(O)C1O MSWZFWKMSRAUBD-UHFFFAOYSA-N 0.000 description 1
- SQDAZGGFXASXDW-UHFFFAOYSA-N 5-bromo-2-(trifluoromethoxy)pyridine Chemical compound FC(F)(F)OC1=CC=C(Br)C=N1 SQDAZGGFXASXDW-UHFFFAOYSA-N 0.000 description 1
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 1
- 206010002383 Angina Pectoris Diseases 0.000 description 1
- 102000004411 Antithrombin III Human genes 0.000 description 1
- 108090000935 Antithrombin III Proteins 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 229920001287 Chondroitin sulfate Polymers 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 229920000045 Dermatan sulfate Polymers 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 229920002971 Heparan sulfate Polymers 0.000 description 1
- 229920000288 Keratan sulfate Polymers 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- DUKURNFHYQXCJG-UHFFFAOYSA-N Lewis A pentasaccharide Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(C)=O)C(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)OC1CO DUKURNFHYQXCJG-UHFFFAOYSA-N 0.000 description 1
- 206010029164 Nephrotic syndrome Diseases 0.000 description 1
- 102000012479 Serine Proteases Human genes 0.000 description 1
- 108010022999 Serine Proteases Proteins 0.000 description 1
- 229940122055 Serine protease inhibitor Drugs 0.000 description 1
- 101710102218 Serine protease inhibitor Proteins 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 1
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- 229920001284 acidic polysaccharide Polymers 0.000 description 1
- 150000004805 acidic polysaccharides Chemical class 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000002785 anti-thrombosis Effects 0.000 description 1
- 239000003146 anticoagulant agent Substances 0.000 description 1
- 229940127219 anticoagulant drug Drugs 0.000 description 1
- 229960005348 antithrombin iii Drugs 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 230000021164 cell adhesion Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000012295 chemical reaction liquid Substances 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 229940059329 chondroitin sulfate Drugs 0.000 description 1
- 230000015271 coagulation Effects 0.000 description 1
- 238000005345 coagulation Methods 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- AVJBPWGFOQAPRH-FWMKGIEWSA-L dermatan sulfate Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@H](OS([O-])(=O)=O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](C([O-])=O)O1 AVJBPWGFOQAPRH-FWMKGIEWSA-L 0.000 description 1
- 229940051593 dermatan sulfate Drugs 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 239000006167 equilibration buffer Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 150000002402 hexoses Chemical class 0.000 description 1
- 229920002674 hyaluronan Polymers 0.000 description 1
- 229960003160 hyaluronic acid Drugs 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- KXCLCNHUUKTANI-RBIYJLQWSA-N keratan Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@H](COS(O)(=O)=O)O[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@H](O[C@@H](O[C@H]3[C@H]([C@@H](COS(O)(=O)=O)O[C@@H](O)[C@@H]3O)O)[C@H](NC(C)=O)[C@H]2O)COS(O)(=O)=O)O[C@H](COS(O)(=O)=O)[C@@H]1O KXCLCNHUUKTANI-RBIYJLQWSA-N 0.000 description 1
- 230000037356 lipid metabolism Effects 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 150000002772 monosaccharides Chemical class 0.000 description 1
- 210000004400 mucous membrane Anatomy 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000003001 serine protease inhibitor Substances 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 210000000813 small intestine Anatomy 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010077037 tyrosyl-tyrosyl-phenylalanine Proteins 0.000 description 1
- 238000000825 ultraviolet detection Methods 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1051—Hexosyltransferases (2.4.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/18—Preparation of compounds containing saccharide radicals produced by the action of a glycosyl transferase, e.g. alpha-, beta- or gamma-cyclodextrins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y204/00—Glycosyltransferases (2.4)
- C12Y204/01—Hexosyltransferases (2.4.1)
- C12Y204/01017—Glucuronosyltransferase (2.4.1.17)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Abstract
The invention relates to a high-activity heparin skeleton synthase PmHS2 mutant and application thereof. The mutant is any one of position mutation of 130 th serine, 135 th serine, 197 th phenylalanine, 326 th threonine, 344 th threonine and 476 th isoleucine of the heparin skeleton synthase PmHS 2. The heparin skeleton synthase PmHS2 mutant provided by the invention has good stability, and can still keep more than 40% of enzyme activity after being placed at 37 ℃ for 72 hours, which is more than 2 times of the existing heparin skeleton synthase PmHS 2. In particular to a heparin skeleton synthase mutant PmHS2 (F197Y), a heparin skeleton synthase mutant PmHS2 (S135A) and a protein expressed by the heparin skeleton synthase mutant PmHS2 (T344S), which can still keep more than 80 percent of enzyme activity after being placed for 72 hours at 37 ℃ and is 5 to 7 times of the existing heparin skeleton synthase PmHS 2.
Description
Technical Field
The invention relates to a high-activity heparin skeleton synthase PmHS2 mutant and application thereof, belonging to the technical field of biology.
Background
Glycosaminoglycans (GAGs) are a complex biological macromolecule, a linear unbranched acidic polysaccharide formed by a regular arrangement of repeating disaccharide units consisting of hexuronic acid or hexose and hexosamine. Glycosaminoglycans are widely distributed and varied, and are classified into sulfate-free glycosaminoglycans and sulfate-modified glycosaminoglycans according to the degree of sulfation, with only one type of hyaluronic acid, whereas sulfate-modified glycosaminoglycans include dermatan sulfate/chondroitin sulfate, heparan sulfate/heparin, keratan sulfate, and the like.
Heparin (HP) is an important glycosaminoglycan, and is composed of D-beta-glucuronic acid (or L-alpha-iduronic acid) and N-acetylglucosamine to form a repeating disaccharide unit. It is named as it was first discovered from the liver that heparin has the activity of enhancing the natural serine protease inhibitor antithrombin iii, catalyzes the inhibition of all serine proteases of the intrinsic coagulation pathway, and thus has an anticoagulant antithrombotic effect, and can also mediate a variety of important physiological processes including cell adhesion, lipid metabolism and growth factor regulation by binding other proteins. Can also be used for treating angina pectoris, nephrotic syndrome, serious burn, rheumatoid arthritis, etc., and the demand is internationally in the forefront of biotechnology pharmaceutical field throughout the year.
Heparin sources are mainly classified into animal extraction, chemical synthesis and chemoenzymatic synthesis. Animal heparin is mainly extracted from small intestine mucous membranes of pigs and cattle, but the heparin product has different structure, configuration and molecular weight due to the difference of extraction methods and different numbers of basic units of heparin from animal sources, which may cause heparin pollution. In addition, heparin has a complex structure, and when the sugar chain is prolonged by using a chemical synthesis method, the steps of protecting, deprotecting, activating, coupling and the like on an unstable group are considered, and the conditions of regioselectivity, stereoselectivity and the like are also considered, so that the difficulty is high and the yield is low. The chemical enzymatic synthesis of HP is a method combining a chemical method and an enzymatic method, and the enzymatic reaction condition required in the synthesis process is mild and the specificity is strong. Compared with chemical synthesis, the method has simple steps and high efficiency. The enzymes commonly used for synthesizing the heparin skeleton sugar chain by the chemical enzyme method at present comprise heparin skeleton synthase KfiA and heparin skeleton synthase PmHS2, wherein the heparin skeleton synthase PmHS2 is a bifunctional enzyme with N-acetylglucosamine transferase activity and glucuronic acid transferase activity, and the enzyme has wide application in the chemical enzyme method synthesis, and the improvement of the activity of the enzyme is helpful for the efficient synthesis of the heparin skeleton sugar chain.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides a high-activity heparin skeleton synthase PmHS2 mutant and application thereof.
Description of the terminology:
GlcA-pNP: chinese is known as 4-nitrophenyl- β -D-glucuronic acid, which acts as an initial acceptor substrate for heparin oligosaccharide synthesis;
UDP-GlcNAc: chinese is called uridine diphosphate-N-acetamido glucose, which serves as a donor to provide acetylglucosamine for heparin oligosaccharide synthesis;
the technical scheme of the invention is as follows:
the mutant of the high-activity heparin skeleton synthase PmHS2 is any one of 130 th serine, 135 th serine, 197 th phenylalanine, 326 th threonine, 344 th threonine and 476 th isoleucine of the heparin skeleton synthase PmHS 2.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (S130T), the amino acid sequence of the mutant is shown as SEQ ID NO.2, and the nucleotide sequence of the encoding gene is shown as SEQ ID NO. 1; the 130 th serine of the amino acid sequence of the heparin skeleton synthase PmHS2 is mutated into threonine.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (S135A), the amino acid sequence of the mutant is shown as SEQ ID NO.4, and the nucleotide sequence of the encoding gene is shown as SEQ ID NO. 3; the 135 th serine of the amino acid sequence of the heparin skeleton synthase PmHS2 is mutated into alanine.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (F197Y), the amino acid sequence of the mutant is shown as SEQ ID NO.6, and the nucleotide sequence of the encoding gene is shown as SEQ ID NO. 5; the 197 th phenylalanine of the amino acid sequence of the heparin skeleton synthase PmHS2 is mutated into lysine.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (T326S), the amino acid sequence of the mutant is shown as SEQ ID NO.8, and the nucleotide sequence of the encoding gene is shown as SEQ ID NO. 7; the 326 th threonine of the amino acid sequence of the heparin skeleton synthase PmHS2 is mutated into serine.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (T344S), the amino acid sequence of the mutant is shown as SEQ ID NO.10, and the nucleotide sequence of the encoding gene is shown as SEQ ID NO. 9; the 344 th threonine of the amino acid sequence of the heparin skeleton synthase PmHS2 is mutated into serine.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (I476F), the amino acid sequence of which is shown as SEQ ID NO.12, and the nucleotide sequence of the encoding gene is shown as SEQ ID NO. 11; the 476 th isoleucine of the amino acid sequence of the heparin skeleton synthase PmHS2 is mutated into phenylalanine.
A recombinant vector is obtained by inserting the coding gene of the heparin skeleton synthase PmHS2 mutant into a plasmid vector.
According to a preferred embodiment of the invention, the plasmid vector is pET-28a (+).
In the invention, the recombinant vector containing the PmHS2 mutant coding gene is synthesized by Nanjing Jinsrui company.
A recombinant strain is obtained by transforming the recombinant vector into a host cell.
Preferably, according to the invention, the host cell is E.coli; further preferably, the host cell is E.coli BL21 (DE 3).
The application of the heparin skeleton synthase mutant in the synthesis of heparin skeleton.
According to the invention, preferably, the application is to catalyze the reaction to generate a heparin skeleton with a structure of GlcNAc-GlcA-pNP by taking UDP-GlcNAc as a donor substrate and GlcA-pNP as an acceptor substrate; or the UDP-GlcA is used as a donor substrate, the GlcNAc-GlcA-pNP is used as an acceptor substrate, and the heparin skeleton with the structure of GlcA-GlcNAc-GlcA-pNP is generated by catalytic reaction.
Experimental procedures not described in detail in the present invention may be performed according to conventional experimental procedures in the art.
Advantageous effects
1. The invention is based on heparin skeleton synthase PmHS2, and selects 130 th, 135 th, 197 th, 326 th, 344 th and 476 th amino acids to carry out site-directed mutagenesis, mutates 130 th serine into threonine (S130T), mutates 135 th serine into alanine (S135A), mutates 197 th phenylalanine into lysine (F197Y), mutates 326 th threonine into serine (T326S), mutates 344 th threonine into serine (T344S), and mutates 476 th isoleucine into phenylalanine (I476F). Compared with the heparin skeleton synthase PmHS2, the heparin skeleton synthase PmHS2 mutant has very high GlcNAc transferase activity and GlcA transferase activity which are respectively 2-3.5 times and 1.5-3.5 times that of the heparin skeleton synthase PmHS2, and can effectively synthesize the GlcNAc-GlcA-pNP heparin skeleton by utilizing substrates UDP-GlcNAc and GlcA-pNP under the optimal condition; or the substrates UDP-GlcA and GlcNAc-GlcA-pNP are utilized to effectively synthesize the GlcA-GlcNAc-GlcA-pNP heparin skeleton, so that the GlcNAc transfer synthesis efficiency and GlcA transfer synthesis efficiency in heparin skeleton synthesis are improved, the application development of heparin bionic synthesis is greatly promoted, and a brand new page is opened for the research and development of glycosaminoglycan.
2. The heparin skeleton synthase PmHS2 mutant provided by the invention has good stability, and can still keep more than 40% of enzyme activity after being placed at 37 ℃ for 72 hours, which is more than 2 times of the existing heparin skeleton synthase PmHS 2. In particular to a heparin skeleton synthase mutant PmHS2 (F197Y), a heparin skeleton synthase mutant PmHS2 (S135A) and a protein expressed by the heparin skeleton synthase mutant PmHS2 (T344S), which can still keep more than 80 percent of enzyme activity after being placed for 72 hours at 37 ℃ and is 5 to 7 times of the existing heparin skeleton synthase PmHS 2.
Drawings
FIG. 1 is a diagram showing the SDS-PAGE detection result of a heparin skeleton synthase PmHS2 mutant for soluble expression of a target protein in recombinant escherichia coli;
FIG. 2 is a histogram of relative conversion of heparin skeleton synthase PmHS2 and its mutants;
FIG. 3 is a graph showing the stability of the heparin skeleton synthase PmHS2 mutant.
In the figure, the ordinate indicates the relative yield of the reaction product, and the abscissa indicates the time for which the protein was left at 37℃with the initial enzyme activity being 100%.
Detailed Description
The technical scheme of the present invention is further described below with reference to examples and drawings of the specification, but the scope of the present invention is not limited thereto. The technical means employed in the present invention are methods well known to those skilled in the art unless specifically stated.
The inventors obtained the amino acid sequence and nucleotide sequence information of heparin skeleton synthase PmHS2 (GenBank: AY 292200.1) from bioinformatics databases and then analyzed the highly conserved regions of the amino acid sequence using Jalview software. Simultaneously, a Swiss-Model tool is used for carrying out protein simulation modeling on the heparin skeleton synthase PmHS2, and a HotSpot Wizard 2.0 is used for predicting the active center of the enzyme. It was found that the 130, 135, 165, 197, 267, 296, 326, 344, 476 and 548 sites of the amino acid sequence of the heparin skeleton synthase PmHS2 are located near the active center and the highly conserved region, and that it is highly possible to increase the catalytic activity of PmHS2 by site-directed mutagenesis. Therefore, in combination with the dominant amino acids at these six positions for homology analysis, ten mutants of PmHS2 (S130T), pmHS2 (S135A), pmHS2 (D165Y), pmHS2 (F197Y), pmHS2 (F267L), pmHS2 (H296N), pmHS2 (T326S), pmHS2 (T344S), pmHS2 (I476F), and PmHS2 (S548A) were designed.
Wherein the coding gene of the mutant PmHS2 (S130T) is shown as SEQ ID NO.1, and the amino acid sequence is shown as SEQ ID NO. 2; the coding gene of the mutant PmHS2 (S135A) is shown as SEQ ID NO.3, and the amino acid sequence is shown as SEQ ID NO. 4; the coding gene of the mutant PmHS2 (F197Y) is shown as SEQ ID NO.5, and the amino acid sequence is shown as SEQ ID NO. 6; the coding gene of the mutant PmHS2 (T326S) is shown as SEQ ID NO.7, and the amino acid sequence is shown as SEQ ID NO. 8; the coding gene of the mutant PmHS2 (T344S) is shown as SEQ ID NO.9, and the amino acid sequence is shown as SEQ ID NO. 10; the coding gene of the mutant PmHS2 (I476F) is shown as SEQ ID NO.11, and the amino acid sequence is shown as SEQ ID NO. 12. The mutant PmHS2 (D165Y) is characterized in that aspartic acid at 165 th position is mutated into tyrosine; the mutant PmHS2 (F267L) is characterized in that 267 th phenylalanine is mutated into leucine; the mutant PmHS2 (H296N) is characterized in that histidine 296 is mutated into asparagine; the mutant PmHS2 (S548A) was mutated from serine at position 548 to alanine. The unprotected mutants herein will not show their sequence, but their sequence acquisition method is the same.
The substrate carbohydrate reagents used in the present invention are available from sigma and from the manufacturer unless otherwise indicated. All plasmids for PmHS2 mutants were commissioned by the company nanjing gold sry. The adopted HPLC detection method uses an amino column of YMC, the liquid phase system is the liquid phase of Shimadzu, and the ultraviolet detection system is SPD-20A. Detecting ultraviolet absorption of each component at 310nm and 254nm after the heparin skeleton synthase catalytic product is separated by a chromatographic column, wherein the HPLC mobile phase condition is shown in table 1:
TABLE 1 HPLC analysis method for heparin oligosaccharides detection
Example 1 preparation of heparin skeleton synthase PmHS2 mutant
(1) Construction of expression strains
Taking mutant PmHS2 (F197Y) as an example, transforming a recombinant vector pET28a-His-PmHS2 (F197Y) -Stop codon (synthesized by Nanjing Style company) containing a heparin skeleton synthase PmHS2 (F197Y) coding gene (SEQ ID NO. 5) into competent cells of escherichia coli BL21 (DE 3), constructing a recombinant strain, culturing the recombinant strain on LB plates of kanamycin (100 mug/mL) for 12 hours, screening transformants (carrying out negative control experiments), and activating to obtain PmHS2 (F197Y) positive transformants; the amino acid sequence of the heparin skeleton synthase is shown as SEQ ID NO. 6.
The remaining nine positive transformants of mutant PmHS2 (S130T), pmHS2 (S135A), pmHS2 (D165Y), pmHS2 (F267L), pmHS2 (H296N), pmHS2 (T326S), pmHS2 (T344S), pmHS2 (I476F), and PmHS2 (S548A) were obtained in the same manner as described above.
(2) Expression and purification of proteins
Taking mutant PmHS2 (F197Y) as an example, single colonies of mutant PmHS2 (F197Y) positive transformants were subjected to activation culture in 50mL of sterile LB liquid medium for 12h (37 ℃ C., 225 r/min). Then 1% volume of the activated bacteria solution was added to LB liquid medium containing kanamycin (100. Mu.g/mL), and shake-cultured at 37℃and 225r/min for about 3 hours to OD 600 IPTG (final concentration 0.2 mM) was added at 0.6-0.8, and induction was performed at 22℃and 225r/min for 16-18h. Then, the cells were collected by centrifugation at 8000rpm for 10min and resuspended in equilibration buffer, sonicated on ice (15 s working, 45s intermittent, 33% amplitude, 1500KJ energy, 4 ℃) for 20min, and the cells after disruption were centrifuged at 12000rpm for 40min (4 ℃) to leave the supernatant which was filtered through a 0.22 μm filter. Purification by histidine tag with nickel column: firstly, using a balance buffer solution to balance 5-10 column volumes of a nickel column, filtering a supernatant fluid to flow through the nickel column to finish a sample loading step, using the balance buffer solution to balance 3-5 column volumes after sample loading, then using a washing buffer solution to wash out impurity proteins, and finally using an elution buffer solutionEluting to obtain the target protein. The purified protein was stored in 20% glycerol and each tube was stored in a-80℃refrigerator.
Wherein, the components of the balance buffer are: 0.3M NaCl,50mM NaH 2 PO 4 ,pH=8;
The wash buffer had the following composition: 10mM imidazole, 0.3M NaCl,50mM NaH 2 PO 4 ,pH=8;
The elution buffer had the following composition: 250mM imidazole, 0.3M NaCl,50mM NaH 2 PO 4 ,pH=8。
The remaining nine mutant PmHS2 (S130T), pmHS2 (S135A), pmHS2 (D165Y), pmHS2 (F267L), pmHS2 (H296N), pmHS2 (T326S), pmHS2 (T344S), pmHS2 (I476F), pmHS2 (S548A) recombinant proteins were purified according to the same method as described above.
The expression condition of the purified recombinant protein is identified by polyacrylamide gel electrophoresis (SDS-PAGE) (shown in figure 1), and after the positive transformant is expressed in a recombinant mode, the protein is purified, the band is single, and the molecular weight of the target protein is consistent with the theoretical value (PmHS 2 is 75.39 kDa).
Example 2 Activity detection of heparin skeleton synthase PmHS2 mutant
The reaction was carried out using commercial GlcA-pNP (final concentration: 0.2 mM) as an acceptor substrate, UDP-GlcNAc (final concentration: 0.3 mM) as a donor substrate, and the heparin-skeleton-synthase PmHS2 mutant prepared in example 1 as a protease, with the reaction system shown in Table 2; the reaction was placed in a 37℃water bath for 30min, heated in boiling water for 5min to inactivate the protease and terminate the reaction, the reaction solution was filtered with a 0.22 μm filter membrane and subjected to HPLC detection as described in Table 1, the pNP group of the monosaccharide acceptor having a specific absorption at a detection wavelength of ultraviolet 310nm, and the mobile phase flow rate was 0.5mL/min.
TABLE 2 enzyme Activity verification reaction System for heparin skeleton synthase
The detection results are shown in fig. 2, wherein the conversion rate of PmHS2 is set to be 1, and the conversion rate of the mutant is reduced to be a multiple thereof.
As can be seen from FIG. 2, the heparin backbone synthase PmHS2 mutant has GlcNAc transferase activity and GlcA transferase activity, and can transfer GlcNAc groups to the non-reducing end of GlcA-pNP to produce heparin disaccharide GlcNAc-GlcA-pNP; the GlcA group can also be transferred to the non-reducing end of GlcNAc-GlcA-pNP to form the heparin-trisaccharide GlcA-GlcNAc-GlcA-pNP.
In addition, the GlcNAc transferase activity and GlcA transferase activity of each mutant were elevated to different extents.
Wherein, the activity of the GlcNAc transferase of the PmHS2 (S130T) is improved by 2.51 times of the activity of the original heparin skeleton synthase PmHS2, and the activity of the GlcA transferase is improved by 2.97 times of the activity of the original heparin skeleton synthase PmHS 2.
The GlcNAc transferase activity of PmHS2 (S135A) was increased by 3.31 times as much as that of the original heparin skeleton synthase PmHS2, and the GlcA transferase activity was increased by 1.63 times as much as that of the original heparin skeleton synthase PmHS 2.
The GlcNAc transferase activity of PmHS2 (F197Y) was increased by 2.81 times as much as that of the original heparin skeleton synthase PmHS2, and the GlcA transferase activity was increased by 2.76 times as much as that of the original heparin skeleton synthase PmHS 2.
The GlcNAc transferase activity of PmHS2 (T326S) was increased by 3.73 times as much as that of the original heparin skeleton synthase PmHS2, and the GlcA transferase activity was increased by 1.67 times as much as that of the original heparin skeleton synthase PmHS 2.
The GlcNAc transferase activity of PmHS2 (T344S) was increased by 2.08 times as much as that of the original heparin skeleton synthase PmHS2, and the GlcA transferase activity was increased by 3.35 times as much as that of the original heparin skeleton synthase PmHS 2.
The GlcNAc transferase activity of PmHS2 (I476F) was increased by 4.33 times as much as that of the original heparin skeleton synthase PmHS2, and the GlcA transferase activity was increased by 2.29 times as much as that of the original heparin skeleton synthase PmHS 2.
The changes in the activities of PmHS2 (D165Y), pmHS2 (F267L), pmHS2 (H297N), and PmHS2 (S548A) were not significant.
As shown by the data, compared with the heparin skeleton synthase PmHS2, the heparin skeleton synthase PmHS2 mutant provided by the invention has very high GlcNAc transferase activity and GlcA transferase activity which are respectively 2-3.5 times and 1.5-3.5 times that of the heparin skeleton synthase PmHS2, and can effectively synthesize a GlcNAc-GlcA-pNP heparin skeleton by utilizing substrates UDP-GlcNAc and GlcA-pNP under the optimal condition; or the substrates UDP-GlcA and GlcNAc-GlcA-pNP are utilized to effectively synthesize the GlcA-GlcNAc-GlcA-pNP heparin skeleton, so that the GlcNAc transfer synthesis efficiency and GlcA transfer synthesis efficiency in heparin skeleton synthesis are improved, the application development of heparin bionic synthesis is greatly promoted, and a brand new page is opened for the research and development of glycosaminoglycan.
EXAMPLE 3 stability study of heparin skeleton synthase PmHS2 mutant
The conventional PmHS2 mutant proteins obtained in example 1 and PmHS2 mutant proteins obtained in example 1 were left at 37℃for 0h, 24h, 48h and 72h, respectively, and then the conventional activities were measured in accordance with the reaction system shown in example 2 using commercially available GlcA-pNP (final concentration: 0.2 mM) as a acceptor substrate and UDP-GlcNAc (final concentration: 0.3 mM) as a donor substrate. The reaction is put into a water bath kettle at 37 ℃ for reaction for 30min, boiling water is heated for 5min to inactivate protease, thus stopping the reaction, the reaction liquid is filtered by a filter membrane of 0.22 mu m, HPLC detection is carried out according to the method shown in table 1, the pNP group of the monosaccharide receptor has specific absorption under the detection wavelength of ultraviolet 310nm, the flow rate of a mobile phase is 0.5mL/min, and the detection result is shown in figure 3.
As shown in FIG. 3, the heparin skeleton synthase PmHS2 mutant provided by the invention has good stability, can still maintain more than 55% of enzyme activity after being placed at 37 ℃ for 24 hours, and can still maintain more than 40% of enzyme activity after being placed at 37 ℃ for 72 hours. The enzyme activity of the heparin skeleton synthase PmHS2 is reduced to 40% after the heparin skeleton synthase PmHS2 is placed at 37 ℃ for 24 hours, and the enzyme activity is reduced to about 10% after the heparin skeleton synthase PmHS2 is placed at 37 ℃ for 72 hours. Wherein, the purified proteins of PmHS2 (F197Y), pmHS2 (S135A) and PmHS2 (T344S) can still keep more than 80 percent of enzyme activity after being placed for 72 hours at 37 ℃ and is 5 to 7 times of that of heparin skeleton synthase PmHS 2.
SEQUENCE LISTING
<110> university of Shandong
<120> high-activity heparin skeleton synthase PmHS2 mutant and application thereof
<160> 12
<170> PatentIn version 3.5
<210> 1
<211> 1956
<212> DNA
<213> artificial sequence
<400> 1
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttacc atcattatga ccagccacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctattt tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcaccga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca cccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actatatcac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 2
<211> 651
<212> PRT
<213> artificial sequence
<400> 2
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Thr Ile Ile Met Thr Ser His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Phe Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Thr Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Thr Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Ile Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
<210> 3
<211> 1956
<212> DNA
<213> artificial sequence
<400> 3
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttagc atcattatga ccgcgcacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctattt tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcaccga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca cccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actatatcac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 4
<211> 651
<212> PRT
<213> artificial sequence
<400> 4
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Ser Ile Ile Met Thr Ala His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Phe Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Thr Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Thr Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Ile Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
<210> 5
<211> 1956
<212> DNA
<213> artificial sequence
<400> 5
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttagc atcattatga ccagccacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctatta tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcaccga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca cccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actatatcac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 6
<211> 651
<212> PRT
<213> artificial sequence
<400> 6
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Ser Ile Ile Met Thr Ser His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Tyr Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Thr Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Thr Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Ile Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
<210> 7
<211> 1956
<212> DNA
<213> artificial sequence
<400> 7
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttagc atcattatga ccagccacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctattt tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcagcga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca cccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actatatcac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 8
<211> 651
<212> PRT
<213> artificial sequence
<400> 8
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Ser Ile Ile Met Thr Ser His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Phe Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Ser Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Thr Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Ile Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
<210> 9
<211> 1956
<212> DNA
<213> artificial sequence
<400> 9
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttagc atcattatga ccagccacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctattt tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcaccga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca gccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actatatcac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 10
<211> 651
<212> PRT
<213> artificial sequence
<400> 10
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Ser Ile Ile Met Thr Ser His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Phe Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Thr Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Ser Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Ile Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
<210> 11
<211> 1956
<212> DNA
<213> artificial sequence
<400> 11
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttagc atcattatga ccagccacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctattt tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcaccga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca cccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actattttac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 12
<211> 651
<212> PRT
<213> artificial sequence
<400> 12
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Ser Ile Ile Met Thr Ser His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Phe Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Thr Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Thr Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Phe Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
Claims (8)
1. The high-activity heparin skeleton synthase PmHS2 mutant T326S is characterized in that the amino acid sequence is shown as SEQ ID NO.8, and the nucleotide sequence of the encoding gene is shown as SEQ ID NO. 7; the 326 th threonine of the amino acid sequence of the heparin skeleton synthase PmHS2 is mutated into serine; genBank of the amino acid sequence of the heparin skeleton synthase PmHS2 is AY292200.1.
2. A high-activity heparin skeleton synthase PmHS2 mutant I476F is characterized in that the amino acid sequence is shown as SEQ ID NO.12, and the nucleotide sequence of the encoding gene is shown as SEQ ID NO. 11; the 476 th isoleucine of the amino acid sequence of the heparin skeleton synthase PmHS2 is mutated into phenylalanine; genBank of the amino acid sequence of the heparin skeleton synthase PmHS2 is AY292200.1.
3. A recombinant vector comprising a plasmid vector and a gene encoding the heparin skeleton synthase PmHS2 mutant according to claim 1 or claim 2 inserted into the plasmid vector.
4. The recombinant vector of claim 3, wherein the plasmid vector is pET-28a (+).
5. A recombinant strain obtained by transforming the recombinant vector of claim 3 into a host cell.
6. The recombinant strain of claim 5, wherein the host cell is e.
7. Use of the heparin scaffold synthase mutant according to claim 1 or claim 2, for the synthesis of heparin scaffolds.
8. The use according to claim 7 wherein the use is of UDP-GlcNAc as the donor substrate and GlcA-pNP as the acceptor substrate to catalyze a reaction to produce a heparin backbone having the structure GlcNAc-GlcA-pNP; or the UDP-GlcA is used as a donor substrate, the GlcNAc-GlcA-pNP is used as an acceptor substrate, and the heparin skeleton with the structure of GlcA-GlcNAc-GlcA-pNP is generated by catalytic reaction.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110881483.0A CN113528482B (en) | 2021-08-02 | 2021-08-02 | High-activity heparin skeleton synthase PmHS2 mutant and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110881483.0A CN113528482B (en) | 2021-08-02 | 2021-08-02 | High-activity heparin skeleton synthase PmHS2 mutant and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113528482A CN113528482A (en) | 2021-10-22 |
CN113528482B true CN113528482B (en) | 2024-02-06 |
Family
ID=78121877
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110881483.0A Active CN113528482B (en) | 2021-08-02 | 2021-08-02 | High-activity heparin skeleton synthase PmHS2 mutant and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113528482B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2002256501B2 (en) * | 2001-05-08 | 2008-05-29 | The Board Of Regents Of The University Of Oklahoma | Heparin/heparosan synthase and methods of making and using same |
CN101855354A (en) * | 2007-09-12 | 2010-10-06 | 拜尔作物科学股份公司 | Plants which synthesize increased amounts of glucosaminglycans |
WO2014132468A1 (en) * | 2013-03-01 | 2014-09-04 | 独立行政法人理化学研究所 | Sugar chain compound and method for producing sugar chain compound |
CN107189992A (en) * | 2017-06-29 | 2017-09-22 | 江南大学 | A kind of heparosan synthase and its application |
-
2021
- 2021-08-02 CN CN202110881483.0A patent/CN113528482B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2002256501B2 (en) * | 2001-05-08 | 2008-05-29 | The Board Of Regents Of The University Of Oklahoma | Heparin/heparosan synthase and methods of making and using same |
CN101855354A (en) * | 2007-09-12 | 2010-10-06 | 拜尔作物科学股份公司 | Plants which synthesize increased amounts of glucosaminglycans |
WO2014132468A1 (en) * | 2013-03-01 | 2014-09-04 | 独立行政法人理化学研究所 | Sugar chain compound and method for producing sugar chain compound |
CN107189992A (en) * | 2017-06-29 | 2017-09-22 | 江南大学 | A kind of heparosan synthase and its application |
Also Published As
Publication number | Publication date |
---|---|
CN113528482A (en) | 2021-10-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110734899B (en) | Sucrose phosphorylase mutant with improved enzyme activity and construction method and application thereof | |
US11203772B2 (en) | Chemoenzymatic synthesis of structurally homogeneous ultra-low molecular weight heparins | |
CN108018269B (en) | Levansucrase mutant with improved thermal stability | |
EP1816206B1 (en) | Method of producing uridine 5 -diphospho-n-acetylgalactosamine | |
US5714376A (en) | Heparinase gene from flavobacterium heparinum | |
CN112029752A (en) | Ulva lactuca polysaccharide lyase as well as coding gene and application thereof | |
WO1997020061A1 (en) | Synthesis of hyaluronic acid | |
CN111088183B (en) | Marine vibrio and application thereof in preparation of iota-carrageenase with thermal stability | |
CN113528482B (en) | High-activity heparin skeleton synthase PmHS2 mutant and application thereof | |
TWI719140B (en) | New polyphosphate-dependent glucokinase and method for preparing glucose 6-phosphate using the same | |
US20040197868A1 (en) | Heparin/heparosan synthase from P. multocida, soluble and single action catalysts thereof and methods of making and using same | |
EP2094840B1 (en) | Novel n-acetylglucosamine-2-epimerase and method for producing cmp-neuraminic acid using the same | |
WO2023035584A1 (en) | Construction and application of yeast engineering bacteria for fermentative production of heparin | |
US20170204381A1 (en) | Pmst1 mutants for chemoenzymatic synthesis of sialyl lewis x compounds | |
CN110055234B (en) | Chondroitin sulfate synthase and coding gene and application thereof | |
Habuchi et al. | Biosynthesis of heparan sulfate and heparin how are the multifunctional glycosaminoglycans built up? | |
CN113564139B (en) | Pasteurella heparin framework synthase BtHS1, mutant and application thereof | |
CN110643586B (en) | Heparin skeleton synthase and coding gene and application thereof | |
US10273517B2 (en) | Methods of producing testosteronan polymers using testosteronan synthase | |
CN114958790B (en) | Heparin skeleton synthase, mutant and application thereof | |
CN110157700A (en) | A kind of circumscribed-type widow algin catenase and its application | |
CN117778354B (en) | Laminarin degrading enzyme OUC-ScLam39 mutant and encoding gene and application thereof | |
WO2022209658A1 (en) | α-1,2-GLUCOSIDASE, PRODUCTION METHOD THEREFOR, AND USE THEREOF | |
CN116676357A (en) | Preparation method of N-sulfonated heparin precursor oligosaccharide | |
CN114703159A (en) | Glucosyltransferase mutant and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |