CN113528482A - High-activity heparin framework synthase PmHS2 mutant and application thereof - Google Patents
High-activity heparin framework synthase PmHS2 mutant and application thereof Download PDFInfo
- Publication number
- CN113528482A CN113528482A CN202110881483.0A CN202110881483A CN113528482A CN 113528482 A CN113528482 A CN 113528482A CN 202110881483 A CN202110881483 A CN 202110881483A CN 113528482 A CN113528482 A CN 113528482A
- Authority
- CN
- China
- Prior art keywords
- leu
- ile
- lys
- asn
- pmhs2
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 229920000669 heparin Polymers 0.000 title claims abstract description 119
- 229960002897 heparin Drugs 0.000 title claims abstract description 119
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 title claims abstract description 114
- 230000000694 effects Effects 0.000 title claims abstract description 64
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 35
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims abstract description 19
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims abstract description 15
- 239000004473 Threonine Substances 0.000 claims abstract description 15
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims abstract description 10
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims abstract description 10
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims abstract description 6
- 229960000310 isoleucine Drugs 0.000 claims abstract description 6
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims abstract description 6
- 230000035772 mutation Effects 0.000 claims abstract description 4
- 150000001413 amino acids Chemical group 0.000 claims description 37
- 239000000758 substrate Substances 0.000 claims description 18
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 15
- 238000000034 method Methods 0.000 claims description 14
- 239000002773 nucleotide Substances 0.000 claims description 13
- 125000003729 nucleotide group Chemical group 0.000 claims description 13
- LFTYTUAZOPRMMI-CFRASDGPSA-N UDP-N-acetyl-alpha-D-glucosamine Chemical compound O1[C@H](CO)[C@@H](O)[C@H](O)[C@@H](NC(=O)C)[C@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-CFRASDGPSA-N 0.000 claims description 8
- LFTYTUAZOPRMMI-UHFFFAOYSA-N UNPD164450 Natural products O1C(CO)C(O)C(O)C(NC(=O)C)C1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-UHFFFAOYSA-N 0.000 claims description 8
- 239000013598 vector Substances 0.000 claims description 6
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 4
- HDYANYHVCAPMJV-UHFFFAOYSA-N Uridine diphospho-D-glucuronic acid Natural products O1C(N2C(NC(=O)C=C2)=O)C(O)C(O)C1COP(O)(=O)OP(O)(=O)OC1OC(C(O)=O)C(O)C(O)C1O HDYANYHVCAPMJV-UHFFFAOYSA-N 0.000 claims description 4
- 235000004279 alanine Nutrition 0.000 claims description 4
- WMWKTCPGFOEPBD-YGIWDPDDSA-N azane;(2s,3s,4s,5r,6r)-6-[[[(2r,3s,4r,5r)-5-(2,4-dioxopyrimidin-1-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-hydroxyphosphoryl]oxy-3,4,5-trihydroxyoxane-2-carboxylic acid Chemical compound N.C([C@@H]1[C@H]([C@H]([C@@H](O1)N1C(NC(=O)C=C1)=O)O)O)OP(O)(=O)OP(O)(=O)O[C@H]1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@H]1O WMWKTCPGFOEPBD-YGIWDPDDSA-N 0.000 claims description 4
- 238000006555 catalytic reaction Methods 0.000 claims description 4
- 239000013600 plasmid vector Substances 0.000 claims description 4
- 241000588724 Escherichia coli Species 0.000 claims description 3
- 239000004472 Lysine Substances 0.000 claims description 3
- 230000002194 synthesizing effect Effects 0.000 claims description 3
- 230000001131 transforming effect Effects 0.000 claims description 2
- 102000004190 Enzymes Human genes 0.000 abstract description 21
- 108090000790 Enzymes Proteins 0.000 abstract description 21
- 102000004169 proteins and genes Human genes 0.000 abstract description 13
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 24
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 18
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 18
- 238000003786 synthesis reaction Methods 0.000 description 18
- 230000015572 biosynthetic process Effects 0.000 description 15
- 238000006243 chemical reaction Methods 0.000 description 14
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 12
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 12
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 12
- AEMOLEFTQBMNLQ-UHFFFAOYSA-N beta-D-galactopyranuronic acid Natural products OC1OC(C(O)=O)C(O)C(O)C1O AEMOLEFTQBMNLQ-UHFFFAOYSA-N 0.000 description 12
- 108010049041 glutamylalanine Proteins 0.000 description 12
- 108010050848 glycylleucine Proteins 0.000 description 12
- 108010034529 leucyl-lysine Proteins 0.000 description 12
- 108010057821 leucylproline Proteins 0.000 description 12
- 229920002683 Glycosaminoglycan Polymers 0.000 description 10
- 102000004357 Transferases Human genes 0.000 description 10
- 108090000992 Transferases Proteins 0.000 description 10
- 108010090473 UDP-N-acetylglucosamine-peptide beta-N-acetylglucosaminyltransferase Proteins 0.000 description 10
- 235000018102 proteins Nutrition 0.000 description 9
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 8
- 238000001514 detection method Methods 0.000 description 7
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 6
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 6
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 6
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 6
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 6
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 6
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 6
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 6
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 6
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 6
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 6
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 6
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 6
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 6
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 6
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 6
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 6
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 6
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 6
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 6
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 6
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 6
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 6
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 6
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 6
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 6
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 6
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 6
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 6
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 6
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 6
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 6
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 6
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 6
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 6
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 6
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 6
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 6
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 6
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 6
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 6
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 6
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 6
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 6
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 6
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 6
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 6
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 6
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 6
- FJAYYNIXQNERSO-ACZMJKKPSA-N Gln-Cys-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FJAYYNIXQNERSO-ACZMJKKPSA-N 0.000 description 6
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 6
- OOLCSQQPSLIETN-JYJNAYRXSA-N Gln-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)O OOLCSQQPSLIETN-JYJNAYRXSA-N 0.000 description 6
- DAAUVRPSZRDMBV-KBIXCLLPSA-N Gln-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DAAUVRPSZRDMBV-KBIXCLLPSA-N 0.000 description 6
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 6
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 6
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 6
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 6
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 6
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 6
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 6
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 6
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 6
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 6
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 6
- JVZLZVJTIXVIHK-SXNHZJKMSA-N Glu-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N JVZLZVJTIXVIHK-SXNHZJKMSA-N 0.000 description 6
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 6
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 6
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 6
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 6
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 6
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 6
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 6
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 6
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 6
- FMRKUXFLLPKVPG-JYJNAYRXSA-N His-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O FMRKUXFLLPKVPG-JYJNAYRXSA-N 0.000 description 6
- CTGZVVQVIBSOBB-AVGNSLFASA-N His-His-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTGZVVQVIBSOBB-AVGNSLFASA-N 0.000 description 6
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 6
- VUUFXXGKMPLKNH-BZSNNMDCSA-N His-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N VUUFXXGKMPLKNH-BZSNNMDCSA-N 0.000 description 6
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 6
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 6
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 6
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 6
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 6
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 6
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 6
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 6
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 6
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 6
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 6
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 6
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 6
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 6
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 6
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 6
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 6
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 6
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 6
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 6
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 6
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 6
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 6
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 6
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 6
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 6
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 6
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 6
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 6
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 6
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 6
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 6
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 6
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 6
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 6
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 6
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 6
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 6
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 6
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 6
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 6
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 6
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 6
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 6
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 6
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 6
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 6
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 6
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 6
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 6
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 6
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 6
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 6
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 6
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 6
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 6
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 6
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 6
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 6
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 6
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 6
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 6
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 6
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 6
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 6
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 6
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 6
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 6
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 6
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 6
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 6
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 6
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 6
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 6
- KZUJCMPVNXOBAF-LKXGYXEUSA-N Thr-Cys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KZUJCMPVNXOBAF-LKXGYXEUSA-N 0.000 description 6
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 6
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 6
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 6
- OHDXOXIZXSFCDN-RCWTZXSCSA-N Thr-Met-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OHDXOXIZXSFCDN-RCWTZXSCSA-N 0.000 description 6
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 6
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 6
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 6
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 6
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 6
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 6
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 6
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 6
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 6
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 6
- MANXHLOVEUHVFD-DCAQKATOSA-N Val-His-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N MANXHLOVEUHVFD-DCAQKATOSA-N 0.000 description 6
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 6
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 6
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 6
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 6
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 6
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 6
- 108010060035 arginylproline Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 108010068265 aspartyltyrosine Proteins 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- 108010079547 glutamylmethionine Proteins 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 108010085325 histidylproline Proteins 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 6
- 108010012058 leucyltyrosine Proteins 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 6
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 108010048818 seryl-histidine Proteins 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- 108010027345 wheylin-1 peptide Proteins 0.000 description 6
- 108010000998 wheylin-2 peptide Proteins 0.000 description 6
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 5
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 5
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 5
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 5
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 5
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 5
- -1 heparin oligosaccharide Chemical class 0.000 description 5
- 238000004128 high performance liquid chromatography Methods 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 238000012546 transfer Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- AEMOLEFTQBMNLQ-AQKNRBDQSA-N D-glucopyranuronic acid Chemical compound OC1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-AQKNRBDQSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 description 3
- OVRNDRQMDRJTHS-RTRLPJTCSA-N N-acetyl-D-glucosamine Chemical compound CC(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-RTRLPJTCSA-N 0.000 description 3
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 description 3
- 239000007853 buffer solution Substances 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 229910052759 nickel Inorganic materials 0.000 description 3
- 229920001542 oligosaccharide Polymers 0.000 description 3
- 239000012071 phase Substances 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 3
- 229910000162 sodium phosphate Inorganic materials 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- IAJILQKETJEXLJ-UHFFFAOYSA-N Galacturonsaeure Natural products O=CC(O)C(O)C(O)C(O)C(O)=O IAJILQKETJEXLJ-UHFFFAOYSA-N 0.000 description 2
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 235000001014 amino acid Nutrition 0.000 description 2
- 229940024606 amino acid Drugs 0.000 description 2
- 230000003592 biomimetic effect Effects 0.000 description 2
- 238000009835 boiling Methods 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 150000002016 disaccharides Chemical group 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 239000007791 liquid phase Substances 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 229950006780 n-acetylglucosamine Drugs 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 238000012827 research and development Methods 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- KIUKXJAPPMFGSW-DNGZLQJQSA-N (2S,3S,4S,5R,6R)-6-[(2S,3R,4R,5S,6R)-3-Acetamido-2-[(2S,3S,4R,5R,6R)-6-[(2R,3R,4R,5S,6R)-3-acetamido-2,5-dihydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-2-carboxy-4,5-dihydroxyoxan-3-yl]oxy-5-hydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-3,4,5-trihydroxyoxane-2-carboxylic acid Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@H](O3)C(O)=O)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](C(O)=O)O1 KIUKXJAPPMFGSW-DNGZLQJQSA-N 0.000 description 1
- IAJILQKETJEXLJ-KLVWXMOXSA-N (2s,3r,4r,5r)-2,3,4,5-tetrahydroxy-6-oxohexanoic acid Chemical group O=C[C@H](O)[C@H](O)[C@@H](O)[C@H](O)C(O)=O IAJILQKETJEXLJ-KLVWXMOXSA-N 0.000 description 1
- RVDBHQMAUIHAHC-OFKSJOQMSA-N (2s,3s,4s,5r,6r)-3,4,5,6-tetrahydroxy-6-(4-nitrophenyl)oxane-2-carboxylic acid Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](C(O)=O)O[C@]1(O)C1=CC=C([N+]([O-])=O)C=C1 RVDBHQMAUIHAHC-OFKSJOQMSA-N 0.000 description 1
- MSWZFWKMSRAUBD-UHFFFAOYSA-N 2-Amino-2-Deoxy-Hexose Chemical group NC1C(O)OC(CO)C(O)C1O MSWZFWKMSRAUBD-UHFFFAOYSA-N 0.000 description 1
- SQDAZGGFXASXDW-UHFFFAOYSA-N 5-bromo-2-(trifluoromethoxy)pyridine Chemical compound FC(F)(F)OC1=CC=C(Br)C=N1 SQDAZGGFXASXDW-UHFFFAOYSA-N 0.000 description 1
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 1
- 206010002383 Angina Pectoris Diseases 0.000 description 1
- 102000004411 Antithrombin III Human genes 0.000 description 1
- 108090000935 Antithrombin III Proteins 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 229920001287 Chondroitin sulfate Polymers 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 229920000045 Dermatan sulfate Polymers 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- 229920002971 Heparan sulfate Polymers 0.000 description 1
- 229920000288 Keratan sulfate Polymers 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- DUKURNFHYQXCJG-UHFFFAOYSA-N Lewis A pentasaccharide Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(C)=O)C(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)OC1CO DUKURNFHYQXCJG-UHFFFAOYSA-N 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102100023315 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase Human genes 0.000 description 1
- 108010056664 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyltransferase Proteins 0.000 description 1
- 206010029164 Nephrotic syndrome Diseases 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 108010022999 Serine Proteases Proteins 0.000 description 1
- 102000012479 Serine Proteases Human genes 0.000 description 1
- 229940122055 Serine protease inhibitor Drugs 0.000 description 1
- 101710102218 Serine protease inhibitor Proteins 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 1
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- 229920001284 acidic polysaccharide Polymers 0.000 description 1
- 150000004805 acidic polysaccharides Chemical class 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 238000001994 activation Methods 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000002429 anti-coagulating effect Effects 0.000 description 1
- 230000002785 anti-thrombosis Effects 0.000 description 1
- 239000003146 anticoagulant agent Substances 0.000 description 1
- 229940127219 anticoagulant drug Drugs 0.000 description 1
- 229960005348 antithrombin iii Drugs 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N aspartic acid group Chemical group N[C@@H](CC(=O)O)C(=O)O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 230000021164 cell adhesion Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 229940059329 chondroitin sulfate Drugs 0.000 description 1
- 230000015271 coagulation Effects 0.000 description 1
- 238000005345 coagulation Methods 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000010511 deprotection reaction Methods 0.000 description 1
- AVJBPWGFOQAPRH-FWMKGIEWSA-L dermatan sulfate Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@H](OS([O-])(=O)=O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](C([O-])=O)O1 AVJBPWGFOQAPRH-FWMKGIEWSA-L 0.000 description 1
- 229940051593 dermatan sulfate Drugs 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 239000006167 equilibration buffer Substances 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 150000002402 hexoses Chemical group 0.000 description 1
- 229920002674 hyaluronan Polymers 0.000 description 1
- 229960003160 hyaluronic acid Drugs 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 210000004347 intestinal mucosa Anatomy 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- KXCLCNHUUKTANI-RBIYJLQWSA-N keratan Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@H](COS(O)(=O)=O)O[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@H](O[C@@H](O[C@H]3[C@H]([C@@H](COS(O)(=O)=O)O[C@@H](O)[C@@H]3O)O)[C@H](NC(C)=O)[C@H]2O)COS(O)(=O)=O)O[C@H](COS(O)(=O)=O)[C@@H]1O KXCLCNHUUKTANI-RBIYJLQWSA-N 0.000 description 1
- 230000037356 lipid metabolism Effects 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 150000002772 monosaccharides Chemical class 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010056119 protease So Proteins 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 239000003001 serine protease inhibitor Substances 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010077037 tyrosyl-tyrosyl-phenylalanine Proteins 0.000 description 1
- 238000000825 ultraviolet detection Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1051—Hexosyltransferases (2.4.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/18—Preparation of compounds containing saccharide radicals produced by the action of a glycosyl transferase, e.g. alpha-, beta- or gamma-cyclodextrins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y204/00—Glycosyltransferases (2.4)
- C12Y204/01—Hexosyltransferases (2.4.1)
- C12Y204/01017—Glucuronosyltransferase (2.4.1.17)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The invention relates to a high-activity heparin framework synthase PmHS2 mutant and application thereof. The mutant is any one site mutation of 130 th serine, 135 th serine, 197 th phenylalanine, 326 th threonine, 344 th threonine and 476 th isoleucine of heparin skeleton synthase PmHS 2. The heparin framework synthase PmHS2 mutant provided by the invention has good stability, can still keep more than 40% of enzyme activity after being placed at 37 ℃ for 72 hours, and is more than 2 times of the activity of the existing heparin framework synthase PmHS 2. In particular to the protein expressed by the heparin framework synthase mutant PmHS2(F197Y), the heparin framework synthase mutant PmHS2(S135A) and the heparin framework synthase mutant PmHS2(T344S), which can still keep more than 80% of enzyme activity after being placed at 37 ℃ for 72 hours and is 5-7 times of the prior heparin framework synthase PmHS 2.
Description
Technical Field
The invention relates to a high-activity heparin framework synthase PmHS2 mutant and application thereof, belonging to the technical field of biology.
Background
Glycosaminoglycan (GAG) is a complex biomacromolecule consisting of repeating disaccharide units of hexuronic acid or hexose and hexosamine regularly arranged to form a linear unbranched acidic polysaccharide. Glycosaminoglycans are widely distributed and various, and classified into glycosaminoglycans without sulfuric acid modification and glycosaminoglycans with sulfuric acid modification according to the degree of sulfation, the glycosaminoglycans without sulfuric acid modification include only hyaluronic acid, and the glycosaminoglycans with sulfuric acid modification include dermatan sulfate/chondroitin sulfate, heparan sulfate/heparin, keratan sulfate, and the like.
Heparin (HP), an important glycosaminoglycan, is composed of repeating disaccharide units formed from D- β -glucuronic acid (or L- α -iduronic acid) and N-acetylglucosamine. It was first discovered from the liver and is named for its activity in enhancing the natural serine protease inhibitor antithrombin iii, catalyzing the inhibition of all serine proteases of the intrinsic coagulation pathway, thus having anticoagulant and antithrombotic effects, and also mediating important physiological processes including cell adhesion, lipid metabolism and growth factor regulation by binding to other proteins. The traditional Chinese medicine composition can be clinically used for treating angina, nephrotic syndrome, severe burns, rheumatoid arthritis and the like, and the demand of the traditional Chinese medicine composition is in the front of the biotechnology medicine field all year round internationally.
The sources of heparin are mainly divided into animal extraction, chemical synthesis and chemical enzymatic synthesis. The animal extracted heparin is mainly extracted from small intestinal mucosa of pigs and cows, but due to the difference of extraction methods and the structure and the number of basic units of the animal derived heparin are different, the structure, the configuration and the molecular weight of heparin products are different, and heparin pollution is possibly caused. In addition, heparin has a complex structure, and when a chemical synthesis method is used for extending a sugar chain, steps such as protection, deprotection, activation and coupling of an unstable group are required to be considered, and conditions such as regioselectivity and stereoselectivity are required to be considered, so that the difficulty is high and the yield is low. The chemical enzyme method synthesis of HP is a method combining a chemical method and an enzyme method, and the enzyme reaction condition required in the synthesis process is mild and the specificity is strong. Compared with chemical synthesis, the method has simple steps and high efficiency. The enzymes commonly used for synthesizing heparin skeleton sugar chains by the chemical enzyme method at present comprise heparin skeleton synthase KfiA and heparin skeleton synthase PmHS2, wherein the heparin skeleton synthase PmHS2 is a bifunctional enzyme with N-acetylglucosaminyl transferase activity and glucuronidase activity, the enzyme is widely applied to the synthesis by the chemical enzyme method, and the improvement of the enzyme activity is beneficial to the high-efficiency synthesis of the heparin skeleton sugar chains.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides a high-activity heparin skeleton synthase PmHS2 mutant and application thereof.
Description of terms:
GlcA-pNP: the Chinese is called 4-nitrophenyl-beta-D-glucuronic acid, and the function of the Chinese is used as an initial receptor substrate for heparin oligosaccharide synthesis;
UDP-GlcNAc: the Chinese is fully called uridine diphosphate-N-acetylglucosamine, and the function of the Chinese is as a donor to provide acetylglucosamine for the synthesis of heparin oligosaccharide;
the technical scheme of the invention is as follows:
the mutant is any one site mutation of 130 th serine, 135 th serine, 197 th phenylalanine, 326 th threonine, 344 th threonine and 476 th isoleucine of heparin skeleton synthase PmHS 2.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (S130T), the amino acid sequence of the mutant is shown as SEQ ID NO.2, and the nucleotide sequence of the coding gene is shown as SEQ ID NO. 1; the 130 th serine of the amino acid sequence of heparin skeleton synthase PmHS2 is mutated into threonine.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (S135A), the amino acid sequence of the mutant is shown as SEQ ID NO.4, and the nucleotide sequence of the coding gene is shown as SEQ ID NO. 3; the 135 th serine of the amino acid sequence of heparin skeleton synthase PmHS2 is mutated into alanine.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (F197Y), the amino acid sequence of the mutant is shown as SEQ ID NO.6, and the nucleotide sequence of the coding gene is shown as SEQ ID NO. 5; the phenylalanine at position 197 of an amino acid sequence of heparin skeleton synthase PmHS2 is mutated into lysine.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (T326S), the amino acid sequence of the mutant is shown as SEQ ID NO.8, and the nucleotide sequence of the coding gene is shown as SEQ ID NO. 7; the threonine at the 326 th site of the amino acid sequence of the heparin skeleton synthase PmHS2 is mutated into serine.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (T344S), the amino acid sequence of the mutant is shown as SEQ ID NO.10, and the nucleotide sequence of the coding gene is shown as SEQ ID NO. 9; the method is characterized in that threonine at the 344 th site of an amino acid sequence of heparin skeleton synthase PmHS2 is mutated into serine.
According to the invention, the mutant is a heparin skeleton synthase PmHS2 mutant (I476F), the amino acid sequence of the mutant is shown as SEQ ID NO.12, and the nucleotide sequence of the coding gene is shown as SEQ ID NO. 11; isoleucine at the 476 th site of an amino acid sequence of heparin skeleton synthase PmHS2 is mutated into phenylalanine.
A recombinant vector is obtained by inserting the coding gene of the heparin skeleton synthase PmHS2 mutant into a plasmid vector.
Preferably, according to the invention, the plasmid vector is pET-28a (+).
In the present invention, the recombinant vector containing the gene encoding the PmHS2 mutant was synthesized by Nanjing Kinsley.
A recombinant strain is obtained by transforming the above recombinant vector into a host cell.
Preferably according to the invention, the host cell is E.coli; further preferably, the host cell is escherichia coli BL21(DE 3).
The application of the heparin framework synthase mutant in synthesizing the heparin framework.
According to the invention, UDP-GlcNAc is used as a donor substrate and GlcA-pNP is used as an acceptor substrate, and the heparin framework with the structure of GlcNAc-GlcA-pNP is generated through catalytic reaction; or carrying out catalytic reaction by using UDP-GlcA as a donor substrate and GlcNAc-GlcA-pNP as an acceptor substrate to generate a heparin framework with the structure of GlcA-GlcNAc-GlcA-pNP.
The experimental procedures not described in detail in the present invention can be performed according to the routine experimental procedures in the technical field.
Advantageous effects
1. The invention selects 130 th, 135 th, 197 th, 326 th, 344 th and 476 th amino acids as the basis of heparin skeleton synthase PmHS2 to carry out site-directed mutagenesis, the 130 th serine is mutated into threonine (S130T), the 135 th serine is mutated into alanine (S135A), the 197 th phenylalanine is mutated into lysine (F197Y), the 326 th threonine is mutated into serine (T326S), the 344 th threonine is mutated into serine (T344S), and the 476 th isoleucine is mutated into phenylalanine (I476F). Compared with heparin framework synthase PmHS2, the heparin framework synthase PmHS2 mutant has very high GlcNAc transferase activity and GlcA transferase activity which are respectively 2-3.5 times and 1.5-3.5 times of the heparin framework synthase PmHS2, and can effectively synthesize a GlcNAc-GlcA-pNP heparin framework by using substrates UDP-GlcNAc and GlcA-pNP under the optimal conditions; or the substrate UDP-GlcA and GlcNAc-GlcA-pNP are utilized to effectively synthesize the GlcA-GlcNAc-GlcA-pNP heparin framework, the GlcNAc transfer synthesis efficiency and the GlcA transfer synthesis efficiency in the synthesis of the heparin framework are improved, the application development of the heparin biomimetic synthesis is greatly promoted, and a brand new page is opened for the research and development of glycosaminoglycan.
2. The heparin framework synthase PmHS2 mutant provided by the invention has good stability, can still keep more than 40% of enzyme activity after being placed at 37 ℃ for 72 hours, and is more than 2 times of the activity of the existing heparin framework synthase PmHS 2. In particular to the protein expressed by the heparin framework synthase mutant PmHS2(F197Y), the heparin framework synthase mutant PmHS2(S135A) and the heparin framework synthase mutant PmHS2(T344S), which can still keep more than 80% of enzyme activity after being placed at 37 ℃ for 72 hours and is 5-7 times of the prior heparin framework synthase PmHS 2.
Drawings
FIG. 1 is a SDS-PAGE result of soluble expression of target protein in recombinant Escherichia coli by a heparin skeleton synthase PmHS2 mutant;
FIG. 2 is a bar graph of the relative conversion rates of heparin skeleton synthase PmHS2 and its mutants;
FIG. 3 is a stability profile of heparin scaffold synthase PmHS2 mutant.
In the figure, the ordinate is the relative yield of the reaction product, based on 100% of the initial enzyme activity, and the abscissa is the time the protein has been left at 37 ℃.
Detailed Description
The technical solution of the present invention is further described below with reference to the following embodiments and the accompanying drawings of the specification, but the scope of the present invention is not limited thereto. Unless otherwise specified, all technical means used in the present invention are well known to those skilled in the art.
The inventors obtained the amino acid sequence and nucleotide sequence information of heparin scaffold synthase PmHS2(GenBank: AY292200.1) from bioinformatics database, and then analyzed the highly conserved region of amino acid sequence using Jalview software. At the same time, protein simulation modeling was performed on heparin scaffold synthase PmHS2 using the Swiss-Model tool, and the active center of the enzyme was predicted using the HotSpot Wizard 2.0. It is found that the 130 th, 135 th, 165 th, 197 th, 267 th, 296 th, 326 th, 344 th, 476 th and 548 th of the amino acid sequence of the heparin skeleton synthase PmHS2 are positioned near an active center and a highly conserved region, and the catalytic activity of PmHS2 is probably improved by carrying out site-directed mutagenesis on the amino acid sequence. Thus, combining the dominant amino acids at these six positions of the homology analysis, ten mutants of PmHS2(S130T), PmHS2(S135A), PmHS2(D165Y), PmHS2(F197Y), PmHS2(F267L), PmHS2(H296N), PmHS2(T326S), PmHS2(T344S), PmHS2(I476F), PmHS2(S548A) were designed.
Wherein, the coding gene of the mutant PmHS2(S130T) is shown as SEQ ID NO.1, and the amino acid sequence is shown as SEQ ID NO. 2; the encoding gene of the mutant PmHS2(S135A) is shown as SEQ ID NO.3, and the amino acid sequence is shown as SEQ ID NO. 4; the encoding gene of the mutant PmHS2(F197Y) is shown as SEQ ID NO.5, and the amino acid sequence is shown as SEQ ID NO. 6; the encoding gene of the mutant PmHS2(T326S) is shown as SEQ ID NO.7, and the amino acid sequence is shown as SEQ ID NO. 8; the encoding gene of the mutant PmHS2(T344S) is shown as SEQ ID NO.9, and the amino acid sequence is shown as SEQ ID NO. 10; the encoding gene of the mutant PmHS2(I476F) is shown as SEQ ID NO.11, and the amino acid sequence is shown as SEQ ID NO. 12. The mutant PmHS2(D165Y) mutates aspartic acid at position 165 into tyrosine; the mutant PmHS2(F267L) is formed by mutating phenylalanine at position 267 to leucine; mutation of histidine at 296 of mutant PmHS2(H296N) into asparagine; the 548 th serine of the mutant PmHS2(S548A) is mutated into alanine. The unprotected mutants herein will not show their sequence, but their sequence is obtained in the same way.
Unless otherwise indicated, the substrate saccharide reagents employed in the present invention were purchased from sigma and from Producer. All plasmids for the PmHS2 mutant were constructed by Nanjing Kinshire. The HPLC detection method adopts YMC amino column, liquid phase system is Shimadzu liquid phase, and ultraviolet detection system is SPD-20A. Detecting ultraviolet absorption of each component at 310nm and 254nm after the heparin skeleton synthase catalytic product is separated by a chromatographic column, wherein HPLC mobile phase conditions are shown in Table 1:
TABLE 1 HPLC analysis method used for detection of heparin oligosaccharides
Example 1 preparation of heparin scaffold synthase PmHS2 mutant
(1) Construction of expression Strain
Taking a mutant PmHS2(F197Y) as an example, a recombinant vector pET28a-His-PmHS2(F197Y) -Stop codon (synthesized by Nanjing Kingsler) containing a gene (SEQ ID NO.5) encoding heparin skeleton synthase PmHS2(F197Y) is transformed into escherichia coli BL21(DE3) competent cells to construct a recombinant strain, the recombinant strain is cultured on an LB plate of kanamycin (100 mu g/mL) for 12 hours, and transformants are screened (a negative control experiment is carried out) to activate to obtain a PmHS2(F197Y) positive transformant; the amino acid sequence of the heparin skeleton synthase is shown in SEQ ID NO. 6.
Positive transformants for the remaining nine mutants, PmHS2(S130T), PmHS2(S135A), PmHS2(D165Y), PmHS2(F267L), PmHS2(H296N), PmHS2(T326S), PmHS2(T344S), PmHS2(I476F), and PmHS2(S548A), were obtained in the same manner as described above.
(2) Expression and purification of proteins
Using the mutant PmHS2(F197Y) as an example, a single colony of a transformant positive to the mutant PmHS2(F197Y) was activated and cultured in 50mL of sterile LB liquid medium for 12 hours (37 ℃, 225 r/min). Then, 1% volume of the activated bacterial suspension was added to LB liquid medium containing kanamycin (100. mu.g/mL), and the mixture was shake-cultured at 37 ℃ and 225r/min for about 3 hours to OD6000.6-0.8, adding IPTG (final concentration of 0.2mM), and inducing at 22 deg.C and 225r/min for 16-18 h. Then, the cells were collected by centrifugation at 8000rpm for 10min and resuspended in equilibration buffer, sonicated on ice (working 15s, pause 45s, amplitude 33%, energy 1500KJ, 4 ℃) for 20min, and the disrupted cells were centrifuged at 12000rpm for 40min (4 ℃), the supernatant was retained, and the supernatant was filtered through a 0.22 μm filter. Purification by histidine tag with nickel column: firstly, the nickel column uses an equilibrium buffer solution to balance 5-10 column volumes, filtered supernatant flows through the nickel column to finish the step of loading, the equilibrium buffer solution is used for balancing 3-5 column volumes after loading, then washing buffer solution is used for washing off impure protein, and finally elution buffer solution is used for eluting to obtain the target protein. The purified protein was stored in 20% glycerol and each tube was stored in a freezer at-80 ℃.
Wherein, the components of the equilibrium buffer solution are as follows: 0.3M NaCl, 50mM NaH2PO4,pH=8;
The composition of the wash buffer was: 10mM imidazole, 0.3M NaCl, 50mM NaH2PO4,pH=8;
The composition of the elution buffer was: 250mM imidazole, 0.3M NaCl, 50mM NaH2PO4,pH=8。
The remaining nine mutants were purified according to the same method as described above to obtain recombinant proteins of PmHS2(S130T), PmHS2(S135A), PmHS2(D165Y), PmHS2(F267L), PmHS2(H296N), PmHS2(T326S), PmHS2(T344S), PmHS2(I476F), and PmHS2 (S548A).
The expression of the purified recombinant protein was identified by polyacrylamide gel electrophoresis (SDS-PAGE) (as shown in FIG. 1), and after the recombinant expression of the positive transformant, the protein was purified, the band was single, and the molecular weight of the target protein was consistent with the theoretical value (PmHS2 is 75.39 KDa).
Example 2 Activity assay of heparin scaffold synthase PmHS2 mutant
Using commercial GlcA-pNP (final concentration of 0.2mM) as an acceptor substrate and UDP-GlcNAc (final concentration of 0.3mM) as a donor substrate, a reaction was carried out using a heparin skeleton synthase PmHS2 and the heparin skeleton synthase PmHS2 mutant prepared in example 1 as proteases in the reaction system shown in Table 2; the reaction is placed in a water bath kettle at 37 ℃ for reaction for 30min, then boiling water is heated for 5min to inactivate protease so as to stop the reaction, the reaction solution is filtered by a filter membrane of 0.22 mu m and then HPLC detection is carried out according to the method shown in Table 1, the pNP group of the monosaccharide receptor has specific absorption under the detection wavelength of ultraviolet 310nm, and the flow rate of a mobile phase is 0.5 mL/min.
TABLE 2 enzyme activity verification reaction System for heparin skeleton synthase
The results of the measurement are shown in FIG. 2, in which the conversion of PmHS2 was set to 1 and the conversion of the mutant was reduced to a multiple thereof.
As can be seen from FIG. 2, the heparin scaffold synthase PmHS2 mutant has GlcNAc transferase activity and GlcA transferase activity, and can transfer a GlcNAc group to the non-reducing end of GlcA-pNP to produce heparin disaccharide GlcNAc-GlcA-pNP; the GlcA group can also be transferred to the non-reducing end of GlcNAc-GlcA-pNP to produce heparin trisaccharide GlcA-GlcNAc-GlcA-pNP.
In addition, the GlcNAc transferase activity and the GlcA transferase activity of each mutant were improved to various degrees.
The GlcNAc transferase activity of PmHS2(S130T) is increased by 2.51 times of the activity of the PmHS2 of the pro-heparin skeleton synthase, and the GlcA transferase activity is increased by 2.97 times of the activity of the PmHS2 of the pro-heparin skeleton synthase.
The GlcNAc transferase activity of PmHS2(S135A) was increased to 3.31 times that of the pro-heparin skeleton synthase PmHS2, and the GlcA transferase activity was increased to 1.63 times that of the pro-heparin skeleton synthase PmHS 2.
The GlcNAc transferase activity of PmHS2(F197Y) was increased to 2.81 times of the activity of PmHS2, and the GlcA transferase activity was increased to 2.76 times of the activity of PmHS 2.
The GlcNAc transferase activity of PmHS2(T326S) is increased to 3.73 times of the activity of PmHS2, and the GlcA transferase activity is increased to 1.67 times of the activity of PmHS 2.
The GlcNAc transferase activity of PmHS2(T344S) is improved by 2.08 times of the activity of the PmHS2 of the protoparin skeleton synthase, and the GlcA transferase activity is improved by 3.35 times of the activity of the PmHS2 of the protoparin skeleton synthase.
The GlcNAc transferase activity of PmHS2(I476F) is increased to 4.33 times of the activity of PmHS2, and the GlcA transferase activity is increased to 2.29 times of the activity of PmHS 2.
The activity of PmHS2(D165Y), PmHS2(F267L), PmHS2(H297N) and PmHS2(S548A) is not obviously changed.
From the data, compared with the heparin framework synthase PmHS2, the heparin framework synthase PmHS2 mutant provided by the invention has very high GlcNAc transferase activity and GlcA transferase activity which are respectively 2-3.5 times and 1.5-3.5 times of the heparin framework synthase PmHS2, and can effectively synthesize a GlcNAc-GlcA-pNP heparin framework by using UDP-GlcNAc and GlcA-pNP as substrates under the optimal condition; or the substrate UDP-GlcA and GlcNAc-GlcA-pNP are utilized to effectively synthesize the GlcA-GlcNAc-GlcA-pNP heparin framework, the GlcNAc transfer synthesis efficiency and the GlcA transfer synthesis efficiency in the synthesis of the heparin framework are improved, the application development of the heparin biomimetic synthesis is greatly promoted, and a brand new page is opened for the research and development of glycosaminoglycan.
Example 3 stability study of heparin scaffold synthase PmHS2 mutant
The current activities of the available PmHS2 and the PmHS2 mutant protein obtained in example 1 were determined by subjecting them to a reaction system shown in example 2, in which commercial GlcA-pNP (final concentration of 0.2mM) was used as an acceptor substrate and UDP-GlcNAc (final concentration of 0.3mM) was used as a donor substrate after 0h, 24h, 48h, and 72h, respectively, at 37 ℃. After the above reaction was placed in a 37 ℃ water bath for 30min, boiling water was heated for 5min to inactivate protease and terminate the reaction, the reaction solution was filtered through a 0.22 μm filter membrane and subjected to HPLC detection as described in Table 1, the pNP group of the monosaccharide acceptor was specifically absorbed at a detection wavelength of ultraviolet 310nm, the mobile phase flow rate was 0.5mL/min, and the detection results are shown in FIG. 3.
As can be seen from FIG. 3, the PmHS2 mutant of heparin framework synthase provided by the invention has good stability, and can still maintain more than 55% of enzyme activity after being placed at 37 ℃ for 24h, and can still maintain more than 40% of enzyme activity after being placed at 37 ℃ for 72 h. The enzyme activity of the heparin skeleton synthase PmHS2 is reduced to 40 percent after being placed at 37 ℃ for 24 hours, and the enzyme activity is reduced to about 10 percent after being placed at 37 ℃ for 72 hours. Wherein the purified proteins of PmHS2(F197Y), PmHS2(S135A) and PmHS2(T344S) can still keep more than 80% of enzyme activity after being placed at 37 ℃ for 72 hours, and are 5-7 times of the enzyme activity of the PmHS2 of heparin skeleton synthase.
SEQUENCE LISTING
<110> Shandong university
<120> high-activity heparin framework synthase PmHS2 mutant and application thereof
<160> 12
<170> PatentIn version 3.5
<210> 1
<211> 1956
<212> DNA
<213> Artificial sequence
<400> 1
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttacc atcattatga ccagccacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctattt tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcaccga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca cccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actatatcac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 2
<211> 651
<212> PRT
<213> Artificial sequence
<400> 2
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Thr Ile Ile Met Thr Ser His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Phe Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Thr Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Thr Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Ile Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
<210> 3
<211> 1956
<212> DNA
<213> Artificial sequence
<400> 3
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttagc atcattatga ccgcgcacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctattt tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcaccga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca cccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actatatcac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 4
<211> 651
<212> PRT
<213> Artificial sequence
<400> 4
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Ser Ile Ile Met Thr Ala His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Phe Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Thr Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Thr Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Ile Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
<210> 5
<211> 1956
<212> DNA
<213> Artificial sequence
<400> 5
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttagc atcattatga ccagccacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctatta tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcaccga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca cccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actatatcac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 6
<211> 651
<212> PRT
<213> Artificial sequence
<400> 6
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Ser Ile Ile Met Thr Ser His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Tyr Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Thr Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Thr Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Ile Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
<210> 7
<211> 1956
<212> DNA
<213> Artificial sequence
<400> 7
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttagc atcattatga ccagccacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctattt tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcagcga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca cccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actatatcac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 8
<211> 651
<212> PRT
<213> Artificial sequence
<400> 8
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Ser Ile Ile Met Thr Ser His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Phe Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Ser Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Thr Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Ile Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
<210> 9
<211> 1956
<212> DNA
<213> Artificial sequence
<400> 9
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttagc atcattatga ccagccacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctattt tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcaccga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca gccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actatatcac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 10
<211> 651
<212> PRT
<213> Artificial sequence
<400> 10
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Ser Ile Ile Met Thr Ser His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Phe Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Thr Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Ser Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Ile Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
<210> 11
<211> 1956
<212> DNA
<213> Artificial sequence
<400> 11
atgaagggta agaaagagat gacccagatt caaatcgcga agaacccgcc gcaacacgag 60
aaagaaaacg agctgaacac ctttcagaac aaaatcgata gcctgaagac caccctgaac 120
aaagacatca ttagccagca aaccctgctg gcgaaacaag acagcaagca cccgctgagc 180
gcgagcctgg aaaacgagaa caaactgctg ctgaagcagc tgcaactggt gctgcaagaa 240
tttgagaaga tttacaccta taaccaggcg ctggaagcga aactggagaa ggataaacag 300
accaccagca tcaccgacct gtataacgaa gttgcgaaga gcgatctggg cctggtgaaa 360
gaaaccaaca gcgcgaaccc gctggttagc atcattatga ccagccacaa caccgcgcaa 420
ttcattgaag cgagcatcaa cagcctgctg ctgcagacct acaagaacat cgaaatcatt 480
atcgtggacg atgacagcag cgacaacacc tttgagattg cgagccgtat cgcgaacacc 540
accagcaaag tgcgtgtttt ccgtctgaac agcaacctgg gtacctattt tgcgaaaaac 600
accggtatcc tgaagagcaa aggcgacatt atcttctttc aggatagcga tgacgtttgc 660
caccacgaac gtattgagcg ttgcgtgaac atcctgctgg cgaacaaaga aaccatcgcg 720
gttcgttgcg cgtacagccg tctggcgccg gaaacccaac acattatcaa ggtgaacaac 780
atggactatc gtctgggttt cattaccctg ggcatgcacc gtaaagtttt tcaggagatc 840
ggcttcttta actgcaccac caagggtagc gatgacgagt tcttccaccg tattgcgaaa 900
tactatggca aggagaaaat caagaacctg ctgctgccgc tgtactataa caccatgcgt 960
gaaaacagcc tgttcaccga catggtggag tggatcgata accacaacat tatccagaag 1020
atgagcgaca cccgtcaaca ctacgcgacc ctgttccagg cgatgcacaa cgaaaccgcg 1080
agccacgatt ttaaaaacct gttccaattt ccgcgtattt acgacgcgct gccggttccg 1140
caggagatga gcaagctgag caacccgaaa atcccggtgt atattaacat ctgcagcatt 1200
ccgagccgta tcgcgcaact gcgtcgtatt atcggtattc tgaagaacca gtgcgaccac 1260
ttccacatct acctggatgg ctatgttgaa attccggact ttatcaagaa cctgggtaac 1320
aaagcgaccg tggttcactg caaagacaag gataacagca ttcgtgacaa cggcaagttc 1380
attctgctgg aggaactgat cgagaagaac caggatggtt actattttac ctgcgatgac 1440
gatattatct acccgagcga ctatattaac accatgatca agaaactgaa cgagtacgac 1500
gataaagcgg ttattggtct gcacggcatc ctgttcccga gccgtatgac caagtatttt 1560
agcgcggatc gtctggtgta cagcttctat aaaccgctgg agaaagacaa ggcggtgaac 1620
gttctgggta ccggcaccgt tagctttcgt gtgagcctgt tcaaccaatt tagcctgagc 1680
gatttcaccc acagcggtat ggcggacatt tactttagcc tgctgtgcaa gaaaaacaac 1740
atcctgcaga tttgcatcag ccgtccggcg aactggctga ccgaagacaa ccgtgatagc 1800
gaaaccctgt accaccaata tcgtgacaac gatgaacagc aaacccagct gattatggag 1860
aacggtccgt ggggctacag cagcatctat ccgctggtta aaaaccatcc gaagttcacc 1920
gacctgattc cgtgcctgcc gttctatttc ctgtaa 1956
<210> 12
<211> 651
<212> PRT
<213> Artificial sequence
<400> 12
Met Lys Gly Lys Lys Glu Met Thr Gln Ile Gln Ile Ala Lys Asn Pro
1 5 10 15
Pro Gln His Glu Lys Glu Asn Glu Leu Asn Thr Phe Gln Asn Lys Ile
20 25 30
Asp Ser Leu Lys Thr Thr Leu Asn Lys Asp Ile Ile Ser Gln Gln Thr
35 40 45
Leu Leu Ala Lys Gln Asp Ser Lys His Pro Leu Ser Ala Ser Leu Glu
50 55 60
Asn Glu Asn Lys Leu Leu Leu Lys Gln Leu Gln Leu Val Leu Gln Glu
65 70 75 80
Phe Glu Lys Ile Tyr Thr Tyr Asn Gln Ala Leu Glu Ala Lys Leu Glu
85 90 95
Lys Asp Lys Gln Thr Thr Ser Ile Thr Asp Leu Tyr Asn Glu Val Ala
100 105 110
Lys Ser Asp Leu Gly Leu Val Lys Glu Thr Asn Ser Ala Asn Pro Leu
115 120 125
Val Ser Ile Ile Met Thr Ser His Asn Thr Ala Gln Phe Ile Glu Ala
130 135 140
Ser Ile Asn Ser Leu Leu Leu Gln Thr Tyr Lys Asn Ile Glu Ile Ile
145 150 155 160
Ile Val Asp Asp Asp Ser Ser Asp Asn Thr Phe Glu Ile Ala Ser Arg
165 170 175
Ile Ala Asn Thr Thr Ser Lys Val Arg Val Phe Arg Leu Asn Ser Asn
180 185 190
Leu Gly Thr Tyr Phe Ala Lys Asn Thr Gly Ile Leu Lys Ser Lys Gly
195 200 205
Asp Ile Ile Phe Phe Gln Asp Ser Asp Asp Val Cys His His Glu Arg
210 215 220
Ile Glu Arg Cys Val Asn Ile Leu Leu Ala Asn Lys Glu Thr Ile Ala
225 230 235 240
Val Arg Cys Ala Tyr Ser Arg Leu Ala Pro Glu Thr Gln His Ile Ile
245 250 255
Lys Val Asn Asn Met Asp Tyr Arg Leu Gly Phe Ile Thr Leu Gly Met
260 265 270
His Arg Lys Val Phe Gln Glu Ile Gly Phe Phe Asn Cys Thr Thr Lys
275 280 285
Gly Ser Asp Asp Glu Phe Phe His Arg Ile Ala Lys Tyr Tyr Gly Lys
290 295 300
Glu Lys Ile Lys Asn Leu Leu Leu Pro Leu Tyr Tyr Asn Thr Met Arg
305 310 315 320
Glu Asn Ser Leu Phe Thr Asp Met Val Glu Trp Ile Asp Asn His Asn
325 330 335
Ile Ile Gln Lys Met Ser Asp Thr Arg Gln His Tyr Ala Thr Leu Phe
340 345 350
Gln Ala Met His Asn Glu Thr Ala Ser His Asp Phe Lys Asn Leu Phe
355 360 365
Gln Phe Pro Arg Ile Tyr Asp Ala Leu Pro Val Pro Gln Glu Met Ser
370 375 380
Lys Leu Ser Asn Pro Lys Ile Pro Val Tyr Ile Asn Ile Cys Ser Ile
385 390 395 400
Pro Ser Arg Ile Ala Gln Leu Arg Arg Ile Ile Gly Ile Leu Lys Asn
405 410 415
Gln Cys Asp His Phe His Ile Tyr Leu Asp Gly Tyr Val Glu Ile Pro
420 425 430
Asp Phe Ile Lys Asn Leu Gly Asn Lys Ala Thr Val Val His Cys Lys
435 440 445
Asp Lys Asp Asn Ser Ile Arg Asp Asn Gly Lys Phe Ile Leu Leu Glu
450 455 460
Glu Leu Ile Glu Lys Asn Gln Asp Gly Tyr Tyr Phe Thr Cys Asp Asp
465 470 475 480
Asp Ile Ile Tyr Pro Ser Asp Tyr Ile Asn Thr Met Ile Lys Lys Leu
485 490 495
Asn Glu Tyr Asp Asp Lys Ala Val Ile Gly Leu His Gly Ile Leu Phe
500 505 510
Pro Ser Arg Met Thr Lys Tyr Phe Ser Ala Asp Arg Leu Val Tyr Ser
515 520 525
Phe Tyr Lys Pro Leu Glu Lys Asp Lys Ala Val Asn Val Leu Gly Thr
530 535 540
Gly Thr Val Ser Phe Arg Val Ser Leu Phe Asn Gln Phe Ser Leu Ser
545 550 555 560
Asp Phe Thr His Ser Gly Met Ala Asp Ile Tyr Phe Ser Leu Leu Cys
565 570 575
Lys Lys Asn Asn Ile Leu Gln Ile Cys Ile Ser Arg Pro Ala Asn Trp
580 585 590
Leu Thr Glu Asp Asn Arg Asp Ser Glu Thr Leu Tyr His Gln Tyr Arg
595 600 605
Asp Asn Asp Glu Gln Gln Thr Gln Leu Ile Met Glu Asn Gly Pro Trp
610 615 620
Gly Tyr Ser Ser Ile Tyr Pro Leu Val Lys Asn His Pro Lys Phe Thr
625 630 635 640
Asp Leu Ile Pro Cys Leu Pro Phe Tyr Phe Leu
645 650
Claims (10)
1. The mutant is characterized in that the mutant is any one site mutation of 130 th serine, 135 th serine, 197 th phenylalanine, 326 th threonine, 344 th threonine and 476 th isoleucine of heparin framework synthase PmHS 2.
2. The high-activity heparin framework synthase PmHS2 mutant according to claim 1, wherein the mutant is a heparin framework synthase PmHS2 mutant (S130T), the amino acid sequence of which is shown as SEQ ID No.2, and the nucleotide sequence of the coding gene is shown as SEQ ID No. 1; the 130 th serine of the amino acid sequence of heparin skeleton synthase PmHS2 is mutated into threonine.
3. The high-activity heparin framework synthase PmHS2 mutant according to claim 1, wherein the mutant is a heparin framework synthase PmHS2 mutant (S135A), the amino acid sequence of which is shown as SEQ ID No.4, and the nucleotide sequence of the coding gene is shown as SEQ ID No. 3; the 135 th serine of the amino acid sequence of heparin skeleton synthase PmHS2 is mutated into alanine.
4. The high-activity heparin framework synthase PmHS2 mutant according to claim 1, wherein the mutant is a heparin framework synthase PmHS2 mutant (F197Y), the amino acid sequence of which is shown as SEQ ID No.6, and the nucleotide sequence of the coding gene is shown as SEQ ID No. 5; the phenylalanine at position 197 of an amino acid sequence of heparin skeleton synthase PmHS2 is mutated into lysine.
5. The high-activity heparin framework synthase PmHS2 mutant according to claim 1, wherein the mutant is a heparin framework synthase PmHS2 mutant (T326S), the amino acid sequence of which is shown as SEQ ID No.8, and the nucleotide sequence of the coding gene is shown as SEQ ID No. 7; the threonine at the 326 th site of the amino acid sequence of the heparin skeleton synthase PmHS2 is mutated into serine.
6. The high-activity heparin framework synthase PmHS2 mutant according to claim 1, wherein the mutant is a heparin framework synthase PmHS2 mutant (T344S), the amino acid sequence of which is shown as SEQ ID No.10, and the nucleotide sequence of the coding gene is shown as SEQ ID No. 9; the method is characterized in that threonine at the 344 th site of an amino acid sequence of heparin skeleton synthase PmHS2 is mutated into serine.
7. The high-activity heparin framework synthase PmHS2 mutant according to claim 1, wherein the mutant is a heparin framework synthase PmHS2 mutant (I476F), the amino acid sequence of which is shown as SEQ ID No.12, and the nucleotide sequence of the coding gene is shown as SEQ ID No. 11; isoleucine at the 476 th site of an amino acid sequence of heparin skeleton synthase PmHS2 is mutated into phenylalanine.
8. A recombinant vector, which is characterized in that a gene encoding the heparin skeleton synthase PmHS2 mutant according to any one of claims 2 to 7 is inserted into a plasmid vector;
preferably, the plasmid vector is pET-28a (+).
9. A recombinant strain obtained by transforming the recombinant vector according to claim 8 into a host cell;
preferably, the host cell is E.coli.
10. Use of the heparin scaffold synthase mutant according to any one of claims 1 to 7 for synthesizing a heparin scaffold;
preferably, the application takes UDP-GlcNAc as a donor substrate and GlcA-pNP as an acceptor substrate, and the heparin framework with the structure of GlcNAc-GlcA-pNP is generated by catalytic reaction; or carrying out catalytic reaction by using UDP-GlcA as a donor substrate and GlcNAc-GlcA-pNP as an acceptor substrate to generate a heparin framework with the structure of GlcA-GlcNAc-GlcA-pNP.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110881483.0A CN113528482B (en) | 2021-08-02 | 2021-08-02 | High-activity heparin skeleton synthase PmHS2 mutant and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110881483.0A CN113528482B (en) | 2021-08-02 | 2021-08-02 | High-activity heparin skeleton synthase PmHS2 mutant and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113528482A true CN113528482A (en) | 2021-10-22 |
CN113528482B CN113528482B (en) | 2024-02-06 |
Family
ID=78121877
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110881483.0A Active CN113528482B (en) | 2021-08-02 | 2021-08-02 | High-activity heparin skeleton synthase PmHS2 mutant and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113528482B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2002256501B2 (en) * | 2001-05-08 | 2008-05-29 | The Board Of Regents Of The University Of Oklahoma | Heparin/heparosan synthase and methods of making and using same |
CN101855354A (en) * | 2007-09-12 | 2010-10-06 | 拜尔作物科学股份公司 | Plants which synthesize increased amounts of glucosaminglycans |
WO2014132468A1 (en) * | 2013-03-01 | 2014-09-04 | 独立行政法人理化学研究所 | Sugar chain compound and method for producing sugar chain compound |
CN107189992A (en) * | 2017-06-29 | 2017-09-22 | 江南大学 | A kind of heparosan synthase and its application |
-
2021
- 2021-08-02 CN CN202110881483.0A patent/CN113528482B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2002256501B2 (en) * | 2001-05-08 | 2008-05-29 | The Board Of Regents Of The University Of Oklahoma | Heparin/heparosan synthase and methods of making and using same |
CN101855354A (en) * | 2007-09-12 | 2010-10-06 | 拜尔作物科学股份公司 | Plants which synthesize increased amounts of glucosaminglycans |
WO2014132468A1 (en) * | 2013-03-01 | 2014-09-04 | 独立行政法人理化学研究所 | Sugar chain compound and method for producing sugar chain compound |
CN107189992A (en) * | 2017-06-29 | 2017-09-22 | 江南大学 | A kind of heparosan synthase and its application |
Also Published As
Publication number | Publication date |
---|---|
CN113528482B (en) | 2024-02-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110734899B (en) | Sucrose phosphorylase mutant with improved enzyme activity and construction method and application thereof | |
US8771995B2 (en) | Enzymatic synthesis of sulfated polysaccharides | |
EP2655421B1 (en) | Chemoenzymatic synthesis of structurally homogeneous ultra-low molecular weight heparins | |
US11572549B2 (en) | Engineered aryl sulfate-dependent enzymes | |
WO2004017910A2 (en) | Total synthesis of heparin | |
CN108531470B (en) | Fucoidin lyase TFLFM and preparation method and application thereof | |
CN113528482B (en) | High-activity heparin skeleton synthase PmHS2 mutant and application thereof | |
US20170204381A1 (en) | Pmst1 mutants for chemoenzymatic synthesis of sialyl lewis x compounds | |
US8852891B2 (en) | N-acetylglucosamine-2-epimerase and method for producing CMP-neuraminic acid using the same | |
CN110055234B (en) | Chondroitin sulfate synthase and coding gene and application thereof | |
WO2023035584A1 (en) | Construction and application of yeast engineering bacteria for fermentative production of heparin | |
US20040197868A1 (en) | Heparin/heparosan synthase from P. multocida, soluble and single action catalysts thereof and methods of making and using same | |
CN113564139B (en) | Pasteurella heparin framework synthase BtHS1, mutant and application thereof | |
CN114616340A (en) | Process for the synthesis of anticoagulant polysaccharides | |
CN110643586B (en) | Heparin skeleton synthase and coding gene and application thereof | |
CN114958790B (en) | Heparin skeleton synthase, mutant and application thereof | |
US20170247732A1 (en) | Methods of Producing Testosteronan Polymers Using Testosteronan Synthase | |
CN114703159A (en) | Glucosyltransferase mutant and application thereof | |
CN117987286A (en) | Method for synthesizing neohesperidin and neohesperidin dihydrochalcone by utilizing glycerol in yarrowia lipolytica | |
JP2005065565A (en) | Modified enzyme | |
Bethea | Structural and functional analysis of heparan sulfate sulfotransferases | |
WO2004089305A2 (en) | Heparin/heparosan synthase from p.multocida, soluble and single action catalysts thereof and methods of making and using same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240425 Address after: Tianchen Avenue, Ji'nan hi tech Development Zone of Shandong Province, No. 678 250101 Patentee after: BLOOMAGE BIOTECH Co.,Ltd. Country or region after: China Address before: 250012 No. 44 West Wenhua Road, Lixia District, Shandong, Ji'nan Patentee before: SHANDONG University Country or region before: China |
|
TR01 | Transfer of patent right |