CN115466756A - Transaminase, immobilized transaminase and application of transaminase to preparation of sitagliptin - Google Patents
Transaminase, immobilized transaminase and application of transaminase to preparation of sitagliptin Download PDFInfo
- Publication number
- CN115466756A CN115466756A CN202110655349.9A CN202110655349A CN115466756A CN 115466756 A CN115466756 A CN 115466756A CN 202110655349 A CN202110655349 A CN 202110655349A CN 115466756 A CN115466756 A CN 115466756A
- Authority
- CN
- China
- Prior art keywords
- seq
- leu
- val
- gly
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000340 Transaminases Proteins 0.000 title claims abstract description 143
- 102000003929 Transaminases Human genes 0.000 title claims abstract description 139
- MFFMDFFZMYYVKS-SECBINFHSA-N sitagliptin Chemical compound C([C@H](CC(=O)N1CC=2N(C(=NN=2)C(F)(F)F)CC1)N)C1=CC(F)=C(F)C=C1F MFFMDFFZMYYVKS-SECBINFHSA-N 0.000 title claims abstract description 43
- 229960004034 sitagliptin Drugs 0.000 title claims abstract description 41
- 238000002360 preparation method Methods 0.000 title claims abstract description 24
- 238000006243 chemical reaction Methods 0.000 claims abstract description 59
- 239000000758 substrate Substances 0.000 claims abstract description 36
- 229920005989 resin Polymers 0.000 claims abstract description 30
- 239000011347 resin Substances 0.000 claims abstract description 30
- FZRKAZHKEDOPNN-UHFFFAOYSA-N Nitric oxide anion Chemical compound O=[N-] FZRKAZHKEDOPNN-UHFFFAOYSA-N 0.000 claims abstract description 26
- 229910052739 hydrogen Inorganic materials 0.000 claims abstract description 8
- 238000004519 manufacturing process Methods 0.000 claims abstract description 5
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 11
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 claims description 45
- NGVDGCNFYWLIFO-UHFFFAOYSA-N pyridoxal 5'-phosphate Chemical compound CC1=NC=C(COP(O)(O)=O)C(C=O)=C1O NGVDGCNFYWLIFO-UHFFFAOYSA-N 0.000 claims description 38
- 239000002773 nucleotide Substances 0.000 claims description 32
- 125000003729 nucleotide group Chemical group 0.000 claims description 32
- 239000000243 solution Substances 0.000 claims description 22
- 235000007682 pyridoxal 5'-phosphate Nutrition 0.000 claims description 19
- 239000011589 pyridoxal 5'-phosphate Substances 0.000 claims description 19
- 229960001327 pyridoxal phosphate Drugs 0.000 claims description 19
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 19
- 102000004190 Enzymes Human genes 0.000 claims description 13
- 108090000790 Enzymes Proteins 0.000 claims description 13
- JJWLVOIRVHMVIS-UHFFFAOYSA-N isopropylamine Chemical group CC(C)N JJWLVOIRVHMVIS-UHFFFAOYSA-N 0.000 claims description 13
- 238000000034 method Methods 0.000 claims description 13
- 239000007810 chemical reaction solvent Substances 0.000 claims description 12
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 12
- 239000007864 aqueous solution Substances 0.000 claims description 10
- 229960004115 sitagliptin phosphate Drugs 0.000 claims description 10
- RTZRUVMEWWPNRR-UHFFFAOYSA-N tert-butyl n-(3-iodo-1h-pyrrolo[2,3-b]pyridin-5-yl)carbamate Chemical compound CC(C)(C)OC(=O)NC1=CN=C2NC=C(I)C2=C1 RTZRUVMEWWPNRR-UHFFFAOYSA-N 0.000 claims description 10
- 238000003756 stirring Methods 0.000 claims description 9
- QAEDTLFWHIEVPK-UHFFFAOYSA-N 1-[3-(trifluoromethyl)-6,8-dihydro-5h-[1,2,4]triazolo[4,3-a]pyrazin-7-yl]-4-(2,4,5-trifluorophenyl)butane-1,3-dione Chemical group C1=C(F)C(F)=CC(F)=C1CC(=O)CC(=O)N1CC2=NN=C(C(F)(F)F)N2CC1 QAEDTLFWHIEVPK-UHFFFAOYSA-N 0.000 claims description 8
- 230000001580 bacterial effect Effects 0.000 claims description 6
- 239000013604 expression vector Substances 0.000 claims description 6
- 238000001914 filtration Methods 0.000 claims description 6
- 108091033319 polynucleotide Proteins 0.000 claims description 6
- 239000002157 polynucleotide Substances 0.000 claims description 6
- 102000040430 polynucleotide Human genes 0.000 claims description 6
- 238000003259 recombinant expression Methods 0.000 claims description 6
- 239000003822 epoxy resin Substances 0.000 claims description 5
- 229920000647 polyepoxide Polymers 0.000 claims description 5
- 239000010802 sludge Substances 0.000 claims description 5
- 241000672609 Escherichia coli BL21 Species 0.000 claims description 4
- 238000005406 washing Methods 0.000 claims description 4
- 241000588724 Escherichia coli Species 0.000 claims description 2
- 239000013612 plasmid Substances 0.000 claims description 2
- 230000035484 reaction time Effects 0.000 claims description 2
- 230000035772 mutation Effects 0.000 abstract description 5
- 230000009286 beneficial effect Effects 0.000 abstract description 3
- 238000009776 industrial production Methods 0.000 abstract description 3
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 50
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 50
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 50
- 108010047857 aspartylglycine Proteins 0.000 description 50
- 108010068265 aspartyltyrosine Proteins 0.000 description 50
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 46
- 150000001413 amino acids Chemical group 0.000 description 40
- 108010093096 Immobilized Enzymes Proteins 0.000 description 29
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 25
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 25
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 25
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 25
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 25
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 25
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 25
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 25
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 25
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 25
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 25
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 25
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 25
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 25
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 25
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 25
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 25
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 25
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 25
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 25
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 25
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 25
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 25
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 25
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 25
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 25
- CMPHFUWXKBPNRS-WDSOQIARSA-N His-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 CMPHFUWXKBPNRS-WDSOQIARSA-N 0.000 description 25
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 25
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 25
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 25
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 25
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 25
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 25
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 25
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 25
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 25
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 25
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 25
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 25
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 25
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 25
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 25
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 25
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 25
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 25
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 25
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 25
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 25
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 25
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 25
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 25
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 25
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 25
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 25
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 25
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 25
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 25
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 25
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 25
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 25
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 25
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 25
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 25
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 25
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 25
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 25
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 25
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 25
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 25
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 25
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 25
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 25
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 25
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 25
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 25
- XTOCLOATLKOZAU-JBACZVJFSA-N Tyr-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N XTOCLOATLKOZAU-JBACZVJFSA-N 0.000 description 25
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 25
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 25
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 25
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 25
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 25
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 25
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 25
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 25
- 108010092114 histidylphenylalanine Proteins 0.000 description 25
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 25
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 25
- 108010003700 lysyl aspartic acid Proteins 0.000 description 25
- 108010051242 phenylalanylserine Proteins 0.000 description 25
- 108010004914 prolylarginine Proteins 0.000 description 25
- 108010029020 prolylglycine Proteins 0.000 description 25
- 108010026333 seryl-proline Proteins 0.000 description 25
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 25
- 108010073969 valyllysine Proteins 0.000 description 25
- 108010009962 valyltyrosine Proteins 0.000 description 25
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 22
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 22
- 108010044940 alanylglutamine Proteins 0.000 description 22
- 108010054812 diprotin A Proteins 0.000 description 22
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 21
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 21
- FWOVTJKVUCGVND-UFYCRDLUSA-N Tyr-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FWOVTJKVUCGVND-UFYCRDLUSA-N 0.000 description 21
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 17
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 17
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 17
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 17
- 108010005233 alanylglutamic acid Proteins 0.000 description 17
- 108010057821 leucylproline Proteins 0.000 description 16
- 230000014759 maintenance of location Effects 0.000 description 10
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 8
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 8
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 8
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 8
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 8
- 230000000694 effects Effects 0.000 description 8
- 108090000623 proteins and genes Proteins 0.000 description 8
- 238000004128 high performance liquid chromatography Methods 0.000 description 7
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 6
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 6
- 238000000926 separation method Methods 0.000 description 6
- 238000004587 chromatography analysis Methods 0.000 description 5
- JMMWKPVZQRWMSS-UHFFFAOYSA-N isopropyl acetate Chemical class CC(C)OC(C)=O JMMWKPVZQRWMSS-UHFFFAOYSA-N 0.000 description 5
- 239000012071 phase Substances 0.000 description 5
- 238000005070 sampling Methods 0.000 description 5
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 4
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 4
- YNAVUWVOSKDBBP-UHFFFAOYSA-N Morpholine Natural products C1COCCN1 YNAVUWVOSKDBBP-UHFFFAOYSA-N 0.000 description 4
- PDKILSUYSUGCAO-JBACZVJFSA-N Tyr-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PDKILSUYSUGCAO-JBACZVJFSA-N 0.000 description 4
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 4
- MENOBBYDZHOWLE-UHFFFAOYSA-N morpholine-2,3-dione Chemical compound O=C1NCCOC1=O MENOBBYDZHOWLE-UHFFFAOYSA-N 0.000 description 4
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 4
- 239000002904 solvent Substances 0.000 description 4
- 238000000967 suction filtration Methods 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 3
- 229920002449 FKM Polymers 0.000 description 3
- 102000004877 Insulin Human genes 0.000 description 3
- 108090001061 Insulin Proteins 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 230000003197 catalytic effect Effects 0.000 description 3
- 238000006555 catalytic reaction Methods 0.000 description 3
- 206010012601 diabetes mellitus Diseases 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 229940125396 insulin Drugs 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 239000002994 raw material Substances 0.000 description 3
- 230000001603 reducing effect Effects 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- TUAXCHGULMWHIO-SECBINFHSA-N (3r)-3-[(2-methylpropan-2-yl)oxycarbonylamino]-4-(2,4,5-trifluorophenyl)butanoic acid Chemical compound CC(C)(C)OC(=O)N[C@@H](CC(O)=O)CC1=CC(F)=C(F)C=C1F TUAXCHGULMWHIO-SECBINFHSA-N 0.000 description 2
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 2
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 238000005576 amination reaction Methods 0.000 description 2
- 235000011114 ammonium hydroxide Nutrition 0.000 description 2
- 238000009903 catalytic hydrogenation reaction Methods 0.000 description 2
- 239000012295 chemical reaction liquid Substances 0.000 description 2
- 239000012141 concentrate Substances 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 238000002242 deionisation method Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 239000000706 filtrate Substances 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 229940011051 isopropyl acetate Drugs 0.000 description 2
- GWYFCOCPABKNJV-UHFFFAOYSA-N isovaleric acid Chemical compound CC(C)CC(O)=O GWYFCOCPABKNJV-UHFFFAOYSA-N 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 208000030159 metabolic disease Diseases 0.000 description 2
- 239000003960 organic solvent Substances 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 125000002023 trifluoromethyl group Chemical group FC(F)(F)* 0.000 description 2
- SPOMEWBVWWDQBC-UHFFFAOYSA-K tripotassium;dihydrogen phosphate;hydrogen phosphate Chemical compound [K+].[K+].[K+].OP(O)([O-])=O.OP([O-])([O-])=O SPOMEWBVWWDQBC-UHFFFAOYSA-K 0.000 description 2
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- DKRDJODCERTCFW-UHFFFAOYSA-N 1,2-dimorpholin-4-ylethane-1,2-dione Chemical class C1COCCN1C(=O)C(=O)N1CCOCC1 DKRDJODCERTCFW-UHFFFAOYSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical class NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- 206010067484 Adverse reaction Diseases 0.000 description 1
- 241000186063 Arthrobacter Species 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- 238000009010 Bradford assay Methods 0.000 description 1
- 208000024172 Cardiovascular disease Diseases 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- 239000012880 LB liquid culture medium Substances 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- 241001052560 Thallis Species 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- PBCJIPOGFJYBJE-UHFFFAOYSA-N acetonitrile;hydrate Chemical compound O.CC#N PBCJIPOGFJYBJE-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000006838 adverse reaction Effects 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- VZTDIZULWFCMLS-UHFFFAOYSA-N ammonium formate Chemical compound [NH4+].[O-]C=O VZTDIZULWFCMLS-UHFFFAOYSA-N 0.000 description 1
- 239000008346 aqueous phase Substances 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 201000011529 cardiovascular cancer Diseases 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000004296 chiral HPLC Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010411 cooking Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- HPNMFZURTQLUMO-UHFFFAOYSA-N diethylamine Chemical compound CCNCC HPNMFZURTQLUMO-UHFFFAOYSA-N 0.000 description 1
- 208000016097 disease of metabolism Diseases 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- WJPYOCIWVYDFDT-UHFFFAOYSA-N ethyl 3-oxo-4-(2,4,5-trifluorophenyl)butanoate Chemical compound CCOC(=O)CC(=O)CC1=CC(F)=C(F)C=C1F WJPYOCIWVYDFDT-UHFFFAOYSA-N 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000003925 fat Substances 0.000 description 1
- 239000012065 filter cake Substances 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- JEGUKCSWCFPDGT-UHFFFAOYSA-N h2o hydrate Chemical compound O.O JEGUKCSWCFPDGT-UHFFFAOYSA-N 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 201000001421 hyperglycemia Diseases 0.000 description 1
- 230000003100 immobilizing effect Effects 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 230000003914 insulin secretion Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 229940035736 metformin and pioglitazone Drugs 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000003541 multi-stage reaction Methods 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 239000008055 phosphate buffer solution Substances 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000008213 purified water Substances 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/18—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
- C12P17/182—Heterocyclic compounds containing nitrogen atoms as the only ring heteroatoms in the condensed system
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N11/00—Carrier-bound or immobilised enzymes; Carrier-bound or immobilised microbial cells; Preparation thereof
- C12N11/02—Enzymes or microbial cells immobilised on or in an organic carrier
- C12N11/08—Enzymes or microbial cells immobilised on or in an organic carrier the carrier being a synthetic polymer
- C12N11/089—Enzymes or microbial cells immobilised on or in an organic carrier the carrier being a synthetic polymer obtained otherwise than by reactions only involving carbon-to-carbon unsaturated bonds
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1096—Transferases (2.) transferring nitrogenous groups (2.6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/14—Nitrogen or oxygen as hetero atom and at least one other diverse hetero ring atom in the same ring
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/18—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
- C12R2001/01—Bacteria or Actinomycetales ; using bacteria or Actinomycetales
- C12R2001/185—Escherichia
- C12R2001/19—Escherichia coli
Abstract
The invention provides an application of immobilized transaminase in preparation of sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone; the immobilized transaminase comprises resin and transaminase, and the amino acid sequence of the transaminase is shown as SEQ ID NO 5, SEQ ID NO 45, SEQ ID NO 47, SEQ ID NO 49 or variants thereof; the variant is that the R mutation at the 241 th position of the amino acid sequence shown as SEQ ID NO. 5 is A, H, N, S, L, T, C or M. The invention also provides immobilized transaminase, a transaminase, and a preparation method and application thereof. After the transaminase is prepared into the immobilized transaminase, the immobilized transaminase is used for catalyzing a ketoamide substrate to produce sitagliptin or an intermediate thereof, and has the advantages of high conversion rate, good stereoselectivity, good stability, improvement of reusability and simpler operation, so that the production cost is reduced, and the immobilized transaminase is beneficial to industrial production.
Description
Technical Field
The invention belongs to the technical field of biology, and particularly relates to transaminase and immobilized transaminase, application of the immobilized transaminase in preparation of sitagliptin or an intermediate thereof, and a preparation method of the sitagliptin or the intermediate thereof.
Background
Diabetes is a metabolic disease characterized by hyperglycemia accompanied by metabolic disorders of proteins, sugars, and fats, which occur due to altered insulin secretion, resulting in insulin deficiency and impaired action, or decreased insulin activity, or both. The degree of harm of diabetes and its complications to human health is the third place after cardiovascular diseases and tumors, and becomes an important disease harming human health. Of the four types of diabetes, type II diabetes accounts for over 90%, most of which are seen in middle aged and elderly people over 30 years old, and the cause of the disease is mainly due to the insensitivity of the body to insulin.
Sitagliptin phosphate (Sitagliptin phosphate) is the first dipeptidyl enzyme-IV (DPP-4) inhibitor approved by FDA for marketing in 2006 for the treatment of type II diabetes. The composition has obvious blood sugar reducing effect when being used singly or used together with metformin and pioglitazone, and has the advantages of safe taking, good tolerance and few adverse reactions.
U.S. Pat. No. 3,3534,3534 discloses that the transaminase obtained by modifying the transaminase from Arthrobacter by Codexes catalyzes 4-oxo-4- [3- (trifluoromethyl) -5,6-dihydro- [1,2,4] triazolo [4,3-a ] pyrazine-7- (8H) -yl ] -1- (2,4,5-trifluorophenyl) butan-2-one to obtain sitagliptin ((2R) -4-oxo-4- [3- (trifluoromethyl) -5,6-dihydro- [1,2,4] triazolo [4,3-a ] pyrazine-7- (8H) -yl ] -1- (2,4,5-trifluorophenyl) butan-2-amine), and the sitagliptin is further phosphorylated to obtain sitagliptin phosphate. The engineered transaminase was capable of converting the substrate 4-oxo-4- [3- (trifluoromethyl) -5,6-dihydro- [1,2,4] triazolo [4,3-a ] pyrazin-7- (8H) -yl ] -1- (2,4,5-trifluorophenyl) butan-2-one (abbreviated as sitagliptin herein) to a detectable level at 210nm by HPLC-UV under conditions of 2g/L of ketoamide substrate, 0.5M isopropylamine, 22 ℃,5%DMSO, 100. Mu.M pyridoxal phosphate (PLP) and 20mg/mL of the transaminase polypeptide. Wherein the catalytic conversion rate of the mutant (SEQ ID NO: 110) with the best effect can reach 90-95%.
US9617573 continuing with the modification of SEQ ID NO:110 in US8293507, the resulting mutant was capable of converting 4-oxo-4- [3- (trifluoromethyl) -5,6-dihydro- [1,2,4] triazolo [4,3-a ] pyrazin-7- (8H) -yl ] -1- (2,4,5-trifluorophenyl) butan-2-one to sitagliptin at an activity at least 1.2 times that of SEQ ID NO:110 under conditions of 50g/L of ketoamide substrate, 1.5M isopropylamine (isopropylamine), 55 ℃,50 ℃ DMSO (v/v), 1mM pyridoxalphosphate. Wherein the catalytic conversion rate of the mutant (SEQ ID NO: 130) with the best effect can reach 90-95%.
However, these transaminase mutants are unstable in 100% organic solvents, and thus, the liquid enzyme is immobilized to obtain an immobilized enzyme, so as to improve the stability of the transaminase in organic solvents. As disclosed in US9587229 for immobilization of SEQ ID NO:110 in US8293507 on SEPABEADS EXE120 resin, the results show that the SEPABEADS EXE120 immobilized enzyme of SEQ ID NO:110 is capable of catalyzing 100g/L of a substrate of 4-oxo-4- [3- (trifluoromethyl) -5,6-dihydro- [1,2,4] triazolo [4,3-a ] pyrazin-7- (8H) -yl ] -1- (2,4,5-trifluorophenyl) butan-2-one to sitagliptin in a water saturated IPAc (isopropyl acetate) solvent with an ee value above 99.9%. However, US9587229 does not study the reusability (number of application batches) of the immobilized enzyme, and the reaction solvent is a water-saturated IPAc (isopropyl acetate) solvent, when the solvent is used for production, IPAc needs to be saturated with water in advance, so the operation is more complicated, and if the solvent is used for multiple batches of application reactions, the water content in the reaction system is not easy to control, the water activity of the immobilized enzyme is affected, and the immobilized enzyme is easy to inactivate and denature.
WO2019011236A1 reports that 1-morpholine-4- (2,4,5-trifluorophenyl) -1,3-butanedione (also referred to as morpholine dione in the invention) can be prepared into (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone through transaminase catalysis, and then sitagliptin is prepared through multi-step reaction.
Disclosure of Invention
The invention aims to overcome the defects that in the prior art, when transaminase is prepared into immobilized enzyme, the stability is still poor when the transaminase is used in a reaction system, and further when the transaminase is used for catalyzing a ketoamide substrate to produce sitagliptin or an intermediate thereof, the conversion rate is not high, the transaminase cannot be recycled, and the like, and provides the transaminase, the immobilized transaminase and the application of the transaminase in preparing the sitagliptin or the intermediate thereof. After the transaminase is prepared into the immobilized transaminase, the immobilized transaminase is used for catalyzing a ketoamide substrate to produce sitagliptin or an intermediate thereof, and has the advantages of high conversion rate, good stereoselectivity, good stability, improved reusability and simpler operation, thereby reducing the production cost and being beneficial to industrial production.
The present inventors have conducted extensive studies on transaminases in the prior art, and found that when certain specific sites are mutated, the obtained transaminase is immobilized, and then used for catalyzing a ketoamide substrate, the transaminase has higher stability and higher conversion rate, and the transaminase is used for producing sitagliptin or an intermediate thereof at a lower cost.
In order to solve the technical problems, the invention provides a use of immobilized transaminase in the preparation of sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone;
wherein the immobilized transaminase comprises a resin and a transaminase, and the amino acid sequence of the transaminase is shown in SEQ ID NO 5, SEQ ID NO 45, SEQ ID NO 47, SEQ ID NO 49 or variants thereof; the variant is that the R mutation at the 241 th position of the amino acid sequence shown as SEQ ID NO. 5 is A, H, N, S, L, T, C or M.
Preferably, the amino acid sequence of the variant is shown as SEQ ID NO. 11, SEQ ID NO. 15, SEQ ID NO. 21, SEQ ID NO. 25, SEQ ID NO. 37, SEQ ID NO. 39, SEQ ID NO. 41 or SEQ ID NO. 43. More preferably, the nucleotide sequence of the transaminase is preferably as shown in SEQ ID NO 6, SEQ ID NO 12, SEQ ID NO 16, SEQ ID NO 22, SEQ ID NO 26, SEQ ID NO 38, SEQ ID NO 40, SEQ ID NO 42, SEQ ID NO 44, SEQ ID NO 46, SEQ ID NO 48 or SEQ ID NO 50;
preferably, the transaminase is attached to the resin by covalent bonding.
Preferably, the resin is preferably an epoxy resin, preferablyAnd (4) HFA. In a preferred embodiment, the resin is purchased from Viton, suzhou, chromatography, separation and purification, incAn HFA resin.
Preferably, the immobilized transaminase has a concentration of the resin-borne transaminase of 150 to 300mg/g (i.e., a mass of 150 to 300mg per gram of the resin-borne transaminase), for example 250mg/g.
Preferably, said use comprises the step of catalyzing a ketoamide substrate with said immobilized transaminase in a reaction solvent in the presence of an amino donor to obtain said sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone.
More preferably, the reaction solvent is an aqueous isopropanol solution, preferably, the volume content of water in the aqueous isopropanol solution (the volume ratio of water to the whole solution) is 2-20%, and more preferably, the prepared reaction system further comprises a co-factor of transaminase, such as pyridoxal phosphate, and the concentration of the co-factor in the whole reaction system is preferably 0.5-5 mg/mL.
More preferably, the ketoamide substrate is 4-oxo-4- [3- (trifluoromethyl) -5,6-dihydro- [1,2,4] triazolo [4,3-a ] pyrazin-7- (8H) -yl ] -1- (2,4,5-trifluorophenyl) butan-2-one and/or 1-morpholine-4- (2,4,5-trifluorophenyl) -1,3-butanedione.
In the invention, the 4-oxo-4- [3- (trifluoromethyl) -5,6-dihydro- [1,2,4] triazolo [4,3-a ] pyrazin-7- (8H) -yl ] -1- (2,4,5-trifluorophenyl) butan-2-one (also referred to as sitaxdione in the invention) has the following specific structural formula:
in the invention, the specific structural formula of the 1-morpholine-4- (2,4,5-trifluorophenyl) -1,3-butanedione (also referred to as morpholine dione in the invention) is as follows:
more preferably, the amino donor is isopropylamine.
More preferably, the molar ratio of the amino donor to the ketoamide substrate is 1:1-5:1.
More preferably, the concentration of the ketoamide substrate in the whole reaction system is 20-200 g/L.
More preferably, the mass ratio of the immobilized transaminase to the ketoamide substrate is 1:1 to 6:1, e.g., 3, 2.5.
More preferably, the temperature of the reaction is 30 to 60 ℃, preferably 45 ℃.
In order to solve the above technical problems, the second aspect of the present invention provides an immobilized transaminase comprising a resin and a transaminase, the transaminase having an amino acid sequence shown in SEQ ID NO 5, SEQ ID NO 45, SEQ ID NO 47, SEQ ID NO 49 or variants thereof; the variant is that the R mutation at position 241 of the amino acid sequence shown as SEQ ID NO. 5 is A, H, N, S, L, T, C or M.
Preferably, the amino acid sequence of the variant is shown as SEQ ID NO. 11, SEQ ID NO. 15, SEQ ID NO. 21, SEQ ID NO. 25, SEQ ID NO. 37, SEQ ID NO. 39, SEQ ID NO. 41 or SEQ ID NO. 43. More preferably, the nucleotide sequence of the transaminase is preferably as shown in SEQ ID NO 6, 12, 16, 22, 26, 38, 40, 42, 44, 46, 48 or 50.
Preferably, the transaminase is attached to the resin by covalent bonding.
Preferably, the resin is preferably an epoxy resin, preferablyHFA. In a preferred embodiment, the resin is purchased from Viton, suzhou, chromatography, separation and purification, incHFA resin.
Preferably, the immobilized transaminase has a concentration of the resin-borne transaminase of 150 to 300mg/g (i.e., a mass of 150 to 300mg per gram of the resin-borne transaminase), for example 250mg/g. In order to solve the above technical problems, the third aspect of the present invention provides a method for preparing an immobilized transaminase, which comprises:
1) Contacting the solution of transaminase enzyme with a resin to form an immobilized transaminase enzyme, the transaminase enzyme having an amino acid sequence set forth in SEQ ID NO 5, SEQ ID NO 45, SEQ ID NO 47, SEQ ID NO 49, or variants thereof; the variant is that the R at the 241 th position of the amino acid sequence shown as SEQ ID NO. 5 is mutated into A, H, N, S, L, T, C or M;
2) Filtering and washing the immobilized transaminase.
Preferably, the amino acid sequence of the variant is shown as SEQ ID NO. 11, SEQ ID NO. 15, SEQ ID NO. 21, SEQ ID NO. 25, SEQ ID NO. 37, SEQ ID NO. 39, SEQ ID NO. 41 or SEQ ID NO. 43; the preferred nucleotide sequence of the transaminase is shown in SEQ ID NO 6, SEQ ID NO 12, SEQ ID NO 16, SEQ ID NO 22, SEQ ID NO 26, SEQ ID NO 38, SEQ ID NO 40, SEQ ID NO 42, SEQ ID NO 44, SEQ ID NO 46, SEQ ID NO 48 or SEQ ID NO 50.
Preferably, the transaminase is attached to the resin by covalent bonding.
Preferably, the resin is preferably an epoxy resin, preferablyHFA. In a preferred embodiment, the resin is purchased from Viton, suzhou, chromatography, separation and purification, incHFA resin.
Preferably, the transaminase is present in the form of transaminase bacterial sludge, and the mass ratio of the transaminase bacterial sludge to the resin is 3:1-1:2, such as 5:4.
Preferably, the temperature of the reaction is 10 to 40 ℃, preferably 20 to 25 ℃.
Preferably, the reaction time is 10 to 30 hours, preferably 20 to 25 hours.
Preferably, the reaction is carried out with stirring, preferably at a rate of 50 to 300rpm, for example 150 or 200rpm.
In a preferred embodiment, the immobilized transaminase is prepared by the following method: adding 5g transaminase (e.g. in the form of bacterial sludge) into dipotassium hydrogen phosphate-potassium dihydrogen phosphate buffer (e.g. 50mL in volume, 100mM in concentration), homogenizing under high pressure, crushing cells, centrifuging, collecting supernatant enzyme solution, adding 10g K 2 HPO 4 ·3H 2 O、1g KH 2 PO 4 And 0.01g of PLP, dissolved with stirring, 4g of which are addedHFA resin (purchased from Suzhou Vietnam chromatography separation and purification Co., ltd.) is fixed, the temperature is 20-25 ℃, the shaking table is used for fixing for 20-25 h at 200rpm, and the immobilized enzyme is obtained through suction filtration, deionization washing and suction filtration.
In order to solve the above technical problems, the fourth aspect of the present invention provides a method for preparing sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone, comprising the step of catalyzing a ketoamide substrate with the immobilized transaminase according to the second aspect of the present invention in a reaction solvent in the presence of an amino donor to obtain sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone.
Preferably, the ketoamide substrate is 4-oxo-4- [3- (trifluoromethyl) -5,6-dihydro- [1,2,4] triazolo [4,3-a ] pyrazin-7- (8H) -yl ] -1- (2,4,5-trifluorophenyl) butan-2-one and/or 1-morpholine-4- (2,4,5-trifluorophenyl) -1,3-butanedione.
Preferably, the reaction solvent is an aqueous isopropanol solution.
Preferably, the amino donor is isopropylamine.
Preferably, the molar ratio of the amino donor to the ketoamide substrate is 1:1-5:1.
Preferably, when the reaction solvent is an aqueous isopropanol solution, the volume content of water (the volume ratio of water to the whole solution) is 2-20%.
Preferably, the concentration of the ketoamide substrate in the whole reaction system is 20 g/L-200 g/L.
Preferably, the mass ratio of immobilized transaminase to substrate is 1:1 to 6:1, e.g. 3.
Preferably, the prepared reaction system also comprises a co-factor of transaminase, such as pyridoxal phosphate, and the concentration of the co-factor in the whole reaction system is preferably 0.5-5 mg/mL.
Preferably, the temperature of the reaction is 30-60 ℃, preferably 45 ℃.
In order to solve the above technical problems, the fifth aspect of the present invention provides a transaminase whose amino acid sequence is represented by SEQ ID NO. 5, SEQ ID NO. 45, SEQ ID NO. 47, SEQ ID NO. 49 or variants thereof; the variant is a mutation from R to A, H, N, S, L, T, C or M at position 241 of the amino acid sequence shown in SEQ ID NO: 5.
Preferably, the amino acid sequence of the variant is shown as SEQ ID NO. 11, SEQ ID NO. 15, SEQ ID NO. 21, SEQ ID NO. 25, SEQ ID NO. 37, SEQ ID NO. 39, SEQ ID NO. 41 or SEQ ID NO. 43;
more preferably, the nucleotide sequence of the transaminase is shown as SEQ ID NO 6, SEQ ID NO 12, SEQ ID NO 16, SEQ ID NO 22, SEQ ID NO 26, SEQ ID NO 38, SEQ ID NO 40, SEQ ID NO 42, SEQ ID NO 44, SEQ ID NO 46, SEQ ID NO 48 or SEQ ID NO 50.
In order to solve the above technical problems, a sixth aspect of the present invention provides a polynucleotide encoding the transaminase according to the fifth aspect of the present invention.
In order to solve the above technical problems, the seventh aspect of the present invention provides a recombinant expression vector comprising the polynucleotide according to the sixth aspect of the present invention.
Preferably, the backbone of the recombinant expression vector is plasmid pET21a.
In order to solve the above technical problems, an eighth aspect of the present invention provides a transformant obtained by introducing the polynucleotide according to the sixth aspect of the present invention or the recombinant expression vector according to the seventh aspect of the present invention into a host.
Preferably, the host is escherichia coli; escherichia coli BL21 is preferred.
In order to solve the above technical problems, a ninth aspect of the present invention provides a use of the transaminase according to the fifth aspect of the present invention in preparing sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone.
Preferably, the use comprises the step of catalyzing a ketoamide substrate with a transaminase in a reaction solvent in the presence of an amino donor to obtain the sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone.
More preferably, the reaction solvent is an aqueous isopropanol solution, preferably, the volume content of water in the aqueous isopropanol solution (the volume ratio of water to the whole solution) is 2-20%, and more preferably, the prepared reaction system further comprises a co-factor of transaminase, such as pyridoxal phosphate, and the concentration of the co-factor in the whole reaction system is preferably 0.5-5 mg/mL.
More preferably, the ketoamide substrate is 4-oxo-4- [3- (trifluoromethyl) -5,6-dihydro- [1,2,4] triazolo [4,3-a ] pyrazin-7- (8H) -yl ] -1- (2,4,5-trifluorophenyl) butan-2-one and/or 1-morpholine-4- (2,4,5-trifluorophenyl) -1,3-butanedione.
More preferably, the amino donor is isopropylamine.
More preferably, the molar ratio of the amino donor to the ketoamide substrate is 1:1 to 5:1.
More preferably, the concentration of the ketoamide substrate in the whole reaction system is 20-200 g/L.
More preferably, the mass ratio of said transaminase to said ketoamide substrate is 1:1 to 6:1, e.g., 3.
More preferably, the transaminase is present as an immobilized transaminase.
More preferably, the prepared reaction system also comprises a co-factor of transaminase, such as pyridoxal phosphate, and the concentration of the co-factor in the whole reaction system is preferably 0.5-5 mg/mL.
More preferably, the temperature of the reaction is 30 to 60 ℃, preferably 45 ℃.
In the present invention, the amount of isopropanol in the aqueous isopropanol solution is sufficient to dissolve the substrate completely. The volume content of water in the isopropanol aqueous solution (the volume ratio of water to the whole solution) can be 2-20%, if the water is too little, the immobilized enzyme can be deactivated, and if the water is too much, the substrate can not be completely dissolved.
In one aspect, the invention further provides a preparation method of sitagliptin phosphate, which comprises the following steps:
(1) Preparing sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone according to the preparation method according to the fourth aspect of the invention;
(2) And (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone prepared in the step (1) is reacted to obtain sitagliptin phosphate.
Preferably, the sitagliptin phosphate is sitagliptin phosphate monohydrate.
In a certain aspect, the invention also provides an enzyme preparation comprising an immobilized transaminase according to the second aspect of the invention or a transaminase according to the fifth aspect of the invention;
preferably, the enzyme preparation further comprises a co-factor for transaminase, such as pyridoxal phosphate.
In the present invention, the enzyme preparation may be generally an enzyme preparation obtained from a transformant host cell containing transaminase obtained from a culture or a culture solution thereof, or a product obtained by processing the same; wherein the product is an extract obtained from a transformant host cell, an isolated product obtained by isolating or purifying transaminase in the extract, or an immobilized product obtained by immobilizing a transformant cell and an extract thereof or an isolated product of the extract.
On the basis of the common knowledge in the field, the above preferred conditions can be combined randomly to obtain the preferred embodiments of the invention.
The reagents and starting materials used in the present invention are commercially available.
The positive progress effects of the invention are as follows: after the transaminase is prepared into the immobilized transaminase, the immobilized transaminase is used for catalyzing a ketoamide substrate to produce a sitagliptin intermediate, and the immobilized transaminase has the advantages of high conversion rate, good stereoselectivity, good stability, improved reusability and simpler operation, thereby reducing the production cost and being beneficial to industrial production.
Detailed Description
The invention is further illustrated by the following examples, which are not intended to limit the scope of the invention. The experimental methods without specifying specific conditions in the following examples were selected according to the conventional methods and conditions, or according to the commercial instructions.
The experimental methods in the invention are conventional methods unless otherwise specified, and the gene cloning operation can be specifically referred to in molecular cloning experimental guidelines compiled by J. SammBruk et al.
The abbreviations of the amino acids in the present invention are those conventional in the art unless otherwise specified, and the amino acids corresponding to the specific abbreviations are shown in Table 1.
TABLE 1
The codons corresponding to the amino acids are also conventional in the art, and the corresponding relationship between specific amino acids and codons is shown in table 2.
TABLE 2
Pet21a was purchased from Novagen, inc.; ndeI enzyme, hindIII enzyme were purchased from Thermo Fisher, BL21 competent cells from Changshu Biotech, inc., beijing ancient cooking.
Conversion HPLC method: a chromatographic column: c18 column 4.6X 250mm,5 μm; a detector: UV268nm; column temperature: 40 ℃; flow rate: 0.8mL/min; sample introduction amount: 20 mu L of the solution; mobile phase A: water, acetonitrile, formic acid and ammonia water = 950: 50: 0.5, the pH value is 3.60-3.80, if the pH value is not reached, the pH value is adjusted to 3.70 by 10% ammonia water or 10% formic acid; ammonium formate can also be used to adjust the pH to 3.70; and (3) mobile phase B: water acetonitrile = 20: 80; gradient elution: 100% A (0.01 min), 60% A +40% B (20 min), 60% A +40% B (40 min), 100% A (50 min), 100% A (60 min).
Morpholine dione control retention time: 27.820min; (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone control retention time: 14.856min;
sitagliptin control retention time: 34.715min;
sitagliptin control (purchased from beijing ying luck science ltd) retention time: 17.705min.
Wherein, the morpholine diketone substrate raw material and the reference product (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone are synthesized by the company, and the synthesis method refers to WO2019011236A1; the 3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone racemate is synthesized by a laboratory, and is prepared by amination and catalytic hydrogenation of morpholine diketone. Determination of configuration of control (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone: (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone may be further reacted to prepare (3R) -N-t-butoxycarbonyl-3-amino-4- (2,4,5-trifluorophenyl) -butyric acid (see WO2019011236A 1), which may be determined to be R configuration by reference to a standard of (3R) -N-t-butoxycarbonyl-3-amino-4- (2,4,5-trifluorophenyl) -butyric acid, available from Anhui Kangyao pharmaceutical Limited liability company.
HPLC method for configuration determination:
a chromatographic column: daicel Chiralpak AD-H (4.6 mm 250mm,5 μm); mobile phase: n-hexane: isopropanol = 90; detection wavelength: 210nm; flow rate: 1.0ml/min; sample introduction volume: 10 mul; column temperature: 25 ℃; operating time: and (4) 40min.
The sitagliptin substrate raw material is synthesized by the laboratory, the synthetic method refers to CN100430397C, the sitagliptin racemate is synthesized by the laboratory, and the sitagliptin substrate raw material is prepared by amination and catalytic hydrogenation of the sitagliptin.
The ee value of the product is detected by a chiral HPLC method as follows:
and (3) chromatographic column: a Daicel ChiralpakAD-H column of 4.6 mm. Times.250mm, 5 μm; mobile phase: n-hexane: isopropanol: diethylamine = 40; a detector: UV268nm; column temperature: 25 ℃; flow rate: 0.8mL/min; sample introduction amount: 10 μ L.
3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone racemate control retention time: 10.290min and 28.087min;
(R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone control retention time: 28.093min.
Sitagliptin racemate control retention time: 14.172min and 17.702min;
sitagliptin control retention time: 17.665min.
EXAMPLE 1 preparation of transaminase mutant enzyme solution
According to the reported coding genes of Enz.1 (SEQ ID NO:1, namely SEQ ID NO:110 in US 8293507) and Enz.2 (SEQ ID NO:3, namely SEQ ID NO:130 in US 9617573), the whole gene is synthesized, the restriction sites NdeI and HindIII are used, and the vector pET21a is obtained. The gene synthesis company is Suzhou Jin Weizhi Biotechnology GmbH (Suzhou Industrial park Star lake street 218 Bionanometer technology park C3). The synthesized transaminase gene is transformed into host escherichia coli BL21 competent cells to obtain an engineering strain containing transaminase Enz.1 and Enz.2 genes.
Similarly, the genes of transaminases Enz.3 to Enz.25 engineered in Table 3, the cleavage sites NdeI and HindIII, and the vector pET21a were used. The synthesized transaminase gene is transformed into host escherichia coli BL21 competent cells to obtain engineering bacteria Enz.3-Enz.25 containing the transaminase gene in the table 3.
TABLE 3
After the engineering bacteria containing transaminase genes are activated by plating and streaking, single colony is selected and inoculated into 5ml LB liquid culture medium containing 50 mug/ml kanamycin, and shake culture is carried out for 12h at 37 ℃. Was inoculated into 150ml of fresh LB liquid medium containing 50. Mu.g/ml of kanamycin at an inoculum size of 2% (v/v), shaken at 37 ℃ until the OD600 reached about 0.8, IPTG was added to a final concentration of 0.5mM, and induced at 18 ℃ for 16 hours. After the culture is finished, the culture solution is centrifuged at 10,000rpm for 10min, the supernatant is discarded, and the thalli are collected and stored in an ultra-low temperature refrigerator at minus 80 ℃ for standby.
EXAMPLE 2 preparation of immobilized transaminase mutants
Mixing 5g transaminase mutant bacterial mud with 50mL 100mM dipotassium hydrogen phosphate-potassium dihydrogen phosphate buffer solution, homogenizing under high pressure, breaking cells, centrifuging, collecting supernatant enzyme solution, adding 10g K 2 HPO 4 ·3H 2 O、1g KH 2 PO 4 And 0.01g of PLP, dissolved with stirring,adding 4g ofHFA resin (purchased from Suzhou Vietnam chromatography separation and purification Co., ltd.) is fixed, the temperature is 20-25 ℃, the shaking table is used for fixing for 20-25 h at 200rpm, and the immobilized enzyme is obtained through suction filtration, deionization washing and suction filtration.
Example 3 immobilized enzymes of respective transaminase mutants for catalyzing morpholino-diketones
In a triangular flask, 10mL of isopropanol, 0.5g of morpholinodione, 0.8mL of a PLP aqueous solution of 16mg/mL, 0.25mL of isopropylamine and 1g of the immobilized enzyme obtained in example 2 were charged, subjected to shake reaction at 45 ℃ and 200rpm, and sampled for 4 hours and 24 hours to examine the conversion. The results of the measurements are shown in Table 4 below.
Among them, the conversion rate was measured by HPLC as described above in the section of the HPLC method for conversion rate. The retention time of each substrate and product was found to be consistent with the respective control.
TABLE 4
The results show that the conversion rates of Enz.3, enz.6, enz.8, enz.11, enz.13, enz.19, enz.20, enz.21, enz.22, enz.23, enz.24 and Enz.25 are all superior to those of Enz.2 and Enz.1, wherein the conversion rates of Enz.6, enz.11 and Enz.24 in 24 hours can all reach more than 80%.
To a flask, 25mL of isopropanol, 2.5g of morpholinodione, 2mL of a PLP aqueous solution of 16mg/mL, 1.24mL of isopropylamine and 7.5g of the immobilized enzyme prepared in example 2 were added, and the mixture was subjected to a shake reaction at 45 ℃ and 200rpm for 24 hours, followed by sampling and detecting the conversion rate. Filtering the reaction liquid to obtain immobilized enzyme, continuously adding 25mL of isopropanol, 2.5g of morpholine diketone, 2mL of 16mg/mL PLP aqueous solution and 1.24mL of isopropylamine, reacting at 45 ℃ with a shaking table at 200rpm, and sampling for 24 hours to detect the conversion rate. The immobilized enzyme is repeatedly applied according to the method, and the result shows that the immobilized enzyme is applied for 10 batches of reactions, the conversion rate is still kept above 90%, the ee value is more than 99.9%, and the immobilized enzyme is relatively stable. While the conversion rate of Enz.1 and Enz.2 which are applied for 3 times under the same condition is below 50 percent, so that the method is not suitable for catalyzing morpholine diketone. The specific application results are shown in Table 5 below.
TABLE 5
Example 5 immobilized enzymes of respective transaminase mutants for catalyzing sitobinone
In a triangular flask, 10mL of isopropyl alcohol, 0.5g of sitobianone, 0.8mL of 16mg/mL of an aqueous solution of PLP, 0.23mL of isopropylamine and 1g of the immobilized enzyme obtained in example 2 were charged, and subjected to a shaking reaction at 45 ℃ and 200rpm, and sampling was performed for 4 hours and 24 hours to determine the conversion by HPLC, and the results are shown in Table 6 below.
Among them, the conversion rate was measured by HPLC as described above in the section of the conversion rate HPLC method. The retention time of each substrate and product was determined to be consistent with the respective control.
TABLE 6
The results show that the conversion rates of Enz.3, enz.6, enz.11, enz.13, enz.19, enz.20, enz.21 and Enz.22 are all better than those of Enz.1 and Enz.2, wherein the effect of Enz.11 is optimal.
The protein concentration of the better immobilized enzymes is measured, and the detection method comprises the following steps: the change of the absorbance at 595nm of the enzyme solution before and after immobilization was measured by the Bradford method, and the protein concentration was calculated, and the measurement results are shown in Table 7 below:
TABLE 7
To a flask, 25mL of isopropanol, 2.5g of sitaxadione, 2mL of 16mg/mL of an aqueous PLP solution, 1.135mL of isopropylamine and 6.25g of the immobilized enzyme prepared in example 2 were added, and the mixture was subjected to a shaking reaction at 45 ℃ and 200rpm for 24 hours, followed by sampling and measuring the conversion rate. Filtering the reaction liquid to obtain immobilized enzyme, continuously adding 25mL of isopropanol, 2.5g of sitodione, 2mL of 16mg/mL of PLP aqueous solution and 1.135mL of isopropylamine, reacting at 45 ℃ with a shaking table at 200rpm, and sampling for 24 hours to detect the conversion rate. The immobilized enzyme is repeatedly used according to the method, and the result shows that the immobilized enzyme Enz.11-The HFA is mechanically applied for 10 batches of reactions, the conversion rate is still kept above 95 percent, and the ee value is>99.9 percent, and the immobilized enzyme is relatively stable. The specific application results are shown in Table 8 below.
TABLE 8
* : conversion compared to enz.1; #: conversion comparison with Enz.2
The above results show that the transaminase mutations of the present invention show superior effects on the conversion rate of the immobilized enzyme catalyzed sitagliptin than the conversion rates of the prior art Enz.1 and Enz.2 when used for immobilized enzyme catalysis, and the difference is statistically significant (p values are both less than 0.001, wherein the p value is calculated by T-Test two-tailed experiment).
Example 7 preparation of sitagliptin phosphate monohydrate
After the reaction of example 6 was completed, the immobilized enzyme and the filtrate were obtained by filtration, and the filtrate was concentrated and dried at 60 ℃ to obtain a concentrate. Dissolving in 100mL of dichloromethane, adding 100mL of purified water, stirring, adjusting pH to 2-3 with 30% concentrated hydrochloric acid, and standing for layering. Adding 100mL of dichloromethane into the water phase, stirring, adding 30% sodium hydroxide solution to adjust the pH value to 11, and standing for layering. The aqueous phase was added again with 100mL of methylene chloride, extracted with stirring, and allowed to stand for separation. The organic phases of the two alkali extractions are combined. Concentrating at 60 ℃, adding 120mL of isopropanol into the concentrate, stirring for dissolving, adding 10.6g of 85% phosphoric acid, heating to 75 ℃, stirring for dissolving, slowly cooling to separate out the sitagliptin phosphate crystals, keeping the temperature at 5 ℃ for 2h, filtering, and drying the filter cake at 60 ℃ to obtain the sitagliptin phosphate monohydrate.
SEQUENCE LISTING
<110> chess Ke Lai Biotechnology (Shanghai) GmbH
<120> transaminase, immobilized transaminase and use for the preparation of sitagliptin
<130> P20014045C
<160> 50
<170> PatentIn version 3.5
<210> 1
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.1 amino acids
<400> 1
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Glu Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Ser Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Met Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Ser Ala Cys Pro Tyr Gln Trp Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Val Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Cys Asp Asn Leu Leu Ala Glu Gly Pro Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ala Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 2
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.1 nucleotides
<400> 2
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcgaaggtg ctttcgttcc gccgtctgaa gctcgtatct ctatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctatggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgtct gcttgcccgt accagtggat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg gttgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg actgcgacaa cctgctggct 660
gaaggtccgg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gctgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 3
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.2 amino acids
<400> 3
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Met Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Val Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Pro Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ala Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 4
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.2 nucleotides
<400> 4
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctatggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg gttgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtccgg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gctgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 5
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.3 amino acids
<400> 5
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Arg Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 6
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.3 nucleotides
<400> 6
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
cgtgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 7
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.4 amino acids
<400> 7
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Glu Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 8
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.4 nucleotides
<400> 8
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gaagctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 9
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.5 amino acids
<400> 9
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Trp Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 10
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.5 nucleotides
<400> 10
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
tgggctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 11
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.6 amino acids
<400> 11
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ala Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 12
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.6 nucleotides
<400> 12
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gctgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 13
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.7 amino acids
<400> 13
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Lys Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 14
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.7 nucleotides
<400> 14
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
aaagctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 15
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.8 amino acid
<400> 15
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
His Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 16
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.8 nucleotides
<400> 16
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
cacgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 17
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.9 amino acids
<400> 17
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Tyr Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 18
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.9 nucleotides
<400> 18
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
tacgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 19
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.10 amino acids
<400> 19
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Gln Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 20
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.10 nucleotides
<400> 20
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
caggctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 21
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.11 amino acids
<400> 21
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Asn Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 22
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.11 nucleotides
<400> 22
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
aacgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 23
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.12 amino acids
<400> 23
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Pro Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 24
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.12 nucleotides
<400> 24
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
ccggctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 25
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.13 amino acids
<400> 25
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ser Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 26
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.13 nucleotides
<400> 26
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
tctgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 27
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.14 amino acids
<400> 27
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Asp Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 28
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.14 nucleotides
<400> 28
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gacgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 29
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.15 amino acids
<400> 29
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Gly Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 30
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.15 nucleotides
<400> 30
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
ggtgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 31
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.16 amino acids
<400> 31
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ile Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 32
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.16 nucleotides
<400> 32
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
atcgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 33
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.17 amino acids
<400> 33
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Phe Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 34
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.17 nucleotides
<400> 34
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
ttcgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 35
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.18 amino acids
<400> 35
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Val Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 36
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.18 nucleotides
<400> 36
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gttgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 37
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.19 amino acids
<400> 37
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Leu Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 38
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.19 nucleotides
<400> 38
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
ctggctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 39
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.20 amino acids
<400> 39
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Thr Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 40
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.20 nucleotides
<400> 40
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
accgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 41
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.21 amino acids
<400> 41
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Cys Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 42
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.21 nucleotides
<400> 42
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
tgcgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 43
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.22 amino acids
<400> 43
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Met Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 44
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.22 nucleotides
<400> 44
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
atggctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 45
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.23 amino acids
<400> 45
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Glu Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Ser Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Phe Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Ser Ala Cys Pro Tyr Gln Trp Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Val Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Cys Asp Asn Leu Leu Ala Glu Gly Cys Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ala Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 46
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.23 nucleotides
<400> 46
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcgaaggtg ctttcgttcc gccgtctgaa gctcgtatct ctatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctttcgtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgtct gcttgcccgt accagtggat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg gttgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg actgcgacaa cctgctggct 660
gaaggttgcg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gctgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 47
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.24 amino acids
<400> 47
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Glu Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Ser Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Ser Ala Cys Pro Tyr Gln Trp Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Val Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Cys Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Asn Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 48
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.24 nucleotides
<400> 48
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcgaaggtg ctttcgttcc gccgtctgaa gctcgtatct ctatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgtct gcttgcccgt accagtggat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg gttgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg actgcgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
aacgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 49
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.25 amino acids
<400> 49
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Glu Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Ser Ala Cys Pro Tyr Gln Trp Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Cys Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Arg Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 50
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.25 nucleotides
<400> 50
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcgaaggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgtct gcttgcccgt accagtggat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg actgcgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
cgtgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
Claims (11)
1. Use of an immobilized transaminase in the preparation of sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone;
wherein the immobilized transaminase comprises a resin and a transaminase, and the amino acid sequence of the transaminase is shown in SEQ ID NO 5, SEQ ID NO 45, SEQ ID NO 47, SEQ ID NO 49 or variants thereof; the variant is that the R at the 241 th site of the amino acid sequence shown as SEQ ID NO. 5 is mutated into A, H, N, S, L, T, C or M;
preferably:
the amino acid sequence of the variant is shown as SEQ ID NO. 11, SEQ ID NO. 15, SEQ ID NO. 21, SEQ ID NO. 25, SEQ ID NO. 37, SEQ ID NO. 39, SEQ ID NO. 41 or SEQ ID NO. 43; the nucleotide sequence of the transaminase is preferably shown as SEQ ID NO 6, SEQ ID NO 12, SEQ ID NO 16, SEQ ID NO 22, SEQ ID NO 26, SEQ ID NO 38, SEQ ID NO 40, SEQ ID NO 42, SEQ ID NO 44, SEQ ID NO 46, SEQ ID NO 48 or SEQ ID NO 50; and/or, said transaminase is attached to said resin by covalent bonding; and/or the resin is epoxy resin, preferablyAnd/or the reaction solvent used in the preparation is an isopropanol aqueous solution, and preferably the volume content of water in the isopropanol aqueous solution is 2-20%; and/or, the prepared reaction system also comprises a co-factor of transaminase, such as pyridoxal phosphate, the concentration of the co-factor is preferably 0.5-5 mg/mL; and/or the immobilized transaminase has a concentration of the transaminase immobilized on the resin of 150 to 300mg/g, e.g., 250mg/g.
2. An immobilized transaminase comprising a resin and a transaminase, the transaminase having an amino acid sequence set forth in SEQ ID NO 5, SEQ ID NO 45, SEQ ID NO 47, SEQ ID NO 49, or variants thereof; the variant is that the R at the 241 th site of the amino acid sequence shown as SEQ ID NO. 5 is mutated into A, H, N, S, L, T, C or M;
preferably:
the amino acid sequence of the variant is shown in SEQ ID NO. 11, SEQ ID NO. 15, SEQ ID NO. 21, SEQ ID NO. 25, SEQ ID NO. 37, SEQ ID NO. 39, SEQ ID NO. 41 or SEQ ID NO. 43; the nucleotide sequence of the transaminase is preferably shown as SEQ ID NO 6, SEQ ID NO 12, SEQ ID NO 16, SEQ ID NO 22, SEQ ID NO 26, SEQ ID NO 38, SEQ ID NO 40, SEQ ID NO 42, SEQ ID NO 44, SEQ ID NO 46, SEQ ID NO 48 or SEQ ID NO 50; and/or, said transaminase is attached to said resin by covalent bonding; and/or the resin is epoxy resin, preferablyAnd/or the immobilized transaminase has a concentration of the transaminase immobilized on the resin of 150 to 300mg/g, e.g., 250mg/g.
3. A process for preparing an immobilized transaminase of claim 2, comprising:
1) Contacting a solution of the transaminase enzyme with the resin to react to form an immobilized transaminase enzyme;
2) Filtering and washing the immobilized transaminase;
preferably:
the transaminase exists in the form of transaminase bacterial sludge, and the mass ratio of the transaminase bacterial sludge to the resin is 3:1-1:2, such as 5:4; and/or the reaction temperature is 10-40 ℃, preferably 20-25 ℃; and/or the reaction time is 10-30 h, preferably 20-25 h; and/or the reaction is carried out under stirring, preferably at a rate of 50 to 300rpm, for example 150 or 200rpm.
4. A process for the preparation of sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone comprising the step of catalyzing a ketoamide substrate with an immobilized transaminase in a reaction solvent in the presence of an amino donor to yield sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone;
wherein the immobilized transaminase is as described in claim 2;
preferably, the ketoamide substrate is 4-oxo-4- [3- (trifluoromethyl) -5,6-dihydro- [1,2,4] triazolo [4,3-a ] pyrazin-7- (8H) -yl ] -1- (2,4,5-trifluorophenyl) butan-2-one and/or 1-morpholine-4- (2,4,5-trifluorophenyl) -1,3-butanedione;
and/or the reaction solvent is an isopropanol aqueous solution;
and/or the amino donor is isopropylamine;
and/or the molar ratio of the amino donor to the ketoamide substrate is 1:1-5:1;
and/or, when the reaction solvent is isopropanol aqueous solution, the volume content of water is 2-20%;
and/or the concentration of the ketoamide substrate is 20-200 g/L;
and/or the mass ratio of the immobilized transaminase to the ketoamide substrate is 1:1 to 6:1, e.g., 3;
and/or, the prepared reaction system also comprises a co-factor of transaminase, such as pyridoxal phosphate, the concentration of the co-factor is preferably 0.5-5 mg/mL;
and/or the temperature of the reaction is 30-60 ℃, preferably 45 ℃.
5. A transaminase, characterized in that the amino acid sequence of the transaminase is shown in SEQ ID No. 5, SEQ ID No. 45, SEQ ID No. 47, SEQ ID No. 49 or variants thereof; the variant is characterized in that the 241 th position of the amino acid sequence shown as SEQ ID NO. 5 is mutated from R to A, H, N, S, L, T, C or M;
preferably, the amino acid sequence of the variant is shown as SEQ ID NO. 11, SEQ ID NO. 15, SEQ ID NO. 21, SEQ ID NO. 25, SEQ ID NO. 37, SEQ ID NO. 39, SEQ ID NO. 41 or SEQ ID NO. 43;
more preferably, the nucleotide sequence of the transaminase is shown as SEQ ID NO 6, SEQ ID NO 12, SEQ ID NO 16, SEQ ID NO 22, SEQ ID NO 26, SEQ ID NO 38, SEQ ID NO 40, SEQ ID NO 42, SEQ ID NO 44, SEQ ID NO 46, SEQ ID NO 48 or SEQ ID NO 50.
6. A polynucleotide encoding the transaminase enzyme of claim 5.
7. A recombinant expression vector comprising the polynucleotide of claim 6;
preferably, the backbone of the recombinant expression vector is plasmid pET21a.
8. A transformant obtained by introducing the polynucleotide of claim 6 or the recombinant expression vector of claim 7 into a host;
preferably, the host is escherichia coli; escherichia coli BL21 is preferred.
9. Use of a transaminase of claim 5 in the preparation of sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone.
10. A preparation method of sitagliptin phosphate, which is characterized by comprising the following steps:
(1) Preparing sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone according to the process of claim 4;
(2) Reacting sitagliptin and/or (R) -3-amino-1-morpholine-4- (2,4,5-trifluorophenyl) -1-butanone prepared in the step (1) to obtain sitagliptin phosphate;
preferably, the sitagliptin phosphate is sitagliptin phosphate monohydrate.
11. An enzyme preparation comprising an immobilized transaminase of claim 2 or a transaminase of claim 5;
preferably, the enzyme preparation further comprises a co-factor for transaminase, such as pyridoxal phosphate.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110655349.9A CN115466756A (en) | 2021-06-11 | 2021-06-11 | Transaminase, immobilized transaminase and application of transaminase to preparation of sitagliptin |
PCT/CN2022/092053 WO2022257686A1 (en) | 2021-06-11 | 2022-05-10 | Transaminase, immobilized transaminase and use for preparing sitagliptin |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110655349.9A CN115466756A (en) | 2021-06-11 | 2021-06-11 | Transaminase, immobilized transaminase and application of transaminase to preparation of sitagliptin |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115466756A true CN115466756A (en) | 2022-12-13 |
Family
ID=84364776
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110655349.9A Pending CN115466756A (en) | 2021-06-11 | 2021-06-11 | Transaminase, immobilized transaminase and application of transaminase to preparation of sitagliptin |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN115466756A (en) |
WO (1) | WO2022257686A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117467733A (en) * | 2023-12-27 | 2024-01-30 | 北京元延医药科技股份有限公司 | High chiral purity sitagliptin and method for preparing same by using immobilized transaminase |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116024083B (en) * | 2023-02-22 | 2023-06-13 | 凯莱英生命科学技术(天津)有限公司 | Device and method for preparing chiral amine compound by continuous flow reaction |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2838744C (en) * | 2011-06-24 | 2020-10-13 | Merck Sharp & Dohme Corp. | Immobilized transaminases and process for making and using immobilized transaminase |
DK2961844T3 (en) * | 2013-02-28 | 2019-01-02 | Codexis Inc | MODIFIED TRANSAMINASE POLYPEPTIDES FOR INDUSTRIAL BIOCATALYSIS |
CN105219745B (en) * | 2014-06-12 | 2018-12-04 | 上海弈柯莱生物医药科技有限公司 | A kind of immobilization transaminase and its application in synthesis sitagliptin intermediate |
ES2733477T3 (en) * | 2017-07-04 | 2019-11-29 | Fis Fabbrica Italiana Sintetici Spa | Effective procedure for the preparation of sitagliptin through a very effective preparation of the intermediate 2,4,5-trifluorophenylacetic acid |
WO2019207443A1 (en) * | 2018-04-24 | 2019-10-31 | Unichem Laboratories Ltd | An enzymatic process for the preparation of (r)-sitagliptin |
CN108586346B (en) * | 2018-05-10 | 2019-10-01 | 北京富盛嘉华医药科技有限公司 | A kind of method that biocatalysis synthesizes sitagliptin and its intermediate |
EP3604507A1 (en) * | 2018-07-30 | 2020-02-05 | University College Cork-National University of Ireland, Cork | An omega-transaminase enzyme |
US20220380817A1 (en) * | 2019-10-25 | 2022-12-01 | Asymchem Laboratories (Fuxin) Co., Ltd. | Transaminase mutant and use thereof |
CN113061594B (en) * | 2019-12-31 | 2023-11-24 | 弈柯莱生物科技(集团)股份有限公司 | Transaminase mutants, immobilized transaminases and use for preparing sitagliptin |
-
2021
- 2021-06-11 CN CN202110655349.9A patent/CN115466756A/en active Pending
-
2022
- 2022-05-10 WO PCT/CN2022/092053 patent/WO2022257686A1/en unknown
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117467733A (en) * | 2023-12-27 | 2024-01-30 | 北京元延医药科技股份有限公司 | High chiral purity sitagliptin and method for preparing same by using immobilized transaminase |
CN117467733B (en) * | 2023-12-27 | 2024-03-12 | 北京元延医药科技股份有限公司 | High chiral purity sitagliptin and method for preparing same by using immobilized transaminase |
Also Published As
Publication number | Publication date |
---|---|
WO2022257686A1 (en) | 2022-12-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113061594B (en) | Transaminase mutants, immobilized transaminases and use for preparing sitagliptin | |
CN111041019B (en) | Aspartic enzyme mutant and application thereof | |
CN115466756A (en) | Transaminase, immobilized transaminase and application of transaminase to preparation of sitagliptin | |
CN111411094B (en) | (R) -omega-transaminase mutant and application thereof | |
CN102277338A (en) | Diketoreductase mutant and application thereof | |
CN109735559B (en) | Biological preparation method of gamma-aminobutyric acid | |
CN110777123B (en) | Mutant L-amino acid ligase and process for preparing L-glutamic acid-L-tryptophan dipeptide by enzyme catalysis method | |
CN109355266B (en) | Imine reductase mutant and application thereof in synthesis of optically active 1-substituted-tetrahydroisoquinoline derivative | |
CN110885846A (en) | Microorganism for synthesizing baicalein and scutellarein, preparation method and application thereof | |
KR100294996B1 (en) | Epimerase | |
CN111534494B (en) | (R) -omega-transaminase mutant and application thereof in preparation of sitagliptin intermediate | |
CN115975969A (en) | Transaminase and use thereof for the preparation of sitagliptin or an intermediate thereof | |
CN109593739B (en) | Recombinant ketoacid reductase mutant, gene, engineering bacterium and application thereof | |
KR100702728B1 (en) | Cyclic depsipeptide synthases, genes thereof and mass production system of cyclic depsipeptide | |
CN111455003A (en) | Method for preparing D-psicose from microalgae | |
CN107099523B (en) | Cefradine synthase mutant and its encoding gene | |
CN112048485B (en) | Engineered transaminase polypeptide for preparing sitagliptin | |
CN112176007B (en) | Preparation method of amino alcohol chiral intermediate | |
CN114438146A (en) | Method for producing histidine by microbial fermentation and application | |
CN111979206B (en) | Immobilized fusion enzyme and method for preparing glutathione by using same | |
CN110951706A (en) | Recombinant R-omega-transaminase, mutant and application in asymmetric synthesis of sitagliptin | |
KR101379585B1 (en) | Novel Glutamate Decarboxylase Mutants For Salt-Free Reaction | |
KR102238654B1 (en) | Mutant strain of Streptomyces sp. having enhanced biosynthetic ability to produce Ohmyungsamycin A, and method for preparing Ohmyungsamycin A using the same | |
CN117467733B (en) | High chiral purity sitagliptin and method for preparing same by using immobilized transaminase | |
CN111808893B (en) | Novel biological preparation method of amino alcohol drug intermediate |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: Room 3114, Building B, 555 Dongchuan Road, Minhang District, Shanghai, 200241 Applicant after: Yikelai Biotechnology (Group) Co.,Ltd. Address before: Room 3114, Building B, 555 Dongchuan Road, Minhang District, Shanghai, 200241 Applicant before: Ecolab Biotechnology (Shanghai) Co.,Ltd. |
|
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |