CN116790569A - Pyruvic acid decarboxylase mutant and application thereof in preparation of alpha-hydroxyketone compound - Google Patents
Pyruvic acid decarboxylase mutant and application thereof in preparation of alpha-hydroxyketone compound Download PDFInfo
- Publication number
- CN116790569A CN116790569A CN202210361627.4A CN202210361627A CN116790569A CN 116790569 A CN116790569 A CN 116790569A CN 202210361627 A CN202210361627 A CN 202210361627A CN 116790569 A CN116790569 A CN 116790569A
- Authority
- CN
- China
- Prior art keywords
- ala
- leu
- gly
- val
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 title claims description 26
- 229940107700 pyruvic acid Drugs 0.000 title claims description 13
- 238000002360 preparation method Methods 0.000 title claims description 9
- 108010011939 Pyruvate Decarboxylase Proteins 0.000 claims abstract description 165
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 8
- 150000001413 amino acids Chemical class 0.000 claims description 80
- HUMNYLRZRPPJDN-UHFFFAOYSA-N benzaldehyde Chemical compound O=CC1=CC=CC=C1 HUMNYLRZRPPJDN-UHFFFAOYSA-N 0.000 claims description 64
- 230000035772 mutation Effects 0.000 claims description 32
- QNGNSVIICDLXHT-UHFFFAOYSA-N para-ethylbenzaldehyde Natural products CCC1=CC=C(C=O)C=C1 QNGNSVIICDLXHT-UHFFFAOYSA-N 0.000 claims description 32
- 150000001875 compounds Chemical class 0.000 claims description 27
- 238000000034 method Methods 0.000 claims description 24
- 238000003259 recombinant expression Methods 0.000 claims description 23
- 102000039446 nucleic acids Human genes 0.000 claims description 18
- 108020004707 nucleic acids Proteins 0.000 claims description 18
- 150000007523 nucleic acids Chemical class 0.000 claims description 18
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 claims description 18
- 241000894006 Bacteria Species 0.000 claims description 15
- 239000002773 nucleotide Substances 0.000 claims description 15
- 125000003729 nucleotide group Chemical group 0.000 claims description 15
- 239000013604 expression vector Substances 0.000 claims description 14
- 125000003118 aryl group Chemical group 0.000 claims description 10
- 239000013598 vector Substances 0.000 claims description 10
- 230000000694 effects Effects 0.000 claims description 9
- 229940054269 sodium pyruvate Drugs 0.000 claims description 9
- 125000001072 heteroaryl group Chemical group 0.000 claims description 8
- 125000004435 hydrogen atom Chemical group [H]* 0.000 claims description 8
- 239000013612 plasmid Substances 0.000 claims description 8
- 241000206602 Eukaryota Species 0.000 claims description 4
- 229920001184 polypeptide Polymers 0.000 claims description 4
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 4
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 4
- 239000003054 catalyst Substances 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 3
- 125000002887 hydroxy group Chemical group [H]O* 0.000 claims description 3
- 229910021645 metal ion Inorganic materials 0.000 claims description 3
- HHDDCCUIIUWNGJ-UHFFFAOYSA-N 3-hydroxypyruvic acid Chemical compound OCC(=O)C(O)=O HHDDCCUIIUWNGJ-UHFFFAOYSA-N 0.000 claims description 2
- 239000013603 viral vector Substances 0.000 claims description 2
- 238000006243 chemical reaction Methods 0.000 abstract description 115
- 239000000758 substrate Substances 0.000 abstract description 48
- 230000003287 optical effect Effects 0.000 abstract description 11
- 230000003197 catalytic effect Effects 0.000 abstract description 9
- 108010079364 N-glycylalanine Proteins 0.000 description 80
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 80
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 48
- 239000000243 solution Substances 0.000 description 42
- 235000001014 amino acid Nutrition 0.000 description 41
- 229940024606 amino acid Drugs 0.000 description 40
- 230000000052 comparative effect Effects 0.000 description 35
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 32
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 32
- 108010005233 alanylglutamic acid Proteins 0.000 description 32
- 108010049041 glutamylalanine Proteins 0.000 description 32
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 32
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 31
- 108010089804 glycyl-threonine Proteins 0.000 description 29
- 238000004128 high performance liquid chromatography Methods 0.000 description 29
- 108020004414 DNA Proteins 0.000 description 24
- 102000053602 DNA Human genes 0.000 description 24
- 102000004190 Enzymes Human genes 0.000 description 24
- 108090000790 Enzymes Proteins 0.000 description 24
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 20
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 18
- 108010047495 alanylglycine Proteins 0.000 description 17
- 108090000623 proteins and genes Proteins 0.000 description 17
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 16
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 16
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 16
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 16
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 16
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 16
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 16
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 16
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 16
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 16
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 16
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 16
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 16
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 16
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 16
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 16
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 16
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 16
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 16
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 16
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 16
- NYDIVDKTULRINZ-AVGNSLFASA-N Arg-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NYDIVDKTULRINZ-AVGNSLFASA-N 0.000 description 16
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 16
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 16
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 16
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 16
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 16
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 16
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 16
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 16
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 16
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 16
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 16
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 16
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 16
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 16
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 16
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 16
- RGNMNWULPAYDAH-JSGCOSHPSA-N Gln-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N RGNMNWULPAYDAH-JSGCOSHPSA-N 0.000 description 16
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 16
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 16
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 16
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 16
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 16
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 16
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 16
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 16
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 16
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 16
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 16
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 16
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 16
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 16
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 16
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 16
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 16
- OZBDSFBWIDPVDA-BZSNNMDCSA-N His-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N OZBDSFBWIDPVDA-BZSNNMDCSA-N 0.000 description 16
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 16
- WSWAUVHXQREQQG-JYJNAYRXSA-N His-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O WSWAUVHXQREQQG-JYJNAYRXSA-N 0.000 description 16
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 16
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 16
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 16
- QRTVJGKXFSYJGW-KBIXCLLPSA-N Ile-Glu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N QRTVJGKXFSYJGW-KBIXCLLPSA-N 0.000 description 16
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 16
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 16
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 16
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 16
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 16
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 16
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 16
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 16
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 16
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 16
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 16
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 16
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 16
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 16
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 16
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 16
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 16
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 16
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 16
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 16
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 16
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 16
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 16
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 16
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 16
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 16
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 16
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 16
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 16
- ZGVYWHODYWRPLK-GUBZILKMSA-N Met-Pro-Cys Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O ZGVYWHODYWRPLK-GUBZILKMSA-N 0.000 description 16
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 16
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 16
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 16
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 16
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 16
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 16
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 16
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 16
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 16
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 16
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 16
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 16
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 16
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 16
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 16
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 16
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 16
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 16
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 16
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 16
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 16
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 16
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 16
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 16
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 16
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 16
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 16
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 16
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 16
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 16
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 16
- GQPQJNMVELPZNQ-GBALPHGKSA-N Thr-Ser-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GQPQJNMVELPZNQ-GBALPHGKSA-N 0.000 description 16
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 16
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 16
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 16
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 16
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 16
- CGDZGRLRXPNCOC-SRVKXCTJSA-N Tyr-Cys-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CGDZGRLRXPNCOC-SRVKXCTJSA-N 0.000 description 16
- FMOSEWZYZPMJAL-KKUMJFAQSA-N Tyr-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N FMOSEWZYZPMJAL-KKUMJFAQSA-N 0.000 description 16
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 16
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 16
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 16
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 16
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 16
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 16
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 16
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 16
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 16
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 16
- 108010093581 aspartyl-proline Proteins 0.000 description 16
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 16
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 16
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 16
- 108010025306 histidylleucine Proteins 0.000 description 16
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 16
- 108010000761 leucylarginine Proteins 0.000 description 16
- 108010057821 leucylproline Proteins 0.000 description 16
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 16
- 108010064235 lysylglycine Proteins 0.000 description 16
- 108010054155 lysyllysine Proteins 0.000 description 16
- 108010056582 methionylglutamic acid Proteins 0.000 description 16
- 108010024607 phenylalanylalanine Proteins 0.000 description 16
- 108010053725 prolylvaline Proteins 0.000 description 16
- 108010071207 serylmethionine Proteins 0.000 description 16
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 16
- 108010061238 threonyl-glycine Proteins 0.000 description 16
- 108010038745 tryptophylglycine Proteins 0.000 description 16
- HFBFSOAKPUZCCO-ZLUOBGJFSA-N Ala-Cys-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HFBFSOAKPUZCCO-ZLUOBGJFSA-N 0.000 description 15
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 15
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 15
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 15
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 15
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 15
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 15
- 108010044940 alanylglutamine Proteins 0.000 description 15
- 239000000047 product Substances 0.000 description 15
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 15
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 14
- 229960002363 thiamine pyrophosphate Drugs 0.000 description 14
- 235000008170 thiamine pyrophosphate Nutrition 0.000 description 14
- 239000011678 thiamine pyrophosphate Substances 0.000 description 14
- YXVCLPJQTZXJLH-UHFFFAOYSA-N thiamine(1+) diphosphate chloride Chemical compound [Cl-].CC1=C(CCOP(O)(=O)OP(O)(O)=O)SC=[N+]1CC1=CN=C(C)N=C1N YXVCLPJQTZXJLH-UHFFFAOYSA-N 0.000 description 14
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 13
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 12
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 12
- 239000012634 fragment Substances 0.000 description 12
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 12
- 239000007788 liquid Substances 0.000 description 11
- 108090000489 Carboxy-Lyases Proteins 0.000 description 10
- 102000004031 Carboxy-Lyases Human genes 0.000 description 10
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 10
- 210000004027 cell Anatomy 0.000 description 8
- 229930027917 kanamycin Natural products 0.000 description 8
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 8
- 229960000318 kanamycin Drugs 0.000 description 8
- 229930182823 kanamycin A Natural products 0.000 description 8
- 238000003760 magnetic stirring Methods 0.000 description 8
- 230000001580 bacterial effect Effects 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- 102000003960 Ligases Human genes 0.000 description 6
- 108090000364 Ligases Proteins 0.000 description 6
- 238000006555 catalytic reaction Methods 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 239000001963 growth medium Substances 0.000 description 6
- 238000010438 heat treatment Methods 0.000 description 6
- 239000011777 magnesium Substances 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 238000003756 stirring Methods 0.000 description 6
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 5
- 125000004432 carbon atom Chemical group C* 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 238000002156 mixing Methods 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- 102220627580 Alsin_L38M_mutation Human genes 0.000 description 4
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 4
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 4
- 239000007853 buffer solution Substances 0.000 description 4
- 238000005520 cutting process Methods 0.000 description 4
- 238000001976 enzyme digestion Methods 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- 239000012880 LB liquid culture medium Substances 0.000 description 3
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 239000000843 powder Substances 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 210000001082 somatic cell Anatomy 0.000 description 3
- 230000002194 synthesizing effect Effects 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- URLKBWYHVLBVBO-UHFFFAOYSA-N Para-Xylene Chemical group CC1=CC=C(C)C=C1 URLKBWYHVLBVBO-UHFFFAOYSA-N 0.000 description 2
- 239000001888 Peptone Substances 0.000 description 2
- 108010080698 Peptones Proteins 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 241000588902 Zymomonas mobilis Species 0.000 description 2
- 238000000246 agarose gel electrophoresis Methods 0.000 description 2
- 125000000217 alkyl group Chemical group 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000007036 catalytic synthesis reaction Methods 0.000 description 2
- 239000012295 chemical reaction liquid Substances 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- ZUOUZKKEUPVFJK-UHFFFAOYSA-N diphenyl Chemical compound C1=CC=CC=C1C1=CC=CC=C1 ZUOUZKKEUPVFJK-UHFFFAOYSA-N 0.000 description 2
- 239000012154 double-distilled water Substances 0.000 description 2
- 238000007852 inverse PCR Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000005065 mining Methods 0.000 description 2
- 239000012452 mother liquor Substances 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 238000013386 optimize process Methods 0.000 description 2
- 235000019319 peptone Nutrition 0.000 description 2
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000009257 reactivity Effects 0.000 description 2
- 229920002477 rna polymer Polymers 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000013112 stability test Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 125000001637 1-naphthyl group Chemical group [H]C1=C([H])C([H])=C2C(*)=C([H])C([H])=C([H])C2=C1[H] 0.000 description 1
- 125000001622 2-naphthyl group Chemical group [H]C1=C([H])C([H])=C2C([H])=C(*)C([H])=C([H])C2=C1[H] 0.000 description 1
- DAQHMCWYXJEOCG-UHFFFAOYSA-N 2-oxopropanoic acid;sodium Chemical compound [Na].CC(=O)C(O)=O DAQHMCWYXJEOCG-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 241000179039 Paenibacillus Species 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 102000003929 Transaminases Human genes 0.000 description 1
- 108090000340 Transaminases Proteins 0.000 description 1
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 239000001166 ammonium sulphate Substances 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 239000004305 biphenyl Substances 0.000 description 1
- 235000010290 biphenyl Nutrition 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 239000005515 coenzyme Substances 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 238000006114 decarboxylation reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 229910052736 halogen Inorganic materials 0.000 description 1
- 125000005843 halogen group Chemical group 0.000 description 1
- 150000002367 halogens Chemical class 0.000 description 1
- 125000005842 heteroatom Chemical group 0.000 description 1
- 150000002391 heterocyclic compounds Chemical class 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- 230000002572 peristaltic effect Effects 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- LWIHDJKSTIGBAC-UHFFFAOYSA-K potassium phosphate Substances [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 1
- 238000012257 pre-denaturation Methods 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 235000018102 proteins Nutrition 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 229910021642 ultra pure water Inorganic materials 0.000 description 1
- 239000012498 ultrapure water Substances 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 229930195727 α-lactose Natural products 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/24—Preparation of oxygen-containing organic compounds containing a carbonyl group
- C12P7/26—Ketones
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y401/00—Carbon-carbon lyases (4.1)
- C12Y401/01—Carboxy-lyases (4.1.1)
- C12Y401/01001—Pyruvate decarboxylase (4.1.1.1)
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
The application discloses a pyruvate decarboxylase mutant and application thereof in preparing alpha-hydroxyketone compounds, wherein the pyruvate decarboxylase mutant is prepared by the following steps of: 1, wherein the pyruvate decarboxylase mutant can be applied to preparing alpha-hydroxyketone compounds, the substrate conversion rate can reach 99.7%, and the e.e value of a target product can reach 99.8%, compared with a target product with the amino acid sequence shown as SEQ ID NO:1, and the pyruvate decarboxylase mutant has the advantages of high catalytic activity, better optical selectivity, stronger substrate tolerance and ideal thermal stability.
Description
Technical Field
The application relates to the fields of enzyme engineering and biopharmaceuticals, in particular to a pyruvic acid decarboxylase mutant and application thereof in preparing alpha-hydroxyketone compounds.
Background
Pyruvate decarboxylase (pyruvate decarboxylase, PDC), which is an intracellular enzyme widely occurring in animals, plants and microorganisms, is a thiamine pyrophosphate (TPP) dependent non-oxidase. The PDC can be applied to the synthesis of alpha-hydroxyketone compounds.
The alpha-hydroxy ketone compound is a compound with important application value, can be used as a natural product and a medical intermediate with biological activity, can be used as a photoinitiator in ultraviolet light curing coating, and can be derived and converted into heterocyclic compounds. The method for synthesizing the alpha-hydroxyketone compound by adopting PDC catalysis has the advantages of mild reaction conditions, low raw materials and high optical purity of the product, and has great economic value and environmental protection significance. However, the catalytic synthesis of the alpha-hydroxyketone compound by using the wild PDC has the defects of low catalytic activity, poor enzyme stability and insufficient optical selectivity, thereby limiting the industrial application of the PDC in the technical field of the synthesis of the alpha-hydroxyketone compound.
Therefore, how to modify the wild PDC to improve the catalytic activity and the stability of the PDC is of great significance to the development of the alpha-hydroxy ketone compounds prepared by the PDC enzymatic method.
Disclosure of Invention
The application provides a pyruvate decarboxylase mutant and application thereof in preparing alpha-hydroxyketone compounds, so as to solve the problems of low catalytic activity, poor enzyme stability and insufficient optical selectivity in the prior art of catalyzing and synthesizing the alpha-hydroxyketone compounds by using wild pyruvate decarboxylase.
In a first aspect, the application provides a pyruvate decarboxylase mutant having an amino acid sequence identical to SEQ ID NO:1, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% similarity, and the amino acid sequence of the pyruvate decarboxylase mutant is represented by SEQ ID NO:1, wherein the amino acid sequence is obtained by one or more point mutations in the amino acid sequence shown in 1; the pyruvate decarboxylase mutant has pyruvate decarboxylase activity.
Further, the amino acid at which the point mutation occurs comprises SEQ ID NO:1, at least one of amino acid T at position 7, amino acid T at position 8, amino acid L at position 38, amino acid N at position 169, amino acid a at position 246, amino acid G at position 294, amino acid W at position 392, amino acid V at position 450, amino acid Q at position 452, amino acid I at position 472, amino acid M at position 475, amino acid I at position 476, amino acid V at position 549, amino acid W at position 551, amino acid K at position 553, or amino acid V at position 555.
Further, the means for generating the point mutation comprises at least one of T7R, T8W, L38M, N Y, A246M, G294K, W392A, V450E, Q452G, I472M, M475K, I476L, V549Y, W551D, K553Y or V555P.
Further, the point mutation occurs in any one of the following ways:
(1)W392I;
(2)I472M;
(3)W551D;
(4)G294K;
(5)I476L;
(6)V555P;
(7)M475K;
(8) W551D and M475K;
(9) V549Y and W392A;
(10) T8W and a246M;
(11) N169Y and V450E;
(12) M475K, L M and W551D;
(13) T7R, L M and Q452G;
(14) T7R, K Y and M475K; or (b)
(15) T7R, L M, W551D and M475K.
Further, the amino acid sequence of the pyruvate decarboxylase mutant is selected from the group consisting of SEQ ID NO: 3. SEQ ID NO: 5. SEQ ID NO: 7. SEQ ID NO: 9. SEQ ID NO: 11. SEQ ID NO: 13. SEQ ID NO: 15. SEQ ID NO: 17. SEQ ID NO: 19. SEQ ID NO: 21. SEQ ID NO: 23. SEQ ID NO: 25. SEQ ID NO: 27. SEQ ID NO:29 or SEQ ID NO:31, and a polypeptide comprising the amino acid sequence shown in any one of seq id no.
In a second aspect, the present application provides a nucleic acid molecule comprising a nucleotide sequence encoding a pyruvate decarboxylase mutant according to any of the first aspects.
Further, the nucleotide sequence is selected from the group consisting of SEQ ID NOs: 4. SEQ ID NO: 6. SEQ ID NO: 8. SEQ ID NO: 10. SEQ ID NO: 12. SEQ ID NO: 14. SEQ ID NO: 16. SEQ ID NO: 18. SEQ ID NO: 20. SEQ ID NO: 22. SEQ ID NO: 24. SEQ ID NO: 26. SEQ ID NO: 28. SEQ ID NO:30 or SEQ ID NO:32, or a nucleotide sequence as set forth in any one of seq id no.
In a third aspect, the present application provides a recombinant expression vector comprising a vector, and a nucleic acid molecule as in any one of the second aspects; the vector is selected from a plasmid, cosmid, phage or viral vector.
In a fourth aspect, the present application provides a recombinant expression transformant comprising a host, and the nucleic acid molecule according to any one of the second aspects, or the recombinant expression vector according to the third aspect, introduced into the host; the host is selected from eukaryotes or prokaryotes.
In a fifth aspect, the present application provides a method for producing a pyruvate decarboxylase mutant by culturing the recombinant expression transformant described in the fourth aspect, and obtaining the pyruvate decarboxylase mutant from the culture.
In a sixth aspect, the present application provides a method for preparing a chiral α -hydroxyketone compound, wherein the first compound and the second compound are contacted and reacted to form the α -hydroxyketone compound, using the pyruvate decarboxylase mutant as described in any one of the first aspect or the pyruvate decarboxylase mutant prepared by the preparation method as described in the fifth aspect as a catalyst; wherein the first compound has a structure represented by the following general formula (I):
In the general formula (I), R 1 Selected from hydrogen atoms or hydroxyl groups, and X is selected from hydrogen atoms or monovalent metal ions.
The second compound has a structure represented by the following general formula (II):
in the general formula (II), R 2 Selected from aryl or heteroaryl groups.
Further, the first compound is selected from at least one of pyruvic acid, hydroxy pyruvic acid or sodium pyruvic acid, and the second compound is selected from benzaldehyde.
The application provides a pyruvic acid decarboxylase mutant and application thereof in preparing alpha-hydroxyketone compounds, and the pyruvic acid decarboxylase mutant has the following technical effects:
in the application, the pyruvate decarboxylase mutant is shown in SEQ ID NO:1, wherein the pyruvate decarboxylase mutant has pyruvate decarboxylase activity, and the pyruvate decarboxylase mutant can be applied to the preparation of alpha-hydroxyketone compounds, and the catalytic synthesis of the compounds with the structure shown in the formula (IV) is taken as an example, the conversion rate of substrate benzaldehyde can reach 99.7%, and the e.e value of a target product can reach 99.8%, compared with the compounds with the structure shown in SEQ ID NO:1, and the pyruvate decarboxylase mutant has the advantages of high catalytic activity and better optical selectivity. Furthermore, the substrate tolerance test and the thermal stability test show that the tolerance performance and the thermal stability performance of the pyruvate decarboxylase mutant to the substrate are obviously superior to those of the wild PDC.
Drawings
The technical solution of the present application and its advantageous effects will be made apparent by the following detailed description of the specific embodiments of the present application with reference to the accompanying drawings.
FIG. 1 is an HPLC chart of a reaction solution obtained by reacting a reaction system including a pyruvate decarboxylase mutant M3 for 2 hours in Experimental example 1;
FIG. 2 is an HPLC chart of a reaction solution obtained by reacting a reaction system including a pyruvate decarboxylase mutant M15 for 2 hours in Experimental example 1;
FIG. 3 is a HPLC chart of a reaction solution obtained by reacting a reaction system including a pyruvate decarboxylase mutant M3 of Experimental example 2 for 30 min;
FIG. 4 is a HPLC chart of a reaction solution obtained by reacting a reaction system including a pyruvate decarboxylase mutant M3 for 24 hours in Experimental example 2;
FIG. 5 is a HPLC chart of a reaction solution obtained by reacting a reaction system including a pyruvate decarboxylase mutant M15 of Experimental example 2 for 30 min;
FIG. 6 is a HPLC chart of a reaction solution obtained by reacting a reaction system including a pyruvate decarboxylase mutant M15 for 24 hours in Experimental example 2;
FIG. 7 is an HPLC chart of a reaction solution obtained by feeding benzaldehyde for 1h to a pre-reaction system comprising a pyruvate decarboxylase mutant M3 in Experimental example 3;
FIG. 8 is an HPLC chart of a reaction solution obtained by feeding benzaldehyde for 24 hours to a pre-reaction system comprising a pyruvate decarboxylase mutant M3 in experimental example 3;
FIG. 9 is an HPLC chart of a reaction solution obtained by feeding benzaldehyde for 1h to a pre-reaction system comprising pyruvic acid decarboxylase mutant M15 in experimental example 3;
FIG. 10 is an HPLC chart of a reaction solution obtained by feeding benzaldehyde for 24 hours to a pre-reaction system containing pyruvic acid decarboxylase mutant M15 in experimental example 3.
Detailed Description
The following description of the embodiments of the present application will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to fall within the scope of the application.
The embodiment of the application provides a pyruvate decarboxylase mutant and application thereof in preparing alpha-hydroxyketone compounds, wherein the amino acid sequence of the pyruvate decarboxylase mutant is identical to that of SEQ ID NO:1, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% similarity, and the amino acid sequence of the pyruvate decarboxylase mutant is defined by SEQ ID NO:1, and the pyruvate decarboxylase mutant has the activity of pyruvate decarboxylase.
As used herein, "similarity" refers to the relatedness between two amino acid sequences or between two nucleotide sequences, for example: amino acid sequence of pyruvate decarboxylase mutant and SEQ ID NO:1, and a correlation between amino acid sequences shown in 1. In embodiments of the application, at least 80% similarity is understood to be 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence similarity, the number of corresponding similarities being an integer; it is further understood that 80.1%, 81.2%, 82.3%, 83.4%, 84.5%, 85.6%, 86.7%, 87.8%, 88.9%, 89.8%, 90.3%, 91.7%, 92.2%, 93.5%, 94.8%, 95.9%, 96.6%, 97.5%, 98.4% or 99.9%, but less than 100% sequence similarity, the number of corresponding similarity being in decimal.
As used herein, "point mutation" refers to a mutation in SEQ ID NO:1, and substitution, deletion or insertion of an amino acid at a specific site in the amino acid sequence shown in fig. 1.
Further, the amino acids with point mutations include SEQ ID NO:1, at least one of amino acid T at position 7, amino acid T at position 8, amino acid L at position 38, amino acid N at position 169, amino acid a at position 246, amino acid G at position 294, amino acid W at position 392, amino acid V at position 450, amino acid Q at position 452, amino acid I at position 472, amino acid M at position 475, amino acid I at position 476, amino acid V at position 549, amino acid W at position 551, amino acid K at position 553, or amino acid V at position 555.
In some embodiments of the application, the manner in which the point mutation occurs comprises at least one of T7R, T W, L38M, N169Y, A246M, G294K, W392A, V450E, Q452G, I472M, M475K, I476L, V549Y, W551D, K553Y or V555P.
As used herein, "amino acid" is represented by a single letter or three letter code, having the following meaning: a: ala (alanine); r: arg (arginine); n: asn (asparagine); d: asp (aspartic acid); c: cys (cysteine); q: gln (glutamine); e: glu (glutamic acid); g: gly (glycine); h: his (histidine); i: ile (isoleucine); l: leu (leucine); k: lys (lysine); m: met (methionine); f: phe (phenylalanine); p: pro (proline); s: ser (serine); t: thr (threonine); w: trp (tryptophan); y: tyr (tyrosine); v: val (valine).
For the point mutation pattern of amino acid substitution, the nomenclature method is: original amino acid, site of original amino acid, substituted amino acid, for example: W551D represents the amino acid sequence set forth in SEQ ID NO:1, substituting aspartic acid for original tryptophan at 551 th site in the amino acid sequence shown in the formula 1; M475K is represented in SEQ ID NO:1, and substituting lysine for original lysine at 475 th site in the amino acid sequence shown in figure 1.
In some embodiments of the application, the amino acid sequence of the pyruvate decarboxylase mutant is selected from the group consisting of SEQ ID NO: 3. SEQ ID NO: 5. SEQ ID NO: 7. SEQ ID NO: 9. SEQ ID NO: 11. SEQ ID NO: 13. SEQ ID NO: 15. SEQ ID NO: 17. SEQ ID NO: 19. SEQ ID NO: 21. SEQ ID NO: 23. SEQ ID NO: 25. SEQ ID NO: 27. SEQ ID NO:29 or SEQ ID NO:31, and a polypeptide comprising the amino acid sequence shown in any one of seq id no.
The embodiment of the application also provides a nucleic acid molecule, which comprises a nucleotide sequence for encoding the pyruvate decarboxylase mutant according to any one of the embodiments of the application.
As used herein, "nucleic acid molecule" refers to a biological macromolecular compound polymerized from a plurality of nucleotides, which may be any one of a deoxyribonucleic acid (DNA), ribonucleic acid (RNA) or oligonucleotide fragment produced by Polymerase Chain Reaction (PCR) or by in vitro translation, and a fragment produced by any one or more of ligation, cleavage, endonuclease action or exonuclease action, and may be single-stranded or double-stranded. In embodiments of the application, nucleic acid molecules include, but are not limited to, polynucleotides encoding pyruvate decarboxylase mutants.
In some embodiments of the application, the nucleotide sequence encoding the pyruvate decarboxylase mutant is selected from the group consisting of SEQ ID NO: 4. SEQ ID NO: 6. SEQ ID NO: 8. SEQ ID NO: 10. SEQ ID NO: 12. SEQ ID NO: 14. SEQ ID NO: 16. SEQ ID NO: 18. SEQ ID NO: 20. SEQ ID NO: 22. SEQ ID NO: 24. SEQ ID NO: 26. SEQ ID NO: 28. SEQ ID NO:30 or SEQ ID NO:32, or a nucleotide sequence as set forth in any one of seq id no.
For ease of understanding, table 1 below shows specific information on pyruvate decarboxylase mutants involved in the examples of the present application,
the pyruvate decarboxylase mutant has a relative amino acid sequence of SEQ ID NO:1, the pyruvate decarboxylase mutants provided in table 1 are by way of example only:
table 1 list of specific information on pyruvate decarboxylase mutants involved in the examples of the present application
The embodiment of the application also provides a recombinant expression vector, which comprises the vector and the nucleic acid molecule according to any one of the embodiments of the application.
As used herein, a "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid, and may be, for example, a plasmid, virus, cosmid, phage, or the like. In one embodiment of the present application, pET24a into which no foreign gene is inserted may be used as a vector.
As used herein, a "recombinant expression vector" refers to a DNA construct comprising a nucleic acid molecule operably linked to suitable control sequences capable of effecting the expression of the nucleic acid molecule in a suitable host. In the examples of the present application, the recombinant expression vector refers to a DNA construct formed by inserting a nucleic acid molecule encoding a pyruvate decarboxylase mutant into a vector using molecular biological techniques.
The embodiment of the application also provides a recombinant expression transformant, which comprises a host and the nucleic acid molecule or the recombinant expression vector according to any one of the embodiment of the application introduced into the host.
As used herein, a "recombinant expression transformant" refers to a host that has been subjected to exogenous genetic material (e.g., plasmid DNA) to alter genetic characteristics. In an embodiment of the application, the engineering strain that receives the exogenous genetic material (nucleic acid molecule encoding a pyruvate decarboxylase mutant or recombinant expression vector) belongs to a recombinant expression transformant. In one embodiment of the application, the T1 recombinant engineering bacteria to the T15 recombinant engineering bacteria belong to recombinant expression transformants.
As used herein, "host" refers to a class of organisms used to express exogenous genes to produce proteins, and the host may be, for example, eukaryotes, prokaryotes, viruses, etc., wherein eukaryotes as hosts include, but are not limited to, mammalian cells, yeast, fungi, insect cells, and plant cells, and prokaryotes as hosts include, but are not limited to, bacillus, clostridium, lactobacillus, streptomyces, staphylococcus, escherichia, pseudomonas, and paenibacillus. In one embodiment of the application, the host for expressing the pyruvate decarboxylase mutant is E.coli BL21 (DE 3).
The embodiment of the application also provides a preparation method of the pyruvate decarboxylase mutant, which specifically comprises the following steps: by culturing the recombinant expression transformant according to any one of the embodiments of the present application, and obtaining a pyruvate decarboxylase mutant from the culture.
The embodiment of the application also provides a preparation method of the alpha-hydroxy ketone compound, which comprises the following steps: the pyruvate decarboxylase mutant or the pyruvate decarboxylase mutant prepared by any one of the preparation methods disclosed by the embodiment of the application is used as a catalyst, and the first compound and the second compound are in contact reaction to generate the alpha-hydroxyketone compound, wherein the first compound has a structure shown as the following general formula (I):
In the general formula (I), R 1 Selected from hydrogen atoms or hydroxyl groups, and X is selected from hydrogen atoms or monovalent metal ions.
The second compound has a structure represented by the following general formula (II):
in the general formula (II), R 2 Selected from aryl or heteroaryl groups.
As used herein, "aryl" includes both unsubstituted aryl groups and aryl groups wherein one or more hydrogen atoms are optionally substituted with other groups, such as halogen atoms or alkyl groups, allowing multiple degrees of substitution to occur; "unsubstituted aryl" refers to an aromatic group containing only carbon atoms in the aromatic ring, including but not limited to phenyl, 1-naphthyl, 2-naphthyl, or biphenyl.
As used herein, "heteroaryl" means that one or more carbon atoms in the aryl group are independently replaced by one or more heteroatoms (e.g., N, O, P and/or S), e.g., heteroaryl groups have 3 to 20 carbon atoms, and as heteroaryl groups have 5 to 15 carbon atoms, and as heteroaryl groups have 5 to 9 carbon atoms, heteroaryl groups may be unsubstituted or have one or more hydrogen atoms thereon optionally replaced by other groups, e.g., alkyl groups, halogens, and the like, allowing multiple degrees of substitution.
It is understood that the pyruvate decarboxylase mutant used for preparing the alpha-hydroxyketone compound may be a culture (including a culture medium) of a recombinant engineering bacterium carrying a gene encoding the pyruvate decarboxylase mutant, or may be a somatic cell, a somatic cell extract, a somatic cell fragment or a purified pyruvate decarboxylase mutant obtained by separating and purifying the culture.
In some embodiments of the application, in a reaction system in which the pyruvate decarboxylase mutant catalyzes a contact reaction of a first compound and a second compound to produce an α -hydroxyketone compound, the pyruvate decarboxylase mutant: a first compound: the mass ratio of the second compound is 1: (0.7-10): (4.5-12). If the addition amount of the pyruvate decarboxylase mutant is too small, the catalytic reaction effect on the first compound and the second compound is limited, so that the phenomenon of excessive substrates occurs; if the addition amount of the pyruvate decarboxylase mutant is too large, the pyruvate decarboxylase mutant is wasted, the production cost is increased, and the difficulty is increased for the separation and purification of the subsequent products.
Further, the reaction system also comprises thiamine pyrophosphate and Mg 2+ Wherein thiamine pyrophosphate is used as a coenzyme for the decarboxylation reaction of the first compound, mg 2+ For increasing the reactivity of the enzyme, wherein the catalytic reaction is represented by the following formula (III):
in some embodiments of the application, the pH of the reaction system is 5.6 to 7.0 and the reaction temperature is 26 ℃ to 35 ℃. The catalytic reaction may be carried out under shaking or stirring conditions, and the reaction time is, for example, based on a substrate remaining amount of less than 5%; after the catalytic reaction is completed, the α -hydroxy ketone compound may be extracted according to a separation and purification method common in the art, including but not limited to at least one of filtration, centrifugation, precipitation, or drying.
The technical solutions in the embodiments of the present application will be clearly and completely described below. It will be apparent that the described embodiments are only some, but not all, embodiments of the application. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to fall within the scope of the application. The experimental procedure, which does not address the specific conditions in the examples below, is generally followed by routine conditions such as Sambrook et al, molecular cloning: conditions described in the laboratory Manual (New York: cold Spring Harbor Laboratory Press, 1989) or as recommended by the manufacturer.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art. In addition, any methods and materials similar or equivalent to those described herein can be used in the present application. The preferred methods and materials described herein are illustrative only and should not be construed as limiting the application.
Unless otherwise indicated, the starting materials and reagents used in the following examples are commercially available or may be prepared by methods known in the art.
1. Description of the Medium in the examples of the application
(1) LB medium
Each 100mL of LB liquid medium comprises: 1.0g of peptone, 0.5g of yeast powder, and 1.0g of NaCl;
for LB solid culture medium, 20g/L agar is added on the basis of the formula of LB liquid culture medium;
for LB medium containing kanamycin resistance, the total concentration of kanamycin was 50. Mu.g/mL.
(2) Self-induction culture medium
120g of yeast powder, 32.25g of peptone and 0.75g of magnesium sulfate (MgSO were weighed out separately 4 ) 16.5g of ammonium sulphate ((NH 4) 2 SO 4 ) 32.5g of monopotassium phosphate (KH) 2 PO 4 ) 35.5g of disodium hydrogen phosphate (Na 2 HPO 4 ) 2.5g of glucose and 10g of alpha-lactose, and then all the weighed components are added into a pulverizer to be sufficiently ground into powder, so as to obtain a powdery self-induction culture medium. 50g of the powdered self-induction medium was dissolved in 1L of deionized water, and after thorough mixing, the pH was adjusted to 7.0, and then sterilized at 121℃for 30min.
2. The description of plasmids and competent cells in the examples of the application is given in Table 2 below:
TABLE 2 description of plasmids and competent cells in the examples of the application
3. Gene fragments and reagents involved in the examples of the present application were described:
the gene fragment related in the embodiment of the application comprises a primer, a sequence shown as SEQ ID NO:2, and the like are synthesized by the division of biological engineering (Shanghai).
Restriction enzymes (e.g., bamH I, nde I and Dpn I), T4 ligase, KOD high-fidelity enzyme kit, gel recovery kit, 10 XT 4 ligase Buffer (Buffer), double distilled water (ddH) 2 Molecular reagents such as O) were purchased from Takara Bio Inc.
The technical scheme and advantageous effects of the present application are further described below with reference to examples.
Example 1: construction of recombinant expression vector pET24a-ZM
Using gene mining techniques, mining a pyruvate decarboxylase derived from zymomonas mobilis (Zymomonas mobilis) from the NCBI database, the pyruvate decarboxylase having an amino acid sequence as set forth in SEQ ID NO:1, NCBI accession number WP_014849477.1. Codon optimization is carried out according to E.coli codon preference, and the coding SEQ ID NO is synthesized by a total gene synthesis method: 1, which has the amino acid sequence shown in SEQ ID NO: 2.
The construction method of the recombinant expression vector pET24a-ZM comprises the following steps of:
s1.1, SEQ ID NO:2, respectively adding enzyme cutting sites BamH I and Nde I at two ends of the nucleotide sequence shown in the formula 2, artificially synthesizing a gene fragment, performing double enzyme cutting on the synthesized gene fragment by adopting BamH I and Nde I restriction enzymes, and recovering the target gene fragment by gel after 1% agarose gel electrophoresis detection of complete enzyme cutting, wherein the operation of recovering the target gene fragment after double enzyme cutting is implemented according to the operation instruction of a gel recovery kit;
s1.2, performing double enzyme digestion on the pET24a plasmid by adopting BamH I and Nde I restriction enzymes, and recovering a carrier skeleton by using 1% agarose gel electrophoresis after detecting that enzyme digestion is complete, wherein the operation of recovering the carrier skeleton after double enzyme digestion is implemented according to the operation instruction of a gel recovery kit;
s1.3, mixing the target gene fragment obtained in the step S1.1 with the vector skeleton obtained in the step S1.2, connecting overnight AT 16 ℃ under the action of T4 ligase, then converting the connection product into DH5a competent cells, picking up monoclonal son for sequencing verification, extracting recombinant plasmid with correct sequencing, obtaining recombinant expression vector containing aminotransferase encoding gene, and naming pET24a-AT, wherein the recombination system is 20 mu L, specifically: 2. Mu.L of 10 XT 4 ligase Buffer (Buffer), 5. Mu.L of the target gene fragment, 5. Mu.L of the vector backbone, 2. Mu.L of T4 ligase and 6. Mu.L of double distilled water (ddH) 2 O)。
EXAMPLE 2 construction of directed mutant library of recombinant engineering bacteria
The pET24a-AT constructed in example 1 is used as a DNA template by adopting a site-directed mutagenesis strategy, a point mutation primer is designed by utilizing Oligo7 software according to the amino acid site to be mutated, and mutation is introduced into the 5' end of the upstream and downstream mutation primer in a manner of inserting, replacing or deleting a base, wherein the mutation site is shown in Table 1. It should be noted that, according to the manner of introducing mutation sites, the skilled person can obtain the nucleotide sequence of the mutation primer in combination with the basic principle of primer design. To construct W392I, the nucleotide sequence of the upstream mutation primer is shown as SEQ ID NO:33, the nucleotide sequence of the downstream mutation primer is shown in SEQ ID NO:34, fifteen sets of mutant primer pairs were obtained by a total design to introduce fifteen mutant forms, respectively, as shown in table 1.
The recombinant expression vector pET24a-AT constructed in example 1 was selected as a template, fifteen sets of mutation primer pairs were used as PCR primers, respectively, and inverse PCR was performed using a KOD high-fidelity enzyme kit, thereby obtaining fifteen mutation sequences. Wherein, the reaction procedure of inverse PCR is: pre-denaturation at 95℃for 3min; denaturation at 98℃for 30s, annealing at 55℃for 30s, extension at 68℃for 3min,28 cycles; extending at 72℃for 5min.
Fifteen mutant sequences are treated by Dpn I restriction endonucleases respectively, enzyme digestion products are connected by T4 ligase and then are converted into escherichia coli BL21 (DE 3) competent, then LB resistance plates containing kanamycin are coated, the plates are placed in 37 ℃ for inversion culture for 18 hours, single colony is selected and transferred into LB liquid culture medium containing kanamycin, the culture solution is selected for sample feeding and sequencing, and clones with correct sequencing are preserved for standby, so that recombinant engineering bacteria taking escherichia coli as hosts are obtained, and recombinant engineering bacteria T1 to T15 respectively used for expressing pyruvate decarboxylase mutants M1 to M15 are obtained.
Example 3 inducible expression and post-treatment of recombinant engineering bacteria
The recombinant engineering bacteria obtained in the example 2 are inoculated into LB liquid medium containing 50 mug/mL kanamycin, and cultured at 37 ℃ and 180r/min until the OD600 is 0.6-0.8, thus obtaining seed bacterial liquid. Seed bacterial liquid is inoculated to fresh self-induction culture medium containing kanamycin with the final concentration of 50 mug/mL at the volume concentration of 1%, and the culture medium is obtained after the seed bacterial liquid is placed at 30 ℃ for 18 hours. Centrifuging the culture solution at 25deg.C and 8000r/min for 10min, discarding supernatant to collect precipitate, washing the precipitate with PB buffer solution with pH of 7.0 for several times, and collecting wet thallus for use.
The wet cells thus obtained were resuspended in ultrapure water to obtain a bacterial liquid having a cell concentration (mg/L) of 20%. The bacterial liquid is treated by adopting an ultrasonic crushing method or a high-pressure homogenizing crushing method, and the crushing conditions can be selected according to actual needs. The working parameters of the example ultrasonic disruption method are: crushing for 1s; suspending for 2s; crushing for 10min under the power of 180W. The working parameters of the example high pressure homogeneous crushing method are: the crushing was carried out twice at 50HZ and 800 bar.
After the bacterial liquid is crushed, the bacterial liquid is centrifuged for 10min to 15min at the temperature of 4 ℃ and 12000r/min to remove cell fragments and macromolecular impurities, and the supernatant is collected and stored at the temperature of-20 ℃ and 4 ℃ for standby, and is the enzyme liquid containing the pyruvate decarboxylase mutant.
Comparative example 1
The comparative example provides a pyruvate decarboxylase, the amino acid sequence of which is shown in SEQ ID NO:1, encodes a polypeptide as set forth in SEQ ID NO:1 is shown in the specification, and the nucleotide sequence of the amino acid sequence shown in SEQ ID NO: 2.
The recombinant expression vector pET24a-ZM is transformed into escherichia coli BL21 (DE 3) to be competent, then a LB resistance plate containing kanamycin is coated, the plate is placed in 37 ℃ for inversion culture for 18 hours, single colony is selected and transferred into LB liquid culture medium containing kanamycin, the culture solution is selected for sample feeding and sequencing, and the clone with correct sequencing is preserved for standby, so that recombinant engineering bacterium T0 is obtained.
Recombinant engineering bacteria T0 are subjected to induced expression and post-treatment according to the method of the example 3, and the obtained enzyme solution is stored at-20 ℃ and 4 ℃ for later use.
Experimental example 1 preliminary comparison of catalytic reactivity and optical Selectivity of pyruvate decarboxylase mutants M1 to M15
A100 mL flask was taken, and 40mL of PB buffer (pH 6.5) having a concentration of 0.1mol/L, 10mL of dimethyl sulfoxide (DMSO), and 50 were sequentially added theretomu.L of thiamine pyrophosphate (TPP) at a concentration of 50Mg/mL and 50 mu.L of Mg at a concentration of 2mol/L 2+ Buffer solution, gently stir well, then add substrate: 500. Mu.L of benzaldehyde and 0.6g of sodium pyruvate, and after the substrate was sufficiently dissolved, the pH of the system was adjusted to 6.5 using 10% NaOH to obtain a mother liquor.
A total of sixteen sets of living systems (corresponding to the living system comprising pyruvate decarboxylase mutant M1 to the living system comprising pyruvate decarboxylase mutant M15, and the living system comprising pyruvate decarboxylase of comparative example 1, respectively) were prepared by mixing 1mL of the mother liquor with 5. Mu.L of the enzyme solution comprising the single pyruvate decarboxylase mutant (prepared in example 3). Placing each group of living detection systems in a shaking table at 30 ℃ and 180r/min for reaction for 2 hours, and detecting and analyzing a reaction liquid by using a high performance liquid chromatography (High Performance Liquid Chromatography, HPLC) method after the reaction is finished, wherein the structural formula of a target product in the reaction liquid is shown as the following formula (IV):
Wherein, the instrument model of HPLC is Shimadzu LC-16 detector, and the working condition of HPLC is as follows:
(1) Preparing a sample injection liquid: mixing 500. Mu.L of the reaction solution with 500. Mu.L of paraxylene, and centrifuging to collect the supernatant; taking 10 mu L of supernatant, adding 990 mu L of mobile phase into the supernatant, shaking and mixing uniformly, and then sampling with the sample feeding amount of 10 mu L each time;
(2) Chromatographic column: macroxyloid OD-H column, 250 x 4.6mm,5 μm.
(3) Preparation of mobile phase: anhydrous n-hexane and anhydrous isopropanol were mixed according to n-hexane: the volume ratio of the isopropanol is 9:1 are mixed and prepared.
(4) Flow rate: 1mL/min.
(5) Analysis time: 20min.
(6) Column temperature: 30 ℃.
The substrate conversion (%) was calculated from the decrease amount of the substrate compound, and the calculation formula of the substrate conversion (%) was as follows:
in formula (III), A0 is the substrate peak area and A1 is the product peak area.
As an example, fig. 1 shows an HPLC profile of a reaction solution obtained by reacting a reaction system containing the pyruvate decarboxylase mutant M3 for 2 hours, and fig. 2 shows an HPLC profile of a reaction product obtained by reacting a reaction system containing the pyruvate decarboxylase mutant M15 for 2 hours.
The conversion of the substrates by the respective pyruvate decarboxylase mutants (M1 to M15) and the pyruvate decarboxylase of comparative example 1, and the optical purity of the corresponding target products produced are detailed in Table 3 below:
TABLE 3 preliminary Activity measurement data for pyruvate decarboxylase mutants M1 to M15 and pyruvate decarboxylase of comparative example 1
As can be seen from table 3, the pyruvate decarboxylase mutants M1 to M15 are mutants obtained by making one or more site mutations based on the pyruvate decarboxylase of comparative example 1, the conversion rate of the substrates by M1 to M15 is significantly improved as compared with the pyruvate decarboxylase of comparative example 1, and the optical purity of the target product generated by the corresponding reaction system is also significantly improved. Taking the pyruvate decarboxylase mutant M15 as an example, the conversion rate of the M15 to the substrate can reach 24.32%, which is 12 times of the conversion rate of the pyruvate decarboxylase of the comparative example 1 to the substrate; in addition, the optical purity of the target product produced by the reaction system containing M15 can reach 99.8%, which is 1.1 times the optical purity of the target product produced by the reaction system containing pyruvate decarboxylase in comparative example 1.
In addition, for the pyruvate decarboxylase mutant obtained by adopting a single site mutation mode, the substrate conversion rate of the pyruvate decarboxylase mutant obtained by adopting a W551D point mutation mode is the highest, and then M475K is adopted; for the pyruvate decarboxylase mutant obtained by the multi-site combined mutation mode, the comprehensive catalytic performance of the pyruvate decarboxylase mutant obtained by the T7R, L38M, W551D and M475K combined mutation mode is optimal.
Experimental example 2 counterscreen comparison of the conversion of substrates by pyruvate decarboxylase mutants M1 to M15
Providing a clean three-neck flask with a magnetic stirring rotor, sequentially adding 55g of pure water and 6.4g of sodium pyruvate into the three-neck flask, placing the three-neck flask containing the pure water and the sodium pyruvate into a constant-temperature magnetic stirring water bath kettle at 30 ℃, starting stirring, and sequentially adding 6g of benzaldehyde and 100 mu L of Mg with the concentration of 2mol/L into the three-neck flask 2+ Buffer and 100. Mu.L of thiamine pyrophosphate (TPP) at a concentration of 50mg/mL to obtain a mixed system, pH of the mixed system was adjusted to 5.95.+ -. 0.2 with 10% (mass/volume, w/v) NaOH, and then 7.0mL of an enzyme solution containing a single pyruvate decarboxylase mutant (produced in example 3) or a pyruvate decarboxylase enzyme solution of comparative example 1 was continuously added to a three-necked flask to obtain a reaction system, and pH of the reaction system was adjusted to 6.2.+ -. 0.2 with 50% (mass/volume, w/v) acetic acid. The reaction system was allowed to react at a constant temperature of 30℃for 24 hours.
After the reaction was completed, the target product in the reaction solution (same as in experimental example 1) was detected and analyzed by an HPLC method, and the detection method by HPLC was performed with reference to experimental example 1. As an example, fig. 3 and 4 show HPLC patterns of reaction solutions obtained by reacting a reaction system containing the pyruvate decarboxylase mutant M3 for 30min and 24h, respectively, and fig. 5 and 6 show HPLC patterns of reaction solutions obtained by reacting a reaction system containing the pyruvate decarboxylase mutant M15 for 30min and 24h, respectively.
The conversion data of the individual pyruvate decarboxylase mutants (M1 to M15) and the pyruvate decarboxylase of comparative example 1 on the substrate (benzaldehyde) are detailed in Table 4 below:
TABLE 4 regreen experimental data for pyruvate decarboxylase mutants M1 to M15 and pyruvate decarboxylase of comparative example 1
As can be seen from Table 4, in the re-screening experiments, the conversion of substrates by the pyruvate decarboxylase mutants M1 to M15 was significantly better than that of the pyruvate decarboxylase of comparative example 1. Taking the pyruvate decarboxylase mutant M15 as an example, the conversion rate of the pyruvate decarboxylase mutant M15 to the substrate can reach 95.2%, which is 38 times of the conversion rate of the pyruvate decarboxylase of the comparative example 1 to the substrate. In addition, for the pyruvate decarboxylase mutant obtained by adopting a single site mutation mode, the substrate conversion rate of the pyruvate decarboxylase mutant obtained by adopting a W551D point mutation mode is the highest, and then M475K is adopted; for the pyruvate decarboxylase mutant obtained by the multi-site combined mutation mode, the substrate conversion rate of the pyruvate decarboxylase mutant obtained by the T7R, L38M, W551D and M475K combined mutation mode is highest.
Experimental example 3 comparison of the conversion ratio of the recombinant engineering bacteria T1 to T15 to the substrate under the optimized Process conditions
Providing a clean three-neck flask with a magnetic stirring rotor, adding 55g of pure water and 6.4g of sodium pyruvate into the three-neck flask, placing the three-neck flask containing the pure water and the sodium pyruvate into a constant-temperature magnetic stirring water bath kettle at 30 ℃, starting stirring, and sequentially adding 100 mu L of Mg with the concentration of 2mol/L into the three-neck flask 2+ Buffer solution and 100. Mu.L of thiamine pyrophosphate (TPP) at a concentration of 50mg/mL to obtain a mixed system, adjusting the pH of the mixed system to 5.95.+ -. 0.2 with 10% (mass/volume, w/v) NaOH, then continuing to add 7.5mL of enzyme solution containing a single pyruvate decarboxylase mutant (prepared in example 3) or the pyruvate decarboxylase enzyme solution of comparative example 1 to the three-necked flask to obtain a pre-reaction system, and then slowly dropping 6g of benzaldehyde (dropping speed of 0.4 mL/h) into the pre-reaction system using a peristaltic pump to perform a catalytic reaction, wherein 50% (mass/volume, w/v) acetic acid was used to control the pH of the whole reaction system to 6.2.+ -. 0.2, and the reaction was carried out at a constant temperature of 30 ℃ for 24 hours.
After the reaction was completed, the target product in the reaction solution (same as in experimental example 1) was detected and analyzed by an HPLC method, and the detection method by HPLC was performed with reference to experimental example 1. As an example, fig. 7 and 8 show HPLC profiles of reaction solutions obtained by feeding benzaldehyde 1h and feeding benzaldehyde 24h to a pre-reaction system comprising the pyruvate decarboxylase mutant M3, respectively, and fig. 9 and 10 show HPLC profiles of reaction solutions obtained by feeding benzaldehyde 1h and feeding benzaldehyde 24h to a pre-reaction system comprising the pyruvate decarboxylase mutant M15, respectively.
The conversion data of the individual pyruvate decarboxylase mutants (M1 to M15) and the pyruvate decarboxylase of comparative example 1 on the substrate (benzaldehyde) are detailed in Table 5 below:
TABLE 5 optimization of pyruvate decarboxylase mutants M1 to M15 and pyruvate decarboxylase of comparative example 1 experimental data on process conditions
As can be seen from Table 5, in the experiments with optimized process conditions, the conversion of substrates by the pyruvate decarboxylase mutants M1 to M15 was significantly better than that of the pyruvate decarboxylase of comparative example 1. Taking pyruvic acid decarboxylase mutant M15 as an example, the conversion rate of the pyruvic acid decarboxylase mutant M15 to the substrate can reach 99.7%, which is 11 times of the conversion rate of the pyruvic acid decarboxylase of comparative example 1 to the substrate.
Experimental example 4 comparison of tolerance of pyruvate decarboxylase mutants M1 to M15 to substrate concentration
A clean three-necked flask equipped with a magnetic stirring rotor was provided, and 60g of pure water and 100. Mu.L of Mg having a concentration of 2mol/L were sequentially added to the three-necked flask 2+ Buffer solution and 100. Mu.L of thiamine pyrophosphate (TPP) at a concentration of 50mg/mL to obtain a mixed system, adjusting pH of the mixed system to 5.95.+ -. 0.2 with 10% (mass/volume, w/v) NaOH, then continuing to add 7.0mL of an enzyme solution containing a single pyruvate decarboxylase mutant (prepared in example 3) or a pyruvate decarboxylase enzyme solution of comparative example 1 to a three-necked flask, and adding a predetermined amount of benzaldehyde to obtain a pre-reaction system Four groups of pre-reaction systems are respectively arranged for each enzyme solution, and the four groups of pre-reaction systems are only different in that: the amounts of benzaldehyde added were different, the amount of benzaldehyde added in the first group of pre-reaction systems was 1g (corresponding to 1% of the concentration of benzaldehyde in the reaction system), the amount of benzaldehyde added in the second group of pre-reaction systems was 2g (corresponding to 2% of the concentration of benzaldehyde in the reaction system), the amount of benzaldehyde added in the third group of pre-reaction systems was 4g (corresponding to 3% of the concentration of benzaldehyde in the reaction system), and the amount of benzaldehyde added in the fourth group of pre-reaction systems was 6g (corresponding to 6% of the concentration of benzaldehyde in the reaction system). The three-neck flask containing the pre-reaction system is placed in a constant temperature magnetic stirring water bath kettle at 30 ℃, stirring is started for 1h, then 6.4g of sodium pyruvate is respectively added into each pre-reaction system to correspondingly obtain a reaction system, and 50% (mass/volume, w/v) acetic acid is adopted to adjust the pH value of the reaction system to 6.2+/-0.2. The reaction system was allowed to react at a constant temperature of 30℃for 2 hours.
After the reaction was completed, the target product in the reaction solution (same as in experimental example 1) was detected and analyzed by an HPLC method, and the detection method by HPLC was performed with reference to experimental example 1. The conversion data of the individual pyruvate decarboxylase mutants (M1 to M15) and the pyruvate decarboxylase of comparative example 1 for substrate (benzaldehyde) at different substrate (benzaldehyde) concentrations are shown in Table 6 below:
TABLE 6 tolerance test data for pyruvate decarboxylase mutants M1 to M15 and pyruvate decarboxylase of comparative example 1
As can be seen from table 6, the conversion rate of the pyruvate decarboxylase mutants M1 to M15 and the pyruvate decarboxylase of comparative example 1 to the substrate (benzaldehyde) all tended to decrease with increasing substrate concentration in the reaction system, but the conversion rate of the pyruvate decarboxylase mutants M1 to M15 to the substrate was significantly higher than that of the pyruvate decarboxylase of comparative example 1 at the same substrate concentration, indicating that: the substrate tolerance of the pyruvate decarboxylase mutants M1 to M15 was better than that of the pyruvate decarboxylase of comparative example 1.
When the substrate concentration in the reaction system was increased from 1% to 6%, the conversion rate of the corresponding pyruvate decarboxylase mutant M15 to the substrate was decreased by 69%, while the conversion rate of the pyruvate decarboxylase of comparative example 1 was decreased by 80%, indicating that the tolerance of the pyruvate decarboxylase mutant M15 to the substrate was significantly better than that of the pyruvate decarboxylase of comparative example 1, and when the substrate concentration in the reaction system was 6%, the conversion rate of the pyruvate decarboxylase mutant M15 to the substrate was 60 times that of the pyruvate decarboxylase of comparative example 1.
Experimental example 5 comparison of the thermostability of pyruvate decarboxylase mutants M1 to M15
Providing a clean three-neck flask with a magnetic stirring rotor, sequentially adding 55g of pure water and 6.4g of sodium pyruvate into the three-neck flask, placing the three-neck flask containing the pure water and the sodium pyruvate into a constant-temperature magnetic stirring water bath kettle at 30 ℃, starting stirring, and sequentially adding 6g of benzaldehyde and 100 mu L of Mg with the concentration of 2mol/L into the three-neck flask 2+ Buffer and 100. Mu.L of thiamine pyrophosphate (TPP) at a concentration of 50mg/mL to obtain a mixed system, pH of the mixed system was adjusted to 5.95.+ -. 0.2 with 10% (mass/volume, w/v) NaOH, and then 7.0mL of an enzyme solution containing a single pyruvate decarboxylase mutant (prepared in example 3) or a pyruvate decarboxylase enzyme solution of comparative example 1 was continuously added to a three-necked flask to obtain reaction systems, each enzyme solution was provided with two sets of reaction systems, respectively, differing in that the two sets of reaction systems were merely: one group of added enzyme solutions is not subjected to heat treatment, and the other group of added enzyme solutions is the enzyme solution after heat treatment at a constant temperature of 50 ℃ for 60 min. The pH of each reaction system was adjusted to 6.2.+ -. 0.2 with 50% (mass/volume, w/v) acetic acid, and the reaction system was allowed to react at a constant temperature of 30℃for 2 hours.
After the reaction was completed, the target product in the reaction solution (same as in experimental example 1) was detected and analyzed by an HPLC method, and the detection method by HPLC was performed with reference to experimental example 1. The conversion of the substrate (benzaldehyde) and the conversion of the substrate (benzaldehyde) after heat treatment of the respective pyruvate decarboxylase mutants (M1 to M15) and the pyruvate decarboxylase of comparative example 1 were described in Table 7 below:
TABLE 7 thermal stability test data for pyruvate decarboxylase mutants M1 to M15 and pyruvate decarboxylase of comparative example 1
As can be seen from Table 7, the thermostable performance of the pyruvate decarboxylase mutants M1 to M15 was significantly better than that of the pyruvate decarboxylase of comparative example 1. Although the substrate conversion rate after heat treatment was reduced as compared with that after heat treatment, the conversion rate of the substrate was reduced by 6.3% for the pyruvate decarboxylase mutants M1 to M15, taking the pyruvate decarboxylase mutant M15 as an example, whereas the pyruvate decarboxylase of comparative example 1 was substantially lost in catalytic activity for the substrate after heat treatment.
From experimental examples 1 to 5, the comprehensive properties of the pyruvate decarboxylase mutants M1 to M15 are obviously better than those of the pyruvate decarboxylase of comparative example 1, and the pyruvate decarboxylase mutants M1 to M15 have the advantages of higher catalytic activity, better optical selectivity and better stability than the pyruvate decarboxylase of comparative example 1.
The application provides a pyruvic acid decarboxylase mutant and application thereof in preparing alpha-hydroxyketone compounds. The principles and embodiments of the present application have been described herein with reference to specific examples, the description of the above examples is only for aiding in understanding the technical solution of the present application and its core ideas; those of ordinary skill in the art will appreciate that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit of the technical solutions according to the embodiments of the present application.
Sequence listing
<110> Taizhou enzyme Biotechnology Co., ltd
<120> pyruvate decarboxylase mutant and application thereof in preparing alpha-hydroxyketone compound
<130> SUP220088CN
<160> 34
<170> SIPOSequenceListing 1.0
<210> 1
<211> 568
<212> PRT
<213> artificial sequence
<400> 1
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 2
<211> 1707
<212> DNA
<213> artificial sequence
<400> 2
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 3
<211> 568
<212> PRT
<213> artificial sequence
<400> 3
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Ile Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 4
<211> 1707
<212> DNA
<213> artificial sequence
<400> 4
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tctatcttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 5
<211> 568
<212> PRT
<213> artificial sequence
<400> 5
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Met Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 6
<211> 1707
<212> DNA
<213> artificial sequence
<400> 6
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatggaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 7
<211> 568
<212> PRT
<213> artificial sequence
<400> 7
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Asp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 8
<211> 1707
<212> DNA
<213> artificial sequence
<400> 8
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa gatggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 9
<211> 568
<212> PRT
<213> artificial sequence
<400> 9
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Lys Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 10
<211> 1707
<212> DNA
<213> artificial sequence
<400> 10
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccacta agtggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 11
<211> 568
<212> PRT
<213> artificial sequence
<400> 11
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Leu His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 12
<211> 1707
<212> DNA
<213> artificial sequence
<400> 12
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgctgca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 13
<211> 568
<212> PRT
<213> artificial sequence
<400> 13
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Pro Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 14
<211> 1707
<212> DNA
<213> artificial sequence
<400> 14
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gccctgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 15
<211> 568
<212> PRT
<213> artificial sequence
<400> 15
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Lys Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 16
<211> 1707
<212> DNA
<213> artificial sequence
<400> 16
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttaagatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 17
<211> 568
<212> PRT
<213> artificial sequence
<400> 17
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Lys Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Asp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 18
<211> 1707
<212> DNA
<213> artificial sequence
<400> 18
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttaagatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa gatggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 19
<211> 568
<212> PRT
<213> artificial sequence
<400> 19
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Ala Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Tyr Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 20
<211> 1707
<212> DNA
<213> artificial sequence
<400> 20
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tctgcattca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attgtataaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 21
<211> 568
<212> PRT
<213> artificial sequence
<400> 21
Met Ser Tyr Thr Val Gly Thr Trp Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Met Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 22
<211> 1707
<212> DNA
<213> artificial sequence
<400> 22
atgagttata ctgtcggtac ctggttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctatgaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 23
<211> 568
<212> PRT
<213> artificial sequence
<400> 23
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Tyr Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Glu Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 24
<211> 1707
<212> DNA
<213> artificial sequence
<400> 24
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgctatact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagaa gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 25
<211> 568
<212> PRT
<213> artificial sequence
<400> 25
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Met Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Lys Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Asp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 26
<211> 1707
<212> DNA
<213> artificial sequence
<400> 26
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tatgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttaagatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa gatggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 27
<211> 568
<212> PRT
<213> artificial sequence
<400> 27
Met Ser Tyr Thr Val Gly Arg Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Met Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gly Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 28
<211> 1707
<212> DNA
<213> artificial sequence
<400> 28
atgagttata ctgtcggtcg atatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tatgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctggtatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 29
<211> 568
<212> PRT
<213> artificial sequence
<400> 29
Met Ser Tyr Thr Val Gly Arg Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Lys Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Tyr Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 30
<211> 1707
<212> DNA
<213> artificial sequence
<400> 30
atgagttata ctgtcggtcg atatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttaagatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggttatc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 31
<211> 568
<212> PRT
<213> artificial sequence
<400> 31
Met Ser Tyr Thr Val Gly Arg Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Met Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Lys Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Asp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 32
<211> 1707
<212> DNA
<213> artificial sequence
<400> 32
atgagttata ctgtcggtcg atatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tatgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttaagatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa gatggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 33
<211> 30
<212> DNA
<213> artificial sequence
<400> 33
attttcaatg ctcagcgcat gaagctcccg 30
<210> 34
<211> 30
<212> DNA
<213> artificial sequence
<400> 34
agagtcaccg gtttcagcaa taaccgtcgt 30
Claims (12)
1. A pyruvate decarboxylase mutant, characterized in that the amino acid sequence of the pyruvate decarboxylase mutant is identical to SEQ ID NO:1, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% similarity, and the amino acid sequence of the pyruvate decarboxylase mutant is represented by SEQ ID NO:1, wherein the amino acid sequence is obtained by one or more point mutations in the amino acid sequence shown in 1; the pyruvate decarboxylase mutant has pyruvate decarboxylase activity.
2. The pyruvate decarboxylase mutant according to claim 1, characterized by the fact that the amino acid at which the point mutation takes place comprises the amino acid sequence of SEQ ID NO:1, at least one of amino acid T at position 7, amino acid T at position 8, amino acid L at position 38, amino acid N at position 169, amino acid a at position 246, amino acid G at position 294, amino acid W at position 392, amino acid V at position 450, amino acid Q at position 452, amino acid I at position 472, amino acid M at position 475, amino acid I at position 476, amino acid V at position 549, amino acid W at position 551, amino acid K at position 553, or amino acid V at position 555.
3. The pyruvate decarboxylase mutant of claim 2 wherein the point mutation is effected by at least one of T7R, T8W, L M, N169Y, A246M, G294K, W A, V450E, Q452G, I472M, M475K, I476L, V549Y, W551D, K553Y or V555P.
4. A pyruvate decarboxylase mutant according to claim 3 characterized by the fact that the point mutation occurs in any of the following ways:
(1)W392I;
(2)I472M;
(3)W551D;
(4)G294K;
(5)I476L;
(6)V555P;
(7)M475K;
(8) W551D and M475K;
(9) V549Y and W392A;
(10) T8W and a246M;
(11) N169Y and V450E;
(12) M475K, L M and W551D;
(13) T7R, L M and Q452G;
(14) T7R, K Y and M475K; or (b)
(15) T7R, L M, W551D and M475K.
5. A pyruvate decarboxylase mutant according to claim 3 characterized by its amino acid sequence selected from the group consisting of SEQ ID NOs: 3. SEQ ID NO: 5. SEQ ID NO: 7. SEQ ID NO: 9. SEQ ID NO: 11. SEQ ID NO: 13. SEQ ID NO: 15. SEQ ID NO: 17. SEQ ID NO: 19. SEQ ID NO: 21. SEQ ID NO: 23. SEQ ID NO: 25. SEQ ID NO: 27. SEQ ID NO:29 or SEQ ID NO:31, and a polypeptide comprising the amino acid sequence shown in any one of seq id no.
6. A nucleic acid molecule comprising a nucleotide sequence encoding a pyruvate decarboxylase mutant according to any one of claims 1 to 5.
7. The nucleic acid molecule of claim 6, wherein said nucleotide sequence is selected from the group consisting of SEQ ID NOs: 4. SEQ ID NO: 6. SEQ ID NO: 8. SEQ ID NO: 10. SEQ ID NO: 12. SEQ ID NO: 14. SEQ ID NO: 16. SEQ ID NO: 18. SEQ ID NO: 20. SEQ ID NO: 22. SEQ ID NO: 24. SEQ ID NO: 26. SEQ ID NO: 28. SEQ ID NO:30 or SEQ ID NO:32, or a nucleotide sequence as set forth in any one of seq id no.
8. A recombinant expression vector comprising a vector and the nucleic acid molecule of claim 6 or 7; the vector is selected from a plasmid, cosmid, phage or viral vector.
9. A recombinant expression transformant comprising a host, and the nucleic acid molecule according to claim 6 or 7, or the recombinant expression vector according to claim 8, introduced into the host; the host is selected from eukaryotes or prokaryotes.
10. A method for preparing a pyruvate decarboxylase mutant, characterized by culturing the recombinant expression transformant of claim 9, and obtaining said pyruvate decarboxylase mutant from the culture.
11. A method for preparing chiral alpha-hydroxy ketone compound, characterized in that the pyruvate decarboxylase mutant as defined in any one of claims 1 to 5 or the pyruvate decarboxylase mutant prepared by the preparation method as defined in claim 10 is used as a catalyst, and the first compound and the second compound are contacted and reacted to generate alpha-hydroxy ketone compound;
wherein the first compound has a structure represented by the following general formula (I):
in the general formula (I), R 1 Selected from hydrogen atoms or hydroxyl groups, and X is selected from hydrogen atoms or monovalent metal ions.
The second compound has a structure represented by the following general formula (II):
in the general formula (II), R 2 Selected from aryl or heteroaryl groups.
12. The method according to claim 11, wherein the first compound is at least one selected from pyruvic acid, hydroxypyruvic acid and sodium pyruvate, and the second compound is benzaldehyde.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210361627.4A CN116790569B (en) | 2022-04-07 | 2022-04-07 | Pyruvic acid decarboxylase mutant and application thereof in preparation of alpha-hydroxyketone compound |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210361627.4A CN116790569B (en) | 2022-04-07 | 2022-04-07 | Pyruvic acid decarboxylase mutant and application thereof in preparation of alpha-hydroxyketone compound |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116790569A true CN116790569A (en) | 2023-09-22 |
CN116790569B CN116790569B (en) | 2024-04-26 |
Family
ID=88038428
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210361627.4A Active CN116790569B (en) | 2022-04-07 | 2022-04-07 | Pyruvic acid decarboxylase mutant and application thereof in preparation of alpha-hydroxyketone compound |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116790569B (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102994390A (en) * | 2011-09-19 | 2013-03-27 | 浙江齐成碳能科技有限公司 | Genetic engineering energy microbe constructed by utilizing point mutation |
US20130177953A1 (en) * | 2010-04-13 | 2013-07-11 | Indian Institute Of Technology, Bombay | Recombinant process for the production of r-aromatic alpha hydroxy ketones |
US20160376580A1 (en) * | 2014-03-11 | 2016-12-29 | Ajinomoto Co., Inc. | Method of Producing 1,5-Pentadiamine Using Lysine Decarboxylase Mutant Having Improved Thermal Stability |
CN106520716A (en) * | 2016-10-28 | 2017-03-22 | 杭州酶易生物技术有限公司 | Thermophilic ketone reductase mutant and application thereof |
CN107312766A (en) * | 2017-08-07 | 2017-11-03 | 上海凌凯医药科技有限公司 | The PDC-W392A that a kind of enzyme activity is improved |
-
2022
- 2022-04-07 CN CN202210361627.4A patent/CN116790569B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130177953A1 (en) * | 2010-04-13 | 2013-07-11 | Indian Institute Of Technology, Bombay | Recombinant process for the production of r-aromatic alpha hydroxy ketones |
CN102994390A (en) * | 2011-09-19 | 2013-03-27 | 浙江齐成碳能科技有限公司 | Genetic engineering energy microbe constructed by utilizing point mutation |
US20160376580A1 (en) * | 2014-03-11 | 2016-12-29 | Ajinomoto Co., Inc. | Method of Producing 1,5-Pentadiamine Using Lysine Decarboxylase Mutant Having Improved Thermal Stability |
CN106520716A (en) * | 2016-10-28 | 2017-03-22 | 杭州酶易生物技术有限公司 | Thermophilic ketone reductase mutant and application thereof |
CN107312766A (en) * | 2017-08-07 | 2017-11-03 | 上海凌凯医药科技有限公司 | The PDC-W392A that a kind of enzyme activity is improved |
Non-Patent Citations (3)
Title |
---|
ELINA BALODITE ET AL.: "Translocation of Zymomonas mobilis pyruvate decarboxylase to periplasmic compartment for production of acetaldehyde outside the cytosol", 《MICROBIOLOGYOPEN.》, vol. 8, no. 809, pages 1 - 6 * |
HEIKE BRUHN ET AL.: "The replacement of Trp392 by alanine influences the decarboxylase/carboligase activity and stability of pyruvate decarboxylase from Zymomonas mobilis", 《EUR. J. BIOCHERN.》, vol. 234, pages 650 - 655, XP000607025, DOI: 10.1111/j.1432-1033.1995.650_b.x * |
朱碧云等: "丙酮酸脱羧酶及其应用研究", 《生命科学》, vol. 22, no. 11, pages 1184 - 1191 * |
Also Published As
Publication number | Publication date |
---|---|
CN116790569B (en) | 2024-04-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109593750B (en) | Nitrile hydratase mutant, genetic engineering bacterium containing same and application thereof | |
CN111269900B (en) | Preparation and application of L-amino acid deaminase mutant | |
CN109777763B (en) | Genetically engineered bacterium for producing L-theanine and construction and application thereof | |
EP3680340A1 (en) | Method for enzymatic preparation of r-3-aminobutyric acid | |
CN112143764B (en) | Method for preparing intermediate compound of brivaracetam by using biological enzyme catalysis | |
CN111321129A (en) | Engineered ketoreductase polypeptides and uses thereof | |
CN109468291B (en) | Carbonyl reductase EbSDR8 mutant and construction method and application thereof | |
CN111808829B (en) | Gamma-glutamyl methylamine synthetase mutant and application thereof | |
CN113088501A (en) | Glutamic acid dehydrogenase mutant for producing L-glufosinate-ammonium and L-glufosinate-ammonium production method | |
CN113832120B (en) | Formaldehyde conversion mutant protein and application thereof | |
CN111454918B (en) | Enol reductase mutant and application thereof in preparation of (R) -citronellal | |
CN116790569B (en) | Pyruvic acid decarboxylase mutant and application thereof in preparation of alpha-hydroxyketone compound | |
CN118291419A (en) | Mutant of thermostable aminotransferase and application thereof | |
CN112908417A (en) | Gene mining method combining functional sequence and structure simulation, NADH (nicotinamide adenine dinucleotide) preference type glufosinate dehydrogenase mutant and application | |
CN114277022B (en) | Nitrile hydratase mutant with high activity and high thermal stability | |
CN105593368B (en) | Recombinant microorganism having increased ability to produce 2,3-butanediol and method for producing 2,3-butanediol using same | |
CN115975964A (en) | High-activity ketopantolactone reductase mutant and coding gene and application thereof | |
CN110938608A (en) | Aldehyde ketone reductase mutant, encoding gene and application of aldehyde ketone reductase mutant in synthesis of (S) -TCPE | |
CN114752574B (en) | Enzyme catalysis system, catalase, preparation method and application | |
CN115011569B (en) | Bose NemR-PS mutant and application thereof in preparation of (S) -citronellol | |
CN109370997B (en) | Phenylalanine aminomutase mutant | |
CN114410599B (en) | Carbonyl reductase mutant and application thereof in preparation of rosuvastatin chiral intermediate | |
CN109337891B (en) | Phenylalanine aminomutase mutant with improved thermal stability | |
CN116496999A (en) | Monoamine oxidase mutant and application thereof in preparation of balciclovir | |
CN110004119B (en) | -ketoester reductase mutant and application thereof in catalyzing and synthesizing (R) -alpha-lipoic acid precursor |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |