CN115125222A - 利用10-去乙酰巴卡亭III10β-O-乙酰转移酶突变体催化合成紫杉醇及其类似物 - Google Patents
利用10-去乙酰巴卡亭III10β-O-乙酰转移酶突变体催化合成紫杉醇及其类似物 Download PDFInfo
- Publication number
- CN115125222A CN115125222A CN202110316454.XA CN202110316454A CN115125222A CN 115125222 A CN115125222 A CN 115125222A CN 202110316454 A CN202110316454 A CN 202110316454A CN 115125222 A CN115125222 A CN 115125222A
- Authority
- CN
- China
- Prior art keywords
- leu
- val
- ser
- gly
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- YWLXLRUDGLRYDR-ZHPRIASZSA-N 5beta,20-epoxy-1,7beta,10beta,13alpha-tetrahydroxy-9-oxotax-11-ene-2alpha,4alpha-diyl 4-acetate 2-benzoate Chemical compound O([C@H]1[C@H]2[C@@](C([C@H](O)C3=C(C)[C@@H](O)C[C@]1(O)C3(C)C)=O)(C)[C@@H](O)C[C@H]1OC[C@]12OC(=O)C)C(=O)C1=CC=CC=C1 YWLXLRUDGLRYDR-ZHPRIASZSA-N 0.000 title claims abstract description 70
- 229930012538 Paclitaxel Natural products 0.000 title claims abstract description 45
- 229960001592 paclitaxel Drugs 0.000 title claims abstract description 45
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 title claims abstract description 43
- 230000015572 biosynthetic process Effects 0.000 title abstract description 6
- 238000003786 synthesis reaction Methods 0.000 title abstract description 5
- 239000003054 catalyst Substances 0.000 title description 5
- TYLVGQKNNUHXIP-MHHARFCSSA-N 10-deacetyltaxol Chemical compound O([C@H]1[C@H]2[C@@](C([C@H](O)C3=C(C)[C@@H](OC(=O)[C@H](O)[C@@H](NC(=O)C=4C=CC=CC=4)C=4C=CC=CC=4)C[C@]1(O)C3(C)C)=O)(C)[C@@H](O)C[C@H]1OC[C@]12OC(=O)C)C(=O)C1=CC=CC=C1 TYLVGQKNNUHXIP-MHHARFCSSA-N 0.000 claims abstract description 80
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 claims abstract description 49
- 102000008300 Mutant Proteins Human genes 0.000 claims abstract description 43
- 108010021466 Mutant Proteins Proteins 0.000 claims abstract description 43
- 125000002252 acyl group Chemical group 0.000 claims abstract description 24
- 239000000758 substrate Substances 0.000 claims abstract description 21
- 125000002887 hydroxy group Chemical group [H]O* 0.000 claims abstract description 17
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 14
- 125000003147 glycosyl group Chemical group 0.000 claims abstract description 14
- 239000002773 nucleotide Substances 0.000 claims abstract description 10
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 10
- 229940123237 Taxane Drugs 0.000 claims abstract description 8
- 230000035772 mutation Effects 0.000 claims description 58
- 108090000623 proteins and genes Proteins 0.000 claims description 35
- 102000004169 proteins and genes Human genes 0.000 claims description 31
- 102000004190 Enzymes Human genes 0.000 claims description 27
- 108090000790 Enzymes Proteins 0.000 claims description 27
- 238000006243 chemical reaction Methods 0.000 claims description 24
- 239000013612 plasmid Substances 0.000 claims description 15
- 102000004157 Hydrolases Human genes 0.000 claims description 12
- 108090000604 Hydrolases Proteins 0.000 claims description 12
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 9
- 150000001413 amino acids Chemical class 0.000 claims description 8
- 230000008878 coupling Effects 0.000 claims description 7
- 238000010168 coupling process Methods 0.000 claims description 7
- 238000005859 coupling reaction Methods 0.000 claims description 7
- 230000002255 enzymatic effect Effects 0.000 claims description 5
- 101000885693 Taxus cuspidata 10-deacetylbaccatin III 10-O-acetyltransferase Proteins 0.000 claims description 4
- 240000000599 Lentinula edodes Species 0.000 claims description 3
- 230000010933 acylation Effects 0.000 claims description 2
- 238000005917 acylation reaction Methods 0.000 claims description 2
- CRFNGMNYKDXRTN-CITAKDKDSA-N butyryl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CCC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 CRFNGMNYKDXRTN-CITAKDKDSA-N 0.000 claims description 2
- 102000037865 fusion proteins Human genes 0.000 claims description 2
- 108020001507 fusion proteins Proteins 0.000 claims description 2
- QAQREVBBADEHPA-IEXPHMLFSA-N propionyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 QAQREVBBADEHPA-IEXPHMLFSA-N 0.000 claims description 2
- 102200102592 rs104893740 Human genes 0.000 claims description 2
- 102200006532 rs112445441 Human genes 0.000 claims description 2
- 102220014333 rs112445441 Human genes 0.000 claims description 2
- 102220197780 rs121434596 Human genes 0.000 claims description 2
- 102220083576 rs143494325 Human genes 0.000 claims description 2
- 102220083998 rs551990619 Human genes 0.000 claims description 2
- GKNWJYPYFJMILD-UHFFFAOYSA-N 1-[4-(2h-triazolo[4,5-c]pyridin-4-ylperoxy)-2h-triazolo[4,5-c]pyridin-6-yl]decan-1-one Chemical compound N=1C(C(=O)CCCCCCCCC)=CC=2NN=NC=2C=1OOC1=NC=CC2=C1N=NN2 GKNWJYPYFJMILD-UHFFFAOYSA-N 0.000 claims 1
- 230000003301 hydrolyzing effect Effects 0.000 claims 1
- 229930182986 10-Deacetyltaxol Natural products 0.000 abstract description 37
- 229940100228 acetyl coenzyme a Drugs 0.000 abstract description 11
- -1 10-deacetyl taxane Chemical group 0.000 abstract description 5
- 244000162450 Taxus cuspidata Species 0.000 description 58
- 235000009065 Taxus cuspidata Nutrition 0.000 description 58
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 46
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 46
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 46
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 46
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 46
- 108010013835 arginine glutamate Proteins 0.000 description 42
- 235000018102 proteins Nutrition 0.000 description 26
- 108020004414 DNA Proteins 0.000 description 24
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 24
- 241000880493 Leptailurus serval Species 0.000 description 24
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 24
- 230000003197 catalytic effect Effects 0.000 description 24
- OVMSOCFBDVBLFW-VHLOTGQHSA-N 5beta,20-epoxy-1,7beta,13alpha-trihydroxy-9-oxotax-11-ene-2alpha,4alpha,10beta-triyl 4,10-diacetate 2-benzoate Chemical compound O([C@@H]1[C@@]2(C[C@H](O)C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)O)C(=O)C1=CC=CC=C1 OVMSOCFBDVBLFW-VHLOTGQHSA-N 0.000 description 23
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 23
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 23
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 23
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 23
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 23
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 23
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 23
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 23
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 23
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 23
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 23
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 23
- CGXQUULXFWRJOI-SRVKXCTJSA-N Arg-Val-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O CGXQUULXFWRJOI-SRVKXCTJSA-N 0.000 description 23
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 23
- OGMDXNFGPOPZTK-GUBZILKMSA-N Asn-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N OGMDXNFGPOPZTK-GUBZILKMSA-N 0.000 description 23
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 23
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 23
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 23
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 23
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 23
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 23
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 23
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 23
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 23
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 23
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 23
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 23
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 23
- 108010090461 DFG peptide Proteins 0.000 description 23
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 23
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 23
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 23
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 23
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 23
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 23
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 23
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 23
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 23
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 23
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 23
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 23
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 23
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 23
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 23
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 23
- FBCURAVMSXNOLP-JYJNAYRXSA-N His-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBCURAVMSXNOLP-JYJNAYRXSA-N 0.000 description 23
- RSDHVTMRXSABSV-GHCJXIJMSA-N Ile-Asn-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RSDHVTMRXSABSV-GHCJXIJMSA-N 0.000 description 23
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 23
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 23
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 23
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 23
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 23
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 23
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 23
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 23
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 23
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 23
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 23
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 23
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 23
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 23
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 23
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 23
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 23
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 23
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 23
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 23
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 23
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 23
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 23
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 23
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 23
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 23
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 23
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 23
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 23
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 23
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 23
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 23
- YDUGVDGFKNXFPL-IXOXFDKPSA-N Phe-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YDUGVDGFKNXFPL-IXOXFDKPSA-N 0.000 description 23
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 23
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 23
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 23
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 23
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 23
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 23
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 23
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 23
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 23
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 23
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 23
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 23
- TVPQRPNBYCRRLL-IHRRRGAJSA-N Ser-Phe-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O TVPQRPNBYCRRLL-IHRRRGAJSA-N 0.000 description 23
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 23
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 23
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 23
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 23
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 23
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 23
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 23
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 23
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 23
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 23
- OZUJUVFWMHTWCZ-HOCLYGCPSA-N Trp-Gly-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OZUJUVFWMHTWCZ-HOCLYGCPSA-N 0.000 description 23
- ZHZLQVLQBDBQCQ-WDSOQIARSA-N Trp-Lys-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ZHZLQVLQBDBQCQ-WDSOQIARSA-N 0.000 description 23
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 23
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 23
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 23
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 23
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 23
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 23
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 23
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 23
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 23
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 23
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 23
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 23
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 23
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 23
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 23
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 23
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 23
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 23
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 23
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 23
- 108010047857 aspartylglycine Proteins 0.000 description 23
- 108010016616 cysteinylglycine Proteins 0.000 description 23
- 108010078144 glutaminyl-glycine Proteins 0.000 description 23
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 23
- 108010049041 glutamylalanine Proteins 0.000 description 23
- 108010079547 glutamylmethionine Proteins 0.000 description 23
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 23
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 23
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 23
- 108010050848 glycylleucine Proteins 0.000 description 23
- 108010037850 glycylvaline Proteins 0.000 description 23
- 108010040030 histidinoalanine Proteins 0.000 description 23
- 108010036413 histidylglycine Proteins 0.000 description 23
- 108010092114 histidylphenylalanine Proteins 0.000 description 23
- 108010018006 histidylserine Proteins 0.000 description 23
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 23
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 23
- 108010000761 leucylarginine Proteins 0.000 description 23
- 108010064235 lysylglycine Proteins 0.000 description 23
- 108010054155 lysyllysine Proteins 0.000 description 23
- 108010090894 prolylleucine Proteins 0.000 description 23
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 23
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 23
- 108010073969 valyllysine Proteins 0.000 description 23
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 22
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 20
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 19
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 15
- 239000000047 product Substances 0.000 description 15
- 238000006640 acetylation reaction Methods 0.000 description 14
- 230000000694 effects Effects 0.000 description 14
- 239000006225 natural substrate Substances 0.000 description 13
- 229930014667 baccatin III Natural products 0.000 description 12
- 230000021736 acetylation Effects 0.000 description 11
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 9
- 241000588724 Escherichia coli Species 0.000 description 9
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 9
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 9
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 8
- 239000000370 acceptor Substances 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 239000000872 buffer Substances 0.000 description 7
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 6
- 241001116500 Taxus Species 0.000 description 6
- 235000001014 amino acid Nutrition 0.000 description 6
- 238000010276 construction Methods 0.000 description 6
- 230000014509 gene expression Effects 0.000 description 6
- BHZOKUMUHVTPBX-UHFFFAOYSA-M sodium acetic acid acetate Chemical compound [Na+].CC(O)=O.CC([O-])=O BHZOKUMUHVTPBX-UHFFFAOYSA-M 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 241001149649 Taxus wallichiana var. chinensis Species 0.000 description 5
- 238000006555 catalytic reaction Methods 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 238000004128 high performance liquid chromatography Methods 0.000 description 5
- 238000011160 research Methods 0.000 description 5
- 238000012216 screening Methods 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 108030002602 10-deacetylbaccatin III 10-O-acetyltransferases Proteins 0.000 description 4
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 4
- 125000000218 acetic acid group Chemical group C(C)(=O)* 0.000 description 4
- 239000007853 buffer solution Substances 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 230000006698 induction Effects 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 3
- 238000001042 affinity chromatography Methods 0.000 description 3
- 108010027371 asparaginyl-leucyl-prolyl-arginine Proteins 0.000 description 3
- 238000007036 catalytic synthesis reaction Methods 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000000589 high-performance liquid chromatography-mass spectrometry Methods 0.000 description 3
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 229910052759 nickel Inorganic materials 0.000 description 3
- 238000005580 one pot reaction Methods 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 102200130585 rs778210210 Human genes 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- 102000018120 Recombinases Human genes 0.000 description 2
- 108010091086 Recombinases Proteins 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 241001052560 Thallis Species 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 108091008324 binding proteins Proteins 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 239000005515 coenzyme Substances 0.000 description 2
- 239000000287 crude extract Substances 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 238000000034 method Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 230000009871 nonspecific binding Effects 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 239000007974 sodium acetate buffer Substances 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 125000002456 taxol group Chemical group 0.000 description 2
- 238000004809 thin layer chromatography Methods 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 125000003241 10-deacetylbaccatin III group Chemical group 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 101001074429 Bacillus subtilis (strain 168) Polyketide biosynthesis acyltransferase homolog PksD Proteins 0.000 description 1
- 101000936617 Bacillus velezensis (strain DSM 23117 / BGSC 10A6 / FZB42) Polyketide biosynthesis acyltransferase homolog BaeD Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 239000004215 Carbon black (E152) Substances 0.000 description 1
- MHYHLWUGWUBUHF-GUBZILKMSA-N Cys-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N MHYHLWUGWUBUHF-GUBZILKMSA-N 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- 235000001715 Lentinula edodes Nutrition 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- 241000015728 Taxus canadensis Species 0.000 description 1
- 241000013871 Taxus globosa Species 0.000 description 1
- 241001674343 Taxus x media Species 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- MBLJBGZWLHTJBH-SZMVWBNQSA-N Trp-Val-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 MBLJBGZWLHTJBH-SZMVWBNQSA-N 0.000 description 1
- 101710159648 Uncharacterized protein Proteins 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 230000029936 alkylation Effects 0.000 description 1
- 238000005804 alkylation reaction Methods 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical group [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 238000006701 autoxidation reaction Methods 0.000 description 1
- 125000003834 baccatin III group Chemical group 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000027455 binding Effects 0.000 description 1
- 230000008238 biochemical pathway Effects 0.000 description 1
- 238000003766 bioinformatics method Methods 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 230000006287 biotinylation Effects 0.000 description 1
- 238000007413 biotinylation Methods 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 1
- 239000005516 coenzyme A Substances 0.000 description 1
- 229940093530 coenzyme a Drugs 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 239000012468 concentrated sample Substances 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 239000012084 conversion product Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- 239000006167 equilibration buffer Substances 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 239000000413 hydrolysate Substances 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 229960001330 hydroxycarbamide Drugs 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 239000012452 mother liquor Substances 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 230000020175 protein destabilization Effects 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 239000013049 sediment Substances 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000012807 shake-flask culturing Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000001502 supplementing effect Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- KFFHSFCOKCGBBW-VCPDXWRASA-N taxa-4(20),11-diene-2alpha,5alpha,10beta,14beta-tetrayl tetraacetate Chemical compound CC(=O)O[C@H]1C[C@]2(C)CC[C@H](OC(C)=O)C(=C)[C@H]2[C@H](OC(C)=O)[C@@H]2[C@@H](OC(=O)C)CC(C)=C1C2(C)C KFFHSFCOKCGBBW-VCPDXWRASA-N 0.000 description 1
- DKPFODGZWDEEBT-QFIAKTPHSA-N taxane Chemical class C([C@]1(C)CCC[C@@H](C)[C@H]1C1)C[C@H]2[C@H](C)CC[C@@H]1C2(C)C DKPFODGZWDEEBT-QFIAKTPHSA-N 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 238000002525 ultrasonication Methods 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/1029—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/02—Oxygen as only ring hetero atoms
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y203/00—Acyltransferases (2.3)
- C12Y203/01—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
- C12Y203/01167—10-Deacetylbaccatin III 10-O-acetyltransferase (2.3.1.167)
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明提供了10‑去乙酰巴卡亭III 10β‑O‑乙酰转移酶(DBAT)一系列突变体蛋白,这些蛋白均能专一性地将乙酰辅酶A等的酰基转移到10‑去乙酰紫杉烷的C10位羟基上,生成紫杉醇或其类似物。本发明涉及DBAT突变体的氨基酸序列以及编码这些氨基酸序列的核苷酸序列,以及DBAT突变体蛋白在利用10‑去乙酰紫杉醇等10‑去乙酰紫杉烷底物合成紫杉醇或其类似物上的应用;特别是将其与能够专一性水解7‑木糖‑10‑去乙酰紫杉醇的糖基水解酶相偶联,以7‑木糖‑10‑去乙酰紫杉醇为底物,以乙酰辅酶A为酰基供体直接合成紫杉醇。
Description
本申请是申请日为2016年5月24日、发明名称为“10-去乙酰巴卡亭III 10β-O-乙酰转移酶突变体及其在催化合成紫杉醇及其类似物中的应用”的申请号为201610346558.4的专利申请的分案申请。
技术领域
本发明涉及10-去乙酰巴卡亭III 10β-O-乙酰转移酶(DBAT)的一系列突变体蛋白,将来自但不限于乙酰辅酶A的酰基转移到10-去乙酰紫杉烷上,生成紫杉醇或其类似物;突变体蛋白的酰基受体包括但不限于10-去乙酰紫杉醇和10-去乙酰巴卡亭III。本发明涉及这些突变体蛋白的氨基酸序列以及编码这些氨基酸序列的核苷酸序列及其应用。
技术背景
紫杉醇(paclitaxel,)是一个具有确切抗肿瘤疗效、主要来自于红豆杉但天然含量极低的“重磅炸弹”式药物,而10-去乙酰巴卡亭III 10β-O-乙酰转移酶(DBAT)则是紫杉醇生物合成途径的一个重要酶,催化该途径的中间体10-去乙酰巴卡亭III(10-DAB)C10位上羟基的乙酰化反应,形成巴卡亭III,后者再经过若干步反应最终形成具有复杂结构的二萜类化合物——紫杉醇。
1996年,Zocher等首次报道以乙酰辅酶A为酰基供体、用来自欧洲红豆杉(Taxusbaccata) 的根部(蛋白)粗提液催化10-DAB的C10位羟基乙酰化形成巴卡亭III,该粗提液显示区域选择性,即只对C10羟基具有乙酰化作用,而对于10-DAB的C1、C7和C13位的游离羟基不起作用[Zocher,R,et al.Biosynthesis of Taxol:enzymatic acetylation of 10-deacetylbaccatin-III to baccatin-III in crude extracts from roots of Taxusbaccata.Biochem Biophys Res Commun.,1996, 229(1):16-20]。之后,Pennington等报道分别从东北红豆杉(Taxus cuspidata)的叶和悬浮培养细胞中得到了部分纯化的DBAT,均能在乙酰辅酶A存在的条件下催化10-DAB形成巴卡亭III;但如果以10-去乙酰紫杉醇(DT)为底物,则不能得到肯定的结果,表现在产物紫杉醇生成的不确定性、或因其产量不足而无显著性统计学意义,并认为最有可能的解释是:紫杉醇的时隐时现源于粗酶液中污染一种尚未被表征的乙酰辅酶A:10-去乙酰紫杉醇-O-乙酰转移酶 [Pennington,JJ,etal.Acetyl CoA:10-deacetylbaccatin-III-10-O-acetyltransferase activity inleaves and cell suspension cultures of Taxus cuspidata.Phytochemistry,1998,49(8):2261-2266]。上述两篇文献仅涉及到DBAT粗酶液,其中夹杂着其他未被表征的蛋白质(或酶),且其中的DBAT都未被表征;反应产物的鉴定也仅限于薄层色谱(TLC)、高效液相色谱(HPLC)和同位素扫描,没有进行严格的波谱学证明,因此证据尚不够充分。1999年Menhard等报道从中国红豆杉(Taxus chinensis)的悬浮细胞中纯化到DBAT,该酶为单体蛋白,表观分子量为71±1.5kDa,最适pH 和最适温度分别为9.0和35℃,pI为5.6[Menhard B,Zenk MH.Purification and characterization of acetyl coenzyme A:10-hydroxytaxane O-acetyltransferase from cell suspension cultures of Taxuschinensis.Phytochemistry,1999,50:763-774]。该细胞系在优化条件下可产生高达150mg/L云南紫杉烷C(taxuyunnanine C,该化合物不含四元氧环),以该化合物为原料水解制备出一系列去乙酰化的化合物(10-deacetyltaxuyunnanine C,10,14-deacetyltaxuyunnanine C, 5,10,14-deacetyltaxuyunnanine C,2,10,14-deacetyltaxuyunnanine C, 2,5,10,14-deacetyltaxuyunnanine C)。催化实验证明该酶均能将这些化合物的C10位羟基乙酰化,但不能在其他位置上乙酰化,说明该酶具有区域选择性。还发现该酶也能将10-DAB的C10位羟基乙酰化,但对10表-10-DAB(10-epi-10-DAB)则不起作用,表现为立体选择性[Menhard B, Zenk MH.Purification andcharacterization of acetyl coenzyme A:10-hydroxytaxane O-acetyltransferasefrom cell suspension cultures of Taxus chinensis.Phytochemistry,1999, 50:763-774]。
2000年Croteau实验室首次报道从东北红豆杉(Taxus cuspidata)中克隆到DBATcDNA [Walker K,Croteau R.Molecular cloning of a 10-deacetylbaccatin III-10-O-acetyltransferase cDNA from Taxus and functional expression in Escherichiacoli.Proc Natl Acad Sci USA.,2000, 97(2):583-587;Croteau et al.Transacylasesof the paclitaxel biosynthetic pathway.US7,153,676B1, Date of patent:Dec.26,2006],并在大肠杆菌中实现异源表达。该重组酶的最适pH为7.4,可以将乙酰辅酶A上的乙酰基转移到10-DAB上,得到产物巴卡亭III。该酶同样具有区域选择性,对于10-DAB的1β-、7β-、13α-位羟基则不能进行乙酰化。之后,其他实验室也相继从欧洲红豆杉等植物中克隆到该酶的编码基因[Fang J,Ewald D.Expression cloned cDNA for 10-deacetylbaccatinIII-10-O-acetyltransferase in Escherichia coli:a comparative study of threefusion systems.Protein Expr Purif.,2004,35(1):17-24;Guo,BH,et al.Molecularcloning and heterologous expression of a 10-deacetylbaccatin III-10-O-acetyltransferase cDNA from Taxus x media.Mol Biol Rep.,2007,34(2):89-95;程抒劼,等.南方红豆杉10-去乙酰巴卡亭III-10-乙酰转移酶基因的克隆与生物信息学分析.生物技术通报,2011,(1):107-112]。Walker研究团队发现,在以10-DAB为酰基受体时,DBAT对于酰基供体具有一定的宽泛性(promiscuity),但酰基辅酶A的碳链长度与催化效率呈负相关,其中以乙酰辅酶A作为酰基供体时的催化效率最高;当dbat基因被导入大肠杆菌后,产生的重组酶可利用大肠杆菌内源性乙酰辅酶A实现底物10-DAB到产物巴卡亭III的转化[Loncaric C,et al.Profiling a Taxol pathway10β-acetyltransferase:Assessmentof the specificity and the production of baccatin III by in vivo acetylationin E.coli.Chem Biol.,2006,13:1-9;Loncaric C,et al.Expression of an acetyl-CoAsynthase and a CoA-transferase in Escherichia coli to produce modifiedtaxanes in vivo.Biotechnol J.,2006,2(2):266-274];该研究团队还发现DBAT具有一定的区域选择宽泛性,也能对4-DAB 的C4位羟基乙酰化[Ondari ME,Walker KD.The taxolpathway 10-O-acetyltransferase shows regioselective promiscuity with theoxetane hydroxyl of 4-deacetyltaxanes.J Am Chem Soc.,2008, 130(50):17187-17194]。
由于10-去乙酰紫杉醇(DT)仅需一步C10位羟基乙酰化即成为紫杉醇,因此,研究和开发这类非天然底物C10位羟基的酶促乙酰化反应具有重要的理论和实际意义。但DBAT究竟能不能催化非天然底物DT的C10位羟基乙酰化、或者即使能够完成此催化反应但催化效率如何,这是一个亟待解决的问题。为此,本发明以DT为酰基受体,以乙酰辅酶A为酰基供体,应用重组的DBAT结合LC-MS等分析技术进行了催化研究,发现DBAT确实能够催化非天然底物DT的C10位羟基乙酰化反应而生成紫杉醇,但催化效率极低。之后,利用蛋白质工程对DBAT进行改造,已获得13个对于DT等非天然酰基受体底物的催化活性比野生型 DBAT有显著提高的突变体蛋白(DBATm系列)(产物为紫杉醇),其中的一些突变体蛋白对于天然酰基受体底物10-DAB的催化活性也有显著提高(产物为巴卡亭III)。将这些突变体蛋白与来自香菇的一种糖基水解酶LXYL-P1-2[Cheng HL,et al.Cloning and characterization ofthe glycoside hydrolases that remove xylosyl group from 7-β-xylosyl-10-deacetyltaxol and its analogues. Mol Cell Proteomics,2013,12(8):2236-2248]相偶联,以乙酰辅酶A为酰基供体,通过“一锅法”反应,还可以将天然含量较为丰富的7-木糖-10-去乙酰紫杉醇(XDT)直接转变为紫杉醇。
发明内容
针对DBAT究竟能不能催化非天然底物DT的C10位羟基乙酰化和如何提高此乙酰化效率,本发明解决的技术问题是提供了一类DBAT系列突变体蛋白、编码该突变体蛋白的核苷酸序列、含有该核苷酸序列的重组质粒、含有该核苷酸序列或重组质粒的重组细胞,以及以上所述的突变体蛋白、其核苷酸序列、其重组质粒或重组细胞在催化合成紫杉醇或其类似物方面的应用。
为解决本发明的技术问题,提供了如下技术方案:
本发明技术方案的第一方面是:应用纯化的(HPLC色谱纯)重组DBAT为催化剂,分别以10-去乙酰紫杉醇(DT)和乙酰辅酶A为乙酰基受体和供体进行催化反应,对产物进行LC-MS 鉴定,证明产物为紫杉醇,证明了重组DBAT可以催化非天然底物DT的C10位羟基乙酰化。
为了提高DBAT的乙酰化效率,本发明技术方案的第二方面是提供一种10-去乙酰巴卡亭 III 10β-O-乙酰转移酶DBAT的突变体蛋白,其特征在于,所述的突变体蛋白具有与SEQ ID NO1所示的氨基酸序列至少90%以上的一致性,但不包括SEQ ID NO1。优选的突变体蛋白具有与SEQ ID NO1所示的氨基酸序列至少95%以上的一致性。最优选的突变体蛋白的氨基酸序列选自SEQ ID NO2~SEQ ID NO23所示的氨基酸序列。
以上所述的突变体蛋白上可进行常规修饰;或者在这些突变体蛋白上连接有用于检测或纯化的标签;所述的常规修饰包括乙酰化、酰胺化、环化、糖基化、磷酸化、烷基化、生物素化、荧光基团修饰、聚乙二醇PEG修饰、固定化修饰;所述的标签包括6×His、GST、EGFP、MBP、Nus、HA、IgG、FLAG、c-Myc、Profinity eXact。
以上所述的突变体蛋白与野生型蛋白DBAT相比,其氨基酸突变包括:G38R、G38W、G38Y、G38I、G38T、G38E、G38M、G38Q、G38C、G38S、G38D、G38H、G38A、F301C、 F301V、F301A、F301M、F301L、F301T、F301S、C216R,以及以上氨基酸突变的组合;所述的组合包括但不限于G38R/F301V双突变。
本发明技术方案的第三方面是:提供了编码第二方面所述突变体蛋白的核苷酸序列,优选SEQ ID NO 25~SEQ ID 46所示的核苷酸序列。
本发明技术方案的第四方面是:提供含有第三方面所述核苷酸序列的重组质粒。
本发明技术方案的第五方面是:提供含有第三方面所述核苷酸序列或第四方面所述重组质粒的重组细胞。
本发明技术方案的第六方面是:提供本发明第二方面所述突变体蛋白、第三方面所述核苷酸序列、第四方面所述重组质粒、第五方面所述重组细胞在催化合成紫杉醇或其类似物方面的应用;进一步的,在催化10-去乙酰紫杉醇及其类似物的C10位羟基酰基化生成紫杉醇或其类似物中的应用;所述的突变体蛋白可以与能够专一性水解7-木糖-10-去乙酰紫杉烷的糖基水解酶相偶联,以7-木糖-10-去乙酰紫杉烷为底物,以酰基辅酶A为酰基供体生成紫杉醇或其类似物;所述的酰基受体包括但不限于10-去乙酰紫杉醇、10-去乙酰巴卡亭III;所述的酰基供体包括但不限于乙酰辅酶A、丙酰辅酶A和丁酰辅酶A。
优选的酰基供体底物为乙酰辅酶A,优选的酰基受体底物为10-去乙酰紫杉醇(DT);
本发明技术方案的第六方面是提供一种酶促反应偶联体系,其特征在于,所述的酶促反应偶联体系是由权利要求1-7任一项的突变体蛋白与糖基水解酶系列蛋白相偶联形成的,所述的糖基水解酶系列蛋白包括克隆自香菇的LXYL-P1蛋白及其一系列活性突变体;所述的偶联形式包括:两种酶在同一反应体系中各自独立存在、或通过连接子形成的融合蛋白形式;优选的糖基水解酶包括LXYL-P1-1(见GenBank Accession:AET31457.1)、LXYL-P1-2(见 GenBank Accession:AET31459.1)、或其系列突变蛋白(即申请号201510268487.6的专利申请中提到的系列突变蛋白)。最优选的糖基水解酶为糖基水解酶LXYL-P1-2系列蛋白,将该突变体蛋白与糖基水解酶LXYL-P1-2系列蛋白相偶联,通过“一锅法”反应,以7-木糖-10-去乙酰紫杉醇(XDT)或其类似物为前体,生物合成紫杉醇或其类似物。本发明还可用于规模化制备紫杉醇中间体巴卡亭III或其类似物。
有益技术效果
本发明利用蛋白质工程对10-去乙酰巴卡亭III 10β-O-乙酰转移酶(DBAT)进行改造,获得13个对于非天然酰基受体底物DT等的催化活性比野生型DBAT有显著提高的突变体蛋白 (DBATm系列),其中的一些突变体蛋白对于天然酰基受体底物10-DAB的催化活性也有显著提高。将这些突变体蛋白与一种糖基水解酶相偶联,以乙酰辅酶A为酰基供体,通过“一锅法”反应,还可以将天然含量较为丰富的7-木糖-10-去乙酰紫杉醇(XDT)直接转变为紫杉醇。本发明可以简化紫杉醇或其类似物的合成步骤,解决紫杉醇或其类似物资源少,合成难的问题。
附图说明
图-1重组DABT表达载体构建示意图
图-2.DBAT催化天然底物10-DAB及非天然底物DT的HPLC和LC-MS分析 (注:在以DT为底物时DBAT的用量是以10-DAB为底物时的25倍)
图-3.DBAT一段氨基酸序列的比对结果
图-5.DBAT突变体全质粒扩增示意图
图-6.DBAT-C216R热稳定性测定结果
图-7.DBAT-G38R/F301V催化DT的物质浓度-时间曲线
图-8.DBAT-G38R/F301V催化DT体系中补加DBAT-G38R/F301V的物质浓度-时间曲线
图-9.双酶催化体系中XDT、DT及紫杉醇含量变化情况
具体实施方式
本发明通过下列实施例予以进一步阐明,这些实施例是仅用于说明性的,而不是以任何方式限制本发明权利要求的范围。
实施例1:DBAT原核表达、纯化及催化天然底物10-DAB及非天然底物DT的HPLC-MS分析
人工合成东北红豆杉dbat基因序列(GenBank Accession:Q9M6E2.1),利用引物F:GAATTCATGCATCATCATCATCATCATGCAGGCTCAAC及引物R: GCGGCCGCTCAAGGCTTAGT进行dbat的基因扩增,同时在DBAT的N-端引入His标签, PCR扩增的片段经Nde I及Xba I进行双酶切后与经同样双酶切的载体连接,转化大肠杆菌 JM109感受态,经菌落PCR筛选阳性转化子JM109-pCWori-dbat,提取阳性转化子的质粒DNA 并进行测序验证。基因dbat cDNA扩增及重组质粒构建过程见图1。
重组菌株的诱导培养:
1)挑取单菌落于含有氨苄青霉素(Amp)的10mL LB(Amp终浓度为100μg/mL)液体培养基中,37℃、200rpm摇瓶培养约12h;
2)将过夜培养的重组菌按1%的比例转接于含有Amp的100mL TB(Amp终浓度为100μg/mL)液体培养基中,37℃、摇瓶培养(200rpm)约2-3h;
3)待OD600≈0.8时,加入IPTG至终浓度为1mmol/L,诱导培养条件:18℃、200r/min、 18h;
4)诱导结束后培养物于8000rpm离心3min,菌体沉淀用ddH2O洗涤2次;得到的菌体沉淀进行超生破碎或-20℃保存备用。
镍亲和层析法纯化目的蛋白:
1)样品准备:将诱导表达后的菌体重悬于破碎缓冲液(同平衡缓冲液,1L菌液收集的菌体沉淀用50mL缓冲液悬浮)中,经高压破碎后(800bar,3次),在4℃条件下12000rpm 离心30min,上清液用0.45μm滤膜过滤。
2)镍亲和层析柱平衡:2mL镍亲和层析柱经去离子水水洗后,用20mL平衡缓冲液平衡 (20mM咪唑、100mM NaCl、20mM Tris-HCl,pH7.5),流速2mL/min。
3)上样:蛋白样品反复上样5次,流速2mL/min。
4)洗脱:用20mL平衡缓冲液洗脱非特异结合蛋白;用20mL含20mM咪唑的缓冲液洗脱非特异结合蛋白;用20mL含200mM咪唑的缓冲液洗脱目的蛋白。
5)样品浓缩:将得到的目的蛋白洗脱液用截留分子量(Molecular WeightCutoff,MWCO) 为30kDa的超滤管在4000g、30min的离心条件下进行浓缩;浓缩后的样品进行蛋白浓度测定。
DBAT催化天然底物10-DAB及非天然底物DT的HPLC-MS分析:
100μL反应体系中包含终浓度为0.02mg/mL(10-DAB测定体系)或0.5mg/mL(DT测定体系)的DBAT、500μM(404.5μg/mL)乙酰辅酶A、500μM底物(相当于10-DAB 272.30 μg/mL或DT 405.94μg/mL),用pH 5.5的醋酸钠-醋酸缓冲液补齐至100μL,于37.5℃条件下反应12h后加入500μL甲醇终止反应,HPLC-MS检测转化产物(结果见图2)。
实施例2:DBAT蛋白一级序列同源比对及三维结构预测分析
将不同种红豆杉来源的DBAT进行一级序列比对分析(图3),发现本研究所采用的东北红豆杉(Taxus cuspidata)来源的DBAT中第216位点在不同种属红豆杉中存在差异,该位点在Taxus baccata、Taxus canadensis、Taxus fauna及Taxus globosa等中均为精氨酸(Arg或R),而在东北红豆杉中为半胱氨酸(Cys或C)。通过对预测的三维结构(图4)分析发现该位点的 Cys在空间上位于蛋白表面,且未与其他Cys形成二硫键,处于游离状态。文献报道蛋白中单独的Cys容易产生自氧化等导致蛋白失稳[Argos P,Rossmann MG,Grau UM,etal.Thermal stability and protein structure.Biochemistry,1979,18(25):5698-5703];另据统计发现嗜热蛋白中带电荷氨基酸Glu、Arg、Asp、Lys的含量明显高于中温蛋白,而更多的带电残基可以为嗜热蛋白提供更多的盐桥[Kumar S,Tsai CJ,NussinovR.Factors enhancing protein thermostability. Protein Eng,2000,13(3):179-191]。因此,本发明尝试对此位点进行216位点Cys→Arg突变(见实施例3)。
目前,DBAT三维结构未知,为进一步研究DBAT的结构与功能关系,选用与DBAT一致性最高的HCT(GenBank Accession:ABO47805.1,与DBAT的一致性为30%)三维结构为模板,利用蛋白质三维结构在线预测软件Swissmodel(http://swissmodel.expasy.org/)对DBAT结构进行预测,结果见图4。通过对DBAT进行三维结构分析,推测距离DBAT活性中心内的氨基酸位点(图4圆内表示范围内的氨基酸)如38、301等可能参与酶与底物的结合或催化。
实施例3:DBAT第38及第301位点饱和突变及组合突变菌株的构建
根据一级序列比对及预测的三维结构分析结果,利用全质粒PCR扩增的方法,以pCWori-dbat为模板分别进行第38及第301位点饱和突变[Parikh A,GuengerichFP.Random mutagenesis by whole-plasmid PCR amplification.Biotechniques,1998,24(3):428-431.]和C216R 定点突变。以pCWori-dbatm-G38R为模板引入F301V突变,构建G38R/F301V组合突变体。以38位饱和突变体重组质粒pCWori-dbat-38X的构建为例,如图5所示。突变体构建所用引物序列如下:
表1.突变体构建所用引物序列
PCR扩增体系如下:
PCR扩增条件:
PCR产物以1.0%的琼脂糖凝胶电泳检测,纯化回收PCR产物。
PCR产物于37℃用Dpn I酶切处理5h。酶切体系如下:
转化及筛选:将酶切产物全部转化大肠杆菌JM109感受态细胞。采用菌落PCR方法进行阳性转化子的筛选并进行DNA序列测定。
实施例4:DBAT突变体蛋白催化非天然底物DT比活力测定
反应体系中DBATm的终浓度为0.5mg/mL,DT及乙酰辅酶A终浓度均为500μM,溶于pH5.5的醋酸钠-醋酸缓冲液中(共100μL),37.5℃条件下反应3h后加入500μL甲醇终止反应,HPLC检测产物紫杉醇生成量。酶活力单位(U)定义为:在37.5℃、pH 5.5、以 DT为底物的条件下,每分钟产生1μmoL紫杉醇所需要的酶量。根据紫杉醇浓度-峰面积标准曲线,计算出酶反应体系中紫杉醇产生量,根据测得的蛋白质量浓度(mg/mL),求出单位为 U/mg的比活力。表2为经过筛选获得的催化DT活性或催化特性有显著改善的突变体(以DBAT 为对照),其中DBAT-G38R/F301V双突变的比活力是对照(DBAT)的3.7倍。
表2 DBAT及其突变体(DBATm)催化DT的比活力与相对酶活性
n=3,*P<0.05vs DBAT,**P<0.01vs DBAT.
实施例5:DBAT突变体蛋白催化天然底物10-DAB比活力测定
反应体系中DBAT或突变体蛋白终浓度为0.02mg/mL,10-DAB及乙酰辅酶A终浓度均为500μM,溶于pH 5.5的醋酸钠-醋酸缓冲液中(共100μL),40℃条件下反应20min后加入500μL甲醇终止反应,用HPLC检测产物生成量。催化10-DAB的酶活力单位(U)定义为:在40℃,pH5.5,以10-DAB为底物的条件下,每分钟产生1μmoL巴卡亭III所需要的酶量。根据巴卡亭III浓度-峰面积标准曲线,计算出酶反应体系中巴卡亭III生成量;根据测得的酶蛋白质量浓度(mg/mL),求出单位为U/mg的比活力。表3为经过筛选获得的催化10-DAB活性有显著提高的突变体。
表3.DBAT及其突变体(DBATm)催化10-DAB活性测定结果
n=3,*P<0.05vs DBAT,**P<0.01vs DBAT.注:反应温度45℃
实施例6:DBAT-C216R突变体蛋白热稳定性及最适催化温度分析
将重组DBAT及突变体DBAT-C216R蛋白用pH 5.5的缓冲液稀释至0.1mg/mL,置于37℃静置12h,每隔1h检测蛋白残留活性,活性检测方法同实施例5,结果见图6。结果显示野生型DBAT酶的半衰期为1.7h,突变体DBAT-C216R的热稳定性显著增强,半衰期延长至4.5 h。
突变体催化10-DAB及DT的最适温度分析:酶催化体系分别同实施例5和实施例6。反应温度分别为25、30、35、40、45和50℃。以野生型DBAT为对照。结果显示DBAT-C216R 催化10-DAB及DT的最适温度分别为45℃和40℃,比突变前分别增加约5℃。
实施例7:DBAT-G38R/F301V催化体系中底物DT与产物紫杉醇的时间-浓度变化曲线
⑴催化体系中不补加DBAT-G38R/F301V
催化体系组成:DBAT-G38R/F301V 1.5mg/mL,DT及乙酰辅酶A浓度均为2mM,DMSO(5% V/V),pH 5.5醋酸-醋酸钠缓冲液补齐至1mL。
反应条件:37.5℃,分别于3h、6h、9h、12h及15h分别检测DT转化情况。
结果见图7,结果显示:反应6h后趋于平衡,紫杉醇产量最高为452.09±2.52μg/mL。
⑵催化体系中补加DBAT-G38R/F301V
催化体系组成:DBAT-G38R/F301V 1.5mg/mL,DT及乙酰辅酶A浓度均为2mM,DMSO(5% V/V),pH 5.5醋酸-醋酸钠缓冲液补齐至1mL,分别于3h、6h、9h补加DBAT-G38R/F301V150μL(酶溶液10mg/mL)。
反应条件:37.5℃,分别于3h、6h、9h、12h及15h分别检测DT转化情况。
结果见图8,结果显示:反应12h后趋于平衡,反应15h时,紫杉醇产量达到640.76±5.05μg/mL。
实施例8:LXYL-P1-2及DBAT突变体偶联反应催化XDT为紫杉醇(显示前体XDT、中间体DT和产物紫杉醇的时间-浓度变化曲线)
所用酶溶液及底物母液:LXYL-P1-2 5mg/mL,DBAT-G38R/F301V 10mg/mL,乙酰辅酶 A 100mM,XDT 100mM;反应体积为10mL。
催化体系组成:LXYL-P1-2 1mL,DBAT-G38R/F301V 1.5mL,XDT 200μL,乙酰辅酶A200 μL,DMSO 500μL,pH 5.5醋酸钠-醋酸缓冲液6.6mL。
分别于3h、6h、9h补加DBAT-G38R/F301V 1.5mL。
反应条件:37.5℃,分别于3h、6h、9h、12h及15h分别检测各物质浓度。
结果见图9。结果显示:反应12h后趋于平衡,反应15h时,紫杉醇产量达到637.24±5.10 μg/mL。
<110> 中国医学科学院药物研究所
<120> 10-去乙酰巴卡亭Ⅲ 10β-O-乙酰转移酶突变体及其在催化合成紫杉醇及其类似物中的应用
<210> SEQ ID NO 1 DBAT
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工合成序列
<400> SEQUENCE: 1
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 2 DBAT-G38R
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 2
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Arg Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 3 DBAT-G38W
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 3
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Trp Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 4 DBAT-G38Y
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 4
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Tyr Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 5 DBAT-G38I
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 5
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Ile Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 6 DBAT-G38T
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 6
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Thr Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 7 DBAT-G38E
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 7
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Glu Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 8 DBAT-G38M
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 8
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Met Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 9 DBAT-G38Q
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 9
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Gln Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 10 DBAT-G38C
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 10
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Cys Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 11 DBAT-G38S
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 11
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Ser Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 12 DBAT-G38D
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 12
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Asp Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 13 DBAT-G38H
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 13
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro His Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 14 DBAT-G38A
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 14
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Ala Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 15 DBAT-F301C
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 15
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Cys Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 16 DBAT-F301V
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 16
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Arg Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Val Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 17 DBAT-F301A
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 17
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Ala Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 18 DBAT-F301M
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 18
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Met Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 19 DBAT-F301L
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 19
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Leu Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 20 DBAT-F301T
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 20
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Thr Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 21 DBAT-F301S
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 21
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Ser Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 22 DBAT-C216R
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 22
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Arg Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 23 DBAT-G38R/F301V
<211> LENGTH: 440
<212> TYPE: PRT
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 23
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met Val Ala Pro Ser Gln Pro
5 10 15 20
Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr Leu Asp Asn Leu Pro Arg Val Arg Glu Asn Ile Phe
25 30 35 40
Asn Thr Leu Leu Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile Arg Gln
45 50 55 60 65
Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly Arg Leu Arg Lys Lys Glu Asn Gly
70 75 80 85
Asp Leu Glu Val Glu Cys Thr Gly Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu
90 95 100 105 110
Ser Val Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe Cys Leu Pro Pro
115 120 125 130
Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val Gln Val Thr Arg Phe Thr Cys Gly Gly Phe
135 140 145 150
Val Val Gly Val Ser Phe Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
155 160 165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile Trp Lys Arg Glu Leu Leu
180 185 190 195
Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr
200 205 210 215 220
Phe Gly Lys Ile Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys Gln Cys
225 230 235 240
Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val Val Ser Ala Leu Ala Trp Ile Ala
245 250 255 260
Arg Thr Arg Ala Leu Gln Ile Pro His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg
265 270 275 280 285
Lys Leu Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Val Val Gly Thr Val Cys Ala Met
290 295 300 305
Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu Arg Val Val Arg Ile Ile Lys Lys Ala Lys
310 315 320 325 330
Val Ser Leu Asn Glu His Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
335 340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe Asp Glu Val Asp Phe Gly
355 360 365 370
Trp Gly His Ala Asp Asn Val Ser Leu Val Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser
375 380 385 390 395
Tyr Phe Leu Phe Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe Met Pro
400 405 410 415
Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met Thr Asn Lys Tyr Val Thr Lys Pro
420 425 430 435 440
<210> SEQ ID NO 24 DBAT
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工合成序列
<400> SEQUENCE: 24
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 25 DBAT-G38R
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 25
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc acgggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 26 DBAT-G38W
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 26
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc atgggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 27 DBAT-G38Y
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 27
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc atatgtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 28 DBAT-G38I
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 28
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aattgtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 29 DBAT-G38T
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 29
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aacggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 30 DBAT-G38E
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 30
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc agaggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 31 DBAT-G38M
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 31
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aatggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 32 DBAT-G38Q
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 32
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc acaggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 33 DBAT-G38C
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 33
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc atgtgtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 34 DBAT-G38S
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 34
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc atcggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 35 DBAT-G38D
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 35
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc agatgtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 36 DBAT-G38H
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 36
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc acatgtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 37 DBAT-G38A
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 37
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc agcggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 38 DBAT-F301C
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 38
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tgtgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 39 DBAT-F301V
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 39
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat gttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 40 DBAT-F301A
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 40
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat gctgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 41 DBAT-F301M
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 41
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat atggttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 42 DBAT-F301L
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 42
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat cttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 43 DBAT-F301T
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 43
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat actgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 44 DBAT-F301S
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 44
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat agtgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 45 DBAT-C216R
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 45
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgattcgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat tttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
<210> SEQ ID NO 46 DBAT-G38R/F301V
<211> LENGTH: 1321
<212> TYPE: DNA
<213> ORGANISM: Taxus cuspidata人工突变序列
<400> SEQUENCE: 46
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc cagccatcgc 70
ccaaagcttt cctgcagctc tccacccttg acaatctacc acgggtgaga gaaaacattt ttaacacctt 140
gttagtctac aatgcctcag acagagtttc cgtagatcct gcaaaagtaa ttcggcaggc tctctccaag 210
gtgttggtgt actattcccc ttttgcaggg cgtctcagga aaaaagaaaa tggagatctt gaagtggagt 280
gcacagggga gggtgctctg tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga 350
tgactacagt ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc tgccatggta 490
tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg gcaaggggag agattaagcc 560
ctcctcggag ccaatatgga agagagaatt gctgaagccg gaagaccctt tataccggtt ccagtattat 630
cactttcaat tgatttgccc gccttcaaca ttcgggaaaa tagttcaagg atctcttgtt ataacctctg 700
agacaataaa ttgtatcaaa caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt 770
atctgcattg gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat gttgttggta 910
ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg cgtgttgtaa ggattataaa 980
gaaagcaaag gtctctttaa atgagcattt cacgtcaaca atcgtgacac cccgttctgg atcagatgag 1050
agtatcaatt atgaaaacat agttggattt ggtgatcgaa ggcgattggg atttgatgaa gtagactttg 1120
ggtgggggca tgcagataat gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta 1190
ttttcttttc atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct tga
Claims (12)
1.一种10-去乙酰巴卡亭III 10β-O-乙酰转移酶DBAT的突变体蛋白,其特征在于,与氨基酸序列如SEQ ID NO 1所示的野生型蛋白DBAT相比,所述的突变体蛋白在301位氨基酸处具有选自下组中的一种突变:F301V、F301C、F301A、F301M、F301L、F301T、F301S。
2.根据权利要求1的突变体蛋白,其特征在于,所述的突变体蛋白具有F301V氨基酸突变。
3.根据权利要求1或2的突变体蛋白,其特征在于,所述的突变体蛋白与野生型蛋白DBAT相比,其氨基酸突变还包括:G38W、G38Y、G38I、G38T、G38E、G38M、G38Q、G38C、G38S、G38D、G38H、G38A、C216R,以及以上氨基酸突变的组合。
4.根据权利要求1-3任一项的突变体蛋白,其特征在于,所述的突变体蛋白的氨基酸序列选自SEQ ID NO 3~SEQ ID NO 14所示的氨基酸序列。
5.一种编码权利要求1-4任一项所述突变体蛋白的核苷酸序列。
6.根据权利要求6的核苷酸序列,其特征在于,所述的核苷酸序列选自SEQ ID NO 26~SEQ ID NO 37所示的核苷酸序列。
7.一种含有权利要求5-6任一项所述的核苷酸序列的重组质粒。
8.一种含有权利要求5-6任一项所述的核苷酸序列或权利要求7所述的重组质粒的重组细胞。
9.权利要求1-4任一项所述的突变体蛋白或权利要求5-6任一项所述的核苷酸序列或权利要求7所述的重组质粒或权利要求8所述的重组细胞在催化10-去乙酰紫杉醇及其类似物的C10位羟基酰基化生成紫杉醇或其类似物中的应用。
10.根据权利要求9的应用,其特征在于,所述的突变体蛋白可以与能够专一性水解7-木糖-10-去乙酰紫杉烷的糖基水解酶相偶联,以7-木糖-10-去乙酰紫杉烷为底物,以酰基辅酶A为酰基供体生成紫杉醇或其类似物。
11.根据权利要求10的应用,其特征在于,所述的酰基受体包括10-去乙酰紫杉醇、10-去乙酰巴卡亭III;所述的酰基供体包括乙酰辅酶A、丙酰辅酶A和丁酰辅酶A。
12.一种酶促反应偶联体系,其特征在于,所述的酶促反应偶联体系是由权利要求1-4任一项的突变体蛋白与糖基水解酶系列蛋白相偶联形成的,所述的糖基水解酶系列蛋白包括克隆自香菇的LXYL-P1蛋白及其一系列活性突变体;所述的偶联形式包括:两种酶在同一反应体系中各自独立存在、或通过连接子形成的融合蛋白形式;优选的糖基水解酶包括LXYL-P1-1或LXYL-P1-2或突变体蛋白。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110316454.XA CN115125222A (zh) | 2021-03-24 | 2021-03-24 | 利用10-去乙酰巴卡亭III10β-O-乙酰转移酶突变体催化合成紫杉醇及其类似物 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110316454.XA CN115125222A (zh) | 2021-03-24 | 2021-03-24 | 利用10-去乙酰巴卡亭III10β-O-乙酰转移酶突变体催化合成紫杉醇及其类似物 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115125222A true CN115125222A (zh) | 2022-09-30 |
Family
ID=83374288
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110316454.XA Pending CN115125222A (zh) | 2021-03-24 | 2021-03-24 | 利用10-去乙酰巴卡亭III10β-O-乙酰转移酶突变体催化合成紫杉醇及其类似物 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115125222A (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113980925A (zh) * | 2016-05-24 | 2022-01-28 | 中国医学科学院药物研究所 | 利用10-去乙酰巴卡亭III 10β-O-乙酰转移酶突变体催化合成紫杉醇及其衍生物 |
-
2021
- 2021-03-24 CN CN202110316454.XA patent/CN115125222A/zh active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113980925A (zh) * | 2016-05-24 | 2022-01-28 | 中国医学科学院药物研究所 | 利用10-去乙酰巴卡亭III 10β-O-乙酰转移酶突变体催化合成紫杉醇及其衍生物 |
CN113980925B (zh) * | 2016-05-24 | 2024-05-14 | 中国医学科学院药物研究所 | 利用10-去乙酰巴卡亭III 10β-O-乙酰转移酶突变体催化合成紫杉醇及其衍生物 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108823179A (zh) | 一种源自放线菌的转氨酶、突变体、重组菌及应用 | |
CN108103039B (zh) | 一组岩藻糖基转移酶突变体及其筛选方法和应用 | |
CN109266630A (zh) | 一种脂肪酶及其在制备布瓦西坦中间体中的应用 | |
CN113736763B (zh) | 黑芥子酶Rmyr及其在制备萝卜硫素、莱菔素中的应用 | |
CN111808829B (zh) | 一种γ-谷氨酰甲胺合成酶突变体及其应用 | |
CN114703158B (zh) | 一种蔗糖磷酸化酶突变体、编码基因及其应用 | |
CN107418938B (zh) | 10-去乙酰巴卡亭III 10β-O-乙酰转移酶突变体及其在催化合成紫杉醇及其类似物中的应用 | |
CN115125222A (zh) | 利用10-去乙酰巴卡亭III10β-O-乙酰转移酶突变体催化合成紫杉醇及其类似物 | |
CN112980906B (zh) | 一种用于制备β-烟酰胺单核苷酸的酶组合物及其应用 | |
CN113151232A (zh) | 忽地笑1-氨基环丙烷-1-羧酸合成酶及其编码基因与应用 | |
CN115433721B (zh) | 一种羰基还原酶突变体及其应用 | |
CN116240249A (zh) | 一种生物酶法水解核苷的方法 | |
CN109355271A (zh) | 一种海洋红酵母来源的环氧化物水解酶及其应用 | |
CN112831532B (zh) | 一种酶促合成d-亮氨酸的方法 | |
CN114657160B (zh) | 一种糖基转移酶突变体及其应用 | |
CN106893748B (zh) | 一种l-茶氨酸的合成方法 | |
CN108410850B (zh) | L-鼠李树胶糖-1-磷酸醛缩酶及其在催化合成稀有糖d-山梨糖中的应用 | |
CN114717170B (zh) | 异源合成黄酮类化合物的宿主细胞及其应用 | |
CN110699345A (zh) | 一种卤醇脱卤酶突变体及其应用 | |
CN114317631B (zh) | 单胺氧化酶在制备托品酮中的应用 | |
US20030175913A1 (en) | Compositions and methods for altering biosynthesis of taxanes and taxane-related compounds | |
CN112779235B (zh) | 一种生物催化合成多种黄酮苷的方法 | |
CN116987681A (zh) | 10-去乙酰巴卡亭Ⅲ10β-O-乙酰转移酶突变体及其应用 | |
CN118308332A (zh) | 一种重组酶VthBga突变体及其应用 | |
CN117965481A (zh) | 一种重组o-琥珀酰-l-高丝氨酸巯基转移酶突变体及其应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |