WO2021219124A1 - Modified threonine transaldolase and application thereof - Google Patents
Modified threonine transaldolase and application thereof Download PDFInfo
- Publication number
- WO2021219124A1 WO2021219124A1 PCT/CN2021/091392 CN2021091392W WO2021219124A1 WO 2021219124 A1 WO2021219124 A1 WO 2021219124A1 CN 2021091392 W CN2021091392 W CN 2021091392W WO 2021219124 A1 WO2021219124 A1 WO 2021219124A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- ltta
- substituted
- modified
- amino acid
- present
- Prior art date
Links
- 102100028601 Transaldolase Human genes 0.000 title claims abstract description 10
- 108020004530 Transaldolase Proteins 0.000 title claims abstract description 10
- 125000000341 threoninyl group Chemical class [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 title 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims abstract description 59
- HUMNYLRZRPPJDN-UHFFFAOYSA-N benzaldehyde Chemical compound O=CC1=CC=CC=C1 HUMNYLRZRPPJDN-UHFFFAOYSA-N 0.000 claims abstract description 34
- 239000004473 Threonine Substances 0.000 claims abstract description 31
- 229960002898 threonine Drugs 0.000 claims abstract description 31
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 29
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 29
- 239000002157 polynucleotide Substances 0.000 claims abstract description 29
- 238000006243 chemical reaction Methods 0.000 claims abstract description 28
- VHVGNTVUSQUXPS-JGVFFNPUSA-N (2s,3r)-2-amino-3-hydroxy-3-phenylpropanoic acid Chemical compound OC(=O)[C@@H](N)[C@H](O)C1=CC=CC=C1 VHVGNTVUSQUXPS-JGVFFNPUSA-N 0.000 claims abstract description 27
- QNGNSVIICDLXHT-UHFFFAOYSA-N para-ethylbenzaldehyde Natural products CCC1=CC=C(C=O)C=C1 QNGNSVIICDLXHT-UHFFFAOYSA-N 0.000 claims abstract description 17
- 239000013598 vector Substances 0.000 claims abstract description 16
- 238000004519 manufacturing process Methods 0.000 claims abstract description 9
- 150000008551 L-threonines Chemical class 0.000 claims abstract description 5
- 238000006467 substitution reaction Methods 0.000 claims description 63
- 230000000694 effects Effects 0.000 claims description 59
- 150000001413 amino acids Chemical group 0.000 claims description 32
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 19
- 239000013604 expression vector Substances 0.000 claims description 8
- 229910052739 hydrogen Inorganic materials 0.000 claims description 7
- 239000002253 acid Substances 0.000 claims description 4
- 230000003197 catalytic effect Effects 0.000 abstract 1
- 235000001014 amino acid Nutrition 0.000 description 60
- 125000003275 alpha amino acid group Chemical group 0.000 description 53
- 238000000034 method Methods 0.000 description 40
- 108090000765 processed proteins & peptides Proteins 0.000 description 30
- 229920001184 polypeptide Polymers 0.000 description 28
- 102000004196 processed proteins & peptides Human genes 0.000 description 28
- 102000004190 Enzymes Human genes 0.000 description 27
- 108090000790 Enzymes Proteins 0.000 description 27
- 229940088598 enzyme Drugs 0.000 description 27
- 150000007523 nucleic acids Chemical group 0.000 description 27
- 210000004027 cell Anatomy 0.000 description 25
- 229940024606 amino acid Drugs 0.000 description 24
- 102000039446 nucleic acids Human genes 0.000 description 23
- 108020004707 nucleic acids Proteins 0.000 description 23
- IKHGUXGNUITLKF-UHFFFAOYSA-N Acetaldehyde Chemical compound CC=O IKHGUXGNUITLKF-UHFFFAOYSA-N 0.000 description 22
- 239000000047 product Substances 0.000 description 18
- 108090000623 proteins and genes Proteins 0.000 description 18
- 125000003729 nucleotide group Chemical group 0.000 description 14
- 239000002773 nucleotide Substances 0.000 description 13
- 102000004169 proteins and genes Human genes 0.000 description 11
- 238000003786 synthesis reaction Methods 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 9
- 241000588724 Escherichia coli Species 0.000 description 9
- 239000000843 powder Substances 0.000 description 9
- 235000018102 proteins Nutrition 0.000 description 9
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 8
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 8
- 125000000539 amino acid group Chemical group 0.000 description 8
- 230000015572 biosynthetic process Effects 0.000 description 8
- NGVDGCNFYWLIFO-UHFFFAOYSA-N pyridoxal 5'-phosphate Chemical compound CC1=NC=C(COP(O)(O)=O)C(C=O)=C1O NGVDGCNFYWLIFO-UHFFFAOYSA-N 0.000 description 8
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 7
- PSVPUHBSBYJSMQ-UHFFFAOYSA-N 4-methylsulfonylbenzaldehyde Chemical compound CS(=O)(=O)C1=CC=C(C=O)C=C1 PSVPUHBSBYJSMQ-UHFFFAOYSA-N 0.000 description 7
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 7
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 7
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 7
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 230000035772 mutation Effects 0.000 description 7
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 7
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 description 7
- 239000002953 phosphate buffered saline Substances 0.000 description 7
- 239000012429 reaction media Substances 0.000 description 7
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 6
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 6
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 6
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 description 6
- 108090000854 Oxidoreductases Proteins 0.000 description 6
- 102000004316 Oxidoreductases Human genes 0.000 description 6
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 6
- 239000000463 material Substances 0.000 description 6
- 102220306518 rs149808406 Human genes 0.000 description 6
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 5
- 238000009396 hybridization Methods 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 4
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 4
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 4
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 4
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 4
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 4
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 229960000310 isoleucine Drugs 0.000 description 4
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 4
- 229930027917 kanamycin Natural products 0.000 description 4
- 229960000318 kanamycin Drugs 0.000 description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 4
- 229930182823 kanamycin A Natural products 0.000 description 4
- 229910001629 magnesium chloride Inorganic materials 0.000 description 4
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 4
- 229960005190 phenylalanine Drugs 0.000 description 4
- 239000008363 phosphate buffer Substances 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 235000007682 pyridoxal 5'-phosphate Nutrition 0.000 description 4
- 239000011589 pyridoxal 5'-phosphate Substances 0.000 description 4
- 229960001327 pyridoxal phosphate Drugs 0.000 description 4
- 229960001153 serine Drugs 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 4
- 229960004441 tyrosine Drugs 0.000 description 4
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 3
- 244000063299 Bacillus subtilis Species 0.000 description 3
- 235000014469 Bacillus subtilis Nutrition 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 108090000698 Formate Dehydrogenases Proteins 0.000 description 3
- 108010050375 Glucose 1-Dehydrogenase Proteins 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 3
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 3
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 3
- 125000003118 aryl group Chemical group 0.000 description 3
- 229960001230 asparagine Drugs 0.000 description 3
- 235000009582 asparagine Nutrition 0.000 description 3
- 238000006555 catalytic reaction Methods 0.000 description 3
- 239000005515 coenzyme Substances 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 3
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 3
- 239000000543 intermediate Substances 0.000 description 3
- 229960003136 leucine Drugs 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 230000008929 regeneration Effects 0.000 description 3
- 238000011069 regeneration method Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 239000004474 valine Substances 0.000 description 3
- 229960004295 valine Drugs 0.000 description 3
- -1 (2S Chemical compound 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 241000485755 Chitiniphilus shinanonensis Species 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 102220490279 DNA mismatch repair protein Mlh3_N35A_mutation Human genes 0.000 description 2
- 239000003155 DNA primer Substances 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-M Formate Chemical compound [O-]C=O BDAGIHXWWSANSR-UHFFFAOYSA-M 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- 102220572775 Lamina-associated polypeptide 2, isoform alpha_F59Y_mutation Human genes 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 239000001888 Peptone Substances 0.000 description 2
- 108010080698 Peptones Proteins 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 241000589540 Pseudomonas fluorescens Species 0.000 description 2
- 241000589774 Pseudomonas sp. Species 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 102220566198 Survival motor neuron protein_F70S_mutation Human genes 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- 150000003935 benzaldehydes Chemical class 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 229960005091 chloramphenicol Drugs 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 229960002743 glutamine Drugs 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 235000019319 peptone Nutrition 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 239000013615 primer Substances 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 102220042631 rs587780925 Human genes 0.000 description 2
- 238000003756 stirring Methods 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- AYIRNRDRBQJXIF-NXEZZACHSA-N (-)-Florfenicol Chemical compound CS(=O)(=O)C1=CC=C([C@@H](O)[C@@H](CF)NC(=O)C(Cl)Cl)C=C1 AYIRNRDRBQJXIF-NXEZZACHSA-N 0.000 description 1
- CXIYBDIJKQJUMN-QMMMGPOBSA-N (2s)-2-anilino-3-hydroxypropanoic acid Chemical compound OC[C@@H](C(O)=O)NC1=CC=CC=C1 CXIYBDIJKQJUMN-QMMMGPOBSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- HBAQYPYDRFILMT-UHFFFAOYSA-N 8-[3-(1-cyclopropylpyrazol-4-yl)-1H-pyrazolo[4,3-d]pyrimidin-5-yl]-3-methyl-3,8-diazabicyclo[3.2.1]octan-2-one Chemical class C1(CC1)N1N=CC(=C1)C1=NNC2=C1N=C(N=C2)N1C2C(N(CC1CC2)C)=O HBAQYPYDRFILMT-UHFFFAOYSA-N 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 102000001390 Fructose-Bisphosphate Aldolase Human genes 0.000 description 1
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 1
- 102220466836 HLA class II histocompatibility antigen, DR beta 4 chain_N35H_mutation Human genes 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 125000000393 L-methionino group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C(SC([H])([H])[H])([H])[H] 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- SBKRTALNRRAOJP-BWSIXKJUSA-N N-[(2S)-4-amino-1-[[(2S,3R)-1-[[(2S)-4-amino-1-oxo-1-[[(3S,6S,9S,12S,15R,18R,21S)-6,9,18-tris(2-aminoethyl)-15-benzyl-3-[(1R)-1-hydroxyethyl]-12-(2-methylpropyl)-2,5,8,11,14,17,20-heptaoxo-1,4,7,10,13,16,19-heptazacyclotricos-21-yl]amino]butan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1-oxobutan-2-yl]-6-methylheptanamide (6S)-N-[(2S)-4-amino-1-[[(2S,3R)-1-[[(2S)-4-amino-1-oxo-1-[[(3S,6S,9S,12S,15R,18R,21S)-6,9,18-tris(2-aminoethyl)-15-benzyl-3-[(1R)-1-hydroxyethyl]-12-(2-methylpropyl)-2,5,8,11,14,17,20-heptaoxo-1,4,7,10,13,16,19-heptazacyclotricos-21-yl]amino]butan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1-oxobutan-2-yl]-6-methyloctanamide sulfuric acid Polymers OS(O)(=O)=O.CC(C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@@H](NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](Cc2ccccc2)NC(=O)[C@@H](CCN)NC1=O)[C@@H](C)O.CC[C@H](C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@@H](NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](Cc2ccccc2)NC(=O)[C@@H](CCN)NC1=O)[C@@H](C)O SBKRTALNRRAOJP-BWSIXKJUSA-N 0.000 description 1
- BAWFJGJZGIEFAR-NNYOXOHSSA-N NAD zwitterion Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-N 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 108010093965 Polymyxin B Proteins 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 102220552581 Protein disulfide-isomerase A3_C57A_mutation Human genes 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000657354 Pseudomonas sp. 34 E 7 Species 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102220640822 Retinoid isomerohydrolase_F70V_mutation Human genes 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 239000004280 Sodium formate Substances 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 102220506113 V-set and transmembrane domain-containing protein 2B_N35Q_mutation Human genes 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 241000222124 [Candida] boidinii Species 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229940124350 antibacterial drug Drugs 0.000 description 1
- 150000001541 aziridines Chemical class 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 102220362887 c.171C>G Human genes 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000008094 contradictory effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 1
- 108010011035 endodeoxyribonuclease DpnI Proteins 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 229960003760 florfenicol Drugs 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 229960002449 glycine Drugs 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 230000031700 light absorption Effects 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 229950006238 nadide Drugs 0.000 description 1
- 229940101270 nicotinamide adenine dinucleotide (nad) Drugs 0.000 description 1
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229960003548 polymyxin b sulfate Drugs 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 102200042754 rs121912898 Human genes 0.000 description 1
- 102220008853 rs193922308 Human genes 0.000 description 1
- 102220033163 rs281865287 Human genes 0.000 description 1
- 102200091337 rs587777474 Human genes 0.000 description 1
- 102220075080 rs796052773 Human genes 0.000 description 1
- 102220096697 rs876660029 Human genes 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000012807 shake-flask culturing Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- HLBBKKJFGFRGMU-UHFFFAOYSA-M sodium formate Chemical compound [Na+].[O-]C=O HLBBKKJFGFRGMU-UHFFFAOYSA-M 0.000 description 1
- 235000019254 sodium formate Nutrition 0.000 description 1
- 239000002910 solid waste Substances 0.000 description 1
- 238000012409 standard PCR amplification Methods 0.000 description 1
- 230000000707 stereoselective effect Effects 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 150000003457 sulfones Chemical class 0.000 description 1
- 230000009469 supplementation Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- OTVAEFIXJLOWRX-NXEZZACHSA-N thiamphenicol Chemical compound CS(=O)(=O)C1=CC=C([C@@H](O)[C@@H](CO)NC(=O)C(Cl)Cl)C=C1 OTVAEFIXJLOWRX-NXEZZACHSA-N 0.000 description 1
- 229960003053 thiamphenicol Drugs 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 229960004799 tryptophan Drugs 0.000 description 1
- 239000000273 veterinary drug Substances 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1022—Transferases (2.) transferring aldehyde or ketonic groups (2.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y202/00—Transferases transferring aldehyde or ketonic groups (2.2)
- C12Y202/01—Transketolases and transaldolases (2.2.1)
Definitions
- the invention relates to the field of enzyme engineering. Specifically, the present invention relates to a modified L-threonine transaldolase (LTTA) and its application in the production of 3-phenyl-L-serine and its derivatives.
- LTTA modified L-threonine transaldolase
- 3-Phenyl-L-serine and its derivatives are a very important class of organic synthesis intermediates, which are widely used in drug synthesis.
- (2S,3R)-p-methylsulfonylphenylserine is a key intermediate for the synthesis of veterinary drugs florfenicol and thiamphenicol
- (2S,3R)-p-nitrophenylserine can be used as an antibacterial drug chloramphenicol Key intermediates.
- 3-phenyl-L-serine and its derivatives can also prepare chiral aziridines through simple transformation (David Tanner, Chiral Aziridines-Their Synthesis and Use in Stereoselective Transformations, Angew. Chem., Int.
- 3-phenyl-L-serine derivatives are mainly prepared by organic synthesis. Such methods often have disadvantages such as many steps, low yield, and poor stereoselectivity. Biologists have also made some attempts to prepare such compounds. For example, Steinreiber et al. (Johannes Steinreiber et al., Threonine aldolases—an emerging tool for organic synthesis, Tetrahedron, 63, 918-926 (2007)) used benzaldehyde derivatives. Synthesize 3-phenyl-L-serine derivatives with glycine under the action of aldolase, the reaction formula is as follows:
- LTTA is used to catalyze the reaction between p-methylsulfonyl benzaldehyde and L-threonine, such as (2S,3R)-p-methylsulfonyl phenylserine, see CN 109836362A.
- LTTA can catalyze the reaction of benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or its derivatives and acetaldehyde.
- the general reaction formula is as shown in formula (I) ( See, for example, PCT/CN2019/123974):
- the present invention provides a modified L-threonine transaldolase (LTTA), wherein the modified LTTA comprises amino acid substitutions at one or more positions compared to its starting LTTA, and It has an improved activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine to generate 3-phenyl-L-serine or its derivatives.
- LTTA modified L-threonine transaldolase
- the modified LTTA includes an amino acid substitution at position 70, preferably substitution to H, N or S, more preferably substitution to H, and the position is numbered with reference to SEQ ID NO: 2.
- the modified LTTA further comprises amino acids at one or more positions selected from positions 35, 38, 48, 57, 59, 94, 116, 141, 181, 185, 205, 229, and 407 replace.
- position 35 is substituted with A, G or S, more preferably S.
- position 38 is substituted with F.
- position 48 is substituted with A or C.
- position 57 is substituted with S, M or T, more preferably M.
- position 59 is substituted with A or Y.
- position 94 is substituted with E.
- position 116 is substituted with R, S or N.
- position 141 is substituted with C.
- position 181 is substituted with Q.
- position 185 is substituted with G.
- position 205 is selected from S, Q or A.
- position 229 is substituted with C.
- position 407 is substituted with R.
- the modified LTTA comprises the amino acid sequence of one of SEQ ID NO: 13-72 or consists of the amino acid sequence of one of SEQ ID NO: 13-72; or is the same as SEQ ID NO: 13-65 Compared with the amino acid sequence of one of 69-72, the modified LTTA has positions 35, 38, 48, 57, 59, 70, 94, 116, 141, 181, 185, 205, 229 and 407. The position further contains 1-10 amino acid substitutions; or compared with the amino acid sequence of one of SEQ ID NO: 66-68, the modified LTTA also contains 1- 10 amino acid substitutions.
- the present invention provides a polynucleotide encoding the modified LTTA of the present invention, and a vector comprising the polynucleotide of the present invention.
- the present invention provides a host cell containing the modified LTTA of the present invention, its encoding polynucleotide or a vector containing the polynucleotide.
- the present invention also provides a method for producing 3-phenyl-L-serine and its derivatives, which comprises combining the modified LTTA of the present invention or the host cell of the present invention with benzaldehyde or its derivatives, and L-threonine contact.
- L-threonine transaldolase and "LTTA” have the ability to catalyze the reaction of benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or derivatives and acetaldehyde (ie The reaction of formula (I)) activity.
- the present invention provides a modified LTTA polypeptide. Compared with the original LTTA polypeptide, the modified LTTA has an improved activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine.
- peptide means a chain of at least two amino acids connected by peptide bonds.
- polypeptide is used interchangeably with the term “protein” herein and refers to a chain containing ten or more amino acid residues. All peptide and polypeptide chemical formulas or sequences herein are written from left to right, indicating the direction from the amino terminal to the carboxy terminal.
- amino acid includes naturally occurring amino acids and unnatural amino acids in proteins.
- the one-letter and three-letter names of amino acids naturally occurring in proteins are commonly used in the field, and can be found in Sambrook, et al. (Molecular Cloning: A Laboratory Manual, 2nd, ed. Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989).
- modification refers to any chemical modification of a polypeptide, such as amino acid substitutions, deletions, insertions, and/or additions.
- the modified LTTA of the present invention contains 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 compared to its starting LTTA. , 16, 17, 18, 19, 20 or more amino acid substitutions, wherein the modified LTTA of the present invention has an improved catalyzed reaction between benzaldehyde or its derivatives and L-threonine compared with the original LTTA Activity of 3-phenyl-L-serine or derivatives.
- the modified LTTA comprises 1-20, 1-15, 1-14, 1-13, 1-12, 1-11, 1-10, 1- 9. 1-8, 1-7, 1-6, 1-5, 1-4, 1-3 or 1-2 amino acid substitutions, wherein the modified LTTA of the present invention has an improvement compared with the original LTTA
- the modified LTTA of the present invention contains an amino acid substitution at position 70 compared to its starting LTTA, and the position is numbered with reference to SEQ ID NO: 2, preferably the substitution is H, N or S, more preferably Replaced by H.
- the modified LTTA of the present invention further comprises one or more positions selected from positions 35, 38, 48, 57, 59, 94, 116, 141, 181, 185, 205, 229 and 407.
- Amino acid substitution Preferably, position 35 is substituted with A, G or S, more preferably S.
- position 38 is substituted with F.
- position 48 is substituted with A or C.
- position 57 is substituted with S, M or T, more preferably M.
- position 59 is substituted with A or Y.
- position 94 is substituted with E.
- position 116 is substituted with R, S or N.
- position 141 is substituted with C.
- position 181 is substituted with Q.
- position 185 is substituted with G.
- position 205 is selected from S, Q or A.
- position 229 is substituted with C.
- position 407 is substituted with R.
- the modified LTTA of the present invention contains amino acid substitutions at positions 35 and 70, and the positions are numbered with reference to SEQ ID NO: 2, wherein position 35 is substituted with S and position 70 is substituted with H. More preferably, the modified LTTA comprises amino acid substitutions at positions 35, 57 and 70, wherein position 35 is substituted with S, position 57 is substituted with M, and position 70 is substituted with H.
- the modified LTTA comprises amino acid substitutions at positions 35, 57, 59, 70, 94 and 141, wherein position 35 is substituted with S, position 57 is substituted with M, position 59 is substituted with A, and position 70 is substituted with H, position 94 is substituted with E, and position 141 is substituted with C.
- the modified LTTA of the present invention has an amino acid substitution or a combination of amino acid substitutions selected from the group consisting of the following amino acid substitutions or combinations of amino acid substitutions (positions are numbered with reference to SEQ ID NO: 2):
- the LTTA polypeptide on which amino acid modifications are made is referred to as the starting LTTA.
- the starting LTTA can be wild-type LTTA or a variant of wild-type LTTA.
- the polypeptide of SEQ ID NO: 2 is the "starting LTTA"; and if the polypeptide of SEQ ID NO: 2 is modified (for example, SEQ ID NOs: 13-65 and 69-72) start to be modified, compared to the modified LTTA, the variant polypeptide is the "starting LTTA".
- wild-type LTTA refers to naturally occurring LTTA.
- the wild-type LTTA is LTTA from Pseudomonas.
- the wild-type LTTA is shown in SEQ ID NO: 2, 4, or 6.
- SEQ ID NO: 2 is the amino acid sequence of LTTA from Pseudomonas fluorescens (Genbank accession number AQZ26585.1)
- SEQ ID NO: 4 is the amino acid sequence of LTTA from Pseudomonas sp.34E 7 (Genbank accession No. CRN02517.1)
- SEQ ID NO: 6 is the amino acid sequence of LTTA from Pseudomonas sp.
- the wild-type LTTA is an LTTA from Chitiniphilus shinanonensis, and its sequence is shown in SEQ ID NO: 8 (Genbank accession number WP_018749561).
- the sequences are aligned for the purpose of optimal comparison (for example, gaps can be introduced in the first amino acid or nucleic acid sequence to match the second amino acid sequence). Or nucleic acid sequence for optimal alignment).
- the amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide in the corresponding position in the second sequence, then these molecules are the same at this position.
- percent identity number of identical positions/total number of positions (i.e. overlapping positions) x 100).
- the two sequences are the same length.
- Percent amino acid identity or “percent amino acid sequence identity” refers to comparing the amino acids of two polypeptides, and when optimally aligned, the two polypeptides have approximately the specified percentage of identical amino acids. For example, “95% amino acid identity” refers to comparing the amino acids of two polypeptides. When the two polypeptides are optimally aligned, 95% of the amino acids of the two polypeptides are identical.
- the modified LTTA polypeptide of the present invention has at least 65%, preferably at least 70%, 75% or 80%, more preferably at least 85% compared to SEQ ID NO: 2, 4, 6 or 8. , 90% or 95%, particularly preferably at least 96%, 97%, 98% or 99% sequence identity.
- the modified LTTA of the present invention includes 1, 2, 3, 4, 5, 6, 7, 8 compared to its starting LTTA (for example, SEQ ID NO: 2, 4, 6, or 8). , 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more amino acid substitutions, wherein the modified LTTA of the present invention has improved catalysis compared with the original LTTA The activity of reacting benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or its derivatives.
- the modified LTTA contains 1-20, 1-15, 1-14, 1-13, 1-20, 1-15, 1-14, 1-13, 1-12, 1-11, 1-10, 1-9, 1-8, 1-7, 1-6, 1-5, 1-4, 1-3 or 1-2 amino acid substitutions.
- the modified LTTA of the present invention contains an amino acid substitution at position 70 compared to its starting LTTA, and the position is numbered with reference to SEQ ID NO: 2, preferably the substitution is H, N or S, more preferably Replaced by H.
- the modified LTTA has at least 65%, preferably at least 70%, 75% or 80%, more preferably at least 85%, 90% or 95%, particularly preferably at least 96%, 97%, 98% or 99% sequence identity.
- the modified LTTA of the present invention further comprises one or more positions selected from positions 35, 38, 48, 57, 59, 94, 116, 141, 181, 185, 205, 229 and 407.
- Amino acid substitution Preferably, position 35 is substituted with A, G or S, more preferably S.
- position 38 is substituted with F.
- position 48 is substituted with A or C.
- position 57 is substituted with S, M or T, more preferably M.
- position 59 is substituted with A or Y.
- position 94 is substituted with E.
- position 116 is substituted with R, S or N.
- position 141 is substituted with C.
- position 181 is substituted with Q.
- position 185 is substituted with G.
- position 205 is selected from S, Q or A.
- position 229 is substituted with C.
- position 407 is substituted with R.
- the modified LTTA has at least 65%, preferably at least 70%, 75% or 80%, more preferably at least 85%, 90% or 95%, particularly preferably at least 96%, 97%, 98% or 99% sequence identity.
- the modified LTTA of the present invention contains amino acid substitutions at positions 35 and 70, and the positions are numbered with reference to SEQ ID NO: 2, wherein position 35 is substituted with S and position 70 is substituted with H. More preferably, the modified LTTA comprises amino acid substitutions at positions 35, 57 and 70, wherein position 35 is substituted with S, position 57 is substituted with M, and position 70 is substituted with H.
- the modified LTTA comprises amino acid substitutions at positions 35, 57, 59, 70, 94 and 141, wherein position 35 is substituted with S, position 57 is substituted with M, position 59 is substituted with A, and position 70 is substituted with H, position 94 is substituted with E, and position 141 is substituted with C.
- the modified LTTA has at least 65%, preferably at least 70%, 75% or 80%, more preferably at least 85%, 90% or 95%, particularly preferably at least 96%, compared with its starting LTTA. 97%, 98% or 99% sequence identity.
- the modified LTTA of the present invention contains an amino acid substitution at position 70 compared to its starting LTTA, and the position is numbered with reference to SEQ ID NO: 2, preferably the substitution is H, N or S, more preferably Substituted to H, wherein the modified LTTA of the present invention has an improved activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or its derivatives compared to its starting LTTA .
- the initial LTTA and SEQ ID NO: 2, 4, 6, or 8 have at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% % Or 99% sequence identity.
- the modified LTTA of the present invention further comprises one or more positions selected from positions 35, 38, 48, 57, 59, 94, 116, 141, 181, 185, 205, 229 and 407.
- Amino acid substitution Preferably, position 35 is substituted with A, G or S, more preferably S.
- position 38 is substituted with F.
- position 48 is substituted with A or C.
- position 57 is substituted with S, M or T, more preferably M.
- position 59 is substituted with A or Y.
- position 94 is substituted with E.
- position 116 is substituted with R, S or N.
- position 141 is substituted with C.
- position 181 is substituted with Q.
- position 185 is substituted with G.
- position 205 is selected from S, Q or A.
- position 229 is substituted with C.
- position 407 is substituted with R.
- the modified LTTA of the present invention contains amino acid substitutions at positions 35 and 70, and the positions are numbered with reference to SEQ ID NO: 2, wherein position 35 is substituted with S and position 70 is substituted with H. More preferably, the modified LTTA comprises amino acid substitutions at positions 35, 57 and 70, wherein position 35 is substituted with S, position 57 is substituted with M, and position 70 is substituted with H.
- the modified LTTA comprises amino acid substitutions at positions 35, 57, 59, 70, 94 and 141, wherein position 35 is substituted with S, position 57 is substituted with M, position 59 is substituted with A, and position 70 is substituted with H, position 94 is substituted with E, and position 141 is substituted with C.
- amino acid residue substitution refers to a substitution in which the amino acid residue is replaced by an amino acid residue having a similar side chain, for example, an amino acid with a basic side chain (e.g., lysine) , Arginine and histidine), acidic side chain amino acids (e.g. aspartic acid, glutamic acid), non-charged electroactive side chain amino acids (e.g. glycine, asparagine, glutamine, serine, threonine) Acid, tyrosine, cysteine), non-polar side chain amino acids (e.g.
- a basic side chain e.g., lysine
- acidic side chain amino acids e.g. aspartic acid, glutamic acid
- non-charged electroactive side chain amino acids e.g. glycine, asparagine, glutamine, serine, threonine
- Acid tyrosine, cysteine
- non-polar side chain amino acids
- ⁇ -branched side chain amino acids e.g. threonine, valine, isoleucine
- aromatic side chain amino acids e.g. tyrosine, phenylalanine, tryptophan, histidine
- Conservative amino acid substitutions generally have the least impact on the activity of the resulting protein. This substitution is described below. Conservative substitution is to replace an amino acid with an amino acid that is similar in size, hydrophobicity, charge, polarity, spatial characteristics, and aromaticity. When it is desired to fine-tune the properties of the protein, such substitutions are usually conservative.
- homologous amino acid residues refer to amino acid residues with similar chemical properties related to hydrophobicity, charge, polarity, steric characteristics, aromatic characteristics, and the like.
- amino acids that are homologous to each other include positively charged lysine, arginine, histidine, negatively charged glutamic acid, aspartic acid, hydrophobic glycine, alanine, valine, and leucine Acid, isoleucine, proline, phenylalanine, polar serine, threonine, cysteine, methionine, tryptophan, tyrosine, asparagine, glutamine , Aromatic phenylalanine, tyrosine, tryptophan, serine and threonine with chemically similar side chain groups, or glutamine and asparagine, or leucine and isoleucine.
- Examples of conservative amino acid substitutions in proteins include: Ser replaces Ala, Lys replaces Arg, Gln or His replaces Asn, Glu replaces Asp, Ser replaces Cys, Asn replaces Gln, Asp replaces Glu, Pro replaces Gly, Asn or Gln replaces His, Leu Or Val replaces Ile, Ile or Val replaces Leu, Arg or Gln replaces Lys, Leu or Ile replaces Met, Met, Leu or Tyr replaces Phe, Thr replaces Ser, Ser replaces Thr, Tyr replaces Tr, Trp or Phe replaces Tyr, and Ile or Leu replaces Val.
- the modified LTTA comprises the amino acid sequence of one of SEQ ID NO: 13-72 or consists of the amino acid sequence of one of SEQ ID NO: 13-72; or is the same as SEQ ID NO: 13-65 Compared with the amino acid sequence of one of 69-72, the modified LTTA has positions 35, 38, 48, 57, 59, 70, 94, 116, 141, 181, 185, 205, 229 and 407.
- the position contains 1-10 amino acid substitutions; or compared with the amino acid sequence of one of SEQ ID NO: 66-68, the modified LTTA contains 1-10 positions other than positions 35, 57 and 70 Amino acid substitutions, wherein the modified LTTA of the present invention has an improved activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or its derivatives compared with the original LTTA.
- the modified LTTA is at positions 35, 38, 48, 57, 59, 70, 94, 116.
- the positions other than, 141, 181, 185, 205, 229 and 407 also contain 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more amino acid substitutions; or with SEQ ID NO: Compared with the amino acid sequence of one of 66-68, the modified LTTA contains 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 at positions other than positions 35, 57, and 70. Or more amino acid substitutions.
- the activity of an enzyme refers to a decrease in substrate or an increase in product per unit time in a chemical reaction catalyzed by an enzyme per unit mass under certain conditions.
- the activity of the modified LTTA of the present invention under certain conditions (such as the reaction conditions listed in the following examples), under the catalysis of a unit mass of the modified LTTA, 3-phenyl-L generated in a unit time -Serine or its derivatives are expressed in terms of amount.
- the activity of an enzyme can also refer to the relative activity of the enzyme, expressed as the ratio of the activity of the enzyme of interest to the activity of a given enzyme that catalyzes the same reaction, such as percentage relative activity.
- the activity of the modified LTTA of the present invention is expressed as a ratio to the activity of the LTTA of SEQ ID NO: 2.
- the modified LTTA has an activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or its derivatives is at least that of the LTTA of SEQ ID NO: 2 1, 1.05, 1.1, 1.2, 1.3, 1.5, 1.7, 2, 2.5, 3, 3.5, 4 times or more.
- product selectivity means that when the reaction product contains two or more possible stereoisomers, one of the products is produced more. If the reaction is an enzymatic reaction, the product selectivity is also referred to as the product selectivity of the enzyme.
- the carbons at positions 2 and 3 of the derivatives of 3-phenyl-L-serine are all chiral carbons, which will produce four stereoisomers, namely P(2S,3R ), P(2R,3S), (P(2S,3S) and P(2R,3R).
- the product selectivity of LTTA can be used under certain reaction conditions (such as the reaction conditions listed in the following examples) diastereomeric ratio (DR) and enantiomers
- DR diastereomeric ratio
- ee% are calculated using the following formulas (I) and (II).
- the DR of the modified LTTA of the present invention is increased, for example, to at least 95.5:4.5, at least 96:4, at least 96.5:3.5, at least 97:3, at least 97.5: 2.5, at least 98:2, at least 98.5:1.5, at least 99:1 or higher.
- the ee% of the modified LTTA of the present invention is >99.9%.
- the benzaldehyde derivative is p-methylsulfonyl benzaldehyde.
- the derivative of 3-phenyl-L-serine is (2S,3R)-p-methylsulfonylphenylserine.
- nucleic acid molecule includes DNA molecules (such as cDNA or genomic DNA) and RNA molecules (such as mRNA) and analogs of DNA or RNA produced using nucleotide analogs.
- the nucleic acid molecule may be single-stranded or double-stranded, preferably double-stranded DNA.
- the synthesis of the nucleic acid may use nucleotide analogs or derivatives (for example, inosine or phosphorothioate nucleotides). Such nucleotides can be used, for example, to prepare nucleic acids with altered base pairing ability or increased nuclease resistance.
- the present invention also provides polynucleotides encoding the modified LTTA of the present invention. Therefore, in the present invention, the term modification also includes genetic manipulation of the polynucleotide encoding the LTTA polypeptide of the present invention. The modification may be a substitution, deletion, insertion and/or addition of nucleotides.
- the term "encoding" refers to the amino acid sequence of a polynucleotide directly specifying its protein product.
- the boundaries of the coding sequence are generally determined by an open reading frame, which usually starts with the ATG start codon or other start codons such as GTG and TTG, and ends with stop codons such as TAA, TAG, and TGA.
- the coding sequence can be a DNA, cDNA or recombinant nucleotide sequence.
- nucleic acid molecules covering all or part of the nucleic acid sequence of the present invention can be separated by polymerase chain reaction (PCR), which uses the design of synthetic oligonucleotide primers based on the sequence information contained in the sequence.
- PCR polymerase chain reaction
- the polynucleotide of the present invention can be amplified using cDNA, mRNA or genomic DNA as a template and suitable oligonucleotide primers according to standard PCR amplification techniques.
- the nucleic acid so amplified can be cloned into a suitable vector and characterized by DNA sequence analysis.
- the polynucleotide of the present invention can be prepared by standard synthesis techniques, for example, by using an automated DNA synthesizer.
- the invention also relates to the complementary strands of the nucleic acid molecules described herein.
- a nucleic acid molecule that is complementary to other nucleotide sequences is a molecule that is sufficiently complementary to the nucleotide sequence so that it can hybridize with other nucleotide sequences to form a stable duplex.
- hybridization refers to nucleotides that are at least about 90%, preferably at least about 95%, more preferably at least about 96%, and more preferably at least 98% homologous to each other under given stringent hybridization and washing conditions. The sequences generally remain hybridized to each other.
- the polynucleotide of the present invention does not include a polynucleotide that only hybridizes to a poly A sequence (such as the 3'end poly (A) of mRNA) or a complementary stretch of poly T (or U) residues.
- nucleic acid constructs and vectors containing the polynucleotide of the present invention are also provided.
- expression includes any step involved in the production of a polypeptide, including but not limited to transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
- nucleic acid construct refers to a single-stranded or double-stranded nucleic acid molecule, which is isolated from a naturally occurring gene or modified to contain a nucleic acid segment that does not occur in nature.
- nucleic acid construct contains the control sequences required to express the coding sequence of the present invention, the term nucleic acid construct is synonymous with the term "expression cassette”.
- expression vector refers herein to a linear or circular DNA molecule, which comprises a polynucleotide encoding a polypeptide of the present invention, the polynucleotide and additional nucleotides provided for the expression of the polynucleotide, for example, Control sequence, operably connected.
- the expression vector includes a viral vector or a plasmid vector.
- control sequence refers herein to include all elements necessary or advantageous for the expression of the polynucleotide encoding the polypeptide of the present invention.
- Each control sequence may be natural or foreign to the nucleotide sequence encoding the polypeptide, or natural or foreign to each other.
- control sequences include, but are not limited to, leader sequences, polyadenylation sequences, propeptide sequences, promoters, signal peptide sequences, and transcription terminator. At a minimum, control sequences include promoters and transcription and translation termination signals.
- control sequence may be a suitable promoter sequence, a nucleotide sequence recognized by the host cell to express the polynucleotide encoding the polypeptide of the present invention.
- the promoter sequence contains transcription control sequences that mediate the expression of the polypeptide.
- the promoter may be any nucleotide sequence that exhibits transcriptional activity in the selected host cell, for example, the Escherichia coli lac operon.
- the promoters also include mutated, truncated and hybrid promoters, and can be obtained from genes encoding extracellular or intracellular polypeptides homologous or heterologous to the host cell.
- operably linked refers to a configuration in which a control sequence is placed at an appropriate position relative to the coding sequence of the polynucleotide sequence, whereby the control sequence directs the expression of the polypeptide coding sequence.
- the polynucleotide encoding the polypeptide of the present invention can be subjected to various operations to allow expression of the polypeptide. Before inserting it into the vector, it is desirable or necessary to manipulate the polynucleotide according to the expression vector. Techniques for modifying polynucleotide sequences using recombinant DNA methods are well known in the art.
- the vector of the present invention preferably contains one or more selectable markers, which allow simple selection of transformed, transfected, transduced, etc. cells.
- a selectable marker is a gene whose product provides biocide or virus resistance, heavy metal resistance, supplementation of auxotrophs, etc.
- the bacterial selectable marker is the dal gene from Bacillus subtilis or Bacillus licheniformis, or a marker that confers antibiotic resistance such as ampicillin, kanamycin, chloramphenicol, or tetracycline resistance.
- the vector of the present invention can be integrated into the genome of the host cell or can replicate autonomously in the cell without relying on the genome. Elements required for integration into the host cell genome or autonomous replication are known in the art (see, for example, the aforementioned Sambrook et al., 1989).
- Vector DNA can be introduced into prokaryotic or eukaryotic cells by conventional transformation or transfection techniques.
- transformation and “transfection” refer to various art-recognized techniques for introducing foreign nucleic acids (such as DNA) into host cells, which can be found in, for example, the aforementioned Sambrook et al., 1989; Davis et al. .,Basic Methods in Molecular Biology (1986) and other laboratory manuals.
- the present invention also relates to a recombinant host cell, which contains the polynucleotide of the present invention, which is advantageously used for the recombinant production of LTTA polypeptides.
- the vector containing the polynucleotide of the present invention is introduced into a host cell, whereby the vector is retained as a chromosomal integrant or as a self-replicating extrachromosomal vector.
- Those skilled in the art know conventional vectors and host cells for expressing proteins.
- the host cell of the present invention is an E. coli cell, such as E. coli BL21 (DE3).
- the expression vector is pET-30a(+).
- the present invention provides a method for preparing 3-phenyl-L-serine or a derivative thereof, which comprises contacting the modified LTTA or host cell of the present invention with benzaldehyde or a derivative thereof and L-threonine.
- the method for preparing 3-phenyl-L-serine or a derivative thereof of the present invention includes the following steps:
- acetaldehyde reductase and reduced nicotinamide adenine dinucleotide (NADH) are added to the reaction medium to reduce acetaldehyde to ethanol.
- the acetaldehyde reductase is an acetaldehyde reductase from Escherichia coli, and its amino acid sequence is shown in SEQ ID NO: 10.
- glucose dehydrogenase and glucose are added to the reaction medium to achieve coenzyme (NADH) regeneration.
- NADH coenzyme
- the glucose dehydrogenase is derived from Bacillus subtilis (Bacillus subtilis), and its amino acid sequence is shown in SEQ ID NO: 12.
- formate or formate and formate dehydrogenase are added to the reaction medium to achieve coenzyme (NADH) regeneration.
- the formate dehydrogenase is derived from Candida boidinii, and its amino acid sequence is shown in SEQ ID NO: 73.
- the reaction medium is a buffer, such as PBS or Tris-HCl buffer.
- the reaction medium is a PBS buffer, such as 0.1M, pH 7.0 PBS buffer.
- the incubation is performed at 25-40°C, preferably 28-35°C, such as 30°C.
- DNA polymerase (PrimeSTAR Max DNA Polymerase) and DpnI endonuclease were purchased from TaKaRa;
- the plasmid extraction kit was purchased from Axygen;
- P-Methylsulfonyl benzaldehyde was purchased from Aladdin, item number M185093, purity 98%;
- L-threonine was purchased from Macleans, catalog number C10393311, analytically pure;
- Pyridoxal phosphate was purchased from Aladdin, item number P136795, purity ⁇ 98%;
- Magnesium chloride was purchased from Aladdin, catalog number A2006034, analytically pure;
- Oxidized nicotinamide adenine dinucleotide (NAD) was purchased from Aladdin, catalog number N196974, with a purity of 95%.
- the expression vector used was pET-30a(+), the plasmid was purchased from Novagen, and the host cell used was E. coli BL21 (DE3), purchased from Tiangen Biochemical Technology (Beijing) Co., Ltd.
- the E. coli cells transformed with the plasmid containing the target gene were inoculated into LB liquid medium containing 50mg/L of kanamycin (50mL medium in a 250mL bottle, peptone 10g/L, yeast powder 5g/L, NaCl 10g/L, pH7.0), incubate overnight at 37°C with shaking.
- kanamycin 50mL medium in a 250mL bottle, peptone 10g/L, yeast powder 5g/L, NaCl 10g/L, pH7.0
- E. coli cells were collected.
- the collected E. coli cells were resuspended in 20 mL of pre-cooled phosphate buffered saline (PBS) at pH 7.0, and the E. coli cells were sonicated at 4°C.
- PBS phosphate buffered saline
- the cell disruption solution was centrifuged at 6,000 g at 4°C for 15 min to remove the precipitate, and the supernatant obtained was a solution containing the recombinant enzyme, which was used to catalyze the reaction.
- the enzyme solution can also be freeze-dried into enzyme powder and stored at 4°C for later use.
- the E. coli cells transformed with the plasmid containing the target gene were inoculated into the wells of a 96-well shallow-well plate, and each well was filled with 100 ⁇ L of LB medium containing 50 ⁇ g/mL kanamycin.
- the culture was incubated in a shaker for 18 hours (250 rpm, 30°C and 85% relative humidity). Transfer 20 ⁇ L of the incubated culture to the wells of a 96-well deep well plate filled with 180 ⁇ L of TB medium containing 50 ⁇ g/mL kanamycin.
- the plate was incubated at 30°C and 250 rpm for 2 hours, and then IPTG (final concentration of 0.4 mM) was added for induction, and incubated in a shaker at 30°C and 250 rpm for 20 hours.
- the E. coli cells were collected by centrifugation, and 300 ⁇ L of lysate (20mM phosphate buffer pH 7.0, 1mM MgSO 4 , 1mg/mL lysozyme and 0.5mg/mL polymyxin B sulfate) was added. The mixture was shaken at room temperature for 2h, centrifuged and the supernatant was taken for enzyme activity determination.
- the reactants were added to the wells of a 96-well plate, mixed briefly, and the SpectraMax190 (Molecular Devices) light absorption microplate reader was used to detect the activity of LTTA and its mutants by changing the absorbance at 340 nm.
- HPLC parameters are as follows:
- HPLC parameters are as follows:
- LTTA01 SEQ ID NO:1
- a mutant was prepared according to the method of Example 1.
- the resulting mutants are shown in Table 1.
- the expression LTTA01 and its mutants were cultured by shaking flasks, and enzyme powder was prepared.
- the activities of LTTA01 and its mutants were measured according to Method 2 in section vi) of Example 1, and the results are shown in Table 1, where the relative activity refers to the activity of the mutant/the activity of LTTA01.
- N35S-Y38F-G48A-Q94E-K116S-F70H 59 2.65 N35S-C57T-F59A-F70H-Q94E-M141C 60 4.02 N35S-C57S-F59A-F70H-Q94E-M141C 61 3.95 N35S-C57M-F59A-F70H-Q94E-M141C 62 4.32 N35S-F59A-F70H-Q94E-M141C-W185G 63 2.19 N35S-F59A-F70H-Q94E-M141C-K407R 64 2.69 N35S-F59A-F70H-Q94E-M141C-M205A 65 4.02
- Example 3 Preparation and detection of mutants of L-threonine transaldolase from different sources
- a mutant containing the substitution combination N35S-C57M-F70H was prepared based on the following LTTA encoding nucleic acid:
- LTTA02 encoding nucleic acid SEQ ID NO: 3;
- LTTA04 Chitiniphilus shinanonensis: LTTA04, encoding nucleic acid SEQ ID NO: 7.
- Example 2 According to the method of Example 1, the expression LTTA01 and its mutants were cultured by shaking flasks, and enzyme powder was prepared. The activity of each wild-type LTTA and its mutants was measured according to method 2 in section vi) of Example 1, and the results are shown in Table 2, where the relative activity refers to the activity of the mutant/the activity of LTTA01.
- Example 4 Introduce different substitutions at positions 35, 57 and 70 to prepare single mutants and compare their enzyme activities
- LTTA01 SEQ ID NO:1
- SEQ ID NO:1 nucleic acid encoding LTTA01
- Table 1 the expression of LTTA01 and its mutants were cultured in a 96-well plate.
- the activity of LTTA01 and the mutant was measured according to Method 1 in Section vi) of Example 1, and the results are shown in Table 3, where the relative activity refers to the activity of the mutant/the activity of LTTA01.
- Example 5 The effect of introducing different mutations in LTTA01 on enzyme activity and product selectivity
- LTTA01 SEQ ID NO:1
- a mutant was prepared according to the method of Example 1.
- the expression LTTA01 and its mutants were cultured by shaking flasks, and enzyme powder was prepared.
- the activity and product selectivity of LTTA01 and the mutant were determined according to Method 3 in Article vi) of Example 1, and the results are shown in Table 4.
- the relative activity refers to the activity of the mutant/the activity of LTTA01, and the product selectivity is represented by DR , And for each enzyme, ee%>99.9%.
- mutants containing F70H or N35S are higher than that of wild type, and the product selectivity of mutants containing N35S and F70H mutations is further improved.
- the mutant containing the single mutation of N35S was not as active as WT under the above reaction conditions, but it did not significantly affect the enzyme activity after being combined with F70H.
- Example 6 The effect of introducing different mutations at position 70 of LTTA01 on enzyme activity and product selectivity
- LTTA01 SEQ ID NO:1
- a mutant was prepared according to the method of Example 1, and a substitution was introduced at position 70 of LTTA01.
- the expression LTTA01 and its mutants were cultured by shaking flasks, and enzyme powder was prepared.
- the activity and product selectivity of LTTA01 and the mutant were determined according to Method 3 in Article vi) of Example 1, and the results are shown in Table 5.
- the relative activity refers to the activity of the mutant/the activity of LTTA01, and the product selectivity is represented by DR , And for each enzyme, ee%>99.9%.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Provided is a modified L-threonine transaldolase (LTTA). Compared with the beginning, the modified LTTA has improved catalytic activity for a reaction between benzaldehyde or a derivative thereof and L-threonine by getting in contact. Also provided are a polynucleotide for encoding the modified LTTA of the present invention, a vector and host cell expressing the modified LTTA of the present invention, and a method for producing 3-phenyl-L-serine and a derivative thereof by using the modified LTTA of the present invention and the host cell.
Description
本发明涉及酶工程领域。具体而言,本发明涉及经修饰的L-苏氨酸转醛酶(LTTA)及其在生产3-苯基-L-丝氨酸及其衍生物中的应用。The invention relates to the field of enzyme engineering. Specifically, the present invention relates to a modified L-threonine transaldolase (LTTA) and its application in the production of 3-phenyl-L-serine and its derivatives.
3-苯基-L-丝氨酸及其衍生物,其结构通式如下所示:The general structural formula of 3-phenyl-L-serine and its derivatives is as follows:
3-苯基-L-丝氨酸及其衍生物是一类很重要的有机合成中间体,其广泛应用于药物合成中。比如:(2S,3R)-对甲砜基苯丝氨酸是合成兽药氟苯尼考跟甲砜霉素的关键中间体,(2S,3R)-对硝基苯丝氨酸可作为抗菌药物氯霉素的关键中间体。另外,3-苯基-L-丝氨酸及其衍生物还可以通过简单的转化制备手性氮杂环丙烷(David Tanner,ChiralAziridines-Their Synthesis and Use in Stereoselective Transformations,Angew.Chem.,Int.Ed.Engl.,33,599(1994)),其广泛应用于手性药物的合成。目前3-苯基-L-丝氨酸衍生物主要是通过有机合成制备,此类方法往往存在步骤多,收率低,立体选择性差等缺点。生物工作者对此类化合物的制备也做了一些尝试,例如Steinreiber等人(Johannes Steinreiber et al.,Threonine aldolases—an emerging tool for organic synthesis,Tetrahedron,63,918-926(2007))利用苯甲醛衍生物与甘氨酸在醛缩酶的作用下合成3-苯基-L-丝氨酸衍生物,其反应式如下所示:3-Phenyl-L-serine and its derivatives are a very important class of organic synthesis intermediates, which are widely used in drug synthesis. For example: (2S,3R)-p-methylsulfonylphenylserine is a key intermediate for the synthesis of veterinary drugs florfenicol and thiamphenicol, (2S,3R)-p-nitrophenylserine can be used as an antibacterial drug chloramphenicol Key intermediates. In addition, 3-phenyl-L-serine and its derivatives can also prepare chiral aziridines through simple transformation (David Tanner, Chiral Aziridines-Their Synthesis and Use in Stereoselective Transformations, Angew. Chem., Int. Ed. Engl., 33, 599 (1994)), which is widely used in the synthesis of chiral drugs. At present, 3-phenyl-L-serine derivatives are mainly prepared by organic synthesis. Such methods often have disadvantages such as many steps, low yield, and poor stereoselectivity. Biologists have also made some attempts to prepare such compounds. For example, Steinreiber et al. (Johannes Steinreiber et al., Threonine aldolases—an emerging tool for organic synthesis, Tetrahedron, 63, 918-926 (2007)) used benzaldehyde derivatives. Synthesize 3-phenyl-L-serine derivatives with glycine under the action of aldolase, the reaction formula is as follows:
目前此方法还是存在转化率低、立体选择性差、产品的分离纯化难度大等缺陷,不具备产业化的条件。因此探索一种温和、高效、经济的方法制备此类化合物受到了广大化学生物工作者的关注。At present, this method still has defects such as low conversion rate, poor stereoselectivity, and difficulty in product separation and purification, and does not have the conditions for industrialization. Therefore, exploring a mild, efficient, and economical method to prepare such compounds has attracted the attention of the majority of chemical and biological workers.
发明人发现,利用LTTA催化对甲砜基苯甲醛和L-苏氨酸反应,如(2S,3R)-对甲砜基苯丝氨酸,参见CN 109836362 A。The inventor found that LTTA is used to catalyze the reaction between p-methylsulfonyl benzaldehyde and L-threonine, such as (2S,3R)-p-methylsulfonyl phenylserine, see CN 109836362A.
进一步,发明人发现,LTTA可以催化苯甲醛或其衍生物和L-苏氨酸反应生成3-苯基-L-丝氨酸或其衍生物和乙醛,反应通式如式(I)所示(参见,例如,PCT/CN2019/123974):Furthermore, the inventors found that LTTA can catalyze the reaction of benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or its derivatives and acetaldehyde. The general reaction formula is as shown in formula (I) ( See, for example, PCT/CN2019/123974):
通过此方法,整个3-苯基-L-丝氨酸或其衍生物的制备、提纯工艺操作简单、条件温和、大幅度降低了固体废弃物和生产的成本。然而,仍需要具有更高的催化式(I)反应的活性的LTTA。Through this method, the entire preparation and purification process of 3-phenyl-L-serine or its derivatives has simple operation, mild conditions, and greatly reduces solid waste and production costs. However, there is still a need for LTTA with higher activity to catalyze the reaction of formula (I).
发明内容Summary of the invention
在第一方面,本发明提供一种经修饰的L-苏氨酸转醛酶(LTTA),其中与其起始LTTA相比,所述经修饰的LTTA包含一或多个位置的氨基酸取代,并且具有改进的催化苯甲醛或其衍生物与L-苏氨酸反应生成3-苯基-L-丝氨酸或其衍生物的活性。In the first aspect, the present invention provides a modified L-threonine transaldolase (LTTA), wherein the modified LTTA comprises amino acid substitutions at one or more positions compared to its starting LTTA, and It has an improved activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine to generate 3-phenyl-L-serine or its derivatives.
在一些实施方案中,所述经修饰的LTTA包含位置70的氨基酸取代,优选取代为H、N或S,更优选取代为H,所述位置参照SEQ ID NO:2进行编号。在一些实施方案中,所述经修饰的LTTA还包含选自位置35、38、48、57、59、94、116、141、181、185、205、229和407的一或多个位置的氨基酸取代。优选地,位置35取代为A、G或S,更优选S。优选地,位置38取代为F。优选地,位置48取代为A或C。优选地,位置57取代为S、M或T,更优选M。优选地,位置59取代为A或Y。优选地,位置94取代为E。优选地,位置116取代为R、S或N。优选地,位置141取代为C。优选地,位置181取代为Q。优选地,位置185取代为G。优选地,位置205选自S、Q或A。优选地,位置229取代为C。优选地,位置407取代为R。In some embodiments, the modified LTTA includes an amino acid substitution at position 70, preferably substitution to H, N or S, more preferably substitution to H, and the position is numbered with reference to SEQ ID NO: 2. In some embodiments, the modified LTTA further comprises amino acids at one or more positions selected from positions 35, 38, 48, 57, 59, 94, 116, 141, 181, 185, 205, 229, and 407 replace. Preferably, position 35 is substituted with A, G or S, more preferably S. Preferably, position 38 is substituted with F. Preferably, position 48 is substituted with A or C. Preferably, position 57 is substituted with S, M or T, more preferably M. Preferably, position 59 is substituted with A or Y. Preferably, position 94 is substituted with E. Preferably, position 116 is substituted with R, S or N. Preferably, position 141 is substituted with C. Preferably, position 181 is substituted with Q. Preferably, position 185 is substituted with G. Preferably, position 205 is selected from S, Q or A. Preferably, position 229 is substituted with C. Preferably, position 407 is substituted with R.
在一些实施方案中,所述经修饰的LTTA包含SEQ ID NO:13-72之一的氨基酸序列或者由SEQ ID NO:13-72之一的氨基酸序列组成;或者与SEQ ID NO:13-65和69-72之一的氨基酸序列相比,所述经修饰的LTTA在除位置35、38、48、57、59、70、94、116、141、181、185、205、229和407之外的位置还包含1-10个氨基酸取代;或者与SEQ ID NO:66-68之一的氨基酸序列相比,所述经修饰的LTTA在除位置35、57和70之外的位置还包含1-10个氨基酸取代。In some embodiments, the modified LTTA comprises the amino acid sequence of one of SEQ ID NO: 13-72 or consists of the amino acid sequence of one of SEQ ID NO: 13-72; or is the same as SEQ ID NO: 13-65 Compared with the amino acid sequence of one of 69-72, the modified LTTA has positions 35, 38, 48, 57, 59, 70, 94, 116, 141, 181, 185, 205, 229 and 407. The position further contains 1-10 amino acid substitutions; or compared with the amino acid sequence of one of SEQ ID NO: 66-68, the modified LTTA also contains 1- 10 amino acid substitutions.
在第二方面,本发明提供编码本发明的经修饰的LTTA的多核苷酸,以及包含本发明的多核苷酸的载体。In a second aspect, the present invention provides a polynucleotide encoding the modified LTTA of the present invention, and a vector comprising the polynucleotide of the present invention.
在第三方面,本发明提供包含本发明的的经修饰的LTTA、其编码多核苷酸或包含所述多核苷酸的载体的宿主细胞。In a third aspect, the present invention provides a host cell containing the modified LTTA of the present invention, its encoding polynucleotide or a vector containing the polynucleotide.
在第四方面,本发明还提供一种生产3-苯基-L-丝氨酸及其衍生物的方法,包括使本发明的经修饰的LTTA或本发明的宿主细胞与苯甲醛或其衍生物以及L-苏氨酸接触。发明详述In the fourth aspect, the present invention also provides a method for producing 3-phenyl-L-serine and its derivatives, which comprises combining the modified LTTA of the present invention or the host cell of the present invention with benzaldehyde or its derivatives, and L-threonine contact. Detailed description of the invention
现在将详细参考本发明的某些实施方案,其示例在所附的结构和反应式中示出。尽管将结合所列举的实施方案描述本发明,但是应当理解,它们并不旨在将本发明限制于 那些实施方案。相反,本发明旨在覆盖可包括在如权利要求所限定的本发明的范围内的所有替代、修改和等同形式。本领域技术人员将认识到许多与本文描述的方法或材料相似或等同的方法和材料可用于实施本发明。本发明不限于所描述的方法和材料。如果所引用的文献、专利和类似材料中的一个或多个与本申请不同或相矛盾,包括但不限于所定义的术语、术语用法、所描述的技术等,则以本申请为准。除非另有定义,否则本文使用的所有技术和科学术语具有与本发明所属领域的普通技术人员通常理解的相同含义。尽管下面描述了合适的方法和材料,但是与本文描述的那些类似或等同的方法和材料可以用于本发明的实践或测试中。本文提及的所有出版物、专利申请、专利和其他参考文献通过引用整体并入本文。除非另有说明,否则本申请中使用的命名基于IUPAC系统命名。Reference will now be made in detail to certain embodiments of the present invention, examples of which are shown in the attached structures and reaction formulas. Although the invention will be described in conjunction with the enumerated embodiments, it should be understood that they are not intended to limit the invention to those embodiments. On the contrary, the present invention is intended to cover all alternatives, modifications, and equivalents that may be included within the scope of the present invention as defined by the claims. Those skilled in the art will recognize that many methods and materials similar or equivalent to those described herein can be used in the practice of the present invention. The invention is not limited to the described methods and materials. If one or more of the cited documents, patents and similar materials is different or contradictory to this application, including but not limited to defined terms, term usage, described technology, etc., this application shall prevail. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by those of ordinary skill in the art to which the present invention belongs. Although suitable methods and materials are described below, methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention. All publications, patent applications, patents and other references mentioned herein are incorporated by reference in their entirety. Unless otherwise stated, the nomenclature used in this application is based on the IUPAC system nomenclature.
一、经修饰的L-苏氨酸转醛酶1. Modified L-threonine transaldolase
如本文所用,“L-苏氨酸转醛酶”和“LTTA”具有催化苯甲醛或其衍生物与L-苏氨酸反应生成3-苯基-L-丝氨酸或衍生物和乙醛(即式(I)的反应)的活性。本发明提供经修饰的LTTA多肽,与其起始LTTA多肽相比,所述经修饰的LTTA具有改进的催化苯甲醛或其衍生物与L-苏氨酸反应的活性。As used herein, "L-threonine transaldolase" and "LTTA" have the ability to catalyze the reaction of benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or derivatives and acetaldehyde (ie The reaction of formula (I)) activity. The present invention provides a modified LTTA polypeptide. Compared with the original LTTA polypeptide, the modified LTTA has an improved activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine.
如本文所用,术语“肽”表示通过肽键连接的至少两个氨基酸的链。术语“多肽”在本文中可以与术语“蛋白质”互换使用,是指含有十个或更多个氨基酸残基的链。本文中的所有肽和多肽化学式或序列均是从左至右书写的,表示从氨基末端至羧基末端的方向。As used herein, the term "peptide" means a chain of at least two amino acids connected by peptide bonds. The term "polypeptide" is used interchangeably with the term "protein" herein and refers to a chain containing ten or more amino acid residues. All peptide and polypeptide chemical formulas or sequences herein are written from left to right, indicating the direction from the amino terminal to the carboxy terminal.
术语“氨基酸”包括蛋白质中天然存在的氨基酸和非天然氨基酸。蛋白质中天然存在的氨基酸的单字母和三字母命名采用本领域惯用名,可见于Sambrook,et al.(Molecular Cloning:A Laboratory Manual,2nd,ed.Cold Spring Harbor Laboratory,Cold Spring Harbor Laboratory Press,Cold Spring Harbor,N.Y.,1989)。The term "amino acid" includes naturally occurring amino acids and unnatural amino acids in proteins. The one-letter and three-letter names of amino acids naturally occurring in proteins are commonly used in the field, and can be found in Sambrook, et al. (Molecular Cloning: A Laboratory Manual, 2nd, ed. Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989).
如本文所用,术语“修饰”是指对多肽的任何化学修饰,例如氨基酸的取代、缺失、插入和/或添加。As used herein, the term "modification" refers to any chemical modification of a polypeptide, such as amino acid substitutions, deletions, insertions, and/or additions.
在一些实施方案中,本发明的经修饰的LTTA与其起始LTTA相比,包含1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20或更多个氨基酸取代,其中本发明的经修饰的LTTA与其起始LTTA相比,具有改进的催化苯甲醛或其衍生物与L-苏氨酸反应生成3-苯基-L-丝氨酸或衍生物的活性。In some embodiments, the modified LTTA of the present invention contains 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 compared to its starting LTTA. , 16, 17, 18, 19, 20 or more amino acid substitutions, wherein the modified LTTA of the present invention has an improved catalyzed reaction between benzaldehyde or its derivatives and L-threonine compared with the original LTTA Activity of 3-phenyl-L-serine or derivatives.
在一些实施方案中,所述经修饰的LTTA与其起始LTTA相比,包含1-20、1-15、1-14、1-13、1-12、1-11、1-10、1-9、1-8、1-7、1-6、1-5、1-4、1-3或1-2个氨基酸取代,其中本发明的经修饰的LTTA与其起始LTTA相比,具有改进的催化苯甲醛或其衍生物与L-苏氨酸反应生成3-苯基-L-丝氨酸或衍生物的活性。In some embodiments, the modified LTTA comprises 1-20, 1-15, 1-14, 1-13, 1-12, 1-11, 1-10, 1- 9. 1-8, 1-7, 1-6, 1-5, 1-4, 1-3 or 1-2 amino acid substitutions, wherein the modified LTTA of the present invention has an improvement compared with the original LTTA The activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or its derivatives.
在一些实施方案中,本发明的经修饰的LTTA与其起始LTTA相比,包含位置70的氨基酸取代,所述位置参照SEQ ID NO:2进行编号,优选取代为H、N或S,更优选取代为H。In some embodiments, the modified LTTA of the present invention contains an amino acid substitution at position 70 compared to its starting LTTA, and the position is numbered with reference to SEQ ID NO: 2, preferably the substitution is H, N or S, more preferably Replaced by H.
在一些实施方案中,本发明的经修饰的LTTA还包含选自位置35、38、48、57、59、94、116、141、181、185、205、229和407的一或多个位置的氨基酸取代。优选地,位置35取代为A、G或S,更优选S。优选地,位置38取代为F。优选地,位置48取代为A或C。优选地,位置57取代为S、M或T,更优选M。优选地,位置59取代为A或Y。优选地,位置94取代为E。优选地,位置116取代为R、S或N。优选地,位置141取代为C。优选地,位置181取代为Q。优选地,位置185取代为G。优选地,位置205选自S、Q或A。优选地,位置229取代为C。优选地,位置407取代为R。In some embodiments, the modified LTTA of the present invention further comprises one or more positions selected from positions 35, 38, 48, 57, 59, 94, 116, 141, 181, 185, 205, 229 and 407. Amino acid substitution. Preferably, position 35 is substituted with A, G or S, more preferably S. Preferably, position 38 is substituted with F. Preferably, position 48 is substituted with A or C. Preferably, position 57 is substituted with S, M or T, more preferably M. Preferably, position 59 is substituted with A or Y. Preferably, position 94 is substituted with E. Preferably, position 116 is substituted with R, S or N. Preferably, position 141 is substituted with C. Preferably, position 181 is substituted with Q. Preferably, position 185 is substituted with G. Preferably, position 205 is selected from S, Q or A. Preferably, position 229 is substituted with C. Preferably, position 407 is substituted with R.
在优选的实施方案中,本发明的经修饰的LTTA包含位置35和70的氨基酸取代,所述位置参照SEQ ID NO:2进行编号,其中位置35取代为S,位置70取代为H。更优选地,所述经修饰的LTTA包含位置35、57和70的氨基酸取代,其中位置35取代为S,位置57取代为M,且位置70取代为H。进一步优选地,所述经修饰的LTTA包含位置35、57、59、70、94和141的氨基酸取代,其中位置35取代为S,位置57取代为M,位置59取代为A,位置70取代为H,位置94取代为E,且位置141取代为C。In a preferred embodiment, the modified LTTA of the present invention contains amino acid substitutions at positions 35 and 70, and the positions are numbered with reference to SEQ ID NO: 2, wherein position 35 is substituted with S and position 70 is substituted with H. More preferably, the modified LTTA comprises amino acid substitutions at positions 35, 57 and 70, wherein position 35 is substituted with S, position 57 is substituted with M, and position 70 is substituted with H. Further preferably, the modified LTTA comprises amino acid substitutions at positions 35, 57, 59, 70, 94 and 141, wherein position 35 is substituted with S, position 57 is substituted with M, position 59 is substituted with A, and position 70 is substituted with H, position 94 is substituted with E, and position 141 is substituted with C.
在一些实施方案中,本发明的经修饰的LTTA与其起始相比,具有选自如下的氨基酸取代或氨基酸取代的组合(位置参照SEQ ID NO:2进行编号):In some embodiments, the modified LTTA of the present invention has an amino acid substitution or a combination of amino acid substitutions selected from the group consisting of the following amino acid substitutions or combinations of amino acid substitutions (positions are numbered with reference to SEQ ID NO: 2):
-35A;-35A;
-35S;-35S;
-35G;-35G;
-57M;-57M;
-57T;-57T;
-57W;-57W;
-59A;-59A;
-59Y;-59Y;
-70H;-70H;
-70N;-70N;
-70S;-70S;
-94E;-94E;
-141C;-141C;
-205S;-205S;
-205Q;-205Q;
-205A;-205A;
-229C;-229C;
-59A、70H;-59A, 70H;
-181Q、70H;-181Q, 70H;
-116N、70H;-116N, 70H;
-35S、70H;-35S, 70H;
-38F、70H;-38F, 70H;
-48A、70H;-48A, 70H;
-48C、70H;-48C, 70H;
-59Y、70H;-59Y, 70H;
-70H、94E;-70H, 94E;
-70H、116S;-70H, 116S;
-70H、116N;-70H, 116N;
-70H、141C;-70H, 141C;
-35S、70H、181L;-35S, 70H, 181L;
-35S、70H、181Q;-35S, 70H, 181Q;
-35S、38F、70H;-35S, 38F, 70H;
-35S、57M、70H;-35S, 57M, 70H;
-35S、70H、94E;-35S, 70H, 94E;
-35S、70H、116N;-35S, 70H, 116N;
-35S、70H、141C;-35S, 70H, 141C;
-35S、48A、70H;-35S, 48A, 70H;
-35S、38F、70H、94E;-35S, 38F, 70H, 94E;
-35S、38F、70H、116S;-35S, 38F, 70H, 116S;
-35S、38F、48A、70H;-35S, 38F, 48A, 70H;
-35S、48A、59A、70H;-35S, 48A, 59A, 70H;
-35S、59A、70H、181Q;-35S, 59A, 70H, 181Q;
-35S、59Y、70H、181Q;-35S, 59Y, 70H, 181Q;
-35S、48A、70H、94E;-35S, 48A, 70H, 94E;
-35S、70H、116S、141C;-35S, 70H, 116S, 141C;
-35S、70H、116N、141C;-35S, 70H, 116N, 141C;
-35S、48A、59A、70H、94E;-35S, 48A, 59A, 70H, 94E;
-35S、59A、70H、94E、141C;-35S, 59A, 70H, 94E, 141C;
-35S、38F、48A、70H、94E;-35S, 38F, 48A, 70H, 94E;
-35S、38F、48A、59A、70H、116R;-35S, 38F, 48A, 59A, 70H, 116R;
-35S、38F、48A、94E、116S、70H;-35S, 38F, 48A, 94E, 116S, 70H;
-35S、57T、59A、70H、94E、141C;-35S, 57T, 59A, 70H, 94E, 141C;
-35S、57S、59A、70H、94E、141C;-35S, 57S, 59A, 70H, 94E, 141C;
-35S、57M、59A、70H、94E、141C;-35S, 57M, 59A, 70H, 94E, 141C;
-35S、59A、70H、94E、141C、185G;-35S, 59A, 70H, 94E, 141C, 185G;
-35S、59A、70H、94E、141C、407R;和-35S, 59A, 70H, 94E, 141C, 407R; and
-35S、59A、70H、94E、141C、205A。-35S, 59A, 70H, 94E, 141C, 205A.
在本文中,在其基础上进行氨基酸修饰的LTTA多肽称为起始LTTA。所述起始LTTA可以是野生型LTTA,也可以是野生型LTTA的变体。例如,从SEQ ID NO:2的多肽开始进行修饰,则相对于经修饰的LTTA,SEQ ID NO:2的多肽是“起始LTTA”;而如果从SEQ ID NO:2的变体多肽(例如SEQ ID NO:13-65和69-72)开始进行修饰,则相对于经修饰的LTTA,所述变体多肽是“起始LTTA”。In this article, the LTTA polypeptide on which amino acid modifications are made is referred to as the starting LTTA. The starting LTTA can be wild-type LTTA or a variant of wild-type LTTA. For example, starting from the polypeptide of SEQ ID NO: 2 for modification, compared to the modified LTTA, the polypeptide of SEQ ID NO: 2 is the "starting LTTA"; and if the polypeptide of SEQ ID NO: 2 is modified (for example, SEQ ID NOs: 13-65 and 69-72) start to be modified, compared to the modified LTTA, the variant polypeptide is the "starting LTTA".
如本文所用,术语“野生型LTTA”是指天然存在的LTTA。在一些实施方案中,所述野生型LTTA是来自假单胞菌属的LTTA。在一些实施方案中,所述野生型LTTA如SEQ ID NO:2、4或6所示。SEQ ID NO:2是来自荧光假单胞菌(Pseudomonas fluorescens)的LTTA的氨基酸序列(Genbank登录号AQZ26585.1),SEQ ID NO:4是来自Pseudomonas sp.34E 7的LTTA的氨基酸序列(Genbank登录号CRN02517.1),而SEQ ID NO:6是来自Pseudomonas sp.Irchel s3a18的LTTA的氨基酸序列(Genbank登录号WP_095149064)。在一些实施方案中,所述野生型LTTA是来自Chitiniphilus shinanonensis的LTTA,其序列如SEQ ID NO:8所示(Genbank登录号WP_018749561)。As used herein, the term "wild-type LTTA" refers to naturally occurring LTTA. In some embodiments, the wild-type LTTA is LTTA from Pseudomonas. In some embodiments, the wild-type LTTA is shown in SEQ ID NO: 2, 4, or 6. SEQ ID NO: 2 is the amino acid sequence of LTTA from Pseudomonas fluorescens (Genbank accession number AQZ26585.1), SEQ ID NO: 4 is the amino acid sequence of LTTA from Pseudomonas sp.34E 7 (Genbank accession No. CRN02517.1), and SEQ ID NO: 6 is the amino acid sequence of LTTA from Pseudomonas sp. Irchel s3a18 (Genbank accession number WP_095149064). In some embodiments, the wild-type LTTA is an LTTA from Chitiniphilus shinanonensis, and its sequence is shown in SEQ ID NO: 8 (Genbank accession number WP_018749561).
对于本发明,为确定两个氨基酸序列或两个核酸序列的相同性百分比,以最佳比较为目的比对序列(例如在第一个氨基酸或核酸序列中可导入缺口,以与第二个氨基酸或核酸序列进行最佳比对)。然后比较在相应氨基酸位置或核苷酸位置的氨基酸残基或核苷酸。当第一个序列中的位置在第二个序列中相应位置由相同氨基酸残基或核苷酸占据时,则这些分子在这个位置是相同的。两个序列之间的相同性百分比是所述序列共有的 相同位置的数量的函数(即相同性百分比=相同位置的数量/位置(即重叠位置)的总数量×100)。优选地,这两个序列是相同长度的。For the present invention, in order to determine the percent identity of two amino acid sequences or two nucleic acid sequences, the sequences are aligned for the purpose of optimal comparison (for example, gaps can be introduced in the first amino acid or nucleic acid sequence to match the second amino acid sequence). Or nucleic acid sequence for optimal alignment). The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide in the corresponding position in the second sequence, then these molecules are the same at this position. The percent identity between two sequences is a function of the number of identical positions shared by the sequences (i.e., percent identity = number of identical positions/total number of positions (i.e. overlapping positions) x 100). Preferably, the two sequences are the same length.
本领域技术人员知晓,可以使用不同的计算机程序确定两个序列之间的相同性。Those skilled in the art know that different computer programs can be used to determine the identity between two sequences.
“氨基酸相同性百分比”或者“氨基酸序列相同性百分比”是指比较两个多肽的氨基酸,当最佳比对时,所述两个多肽具有大约指定的相同氨基酸百分比。例如,“95%的氨基酸相同性”是指比较两个多肽的氨基酸,当最佳比对时,所述两个多肽有95%的氨基酸相同。"Percent amino acid identity" or "percent amino acid sequence identity" refers to comparing the amino acids of two polypeptides, and when optimally aligned, the two polypeptides have approximately the specified percentage of identical amino acids. For example, "95% amino acid identity" refers to comparing the amino acids of two polypeptides. When the two polypeptides are optimally aligned, 95% of the amino acids of the two polypeptides are identical.
在一些实施方案中,本发明的经修饰的LTTA多肽与SEQ ID NO:2、4、6或8相比,具有至少65%,优选至少70%、75%或80%,更优选至少85%、90%或95%,特别优选至少96%、97%、98%或99%的序列相同性。In some embodiments, the modified LTTA polypeptide of the present invention has at least 65%, preferably at least 70%, 75% or 80%, more preferably at least 85% compared to SEQ ID NO: 2, 4, 6 or 8. , 90% or 95%, particularly preferably at least 96%, 97%, 98% or 99% sequence identity.
在一些实施方案中,本发明的经修饰的LTTA与其起始LTTA(例如SEQ ID NO:2、4、6或8)相比,包含1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20或更多个氨基酸取代,其中本发明的经修饰的LTTA与其起始LTTA相比,具有改进的催化苯甲醛或其衍生物与L-苏氨酸反应生成3-苯基-L-丝氨酸或衍生物的活性。在一些实施方案中,所述经修饰的LTTA与其起始LTTA(例如SEQ ID NO:2、4、6或8)相比,包含1-20、1-15、1-14、1-13、1-12、1-11、1-10、1-9、1-8、1-7、1-6、1-5、1-4、1-3或1-2个氨基酸取代。在一些实施方案中,本发明的经修饰的LTTA与其起始LTTA相比,包含位置70的氨基酸取代,所述位置参照SEQ ID NO:2进行编号,优选取代为H、N或S,更优选取代为H。在一些实施方案中,所述经修饰的LTTA与其起始LTTA相比,具有至少65%,优选至少70%、75%或80%,更优选至少85%、90%或95%,特别优选至少96%、97%、98%或99%的序列相同性。In some embodiments, the modified LTTA of the present invention includes 1, 2, 3, 4, 5, 6, 7, 8 compared to its starting LTTA (for example, SEQ ID NO: 2, 4, 6, or 8). , 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more amino acid substitutions, wherein the modified LTTA of the present invention has improved catalysis compared with the original LTTA The activity of reacting benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or its derivatives. In some embodiments, the modified LTTA contains 1-20, 1-15, 1-14, 1-13, 1-20, 1-15, 1-14, 1-13, 1-12, 1-11, 1-10, 1-9, 1-8, 1-7, 1-6, 1-5, 1-4, 1-3 or 1-2 amino acid substitutions. In some embodiments, the modified LTTA of the present invention contains an amino acid substitution at position 70 compared to its starting LTTA, and the position is numbered with reference to SEQ ID NO: 2, preferably the substitution is H, N or S, more preferably Replaced by H. In some embodiments, the modified LTTA has at least 65%, preferably at least 70%, 75% or 80%, more preferably at least 85%, 90% or 95%, particularly preferably at least 96%, 97%, 98% or 99% sequence identity.
在一些实施方案中,本发明的经修饰的LTTA还包含选自位置35、38、48、57、59、94、116、141、181、185、205、229和407的一或多个位置的氨基酸取代。优选地,位置35取代为A、G或S,更优选S。优选地,位置38取代为F。优选地,位置48取代为A或C。优选地,位置57取代为S、M或T,更优选M。优选地,位置59取代为A或Y。优选地,位置94取代为E。优选地,位置116取代为R、S或N。优选地,位置141取代为C。优选地,位置181取代为Q。优选地,位置185取代为G。优选地,位置205选自S、Q或A。优选地,位置229取代为C。优选地,位置407取代为R。在一些实施方案中,所述经修饰的LTTA与其起始LTTA相比,具有至少65%,优选至少70%、75%或80%,更优选至少85%、90%或95%,特别优选至少96%、97%、98%或99%的序列相同性。In some embodiments, the modified LTTA of the present invention further comprises one or more positions selected from positions 35, 38, 48, 57, 59, 94, 116, 141, 181, 185, 205, 229 and 407. Amino acid substitution. Preferably, position 35 is substituted with A, G or S, more preferably S. Preferably, position 38 is substituted with F. Preferably, position 48 is substituted with A or C. Preferably, position 57 is substituted with S, M or T, more preferably M. Preferably, position 59 is substituted with A or Y. Preferably, position 94 is substituted with E. Preferably, position 116 is substituted with R, S or N. Preferably, position 141 is substituted with C. Preferably, position 181 is substituted with Q. Preferably, position 185 is substituted with G. Preferably, position 205 is selected from S, Q or A. Preferably, position 229 is substituted with C. Preferably, position 407 is substituted with R. In some embodiments, the modified LTTA has at least 65%, preferably at least 70%, 75% or 80%, more preferably at least 85%, 90% or 95%, particularly preferably at least 96%, 97%, 98% or 99% sequence identity.
在优选的实施方案中,本发明的经修饰的LTTA包含位置35和70的氨基酸取代,所述位置参照SEQ ID NO:2进行编号,其中位置35取代为S,位置70取代为H。更优选地,所述经修饰的LTTA包含位置35、57和70的氨基酸取代,其中位置35取代为S,位置57取代为M,且位置70取代为H。进一步优选地,所述经修饰的LTTA包含位置35、57、59、70、94和141的氨基酸取代,其中位置35取代为S,位置57取代为M, 位置59取代为A,位置70取代为H,位置94取代为E,且位置141取代为C。优选地,所述经修饰的LTTA与其起始LTTA相比,具有至少65%,优选至少70%、75%或80%,更优选至少85%、90%或95%,特别优选至少96%、97%、98%或99%的序列相同性。In a preferred embodiment, the modified LTTA of the present invention contains amino acid substitutions at positions 35 and 70, and the positions are numbered with reference to SEQ ID NO: 2, wherein position 35 is substituted with S and position 70 is substituted with H. More preferably, the modified LTTA comprises amino acid substitutions at positions 35, 57 and 70, wherein position 35 is substituted with S, position 57 is substituted with M, and position 70 is substituted with H. Further preferably, the modified LTTA comprises amino acid substitutions at positions 35, 57, 59, 70, 94 and 141, wherein position 35 is substituted with S, position 57 is substituted with M, position 59 is substituted with A, and position 70 is substituted with H, position 94 is substituted with E, and position 141 is substituted with C. Preferably, the modified LTTA has at least 65%, preferably at least 70%, 75% or 80%, more preferably at least 85%, 90% or 95%, particularly preferably at least 96%, compared with its starting LTTA. 97%, 98% or 99% sequence identity.
在一些实施方案中,本发明的经修饰的LTTA与其起始LTTA相比,包含位置70的氨基酸取代,所述位置参照SEQ ID NO:2进行编号,优选取代为H、N或S,更优选取代为H,其中本发明的经修饰的LTTA与其起始LTTA相比,具有改进的催化苯甲醛或其衍生物与L-苏氨酸反应生成3-苯基-L-丝氨酸或衍生物的活性。在一些实施方案中,所述起始LTTA与SEQ ID NO:2、4、6或8具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%的序列相同性。在一些实施方案中,本发明的经修饰的LTTA还包含选自位置35、38、48、57、59、94、116、141、181、185、205、229和407的一或多个位置的氨基酸取代。优选地,位置35取代为A、G或S,更优选S。优选地,位置38取代为F。优选地,位置48取代为A或C。优选地,位置57取代为S、M或T,更优选M。优选地,位置59取代为A或Y。优选地,位置94取代为E。优选地,位置116取代为R、S或N。优选地,位置141取代为C。优选地,位置181取代为Q。优选地,位置185取代为G。优选地,位置205选自S、Q或A。优选地,位置229取代为C。优选地,位置407取代为R。在优选的实施方案中,本发明的经修饰的LTTA包含位置35和70的氨基酸取代,所述位置参照SEQ ID NO:2进行编号,其中位置35取代为S,位置70取代为H。更优选地,所述经修饰的LTTA包含位置35、57和70的氨基酸取代,其中位置35取代为S,位置57取代为M,且位置70取代为H。进一步优选地,所述经修饰的LTTA包含位置35、57、59、70、94和141的氨基酸取代,其中位置35取代为S,位置57取代为M,位置59取代为A,位置70取代为H,位置94取代为E,且位置141取代为C。In some embodiments, the modified LTTA of the present invention contains an amino acid substitution at position 70 compared to its starting LTTA, and the position is numbered with reference to SEQ ID NO: 2, preferably the substitution is H, N or S, more preferably Substituted to H, wherein the modified LTTA of the present invention has an improved activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or its derivatives compared to its starting LTTA . In some embodiments, the initial LTTA and SEQ ID NO: 2, 4, 6, or 8 have at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% % Or 99% sequence identity. In some embodiments, the modified LTTA of the present invention further comprises one or more positions selected from positions 35, 38, 48, 57, 59, 94, 116, 141, 181, 185, 205, 229 and 407. Amino acid substitution. Preferably, position 35 is substituted with A, G or S, more preferably S. Preferably, position 38 is substituted with F. Preferably, position 48 is substituted with A or C. Preferably, position 57 is substituted with S, M or T, more preferably M. Preferably, position 59 is substituted with A or Y. Preferably, position 94 is substituted with E. Preferably, position 116 is substituted with R, S or N. Preferably, position 141 is substituted with C. Preferably, position 181 is substituted with Q. Preferably, position 185 is substituted with G. Preferably, position 205 is selected from S, Q or A. Preferably, position 229 is substituted with C. Preferably, position 407 is substituted with R. In a preferred embodiment, the modified LTTA of the present invention contains amino acid substitutions at positions 35 and 70, and the positions are numbered with reference to SEQ ID NO: 2, wherein position 35 is substituted with S and position 70 is substituted with H. More preferably, the modified LTTA comprises amino acid substitutions at positions 35, 57 and 70, wherein position 35 is substituted with S, position 57 is substituted with M, and position 70 is substituted with H. Further preferably, the modified LTTA comprises amino acid substitutions at positions 35, 57, 59, 70, 94 and 141, wherein position 35 is substituted with S, position 57 is substituted with M, position 59 is substituted with A, and position 70 is substituted with H, position 94 is substituted with E, and position 141 is substituted with C.
术语“保守取代”也称为由“同源”氨基酸残基取代,是指其中氨基酸残基由具有相似侧链的氨基酸残基置换的取代,例如,碱性侧链的氨基酸(例如赖氨酸、精氨酸和组氨酸)、酸性侧链的氨基酸(例如天冬氨酸、谷氨酸)、非荷电极性侧链氨基酸(例如甘氨酸、天冬酰胺、谷氨酰胺、丝氨酸、苏氨酸、酪氨酸、半胱氨酸)、非极性侧链氨基酸(例如丙氨酸、缬氨酸、亮氨酸、异亮氨酸、脯氨酸、苯丙氨酸、甲硫氨酸、色氨酸)、β-分支的侧链氨基酸(例如苏氨酸、缬氨酸、异亮氨酸)及芳香侧链氨基酸(例如酪氨酸、苯丙氨酸、色氨酸、组氨酸)。The term "conservative substitution" is also referred to as "homologous" amino acid residue substitution, and refers to a substitution in which the amino acid residue is replaced by an amino acid residue having a similar side chain, for example, an amino acid with a basic side chain (e.g., lysine) , Arginine and histidine), acidic side chain amino acids (e.g. aspartic acid, glutamic acid), non-charged electroactive side chain amino acids (e.g. glycine, asparagine, glutamine, serine, threonine) Acid, tyrosine, cysteine), non-polar side chain amino acids (e.g. alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine) , Tryptophan), β-branched side chain amino acids (e.g. threonine, valine, isoleucine) and aromatic side chain amino acids (e.g. tyrosine, phenylalanine, tryptophan, histidine) acid).
保守氨基酸取代通常对所得蛋白质的活性的影响最小。这种取代在下文描述。保守取代是用大小、疏水性、电荷、极性、空间特征、芳香性等相似的氨基酸置换一个氨基酸。当希望精细调节蛋白质的特性时,这种取代通常是保守的。Conservative amino acid substitutions generally have the least impact on the activity of the resulting protein. This substitution is described below. Conservative substitution is to replace an amino acid with an amino acid that is similar in size, hydrophobicity, charge, polarity, spatial characteristics, and aromaticity. When it is desired to fine-tune the properties of the protein, such substitutions are usually conservative.
如本文所用,“同源”氨基酸残基是指具有相似化学性质的氨基酸残基,所述化学性质涉及疏水性、电荷、极性、空间特征、芳香性特征等。彼此同源的氨基酸的例子包括正电荷的赖氨酸、精氨酸、组氨酸,负电荷的谷氨酸、天冬氨酸,疏水性的甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、脯氨酸、苯丙氨酸,极性的丝氨酸、苏氨酸、半胱 氨酸、甲硫氨酸、色氨酸、酪氨酸、天冬酰胺、谷氨酰胺,芳香性的苯丙氨酸、酪氨酸、色氨酸,化学相似侧链基团的丝氨酸与苏氨酸,或者谷氨酰胺和天冬酰胺,或者亮氨酸和异亮氨酸。As used herein, "homologous" amino acid residues refer to amino acid residues with similar chemical properties related to hydrophobicity, charge, polarity, steric characteristics, aromatic characteristics, and the like. Examples of amino acids that are homologous to each other include positively charged lysine, arginine, histidine, negatively charged glutamic acid, aspartic acid, hydrophobic glycine, alanine, valine, and leucine Acid, isoleucine, proline, phenylalanine, polar serine, threonine, cysteine, methionine, tryptophan, tyrosine, asparagine, glutamine , Aromatic phenylalanine, tyrosine, tryptophan, serine and threonine with chemically similar side chain groups, or glutamine and asparagine, or leucine and isoleucine.
蛋白质中氨基酸保守取代的例子包括:Ser取代Ala,Lys取代Arg,Gln或His取代Asn,Glu取代Asp,Ser取代Cys,Asn取代Gln,Asp取代Glu,Pro取代Gly,Asn或Gln取代His,Leu或Val取代Ile,Ile或Val取代Leu,Arg或Gln取代Lys,Leu或Ile取代Met,Met、Leu或Tyr取代Phe,Thr取代Ser,Ser取代Thr,Tyr取代Trp,Trp或Phe取代Tyr,及Ile或Leu取代Val。Examples of conservative amino acid substitutions in proteins include: Ser replaces Ala, Lys replaces Arg, Gln or His replaces Asn, Glu replaces Asp, Ser replaces Cys, Asn replaces Gln, Asp replaces Glu, Pro replaces Gly, Asn or Gln replaces His, Leu Or Val replaces Ile, Ile or Val replaces Leu, Arg or Gln replaces Lys, Leu or Ile replaces Met, Met, Leu or Tyr replaces Phe, Thr replaces Ser, Ser replaces Thr, Tyr replaces Tr, Trp or Phe replaces Tyr, and Ile or Leu replaces Val.
在一些实施方案中,所述经修饰的LTTA包含SEQ ID NO:13-72之一的氨基酸序列或者由SEQ ID NO:13-72之一的氨基酸序列组成;或者与SEQ ID NO:13-65和69-72之一的氨基酸序列相比,所述经修饰的LTTA在除位置35、38、48、57、59、70、94、116、141、181、185、205、229和407之外的位置包含1-10个氨基酸取代;或者与SEQ ID NO:66-68之一的氨基酸序列相比,所述经修饰的LTTA在除位置35、57和70之外的位置包含1-10个氨基酸取代,其中本发明的经修饰的LTTA与其起始LTTA相比,具有改进的催化苯甲醛或其衍生物与L-苏氨酸反应生成3-苯基-L-丝氨酸或衍生物的活性。在一些实施方案中,与SEQ ID NO:13-65和69-72之一的氨基酸序列相比,所述经修饰的LTTA在除位置35、38、48、57、59、70、94、116、141、181、185、205、229和407之外的位置还包含1、2、3、4、5、6、7、8、9、10或更多个氨基酸取代;或者与SEQ ID NO:66-68之一的氨基酸序列相比,所述经修饰的LTTA在除位置35、57和70之外的位置还包含1、2、3、4、5、6、7、8、9、10或更多个氨基酸取代。In some embodiments, the modified LTTA comprises the amino acid sequence of one of SEQ ID NO: 13-72 or consists of the amino acid sequence of one of SEQ ID NO: 13-72; or is the same as SEQ ID NO: 13-65 Compared with the amino acid sequence of one of 69-72, the modified LTTA has positions 35, 38, 48, 57, 59, 70, 94, 116, 141, 181, 185, 205, 229 and 407. The position contains 1-10 amino acid substitutions; or compared with the amino acid sequence of one of SEQ ID NO: 66-68, the modified LTTA contains 1-10 positions other than positions 35, 57 and 70 Amino acid substitutions, wherein the modified LTTA of the present invention has an improved activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or its derivatives compared with the original LTTA. In some embodiments, compared with the amino acid sequence of one of SEQ ID NOs: 13-65 and 69-72, the modified LTTA is at positions 35, 38, 48, 57, 59, 70, 94, 116. The positions other than, 141, 181, 185, 205, 229 and 407 also contain 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more amino acid substitutions; or with SEQ ID NO: Compared with the amino acid sequence of one of 66-68, the modified LTTA contains 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 at positions other than positions 35, 57, and 70. Or more amino acid substitutions.
如本文所用,酶的活性指在一定条件下,在单位质量的酶催化的化学反应中,单位时间内底物的减少量或产物的增加量。例如,本发明的经修饰的LTTA的活性,用一定条件下(如以下实施例中列举的反应条件),在单位质量的经修饰的LTTA催化下,单位时间内生成的3-苯基-L-丝氨酸或其衍生物量来表示。As used herein, the activity of an enzyme refers to a decrease in substrate or an increase in product per unit time in a chemical reaction catalyzed by an enzyme per unit mass under certain conditions. For example, the activity of the modified LTTA of the present invention, under certain conditions (such as the reaction conditions listed in the following examples), under the catalysis of a unit mass of the modified LTTA, 3-phenyl-L generated in a unit time -Serine or its derivatives are expressed in terms of amount.
在本文中,酶的活性也可以指酶的相对活性,以感兴趣的酶的活性与催化相同反应的给定的酶的活性的比值表示,如百分比相对活性。In this context, the activity of an enzyme can also refer to the relative activity of the enzyme, expressed as the ratio of the activity of the enzyme of interest to the activity of a given enzyme that catalyzes the same reaction, such as percentage relative activity.
在一些实施方案中,本发明的经修饰的LTTA的活性以与SEQ ID NO:2的LTTA的活性的比值表示。在一些实施方案中,所述经修饰的LTTA催化苯甲醛或其衍生物与L-苏氨酸反应生成3-苯基-L-丝氨酸或衍生物的活性是SEQ ID NO:2的LTTA的至少1、1.05、1.1、1.2、1.3、1.5、1.7、2、2.5、3、3.5、4倍或更高。In some embodiments, the activity of the modified LTTA of the present invention is expressed as a ratio to the activity of the LTTA of SEQ ID NO: 2. In some embodiments, the modified LTTA has an activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine to produce 3-phenyl-L-serine or its derivatives is at least that of the LTTA of SEQ ID NO: 2 1, 1.05, 1.1, 1.2, 1.3, 1.5, 1.7, 2, 2.5, 3, 3.5, 4 times or more.
如本文所用,产物选择性是指当反应产物包含两个或多个可能的立体异构体时,其中一种产物生成较多。如果所述反应是酶促反应,则所述产物选择性也称为酶的产物选择性。例如,在式(I)的反应中,3-苯基-L-丝氨酸的衍生物的2、3位的碳都是手性碳,会产生四种立体异构体,即P(2S,3R)、P(2R,3S)、(P(2S,3S)和P(2R,3R)。As used herein, product selectivity means that when the reaction product contains two or more possible stereoisomers, one of the products is produced more. If the reaction is an enzymatic reaction, the product selectivity is also referred to as the product selectivity of the enzyme. For example, in the reaction of formula (I), the carbons at positions 2 and 3 of the derivatives of 3-phenyl-L-serine are all chiral carbons, which will produce four stereoisomers, namely P(2S,3R ), P(2R,3S), (P(2S,3S) and P(2R,3R).
在本文中,LTTA(野生型或经修饰的)的产物选择性可以用一定反应条件下(如以下实施例中列举的反应条件)非对映异构比(diastereomeric ratio,DR)和对映体过量百分比 (enantiomeric excess percentage,ee%)表征,其中DR和ee%用以下公式(I)和(II)计算。In this article, the product selectivity of LTTA (wild-type or modified) can be used under certain reaction conditions (such as the reaction conditions listed in the following examples) diastereomeric ratio (DR) and enantiomers The excess percentage (enantiomeric excess percentage, ee%) is characterized, where DR and ee% are calculated using the following formulas (I) and (II).
DR=(P(2S,3R)的产量+P(2R,3S)的产量):(P(2S,3S)的产量+P(2R,3R)的产量) 公式(I)DR=(P(2S,3R) output+P(2R,3S) output):(P(2S,3S) output+P(2R,3R) output) Formula (I)
ee%=P(2S,3R)的产量/(P(2S,3R)的产量+P(2R,3S)的产量)×100% 公式(II)ee%=P(2S,3R) output/(P(2S,3R) output+P(2R,3S) output)×100% Formula (II)
在一些实施方案中,与野生型LTTA相比,本发明的经修饰的LTTA的DR提高,例如提高到至少95.5:4.5、至少96:4、至少96.5:3.5、至少97:3、至少97.5:2.5、至少98:2、至少98.5:1.5、至少99:1或更高。In some embodiments, compared to wild-type LTTA, the DR of the modified LTTA of the present invention is increased, for example, to at least 95.5:4.5, at least 96:4, at least 96.5:3.5, at least 97:3, at least 97.5: 2.5, at least 98:2, at least 98.5:1.5, at least 99:1 or higher.
在一些实施方案中,本发明的经修饰的LTTA的ee%>99.9%。In some embodiments, the ee% of the modified LTTA of the present invention is >99.9%.
在一些实施方案中,所述苯甲醛的衍生物是对甲砜基苯甲醛。在一些实施方案中,所述3-苯基-L-丝氨酸的衍生物是(2S,3R)-对甲砜基苯丝氨酸。In some embodiments, the benzaldehyde derivative is p-methylsulfonyl benzaldehyde. In some embodiments, the derivative of 3-phenyl-L-serine is (2S,3R)-p-methylsulfonylphenylserine.
二、编码经修饰的LTTA的多核苷酸。2. The polynucleotide encoding the modified LTTA.
如本文所用,术语“多核苷酸”或者“核酸分子”包括DNA分子(例如cDNA或基因组DNA)和RNA分子(例如mRNA)及使用核苷酸类似物产生的DNA或RNA的类似物。所述核酸分子可以是单链或双链的,优选双链DNA。所述核酸的合成可以使用核苷酸类似物或衍生物(例如肌苷或硫代磷酸核苷酸)。这种核苷酸可以用于,例如,制备具有改变的碱基配对能力或者增加的核酸酶抗性的核酸。As used herein, the term "polynucleotide" or "nucleic acid molecule" includes DNA molecules (such as cDNA or genomic DNA) and RNA molecules (such as mRNA) and analogs of DNA or RNA produced using nucleotide analogs. The nucleic acid molecule may be single-stranded or double-stranded, preferably double-stranded DNA. The synthesis of the nucleic acid may use nucleotide analogs or derivatives (for example, inosine or phosphorothioate nucleotides). Such nucleotides can be used, for example, to prepare nucleic acids with altered base pairing ability or increased nuclease resistance.
本发明还提供编码本发明的经修饰的LTTA的多核苷酸。因此,在本发明中,术语修饰还包括对编码本发明的LTTA多肽的多核苷酸的遗传操作。所述修饰可以是核苷酸的取代、缺失、插入和/或添加。The present invention also provides polynucleotides encoding the modified LTTA of the present invention. Therefore, in the present invention, the term modification also includes genetic manipulation of the polynucleotide encoding the LTTA polypeptide of the present invention. The modification may be a substitution, deletion, insertion and/or addition of nucleotides.
如本文所用,术语“编码”是指多核苷酸直接指定其蛋白质产物的氨基酸序列。编码序列的边界一般由开放读框确定,所述开放读框通常以ATG起始密码子或另外的起始密码子如GTG和TTG开始,以终止密码子如TAA、TAG和TGA结束。所述编码序列可以是DNA、cDNA或重组核苷酸序列。As used herein, the term "encoding" refers to the amino acid sequence of a polynucleotide directly specifying its protein product. The boundaries of the coding sequence are generally determined by an open reading frame, which usually starts with the ATG start codon or other start codons such as GTG and TTG, and ends with stop codons such as TAA, TAG, and TGA. The coding sequence can be a DNA, cDNA or recombinant nucleotide sequence.
此外,涵盖本发明的全部或部分核酸序列的核酸分子可以通过聚合酶链反应(PCR)分离,所述PCR使用基于所述序列中包含的序列信息设计合成的寡核苷酸引物。In addition, nucleic acid molecules covering all or part of the nucleic acid sequence of the present invention can be separated by polymerase chain reaction (PCR), which uses the design of synthetic oligonucleotide primers based on the sequence information contained in the sequence.
本发明的多核苷酸可以使用cDNA、mRNA或者基因组DNA作为模板及合适的寡核苷酸引物根据标准PCR扩增技术进行扩增。如此扩增的核酸可以克隆进合适的载体中,并通过DNA序列分析进行表征。The polynucleotide of the present invention can be amplified using cDNA, mRNA or genomic DNA as a template and suitable oligonucleotide primers according to standard PCR amplification techniques. The nucleic acid so amplified can be cloned into a suitable vector and characterized by DNA sequence analysis.
本发明的多核苷酸可以通过标准的合成技术制备,例如使用自动化DNA合成仪制备。The polynucleotide of the present invention can be prepared by standard synthesis techniques, for example, by using an automated DNA synthesizer.
本发明还涉及本文描述的核酸分子的互补链。与其它核苷酸序列互补的核酸分子是与该核苷酸序列充分互补的分子,使得其可以与其他核苷酸序列杂交,从而形成稳定双 链体。The invention also relates to the complementary strands of the nucleic acid molecules described herein. A nucleic acid molecule that is complementary to other nucleotide sequences is a molecule that is sufficiently complementary to the nucleotide sequence so that it can hybridize with other nucleotide sequences to form a stable duplex.
如本文所用,术语“杂交”是在给定的严格杂交和洗涤条件下,彼此至少大约90%、优选至少大约95%、更优选至少大约96%、更优选至少98%同源的核苷酸序列通常保持彼此杂交。As used herein, the term "hybridization" refers to nucleotides that are at least about 90%, preferably at least about 95%, more preferably at least about 96%, and more preferably at least 98% homologous to each other under given stringent hybridization and washing conditions. The sequences generally remain hybridized to each other.
本领域技术人员知道各种用于杂交的条件,如严格杂交条件和高度严格杂交条件。参见,例如,Sambrook et al.,1989,Molecular Cloning,A Laboratory Manual,Cold Spring Harbor Press,N.Y.;和Ausubel et al.(eds.),1995,Current Protocols in Molecular Biology,John Wiley&Sons,N.Y.。Those skilled in the art know various conditions for hybridization, such as stringent hybridization conditions and highly stringent hybridization conditions. See, for example, Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, N.Y.; and Ausubel et al. (eds.), 1995, Current Protocols in Molecular Biology, John Wiley & Sons, N.Y.
当然,本发明的多核苷酸不包括仅与poly A序列(如mRNA的3’末端poly(A))或者与互补的一段poly T(或U)残基杂交的多核苷酸。Of course, the polynucleotide of the present invention does not include a polynucleotide that only hybridizes to a poly A sequence (such as the 3'end poly (A) of mRNA) or a complementary stretch of poly T (or U) residues.
三、表达和生产经修饰的LTTA3. Expression and production of modified LTTA
为表达本发明的经修饰的LTTA,还提供包含本发明的多核苷酸的核酸构建体和载体,如表达载体。In order to express the modified LTTA of the present invention, nucleic acid constructs and vectors containing the polynucleotide of the present invention, such as expression vectors, are also provided.
如本文所用,术语“表达”包括多肽生产中包含的任何步骤,包括但不限于转录、转录后修饰、翻译、翻译后修饰和分泌。As used herein, the term "expression" includes any step involved in the production of a polypeptide, including but not limited to transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
术语“核酸构建体”是指单链或双链的核酸分子,其分离自天然存在的基因或者被修饰为含有天然不存在的核酸区段。当所述核酸构建体含有表达本发明编码序列所需的控制序列时,术语核酸构建体与术语“表达盒”同义。The term "nucleic acid construct" refers to a single-stranded or double-stranded nucleic acid molecule, which is isolated from a naturally occurring gene or modified to contain a nucleic acid segment that does not occur in nature. When the nucleic acid construct contains the control sequences required to express the coding sequence of the present invention, the term nucleic acid construct is synonymous with the term "expression cassette".
术语“表达载体”在本文是指线性或环形DNA分子,其包含编码本发明多肽的多核苷酸,所述多核苷酸与为所述多核苷酸表达而提供的另外的核苷酸,例如,控制序列,可操纵地连接。所述表达载体包括病毒载体或质粒载体。The term "expression vector" refers herein to a linear or circular DNA molecule, which comprises a polynucleotide encoding a polypeptide of the present invention, the polynucleotide and additional nucleotides provided for the expression of the polynucleotide, for example, Control sequence, operably connected. The expression vector includes a viral vector or a plasmid vector.
术语“控制序列”在本文是指包括表达编码本发明多肽的多核苷酸所需或有利的所有元件。各控制序列对于编码多肽的核苷酸序列可以是天然的或者是外来的,或者彼此是天然或者外来的。这种控制序列包括但不限于前导序列、聚腺苷酸化序列、前肽序列、启动子、信号肽序列及转录终止子。最低限度,控制序列包括启动子和转录及翻译终止信号。The term "control sequence" refers herein to include all elements necessary or advantageous for the expression of the polynucleotide encoding the polypeptide of the present invention. Each control sequence may be natural or foreign to the nucleotide sequence encoding the polypeptide, or natural or foreign to each other. Such control sequences include, but are not limited to, leader sequences, polyadenylation sequences, propeptide sequences, promoters, signal peptide sequences, and transcription terminator. At a minimum, control sequences include promoters and transcription and translation termination signals.
例如,所述控制序列可以是合适的启动子序列,一种由宿主细胞识别以表达编码本发明多肽的多核苷酸的核苷酸序列。所述启动子序列含有介导所述多肽的表达的转录控制序列。所述启动子可以是在所选择的宿主细胞中表现出转录活性的任何核苷酸序列,例如,大肠杆菌(Escherichia coli)lac操纵子。所述启动子还包括突变的、截短的和杂合的启动子,并且可以从与宿主细胞同源或异源的编码胞外或胞内多肽的基因获得。For example, the control sequence may be a suitable promoter sequence, a nucleotide sequence recognized by the host cell to express the polynucleotide encoding the polypeptide of the present invention. The promoter sequence contains transcription control sequences that mediate the expression of the polypeptide. The promoter may be any nucleotide sequence that exhibits transcriptional activity in the selected host cell, for example, the Escherichia coli lac operon. The promoters also include mutated, truncated and hybrid promoters, and can be obtained from genes encoding extracellular or intracellular polypeptides homologous or heterologous to the host cell.
术语“可操纵地连接”在本文是指这样的构型,其中控制序列置于相对于多核苷酸序列的编码序列的适当位置,由此所述控制序列指导多肽编码序列的表达。The term "operably linked" herein refers to a configuration in which a control sequence is placed at an appropriate position relative to the coding sequence of the polynucleotide sequence, whereby the control sequence directs the expression of the polypeptide coding sequence.
编码本发明多肽的多核苷酸可以进行各种操作,以使得多肽表达。在将其插入载体之前,根据表达载体对多核苷酸的操作是可取的或必需的。利用重组DNA方法修饰多 核苷酸序列的技术为本领域熟知。The polynucleotide encoding the polypeptide of the present invention can be subjected to various operations to allow expression of the polypeptide. Before inserting it into the vector, it is desirable or necessary to manipulate the polynucleotide according to the expression vector. Techniques for modifying polynucleotide sequences using recombinant DNA methods are well known in the art.
为了鉴定和选择包含本发明的表达载体的宿主细胞,本发明的载体优选含有一或多个可选择标记,其使得可以对转化、转染、转导等的细胞进行简单的选择。可选择标记是一种基因,其产物提供生物杀灭剂或病毒抗性、重金属抗性、补充营养缺陷型等。例如,细菌的可选择标记是来自枯草芽孢杆菌或地衣芽孢杆菌的dal基因,或者赋予抗生素抗性如氨苄青霉素、卡那霉素、氯霉素或四环素抗性的标记。In order to identify and select host cells containing the expression vector of the present invention, the vector of the present invention preferably contains one or more selectable markers, which allow simple selection of transformed, transfected, transduced, etc. cells. A selectable marker is a gene whose product provides biocide or virus resistance, heavy metal resistance, supplementation of auxotrophs, etc. For example, the bacterial selectable marker is the dal gene from Bacillus subtilis or Bacillus licheniformis, or a marker that confers antibiotic resistance such as ampicillin, kanamycin, chloramphenicol, or tetracycline resistance.
本发明的载体可整合进宿主细胞基因组中或者在细胞中不依赖于基因组而自主复制。为了整合进宿主细胞基因组中或者自主复制所需的元件是本领域已知的(参见例如前述Sambrook et al.,1989)。The vector of the present invention can be integrated into the genome of the host cell or can replicate autonomously in the cell without relying on the genome. Elements required for integration into the host cell genome or autonomous replication are known in the art (see, for example, the aforementioned Sambrook et al., 1989).
载体DNA可以通过常规转化或转染技术导入原核或真核细胞中。如本文所用,术语“转化”和“转染”是指将外源核酸(例如DNA)导入宿主细胞中的各种本领域公认的技术,可见于例如前述Sambrook et al.,1989;Davis et al.,Basic Methods in Molecular Biology(1986)及其它实验室手册。Vector DNA can be introduced into prokaryotic or eukaryotic cells by conventional transformation or transfection techniques. As used herein, the terms "transformation" and "transfection" refer to various art-recognized techniques for introducing foreign nucleic acids (such as DNA) into host cells, which can be found in, for example, the aforementioned Sambrook et al., 1989; Davis et al. .,Basic Methods in Molecular Biology (1986) and other laboratory manuals.
本发明还涉及重组宿主细胞,其包含本发明的多核苷酸,所述多核苷酸有利地用于LTTA多肽的重组产生。包含本发明多核苷酸的载体被导入宿主细胞中,由此所述载体作为染色体整合体或作为自身复制染色体外载体被保留。本领域技术人员知晓表达蛋白质的常规载体和宿主细胞。The present invention also relates to a recombinant host cell, which contains the polynucleotide of the present invention, which is advantageously used for the recombinant production of LTTA polypeptides. The vector containing the polynucleotide of the present invention is introduced into a host cell, whereby the vector is retained as a chromosomal integrant or as a self-replicating extrachromosomal vector. Those skilled in the art know conventional vectors and host cells for expressing proteins.
在一些实施方案中,本发明的宿主细胞是大肠杆菌细胞,如大肠杆菌BL21(DE3)。在一些实施方案中,所述表达载体是pET-30a(+)。In some embodiments, the host cell of the present invention is an E. coli cell, such as E. coli BL21 (DE3). In some embodiments, the expression vector is pET-30a(+).
四、生产3-苯基-L-丝氨酸及其衍生物4. Production of 3-phenyl-L-serine and its derivatives
此外,本发明提供一种制备3-苯基-L-丝氨酸或其衍生物的方法,包括使本发明的经修饰LTTA或宿主细胞与苯甲醛或其衍生物以及L-苏氨酸接触。In addition, the present invention provides a method for preparing 3-phenyl-L-serine or a derivative thereof, which comprises contacting the modified LTTA or host cell of the present invention with benzaldehyde or a derivative thereof and L-threonine.
在一些实施方案中,本发明的制备3-苯基-L-丝氨酸或其衍生物的方法包括如下步骤:In some embodiments, the method for preparing 3-phenyl-L-serine or a derivative thereof of the present invention includes the following steps:
(a)向反应介质提供本发明的经修饰的LTTA的活性和苯甲醛或其衍生物以及L-苏氨酸;(a) Providing the activity of the modified LTTA of the present invention, benzaldehyde or its derivatives, and L-threonine to the reaction medium;
(b)温育所述反应介质使得苯甲醛或其衍生物与L-苏氨酸反应,生成3-苯基-L-丝氨酸或其衍生物和乙醛;(b) incubating the reaction medium so that benzaldehyde or its derivative reacts with L-threonine to generate 3-phenyl-L-serine or its derivative and acetaldehyde;
(c)任选地,回收3-苯基-L-丝氨酸或其衍生物。(c) Optionally, recover 3-phenyl-L-serine or a derivative thereof.
在一些实施方案中,向所述反应介质添加乙醛还原酶和还原型烟酰胺腺嘌呤双核苷酸(NADH)将乙醛还原为乙醇。在一些实施方案中,所述乙醛还原酶是来自大肠杆菌的乙醛还原酶,其氨基酸序列如SEQ ID NO:10所示。In some embodiments, acetaldehyde reductase and reduced nicotinamide adenine dinucleotide (NADH) are added to the reaction medium to reduce acetaldehyde to ethanol. In some embodiments, the acetaldehyde reductase is an acetaldehyde reductase from Escherichia coli, and its amino acid sequence is shown in SEQ ID NO: 10.
在一些实施方案中,向所述反应介质添加葡萄糖脱氢酶和葡萄糖,以实现辅酶(NADH)再生。在一些实施方案中,所述葡萄糖脱氢酶来源于枯草芽孢杆菌(Bacillus subtilis),其氨基酸序列如SEQ ID NO:12所示。In some embodiments, glucose dehydrogenase and glucose are added to the reaction medium to achieve coenzyme (NADH) regeneration. In some embodiments, the glucose dehydrogenase is derived from Bacillus subtilis (Bacillus subtilis), and its amino acid sequence is shown in SEQ ID NO: 12.
乙醛还原和辅酶再生的过程如式(II)所示:The process of acetaldehyde reduction and coenzyme regeneration is shown in formula (II):
在一些实施方案中,向所述反应介质添加甲酸或甲酸盐和甲酸脱氢酶,以实现辅酶(NADH)再生。在一些实施方案中,所述甲酸脱氢酶来源于博伊丁假丝酵母(Candida boidinii),其氨基酸序列如SEQ ID NO:73所示。In some embodiments, formate or formate and formate dehydrogenase are added to the reaction medium to achieve coenzyme (NADH) regeneration. In some embodiments, the formate dehydrogenase is derived from Candida boidinii, and its amino acid sequence is shown in SEQ ID NO: 73.
在一些实施方案中,所述反应介质是缓冲液,例如PBS或Tris-HCl缓冲液。在一个实施方案中,所述反应介质是PBS缓冲液,如0.1M、pH7.0的PBS缓冲液。In some embodiments, the reaction medium is a buffer, such as PBS or Tris-HCl buffer. In one embodiment, the reaction medium is a PBS buffer, such as 0.1M, pH 7.0 PBS buffer.
在一些实施方案中,所述温育在25-40℃,优选28-35℃,例如30℃进行。In some embodiments, the incubation is performed at 25-40°C, preferably 28-35°C, such as 30°C.
通过以下实施例,本领域技术人员会更清楚地理解本发明。应理解,实施例只是用于说明,而非限制本发明的范围。Through the following examples, those skilled in the art will understand the present invention more clearly. It should be understood that the embodiments are only for illustration, but not for limiting the scope of the present invention.
实施例1:材料和方法Example 1: Materials and methods
如无特别说明,本发明中使用的实验方法均为常规方法,基因克隆操作具体可参见前述Sambrook et al.,1989。Unless otherwise specified, the experimental methods used in the present invention are all conventional methods. For specific gene cloning operations, please refer to the aforementioned Sambrook et al., 1989.
i)试剂和仪器:i) Reagents and instruments:
DNA聚合酶(PrimeSTAR Max DNA Polymerase)和DpnI内切酶购自TaKaRa公司;DNA polymerase (PrimeSTAR Max DNA Polymerase) and DpnI endonuclease were purchased from TaKaRa;
质粒提取试剂盒购自Axygen公司;The plasmid extraction kit was purchased from Axygen;
对甲砜基苯甲醛购自阿拉丁,货号M185093,纯度98%;P-Methylsulfonyl benzaldehyde was purchased from Aladdin, item number M185093, purity 98%;
L-苏氨酸购自麦克林,货号C10393311,分析纯;L-threonine was purchased from Macleans, catalog number C10393311, analytically pure;
磷酸吡哆醛购自阿拉丁,货号P136795,纯度≥98%;Pyridoxal phosphate was purchased from Aladdin, item number P136795, purity ≥98%;
氯化镁购自阿拉丁,货号A2006034,分析纯;Magnesium chloride was purchased from Aladdin, catalog number A2006034, analytically pure;
氧化型烟酰胺腺嘌呤双核苷酸(NAD)购自阿拉丁,货号N196974,纯度95%。Oxidized nicotinamide adenine dinucleotide (NAD) was purchased from Aladdin, catalog number N196974, with a purity of 95%.
ii)载体和菌株:所使用的表达载体为pET-30a(+),质粒购自Novagen公司,所使用的宿主细胞为大肠杆菌BL21(DE3),购自天根生化科技(北京)有限公司。ii) Vector and strain: The expression vector used was pET-30a(+), the plasmid was purchased from Novagen, and the host cell used was E. coli BL21 (DE3), purchased from Tiangen Biochemical Technology (Beijing) Co., Ltd.
iii)测序、引物合成和基因合成由苏州泓迅生物科技股份有限公司完成。其中基因(SEQ ID NO:1、3、5、7、9和11)合成后,构建到载体pET-30a中。iii) Sequencing, primer synthesis and gene synthesis were completed by Suzhou Hongxun Biotechnology Co., Ltd. The genes (SEQ ID NO: 1, 3, 5, 7, 9 and 11) were synthesized and constructed into the vector pET-30a.
iv)定点突变:iv) Site-directed mutation:
设计特异性引物对,在所需突变的氨基酸位置对应碱基引入所需的取代。用提取的突变前质粒(包含野生型LTTA编码序列,pET-30a(+)骨架)为模版,利用Packer和Liu描述的方法制备突变体(Methods for the directed evolution of proteins.Nat Rev Genet,2015,16(7):379-394)。Design specific primer pairs, and introduce the required substitutions at the corresponding bases at the amino acid positions of the required mutations. Using the extracted pre-mutation plasmid (containing the wild-type LTTA coding sequence, pET-30a(+) skeleton) as a template, the mutant was prepared using the method described by Packer and Liu (Methods for the directed evolution of proteins. Nat Rev Genet, 2015, 16(7):379-394).
v)蛋白表达及酶液的制备:v) Protein expression and preparation of enzyme solution:
1、摇瓶培养进行表达1. Shake flask culture for expression
将用含有目的基因的质粒转化的大肠杆菌细胞接种至含有50mg/L的卡那霉素的LB液体培养基(250mL的瓶中的50mL培养基,蛋白胨10g/L,酵母粉5g/L,NaCl 10g/L,pH7.0)中,37℃震荡温育过夜。将1mL培养物转移至TB液体培养基(250mL的瓶中的50mL培养基,蛋白胨12g/L,酵母提取物24g/L,甘油4mL/L,磷酸二氢钾2.31g/L,磷酸氢二钾12.54g/L)中,37℃震荡温育至OD600达到0.6-0.8,加入IPTG(终浓度为0.4mM)在30℃温育过夜以诱导蛋白质表达。The E. coli cells transformed with the plasmid containing the target gene were inoculated into LB liquid medium containing 50mg/L of kanamycin (50mL medium in a 250mL bottle, peptone 10g/L, yeast powder 5g/L, NaCl 10g/L, pH7.0), incubate overnight at 37°C with shaking. Transfer 1mL culture to TB liquid medium (50mL medium in a 250mL bottle, peptone 12g/L, yeast extract 24g/L, glycerol 4mL/L, potassium dihydrogen phosphate 2.31g/L, dipotassium hydrogen phosphate 12.54g/L), incubate with shaking at 37°C until OD600 reaches 0.6-0.8, add IPTG (final concentration of 0.4mM) and incubate at 30°C overnight to induce protein expression.
温育后,将培养物以4,000g在4℃离心10min,弃上清,收集大肠杆菌细胞。将收集的大肠杆菌细胞重悬于预冷的20mL pH 7.0的磷酸盐缓冲液(PBS)中,在4℃超声破碎大肠杆菌细胞。细胞破碎液以6,000g在4℃离心15min去除沉淀,得到的上清为含有重组酶的溶液,用于催化反应。也可以将酶液冷冻干燥成酶粉,保存于4℃备用。After incubation, the culture was centrifuged at 4,000 g at 4°C for 10 min, the supernatant was discarded, and E. coli cells were collected. The collected E. coli cells were resuspended in 20 mL of pre-cooled phosphate buffered saline (PBS) at pH 7.0, and the E. coli cells were sonicated at 4°C. The cell disruption solution was centrifuged at 6,000 g at 4°C for 15 min to remove the precipitate, and the supernatant obtained was a solution containing the recombinant enzyme, which was used to catalyze the reaction. The enzyme solution can also be freeze-dried into enzyme powder and stored at 4°C for later use.
2、在96孔板上培养进行表达2. Cultivate on 96-well plates for expression
将用含有目的基因的质粒转化的大肠杆菌细胞接种至96孔浅孔板的孔中,每孔填充有100μL含有50μg/mL卡那霉素的LB培养基。将培养物在摇床中温育18小时(250rpm,30℃和85%相对湿度)。将经温育的培养物20μL转移到填充有180μL的含有50μg/mL卡那霉素的TB培养基的96孔深孔板的孔中。将板在30℃,250rpm温育2小时,然后加入IPTG(终浓度为0.4mM)诱导,并在摇床中30℃,250rpm温育20小时。离心收集大肠杆菌细胞,加入300μL裂解液(20mM磷酸缓冲液pH7.0,1mM MgSO
4,1mg/mL溶菌酶和0.5mg/mL多粘菌素B硫酸盐)。将混合物在室温下震荡2h,离心并取上清液用于酶活性测定。
The E. coli cells transformed with the plasmid containing the target gene were inoculated into the wells of a 96-well shallow-well plate, and each well was filled with 100 μL of LB medium containing 50 μg/mL kanamycin. The culture was incubated in a shaker for 18 hours (250 rpm, 30°C and 85% relative humidity). Transfer 20 μL of the incubated culture to the wells of a 96-well deep well plate filled with 180 μL of TB medium containing 50 μg/mL kanamycin. The plate was incubated at 30°C and 250 rpm for 2 hours, and then IPTG (final concentration of 0.4 mM) was added for induction, and incubated in a shaker at 30°C and 250 rpm for 20 hours. The E. coli cells were collected by centrifugation, and 300 μL of lysate (20mM phosphate buffer pH 7.0, 1mM MgSO 4 , 1mg/mL lysozyme and 0.5mg/mL polymyxin B sulfate) was added. The mixture was shaken at room temperature for 2h, centrifuged and the supernatant was taken for enzyme activity determination.
vi)酶活性测定vi) Enzyme activity determination
方法1:method 1:
配制以下体系用于测定LTTA突变体的酶活性(200μl):The following system was prepared to determine the enzyme activity of the LTTA mutant (200μl):
对甲砜基苯甲醛0.7g/L,L-苏氨酸0.6g/L,磷酸吡哆醛0.025g/L,氯化镁0.1g/L,NADH 0.2g/L,加入乙醛还原酶(终浓度为0.05g/L)和L-苏氨酸转醛酶或其突变体(终浓度为0.2g/L),100mM磷酸缓冲液(pH6.5)。P-methylsulfonyl benzaldehyde 0.7g/L, L-threonine 0.6g/L, pyridoxal phosphate 0.025g/L, magnesium chloride 0.1g/L, NADH 0.2g/L, add acetaldehyde reductase (final concentration 0.05g/L) and L-threonine transaldolase or its mutant (final concentration 0.2g/L), 100mM phosphate buffer (pH6.5).
将各反应物加入96孔板的孔中,短暂混合,使用SpectraMax190(Molecular Devices)光吸收酶标仪,通过在340nm处吸光度变化,从而检测LTTA及其突变体活性。The reactants were added to the wells of a 96-well plate, mixed briefly, and the SpectraMax190 (Molecular Devices) light absorption microplate reader was used to detect the activity of LTTA and its mutants by changing the absorbance at 340 nm.
方法2:Method 2:
配制以下体系用于测定LTTA突变体的酶活性(20mL):The following system was prepared to determine the enzyme activity of the LTTA mutant (20 mL):
对甲砜基苯甲醛50g/L,L-苏氨酸42g/L,氯化镁1g/L,NAD
+0.1g/L,磷酸吡哆醛0.15g/L,D-葡萄糖65g/L,100mM磷酸缓冲液(pH6.5)。
P-methylsulfonyl benzaldehyde 50g/L, L-threonine 42g/L, magnesium chloride 1g/L, NAD + 0.1g/L, pyridoxal phosphate 0.15g/L, D-glucose 65g/L, 100mM phosphate buffer Liquid (pH6.5).
加热至30℃磁力搅拌均匀,加入L-苏氨酸转醛酶酶粉20mg,乙醛还原酶酶粉20mg和葡萄糖脱氢酶酶粉8mg,开始搅拌反应,在反应1h后,HPLC检测对甲砜基苯丝氨酸。Heat to 30℃ and stir evenly with magnetic force. Add 20mg of L-threonine transaldolase enzyme powder, 20mg of acetaldehyde reductase enzyme powder and 8mg of glucose dehydrogenase enzyme powder. Sulfone phenylserine.
方法3:Method 3:
配制以下体系用于测定LTTA突变体的酶活性(1mL):The following system was prepared to determine the enzyme activity of the LTTA mutant (1mL):
对甲砜基苯甲醛28.4g/L,L-苏氨酸23.8g/L,氯化镁1g/L,磷酸吡哆醛0.066g/L,甲酸钠20.4g/L,NADH 0.325g/L,磷酸缓冲液100mM,pH7.0。P-methylsulfonyl benzaldehyde 28.4g/L, L-threonine 23.8g/L, magnesium chloride 1g/L, pyridoxal phosphate 0.066g/L, sodium formate 20.4g/L, NADH 0.325g/L, phosphate buffer 100mM, pH7.0.
加热至30℃混合均匀,加入LTTA至终浓度0.6mg/mL,甲酸脱氢酶至终浓度0.6mg/mL和乙醛还原酶至终浓度0.3mg/mL,开始搅拌反应,反应1小时后通过HPLC检测产物。Heat to 30℃ and mix well, add LTTA to a final concentration of 0.6mg/mL, formate dehydrogenase to a final concentration of 0.6mg/mL and acetaldehyde reductase to a final concentration of 0.3mg/mL, start the stirring reaction, and pass after 1 hour of reaction The product was detected by HPLC.
vii)HPLC分析vii) HPLC analysis
对于含量分析,HPLC参数如下:For content analysis, the HPLC parameters are as follows:
柱 Aliglent TC-C18柱(250mm*4.6mm*5μm);Column Aliglent TC-C18 column (250mm*4.6mm*5μm);
流动相 PBS(50mM,pH8.0):乙腈=85:15(0-2min),50:50(7-8min),85:15(9-13min);Mobile phase PBS (50mM, pH8.0): Acetonitrile=85:15 (0-2min), 50:50 (7-8min), 85:15 (9-13min);
流速 1mL/min;Flow rate 1mL/min;
柱温 30℃;Column temperature 30°C;
检测波长 235nm;Detection wavelength 235nm;
保留时间 对甲砜基苯甲醛:9.91min,Retention time p-methylsulfonyl benzaldehyde: 9.91min,
对甲砜基苯丝氨酸:3.25min。P-Methylsulfonylphenylserine: 3.25min.
对于立体异构体分析,HPLC参数如下:For stereoisomer analysis, the HPLC parameters are as follows:
柱 Aliglent TC-C18柱(250mm*4.6mm*5μm);Column Aliglent TC-C18 column (250mm*4.6mm*5μm);
流动相 PBS(50mM,pH8.0):乙腈=80:20;Mobile phase PBS (50mM, pH8.0): Acetonitrile=80:20;
流速 1mL/min;Flow rate 1mL/min;
柱温 30℃;Column temperature 30°C;
检测波长 340nm;Detection wavelength 340nm;
保留时间 (2R,3S)-对甲砜基苯丝氨酸:5.4min,Retention time (2R,3S)-p-methylsulfonyl phenylserine: 5.4min,
(2S,3R)-对甲砜基苯丝氨酸:5.9min,(2S,3R)-p-methylsulfonylphenylserine: 5.9min,
(2R,3R)-对甲砜基苯丝氨酸:6.3min,(2R, 3R)-p-methylsulfonyl phenylserine: 6.3min,
(2S,3S)-对甲砜基苯丝氨酸:6.8min。(2S, 3S)-p-methylsulfonylphenylserine: 6.8min.
实施例2、制备和检测荧光假单胞菌的LTTA(LTTA01)的突变体Example 2: Preparation and detection of mutants of LTTA (LTTA01) of Pseudomonas fluorescens
以LTTA01的编码核酸(SEQ ID NO:1)为模板,根据实施例1的方法制备突变体。所得突变体如表1所示。根据实施例1的方法,通过摇瓶培养表达LTTA01和其突变体,并制备酶粉。根据实施例1第vi)条中的方法2测定LTTA01和其突变体的活性,结果示于表1,其中相对活性是指突变体的活性/LTTA01的活性。Using the nucleic acid encoding LTTA01 (SEQ ID NO:1) as a template, a mutant was prepared according to the method of Example 1. The resulting mutants are shown in Table 1. According to the method of Example 1, the expression LTTA01 and its mutants were cultured by shaking flasks, and enzyme powder was prepared. The activities of LTTA01 and its mutants were measured according to Method 2 in section vi) of Example 1, and the results are shown in Table 1, where the relative activity refers to the activity of the mutant/the activity of LTTA01.
表1Table 1
突变体中的取代Substitutions in mutants | SEQ ID NO:SEQ ID NO: | 相对活性Relative activity |
N35AN35A | 1313 | 1.31.3 |
N35SN35S | 1414 | 1.151.15 |
C57MC57M | 1515 | 1.611.61 |
C57TC57T | 1616 | 1.361.36 |
F59AF59A | 1717 | 1.011.01 |
F59YF59Y | 1818 | 1.081.08 |
F70HF70H | 1919 | 1.381.38 |
Q94EQ94E | 2020 | 1.181.18 |
M141CM141C | 21twenty one | 1.121.12 |
M205SM205S | 22twenty two | 1.271.27 |
M205QM205Q | 23twenty three | 1.221.22 |
M205AM205A | 24twenty four | 1.31.3 |
H229CH229C | 2525 | 1.261.26 |
F59A-F70HF59A-F70H | 2626 | 1.711.71 |
F181Q-F70HF181Q-F70H | 2727 | 1.681.68 |
K116N-F70HK116N-F70H | 2828 | 1.731.73 |
N35S-F70HN35S-F70H | 2929 | 3.193.19 |
Y38F-F70HY38F-F70H | 3030 | 1.601.60 |
G48A-F70HG48A-F70H | 3131 | 1.591.59 |
G48C-F70HG48C-F70H | 3232 | 1.421.42 |
F59Y-F70HF59Y-F70H | 3333 | 1.701.70 |
F70H-Q94EF70H-Q94E | 3434 | 1.611.61 |
F70H-K116SF70H-K116S | 3535 | 1.531.53 |
F70H-K116NF70H-K116N | 3636 | 1.631.63 |
F70H-M141CF70H-M141C | 3737 | 1.451.45 |
N35S-F70H-F181LN35S-F70H-F181L | 3838 | 2.362.36 |
N35S-F70H-F181QN35S-F70H-F181Q | 3939 | 2.492.49 |
N35S-Y38F-F70HN35S-Y38F-F70H | 4040 | 3.513.51 |
N35S-C57M-F70HN35S-C57M-F70H | 4141 | 4.234.23 |
N35S-F70H-Q94EN35S-F70H-Q94E | 4242 | 3.453.45 |
N35S-F70H-K116NN35S-F70H-K116N | 4343 | 3.063.06 |
N35S-F70H-M141CN35S-F70H-M141C | 4444 | 3.093.09 |
N35S-G48A-F70HN35S-G48A-F70H | 4545 | 3.803.80 |
N35S-Y38F-F70H-Q94EN35S-Y38F-F70H-Q94E | 4646 | 3.513.51 |
N35S-Y38F-F70H-K116SN35S-Y38F-F70H-K116S | 4747 | 2.872.87 |
N35S-Y38F-G48A-F70HN35S-Y38F-G48A-F70H | 4848 | 3.323.32 |
N35S-G48A-F59A-F70HN35S-G48A-F59A-F70H | 4949 | 3.603.60 |
N35S-F59A-F70H-F181QN35S-F59A-F70H-F181Q | 5050 | 2.362.36 |
N35S-F59Y-F70H-F181QN35S-F59Y-F70H-F181Q | 5151 | 2.622.62 |
N35S-G48A-F70H-Q94EN35S-G48A-F70H-Q94E | 5252 | 3.833.83 |
N35S-F70H-K116S-M141CN35S-F70H-K116S-M141C | 5353 | 2.462.46 |
N35S-F70H-K116N-M141CN35S-F70H-K116N-M141C | 5454 | 2.622.62 |
N35S-G48A-F59A-F70H-Q94EN35S-G48A-F59A-F70H-Q94E | 5555 | 3.763.76 |
N35S-F59A-F70H-Q94E-M141CN35S-F59A-F70H-Q94E-M141C | 5656 | 3.323.32 |
N35S-Y38F-G48A-F70H-Q94EN35S-Y38F-G48A-F70H-Q94E | 5757 | 3.703.70 |
N35S-Y38F-G48A-F59A-F70H-K116RN35S-Y38F-G48A-F59A-F70H-K116R | 5858 | 3.733.73 |
N35S-Y38F-G48A-Q94E-K116S-F70HN35S-Y38F-G48A-Q94E-K116S-F70H | 5959 | 2.652.65 |
N35S-C57T-F59A-F70H-Q94E-M141CN35S-C57T-F59A-F70H-Q94E-M141C | 6060 | 4.024.02 |
N35S-C57S-F59A-F70H-Q94E-M141CN35S-C57S-F59A-F70H-Q94E-M141C | 6161 | 3.953.95 |
N35S-C57M-F59A-F70H-Q94E-M141CN35S-C57M-F59A-F70H-Q94E-M141C | 6262 | 4.324.32 |
N35S-F59A-F70H-Q94E-M141C-W185GN35S-F59A-F70H-Q94E-M141C-W185G | 6363 | 2.192.19 |
N35S-F59A-F70H-Q94E-M141C-K407RN35S-F59A-F70H-Q94E-M141C-K407R | 6464 | 2.692.69 |
N35S-F59A-F70H-Q94E-M141C-M205AN35S-F59A-F70H-Q94E-M141C-M205A | 6565 | 4.024.02 |
实施例3:制备和检测不同来源的L-苏氨酸转醛酶的突变体Example 3: Preparation and detection of mutants of L-threonine transaldolase from different sources
根据实施例1的方法,基于以下LTTA编码核酸制备包含取代组合N35S-C57M-F70H的突变体:According to the method of Example 1, a mutant containing the substitution combination N35S-C57M-F70H was prepared based on the following LTTA encoding nucleic acid:
-Pseudomonas sp.34 E 7的LTTA:LTTA02,编码核酸SEQ ID NO:3;-Pseudomonas sp.34 E 7's LTTA: LTTA02, encoding nucleic acid SEQ ID NO: 3;
-Pseudomonas sp.Irchel s3a18的LTTA:LTTA03,编码核酸SEQ ID NO:5;和-LTTA of Pseudomonas sp. Irchel s3a18: LTTA03, encoding nucleic acid SEQ ID NO: 5; and
-Chitiniphilus shinanonensis的LTTA:LTTA04,编码核酸SEQ ID NO:7。-LTTA of Chitiniphilus shinanonensis: LTTA04, encoding nucleic acid SEQ ID NO: 7.
根据实施例1的方法,通过摇瓶培养表达LTTA01和其突变体,并制备酶粉。根据实施例1第vi)条中的方法2测定各野生型LTTA和其突变体的活性,结果示于表2,其中相对活性是指突变体的活性/LTTA01的活性。According to the method of Example 1, the expression LTTA01 and its mutants were cultured by shaking flasks, and enzyme powder was prepared. The activity of each wild-type LTTA and its mutants was measured according to method 2 in section vi) of Example 1, and the results are shown in Table 2, where the relative activity refers to the activity of the mutant/the activity of LTTA01.
表2Table 2
LTTA或突变体LTTA or mutant | SEQ ID NO:SEQ ID NO: | 相对活性Relative activity |
LTTA01LTTA01 | 22 | 11 |
LTTA01-N35S-C57M-F70HLTTA01-N35S-C57M-F70H | 4141 | 4.234.23 |
LTTA02LTTA02 | 44 | 1.021.02 |
LTTA02-N35S-C57M-F70HLTTA02-N35S-C57M-F70H | 6666 | 4.304.30 |
LTTA03LTTA03 | 66 | 0.490.49 |
LTTA03-N35S-C57M-F70HLTTA03-N35S-C57M-F70H | 6767 | 2.792.79 |
LTTA04LTTA04 | 88 | 0.130.13 |
LTTA04-N35S-C57M-F70HLTTA04-N35S-C57M-F70H | 6868 | 0.170.17 |
实施例4:在位置35、57和70引入不同取代制备单突变体,并比较其酶活性Example 4: Introduce different substitutions at positions 35, 57 and 70 to prepare single mutants and compare their enzyme activities
以LTTA01的编码核酸(SEQ ID NO:1)为模板,根据实施例1的方法制备突变体,在位置35、57和70引入不同取代。所得突变体如表1所示。根据实施例1的方法,通过96孔板培养表达LTTA01和其突变体。根据实施例1第vi)条中的方法1测定LTTA01和突变体的活性,结果示于表3,其中相对活性是指突变体的活性/LTTA01的活性。Using the nucleic acid encoding LTTA01 (SEQ ID NO:1) as a template, a mutant was prepared according to the method of Example 1, and different substitutions were introduced at positions 35, 57, and 70. The resulting mutants are shown in Table 1. According to the method of Example 1, the expression LTTA01 and its mutants were cultured in a 96-well plate. The activity of LTTA01 and the mutant was measured according to Method 1 in Section vi) of Example 1, and the results are shown in Table 3, where the relative activity refers to the activity of the mutant/the activity of LTTA01.
表3table 3
突变体中的取代Substitutions in mutants | SEQ ID NO:SEQ ID NO: | 相对活性Relative activity |
N35AN35A | 1313 | 2.12.1 |
N35EN35E | To | 0.060.06 |
N35GN35G | 6969 | 1.771.77 |
N35HN35H | To | 0.690.69 |
N35MN35M | To | 0.410.41 |
N35PN35P | To | 0.220.22 |
N35QN35Q | To | 0.420.42 |
N35SN35S | 1414 | 1.651.65 |
N35VN35V | To | 0.080.08 |
C57AC57A | To | 0.070.07 |
C57EC57E | To | 0.320.32 |
C57FC57F | To | 0.60.6 |
C57GC57G | To | 0.810.81 |
C57MC57M | 1515 | 1.31.3 |
C57PC57P | To | 0.20.2 |
C57TC57T | 1616 | 1.161.16 |
C57WC57W | 7070 | 1.541.54 |
C57YC57Y | To | 0.360.36 |
F70EF70E | To | 0.330.33 |
F70HF70H | 1919 | 1.881.88 |
F70LF70L | To | 0.50.5 |
F70NF70N | 7171 | 1.291.29 |
F70QF70Q | To | 0.710.71 |
F70SF70S | 7272 | 1.531.53 |
F70TF70T | To | 0.80.8 |
F70VF70V | To | 0.60.6 |
实施例5、在LTTA01中引入不同突变对酶活性和产物选择性的影响Example 5. The effect of introducing different mutations in LTTA01 on enzyme activity and product selectivity
以LTTA01的编码核酸(SEQ ID NO:1)为模板,根据实施例1的方法制备突变体。根据实施例1的方法,通过摇瓶培养表达LTTA01和其突变体,并制备酶粉。根据实施例1第vi)条中的方法3测定LTTA01和突变体的活性和产品选择性,结果示于表4,其中相对活性是指突变体的活性/LTTA01的活性,产品选择性以DR表示,并且对于每种酶,ee%>99.9%。Using the nucleic acid encoding LTTA01 (SEQ ID NO:1) as a template, a mutant was prepared according to the method of Example 1. According to the method of Example 1, the expression LTTA01 and its mutants were cultured by shaking flasks, and enzyme powder was prepared. The activity and product selectivity of LTTA01 and the mutant were determined according to Method 3 in Article vi) of Example 1, and the results are shown in Table 4. The relative activity refers to the activity of the mutant/the activity of LTTA01, and the product selectivity is represented by DR , And for each enzyme, ee%>99.9%.
表4Table 4
LTTA或突变体(以其中取代表示)LTTA or mutant (indicated by its substitution) | 相对活性Relative activity | DRDR |
WT(LTTA01)WT(LTTA01) | 1.001.00 | 95.2:4.895.2:4.8 |
N35SN35S | 0.410.41 | 96.8:3.296.8:3.2 |
C57NC57N | 1.171.17 | 94.0:6.094.0:6.0 |
N35S-C57NN35S-C57N | 0.490.49 | 97.2:2.897.2:2.8 |
F70HF70H | 1.671.67 | 96.7:3.396.7:3.3 |
N35S-F70HN35S-F70H | 1.711.71 | 98.1:1.998.1:1.9 |
C57N-F70HC57N-F70H | 1.661.66 | 95.5:4.595.5:4.5 |
N35S-C57N-F70HN35S-C57N-F70H | 2.132.13 | 98.1:1.998.1:1.9 |
N35S-C57M-F59A-F70H-Q94E-M141CN35S-C57M-F59A-F70H-Q94E-M141C | 3.253.25 | 99.4:0.699.4:0.6 |
可见,包含F70H或N35S的突变体的产品选择性均高于野生型,并且包含N35S和F70H突变的突变体的产品选择性的进一步提高。包含N35S单突变的突变体在上述反应条件下酶活性不如WT,但与F70H组合后并没有显著影响酶活性。It can be seen that the product selectivity of mutants containing F70H or N35S is higher than that of wild type, and the product selectivity of mutants containing N35S and F70H mutations is further improved. The mutant containing the single mutation of N35S was not as active as WT under the above reaction conditions, but it did not significantly affect the enzyme activity after being combined with F70H.
实施例6、在LTTA01的位置70引入不同突变对酶活性和产物选择性的影响Example 6. The effect of introducing different mutations at position 70 of LTTA01 on enzyme activity and product selectivity
以LTTA01的编码核酸(SEQ ID NO:1)为模板,根据实施例1的方法制备突变体,对LTTA01的位置70引入取代。根据实施例1的方法,通过摇瓶培养表达LTTA01和其突变体,并制备酶粉。根据实施例1第vi)条中的方法3测定LTTA01和突变体的活性和产品选择性,结果示于表5,其中相对活性是指突变体的活性/LTTA01的活性,产品选择性以DR表示,并且对于每种酶,ee%>99.9%。Using the nucleic acid encoding LTTA01 (SEQ ID NO:1) as a template, a mutant was prepared according to the method of Example 1, and a substitution was introduced at position 70 of LTTA01. According to the method of Example 1, the expression LTTA01 and its mutants were cultured by shaking flasks, and enzyme powder was prepared. The activity and product selectivity of LTTA01 and the mutant were determined according to Method 3 in Article vi) of Example 1, and the results are shown in Table 5. The relative activity refers to the activity of the mutant/the activity of LTTA01, and the product selectivity is represented by DR , And for each enzyme, ee%>99.9%.
表5table 5
LTTA和突变体LTTA and mutants | 相对活性Relative activity | DRDR |
WTWT | 1.001.00 | 95.3:4.795.3:4.7 |
F70GF70G | 0.000.00 | -- |
F70HF70H | 2.692.69 | 97.1:2.997.1:2.9 |
F70NF70N | 3.623.62 | 97.7:2.397.7:2.3 |
F70PF70P | 0.110.11 | 91.4:8.691.4:8.6 |
F70RF70R | 0.140.14 | 88.2:11.888.2:11.8 |
F70SF70S | 3.173.17 | 97.8:2.297.8:2.2 |
F70WF70W | 0.080.08 | 91.5:8.591.5:8.5 |
F70YF70Y | 0.220.22 | 92.4:7.692.4:7.6 |
Claims (10)
- 一种经修饰的L-苏氨酸转醛酶(LTTA),A modified L-threonine transaldolase (LTTA),与其起始LTTA相比,包含位置70的氨基酸取代,所述位置70的氨基酸取代为H、N或S,且任选地,所述经修饰的LTTA还包含选自位置35、38、48、57、59、94、116、141、181、185、205、229和407的一或多个位置的取代,其中氨基酸位置参照SEQ ID NO:2进行编号,Compared with its starting LTTA, the amino acid substitution at position 70 is included, and the amino acid substitution at position 70 is H, N, or S, and optionally, the modified LTTA also includes positions 35, 38, 48, 57, 59, 94, 116, 141, 181, 185, 205, 229 and 407 substitutions at one or more positions, wherein the amino acid positions are numbered with reference to SEQ ID NO: 2,其中与其起始LTTA相比,所述经修饰的LTTA具有改进的催化苯甲醛或其衍生物与L-苏氨酸反应生成3-苯基-L-丝氨酸及其衍生物的活性。Wherein, compared with the starting LTTA, the modified LTTA has an improved activity of catalyzing the reaction of benzaldehyde or its derivatives with L-threonine to generate 3-phenyl-L-serine and its derivatives.
- 权利要求1的经修饰的LTTA,其包含选自以下的一或多个氨基酸取代:位置35取代为A、G或S,位置38取代为F,位置48取代为A或C,位置57取代为S、M、T或W,位置59取代为A或Y,位置94取代为E,位置116取代为R、S或N,位置141取代为C,位置181取代为Q或L,位置185取代为G,位置205选自S、Q或A,位置229取代为C,位置407取代为R。The modified LTTA of claim 1, comprising one or more amino acid substitutions selected from the group consisting of: A, G, or S at position 35, F at position 38, A or C at position 48, and A or C at position 57, S, M, T or W, position 59 is substituted with A or Y, position 94 is substituted with E, position 116 is substituted with R, S or N, position 141 is substituted with C, position 181 is substituted with Q or L, position 185 is substituted with G, position 205 is selected from S, Q or A, position 229 is substituted with C, and position 407 is substituted with R.
- 权利要求1或2的经修饰的LTTA,其中所述经修饰的LTTA包含位置35和70的氨基酸取代,其中位置35取代为S,且位置70取代为H。The modified LTTA of claim 1 or 2, wherein the modified LTTA comprises amino acid substitutions at positions 35 and 70, wherein position 35 is substituted to S and position 70 is substituted to H.
- 权利要求1-3任一项的经修饰的LTTA,其中所述经修饰的LTTA包含位置35、57和70的氨基酸取代,其中位置35取代为S,位置57取代为M,且位置70取代为H。The modified LTTA of any one of claims 1-3, wherein the modified LTTA comprises amino acid substitutions at positions 35, 57, and 70, wherein position 35 is substituted with S, position 57 is substituted with M, and position 70 is substituted with H.
- 权利要求1-4任一项的的经修饰的LTTA,其中所述经修饰的LTTA包含位置35、57、59、70、94和141的氨基酸取代,其中位置35取代为S,位置57取代为M,位置59取代为A,位置70取代为H,位置94取代为E,且位置141取代为C。The modified LTTA of any one of claims 1 to 4, wherein the modified LTTA comprises amino acid substitutions at positions 35, 57, 59, 70, 94 and 141, wherein the substitution at position 35 is S and the substitution at position 57 is M, position 59 is substituted with A, position 70 is substituted with H, position 94 is substituted with E, and position 141 is substituted with C.
- 一种经修饰的LTTA,其包含SEQ ID NO:13-72之一的氨基酸序列。A modified LTTA comprising the amino acid sequence of one of SEQ ID NO: 13-72.
- 一种多核苷酸,编码权利要求1-6任一项的经修饰的LTTA。A polynucleotide encoding the modified LTTA of any one of claims 1-6.
- 一种表达载体,包含权利要求7的多核苷酸。An expression vector comprising the polynucleotide of claim 7.
- 一种宿主细胞,包含权利要求1-6任一项的经修饰的LTTA、权利要求7多核苷酸或权利要求8的载体。A host cell comprising the modified LTTA of any one of claims 1-6, the polynucleotide of claim 7 or the vector of claim 8.
- 一种生产的3-苯基-L-丝氨酸及其衍生物方法,包括使1-6任一项的经修饰的LTTA或权利要求9的宿主细胞与苯甲醛或其衍生物以及L-苏氨酸接触。A method for producing 3-phenyl-L-serine and its derivatives, comprising combining the modified LTTA of any one of 1-6 or the host cell of claim 9 with benzaldehyde or its derivatives and L-threonine Acid contact.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010361933 | 2020-04-30 | ||
CN202010361933.9 | 2020-04-30 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021219124A1 true WO2021219124A1 (en) | 2021-11-04 |
Family
ID=78243031
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/091392 WO2021219124A1 (en) | 2020-04-30 | 2021-04-30 | Modified threonine transaldolase and application thereof |
Country Status (2)
Country | Link |
---|---|
CN (2) | CN113583989B (en) |
WO (1) | WO2021219124A1 (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024077428A1 (en) * | 2022-10-10 | 2024-04-18 | 武汉远大弘元股份有限公司 | Enzyme with d-amino acid synthesis activity and use thereof |
CN118240795B (en) * | 2024-05-28 | 2024-08-27 | 中国科学院天津工业生物技术研究所 | L-threonine transaldolase mutant and application thereof in synthesis of chiral beta-hydroxy-alpha-amino acid |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109836362A (en) * | 2017-11-29 | 2019-06-04 | 苏州引航生物科技有限公司 | A kind of method preparing chiral (2S, 3R)-D-4-methylsulfonylphserine serine ethyl ester |
CN110540977A (en) * | 2019-09-05 | 2019-12-06 | 福建昌生生物科技发展有限公司 | Application of L-threonine transaldolase in synthesis of florfenicol chiral intermediate |
CN110904188A (en) * | 2019-12-23 | 2020-03-24 | 福州大学 | High-throughput rapid screening method of L-threonine transaldolase mutant |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10655153B2 (en) * | 2017-03-14 | 2020-05-19 | Washington University | Chemoenzymatic synthesis of peptide beta-lactones and beta-hydroxy acids |
CN110951799B (en) * | 2019-12-23 | 2021-07-30 | 福州大学 | Method for synthesizing (2S,3R) -p-methylsulfonyl phenyl serine by whole cell asymmetry of' one bacterium multienzyme |
-
2021
- 2021-04-30 CN CN202110480009.7A patent/CN113583989B/en active Active
- 2021-04-30 CN CN202410785300.9A patent/CN118530962A/en active Pending
- 2021-04-30 WO PCT/CN2021/091392 patent/WO2021219124A1/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109836362A (en) * | 2017-11-29 | 2019-06-04 | 苏州引航生物科技有限公司 | A kind of method preparing chiral (2S, 3R)-D-4-methylsulfonylphserine serine ethyl ester |
CN110540977A (en) * | 2019-09-05 | 2019-12-06 | 福建昌生生物科技发展有限公司 | Application of L-threonine transaldolase in synthesis of florfenicol chiral intermediate |
CN110904188A (en) * | 2019-12-23 | 2020-03-24 | 福州大学 | High-throughput rapid screening method of L-threonine transaldolase mutant |
Non-Patent Citations (2)
Title |
---|
DATABASE PROTEIN 19 December 2017 (2017-12-19), ANONYMOUS : "MULTISPECIES: hypothetical protein [Pseudomonas]", XP055867098, retrieved from NCBI Database accession no. WP_065949345 * |
SCOTT THOMAS A., HEINE DANIEL, QIN ZHIWEI, WILKINSON BARRIE: "An L-threonine transaldolase is required for L-threo-β-hydroxy-α-amino acid assembly during obafluorin biosynthesis", NATURE COMMUNICATIONS, vol. 8, no. 1, 1 August 2017 (2017-08-01), XP055867095, DOI: 10.1038/ncomms15935 * |
Also Published As
Publication number | Publication date |
---|---|
CN113583989A (en) | 2021-11-02 |
CN118530962A (en) | 2024-08-23 |
CN113583989B (en) | 2024-06-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11667896B2 (en) | Modified DAAO enzyme and application thereof | |
WO2021219124A1 (en) | Modified threonine transaldolase and application thereof | |
CN111944775B (en) | Modified 7 beta-hydroxysteroid dehydrogenase and application thereof | |
JP6279478B2 (en) | Hydantoinase mutants | |
JP2022547523A (en) | Peroxidase activity against 10-acetyl-3,7-dihydroxyphenoxazine | |
CN111100851B (en) | Alcohol dehydrogenase mutant and application thereof in synthesis of chiral diaryl alcohol compound | |
WO2022007881A1 (en) | Modified glutamate dehydrogenase and use thereof | |
TWI852256B (en) | Modified glutamate dehydrogenase and its application | |
WO2023125769A1 (en) | Modified glutamate dehydrogenase and application thereof | |
WO2024183112A1 (en) | Method for expressing d-amino acid oxidase and use thereof | |
KR101617371B1 (en) | Novel Baeyer-Villiger monooxygenase BVMOgm1 from Metagenome | |
KR101619864B1 (en) | Novel Baeyer-Villiger monooxygenase BVMOsm1 from Metagenome | |
CN118667785A (en) | Modified homoserine O-acetyltransferase and its use | |
WO2024010785A1 (en) | Ketoreductase enzymes for the synthesis of 1,3-diol substituted indanes | |
TW202413637A (en) | Mutant ketoreductase with increased ketoreductase activity as well as methods and uses involving the same | |
JP2010187573A (en) | Method for producing (r)-3-hydroxyisobutyric acid |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21797898 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 202217067950 Country of ref document: IN |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21797898 Country of ref document: EP Kind code of ref document: A1 |