CN114410665A - Gene for efficiently catalyzing biosynthesis of gallic acid methyl ester and application thereof - Google Patents
Gene for efficiently catalyzing biosynthesis of gallic acid methyl ester and application thereof Download PDFInfo
- Publication number
- CN114410665A CN114410665A CN202111610349.3A CN202111610349A CN114410665A CN 114410665 A CN114410665 A CN 114410665A CN 202111610349 A CN202111610349 A CN 202111610349A CN 114410665 A CN114410665 A CN 114410665A
- Authority
- CN
- China
- Prior art keywords
- gene
- mrta
- csta
- methyl ester
- gallic acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- FBSFWRHWHYMIOG-UHFFFAOYSA-N methyl 3,4,5-trihydroxybenzoate Chemical compound COC(=O)C1=CC(O)=C(O)C(O)=C1 FBSFWRHWHYMIOG-UHFFFAOYSA-N 0.000 title claims abstract description 85
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 65
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 30
- 101150084890 cstA gene Proteins 0.000 claims abstract description 39
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 27
- 239000002773 nucleotide Substances 0.000 claims abstract description 21
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 21
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims abstract description 16
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims abstract description 16
- 241000196324 Embryophyta Species 0.000 claims abstract description 13
- 241000894006 Bacteria Species 0.000 claims abstract description 7
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 claims description 27
- 238000006243 chemical reaction Methods 0.000 claims description 25
- 239000000758 substrate Substances 0.000 claims description 17
- 238000000034 method Methods 0.000 claims description 14
- IBKQQKPQRYUGBJ-UHFFFAOYSA-N methyl gallate Natural products CC(=O)C1=CC(O)=C(O)C(O)=C1 IBKQQKPQRYUGBJ-UHFFFAOYSA-N 0.000 claims description 14
- WMBWREPUVVBILR-UHFFFAOYSA-N GCG Natural products C=1C(O)=C(O)C(O)=CC=1C1OC2=CC(O)=CC(O)=C2CC1OC(=O)C1=CC(O)=C(O)C(O)=C1 WMBWREPUVVBILR-UHFFFAOYSA-N 0.000 claims description 12
- WMBWREPUVVBILR-WIYYLYMNSA-N (-)-Epigallocatechin-3-o-gallate Chemical compound O([C@@H]1CC2=C(O)C=C(C=C2O[C@@H]1C=1C=C(O)C(O)=C(O)C=1)O)C(=O)C1=CC(O)=C(O)C(O)=C1 WMBWREPUVVBILR-WIYYLYMNSA-N 0.000 claims description 11
- 238000006911 enzymatic reaction Methods 0.000 claims description 9
- 239000013598 vector Substances 0.000 claims description 7
- 230000014509 gene expression Effects 0.000 claims description 5
- 239000013604 expression vector Substances 0.000 claims description 4
- QJYNZEYHSMRWBK-NIKIMHBISA-N 1,2,3,4,6-pentakis-O-galloyl-beta-D-glucose Chemical compound OC1=C(O)C(O)=CC(C(=O)OC[C@@H]2[C@H]([C@H](OC(=O)C=3C=C(O)C(O)=C(O)C=3)[C@@H](OC(=O)C=3C=C(O)C(O)=C(O)C=3)[C@H](OC(=O)C=3C=C(O)C(O)=C(O)C=3)O2)OC(=O)C=2C=C(O)C(O)=C(O)C=2)=C1 QJYNZEYHSMRWBK-NIKIMHBISA-N 0.000 claims description 3
- 235000008375 Decussocarpus nagi Nutrition 0.000 claims description 3
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 claims description 3
- 244000132436 Myrica rubra Species 0.000 claims description 3
- 235000014631 Myrica rubra Nutrition 0.000 claims description 3
- 235000006468 Thea sinensis Nutrition 0.000 claims description 3
- 244000052707 Camellia sinensis Species 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 235000007270 Gaultheria hispida Nutrition 0.000 abstract description 10
- 235000009134 Myrica cerifera Nutrition 0.000 abstract description 10
- 235000012851 Myrica pensylvanica Nutrition 0.000 abstract description 10
- 235000009024 Ceanothus sanguineus Nutrition 0.000 abstract description 4
- 240000003553 Leptospermum scoparium Species 0.000 abstract description 4
- 235000015459 Lycium barbarum Nutrition 0.000 abstract description 4
- 239000013612 plasmid Substances 0.000 abstract description 3
- 244000024215 Myrica gale Species 0.000 abstract description 2
- 238000012269 metabolic engineering Methods 0.000 abstract description 2
- 230000009261 transgenic effect Effects 0.000 abstract 1
- 239000000047 product Substances 0.000 description 13
- 239000000243 solution Substances 0.000 description 10
- 102000004190 Enzymes Human genes 0.000 description 9
- 108090000790 Enzymes Proteins 0.000 description 9
- 244000269152 Myrica pensylvanica Species 0.000 description 8
- 244000269722 Thea sinensis Species 0.000 description 8
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 239000007788 liquid Substances 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 238000010367 cloning Methods 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- LNTHITQWFMADLM-UHFFFAOYSA-N gallic acid Chemical compound OC(=O)C1=CC(O)=C(O)C(O)=C1 LNTHITQWFMADLM-UHFFFAOYSA-N 0.000 description 4
- 230000006872 improvement Effects 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- XMOCLSLCDHWDHP-IUODEOHRSA-N epi-Gallocatechin Chemical compound C1([C@H]2OC3=CC(O)=CC(O)=C3C[C@H]2O)=CC(O)=C(O)C(O)=C1 XMOCLSLCDHWDHP-IUODEOHRSA-N 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 3
- 239000012160 loading buffer Substances 0.000 description 3
- -1 methyl gallate Chemical class 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000035484 reaction time Effects 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 238000012795 verification Methods 0.000 description 3
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 2
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- XMOCLSLCDHWDHP-UHFFFAOYSA-N L-Epigallocatechin Natural products OC1CC2=C(O)C=C(O)C=C2OC1C1=CC(O)=C(O)C(O)=C1 XMOCLSLCDHWDHP-UHFFFAOYSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- 108010025216 RVF peptide Proteins 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- 241001052560 Thallis Species 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 244000052616 bacterial pathogen Species 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000007795 chemical reaction product Substances 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000007123 defense Effects 0.000 description 2
- 229940074391 gallic acid Drugs 0.000 description 2
- 235000004515 gallic acid Nutrition 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 230000036039 immunity Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 238000009630 liquid culture Methods 0.000 description 2
- 238000004949 mass spectrometry Methods 0.000 description 2
- 238000001819 mass spectrum Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 239000008055 phosphate buffer solution Substances 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 230000009465 prokaryotic expression Effects 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 238000005303 weighing Methods 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- VLAFRQCSFRYCLC-FXQIFTODSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-2-aminopropanoyl]amino]acetyl]amino]-3-hydroxypropanoyl]amino]pentanedioic acid Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VLAFRQCSFRYCLC-FXQIFTODSA-N 0.000 description 1
- SCPRYBYMKVYVND-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-4-methylpentanoyl)pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(O)=O SCPRYBYMKVYVND-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- 239000004382 Amylase Substances 0.000 description 1
- 102000013142 Amylases Human genes 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- 229920000856 Amylose Polymers 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- FOWOZYAWODIRFZ-JYJNAYRXSA-N Arg-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCN=C(N)N)N FOWOZYAWODIRFZ-JYJNAYRXSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 1
- ODNWIBOCFGMRTP-SRVKXCTJSA-N Asp-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CN=CN1 ODNWIBOCFGMRTP-SRVKXCTJSA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- SXLCDCZHNCLFGZ-BPUTZDHNSA-N Asp-Pro-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SXLCDCZHNCLFGZ-BPUTZDHNSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- DCJNIJAWIRPPBB-CIUDSAMLSA-N Cys-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N DCJNIJAWIRPPBB-CIUDSAMLSA-N 0.000 description 1
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 1
- TVTIDSMADMIHEU-KKUMJFAQSA-N His-Cys-Phe Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1ccccc1)C(O)=O TVTIDSMADMIHEU-KKUMJFAQSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- LQGCNWWLGGMTJO-ULQDDVLXSA-N His-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N LQGCNWWLGGMTJO-ULQDDVLXSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 1
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- QTMKFZAYZKBFRC-BZSNNMDCSA-N His-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O QTMKFZAYZKBFRC-BZSNNMDCSA-N 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- 239000012880 LB liquid culture medium Substances 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 1
- STLBOMUOQNIALW-BQBZGAKWSA-N Met-Gly-Cys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O STLBOMUOQNIALW-BQBZGAKWSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- RVYDCISQIGHAFC-ZPFDUUQYSA-N Met-Ile-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O RVYDCISQIGHAFC-ZPFDUUQYSA-N 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- PBXYXOAEQQUVMM-ULQDDVLXSA-N Phe-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PBXYXOAEQQUVMM-ULQDDVLXSA-N 0.000 description 1
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 1
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 1
- OKQQWSNUSQURLI-JYJNAYRXSA-N Phe-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N OKQQWSNUSQURLI-JYJNAYRXSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- 241000588770 Proteus mirabilis Species 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 208000018569 Respiratory Tract disease Diseases 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- 241001147691 Staphylococcus saprophyticus Species 0.000 description 1
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 1
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 1
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- 208000012931 Urologic disease Diseases 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- JPPXDMBGXJBTIB-ULQDDVLXSA-N Val-His-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N JPPXDMBGXJBTIB-ULQDDVLXSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- RJINLRBSXMOGAQ-KMNHUKDVSA-N [(2r,3r,4s,5r)-6-hydroxy-3,4,5-tris[(3,4,5-trihydroxybenzoyl)oxy]oxan-2-yl]methyl 3,4,5-trihydroxybenzoate Chemical compound C([C@H]1OC([C@@H]([C@@H](OC(=O)C=2C=C(O)C(O)=C(O)C=2)[C@@H]1OC(=O)C=1C=C(O)C(O)=C(O)C=1)OC(=O)C=1C=C(O)C(O)=C(O)C=1)O)OC(=O)C1=CC(O)=C(O)C(O)=C1 RJINLRBSXMOGAQ-KMNHUKDVSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- PQLVXDKIJBQVDF-UHFFFAOYSA-N acetic acid;hydrate Chemical compound O.CC(O)=O PQLVXDKIJBQVDF-UHFFFAOYSA-N 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 108010069490 alanyl-glycyl-seryl-glutamic acid Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 150000001413 amino acids Chemical group 0.000 description 1
- 235000019418 amylase Nutrition 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000000843 anti-fungal effect Effects 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 230000002153 concerted effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- DZYNKLUGCOSVKS-UHFFFAOYSA-N epigallocatechin Natural products OC1Cc2cc(O)cc(O)c2OC1c3cc(O)c(O)c(O)c3 DZYNKLUGCOSVKS-UHFFFAOYSA-N 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 229930003935 flavonoid Natural products 0.000 description 1
- 150000002215 flavonoids Chemical class 0.000 description 1
- 235000017173 flavonoids Nutrition 0.000 description 1
- 230000002496 gastric effect Effects 0.000 description 1
- 239000007986 glycine-NaOH buffer Substances 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- XLYOFNOQVPJJNP-ZSJDYOACSA-N heavy water Substances [2H]O[2H] XLYOFNOQVPJJNP-ZSJDYOACSA-N 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 239000012452 mother liquor Substances 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 150000002989 phenols Chemical class 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 235000013824 polyphenols Nutrition 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 239000012474 protein marker Substances 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 229930000044 secondary metabolite Natural products 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 208000017520 skin disease Diseases 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 208000018556 stomach disease Diseases 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000009210 therapy by ultrasound Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 208000014001 urinary system disease Diseases 0.000 description 1
- 210000001635 urinary tract Anatomy 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1003—Transferases (2.) transferring one-carbon groups (2.1)
- C12N9/1007—Methyltransferases (general) (2.1.1.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/62—Carboxylic acid esters
Abstract
The invention discloses a gene for efficiently catalyzing biosynthesis of gallic acid methyl ester and application thereof, belonging to the technical field of molecular biology and metabolic engineering, wherein the gene is a plant TA family gene, and the TA family gene is any one of the following sequences: (1) CsTA gene with a nucleotide sequence shown as SEQ ID NO.1 or MrTA gene with a nucleotide sequence shown as SEQ ID NO. 2; (2) a nucleotide sequence in which one or more nucleotides are substituted, deleted and/or added in the nucleotide sequence in (1) and the same functional protein is expressed; (3) a nucleotide sequence which has more than 90% of homology with the nucleotide sequence of any one of (1) and (2) and expresses the same functional protein. The invention provides genes CsTA and MrTA for catalyzing biosynthesis of gallic acid methyl ester, clones and verifies the genes CsTA and MrTA for catalyzing biosynthesis of gallic acid methyl ester from waxberries of tea trees, and also provides recombinant plasmids, transgenic engineering bacteria and recombinant proteins containing the genes CsTA and MrTA respectively.
Description
Technical Field
The invention relates to the fields of molecular biology and metabolic engineering, in particular to a gene for efficiently catalyzing biosynthesis of gallic acid methyl ester and application thereof.
Background
Methyl gallate is a gallated derivative in many plants, and has strong antifungal activity on two main tea plant diseases, namely anthracnose pathogenic bacteria and wheel spot pathogenic bacteria; medically, some phenolic compounds, including methyl gallate, have been found to be most effective against proteus mirabilis and staphylococcus saprophyticus pathogens in the urinary tract, which provides scientific basis for treating gastric, skin, respiratory and urinary tract diseases in traditional medicine; in addition, methyl gallate is also a main anti-HIV-1 compound in the bark of the pith and has the potential of developing novel HIV-1 inhibitors; methyl gallate as an important flavonoid substance can induce plant defense reaction in tobacco, and the compound methyl gallate can improve the immunity of plants, and can provide a new idea for green agriculture, crop disease management and crop disease control by intentionally stimulating the plant immunity.
However, methyl gallate has been synthesized only by chemical methods so far, and has a complicated process, a long time consumption, a large variety of raw materials and the generation of environmental-polluting chemical substances; the problems of low content of gallic acid methyl ester in plant bodies, high purification difficulty, low efficiency and the like also limit the obtaining of the compound; gallic acid-related methyltransferases have also not been reported in plants. Therefore, the invention adopts a biosynthesis mode to obtain two proteins which can efficiently catalyze the synthesis of the methyl gallate, and lays a foundation for realizing the commercial production of the methyl gallate.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provide a gene for efficiently catalyzing biosynthesis of gallic acid methyl ester and application thereof.
The invention is realized by adopting the following technical scheme:
the invention provides a gene for efficiently catalyzing biosynthesis of gallic acid methyl ester, and the gene is a plant TA family gene.
The further improvement is that the gene is CsTA or MrTA or the combination of the two, and the TA family gene is any one of the following sequences:
(1) CsTA gene with a nucleotide sequence shown as SEQ ID NO.1 or MrTA gene with a nucleotide sequence shown as SEQ ID NO. 2;
(2) a nucleotide sequence in which one or more nucleotides are substituted, deleted and/or added in the nucleotide sequence in (1) and the same functional protein is expressed;
(3) a nucleotide sequence which has more than 90% of homology with the nucleotide sequence of any one of (1) and (2) and expresses the same functional protein.
The further improvement is that the gene CsTA and the gene MrTA are obtained by separating and cloning fresh tea leaves and waxberry.
The further improvement is that the amino acid sequences of the coding proteins of the gene CsTA and the gene MrTA are respectively shown as SEQ ID NO.3 and SEQ ID NO. 4.
The invention also provides an expression cassette containing the gene for efficiently catalyzing the biosynthesis of the gallic acid methyl ester.
The invention also provides a recombinant plant expression vector, which comprises a recombinant vector pRSFDuet-1-CsTA or pRSFDuet-1-MrTA or a combination of the pRSFDuet-1-MrTA or the pRSFDuet-1-MrTA obtained by recombining the gene for efficiently catalyzing the biosynthesis of the gallic acid methyl ester on the pRSFDuet-1 vector.
The invention also provides an engineering bacterium, which is characterized in that the engineering bacterium is an escherichia coli BL21(DE3) strain containing the recombinant vector pRSFDuet-1-CsTA or pRSFDuet-1-MrTA.
The invention also provides an application of the gene in catalyzing biosynthesis of gallic acid methyl ester.
The invention also provides a biosynthesis method of gallic acid methyl ester, which is characterized in that co-expression recombinant protein of the gene combination or any recombinant protein CsTA or MrTA separated and purified from the engineering bacteria combination is added into a reaction system which contains EGCG or PGG and is added with methanol as a substrate, and the gallic acid methyl ester is biosynthesized through enzyme catalysis reaction.
The invention has the beneficial effects that:
the invention provides two genes CsTA and MrTA for efficiently catalyzing biosynthesis of gallic acid methyl ester, coding proteins and application thereof, and clones and verifies the genes CsTA and MrTA for catalyzing biosynthesis of gallic acid methyl ester from waxberries of tea trees. The invention provides two high-efficiency and safe gallic acid methyl ester biosynthesis technologies, optimizes the optimal addition amount of a substrate for enzyme reaction and the optimal reaction conditions of enzyme by using a bioengineering method, and provides a foundation for realizing the commercial production of gallic acid methyl ester. The gallic acid methyl ester is used as an important secondary metabolite, has the characteristics of resisting fungi, HIV-1, inflammation and oxidation, enhancing plant defense and the like, has been widely accepted and utilized in various scientific research fields, and lays a solid foundation for the biosynthesis of medicinal compounds.
Drawings
FIG. 1 is an SDS-PAGE protein electrophoretic analysis diagram of CsTA and MrTA recombinant proteins in an example of the present invention; wherein, M is a protein Marker; the two bands are the purified CsTA and MrTA recombinant proteins, respectively.
FIG. 2 is a result diagram of HPLC analysis of the enzymatic activity products catalyzed by CsTA and MrTA recombinant proteins using EGCG as a substrate in the embodiment of the present invention.
FIG. 3 is a result diagram of HPLC analysis of the enzymatic activity products catalyzed by CsTA and MrTA recombinant proteins using PGG as a substrate in the embodiment of the present invention.
FIG. 4 is a graph showing the pH gradient of the optimal reaction of the recombinant proteins CsTA and MrTA under different buffer conditions in the examples of the present invention.
FIG. 5 is a graph showing the optimal reaction temperature gradient of each of the recombinant proteins CsTA and MrTA under the optimal pH condition in the example of the present invention.
FIG. 6 is a graph showing the reaction time gradient when the recombinant proteins CsTA and MrTA of the present invention each use EGCG as a substrate.
FIG. 7 is a graph showing the reaction time gradient when the recombinant proteins CsTA and MrTA of the present invention each use PGG as a substrate.
FIG. 8 is a flow chart of the method for producing gallic acid methyl ester by catalyzing EGCG or PGG and methanol reaction with recombinant proteins CsTA and MrTA in vitro through enzyme reaction in the embodiment of the present invention.
FIG. 9 is a structural map of recombinant expression vector pRSFDuet-1 in the example of the present invention.
Detailed Description
In order to make the technical means, the creation characteristics, the achievement purposes and the effects of the invention easy to understand, the invention is further explained below by combining the specific drawings.
1. Material
(1) Plant material: fresh tea leaves and fresh waxberry leaves are collected from Shucha (Camellia sinensis (L.) O.Kuntze.var. sinensiscultivar Shucha) and waxberry (Myricarba (Lour.) S.etZucc.), immediately frozen by liquid nitrogen and stored in a refrigerator at-80 ℃ for later use.
(2) Competent E.coli cell DH 5. alpha. was cloned and the host strain BL21 was expressed (DE 3).
(3) LB culture medium: weighing 5g of yeast extract, 10g of tryptone and 10g of sodium chloride, adding 950mL of pure water, performing ultrasonic treatment until the yeast extract is fully dissolved, adjusting the pH to 7.0 by using 1mol/L NaOH solution, adding water to a constant volume of 1L, and performing autoclaving to obtain a liquid culture medium; adding 3g of agar powder into each 200mLLB liquid culture medium, mixing, and sterilizing with high pressure steam at 121 deg.C for 15 min.
(4) Kanamycin mother liquor (50 mg/mL): 0.5g kanamycin is weighed, dissolved in 10mL of sterilized water, filtered and sterilized, and then subpackaged into small tubes to be preserved at-20 ℃.
(5) Isopropyl thio-beta-D-galactoside (IPTG,1mol/L) 2.383g of the powder was weighed out and dissolved in 10mL of sterile water, filtered and sterilized, and then subpackaged into small tubes and stored at-20 ℃.
(6) Protein purification buffer (0.1mol/L phosphate buffer): weighing 17.907gNa2HPO4·12H2O and 2.925g NaCl, made to a volume of 500ml with pure water, which was solution I, 3.1202g NaH was weighed2PO4·2H2O and 1.17g of NaCl, using pure water to fix the volume to 200ml, wherein the volume is solution II, using II to adjust the pH value to 7.4, using the solution II to finally obtain a loading buffer solution, and adding 1.02g of imidazole into each 100ml of the loading buffer solution to obtain the solution, namely the elution buffer solution.
Unless otherwise indicated, the examples were carried out according to conventional experimental conditions or according to the manufacturer's instructions. The reagents or instruments used are not indicated by the manufacturer, and are all conventional products commercially available.
2. Method of producing a composite material
2.1 cloning and expression of Gene combinations for concerted catalysis of methyl gallate biosynthesis
2.1.1 cloning of CsTA and MrTA genes
(1) According to the open reading frame sequences of the two genes CsTA and MrTA, specific primers of multiple cloning sites BamH I and Pst I with an expression vector pRSFDuet-1 are designed.
The primer sequences are shown below:
CsTA forward primer:
5’-CATCACCATCATCACCACAGCCAGGATCCGATGGATTCAATAGCCC-3’;
CsTA reverse primer:
5’-GCCGCAAGCTTGTCGACCTGCAGTCAATTTTCATTTACG-3’;
MrTA forward primer:
5’-CATCACCATCATCACCACAGCCAGGATCCGATGGCGTCAAGCACTGG-3’;
MrTA reverse primer:
5’-GCCGCAAGCTTGTCGACCTGCAGTCAAGTTATG-3’;
(2) according to the specification of the polysaccharide polyphenol total RNA extraction kit, total RNA is respectively extracted from the Shucha early fresh leaves and the waxberry fresh leaves, and reverse transcription is carried out on the total RNA by utilizing reverse transcriptase to obtain a tea tree cDNA template and a waxberry cDNA template.
(3) Respectively taking tea tree cDNA and waxberry cDNA as templates, and respectively amplifying by using a CsTA primer and an MrTA primer, wherein the amplification procedure is as follows: denaturation at 98 ℃ for 10s, annealing at 62 ℃ for 15s, extension at 72 ℃ for 30s, 30 cycles, and extension at 72 ℃ for 6min to obtain a PCR product.
(4) The PCR product was purified using the DNA purification recovery kit, and the product was finally ligated to expression plasmid pRSFDuet-1 by one-step cloning to obtain pRSFDuet-1-CsTA and pRSFDuet-1-MrTA recombinant plasmids, and the structural maps thereof are shown in FIG. 9.
2.1.2 prokaryotic expression and protein purification of genes CsTA and MrTA
The prokaryotic expression and functional verification techniques used in this embodiment are those commonly used or fully understood by those skilled in the art.
(1) Respectively transforming pRSFDuet-1-CsTA and pRSFDuet-1-MrTA recombinant plasmids into an escherichia coli BL21(DE3) expression host bacterium, adding 200 mu L LB liquid medium, culturing at 37 ℃ and 180r/min for 60min, then coating the bacterium liquid on an LB plate containing 50 mu g/mL Kan +, and performing inverted culture at 37 ℃;
(2) selecting positive colonies after PCR verification, and inoculating the positive colonies into 100mL of sterilized LB liquid culture medium for amplification culture until OD600 is approximately equal to 0.6;
(3) adding 90uL IPTG into the successfully amplified bacterial liquid, culturing at 16 ℃ for 24h, collecting thalli, adding 10-15mL of sample loading buffer solution, fully suspending the thalli, carrying out ultrasonic disruption for 30min in an ultrasonic disruptor at 25% power, and centrifuging at 12000rpm to collect supernatant; recombinant proteins were purified using an amylose resin affinity column with a His tag (affinity chromatography on an amylase resin, New England Biolabs, MA, USA), and protein expression and purification effects were examined using SDS-PAGE methods commonly used in the art, and the results are shown in FIG. 1. The two purer protein bands were MrTA and CsTA, respectively, and the purified proteins were used for further enzymatic analysis.
2.2 enzyme activity function verification of CsTA and MrTA coexpression recombinant protein
2.2.1 detection of enzymatic Activity of CsTA and MrTA proteins and determination of optimal Condition Range
(1) 50 mul of enzyme activity reaction system: comprises 50mM phosphate buffer solution (pH 7.0), 1-3 μ g CsTA or 1-3 μ g MrTA purified protein, 0.8mM EGCG or 0.4mM PGG and 2M methanol, water bath reaction at 40-45 deg.C for 5min, adding equal volume of methanol, shaking and mixing uniformly to denature enzyme, 13000g centrifuging for 20min, and detecting with ultra high performance liquid chromatograph and mass spectrum. The synthesis process of EGCG is shown in the upper graph of FIG. 8, and the synthesis process of PGG is shown in the lower graph of FIG. 8.
The method for detecting the enzyme reaction product comprises the following steps: agilent HPLC system, mobile phase: phase A of 1% acetic acid water, phase B of pure acetonitrile, flow rate of 0.4mL/min, column model PoroshellHPH-C18column (2.7 μm, 4.6X 100mm), detector wavelength of 280nm, sample size of 5 μ l, gradient elution method: from 0min to 5min, phase B increased from 1% to 10%; from 5min to 20min, phase B rises from 10% to 35%; from 20min to 21min, phase B decreased from 35% to 10%, from 21min to 23min, phase B decreased from 10% to 1%, from 23min to 25min, maintaining phase B at 1% for equilibration of the column. And determining an enzyme reaction product according to the peak time of the standard product and the maximum ultraviolet absorption peak.
Mass spectrometry method is the same as above, mass spectrometry conditions: electrospray, negative ion mode, collection of compounds with mass to charge ratio of 100-1700. According to the identification and analysis of the product peak by the standard substance and the characteristic ion fragment, the results are shown in fig. 2 and fig. 3, when EGCG and methanol or PGG and methanol are respectively used as substrates, no product is generated after boiling protein is added in the reaction system, and when the CsTA protein or MrTA protein is added, the substrate EGCG is almost completely consumed to generate the product methyl gallate, gallic acid GA and epigallocatechin EGC or tetragalloyl glucose TeGG, and the peak time of the obtained product methyl gallate and the standard substance is completely consistent with the mass spectrum information, so that the substance is the methyl gallate.
(2) Determining the optimal reaction pH range: to exclude the influence of the buffer difference on the enzyme activity, 4 buffers having different buffer intervals were selected to search for the optimal buffer and pH range for each of the two proteins. Respectively, citrate sodium salt buffer solution (pH4.0-6.0), phosphate buffer solution (pH6.0-8.0), Tris-hydrochloric acid buffer solution (pH7.0-10.5) and glycine-NaOH buffer solution (pH 9.0-11.0), 1-3 μ g CsTA or 1-3 μ g MrTA purified protein, 0.8mM EGCG or 0.4mM PGG and 2M methanol, reacting in water bath at 40-45 deg.C for 3-5min, adding equal volume of methanol, shaking, mixing to denature enzyme, centrifuging at 13000g for 20min, and detecting peak area with ultra high performance liquid chromatograph, the result is shown in FIG. 4, and the optimum pH range of the two proteins is 8.0-9.0.
(3) Determining the optimal reaction temperature range: in order to maximize the efficiency of the enzyme reaction, 10, 20, 30, 35, 40, 45, 50, 60, and 70 ℃ were selected for the enzyme reaction, and the results are shown in FIG. 5, where the optimal temperature ranges for both proteins were 45-50 ℃.
2.2.2 substrate conversion assay for CsTA and MrTA recombinant proteins
Under the optimal condition, different reaction time gradients are set for in vitro enzyme activity experiments to judge the conversion rates of the two proteins to the substrate. As can be seen from the change curves of the substrate and the product in the enzyme reaction system of FIG. 6: when EGCG is used as a substrate, in a reaction system of CsTA, 1 mu g of protein can reach the substrate conversion rate of more than 90 percent when the reaction is carried out for 8 min; in the reaction system of MrTA, the conversion rate can reach more than 90 percent within 5min of reaction of 1 mu g of protein.
As can be seen from the change curves of the substrate and the product in the enzyme reaction system of FIG. 7: when PGG is used as a substrate, in a CsTA reaction system, 3 mu g of protein can reach a substrate conversion rate of more than 200% after reacting for 15 min; in the reaction system of MrTA, 3 mu g of protein can reach the conversion rate of more than 300 percent when reacting for 15 min. This provides a basic condition for the subsequent production of gallic acid methyl ester by enzyme engineering.
The invention has been described in detail with respect to a general description and specific embodiments thereof, but it will be apparent to those skilled in the art that modifications and improvements can be made based on the invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed.
Sequence listing
<110> agriculture university of Anhui
<120> gene for efficiently catalyzing biosynthesis of gallic acid methyl ester and application thereof
<141> 2021-12-27
<160> 4
<170> SIPOSequenceListing 1.0
<210> 3
<211> 724
<212> DNA
<213> tea (tea)
<400> 3
caagatcaac ggaccagatc acaagctccc cctcctcgtt cactaccacg gtggaggctt 60
ctgcatggga tcctccctcg acaccgtcac tctcagattc ctcacctccc tcgcctccca 120
agcccacttg atcgctatct ccgttgacta caggctcgcc ccggagcacc cattacccat 180
cgcgtatgaa gactcctggt ccgcgttgca gtggatcgct acccactcta acggccaagg 240
acccgatccc tggctaaacc agtacgcgga tttcggtcgg gttttcctgg cgggagagag 300
tgccggggca aatatagccc accaagtggc ggttcgggtt ggcactgtcg gattggaggg 360
tttcatgcca cgtggagtga ttataattca cccctatttt gcgggtagtg aaccggacaa 420
gatgatccag tatttgtatc cggggagtag tgggtcggag gatgacccga atttgagccc 480
caaggaggat ccgaatttga cgaagatggg ttgctccaag gtgattgtgt tcgtggcaga 540
gaaggatcgg ttgaaaccga gaggcgtgga ttactatgag acattgaaaa atagtggttg 600
ggaaggtagg gtggaatttg tggaggataa aggggaggac cactgtttcc atatgtttag 660
tcccaatagt gataaggttg tgggtttgat gcaaaagttg ggtactttcg taaatgaaaa 720
ttga 724
<210> 2
<211> 906
<212> DNA
<213> waxberry (Myrica rubra)
<400> 2
atggcgtcaa gcactggtga gataacccat gatttcccac ctttcttcaa agtatacaaa 60
gatggccgca tagagaggta catgacccat gacccagtcc ccgcgaggct agacccaagc 120
acgggaatac aatccaaaga cgtggtgatc tcgccggaag ccggtgtgtc ggcccgaatc 180
ttcatcccca aaatcaacgg cccggatcaa aagctgccgc taatcgtcca ctaccacggg 240
ggaggcttct gcgttggatc tccgttcgat acgatttccg agagatttct cacatctttg 300
gtctcccaag cgaacgtaat agttgtctct gtcgactaca ggttagcccc agagcaccca 360
ctaccgattg catatgaaga ctcctgggcc gcgctgcagt ggattgcgtc tcattctaac 420
gggcaaggac ccgaaccgag tctcaacgag tacgtggatt tcggtcgggt tttcttgttg 480
ggcgagagcg ctggcgccaa catagccaac tacgtggcag tccaagcagg tgcgattgga 540
ttgcctggcc tgaagatcgt tggggcactt atagtgcacc ccttctttgg gagcaaggag 600
cgggatgaaa tgtacacgtt tctgtgtccc acgagctcag ggtgtgagga tgacccgaaa 660
ctgaacccgg cggttgatcc gaatttgtcg accatggcgt gcgcgaaggt cctggtatgt 720
gtggcggaga aagacgggct gaaagataga ggggtggcct actgcgagac tttacgtaag 780
agcggttggg gcggttctgt ggaattattc gaaaccaaag gagaagacca ctgctttcat 840
atgttcggcg aaggggaaaa cgtcgtgtcg ttgacgaaaa agtttgtcga ctttatcata 900
acttga 906
<210> 3
<211> 299
<212> PRT
<213> tea (tea)
<400> 3
Met Asp Ser Ile Ala His Asp Phe Pro Pro Phe Phe Arg Val His Lys
1 5 10 15
Asp Gly Arg Val Glu Arg Phe Met Val Ser Asp Tyr Val Pro Pro Ala
20 25 30
Val Asp Pro Lys Thr Gly Val Glu Phe Lys Asp Thr Leu Ile Ser Pro
35 40 45
Glu Thr Gly Val Lys Ala Arg Ile Phe Leu Pro Lys Ile Asn Gly Pro
50 55 60
Asp His Lys Leu Pro Leu Leu Val His Tyr His Gly Gly Gly Phe Cys
65 70 75 80
Met Gly Ser Ser Leu Asp Thr Val Thr Leu Arg Phe Leu Thr Ser Leu
85 90 95
Ala Ser Gln Ala His Leu Ile Ala Ile Ser Val Asp Tyr Arg Leu Ala
100 105 110
Pro Glu His Pro Leu Pro Ile Ala Tyr Glu Asp Ser Trp Ser Ala Leu
115 120 125
Gln Trp Ile Ala Thr His Ser Asn Gly Gln Gly Pro Asp Pro Trp Leu
130 135 140
Asn Gln Tyr Ala Asp Phe Gly Arg Val Phe Leu Ala Gly Glu Ser Ala
145 150 155 160
Gly Ala Asn Ile Ala His Gln Val Ala Val Arg Val Gly Thr Val Gly
165 170 175
Leu Glu Gly Phe Met Pro Arg Gly Val Ile Ile Ile His Pro Tyr Phe
180 185 190
Ala Gly Ser Glu Pro Asp Lys Met Ile Gln Tyr Leu Tyr Pro Gly Ser
195 200 205
Ser Gly Ser Glu Asp Asp Pro Asn Leu Ser Pro Lys Glu Asp Pro Asn
210 215 220
Leu Thr Lys Met Gly Cys Ser Lys Val Ile Val Phe Val Ala Glu Lys
225 230 235 240
Asp Arg Leu Lys Pro Arg Gly Val Asp Tyr Tyr Glu Thr Leu Lys Asn
245 250 255
Ser Gly Trp Glu Gly Arg Val Glu Phe Val Glu Asp Lys Gly Glu Asp
260 265 270
His Cys Phe His Met Phe Ser Pro Asn Ser Asp Lys Val Val Gly Leu
275 280 285
Met Gln Lys Leu Gly Thr Phe Val Asn Glu Asn
290 295
<210> 4
<211> 301
<212> PRT
<213> waxberry (Myrica rubra)
<400> 4
Met Ala Ser Ser Thr Gly Glu Ile Thr His Asp Phe Pro Pro Phe Phe
1 5 10 15
Lys Val Tyr Lys Asp Gly Arg Ile Glu Arg Tyr Met Thr His Asp Pro
20 25 30
Val Pro Ala Arg Leu Asp Pro Ser Thr Gly Ile Gln Ser Lys Asp Val
35 40 45
Val Ile Ser Pro Glu Ala Gly Val Ser Ala Arg Ile Phe Ile Pro Lys
50 55 60
Ile Asn Gly Pro Asp Gln Lys Leu Pro Leu Ile Val His Tyr His Gly
65 70 75 80
Gly Gly Phe Cys Val Gly Ser Pro Phe Asp Thr Ile Ser Glu Arg Phe
85 90 95
Leu Thr Ser Leu Val Ser Gln Ala Asn Val Ile Val Val Ser Val Asp
100 105 110
Tyr Arg Leu Ala Pro Glu His Pro Leu Pro Ile Ala Tyr Glu Asp Ser
115 120 125
Trp Ala Ala Leu Gln Trp Ile Ala Ser His Ser Asn Gly Gln Gly Pro
130 135 140
Glu Pro Ser Leu Asn Glu Tyr Val Asp Phe Gly Arg Val Phe Leu Leu
145 150 155 160
Gly Glu Ser Ala Gly Ala Asn Ile Ala Asn Tyr Val Ala Val Gln Ala
165 170 175
Gly Ala Ile Gly Leu Pro Gly Leu Lys Ile Val Gly Ala Leu Ile Val
180 185 190
His Pro Phe Phe Gly Ser Lys Glu Arg Asp Glu Met Tyr Thr Phe Leu
195 200 205
Cys Pro Thr Ser Ser Gly Cys Glu Asp Asp Pro Lys Leu Asn Pro Ala
210 215 220
Val Asp Pro Asn Leu Ser Thr Met Ala Cys Ala Lys Val Leu Val Cys
225 230 235 240
Val Ala Glu Lys Asp Gly Leu Lys Asp Arg Gly Val Ala Tyr Cys Glu
245 250 255
Thr Leu Arg Lys Ser Gly Trp Gly Gly Ser Val Glu Leu Phe Glu Thr
260 265 270
Lys Gly Glu Asp His Cys Phe His Met Phe Gly Glu Gly Glu Asn Val
275 280 285
Val Ser Leu Thr Lys Lys Phe Val Asp Phe Ile Ile Thr
290 295 300
Claims (9)
1. The gene for efficiently catalyzing biosynthesis of gallic acid methyl ester is characterized in that the gene is a plant TA family gene.
2. The gene for efficiently catalyzing biosynthesis of gallic acid methyl ester according to claim 1, wherein said TA family gene is any one of the following sequences:
(1) CsTA gene with a nucleotide sequence shown as SEQ ID NO.1 or MrTA gene with a nucleotide sequence shown as SEQ ID NO. 2;
(2) a nucleotide sequence in which one or more nucleotides are substituted, deleted and/or added in the nucleotide sequence in (1) and the same functional protein is expressed;
(3) a nucleotide sequence which has more than 90% of homology with the nucleotide sequence of any one of (1) and (2) and expresses the same functional protein.
3. The gene for efficiently catalyzing biosynthesis of gallic acid methyl ester according to claim 1, wherein said gene CsTA and gene MrTA are isolated and cloned from fresh leaves of Camellia sinensis and fresh leaves of Myrica rubra respectively.
4. The gene for efficiently catalyzing biosynthesis of gallic acid methyl ester according to claim 1, wherein the amino acid sequences of the encoded proteins of said gene CsTA and said gene MrTA are shown in SEQ ID No.3 and SEQ ID No.4, respectively.
5. An expression cassette comprising the gene highly catalyzing biosynthesis of gallic acid methyl ester according to any one of claims 1 to 4.
6. A recombinant plant expression vector, wherein the vector combination comprises a recombinant vector pRSFDuet-1-CsTA or pRSFDuet-1-MrTA or a combination thereof obtained by recombining the gene capable of efficiently catalyzing the biosynthesis of gallic acid methyl ester according to any one of claims 1 to 4 onto a pRSFDuet-1 vector.
7. An engineered bacterium, which is Escherichia coli BL21(DE3) strain comprising the recombinant vector pRSFDuet-1-CsTA or pRSFDuet-1-MrTA of claim 6.
8. Use of the gene of any one of claims 1-4 in catalyzing the biosynthesis of gallic acid methyl ester.
9. A method for biosynthesis of methyl gallate, characterized in that a recombinant protein CsTA or MrTA of any one of 1 to 4 of the genes is added to a reaction system containing EGCG or PGG and methanol as a substrate, and methyl gallate is biosynthesized by an enzymatic reaction.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111610349.3A CN114410665B (en) | 2021-12-27 | 2021-12-27 | Gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111610349.3A CN114410665B (en) | 2021-12-27 | 2021-12-27 | Gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114410665A true CN114410665A (en) | 2022-04-29 |
CN114410665B CN114410665B (en) | 2024-01-16 |
Family
ID=81270309
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111610349.3A Active CN114410665B (en) | 2021-12-27 | 2021-12-27 | Gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114410665B (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004087909A1 (en) * | 2003-03-28 | 2004-10-14 | Nihon University | Polynucleotide encoding 2-hydorxyisoflavanone dehydratase and application of the same |
CN101993861A (en) * | 2009-08-31 | 2011-03-30 | 同济大学 | Recombinant expression of carboxyl esterase |
CN102719413A (en) * | 2012-04-28 | 2012-10-10 | 中山大学 | Novel tannase and application thereof |
US20120301950A1 (en) * | 2009-03-11 | 2012-11-29 | Baynes Brian M | Biological synthesis of difunctional hexanes and pentanes from carbohydrate feedstocks |
CN104761627A (en) * | 2015-04-20 | 2015-07-08 | 吉林大学 | Application of GmbHLH transcription factor in promotion of synthesizing soy isoflavone |
-
2021
- 2021-12-27 CN CN202111610349.3A patent/CN114410665B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004087909A1 (en) * | 2003-03-28 | 2004-10-14 | Nihon University | Polynucleotide encoding 2-hydorxyisoflavanone dehydratase and application of the same |
US20120301950A1 (en) * | 2009-03-11 | 2012-11-29 | Baynes Brian M | Biological synthesis of difunctional hexanes and pentanes from carbohydrate feedstocks |
CN101993861A (en) * | 2009-08-31 | 2011-03-30 | 同济大学 | Recombinant expression of carboxyl esterase |
CN102719413A (en) * | 2012-04-28 | 2012-10-10 | 中山大学 | Novel tannase and application thereof |
CN104761627A (en) * | 2015-04-20 | 2015-07-08 | 吉林大学 | Application of GmbHLH transcription factor in promotion of synthesizing soy isoflavone |
Non-Patent Citations (4)
Title |
---|
DAI, X. ET AL.: "Camellia sinensis tannase mRNA, complete cds GenBank: MK381269.1", 《GENBANK》, pages 1 * |
JIA, H.M. ET AL.: "2-hydroxyisoflavanone dehydratase [Morella rubra] GenBank: KAB1207115.1", 《GENBANK》, pages 1 - 2 * |
XINLONG DAI ET AL.: "Discovery and characterization of tannase genes in plants: roles in hydrolysis of tannins", 《NEW PHYTOLOGIST》, vol. 226, pages 1104 - 1116 * |
YIFAN CHEN ET AL.: "New insights into the function of plant tannase with promiscuous acyltransferase activity", 《THE PLANT JOURNAL》, vol. 113, pages 576 - 594 * |
Also Published As
Publication number | Publication date |
---|---|
CN114410665B (en) | 2024-01-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107502647A (en) | A kind of method that biological enzyme deracemization prepares L glufosinate-ammoniums | |
US20190136219A1 (en) | Genetically Engineered Arginine Deiminase Modified by Site-Directed Mutagenesis | |
CN104560927A (en) | Mutated arginine deiminase as well as preparation method and application thereof | |
CN109182232B (en) | Recombinant escherichia coli zjut-ho1 and application thereof in preparation of biliverdin | |
CN101348794A (en) | Encoding gene of high activity glucose oxidase, preparation and use thereof | |
CN111420037B (en) | Application of phage lyase Lysep3 in preparation of broad-spectrum antibacterial drugs | |
CN107217043B (en) | Lactobacillus plantarum D-lactate dehydrogenase, and coding gene and application thereof | |
CN113308443B (en) | Monascus monooxygenase mutant and application thereof | |
CN106222231A (en) | A kind of method of quick production high-optical-purity D lysine | |
JP4350930B2 (en) | Novel gene containing DNA sequence encoding hydroxynitrile lyase, recombinant protein derived from the same gene and having hydroxynitrile lyase activity and use thereof | |
CN105002192A (en) | Malic enzyme gene RKME1 and recombinant expression vector thereof | |
CN114410665B (en) | Gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof | |
CN108977455B (en) | Recombinant plasmid for producing oxalate decarboxylase, escherichia coli expression system, method and application | |
CN109679972B (en) | Gene for catalyzing biosynthesis of waxberry UDP-rhamnose, encoded protein and application | |
CN108424937B (en) | Method for synthesizing tanshinol by enzyme method | |
CN105695429A (en) | Fermentation medium and method for fermenting nicotinamide monoucleotide (NMN) transferase by same | |
CN113151232B (en) | 1-aminocyclopropane-1-carboxylic acid synthetase of lycoris aurea, and coding gene and application thereof | |
CN113621633B (en) | Mangifera indica terpene synthase gene TPS1 and application thereof | |
CN115927147A (en) | Method for improving antioxidant activity of lactococcus lactis and application thereof | |
CN103468665A (en) | Corn phenylalanine ammonia enzyme and application thereof | |
CN109337879B (en) | Malate dehydrogenase PbMDH and coding sequence and application thereof | |
CN103966190B (en) | The cyclodextrin glycosyltransferase mutant that a kind of cyclisation vigor improves | |
CN111218432B (en) | Tyrosinase precursor, encoding gene, preparation and application thereof | |
CN108753802B (en) | Malic dehydrogenase gene CIMDH1 and recombinant expression vector thereof | |
CN104673808B (en) | A kind of malic enzyme gene and its recombinant expression carrier |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |