CN114410665B - Gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof - Google Patents
Gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof Download PDFInfo
- Publication number
- CN114410665B CN114410665B CN202111610349.3A CN202111610349A CN114410665B CN 114410665 B CN114410665 B CN 114410665B CN 202111610349 A CN202111610349 A CN 202111610349A CN 114410665 B CN114410665 B CN 114410665B
- Authority
- CN
- China
- Prior art keywords
- gene
- mrta
- csta
- gly
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- FBSFWRHWHYMIOG-UHFFFAOYSA-N methyl 3,4,5-trihydroxybenzoate Chemical compound COC(=O)C1=CC(O)=C(O)C(O)=C1 FBSFWRHWHYMIOG-UHFFFAOYSA-N 0.000 title claims abstract description 70
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 51
- IBKQQKPQRYUGBJ-UHFFFAOYSA-N methyl gallate Natural products CC(=O)C1=CC(O)=C(O)C(O)=C1 IBKQQKPQRYUGBJ-UHFFFAOYSA-N 0.000 title claims abstract description 30
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 23
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims abstract description 16
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims abstract description 16
- 239000002773 nucleotide Substances 0.000 claims abstract description 15
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 15
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 claims description 27
- 238000006243 chemical reaction Methods 0.000 claims description 24
- 239000000758 substrate Substances 0.000 claims description 17
- WMBWREPUVVBILR-UHFFFAOYSA-N GCG Natural products C=1C(O)=C(O)C(O)=CC=1C1OC2=CC(O)=CC(O)=C2CC1OC(=O)C1=CC(O)=C(O)C(O)=C1 WMBWREPUVVBILR-UHFFFAOYSA-N 0.000 claims description 12
- WMBWREPUVVBILR-WIYYLYMNSA-N (-)-Epigallocatechin-3-o-gallate Chemical compound O([C@@H]1CC2=C(O)C=C(C=C2O[C@@H]1C=1C=C(O)C(O)=C(O)C=1)O)C(=O)C1=CC(O)=C(O)C(O)=C1 WMBWREPUVVBILR-WIYYLYMNSA-N 0.000 claims description 11
- 102000004190 Enzymes Human genes 0.000 claims description 10
- 108090000790 Enzymes Proteins 0.000 claims description 10
- 238000006555 catalytic reaction Methods 0.000 claims description 4
- QJYNZEYHSMRWBK-NIKIMHBISA-N 1,2,3,4,6-pentakis-O-galloyl-beta-D-glucose Chemical compound OC1=C(O)C(O)=CC(C(=O)OC[C@@H]2[C@H]([C@H](OC(=O)C=3C=C(O)C(O)=C(O)C=3)[C@@H](OC(=O)C=3C=C(O)C(O)=C(O)C=3)[C@H](OC(=O)C=3C=C(O)C(O)=C(O)C=3)O2)OC(=O)C=2C=C(O)C(O)=C(O)C=2)=C1 QJYNZEYHSMRWBK-NIKIMHBISA-N 0.000 claims description 3
- 230000002194 synthesizing effect Effects 0.000 claims 1
- 101150084890 cstA gene Proteins 0.000 abstract description 34
- 102000004169 proteins and genes Human genes 0.000 abstract description 19
- 241000196324 Embryophyta Species 0.000 abstract description 11
- 235000009024 Ceanothus sanguineus Nutrition 0.000 abstract description 5
- 240000003553 Leptospermum scoparium Species 0.000 abstract description 5
- 235000015459 Lycium barbarum Nutrition 0.000 abstract description 5
- 235000009134 Myrica cerifera Nutrition 0.000 abstract description 5
- 244000269152 Myrica pensylvanica Species 0.000 abstract description 5
- 241000894006 Bacteria Species 0.000 abstract description 4
- 235000007270 Gaultheria hispida Nutrition 0.000 abstract description 4
- 235000012851 Myrica pensylvanica Nutrition 0.000 abstract description 4
- 239000013612 plasmid Substances 0.000 abstract description 3
- 238000012269 metabolic engineering Methods 0.000 abstract description 2
- 230000009261 transgenic effect Effects 0.000 abstract 1
- 238000000034 method Methods 0.000 description 15
- 239000000047 product Substances 0.000 description 13
- 239000000243 solution Substances 0.000 description 10
- 239000000872 buffer Substances 0.000 description 8
- 238000006911 enzymatic reaction Methods 0.000 description 8
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 8
- 239000007788 liquid Substances 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 235000008375 Decussocarpus nagi Nutrition 0.000 description 6
- 244000132436 Myrica rubra Species 0.000 description 6
- 235000014631 Myrica rubra Nutrition 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 235000013616 tea Nutrition 0.000 description 6
- 244000269722 Thea sinensis Species 0.000 description 5
- 150000001875 compounds Chemical class 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- LNTHITQWFMADLM-UHFFFAOYSA-N gallic acid Chemical compound OC(=O)C1=CC(O)=C(O)C(O)=C1 LNTHITQWFMADLM-UHFFFAOYSA-N 0.000 description 4
- 230000014509 gene expression Effects 0.000 description 4
- 230000006872 improvement Effects 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- XMOCLSLCDHWDHP-IUODEOHRSA-N epi-Gallocatechin Chemical compound C1([C@H]2OC3=CC(O)=CC(O)=C3C[C@H]2O)=CC(O)=C(O)C(O)=C1 XMOCLSLCDHWDHP-IUODEOHRSA-N 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 3
- 239000012160 loading buffer Substances 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 239000008363 phosphate buffer Substances 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- 238000012795 verification Methods 0.000 description 3
- 238000005303 weighing Methods 0.000 description 3
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- XMOCLSLCDHWDHP-UHFFFAOYSA-N L-Epigallocatechin Natural products OC1CC2=C(O)C=C(O)C=C2OC1C1=CC(O)=C(O)C(O)=C1 XMOCLSLCDHWDHP-UHFFFAOYSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- 108010025216 RVF peptide Proteins 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000000843 anti-fungal effect Effects 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 244000052616 bacterial pathogen Species 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 239000007795 chemical reaction product Substances 0.000 description 2
- 230000007123 defense Effects 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 229940074391 gallic acid Drugs 0.000 description 2
- 235000004515 gallic acid Nutrition 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 230000036039 immunity Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 238000004949 mass spectrometry Methods 0.000 description 2
- 238000001819 mass spectrum Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- -1 methyl gallate Chemical class 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000009465 prokaryotic expression Effects 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 230000001954 sterilising effect Effects 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- VLAFRQCSFRYCLC-FXQIFTODSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-2-aminopropanoyl]amino]acetyl]amino]-3-hydroxypropanoyl]amino]pentanedioic acid Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VLAFRQCSFRYCLC-FXQIFTODSA-N 0.000 description 1
- SCPRYBYMKVYVND-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-4-methylpentanoyl)pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(O)=O SCPRYBYMKVYVND-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- 239000004382 Amylase Substances 0.000 description 1
- 102000013142 Amylases Human genes 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- 229920000856 Amylose Polymers 0.000 description 1
- 241000722948 Apocynum cannabinum Species 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- FOWOZYAWODIRFZ-JYJNAYRXSA-N Arg-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCN=C(N)N)N FOWOZYAWODIRFZ-JYJNAYRXSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 1
- ODNWIBOCFGMRTP-SRVKXCTJSA-N Asp-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CN=CN1 ODNWIBOCFGMRTP-SRVKXCTJSA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- SXLCDCZHNCLFGZ-BPUTZDHNSA-N Asp-Pro-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SXLCDCZHNCLFGZ-BPUTZDHNSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- DCJNIJAWIRPPBB-CIUDSAMLSA-N Cys-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N DCJNIJAWIRPPBB-CIUDSAMLSA-N 0.000 description 1
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 235000015928 Hibiscus cannabinus Nutrition 0.000 description 1
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 1
- TVTIDSMADMIHEU-KKUMJFAQSA-N His-Cys-Phe Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1ccccc1)C(O)=O TVTIDSMADMIHEU-KKUMJFAQSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- LQGCNWWLGGMTJO-ULQDDVLXSA-N His-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N LQGCNWWLGGMTJO-ULQDDVLXSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 1
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- QTMKFZAYZKBFRC-BZSNNMDCSA-N His-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O QTMKFZAYZKBFRC-BZSNNMDCSA-N 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- 239000012880 LB liquid culture medium Substances 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 1
- STLBOMUOQNIALW-BQBZGAKWSA-N Met-Gly-Cys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O STLBOMUOQNIALW-BQBZGAKWSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- RVYDCISQIGHAFC-ZPFDUUQYSA-N Met-Ile-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O RVYDCISQIGHAFC-ZPFDUUQYSA-N 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- PBXYXOAEQQUVMM-ULQDDVLXSA-N Phe-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PBXYXOAEQQUVMM-ULQDDVLXSA-N 0.000 description 1
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 1
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 1
- OKQQWSNUSQURLI-JYJNAYRXSA-N Phe-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N OKQQWSNUSQURLI-JYJNAYRXSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- 241000588770 Proteus mirabilis Species 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 208000018569 Respiratory Tract disease Diseases 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- 241001147691 Staphylococcus saprophyticus Species 0.000 description 1
- 235000006468 Thea sinensis Nutrition 0.000 description 1
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 1
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 1
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- 208000012931 Urologic disease Diseases 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- JPPXDMBGXJBTIB-ULQDDVLXSA-N Val-His-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N JPPXDMBGXJBTIB-ULQDDVLXSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- RJINLRBSXMOGAQ-KMNHUKDVSA-N [(2r,3r,4s,5r)-6-hydroxy-3,4,5-tris[(3,4,5-trihydroxybenzoyl)oxy]oxan-2-yl]methyl 3,4,5-trihydroxybenzoate Chemical compound C([C@H]1OC([C@@H]([C@@H](OC(=O)C=2C=C(O)C(O)=C(O)C=2)[C@@H]1OC(=O)C=1C=C(O)C(O)=C(O)C=1)OC(=O)C=1C=C(O)C(O)=C(O)C=1)O)OC(=O)C1=CC(O)=C(O)C(O)=C1 RJINLRBSXMOGAQ-KMNHUKDVSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- PQLVXDKIJBQVDF-UHFFFAOYSA-N acetic acid;hydrate Chemical compound O.CC(O)=O PQLVXDKIJBQVDF-UHFFFAOYSA-N 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 108010069490 alanyl-glycyl-seryl-glutamic acid Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 150000001413 amino acids Chemical group 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 235000019418 amylase Nutrition 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 229940121375 antifungal agent Drugs 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 230000003078 antioxidant effect Effects 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 235000005607 chanvre indien Nutrition 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- DZYNKLUGCOSVKS-UHFFFAOYSA-N epigallocatechin Natural products OC1Cc2cc(O)cc(O)c2OC1c3cc(O)c(O)c(O)c3 DZYNKLUGCOSVKS-UHFFFAOYSA-N 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 229930003935 flavonoid Natural products 0.000 description 1
- 150000002215 flavonoids Chemical class 0.000 description 1
- 235000017173 flavonoids Nutrition 0.000 description 1
- 230000002496 gastric effect Effects 0.000 description 1
- 239000007986 glycine-NaOH buffer Substances 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000012452 mother liquor Substances 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 150000002989 phenols Chemical class 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 235000013824 polyphenols Nutrition 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 239000012474 protein marker Substances 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 208000017520 skin disease Diseases 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 208000018556 stomach disease Diseases 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000009210 therapy by ultrasound Methods 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 208000014001 urinary system disease Diseases 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1003—Transferases (2.) transferring one-carbon groups (2.1)
- C12N9/1007—Methyltransferases (general) (2.1.1.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/62—Carboxylic acid esters
Abstract
The invention discloses a gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof, belonging to the technical field of molecular biology and metabolic engineering, wherein the gene is a plant TA family gene, and the TA family gene is any one of the following sequences: (1) A CsTA gene with a nucleotide sequence shown as SEQ ID NO.1 or an MrTA gene with a nucleotide sequence shown as SEQ ID NO. 2; (2) A nucleotide sequence having the nucleotide sequence of (1) substituted, deleted and/or added with one or more nucleotides and expressing the same functional protein; (3) A nucleotide sequence which has a homology of 90% or more with the nucleotide sequence of any one of (1) and (2) and expresses the same functional protein. The invention provides genes CsTA and MrTA for catalyzing biosynthesis of methyl gallate, which are cloned and verified from waxberry of tea trees, and recombinant plasmids, transgenic engineering bacteria and recombinant proteins respectively containing the CsTA and MrTA genes.
Description
Technical Field
The invention relates to the fields of molecular biology and metabolic engineering, in particular to a gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof.
Background
Gallic acid methyl ester is a gallized derivative in many plants, and has stronger antifungal activity on two main tea plant diseases, namely anthracnose pathogenic bacteria and round spot pathogenic bacteria; some phenolic compounds, including methyl gallate, have been found to be most effective against Proteus mirabilis and Staphylococcus saprophyticus pathogens in medicine, providing a scientific basis for the treatment of gastric, skin, respiratory and urinary tract diseases in traditional medicine; in addition, the gallic acid methyl ester is also a main anti-HIV-1 compound in the stem bark of the Indian hemp stem, and has the potential of developing novel HIV-1 inhibitors; the gallic acid methyl ester is used as an important flavonoid substance which can induce plant defense reaction in tobacco, and the compound can improve the immunity of plants, so that the deliberate stimulation of plant immunity can provide a new idea for green agriculture, crop disease management and crop disease control.
However, until now, the methyl gallate can only be synthesized by a chemical method, the procedures are complicated, the time is long, and the raw materials are used in various types and are accompanied with chemical substances which pollute the environment; the compound is limited to be obtained due to the low content of the methyl gallate in the plant body, high purification difficulty, low efficiency and other problems; gallic acid-related methyltransferases have not been reported in plants. Therefore, the invention adopts a biosynthesis mode to obtain two proteins, which can efficiently catalyze the synthesis of the methyl gallate, thereby laying a foundation for realizing the commercial production of the methyl gallate.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provide a gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof.
The invention is realized by adopting the following technical scheme:
the invention provides a gene for catalyzing biosynthesis of methyl gallate with high efficiency, wherein the gene is a plant TA family gene.
The further improvement is that the gene is CsTA, or MrTA, or the combination of the gene and the gene, and the TA family gene is any one of the following sequences:
(1) A CsTA gene with a nucleotide sequence shown as SEQ ID NO.1 or an MrTA gene with a nucleotide sequence shown as SEQ ID NO. 2;
(2) A nucleotide sequence having the nucleotide sequence of (1) substituted, deleted and/or added with one or more nucleotides and expressing the same functional protein;
(3) A nucleotide sequence which has a homology of 90% or more with the nucleotide sequence of any one of (1) and (2) and expresses the same functional protein.
The further improvement is that the gene CsTA and the gene MrTA are isolated and cloned from fresh leaves of tea trees and waxberries.
The further improvement is that the amino acid sequences of the encoding proteins of the gene CsTA and the gene MrTA are respectively shown as SEQ ID NO.3 and SEQ ID NO. 4.
The invention also provides an expression cassette comprising the gene for efficiently catalyzing the biosynthesis of the methyl gallate.
The invention also provides a recombinant plant expression vector, which comprises a recombinant vector pRSFDuet-1-CsTA obtained by recombining the gene for efficiently catalyzing the biosynthesis of the methyl gallate onto the pRSFDuet-1 vector, or pRSFDuet-1-MrTA, or a combination of the two.
The invention also provides an engineering bacterium, which is characterized by comprising the recombinant vector pRSFDuet-1-CsTA or pRSFDuet-1-MrTA and being an escherichia coli BL21 (DE 3) strain.
The invention also provides application of the gene in catalyzing biosynthesis of methyl gallate.
The invention also provides a biosynthesis method of the methyl gallate, which is characterized in that the co-expression recombinant protein of the gene combination or any recombinant protein CsTA or MrTA separated and purified in the engineering bacteria combination is added into a reaction system containing EGCG or PGG and methanol as a substrate, and the methyl gallate is biosynthesized through an enzyme catalytic reaction.
The beneficial effects of the invention are as follows:
the invention provides two genes CsTA and MrTA which catalyze the biosynthesis of methyl gallate at high efficiency, encoding proteins and application thereof, and the CsTA and MrTA genes which catalyze the biosynthesis of methyl gallate are cloned and verified from waxberry of tea trees. The invention provides two high-efficiency and safe methyl gallate biosynthesis technologies, optimizes the optimal addition amount of a substrate for enzyme reaction and the optimal reaction condition of the enzyme by utilizing a bioengineering method, and provides a basis for realizing commercial production of methyl gallate. The gallic acid methyl ester is taken as an important secondary metabolic substance, has the characteristics of antifungal, anti-HIV-1, anti-inflammatory, antioxidant, plant defense enhancement and the like, has been widely accepted and utilized in various scientific research fields, and lays a solid foundation for biosynthesis of medicinal compounds.
Drawings
FIG. 1 is a diagram showing SDS-PAGE analysis of CsTA and MrTA recombinant proteins according to an embodiment of the present invention; wherein M is a protein Marker; the two bands are the purified CsTA and MrTA recombinant proteins, respectively.
FIG. 2 is a graph showing the results of HPLC analysis of the enzymatic activity product catalyzed by CsTA and MrTA recombinant proteins using EGCG as a substrate in the examples of the present invention.
FIG. 3 is a graph showing the results of HPLC analysis of the enzymatic activity product catalyzed by CsTA and MrTA recombinant proteins using PGG as a substrate in the examples of the present invention.
FIG. 4 is a graph showing the pH gradient of the optimal reaction of recombinant proteins CsTA and MrTA under different buffer conditions, respectively, in an example of the present invention.
FIG. 5 is a graph showing the optimal reaction temperature gradient of recombinant proteins CsTA and MrTA under optimal pH conditions, respectively, in an embodiment of the present invention.
FIG. 6 is a graph showing the time gradient of the reaction of recombinant proteins CsTA and MrTA each using EGCG as a substrate in the examples of the present invention.
FIG. 7 is a graph showing the time gradient of the reaction of recombinant proteins CsTA and MrTA using PGG as a substrate, respectively, according to the embodiment of the present invention.
FIG. 8 is a flow chart of the recombinant proteins CsTA and MrTA in vitro for catalyzing EGCG or PGG to react with methanol to generate methyl gallate by enzyme reaction in an embodiment of the invention.
FIG. 9 is a structural diagram of recombinant expression vector pRSFDuet-1 in the examples of the present invention.
Detailed Description
The invention is further described with reference to the following detailed drawings in order to make the technical means, the creation characteristics, the achievement of the purpose and the effect of the implementation of the invention easy to understand.
1. Material
(1) Plant material: early tea leaf comfort (l.) o.kuntze.var.sinenscultiva shochazao), fresh tea leaves and Yang Meixian leaves of bayberry (myricaubra) s.etzucc were collected, immediately frozen with liquid nitrogen and stored in a-80 ℃ refrigerator for later use.
(2) Competent E.coli cells DH 5. Alpha. Were cloned and host strain BL21 (DE 3) was expressed.
(3) LB medium: weighing 5g of yeast extract, 10g of tryptone, 10g of sodium chloride, adding 950mL of pure water, performing ultrasonic treatment until the yeast extract is fully dissolved, adjusting the pH to 7.0 by using 1mol/LNaOH solution, adding water to a constant volume of 1L, and performing high-pressure sterilization to obtain a liquid culture medium; LB solid medium, 3g agar powder is added into each 200mLLB liquid medium, and the mixture is uniformly mixed, and the mixture is sterilized by high-pressure steam at 121 ℃ for 15min.
(4) Kanamycin mother liquor (50 mg/mL): 0.5g of kanamycin is weighed, dissolved in 10mL of sterilized water, filtered, sterilized, packaged in small tubes and stored at-20 ℃.
(5) Isopropyl thio-beta-D-galactoside (IPTG, 1 mol/L) is prepared by weighing 2.383g of powder, dissolving in 10mL of sterilized water, filtering, sterilizing, sub-packaging into small tubes, and preserving at-20deg.C.
(6) Protein purification buffer (0.1 mol/L phosphate buffer): weighing 17.907g of Na 2 HPO 4 ·12H 2 O and 2.925g NaCl, and the volume was set to 500ml with pure water, which is solution I, 3.1202g NaH was weighed 2 PO 4 ·2H 2 O and 1.17g NaCl, the volume is fixed to 200ml by pure water, the solution is solution II, the pH value is adjusted to 7.4 by II, the finally obtained solution is a loading buffer solution, 1.02g imidazole is added into every 100ml loading buffer solution, and the obtained solution is an elution buffer solution.
Unless otherwise indicated, the examples were conducted under conventional experimental conditions or under conditions recommended by the manufacturer's instructions. The reagents or apparatus used were conventional products commercially available without the manufacturer's attention.
2. Method of
2.1 cloning and expression of Gene combinations that synergistically catalyze the biosynthesis of methyl gallate
2.1.1 Cloning of CsTA and MrTA genes
(1) Based on the open reading frame sequences of the two genes CsTA and MrTA, the specific primers of BamHI and PstI were designed for the multiple cloning site with the expression vector pRSFDuet-1.
The primer sequences are shown below:
CsTA forward primer:
5’-CATCACCATCATCACCACAGCCAGGATCCGATGGATTCAATAGCCC-3’;
CsTA reverse primer:
5’-GCCGCAAGCTTGTCGACCTGCAGTCAATTTTCATTTACG-3’;
MrTA forward primer:
5’-CATCACCATCATCACCACAGCCAGGATCCGATGGCGTCAAGCACTGG-3’;
MrTA reverse primer:
5’-GCCGCAAGCTTGTCGACCTGCAGTCAAGTTATG-3’;
(2) According to the specification of the polysaccharide polyphenol total RNA extraction kit, total RNA is respectively extracted from the early fresh leaves of Shucha and the fresh leaves of Myrica rubra, and reverse transcription is carried out on the total RNA by reverse transcriptase to obtain a tea tree cDNA template and a Myrica rubra cDNA template.
(3) Respectively using tea tree cDNA and waxberry cDNA as templates, and respectively amplifying with a primer of CsTA and a primer of MrTA, wherein the amplification procedure is as follows: denaturation at 98℃for 10s, annealing at 62℃for 15s, extension at 72℃for 30s,30 cycles, and further extension at 72℃for 6min to obtain PCR products.
(4) The PCR product was purified using a DNA purification recovery kit, and the product was finally ligated to the expression plasmid pRSFDuet-1 by a one-step cloning method to obtain pRSFDuet-1-CsTA and pRSFDuet-1-MrTA recombinant plasmids, the structure of which is shown in FIG. 9.
2.1.2 prokaryotic expression of the genes CsTA and MrTA and protein purification
The prokaryotic expression and functional verification techniques used in this example are those commonly used or well understood by those of ordinary skill in the art.
(1) Respectively transferring pRSFDuet-1-CsTA and pRSFDuet-1-MrTA recombinant plasmids into escherichia coli BL21 (DE 3) expression host bacteria, adding 200 mu L of LB liquid culture medium, culturing at 37 ℃ for 60min at 180r/min, coating the bacterial liquid on an LB plate containing 50 mu g/mL Kan+, and culturing at 37 ℃ in an inverted manner;
(2) After PCR verification, positive colonies are selected, inoculated into 100mL of sterilized LB liquid medium for expansion culture until OD600 is approximately equal to 0.6;
(3) Adding 90uL of IPTG into the successfully expanded bacterial liquid, culturing for 24 hours at 16 ℃, collecting bacterial cells, adding 10-15mL of loading buffer solution, fully suspending bacterial cells, placing in an ultrasonic crusher, ultrasonically crushing for 30 minutes at 25% power, and centrifugally collecting supernatant at 12000 rpm; recombinant proteins were purified using His-tagged amylose resin affinity columns (affinity chromatography on an amylase resin, new England Biolabs, MA, USA) and protein expression and purification were tested by SDS-PAGE methods commonly used in the art, and the results are shown in FIG. 1. The two purer protein bands were MrTA and CsTA, respectively, and the purified protein was used for further enzymatic analysis.
2.2 Enzyme activity function verification of CsTA and MrTA coexpression recombinant protein
2.2.1 Detection of enzymatic Activity of CsTA and MrTA proteins and determination of optimal condition Range
(1) Enzyme activity reaction system 50 μl: comprises 50mM phosphate buffer (pH 7.0), 1-3 μg CsTA or 1-3 μg MrTA purified protein, 0.8mM EGCG or 0.4mM PGG and 2M methanol, reacting in water bath at 40-45deg.C for 5min, adding equal volume of methanol, shaking, mixing, thoroughly denaturing the enzyme, centrifuging at 13000g for 20min, and detecting with ultra high performance liquid chromatograph and mass spectrum. The EGCG synthesis process is shown in the upper diagram in FIG. 8, and the PGG synthesis process is shown in the lower diagram in FIG. 8.
The method for detecting the enzyme reaction product comprises the following steps: agilent HPLC system, mobile phase: phase A1% acetic acid water, phase B pure acetonitrile, flow rate 0.4mL/min, column model PooshellHPH-C18 column (2.7 μm, 4.6X100 mm), detector wavelength 280nm, sample injection amount 5 μl, gradient elution method: from 0min to 5min, phase b rises from 1% to 10%; from 5min to 20min, phase b rises from 10% to 35%; from 20min to 21min, phase B was reduced from 35% to 10%, from 21min to 23min, phase B was reduced from 10% to 1%, from 23min to 25min, and phase B was maintained at 1% for equilibration of the column. And determining the enzyme reaction product according to the peak emergence time and the maximum ultraviolet absorption peak of the standard substance.
Mass spectrometry method is the same as above, mass spectrometry conditions: electrospray, negative ion mode, collecting compounds with mass-to-charge ratio of 100-1700. According to the results of identifying and analyzing the product peaks by the standard substance and the characteristic ion fragments, when EGCG and methanol or PGG and methanol are respectively used as substrates, no product is produced after boiling protein is added into a reaction system, and when CsTA protein or MrTA protein is added, the substrate EGCG is almost completely consumed to produce the product of methyl gallate, gallic acid GA and epigallocatechin EGC or tetragalloylglucose TeGG, and the peak-out time and mass spectrum information of the obtained product of methyl gallate and the standard substance completely agree with each other to prove that the product is the gallic acid methyl ester.
(2) Determination of the optimal reaction pH range: in order to eliminate the influence on the enzyme activity caused by different buffers, 4 buffers with different buffer intervals were selected to find the optimal buffers and pH ranges of the two proteins. Sodium citrate buffer (pH 4.0-6.0), phosphate buffer (pH 6.0-8.0), tris-hydrochloric acid buffer (pH 7.0-10.5) and glycine-NaOH buffer (pH 9.0-11.0), wherein 1-3 μg CsTA or 1-3 μg MrTA purified protein, 0.8mM EGCG or 0.4mM PGG and 2M methanol are reacted in a water bath at 40-45 ℃ for 3-5min, added with equal volume of methanol, and mixed uniformly by shaking to fully denature enzyme, 13000g is centrifuged for 20min, and peak area detection is carried out by an ultra-high performance liquid chromatograph, and the result is shown in figure 4, wherein the optimal pH range of the two proteins is 8.0-9.0.
(3) Determination of the optimal reaction temperature range: in order to maximize the efficiency of the enzyme reaction, the enzyme reaction was detected at 10, 20, 30, 35, 40, 45, 50, 60, and 70℃respectively, and the optimal temperature ranges of both proteins were 45-50℃as shown in FIG. 5.
2.2.2 Substrate conversion detection of CsTA and MrTA recombinant proteins
Under the optimal condition, the conversion rate of the two proteins to the substrate is judged by setting different reaction time gradients for in vitro enzyme activity experiments. From the curves of substrate and product changes in the enzyme reaction system of FIG. 6, it can be seen that: when EGCG is used as a substrate, 1 mug protein is in a reaction system of CsTA, and the substrate conversion rate can reach more than 90% when the reaction is carried out for 8 min; in the reaction system of MrTA, 1 mug protein can reach more than 90% in 5min.
From the curves of substrate and product changes in the enzyme reaction system of FIG. 7, it can be seen that: when PGG is used as a substrate, 3 mug protein is in a reaction system of CsTA, and the substrate conversion rate can reach more than 200% after 15min of reaction; in the reaction system of MrTA, the conversion rate can reach more than 300% when the reaction is carried out for 15min by 3 mug protein. The method provides a basic condition for the subsequent production of the methyl gallate by utilizing enzyme engineering.
While the invention has been described in detail with respect to the general description and specific embodiments thereof, it will be apparent to those skilled in the art that various modifications and improvements can be made thereto. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, and that the above embodiments and descriptions are merely illustrative of the principles of the present invention, and various changes and modifications may be made without departing from the spirit and scope of the invention, which is defined in the appended claims.
Sequence listing
<110> Anhui university of agriculture
<120> gene for high-efficiency catalysis of methyl gallate biosynthesis and application thereof
<141> 2021-12-27
<160> 4
<170> SIPOSequenceListing 1.0
<210> 1
<211> 724
<212> DNA
<213> tea (teas)
<400> 1
caagatcaac ggaccagatc acaagctccc cctcctcgtt cactaccacg gtggaggctt 60
ctgcatggga tcctccctcg acaccgtcac tctcagattc ctcacctccc tcgcctccca 120
agcccacttg atcgctatct ccgttgacta caggctcgcc ccggagcacc cattacccat 180
cgcgtatgaa gactcctggt ccgcgttgca gtggatcgct acccactcta acggccaagg 240
acccgatccc tggctaaacc agtacgcgga tttcggtcgg gttttcctgg cgggagagag 300
tgccggggca aatatagccc accaagtggc ggttcgggtt ggcactgtcg gattggaggg 360
tttcatgcca cgtggagtga ttataattca cccctatttt gcgggtagtg aaccggacaa 420
gatgatccag tatttgtatc cggggagtag tgggtcggag gatgacccga atttgagccc 480
caaggaggat ccgaatttga cgaagatggg ttgctccaag gtgattgtgt tcgtggcaga 540
gaaggatcgg ttgaaaccga gaggcgtgga ttactatgag acattgaaaa atagtggttg 600
ggaaggtagg gtggaatttg tggaggataa aggggaggac cactgtttcc atatgtttag 660
tcccaatagt gataaggttg tgggtttgat gcaaaagttg ggtactttcg taaatgaaaa 720
ttga 724
<210> 2
<211> 906
<212> DNA
<213> Myrica rubra (Myrica rubra)
<400> 2
atggcgtcaa gcactggtga gataacccat gatttcccac ctttcttcaa agtatacaaa 60
gatggccgca tagagaggta catgacccat gacccagtcc ccgcgaggct agacccaagc 120
acgggaatac aatccaaaga cgtggtgatc tcgccggaag ccggtgtgtc ggcccgaatc 180
ttcatcccca aaatcaacgg cccggatcaa aagctgccgc taatcgtcca ctaccacggg 240
ggaggcttct gcgttggatc tccgttcgat acgatttccg agagatttct cacatctttg 300
gtctcccaag cgaacgtaat agttgtctct gtcgactaca ggttagcccc agagcaccca 360
ctaccgattg catatgaaga ctcctgggcc gcgctgcagt ggattgcgtc tcattctaac 420
gggcaaggac ccgaaccgag tctcaacgag tacgtggatt tcggtcgggt tttcttgttg 480
ggcgagagcg ctggcgccaa catagccaac tacgtggcag tccaagcagg tgcgattgga 540
ttgcctggcc tgaagatcgt tggggcactt atagtgcacc ccttctttgg gagcaaggag 600
cgggatgaaa tgtacacgtt tctgtgtccc acgagctcag ggtgtgagga tgacccgaaa 660
ctgaacccgg cggttgatcc gaatttgtcg accatggcgt gcgcgaaggt cctggtatgt 720
gtggcggaga aagacgggct gaaagataga ggggtggcct actgcgagac tttacgtaag 780
agcggttggg gcggttctgt ggaattattc gaaaccaaag gagaagacca ctgctttcat 840
atgttcggcg aaggggaaaa cgtcgtgtcg ttgacgaaaa agtttgtcga ctttatcata 900
acttga 906
<210> 3
<211> 299
<212> PRT
<213> tea (teas)
<400> 3
Met Asp Ser Ile Ala His Asp Phe Pro Pro Phe Phe Arg Val His Lys
1 5 10 15
Asp Gly Arg Val Glu Arg Phe Met Val Ser Asp Tyr Val Pro Pro Ala
20 25 30
Val Asp Pro Lys Thr Gly Val Glu Phe Lys Asp Thr Leu Ile Ser Pro
35 40 45
Glu Thr Gly Val Lys Ala Arg Ile Phe Leu Pro Lys Ile Asn Gly Pro
50 55 60
Asp His Lys Leu Pro Leu Leu Val His Tyr His Gly Gly Gly Phe Cys
65 70 75 80
Met Gly Ser Ser Leu Asp Thr Val Thr Leu Arg Phe Leu Thr Ser Leu
85 90 95
Ala Ser Gln Ala His Leu Ile Ala Ile Ser Val Asp Tyr Arg Leu Ala
100 105 110
Pro Glu His Pro Leu Pro Ile Ala Tyr Glu Asp Ser Trp Ser Ala Leu
115 120 125
Gln Trp Ile Ala Thr His Ser Asn Gly Gln Gly Pro Asp Pro Trp Leu
130 135 140
Asn Gln Tyr Ala Asp Phe Gly Arg Val Phe Leu Ala Gly Glu Ser Ala
145 150 155 160
Gly Ala Asn Ile Ala His Gln Val Ala Val Arg Val Gly Thr Val Gly
165 170 175
Leu Glu Gly Phe Met Pro Arg Gly Val Ile Ile Ile His Pro Tyr Phe
180 185 190
Ala Gly Ser Glu Pro Asp Lys Met Ile Gln Tyr Leu Tyr Pro Gly Ser
195 200 205
Ser Gly Ser Glu Asp Asp Pro Asn Leu Ser Pro Lys Glu Asp Pro Asn
210 215 220
Leu Thr Lys Met Gly Cys Ser Lys Val Ile Val Phe Val Ala Glu Lys
225 230 235 240
Asp Arg Leu Lys Pro Arg Gly Val Asp Tyr Tyr Glu Thr Leu Lys Asn
245 250 255
Ser Gly Trp Glu Gly Arg Val Glu Phe Val Glu Asp Lys Gly Glu Asp
260 265 270
His Cys Phe His Met Phe Ser Pro Asn Ser Asp Lys Val Val Gly Leu
275 280 285
Met Gln Lys Leu Gly Thr Phe Val Asn Glu Asn
290 295
<210> 4
<211> 301
<212> PRT
<213> Myrica rubra (Myrica rubra)
<400> 4
Met Ala Ser Ser Thr Gly Glu Ile Thr His Asp Phe Pro Pro Phe Phe
1 5 10 15
Lys Val Tyr Lys Asp Gly Arg Ile Glu Arg Tyr Met Thr His Asp Pro
20 25 30
Val Pro Ala Arg Leu Asp Pro Ser Thr Gly Ile Gln Ser Lys Asp Val
35 40 45
Val Ile Ser Pro Glu Ala Gly Val Ser Ala Arg Ile Phe Ile Pro Lys
50 55 60
Ile Asn Gly Pro Asp Gln Lys Leu Pro Leu Ile Val His Tyr His Gly
65 70 75 80
Gly Gly Phe Cys Val Gly Ser Pro Phe Asp Thr Ile Ser Glu Arg Phe
85 90 95
Leu Thr Ser Leu Val Ser Gln Ala Asn Val Ile Val Val Ser Val Asp
100 105 110
Tyr Arg Leu Ala Pro Glu His Pro Leu Pro Ile Ala Tyr Glu Asp Ser
115 120 125
Trp Ala Ala Leu Gln Trp Ile Ala Ser His Ser Asn Gly Gln Gly Pro
130 135 140
Glu Pro Ser Leu Asn Glu Tyr Val Asp Phe Gly Arg Val Phe Leu Leu
145 150 155 160
Gly Glu Ser Ala Gly Ala Asn Ile Ala Asn Tyr Val Ala Val Gln Ala
165 170 175
Gly Ala Ile Gly Leu Pro Gly Leu Lys Ile Val Gly Ala Leu Ile Val
180 185 190
His Pro Phe Phe Gly Ser Lys Glu Arg Asp Glu Met Tyr Thr Phe Leu
195 200 205
Cys Pro Thr Ser Ser Gly Cys Glu Asp Asp Pro Lys Leu Asn Pro Ala
210 215 220
Val Asp Pro Asn Leu Ser Thr Met Ala Cys Ala Lys Val Leu Val Cys
225 230 235 240
Val Ala Glu Lys Asp Gly Leu Lys Asp Arg Gly Val Ala Tyr Cys Glu
245 250 255
Thr Leu Arg Lys Ser Gly Trp Gly Gly Ser Val Glu Leu Phe Glu Thr
260 265 270
Lys Gly Glu Asp His Cys Phe His Met Phe Gly Glu Gly Glu Asn Val
275 280 285
Val Ser Leu Thr Lys Lys Phe Val Asp Phe Ile Ile Thr
290 295 300
Claims (2)
1.MrTAThe application of the gene in catalyzing the biosynthesis of the methyl gallate is characterized in thatMrTAThe nucleotide sequence of the gene is shown as SEQ ID NO. 2.
2. The use according to claim 1, wherein,MrTAthe gene catalysis methyl gallate biosynthesis steps are as follows: adding into a reaction system containing EGCG or PGG and methanol as substrateMrTAThe recombinant protein MrTA of the gene is used for biologically synthesizing the methyl gallate through an enzyme catalytic reaction.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111610349.3A CN114410665B (en) | 2021-12-27 | 2021-12-27 | Gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111610349.3A CN114410665B (en) | 2021-12-27 | 2021-12-27 | Gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114410665A CN114410665A (en) | 2022-04-29 |
CN114410665B true CN114410665B (en) | 2024-01-16 |
Family
ID=81270309
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111610349.3A Active CN114410665B (en) | 2021-12-27 | 2021-12-27 | Gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114410665B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004087909A1 (en) * | 2003-03-28 | 2004-10-14 | Nihon University | Polynucleotide encoding 2-hydorxyisoflavanone dehydratase and application of the same |
CN101993861A (en) * | 2009-08-31 | 2011-03-30 | 同济大学 | Recombinant expression of carboxyl esterase |
CN102719413A (en) * | 2012-04-28 | 2012-10-10 | 中山大学 | Novel tannase and application thereof |
CN104761627A (en) * | 2015-04-20 | 2015-07-08 | 吉林大学 | Application of GmbHLH transcription factor in promotion of synthesizing soy isoflavone |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8404465B2 (en) * | 2009-03-11 | 2013-03-26 | Celexion, Llc | Biological synthesis of 6-aminocaproic acid from carbohydrate feedstocks |
-
2021
- 2021-12-27 CN CN202111610349.3A patent/CN114410665B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004087909A1 (en) * | 2003-03-28 | 2004-10-14 | Nihon University | Polynucleotide encoding 2-hydorxyisoflavanone dehydratase and application of the same |
CN101993861A (en) * | 2009-08-31 | 2011-03-30 | 同济大学 | Recombinant expression of carboxyl esterase |
CN102719413A (en) * | 2012-04-28 | 2012-10-10 | 中山大学 | Novel tannase and application thereof |
CN104761627A (en) * | 2015-04-20 | 2015-07-08 | 吉林大学 | Application of GmbHLH transcription factor in promotion of synthesizing soy isoflavone |
Non-Patent Citations (4)
Title |
---|
Dai,X. et al..Camellia sinensis tannase mRNA, complete cds GenBank: MK381269.1.《genbank》.2019,第1页. * |
Discovery and characterization of tannase genes in plants: roles in hydrolysis of tannins;Xinlong Dai et al.;《New Phytologist》;第226卷;第1104-1116页 * |
Jia,H.M. et al..2-hydroxyisoflavanone dehydratase [Morella rubra] GenBank: KAB1207115.1.《genbank》.2019,第1-2页. * |
New insights into the function of plant tannase with promiscuous acyltransferase activity;Yifan Chen et al.;《The Plant Journal》;第113卷;第576-594页 * |
Also Published As
Publication number | Publication date |
---|---|
CN114410665A (en) | 2022-04-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Cheng et al. | Molecular cloning and function assay of a chalcone isomerase gene (GbCHI) from Ginkgo biloba | |
CN113481185B (en) | Salt-tolerant beta-galactosidase GalNC2-13 and preparation method and application thereof | |
Hua et al. | Expression patterns of an isoflavone reductase-like gene and its possible roles in secondary metabolism in Ginkgo biloba | |
JP2021168646A (en) | Hydroxylase gene and use thereof | |
CN111420037B (en) | Application of phage lyase Lysep3 in preparation of broad-spectrum antibacterial drugs | |
JP2002535346A (en) | Methods for controlling fungal pathogens and substances useful therefor | |
CN114410665B (en) | Gene for efficiently catalyzing biosynthesis of methyl gallate and application thereof | |
JP2023514687A (en) | Glycosyltransferases, polynucleotides encoding them and methods of use | |
JP4350930B2 (en) | Novel gene containing DNA sequence encoding hydroxynitrile lyase, recombinant protein derived from the same gene and having hydroxynitrile lyase activity and use thereof | |
CN110616205B (en) | Flavone synthase for synthesis and preparation of flavone glycoside | |
CN108977455B (en) | Recombinant plasmid for producing oxalate decarboxylase, escherichia coli expression system, method and application | |
CA2151257A1 (en) | Dna encoding chitin deacetylase | |
CN108424937B (en) | Method for synthesizing tanshinol by enzyme method | |
CN106906189B (en) | P-coumaric acid-3-hydroxylase and coding gene and application thereof | |
CN113151232B (en) | 1-aminocyclopropane-1-carboxylic acid synthetase of lycoris aurea, and coding gene and application thereof | |
Li et al. | Effects of N-terminal deletions on 1-aminocyclopropane-1-carboxylate synthase activity | |
CN111411099B (en) | Hemsleya amabilis acetyl transferase, coding gene thereof and application of hemsleya amabilis acetyl transferase in preparation of cucurbitacin | |
Luo et al. | Proteolytic activation and characterization of recombinant polyphenol oxidase from Rosa chinensis for efficient synthesis of theaflavins | |
CN108265068B (en) | Recombinant arginine deiminase and industrial preparation method and application thereof | |
Li et al. | High-level expression of a sika deer (Cervus nippon) Cu/Zn superoxide dismutase in Pichia pastoris and its characterization | |
CN111394382B (en) | Recombinant expression vector and recombinant bacterium of feruloyl esterase BpFae gene, and recombinant expression method | |
Chen et al. | Molecular characterization, recombinant expression in Escherichia coli and biological activity of (S)-Tetrahydroberberine oxidase from Corydalis saxicola Bunt | |
CN113528552B (en) | Gene combination for synergetic catalysis of tea tree ester type catechin biosynthesis and application thereof | |
CN114181920B (en) | Rhodobacter capsulatus 5-aminoacetylpropionic acid synthetase mutant and application thereof | |
CN114657195B (en) | Gene capable of hydrolyzing fruit-bound benzaldehyde and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |