CN113582803A - 5-11 bicyclic sesterterpene skeleton compounds and preparation thereof - Google Patents
5-11 bicyclic sesterterpene skeleton compounds and preparation thereof Download PDFInfo
- Publication number
- CN113582803A CN113582803A CN202110709524.8A CN202110709524A CN113582803A CN 113582803 A CN113582803 A CN 113582803A CN 202110709524 A CN202110709524 A CN 202110709524A CN 113582803 A CN113582803 A CN 113582803A
- Authority
- CN
- China
- Prior art keywords
- nnns
- sesterterpene
- leu
- bicyclic
- gene
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- -1 bicyclic sesterterpene skeleton compounds Chemical class 0.000 title claims abstract description 44
- 238000002360 preparation method Methods 0.000 title abstract description 5
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 43
- 108090000364 Ligases Proteins 0.000 claims abstract description 16
- 102000003960 Ligases Human genes 0.000 claims abstract description 16
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 12
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 9
- 238000007363 ring formation reaction Methods 0.000 claims abstract description 7
- 239000000758 substrate Substances 0.000 claims abstract description 4
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 3
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 3
- 239000002157 polynucleotide Substances 0.000 claims abstract description 3
- 241001226034 Nectria <echinoderm> Species 0.000 claims abstract 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 2
- 230000014509 gene expression Effects 0.000 claims description 23
- 238000000034 method Methods 0.000 claims description 17
- 150000003505 terpenes Chemical class 0.000 claims description 14
- 239000013612 plasmid Substances 0.000 claims description 12
- 239000013604 expression vector Substances 0.000 claims description 11
- 210000001938 protoplast Anatomy 0.000 claims description 11
- 238000003259 recombinant expression Methods 0.000 claims description 8
- 238000012216 screening Methods 0.000 claims description 7
- 235000007586 terpenes Nutrition 0.000 claims description 7
- 101710135150 (+)-T-muurolol synthase ((2E,6E)-farnesyl diphosphate cyclizing) Proteins 0.000 claims description 6
- 240000006439 Aspergillus oryzae Species 0.000 claims description 6
- 235000002247 Aspergillus oryzae Nutrition 0.000 claims description 6
- 101710119400 Geranylfarnesyl diphosphate synthase Proteins 0.000 claims description 6
- 101710107752 Geranylgeranyl diphosphate synthase Proteins 0.000 claims description 6
- 238000012258 culturing Methods 0.000 claims description 6
- 238000012795 verification Methods 0.000 claims description 6
- 108091092195 Intron Proteins 0.000 claims description 5
- 230000003321 amplification Effects 0.000 claims description 5
- 239000013613 expression plasmid Substances 0.000 claims description 5
- 238000000855 fermentation Methods 0.000 claims description 5
- 230000004151 fermentation Effects 0.000 claims description 5
- 239000001963 growth medium Substances 0.000 claims description 5
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 5
- 238000004321 preservation Methods 0.000 claims description 5
- 239000013598 vector Substances 0.000 claims description 5
- 238000005516 engineering process Methods 0.000 claims description 4
- 238000009630 liquid culture Methods 0.000 claims description 4
- 230000009465 prokaryotic expression Effects 0.000 claims description 4
- 238000002390 rotary evaporation Methods 0.000 claims description 4
- 210000004027 cell Anatomy 0.000 claims description 3
- 239000002299 complementary DNA Substances 0.000 claims description 3
- 238000010276 construction Methods 0.000 claims description 3
- 239000000287 crude extract Substances 0.000 claims description 3
- 239000012634 fragment Substances 0.000 claims description 3
- 230000006801 homologous recombination Effects 0.000 claims description 3
- 238000002744 homologous recombination Methods 0.000 claims description 3
- 238000009629 microbiological culture Methods 0.000 claims description 3
- 238000000926 separation method Methods 0.000 claims description 3
- 238000010898 silica gel chromatography Methods 0.000 claims description 3
- 238000012546 transfer Methods 0.000 claims description 3
- 230000009466 transformation Effects 0.000 claims description 3
- 238000005303 weighing Methods 0.000 claims description 3
- 241001302160 Escherichia coli str. K-12 substr. DH10B Species 0.000 claims description 2
- 238000002156 mixing Methods 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 241000616582 Neivamyrmex nigrescens Species 0.000 claims 1
- 210000004899 c-terminal region Anatomy 0.000 claims 1
- 150000001875 compounds Chemical class 0.000 abstract description 29
- 102000004169 proteins and genes Human genes 0.000 abstract description 11
- 108700005078 Synthetic Genes Proteins 0.000 abstract description 5
- 229930004725 sesquiterpene Natural products 0.000 abstract description 3
- 238000010367 cloning Methods 0.000 abstract 1
- 229930002368 sesterterpene Natural products 0.000 description 16
- UHOVQNZJYSORNB-MZWXYZOWSA-N benzene-d6 Chemical compound [2H]C1=C([2H])C([2H])=C([2H])C([2H])=C1[2H] UHOVQNZJYSORNB-MZWXYZOWSA-N 0.000 description 14
- 238000004090 dissolution Methods 0.000 description 14
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 9
- 241000647196 Nectria nigrescens Species 0.000 description 9
- 238000001228 spectrum Methods 0.000 description 9
- WEVYAHXRMPXWCK-UHFFFAOYSA-N acetonitrile Substances CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 8
- 239000000047 product Substances 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 150000002653 sesterterpene derivatives Chemical class 0.000 description 6
- 241000233866 Fungi Species 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 229930014626 natural product Natural products 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- 238000005406 washing Methods 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- 108020004414 DNA Proteins 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 238000005481 NMR spectroscopy Methods 0.000 description 4
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 4
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 4
- 238000006555 catalytic reaction Methods 0.000 description 4
- 230000002538 fungal effect Effects 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- NUHSROFQTUXZQQ-UHFFFAOYSA-N isopentenyl diphosphate Chemical compound CC(=C)CCO[P@](O)(=O)OP(O)(O)=O NUHSROFQTUXZQQ-UHFFFAOYSA-N 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 3
- RRHGJUQNOFWUDK-UHFFFAOYSA-N Isoprene Chemical compound CC(=C)C=C RRHGJUQNOFWUDK-UHFFFAOYSA-N 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 238000001026 1H--1H correlation spectroscopy Methods 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 230000001588 bifunctional effect Effects 0.000 description 2
- 230000001851 biosynthetic effect Effects 0.000 description 2
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 2
- 238000001460 carbon-13 nuclear magnetic resonance spectrum Methods 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 2
- 235000011180 diphosphates Nutrition 0.000 description 2
- 229930004069 diterpene Natural products 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- HQVFCQRVQFYGRJ-UHFFFAOYSA-N formic acid;hydrate Chemical compound O.OC=O HQVFCQRVQFYGRJ-UHFFFAOYSA-N 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 238000001052 heteronuclear multiple bond coherence spectrum Methods 0.000 description 2
- 238000000990 heteronuclear single quantum coherence spectrum Methods 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- QWTDNUCVQCZILF-UHFFFAOYSA-N isopentane Chemical compound CCC(C)C QWTDNUCVQCZILF-UHFFFAOYSA-N 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 239000002207 metabolite Substances 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 229920001550 polyprenyl Polymers 0.000 description 2
- 125000001185 polyprenyl group Polymers 0.000 description 2
- CBIDRCWHNCKSTO-UHFFFAOYSA-N prenyl diphosphate Chemical compound CC(C)=CCO[P@](O)(=O)OP(O)(O)=O CBIDRCWHNCKSTO-UHFFFAOYSA-N 0.000 description 2
- 238000000425 proton nuclear magnetic resonance spectrum Methods 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 108010087432 terpene synthase Proteins 0.000 description 2
- 238000005160 1H NMR spectroscopy Methods 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- GHBSKQGCIYSCNS-NAKRPEOUSA-N Ala-Leu-Asp-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GHBSKQGCIYSCNS-NAKRPEOUSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- AYZAWXAPBAYCHO-CIUDSAMLSA-N Asn-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N AYZAWXAPBAYCHO-CIUDSAMLSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 102000000584 Calmodulin Human genes 0.000 description 1
- 108010041952 Calmodulin Proteins 0.000 description 1
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 1
- 241001306278 Diaporthe amygdali Species 0.000 description 1
- 108010006731 Dimethylallyltranstransferase Proteins 0.000 description 1
- 102000005454 Dimethylallyltranstransferase Human genes 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- FMRKUXFLLPKVPG-JYJNAYRXSA-N His-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O FMRKUXFLLPKVPG-JYJNAYRXSA-N 0.000 description 1
- BKOVCRUIXDIWFV-IXOXFDKPSA-N His-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 BKOVCRUIXDIWFV-IXOXFDKPSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- PFOUFRJYHWZJKW-NKIYYHGXSA-N His-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O PFOUFRJYHWZJKW-NKIYYHGXSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- GHOIOYHDDKXIDX-SZMVWBNQSA-N Lys-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 GHOIOYHDDKXIDX-SZMVWBNQSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- DJJBHQHOZLUBCN-WDSOQIARSA-N Met-Lys-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DJJBHQHOZLUBCN-WDSOQIARSA-N 0.000 description 1
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 208000018737 Parkinson disease Diseases 0.000 description 1
- NEHSHYOUIWBYSA-DCPHZVHLSA-N Phe-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NEHSHYOUIWBYSA-DCPHZVHLSA-N 0.000 description 1
- UNZNFYLFNRNPBS-UHFFFAOYSA-N Phe-Gln-Ala-Arg Chemical compound NC(N)=NCCCC(C(O)=O)NC(=O)C(C)NC(=O)C(CCC(N)=O)NC(=O)C(N)CC1=CC=CC=C1 UNZNFYLFNRNPBS-UHFFFAOYSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- 101100226889 Phomopsis amygdali PaFS gene Proteins 0.000 description 1
- ZRZNJUXESFHSIO-UHFFFAOYSA-N Pleuromutilin Natural products CC1C(O)C(C)(C=C)CC(OC(=O)CO)C2(C)C(C)CCC31C2C(=O)CC3 ZRZNJUXESFHSIO-UHFFFAOYSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- 102000001435 Synapsin Human genes 0.000 description 1
- 108050009621 Synapsin Proteins 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- SJWLQICJOBMOGG-PMVMPFDFSA-N Trp-Tyr-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CC4=CN=CN4)C(=O)O)N SJWLQICJOBMOGG-PMVMPFDFSA-N 0.000 description 1
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- FFCRCJZJARTYCG-KKUMJFAQSA-N Tyr-Cys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O FFCRCJZJARTYCG-KKUMJFAQSA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- UPODKYBYUBTWSV-BZSNNMDCSA-N Tyr-Phe-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 UPODKYBYUBTWSV-BZSNNMDCSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- MQUYPYFPHIPVHJ-MNSWYVGCSA-N Tyr-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O MQUYPYFPHIPVHJ-MNSWYVGCSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- XBJFCYDKBDVADW-UHFFFAOYSA-N acetonitrile;formic acid Chemical compound CC#N.OC=O XBJFCYDKBDVADW-UHFFFAOYSA-N 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000001348 anti-glioma Effects 0.000 description 1
- 230000000078 anti-malarial effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000003430 antimalarial agent Substances 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 125000002619 bicyclic group Chemical group 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 244000144987 brood Species 0.000 description 1
- 239000012159 carrier gas Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000005100 correlation spectroscopy Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- AFABGHUZZDYHJO-UHFFFAOYSA-N dimethyl butane Natural products CCCC(C)C AFABGHUZZDYHJO-UHFFFAOYSA-N 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 150000004141 diterpene derivatives Chemical class 0.000 description 1
- 125000000567 diterpene group Chemical group 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000002451 electron ionisation mass spectrometry Methods 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 108091008053 gene clusters Proteins 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 239000004519 grease Substances 0.000 description 1
- 239000001307 helium Substances 0.000 description 1
- 229910052734 helium Inorganic materials 0.000 description 1
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 1
- 238000003919 heteronuclear multiple bond coherence Methods 0.000 description 1
- 238000005570 heteronuclear single quantum coherence Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 150000002611 lead compounds Chemical class 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000008811 mitochondrial respiratory chain Effects 0.000 description 1
- 229930003658 monoterpene Natural products 0.000 description 1
- 150000002773 monoterpene derivatives Chemical class 0.000 description 1
- 235000002577 monoterpenes Nutrition 0.000 description 1
- 239000002547 new drug Substances 0.000 description 1
- 238000000655 nuclear magnetic resonance spectrum Methods 0.000 description 1
- 229930184030 ophiobolin Natural products 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- ZRZNJUXESFHSIO-VYTKZBNOSA-N pleuromutilin Chemical compound C([C@H]([C@]1(C)[C@@H](C[C@@](C)(C=C)[C@@H](O)[C@@H]2C)OC(=O)CO)C)C[C@]32[C@H]1C(=O)CC3 ZRZNJUXESFHSIO-VYTKZBNOSA-N 0.000 description 1
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 230000020978 protein processing Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 150000004354 sesquiterpene derivatives Chemical class 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000001502 supplementing effect Effects 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 150000003648 triterpenes Chemical class 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07C—ACYCLIC OR CARBOCYCLIC COMPOUNDS
- C07C13/00—Cyclic hydrocarbons containing rings other than, or in addition to, six-membered aromatic rings
- C07C13/28—Polycyclic hydrocarbons or acyclic hydrocarbon derivatives thereof
- C07C13/32—Polycyclic hydrocarbons or acyclic hydrocarbon derivatives thereof with condensed rings
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/001—Oxidoreductases (1.) acting on the CH-CH group of donors (1.3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P5/00—Preparation of hydrocarbons or halogenated hydrocarbons
- C12P5/002—Preparation of hydrocarbons or halogenated hydrocarbons cyclic
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07C—ACYCLIC OR CARBOCYCLIC COMPOUNDS
- C07C2602/00—Systems containing two condensed rings
- C07C2602/02—Systems containing two condensed rings the rings having only two atoms in common
- C07C2602/14—All rings being cycloaliphatic
- C07C2602/32—All rings being cycloaliphatic the ring system containing at least eleven carbon atoms
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Mycology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The invention provides a 5-11 double-ring sesterterpene skeleton compound and a preparation method thereof, wherein the structural formula of Nigtetraene is shown as follows, the amino acid sequence of synthetase NnNS is shown as SEQ ID NO.3, a synthetic gene is obtained by cloning from a CS12199 strain (Nectria. nigrescens 12199) genome, and a polynucleotide sequence is shown as SEQ ID NO. 1. The NnNS protein has the functions of catalyzing the chain length extension and structural cyclization of a substrate, and can assist the synthesis of the mother nucleus of the Nigtetraene compound. The invention is 5-11 bicyclicThe biosynthesis of the sesquiterpene compound provides a new resource, and a choice is provided for the synthesis of the compound types.
Description
Technical Field
The invention belongs to the technical field of genetic engineering, and particularly relates to a 5-11 bicyclic sesterterpene skeleton compound Nigtetraene synthetic gene NnNS cloned from Nectria nigrescens 12199 and application thereof.
Background
Terpenoids are a small molecule compound formed by combining isoprene or isopentane in various ways, and are the largest group of compounds in natural products, and more than 8 ten thousand compounds have been discovered from plants, animals and microorganisms to date1. Terpenoids can generally be classified as monoterpenes (C) according to the number of isoprene units contained therein10) Sesquiterpenes (C)15) Diterpenes (C)20) Sesterterpene (C)25) Triterpene (C)30) And the like. Wherein the amount of sesterterpene is less than 2% of total terpenes, especially attention is paid2. At present, most sesterterpenes are extracted and separated from marine organism sponges, but also exist in lichens, fungi, insect secretions and plants. Sesterterpene compounds have a broad spectrum of physiological activities, and as an important calmodulin inhibitor, the sesterterpene compounds, such as ophiobolins A, are useful3And simultaneously has antimalarial and anti-glioma activities, the derivative 3-anhydro-6-hydroxyyophiobolin A can promote the degradation of alpha synapsin of PC12 cells, and has potential application in the treatment of Parkinson's disease4. Teretonins sesterterpene compoundsThe teretonins E and F have the function of inhibiting mitochondrial respiratory chain5. In addition, it also has antibacterial effect6Inhibiting receptors7And the like. In 1965, more than 70 sesterterpene compounds derived from filamentous fungi are discovered, and the application prospect in biomedicine and other aspects is good due to the simple structure and various biological activities8。
Terpenoids start with two C5The unit isomers of Dimethylallyl pyrophosphate (DMAPP) and Isopentenyl diphosphate (IPP) are subjected to chain extension under the catalysis of isoamyltransferase (PTs) to form a chain polyprenyl pyrophosphate precursor, and then are subjected to cyclization under the catalysis of Terpene Cyclase (TCs) to form a Terpene skeleton, and end products with different cyclization structures are formed under the catalytic reaction of various post-modifying enzymes. Most of sesterterpenes from fungi are generated by catalysis of special Bifunctional terpene synthases (BFTSs), and the enzymes simultaneously have prenyl transferase and terpene cyclase and respectively catalyze chain extension and cyclization of polyprenyl pyrophosphate. The bifunctional terpene synthase PaFS was explored from the plant pathogenic fungus Phomopsis amygdali by the Sassa team in Japan from 20079Thereafter, a total of 22 BFTSs have been reported up to now.
The process of obtaining the terpenoid by the traditional chemical synthesis method has the defects of complicated operation steps, unstable reaction conditions, unfriendliness to the environment and the like, and the most important point is that the process has certain limitation in obtaining a new terpenoid skeleton. However, in recent years, with the recent iteration of genome sequencing technology and the rapid development of bioinformatics, microorganisms have been found to have great potential in the production of terpenoids. Therefore, the biosynthetic gene cluster is reconstructed in a heterologous host which is easy to operate and has clear genetic background for heterologous expression, and a large amount of new framework sesterterpene compounds and derivatives thereof can be obtained. Not only is the natural product resource library enriched, but also more choices are provided for finding new drugs. Is commonly used at present for fungus gradeHeterologous expression hosts for metabolites including prokaryotic expression systems, yeast expression systems, and filamentous fungal expression systems10. Prokaryotic expression systems such as E.coli, because they cannot cut introns in eukaryotic genes, can be applied to overexpression and functional identification of some enzymes in biosynthetic pathways. Yeast expression systems, such as Saccharomyces cerevisiae, whose secretory pathways enable correct protein processing and post-translational modifications, can be used for the analysis of the biosynthetic machinery of fungal natural products. Filamentous fungal expression systems, such as Aspergillus oryzae, can spontaneously accomplish the correct splicing of introns in a gene and can be used for biosynthetic pathway resolution and heterologous production of fungal natural products. Subject group of professor Oikawa successfully expressed diterpene compound pleuromutilin in Aspergillus oryzae11The sesquiterpene compound (-) -Terestacin12And the like natural terpene products.
Disclosure of Invention
The invention is carried out by the research, and provides 5-11 bicyclic sesterterpene skeleton compounds, synthetases thereof, coding genes of the synthetases and a heterologous expression method of the compounds.
The idea of the invention is as follows: in the biosynthesis process of the 5-11 bicyclic sesterterpene compounds serving as metabolites of the excavated strain N.nigrescens 12199, the functional analysis of gene genes shows that the NnNS gene is related to terpene synthesis, and the NnNS gene is involved in the biosynthesis of the 5-11 bicyclic sesterterpene compounds. Further obtains the NnNS protein through heterogeneously expressing the NnNS gene to carry out in vitro enzymatic reaction, and verifies that the NnNS gene is a synthetic gene of a 5-11 bicyclic sesterterpene compound Nigtetraene.
The invention aims at providing the structure of 5-11 bicyclic sesterterpene skeleton compounds; the second purpose is to provide an enzyme and a gene for synthesizing the compound; the third objective is to provide a method for heterologous expression of 5-11 bicyclic sesterterpene skeleton compounds.
In order to achieve the purpose, the technical scheme adopted by the invention is as follows:
in a first aspect of the invention, there is provided a 5-11 bicyclic sesterterpene skeleton compound, nitterlene,the molecular formula is C25H40The structural formula is as follows:
in a second aspect of the invention, the synthetase NnNS of the 5-11 bicyclic sesterterpene skeleton compound is provided, the amino acid sequence of the synthetase is shown in SEQ ID No.3, and the N end and the C end of the synthetase are respectively responsible for terpene cyclization and isopentenyl transfer functions.
Further, the synthetase contains two conserved domains: the terpene cyclase domain contains two recognition domains for Mg2+And the characteristic conserved motifs of the substrates DDVIE and NDYFSWDKE, the E-IPPS domain also contains two characteristic conserved motifs with similar functions DDVED and DDYLN.
In a second aspect of the present invention, there is provided a gene encoding the above-mentioned synthetase cloned from the genome of the CS12199 strain (N. nigrescens 12199), and the polynucleotide sequence thereof is shown in SEQ ID NO. 1. The CS12199 strain is preserved in China general microbiological culture Collection center (CGMCC) with the preservation number of CGMCC No. 21943. The gene contains 7 introns, the cDNA size is 2205bp, and the sequence is shown in SEQ ID NO. 2.
In a third aspect of the present invention, there is provided a recombinant expression vector of 5-11 bicyclic sesterterpene skeleton compounds, wherein the recombinant expression vector is a eukaryotic or prokaryotic expression vector carrying the above-mentioned synthetase or gene, such as E.coli, yeast system and filamentous fungi.
The fourth aspect of the invention provides a recombinant expression host cell of 5-11 bicyclic sesterterpene skeleton compounds, which contains the recombinant expression vector to realize the heterologous expression of the compounds.
In a fifth aspect of the invention, the application of the above-mentioned synthetase or gene in the synthesis of terpenoid, in particular to the application in the synthesis of 5-11 bicyclic sesterterpene skeleton compounds is provided.
In a sixth aspect of the invention, a method for heterologous expression of a 5-11 bicyclic sesterterpene skeleton compound is provided, which comprises the following steps:
A. construction of NnNS gene heterologous expression vector
Amplifying by using a PCR technology and using a N.nigrescens 12199 genome as a template to obtain a gene sequence containing NnNS, wherein primer sequences used for amplification are respectively shown as SEQ ID NO.4 and SEQ ID NO. 5;
primer sequences used for amplification:
NnNS-F:cgGAATTCGAGCTCGATGGCTCCATTGTCGATCAT;
NnNS-R:actacaGATCCCCGGTCATCGAGCTTCAATTTCCAGCCG。
connecting the amplified fragment with a pUARA2 vector through homologous recombination to construct a pUARA2-NnNS expression plasmid, transforming the ligation product into Escherichia coli DH10B, screening positive transformants, extracting plasmid PCR verification after culture to obtain a pUARA2-NnNS plasmid,
B. protoplast transformation
Culturing Aspergillus oryzae NSAR1, collecting protoplast, mixing with pUARA2-NnNS plasmid, culturing, performing PCR verification on the grown transformant, wherein the positive transformant is the NnNS heterologous expression strain AO-NnNS,
C. culture of heterologous expression strain AO-NnNS and product separation
Inoculating a heterologous expression strain AO-NnNS, screening by a pUARA2 plasmid screening liquid culture medium, performing fermentation culture, separating and purifying the obtained fermentation crude extract by adopting a forward silica gel column chromatography method, detecting each flow part rapidly by TLC, combining the same flow parts with spots, concentrating under reduced pressure, performing rotary evaporation to dryness, transferring into a weighed sample bottle, weighing the sample and recording the weight.
The invention has the following beneficial effects:
the invention discovers the NnNS gene for synthesizing the 5-11 bicyclic sesterterpene skeleton compound Nigtetraene, and the encoded NnNS protein can assist the synthesis of the mother nucleus of the Nigtetraene compound. The invention provides a new resource for the biosynthesis of the 5-11 bicyclic sesterterpene compounds and provides a choice for the synthesis of the compounds.
The NnNS gene discovered by the invention catalyzes and generates a new 5-11 bicyclic sesterterpene skeleton compound, and provides a valuable lead compound resource for enriching a natural product compound library and discovering new antibiotics.
Drawings
FIG. 1 is a HR-EI-MS spectrum of the compound Nigtetraene of the present invention.
FIG. 2 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6In (1)1H-NMR spectrum.
FIG. 3 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6In (1)13C-NMR spectrum.
FIG. 4 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6In (1)1H-1H COSY spectra.
FIG. 5 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6HSQC spectrum in (1).
FIG. 6 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6HMBC spectrum in (1).
FIG. 7 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6NOESY spectrum of (1).
FIG. 8 is a graph showing the alignment of the amino acid sequences of the protein encoded by the NnNS gene in Nectria nigrescens 12199 and the reported proteins.
Strain preservation information: CS12199(Nectria nigrescens 12199) is preserved in China general microbiological culture Collection center (CGMCC), with the preservation address of No.3, No. 04/02 at 2021 years and the preservation number of CGMCC No. 21943.
Detailed Description
The following embodiments are implemented on the premise of the technical scheme of the invention, and give detailed implementation modes and specific operation procedures, but the protection scope of the invention is not limited to the following embodiments.
The experimental procedures used in the following examples are all conventional procedures unless otherwise specified.
Materials, reagents and the like used in the following examples are commercially available unless otherwise specified.
The 5-11 bicyclic sesterterpene compound Nigtetraene synthetic gene provided by the invention is cloned from Nectria nigrescens 12199 and named as NnNS gene, and the gene sequence of the gene is shown in SEQ ID NO. 1. The NnNS gene contains 7 introns, the cDNA size is 2205bp, and the sequence is shown in SEQ ID NO. 2. The NnNS gene coded protein is named as NnNS protein, and the amino acid sequence of the NnNS protein is shown as SEQ ID NO. 3.
The NnNS protein belongs to a chimeric terpene synthetase, and the N end and the C end of the NnNS protein are respectively responsible for the functions of terpene cyclization and isopentenyl group transfer. The NnNS protein contains two conserved domains, wherein the terpene cyclase domain contains two domains for recognizing Mg2+And the characteristic conserved motifs of the substrate, DDVIE and NDYFSWDKE, the E-IPPS domain also contains two characteristic conserved motifs with similar functions, DDVED and DDYLN (FIG. 8).
Example 1: heterologous expression of 5-11 bicyclic sesterterpene compound Nigtetraene synthetic gene and structural identification of 5-11 bicyclic bi-sesquiterpene skeleton compound
By using a heterologous expression method, the NnNS gene in the Nectria nigrescens 12199 thallus is transferred into a host Aspergillus oryzae by constructing an expression plasmid, and the production condition of a heterologous expression strain product is detected. The medium formulation used is shown in Table 1.
TABLE 1 culture Medium formulation used in the examples
Construction of NnNS Gene Source expression vector
(1) The gene sequence containing NnNS is obtained by PCR technology and amplification by using the N.nigrescens 12199 genome as a template. Primer sequences used for amplification:
NnNS-F:cgGAATTCGAGCTCGATGGCTCCATTGTCGATCAT(SEQ ID NO.4);
NnNS-R:actacaGATCCCCGGTCATCGAGCTTCAATTTCCAGCCG(SEQ ID NO.5)。
(2) the amplified fragment is connected with a pUARA2 vector through homologous recombination to construct a pUARA2-NnNS expression plasmid, and the two sides of the vector NnNS have a homologous sequence which is consistent with the pUARA2 vector.
(3) The ligation product was transformed into E.coli DH10B and positive transformants were selected by ampicillin screening. Liquid culture of positive transformant, extraction of plasmid PCR verification, and obtaining of pUARA2-NnNS plasmid.
2. Transformation of protoplasts
(1) Aspergillus oryzae NSAR1 was spread on PDA plates and cultured at 30 ℃ for 7 days.
(2) Spores were collected in 10mL of 0.1% Tween-80 (1 plate of the brood is typically collected) and counted on a hemocytometer. Inoculation about 107The spores were cultured in 50mL DPY at 30 ℃ and 220rpm for 2-3 days.
(3) 100mg of Yatalase was weighed, dissolved by addition of dissolution 0, 20ml was sterilized by filtration through a 0.22 μm filter and added to a 50ml centrifuge tube.
(4) And collecting the thallus. Pouring 100ml of cultured mycelia into a P250 glass filter, removing a culture medium, washing with sterile water (or 0.8M NaCl) for 3-5 times, squeezing out water with a sterile medicine spoon, and adding the pressed dry mycelia into a Yatalase solution. Culturing at 30 deg.C and 200rpm under shaking for 1-2 hr until the spherical mycelium disappears and the supernatant is clear and dirty.
(5) The digested bacterial solution was filtered through Miracloth, protoplasts were collected and transferred to a new 50ml centrifuge tube, centrifuged at 4 ℃ at 800g for 5 min.
(6) The supernatant was removed, 20ml of 0.8M NaCl was added for resuspension washing, and centrifugation was carried out at 4 ℃ at 800g for 5min (twice washing). The supernatant was removed and 10ml of 0.8M NaCl was added. The number of protoplasts was counted under a microscope using a bacteria counter. Protoplast count-total count/80X 400ml X104X dilution factor.
(7) Adjusting the protoplast concentration to 2X 108cell/ml. (sol 2/sol 3 ═ 4/1), according toThe protoplast of 0.5ml-2ml can be obtained according to the growth condition of the thallus.
(8) 200. mu.l of the protoplast solution was transferred to a new 50ml centrifuge tube, 10. mu.g of the expression plasmid pUARA2-NnNS was added, and gently mixed. Standing on ice for 20 min. During this time, the sterilized Top agar was incubated in a water bath at 50 ℃.
(9) To the suspension of (8) was added 1ml of sol 3 and gently mixed with a tip. Standing at room temperature for 20 min. Add 10ml of sol 2 and mix gently.
(10) Centrifugation was carried out at 4 ℃ and 800g for 10min to remove the supernatant, 1ml of sol 2 was added, the suspension was gently suspended by a pipette gun, and 200. mu.l of the suspension was added to the center of the pUARA2 plasmid selection solid medium (X3 plate). 5ml of top agar incubated at 50 ℃ was rapidly added around the petri dish and mixed rapidly. After the plate surface was sufficiently dried, it was wound with parafilm, and cultured at 30 ℃ for 3 to 7 days with the lid facing downward.
(11) Each plate was picked 2-3 clones, 8 in total. And carrying out PCR verification on the grown transformant, wherein a positive transformant is the NnNS heterologous expression strain AO-NnNS.
3. Detection of expression product of heterologous expression strain AO-NnNS
(1) Inoculating heterologous expression strain AO-NnNS into pUARA2 plasmid to screen liquid culture medium, and culturing at 30 deg.C for 3 d.
(2) Centrifuging at 8000rpm for 10min to collect fermented thallus, adding 100ml of 80% acetone with the same volume, ultrasonicating for 20min, centrifuging at 8000rpm for 10min, and collecting supernatant.
(3) The mixture was extracted 1 time with 2 volumes of ethyl acetate, dried by rotary evaporation and dissolved in 15mL of methanol (chromatographic grade).
(4) Taking 1mL of methanol solution, filtering the solution by a 0.22 mu m filter membrane, and placing the filtered solution in a chromatographic bottle to obtain GC-MS and LC-MS samples.
(5) The samples were subjected to GC-MS detection: an Agilent-HP-5MS chromatographic column is adopted, the initial temperature is 60 ℃, the temperature is increased to 310 ℃ at the speed of 15 ℃/min, then the temperature is increased to 310 ℃ at the speed of 5 ℃/min, and the temperature is kept for 13 min. The GC-MS process parameters were as follows: a Sample module: the needle washing times before and after sample injection are 5 times, the needle washing times of the sample are 2 times, the viscosity compensation time is 0.2s, and the sample injection mode is normal. A GC module: the column temperature is 50 ℃, the sample injection temperature is 270 ℃, the sample injection mode is non-split flow (split), the carrier gas is helium, the flow rate control mode is linear, and the total flow rate is 10 mL/min. An MS module: the MS ion source temperature is 230 ℃, the interface temperature is 270 ℃, the solvent removal time is 2.5min, the acquisition time is 3min-60min, the acquisition mode is full scanning, the event time is set to be 0.3s, the scanning speed is 2000, and the scanning nuclear-to-mass ratio is 40-600 Da.
(7) Subjecting the sample to LC-MS detection: adopting a Cholester chromatographic column, and adopting mobile phase A-0.1% formic acid water and mobile phase B-acetonitrile with the flow rate of 1mL/min, wherein the proportion of the mobile phase acetonitrile in 30min is increased from 5% to 100%, then maintaining for 6min, then reducing the proportion of the mobile phase acetonitrile in 10s to 5%, and then maintaining for 4min and 50 s.
4. Separation, purification and identification of heterologously expressed recombinant strain AO-NnNS sesterterpene skeleton product
AO-NnNS co-ferments 10L, yielding about 2g of crude extract. The obtained crude fermentation extract is firstly separated and purified by adopting a forward silica gel column chromatography method. Loading the column by adopting a dry method, isocratic eluting by using petroleum ether, collecting one tube of effluent liquid every 10mL, collecting 18 flow parts, quickly detecting each flow part by TLC, combining the flow parts with the same spots, concentrating under reduced pressure, performing rotary evaporation to dryness, transferring the mixture into a weighed sample bottle, weighing the sample and recording the weight. HPLC is used for analyzing the components of each flow part and accurately positioning the target flow part. Optimizing the preparation conditions, selecting a Cholester semi-preparative chromatographic column to prepare a target compound, and carrying out mobile phase: phase a-0.1% formic acid water; and (3) phase B-acetonitrile, wherein the flow rate is 4mL/min, the acetonitrile formic acid with 95 percent is equal, the initial sample introduction amount is 10 mu L, the sample introduction amount is gradually increased to 80 mu L on the basis of ensuring that the peak type is not changed, the target sesterterpene compound peak appears in about 20min, and when the peak appears, the outflow solution is connected into a conical flask. The purity of the prepared compound was checked by TLC and HPLC.
NMR measurements of the isolated sesterterpene skeleton compounds were carried out using Bruker 600MHz (R) ((R))1H 600MHz;13C150 MHz). The solvent of the sesterterpene skeleton compound is Benzene-d6The resolution of NMR spectrometer is 600MHz, the first step is1H NMR and13c NMR measurement, comparing with data in database, and supplementing if it is new structureAnd (4) performing full HSQC, COSY and HMBC spectrogram decomposition spectrum to determine a specific structure.
5. Identifying the sesterterpene skeleton compound Nigtetraene.
Identifying the sesterterpene skeleton compound Nigtetraene obtained by the method:
(1) appearance: is in the form of transparent grease.
(2) Solubility: is easily dissolved in methanol and hardly dissolved in water.
(3) Nuclear magnetic resonance spectroscopy: FIG. 1 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6In (1)1H-NMR spectrum.
FIG. 2 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6In (1)13C-NMR spectrum. FIG. 3 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6In13C-DEPT 135 Spectrum. FIG. 4 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6In (1)1H-1H COSY spectra. FIG. 5 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6HSQC spectrum in (1). FIG. 6 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6HMBC spectrum in (1). FIG. 7 shows the dissolution of the compound Nigtetraene of the present invention in Benzene-d6NOESY spectrum of (1). The nuclear magnetic resonance spectrum of the compound Nigtetraene of the invention was studied and the signals of the 1D and 2D spectra were assigned, see Table 2. And finally the structure was determined as follows:
TABLE 2 assignment of peaks in the 1D and 2D spectra of the compound fusaoxyspeneA
The references involved in the background of the invention are as follows:
1 Mitsuhashi,T.&Abe,I.Chimeric Terpene Synthases Possessing both Terpene Cyclization and Prenyltransfer Activities.Chembiochem 19,1106-1114(2018).
2 Wang,L.,Yang,B.,Lin,X.-P.,Zhou,X.-F.&Liu,Y.Sesterterpenoids.Natural product reports 30,455-473(2013).
3 Guan,Z.et al.Metabolic engineering of Bacillus subtilis for terpenoid production. Applied Microbiology biotechnology 99,9395-9406(2015).
4 Xue,D.et al.3-Anhydro-6-hydroxy-ophiobolin A,a fungal sesterterpene from Bipolaris oryzae induced autophagy and promoted the degradation ofα-synuclein in PC12 cells. Bioorganic Medicinal Chemistry Letters 25,1464-1470(2015).
5 López-Gresa,M.P.et al.Terretonins E and F,inhibitors of the mitochondrial respiratory chain from the marine-derived fungus Aspergillus insuetus.Journal of natural products 72,1348-1351(2009).
6 Amagata,T.et al.Unusual C25 Steroids Produced by a Sponge-Derived Penicillium c itrinum.Organic letters 5,4393-4396(2003).
7 Hensens,O.D.et al.Variecolin,a sesterterpenoid of novel skeleton from Aspergillus variecolor MF138.The Journal of Organic Chemistry 56,3399-3403(1991).
8 Yin,R.&Hong,K.Filamentous fungal sesterterpenoids and their synthases.Chinese Journal of Biotechnology 32,1631-1641(2016).
9 Toyomasu,T.et al.Fusicoccins are biosynthesized by an unusual chimera diterpene synthase in fungi.Proceedings of the national academy of sciences 104,3084-3088 (2007).
10 Malus floribunda, Li Wei & Yi England. research progress on heterologous production of natural products of fungi, microbiological report 56, 429-.
11 Nagamine,S.et al.Ascomycete Aspergillus oryzae is an efficient expression host for production of basidiomycete terpenes by using genomic DNA sequences.Applied environmental microbiology 85,e00409-00419(2019).
12 Narita,K.et al.Total biosynthesis of antiangiogenic agent(-)-terpestacin by artificial reconstitution of the biosynthetic machinery in Aspergillus oryzae.The Journal of organic chemistry 83,7042-7048(2018).
While the preferred embodiments of the present invention have been described in detail, it will be understood by those skilled in the art that the invention is not limited thereto, and that various changes and modifications may be made without departing from the spirit of the invention, and the scope of the appended claims is to be accorded the full scope of the invention.
Sequence listing
<110> university of east China's college of science
<120> 5-11 bicyclic sesterterpene skeleton compounds and preparation thereof
<130> claims, specification
<160> 5
<170> SIPOSequenceListing 1.0
<210> 1
<211> 2607
<212> DNA
<213> Artificial sequence (Artificial sequence)
<400> 1
atggctccat tgtcgatcat ccgcaactct gtccctctcc caccgcatac atacgaaaca 60
gacgagtact tttgtcgatt caccccacgt atccatcgtg acgtccgcct tgctgatgcc 120
ggctcgtggc aatgccaggt cgactttctg ggatctagta cggccgctcg ggctggtgca 180
acgcgcaaca aggatgtttc tagttacgcc gtgggctgca ttaacccagt cgtcggcaac 240
ttcactgcgc tgtgtgcttg cgaagccttg agtgaccggc ttgccctgac tacttatatg 300
gttgagtatg cttatattca tgatgatggt atgtacttgg agcattgctt cccaccctca 360
tttaacatgt gttatgctag tgatcgagta tgccgagaac aaagacgaat ctcaggtaaa 420
tccaattaga ttgtttctct agacatgtct catcgttatg ctaacatttt aaacagcttc 480
tggagacaaa ccagcagctc attgaaggtc tcagtctcga agaagacgtc agcgctggct 540
cgaaagacca tgtcagaaga cgacagttgc aagctaaaat ggtcatggag cttattgaaa 600
cagacaagaa gcaggcaaaa gagtgcctcc gcttatggag ggaaatgtca cacgtatttg 660
tccagatcag agacatgcag ttcactgaac tgaatgacta tctcaagttc cgtgtggtag 720
acgcgggatg cccgtaagtc tacaatcaac actataagta ataagataat ccagtcgcta 780
acactgtctt tcagttggac tatgagcctc ttatgctttt ccatggactt cacactgaat 840
tccagtgaag aggagagagc ttccgccgtc acccaggcag cgtacgacgc ctgggttcta 900
gtcaacgact acttttcttg ggataaggag tggaacaacc accaatctcg tggtggcacc 960
ggtgtgattg ccaattccat cttcctcttc atgaaatggt actccgttga cgcaaaggag 1020
ggcaaaacga tgctgcgaaa ggagattctg gctcgcgaag agaagtattg caaggccaaa 1080
gaagatcttg aggcaaatgg ttccatgtcc gacaagataa cacagtggct cgagttgctc 1140
gatcttgtaa cagcaggtaa ctttgcttgg agcatgacaa ctgcccgcta tcgccttggc 1200
gctgaagacg catatatagc tttacggaat gcgtatacag agacccctgg ttctggtaca 1260
actgacagtc ttgggagccc catttcgcaa aacgctcttg cgatggcaga taaaatcgat 1320
atcgtgctga aggatcggag gtacctggat cttagcatcc gcgagcgcaa gataatcaaa 1380
cccgttgata gacccattgg ccgacaaaca actcatcctc ccgaggatgc gaacttcaag 1440
aagtcgcagg tcagccaagt ttggtctctc caccaatacg aagaggtctg taagcaaata 1500
agtgtgacag agagaatgtt tacctaataa tatcccagat gattctacaa cctccaaagt 1560
acttggagat gatgccatca aaggaagtga ggaacgctgt tatagacggc ctagagactt 1620
ggtaccatgt ttcagagaag tcactcgcag ccattcgaga aattgtaaac ttattgcata 1680
gctcttccct catgtacgta tctatctcta aaccaaaaaa aaaaaaaaca ctaactcctt 1740
tgtaggctag acgatgttga agataactcg cggctcagac gaggatttcc agcaacccac 1800
attatatttg gagtcagcca gactataaat tcagccaacc tactcatcat gaaggctctt 1860
aaagcagcgg aaaccctctc gcctctcgcc gtgcgcatcc tcatcgaaag actcattgac 1920
gggcatattg gtcaagggct ggatctctac tggacacacc acactcagac acccaccgaa 1980
gaagaatatt tcactatggt cgatggaagt tagtattgga cctctgctca gtcaccgcac 2040
agtcaacact aacgctctcg gtctagaaac cggtagtctt ttcattctta tcgccgaact 2100
gatgcgttct gaagctacga agcataaaac actcgacgct ggccttctca tgaagcttgt 2160
gggccgcttc ttccaagcac gagatgatta tctaaatctc cagagtgaag aggtttgtaa 2220
ttaaaatatt ctgcattcat tggttcagtg gtgctaattt tgcacgtgtc aacacagtat 2280
acccagaaga aaggcatcgc ggaagatatc aacgaaggga aattctcgct gccgctcatt 2340
cacgctttaa ggagcaagtc accgcaccgc gatcgccttt tgagcattct gcagcaacga 2400
aagaggtacg acgatctttc ccctgaaata cgcaagctcg ctctcgacga tatcaaagcc 2460
actggagggc tggagtatgc ggagaagacg gctatagagc tacaggaggc ggttagcgag 2520
acgcttacta cgtatgagga gagggtagga gagaagaatt ggctcctgag attggcgcag 2580
aagcggctgg aaattgaagc tcgatga 2607
<210> 2
<211> 2205
<212> DNA
<213> Artificial sequence (Artificial sequence)
<400> 2
atggctccat tgtcgatcat ccgcaactct gtccctctcc caccgcatac atacgaaaca 60
gacgagtact tttgtcgatt caccccacgt atccatcgtg acgtccgcct tgctgatgcc 120
ggctcgtggc aatgccaggt cgactttctg ggatctagta cggccgctcg ggctggtgca 180
acgcgcaaca aggatgtttc tagttacgcc gtgggctgca ttaacccagt cgtcggcaac 240
ttcactgcgc tgtgtgcttg cgaagccttg agtgaccggc ttgccctgac tacttatatg 300
gttgagtatg cttatattca tgatgatgtg atcgagtatg ccgagaacaa agacgaatct 360
cagcttctgg agacaaacca gcagctcatt gaaggtctca gtctcgaaga agacgtcagc 420
gctggctcga aagaccatgt cagaagacga cagttgcaag ctaaaatggt catggagctt 480
attgaaacag acaagaagca ggcaaaagag tgcctccgct tatggaggga aatgtcacac 540
gtatttgtcc agatcagaga catgcagttc actgaactga atgactatct caagttccgt 600
gtggtagacg cgggatgccc ttggactatg agcctcttat gcttttccat ggacttcaca 660
ctgaattcca gtgaagagga gagagcttcc gccgtcaccc aggcagcgta cgacgcctgg 720
gttctagtca acgactactt ttcttgggat aaggagtgga acaaccacca atctcgtggt 780
ggcaccggtg tgattgccaa ttccatcttc ctcttcatga aatggtactc cgttgacgca 840
aaggagggca aaacgatgct gcgaaaggag attctggctc gcgaagagaa gtattgcaag 900
gccaaagaag atcttgaggc aaatggttcc atgtccgaca agataacaca gtggctcgag 960
ttgctcgatc ttgtaacagc aggtaacttt gcttggagca tgacaactgc ccgctatcgc 1020
cttggcgctg aagacgcata tatagcttta cggaatgcgt atacagagac ccctggttct 1080
ggtacaactg acagtcttgg gagccccatt tcgcaaaacg ctcttgcgat ggcagataaa 1140
atcgatatcg tgctgaagga tcggaggtac ctggatctta gcatccgcga gcgcaagata 1200
atcaaacccg ttgatagacc cattggccga caaacaactc atcctcccga ggatgcgaac 1260
ttcaagaagt cgcaggtcag ccaagtttgg tctctccacc aatacgaaga gatgattcta 1320
caacctccaa agtacttgga gatgatgcca tcaaaggaag tgaggaacgc tgttatagac 1380
ggcctagaga cttggtacca tgtttcagag aagtcactcg cagccattcg agaaattgta 1440
aacttattgc atagctcttc cctcatgcta gacgatgttg aagataactc gcggctcaga 1500
cgaggatttc cagcaaccca cattatattt ggagtcagcc agactataaa ttcagccaac 1560
ctactcatca tgaaggctct taaagcagcg gaaaccctct cgcctctcgc cgtgcgcatc 1620
ctcatcgaaa gactcattga cgggcatatt ggtcaagggc tggatctcta ctggacacac 1680
cacactcaga cacccaccga agaagaatat ttcactatgg tcgatggaaa aaccggtagt 1740
cttttcattc ttatcgccga actgatgcgt tctgaagcta cgaagcataa aacactcgac 1800
gctggccttc tcatgaagct tgtgggccgc ttcttccaag cacgagatga ttatctaaat 1860
ctccagagtg aagagtatac ccagaagaaa ggcatcgcgg aagatatcaa cgaagggaaa 1920
ttctcgctgc cgctcattca cgctttaagg agcaagtcac cgcaccgcga tcgccttttg 1980
agcattctgc agcaacgaaa gaggtacgac gatctttccc ctgaaatacg caagctcgct 2040
ctcgacgata tcaaagccac tggagggctg gagtatgcgg agaagacggc tatagagcta 2100
caggaggcgg ttagcgagac gcttactacg tatgaggaga gggtaggaga gaagaattgg 2160
ctcctgagat tggcgcagaa gcggctggaa attgaagctc gatga 2205
<210> 3
<211> 734
<212> PRT
<213> Artificial sequence (Artificial sequence)
<400> 3
Met Ala Pro Leu Ser Ile Ile Arg Asn Ser Val Pro Leu Pro Pro His
1 5 10 15
Thr Tyr Glu Thr Asp Glu Tyr Phe Cys Arg Phe Thr Pro Arg Ile His
20 25 30
Arg Asp Val Arg Leu Ala Asp Ala Gly Ser Trp Gln Cys Gln Val Asp
35 40 45
Phe Leu Gly Ser Ser Thr Ala Ala Arg Ala Gly Ala Thr Arg Asn Lys
50 55 60
Asp Val Ser Ser Tyr Ala Val Gly Cys Ile Asn Pro Val Val Gly Asn
65 70 75 80
Phe Thr Ala Leu Cys Ala Cys Glu Ala Leu Ser Asp Arg Leu Ala Leu
85 90 95
Thr Thr Tyr Met Val Glu Tyr Ala Tyr Ile His Asp Asp Val Ile Glu
100 105 110
Tyr Ala Glu Asn Lys Asp Glu Ser Gln Leu Leu Glu Thr Asn Gln Gln
115 120 125
Leu Ile Glu Gly Leu Ser Leu Glu Glu Asp Val Ser Ala Gly Ser Lys
130 135 140
Asp His Val Arg Arg Arg Gln Leu Gln Ala Lys Met Val Met Glu Leu
145 150 155 160
Ile Glu Thr Asp Lys Lys Gln Ala Lys Glu Cys Leu Arg Leu Trp Arg
165 170 175
Glu Met Ser His Val Phe Val Gln Ile Arg Asp Met Gln Phe Thr Glu
180 185 190
Leu Asn Asp Tyr Leu Lys Phe Arg Val Val Asp Ala Gly Cys Pro Trp
195 200 205
Thr Met Ser Leu Leu Cys Phe Ser Met Asp Phe Thr Leu Asn Ser Ser
210 215 220
Glu Glu Glu Arg Ala Ser Ala Val Thr Gln Ala Ala Tyr Asp Ala Trp
225 230 235 240
Val Leu Val Asn Asp Tyr Phe Ser Trp Asp Lys Glu Trp Asn Asn His
245 250 255
Gln Ser Arg Gly Gly Thr Gly Val Ile Ala Asn Ser Ile Phe Leu Phe
260 265 270
Met Lys Trp Tyr Ser Val Asp Ala Lys Glu Gly Lys Thr Met Leu Arg
275 280 285
Lys Glu Ile Leu Ala Arg Glu Glu Lys Tyr Cys Lys Ala Lys Glu Asp
290 295 300
Leu Glu Ala Asn Gly Ser Met Ser Asp Lys Ile Thr Gln Trp Leu Glu
305 310 315 320
Leu Leu Asp Leu Val Thr Ala Gly Asn Phe Ala Trp Ser Met Thr Thr
325 330 335
Ala Arg Tyr Arg Leu Gly Ala Glu Asp Ala Tyr Ile Ala Leu Arg Asn
340 345 350
Ala Tyr Thr Glu Thr Pro Gly Ser Gly Thr Thr Asp Ser Leu Gly Ser
355 360 365
Pro Ile Ser Gln Asn Ala Leu Ala Met Ala Asp Lys Ile Asp Ile Val
370 375 380
Leu Lys Asp Arg Arg Tyr Leu Asp Leu Ser Ile Arg Glu Arg Lys Ile
385 390 395 400
Ile Lys Pro Val Asp Arg Pro Ile Gly Arg Gln Thr Thr His Pro Pro
405 410 415
Glu Asp Ala Asn Phe Lys Lys Ser Gln Val Ser Gln Val Trp Ser Leu
420 425 430
His Gln Tyr Glu Glu Met Ile Leu Gln Pro Pro Lys Tyr Leu Glu Met
435 440 445
Met Pro Ser Lys Glu Val Arg Asn Ala Val Ile Asp Gly Leu Glu Thr
450 455 460
Trp Tyr His Val Ser Glu Lys Ser Leu Ala Ala Ile Arg Glu Ile Val
465 470 475 480
Asn Leu Leu His Ser Ser Ser Leu Met Leu Asp Asp Val Glu Asp Asn
485 490 495
Ser Arg Leu Arg Arg Gly Phe Pro Ala Thr His Ile Ile Phe Gly Val
500 505 510
Ser Gln Thr Ile Asn Ser Ala Asn Leu Leu Ile Met Lys Ala Leu Lys
515 520 525
Ala Ala Glu Thr Leu Ser Pro Leu Ala Val Arg Ile Leu Ile Glu Arg
530 535 540
Leu Ile Asp Gly His Ile Gly Gln Gly Leu Asp Leu Tyr Trp Thr His
545 550 555 560
His Thr Gln Thr Pro Thr Glu Glu Glu Tyr Phe Thr Met Val Asp Gly
565 570 575
Lys Thr Gly Ser Leu Phe Ile Leu Ile Ala Glu Leu Met Arg Ser Glu
580 585 590
Ala Thr Lys His Lys Thr Leu Asp Ala Gly Leu Leu Met Lys Leu Val
595 600 605
Gly Arg Phe Phe Gln Ala Arg Asp Asp Tyr Leu Asn Leu Gln Ser Glu
610 615 620
Glu Tyr Thr Gln Lys Lys Gly Ile Ala Glu Asp Ile Asn Glu Gly Lys
625 630 635 640
Phe Ser Leu Pro Leu Ile His Ala Leu Arg Ser Lys Ser Pro His Arg
645 650 655
Asp Arg Leu Leu Ser Ile Leu Gln Gln Arg Lys Arg Tyr Asp Asp Leu
660 665 670
Ser Pro Glu Ile Arg Lys Leu Ala Leu Asp Asp Ile Lys Ala Thr Gly
675 680 685
Gly Leu Glu Tyr Ala Glu Lys Thr Ala Ile Glu Leu Gln Glu Ala Val
690 695 700
Ser Glu Thr Leu Thr Thr Tyr Glu Glu Arg Val Gly Glu Lys Asn Trp
705 710 715 720
Leu Leu Arg Leu Ala Gln Lys Arg Leu Glu Ile Glu Ala Arg
725 730
<210> 4
<211> 35
<212> DNA
<213> Artificial sequence (Artificial sequence)
<400> 4
cggaattcga gctcgatggc tccattgtcg atcat 35
<210> 5
<211> 39
<212> DNA
<213> Artificial sequence (Artificial sequence)
<400> 5
actacagatc cccggtcatc gagcttcaat ttccagccg 39
Claims (9)
2. the 5-11 bicyclic sesterterpene skeleton compound synthase NnNS of claim 1, wherein the amino acid sequence of the synthase is shown in SEQ ID NO.3, and the N-terminal and the C-terminal of the synthase are respectively responsible for terpene cyclization and isopentenyl group transfer functions.
3. The synthetase according to claim 2, characterized in that the synthetase comprises two conserved domains: the terpene cyclase domain contains two recognition domains for Mg2+And the characteristic conserved motifs of the substrates DDVIE and NDYFSWDKE, the E-IPPS domain also contains two characteristic conserved motifs with similar functions DDVED and DDYLN.
4. A gene encoding the synthetase according to claim 2, characterized in that: the polynucleotide sequence of the strain is shown as SEQ ID NO.1 in order to be cloned from the genome of a CS12199 strain (Nectria. nigrescens 12199), wherein the CS12199 strain is preserved in the China general microbiological culture Collection center with the preservation number of CGMCC No. 21943.
5. The gene as claimed in claim 4, which contains 7 introns, and has a cDNA size of 2205bp and a sequence shown in SEQ ID NO. 2.
6. A recombinant expression vector of 5-11 bicyclic sesterterpene skeleton compound, wherein the recombinant expression vector is a eukaryotic or prokaryotic expression vector carrying the synthetase of claim 2 or 3 or carrying the gene of claim 3 or 4.
7. A recombinant expression host cell of a 5-11 bicyclic sesterterpene skeleton compound, characterized in that: comprising the recombinant expression vector of claim 6.
8. Use of the synthetase of claim 2 or 3 or the gene of claim 4 or 5 for the synthesis of 5-11 bicyclic sesterterpene skeleton compounds.
9. A method for heterologously expressing a 5-11 bicyclic sesterterpene skeleton compound, comprising the steps of:
A. construction of NnNS gene heterologous expression vector
Amplifying by using a PCR technology and using a N.nigrescens 12199 genome as a template to obtain a gene sequence containing NnNS, wherein primer sequences used for amplification are respectively shown as SEQ ID NO.4 and SEQ ID NO. 5;
connecting the amplified fragment with a pUARA2 vector through homologous recombination to construct a pUARA2-NnNS expression plasmid, transforming the ligation product into Escherichia coli DH10B, screening positive transformants, extracting plasmid PCR verification after culture to obtain a pUARA2-NnNS plasmid,
B. protoplast transformation
Culturing Aspergillus oryzae NSAR1, collecting protoplast, mixing with pUARA2-NnNS plasmid, culturing, performing PCR verification on the grown transformant, wherein the positive transformant is the NnNS heterologous expression strain AO-NnNS,
C. culture of heterologous expression strain AO-NnNS and product separation
Inoculating a heterologous expression strain AO-NnNS, screening by a pUARA2 plasmid screening liquid culture medium, performing fermentation culture, separating and purifying the obtained fermentation crude extract by adopting a forward silica gel column chromatography method, detecting each flow part rapidly by TLC, combining the same flow parts with spots, concentrating under reduced pressure, performing rotary evaporation to dryness, transferring into a weighed sample bottle, weighing the sample and recording the weight.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110709524.8A CN113582803B (en) | 2021-06-25 | 2021-06-25 | 5-11 Bicyclo-sesquiterpene skeleton compound and preparation thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110709524.8A CN113582803B (en) | 2021-06-25 | 2021-06-25 | 5-11 Bicyclo-sesquiterpene skeleton compound and preparation thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113582803A true CN113582803A (en) | 2021-11-02 |
CN113582803B CN113582803B (en) | 2024-09-17 |
Family
ID=78244403
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110709524.8A Active CN113582803B (en) | 2021-06-25 | 2021-06-25 | 5-11 Bicyclo-sesquiterpene skeleton compound and preparation thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113582803B (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112142585A (en) * | 2020-09-08 | 2020-12-29 | 武汉大学 | Mangicols sesterterpene compounds, synthetic method, gene cluster, nucleic acid molecule, construct and application thereof |
CN112592259A (en) * | 2020-12-14 | 2021-04-02 | 武汉大学 | Compound, synthetic gene cluster and application thereof |
-
2021
- 2021-06-25 CN CN202110709524.8A patent/CN113582803B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112142585A (en) * | 2020-09-08 | 2020-12-29 | 武汉大学 | Mangicols sesterterpene compounds, synthetic method, gene cluster, nucleic acid molecule, construct and application thereof |
CN112592259A (en) * | 2020-12-14 | 2021-04-02 | 武汉大学 | Compound, synthetic gene cluster and application thereof |
Non-Patent Citations (1)
Title |
---|
GUANGKAI BIAN等: "A Clade II-D Fungal Chimeric Diterpene Synthase from Colletotrichum gloeosporioides Produces Dolasta-1(15), 8-diene", ANGEW. CHEM. INT. ED., vol. 57, 26 October 2018 (2018-10-26), pages 15887 - 15890 * |
Also Published As
Publication number | Publication date |
---|---|
CN113582803B (en) | 2024-09-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Ye et al. | Coupling cell growth and biochemical pathway induction in Saccharomyces cerevisiae for production of (+)-valencene and its chemical conversion to (+)-nootkatone | |
Guo et al. | Tafuketide, a phylogeny-guided discovery of a new polyketide from Talaromyces funiculosus Salicorn 58 | |
CN115197172B (en) | Sesterterpene compound, synthetic gene cluster and synthetic method thereof | |
Surup et al. | New terpenoids from the fermentation broth of the edible mushroom Cyclocybe aegerita | |
CN113186183B (en) | Difunctional sesterterpene/diterpene synthase LcTPS2, coding gene, product and application thereof | |
CN112142585B (en) | Mangicols sesterterpene compounds, synthetic method, gene cluster, nucleic acid molecule, construct and application thereof | |
Okoye et al. | Two new cytochalasins from an endophytic fungus, KL-1.1 isolated from Psidium guajava leaves | |
CN113416748A (en) | Expression vector for synthesizing cannabidiol, heterologous expression method and application | |
CN113046332B (en) | Sesterterpene skeleton compound, synthetic gene and preparation method thereof | |
US11920178B2 (en) | Use of type III polyketide synthases from bacteria as phloroglucinol synthases | |
CN117105886A (en) | Tetracyclic sesterterpenoid and synthetic gene cluster thereof | |
US11512293B2 (en) | Use of type III polyketide synthases as phloroglucinol synthases | |
CN113582803B (en) | 5-11 Bicyclo-sesquiterpene skeleton compound and preparation thereof | |
CN110317765B (en) | Escherichia coli expression strain for high-yield geraniol glucoside and application thereof | |
CN107880134B (en) | Method for enzymatic synthesis of kaempferol | |
CN113444737B (en) | Cytochrome P450 enzyme and application thereof in synthesis of ganoderma lucidum triterpenoid | |
CN113402357B (en) | 5-12-5 Tricyclic diterpene skeleton compound and preparation thereof | |
CN113045410A (en) | Bicyclic norditerpenoid compound and synthetic gene and preparation method thereof | |
CN111411099B (en) | Hemsleya amabilis acetyl transferase, coding gene thereof and application of hemsleya amabilis acetyl transferase in preparation of cucurbitacin | |
CN107903227B (en) | Succinic anhydride compound, gene and protein related to succinic anhydride compound and preparation method of succinic anhydride compound | |
Venkata Dasu et al. | Studies on production of griseofulvin | |
CN112646834A (en) | Lupeol derivative and synthesis method and application thereof | |
CN111363753A (en) | Method for the heterologous production of linear triterpenes | |
CN111778222B (en) | NRPS-PKS hybrid protein capable of producing flavonoid compounds in fungi and encoding gene and application thereof | |
CN108504640A (en) | Steroidal compounds side chain modifier and its application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant |