CN117143838A - 对棘白菌素类化合物氧磺酰化的酶及其应用 - Google Patents
对棘白菌素类化合物氧磺酰化的酶及其应用 Download PDFInfo
- Publication number
- CN117143838A CN117143838A CN202210570357.8A CN202210570357A CN117143838A CN 117143838 A CN117143838 A CN 117143838A CN 202210570357 A CN202210570357 A CN 202210570357A CN 117143838 A CN117143838 A CN 117143838A
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- compound
- mobile phase
- gene
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108010049047 Echinocandins Proteins 0.000 title claims abstract description 39
- 102000004190 Enzymes Human genes 0.000 title claims abstract description 20
- 108090000790 Enzymes Proteins 0.000 title claims abstract description 20
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 85
- 150000001875 compounds Chemical class 0.000 claims abstract description 72
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 claims abstract description 41
- 101710198130 NADPH-cytochrome P450 reductase Proteins 0.000 claims abstract description 30
- 239000013598 vector Substances 0.000 claims abstract description 24
- 230000033444 hydroxylation Effects 0.000 claims abstract description 15
- 238000005805 hydroxylation reaction Methods 0.000 claims abstract description 15
- 230000006103 sulfonylation Effects 0.000 claims abstract description 13
- 238000005694 sulfonylation reaction Methods 0.000 claims abstract description 13
- 239000012620 biological material Substances 0.000 claims abstract description 12
- -1 L-homotyrosine benzene ring Chemical group 0.000 claims description 41
- 241001158911 Coleophoma sp. Species 0.000 claims description 33
- 238000000034 method Methods 0.000 claims description 24
- 241000233866 Fungi Species 0.000 claims description 8
- 241001503951 Phoma Species 0.000 claims description 7
- 230000000640 hydroxylating effect Effects 0.000 claims description 7
- 241000887193 Coleophoma crateriformis Species 0.000 claims description 6
- 125000000472 sulfonyl group Chemical group *S(*)(=O)=O 0.000 abstract description 28
- 108090000992 Transferases Proteins 0.000 abstract description 15
- 102000004357 Transferases Human genes 0.000 abstract description 15
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 abstract description 7
- 239000001301 oxygen Substances 0.000 abstract description 7
- 229910052760 oxygen Inorganic materials 0.000 abstract description 7
- 102000004896 Sulfotransferases Human genes 0.000 abstract description 5
- 108090001033 Sulfotransferases Proteins 0.000 abstract description 5
- 239000013612 plasmid Substances 0.000 description 44
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 36
- HLIAVLHNDJUHFG-HOTGVXAUSA-N neotame Chemical compound CC(C)(C)CCN[C@@H](CC(O)=O)C(=O)N[C@H](C(=O)OC)CC1=CC=CC=C1 HLIAVLHNDJUHFG-HOTGVXAUSA-N 0.000 description 36
- 239000000243 solution Substances 0.000 description 34
- 241001460671 Glarea lozoyensis Species 0.000 description 30
- 210000004027 cell Anatomy 0.000 description 28
- 102000004169 proteins and genes Human genes 0.000 description 28
- 238000010828 elution Methods 0.000 description 27
- 238000004128 high performance liquid chromatography Methods 0.000 description 27
- 239000004384 Neotame Substances 0.000 description 26
- 108010070257 neotame Proteins 0.000 description 26
- 235000019412 neotame Nutrition 0.000 description 26
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 25
- 238000000855 fermentation Methods 0.000 description 24
- 230000004151 fermentation Effects 0.000 description 24
- 150000001413 amino acids Chemical group 0.000 description 23
- 238000004458 analytical method Methods 0.000 description 22
- 239000012634 fragment Substances 0.000 description 22
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 18
- 239000007788 liquid Substances 0.000 description 18
- 230000015572 biosynthetic process Effects 0.000 description 16
- KAPLTEIQCKDUAT-WAULFNKYSA-N chembl1161027 Chemical group C1([C@H](O)[C@@H](O)[C@H]2C(=O)N[C@H](C(=O)N3C[C@H](C)[C@H](O)[C@H]3C(=O)N[C@@H](O)[C@H](O)C[C@@H](C(N[C@H](C(=O)N3C[C@H](O)C[C@H]3C(=O)N2)[C@@H](C)O)=O)NC(=O)CCCCCCCCCCCCCCC)[C@H](O)CC(N)=O)=CC=C(O)C(OS(O)(=O)=O)=C1 KAPLTEIQCKDUAT-WAULFNKYSA-N 0.000 description 16
- 238000005406 washing Methods 0.000 description 16
- 241000776509 Coleophoma Species 0.000 description 14
- 238000012408 PCR amplification Methods 0.000 description 13
- 239000001963 growth medium Substances 0.000 description 13
- 239000002609 medium Substances 0.000 description 13
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 12
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 12
- 238000005119 centrifugation Methods 0.000 description 12
- 238000010276 construction Methods 0.000 description 12
- 210000001938 protoplast Anatomy 0.000 description 12
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 11
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 11
- 239000012148 binding buffer Substances 0.000 description 10
- 238000010367 cloning Methods 0.000 description 10
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 10
- UNILWMWFPHPYOR-KXEYIPSPSA-M 1-[6-[2-[3-[3-[3-[2-[2-[3-[[2-[2-[[(2r)-1-[[2-[[(2r)-1-[3-[2-[2-[3-[[2-(2-amino-2-oxoethoxy)acetyl]amino]propoxy]ethoxy]ethoxy]propylamino]-3-hydroxy-1-oxopropan-2-yl]amino]-2-oxoethyl]amino]-3-[(2r)-2,3-di(hexadecanoyloxy)propyl]sulfanyl-1-oxopropan-2-yl Chemical compound O=C1C(SCCC(=O)NCCCOCCOCCOCCCNC(=O)COCC(=O)N[C@@H](CSC[C@@H](COC(=O)CCCCCCCCCCCCCCC)OC(=O)CCCCCCCCCCCCCCC)C(=O)NCC(=O)N[C@H](CO)C(=O)NCCCOCCOCCOCCCNC(=O)COCC(N)=O)CC(=O)N1CCNC(=O)CCCCCN\1C2=CC=C(S([O-])(=O)=O)C=C2CC/1=C/C=C/C=C/C1=[N+](CC)C2=CC=C(S([O-])(=O)=O)C=C2C1 UNILWMWFPHPYOR-KXEYIPSPSA-M 0.000 description 9
- 238000006243 chemical reaction Methods 0.000 description 9
- 229940079593 drug Drugs 0.000 description 9
- 239000003814 drug Substances 0.000 description 9
- 230000000694 effects Effects 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- 238000000746 purification Methods 0.000 description 8
- 238000011218 seed culture Methods 0.000 description 8
- 239000006228 supernatant Substances 0.000 description 8
- WCDLCPLAAKUJNY-UHFFFAOYSA-N 4-[4-[3-(1h-pyrazol-4-yl)pyrazolo[1,5-a]pyrimidin-6-yl]phenyl]morpholine Chemical compound C1COCCN1C1=CC=C(C2=CN3N=CC(=C3N=C2)C2=CNN=C2)C=C1 WCDLCPLAAKUJNY-UHFFFAOYSA-N 0.000 description 7
- 229940121375 antifungal agent Drugs 0.000 description 7
- 229930027917 kanamycin Natural products 0.000 description 7
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 7
- 229960000318 kanamycin Drugs 0.000 description 7
- 229930182823 kanamycin A Natural products 0.000 description 7
- 238000012795 verification Methods 0.000 description 7
- 241001587826 Coleophoma empetri Species 0.000 description 6
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 238000005481 NMR spectroscopy Methods 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- PMZXXNPJQYDFJX-UHFFFAOYSA-N acetonitrile;2,2,2-trifluoroacetic acid Chemical compound CC#N.OC(=O)C(F)(F)F PMZXXNPJQYDFJX-UHFFFAOYSA-N 0.000 description 6
- 230000000843 anti-fungal effect Effects 0.000 description 6
- 229940088598 enzyme Drugs 0.000 description 6
- 230000014509 gene expression Effects 0.000 description 6
- 101150010615 mcfS gene Proteins 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 239000012460 protein solution Substances 0.000 description 6
- 238000012216 screening Methods 0.000 description 6
- 238000000926 separation method Methods 0.000 description 6
- 238000000108 ultra-filtration Methods 0.000 description 6
- 238000000825 ultraviolet detection Methods 0.000 description 6
- QFLWZFQWSBQYPS-AWRAUJHKSA-N (3S)-3-[[(2S)-2-[[(2S)-2-[5-[(3aS,6aR)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]pentanoylamino]-3-methylbutanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-4-[1-bis(4-chlorophenoxy)phosphorylbutylamino]-4-oxobutanoic acid Chemical compound CCCC(NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)CCCCC1SC[C@@H]2NC(=O)N[C@H]12)C(C)C)P(=O)(Oc1ccc(Cl)cc1)Oc1ccc(Cl)cc1 QFLWZFQWSBQYPS-AWRAUJHKSA-N 0.000 description 5
- AQWSFUIGRSMCST-UHFFFAOYSA-N 3-pyridin-3-ylsulfonyl-5-(trifluoromethyl)chromen-2-one Chemical compound N1=CC(=CC=C1)S(=O)(=O)C=1C(OC2=CC=CC(=C2C=1)C(F)(F)F)=O AQWSFUIGRSMCST-UHFFFAOYSA-N 0.000 description 5
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 5
- 229920000936 Agarose Polymers 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 5
- 108010021062 Micafungin Proteins 0.000 description 5
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 5
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 5
- XBJFCYDKBDVADW-UHFFFAOYSA-N acetonitrile;formic acid Chemical compound CC#N.OC=O XBJFCYDKBDVADW-UHFFFAOYSA-N 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 238000012258 culturing Methods 0.000 description 5
- 235000019253 formic acid Nutrition 0.000 description 5
- 230000006801 homologous recombination Effects 0.000 description 5
- 238000002744 homologous recombination Methods 0.000 description 5
- 101150046977 mcfP gene Proteins 0.000 description 5
- 229960002159 micafungin Drugs 0.000 description 5
- PIEUQSKUWLMALL-YABMTYFHSA-N micafungin Chemical compound C1=CC(OCCCCC)=CC=C1C1=CC(C=2C=CC(=CC=2)C(=O)N[C@@H]2C(N[C@H](C(=O)N3C[C@H](O)C[C@H]3C(=O)N[C@H](C(=O)N[C@H](C(=O)N3C[C@H](C)[C@H](O)[C@H]3C(=O)N[C@H](O)[C@H](O)C2)[C@H](O)CC(N)=O)[C@H](O)[C@@H](O)C=2C=C(OS(O)(=O)=O)C(O)=CC=2)[C@@H](C)O)=O)=NO1 PIEUQSKUWLMALL-YABMTYFHSA-N 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 4
- GRRNUXAQVGOGFE-UHFFFAOYSA-N Hygromycin-B Natural products OC1C(NC)CC(N)C(O)C1OC1C2OC3(C(C(O)C(O)C(C(N)CO)O3)O)OC2C(O)C(CO)O1 GRRNUXAQVGOGFE-UHFFFAOYSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 239000001888 Peptone Substances 0.000 description 4
- 108010080698 Peptones Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 238000001914 filtration Methods 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- GRRNUXAQVGOGFE-NZSRVPFOSA-N hygromycin B Chemical compound O[C@@H]1[C@@H](NC)C[C@@H](N)[C@H](O)[C@H]1O[C@H]1[C@H]2O[C@@]3([C@@H]([C@@H](O)[C@@H](O)[C@@H](C(N)CO)O3)O)O[C@H]2[C@@H](O)[C@@H](CO)O1 GRRNUXAQVGOGFE-NZSRVPFOSA-N 0.000 description 4
- 229940097277 hygromycin b Drugs 0.000 description 4
- 230000006698 induction Effects 0.000 description 4
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- 235000019319 peptone Nutrition 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 239000011347 resin Substances 0.000 description 4
- 229920005989 resin Polymers 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 239000000600 sorbitol Substances 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 239000000725 suspension Substances 0.000 description 4
- 238000002137 ultrasound extraction Methods 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 3
- 241000589158 Agrobacterium Species 0.000 description 3
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 3
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 3
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 3
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 3
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 3
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 3
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 3
- NVXLFIPTHPKSKL-UBHSHLNASA-N Asp-Trp-Asn Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 NVXLFIPTHPKSKL-UBHSHLNASA-N 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- 108020004414 DNA Proteins 0.000 description 3
- 238000001712 DNA sequencing Methods 0.000 description 3
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 3
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 3
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 3
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 3
- CMQOGWZUKPHLHL-DCAQKATOSA-N His-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N CMQOGWZUKPHLHL-DCAQKATOSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 3
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 3
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 3
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 3
- NFLNBHLMLYALOO-DCAQKATOSA-N Pro-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 NFLNBHLMLYALOO-DCAQKATOSA-N 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 3
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 3
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 3
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 3
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 3
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 3
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 230000003197 catalytic effect Effects 0.000 description 3
- 229940125773 compound 10 Drugs 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- ZLVXBBHTMQJRSX-VMGNSXQWSA-N jdtic Chemical compound C1([C@]2(C)CCN(C[C@@H]2C)C[C@H](C(C)C)NC(=O)[C@@H]2NCC3=CC(O)=CC=C3C2)=CC=CC(O)=C1 ZLVXBBHTMQJRSX-VMGNSXQWSA-N 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 230000002018 overexpression Effects 0.000 description 3
- 108010073101 phenylalanylleucine Proteins 0.000 description 3
- 239000002243 precursor Substances 0.000 description 3
- 230000008929 regeneration Effects 0.000 description 3
- 238000011069 regeneration method Methods 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- GHYOCDFICYLMRF-UTIIJYGPSA-N (2S,3R)-N-[(2S)-3-(cyclopenten-1-yl)-1-[(2R)-2-methyloxiran-2-yl]-1-oxopropan-2-yl]-3-hydroxy-3-(4-methoxyphenyl)-2-[[(2S)-2-[(2-morpholin-4-ylacetyl)amino]propanoyl]amino]propanamide Chemical group C1(=CCCC1)C[C@@H](C(=O)[C@@]1(OC1)C)NC([C@H]([C@@H](C1=CC=C(C=C1)OC)O)NC([C@H](C)NC(CN1CCOCC1)=O)=O)=O GHYOCDFICYLMRF-UTIIJYGPSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- GACDQMDRPRGCTN-KQYNXXCUSA-N 3'-phospho-5'-adenylyl sulfate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OS(O)(=O)=O)[C@@H](OP(O)(O)=O)[C@H]1O GACDQMDRPRGCTN-KQYNXXCUSA-N 0.000 description 2
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 2
- 108010064760 Anidulafungin Proteins 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 2
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 2
- 238000009010 Bradford assay Methods 0.000 description 2
- 108010020326 Caspofungin Proteins 0.000 description 2
- 108010059892 Cellulase Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 241000752430 Coleophoma cylindrospora Species 0.000 description 2
- MKMKILWCRQLDFJ-DCAQKATOSA-N Cys-Lys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MKMKILWCRQLDFJ-DCAQKATOSA-N 0.000 description 2
- 206010017533 Fungal infection Diseases 0.000 description 2
- 206010064571 Gene mutation Diseases 0.000 description 2
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 2
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 2
- 101150059802 KU80 gene Proteins 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 2
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 2
- 208000031888 Mycoses Diseases 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 101100074054 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) mus-52 gene Proteins 0.000 description 2
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 2
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 2
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 2
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 2
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 2
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 2
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 2
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 2
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 2
- 101100074057 Schizosaccharomyces pombe (strain 972 / ATCC 24843) pku80 gene Proteins 0.000 description 2
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 2
- 238000010459 TALEN Methods 0.000 description 2
- ZJPSMXCFEKMZFE-IHPCNDPISA-N Trp-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O ZJPSMXCFEKMZFE-IHPCNDPISA-N 0.000 description 2
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 229960003348 anidulafungin Drugs 0.000 description 2
- JHVAMHSQVVQIOT-MFAJLEFUSA-N anidulafungin Chemical compound C1=CC(OCCCCC)=CC=C1C1=CC=C(C=2C=CC(=CC=2)C(=O)N[C@@H]2C(N[C@H](C(=O)N3C[C@H](O)C[C@H]3C(=O)N[C@H](C(=O)N[C@H](C(=O)N3C[C@H](C)[C@H](O)[C@H]3C(=O)N[C@H](O)[C@H](O)C2)[C@@H](C)O)[C@H](O)[C@@H](O)C=2C=CC(O)=CC=2)[C@@H](C)O)=O)C=C1 JHVAMHSQVVQIOT-MFAJLEFUSA-N 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- JYIKNQVWKBUSNH-WVDDFWQHSA-N caspofungin Chemical compound C1([C@H](O)[C@@H](O)[C@H]2C(=O)N[C@H](C(=O)N3CC[C@H](O)[C@H]3C(=O)N[C@H](NCCN)[C@H](O)C[C@@H](C(N[C@H](C(=O)N3C[C@H](O)C[C@H]3C(=O)N2)[C@@H](C)O)=O)NC(=O)CCCCCCCC[C@@H](C)C[C@@H](C)CC)[C@H](O)CCN)=CC=C(O)C=C1 JYIKNQVWKBUSNH-WVDDFWQHSA-N 0.000 description 2
- 229960003034 caspofungin Drugs 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 210000002421 cell wall Anatomy 0.000 description 2
- 229940106157 cellulase Drugs 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 229940125797 compound 12 Drugs 0.000 description 2
- 235000012343 cottonseed oil Nutrition 0.000 description 2
- 125000004122 cyclic group Chemical group 0.000 description 2
- 238000011033 desalting Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 238000011067 equilibration Methods 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 239000004744 fabric Substances 0.000 description 2
- 238000011049 filling Methods 0.000 description 2
- 238000012224 gene deletion Methods 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 238000010362 genome editing Methods 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 230000036039 immunity Effects 0.000 description 2
- 239000002054 inoculum Substances 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 238000009629 microbiological culture Methods 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 238000004321 preservation Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- LMBFAGIMSUYTBN-MPZNNTNKSA-N teixobactin Chemical compound C([C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@H](CCC(N)=O)C(=O)N[C@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@H]1C(N[C@@H](C)C(=O)N[C@@H](C[C@@H]2NC(=N)NC2)C(=O)N[C@H](C(=O)O[C@H]1C)[C@@H](C)CC)=O)NC)C1=CC=CC=C1 LMBFAGIMSUYTBN-MPZNNTNKSA-N 0.000 description 2
- 238000002525 ultrasonication Methods 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 241000826286 Acidea extrema Species 0.000 description 1
- 241000138865 Acidothrix acidophila Species 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 1
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 1
- ZCSHHTFOZULVLN-SZMVWBNQSA-N Arg-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 ZCSHHTFOZULVLN-SZMVWBNQSA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- 241001465318 Aspergillus terreus Species 0.000 description 1
- 229920002498 Beta-glucan Polymers 0.000 description 1
- 108091033409 CRISPR Proteins 0.000 description 1
- 238000010453 CRISPR/Cas method Methods 0.000 description 1
- 229920002261 Corn starch Polymers 0.000 description 1
- ZVNFONSZVUBRAV-CIUDSAMLSA-N Cys-Gln-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)CN=C(N)N ZVNFONSZVUBRAV-CIUDSAMLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- IROABALAWGJQGM-OALUTQOASA-N Gly-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)CN IROABALAWGJQGM-OALUTQOASA-N 0.000 description 1
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 1
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 1
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 1
- SWBUZLFWGJETAO-KKUMJFAQSA-N His-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O SWBUZLFWGJETAO-KKUMJFAQSA-N 0.000 description 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 1
- IMRKCLXPYOIHIF-ZPFDUUQYSA-N Ile-Met-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IMRKCLXPYOIHIF-ZPFDUUQYSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- LOOZZTFGSTZNRX-VIFPVBQESA-N L-Homotyrosine Chemical compound OC(=O)[C@@H](N)CCC1=CC=C(O)C=C1 LOOZZTFGSTZNRX-VIFPVBQESA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 108010028921 Lipopeptides Proteins 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- 108010003266 Lys-Leu-Tyr-Asp Proteins 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 1
- MIAZEQZXAFTCCG-UBHSHLNASA-N Met-Phe-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 MIAZEQZXAFTCCG-UBHSHLNASA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- UEHNWRNADDPYNK-DLOVCJGASA-N Phe-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N UEHNWRNADDPYNK-DLOVCJGASA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 1
- NAOVYENZCWFBDG-BZSNNMDCSA-N Phe-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 NAOVYENZCWFBDG-BZSNNMDCSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- IFLVBVIYADZIQO-DCAQKATOSA-N Ser-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N IFLVBVIYADZIQO-DCAQKATOSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- 235000019764 Soybean Meal Nutrition 0.000 description 1
- 241000736131 Sphingomonas Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- VUSAEKOXGNEYNE-PBCZWWQYSA-N Thr-His-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VUSAEKOXGNEYNE-PBCZWWQYSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 1
- ICNFHVUVCNWUAB-SZMVWBNQSA-N Trp-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ICNFHVUVCNWUAB-SZMVWBNQSA-N 0.000 description 1
- IXEGQBJZDIRRIV-QEJZJMRPSA-N Trp-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IXEGQBJZDIRRIV-QEJZJMRPSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- NLWCSMOXNKBRLC-WDSOQIARSA-N Trp-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLWCSMOXNKBRLC-WDSOQIARSA-N 0.000 description 1
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 1
- FHHYVSCGOMPLLO-IHPCNDPISA-N Trp-Tyr-Asp Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 FHHYVSCGOMPLLO-IHPCNDPISA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- FGJWNBBFAUHBEP-IHPCNDPISA-N Tyr-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FGJWNBBFAUHBEP-IHPCNDPISA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- KUXCBJFJURINGF-PXDAIIFMSA-N Tyr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N KUXCBJFJURINGF-PXDAIIFMSA-N 0.000 description 1
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 1
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- 241000235015 Yarrowia lipolytica Species 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000032683 aging Effects 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 239000003429 antifungal agent Substances 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 238000009412 basement excavation Methods 0.000 description 1
- 238000007622 bioinformatic analysis Methods 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 229940125904 compound 1 Drugs 0.000 description 1
- 229940125898 compound 5 Drugs 0.000 description 1
- 239000008120 corn starch Substances 0.000 description 1
- 238000005336 cracking Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 108010062092 echinocandin B Proteins 0.000 description 1
- FAUOJMHVEYMQQG-HVYQDZECSA-N echinocandin B Chemical compound C1([C@H](O)[C@@H](O)[C@H]2C(=O)N[C@H](C(=O)N3C[C@H](C)[C@H](O)[C@H]3C(=O)N[C@H](O)[C@H](O)C[C@@H](C(N[C@H](C(=O)N3C[C@H](O)C[C@H]3C(=O)N2)[C@@H](C)O)=O)NC(=O)CCCCCCC\C=C/C\C=C/CCCCC)[C@@H](C)O)=CC=C(O)C=C1 FAUOJMHVEYMQQG-HVYQDZECSA-N 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010052221 glucan synthase Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 238000012594 liquid chromatography nuclear magnetic resonance Methods 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 235000012054 meals Nutrition 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 239000002547 new drug Substances 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- DQXPFAADCTZLNL-FXDJFZINSA-N pneumocandin B0 Chemical compound C1([C@H](O)[C@@H](O)[C@H]2C(=O)N[C@H](C(=O)N3CC[C@H](O)[C@H]3C(=O)N[C@H](O)[C@H](O)C[C@@H](C(N[C@H](C(=O)N3C[C@H](O)C[C@H]3C(=O)N2)[C@@H](C)O)=O)NC(=O)CCCCCCCC[C@@H](C)C[C@@H](C)CC)[C@H](O)CC(N)=O)=CC=C(O)C=C1 DQXPFAADCTZLNL-FXDJFZINSA-N 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 239000004455 soybean meal Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
- C12N9/0077—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14) with a reduced iron-sulfur protein as one donor (1.14.15)
- C12N9/0081—Cholesterol monooxygenase (cytochrome P 450scc)(1.14.15.6)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K7/00—Peptides having 5 to 20 amino acids in a fully defined sequence; Derivatives thereof
- C07K7/50—Cyclic peptides containing at least one abnormal peptide link
- C07K7/54—Cyclic peptides containing at least one abnormal peptide link with at least one abnormal peptide link in the ring
- C07K7/56—Cyclic peptides containing at least one abnormal peptide link with at least one abnormal peptide link in the ring the cyclisation not occurring through 2,4-diamino-butanoic acid
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/13—Transferases (2.) transferring sulfur containing groups (2.8)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/15—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with reduced iron-sulfur protein as one donor, and incorporation of one atom of oxygen (1.14.15)
- C12Y114/15006—Cholesterol monooxygenase (side-chain-cleaving) (1.14.15.6), i.e. cytochrome P450scc
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y208/00—Transferases transferring sulfur-containing groups (2.8)
- C12Y208/02—Sulfotransferases (2.8.2)
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Medicinal Chemistry (AREA)
- Biomedical Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了能催化棘白菌素类化合物氧磺酰化的酶及其应用,具体公开了一种具有氧磺酰化功能的酶或包含所述酶的生物材料在对棘白菌素类化合物进行羟基化、磺酰化、或氧磺酰化中的应用;所述具有氧磺酰化功能的酶选自细胞色素P450单加氧酶和/或磺酰基转移酶;所述细胞色素P450单加氧酶与SEQ ID No.2相比具有至少70%的序列同一性;所述磺酰基转移酶与SEQ ID No.4相比具有至少70%的序列同一性;所述生物材料选自:所述酶的编码基因,或者包含所述编码基因的载体,或者包含所述载体的宿主细胞。
Description
技术领域
本发明属于生物制药技术领域,涉及一种可以对棘白菌素类化合物具有氧磺酰化的P450酶和磺酰基转移酶及其应用;更进一步地,涉及氧磺酰化纽莫康定B0的形成。
背景技术
近年来随着老龄化人口的增多,器官移植治疗的临床应用推广,HIV病毒的蔓延等,免疫力低下患者数量不断增加,导致深部真菌感染率呈迅速上升的趋势。深部真菌感染逐渐成为免疫力低下患者发病和死亡的重要诱因,对人类社会健康造成了极大威胁。
目前临床上应用的传统抗真菌药物对人体存在毒副作用,并且真菌耐药性问题也日益突出,因此亟需开发新一代的高效、低毒且对耐药菌有效的抗真菌药物。棘白菌素类药物作为一类新型的环脂肽类抗真菌药物,作用机制独特,能够选择性地抑制真菌细胞壁中的β-1,3葡聚糖合成酶活性从而抑制真菌细胞壁的合成,导致真菌细胞裂解死亡,该类药物安全性高,抗菌谱宽且对耐药菌有效。
临床上应用的棘白菌素类抗真菌药物包括三种,分别是卡泊芬净、米卡芬净和阿尼芬净。其中,米卡芬净具有其独特性,与另外两种棘白菌素类抗真菌药物相比,米卡芬净具有磺酰基基团,磺酰基基团能赋予化合物极好的水溶性,进而增加其生物利用度。然而目前FR901379结构中的氧磺酰基基团形成机制仍然是未知的,这极大地限制了磺酰基转移酶在重要化合物生物活性修饰方面的应用。因此,解析米卡芬净前体FR901379中氧磺酰基基团合成机制,可以为生物信息学挖掘提供靶标,发现更多磺酰化的环脂肽类化合物,同时为棘白菌素类化合物氧磺酰化修饰提供了酶学元件和理论指导,有助于芬净类抗真菌药物的新药创制。
发明内容
一方面,本发明提供了一种细胞色素P450单加氧酶。
在一个实施方式中,所述细胞色素P450单加氧酶与SEQ ID No.2相比具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%、99%、99.1%、99.2%、99.3%、99.4%、99.5%、99.6%、99.7%、99.8%或99.9%的序列同一性;优选的,所述细胞色素P450单加氧酶来源于鞘茎点霉属真菌(Coleophoma sp.);更优选的,所述细胞色素P450单加氧酶的氨基酸序列与SEQ ID No.2相比具有至少70%的序列同一性,并且所述细胞色素P450单加氧酶来源于鞘茎点霉属真菌;所述鞘茎点霉属真菌包括Coleophoma sp.或Coleophoma empetri,例如,鞘茎点霉(Coleophoma sp.)MEFC009。在其他的实施方式中,所述C.empetri为C.empetri F-11899。更优选的,所述细胞色素P450单加氧酶的氨基酸序列如SEQ ID No.2所示。
在其他的实施方式中,所述细胞色素P450单加氧酶的氨基酸序列如NCBI数据库中的序列号分别为RDW63434.1、RDW57263.1和XP_031866084.1所示的序列,分别来自于丝状真菌Coleophoma cylindrospora、Coleophoma crateriformis和Venustampullaechinocandica。
另一方面,本发明提供了一种磺酰基转移酶。
在一个实施方式中,所述磺酰基转移酶与SEQ ID No.4相比具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%、99%、99.1%、99.2%、99.3%、99.4%、99.5%、99.6%、99.7%、99.8%或99.9%的序列同一性;优选的,所述磺酰基转移酶来源于鞘茎点霉属真菌(Coleophoma sp.);更优选的,所述磺酰基转移酶的氨基酸序列与SEQ ID No.4相比具有至少70%的序列同一性,并且所述磺酰基转移酶来源于鞘茎点霉属真菌;所述鞘茎点霉属真菌包括Coleophoma sp.或C.empetri,例如,鞘茎点霉(Coleophoma sp.)MEFC009。在其他的实施方式中,所述C.empetri为C.empetri F-11899。更优选的,所述磺酰基转移酶的氨基酸序列如SEQ ID No.4所示。
在其他的实施方式中,所述磺酰基转移酶的氨基酸序列如NCBI数据库中的序列号分别为RDW57264.1和XP_031866072.1所示的序列;其中,RDW57264.1的氨基酸序列如SEQID No.5所示,其来源于C.crateriformis;XP_031866072.1的氨基酸序列如SEQ ID No.6所示,其来源于V.echinocandica。
在其他的实施方式中,本发明所述磺酰基转移酶与SEQ ID No.5相比具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%、99%、99.1%、99.2%、99.3%、99.4%、99.5%、99.6%、99.7%、99.8%或99.9%的序列同一性;优选的,所述磺酰基转移酶来源于C.crateriformis;更优选的,所述磺酰基转移酶的氨基酸序列与SEQ ID No.5相比具有至少70%的序列同一性,并且所述磺酰基转移酶来源于C.crateriformis。更优选的,所述磺酰基转移酶的氨基酸序列如SEQ ID No.5所示。
因此,在其他的实施方式中,本发明所述磺酰基转移酶与SEQ ID No.6相比具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%、99%、99.1%、99.2%、99.3%、99.4%、99.5%、99.6%、99.7%、99.8%或99.9%的序列同一性;优选的,所述磺酰基转移酶来源于V.echinocandica;更优选的,所述磺酰基转移酶的氨基酸序列与SEQ ID No.6相比具有至少70%的序列同一性,并且所述磺酰基转移酶来源于V.echinocandica。更优选的,所述磺酰基转移酶的氨基酸序列如SEQ ID No.6所示。
本发明中,鞘茎点霉(Coleophoma sp.)MEFC009,保藏于中国微生物菌种保藏管理委员会普通微生物中心(CGMCC),保藏编号为CGMCC No.21058,保藏日期为2020年11月18日,地址:北京市朝阳区北辰西路1号院3号中国科学院微生物研究所,电话:010-64807355。
另一方面,本发明还提供了包含上述细胞色素P450单加氧酶或磺酰基转移酶或其编码基因的生物材料。所述生物材料选自:包含上述细胞色素P450单加氧酶或磺酰基转移酶的载体,或者,包含上述细胞色素P450单加氧酶或磺酰基转移酶的宿主细胞。
另一方面,本发明还提供了编码上述细胞色素P450单加氧酶或磺酰基转移酶的基因。
另一方面,本发明还提供了包含上述基因的载体,或者,包含所述载体的宿主细胞。
在一个实施方式中,所述载体包括克隆载体和表达载体,例如,pET系列载体(如pET-14、pET-21、pET-22、pET-28、pET-30、pET-42、pET-GST、pET-His、pET-Trx、pET-GST、pET-CKS、pET-DsbA)、pMAL系列载体(如pMAL-2c)、pGEX系列载体(如pGEX-4T-2、pGEX-6T-1)、pBAD系列载体(如pBAD-His、pBAD-Myc)、pMBP系列载体(pMBP-P、pMBP-C)、pTYB2、pQE-9、pACYCDuet-1、pCDFDuet-1、pColADuet-1、pRSFDuet-1、pllP-OmpA、pUC系列载体(如pUC18、pUC19)、pQE-30、pXH2-1、pXH-43、pTRII、pGSF957。
在一个实施方式中,所述宿主细胞选自大肠杆菌(例如,大肠杆菌DH5α、大肠杆菌BL21(DE3)、Rosetta(DE3)、Codon Plus(DE3)-RIPL、BL21 Codon plus(DE3)、Top10、JM109)、酵母菌(例如,酿酒酵母、毕赤酵母、解酯耶氏酵母)、鞘茎点霉真菌、纽莫康定产生菌(Glarea lozoyensis)。
另一方面,本发明还提供了上述细胞色素P450单加氧酶和/或磺酰基转移酶、其编码基因、包含基因的载体、上述宿主细胞、或上述生物材料在棘白菌素类化合物羟基化、磺酰化或氧磺酰化中的应用。
在一个实施方式中,本发明提供了上述细胞色素P450单加氧酶、其编码基因、包含基因的载体、上述宿主细胞、或上述生物材料在棘白菌素类化合物羟基化中的应用。
在一个实施方式中,本发明提供了上述磺酰基转移酶、其编码基因、包含基因的载体、上述宿主细胞、或上述生物材料在棘白菌素类化合物磺酰化中的应用。
在一个实施方式中,本发明提供了上述细胞色素P450单加氧酶和磺酰基转移酶、其编码基因、包含基因的载体、上述宿主细胞、或上述生物材料在棘白菌素类化合物氧磺酰化中的应用。
本领域公知,棘白菌素类化合物的结构中通常包含L-高酪氨酸苯环,在抗真菌药物卡泊芬净和阿尼芬净前体纽莫康定B0和棘白菌素B中都含有L-高酪氨酸苯环,同时在本发明中的FR901379、FR133302、化合物13a、化合物13、纽莫康定B0、羟化纽莫康定B0、氧磺酰化纽莫康定B0、化合物4。
例如,FR901379为米卡芬净前体,其结构式如式(I)所示:
羟化纽莫康定B0的结构式如式(II)所示:
氧磺酰化纽莫康定B0的结构式如式(III)所示:
纽莫康定B0的结构式如式(IV)所示:
FR133302的结构式如式(V)所示:
化合物13a的结构式如式(VI)所示:
化合物13的结构式如式(VII)所示:
化合物4的结构式如式(VIII)所示:
上述FR901379、FR133302、化合物13a、化合物13、纽莫康定B0、羟基化纽莫康定B0、氧磺酰化纽莫康定B0或化合物4包括L-高酪氨酸苯环,并且,在L-高酪氨酸苯环上的C3’位置存在不同的修饰。例如,FR901379、化合物13和氧磺酰化纽莫康定B0在L-高酪氨酸苯环上的C3’位置存在氧磺酰基修饰;FR133302、化合物13a和羟基化纽莫康定B0在L-高酪氨酸苯环上的C3’位置存在羟基修饰;化合物4和纽莫康定B0则在L-高酪氨酸苯环上的C3’位置上不存在任何修饰。
在一个实施方式中,所述对棘白菌素类化合物羟基化、磺酰化或氧磺酰化为对棘白菌素类化合物的L-高酪氨酸苯环上的C3’位进行羟基化、磺酰化或氧磺酰化。
在一个实施方式中,所述羟基化为在棘白菌素类化合物L-高酪氨酸苯环未羟基化的C3’位上添加羟基;优选的,所述未羟基化的C3’位上无任何修饰;
在一个实施方式中,所述磺酰化为在棘白菌素类化合物L-高酪氨酸苯环羟基化C3’位的羟基上添加磺酰基(SO3 -)。
在一个实施方式中,所述氧磺酰化为在棘白菌素类化合物的L-高酪氨酸苯环无修饰的C3’位上添加氧磺酰基(OSO3 -)。
上述棘白菌素类化合物包括但不限于FR901379、FR133302、化合物13a、化合物13、纽莫康定B0、羟基化纽莫康定B0、氧磺酰化纽莫康定B0、化合物4中的一种或任意几种,上述化合物的结构域如式I-式VIII所示。
在一个实施方式中,所述羟基化为将式VIII所示的化合物羟基化为式V所示的化合物,或者,将纽莫康定B0羟基化为式II所示的化合物。
在一个实施方式中,所述磺酰化为将式V所示的化合物磺酰化为式I所示的化合物,或者,将式II所示的化合物磺酰化为式III所示的化合物,或者,将式VI所示的化合物磺酰化为式VII所示的化合物。
在一个实施方式中,所述氧磺酰化为将式VIII所示的化合物氧磺酰化为式I所示的化合物,或者,将纽莫康定B0氧磺酰化为式III所示的化合物。
在一个实施方式中,本发明提供了上述细胞色素P450单加氧酶在FR901379形成过程中L-高酪氨酸苯环上的C3’位的羟基化中的用途。
在一个实施方式中,本发明提供了上述磺酰基转移酶在FR901379形成过程中将磺酰基转移到L-高酪氨酸苯环上的C3’位的羟基中的用途。
在一个实施方式中,本发明提供了上述细胞色素P450单加氧酶和上述磺酰基转移酶在FR901379形成过程中在L-高酪氨酸苯环上的C3’位形成氧磺酰基团中的用途。
具体的,本发明提供了上述细胞色素P450单加氧酶和上述磺酰基转移酶在如式(I)所示的FR901379形成过程中催化形成式(I)中的氧磺酰基团(OSO3-)中的用途。
在一个实施方式中,本发明提供了上述细胞色素P450单加氧酶在催化纽莫康定B0形成羟基化纽莫康定B0中的用途。
在一个实施方式中,本发明提供了上述细胞色素P450单加氧酶和磺酰基转移酶在催化纽莫康定B0形成氧磺酰化纽莫康定B0中的用途。
本发明中,所述细胞色素P450单加氧酶又称为P450酶。
另一方面,本发明还提供了一种对棘白菌素类化合物的L-高酪氨酸苯环上的C3’位进行羟基化、磺酰化或氧磺酰化的方法,所述方法包括利用上述细胞色素P450单加氧酶和/或磺酰基转移酶、其编码基因、包含基因的载体、上述宿主细胞、或上述生物材料对棘白菌素类化合物的L-高酪氨酸苯环上的C3’位进行羟基化、磺酰化或氧磺酰化的步骤。
另一方面,本发明还提供了一种纽莫康定产生菌(G.lozoyensis)基因工程菌株,所述工程菌株为在纽莫康定产生菌中引入上述细胞色素P450单加氧酶和/或磺酰基转移酶所得到的工程菌株。
优选的,所述引入为过表达。
所述的“引入”包括将上述目的基因在出发菌株中进行表达的步骤,优选,过表达。例如,将目的基因构建到表达载体上,将表达载体转入到宿主细胞中以表达目的基因,优选,过表达。在其他的实施方式中,所述的“引入”包括将目的基因插入到宿主细胞的基因组中;优选的,所述插入到宿主细胞的基因组中可以采用同源重组双交换的方法;在一个实施方式中,可以采用将目的基因以及同源臂插入到载体上,然后将载体转入到宿主细胞中,利用同源臂与宿主细胞基因组发生同源重组双交换从而将目的基因插入到合适的基因组的位置;在其他的实施方式中,还可以采用基因编辑的方式,例如,利用CRISPR/Cas系统在期望的基因组位点上进行切割,同时将目的基因作为外源供体插入到切割位点上。
另一方面,本发明还提供了上述基因工程菌株在生产羟基化纽莫康定B0和/或氧磺酰化纽莫康定B0中的应用。
所述羟基化纽莫康定B0的结构式如式(II)所示,所述氧磺酰化纽莫康定B0的结构式如式(III)所示。
另一方面,本发明还提供了一种制备羟基化纽莫康定B0和/或氧磺酰化纽莫康定B0的方法,所述方法包括利用上述基因工程菌株进行发酵的步骤。
本发明中的“过表达”是指基因工程菌与野生型的出发菌株相比,所述目的基因的表达量或活性要高于野生型的出发菌株。在一个实施方式中,上述过表达可以通过引入表达载体来过表达目的基因来实现;其他的实施方式中,上述过表达也可以通过在出发菌株中引入额外的目的基因的拷贝、通过增加目的基因的拷贝数来实现;其他的实施方式中,还可以通过对目的基因的启动子优化来实现,比如,通过将目的基因的原始启动子替换为启动子活性更高的启动子来实现目的基因的过表达。
本发明所述的突变包括通过基因缺失、基因插入或基因取代的方式导致基因功能或活性丧失。
在优选的实施方式中,基因突变为将目标基因进行基因敲除。
在一个实施方式中,所述基因突变可以采用本领域常规的技术操作来实现,例如,通过同源重组的方式进行基因敲入或基因敲除从而导致基因功能或活性丧失;或者,采用基因编辑的方式,如锌指核酸内切酶(ZFN)、类转录激活因子效应物核酸酶(TALEN)或CRIspR技术对上述基因进行突变从而导致基因功能或活性丧失。
另一方面,本发明还提供了上述基因工程菌的构建方法。
附图说明
图1为敲除mcfP基因所获转化子的基因组PCR验证结果;其中6#,8#,9#是基因mcfP缺失的转化子,WT-1为对照菌株Coleophoma sp.-Δku80。
图2为基因mcfP缺失菌株Coleophoma sp.-ΔmcfP发酵产物HPLC分析结果;其中Coleophoma sp.-ΔmcfP为基因mcfP缺失菌株,WT-1为Coleophoma sp.-Δku80。
图3为化合物4,5,6,7和8的LC-MS分析结果,其中A为化合物4,B为化合物5,C为化合物6,D为化合物7,E为化合物8。
图4为化合物4,5,6,7和8的结构。
图5为敲除mcfS基因所获转化子的基因组PCR验证结果;其中1#,3#,7#是基因mcfS缺失的转化子,WT-1为对照菌株Coleophoma sp.-Δku80。
图6为基因mcfS缺失菌株Coleophoma sp.-ΔmcfS发酵产物HPLC分析结果;其中Coleophoma sp.-ΔmcfS为基因mcfS缺失菌株,WT-1为Coleophoma sp.-Δku80。
图7为化合物9的LC-MS分析结果。
图8为重组质粒pCAMBIA1300-mcfP和pCAMBIA1300-mcfS图谱示意图;其中A为质粒pCAMBIA1300-mcfP,B为pCAMBIA1300-mcfS。
图9为异源表达P450酶McfP和磺酰基转移酶McfS的G.lozoyensis ATCC 74030转化子的基因组PCR验证结果;其中1-9为转化子基因组;WT-2为对照菌株G.lozoyensis ATCC74030基因组。
图10为工程菌株G.lozoyensis::mcfP::mcfS发酵产物HPLC分析结果;其中,G.lozoyensis::mcfP::mcfS为同时异源表达基因mcfP和mcfS的重组菌株,G.lozoyensisATCC74030为对照菌株。
图11为化合物11和12的LC-MS分析结果,其中A为化合物11,B为化合物12。
图12为化合物11和12的化学结构。
图13为异源表达P450酶McfP的G.lozoyensis ATCC 74030转化子的基因组PCR验证结果;其中1-18为转化子基因组;WT-2为对照菌株Glarea lozoyensis ATCC 74030基因组。
图14为工程菌株G.lozoyensis::mcfP发酵产物HPLC分析结果;其中,G.lozoyensis::mcfP为异源插入基因mcfP的重组菌株,G.lozoyensis ATCC 74030为对照菌株。
图15为质粒pET28a-SUMO-McfS图谱。
图16为蛋白SUMO-McfS的SDS-PAGE分析结果。
图17为磺酰基转移酶McfS催化FR133302的HPLC分析结果;i:FR901379标准品;ii:McfS催化的酶反应;iii:(ii)的对照,McfS煮沸失活;iv:(ii)的对照,不加供体PAPS。图18为磺酰基转移酶McfS催化FR133302功能示意图。
图19为蛋白RDW57264.1、XP_031866072.1和McfS催化化合物13a的HPLC分析结果。图20为化合物13a和13的LC-MS分析结果。A为化合物13a,B为化合物13。
图21为磺酰基转移酶McfS、RDW57264.1和XP_031866072.1催化化合物13a的功能示意图。
具体实施方式
下面结合具体实施例对本发明做进一步说明,但本发明不受实施例的限制。以下实施例中所用材料、试剂、仪器和方法,未经特殊说明,均为本领域中的常规材料、试剂、仪器和方法,均可通过商业渠道获得。
本发明中质粒提取采用OMEGA公司的Plasmid Mini Kit I试剂(D6942-01);PCR片段纯化采用OMEGA公司DNA片段回收Cycle-Pure Kit试剂盒(D6492-01);一步克隆酶Ultra One Step Cloning Kit购自南京Vazyme公司;限制性内切酶购自Thermo公司;T4连接酶购自New England Biolabs;RNA提取采用TAKARA公司的Mini BESTPlant RNA Extraction Kit试剂盒;cNDA反转录试剂盒购自TAKARA公司;大肠杆菌感受态细胞DH5α和BL21(DE3)购自南京Vazyme公司;农杆菌感受态细胞LBA4404购自上海唯地生物公司。
大肠杆菌培养基LB培养基:1%蛋白胨,0.5%酵母粉,1%NaCl,pH 7.0。
Coleophoma sp.MEFC009的种子培养基:15g/L可溶性淀粉,10g/L蔗糖,5g/L棉籽饼粉,10g/L蛋白胨,1g/L KH2PO4,2g/L CaCO3,pH 6.0-8.0。
Coleophoma sp.MEFC009的发酵培养基:30g/L玉米淀粉,30g/L蛋白胨,6g/L(NH4)2SO4,1g/L KH2PO4,0.3g/L FeSO4·7H2O,0.01g/L ZnSO4·7H2O,2g/L CaCO3,pH6.0-8.0。
G.lozoyensis ATCC 74030的种子培养基:20g/L大豆粉,40g/L葡萄糖,1g/LKH2PO4,pH 5.0-8.0。
G.lozoyensis ATCC 74030的发酵培养基:100g/L甘露醇,20g/L葡萄糖,10g/L棉籽饼粉,10g/L蛋白胨,2.5g/L K2HPO4·3H2O,pH 5.0-8.0。
STC:1M山梨醇,50mM Tris-HCl(pH 8.0),50mM CaCl2。
PSTC:40%PEG4000,1M山梨醇,50mM Tris-HCl(pH 8.0),50mM CaCl2。
顶层琼脂:PDB、1M山梨醇和4g/L琼脂糖,灭菌后48℃保温。
再生筛选培养基平板PDA-SH:PDA平板、1M山梨醇和100mg/L潮霉素B。
筛选培养基PDA-H:PDA平板和100mg/L潮霉素B。
质粒pXH2-1记载在Xuenian Huang,Xuefeng Lu,Jian-Jun Li.Cloning,characterization and application of a glyceraldehyde-3-phosphatedehydrogenase promoter from Aspergillus terreus,J Ind Microbiol Biotechnol(2014)41:585-592。
质粒pCAMBIAMBIA1300记载在HanaMartina Hujslová,MilanGryndler.Genetic transformation of extremophilic fungi Acidea extrema andAcidothrix acidophila,Folia Microbiol(Praha).2015,60(4),365-71.
质粒pPM-3记载在Ping Men,Min Wang,Jinda Li,Xuenian Huang,XuefengLu.Establishing an efficient genetic manipulation system for sulfatedechinocandin producing fungus Coleophoma empetri.Frontiers inMicrobiology.2021,12,734780。
质粒pCAMBIA1300-mcfP(本实验室自主构建)
质粒pCAMBIA1300-mcfS(本实验室自主构建)
鞘茎点霉属真菌(Coleophoma sp.)保藏于中国普通微生物菌种保藏管理中心,(该菌株保藏号是CGMCC NO:21058,地址:北京市朝阳区北辰西路1号院3号,中国科学院微生物研究所)。
G.lozoyensis ATCC 74030购自American type culture collection。
实施例1.构建敲除mcfP基因的工程菌株Coleophoma sp.-ΔmcfP
以野生型Coleophoma sp.MEFC009的基因组为模板,采用pfu DNA聚合酶(Fermentas,Catalog No.:EP0501)进行PCR扩增,用引物UmcfP-F(5’-tctcaaggagataactcccacac-3’)和UmcfP-R(5’-ctttacgcttgcgatcccgaaTCATTGGGATTGATGCGGATGATAGG-3’)可以扩增获得大小约为1.2kb的mcfP的上游序列U-mcfP,用引物DmcfP-F(5’-ccctgggttcgcaaagataattgCGTATCTTTCCACTAATACTGC-3’)和DmcfP-R(5’-caccgtacctgaatcctcat-3’)可以扩增获得大小为1.2kb的mcfP的下游序列D-mcfP。以质粒pXH2-1为模板,用引物hph-F(5’-ttcgggatcgcaagcgtaaag-3’)和hph-R(5’-caattatctttgcgaacccagg-3’)进行PCR扩增,获得大小约为2.2kb的潮霉素抗性筛选片段hph;通过融合PCR将hph片段、上游序列U-mcfP和下游序列D-mcfP进行融合,再以融合产物为模板,用巢式引物UmcfP-CS-F(5’-ggacaacgaatagctaaatgaaga-3’)和DmcfP-CS-R(5’-gctctgctattcataactcg-3’)通过PCR扩增出大小为4.4kb的敲除打靶元件UmcfP-hph-DmcfP。所述mcfP基因序列如SEQ ID No.1所示,所述McfP的氨基酸序列如SEQ ID No.2所示。
以Coleophoma sp.-Δku80为出发菌,首先从PDA平板上取少量菌丝,用手持匀浆器破碎,取1mL种子液接种到50mL的种子培养基,于250mL的三角瓶中,220rpm,25℃进行摇床培养。2天后,离心收集菌丝。5000rpm,4℃,5min。将菌丝体再次用匀浆器破碎,取0.5mL-2mL种子液接种到50mL的种子培养基,相同的条件下培养1天,将培养基与菌丝体一起倒入50mL的无菌离心管中,5000rpm,离心收集菌丝。用0.6M MgSO4对菌丝体清洗2次。称取1g菌丝,加入10mL酶解液,30℃,100rpm处理1-4h。酶解液成分为:1%纤维素酶、0.6%溶壁酶、0.6%蜗牛酶、0.6M MgSO4,经由0.22μm的无菌过滤器过滤除菌。将上述原生质体反应液通过无菌神奇滤布进行过滤。5000rpm,4℃离心收集原生质体。用冰预冷的STC,洗涤一次,把原生质体重悬于预冷的STC中,并用STC将原生质体浓度调整为5×107个/mL,得到原生质体悬液。
向140μL上述原生质体悬液中加入10μL的UmcfP-hph-DmcfP片段,再加入50μL的PSTC,轻轻混匀,冰浴30min。加入1mL PSTC,混匀后室温放置20min;然后与10mL顶层琼脂混合后倾注于3块再生筛选培养基平板PDA-SH上,在30℃、黑暗条件下培养5-7天,得到转化子。
从转化筛选平板上挑取具有潮霉素抗性转化子转移至PDA-H上,在25℃培养4-6天进行传代培养,连续传代3代。选取稳定传代的3个转化子(6#,8#,9#)进行单孢分离纯化,并提取单孢分离之后的转化子基因组。利用外部引物UmcfP-F(5’-tctcaaggagataactcccacac-3’)和DmcfP-R(5’-caccgtacctgaatcctcat-3’)对转化子基因组进行PCR验证,能扩增出大小约为4.6kb的条带的为阳性转化子,而Coleophoma sp.-Δku80只能扩增出大小约为2.9kb的条带,图1说明6#,8#,9#转化子为阳性转化子,说明在基因mcfP位置发生了同源重组,整合了外源片段UmcfP-hph-DmcfP。
实施例2.mcfP基因缺失工程菌株Coleophoma sp.-ΔmcfP的发酵及产物分析
将3株mcfP基因缺失工程菌株Coleophoma sp.-ΔmcfP 6#,8#,9#和对照菌株Coleophoma sp.-Δku80接种于PDA固体平板上,25℃培养4-6天。挑取少量的菌丝,利用核酸提取仪(-24)对菌丝进行破碎,将破碎后的菌丝接种于50mL Coleophoma sp.的种子培养基(250mL三角瓶),25℃,220rpm,摇床培养48h。将上述培养的种子液取5mL到Coleophoma sp.的发酵培养基,25℃,220rpm,摇床培养8天,每个菌株设置3个平行。从每一瓶发酵液中取1mL,加入等体积的甲醇,超声萃取1h,离心,取上清。用0.22μm的有机滤器过滤处理样品,并通过HPLC和LC-MS对处理后的样品进行分析。
HPLC分析方法为:液相色谱柱为Agilent C-18反向柱883975-902(4.6×150mm,5μm);流动相为A:0.05%(体积比)三氟乙酸水溶液,流动相B:0.05%(体积比)三氟乙酸乙腈溶液,流速为1mL/min,紫外检测波长:210nm,30℃,总洗脱时间为37min。梯度洗脱条件:0-5min,流动相B占流动相的体积由5%线性上升到24%,5-35min,流动相B占流动相的体积由24%线性上升到62%,35-37min,流动相B占流动相的体积由62%线性上升到100%。结果如图2所示;与出发菌株Coleophoma sp.-Δku80相比,化合物1,2,3消失,相应的出现了紫外吸收与化合物1,2,3相同的另外4个化合物。进而对化合物4,5,6,7和8进行分离纯化,通过液质联用(LC-MS)和核磁共振(NMR)进行分析。LC-MS分析方法为:Agilent公司1290高效液相色谱,色谱柱为Agilent Zorbax Extend-C18的C18柱(2.1×50mm,1.8μm);流动相总流速为0.6mL/min;流动相A:0.05%(体积比)甲酸水溶液,流动相B:0.05%(体积比)甲酸乙腈溶液,总洗脱时间为7.0min;洗脱条件为:梯度洗脱条件:0-1min,流动相B占流动相的体积由5%线性上升到20%,1-6min,流动相B占流动相的体积由20%线性上升到60%,6-7min,流动相B占流动相的体积由60%线性上升到100%。结果如图3所示;结合NMR分析结果可知,4(分子式:C51H82N8O17,理论值:[M+H]+1079.5871,实际值:1079.5873),5(分子式:C50H80N8O16,理论值:[M+H]+1049.5765,实际值:1049.5766),6(分子式:C51H82N8O16,理论值:[M+H]+1063.5922,实际值1063.5921),7(分子式:C51H82N8O15,理论值:[M+H]+1047.5972,实际值:1047.5969)和8(分子式:C51H82N8O14,理论值:[M+H]+1031.6023,实际值:1031.6022),根据分子量推测这些中间体少了氧磺酰基和一些羟基。进一步通过NMR对化合物4,5,6,7和8的结构进行了鉴定。他们有一个共同的特征:氧磺酰基消失,进而说明基因mcfP编码的P450酶负责FR901379结构中氧磺酰化模块形成的第一步——L-高酪氨酸苯环C3’位的羟基化。
以McfP的氨基酸序列作为参照,在另外3株产其他磺酰化的棘白菌素类化合物生产菌株Coleophoma cylindrospora、Coleophoma crateriform和Venustampullaechinocandica中找到了同源性大于75%的同源序列,在NCBI数据库中的序列号分别为RDW63434.1、RDW57263.1和XP_031866084.1。这3个蛋白很有可能与McfP具有相同的功能,负责氧磺酰基模块中第一步羟基的形成。
实施例3.构建敲除mcfS基因的工程菌株Coleophoma sp.-ΔmcfS
以野生型Coleophoma sp.MEFC009的基因组为模板,采用pfu DNA聚合酶(Fermentas,Catalog No.:EP0501)进行PCR扩增,用引物UmcfS-F(5’-gcgccttcgaagcgggcaac-3’)和UmcfS-R(5’-ctttacgcttgcgatcccgaaTCGAAGGCCTCTTTCCACAAC-3’)可以扩增获得大小约为1.2kb的mcfS的上游序列U-mcfS,用引物DmcfS-F(5’-cctgggttcgcaaagataattgACATATTCAAGTACAGCCCCC-3’)和DmcfS-R(5’-tagtccagaggatgacttcc-3’)可以扩增获得大小为1.2kb的mcfS的下游序列D-mcfS。以质粒pXH2-1为模板,用引物hph-F(5’-ttcgggatcgcaagcgtaaag-3’)和hph-R(5’-caattatctttgcgaacccagg-3’)进行PCR扩增,获得大小约为2.2kb的潮霉素抗性筛选片段hph;通过融合PCR将hph片段、上游序列U-mcfS和下游序列D-mcfS进行融合,再以融合产物为模板,用巢式引物UmcfS-CS-F(5’-gaatactttgctcgcaggtg-3’)和DmcfS-CS-R(5’-gccaatctataaagggaaagg-3’)通过PCR扩增出大小为4.4kb的敲除打靶元件UmcfS-hph-DmcfS。
以Coleophoma sp.-Δku80为出发菌,首先从PDA平板上取少量菌丝,用手持匀浆器破碎,取1mL种子液接种到50mL的种子培养基,于250mL的三角瓶中,220rpm,25℃进行摇床培养。2天后,离心收集菌丝。5000rpm,4℃,5min。将菌丝体再次用匀浆器破碎,取0.5mL-2mL种子液接种到50mL的种子培养基,相同的条件下培养1天,将培养基与菌丝体一起倒入50mL的无菌离心管中,5000rpm,离心收集菌丝。用0.6M MgSO4对菌丝体清洗2次。称取1g菌丝,加入10mL酶解液,30℃,100rpm处理1-4h。酶解液成分为:1%纤维素酶、0.6%溶壁酶、0.6%蜗牛酶、0.6M MgSO4,经由0.22μm的无菌过滤器过滤除菌。将上述原生质体反应液通过无菌神奇滤布进行过滤。5000rpm,4℃离心收集原生质体。用冰预冷的STC,洗涤一次,把原生质体重悬于预冷的STC中,并用STC将原生质体浓度调整为5×107个/mL,得到原生质体悬液。
向140μL上述原生质体悬液中加入10μL的UmcfS-hph-DmcfS片段,再加入50μL的PSTC,轻轻混匀,冰浴30min。加入1mL PSTC,混匀后室温放置20min;然后与10mL顶层琼脂混合后倾注于3块再生筛选培养基平板PDA-SH上,在30℃黑暗条件下培养5-7天,得到转化子。
从转化筛选平板上挑取具有潮霉素抗性转化子转移至PDA-H上,在25℃培养4-6天进行传代培养,连续传代3代。选取稳定传代的3个转化子(1#,3#,7#)进行单孢分离纯化,并提取单孢分离之后的转化子基因组。利用外部引物UmcfS-F(5’-gcgccttcgaagcgggcaac-3’)和DmcfS-R(5’-tagtccagaggatgacttcc-3’)对转化子基因组进行PCR验证,能扩增出大小约为4.9kb的条带的为阳性转化子,而Coleophoma sp.-Δku80只能扩增出大小约为3.1kb的条带,图5说明1#,3#,7#转化子为阳性转化子,说明在基因mcfS位置发生了同源重组,整合了外源片段UmcfS-hph-DmcfS。
实施例4.mcfS基因缺失工程菌株Coleophoma sp.-ΔmcfS的发酵及产物分析
将3株mcfS基因缺失工程菌株Coleophoma sp.-ΔmcfS 1#,3#,7#和对照菌株Coleophoma sp.-Δku80接种于PDA固体平板上,25℃培养4-6天。挑取少量的菌丝,利用核酸提取仪(-24)对菌丝进行破碎,将破碎后的菌丝接种于50mL Coleophoma sp的种子培养基(250mL三角瓶),25℃,220rpm,摇床培养48h。将上述培养的种子液取5mL到Coleophoma sp的发酵培养基,25℃,220rpm,摇床培养8天,每个菌株设置3个平行。从每一瓶发酵液中取1mL,加入等体积的甲醇,超声萃取1h,离心,取上清。用0.22μm的有机滤器过滤处理样品,并通过HPLC和LC-MS对处理后的样品进行分析。
HPLC分析方法为:液相色谱柱为Agilent C-18反向柱883975-902(4.6×150mm,5μm);流动相为A:0.05%(体积比)三氟乙酸水溶液,流动相B:0.05%(体积比)三氟乙酸乙腈溶液,流速为1mL/min,紫外检测波长:210nm,30℃,总洗脱时间为37min。梯度洗脱条件:0-5min,流动相B占流动相的体积由5%线性上升到24%,5-35min,流动相B占流动相的体积由24%线性上升到62%,35-37min,流动相B占流动相的体积由62%线性上升到100%。结果如图6所示;与出发菌株Coleophoma sp.-Δku80相比,化合物1,2,3消失,产生了少量的化合物6、7、8和9。通过LC-MS对Coleophoma sp.-ΔmcfS发酵产物进行分析。LC-MS分析方法为:Agilent公司1290高效液相色谱,色谱柱为Agilent Zorbax Extend-C18的C18柱(2.1×50mm,1.8μm);流动相总流速为0.6mL/min;流动相A:0.05%(体积比)甲酸水溶液,流动相B:0.05%(体积比)甲酸乙腈溶液,总洗脱时间为7.0min;梯度洗脱条件:0-1min,流动相B占流动相的体积由5%线性上升到20%,1-6min,流动相B占流动相的体积由20%线性上升到60%,6-7min,流动相B占流动相的体积由60%线性上升到100%。结果如图7所示;通过LC-MS分析结果可知,当敲除基因mcfS之后,FR901379结构中的磺酰基消失,产生了化合物6、7、8和9。化合物6、7和8同样也在敲除株Coleophoma sp.-ΔmcfP中存在,这3个化合物有一个共同的特征,L-高酪氨酸苯环C3’位的氧磺酰基缺失。通过LC-MS对化合物9进行分析,化合物9分子式:C51H82N8O18,理论值:[M+H]+1095.5820实际值1095.5823,与化合物1相比,分子量小了80,推测应该少了磺酰基(SO3 -);进一步通过NMR对其进行确认,发现化合物9在L-高酪氨酸苯环的C3’位只有羟基。以上结果表明McfS在FR901379生物合成中负责将磺酰基转移到L-高酪氨酸苯环的C3’位的羟基上。mcfS基因序列如SEQ ID No.3所示,McfS的氨基酸序列如SEQ ID No.4所示。
以McfS的氨基酸序列作为参照,在其他磺酰化的棘白菌素类化合物生产菌株Coleophoma crateriformis和Venustampulla echinocandica中找到了同源性大于75%的同源序列,在NCBI数据库中的序列号分别为RDW57264.1和XP_031866072.1。这2个蛋白很有可能与McfS具有相同的功能,负责氧磺酰基模块中第二步磺酰基基团的形成。
实施例5产氧磺酰化纽莫康定B0的G.lozoyensis ATCC 74030工程菌株的构建
1.P450酶编码基因mcfP和磺酰基转移酶编码基因mcfS表达质粒构建
所述P450酶编码基因mcfP的基因序列如SEQ ID No.1所示,其氨基酸序列如SEQID No.2所示。所述磺酰基转移酶编码基因mcfS的基因序列如SEQ ID No.3所示,其氨基酸序列如SEQ ID No.4所示。
提取Coleophoma sp.MEFC009的RNA,对其RNA进行反转录获得cDNA。以反转录后的cDNA为模板,利用引物mcfPCDS-F(5’-cttattcctttgaacctttcaATGATAAATCTTGCAAGTCCCCTC-3’)和mcfPCDS-R(5’-caaaattcttcatttatttattatgcttccacaagtattcttaa-3’)进行PCR扩增,获得大小约为1.5kb的mcfP的编码序列(mcfPCDS);以G.lozoyensis ATCC 74030基因组为模板,利用引物PgpdGL-F(5’-ctgggttcgcaaagataattgtgttactcatatggattgaggg-3’)和PgpdGL-R(5’-GGGGACTTGCAAGATTTATCATattgttttctggtgaagattag-3’)进行PCR扩增,获得大小约为1.0kb的启动子片段PgpdGL;以质粒pPM3为模板,利用引物Tpgk-F(5’-taaataaatgaagaattttgtgaaacgag-3’)和Tpgk-R(5’-cacacattattatggagaaacattgcagcgcacaagtcagt-3’)进行PCR扩增,获得大小约为0.5kb大小的终止子片段Tpgk;以质粒pXH2-1为模板,利用引物hph-F(5’-ttcgggatcgcaagcgtaaag-3’)和hph-R(5’-caattatctttgcgaacccagg-3’)进行PCR扩增,获得大小约为2.2kb的潮霉素抗性筛选片段hph;利用限制性内切酶Bam H I和Xho I对质粒pCAMBIA1300进行双酶切,纯化回收后获得线性的质粒pCAMBIA1300。利用一步克隆试剂盒(Ultra One Step CloningKit)将线性质粒pCAMBIA1300与片段hph,PgpdGL,mcfPCDS和Tpgk进行连接,获得重组质粒pCAMBIA1300-mcfP。将重组后的质粒pCAMBIA1300-mcfP转化到大肠杆菌DH5α感受态细胞,通过卡那霉素抗性筛选阳性转化子,通过PCR和DNA测序获得正确的重组质粒pCAMBIA1300-mcfP,质粒图谱如图8A。
参照上述相同的方法构建质粒pCAMBIA1300-mcfS。以反转录后的cDNA为模板,利用引物mcfSCDS-F(5’-caactcatcaatcatcacaacATGGCTTTAGACCGCCAGAATGC-3’)和mcfSCDS-R(5’-cacaaaattcttcatttatttaCTACTTCCTAGCTAGCCAAACAGCC-3’)进行PCR扩增,获得大小约为0.8kb的mcfS的编码序列(mcfSCDS);以质粒pXH2-1为模板,利用引物PgpdAt-F(5’-ccctgggttcgcaaagataattggttacactctgggaggatcc-3’)和PgpdAt-R(5’-gttgtgatgattgatgagttg-3’)进行PCR扩增,获得大小约为0.7kb的启动子片段PgpdAt;利用一步克隆试剂盒(Ultra One Step Cloning Kit)将线性质粒pCAMBIA1300与片段hph,PgpdAt,mcfSCDS和Tpgk进行连接,获得重组质粒pCAMBIA1300-mcfS。将重组后的质粒pCAMBIA1300-mcfS转化到大肠杆菌DH5α感受态细胞,通过卡那霉素抗性筛选阳性转化子,通过PCR和DNA测序获得正确的重组质粒pCAMBIA1300-mcfS,质粒图谱如图8B。
2.产氧磺酰化纽莫康定B0菌株的构建
分别将重组质粒pCAMBIA1300-mcfP和pCAMBIA1300-mcfS转到农杆菌LBA4404感受态细胞,获得重组菌株LBA4404-pCAMBIA1300-mcfP和LBA4404-pCAMBIA1300-mcfS。通过根瘤农杆菌介导的转化,将片段hph-PgpdA-mcfP-Tpgk和hph-PgpdAt-mcfS-Tpgk共同转到两种方式转到G.lozoyensis ATCC 74030菌株中。
将具有潮霉素B抗性的转化子进行传代培养,连续传代3次,选取稳定传代的9个转化子进行分离纯化,25℃培养7-10天,对纯化后的单菌落提取基因组,分别利用引物PgpdGL-F(5’-ctgggttcgcaaagataattgtgttactcatatggattgaggg-3’)和mcfPCDS-R(5’-caaaattcttcatttatttattatgcttccacaagtattcttaa-3’);PgpdAt-F(5’-ccctgggttcgcaaagataattggttacactctgggaggatcc-3’)和mcfSCDS-R(5’-cacaaaattcttcatttatttaCTACTTCCTAGCTAGCCAAACAGCC-3’)进行PCR验证,能同时扩增出大小约为3.0kb和2.1kb的条带为阳性转化子,由图9可知利用该方法获得的9个转化子为阳性转化子,基因组上同时整合了表达元件PgpdGL-mcfP-Tpgk和PgpdAt-mcfS-Tpgk,证明我们得到了表达mcfP和mcfS的G.lozoyensis菌株G.lozoyensis::mcfP::mcfS。
实施例6产氧磺酰化纽莫康定B0的G.lozoyensis::mcfP::mcfS工程菌株的发酵验证
将工程菌株G.lozoyensis::mcfP::mcfS和对照菌株G.lozoyensis ATCC 74030接种于PDA固体平板上,25℃培养7-10天。挑取少量的菌丝,利用核酸提取仪(-24)对菌丝进行破碎,将破碎后的菌丝接种于50mL G.lozoyensis ATCC 74030的种子培养基(250mL三角瓶),25℃,220rpm,摇床培养4-5天。将上述培养的种子液取5mL到G.lozoyensisATCC 74030的发酵培养基,25℃,220rpm,摇床培养12天,每个菌株设置3个平行。从每一瓶发酵液中取1mL,加入等体积的甲醇,超声萃取1h,离心,取上清。用0.22μm的有机滤器过滤处理样品,并通过HPLC和LC-MS对处理后的样品进行分析。
HPLC分析方法为:液相色谱柱为Agilent C-18反向柱883975-902(4.6×150mm,5μm);流动相为A:0.05%(体积比)三氟乙酸水溶液,流动相B:0.05%(体积比)三氟乙酸乙腈溶液,流速为1mL/min,紫外检测波长:210nm,30℃,总洗脱时间为25min。梯度洗脱条件:0-5min,流动相B占流动相的体积由5%线性上升到40%,5-20min,流动相B占流动相的体积由40%线性上升到62%,20-25min,流动相B占流动相的体积由62%线性上升到100%。结果如图10所示;从HPLC结果可知,除了化合物10(纽莫康定B0)外,在12.2min和13min出现了两个新的化合物,命名为化合物11和12。并且11和12和10具有相同的紫外吸收。猜测可能是羟化的纽莫康定B0和氧磺酰化纽莫康定B0的。为了进一步确认化合物的11和12结构,通过LC-MS进行分析。
LC-MS分析方法为:Agilent公司1290高效液相色谱,色谱柱为Agilent ZorbaxExtend-C18的C18柱(2.1×50mm,1.8μm);流动相总流速为0.6mL/min;流动相A:0.05%(体积比)甲酸水溶液,流动相B:0.05%(体积比)甲酸乙腈溶液,总洗脱时间为7.5min;洗脱条件为:0-1min,流动相B占流动相的体积由5%线性上升到20%,1-6min,流动相B占流动相的体积由20%线性上升到60%,6-7min,流动相B占流动相的体积由60%线性上升到100%。结果如图11所示。从LC-MS分析结果可知,化合物11的质荷比[M+H]+为1081.5664(C50H80N8O18,理论值:1081.5663),化合物12的质荷比[M+H]+为1161.5229(C50H80N8O21S,理论值:1161.5231)。进一步的NMR结果表明化合物11和12与化合物10相比(图12),分别在L-高酪氨酸苯环的C3’位置多了一个羟基和一个氧磺酰基。说明P450酶McfP和McfS能够在G.lozoyensis ATCC 74030内催化纽莫康定B0生成氧磺酰化的纽莫康定B0。
实施例7产羟化纽莫康定B0的G.lozoyensis ATCC 74030工程菌株的构建
1.P450酶编码基因mcfP表达质粒构建
所述P450酶编码基因mcfP的基因序列如SEQ ID No.1所示,其氨基酸序列如SEQID No.2所示。提取Coleophoma sp.MEFC009的RNA,对其RNA进行反转录获得cDNA。以反转录后的cDNA为模板,利用引物mcfPCDS-F(5’-cttattcctttgaacctttcaATGATAAATCTTGCAAGTCCCCTC-3’)和mcfPCDS-R(5’-caaaattcttcatttatttattatgcttccacaagtattcttaa-3’)进行PCR扩增,获得大小约为1.5kb的mcfP的编码序列(mcfPCDS);以G.lozoyensis ATCC 74030基因组为模板,利用引物PgpdGL-F(5’-ctgggttcgcaaagataattgtgttactcatatggattgaggg-3’)和PgpdGL-R(5’-GGGGACTTGCAAGATTTATCATattgttttctggtgaagattag-3’)进行PCR扩增,获得大小约为1.0kb的启动子片段PgpdGL;以质粒pPM3为模板,利用引物Tpgk-F(5’-taaataaatgaagaattttgtgaaacgag-3’)和Tpgk-R(5’-cacacattattatggagaaacattgcagcgcacaagtcagt-3’)进行PCR扩增,获得大小约为0.5kb大小的终止子片段Tpgk;以质粒pXH2-1为模板,利用引物hph-F(5’-ttcgggatcgcaagcgtaaag-3’)和hph-R(5’-caattatctttgcgaacccagg-3’)进行PCR扩增,获得大小约为2.2kb的潮霉素抗性筛选片段hph;利用限制性内切酶Bam HI和Xho I对质粒pCAMBIA1300进行双酶切,纯化回收后获得线性的质粒pCAMBIA1300。利用一步克隆试剂盒(Ultra One Step CloningKit)将线性质粒pCAMBIA1300与片段hph、PgpdGL、mcfPCDS和Tpgk进行连接,获得重组质粒pCAMBIA1300-mcfP。将重组后的质粒pCAMBIA1300-mcfP转化到大肠杆菌DH5α感受态细胞,通过卡那霉素抗性筛选阳性转化子,通过PCR和DNA测序获得正确的重组质粒pCAMBIA1300-mcfP,质粒图谱如图8A。
2.产羟化纽莫康定B0菌株的构建
将重组质粒pCAMBIA1300-mcfP转到农杆菌LBA4404感受态细胞,获得重组菌株LBA4404-pCAMBIA1300-mcfP通过根瘤农杆菌介导的转化,将片段hph-PgpdGL-mcfP-Tpgk转到G.lozoyensis ATCC 74030菌株中。
将具有潮霉素B抗性的转化子进行传代培养,连续传代3次,选取稳定传代的9个转化子进行分离纯化,25℃培养7-10天,对纯化后的单菌落提取基因组,分别利用引物PgpdGL-F(5’-ctgggttcgcaaagataattgtgttactcatatggattgaggg-3’)和mcfPCDS-R(5’-caaaattcttcatttatttattatgcttccacaagtattcttaa-3’)进行PCR验证,能同时扩增出大小约为2.5kb的条带为阳性转化子,由图13可知利用该方法获得15个阳性转化子,基因组上整合了表达元件PgpdGL-mcfP-Tpgk,证明我们得到了表达mcfP的G.lozoyensis菌株G.lozoyensis::mcfP。
实施例8产羟化纽莫康定B0的G.lozoyensis::mcfP工程菌株的发酵验证
将工程菌株G.lozoyensis::mcfP和对照菌株G.lozoyensis ATCC 74030接种于PDA固体平板上,25℃培养7-10天。挑取少量的菌丝,利用核酸提取仪(-24)对菌丝进行破碎,将破碎后的菌丝接种于50mL G.lozoyensis ATCC 74030的种子培养基(250mL三角瓶),25℃,220rpm,摇床培养4-5天。将上述培养的种子液取5mL到G.lozoyensisATCC74030的发酵培养基,25℃,220rpm,摇床培养12天,每个菌株设置3个平行。从每一瓶发酵液中取1mL,加入等体积的甲醇,超声萃取1h,离心,取上清。用0.22μm的有机滤器过滤处理样品,并通过HPLC和LC-MS对处理后的样品进行分析。
HPLC分析方法为:液相色谱柱为Agilent C-18反向柱883975-902(4.6×150mm,5μm);流动相为A:0.05%(体积比)三氟乙酸水溶液,流动相B:0.05%(体积比)三氟乙酸乙腈溶液,流速为1mL/min,紫外检测波长:210nm,30℃,总洗脱时间为25min。梯度洗脱条件:0-5min,流动相B占流动相的体积由5%线性上升到40%,5-20min,流动相B占流动相的体积由40%线性上升到62%,20-25min,流动相B占流动相的体积由62%线性上升到100%。结果如图14所示;从HPLC结果可知,除了化合物10(纽莫康定B0)外,在12.2min出现了一个新的化合物,命名为化合物11。并且11和10具有相同的紫外吸收。猜测可能是羟化的纽莫康定B0。为了进一步确认化合物的11结构,通过LC-MS进行分析。
LC-MS分析方法为:Agilent公司1290高效液相色谱,色谱柱为Agilent ZorbaxExtend-C18的C18柱(2.1×50mm,1.8μm);流动相总流速为0.6mL/min;流动相A:0.05%(体积比)甲酸水溶液,流动相B:0.05%(体积比)甲酸乙腈溶液,总洗脱时间为7.5min;洗脱条件为:0-1min,流动相B占流动相的体积由5%线性上升到20%,1-6min,流动相B占流动相的体积由20%线性上升到60%,6-7min,流动相B占流动相的体积由60%线性上升到100%。结果如图11所示。从LC-MS分析结果可知,化合物11的质荷比[M+H]+为1081.5664(C50H80N8O18,理论值:1081.5663)。进一步的NMR结果表明化合物11与纽莫康定B0相比(图12),在L-高酪氨酸苯环的C3’位置多了一个羟基。说明P450酶McfP能够在G.lozoyensis ATCC 74030内催化纽莫康定B0生成羟化的纽莫康定B0。
实施例9.磺酰基转移酶McfS的制备
1.磺酰基转移酶McfS表达菌株构建
将Coleophoma sp.MEFC009中基因mcfS编码的磺酰基转移酶的核苷酸序列进行合成,并克隆到载体pET28a-His-SUMO上,获得质粒pET28a-His-SUMO-McfS,质粒图谱如图15所示。将质粒pET28a-His-SUMO-McfS转化到大肠杆菌BL21感受态细胞中,得到能够表达磺酰基转移酶McfS的重组菌株BL21(DE3)。
2.磺酰基转移酶McfS的表达与纯化
挑取重组菌株BL21(DE3)的阳性单克隆于LB液体培养基中(含100μg/mL的卡那霉素)中,37℃,220rpm摇床培养26-30h获得种子发酵液;按照1%接种量将种子液接种到LB液体培养基中(含100μg/mL的卡那霉素)中,37℃,220rpm摇床培养至OD600=0.6-1.0,加入终浓度为0.2mM的IPTG进行诱导表达,并在18℃,180rpm摇床继续培养发酵24h。将发酵液于4℃,8000rpm进行离心,收集菌体。菌体用结合缓冲液(50mM Tris-HCl)洗涤一次,将菌体保存于-80℃超低温冰箱中。分别在用IPTG诱导前,诱导后进行取样,并对诱导表达后的样品超声破碎之后,进行离心,13000rpm,10min,将上清与菌体分离。
采用Ni-NTA琼脂糖(QIAGEN,Cat No.30230)对目的蛋白进行纯化,纯化过程中温度控制在4℃,步骤如下:(1)用蒸馏水洗净玻璃层析柱,将Ni-NTA琼脂糖摇匀后取5mL装柱,使溶液流出;(2)平衡:用10倍体积的结合缓冲液洗柱,对树脂进行平衡;(3)上样:将上清液用0.22μm滤器过滤后,以1mL/min的流速将样品流加至层析柱中;(4)洗涤:先流加10倍体积的结合缓冲液(50mM Tris-HCl,5mM咪唑,500mM NaCl,pH 8.0)对层析柱进行洗涤,然后用20倍体积的60mM咪唑溶液(结合缓冲液中溶解有5mM咪唑)进行洗涤,将吸附物质、杂蛋白以及结合力较弱蛋白清洗除去,洗涤过程中溶液流速为1.0mL/min;(5)洗脱:用20mL的250mM咪唑溶液(结合缓冲液中溶解有5mM咪唑)将结合在树脂上的蛋白进行洗脱,洗脱过程中流速为1.0mL/min,收集洗脱的蛋白液;(6)超滤:将收集的蛋白于超滤管中进行浓缩,浓缩后的蛋白利用去盐缓冲液(50mM Tris-HCl,10%甘油)进行置换,直至蛋白溶液中的咪唑浓度小于0.2mM。超滤全过程都于4℃进行;(7)浓度测定:将透析后的蛋白液进行收集,用Bradford方法测定蛋白质浓度,用SDS-PAGE分析蛋白质纯化结果(图16)。
实施例10磺酰基转移酶McfS的活性分析
分析磺酰基转移酶McfS是否可以对化合物FR133302(9)(其结构如图18所示)L-高酪氨酸苯环上的C3'位进行磺酰化。反应体系(200μL):底物浓度为75μM,酶浓度5μM,供体3'-磷酸腺苷-5'-磷酰硫酸(PAPS)浓度0.4mM,MgCl2浓度1mM,缓冲体系为50mM Tris-HCl,pH 8.0;对照组1:用50mM Tris-HCl,pH 8.0的缓冲液代替酶液;对照组2:用50mM Tris-HCl,pH 8.0缓冲液代替底物;30℃反应1-5h。加入等体积的甲醇终止反应,用0.22μm的有机过滤器过滤之后,通过HPLC和LC-MS对样品进行分析。HPLC分析方法为:液相色谱柱为Agilent C-18反向柱883975-902(4.6×150mm,5μm);流动相为A:0.05%(体积比)三氟乙酸水溶液,流动相B:0.05%(体积比)三氟乙酸乙腈溶液,流速为1mL/min,紫外检测波长:210nm,30℃,总洗脱时间为25min。梯度洗脱条件:0-5min,流动相B占流动相的体积由5%线性上升到40%,5-20min,流动相B占流动相的体积由40%线性上升到62%,20-25min,流动相B占流动相的体积由62%线性上升到100%,HPLC分析结果如图17所示。结果表明磺酰基转移酶McfS可以对化合物FR133302进行催化,在L-高酪氨酸苯环上的C3’位的羟基上加上磺酰基,从而生成化合物FR901379,反应如图18所示。
实施例11磺酰基转移酶McfS同源蛋白的制备及活性分析
1、磺酰基转移酶McfS同源蛋白的制备
以McfS的氨基酸作为参照序列,通过生物信息学分析,在其他磺酰化棘白菌素生产菌株Coleophoma crateriformis和Venustampulla echinocandica菌株中找到了McfS的同源蛋白,编号分别是RDW57264.1和XP_031866072.1,这2个蛋白在NCBI数据库中都被注释为未知功能蛋白。通过Cluster W对这两个蛋白及McfS的氨基酸序列进行了对比,这3个蛋白的氨基酸序列具有超过75%的同源性。
将蛋白RDW57264.1和XP_031866072.1的氨基酸序列(分别如SEQ ID No.5和6所示)所编码的核苷酸序列进行合成,并克隆到载体pET28a-His-SUMO上,得到重组质粒pET28a-His-SUMO-RDW57264.1和pET28a-His-SUMO-XP_031866072.1。
挑取重组菌株BL21(DE3)的阳性单克隆于LB液体培养基中(含100μg/mL的卡那霉素)中,37℃,220rpm摇床培养26-30h获得种子发酵液;按照1%接种量将种子液接种到LB液体培养基中(含100μg/mL的卡那霉素)中,37℃,220rpm摇床培养至OD600=0.6-0.8,加入终浓度为0.2mM的IPTG进行诱导表达,并在18℃,180rpm摇床继续培养发酵24h。将发酵液于4℃,8000rpm进行离心,收集菌体。菌体用结合缓冲液(50mM Tris-HCl)洗涤一次,将菌体保存于-80℃超低温冰箱中。分别在用IPTG诱导前,诱导后进行取样,并对诱导表达后的样品超声破碎之后,进行离心,13000rpm,10min,将上清与菌体分离。
采用Ni-NTA琼脂糖(QIAGEN,Cat No.30230)对目的蛋白进行纯化,纯化过程中温度控制在4℃,步骤如下:(1)用蒸馏水洗净玻璃层析柱,将Ni-NTA琼脂糖摇匀后取5mL装柱,使溶液流出;(2)平衡:用10倍体积的结合缓冲液洗柱,对树脂进行平衡;(3)上样:将上清液用0.22μm滤器过滤后,以1mL/min的流速将样品流加至层析柱中;(4)洗涤:先流加10倍体积的结合缓冲液(50mM Tris-HCl,5mM咪唑,500mM NaCl,pH 8.0)对层析柱进行洗涤,然后用20倍体积的60mM咪唑溶液(结合缓冲液中溶解有5mM咪唑)进行洗涤,将吸附物质、杂蛋白以及结合力较弱蛋白清洗除去,洗涤过程中溶液流速为1.0mL/min;(5)洗脱:用20mL的250mM咪唑溶液(结合缓冲液中溶解有5mM咪唑)将结合在树脂上的蛋白进行洗脱,洗脱过程中流速为1.0mL/min,收集洗脱的蛋白液;(6)超滤:将收集的蛋白于超滤管中进行浓缩,浓缩后的蛋白利用去盐缓冲液(50mM Tris-HCl,10%甘油)进行置换,直至蛋白溶液中的咪唑浓度小于0.2mM。超滤全过程都于4℃进行;(7)浓度测定:将透析后的蛋白液进行收集,用Bradford方法测定蛋白质浓度。
2、磺酰基转移酶McfS同源蛋白的活性分析
以化合物13a作为底物,分别利用蛋白RDW57264.1和XP_031866072.1对其进行催化,反应体系与McfS的催化体系相同,反应体系(200μL):底物浓度为75μM,酶浓度5μM,供体3'-磷酸腺苷-5'-磷酰硫酸(PAPS)浓度0.4mM,MgCl2浓度1mM,缓冲体系为50mM Tris-HCl,pH 8.0;30℃反应1-5h。加入等体积的甲醇终止反应,用0.22μm的有机过滤器过滤之后,通过HPLC和LC-MS对样品进行分析。HPLC分析方法为:液相色谱柱为Agilent C18反向柱883975-902(4.6×150mm,5μm);流动相为A:0.05%(体积比)三氟乙酸水溶液,流动相B:0.05%(体积比)三氟乙酸乙腈溶液,流速为1mL/min,紫外检测波长:210nm,30℃,总洗脱时间为25min。梯度洗脱条件:0-5min,流动相B占流动相的体积由5%线性上升到40%,5-20min,流动相B占流动相的体积由40%线性上升到62%,20-25min,流动相B占流动相的体积由62%线性上升到100%,HPLC分析结果如图19所示。进而通过LC-MS对酶反应产物进行分析。LC-MS分析方法为:Agilent公司1290高效液相色谱,色谱柱为Agilent ZorbaxExtend-C18的C18柱(2.1×50mm,1.8μm);流动相总流速为0.6mL/min;流动相A:0.05%(体积比)甲酸水溶液,流动相B:0.05%(体积比)体积比甲酸乙腈溶液,总洗脱时间为7.0min;梯度洗脱条件:0-1min,流动相B占流动相的体积由5%线性上升到20%,1-6min,流动相B占流动相的体积由20%线性上升到60%,6-7min,流动相B占流动相的体积由60%线性上升到100%。化合物13a和13的LC-MS分析结果如图20所示。
磺酰基转移酶McfS、RDW57264.1和XP_031866072.1均可催化13a生产13;即是说,McfS、RDW57264.1和XP_031866072.1可以催化化合物13a,在L-高酪氨酸苯环上的C3’位的羟基上加上磺酰基,从而生成化合物13,其催化机理如图21所示。
虽然本发明已以较佳的实施例公开如上,但其并非用以限定本发明,在不脱离本发明精神和范围内,本领域技术人员都可以在此基础上做出各种改动与变型,因此,本发明的保护范围应该以权利要求书所界定的为准。
SEQUENCE LISTING
<110> 中国科学院青岛生物能源与过程研究所
<120> 对棘白菌素类化合物氧磺酰化的酶及其应用
<130> 11
<160> 6
<170> PatentIn version 3.5
<210> 1
<211> 1503
<212> DNA
<213> Artificial Sequence
<220>
<223> mcfP
<400> 1
atgataaatc ttgcaagtcc cctcttcgca acaacagcag ttctagtctg gctcagcagt 60
ctcataatct atcgcctata tctctctcca ctatctcgat ttcccggccc aaaactcgct 120
gctctaacag gatggtacga gacatacttc gacctcttta aacggggtcg ctactggatc 180
gagattgaac gcatgcacga agtctatggc cctatcatcc gcatcaatcc caatgagcta 240
catgttaatg acccagaatg gaatgagccc tacaagatca gcggccgcgt tgacaagtat 300
gactggtact acacctttgt tggtagttcc ggatcctcat ctgcattcgg aaccatagac 360
cacgacgttc atcgtggccg ccggaaagct caacagggct atttcaccac cgacgccatc 420
acgcgctttg aaccacattt agaaaccctg acagcaaagt tctgcgcaag actagacggc 480
ttcaagggga cgggaaagca tgttaatctc tccgatgcgt tccgatcaat cgcggtggat 540
gtggccgcga tgtttacatt gaatcaatcg tatggtttca tcgatgaccc ggatttcaag 600
gccgaggtcc atcaagggat ccgggcattt ccggatattg gagtgctgaa tcgccatttt 660
acgggtttgt tcgtggtttt ggagtcaatc catagatggg tgttgagtgt tatcaacccg 720
tcagaagaag ataatgggtt actcacaagt agaataaacc tgcattgtaa agctattatt 780
gccgactacg ccagtaagaa aggcgacgtc aagcccaata tcattcacag aatgctagac 840
gcaccagaac tatcgatgaa agataagaca gcgtggcgcc ttcaattgga ggcgcgcacc 900
cttataggag ctggaactga aacgacagga cacacattag ccgtcatagc attccatctg 960
ctagcaaatc cggagaaggc aaagaggttg aaggaggaga tcttagctac gaaagaaggg 1020
cgggaaaagc ctttaactta tcaggagtta caaatgcttc cgtatttatc ttctgtggtc 1080
cttgaaggtc atcgcatttc tagtgttgta tcaggtcgtc tgccacgggt caatacaaaa 1140
gagccgctca gatatggtga ctatagtatc cctattggca cacccgtcag caccacccaa 1200
cggttaacac actacaatgc caccatattc ccctccccaa acacattcct ccccgaacgt 1260
tggcttcagc cctcggaacg aaagcgcctg gagaaataca tccagccgtt cgggcgtggc 1320
tcaagatctt gtataggcat gcatcttgca aatgcagaga tttacaaaac attggcggag 1380
atgtttgcaa ggtttgacat gaagttatat gatacggagt tcgaggatat tatgcaagtg 1440
catgactttt ttacttcgtt tccatcgagc gagaggggtt taagaatact tgtggaagca 1500
taa 1503
<210> 2
<211> 500
<212> PRT
<213> Artificial Sequence
<220>
<223> mcfP
<400> 2
Met Ile Asn Leu Ala Ser Pro Leu Phe Ala Thr Thr Ala Val Leu Val
1 5 10 15
Trp Leu Ser Ser Leu Ile Ile Tyr Arg Leu Tyr Leu Ser Pro Leu Ser
20 25 30
Arg Phe Pro Gly Pro Lys Leu Ala Ala Leu Thr Gly Trp Tyr Glu Thr
35 40 45
Tyr Phe Asp Leu Phe Lys Arg Gly Arg Tyr Trp Ile Glu Ile Glu Arg
50 55 60
Met His Glu Val Tyr Gly Pro Ile Ile Arg Ile Asn Pro Asn Glu Leu
65 70 75 80
His Val Asn Asp Pro Glu Trp Asn Glu Pro Tyr Lys Ile Ser Gly Arg
85 90 95
Val Asp Lys Tyr Asp Trp Tyr Tyr Thr Phe Val Gly Ser Ser Gly Ser
100 105 110
Ser Ser Ala Phe Gly Thr Ile Asp His Asp Val His Arg Gly Arg Arg
115 120 125
Lys Ala Gln Gln Gly Tyr Phe Thr Thr Asp Ala Ile Thr Arg Phe Glu
130 135 140
Pro His Leu Glu Thr Leu Thr Ala Lys Phe Cys Ala Arg Leu Asp Gly
145 150 155 160
Phe Lys Gly Thr Gly Lys His Val Asn Leu Ser Asp Ala Phe Arg Ser
165 170 175
Ile Ala Val Asp Val Ala Ala Met Phe Thr Leu Asn Gln Ser Tyr Gly
180 185 190
Phe Ile Asp Asp Pro Asp Phe Lys Ala Glu Val His Gln Gly Ile Arg
195 200 205
Ala Phe Pro Asp Ile Gly Val Leu Asn Arg His Phe Thr Gly Leu Phe
210 215 220
Val Val Leu Glu Ser Ile His Arg Trp Val Leu Ser Val Ile Asn Pro
225 230 235 240
Ser Glu Glu Asp Asn Gly Leu Leu Thr Ser Arg Ile Asn Leu His Cys
245 250 255
Lys Ala Ile Ile Ala Asp Tyr Ala Ser Lys Lys Gly Asp Val Lys Pro
260 265 270
Asn Ile Ile His Arg Met Leu Asp Ala Pro Glu Leu Ser Met Lys Asp
275 280 285
Lys Thr Ala Trp Arg Leu Gln Leu Glu Ala Arg Thr Leu Ile Gly Ala
290 295 300
Gly Thr Glu Thr Thr Gly His Thr Leu Ala Val Ile Ala Phe His Leu
305 310 315 320
Leu Ala Asn Pro Glu Lys Ala Lys Arg Leu Lys Glu Glu Ile Leu Ala
325 330 335
Thr Lys Glu Gly Arg Glu Lys Pro Leu Thr Tyr Gln Glu Leu Gln Met
340 345 350
Leu Pro Tyr Leu Ser Ser Val Val Leu Glu Gly His Arg Ile Ser Ser
355 360 365
Val Val Ser Gly Arg Leu Pro Arg Val Asn Thr Lys Glu Pro Leu Arg
370 375 380
Tyr Gly Asp Tyr Ser Ile Pro Ile Gly Thr Pro Val Ser Thr Thr Gln
385 390 395 400
Arg Leu Thr His Tyr Asn Ala Thr Ile Phe Pro Ser Pro Asn Thr Phe
405 410 415
Leu Pro Glu Arg Trp Leu Gln Pro Ser Glu Arg Lys Arg Leu Glu Lys
420 425 430
Tyr Ile Gln Pro Phe Gly Arg Gly Ser Arg Ser Cys Ile Gly Met His
435 440 445
Leu Ala Asn Ala Glu Ile Tyr Lys Thr Leu Ala Glu Met Phe Ala Arg
450 455 460
Phe Asp Met Lys Leu Tyr Asp Thr Glu Phe Glu Asp Ile Met Gln Val
465 470 475 480
His Asp Phe Phe Thr Ser Phe Pro Ser Ser Glu Arg Gly Leu Arg Ile
485 490 495
Leu Val Glu Ala
500
<210> 3
<211> 849
<212> DNA
<213> Artificial Sequence
<220>
<223> mcfS
<400> 3
atggctttag accgccagaa tgcgaaagtt acaactttcg gtctgtcaaa gccgaaaacc 60
aatatagatc gccgatcatg tcagagaact gtccccatga aggttctctg cctaggacta 120
tgtcgaaccg gcacttcctc attgcgtgcg gctctctttg agcttggcct tgatgatgtc 180
tatcacatgt gtagtgtgac ggaagagaat cccctcgact ccaagttgtg gaaagaggcc 240
ttcgacgcga aatatgaagg gatcggcaag ccctacggaa gagctgaatt tgacgcactc 300
ttgggtcatt gcatggcaac ctcggatttc cccagcgttg ccttcgctcc agaactcatc 360
gccgcttacc ccgaggcaaa gataattctc actgtacgag ataacgccga tgtctggtat 420
gactccgttc tcaacacgat ctggagagtc tccaacttcc ttcgcgctcc tccgagaact 480
ttaacccaac gagtcgttca agcgattctt cccaagccgg atttcaacat attcaagtac 540
agcccccttg gcaactttcc tgaggaaggc tgtcagtggt atagtgactg gaatgaagag 600
attagaactc tagccaaagg gagggacttc ttggaattca atgtaaagga gggatggggt 660
ccactctgta gattcttgga ggtggagcag ccggagacgc catttccaag agtcaatgat 720
tcaaatacat tcaaggaatt tcatgataag ggtttggagc aggatattca aagactggta 780
ggcataagta ctaagcttgt cgccgctgtt ggtgtattgg gtttggctgt ttggctagct 840
aggaagtag 849
<210> 4
<211> 282
<212> PRT
<213> Artificial Sequence
<220>
<223> mcfS
<400> 4
Met Ala Leu Asp Arg Gln Asn Ala Lys Val Thr Thr Phe Gly Leu Ser
1 5 10 15
Lys Pro Lys Thr Asn Ile Asp Arg Arg Ser Cys Gln Arg Thr Val Pro
20 25 30
Met Lys Val Leu Cys Leu Gly Leu Cys Arg Thr Gly Thr Ser Ser Leu
35 40 45
Arg Ala Ala Leu Phe Glu Leu Gly Leu Asp Asp Val Tyr His Met Cys
50 55 60
Ser Val Thr Glu Glu Asn Pro Leu Asp Ser Lys Leu Trp Lys Glu Ala
65 70 75 80
Phe Asp Ala Lys Tyr Glu Gly Ile Gly Lys Pro Tyr Gly Arg Ala Glu
85 90 95
Phe Asp Ala Leu Leu Gly His Cys Met Ala Thr Ser Asp Phe Pro Ser
100 105 110
Val Ala Phe Ala Pro Glu Leu Ile Ala Ala Tyr Pro Glu Ala Lys Ile
115 120 125
Ile Leu Thr Val Arg Asp Asn Ala Asp Val Trp Tyr Asp Ser Val Leu
130 135 140
Asn Thr Ile Trp Arg Val Ser Asn Phe Leu Arg Ala Pro Pro Arg Thr
145 150 155 160
Leu Thr Gln Arg Val Val Gln Ala Ile Leu Pro Lys Pro Asp Phe Asn
165 170 175
Ile Phe Lys Tyr Ser Pro Leu Gly Asn Phe Pro Glu Glu Gly Cys Gln
180 185 190
Trp Tyr Ser Asp Trp Asn Glu Glu Ile Arg Thr Leu Ala Lys Gly Arg
195 200 205
Asp Phe Leu Glu Phe Asn Val Lys Glu Gly Trp Gly Pro Leu Cys Arg
210 215 220
Phe Leu Glu Val Glu Gln Pro Glu Thr Pro Phe Pro Arg Val Asn Asp
225 230 235 240
Ser Asn Thr Phe Lys Glu Phe His Asp Lys Gly Leu Glu Gln Asp Ile
245 250 255
Gln Arg Leu Val Gly Ile Ser Thr Lys Leu Val Ala Ala Val Gly Val
260 265 270
Leu Gly Leu Ala Val Trp Leu Ala Arg Lys
275 280
<210> 5
<211> 282
<212> PRT
<213> Coleophoma crateriformis
<400> 5
Met Ala Leu Asp Arg Gln Asn Ala Asn Ile Thr Thr Phe Gly Leu Ala
1 5 10 15
Arg Pro Lys Thr Asn Ile Asp Arg Arg Ser Cys Lys Arg Asn Val Pro
20 25 30
Met Lys Val Leu Cys Leu Gly Leu Cys Arg Thr Gly Thr Ser Ser Leu
35 40 45
Arg Ala Ala Leu Leu Glu Leu Gly Leu Asp Asp Val Tyr His Met Cys
50 55 60
Ser Val Thr Glu Glu Asn Pro Pro Asp Ala Asn Leu Trp Lys Glu Ala
65 70 75 80
Phe Asp Ala Lys Tyr Glu Gly Ile Gly Lys Pro Tyr Gly Lys Asp Glu
85 90 95
Phe Asp Ala Leu Leu Gly His Cys Met Ala Thr Ala Asp Phe Pro Ser
100 105 110
Ile Ser Phe Ala Pro Glu Leu Leu Ala Ala Tyr Pro Asp Ala Lys Val
115 120 125
Ile Leu Thr Val Arg Asp Asn Ala Asp Val Trp Tyr Asp Ser Val Leu
130 135 140
Asn Thr Ile Trp Lys Val Ser Asn Phe Leu Arg Ala Pro Pro Arg Thr
145 150 155 160
Leu Thr Gln Arg Ile Val Gln Ala Ile Leu Pro Lys Pro Ala Phe Asn
165 170 175
Ile Phe Lys Tyr Ser Pro Leu Gly Asn Phe Pro Glu Glu Gly Arg Gln
180 185 190
Trp Tyr Ser Asp Trp Asn Glu Glu Ile Lys Thr Leu Ala Lys Gly Arg
195 200 205
Glu Phe Leu Glu Phe Asn Val Lys Gln Gly Trp Gly Pro Leu Cys Lys
210 215 220
Phe Leu Glu Val Glu Gln Pro Lys Thr Ala Phe Pro Arg Val Asn Asp
225 230 235 240
Ser Asn Thr Phe Lys Glu Phe His His Lys Gly Leu Trp Leu Asp Val
245 250 255
Gln Arg Leu Val Gly Ile Ser Thr Lys Leu Val Ala Ala Leu Gly Val
260 265 270
Leu Gly Leu Ala Val Trp Leu Ala Lys Lys
275 280
<210> 6
<211> 275
<212> PRT
<213> Venustampulla echinocandica
<400> 6
Met Ala Ser Asp Leu Gln Asn Gly Gln Leu Thr Thr Met Gly Leu Leu
1 5 10 15
Arg Pro Lys Thr Asn Ile Asp Arg Arg Ser Cys Lys Arg Val Val Pro
20 25 30
Met Lys Val Ile Cys Leu Gly Leu Cys Arg Thr Gly Thr Ser Ser Leu
35 40 45
Arg Ala Ala Leu Phe Glu Leu Gly Leu Asn Asp Val Tyr His Met Phe
50 55 60
Ser Val Thr Thr Glu Asn Pro Leu Asp Ala Glu Leu Trp Lys Glu Ala
65 70 75 80
Tyr Asp Ala Lys Tyr Lys Gly Ile Gly Lys Pro Tyr Gly Lys Glu Glu
85 90 95
Phe Asp Ala Leu Leu Gly His Cys Met Ala Thr Thr Asp Phe Pro Gly
100 105 110
Ile Ser Phe Ala Pro Glu Leu Leu Ala Ala Tyr Pro Asp Ala Lys Val
115 120 125
Ile Leu Thr Val Arg Asp Asn Gly Asp Val Trp Tyr Asp Ser Val Phe
130 135 140
Asn Thr Ile Trp Thr Val Ser Asn Phe Leu Arg Ala Pro Pro Lys Thr
145 150 155 160
Leu Thr Gln Arg Leu Val Gln Ala Ile Leu Pro Lys Pro His Phe Asn
165 170 175
Val Phe Glu His Thr Pro Leu Gly Asn Phe Pro Val Glu Gly Arg Gln
180 185 190
Trp Tyr Asp Asp Trp Asn Glu Asp Ile Arg Thr Arg Ala Lys Gly Arg
195 200 205
Glu Phe Leu Glu Phe Asn Val Lys Gln Gly Trp Gly Pro Leu Cys Glu
210 215 220
Phe Leu Gly Val Glu Gln Pro Lys Ala Lys Phe Pro Arg Val Asn Asp
225 230 235 240
Ser Ala Ser Phe Lys Glu Thr His Asn Asn Asp Leu Leu Arg Val Gly
245 250 255
Ala Lys Val Val Ala Ala Leu Ser Val Leu Gly Leu Ala Val Trp Leu
260 265 270
Ala Lys Lys
275
Claims (10)
1.选自以下i-ii任意一种或两种的酶,或包含所述酶的生物材料在对棘白菌素类化合物进行羟基化、磺酰化或氧磺酰化中的应用;
i、细胞色素P450单加氧酶,所述细胞色素P450单加氧酶与SEQ ID No.2相比具有至少75%的序列同一性;
ii、磺酰基转移酶,所述磺酰基转移酶与SEQ ID No.4相比具有至少70%的序列同一性;
所述生物材料选自:所述酶的编码基因,或者包含所述编码基因的载体,或者包含所述载体的宿主细胞。
2.根据权利要求1所述的应用,其特征在于,所述对棘白菌素类化合物进行羟基化、磺酰化或氧磺酰化包括对棘白菌素类化合物的L-高酪氨酸苯环上的C3’位进行羟基化、磺酰化或氧磺酰化。
3.根据权利要求1或2所述的应用,其特征在于,所述酶为细胞色素P450单加氧酶,所述应用为对棘白菌素类化合物进行羟基化。
4.根据权利要求1或2所述的应用,其特征在于,所述酶为磺酰基转移酶,所述应用为对棘白菌素类化合物进行磺酰化。
5.根据权利要求1或2所述的应用,其特征在于,所述酶为细胞色素P450单加氧酶和磺酰基转移酶,所述应用为对棘白菌素类化合物进行氧磺酰化。
6.根据权利要求1所述的应用,其特征在于,
所述细胞色素P450单加氧酶与SEQ ID No.2相比具有至少70%的序列同一性,优选的,所述细胞色素P450单加氧酶来源于鞘茎点霉属真菌(Coleophoma sp.);
所述磺酰基转移酶与SEQ ID No.4相比具有至少70%的序列同一性,优选的,所述磺酰基转移酶来源于鞘茎点霉属真菌(Coleophoma sp.);或者,所述磺酰基转移酶与SEQ IDNo.5相比具有至少70%的序列同一性,优选的,所述磺酰基转移酶来源于Coleophomacrateriformis;或者,所述磺酰基转移酶与SEQ ID No.6相比具有至少70%的序列同一性,优选的,所述磺酰基转移酶来源于Venustampulla echinocandica。
7.根据权利要求1所述的应用,其特征在于,所述棘白菌素类化合物选自FR133302或纽莫康定B0或其衍生物。
8.根据权利要求1所述的应用,其特征在于,所述棘白菌素类化合物选自式I-式VIII中的一种或任意几种。
9.一种对棘白菌素类化合物进行羟基化、磺酰化或氧磺酰化的方法,所述方法包括利用权利要求1-8任意权利要求中的酶或包含所述酶的生物材料对棘白菌素类化合物进行羟基化、磺酰化或氧磺酰化的步骤。
10.根据权利要求9所述的方法,其特征在于,所述方法为对棘白菌素类化合物的L-高酪氨酸苯环上的C3’位进行羟基化、磺酰化或氧磺酰化。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210570357.8A CN117143838A (zh) | 2022-05-24 | 2022-05-24 | 对棘白菌素类化合物氧磺酰化的酶及其应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210570357.8A CN117143838A (zh) | 2022-05-24 | 2022-05-24 | 对棘白菌素类化合物氧磺酰化的酶及其应用 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117143838A true CN117143838A (zh) | 2023-12-01 |
Family
ID=88884842
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210570357.8A Pending CN117143838A (zh) | 2022-05-24 | 2022-05-24 | 对棘白菌素类化合物氧磺酰化的酶及其应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117143838A (zh) |
-
2022
- 2022-05-24 CN CN202210570357.8A patent/CN117143838A/zh active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2022204012C1 (en) | Methods and materials for biosynthesis of mogroside compounds | |
Proctor et al. | Co-expression of 15 contiguous genes delineates a fumonisin biosynthetic gene cluster in Gibberella moniliformis | |
US11965181B2 (en) | Increased biosynthesis of benzylisoquinoline alkaloids and benzylisoquinoline alkaloid precursors in a recombinant host cell | |
Liu et al. | Identification of virulence genes in the crucifer anthracnose fungus Colletotrichum higginsianum by insertional mutagenesis | |
WO2018229283A1 (en) | Production of mogroside compounds in recombinant hosts | |
WO2013058655A1 (en) | Methods and compositions for producing drimenol | |
Ruocco et al. | Polyketide synthases of Diaporthe helianthi and involvement of DhPKS1 in virulence on sunflower | |
US20230193333A1 (en) | Norcoclaurine Synthases With Increased Activity | |
CN112011470A (zh) | 一种高产反式乌头酸的基因工程菌及其构建方法与应用 | |
CN117143838A (zh) | 对棘白菌素类化合物氧磺酰化的酶及其应用 | |
CN117143199A (zh) | 氧磺酰化纽莫康定b0及其制备方法和应用 | |
CN114854714A (zh) | 一种菜豆源环氧化物酶突变体、基因、载体、工程菌及制备方法和应用 | |
CN112410353B (zh) | 一种fkbS基因、含其的基因工程菌及其制备方法和用途 | |
CN117143748A (zh) | 高产fr901379的菌株及其构建方法与应用 | |
CN107723308B (zh) | 一种化合物balanol的生物合成方法及基因簇 | |
CN102260644B (zh) | 一株淡黄色链霉菌突变菌株,构建方法及其用途 | |
CN117143749A (zh) | 一株高产fr901379的菌株及其构建方法与应用 | |
KR101164726B1 (ko) | 지랄레논 유도성 프로모터를 포함하는 재조합벡터 및 이를 이용한 단백질 생산방법 및 지랄레논 검출방법 | |
Choi et al. | Genetic localization of epicoccamide biosynthetic gene cluster in Epicoccum nigrum KACC 40642 | |
WO2024017105A1 (zh) | 提高棘白菌素类化合物产量的转录因子及其应用 | |
KR20120127617A (ko) | 효소 공법에 의한 피리피로펜 유도체의 제조 방법 | |
CN112094330B (zh) | 多硫代二酮哌嗪合成相关蛋白质及其相关生物材料与应用 | |
CN109593740A (zh) | 一种糖基转移酶及其应用 | |
CN118063531B (zh) | 大环内酯类化合物PA-46101s C-E的制备及其应用 | |
CN114410604B (zh) | 环氧化物水解酶及其编码基因和应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |