CN114277024A - 一种新型三萜合酶及其应用 - Google Patents
一种新型三萜合酶及其应用 Download PDFInfo
- Publication number
- CN114277024A CN114277024A CN202011032594.6A CN202011032594A CN114277024A CN 114277024 A CN114277024 A CN 114277024A CN 202011032594 A CN202011032594 A CN 202011032594A CN 114277024 A CN114277024 A CN 114277024A
- Authority
- CN
- China
- Prior art keywords
- polypeptide
- cell
- gene
- polynucleotide
- oxidosqualene
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 150000003648 triterpenes Chemical class 0.000 title abstract description 24
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims abstract description 21
- 229920001184 polypeptide Polymers 0.000 claims description 101
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 101
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 101
- 210000004027 cell Anatomy 0.000 claims description 86
- 108091033319 polynucleotide Proteins 0.000 claims description 36
- 102000040430 polynucleotide Human genes 0.000 claims description 36
- 239000002157 polynucleotide Substances 0.000 claims description 36
- 108090000623 proteins and genes Proteins 0.000 claims description 35
- 238000000034 method Methods 0.000 claims description 32
- QYIMSPSDBYKPPY-RSKUXYSASA-N (S)-2,3-epoxysqualene Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C=C(/C)CC\C=C(/C)CC[C@@H]1OC1(C)C QYIMSPSDBYKPPY-RSKUXYSASA-N 0.000 claims description 30
- QYIMSPSDBYKPPY-UHFFFAOYSA-N OS Natural products CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC1OC1(C)C QYIMSPSDBYKPPY-UHFFFAOYSA-N 0.000 claims description 30
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 20
- 238000004519 manufacturing process Methods 0.000 claims description 19
- 239000012634 fragment Substances 0.000 claims description 16
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 13
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 13
- 239000013598 vector Substances 0.000 claims description 13
- 239000002773 nucleotide Substances 0.000 claims description 11
- 125000003729 nucleotide group Chemical group 0.000 claims description 11
- 241000208422 Rhododendron Species 0.000 claims description 10
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 9
- 210000005253 yeast cell Anatomy 0.000 claims description 9
- 239000000203 mixture Substances 0.000 claims description 8
- 125000000539 amino acid group Chemical group 0.000 claims description 7
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 7
- 101150051269 ERG10 gene Proteins 0.000 claims description 6
- 101150071502 ERG12 gene Proteins 0.000 claims description 6
- 101150014913 ERG13 gene Proteins 0.000 claims description 6
- 101150045041 ERG8 gene Proteins 0.000 claims description 6
- 238000012258 culturing Methods 0.000 claims description 6
- 101150075592 idi gene Proteins 0.000 claims description 6
- 210000004962 mammalian cell Anatomy 0.000 claims description 6
- 101150023788 ERG19 gene Proteins 0.000 claims description 5
- 241000238631 Hexapoda Species 0.000 claims description 5
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 5
- 230000003197 catalytic effect Effects 0.000 claims description 5
- 244000063299 Bacillus subtilis Species 0.000 claims description 4
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 4
- 241000588724 Escherichia coli Species 0.000 claims description 4
- 230000002538 fungal effect Effects 0.000 claims description 4
- 101150084750 1 gene Proteins 0.000 claims description 2
- 239000006143 cell culture medium Substances 0.000 claims description 2
- 230000000295 complement effect Effects 0.000 claims description 2
- 239000004615 ingredient Substances 0.000 claims description 2
- -1 triterpene compound Chemical class 0.000 abstract description 18
- SFUCGABQOMYVJW-MRVPVSSYSA-N 4-[(3r)-3-hydroxybutyl]phenol Chemical compound C[C@@H](O)CCC1=CC=C(O)C=C1 SFUCGABQOMYVJW-MRVPVSSYSA-N 0.000 abstract description 11
- SFUCGABQOMYVJW-UHFFFAOYSA-N rac-rhododendrol Natural products CC(O)CCC1=CC=C(O)C=C1 SFUCGABQOMYVJW-UHFFFAOYSA-N 0.000 abstract description 11
- 239000002243 precursor Substances 0.000 abstract description 5
- 238000002360 preparation method Methods 0.000 abstract description 4
- 101100339482 Colletotrichum orbiculare (strain 104-T / ATCC 96160 / CBS 514.97 / LARS 414 / MAFF 240422) HOG1 gene Proteins 0.000 description 26
- 239000000047 product Substances 0.000 description 19
- 102000004169 proteins and genes Human genes 0.000 description 17
- 108020004414 DNA Proteins 0.000 description 15
- 230000006870 function Effects 0.000 description 14
- 150000001413 amino acids Chemical class 0.000 description 13
- 241000196324 Embryophyta Species 0.000 description 10
- 108091026890 Coding region Proteins 0.000 description 9
- 238000000855 fermentation Methods 0.000 description 9
- 230000004151 fermentation Effects 0.000 description 9
- XVXPXUMUGATHPD-JMJRLLIOSA-N Simiarenol Chemical compound C([C@@H]1[C@@]2(C)CC[C@@]3(C)[C@H]([C@@]2(CC[C@]11C)C)CC[C@@H]3C(C)C)C=C2[C@H]1CC[C@H](O)C2(C)C XVXPXUMUGATHPD-JMJRLLIOSA-N 0.000 description 8
- XVXPXUMUGATHPD-UHFFFAOYSA-N Simiarol Natural products CC(C)C1CCC(C2(CCC34C)C)C1(C)CCC2(C)C3CC=C1C4CCC(O)C1(C)C XVXPXUMUGATHPD-UHFFFAOYSA-N 0.000 description 8
- 239000013604 expression vector Substances 0.000 description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 description 7
- 240000008042 Zea mays Species 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- 238000004128 high performance liquid chromatography Methods 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 238000000605 extraction Methods 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 230000037361 pathway Effects 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 4
- 238000005481 NMR spectroscopy Methods 0.000 description 4
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 235000005822 corn Nutrition 0.000 description 4
- 230000006698 induction Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 238000001262 western blot Methods 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 241001052560 Thallis Species 0.000 description 3
- 238000000246 agarose gel electrophoresis Methods 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 230000002708 enhancing effect Effects 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 229930014626 natural product Natural products 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 238000010188 recombinant method Methods 0.000 description 3
- 238000007363 ring formation reaction Methods 0.000 description 3
- 238000001644 13C nuclear magnetic resonance spectroscopy Methods 0.000 description 2
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 2
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 2
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 2
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 2
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- HEDRZPFGACZZDS-MICDWDOJSA-N Trichloro(2H)methane Chemical compound [2H]C(Cl)(Cl)Cl HEDRZPFGACZZDS-MICDWDOJSA-N 0.000 description 2
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 235000007244 Zea mays Nutrition 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 230000001851 biosynthetic effect Effects 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 150000007523 nucleic acids Chemical class 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- QYIMSPSDBYKPPY-BANQPHDMSA-N 2,3-epoxysqualene Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C=C(/C)CC\C=C(/C)CCC1OC1(C)C QYIMSPSDBYKPPY-BANQPHDMSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 102000057234 Acyl transferases Human genes 0.000 description 1
- 108700016155 Acyl transferases Proteins 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- 241001435059 Artemisia argyi Species 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 1
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 1
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- YTXCCDCOHIYQFC-GUBZILKMSA-N Asp-Met-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTXCCDCOHIYQFC-GUBZILKMSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- YODBPLSWNJMZOJ-BPUTZDHNSA-N Asp-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N YODBPLSWNJMZOJ-BPUTZDHNSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- SFUUYRSAJPWTGO-SRVKXCTJSA-N Cys-Asn-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SFUUYRSAJPWTGO-SRVKXCTJSA-N 0.000 description 1
- BUUVFIAZIOIEIN-UBHSHLNASA-N Cys-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N BUUVFIAZIOIEIN-UBHSHLNASA-N 0.000 description 1
- SSNJZBGOMNLSLA-CIUDSAMLSA-N Cys-Leu-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O SSNJZBGOMNLSLA-CIUDSAMLSA-N 0.000 description 1
- YYLBXQJGWOQZOU-IHRRRGAJSA-N Cys-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N YYLBXQJGWOQZOU-IHRRRGAJSA-N 0.000 description 1
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 description 1
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 101100286286 Dictyostelium discoideum ipi gene Proteins 0.000 description 1
- 241001057636 Dracaena deremensis Species 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 101100025321 Gibberella zeae (strain ATCC MYA-4620 / CBS 123657 / FGSC 9075 / NRRL 31084 / PH-1) ERG19 gene Proteins 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- QMVCEWKHIUHTSD-GUBZILKMSA-N Gln-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QMVCEWKHIUHTSD-GUBZILKMSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- RBSKVTZUFMIWFU-XEGUGMAKSA-N Gln-Trp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O RBSKVTZUFMIWFU-XEGUGMAKSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- LSTFYPOGBGFIPP-FXQIFTODSA-N Glu-Cys-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O LSTFYPOGBGFIPP-FXQIFTODSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- YVYVMJNUENBOOL-KBIXCLLPSA-N Glu-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YVYVMJNUENBOOL-KBIXCLLPSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- JJSVALISDCNFCU-SZMVWBNQSA-N Glu-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JJSVALISDCNFCU-SZMVWBNQSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- 102000051366 Glycosyltransferases Human genes 0.000 description 1
- 108700023372 Glycosyltransferases Proteins 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 1
- LBHOVGUGOBINDL-KKUMJFAQSA-N His-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O LBHOVGUGOBINDL-KKUMJFAQSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 1
- ZHMZWSFQRUGLEC-JYJNAYRXSA-N His-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZHMZWSFQRUGLEC-JYJNAYRXSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 1
- WTOAPTKSZJJWKK-HTFCKZLJSA-N Ile-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WTOAPTKSZJJWKK-HTFCKZLJSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000222722 Leishmania <genus> Species 0.000 description 1
- 208000004554 Leishmaniasis Diseases 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- 102100024295 Maltase-glucoamylase Human genes 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- UZBQXELAFPCGRV-SZMVWBNQSA-N Met-Trp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZBQXELAFPCGRV-SZMVWBNQSA-N 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 101100445407 Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) erg10B gene Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- YCEWAVIRWNGGSS-NQCBNZPSSA-N Phe-Trp-Ile Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)C1=CC=CC=C1 YCEWAVIRWNGGSS-NQCBNZPSSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- FJLODLCIOJUDRG-PYJNHQTQSA-N Pro-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FJLODLCIOJUDRG-PYJNHQTQSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- WQUURFHRUAZQHU-VGWMRTNUSA-N Pro-Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 WQUURFHRUAZQHU-VGWMRTNUSA-N 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 101100025327 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) MVD1 gene Proteins 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 1
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 1
- BIWBTRRBHIEVAH-IHPCNDPISA-N Ser-Tyr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BIWBTRRBHIEVAH-IHPCNDPISA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- BLRPTPMANUNPDV-UHFFFAOYSA-N Silane Chemical group [SiH4] BLRPTPMANUNPDV-UHFFFAOYSA-N 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- VMSSYINFMOFLJM-KJEVXHAQSA-N Thr-Tyr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O VMSSYINFMOFLJM-KJEVXHAQSA-N 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 1
- FOAJSVIXYCLTSC-PJODQICGSA-N Trp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FOAJSVIXYCLTSC-PJODQICGSA-N 0.000 description 1
- DTPARJBMONKGGC-IHPCNDPISA-N Trp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N DTPARJBMONKGGC-IHPCNDPISA-N 0.000 description 1
- HXNVJPQADLRHGR-JBACZVJFSA-N Trp-Glu-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXNVJPQADLRHGR-JBACZVJFSA-N 0.000 description 1
- DNUJCLUFRGGSDJ-YLVFBTJISA-N Trp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DNUJCLUFRGGSDJ-YLVFBTJISA-N 0.000 description 1
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 1
- YYXIWHBHTARPOG-HJXMPXNTSA-N Trp-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YYXIWHBHTARPOG-HJXMPXNTSA-N 0.000 description 1
- VOCHZIJXPRBVSI-XIRDDKMYSA-N Trp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N VOCHZIJXPRBVSI-XIRDDKMYSA-N 0.000 description 1
- RQLNEFOBQAVGSY-WDSOQIARSA-N Trp-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQLNEFOBQAVGSY-WDSOQIARSA-N 0.000 description 1
- HJWLQSFTGDQSRX-BPUTZDHNSA-N Trp-Met-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HJWLQSFTGDQSRX-BPUTZDHNSA-N 0.000 description 1
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 1
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 description 1
- ADECJAKCRKPSOR-ULQDDVLXSA-N Tyr-His-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ADECJAKCRKPSOR-ULQDDVLXSA-N 0.000 description 1
- HFJJDMOFTCQGEI-STECZYCISA-N Tyr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HFJJDMOFTCQGEI-STECZYCISA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- LYPKCSYAKLTBHJ-ILWGZMRPSA-N Tyr-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=C(C=C4)O)N)C(=O)O LYPKCSYAKLTBHJ-ILWGZMRPSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 108010084217 alanyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010028144 alpha-Glucosidases Proteins 0.000 description 1
- 238000003277 amino acid sequence analysis Methods 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 101150014423 fni gene Proteins 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 230000005802 health problem Effects 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 201000001421 hyperglycemia Diseases 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011090 industrial biotechnology method and process Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 230000001678 irradiating effect Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 231100000707 mutagenic chemical Toxicity 0.000 description 1
- 239000005445 natural material Substances 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 235000017807 phytochemicals Nutrition 0.000 description 1
- 229930000223 plant secondary metabolite Natural products 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 108010038196 saccharide-binding proteins Proteins 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 229910000077 silane Inorganic materials 0.000 description 1
- 238000010898 silica gel chromatography Methods 0.000 description 1
- MFFMDFFZMYYVKS-SECBINFHSA-N sitagliptin Chemical compound C([C@H](CC(=O)N1CC=2N(C(=NN=2)C(F)(F)F)CC1)N)C1=CC(F)=C(F)C=C1F MFFMDFFZMYYVKS-SECBINFHSA-N 0.000 description 1
- 229960004034 sitagliptin Drugs 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明提供了一种新型三萜合酶、其制备及其应用。本发明的三萜合酶能有效地催化三萜化合物前体形成西米杜鹃醇。本发明的三萜合酶能够在酵母等工程细胞中表达以及进行西米杜鹃醇合酶的生产,具有非常好的应用前景。
Description
技术领域
本发明涉及生物技术和植物生物学领域;更具体地,本发明涉及一种新型三萜合酶及其应用。
背景技术
三萜化合物是植物中广泛存在的一大类天然产物,其结构复杂,功能多样,在众多领域发挥着重要功能。三萜合酶作为三萜化合物合成途径中的关键酶,形成结构多样的核心骨架,是三萜化合物结构多样性的基础。
三萜合酶催化2,3环氧角鲨烯形成不同的碳骨架,进一步由细胞色素P450、糖基转移酶和酰基转移酶等修饰酶的修饰,产生结构多样的三萜化合物,广泛应用于食品、医药和工业生物技术等领域。因此探索新功能的三萜合酶或具有新异三萜碳骨架的三萜合酶元件有助于开发自然界中丰富的三萜化合物资源。
目前植物中已鉴定了100余种三萜骨架,包含85条不同功能的三萜合酶,还有一些具有生物活性的三萜化合物是通过植物化学法分离到并鉴定结构的,然而相关的三萜合酶元件并未分离到,导致无法大量合成此类化合物,阻碍了其功能的全面解析与应用。因此本领域有必要开发三萜合酶元件的方法,加快解析自然界中丰富的三萜合酶。
综上,新型的三萜合酶的开发以及将之应用于生产工业上、药物学上有用的产品,是本领域迫切所需的。
发明内容
本发明的目的在于提供一种新型三萜合酶及其应用。
在本发明的第一方面,提供一种分离的多肽,该多肽选自下组:
(a)SEQ ID NO:2所示氨基酸序列的多肽;
(b)与SEQ ID NO:2所示的氨基酸序列有至少70%相同性(较佳地75%以上;更佳地80%以上;更佳85%以上,如90%,95%,98%或99%以上),且具有(a)所限定的多肽的功能(包括催化2,3环氧角鲨烯生成西米杜鹃醇功能)的多肽;
(c)将(a)所限定的多肽的氨基酸序列经过一个或多个(如1-20个,较佳地1-10个;更佳地1-5个;更佳地1-3个)氨基酸残基的取代、缺失或添加而形成的,且具有(a)所限定的多肽的功能的多肽;或
(d)(a)~(c)任一所述多肽的片段,其包含有该多肽的催化结构域,且具有(a)所限定的多肽的功能;
(e)(a)~(d)任一所述多肽的N或C末端添加标签序列,或在其N末端添加信号肽序列后形成的多肽。
在本发明的另一方面,提供一种分离的多核苷酸,它包含一核苷酸序列,该核苷酸序列选自下组:(1)编码如所述多肽的多核苷酸;(2)与多核苷酸(1)互补的多核苷酸。
在一个优选例中,该多核苷酸编码如SEQ ID NO:2所示氨基酸序列的多肽;较佳地,该多核苷酸的核苷酸序列如SEQ ID NO:1所示。
在本发明的另一方面,提供一种载体,它含有前面任一所述的多核苷酸。
在本发明的另一方面,提供一种遗传工程化的宿主细胞,它含有所述的载体,或其基因组中整合有前面任一所述的多核苷酸。
在另一优选例中,所述的整合包括定向整合或随机整合。
在另一优选例中,所述宿主细胞包括(但不限于):原核细胞或真核细胞;较佳地,所述原核细胞包括大肠杆菌、枯草杆菌;所述真核细胞包括(但不限于):酵母细胞、真菌细胞、昆虫细胞、哺乳动物细胞;较佳地,所述宿主细胞为酵母细胞。
在本发明的另一方面,提供一种制备所述的多肽的方法,包括:(i)培养所述的宿主细胞;(ii)收集含有所述的多肽的培养物;(iii)从培养物中分离出所述的多肽。
在本发明的另一方面,提供所述的多肽的用途,用于催化2,3环氧角鲨烯生成西米杜鹃醇。
在本发明的另一方面,提供一种组合物,其包含所述的多肽;以及工业学或微生物学上可接受的载体。
在本发明的另一方面,提供一种组合物,其包含所述的宿主细胞;以及工业学或微生物学上可接受的载体。
在本发明的另一方面,提供一种生产西米杜鹃醇的方法,包括:应用所述的多肽、所述的宿主细胞或述的组合物催化2,3环氧角鲨烯,生成西米杜鹃醇。
在本发明的另一方面,提供一种生产西米杜鹃醇的方法,包括:
(1)提供一工程细胞,该细胞:包含2,3环氧角鲨烯,能吸收胞外的2,3环氧角鲨烯,或具有2,3环氧角鲨烯生产途径;
(2)在(1)所述工程细胞中表达所述的多肽;
(3)培养(2)的工程细胞,生产西米杜鹃醇。
在一个优选例中,所述方法还包括:在所述工程细胞中过表达ERG10基因,ERG13基因,ERG12基因,ERG8基因,ERG19基因,IDI基因,tHMG1基因。
在本发明的另一方面,提供一种用于生产西米杜鹃醇的工程细胞,该细胞:包含2,3环氧角鲨烯,能吸收胞外的2,3环氧角鲨烯或具有2,3环氧角鲨烯生产途径;且,该细胞表达所述的多肽。
在一个优选例中,所述工程细胞中过表达ERG10基因,ERG13基因,ERG12基因,ERG8基因,ERG19基因,IDI基因,tHMG1基因。
在另一优选例中,所述的工程细胞包括(但不限于):原核细胞或真核细胞;较佳地,所述原核细胞包括大肠杆菌、枯草杆菌;所述真核细胞包括(但不限于):酵母细胞、真菌细胞、昆虫细胞、哺乳动物细胞;较佳地,所述工程细胞为酵母细胞;更佳地,所述工程细胞为酿酒酵母细胞。
在本发明的另一方面,提供一种用于生产西米杜鹃醇的试剂盒,其中包括:前面任一所述的工程细胞;较佳地,其中还包括细胞培养基或培养组分;较佳地,其中还包括使用说明书。
本发明的其它方面由于本文的公开内容,对本领域的技术人员而言是显而易见的。应理解,在本发明范围内中,本发明的上述各技术特征和在下文(如实施例)中具体描述的各技术特征之间都可以互相组合,从而构成新的或优选的技术方案。限于篇幅,在此不再一一累述。
附图说明
图1、两对引物SEQ ID NO:1和2的PCR产物的琼脂糖凝胶电泳检测结果。其中,泳道1为翊圣生物公司的1kb DNA ladder Marker,泳道2为ZmOSC1的PCR产物。
图2、西米杜鹃醇合酶ZmOSC1 Western印迹分析。
图3、重组酿酒酵母pESC-Leu-ZmOSC1/WP7发酵产物的HPLC图。其中,编号为1号的HPLC峰图为pESC-Leu/WP7空载质粒作为阴性对照的测定峰图,编号为2号的HPLC峰图为pESC-Leu-ZmOSC1/WP7的测定峰图。
具体实施方式
本发明人经过大规模的筛选和深入的研究,从玉米植物中分离到了一种新颖的三萜合酶,为西米杜鹃醇合酶(Simiarenol Synthase),本发明人将之命名为(Zm)OSC1。本发明的OSC1能有效地催化三萜化合物前体形成西米杜鹃醇(Simiarenol)。本发明的三萜合酶能够在酵母等工程细胞中表达以及进行西米杜鹃醇合酶的生产,具有非常好的应用前景。
新型多肽、其核酸、构建体及制备
如本文所用,术语“本发明的多肽”、“本发明的蛋白”、“西米杜鹃醇合酶”、“三萜合酶”、“OSC1”可互换使用,都指具有SEQ ID NO:2或其片段或其变异形式或衍生物的蛋白或多肽。
如本文所用,“分离的”是指物质从其原始环境中分离出来(如果是天然的物质,原始环境即是天然环境)。如活体细胞内的天然状态下的多聚核苷酸和多肽是没有分离纯化的,但同样的多聚核苷酸或多肽如从天然状态中同存在的其他物质中分开,则为分离纯化的。
如本文所用,“分离的多肽(本发明中为OSC1)”是指本发明所述OSC1基本上不含天然与其相关的其它蛋白、脂类、糖类或其它物质。本领域的技术人员能用标准的蛋白质纯化技术纯化所述OSC1。基本上纯的多肽在非还原聚丙烯酰胺凝胶上能产生单一的主带。所述OSC1的纯度能用氨基酸序列分析。
本发明中,所述“含有”表示各种成分可一起应用于本发明的混合物或组合物中。因此,术语“主要由...组成”和“由...组成”包含在术语“含有”中。
本发明的多肽(OSC1)可以是重组多肽、天然多肽、合成多肽,优选重组多肽。本发明的多肽可以是天然纯化的产物,或是化学合成的产物,或使用重组技术从真核或原核宿主(例如,细菌、酵母、高等植物、昆虫和哺乳动物细胞)中产生。根据重组生产方案所用的宿主,本发明的多肽可以是糖基化的,或可以是非糖基化的。本发明的多肽还可包括或不包括起始的甲硫氨酸残基。
本发明还包括所述OSC1的片段、衍生物和类似物。如本文所用,术语“片段”、“衍生物”和“类似物”是指基本上保持本发明的天然OSC1相同的生物学功能或活性的多肽。本发明的多肽片段、衍生物或类似物可以是(i)有一个或多个保守或非保守性氨基酸残基(优选保守性氨基酸残基)被取代的多肽,而这样的取代的氨基酸残基可以是也可以不是由遗传密码编码的,或(ii)在一个或多个氨基酸残基中具有取代基团的多肽,或(iii)成熟多肽与另一个化合物(比如延长多肽半衰期的化合物,例如聚乙二醇)融合所形成的多肽,或(iv)附加的氨基酸序列融合到此多肽序列而形成的多肽(如前导序列或分泌序列或用来纯化此多肽的序列或蛋白原序列,或与抗原IgG片段的形成的融合蛋白)。根据本文的教导,这些片段、衍生物和类似物属于本领域熟练技术人员公知的范围。
在本发明中,术语“所述OSC1”指具有所述OSC1活性的SEQ ID NO:2序列的多肽。该术语还包括具有与所述OSC1相同功能的、SEQ ID NO:2序列的变异形式。这些变异形式包括(但并不限于):一个或多个(通常为1-50个,较佳地1-30个,更佳地1-20个,更佳地1-10个,最佳地1-5个)氨基酸的缺失、插入和/或取代,以及在C末端和/或N末端添加或缺失一个或数个(通常为20个以内,较佳地为10个以内,更佳地为5个以内)氨基酸。例如,在本领域中,用性能相近或相似的氨基酸进行取代时,通常不会改变蛋白质的功能。比如,在C末端和/或N末端添加或缺失一个或数个氨基酸通常也不会改变蛋白质的功能;又比如,仅表达该蛋白的催化结构域,而不表达碳水化合物结合结构域也能获得和完整蛋白同样的催化功能。因此该术语还包括所述OSC1的活性片段和活性衍生物。例如,变异可以发生在SEQ ID NO:2的催化功能域之外。
该多肽的变异形式包括:同源序列、保守性变异体、等位变异体、天然突变体、诱导突变体、在高或低的严谨度条件下能与所述OSC1DNA杂交的DNA所编码的蛋白、以及利用抗所述OSC1的抗体获得的多肽或蛋白。本发明还提供了其他多肽,如包含所述OSC1或其片段的融合蛋白。除了几乎全长的多肽外,本发明还包括了所述OSC1的片段。通常,该片段具有所述OSC1序列的至少约10个连续氨基酸,通常至少约30个连续氨基酸,较佳地至少约50个连续氨基酸,更佳地至少约80个连续氨基酸,最佳地至少约100个连续氨基酸。
发明还提供所述OSC1蛋白或多肽的类似物。这些类似物与天然所述OSC1的差别可以是氨基酸序列上的差异,也可以是不影响序列的修饰形式上的差异,或者兼而有之。这些多肽包括天然或诱导的遗传变异体。诱导变异体可以通过各种技术得到,如通过辐射或暴露于诱变剂而产生随机诱变,还可通过定点诱变法或其他已知分子生物学的技术。类似物还包括具有不同于天然L-氨基酸的残基(如D-氨基酸)的类似物,以及具有非天然存在的或合成的氨基酸(如β、γ-氨基酸)的类似物。应理解,本发明的多肽并不限于上述例举的代表性的多肽。
在本发明中,“所述OSC1的保守性变异体”指与SEQ ID NO:2的氨基酸序列相比,有至多30个,较佳地至多20个,更佳地至多10个,更佳地至多5个氨基酸被性质相似或相近的氨基酸所替换而形成多肽。这些保守性变异体较佳地根据表1进行氨基酸替换而产生。
表1
本发明的OSC1的氨基端或羧基端还可含有一个或多个多肽片段,作为蛋白标签。任何合适的标签都可以用于本发明。例如,所述的标签可以是FLAG,HA,HA1,c-Myc,Poly-His,Poly-Arg,Strep-TagII,AU1,EE,T7,4A6,ε,B,gE以及Ty1。
为了使翻译的蛋白分泌表达(如分泌到细胞外),还可在所述所述OSC1的氨基酸氨基末端用宿主适用的信号肽替换本身信号肽序列。信号肽在多肽从细胞内分泌出来的过程中可被切去。
本发明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因组DNA或人工合成的DNA。DNA可以是单链的或是双链的。DNA可以是编码链或非编码链。编码成熟多肽的编码区序列可以与OSC1的天然编码序列或与SEQ ID NO:1所示编码序列相同或者是简并的变异体。如本文所用,“简并的变异体”在本发明中是指编码具有SEQ ID NO:2的蛋白质,但与OSC1的天然编码序列或SEQ ID NO:1所示编码序列有差别的核酸序列。
编码SEQ ID NO:2的成熟多肽的多核苷酸包括:只编码成熟多肽的编码序列;成熟多肽的编码序列和各种附加编码序列;成熟多肽的编码序列(和任选的附加编码序列)以及非编码序列。术语“编码多肽的多核苷酸”可以是包括编码此多肽的多核苷酸,也可以是还包括附加编码和/或非编码序列的多核苷酸。
本发明还涉及上述多核苷酸的变异体,其编码与本发明有相同的氨基酸序列的多肽或多肽的片段、类似物和衍生物。此多核苷酸的变异体可以是天然发生的等位变异体或非天然发生的变异体。这些核苷酸变异体包括取代变异体、缺失变异体和插入变异体。如本领域所知的,等位变异体是一个多核苷酸的替换形式,它可能是一个或多个核苷酸的取代、缺失或插入,但不会从实质上改变其编码的多肽的功能。
本发明还涉及与上述的序列杂交且两个序列之间具有至少50%,较佳地至少70%,更佳地至少80%相同性的多核苷酸。本发明特别涉及在严格条件(或严谨条件)下与本发明所述多核苷酸可杂交的多核苷酸。在本发明中,“严格条件”是指:(1)在较低离子强度和较高温度下的杂交和洗脱,如0.2×SSC,0.1%SDS,60℃;或(2)杂交时加有变性剂,如50%(v/v)甲酰胺,0.1%小牛血清/0.1%Ficoll,42℃等;或(3)仅在两条序列之间的相同性至少在90%以上,更好是95%以上时才发生杂交。并且,可杂交的多核苷酸编码的多肽与SEQ ID NO:2所示的成熟多肽有相同的生物学功能和活性。
本发明中的多肽和多核苷酸优选以分离的形式提供,更佳地被纯化至均质。
所述OSC1核苷酸全长序列或其片段通常可以用PCR扩增法、重组法或人工合成的方法获得。目前,已经可以完全通过化学合成来得到编码本发明蛋白(或其片段,或其衍生物)的DNA序列。然后可将该DNA序列引入本领域中已知的各种现有的DNA分子(或如载体)和细胞中。
本发明也提供了包含本发明的多核苷酸的载体,以及用本发明的载体或所述OSC1编码序列经基因工程产生的宿主细胞,以及经重组技术产生本发明所述多肽的方法。
通过常规的重组DNA技术,可利用本发明的多聚核苷酸序列可用来表达或生产重组的所述OSC1。一般来说有以下步骤:(1).用编码所述OSC1的多核苷酸(或变异体),或用含有该多核苷酸的重组表达载体转化或转导合适的宿主细胞;(2).在合适的培养基中培养的宿主细胞;(3).从培养基或细胞中分离、纯化蛋白质。
本发明中,所述OSC1多核苷酸序列可插入到重组表达载体中。术语“重组表达载体”指本领域熟知的细菌质粒、噬菌体、酵母质粒、植物细胞病毒、哺乳动物细胞病毒如腺病毒、逆转录病毒或其他载体。只要能在宿主体内复制和稳定,任何质粒和载体都可以用。表达载体的一个重要特征是通常含有复制起点、启动子、标记基因和翻译控制元件。
本领域的技术人员熟知的方法能用于构建含编码所述OSC1的DNA序列和合适的转录/翻译控制信号的表达载体。这些方法包括体外重组DNA技术、DNA合成技术、体内重组技术等。所述的DNA序列可有效连接到表达载体中的适当启动子上,以指导mRNA合成。表达载体还包括翻译起始用的核糖体结合位点和转录终止子。此外,表达载体优选地包含一个或多个选择性标记基因,以提供用于选择转化的宿主细胞的表型性状。包含上述的适当DNA序列以及适当启动子或者控制序列的载体,可以用于转化适当的宿主细胞,以使其能够表达蛋白质。
宿主细胞可以是原核细胞,如细菌细胞;或是低等真核细胞,如酵母细胞;或是高等真核细胞,如哺乳动物细胞。代表性例子有:大肠杆菌,枯草杆菌,链霉菌属、鼠伤寒沙门氏菌的细菌细胞;真菌细胞如酵母;植物细胞;昆虫细胞(如果蝇S2或Sf9);CHO、COS、293细胞、或Bowes黑素瘤细胞的动物细胞等。在本发明的优选方式中,所述的宿主细胞为真核细胞,较佳地为酵母细胞,如酿酒酵母细胞。
获得的转化子可以用常规方法培养,表达本发明的基因所编码的多肽。根据所用的宿主细胞,培养中所用的培养基可选自各种常规培养基。在适于宿主细胞生长的条件下进行培养。当宿主细胞生长到适当的细胞密度后,用合适的方法(如温度转换或化学诱导)诱导选择的启动子,将细胞再培养一段时间。
在上面的方法中的重组多肽可在细胞内、或在细胞膜上表达、或分泌到细胞外。如果需要,可利用其物理的、化学的和其它特性通过各种分离方法分离和纯化重组的蛋白。这些方法是本领域技术人员所熟知的。这些方法的例子包括但并不限于:常规的复性处理、用蛋白沉淀剂处理(盐析方法)、离心、渗透破菌、超处理、超离心、分子筛层析(凝胶过滤)、吸附层析、离子交换层析、高效液相层析(HPLC)和其它各种液相层析技术及这些方法的结合。
作为本发明的优选方式,采用酵母细胞作为宿主细胞来表达所述OSC1。本发明通过构建表达系统,在大酵母工程细胞中异源表达所述的OSC1。
三萜化合物的生产
经过大量筛选,从玉米中获得了一种新型西米杜鹃醇合酶,其具有催化2,3环氧角鲨烯的环化产生三萜化合物西米杜鹃醇的功能。
本发明的OSC1或其保守性变异体,可应用于特异和高效地催化2,3环氧角鲨烯的环化,从而产生三萜化合物西米杜鹃醇产物。
西米杜鹃醇是一个五环三萜化合物,具有显著的α葡糖苷酶抑制活性,有助于减弱餐后高血糖。此外,西米杜鹃醇可能具有抗利什曼原虫活性,利什曼病是一个世界性的健康问题,在发展中国家高度流行,目前所用药物严重的副作用与耐药性问题使替代药物的研发迫在眉睫。
西米杜鹃醇合酶(Simiarenol Synthase)是合成三萜化合物西米杜鹃醇(Simiarenol)的关键酶,催化三萜化合物前体2,3环氧角鲨烯环化形成三萜化合物西米杜鹃醇。其反应式示意如下:
作为一种可选的方式,可利用本发明的OSC1进行体外生产,可以通过规模化生产本发明的OSC1,将之与前体2,3环氧角鲨烯进行反应,以获得西米杜鹃醇。
作为本发明的优选方式,利用生物合成的方法进行生产。这通常包括:(1)提供一工程细胞,该细胞具有选自下组的至少一方面特征:包含2,3环氧角鲨烯,能吸收胞外的2,3环氧角鲨烯,或具有2,3环氧角鲨烯生产途径;(2)在(1)所述工程细胞中表达所述的多肽;以及(3)培养(2)的工程细胞,生产西米杜鹃醇。在更为优选的方式中,所述方法还包括:从工程细胞的培养物中,分离纯化西米杜鹃醇的步骤。
在利用生物合成的方法进行生产时,作为本发明的优选方式,还包括在细胞中强化2,3环氧角鲨烯的生产途径。更优选地,通过在细胞中引入一系列的基因来进行强化,包括:ERG10基因,ERG13基因,ERG12基因,ERG8基因,ERG19基因,IDI基因,tHMG1基因。也可以通过强化2,3环氧角鲨烯的上游途径的化合物的产生,来提供更多的2,3环氧角鲨烯,作为西米杜鹃醇的前体。应理解,其它一些强化细胞中2,3环氧角鲨烯的产生的方法也可被包含于本发明中。
本发明也提供了用于生物合成西米杜鹃醇的试剂盒,其中包括:本发明所述的基因工程细胞。较佳地,所述试剂盒中还可包括适于进行所述基因工程细胞的培养的培养基或培养组分。较佳地,所述试剂盒中还包括说明进行生物合成的方法的使用说明书,以指导本领域技术人员以适当的方法来进行生产。
本发明为将来利用合成生物学技术来大量合成西米杜鹃醇提供了必须的元件。
相对于传统的植物提取手段,微生物发酵具有速度快、受外界因素影响较小等优势;部分化合物通过微生物合成的产量远高于植物提取,已经成为天然产物获得的一种重要手段。西米杜鹃醇天然丰度低,并且在植物(例如来自植物艾叶)提取物中分离纯化的程序繁琐复杂。本发明中使用微生物发酵的方式来定向得合成西米杜鹃醇,可极为有效地降低分离纯化这类化合物的成本。
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如J.萨姆布鲁克等编著,分子克隆实验指南,第三版,科学出版社,2002中所述的条件,或按照制造厂商所建议的条件。
生物序列信息本发明后续实施例中所涉及的基因、多肽及引物如表1。
表1
实施例1、西米杜鹃醇合酶(ZmOSC1)的克隆
合成两条引物分别具有序列表中SEQ ID NO:3和SEQ ID NO:4的核苷酸序列。
以从玉米中提取的RNA反转录获得的cDNA为模板,利用如上两对引物SEQ ID NO:3和SEQ ID NO:4进行PCR。DNA聚合酶选用宝生物工程(大连)有限公司(Takara)高保真的PrimeStar DNA聚合酶。PCR扩增程序为:98℃2min;98℃10s,58℃15s,68℃3min,共35个循环;68℃7min,降至10℃。PCR产物经琼脂糖凝胶电泳检测,结果如图1。
在紫外下照射,切下目标DNA条带。然后采用Axygen Gel Extraction Kit(AXYGEN公司)从琼脂糖凝胶中回收DNA即为扩增出的三萜合酶基因的DNA片段。利用全式金有限公司的-Blunt Simple Cloning Kit克隆试剂盒,将回收的PCR产物克隆到pEASY载体,所构建的载体分别命名为pEASY-ZmOSC1,经测序获得ZmOSC1的基因序列。
ZmOSC1基因具有序列表中SEQ ID NO:1的核苷酸序列。自SEQ ID NO:1的5’端第1-2277位核苷酸为ZmOSC1的开放阅读框(Open Reading Frame,ORF),自SEQ ID NO:1的5’端的第1-3位核苷酸为ZmOSC1基因的起始密码子ATG,自SEQ ID NO:1的5’端的第2275-2277位核苷酸为ZmOSC1基因的终止密码子TGA。西米杜鹃醇合酶基因ZmOSC1编码一个含有758个氨基酸的蛋白质ZmOSC1,具有SEQ ID NO:2的氨基酸残基序列,用软件预测到该蛋白质的理论分子量大小为87.21kDa,等电点pI为6.40。
实施例2、重组酿酒酵母WP7-pESC-Leu-ZmOSC1的构建
1、工程细胞的建立
强化酵母CENPK-II(购自Euroscarf)的ERG10,ERG13,ERG12,ERG8,ERG19,IDI和tHMG1基因,将这些基因在酵母CENPK-II中过表达,获得高产角鲨烯烃和2,3环氧角鲨烯的酵母工程细胞WP7。其中各个基因的序列信息见表1中。
2、重组酿酒酵母WP7-pESC-Leu-ZmOSC1的建立
合成分别具有序列表中SEQ ID NO:5和SEQ ID NO:6核苷酸序列的两条引物。
在合成的引物SEQ ID NO:5和SEQ ID NO:6(扩增ZmOSC1)两端分别设置有同源臂序列,以从玉米中提取的RNA反转录的cDNA为模板进行PCR。PCR扩增程序同实施例1。PCR产物经琼脂糖凝胶电泳分离、回收后,与经SmalI酶切的pESC-Leu质粒(购自Invitrogen公司)连接,所获得的重组质粒命名为pESC-Leu-ZmOSC1。
将重组质粒pESC-Leu-ZmOSC1及空载体pESC-Leu分别转化酿酒酵母三萜重组细胞WP7中,分别建立重组酿酒酵母WP7-pESC-Leu-ZmOSC1和WP7-pESC-Leu。
实施例3、利用重组酿酒酵母诱导生产西米杜鹃醇
1、诱导培养方法
挑取在固体SCO培养基平板上划线的酵母:WP7-pESC-Leu-ZmOSC1和WP7-pESC-Leu分别于含有4mL液体种子培养基的试管震荡培养过夜(30℃,250rpm,16h);按1%的接种量转接于含有液体合成培养基的250ml锥形瓶中进行摇瓶发酵,30℃,250rpm震荡培养4天得到发酵产物。
2、Western印迹分析
13000g离心30秒收集菌体,加入PH8.0的Tris-HCl缓冲液重悬菌体,转入Fastprep管中6M/s,35s/次震荡破碎4次,4℃12000g离心收集细胞裂解液上清于新的EP管中,再15000离心20分钟收集上清。取样品进行Western blot分析。
西米杜鹃醇合酶ZmOSC1 Western印迹分析结果如附图2。
3、西米杜鹃醇产物提取及检测
取1mL的WP7-pESC-Leu-ZmOSC1/WP7-pESC-Leu(Control)的发酵液离心收集菌体,加入800μL抽提液(20%KOH、50%无水乙醇),用Fastprep震荡裂解酵母,沸水煮30分钟。加入等体积(800μL)的正己烷抽提,而后在真空条件下使正己烷蒸干。用100μL无水乙醇溶解后通过HPLC检测目的产物西米杜鹃醇。
HPLC结果见图3。
从图3中结果可以看出,相对于阴性对照,重组酿酒酵母WP7-pESC-Leu-ZmOSC1发酵产物有两个新峰。经核磁分析证明红色箭头标注的产物峰为西米杜鹃醇,表明本发明中发现的三萜合酶西米杜鹃醇合酶ZmOSC1能催化2,3环氧角鲨烯产生五环三萜化合物西米杜鹃醇。
实施例4、重组酿酒酵母WP7-pESC-Leu-ZmOSC1诱导生产西米杜鹃醇的结构鉴定
西米杜鹃醇产物的纯化:先用硅胶柱层析初步纯化WP7-pESC-Leu-ZmOSC1的发酵液,再用大连依利特公司的C18半制备柱SinoChrom ODS-BP柱(5μm,10.0mm×250mm)进行液相半制备,得到纯化的西米杜鹃醇产物。
西米杜鹃醇产物的鉴定:用Bruker公司的Bruker AdvanceIII 500(1H:500MHz,13C:125MHz)的核磁仪器采集西米杜鹃醇产物的核磁共振数据,以CDCl3为溶剂,三级甲硅烷的C和H的化学位移作为零点参照。
ZmOSC1和simiarenol的13C NMR数据对比见表2。其中,第一列为13C NMR的C,第二列为ZmOSC1的13C NMR数据,第三列为Simiarenol的核磁数据。
表2、ZmOSC1合酶发酵产物的NMR鉴定结果
根据表1,其NMR氢谱和碳谱与已报道的天然产物Simiarenol的NMR氢谱和碳谱数据均完全吻合。
在本发明提及的所有文献都在本申请中引用作为参考,就如同每一篇文献被单独引用作为参考那样。此外应理解,在阅读了本发明的上述讲授内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。
序列表
<110> 中国科学院分子植物科学卓越创新中心
<120> 一种新型三萜合酶及其应用
<130> 204412
<160> 6
<170> SIPOSequenceListing 1.0
<210> 1
<211> 2277
<212> DNA
<213> 玉米(Zea mays L.)
<400> 1
atgtggaggt taaagatcgg agagggggga gggccgtgga tgcagacggt gagcggcttc 60
catggccggc aggtttggga gtacgacccg gacgccggca ccgacgagga gcgcagcaag 120
gtggagcagc ttcgccggga gttcacagag aaccgcttcc gaaggaggga atcgcaggac 180
ctcctcatgc gtatgcagtt aactggacaa aaacatctgc atgcggacga aatgggagcg 240
gccaccaaga tagaggatgg cgacgaggtg acggaggaga ggttgcggga gtcgctgagg 300
cgagcgctgg ggtggatgtc tgctctccaa gctgaagatg gccactggcc tggtgatttc 360
agtgggatta cgtacattat gccgttctgg attttcgccc tgcacatcac aggatccatc 420
gatgttgtcc tatcgaaaga gcatagacgt gagatctgcc gccatatcta caaccatcag 480
aacgaggacg gaggatgggg cttcaacatc ttagatgaga gcgccatgtt ttccacatgc 540
ttaaactaca ccgccctgag acttctgggt gaggtgcagc aggaggagaa cgatgggcta 600
gctaaaggtc gagcatggat cctatcccat ggaactgcaa ctgcagcacc gcagtgggca 660
aaaatactac tctcggttat tggtgtatat gattggcgtg gcaataatcc agttgtacct 720
gaactatggt tggtaccacg tttcctccca attcacccag ggcggttctg gtgtttcact 780
aggattacat acatgtcaat agctttcctt tatggcaaaa aattcgttgg tcccattaca 840
ccaactattt tagaactaag ggaggaactc tacagtttgc cttatgttca gatagattgg 900
agtaaagctc gtaattcttg tgccaaggag gacatgcgca ataaaccttc agaaatattc 960
aaatttattt caacatgctt gaacatgttc gtggagcctg tgttgaacta ttggccactt 1020
aacaagttaa gagagagagc tttgaaccac gttctggaac atatccacta cgaagatgaa 1080
actactcaat atattggcat atctcccgtt acaaaggcac taaacatgat ttgttgctgg 1140
gttgaaaacc caaattcaga tgcgctcaag cgacatattc caaggataca tgattatttg 1200
tggattgctg aagatggaat gaacaccaaa atatacgatg gtacccacaa ctgggagttg 1260
gcacttataa tccaggccat attatcagca gatgctgcca atgagtatgg tccaaccatt 1320
cagagggcta tggaatactt aaagagagca caagttacta caaaccctcc agggaaccca 1380
agttattggt tccgccatag gtcaaaagga tcatggccac tttccactat agacaatggc 1440
tggggctcat cggatactag tgcggaagca actaaggcac ttttgatgat ttccaaggta 1500
tatcccaacc tcgttgaaaa ttcaaatgga gatgagtgga tgttgaatgc tgttgattgc 1560
cttctttcat ttatgaacaa agatggttca gtttctacat ttgaatgtca aagaacttat 1620
tcttggctag agattctgaa tcctttggag agttttcgaa atattgtcgc tgattacccg 1680
acggttgaat gtacatcctc ggtgttgcag gcattggtat tatttgagga attcaattca 1740
gagtaccgca gtaaagaaat aaaagaaaat gttaagaaag ctgccatata tattgagaat 1800
aatcaaaaca aggatggatc ctggtacggc acatggggaa tatgttttgt ctatgggact 1860
ctctatgcaa taaaaggctt agttgctgct gggagaaatt atgagaacag tatttgcatt 1920
aggaaagcat gtaacttttt gctctcaata caactcaaaa ctggtggctg ggcagagagc 1980
tatcattcct gtgaaagaca ggtttatgta gagggtcata gcactcatgt tgtacaaact 2040
gcttgggcga tgctagcttt gatttatact ggacagatgg aacgggaccc aacaccatta 2100
catcgagcag caaaagtatt gattaatatg caactggaga caggagacta tgctcaacaa 2160
gaacatgttg gtagcaccaa ctgctctgta tacttcaact accccaacta tcgtattttg 2220
ttccctgttt gggctcttgg agagtatcat cgcaaagtat gtaccaaaag taattga 2277
<210> 2
<211> 758
<212> PRT
<213> 玉米(Zea mays L.)
<400> 2
Met Trp Arg Leu Lys Ile Gly Glu Gly Gly Gly Pro Trp Met Gln Thr
1 5 10 15
Val Ser Gly Phe His Gly Arg Gln Val Trp Glu Tyr Asp Pro Asp Ala
20 25 30
Gly Thr Asp Glu Glu Arg Ser Lys Val Glu Gln Leu Arg Arg Glu Phe
35 40 45
Thr Glu Asn Arg Phe Arg Arg Arg Glu Ser Gln Asp Leu Leu Met Arg
50 55 60
Met Gln Leu Thr Gly Gln Lys His Leu His Ala Asp Glu Met Gly Ala
65 70 75 80
Ala Thr Lys Ile Glu Asp Gly Asp Glu Val Thr Glu Glu Arg Leu Arg
85 90 95
Glu Ser Leu Arg Arg Ala Leu Gly Trp Met Ser Ala Leu Gln Ala Glu
100 105 110
Asp Gly His Trp Pro Gly Asp Phe Ser Gly Ile Thr Tyr Ile Met Pro
115 120 125
Phe Trp Ile Phe Ala Leu His Ile Thr Gly Ser Ile Asp Val Val Leu
130 135 140
Ser Lys Glu His Arg Arg Glu Ile Cys Arg His Ile Tyr Asn His Gln
145 150 155 160
Asn Glu Asp Gly Gly Trp Gly Phe Asn Ile Leu Asp Glu Ser Ala Met
165 170 175
Phe Ser Thr Cys Leu Asn Tyr Thr Ala Leu Arg Leu Leu Gly Glu Val
180 185 190
Gln Gln Glu Glu Asn Asp Gly Leu Ala Lys Gly Arg Ala Trp Ile Leu
195 200 205
Ser His Gly Thr Ala Thr Ala Ala Pro Gln Trp Ala Lys Ile Leu Leu
210 215 220
Ser Val Ile Gly Val Tyr Asp Trp Arg Gly Asn Asn Pro Val Val Pro
225 230 235 240
Glu Leu Trp Leu Val Pro Arg Phe Leu Pro Ile His Pro Gly Arg Phe
245 250 255
Trp Cys Phe Thr Arg Ile Thr Tyr Met Ser Ile Ala Phe Leu Tyr Gly
260 265 270
Lys Lys Phe Val Gly Pro Ile Thr Pro Thr Ile Leu Glu Leu Arg Glu
275 280 285
Glu Leu Tyr Ser Leu Pro Tyr Val Gln Ile Asp Trp Ser Lys Ala Arg
290 295 300
Asn Ser Cys Ala Lys Glu Asp Met Arg Asn Lys Pro Ser Glu Ile Phe
305 310 315 320
Lys Phe Ile Ser Thr Cys Leu Asn Met Phe Val Glu Pro Val Leu Asn
325 330 335
Tyr Trp Pro Leu Asn Lys Leu Arg Glu Arg Ala Leu Asn His Val Leu
340 345 350
Glu His Ile His Tyr Glu Asp Glu Thr Thr Gln Tyr Ile Gly Ile Ser
355 360 365
Pro Val Thr Lys Ala Leu Asn Met Ile Cys Cys Trp Val Glu Asn Pro
370 375 380
Asn Ser Asp Ala Leu Lys Arg His Ile Pro Arg Ile His Asp Tyr Leu
385 390 395 400
Trp Ile Ala Glu Asp Gly Met Asn Thr Lys Ile Tyr Asp Gly Thr His
405 410 415
Asn Trp Glu Leu Ala Leu Ile Ile Gln Ala Ile Leu Ser Ala Asp Ala
420 425 430
Ala Asn Glu Tyr Gly Pro Thr Ile Gln Arg Ala Met Glu Tyr Leu Lys
435 440 445
Arg Ala Gln Val Thr Thr Asn Pro Pro Gly Asn Pro Ser Tyr Trp Phe
450 455 460
Arg His Arg Ser Lys Gly Ser Trp Pro Leu Ser Thr Ile Asp Asn Gly
465 470 475 480
Trp Gly Ser Ser Asp Thr Ser Ala Glu Ala Thr Lys Ala Leu Leu Met
485 490 495
Ile Ser Lys Val Tyr Pro Asn Leu Val Glu Asn Ser Asn Gly Asp Glu
500 505 510
Trp Met Leu Asn Ala Val Asp Cys Leu Leu Ser Phe Met Asn Lys Asp
515 520 525
Gly Ser Val Ser Thr Phe Glu Cys Gln Arg Thr Tyr Ser Trp Leu Glu
530 535 540
Ile Leu Asn Pro Leu Glu Ser Phe Arg Asn Ile Val Ala Asp Tyr Pro
545 550 555 560
Thr Val Glu Cys Thr Ser Ser Val Leu Gln Ala Leu Val Leu Phe Glu
565 570 575
Glu Phe Asn Ser Glu Tyr Arg Ser Lys Glu Ile Lys Glu Asn Val Lys
580 585 590
Lys Ala Ala Ile Tyr Ile Glu Asn Asn Gln Asn Lys Asp Gly Ser Trp
595 600 605
Tyr Gly Thr Trp Gly Ile Cys Phe Val Tyr Gly Thr Leu Tyr Ala Ile
610 615 620
Lys Gly Leu Val Ala Ala Gly Arg Asn Tyr Glu Asn Ser Ile Cys Ile
625 630 635 640
Arg Lys Ala Cys Asn Phe Leu Leu Ser Ile Gln Leu Lys Thr Gly Gly
645 650 655
Trp Ala Glu Ser Tyr His Ser Cys Glu Arg Gln Val Tyr Val Glu Gly
660 665 670
His Ser Thr His Val Val Gln Thr Ala Trp Ala Met Leu Ala Leu Ile
675 680 685
Tyr Thr Gly Gln Met Glu Arg Asp Pro Thr Pro Leu His Arg Ala Ala
690 695 700
Lys Val Leu Ile Asn Met Gln Leu Glu Thr Gly Asp Tyr Ala Gln Gln
705 710 715 720
Glu His Val Gly Ser Thr Asn Cys Ser Val Tyr Phe Asn Tyr Pro Asn
725 730 735
Tyr Arg Ile Leu Phe Pro Val Trp Ala Leu Gly Glu Tyr His Arg Lys
740 745 750
Val Cys Thr Lys Ser Asn
755
<210> 3
<211> 24
<212> DNA
<213> 引物(Primer)
<400> 3
atgtggaggt taaagatcgg agag 24
<210> 4
<211> 27
<212> DNA
<213> 引物(Primer)
<400> 4
tcaattactt ttggtacata ctttgcg 27
<210> 5
<211> 59
<212> DNA
<213> 引物(Primer)
<400> 5
aatatacctc tatactttaa cgtcaaggag aaaaaaccca tgtggaggtt aaagatcgg 59
<210> 6
<211> 59
<212> DNA
<213> 引物(Primer)
<400> 6
ttttcggtta gagcggattt aatgatgatg atgatgatgt caattacttt tggtacata 59
Claims (15)
1.一种分离的多肽,其特征在于,该多肽选自下组:
(a)SEQ ID NO:2所示氨基酸序列的多肽;
(b)与SEQ ID NO:2所示的氨基酸序列有至少70%相同性,且具有(a)所限定的多肽的功能的多肽;
(c)将(a)所限定的多肽的氨基酸序列经过一个或多个氨基酸残基的取代、缺失或添加而形成的,且具有(a)所限定的多肽的功能的多肽;或
(d)(a)~(c)任一所述多肽的片段,其包含有该多肽的催化结构域,且具有(a)所限定的多肽的功能;
(e)(a)~(d)任一所述多肽的N或C末端添加标签序列,或在其N末端添加信号肽序列后形成的多肽。
2.一种分离的多核苷酸,其特征在于,它包含一核苷酸序列,该核苷酸序列选自下组:
(1)编码如权利要求1所述多肽的多核苷酸;
(2)与多核苷酸(1)互补的多核苷酸。
3.如权利要求2所述的多核苷酸,其特征在于,该多核苷酸编码如SEQ ID NO:2所示氨基酸序列的多肽;较佳地,该多核苷酸的核苷酸序列如SEQ ID NO:1所示。
4.一种载体,其特征在于,它含有权利要求2~3任一所述的多核苷酸。
5.一种遗传工程化的宿主细胞,其特征在于,它含有权利要求4所述的载体,或其基因组中整合有权利要求2~3任一所述的多核苷酸。
6.一种制备权利要求1所述的多肽的方法,包括:
(i)培养权利要求5所述的宿主细胞;
(ii)收集含有权利要求1所述的多肽的培养物;
(iii)从培养物中分离出权利要求1所述的多肽。
7.权利要求1所述的多肽的用途,用于催化2,3环氧角鲨烯生成西米杜鹃醇。
8.一种组合物,其包含选自下组的成分:权利要求1所述的多肽;或权利要求5所述的宿主细胞;以及,工业学或微生物学上可接受的载体。
9.一种生产西米杜鹃醇的方法,包括:应用权利要求1所述的多肽、权利要求5所述的宿主细胞或权利要求8所述的组合物催化2,3环氧角鲨烯,生成西米杜鹃醇。
10.一种生产西米杜鹃醇的方法,包括:
(1)提供一工程细胞,该细胞:
包含2,3环氧角鲨烯,
能吸收胞外的2,3环氧角鲨烯,或
具有2,3环氧角鲨烯生产途径;
(2)在(1)所述工程细胞中表达权利要求1所述的多肽;
(3)培养(2)的工程细胞,生产西米杜鹃醇。
11.如权利要求10所述的方法,其特征在于,还包括:在所述工程细胞中过表达ERG10基因,ERG13基因,ERG12基因,ERG8基因,ERG19基因,IDI基因,tHMG1基因。
12.一种用于生产西米杜鹃醇的工程细胞,该细胞:
包含2,3环氧角鲨烯,
能吸收胞外的2,3环氧角鲨烯,或
具有2,3环氧角鲨烯生产途径;
且,该细胞表达权利要求1所述的多肽。
13.如权利要求12所述的工程细胞,其特征在于,所述工程细胞中过表达ERG10基因,ERG13基因,ERG12基因,ERG8基因,ERG19基因,IDI基因,tHMG1基因。
14.如权利要求10~13任一所述,其特征在于,所述的工程细胞包括:原核细胞或真核细胞;较佳地,所述原核细胞包括大肠杆菌、枯草杆菌;所述真核细胞包括:酵母细胞、真菌细胞、昆虫细胞、哺乳动物细胞;较佳地,所述工程细胞为酵母细胞;更佳地,所述工程细胞为酿酒酵母细胞。
15.一种用于生产西米杜鹃醇的试剂盒,其中包括:权利要求12~14任一所述的工程细胞;较佳地,其中还包括细胞培养基或培养组分;较佳地,其中还包括使用说明书。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011032594.6A CN114277024B (zh) | 2020-09-27 | 2020-09-27 | 一种新型三萜合酶及其应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011032594.6A CN114277024B (zh) | 2020-09-27 | 2020-09-27 | 一种新型三萜合酶及其应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114277024A true CN114277024A (zh) | 2022-04-05 |
CN114277024B CN114277024B (zh) | 2024-02-13 |
Family
ID=80867809
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011032594.6A Active CN114277024B (zh) | 2020-09-27 | 2020-09-27 | 一种新型三萜合酶及其应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114277024B (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112063647A (zh) * | 2020-09-17 | 2020-12-11 | 云南农业大学 | 酿酒酵母重组菌Cuol01的构建方法、酿酒酵母重组菌Cuol02及应用 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070107081A1 (en) * | 2005-11-07 | 2007-05-10 | Mcgonigle Brian | Compositions with increased phytosterol levels obtained from plants with decreased triterpene saponin levels |
CN104138384A (zh) * | 2013-05-07 | 2014-11-12 | 中国药科大学 | 五环三萜类化合物作为氧化角鲨烯环化酶抑制剂的用途 |
-
2020
- 2020-09-27 CN CN202011032594.6A patent/CN114277024B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070107081A1 (en) * | 2005-11-07 | 2007-05-10 | Mcgonigle Brian | Compositions with increased phytosterol levels obtained from plants with decreased triterpene saponin levels |
CN104138384A (zh) * | 2013-05-07 | 2014-11-12 | 中国药科大学 | 五环三萜类化合物作为氧化角鲨烯环化酶抑制剂的用途 |
Non-Patent Citations (2)
Title |
---|
UNKNOWN: "NCBI Reference Sequence: XP_008661272.1,achilleol B synthase isoform X1 [Zea mays]", 《NCBI》 * |
樊震鋆: "三萜合酶和倍半萜合酶元件的挖掘与鉴定", 《中国优秀硕士论文电子期刊网》, pages 17 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112063647A (zh) * | 2020-09-17 | 2020-12-11 | 云南农业大学 | 酿酒酵母重组菌Cuol01的构建方法、酿酒酵母重组菌Cuol02及应用 |
CN112063647B (zh) * | 2020-09-17 | 2023-05-02 | 云南农业大学 | 酿酒酵母重组菌Cuol01的构建方法、酿酒酵母重组菌Cuol02及应用 |
Also Published As
Publication number | Publication date |
---|---|
CN114277024B (zh) | 2024-02-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110551701B (zh) | 羰基还原酶突变体及其在环戊二酮类化合物还原中的应用 | |
CN111051515B (zh) | 来自细菌的iii型聚酮合酶作为间苯三酚合酶的用途 | |
CN113322288A (zh) | 新型黄酮羟基化酶、合成黄酮碳苷类化合物的微生物及其应用 | |
KR102418138B1 (ko) | 글리코실트랜스퍼라제, 돌연변이체 및 이의 응용 | |
CN111032875B (zh) | Iii型聚酮合酶作为间苯三酚合酶的用途 | |
CN105985938A (zh) | 糖基转移酶突变蛋白及其应用 | |
CN113136373A (zh) | 新型碳苷糖基转移酶及其应用 | |
CN110885846B (zh) | 合成黄芩素和野黄芩素的微生物、其制备方法及其应用 | |
EP4108777A1 (en) | Bifunctional c-glycoside glycosyltransferases and application thereof | |
CN109207448B (zh) | 新型黄酮异戊烯基转移酶及其应用 | |
CN114277024B (zh) | 一种新型三萜合酶及其应用 | |
CN110951705B (zh) | 胺脱氢酶突变体、酶制剂、重组载体、重组细胞及其制备方法和应用 | |
CN114196640B (zh) | 一种源于新型贝类的l-肌肽合成酶atpgd及其应用 | |
CN106318920B (zh) | 黄酮-6-羟化酶及其在灯盏乙素合成中的应用 | |
US20240035062A1 (en) | Cytochrome P450 Mutant Protein and Use Thereof | |
KR20240032944A (ko) | 람노스가 고도로 특이적인 글리코실트랜스퍼라제 및 이의 응용 | |
CN112553175B (zh) | 糖基转移酶ugt76g1突变体的制备及其应用 | |
CN116478973A (zh) | 梅片树倍半萜合酶CbTPS6及其相关生物材料与应用 | |
CN113046333B (zh) | 黄酮-8-羟化酶及其在8-羟基黄酮化合物合成中的应用 | |
CN113755464B (zh) | 一种参与桂叶苷B和毛蕊花糖苷生物合成的LrUGT2蛋白及其编码基因与应用 | |
CN108410905A (zh) | 调节棉花的棉酚性状的基因以及调节方法 | |
CN111926000B (zh) | 绞股蓝糖基转移酶及其应用 | |
CN112322592B (zh) | 一种参与紫草素生物合成的cyp76b100蛋白及其编码基因与应用 | |
CN108707590A (zh) | 一种Pictet-Spengler酶及其编码基因与应用 | |
KR102238654B1 (ko) | 오명사마이신 a의 생산능이 증대된 스트렙토마이세스 속 변이 균주 및 이를 이용한 오명사마이신 a의 생산 방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |