KR102618015B1 - 스테비올 또는 이의 전구체의 생산 - Google Patents
스테비올 또는 이의 전구체의 생산 Download PDFInfo
- Publication number
- KR102618015B1 KR102618015B1 KR1020180167932A KR20180167932A KR102618015B1 KR 102618015 B1 KR102618015 B1 KR 102618015B1 KR 1020180167932 A KR1020180167932 A KR 1020180167932A KR 20180167932 A KR20180167932 A KR 20180167932A KR 102618015 B1 KR102618015 B1 KR 102618015B1
- Authority
- KR
- South Korea
- Prior art keywords
- leu
- glu
- ser
- ile
- lys
- Prior art date
Links
- 239000002243 precursor Substances 0.000 title claims abstract description 31
- QFVOYBUQQBFCRH-VQSWZGCSSA-N steviol Chemical compound C([C@@]1(O)C(=C)C[C@@]2(C1)CC1)C[C@H]2[C@@]2(C)[C@H]1[C@](C)(C(O)=O)CCC2 QFVOYBUQQBFCRH-VQSWZGCSSA-N 0.000 title abstract description 80
- 229940032084 steviol Drugs 0.000 title abstract description 78
- QFVOYBUQQBFCRH-UHFFFAOYSA-N Steviol Natural products C1CC2(C3)CC(=C)C3(O)CCC2C2(C)C1C(C)(C(O)=O)CCC2 QFVOYBUQQBFCRH-UHFFFAOYSA-N 0.000 title abstract description 75
- 238000004519 manufacturing process Methods 0.000 title description 37
- 108090000790 Enzymes Proteins 0.000 claims abstract description 59
- 102000004190 Enzymes Human genes 0.000 claims abstract description 58
- ONVABDHFQKWOSV-UHFFFAOYSA-N 16-Phyllocladene Natural products C1CC(C2)C(=C)CC32CCC2C(C)(C)CCCC2(C)C31 ONVABDHFQKWOSV-UHFFFAOYSA-N 0.000 claims description 66
- ONVABDHFQKWOSV-HPUSYDDDSA-N ent-kaur-16-ene Chemical compound C1C[C@H](C2)C(=C)C[C@@]32CC[C@@H]2C(C)(C)CCC[C@@]2(C)[C@@H]31 ONVABDHFQKWOSV-HPUSYDDDSA-N 0.000 claims description 65
- 108090000623 proteins and genes Proteins 0.000 claims description 59
- 210000004027 cell Anatomy 0.000 claims description 46
- OINNEUNVOZHBOX-QIRCYJPOSA-K 2-trans,6-trans,10-trans-geranylgeranyl diphosphate(3-) Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\COP([O-])(=O)OP([O-])([O-])=O OINNEUNVOZHBOX-QIRCYJPOSA-K 0.000 claims description 32
- OINNEUNVOZHBOX-XBQSVVNOSA-N Geranylgeranyl diphosphate Natural products [P@](=O)(OP(=O)(O)O)(OC/C=C(\CC/C=C(\CC/C=C(\CC/C=C(\C)/C)/C)/C)/C)O OINNEUNVOZHBOX-XBQSVVNOSA-N 0.000 claims description 32
- JCAIWDXKLCEQEO-PGHZQYBFSA-N 5beta,9alpha,10alpha-labda-8(20),13-dien-15-yl diphosphate Chemical compound CC1(C)CCC[C@@]2(C)[C@H](CCC(/C)=C/COP(O)(=O)OP(O)(O)=O)C(=C)CC[C@@H]21 JCAIWDXKLCEQEO-PGHZQYBFSA-N 0.000 claims description 22
- 108010045510 NADPH-Ferrihemoprotein Reductase Proteins 0.000 claims description 17
- 108010067758 ent-kaurene oxidase Proteins 0.000 claims description 17
- 239000000284 extract Substances 0.000 claims description 16
- 235000019202 steviosides Nutrition 0.000 claims description 15
- 239000004383 Steviol glycoside Substances 0.000 claims description 14
- 230000001580 bacterial effect Effects 0.000 claims description 14
- 235000019411 steviol glycoside Nutrition 0.000 claims description 14
- 229930182488 steviol glycoside Natural products 0.000 claims description 14
- 150000008144 steviol glycosides Chemical class 0.000 claims description 14
- VWFJDQUYCIWHTN-YFVJMOTDSA-N 2-trans,6-trans-farnesyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CO[P@](O)(=O)OP(O)(O)=O VWFJDQUYCIWHTN-YFVJMOTDSA-N 0.000 claims description 12
- VWFJDQUYCIWHTN-UHFFFAOYSA-N Farnesyl pyrophosphate Natural products CC(C)=CCCC(C)=CCCC(C)=CCOP(O)(=O)OP(O)(O)=O VWFJDQUYCIWHTN-UHFFFAOYSA-N 0.000 claims description 12
- 239000000203 mixture Substances 0.000 claims description 12
- 230000009977 dual effect Effects 0.000 claims description 9
- NIKHGUQULKYIGE-UHFFFAOYSA-N kaurenoic acid Natural products C1CC2(CC3=C)CC3CCC2C2(C)C1C(C)(C(O)=O)CCC2 NIKHGUQULKYIGE-UHFFFAOYSA-N 0.000 claims description 8
- 210000005253 yeast cell Anatomy 0.000 claims description 8
- 239000002253 acid Substances 0.000 claims description 4
- 230000002538 fungal effect Effects 0.000 claims description 4
- 229920001184 polypeptide Polymers 0.000 claims description 4
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 4
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 4
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 3
- 108010074633 Mixed Function Oxygenases Proteins 0.000 claims description 3
- 102000008109 Mixed Function Oxygenases Human genes 0.000 claims description 3
- 241000894006 Bacteria Species 0.000 claims description 2
- 241000238631 Hexapoda Species 0.000 claims description 2
- 210000004962 mammalian cell Anatomy 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- NIKHGUQULKYIGE-SHAPNJEPSA-N ent-kaur-16-en-19-oic acid Chemical compound C([C@H]1C[C@]2(CC1=C)CC1)C[C@H]2[C@@]2(C)[C@H]1[C@](C)(C(O)=O)CCC2 NIKHGUQULKYIGE-SHAPNJEPSA-N 0.000 claims 1
- KWVKUAKMOIEELN-UHFFFAOYSA-N ent-kaur-16-en-19-oic acid Natural products CC1(C)CCCC2(C)C1CCC34CC(=C(C3)C(=O)O)CCC24 KWVKUAKMOIEELN-UHFFFAOYSA-N 0.000 claims 1
- 238000000034 method Methods 0.000 abstract description 21
- 230000015572 biosynthetic process Effects 0.000 abstract description 13
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 40
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 39
- SNRUBQQJIBEYMU-UHFFFAOYSA-N dodecane Chemical compound CCCCCCCCCCCC SNRUBQQJIBEYMU-UHFFFAOYSA-N 0.000 description 30
- 244000228451 Stevia rebaudiana Species 0.000 description 24
- 108091033319 polynucleotide Proteins 0.000 description 17
- 102000040430 polynucleotide Human genes 0.000 description 17
- 239000002157 polynucleotide Substances 0.000 description 17
- 150000001413 amino acids Chemical group 0.000 description 14
- CRDAMVZIKSXKFV-FBXUGWQNSA-N (2-cis,6-cis)-farnesol Chemical compound CC(C)=CCC\C(C)=C/CC\C(C)=C/CO CRDAMVZIKSXKFV-FBXUGWQNSA-N 0.000 description 13
- 239000000260 (2E,6E)-3,7,11-trimethyldodeca-2,6,10-trien-1-ol Substances 0.000 description 13
- OJISWRZIEWCUBN-QIRCYJPOSA-N (E,E,E)-geranylgeraniol Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CO OJISWRZIEWCUBN-QIRCYJPOSA-N 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 13
- 229930002886 farnesol Natural products 0.000 description 13
- 229940043259 farnesol Drugs 0.000 description 13
- XWRJRXQNOHXIOX-UHFFFAOYSA-N geranylgeraniol Natural products CC(C)=CCCC(C)=CCOCC=C(C)CCC=C(C)C XWRJRXQNOHXIOX-UHFFFAOYSA-N 0.000 description 13
- OJISWRZIEWCUBN-UHFFFAOYSA-N geranylnerol Natural products CC(C)=CCCC(C)=CCCC(C)=CCCC(C)=CCO OJISWRZIEWCUBN-UHFFFAOYSA-N 0.000 description 13
- CRDAMVZIKSXKFV-UHFFFAOYSA-N trans-Farnesol Natural products CC(C)=CCCC(C)=CCCC(C)=CCO CRDAMVZIKSXKFV-UHFFFAOYSA-N 0.000 description 13
- 108010007508 Farnesyltranstransferase Proteins 0.000 description 11
- 102000007317 Farnesyltranstransferase Human genes 0.000 description 11
- HELXLJCILKEWJH-NCGAPWICSA-N rebaudioside A Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HELXLJCILKEWJH-NCGAPWICSA-N 0.000 description 11
- 101150080339 BTS1 gene Proteins 0.000 description 10
- 235000006092 Stevia rebaudiana Nutrition 0.000 description 10
- RPYRMTHVSUWHSV-CUZJHZIBSA-N rebaudioside D Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O RPYRMTHVSUWHSV-CUZJHZIBSA-N 0.000 description 10
- 238000000605 extraction Methods 0.000 description 9
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 9
- 239000000543 intermediate Substances 0.000 description 9
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 9
- 230000008569 process Effects 0.000 description 9
- 101150084072 ERG20 gene Proteins 0.000 description 8
- 241000195930 Jungermannia subulata Species 0.000 description 8
- -1 KAH Proteins 0.000 description 8
- 230000001851 biosynthetic effect Effects 0.000 description 8
- 210000004748 cultured cell Anatomy 0.000 description 7
- NIKHGUQULKYIGE-OTCXFQBHSA-N ent-kaur-16-en-19-oic acid Chemical compound C([C@@H]1C[C@]2(CC1=C)CC1)C[C@H]2[C@@]2(C)[C@H]1[C@](C)(C(O)=O)CCC2 NIKHGUQULKYIGE-OTCXFQBHSA-N 0.000 description 7
- 108010064739 ent-kaurene synthetase B Proteins 0.000 description 7
- 235000013305 food Nutrition 0.000 description 7
- 235000003599 food sweetener Nutrition 0.000 description 7
- 239000003765 sweetening agent Substances 0.000 description 7
- 239000013598 vector Substances 0.000 description 7
- 241000196324 Embryophyta Species 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 230000001588 bifunctional effect Effects 0.000 description 6
- 238000005119 centrifugation Methods 0.000 description 6
- 230000014509 gene expression Effects 0.000 description 6
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- QSRAJVGDWKFOGU-WBXIDTKBSA-N rebaudioside c Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]1(CC[C@H]2[C@@]3(C)[C@@H]([C@](CCC3)(C)C(=O)O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O)CC3)C(=C)C[C@]23C1 QSRAJVGDWKFOGU-WBXIDTKBSA-N 0.000 description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 description 5
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 5
- 238000004113 cell culture Methods 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 101000841399 Arabidopsis thaliana ERBB-3 BINDING PROTEIN 1 Proteins 0.000 description 4
- 239000001512 FEMA 4601 Substances 0.000 description 4
- 102100028501 Galanin peptides Human genes 0.000 description 4
- 102100039291 Geranylgeranyl pyrophosphate synthase Human genes 0.000 description 4
- 108010066605 Geranylgeranyl-Diphosphate Geranylgeranyltransferase Proteins 0.000 description 4
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- HELXLJCILKEWJH-SEAGSNCFSA-N Rebaudioside A Natural products O=C(O[C@H]1[C@@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1)[C@@]1(C)[C@@H]2[C@](C)([C@H]3[C@@]4(CC(=C)[C@@](O[C@H]5[C@H](O[C@H]6[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O6)[C@@H](O[C@H]6[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O6)[C@H](O)[C@@H](CO)O5)(C4)CC3)CC2)CCC1 HELXLJCILKEWJH-SEAGSNCFSA-N 0.000 description 4
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 4
- 241000222057 Xanthophyllomyces dendrorhous Species 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 239000000356 contaminant Substances 0.000 description 4
- HELXLJCILKEWJH-UHFFFAOYSA-N entered according to Sigma 01432 Natural products C1CC2C3(C)CCCC(C)(C(=O)OC4C(C(O)C(O)C(CO)O4)O)C3CCC2(C2)CC(=C)C21OC(C1OC2C(C(O)C(O)C(CO)O2)O)OC(CO)C(O)C1OC1OC(CO)C(O)C(O)C1O HELXLJCILKEWJH-UHFFFAOYSA-N 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 239000002035 hexane extract Substances 0.000 description 4
- 108010003700 lysyl aspartic acid Proteins 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 239000002207 metabolite Substances 0.000 description 4
- 150000007523 nucleic acids Chemical group 0.000 description 4
- 229930188195 rebaudioside Natural products 0.000 description 4
- 235000019203 rebaudioside A Nutrition 0.000 description 4
- GSGVXNMGMKBGQU-PHESRWQRSA-N rebaudioside M Chemical compound C[C@@]12CCC[C@](C)([C@H]1CC[C@@]13CC(=C)[C@@](C1)(CC[C@@H]23)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O[C@@H]2O[C@H](CO)[C@@H](O)[C@H](O)[C@H]2O)[C@H]1O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O)C(=O)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O[C@@H]2O[C@H](CO)[C@@H](O)[C@H](O)[C@H]2O)[C@H]1O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O GSGVXNMGMKBGQU-PHESRWQRSA-N 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 3
- 101710145281 Ent-kaur-16-ene synthase Proteins 0.000 description 3
- 239000001776 FEMA 4720 Substances 0.000 description 3
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 3
- 101100114901 Streptomyces griseus crtI gene Proteins 0.000 description 3
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 3
- 239000006227 byproduct Substances 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 101150000046 crtE gene Proteins 0.000 description 3
- 239000012228 culture supernatant Substances 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 239000000796 flavoring agent Substances 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 238000002744 homologous recombination Methods 0.000 description 3
- 230000006801 homologous recombination Effects 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- QRGRAFPOLJOGRV-UHFFFAOYSA-N rebaudioside F Natural products CC12CCCC(C)(C1CCC34CC(=C)C(CCC23)(C4)OC5OC(CO)C(O)C(OC6OCC(O)C(O)C6O)C5OC7OC(CO)C(O)C(O)C7O)C(=O)OC8OC(CO)C(O)C(O)C8O QRGRAFPOLJOGRV-UHFFFAOYSA-N 0.000 description 3
- HYLAUKAHEAUVFE-AVBZULRRSA-N rebaudioside f Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)CO1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HYLAUKAHEAUVFE-AVBZULRRSA-N 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 2
- KJTLQQUUPVSXIM-ZCFIWIBFSA-M (R)-mevalonate Chemical compound OCC[C@](O)(C)CC([O-])=O KJTLQQUUPVSXIM-ZCFIWIBFSA-M 0.000 description 2
- CONKBQPVFMXDOV-QHCPKHFHSA-N 6-[(5S)-5-[[4-[2-(2,3-dihydro-1H-inden-2-ylamino)pyrimidin-5-yl]piperazin-1-yl]methyl]-2-oxo-1,3-oxazolidin-3-yl]-3H-1,3-benzoxazol-2-one Chemical compound C1C(CC2=CC=CC=C12)NC1=NC=C(C=N1)N1CCN(CC1)C[C@H]1CN(C(O1)=O)C1=CC2=C(NC(O2)=O)C=C1 CONKBQPVFMXDOV-QHCPKHFHSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- 241000219195 Arabidopsis thaliana Species 0.000 description 2
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 2
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 2
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 2
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- KJTLQQUUPVSXIM-UHFFFAOYSA-N DL-mevalonic acid Natural products OCCC(O)(C)CC(O)=O KJTLQQUUPVSXIM-UHFFFAOYSA-N 0.000 description 2
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- 101150094690 GAL1 gene Proteins 0.000 description 2
- 101150037782 GAL2 gene Proteins 0.000 description 2
- 102100021735 Galectin-2 Human genes 0.000 description 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 2
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 2
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 2
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 2
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- 229910009891 LiAc Inorganic materials 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 2
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 2
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 2
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 2
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 2
- ISYSEOWLRQKQEQ-JYJNAYRXSA-N Phe-His-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISYSEOWLRQKQEQ-JYJNAYRXSA-N 0.000 description 2
- RLLCWNUIHGPAJY-RYBZXKSASA-N Rebaudioside E Natural products O=C(O[C@H]1[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O2)[C@@H](O)[C@@H](O)[C@H](CO)O1)[C@]1(C)[C@@H]2[C@@](C)([C@@H]3[C@@]4(CC(=C)[C@@](O[C@@H]5[C@@H](O[C@@H]6[C@@H](O)[C@H](O)[C@@H](O)[C@H](CO)O6)[C@H](O)[C@@H](O)[C@H](CO)O5)(C4)CC3)CC2)CCC1 RLLCWNUIHGPAJY-RYBZXKSASA-N 0.000 description 2
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 239000004376 Sucralose Substances 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- TZNNEYFZZAHLBL-BPUTZDHNSA-N Trp-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O TZNNEYFZZAHLBL-BPUTZDHNSA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 230000036983 biotransformation Effects 0.000 description 2
- 239000012930 cell culture fluid Substances 0.000 description 2
- 239000006143 cell culture medium Substances 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000012790 confirmation Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 235000011180 diphosphates Nutrition 0.000 description 2
- 229930004069 diterpene Natural products 0.000 description 2
- 150000004141 diterpene derivatives Chemical class 0.000 description 2
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 2
- 230000027721 electron transport chain Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 229930182470 glycoside Natural products 0.000 description 2
- 150000002338 glycosides Chemical class 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 238000012269 metabolic engineering Methods 0.000 description 2
- 235000021096 natural sweeteners Nutrition 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- RLLCWNUIHGPAJY-SFUUMPFESA-N rebaudioside E Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O RLLCWNUIHGPAJY-SFUUMPFESA-N 0.000 description 2
- 235000019204 saccharin Nutrition 0.000 description 2
- CVHZOJJKTDOEJC-UHFFFAOYSA-N saccharin Chemical compound C1=CC=C2C(=O)NS(=O)(=O)C2=C1 CVHZOJJKTDOEJC-UHFFFAOYSA-N 0.000 description 2
- 229940081974 saccharin Drugs 0.000 description 2
- 239000000901 saccharin and its Na,K and Ca salt Substances 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 229940031439 squalene Drugs 0.000 description 2
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 2
- 229940013618 stevioside Drugs 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 235000019408 sucralose Nutrition 0.000 description 2
- BAQAVOSOZGMPRM-QBMZZYIRSA-N sucralose Chemical compound O[C@@H]1[C@@H](O)[C@@H](Cl)[C@@H](CO)O[C@@H]1O[C@@]1(CCl)[C@@H](O)[C@H](O)[C@@H](CCl)O1 BAQAVOSOZGMPRM-QBMZZYIRSA-N 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- DRSKVOAJKLUMCL-MMUIXFKXSA-N u2n4xkx7hp Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(O)=O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O DRSKVOAJKLUMCL-MMUIXFKXSA-N 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 239000007222 ypd medium Substances 0.000 description 2
- RMLYXMMBIZLGAQ-UHFFFAOYSA-N (-)-monatin Natural products C1=CC=C2C(CC(O)(CC(N)C(O)=O)C(O)=O)=CNC2=C1 RMLYXMMBIZLGAQ-UHFFFAOYSA-N 0.000 description 1
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- RMLYXMMBIZLGAQ-HZMBPMFUSA-N (2s,4s)-4-amino-2-hydroxy-2-(1h-indol-3-ylmethyl)pentanedioic acid Chemical compound C1=CC=C2C(C[C@](O)(C[C@H](N)C(O)=O)C(O)=O)=CNC2=C1 RMLYXMMBIZLGAQ-HZMBPMFUSA-N 0.000 description 1
- OINNEUNVOZHBOX-QIRCYJPOSA-N 2-trans,6-trans,10-trans-geranylgeranyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\COP(O)(=O)OP(O)(O)=O OINNEUNVOZHBOX-QIRCYJPOSA-N 0.000 description 1
- 241000208140 Acer Species 0.000 description 1
- WBZFUFAFFUEMEI-UHFFFAOYSA-M Acesulfame k Chemical compound [K+].CC1=CC(=O)[N-]S(=O)(=O)O1 WBZFUFAFFUEMEI-UHFFFAOYSA-M 0.000 description 1
- 241000589220 Acetobacter Species 0.000 description 1
- 241000589291 Acinetobacter Species 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- AAXVGJXZKHQQHD-LSJOCFKGSA-N Ala-His-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N AAXVGJXZKHQQHD-LSJOCFKGSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 101000998372 Arabidopsis thaliana NADPH-cytochrome P450 reductase 2 Proteins 0.000 description 1
- IJPNNYWHXGADJG-GUBZILKMSA-N Arg-Ala-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O IJPNNYWHXGADJG-GUBZILKMSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- LLZXKVAAEWBUPB-KKUMJFAQSA-N Arg-Gln-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLZXKVAAEWBUPB-KKUMJFAQSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- JTZUZBADHGISJD-SRVKXCTJSA-N Arg-His-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JTZUZBADHGISJD-SRVKXCTJSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 1
- MECFLTFREHAZLH-ACZMJKKPSA-N Asn-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N MECFLTFREHAZLH-ACZMJKKPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 1
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- WEDGJJRCJNHYSF-SRVKXCTJSA-N Asp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WEDGJJRCJNHYSF-SRVKXCTJSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- UZNSWMFLKVKJLI-VHWLVUOQSA-N Asp-Ile-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UZNSWMFLKVKJLI-VHWLVUOQSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- WDMNFNXKGSLIOB-GUBZILKMSA-N Asp-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N WDMNFNXKGSLIOB-GUBZILKMSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- 108010011485 Aspartame Proteins 0.000 description 1
- 241000680806 Blastobotrys adeninivorans Species 0.000 description 1
- 241000195940 Bryophyta Species 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- 241000222122 Candida albicans Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 241000235646 Cyberlindnera jadinii Species 0.000 description 1
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 1
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 1
- KGIHMGPYGXBYJJ-SRVKXCTJSA-N Cys-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CS KGIHMGPYGXBYJJ-SRVKXCTJSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- CANAPGLEBDTCAF-NTIPNFSCSA-N Dulcoside A Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@]23C(C[C@]4(C2)[C@H]([C@@]2(C)[C@@H]([C@](CCC2)(C)C(=O)O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)CC4)CC3)=C)O[C@H](CO)[C@@H](O)[C@@H]1O CANAPGLEBDTCAF-NTIPNFSCSA-N 0.000 description 1
- CANAPGLEBDTCAF-QHSHOEHESA-N Dulcoside A Natural products C[C@@H]1O[C@H](O[C@@H]2[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]2O[C@]34CC[C@H]5[C@]6(C)CCC[C@](C)([C@H]6CC[C@@]5(CC3=C)C4)C(=O)O[C@@H]7O[C@H](CO)[C@@H](O)[C@H](O)[C@H]7O)[C@H](O)[C@H](O)[C@H]1O CANAPGLEBDTCAF-QHSHOEHESA-N 0.000 description 1
- 239000004278 EU approved seasoning Substances 0.000 description 1
- 108030000406 Ent-copalyl diphosphate synthases Proteins 0.000 description 1
- 241001465328 Eremothecium gossypii Species 0.000 description 1
- 239000004386 Erythritol Substances 0.000 description 1
- UNXHWFMMPAWVPI-UHFFFAOYSA-N Erythritol Natural products OCC(O)C(O)CO UNXHWFMMPAWVPI-UHFFFAOYSA-N 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 101001091269 Escherichia coli Hygromycin-B 4-O-kinase Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 1
- MINZLORERLNSPP-ACZMJKKPSA-N Gln-Asn-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N MINZLORERLNSPP-ACZMJKKPSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- DQPOBSRQNWOBNA-GUBZILKMSA-N Gln-His-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O DQPOBSRQNWOBNA-GUBZILKMSA-N 0.000 description 1
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 1
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- CBOVGULVQSVMPT-CIUDSAMLSA-N Glu-Pro-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CBOVGULVQSVMPT-CIUDSAMLSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- SFKMXFWWDUGXRT-NWLDYVSISA-N Glu-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N)O SFKMXFWWDUGXRT-NWLDYVSISA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 1
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 description 1
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 1
- LYSVCKOXIDKEEL-SRVKXCTJSA-N His-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYSVCKOXIDKEEL-SRVKXCTJSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- LIEIYPBMQJLASB-SRVKXCTJSA-N His-Gln-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LIEIYPBMQJLASB-SRVKXCTJSA-N 0.000 description 1
- SWSVTNGMKBDTBM-DCAQKATOSA-N His-Gln-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SWSVTNGMKBDTBM-DCAQKATOSA-N 0.000 description 1
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- CTEMYIWDSVICKS-WDSOQIARSA-N His-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N CTEMYIWDSVICKS-WDSOQIARSA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- JUCZDDVZBMPKRT-IXOXFDKPSA-N His-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O JUCZDDVZBMPKRT-IXOXFDKPSA-N 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 241000194036 Lactococcus Species 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- BYEBKXRNDLTGFW-CIUDSAMLSA-N Lys-Cys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O BYEBKXRNDLTGFW-CIUDSAMLSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- SEZADXQOJJTXPG-VFAJRCTISA-N Lys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N)O SEZADXQOJJTXPG-VFAJRCTISA-N 0.000 description 1
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- MVQGZYIOMXAFQG-GUBZILKMSA-N Met-Ala-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MVQGZYIOMXAFQG-GUBZILKMSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- TZHFJXDKXGZHEN-IHRRRGAJSA-N Met-His-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O TZHFJXDKXGZHEN-IHRRRGAJSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 1
- JYPITOUIQVSCKM-IHRRRGAJSA-N Met-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCSC)N JYPITOUIQVSCKM-IHRRRGAJSA-N 0.000 description 1
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 1
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 1
- ATBJCCFCJXCNGZ-UFYCRDLUSA-N Met-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 ATBJCCFCJXCNGZ-UFYCRDLUSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 241000320412 Ogataea angusta Species 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- HPECNYCQLSVCHH-BZSNNMDCSA-N Phe-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N HPECNYCQLSVCHH-BZSNNMDCSA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- QUUCAHIYARMNBL-FHWLQOOXSA-N Phe-Tyr-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N QUUCAHIYARMNBL-FHWLQOOXSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 1
- CKXMGSJPDQXBPG-JYJNAYRXSA-N Pro-Cys-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O CKXMGSJPDQXBPG-JYJNAYRXSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- DIDLUFMLRUJLFB-FKBYEOEOSA-N Pro-Trp-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=C(C=C4)O)C(=O)O DIDLUFMLRUJLFB-FKBYEOEOSA-N 0.000 description 1
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- MTMJNKFZDQEVSY-BZSNNMDCSA-N Pro-Val-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MTMJNKFZDQEVSY-BZSNNMDCSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 101100209986 Rattus norvegicus Slc18a1 gene Proteins 0.000 description 1
- YWPVROCHNBYFTP-UHFFFAOYSA-N Rubusoside Natural products C1CC2C3(C)CCCC(C)(C(=O)OC4C(C(O)C(O)C(CO)O4)O)C3CCC2(C2)CC(=C)C21OC1OC(CO)C(O)C(O)C1O YWPVROCHNBYFTP-UHFFFAOYSA-N 0.000 description 1
- 101100499952 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) DPP1 gene Proteins 0.000 description 1
- 101100459905 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) NCP1 gene Proteins 0.000 description 1
- 241000198072 Saccharomyces mikatae Species 0.000 description 1
- 241000235342 Saccharomycetes Species 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- UEDUENGHJMELGK-HYDKPPNVSA-N Stevioside Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O UEDUENGHJMELGK-HYDKPPNVSA-N 0.000 description 1
- 101001091268 Streptomyces hygroscopicus Hygromycin-B 7''-O-kinase Proteins 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 235000007303 Thymus vulgaris Nutrition 0.000 description 1
- 240000002657 Thymus vulgaris Species 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- OQMQBYOEAHVCGD-GQGQLFGLSA-N Trp-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OQMQBYOEAHVCGD-GQGQLFGLSA-N 0.000 description 1
- OBAMASZCXDIXSS-SZMVWBNQSA-N Trp-Glu-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N OBAMASZCXDIXSS-SZMVWBNQSA-N 0.000 description 1
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 1
- YTCNLMSUXPCFBW-SXNHZJKMSA-N Trp-Ile-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O YTCNLMSUXPCFBW-SXNHZJKMSA-N 0.000 description 1
- LDMUNXDDIDAPJH-VMBFOHBNSA-N Trp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LDMUNXDDIDAPJH-VMBFOHBNSA-N 0.000 description 1
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 1
- ZHZLQVLQBDBQCQ-WDSOQIARSA-N Trp-Lys-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ZHZLQVLQBDBQCQ-WDSOQIARSA-N 0.000 description 1
- DTPWXZXGFAHEKL-NWLDYVSISA-N Trp-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DTPWXZXGFAHEKL-NWLDYVSISA-N 0.000 description 1
- RQKMZXSRILVOQZ-GMVOTWDCSA-N Trp-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N RQKMZXSRILVOQZ-GMVOTWDCSA-N 0.000 description 1
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 1
- SWSUXOKZKQRADK-FDARSICLSA-N Trp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SWSUXOKZKQRADK-FDARSICLSA-N 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 1
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- GFJXBLSZOFWHAW-JYJNAYRXSA-N Tyr-His-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GFJXBLSZOFWHAW-JYJNAYRXSA-N 0.000 description 1
- OSXNCKRGMSHWSQ-ACRUOGEOSA-N Tyr-His-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSXNCKRGMSHWSQ-ACRUOGEOSA-N 0.000 description 1
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- NKMFRGPKTIEXSK-ULQDDVLXSA-N Tyr-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NKMFRGPKTIEXSK-ULQDDVLXSA-N 0.000 description 1
- FDKDGFGTHGJKNV-FHWLQOOXSA-N Tyr-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FDKDGFGTHGJKNV-FHWLQOOXSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 1
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- 241000235015 Yarrowia lipolytica Species 0.000 description 1
- 241000222124 [Candida] boidinii Species 0.000 description 1
- 241000222126 [Candida] glabrata Species 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 235000010358 acesulfame potassium Nutrition 0.000 description 1
- 229960004998 acesulfame potassium Drugs 0.000 description 1
- 239000000619 acesulfame-K Substances 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 239000008122 artificial sweetener Substances 0.000 description 1
- 235000021311 artificial sweeteners Nutrition 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 239000000605 aspartame Substances 0.000 description 1
- 235000010357 aspartame Nutrition 0.000 description 1
- IAOZJIPTCAWIRG-QWRGUYRKSA-N aspartame Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)OC)CC1=CC=CC=C1 IAOZJIPTCAWIRG-QWRGUYRKSA-N 0.000 description 1
- 229960003438 aspartame Drugs 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 238000010009 beating Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 235000013361 beverage Nutrition 0.000 description 1
- 230000003570 biosynthesizing effect Effects 0.000 description 1
- 229940095731 candida albicans Drugs 0.000 description 1
- 208000032343 candida glabrata infection Diseases 0.000 description 1
- 230000005465 channeling Effects 0.000 description 1
- 238000012777 commercial manufacturing Methods 0.000 description 1
- 235000009508 confectionery Nutrition 0.000 description 1
- 101150046305 cpr-1 gene Proteins 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 239000001177 diphosphate Substances 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-N diphosphoric acid Chemical compound OP(O)(=O)OP(O)(O)=O XPPKVPWEQAFLFU-UHFFFAOYSA-N 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- JCAIWDXKLCEQEO-MSVCPBRZSA-N ent-Copalyl diphosphate Natural products [P@](=O)(OP(=O)(O)O)(OC/C=C(\CC[C@H]1C(=C)CC[C@@H]2C(C)(C)CCC[C@]12C)/C)O JCAIWDXKLCEQEO-MSVCPBRZSA-N 0.000 description 1
- ONVABDHFQKWOSV-YQXATGRUSA-N ent-Kaur-16-ene Natural products C1C[C@@H](C2)C(=C)C[C@@]32CC[C@@H]2C(C)(C)CCC[C@@]2(C)[C@@H]31 ONVABDHFQKWOSV-YQXATGRUSA-N 0.000 description 1
- JCAVDWHQNFTFBW-GNVSMLMZSA-N ent-kaur-16-en-19-al Chemical compound C1C[C@@H](C2)C(=C)C[C@@]32CC[C@@H]2[C@](C)(C=O)CCC[C@@]2(C)[C@@H]31 JCAVDWHQNFTFBW-GNVSMLMZSA-N 0.000 description 1
- TUJQVRFWMWRMIO-GNVSMLMZSA-N ent-kaur-16-en-19-ol Chemical compound C1C[C@@H](C2)C(=C)C[C@@]32CC[C@@H]2[C@](C)(CO)CCC[C@@]2(C)[C@@H]31 TUJQVRFWMWRMIO-GNVSMLMZSA-N 0.000 description 1
- JCAVDWHQNFTFBW-UHFFFAOYSA-N ent-kaurenal Natural products C1CC(C2)C(=C)CC32CCC2C(C)(C=O)CCCC2(C)C31 JCAVDWHQNFTFBW-UHFFFAOYSA-N 0.000 description 1
- UIXMIBNGPQGJJJ-UHFFFAOYSA-N ent-kaurene Natural products CC1CC23CCC4C(CCCC4(C)C)C2CCC1C3 UIXMIBNGPQGJJJ-UHFFFAOYSA-N 0.000 description 1
- 108010064741 ent-kaurene synthetase A Proteins 0.000 description 1
- 108010026539 ent-kaurenoic acid 13-hydroxylase Proteins 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- UNXHWFMMPAWVPI-ZXZARUISSA-N erythritol Chemical compound OC[C@H](O)[C@H](O)CO UNXHWFMMPAWVPI-ZXZARUISSA-N 0.000 description 1
- 235000019414 erythritol Nutrition 0.000 description 1
- 229940009714 erythritol Drugs 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 235000011194 food seasoning agent Nutrition 0.000 description 1
- 229960002737 fructose Drugs 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 229960001031 glucose Drugs 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000001307 helium Substances 0.000 description 1
- 229910052734 helium Inorganic materials 0.000 description 1
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 1
- 235000019534 high fructose corn syrup Nutrition 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010041601 histidyl-aspartyl-glutamyl-leucine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 235000012907 honey Nutrition 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010075702 lysyl-valyl-aspartyl-leucine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000012092 media component Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 235000013379 molasses Nutrition 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- YWPVROCHNBYFTP-OSHKXICASA-N rubusoside Chemical compound O([C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O YWPVROCHNBYFTP-OSHKXICASA-N 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 239000010421 standard material Substances 0.000 description 1
- QSIDJGUAAUSPMG-CULFPKEHSA-N steviolmonoside Chemical compound O([C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(O)=O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O QSIDJGUAAUSPMG-CULFPKEHSA-N 0.000 description 1
- OHHNJQXIOPOJSC-UHFFFAOYSA-N stevioside Natural products CC1(CCCC2(C)C3(C)CCC4(CC3(CCC12C)CC4=C)OC5OC(CO)C(O)C(O)C5OC6OC(CO)C(O)C(O)C6O)C(=O)OC7OC(CO)C(O)C(O)C7O OHHNJQXIOPOJSC-UHFFFAOYSA-N 0.000 description 1
- 229960004793 sucrose Drugs 0.000 description 1
- 235000019605 sweet taste sensations Nutrition 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 239000001585 thymus vulgaris Substances 0.000 description 1
- OQPOFZJZPYRNFF-CULFPKEHSA-N tkd5uc898q Chemical compound O=C([C@]1(C)CCC[C@@]2([C@@H]1CC[C@]13C[C@](O)(C(=C)C1)CC[C@@H]23)C)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O OQPOFZJZPYRNFF-CULFPKEHSA-N 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23L—FOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
- A23L27/00—Spices; Flavouring agents or condiments; Artificial sweetening agents; Table salts; Dietetic salt substitutes; Preparation or treatment thereof
- A23L27/30—Artificial sweetening agents
- A23L27/33—Artificial sweetening agents containing sugars or derivatives
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/96—Stabilising an enzyme by forming an adduct or a composition; Forming enzyme conjugates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/44—Preparation of O-glycosides, e.g. glucosides
- C12P19/56—Preparation of O-glycosides, e.g. glucosides having an oxygen atom of the saccharide radical directly bound to a condensed ring system having three or more carbocyclic rings, e.g. daunomycin, adriamycin
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y402/00—Carbon-oxygen lyases (4.2)
- C12Y402/03—Carbon-oxygen lyases (4.2) acting on phosphates (4.2.3)
- C12Y402/03019—Ent-kaurene synthase (4.2.3.19)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y505/00—Intramolecular lyases (5.5)
- C12Y505/01—Intramolecular lyases (5.5.1)
- C12Y505/01013—Ent-copalyl diphosphate synthase (5.5.1.13)
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23V—INDEXING SCHEME RELATING TO FOODS, FOODSTUFFS OR NON-ALCOHOLIC BEVERAGES AND LACTIC OR PROPIONIC ACID BACTERIA USED IN FOODSTUFFS OR FOOD PREPARATION
- A23V2002/00—Food compositions, function of food ingredients or processes for food or foodstuffs
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23V—INDEXING SCHEME RELATING TO FOODS, FOODSTUFFS OR NON-ALCOHOLIC BEVERAGES AND LACTIC OR PROPIONIC ACID BACTERIA USED IN FOODSTUFFS OR FOOD PREPARATION
- A23V2250/00—Food ingredients
- A23V2250/24—Non-sugar sweeteners
- A23V2250/262—Stevioside
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Mycology (AREA)
- Polymers & Plastics (AREA)
- Food Science & Technology (AREA)
- Nutrition Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
본 발명은 스테비올 또는 이의 전구체 생합성에 관한 것으로서, 더욱 자세하게는 스테비올 전구체를 생산하는 효소, 상기 효소를 발현하는 재조합 균주, 상기 재조합 균주를 이용한 스테비올 또는 이의 전구체를 생산하는 재조합 균주 및 상기 효소 또는 재조합 균주를 이용한 스테비올 생산방법에 관한 것이다.
Description
본 발명은 스테비올 또는 이의 전구체 생합성에 관한 것으로서, 더욱 자세하게는 스테비올 전구체를 생산하는 효소, 상기 효소를 발현하는 재조합 균주, 상기 재조합 균주를 이용한 스테비올 또는 이의 전구체를 생산하는 재조합 균주 및 상기 효소 또는 재조합 균주를 이용한 스테비올 생산방법에 관한 것이다.
감미료는 식품, 음료, 또는 과자 산업에서 가장 흔히 이용되는 성분들로 알려져 있다. 감미료는 생산 동안 최종 식품 산물에 통합될 수 있거나 또는 단독 용도로 적절하게 희석시켰을 때, 식탁 감미료 또는 베이킹에서 설탕을 대체하는 가정용 대체물로 이용할 수 있다. 감미료는 예를 들어 수크로오스, 고과당 옥수수 시럽, 당밀, 메이플 시럽, 및 꿀과 같은 천연 감미료들, 그리고 예를 들어 아스파르탐, 사카린 및 수크랄로오스(sucralose)와 같은 인공 감미료를 포함한다.
스테비아(Stevia) 추출물은 다년생 관목, 스테비아 레바우디아나(Stevia rebaudiana)로부터 추출할 수 있는 천연 감미료이다. 다양한 수준으로 정제된 스테비아 추출물은 식품 및 블렌드에서 고감도 조미료로, 또는 단독으로 식탁 감미료로 시판된다.
스테비아 식물의 추출물들은 레바우디오시드 및 단맛에 기여하는 기타 스테비올을 함유하지만, 각 글리코시드의 양은 상이한 생산 일괄량 (batches)에 따라 흔히 변화한다. 기존의 시판 제품들은 주로 레바우디오시드 A이며, 적은 양의 레바우디오시드 C, D, 및 F와 같은 기타 글리코시들이 있다. 스테비아 추출물은 또한 이취(off-flavors)의 원인이 되는 식물에서 유도된 화합물과 같은 오염물질을 함유할 수 있다. 이러한 이취는 선택된 식품 시스템 또는 용도에 따라 대체로 문제가 될 수 있다.
스테비아 추출물의 조성물은 식물이 성장하는 토양 및 기후에 따라 매우 다양할 수 있다. 원료 식물, 기후 조건, 및 추출 공정에 따라, 상업적 제조 과정에서 레바우디오시드 A의 양은 총 스테비올 글리코시드 함량의 20 내지 97%로 다양하다고 보고된다. 또 다른 스테비올 글리코시드들이 스테비아 추출물 내에 다양한 양으로 존재한다.
스테비아 식물로부터의 스테비올 글리코시드의 회수 및 정제가 노동 집약적이고 비효율적인 것으로 밝혀지면서, RebD 및 RebM과 같은 고수율의 요망되는 스테비올 글리코시드를 축적할 수 있는 재조합 생산 시스템이 여전히 요구되고 있다. 또한, 상업적 용도를 위한 재조합 숙주에서 스테비올 글리코시드의 개선된 생산에 대한 요구가 여전히 존재한다.
본 발명은 스테비올 또는 이의 전구체 생합성에 관한 것으로서, 더욱 자세하게는 스테비올 전구체를 생산하는 효소, 상기 효소를 발현하는 재조합 균주, 상기 재조합 균주를 이용한 스테비올을 생산하는 재조합 균주 및 상기 효소 또는 재조합 균주를 이용한 스테비올 생산방법에 관한 것이다.
이하, 본 발명을 더욱 자세히 설명하고자 한다.
본 발명의 일예는 서열번호 1의 아미노산 서열과 적어도 40% 이상의 서열 동일성을 갖는 CDPS/KS 이중 기능 효소 단백질에 관한 것이며, 상기 효소 단백질은 Jungermannia subulata 유래 효소 단백질일 수 있다. 상기 효소 단백질의 아미노산 서열의 동일성은 적어도 40%이상, 50%이상, 60%이상, 70%이상, 80%이상, 90%이상, 95%이상 또는 99%이상일 수 있다.
본 발명에 따른 CDPS/KS 이중 기능 효소 단백질은 ent-코팔릴 디포스페이트 신타제 (CDPS)와 ent-카우렌 신타제 (KS)의 두 가지 기능을 모두 가지는 이중 기능 효소(bifunctional enzyme) 단백질이다. 본 발명에 따른 이중 기능 효소는 channelling을 형성하여 두 가지 반응을 연속적으로 빠르게 일어나게 하는 장점이 있어, 효소를 이용하여 생산한 산물의 수율 및 생산속도 등을 향상시키는 장점이 있다. 따라서, 본 발명에 따른 가지는 이중 기능 효소는 기질로부터 카우렌 생성까지 진행할 수 있다.
본 발명에 따른 일 예는 서열번호 1의 아미노산 서열과 적어도 40%의 서열 동일성을 갖는 CDPS/KS 이중 기능 효소 단백질을 암호화하는 폴리뉴클레오티드에 관한 것이며, 예를 들면 서열번호 2의 핵산 서열과 적어도 70%의 서열 동일성을 갖는 것이며, 상기 유전자의 핵산 서열의 동일성은 적어도 70%이상, 80%이상, 90%이상, 95%이상 또는 99%이상일 수 있다.
본 발명의 추가 일 예는, 본 발명에 따른 일 예는 서열번호 1의 아미노산 서열과 적어도 40%의 서열 동일성을 갖는 CDPS/KS 이중 기능 효소 단백질을 암호화하는 폴리뉴클레오티드를 포함하는 재조합 벡터에 관한 것이다. 상기 재조합 벡터에서, 상기 폴리뉴클레오티드는 작동 가능한 조절 서열에 연결되어 포함될 수 있으며, 상기 조절서열은 전사 프로모터 등을 포함한다. 본 발명에 따른 재조합 벡터의 일 예는 도 1에 예시적으로 도시되어 있다.
또한, 본 발명의 일 예는 상기 서열번호 1의 아미노산 서열과 적어도 40%의 서열 동일성을 갖는 CDPS/KS 이중 기능 효소 단백질을 암호화하는 폴리뉴클레오티드를 포함하는 재조합 숙주 세포에 관한 것이다.
본 발명에 따른 재조합 균주는 (a) CDPS/KS 이중 기능 효소 단백질을 암호화하는 유전자를 포함하는 스테비올 또는 이의 전구체를 생산하는 재조합 균주일 수 있으며, 상기 균주는 (b) GGPP 합성 효소 단백질을 암호화하는 유전자를 추가로 포함할 수 있다.
본 발명에 또 따른 재조합 균주는 (a) CPS/KS 이중 기능 효소 단백질을 암호화하는 유전자에 더하여, (c) 카우레노산 하이드록실라제 (KAH)를 암호화하는 유전자, (d) 카우렌 옥시다제 (KO)를 암호화하는 유전자, 및 (e) KO 및 KAH에 전자를 전달하여 전자전달계를 완성하는 시토크롬 P450 환원효소(cytochrome P450 reductase, CPR)를 암호화하는 유전자를 포함하는 스테비올를 생산하는 재조합 균주일 수 있다. 상기 균주는 (b) GGPP 합성 효소 단백질을 암호화하는 유전자를 추가로 포함할 수 있다.
본 발명에 따른 CDPS/KS 이중 기능 효소 단백질를 이용하여, 스테비올 및/또는 스테비올 전구체는, 재조합 숙주에서, 시험관내에서(즉, 효소적으로), 또는 전세포 생물전환에 의해 생성될 수 있다.
일부 양태에서, 본 발명에 기재된 방법 또는 재조합 숙주에 의해 생성된 스테비올 또는 스테비올 전구체는 상기 조건 하에서 배양될 때 검출 가능한 농도로 축적된다. 본 발명에 따른 스테비올 및 이의 전구체를 제조하는 합성 경로를 간략히 반응식1에 표시한다. 또한, 상기 스테비올 합성 경로에 관여하는 효소를 살펴보면, Mevalonate에서 FPP(farnesyl pyrophosphate)를 거쳐 GGPP (geranylgeranyl pyrophosphate)를 얻고, CDPS) copalyl diphosphate synthase를 이용하여 GGPP에서 CPP (copalyl pyrophosphate)를 얻고, (KS) kaurene synthase를 이용하여 CPP에서 카우렌을 얻고, (KO) kaurene oxidase를 이용하여 카우렌에서 카우레노익산을 얻고, (KAH) kaurenoic acid hydroylase을 이용하여 카우레노익산에서 스테비올을 얻는다.
[반응식 1]
본 발명에 기재된 방법 또는 재조합 숙주에 의해 생성된 스테비올 또는 스테비올 전구체는 검출이 불가능한 농도의 스테비아 식물-유래된 오염물질을 갖는다. 일부 구체예에서, 생체내, 시험관내 또는 전체 세포 생물전환에 의해 생성된 스테비올 또는 스테비올 전구체 조성물은 특히 스테비아 식물로부터의 스테비아 추출물보다 적은 오염물질을 포함한다. 오염물질은 이취(off-flavors)의 원인이 되는 식물-유래 화합물을 포함한다.
효모를 이용하여 스테비올을 생산하는 경우, 스테비올은 효모 내에서 GGPP (geranyl geranyl pyrophosphate) 생성 이후 copalyl pyrophosphate (CPP) synthase에 의해 CPP (copalyl pyrophosphate)가 생성되고, 순차적으로 kaurene synthase (KS)에 의해 생성되는 카우렌(kaurene) 및 카우렌산(kaurenoic acid)을 거쳐 생합성된다. 따라서, 효모를 이용한 스테비올 생합성 과정에서 전구체인 카우렌의 생성은 중요하며, 카우렌의 생성을 위한 GGPP 생성과 카우렌 생성을 높이는 것도 또한 필요하다. 이에, 종래에 GGPP에서 CPP와 카우렌을 생성하는 효소를 모듈(module)로 제조하며, 더욱 바람직하게는 CDPS와 KS가 동시에 작용하여 카우렌을 생산할 수 있는 두 기능 효소를 사용하는 것이다.
카우렌은 효모 내 메발론산 대사경로(mevalonate pathway)를 통해 GGPP에서 CPP을 거쳐서 생합성이 가능하다. 이 과정에 작용하는 효소가 copalyl pyrophosphate synthase (CDPS) 및 Ent-kaurene synthase (KS)이며, 종래에는 Stevia rebaudiana 유래 KS (SrKS) 효소 및 CDPS (SrCDPS) 효소로 이루어진 2종 효소를 사용하여 제조하였다. 이 과정에 작용하는 효소가 copalyl pyrophosphate synthase (CDPS) 및 Ent-kaurene synthase (KS)이며, Stevia rebaudiana 유래 KS (SrKS) 효소 및 CDPS (SrCDPS) 효소 2종을 사용하여 제조하였다.
본 발명에 따른 CDPS/KS 이중 기능 효소 단백질을 이용하는 경우, 종래에 사용하는 스테비아 유래의 SrCDPS, SrKS에 비해 Kaurene 생성 및 스테비올 생성이 증가한다.
본 명세서에서, 스테비올 글리코시드는 스테비올-13-O-글루코시드 (13-SMG), 스테비올-1,2-바이오사이드, 스테비올-1,3-바이오사이드, 스테비올-19-O-글루코시드 (19-SMG), 스테비오사이드, 1,3-스테비오사이드, 루부소사이드, 레바우디오사이드 A (RebA), 레바우디오사이드 B (RebB), 레바우디오사이드 C (RebC), 레바우디오사이드 D (RebD), 레바우디오사이드 E (RebE), 레바우디오사이드 F (RebF), 레바우디오사이드 M (RebM), 레바우디오사이드 Q (RebQ), 레바우디오사이드 I (RebI), 둘코사이드 A, 디-글리코실화된 스테비올, 트리-글리코실화된 스테비올, 테트라-글리코실화된 스테비올, 펜타-글리코실화된 스테비올, 헥사-글리코실화된 스테비올, 헵타-글리코실화된 스테비올, 또는 이들의 이성질체를 포함한다.
본 명세서에서 사용된 바와 같이, 용어 "스테비올 전구체","스테비올 글리코시드 전구체" 및 "스테비올 전구체 화합물"은 스테비올 생합성 경로의 중간체 화합물을 지칭하기 위해 사용된다. 스테비올 전구체는 비제한적으로, 게라닐게라닐 디포스페이트 (GGPP), ent-코팔릴-디포스페이트, 코팔릴-파이로포스페이트, ent-카우렌, ent-카우레놀, ent-카우레날, ent-카우레노산, 및 스테비올을 포함한다. 바람직하게는, 상기 스테비올 전구체는 FPP(farnesyl pyrophosphate), GGPP(geranylgeranyl pyrophosphate), CPP(copalyl pyrophosphate), 카우렌 및 카우레노익산(kaurnoic acid)으로 이루어지는 군에서 선택된 1종 이상일 수 있으나 이에 한정되지 않는다.
본 발명에 따른 (a) CDPS/KS 이중 기능 효소 단백질을 암호화하는 유전자를 포함하는 스테비올 또는 이의 전구체를 생산하는 재조합 균주 또는 일 수 있으며, 상기 (a) 및 (b) GGPP 합성 효소 단백질을 암호화하는 유전자를 포함하는 재조합 균주에서, 스테비올을 생성하기에 적합한 폴리펩티드는 KO, KAH, 및 CPR의 기능성 동족체(homolog)를 포함한다. 예를 들어, KO, KAH, 및 CPR의 기질 특이성을 변경시키는 방법은 당업자에게 공지되어 있으며, 부위-지정된/논리적 돌연변이유발 접근법, 무작위 유도되는 진화 접근법 및 무작위 돌연변이유발/포화 기법이 효소의 활성 부위 부근에서 수행되는 조합법을 포함하나, 이로 한정되는 것은 아니다.
본 발명에 따른 재조합 벡터 및 제조합 숙주 세포에 포함된 (b) GGPP 합성 효소 단백질을 암호화하는 유전자, (c) 카우레노산 하이드록실라제 (KAH)를 암호화하는 유전자, (d) 카우렌 옥시다제 (KO)를 암호화하는 유전자, 및 (e) KO 및 KAH에 전자를 전달하여 전자전달계를 완성하는 시토크롬 P450 환원효소(cytochrome P450 reductase, CPR)를 암호화하는 유전자는, 본 기술 분야의 전문가에게 알려진 유전자를 선택하여 사용할 수 있다. 예를 들면, Geranylgeranyl pyrophosphate synthase을 암호화하는 유전자의 예는, BTS1로서, UniProtKB accession number Q12051을 갖는 Saccharomyces cerevisiae의 BTS1이고 (서열번호 15의 아미노산 서열), Geranylgeranyl pyrophosphate synthase를 암호화는 유전자의 예는 XdCrtE로서 UniProtKB accession number Q1L6K3을 갖는 Xanthophyllomyces dendrorhous crtE이고 (서열번호 16의 아미노산 서열), GGPP synthase를 암호화하는 유전자의 예는 ERG20F96C(DNA sequence of the GGPP synthase ERG20F96C from Saccharomyces cerevisiae)로서, 서열번호 17의 아미노산 서열에 의해 암호화되며, 구체적으로 참고문헌 Metabolic engineering 27 (2015) 65-75, Efficient diterpene production in yeast by engineering Erg20p into a geranylgeranyl diphosphate synthase에 기재된 것이다.
구체적인 예시로서, 상기 (c), (d) 및 (e) 유전자는, S. rebaudiana 유래의 KO (SrKO) 및 KAH (SrKAH)와 Arabidopsis thaliana 유래와 S.cereviiae의 CPR (AtCPR, yCPR)을 사용할 수 있다. 구체적 사용 가능한 유전자로서, SrKO(stevia rebaudiana KO1, genebank accession number, Q6UQ67), SrKAH (stevia rebaudiana KA13H genebank accession number, Q0NZP1), AtCPR(arabidopsis thaliana CPR2, genebank accession number, Q9SUM3), 및 yCPR (Saccharomyces cerevisiae CPR1, gene bank accession number, P16603)으로 이루어지는 군에서 선택된 1종이상일 수 있다.
상기 조절 영역의 선택은 비제한적으로 특정 배양 단계 동안의 효율, 선택성, 유도성, 요망되는 발현 수준, 및 우선적인 발현을 포함하는 여러 인자에 의해 좌우된다. 코딩 서열에 대한 조절 영역을 적절하게 선택하고 정위함에 의해 코딩 서열의 발현을 조절하는 것은 당업자에게는 관례적인 것이다. 하나를 초과하는 조절 영역, 예컨대 인트론, 인핸서, 업스트림 활성화 영역, 전사 종결인자, 및 유도성 요소가 존재할 수 있음이 이해될 것이다.
구체적으로, 스테비올 생합성 대사경로 내 카우렌 생합성을 위해, CDPS/KS 이외의 유전자들이 작동 가능한 프로모터, 예를 들면 GAL 프로모터 조절 하에 발현되도록 설계된다. 설계된 유전자는 GGPP 합성 단계 관련 BTS1, XdCrtE, ERG20F96C 3종의 효소들을 조합하여 GGPP 생합성 모듈로서 GAL1 프로모터 조절 하에 발현되도록 설계하였다. 구체적으로, BTS1은 Genbank accession number Q12051을 갖는 Saccharomyces cerevisiae의 BTS1이고, XdCrtE는 Genbank accession number A0A0C4MWV0을 갖는 Xanthophyllomyces dendrorhous crtE이고, ERG20F96C는 참고문헌 Metabolic engineering 27 (2015) 65-75, Efficient diterpene production in yeast by engineering Erg20p into a geranylgeranyl diphosphate synthase에 기재된 것이다. 또한 카우렌 생성 단계 관련 Stevia rebaudiana 유래 KS (SrKS) 효소와 CPS (SrCDPS) 효소 2종과 CPS/KS 두 효소의 역할을 한번에 진행하는 Jungermannia subulata 유래의 이중 기능 효소 JsCDPS/KS 를 카우렌 생합성 모듈로써 GAL2 프로모터 조절 하에 발현되도록 설계한다. 상기 제조된 벡터의 개열지도를 도 1에 나타낸다.
본원에 기재된 재조합 숙주는 식물 세포, 포유동물 세포, 곤충 세포, 진균류 세포 또는 박테리아 세포를 포함한다. 상기 박테리아 세포는 에스체리치아 박테리아 세포, 예를 들어, 에스체리치아 콜라이 세포; 락토바실러스 박테리아 세포, 락토코커스 박테리아 세포, 코르네박테리움 박테리아 세포, 아세토박터 박테리아 세포, 아시네토박터 박테리아 세포, 또는 슈도모나스 박테리아 세포를 포함한다.
상기 진균류 세포는 효모 세포를 포함한다. 한 양태에서, 효모 세포는 사카로마아세스 세레비시애(Saccharomyces cerevisiae), 쉬조사카로마이세스 폼베(Schizosaccharomyces pombe), 야로위아 리폴리티카(Yarrowia lipolytica), 칸디다 글라브라타(Candida glabrata), 아쉬비아 고시피이(Ashbya gossypii), 사이베를린드네라 자디니이(Cyberlindnera jadinii), 피치아 파스토리스(Pichia pastoris), 클루이베로마이세스 락티스(Kluyveromyces lactis), 한세눌라 폴리모르파(Hansenula polymorpha), 칸디다 보이디니이(Candida boidinii), 아르술라 아데니니보란스(Arxula adeninivorans), 잔토필로마이세스 덴드로르호우스(Xanthophyllomyces dendrorhous), 또는 칸디다 알비칸스(Candida albicans) 종으로부터의 세포이다.상 기 효모 세포는 사카로마이세테(Saccharomycete)이다. 한 양태에서, 효모 세포는 사카로마이세스 세레비시애 종으로부터의 세포이다.
본 발명은 (a) 본원에 기재된 임의의 유전자가 발현되는 조건하에 배양 배지에서 본원에 기재된 재조합 숙주를 성장시키는 단계로서, 스테비올 또는 스테비올 전구체가 상기 숙주에 의해 합성되는 단계; 및/또는
(b) 선택적으로 스테비올 또는 스테비올 전구체를 정량화시키는 단계; 및/또는
(c) 선택적으로, 스테비올 또는 스테비올 전구체를 분리하는 단계를 포함하여, 스테비올 또는 스테비올 전구체를 생성하는 방법을 제공한다.
본 발명에 따른 재조합 숙주 세포의 배양은, 예를 들면 상기 재조합 숙주 세포가 효모인 경우, 재조합 효모를 YPD 배지에서 전배양을 한 다음, OD600에서 흡광도가 특정 값이 되게, 예를 들면 0.1 내지 0.2가 되게 YNB 액체배지에 접종하여 30℃에서 240 rpm으로 5일 배양하여, 스테비올 또는 스테비올 전구체를 생산할 수 있다.
본 발명에 기재된 재조합 미생물에 의해 생성된 조성물은 식품에 혼입될 수 있다 실질적으로 순수한 스테비올 또는 스테비올은 다른 감미료, 예를 들어, 사카린, 덱스트로스, 수크로스, 푸룩토스, 에리트리톨, 아스파르탐, 수크랄로스, 모나틴, 또는 아세술팜 포타슘과 함께 식품에 혼입될 수 있다. 기타 감미료에 대한 스테비올 또는 스테비올의 중량비는 최종 식품에서 만족할 만한 맛에 도달하기 위해 필요에 따라 변화될 수 있다.
본 발명은 스테비올 전구체를 생산하는 효소, 상기 효소를 발현하는 재조합 균주, 상기 재조합 균주를 이용한 스테비올을 생산하는 재조합 균주 및 상기 효소 또는 재조합 균주를 이용한 스테비올 생산방법에 관한 것으로서, 스테비아 식물로부터의 제조하는 기술에 비해, RebD 및 RebM과 같은 고수율의 스테비올을 축적할 수 있는 재조합 생산 시스템이다.
도 1은 카우렌을 생합성하는 재조합 균주 제조를 위한 벡터의 개열 지도(cleavage map)이다.
본 발명을 하기 실시예를 들어 더욱 자세히 설명할 것이나, 본 발명의 범위가 하기 예시적인 실시예 범위로 한정되는 의도는 아니다.
실시예 1. GGPP에서부터 카우렌을 생산하는 이중 기능 효소의 탐색
스테비올을 생합성 과정의 중간체인 카우렌은, GGPP에서 CPP을 거쳐서 생성된다. CPP에서 카우렌을 생산하는 과정에 작용하는 효소가 copalyl pyrophosphate synthase (CDPS) 및 Ent-kaurene synthase (KS)로서 이것을 동시에 할 수 있는 두 기능(bifunctional) 효소를 탐색하였다. 두 가지 기능 효소의 탐색은 NCBI(www.ncbi.nlm.nih.gov)의 공개된 정보를 바탕으로 기존에 기능이 확인되지 않은 효소들을 우선 선별하였다.
최종적으로 Jungermannia subulata 유래의 두 기능 효소인 CDPS/KS 효소(JsCDPS/KS)가 선별되었고, 해당 유전자에서의 신호서열을 제거하기 위해 plant network (www.cbs.dtu.dk/services/TargetP) 프로그램을 사용하여 N 말단의 신호서열(signal peptide)를 확인하고 제거하였다. 상기 신호서열 (N-말단의 105개 아미노산)이 제거된 효소의 아미노산 서열을 서열번호 1에 나타냈다.
Saccharomyces cerevisiae 코돈 최적화하여 서열번호 1의 아미노산을 암호화하는 폴리뉴클레오타이드(서열번호 2)를 Genescript(USA)에서 합성하였다. 합성된 유전자 서열은 CAI Calculator 2 서버를 이용하여 코돈 최적화된 유전자 서열을 재확인하였다.
실시예 2. 카우렌을 생합성하는 재조합 균주의 제조
실시예 1에서 선별된 JsCDPS/KS의 효소 활성 확인을 위해 S. cereviaise 에서의 카우렌 생합성을 확인하였다.
구체적으로, 도 1에서의 스테비올 생합성 대사경로 내 카우렌 생합성을 위해, CDPS/KS 이외의 유전자들이 GAL 프로모터 조절 하에 발현되도록 설계되었다. 설계된 유전자는 GGPP 합성 단계 관련 BTS1, XdCrtE, ERG20F96C 3종의 효소들을 조합하여 GGPP 생합성 모듈로서 GAL1 프로모터 조절 하에 발현되도록 설계하였다. 구체적으로, Geranylgeranyl pyrophosphate synthase을 암호화하는 BTS1은 UniProtKB accession number Q12051을 갖는 Saccharomyces cerevisiae의 BTS1이고, Geranylgeranyl pyrophosphate synthase를 암호화는 XdCrtE는 UniProtKB accession number Q1L6K3을 갖는 Xanthophyllomyces dendrorhous crtE이고, GGPP synthase를 암호화하는 ERG20F96C(DNA sequence of the GGPP synthase ERG20F96C from Saccharomyces cerevisiae)는 참고문헌 Metabolic engineering 27 (2015) 65-75, Efficient diterpene production in yeast by engineering Erg20p into a geranylgeranyl diphosphate synthase에 기재된 것이다. 구체적으로, BTS1, XdCrtE, ERG20F96C 3종의 효소 단백질의 아미노산 서열은 서열번호 15, 16 및 17에 각각 나타냈다.
또한 카우렌 생성 단계 관련 Stevia rebaudiana 유래 KS (SrKS) 효소와 CPS (SrCDPS) 효소 2종과 CPS/KS 두 효소의 역할을 한번에 진행하는 Jungermannia subulata 유래의 이중 기능 효소 JsCDPS/KS 를 카우렌 생합성 모듈로써 GAL2 프로모터 조절 하에 발현되도록 설계하였다. 상기 제조된 벡터의 개열지도를 도 1에 나타냈다. 두 모듈의 유전자들을 조합하여 GGPP에서부터 카우렌까지 생합성 단계의 총 6개 조합을 설계하고 대장균-효모 셔틀 벡터인 pRS424로 클로닝되어 표 1의 pSYK 재조합 플라스미드 6종을 제작하였다.
재조합 플라스미드 |
GGPP 생합성 모듈 | 카우렌 생합성 모듈 | ||||
프로모터 | 유전자 | 프로모터 | 유전자 | 프로모터 | 유전자 | |
pSYK1R | ScGAL1 | BTS1-ERG20 | SmGAL2 | SrCDPS | ScGAL2 | SrKS |
pSYK2R | ScGAL1 | XdCrtE-ERG20 | SmGAL2 | SrCDPS | ScGAL2 | SrKS |
pSYK3R | ScGAL1 | ERG20F96C | SmGAL2 | SrCDPS | ScGAL2 | SrKS |
pSYK4 | ScGAL1 | BTS1-ERG20 | - | - | ScGAL2 | JsCDPS/KS |
pSYK5 | ScGAL1 | XdCrtE-ERG20 | - | - | ScGAL2 | JsCDPS/KS |
pSYK6 | ScGAL1 | ERG20F96C | - | - | ScGAL2 | JsCDPS/KS |
Sc: Saccharomyces cerevisiae,
Sm: Saccharomyces mikatae
Sr: Stevia rebaudiana
Js: Jungermannia subulata
제조된 6종의 재조합 플라스미드 S. cerevisiae의 유전체로 삽입을 위해 SwaI(NEB)을 처리한 후 LiAc(Lithium acetate) 방법을 이용하여 S. cerevisiae CEN PK2-1c 균주에 형질전환하였다. 제조된 6종의 유전자 카세트들은 S. cerevisiae 유전체의 GAL7-GAL10-GAL1위치에 상동재조합 형태로 삽입되었으며, LEU 선별마커를 통해 6종의 유전자 카세트들이 삽입된 재조합 균주들을 선별하였다.
실시예 3. 재조합 균주의 배양
실시예 2에서 제조한, 카우렌을 생합성하는 재조합 효모의 배양은 표 2의 배지를 사용하여 배양하였다. 재조합 효모 균주의 배양은 고체배지에 배양한 뒤 표 2에 나타낸 배지 5 mL에 접종하여 30℃에서 200 rpm으로 9시간 전배양하였다. 전배양액의 세포들은 최종적으로 25 mL의 표 2 배지에 접종한 뒤 멸균된 2 mL 도데칸(dodecane)을 첨가하여 30℃, 200 rpm에서 72시간 동안 진탕배양하였다. 이에 따라, Dodecane two phase(도데칸 층분리) 배양물과 배양 세포균체를 얻었다.
배지성분 | 사용조건 (g/L) |
YNB(yeast nitrogene base) | 6.7 |
MES | 100 mM |
Glucose | 20 |
암모니아수 | pH 6.0 조정 |
실시예 4. LC-MS를 이용한 중간체 물질의 정량분석
4-1: 중간체 확인을 위한 재조합 효모 배양
대사 산물로서 중간체 물질인 파네졸산(FPP), 게라닐게라닐산(GGPP) 등을 확인하기 위해서, 실시예 2에서 얻어진 6종의 카우렌 생합성 효모 균주를 36시간 배양하였다. 대수 증식기에서 배양 세포를 이용하여 중간체 생산을 확인하였다.
구체적으로, 중간체의 확인을 위해 배양한 균체액을 냉각시킨 식염수 용액 45mL에 상기 효모 배양액 5 mL을 섞고, 원심분리(5000 g, 10 min, 0) 하여 상등액을 빠르게 버리고 다시 냉각된 식염수 용액으로 세척하였다. 원심분리로 상등액을 제거하고 균체를 얼음에 보관한 후 추출공정에 사용하였다.
4-2: 재조합 효모 세포로부터 대사산물의 추출
대사산물의 추출을 위해, 95℃ water bath에서 가열한 70%((v/v) 에탄올 수용액을 1.5 mL 첨가하여 상기 균체를 잘 풀어준 다음, 3분 동안 95℃ bath에서 120 rpm으로 shaking하여 대사산물을 추출한 후, 얼음에 넣어 빠르게 식힌 다음 5분 동안 보관 후 원심분리(15000 g, 5분)하여 상등액을 얻었으며, 상기 상등액을 이용하여 LC-MS 분석을 실시하였다.
대사산물의 분석은 LC-MS 방법을 이용하는데 gemini-NX C18 150 mm X2mm, 3 um 110 A particale (Phenomenex) column을 이용하고, 이동상은 7.5 mM tributylamine(pH 4.95) 용액과 아세토나이트릴을 각각 이용하여 Gradient로 분석하였다. 상기 LC-MS 분석 조건을 하기 표 3에 나타냈다.
Time (min) | Eluent B(%) |
0 | 0 |
10 | 15 |
35 | 30 |
45 | 70 |
47 | 100 |
49 | 100 |
55 | 100 |
56 | 0 |
60 | 0 |
상기 GGPP 추출물의 LC-MS 분석을 통하여 재조합 효모 균체의 추출물에서 FPP와 GGPP의 존재를 확인하였다.
실시예 5. 효모 배양의 도데칸층 및 배양액과 균체로부터 카우렌 추출
5-1: 카우렌 생산 및 추출
재조합 균주의 배양 과정에서, 도데칸을 첨가하면서 재조합 효모를 배양하는 경우, 도데칸 층에 카우렌이 축적되므로, 도데칸 층에 존재하는 카우렌과 세포를 파쇄하고 헥산으로 추출한 세포 추출물에 포함된 카우렌의 함량을 확인하고자 하였다.
구체적으로, 실시예 3과 동일한 방법으로 재조합 균주를 12시간 동안 배양하고 상기 배양액의 상층에 존재하는 도데칸 층 200 uL를 물이 포함되지 않도록 샘플을 확보하여, 도데칸 층의 카우렌의 생산량을 확인 실험에 사용하였다.
또한, 세포 배양액과 배양 세포로부터 카우렌을 확인하였다. 구체적으로, 배양 세포로부터의 카우렌 추출과정은 실시예 3에서 얻어진 3 mL의 Dodecane two phase(도데칸 층분리) 배양하여 얻어진 효모 배양물(전체 배양세포 포함)을 원심분리(13000 g, 5분)하여 균체를 회수하였다.
상기 세포 배양물의 원심분리에서 얻어진 배양 상등액 1mL을 400ul hexane 과 섞어서 2mL screw-capped tube에 옮긴 후 추출 과정을 거처 세포 배양액내에 존재하는 카우렌 양을 확인하는 실험에 사용하였다.
상기 원심 분리에서 회수한 균체는 물 1ml로 2회 씻은 후 0.5 mm입경을 갖는 glass bead 을 200 mg 양으로 넣고, 포화 염화나트륨 용액을 200 ul 첨가하였다. 그런 후에, Hexane을 400 ul 첨가하여 screw-capped tube 에 옮긴 후, 분당 6500회 bead-beating하여 카우렌을 추출하여 카우렌의 생산량을 확인 실험에 사용하였다.
5-2: GC-MS 분석을 통한 카우렌 정량
실시예 5-1에서 추출한 카우렌의 양은 GC-MS를 통하여 분석하였다. 구체적으로, GC-MS 분석은 실시예 5-1에서 얻어진, 세포 배양액의 도데칸 층에 포함된 시료, 상기 세포 배양물의 원심분리에서 얻어진 배양 상등액의 헥산 추출물 시료, 및 배양 세포의 추출물 시료를 분석하였다. 4 ul injection을 260℃에서 실시하였고, 분석용 컬럼은 30 m 0.24 um 0.25 um + 10 m EZ guard (Agilent CP9013)을 사용하고 유동상은 1 mL/min 의 속도로 헬륨가스를 사용하였고, GC-MS Gradient 조건은 다음 표 3고 같이 실행하였다.
온도 | 시간 | 비고 |
40 | 2 | - |
40 - 210 | 15.4 | +11 / min |
210 | 2 | - |
210 - 250 | 6.1 | +6.5 / min |
250 | 15 | - |
5-3: 재조합 효모 세포로부터 부산물 생성 확인
카우렌 생산과정은 FPP(farnesyl pyrophosphate), GGPP(geranylgeranyl pyrophosphate), CPP(copalyl pyrophosphate) 의 중간체를 거치면서 이루어진다. 카우렌 생합성 과정에서 카우렌 생성은 5-1의 분석으로 확인하였다. 카우렌 생산 과정의 각 중간체가 가수분해되면 파네졸(FPP 가수분해산물) 및 게라닐게라니올(GGPP 가수분해 산물) 등의 부산물이 생성된다. 대사공학적으로 유도된 효모의 경우, 파네졸, 게라닐게라니올, 카우렌 등의 분포에 따라 대사과정의 방향성을 확인할 수 있다.
구체적으로, 실시예 3에서 배양한 세포 배양물 중에 파네졸, 게라닐게라니올, 및 카우렌 생성을 확인하였다. 구체적으로, 실시예 5-1 및 5-2와 실질적으로 동일한 방법으로 세포 배양액의 도데칸 층에 포함된 시료, 상기 세포 배양물의 원심분리에서 얻어진 배양 상등액의 헥산 추출물 시료, 및 배양 세포의 추출물 시료를 GC-MS 방법으로 분석하였다.
상기 GC-MS 분석 결과로부터, 파네졸, 게라닐게라니올, 카우렌 등을 확인하였다. 구체적으로, 균체의 헥산 추출물에는 파네졸, 카우렌 및 스쿠알렌이 존재하고, 도데칸 층에는 파네졸, 카우린, 및 게라닐게라니올이 존재함을 확인하였다. 이 과정 중에 스쿠알렌(squalene)의 경우 균체의 헥산 추출물에만 존재하였다. 제작된 카우렌 생성 균주의 GC-MS 분석을 통하여 카우렌과 부산물의 생성을 탐색한 GC-MS 분석 결과를 하기 표 5에 기재하였다.
strain | Average Titre(mg/L) | ||
Product | Average | Standard deviation | |
S1 | farnesol | 26.870 | 4.306 |
geranylgeraniol | 2.167 | 0.577 | |
kaurene | 0.059 | 0.008 | |
S2 | farnesol | 19.797 | 5.030 |
geranylgeraniol | 3.967 | 0.670 | |
kaurene | 0.119 | 0.054 | |
S3 | farnesol | 19.407 | 1.897 |
geranylgeraniol | 6.523 | 6.088 | |
kaurene | 0.170 | 0.126 | |
S4 | farnesol | 17.007 | 13.655 |
geranylgeraniol | 0.760 | 0.518 | |
kaurene | 0.160 | 0.024 | |
S5 | farnesol | 29.790 | 9.685 |
geranylgeraniol | 4.470 | 0.695 | |
kaurene | 1.267 | 0.540 | |
S6 | farnesol | 28.183 | 4.476 |
geranylgeraniol | 10.030 | 1.015 | |
kaurene | 0.794 | 0.102 |
상기 표 5에 나타낸 바와 같이, 재조합 효모 배양액의 도데칸 층을 GC-MS로 분석한 결과. 파네졸 생성량이 25~30 mg/L임을 확인하였으며, 게라닐게라니올의 생성량은 편차가 심하였다. 이는 세포 내에 존재하는 FPP 농도의 차이에 의해 나타나고 GGPP의 경우 geranylgeraniol과 연관되어 차이를 보이고 GGPP 생합성 모듈로서 ERG20F96C의 경우 XdCrtE-ERG20, BTS1-ERG20 보다 생성량이 많았다.
카우렌 농도의 경우, Stevia rebaudiana 유래의 CDPS와 KS의 조합에서 생성량이 낮았고, 이중기능(bifunctional) 효소를 사용한 조합의 경우 높게 나타났다. CDPS, KS의 유전자 모듈에서 작용하여 카우렌 생산된 평균치는 Stevia rebaudiana 유래의 SrCDPS, SrKS의 경우 0.116이고, 이끼류(Jungermannia subulata) 유래인 JsCDPS/KS의 경우 0.740 mg/L의 생산량 평균치를 보였다. 이를 보면 이끼류 유래의 JsCDPS/KS의 경우 광범위하게 카우렌으로 전환할 수 있는 능력을 가진 것을 확인하였다.
실시예 6. 스테비올 생산균주를 이용한 스테비올 생산
6-1: 스테비올 생산균주의 제조
카우렌 생산균주인 S6와 S9 균주를 선별하여, 스테비올 생산을 위해서, 카우렌 산화효소(kaurene oxidase, KO)와 카우렌산 수산화효소(Ent-kaurenoic acid 13-hydroxylase, KAH) 및 시토크롬 P450 환원효소(cytochrome P450 reductase, CPR)를 포함한 총 3가지 효소 유전자를 도입하여 스테비올 생산용 재조합 효소를 제조하였으며, 이를 S6-steviol로 명명하였다. 문헌 상으로 확인된 S. rebaudiana 유래의 KO (SrKO) 및 KAH (SrKAH)와 Arabidopsis thaliana 유래와 S.cereviiae의 CPR (AtCPR, yCPR)을 사용하였다.
구체적으로, KO, KAH, 및 CPR 유전자의 재조합 효모내로 도입은 상동 재조합방식으로 도입하였는데, 효모의 DPP(diphosphate phosphatase)1 위치에 유전자를 도입하도록 상동유전자 부분을 결합한 형태로 SrKAH와 AtCPR의 유전자 발현 카세트와 SrKO와 yCPR 유전자를 각각 하나의 모듈로 제조하고, 유전자 내에 hygromycin 내성인자인 Hph(hygromycin B phosphotransferase)를 도입하여 발현 카세트를 포함하여 DPPF1, DPPF2, DPPF3, DPPF4를 제작하였다. 각각의 유전자 덩어리는 pBluescipt SK에 서브클로닝 하였고, 유전자서열분석을 통하여 유전자가 정상적으로 발현할 수 있는 형태을 확인하였다. 확인된 유전자 카세트를 PCR을 통해서 증폭하고, 증폭된 각각의 유전자 발현 카세트들은, LiAc 형질전환방법을 이용하여, 실시예 2의 재조합 효모에 형질전환하였다. 구체적으로 DPPF1, DPPF2, DPPF3, 및 DPPF4는 하기 표 6에 기재된 프라이머 쌍을 이용하여 유전자 증폭 반응을 95℃에서 30초, 55℃에서 30초 72℃에서 4분의 반응을 30회 반복하여 확보하였다. 상기 제조된 DPPF1, DPPF2, DPPF3, DPPF4의 핵산서열은 각각 서열번호 3 내지 6에 나타냈다.
명명 | 핵산서열 (5'->3') | 서열번호 |
유전자 DPPF1의 정방향 프라이머 | CGCCGAGGGTATTTTACTTCC | 7 |
유전자 DPPF1의 역방향 프라이머 | GCACTCGAAACTTCAGGTTC | 8 |
유전자 DPPF2의 정방향 프라이머 | GGTGTTATCGTTGCTGGTGG | 9 |
유전자 DPPF2의 역방향 프라이머 | GATAGTACTAGAGACACATATTC | 10 |
유전자 DPPF3의 정방향 프라이머 | GGCACTGGTCACTCTTTTGG | 11 |
유전자 DPPF3의 역방향 프라이머 | CTGTCCTTGCCTGGTGGG | 12 |
유전자 DPPF4의 정방향 프라이머 | CTCTCGCCGCTCGCCATC | 13 |
유전자 DPPF4의 역방향 프라이머 | CAACCGGCTCTTTGTCAACAG | 14 |
효모에 도입된 유전자는 효모 내에서 In vivo assembly 과정으로 조립 되고, 최종적으로 DPP1 유전자 자리에 상동재조합 과정에 의해 효모 유전체 내로 삽입되었다. KAH/CPR 조합 카세트가 도입된 형질전환 균주는 Hygromycin (200 mg/L)에 내성을 바탕으로 선별하여, KO, KAH, 및 CPR 유전자가 도입된 효모균주를 확보하였다.
6-2: 재조합 효모균주의 배양 및 스테비올 생산
KO, KAH 및 CPR 유전자가 도입된 형질전환 균주(S6-steviol)을 3 mL YPD 배지에서 전배양 후, 초기 OD600가 0.1~0.2가 되도록 조정하여 50 mL YPD (50 g/L glucose) 플라스크에서 온도 30°C 및 250 rpm에서 본배양을 수행하였다. 본 배양의 샘플링은 72시간 배양 후 OD600를 측정 후 5 mL씩 취하여 분주하였다. 상기 배양된 스테비올 생산 균주의 OD600를 측정하여 균주별로 생장 정도를 확인하였다.
상기 스테비올 생산 균주의 배양액 5 mL로부터 세포추출물을 추출하여 스테비올이 생산되었는지 확인하였다. 구체적으로, 스테비올의 추출방법은 CSH (Cold Spring Harbor)에서 제작한 LC-MS 분석을 위한 효모로부터의 추출물 분리 방법으로써, 세포 배양물에서 원심분리를 하여 세포를 회수하고, MeOH:ACN:H2O (2:2:1) 용매를 사용하여 세포를 희석하고, -70°C에서 30분간 정치한 뒤 상온에서 30분 동안 교반하여 스테비올 추출을 진행하였다.
추출된 샘플은 여과 후에 Gradient HPLC를 통해 분석하였다. 스테비올 표준물질과 비교하여, 상기 추출물로부터 스테비올이 동일한 머무름값(retention time, RT)을 가지는 물질을 확인할 수 있었고, 배양시간의 증가에 따라 스테비올 의 양이 증가하는 것을 확인하였다. 최종적으로 S6-steviol 균주에서 생산되는 스테비올의 생산량은 0.67 ± 0.22(mg/L)이었다.
<110> SAMYANG CORPORATION
<120> production of steviol and their precursors
<130> DPP20184721KR
<160> 17
<170> KoPatentIn 3.0
<210> 1
<211> 782
<212> PRT
<213> Artificial Sequence
<220>
<223> JS CDPS/KS enzyme of Jungermannia subulata
<400> 1
Met Ser Phe Glu Lys Ser Ala Pro Gly Ser Val Val Glu Pro Asn Gly
1 5 10 15
Arg Ser Lys Pro Asp Ile Tyr Lys Asp Lys Gly Lys Glu Ala Glu Glu
20 25 30
Ile Lys Gln Trp Ile Glu Glu Ile Arg Ala Met Met Gly Ser Met Thr
35 40 45
Asp Gly Glu Ile Thr Asn Ser Pro Tyr Asp Thr Ala Trp Val Ala Leu
50 55 60
Val Pro Ala Leu Asp Gly Ser Asp Gly Pro Gln Phe Pro Lys Ser Leu
65 70 75 80
Gln Trp Ile Ile Glu Asn Gln Phe Ser Asp Gly Ser Trp Gly Asp Arg
85 90 95
Gly Tyr Phe Ser Tyr Tyr Asp Arg Val Cys Asn Thr Leu Ala Cys Ile
100 105 110
Ile Ala Leu Lys Thr Trp Lys Thr Gly Ser Ala Ala Val Glu Lys Gly
115 120 125
Val Glu Phe Ile Gln Lys Asn Leu Gln Ala Met Glu Thr Glu Glu Asp
130 135 140
Ala His Met Met Ile Gly Phe Glu Ile Val Phe Pro Ala Leu Ile Ser
145 150 155 160
Tyr Ala Lys Ser Leu Asp Leu Asp Leu Pro Phe Asp Ala Pro Ile Ile
165 170 175
Ala Lys Ile Ser Ala Glu Arg Glu Lys Lys Leu Ala Lys Ile Pro Met
180 185 190
Asp Ile Leu His Lys Val Pro Thr Thr Leu Leu His Ser Leu Glu Gly
195 200 205
Phe His Glu Glu Leu Asp Trp Glu Lys Leu Leu Lys Leu Gln Ser Glu
210 215 220
Asp Gly Ser Phe Leu Cys Ser Pro Ala Ser Thr Ala Ala Cys Leu Leu
225 230 235 240
His Thr Lys Asp Glu Lys Ala Leu Ser Tyr Leu Thr Ser Leu Leu Asp
245 250 255
Arg Phe Asn Asn Ala Val Pro Asn Val Tyr Pro Val Asp Leu Phe Glu
260 265 270
His Met Trp Thr Val Asp Arg Leu Gln Arg Leu Gly Ile Asp Arg Tyr
275 280 285
Phe Glu Lys Glu Ile Lys Asp Ser Leu Asp Tyr Val Tyr Lys Tyr Tyr
290 295 300
Lys Ser Val Gly Ile Gly Trp Ala Arg Gly Ser Val Val Gln Asp Leu
305 310 315 320
Asp Asp Thr Ala Met Gly Phe Arg Leu Leu Arg Gln Asn Gly Tyr Asp
325 330 335
Val Asn Glu Asp Val Phe Arg Gln Phe Lys Gly Lys Glu Ser Glu Phe
340 345 350
Phe Cys Phe Ala Gly Gln Ser Gly Gln Ala Val Thr Gly Leu Phe Asn
355 360 365
Phe Tyr Arg Ala Thr Gln Thr Arg Phe Pro Gly Glu Ser Leu Leu Ala
370 375 380
Thr Gly Glu His Phe Ala Arg Gly Phe Leu Val Glu Arg His Glu Lys
385 390 395 400
Asn Glu Cys Phe Asp Lys Trp Ile Ile Thr Lys Asp Leu Pro Gly Glu
405 410 415
Val Glu Tyr Ala Leu Ala Thr Pro Trp Tyr Cys Ser Leu Pro Arg Leu
420 425 430
Glu Thr Glu Ser Tyr Leu Ser His Tyr Gly Thr Asp Asp Ile Trp Ile
435 440 445
Gly Lys Ser Leu Tyr Arg Met Pro Phe Val Asn Asn Glu Thr Phe Leu
450 455 460
Ala Leu Ala Lys Ala Asp Phe Asn Leu Cys Gln Ala Lys His Gln Glu
465 470 475 480
Asp Leu Gln Asn Ile Thr Arg Trp Ser Glu Asp Cys Gly Phe Gly Lys
485 490 495
Leu Ser Phe Ala Arg Gln Lys Ala Ile Glu Gly Val Phe Ser Ala Ala
500 505 510
Cys Ile Leu Pro Gly Pro Glu Leu Ser Pro Ala Arg Leu Val Trp Ala
515 520 525
Gln Asn Cys Val Leu Thr Thr Val Val Asp Asp Tyr Phe Asp Val Gly
530 535 540
Gly Thr Leu Pro Asp Met Arg Arg Phe Leu Glu Ala Phe Lys Glu Trp
545 550 555 560
Asn Pro Ser Leu Met Asp Gly Thr Ala Glu Glu Ala Gln Ile Val Phe
565 570 575
Asn Gly Leu Tyr Asn Thr Leu Asn Ala Met Thr Gln Glu Gly Thr Leu
580 585 590
Ala Gln Gly Arg Asp Ile Gly Gln His Leu Gln Lys Ile Trp Leu Arg
595 600 605
Trp Leu Glu Ser Cys Leu Thr Glu Ala Glu Trp Thr Ala Ser Ser Phe
610 615 620
Ser Pro Ser Phe Asp Glu Tyr Met Lys Asn Ala Leu Pro Ser Ile Ala
625 630 635 640
Leu Glu Pro Ile Val Leu Cys Thr Leu Phe Phe Leu Gly Glu Pro Leu
645 650 655
Ser Asp Glu Phe Val Gly Asp Ser Gln Lys Leu Arg Leu Met Glu Leu
660 665 670
Thr Asn Arg Val Gly Arg Leu Leu Asn Asp Ser Gln Gly Trp Lys Arg
675 680 685
Glu Asp Ser Gln Asn Lys Pro Asn Ser Val Ser Ile Leu Leu Arg Glu
690 695 700
Asn Pro Gly Trp Thr Glu Glu Glu Ala Ile Ala Asn Val Arg Ser Thr
705 710 715 720
Val Glu Glu Ser Met Leu Glu Leu Val Arg Ala Val His Gln Arg Ser
725 730 735
Pro Ile Pro Asn Ser Ile Arg Gln Leu His Phe Asn Met Ala Arg Ile
740 745 750
Met His Leu Phe Tyr Gln Lys Thr Asp Gly Phe Thr Asp Arg Ser Ala
755 760 765
Met Ala Lys Lys Leu Lys Lys Val Leu Phe Gln Pro Val Val
770 775 780
<210> 2
<211> 2349
<212> DNA
<213> Artificial Sequence
<220>
<223> JS CDPS/KS enzyme of Jungermannia subulata
<400> 2
atgtcttttg aaaaatcagc tccaggttct gttgttgaac caaatggtag atcaaagcca 60
gatatatata aggataaggg taaagaagct gaagaaatca agcaatggat cgaagaaatc 120
agagcaatga tgggttcaat gactgatggt gaaattacaa attctccata cgatactgca 180
tgggttgctt tggttccagc tttagatggt tcagatggtc cacaatttcc aaaatcttta 240
caatggatca tcgaaaacca attttctgat ggttcatggg gtgacagagg ttacttctca 300
tactacgata gagtttgtaa cactttagca tgtatcatcg ctttgaagac ttggaagaca 360
ggttctgctg ctgttgaaaa gggtgttgag tttattcaaa agaatttgca agctatggaa 420
acagaagaag atgcacacat gatgatcggt ttcgaaatcg ttttcccagc tttgatttca 480
tacgcaaaat ctttggattt ggatttgcca ttcgatgcac caatcatcgc taaaatttca 540
gcagaaagag aaaagaaatt ggctaagatc ccaatggata ttttgcataa agttccaact 600
acattgttgc attctttaga aggtttccat gaagaattgg attgggaaaa attgttgaaa 660
ttacaatcag aagatggttc atttttgtgt tctccagctt caactgctgc atgtttgtta 720
catacaaagg atgaaaaagc attatcatat ttgacttctt tgttggatag attcaataac 780
gctgttccaa atgtttaccc agttgatttg tttgaacata tgtggacagt tgatagattg 840
caaagattgg gtatcgatag atacttcgaa aaggaaatca aagattcttt agattatgtt 900
tacaaatatt acaaatcagt tggtattggt tgggctagag gttctgttgt tcaagatttg 960
gatgatactg caatgggttt cagattgttg agacaaaacg gttacgatgt taacgaagat 1020
gtttttagac aattcaaagg taaagaatca gaatttttct gttttgctgg tcaatctggt 1080
caagcagtta ctggtttgtt taatttctac agagctactc aaacaagatt tcctggtgaa 1140
tctttgttag ctacaggtga acattttgca agaggtttct tggttgaaag acatgaaaag 1200
aatgaatgtt tcgataagtg gatcatcact aaagatttgc caggtgaagt tgaatatgct 1260
ttggcaacac cttggtactg ttcattgcca agattggaaa ctgaatctta tttgtcacat 1320
tacggtacag atgatatttg gatcggtaaa tctttgtaca gaatgccatt cgttaacaac 1380
gaaacatttt tggctttggc aaaggctgat ttcaatttgt gtcaagctaa gcatcaagaa 1440
gatttgcaaa acatcacaag atggtcagaa gattgtggtt tcggtaaatt atcttttgct 1500
agacaaaagg ctattgaagg tgttttctct gctgcttgta ttttgccagg tccagaatta 1560
tctccagcta gattggtttg ggcacaaaat tgtgttttga ctacagttgt tgatgattac 1620
tttgatgttg gtggtacatt accagatatg agaagatttt tggaagcctt taaagaatgg 1680
aatccatctt tgatggatgg tactgctgaa gaagctcaaa tcgtttttaa tggtttgtac 1740
aacacattga acgctatgac tcaagaaggt acattagcac aaggtagaga tattggtcaa 1800
catttgcaaa agatttggtt aagatggttg gaatcatgtt tgactgaagc tgaatggaca 1860
gcatcttcat tttctccatc atttgatgaa tacatgaaga acgctttacc atctattgca 1920
ttggaaccta ttgttttgtg tactttgttt ttcttgggtg aaccattatc agatgaattt 1980
gttggtgact ctcaaaaatt gagattgatg gaattgacaa acagagttgg tagattgttg 2040
aacgattcac aaggttggaa gagagaagat tctcaaaata agccaaattc tgtttcaatt 2100
ttgttgagag aaaacccagg ttggactgaa gaagaagcta ttgctaatgt tagatcaaca 2160
gttgaagaat ctatgttaga attggttaga gcagttcatc aaagatcacc aatcccaaat 2220
tctatcagac aattacattt caacatggct agaatcatgc atttgtttta ccaaaagact 2280
gatggtttta cagatagatc agctatggct aagaaattga agaaagtttt atttcaacca 2340
gttgtttaa 2349
<210> 3
<211> 5530
<212> DNA
<213> Artificial Sequence
<220>
<223> DPPF1 polynucleotide sequence
<400> 3
cgccgagggt attttacttc cgaatctcaa agaaaaaaat atgcttactg ttaatcctaa 60
aagaggtgac aatattcagc taaaactttc agagacttgc agttctcttc aaggcggtca 120
actttaacaa agaggtagca gattgttttc tttatttgtt cgctatttac aagtgaagaa 180
gcagctcttc ataaagggac aacacggctt atagcatttt ttacgaaaag tttgaccgtt 240
tagaacaaat atttaaaaac tagtactcga tttctggcgc agcaaaaata tagcattatg 300
tccgataaac acagttgtga tctgtcttgt gatcgcatac tctgcagata atcagttgaa 360
atagcagctt ttaagtgaga atcttattct tagtctacat cgttacattg tatcagtcac 420
aggtacggag agaaattata cttttcgatt tcattcaatg tagtttcttt tttacattaa 480
atatagtttt ccagtagtgc actattatta aggcgcttct gtttttagtc aacacttttt 540
cagatagtac ctttcaggtg gttagagtgc gatcccttta aaaaaaagta ttcgtcaacg 600
atgacagggt aaagaataaa tgcagcacgc ctggcgtata ctgctataat tgtacatcat 660
gttatcggcg ttgattctca attgtttggt gattagcttt tatatataga tagaaaccca 720
acgttggata acctcacgac taactttttt gtattttaga aataatttgt cgatcggttg 780
tatatttttg tcatatatta tctagaaacg ttagggaata aactgttatc tagggtccac 840
taacatacgc gcagttcgga aatcagcaaa catcacttaa aggacacctg ctataaactg 900
aattgtgtcc aatttttcga gtagttagca gttcaataaa gggcacgtta tcaattgtta 960
aaggcaaaga atcagaatta aatcatagca aacgaccaaa atgtcttgta aggcagtttc 1020
aaaggaatac tctgatttgt tacaaaagga tgaagcatct tttactaagt gggatgatga 1080
taaggttaag gatcatttgg atacaaataa gaatttgtat ccaaatgatg aaattaaaga 1140
atttgttgaa tcagttaaag caatgttcgg ttctatgaac gatggtgaaa ttaatgtttc 1200
tgcttacgat actgcatggg ttgctttggt tcaagatgtt gatggttcag gttctccaca 1260
atttccatct tcattggaat ggatcgcaaa caaccaatta tcagatggtt cttggggtga 1320
ccatttgtta ttttcagctc atgatagaat tattaacact ttggcatgtg ttattgcttt 1380
aacatcatgg aatgttcatc catctaagtg tgaaaagggt ttgaatttct tgagagaaaa 1440
tatttgtaaa ttggaagatg aaaatgcaga acacatgcct attggtttcg aagttacatt 1500
tccatctttg attgatattg ctaagaaatt gaacatcgaa gttccagaag atactccagc 1560
attgaaggaa atatatgcta gaagagatat taaattgaca aaaattccaa tggaagtttt 1620
gcataaagtt ccaactacat tgttgcattc tttggaaggt atgccagatt tggaatggga 1680
aaaattgttg aaattgcaat gcaaggatgg ttcattttta ttttctccat cttcaactgc 1740
attcgctttg atgcaaacaa aggatgaaaa gtgtttgcaa tatttgacta acatcgttac 1800
aaagtttaat ggtggtgttc caaatgttta cccagttgat ttgttcgaac atatttgggt 1860
tgttgataga ttgcaaagat taggtatcgc aagatacttc aagtctgaaa ttaaagattg 1920
tgttgaatac atcaataagt actggactaa aaatggtatt tgttgggcta gaaacactca 1980
tgttcaagat attgatgata cagcaatggg ttttagagtt ttgagagcac atggttatga 2040
tgttacacca gatgttttta gacaattcga aaaggatggt aaatttgttt gttttgcagg 2100
tcaatcaact caagctgtta caggcatgtt caatgtttac agagcatctc aaatgttgtt 2160
tccaggtgaa agaattttag aagatgctaa gaaattttct tacaactact taaaggaaaa 2220
gcaatctact aatgaattgt tagataaatg gattattgct aaagatttgc caggtgaagt 2280
tggttatgca ttagatattc cttggtacgc ttctttgcca agattagaaa caagatacta 2340
cttggaacaa tatggtggtg aagatgatgt ttggatcggt aaaactttat acagaatggg 2400
ttacgtttca aacaacacat acttggaaat ggcaaaattg gattacaaca actacgttgc 2460
tgttttgcaa ttagaatggt acactatcca acaatggtac gttgatattg gtatcgaaaa 2520
gttcgaatct gataacatca aatcagtttt ggtttcttat tacttagctg ctgcttctat 2580
tttcgaacca gaaagatcaa aggaaagaat tgcatgggct aaaactacaa ttttagttga 2640
taagatcact tcaatttttg attcttcaca atcttcaaag gaagatatta cagcttttat 2700
tgataagttt agaaataagt cttcttctaa gaaacattct attaatggtg aaccttggca 2760
tgaagttatg gttgctttga agaaaacttt gcatggtttt gcattggatg ctttaatgac 2820
acattcacaa gatattcatc cacaattaca tcaagcatgg gaaatgtggt tgactaaatt 2880
acaagatggt gttgatgtta cagcagaatt aatggttcaa atgattaata tgactgctgg 2940
tagatgggtt tctaaggaat tgttgacaca tccacaatac caaagattgt caactgttac 3000
aaattctgtt tgtcatgata ttactaaatt gcataacttc aaggaaaatt caactacagt 3060
tgattctaag gttcaagaat tggttcaatt agttttctct gatacaccag atgatttgga 3120
tcaagatatg aagcaaacat ttttgacagt tatgaaaact ttttactaca aagcatggtg 3180
tgatccaaat actattaacg atcatatctc aaaagttttc gaaatcgtta tataacaata 3240
agcgatttaa tctctaatta ttagttaaag ttttataagc atttttatgt aacgaaaaat 3300
aaattggttc atattattac tgcactgtca cttaccatgg aaagaccaga caagaagttg 3360
ccgacagtct gttgaattgg cctggttagg cttaagtctg ggtccgcttc tttacaaatt 3420
tggagaattt ctcttaaacg atatgtatat tcttttcgtt ggaaaagatg tcttccaaaa 3480
aaaaaaccga tgaattagtg gaaccaagga aaaaaaaaga ggtatccttg attaaggaac 3540
actgtttaaa cagtgtggtt tccaaaaccc tgaaactgca ttagtgtaat agaagactag 3600
acacctcgat acaaataact gggcatgcat gtcgacaccc ttaatataac ttcgtataat 3660
gtatgctata cgaagttatt aggtctagag atctgtttag cttgcctcgt ccccgccggg 3720
tcacccggcc agcgacatgg aggcccagaa taccctcctt gacagtcttg acgtgcgcag 3780
ctcaggggca tgatgtgact gtcgcccgta catttagccc atacatcccc atgtataatc 3840
atttgcatcc atacattttg atggccgcac ggcgcgaagc aaaaattacg gctcctcgct 3900
gcagacctgc gagcagggaa acgctcccct cacagacgcg ttgaattgtc cccacgccgc 3960
gcccctgtag agaaatataa aaggttagga tttgccactg aggttcttct ttcatatact 4020
tccttttaaa atcttgctag gatacagttc tcacatcaca tccgaacata aacaaccatg 4080
cctgaactca ccgcgacgtc tgtcgagaag tttctgatcg aaaagttcga cagcgtctcc 4140
gacctgatgc agctctcgga gggcgaagaa tctcgtgctt tcagcttcga tgtaggaggg 4200
cgtggatatg tcctgcgggt aaatagctgc gccgatggtt tctacaaaga tcgttatgtt 4260
tatcggcact ttgcatcggc cgcgctcccg attccggaag tgcttgacat tggggaattc 4320
agcgagagcc tgacctattg catctcccgc cgtgcacagg gtgtcacgtt gcaagacctg 4380
cctgaaaccg aactgcccgc tgttctgcag ccggtcgcgg aggccatgga tgcgatcgct 4440
gcggccgatc ttagccagac gagcgggttc ggcccattcg gaccgcaagg aatcggtcaa 4500
tacactacat ggcgtgattt catatgcgcg attgctgatc cccatgtgta tcactggcaa 4560
actgtgatgg acgacaccgt cagtgcgtcc gtcgcgcagg ctctcgatga gctgatgctt 4620
tgggccgagg actgccccga agtccggcac ctcgtgcacg cggatttcgg ctccaacaat 4680
gtcctgacgg acaatggccg cataacagcg gtcattgact ggagcgaggc gatgttcggg 4740
gattcccaat acgaggtcgc caacatcttc ttctggaggc cgtggttggc ttgtatggag 4800
cagcagacgc gctacttcga gcggaggcat ccggagcttg caggatcgcc gcggctccgg 4860
gcgtatatgc tccgcattgg tcttgaccaa ctctatcaga gcttggttga cggcaatttc 4920
gatgatgcag cttgggcgca gggtcgatgc gacgcaatcg tccgatccgg agccgggact 4980
gtcgggcgta cacaaatcgc ccgcagaagc gcggccgtct ggaccgatgg ctgtgtagaa 5040
gtactcgccg atagtggaaa ccgacgcccc agcactcgtc cgagggcaaa ggaatagtca 5100
gtactgacaa taaaaagatt cttgttttca agaacttgtc atttgtatag tttttttata 5160
ttgtagttgt tctattttaa tcaaatgtta gcgtgattta tatttttttt cgcctcgaca 5220
tcatctgccc agatgcgaag ttaagtgcgc agaaagtaat atcatgcgtc aatcgtatgt 5280
gaatgctggt cgctatactg ctgtcgattc gatactaacg ccgccatcca gtgtcgaaaa 5340
cgagctctcg agaaccctta atataacttc gtataatgta tgctatacga agttattagg 5400
tgatatcaga tccactagtg gcctatgcgg gtgttatcgt tgctggtggt gcccatgggg 5460
ctgacgaggg gaattacgat gtttgctagt tccaccccct tcgggggttt gaacctgaag 5520
tttcgagtgc 5530
<210> 4
<211> 5185
<212> DNA
<213> Artificial Sequence
<220>
<223> DPPF2 polynucleotide sequence
<400> 4
ggtgttatcg ttgctggtgg tgcccatggg gctgacgagg ggaattacga tgtttgctag 60
ttccaccccc ttcgggggtt tgaacctgaa gtttcgagtg cgacatggag gcccagaata 120
ccctccttga cagtcttgac gtgcgcagct caggggcatg atgtgactgt cgcccgtaca 180
tttagcccat acatccccat gtataatcat ttgcatccat acattttgat ggccgcacgg 240
cgcgaagcaa aaattacggc tcctcgctgc agacctgcga gcagggaaac gctcccctca 300
cagacgcgtt gaattgtccc cacgccgcgc ccctgtagag aaatataaaa ggttaggatt 360
tgccactgag gttcttcttt catatacttc cttttaaaat cttgctagga tacagttctc 420
acatcacatc cgaacataaa caaccatgat ccagtttctg acccccgtgc tactatttat 480
cgtcttatac gtgttttgga aggtttacaa gactcagaaa actaaaatca acctgccccc 540
aggtagtttc gggtggccgt ttttaggcga aacgcttgcg ttccttcgtg cgaactggga 600
tggggttccg gaaaggtttg ttcaggaacg tgtcgaaaag tatgggagtc ccctggtctt 660
caagacctca cttttgggag acagaatggc agtgctgtgc ggtcatgctg gtaacaaatt 720
tctttttggg aacgagaata agctagttgc tgtctggtgg ccccttcctg taagaaaatt 780
gtttgggcgt tctctgataa cgatcagagg cgacgaagcc aagtggatgc gtaaaatgct 840
actttcatat ttagggccgg aggcttttgc gacacactat gcggcgacca tggacgctgt 900
cacccgtagg catatacaag tccattggca aggcaaagaa gaagtaaacg tgtttcaaac 960
tgtgaaggtc tatgcattcg agcttgcttg tcgtctgttt cttagccttg aggagccgaa 1020
ccatattgcg aaattggcga gcctgtttaa catctttatg aaaggaatta tagaattgcc 1080
cataaacttt ccgggaacga gattttacag ctctaaaaag gccgcggcgg cgataagaac 1140
agagcttaaa aaaataatca aagctaggcg tgttgagcta gaagagggca acgcgagtac 1200
aagccaagat ctacttagtc atctgttaac atcatctgat gagaacggta ggtatttaac 1260
cgaaaacgat atcgcgaaca acatacttgt cctgctgttc gccggccatg atacctcagc 1320
ggtcagtatc acgctactgt taaagtctct gggggagcat ccaaatgtgt atgacaaggt 1380
tttgaaggaa cagatagaaa tatccaatgc caaggaggcg tgggaacttc tgaaatggga 1440
ggatatccaa aagatgaaat attcctggaa cgtggtaagc gaagtaatga ggttgacacc 1500
tcctgtaacc ggcgcttatc gtgaagcgtt agttgatatt gagtacgcag gatacaccat 1560
accgaagggg tggaagttgc actggagtgg ctcacacacg catagggagg aagcgaattt 1620
tgaagatccc atgaggtttg atccgtccag attcgaggga gcgggacctc gtccattcac 1680
ttttgtccca tttgggggtg gccctaggat gtgtctaggt aaggaatttg caaggttgga 1740
agtactggcg tttcttcaca acatcgtaac taattttaaa tgggaccttt taatacctaa 1800
tgagaaaata gagtacgatc ccatgcctac tccagtcaag ggactaccta tacgtataca 1860
tccccaccag gtctaagtct gaagaatgaa tgatttgatg atttcttttt ccctccattt 1920
ttcttactga atatatcaat gatatagact tgtatagttt attatttcaa attaagtagc 1980
tatatatagt caagataacg tttgtttgac acgattacat tattcgtcga catctttttt 2040
cagcctgtcg tggtagcaat ttgaggagta ttattaattg aataggttca ttttgcgctc 2100
gcataaacag ttttcgtcag ggacagtatg ttggaatgag tggtaattaa tggtgacatg 2160
acatgttata gcaataacct tgatgtttac atcgtagttt aatgtacacc ccgcgaattc 2220
gttcaagtag gagtgcacca attgcaaagg gaaaagctga atgggcagtt cgaatagcgc 2280
gaacaaaaat cacgatctgg gtgggtgtgg gtgtattgga ttataggaag ccacgcgctc 2340
aacctggaat tacaggaagc tggtaatttt ttgggtttgc aatcatcacc atctgcacgt 2400
tgttataatg tcccgtgtct atatatatcc attgacggta ttctattttt ttgctattga 2460
aatgagcgtt ttttgttact acaattggtt ttacagacgg aattttccct atttgtttcg 2520
tcccattttt ccttttctca ttgttctcat atcttaaaaa ggtcctttct tcataatcaa 2580
tgctttcttt tacttaatat tttacttgca ttcagtgaat tttaatacat attcctctag 2640
tcttgcaaaa tcgatttaga atcaagatac cagcctaaaa atgaatgagc agtagctcct 2700
caagctccac gtctatgata gacttgatgg ccgcgataat taagggggaa ccagtgattg 2760
ttagtgaccc ggctaatgcg agtgcatacg aatcagtagc ggctgaactg tcatctatgt 2820
tgatcgagaa cagacaattc gctatgatag tgacgacgag catagccgtt cttatcggtt 2880
gtatcgtaat gttggtttgg aggcgtagtg ggagcgggaa tagcaaacgt gttgagcccc 2940
taaagcctct ggtgataaaa ccgagggagg aagaaataga tgatggtaga aagaaggtta 3000
ctattttttt cggaactcag acggggacag cggagggttt tgccaaggcc cttggagagg 3060
aggcaaaagc acgttacgaa aagacccgtt tcaagatagt agacttagac gactatgcgg 3120
cagacgatga tgagtatgaa gagaagctga agaaggaaga cgtcgcgttt ttcttcctgg 3180
cgacttatgg tgatggggaa cccacagata acgcggcaag gttttataaa tggttcaccg 3240
aaggcaatga cagaggagag tggttaaaga atcttaagta cggtgttttt gggcttggga 3300
acaggcaata tgagcacttt aacaaggttg ccaaagttgt cgatgatatc ctagttgaac 3360
aaggcgcaca gagattggtt caggtcggtc ttggagatga cgaccagtgt atcgaggatg 3420
atttcactgc gtggagagag gccttgtggc ctgagcttga cacaatctta agggaagagg 3480
gggatacagc cgtagctacg ccttataccg ccgcggtgtt ggagtacagg gtctccatcc 3540
acgactccga ggacgccaag ttcaatgaca taaacatggc gaatggcaat ggatatactg 3600
tctttgatgc acaacatcca tacaaggcca atgtggcggt taaaagagag ctgcacacgc 3660
ctgaaagcga tagaagctgt atccacttag agtttgacat agccggctca ggtttaacat 3720
acgaaactgg ggaccacgtg ggcgtgttgt gcgataacct aagtgaaact gtggatgaag 3780
ctttgcgttt attggatatg tcccccgaca cgtacttctc attacatgcg gaaaaggagg 3840
acggcactcc tatatcatcc tcacttcccc ctcccttccc gccttgcaat cttcgtaccg 3900
ccctaactag atatgcttgc ttattatcct ctccaaaaaa gtccgcactg gtcgcgcttg 3960
ccgcccacgc ttcagatccg accgaggccg agagattgaa acatttagcc agcccagcag 4020
gcaaagatga atattctaaa tgggtcgtgg agagtcagag aagcctgctt gaagtgatgg 4080
ctgaatttcc cagtgcgaaa ccgcctttgg gggtgttttt tgctggagta gcacccagac 4140
ttcaaccacg tttttattcc atctcatcta gtcccaaaat tgcagaaacg aggatacatg 4200
ttacctgtgc attggtatac gaaaagatgc ctacaggacg tattcacaag ggagtctgct 4260
ccacttggat gaaaaatgcg gttccatacg aaaaatctga aaattgttcc agcgcgccca 4320
tatttgtgcg tcagtcaaat tttaagcttc cctccgacag taaggtgccg atcatcatga 4380
tcggaccagg tactggatta gcgcccttca gaggttttct gcaagaaaga ctggccttgg 4440
ttgaatctgg agttgagttg ggaccttctg ttcttttttt tgggtgtcgt aacaggagga 4500
tggatttcat ctacgaagaa gaacttcagc gttttgtcga gtcaggtgca cttgctgaat 4560
taagcgtagc attctccaga gaagggccaa caaaggaata tgtgcaacac aaaatgatgg 4620
acaaagccag cgatatttgg aatatgatca gtcaaggcgc ttatttgtac gtctgtggtg 4680
atgctaaagg catggctcgt gatgtgcata ggtctctgca taccattgca caggagcagg 4740
gtagtatgga ttcaactaag gcggaagggt ttgtcaagaa cttacagacg tctggcagat 4800
atttaaggga cgtttggtaa ttgtcgatat catgtaatta gttatgtcac gcttacattc 4860
acgccctcct cccacatccg ctctaaccga aaaggaagga gttagacaac ctgaagtcta 4920
ggtccctatt tatttttttt aatagttatg ttagtattaa gaacgttatt tatatttcaa 4980
atttttcttt tttttctgta caaacgcgtg tacgcatgta acattatact gaaaaccttg 5040
cttgagaagg ttttgggacg ctcgaaggct ttaatttgca agggcactgg tcactctttt 5100
ggcattctct agcattcgtg catgcagatc atcttaggtc tgtctgccct agtcgctctt 5160
tagaatatgt gtctctagta ctatc 5185
<210> 5
<211> 3428
<212> DNA
<213> Artificial Sequence
<220>
<223> DPPF3 polynucleotide sequence
<400> 5
ggcactggtc actcttttgg cattctctag cattcgtgca tgcagatcat cttaggtctg 60
tctgccctag tcgctcttta gaatatgtgt ctctagtact atcataaaaa acacgctttt 120
tcagttcgag tttatcatta tcaatactgc catttcaaag aatacgtaaa taattaatag 180
tagtgatttt cctaacttta tttagtcaaa aaattagcct tttaattctg ctgtaacccg 240
tacatgccca aaataggggg cgggttacac agaatatata acatcgtagg tgtctgggtg 300
aacagtttat tcctggcatc cactaaatat aatggagccc gctttttaag ctggcatcca 360
gaaaaaaaaa gaatcccagc accaaaatat tgttttcttc accaaccatc agttcatagg 420
tccattctct tagcgcaact acagagaaca ggggcacaaa caggcaaaaa acgggcacaa 480
cctcaatgga gtgatgcaac ctgcctggag taaatgatga cacaaggcaa ttgacccacg 540
catgtatcta tctcattttc ttacaccttc tattaccttc tgctctctct gatttggaaa 600
aagctgaaaa aaaaggttga aaccagttcc ctgaaattat tcccctactt gactaataag 660
tatataaaga cggtaggtat tgattgtaat tctgtaaatc tatttcttaa acttcttaaa 720
ttctactttt atagttagtc ttttttttag ttttaaaaca ccaagaactt agtttcgaat 780
aaacacacat aaacaaacaa aatgacttct catggtggtc aaactaatcc aacaaatttg 840
attattgata ctacaaagga aagaatacaa aaattgttta aaaatgttga aatttctgtt 900
tcttcttatg atacagcatg ggttgctatg gttccatctc caaattcacc aaagtctcca 960
tgtttcccag aatgtttaaa ttggttgatt aataaccaat taaatgatgg ttcttggggt 1020
ttggttaacc atactcataa ccataaccat ccattgttga aggattcatt atcttcaaca 1080
ttggcatgta tcgttgcttt gaaaagatgg aatgttggtg aagatcaaat taataagggt 1140
ttgtctttta ttgaatctaa tttggcatca gctactgata aatcacaacc atctcctatt 1200
ggtttcgata ttatcttccc aggtttgtta gaatatgcta aaaatttgga tattaatttg 1260
ttgtctaaac aaacagattt ttcattgatg ttacataaaa gagaattaga acaaaagaga 1320
tgtcattcta acgaaattga tggttatttg gcttacatct cagaaggttt aggcaatttg 1380
tacgattgga acatggttaa gaaataccaa atgaagaacg gttctgtttt taattctcca 1440
tcagcaactg ctgctgcttt tattaatcat caaaatccag gttgtttaaa ttacttaaat 1500
tctttgttag ataaatttgg taatgctgtt ccaactgttt acccattgga tttgtacatc 1560
agattatcta tggttgatac tattgaaaga ttgggtattt cacatcattt tagagttgaa 1620
attaaaaatg ttttagatga aacttataga tgttgggttg aaagagatga acaaattttt 1680
atggatgttg ttacttgtgc attggctttt agattgttga gaatacatgg ttacaaagtt 1740
tctccagatc aattagcaga aatcactaac gaattggctt ttaaagatga atacgcagct 1800
ttagaaacat accatgcatc acaaatttta taccaagaag atttgtcttc aggtaaacaa 1860
attttgaagt ctgctgattt cttgaagggt attttgtcaa ctgattctaa tagattatca 1920
aaattgatcc ataaggaagt tgaaaatgct ttaaaatttc caattaatac tggtttggaa 1980
agaattaata caagaagaaa catccaatta tataatgttg ataatacaag aattttgaaa 2040
actacatacc attcttcaaa tatttctaac acttactact tgagattagc tgttgaagat 2100
ttctacactt gtcaatctat atatagagaa gaattaaaag gtttggaaag atgggttgtt 2160
caaaataagt tggatcaatt aaagttcgca agacaaaaga ctgcttattg ttacttttct 2220
gttgcagcta cattatcttc accagaattg tctgatgcaa gaatttcatg ggctaaaaat 2280
ggtattttaa ctacagttgt tgatgatttc tttgatattg gtggtactat tgatgaatta 2340
acaaatttga tccaatgtgt tgaaaagtgg aatgttgatg ttgataagga ttgttgttct 2400
gaacacgtta gaattttgtt tttagcattg aaggatgcta tttgttggat tggtgacgaa 2460
gcctttaaat ggcaagctag agatgttact tctcatgtta tccaaacatg gttggaattg 2520
atgaactcta tgttgagaga agcaatttgg actagagatg cttatgttcc aacattaaac 2580
gaatacatgg aaaacgcata cgtttctttt gctttgggtc ctattgttaa accagctatc 2640
tattttgttg gtccaaaatt gtcagaagaa atcgttgaat cttcagaata ccataatttg 2700
tttaaattga tgtctactca aggtagattg ttgaacgata ttcattcttt taaaagagag 2760
tttaaagagg gtaaattgaa tgcagttgct ttacatttgt ctaatggtga atctggtaaa 2820
gttgaagaag aagttgttga agaaatgatg atgatgatta aaaataagag aaaggaatta 2880
atgaaattga tcttcgaaga aaacggttct attgttccaa gagcatgtaa agatgctttt 2940
tggaatatgt gtcatgtttt gaatttcttt tatgctaacg atgatggttt tactggtaac 3000
acaattttgg atacagttaa agatattatc tataatccat tagttttggt taatgaaaat 3060
gaagaacaaa gataaatcat gtaattagtt atgtcacgct tacattcacg ccctcctccc 3120
acatccgctc taaccgaaaa ggaaggagtt agacaacctg aagtctaggt ccctatttat 3180
tttttttaat agttatgtta gtattaagaa cgttatttat atttcaaatt tttctttttt 3240
ttctgtacaa acgcgtgtac gcatgtaaca ttatactgaa aaccttgctt gagaaggttt 3300
tgggacgctc gaaggcttta atttgcctct cgccgctcgc catcgtctcc tccggacttg 3360
aacttgtccg ccattggtgc atgctgtcta aaaacccacc accatgtagg cccaccaggc 3420
aaggacag 3428
<210> 6
<211> 6561
<212> DNA
<213> Artificial Sequence
<220>
<223> DPPF4 polynucleotide sequence
<400> 6
ctctcgccgc tcgccatcgt ctcctccgga cttgaacttg tccgccattg gtgcatgctg 60
tctaaaaacc caccaccatg taggcccacc aggcaaggac agatccaact ggcaccgctg 120
gcttgaacaa caataccagc cttccaactt ctgtaaataa cggcggtacg ccagtgccac 180
cagtaccgtt acctttcggt atacctcctt tccccatgtt tccaatgccc ttcatgcctc 240
caacggctac tatcacaaat cctcatcaag ctgacgcaag ccctaagaaa tgaataacaa 300
tactgacagt actaaataat tgcctacttg gcttcacata cgttgcatac gtcgatatag 360
ataataatga taatgacagc aggattatcg taatacgtaa tagttgaaaa tctcaaaaat 420
gtgtgggtca ttacgtaaat aatgatagga atgggattct tctatttttc ctttttccat 480
tctagcagcc gtcgggaaaa cgtggcatcc tctctttcgg gctcaattgg agtcacgctg 540
ccgtgagcat cctctctttc catatctaac aactgagcac gtaaccaatg gaaaagcatg 600
agcttagcgt tgctccaaaa aagtattgga tggttaatac catttgtctg ttctcttctg 660
actttgactc ctcaaaaaaa aaaaatctac aatcaacaga tcgcttcaat tacgccctca 720
caaaaacttt tttccttctt cttcgcccac gttaaatttt atccctcatg ttgtctaacg 780
gatttctgca cttgatttat tataaaaaga caaagacata atacttctct atcaatttca 840
gttattgttc ttccttgcgt tattcttctg ttcttctttt tcttttgtca tatataacca 900
taaccaagta atacatattc aaaatggatg cagtaactgg attgctgacc gttccagcta 960
cggctataac tattgggggc actgcggttg cattggctgt tgctttgatt ttttggtatt 1020
tgaagtctta cacgagtgcc agaagatccc aaagcaacca tttgccacgg gtgcctgaag 1080
tgccaggggt tccactcttg ggtaatctgt tgcaattgaa agaaaaaaaa ccatatatga 1140
cttttaccag atgggcggct acttatggac ctatttactc tattaagacc ggtgctacga 1200
gtatggttgt cgtttcctca aatgagattg ctaaggaagc tttggttact cgttttcaat 1260
ctatatcgac taggaacttg agcaaggcct tgaaagtttt gactgctgac aaaactatgg 1320
ttgccatgtc tgattacgac gattatcata agactgttaa aagacatatt ttgactgctg 1380
ttttaggtcc caatgcccaa aaaaagcaca gaatccaccg cgatattatg atggataata 1440
tttcgactca attgcatgaa ttcgttaaaa ataacccaga gcaggaagaa gttgatttga 1500
gaaaaatatt tcagtcagaa ttgtttggtt tggctatgag acaagctttg ggcaaggatg 1560
ttgagtcttt atatgtggaa gatttgaaaa tcactatgaa ccgtgacgag atctttcaag 1620
ttttagttgt tgacccgatg atgggcgcga ttgatgttga ttggagagac ttcttccctt 1680
atcttaaatg ggtcccgaat aaaaagttcg agaacactat tcaacaaatg tacattagaa 1740
gagaagccgt tatgaaatcc ttgatcaagg agcacaagaa gagaatagct tctggagaaa 1800
aattgaattc ttatattgat tacttgttgt ctgaagcaca aactttgacc gaccagcaat 1860
tgttgatgtc tctctgggag ccaattattg aatcttctga tacaactatg gttacgactg 1920
aatgggccat gtacgaactg gctaaaaatc ctaagttgca ggatagattg tatagagata 1980
tcaaatctgt ttgtggttcc gagaaaataa ctgaagagca tttgtctcaa ttgccatata 2040
tcactgcaat tttccatgaa actcttagaa ggcacagtcc agtccctatt atcccattga 2100
gacatgttca cgaagacact gtccttggtg gctaccatgt tcccgctggc actgaattgg 2160
cggttaatat ttatggttgc aatatggata agaacgtttg ggaaaatcca gaagaatgga 2220
accctgaaag atttatgaaa gagaacgaaa ctattgactt tcaaaagact atggcattcg 2280
gtggtggaaa aagagtttgt gctggtagct tgcaggcctt gttgactgca tctattggga 2340
ttggaagaat ggtccaggaa tttgaatgga aattgaaaga tatgactcaa gaagaggtta 2400
atacaatcgg tttgactact caaatgctga gacctttacg agcaataata aaaccaagaa 2460
tttaagatta atataattat ataaaaatat tatcttcttt tctttatatc tagtgttatg 2520
taaaataaat tgatgactac ggaaagcttt tttatattgt ttctttttca ttctgagcca 2580
cttaaatttc gtgaatgttc ttgtaaggga cggtagattt acaagtgata caacaaaaag 2640
caaggcgctt tttctaataa aaagaagaaa agcatttaac aattgaacac ctctatatca 2700
acgaagaata ttactttgtc tctaaatcct tgtaaaatgt gtacgatctc tatatgggtt 2760
actcataagt gtaccgaaga ctgcattgaa agtttatgtt ttttcactgg aggcgtcatt 2820
ttcgcgttga gaagatgttc ttatccaaat ttcaactgtt atatagaaga gcgcgaaagt 2880
ttttccggca agctaaatgg aaaaaggaaa gattattgaa agagaaagaa agaaaaaaaa 2940
aaaatgtaca cccagacatc gggcttccac aatttcggct ctattgtttt ccatctctcg 3000
caacggcggg attcctctat ggcgtgtgat gtctgtatct gttacttaat ccagaaactg 3060
gcacttgacc caactctgcc acgtgggtcg ttttgccatc gacagattgg gagattttca 3120
tagtagaatt cagcatgata gctacgtaaa tgtgttccgc accgtcacaa agtgttttct 3180
actgttcttt cttctttcgt tcattcagtt gagttgagtg agtgctttgt tcaatggatc 3240
ttagctaaaa tgcatatttt ttctcttggt aaatgaatgc ttgtgatgtc ttccaagtga 3300
tttcctttcc ttcccatatg atgctaggta cctttagtgt cttcctaaaa aaaaaaaaag 3360
gctcgccatc aaaacgatat tcgttggctt ttttttctga attataaata ctctttggta 3420
acttttcatt tccaagaacc tcttttttcc agttatatca tggtcccctt tcaaagttat 3480
tctctactct ttttcatatt cattcttttt catcctttgg ttttttattc ttaacttgtt 3540
tattattctc tcttgtttct atttacaaga caccaatcaa aacaaataaa acatcatcac 3600
aatgccgttt ggaatagaca acaccgactt cactgtcctg gcggggctag tgcttgccgt 3660
gctactgtac gtaaagagaa actccatcaa ggaactgctg atgtccgatg acggagatat 3720
cacagctgtc agctcgggca acagagacat tgctcaggtg gtgaccgaaa acaacaagaa 3780
ctacttggtg ttgtatgcgt cgcagactgg gactgccgag gattacgcca aaaagttttc 3840
caaggagctg gtggccaagt tcaacctaaa cgtgatgtgc gcagatgttg agaactacga 3900
ctttgagtcg ctaaacgatg tgcccgtcat agtctcgatt tttatctcta catatggtga 3960
aggagacttc cccgacgggg cggtcaactt tgaagacttt atttgtaatg cggaagcggg 4020
tgcactatcg aacctgaggt ataatatgtt tggtctggga aattctactt atgaattctt 4080
taatggtgcc gccaagaagg ccgagaagca tctctccgcc gcgggcgcta tcagactagg 4140
caagctcggt gaagctgatg atggtgcagg aactacagac gaagattaca tggcctggaa 4200
ggactccatc ctggaggttt tgaaagacga actgcatttg gacgaacagg aagccaagtt 4260
cacctctcaa ttccagtaca ctgtgttgaa cgaaatcact gactccatgt cgcttggtga 4320
accctctgct cactatttgc cctcgcatca gttgaaccgc aacgcagacg gcatccaatt 4380
gggtcccttc gatttgtctc aaccgtatat tgcacccatc gtgaaatctc gcgaactgtt 4440
ctcttccaat gaccgtaatt gcatccactc tgaatttgac ttgtccggct ctaacatcaa 4500
gtactccact ggtgaccatc ttgctgtttg gccttccaac ccattggaaa aggtcgaaca 4560
gttcttatcc atattcaacc tggaccctga aaccattttt gacttgaagc ccctggatcc 4620
caccgtcaaa gtgcccttcc caacgccaac tactattggc gctgctatta aacactattt 4680
ggaaattaca ggacctgtct ccagacaatt gttttcatct ttgattcagt tcgcccccaa 4740
cgctgacgtc aaggaaaaat tgactctgct ttcgaaagac aaggaccaat tcgccgtcga 4800
gataacctcc aaatatttca acatcgcaga tgctctgaaa tatttgtctg atggcgccaa 4860
atgggacacc gtacccatgc aattcttggt cgaatcagtt ccccaaatga ctcctcgtta 4920
ctactctatc tcttcctctt ctctgtctga aaagcaaacc gtccatgtca cctccattgt 4980
ggaaaacttt cctaacccag aattgcctga tgctcctcca gttgttggtg ttacgactaa 5040
cttgttaaga aacattcaat tggctcaaaa caatgttaac attgccgaaa ctaacctacc 5100
tgttcactac gatttaaatg gcccacgtaa acttttcgcc aattacaaat tgcccgtcca 5160
cgttcgtcgt tctaacttca gattgccttc caacccttcc accccagtta tcatgatcgg 5220
tccaggtacc ggtgttgccc cattccgtgg gtttatcaga gagcgtgtcg cgttcctcga 5280
atcacaaaag aagggcggta acaacgtttc gctaggtaag catatactgt tttatggatc 5340
ccgtaacact gatgatttct tgtaccagga cgaatggcca gaatacgcca aaaaattgga 5400
tggttcgttc gaaatggtcg tggcccattc caggttgcca aacaccaaaa aagtttatgt 5460
tcaagataaa ttaaaggatt acgaagacca agtatttgaa atgattaaca acggtgcatt 5520
tatctacgtc tgtggtgatg caaagggtat ggccaagggt gtgtcaaccg cattggttgg 5580
catcttatcc cgtggtaaat ccattaccac tgatgaagca acagagctaa tcaagatgct 5640
caagacttca ggtagatacc aagaagatgt ctggtaatca gcccactgat caagccttcg 5700
gcgcggttgt tcaaccacac gatctgtatc aaagaaaaat aagttagata accaaaaaaa 5760
aaaaaaattt catactcact ataagaaatc atacgcagtt caacttttgc ttttacatac 5820
aattttatct atatattcgt gcttctgcga tgtccttatt tatccgatga aggtatgtaa 5880
gaataaaaaa gaatatatac tccacatgac atacgaaata tacgtattta ttgttctgta 5940
tggaataaca gcgattacat aaagatgaca tgttacttct ttattcaaat taatcttgac 6000
gtgcaagggc ctgcttgtta tttcatcgga caatcccaac atcactttac acgaaagcct 6060
tagaagttta ttatttgttt taagttggac tatagtgatg taggtagttt cttaggaagc 6120
agttgagtag ctgatttttg agataagaac ctggtgtaat caatctataa acagcctaga 6180
atctttttaa gcaaatttac ttttacattt atctctatct tctttcttac aagaagttat 6240
tttcattaca aaaggacatt aaatacacta aattttcaat ctttacattg ttggaaagcc 6300
tcgttgtctt ttaagatttt ataagcattg attttttttt tcaataattt tccgttcccc 6360
ttaacacata ctatgtataa atgtcattga gtcatctcac tttagatcaa tattatgaaa 6420
tacagtgcaa cgaacttgaa gcgatacgtt ccatttatat ggatgacttt actgacttaa 6480
ctaaaagaaa gtctagctgg gataagcagc cacagattat attcgaaatt acgcttcgat 6540
ctgttgacaa agagccggtt g 6561
<210> 7
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward PCR primer for DPPF1 polynucleotide sequence
<400> 7
cgccgagggt attttacttc c 21
<210> 8
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse PCR primer for DPPF1 polynucleotide sequence
<400> 8
gcactcgaaa cttcaggttc 20
<210> 9
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward PCR primer for DPPF2 polynucleotide sequence
<400> 9
ggtgttatcg ttgctggtgg 20
<210> 10
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse PCR primer for DPPF2 polynucleotide sequence
<400> 10
gatagtacta gagacacata ttc 23
<210> 11
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward PCR primer for DPPF3 polynucleotide sequence
<400> 11
ggcactggtc actcttttgg 20
<210> 12
<211> 18
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse PCR primer for DPPF3 polynucleotide sequence
<400> 12
ctgtccttgc ctggtggg 18
<210> 13
<211> 18
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward PCR primer for DPPF4 polynucleotide sequence
<400> 13
ctctcgccgc tcgccatc 18
<210> 14
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse PCR primer for DPPF4 polynucleotide sequence
<400> 14
caaccggctc tttgtcaaca g 21
<210> 15
<211> 335
<212> PRT
<213> Artificial Sequence
<220>
<223> Amino acid sequence of eranylgeranyl pyrophosphate synthase
(BTS1) of Saccharomyces cerevisiae
<400> 15
Met Glu Ala Lys Ile Asp Glu Leu Ile Asn Asn Asp Pro Val Trp Ser
1 5 10 15
Ser Gln Asn Glu Ser Leu Ile Ser Lys Pro Tyr Asn His Ile Leu Leu
20 25 30
Lys Pro Gly Lys Asn Phe Arg Leu Asn Leu Ile Val Gln Ile Asn Arg
35 40 45
Val Met Asn Leu Pro Lys Asp Gln Leu Ala Ile Val Ser Gln Ile Val
50 55 60
Glu Leu Leu His Asn Ser Ser Leu Leu Ile Asp Asp Ile Glu Asp Asn
65 70 75 80
Ala Pro Leu Arg Arg Gly Gln Thr Thr Ser His Leu Ile Phe Gly Val
85 90 95
Pro Ser Thr Ile Asn Thr Ala Asn Tyr Met Tyr Phe Arg Ala Met Gln
100 105 110
Leu Val Ser Gln Leu Thr Thr Lys Glu Pro Leu Tyr His Asn Leu Ile
115 120 125
Thr Ile Phe Asn Glu Glu Leu Ile Asn Leu His Arg Gly Gln Gly Leu
130 135 140
Asp Ile Tyr Trp Arg Asp Phe Leu Pro Glu Ile Ile Pro Thr Gln Glu
145 150 155 160
Met Tyr Leu Asn Met Val Met Asn Lys Thr Gly Gly Leu Phe Arg Leu
165 170 175
Thr Leu Arg Leu Met Glu Ala Leu Ser Pro Ser Ser His His Gly His
180 185 190
Ser Leu Val Pro Phe Ile Asn Leu Leu Gly Ile Ile Tyr Gln Ile Arg
195 200 205
Asp Asp Tyr Leu Asn Leu Lys Asp Phe Gln Met Ser Ser Glu Lys Gly
210 215 220
Phe Ala Glu Asp Ile Thr Glu Gly Lys Leu Ser Phe Pro Ile Val His
225 230 235 240
Ala Leu Asn Phe Thr Lys Thr Lys Gly Gln Thr Glu Gln His Asn Glu
245 250 255
Ile Leu Arg Ile Leu Leu Leu Arg Thr Ser Asp Lys Asp Ile Lys Leu
260 265 270
Lys Leu Ile Gln Ile Leu Glu Phe Asp Thr Asn Ser Leu Ala Tyr Thr
275 280 285
Lys Asn Phe Ile Asn Gln Leu Val Asn Met Ile Lys Asn Asp Asn Glu
290 295 300
Asn Lys Tyr Leu Pro Asp Leu Ala Ser His Ser Asp Thr Ala Thr Asn
305 310 315 320
Leu His Asp Glu Leu Leu Tyr Ile Ile Asp His Leu Ser Glu Leu
325 330 335
<210> 16
<211> 376
<212> PRT
<213> Artificial Sequence
<220>
<223> GGDP synthase (crtE) of Xanthophyllomyces dendrorhous
<400> 16
Met Asp Tyr Ala Asn Ile Leu Thr Ala Ile Pro Leu Glu Phe Thr Pro
1 5 10 15
Gln Asp Asp Ile Val Leu Leu Glu Pro Tyr His Tyr Leu Gly Lys Asn
20 25 30
Pro Gly Lys Glu Ile Arg Ser Gln Leu Ile Glu Ala Phe Asn Tyr Trp
35 40 45
Leu Asp Val Lys Lys Glu Asp Leu Glu Val Ile Gln Asn Val Val Gly
50 55 60
Met Leu His Thr Ala Ser Leu Leu Met Asp Asp Val Glu Asp Ser Ser
65 70 75 80
Val Leu Arg Arg Gly Ser Pro Val Ala His Leu Ile Tyr Gly Ile Pro
85 90 95
Gln Thr Ile Asn Thr Ala Asn Tyr Val Tyr Phe Leu Ala Tyr Gln Glu
100 105 110
Ile Phe Lys Leu Arg Pro Thr Pro Ile Pro Met Pro Val Ile Pro Pro
115 120 125
Ser Ser Ala Ser Leu Gln Ser Ser Val Ser Ser Ala Ser Ser Ser Ser
130 135 140
Ser Ala Ser Ser Glu Asn Gly Gly Thr Ser Thr Pro Asn Ser Gln Ile
145 150 155 160
Pro Phe Ser Lys Asp Thr Tyr Leu Asp Lys Val Ile Thr Asp Glu Met
165 170 175
Leu Ser Leu His Arg Gly Gln Gly Leu Glu Leu Phe Trp Arg Asp Ser
180 185 190
Leu Thr Cys Pro Ser Glu Glu Glu Tyr Val Lys Met Val Leu Gly Lys
195 200 205
Thr Gly Gly Leu Phe Arg Ile Ala Val Arg Leu Met Met Ala Lys Ser
210 215 220
Glu Cys Asp Ile Asp Phe Val Gln Leu Val Asn Leu Ile Ser Ile Tyr
225 230 235 240
Phe Gln Ile Arg Asp Asp Tyr Met Asn Leu Gln Ser Ser Glu Tyr Ala
245 250 255
His Asn Lys Asn Phe Ala Glu Asp Leu Thr Glu Gly Lys Phe Ser Phe
260 265 270
Pro Thr Ile His Ser Ile His Ala Asn Pro Ser Ser Arg Leu Val Ile
275 280 285
Asn Thr Leu Gln Lys Lys Ser Thr Ser Pro Glu Ile Leu His His Cys
290 295 300
Val Asn Tyr Met Arg Thr Glu Thr His Ser Phe Glu Tyr Thr Gln Glu
305 310 315 320
Val Leu Asn Thr Leu Ser Gly Ala Leu Glu Arg Glu Leu Gly Arg Leu
325 330 335
Gln Gly Glu Phe Ala Glu Ala Asn Ser Lys Ile Asp Leu Gly Asp Val
340 345 350
Glu Ser Glu Gly Arg Thr Gly Lys Asn Val Lys Leu Glu Ala Ile Leu
355 360 365
Lys Lys Leu Ala Asp Ile Pro Leu
370 375
<210> 17
<211> 352
<212> PRT
<213> Artificial Sequence
<220>
<223> Amino acid sequence of GGPP synthase ERG20F96C from Saccharomyces
cerevisiae
<400> 17
Met Ala Ser Glu Lys Glu Ile Arg Arg Glu Arg Phe Leu Asn Val Phe
1 5 10 15
Pro Lys Leu Val Glu Glu Leu Asn Ala Ser Leu Leu Ala Tyr Gly Met
20 25 30
Pro Lys Glu Ala Cys Asp Trp Tyr Ala His Ser Leu Asn Tyr Asn Thr
35 40 45
Pro Gly Gly Lys Leu Asn Arg Gly Leu Ser Val Val Asp Thr Tyr Ala
50 55 60
Ile Leu Ser Asn Lys Thr Val Glu Gln Leu Gly Gln Glu Glu Tyr Glu
65 70 75 80
Lys Val Ala Ile Leu Gly Trp Cys Ile Glu Leu Leu Gln Ala Tyr Cys
85 90 95
Leu Val Ala Asp Asp Met Met Asp Lys Ser Ile Thr Arg Arg Gly Gln
100 105 110
Pro Cys Trp Tyr Lys Val Pro Glu Val Gly Glu Ile Ala Ile Asn Asp
115 120 125
Ala Phe Met Leu Glu Ala Ala Ile Tyr Lys Leu Leu Lys Ser His Phe
130 135 140
Arg Asn Glu Lys Tyr Tyr Ile Asp Ile Thr Glu Leu Phe His Glu Val
145 150 155 160
Thr Phe Gln Thr Glu Leu Gly Gln Leu Met Asp Leu Ile Thr Ala Pro
165 170 175
Glu Asp Lys Val Asp Leu Ser Lys Phe Ser Leu Lys Lys His Ser Phe
180 185 190
Ile Val Thr Phe Lys Thr Ala Tyr Tyr Ser Phe Tyr Leu Pro Val Ala
195 200 205
Leu Ala Met Tyr Val Ala Gly Ile Thr Asp Glu Lys Asp Leu Lys Gln
210 215 220
Ala Arg Asp Val Leu Ile Pro Leu Gly Glu Tyr Phe Gln Ile Gln Asp
225 230 235 240
Asp Tyr Leu Asp Cys Phe Gly Thr Pro Glu Gln Ile Gly Lys Ile Gly
245 250 255
Thr Asp Ile Gln Asp Asn Lys Cys Ser Trp Val Ile Asn Lys Ala Leu
260 265 270
Glu Leu Ala Ser Ala Glu Gln Arg Lys Thr Leu Asp Glu Asn Tyr Gly
275 280 285
Lys Lys Asp Ser Val Ala Glu Ala Lys Cys Lys Lys Ile Phe Asn Asp
290 295 300
Leu Lys Ile Glu Gln Leu Tyr His Glu Tyr Glu Glu Ser Ile Ala Lys
305 310 315 320
Asp Leu Lys Ala Lys Ile Ser Gln Val Asp Glu Ser Arg Gly Phe Lys
325 330 335
Ala Asp Val Leu Thr Ala Phe Leu Asn Lys Val Tyr Lys Arg Ser Lys
340 345 350
Claims (11)
- 서열번호 1의 아미노산 서열을 갖는 CDPS/KS 이중 기능 효소 단백질을 암호화하는 유전자를 포함하고,
ent-카우렌 옥시다제 (KO) 폴리펩티드를 암호화하는 유전자, ent-카우레노산 하이드록실라제 (KAH) 폴리펩티드를 암호화하는 유전자, 및 사이토크롬 P450 리덕타제 (CPR) 폴리펩티드를 암호화하는 유전자로 이루어지는 군에서 선택된 1종 이상을 추가로 포함하는 재조합 숙주세포;
상기 재조합 숙주세포의 균체; 상기 재조합 숙주 세포의 배양물; 또는 상기 재조합 숙주 세포의 배양물에서 얻어지는 추출물을 포함하는,
스테비올 글리코시드 생산용 조성물. - 제1항에 있어서, 상기 효소 단백질은 이끼류에서 유래된 것인, 스테비올 글리코시드 생산용 조성물.
- 제1항에 있어서, 상기 효소 단백질은 서열번호 2의 염기서열을 포함하는 유전자에 의해 암호화되는 것인, 스테비올 글리코시드 생산용 조성물.
- 삭제
- 제1항에 있어서, 상기 재조합 숙주 세포는 스테비올 글리코사이드 또는 이의 전구체를 생산하는 것인, 스테비올 글리코시드 생산용 조성물.
- 제1항에 있어서, 상기 재조합 숙주 세포는 게라닐게라닐 파이로포스페이트 (GGPP) 신타제를 암호화하는 유전자를 추가로 포함하는 것인, 스테비올 글리코시드 생산용 조성물.
- 삭제
- 제1항에 있어서, 상기 재조합 숙주가 식물 세포, 포유동물 세포, 곤충 세포, 진균류 세포 또는 박테리아 세포를 포함하는, 스테비올 글리코시드 생산용 조성물.
- 제8항에 있어서, 상기 진균류 세포가 효모 세포인, 스테비올 글리코시드 생산용 조성물.
- 삭제
- 제5항에 있어서, 상기 전구체는 FPP(farnesyl pyrophosphate), GGPP(geranylgeranyl pyrophosphate), CPP(copalyl pyrophosphate), 카우렌 및 카우레노익산(kaurnoic acid)으로 이루어지는 군에서 선택된 1종 이상인, 조성물.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020180167932A KR102618015B1 (ko) | 2018-12-21 | 2018-12-21 | 스테비올 또는 이의 전구체의 생산 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020180167932A KR102618015B1 (ko) | 2018-12-21 | 2018-12-21 | 스테비올 또는 이의 전구체의 생산 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20200078807A KR20200078807A (ko) | 2020-07-02 |
KR102618015B1 true KR102618015B1 (ko) | 2023-12-26 |
Family
ID=71599478
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020180167932A KR102618015B1 (ko) | 2018-12-21 | 2018-12-21 | 스테비올 또는 이의 전구체의 생산 |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR102618015B1 (ko) |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
BR112014003037B1 (pt) * | 2011-08-08 | 2022-04-05 | Evolva Sa | Hospedeiro recombinante e método para produzir um glicosídeo de esteviol |
KR101669044B1 (ko) * | 2015-03-31 | 2016-10-25 | 아주대학교산학협력단 | 스테비올바이오사이드 생산능을 가지는 재조합 미생물 및 이를 이용한 스테비올바이오사이드의 생산 방법 |
KR101669057B1 (ko) * | 2015-03-31 | 2016-10-25 | 아주대학교산학협력단 | 스테비올모노사이드 생산능을 가지는 재조합 미생물 및 이를 이용한 스테비올모노사이드의 생산 방법 |
-
2018
- 2018-12-21 KR KR1020180167932A patent/KR102618015B1/ko active IP Right Grant
Non-Patent Citations (2)
Title |
---|
FEBS J.,278:123-133(2011.12.31.)* |
GenBank: BAJ39816.1(2011.1.15.)* |
Also Published As
Publication number | Publication date |
---|---|
KR20200078807A (ko) | 2020-07-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210198711A1 (en) | Production of steviol glycosides in recombinant hosts | |
CN107567492B (zh) | Udp-糖基转移酶 | |
US20210155966A1 (en) | Production of steviol glycosides in recombinant hosts | |
US20220195477A1 (en) | Production of steviol glycosides in recombinant hosts | |
CA2973674A1 (en) | Production of steviol glycosides in recombinant hosts | |
US20200291442A1 (en) | Production of steviol glycosides in recombinant hosts | |
US11396669B2 (en) | Production of steviol glycosides in recombinant hosts | |
Shrestha et al. | Expression of chitin deacetylase from Colletotrichum lindemuthianum in Pichia pastoris: purification and characterization | |
CN107922913B (zh) | 甜菊醇糖苷转运 | |
TW201107482A (en) | Method for producing itaconic acid in yeast | |
CN107922465B (zh) | 甜菊醇糖苷转运 | |
CN114207108A (zh) | 产生糖基化大麻素的基因修饰的宿主细胞 | |
WO2018211032A1 (en) | Production of steviol glycosides in recombinant hosts | |
CN102361975A (zh) | 来自微生物的改进型凝乳蛋白酶 | |
EP3249044A1 (en) | Method for preparing mogroside | |
KR102618015B1 (ko) | 스테비올 또는 이의 전구체의 생산 | |
US11268118B2 (en) | Method for producing steviol and steviol glycoside using AOBGL1 homolog | |
KR102379608B1 (ko) | 개선된 스테비올 배당체 화합물 또는 이의 전구체의 생산 | |
US20190048356A1 (en) | Production of steviol glycosides in recombinant hosts | |
KR102237465B1 (ko) | 이눌로수크라제 활성이 도입된 효모 및 이를 이용한 프럭토올리고사카라이드 생산방법 | |
US20060099680A1 (en) | Yeast transformant into which genes associated with synthesis system of O-fucosylated protein are introduced | |
KR102171224B1 (ko) | 이눌린 과당전이효소 활성이 도입된 효모 및 이를 이용한 디프럭토스 언하이드리드 iii 및 프럭토올리고당의 생산방법 | |
CN112920959A (zh) | 一种提高酵母菌中l-薄荷醇产量的方法 | |
KR20220091425A (ko) | 당전이 효소 및 이를 이용한 스테비올 배당체의 제조방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant |