CN113195726A - 莎草奥酮的微生物生产 - Google Patents
莎草奥酮的微生物生产 Download PDFInfo
- Publication number
- CN113195726A CN113195726A CN201980060571.5A CN201980060571A CN113195726A CN 113195726 A CN113195726 A CN 113195726A CN 201980060571 A CN201980060571 A CN 201980060571A CN 113195726 A CN113195726 A CN 113195726A
- Authority
- CN
- China
- Prior art keywords
- leu
- glu
- ala
- ser
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000000813 microbial effect Effects 0.000 title claims abstract description 83
- YALFFHSIVPCNLF-QPSCCSFWSA-N Cyperolone Chemical compound C1[C@H](C(=C)C)CC[C@@]2(C)CC[C@H](O)[C@]21C(C)=O YALFFHSIVPCNLF-QPSCCSFWSA-N 0.000 title claims abstract description 63
- YALFFHSIVPCNLF-UHFFFAOYSA-N Cyperolone Natural products C1C(C(=C)C)CCC2(C)CCC(O)C21C(C)=O YALFFHSIVPCNLF-UHFFFAOYSA-N 0.000 title claims abstract description 63
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 33
- 102000004190 Enzymes Human genes 0.000 claims abstract description 81
- 108090000790 Enzymes Proteins 0.000 claims abstract description 81
- ADIDQIZBYUABQK-UHFFFAOYSA-N α-guaiene Chemical compound C1C(C(C)=C)CCC(C)C2=C1C(C)CC2 ADIDQIZBYUABQK-UHFFFAOYSA-N 0.000 claims abstract description 40
- 238000000034 method Methods 0.000 claims abstract description 27
- 229910052799 carbon Inorganic materials 0.000 claims abstract description 23
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims abstract description 21
- 238000006243 chemical reaction Methods 0.000 claims abstract description 15
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 71
- XURCUMFVQKJMJP-UHFFFAOYSA-N Dihydro-alpha-guaien Natural products C1C(C(C)C)CCC(C)C2=C1C(C)CC2 XURCUMFVQKJMJP-UHFFFAOYSA-N 0.000 claims description 38
- 229930000038 α-guaiene Natural products 0.000 claims description 38
- 238000006467 substitution reaction Methods 0.000 claims description 37
- 108010021809 Alcohol dehydrogenase Proteins 0.000 claims description 33
- 230000037361 pathway Effects 0.000 claims description 27
- 102000007698 Alcohol dehydrogenase Human genes 0.000 claims description 22
- 239000000758 substrate Substances 0.000 claims description 20
- KJTLQQUUPVSXIM-ZCFIWIBFSA-M (R)-mevalonate Chemical compound OCC[C@](O)(C)CC([O-])=O KJTLQQUUPVSXIM-ZCFIWIBFSA-M 0.000 claims description 18
- KJTLQQUUPVSXIM-UHFFFAOYSA-N DL-mevalonic acid Natural products OCCC(O)(C)CC(O)=O KJTLQQUUPVSXIM-UHFFFAOYSA-N 0.000 claims description 18
- 241000588724 Escherichia coli Species 0.000 claims description 16
- 102000004316 Oxidoreductases Human genes 0.000 claims description 15
- 108090000854 Oxidoreductases Proteins 0.000 claims description 15
- 108010045510 NADPH-Ferrihemoprotein Reductase Proteins 0.000 claims description 14
- 108090000623 proteins and genes Proteins 0.000 claims description 14
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 claims description 9
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 claims description 9
- 244000063299 Bacillus subtilis Species 0.000 claims description 8
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 8
- 241000894006 Bacteria Species 0.000 claims description 8
- 108010026318 Geranyltranstransferase Proteins 0.000 claims description 8
- 102100035111 Farnesyl pyrophosphate synthase Human genes 0.000 claims description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 7
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 7
- 239000000284 extract Substances 0.000 claims description 7
- 108030003441 Alpha-guaiene synthases Proteins 0.000 claims description 6
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 claims description 6
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 claims description 6
- 108010029541 Laccase Proteins 0.000 claims description 6
- 102000004020 Oxygenases Human genes 0.000 claims description 6
- 108090000417 Oxygenases Proteins 0.000 claims description 6
- 239000008103 glucose Substances 0.000 claims description 6
- 230000001580 bacterial effect Effects 0.000 claims description 5
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 claims description 4
- 241000191025 Rhodobacter Species 0.000 claims description 4
- MAKBWIUHFAVVJP-HAXARLPTSA-N (2R,3S)-pentane-1,2,3,4-tetrol phosphoric acid Chemical compound OP(O)(O)=O.CC(O)[C@H](O)[C@H](O)CO MAKBWIUHFAVVJP-HAXARLPTSA-N 0.000 claims description 3
- 241000186226 Corynebacterium glutamicum Species 0.000 claims description 3
- 241000235058 Komagataella pastoris Species 0.000 claims description 3
- 241000235648 Pichia Species 0.000 claims description 3
- 241000589776 Pseudomonas putida Species 0.000 claims description 3
- 241000191043 Rhodobacter sphaeroides Species 0.000 claims description 3
- 241000235070 Saccharomyces Species 0.000 claims description 3
- 241000607598 Vibrio Species 0.000 claims description 3
- 241000235013 Yarrowia Species 0.000 claims description 3
- 241000235015 Yarrowia lipolytica Species 0.000 claims description 3
- 241000588901 Zymomonas Species 0.000 claims description 3
- 241000588902 Zymomonas mobilis Species 0.000 claims description 3
- 150000003278 haem Chemical class 0.000 claims description 3
- 229910052742 iron Inorganic materials 0.000 claims description 3
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 2
- 241000186216 Corynebacterium Species 0.000 claims description 2
- 241000588722 Escherichia Species 0.000 claims description 2
- 229930091371 Fructose Natural products 0.000 claims description 2
- 239000005715 Fructose Substances 0.000 claims description 2
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 claims description 2
- 241000589516 Pseudomonas Species 0.000 claims description 2
- 229930006000 Sucrose Natural products 0.000 claims description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 claims description 2
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 claims description 2
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 claims description 2
- 238000012258 culturing Methods 0.000 claims description 2
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 claims description 2
- 230000008569 process Effects 0.000 claims description 2
- 102220095362 rs876658893 Human genes 0.000 claims description 2
- 239000005720 sucrose Substances 0.000 claims description 2
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 claims 1
- 230000002255 enzymatic effect Effects 0.000 abstract description 6
- 239000000796 flavoring agent Substances 0.000 abstract description 6
- 235000019634 flavors Nutrition 0.000 abstract description 6
- 239000003205 fragrance Substances 0.000 abstract description 5
- 239000000203 mixture Substances 0.000 abstract description 4
- 102000040430 polynucleotide Human genes 0.000 abstract description 3
- 108091033319 polynucleotide Proteins 0.000 abstract description 3
- 239000002157 polynucleotide Substances 0.000 abstract description 3
- 235000000346 sugar Nutrition 0.000 abstract description 3
- 150000008163 sugars Chemical class 0.000 abstract description 3
- 210000004027 cell Anatomy 0.000 description 64
- 108010013835 arginine glutamate Proteins 0.000 description 63
- 241000984061 Aquilaria Species 0.000 description 58
- 108010050848 glycylleucine Proteins 0.000 description 46
- 108010031719 prolyl-serine Proteins 0.000 description 45
- 108010038633 aspartylglutamate Proteins 0.000 description 44
- 108010049041 glutamylalanine Proteins 0.000 description 44
- 108010051242 phenylalanylserine Proteins 0.000 description 44
- 108010034529 leucyl-lysine Proteins 0.000 description 41
- 108010003700 lysyl aspartic acid Proteins 0.000 description 39
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 36
- 239000000047 product Substances 0.000 description 36
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 35
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 34
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 34
- ADIDQIZBYUABQK-RWMBFGLXSA-N alpha-guaiene Chemical compound C1([C@H](CC[C@H](C2)C(C)=C)C)=C2[C@@H](C)CC1 ADIDQIZBYUABQK-RWMBFGLXSA-N 0.000 description 33
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 33
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 32
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 31
- 108010092854 aspartyllysine Proteins 0.000 description 29
- 108010005233 alanylglutamic acid Proteins 0.000 description 28
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 28
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 26
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 25
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 25
- VWFJDQUYCIWHTN-YFVJMOTDSA-N 2-trans,6-trans-farnesyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CO[P@](O)(=O)OP(O)(O)=O VWFJDQUYCIWHTN-YFVJMOTDSA-N 0.000 description 24
- VWFJDQUYCIWHTN-FBXUGWQNSA-N Farnesyl diphosphate Natural products CC(C)=CCC\C(C)=C/CC\C(C)=C/COP(O)(=O)OP(O)(O)=O VWFJDQUYCIWHTN-FBXUGWQNSA-N 0.000 description 24
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 24
- 108010008355 arginyl-glutamine Proteins 0.000 description 24
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 23
- 150000001413 amino acids Chemical class 0.000 description 23
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 22
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 22
- 108010068380 arginylarginine Proteins 0.000 description 22
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 22
- 108010040030 histidinoalanine Proteins 0.000 description 22
- 108010092114 histidylphenylalanine Proteins 0.000 description 22
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 21
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 21
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 21
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 21
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 21
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 21
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 21
- 108010066427 N-valyltryptophan Proteins 0.000 description 21
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 21
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 21
- 108010073969 valyllysine Proteins 0.000 description 21
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 20
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 20
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 20
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 20
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 20
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 20
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 20
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 20
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 20
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 20
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 20
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 20
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 20
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 20
- 108010037850 glycylvaline Proteins 0.000 description 20
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 19
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 19
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 19
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 19
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 19
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 19
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 19
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 19
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 19
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 19
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 19
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 19
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 19
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 19
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 19
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 19
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 19
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 19
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 19
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 19
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 19
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 19
- 108010068265 aspartyltyrosine Proteins 0.000 description 19
- 108010087823 glycyltyrosine Proteins 0.000 description 19
- 108010091871 leucylmethionine Proteins 0.000 description 19
- 108010087432 terpene synthase Proteins 0.000 description 19
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 18
- WYPUMLRSQMKIJU-BPNCWPANSA-N Ala-Arg-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WYPUMLRSQMKIJU-BPNCWPANSA-N 0.000 description 18
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 18
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 18
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 18
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 18
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 18
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 18
- WAEWODAAWLGLMK-OYDLWJJNSA-N Arg-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WAEWODAAWLGLMK-OYDLWJJNSA-N 0.000 description 18
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 18
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 18
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 18
- QHHVSXGWLYEAGX-GUBZILKMSA-N Asp-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QHHVSXGWLYEAGX-GUBZILKMSA-N 0.000 description 18
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 18
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 18
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 18
- JXVFJOMFOLFPMP-KKUMJFAQSA-N Cys-Leu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JXVFJOMFOLFPMP-KKUMJFAQSA-N 0.000 description 18
- KGIHMGPYGXBYJJ-SRVKXCTJSA-N Cys-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CS KGIHMGPYGXBYJJ-SRVKXCTJSA-N 0.000 description 18
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 18
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 18
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 18
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 18
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 18
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 18
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 18
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 18
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 18
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 18
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 18
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 18
- RAVLQPXCMRCLKT-KBPBESRZSA-N His-Gly-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RAVLQPXCMRCLKT-KBPBESRZSA-N 0.000 description 18
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 18
- YAEKRYQASVCDLK-JYJNAYRXSA-N His-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YAEKRYQASVCDLK-JYJNAYRXSA-N 0.000 description 18
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 18
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 18
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 18
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 18
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 18
- 241000880493 Leptailurus serval Species 0.000 description 18
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 18
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 18
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 18
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 18
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 18
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 18
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 18
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 18
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 18
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 18
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 18
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 18
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 18
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 18
- SKUOQDYMJFUMOE-ULQDDVLXSA-N Lys-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N SKUOQDYMJFUMOE-ULQDDVLXSA-N 0.000 description 18
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 18
- PHWSCIFNNLLUFJ-NHCYSSNCSA-N Met-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N PHWSCIFNNLLUFJ-NHCYSSNCSA-N 0.000 description 18
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 18
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 18
- OOLVTRHJJBCJKB-IHRRRGAJSA-N Met-Tyr-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OOLVTRHJJBCJKB-IHRRRGAJSA-N 0.000 description 18
- LNIIRLODKOWQIY-IHRRRGAJSA-N Phe-Asn-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LNIIRLODKOWQIY-IHRRRGAJSA-N 0.000 description 18
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 18
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 18
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 18
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 18
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 18
- WCNVGGZRTNHOOS-ULQDDVLXSA-N Pro-Lys-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O WCNVGGZRTNHOOS-ULQDDVLXSA-N 0.000 description 18
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 18
- SHTKRJHDMNSKRM-ULQDDVLXSA-N Pro-Tyr-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O SHTKRJHDMNSKRM-ULQDDVLXSA-N 0.000 description 18
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 18
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 18
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 18
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 18
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 18
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 18
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 18
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 18
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 18
- BTAJAOWZCWOHBU-HSHDSVGOSA-N Thr-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)C(C)C)C(O)=O)=CNC2=C1 BTAJAOWZCWOHBU-HSHDSVGOSA-N 0.000 description 18
- DVAAUUVLDFKTAQ-VHWLVUOQSA-N Trp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DVAAUUVLDFKTAQ-VHWLVUOQSA-N 0.000 description 18
- NIHNMOSRSAYZIT-BPNCWPANSA-N Tyr-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NIHNMOSRSAYZIT-BPNCWPANSA-N 0.000 description 18
- WEFIPBYPXZYPHD-HJPIBITLSA-N Tyr-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WEFIPBYPXZYPHD-HJPIBITLSA-N 0.000 description 18
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 18
- STTVVMWQKDOKAM-YESZJQIVSA-N Tyr-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O STTVVMWQKDOKAM-YESZJQIVSA-N 0.000 description 18
- BBSPTGPYIPGTKH-JYJNAYRXSA-N Tyr-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BBSPTGPYIPGTKH-JYJNAYRXSA-N 0.000 description 18
- ZMKDQRJLMRZHRI-ACRUOGEOSA-N Tyr-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N ZMKDQRJLMRZHRI-ACRUOGEOSA-N 0.000 description 18
- FZADUTOCSFDBRV-RNXOBYDBSA-N Tyr-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FZADUTOCSFDBRV-RNXOBYDBSA-N 0.000 description 18
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 18
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 18
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 18
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 18
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 18
- 108010070944 alanylhistidine Proteins 0.000 description 18
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 18
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 18
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 18
- NUHSROFQTUXZQQ-UHFFFAOYSA-N isopentenyl diphosphate Chemical compound CC(=C)CCO[P@](O)(=O)OP(O)(O)=O NUHSROFQTUXZQQ-UHFFFAOYSA-N 0.000 description 18
- 108010053062 lysyl-arginyl-phenylalanyl-lysine Proteins 0.000 description 18
- 230000004048 modification Effects 0.000 description 18
- 238000012986 modification Methods 0.000 description 18
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 17
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 17
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 17
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 17
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 17
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 17
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 17
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 17
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 17
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 17
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 17
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 17
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 17
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 17
- XMWHRVNVKDKBRG-CRCLSJGQSA-N [(2s,3r)-2,3,4-trihydroxy-3-methylbutyl] dihydrogen phosphate Chemical compound OC[C@](O)(C)[C@@H](O)COP(O)(O)=O XMWHRVNVKDKBRG-CRCLSJGQSA-N 0.000 description 17
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 16
- 108010061238 threonyl-glycine Proteins 0.000 description 16
- XMWHRVNVKDKBRG-UHNVWZDZSA-N 2-C-Methyl-D-erythritol 4-phosphate Natural products OC[C@@](O)(C)[C@H](O)COP(O)(O)=O XMWHRVNVKDKBRG-UHNVWZDZSA-N 0.000 description 15
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 15
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 15
- 239000000543 intermediate Substances 0.000 description 15
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 14
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 14
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 14
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 14
- 230000014509 gene expression Effects 0.000 description 14
- CBIDRCWHNCKSTO-UHFFFAOYSA-N prenyl diphosphate Chemical compound CC(C)=CCO[P@](O)(=O)OP(O)(O)=O CBIDRCWHNCKSTO-UHFFFAOYSA-N 0.000 description 14
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 13
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 13
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 13
- 108010057821 leucylproline Proteins 0.000 description 13
- 240000006365 Vitis vinifera Species 0.000 description 12
- 235000014787 Vitis vinifera Nutrition 0.000 description 12
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 12
- MECFLTFREHAZLH-ACZMJKKPSA-N Asn-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N MECFLTFREHAZLH-ACZMJKKPSA-N 0.000 description 11
- 238000012217 deletion Methods 0.000 description 11
- 230000037430 deletion Effects 0.000 description 11
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 11
- 238000003780 insertion Methods 0.000 description 11
- 230000037431 insertion Effects 0.000 description 11
- 238000007254 oxidation reaction Methods 0.000 description 11
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 11
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 10
- 235000009754 Vitis X bourquina Nutrition 0.000 description 10
- 235000012333 Vitis X labruscana Nutrition 0.000 description 10
- 230000009471 action Effects 0.000 description 10
- 108010054155 lysyllysine Proteins 0.000 description 10
- 230000003647 oxidation Effects 0.000 description 10
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 9
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 9
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 9
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 9
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 9
- 108010062796 arginyllysine Proteins 0.000 description 9
- 230000000694 effects Effects 0.000 description 9
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 9
- 108010089804 glycyl-threonine Proteins 0.000 description 9
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 9
- 108010090894 prolylleucine Proteins 0.000 description 9
- 241000219195 Arabidopsis thaliana Species 0.000 description 8
- 241000193755 Bacillus cereus Species 0.000 description 8
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 8
- 108010025306 histidylleucine Proteins 0.000 description 8
- 108010009298 lysylglutamic acid Proteins 0.000 description 8
- 108010064235 lysylglycine Proteins 0.000 description 8
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 8
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 7
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 7
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 7
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 7
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 7
- 244000228451 Stevia rebaudiana Species 0.000 description 7
- 235000006092 Stevia rebaudiana Nutrition 0.000 description 7
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 108010081551 glycylphenylalanine Proteins 0.000 description 7
- 108010078274 isoleucylvaline Proteins 0.000 description 7
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 7
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 7
- 239000002243 precursor Substances 0.000 description 7
- 108010004914 prolylarginine Proteins 0.000 description 7
- 108010026333 seryl-proline Proteins 0.000 description 7
- 229930004725 sesquiterpene Natural products 0.000 description 7
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 6
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 6
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 6
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 6
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 6
- 235000001405 Artemisia annua Nutrition 0.000 description 6
- 240000000011 Artemisia annua Species 0.000 description 6
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 6
- 241000196324 Embryophyta Species 0.000 description 6
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 6
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 6
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 6
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 6
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 6
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 6
- 244000203593 Piper nigrum Species 0.000 description 6
- 235000008184 Piper nigrum Nutrition 0.000 description 6
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 6
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 6
- 108010041407 alanylaspartic acid Proteins 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 230000004907 flux Effects 0.000 description 6
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 6
- 108010015796 prolylisoleucine Proteins 0.000 description 6
- 102000004169 proteins and genes Human genes 0.000 description 6
- -1 sesquiterpene olefin Chemical class 0.000 description 6
- 108010051110 tyrosyl-lysine Proteins 0.000 description 6
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 5
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 5
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 5
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 5
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 5
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 5
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 5
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 5
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 5
- 102000056950 Gs GTP-Binding Protein alpha Subunits Human genes 0.000 description 5
- 108091006065 Gs proteins Proteins 0.000 description 5
- 244000020551 Helianthus annuus Species 0.000 description 5
- 235000003222 Helianthus annuus Nutrition 0.000 description 5
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 5
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 5
- 108010065920 Insulin Lispro Proteins 0.000 description 5
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 5
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 5
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 5
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 5
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 5
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 5
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 5
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 5
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 5
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 5
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 5
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 5
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 5
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 5
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 5
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 5
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 5
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 5
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 5
- 108010054813 diprotin B Proteins 0.000 description 5
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 5
- 108010015792 glycyllysine Proteins 0.000 description 5
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 108010071207 serylmethionine Proteins 0.000 description 5
- 150000004354 sesquiterpene derivatives Chemical class 0.000 description 5
- 150000003505 terpenes Chemical class 0.000 description 5
- YHOPXCAOTRUGLV-XAMCCFCMSA-N Ala-Ala-Asp-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YHOPXCAOTRUGLV-XAMCCFCMSA-N 0.000 description 4
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 4
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 4
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 4
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 4
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 4
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 4
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 4
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 4
- AYZAWXAPBAYCHO-CIUDSAMLSA-N Asn-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N AYZAWXAPBAYCHO-CIUDSAMLSA-N 0.000 description 4
- SMZCLQGDQMGESY-ACZMJKKPSA-N Asp-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N SMZCLQGDQMGESY-ACZMJKKPSA-N 0.000 description 4
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 4
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 4
- 241000743776 Brachypodium distachyon Species 0.000 description 4
- 240000002319 Citrus sinensis Species 0.000 description 4
- 235000005976 Citrus sinensis Nutrition 0.000 description 4
- 102000057412 Diphosphomevalonate decarboxylases Human genes 0.000 description 4
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 4
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 4
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 4
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 4
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 4
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 4
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 4
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 4
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 4
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 4
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 4
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 4
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 4
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 4
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 4
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 4
- 101000860349 Helianthus annuus Germacrene A hydroxylase Proteins 0.000 description 4
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 4
- 108090000895 Hydroxymethylglutaryl CoA Reductases Proteins 0.000 description 4
- 102000004286 Hydroxymethylglutaryl CoA Reductases Human genes 0.000 description 4
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 4
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 4
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 4
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 4
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 4
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 4
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 4
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 4
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 4
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 4
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 4
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 4
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 4
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 4
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 4
- 108700040132 Mevalonate kinases Proteins 0.000 description 4
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 101000958834 Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) Diphosphomevalonate decarboxylase mvd1 Proteins 0.000 description 4
- 101000958925 Panax ginseng Diphosphomevalonate decarboxylase 1 Proteins 0.000 description 4
- 244000270673 Pelargonium graveolens Species 0.000 description 4
- 235000017927 Pelargonium graveolens Nutrition 0.000 description 4
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 4
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 4
- 102100024279 Phosphomevalonate kinase Human genes 0.000 description 4
- 240000002505 Pogostemon cablin Species 0.000 description 4
- 235000011751 Pogostemon cablin Nutrition 0.000 description 4
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 4
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 4
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 4
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 4
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 4
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 4
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 4
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 4
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 4
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 4
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 4
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 4
- FBGDDUKYOBNZJL-WDSOQIARSA-N Trp-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FBGDDUKYOBNZJL-WDSOQIARSA-N 0.000 description 4
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 4
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 4
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 4
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 4
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 4
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 4
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 4
- 235000013614 black pepper Nutrition 0.000 description 4
- 108010069495 cysteinyltyrosine Proteins 0.000 description 4
- YHAJBLWYOIUHHM-GUTXKFCHSA-N delta-guaiene Chemical compound C1C[C@@H](C(C)=C)C[C@H]2[C@@H](C)CCC2=C1C YHAJBLWYOIUHHM-GUTXKFCHSA-N 0.000 description 4
- 108010067758 ent-kaurene oxidase Proteins 0.000 description 4
- 230000009483 enzymatic pathway Effects 0.000 description 4
- 238000000855 fermentation Methods 0.000 description 4
- 230000004151 fermentation Effects 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 108010027338 isoleucylcysteine Proteins 0.000 description 4
- 101150018742 ispF gene Proteins 0.000 description 4
- 150000002576 ketones Chemical class 0.000 description 4
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 4
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 102000002678 mevalonate kinase Human genes 0.000 description 4
- 108091000116 phosphomevalonate kinase Proteins 0.000 description 4
- 108010077112 prolyl-proline Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- MDSIZRKJVDMQOQ-GORDUTHDSA-K (2E)-4-hydroxy-3-methylbut-2-enyl diphosphate(3-) Chemical compound OCC(/C)=C/COP([O-])(=O)OP([O-])([O-])=O MDSIZRKJVDMQOQ-GORDUTHDSA-K 0.000 description 3
- PAHHYDSPOXDASW-VGWMRTNUSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-1-[(2s)-2-amino-3-hydroxypropanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO PAHHYDSPOXDASW-VGWMRTNUSA-N 0.000 description 3
- CABVTRNMFUVUDM-VRHQGPGLSA-N (3S)-3-hydroxy-3-methylglutaryl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C[C@@](O)(CC(O)=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 CABVTRNMFUVUDM-VRHQGPGLSA-N 0.000 description 3
- 108010006229 Acetyl-CoA C-acetyltransferase Proteins 0.000 description 3
- 102100037768 Acetyl-CoA acetyltransferase, mitochondrial Human genes 0.000 description 3
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 3
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 3
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 3
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 3
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 3
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 3
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 3
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 3
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 3
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 3
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 3
- 241000271307 Aquilaria malaccensis Species 0.000 description 3
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 3
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 3
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 3
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 3
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 3
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 3
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 3
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 3
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 3
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 3
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 3
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 3
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 3
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 3
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 3
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 3
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 3
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 3
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 3
- UZNSWMFLKVKJLI-VHWLVUOQSA-N Asp-Ile-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UZNSWMFLKVKJLI-VHWLVUOQSA-N 0.000 description 3
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 3
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 3
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 3
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 3
- 101100152417 Bacillus spizizenii (strain ATCC 23059 / NRRL B-14472 / W23) tarI gene Proteins 0.000 description 3
- 101100397224 Bacillus subtilis (strain 168) isp gene Proteins 0.000 description 3
- 101100268668 Caenorhabditis elegans acc-2 gene Proteins 0.000 description 3
- 101100268670 Caenorhabditis elegans acc-3 gene Proteins 0.000 description 3
- 241000222511 Coprinus Species 0.000 description 3
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 3
- PRVVCRZLTJNPCS-FXQIFTODSA-N Cys-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N PRVVCRZLTJNPCS-FXQIFTODSA-N 0.000 description 3
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- VVWWRZZMPSPVQU-KBIXCLLPSA-N Gln-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VVWWRZZMPSPVQU-KBIXCLLPSA-N 0.000 description 3
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 3
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 3
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 3
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 3
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 3
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 3
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 3
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 3
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 3
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 3
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 3
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 3
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 3
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 3
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 3
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 3
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 3
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 3
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 3
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 3
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 3
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 3
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 3
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 3
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 3
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 3
- MSASLZGZQAXVFP-PEDHHIEDSA-N Ile-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N MSASLZGZQAXVFP-PEDHHIEDSA-N 0.000 description 3
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 3
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 3
- 101100509110 Leifsonia xyli subsp. xyli (strain CTCB07) ispDF gene Proteins 0.000 description 3
- 241000222418 Lentinus Species 0.000 description 3
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 3
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 3
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 3
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 3
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 3
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 3
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 3
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 3
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 3
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 3
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 3
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 3
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 3
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 3
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 3
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 3
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 3
- KDBDVESGGJYVEH-PMVMPFDFSA-N Lys-Trp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](N)CCCCN)C(O)=O)C1=CC=CC=C1 KDBDVESGGJYVEH-PMVMPFDFSA-N 0.000 description 3
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 3
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 3
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 3
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 3
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 3
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 3
- 102000008109 Mixed Function Oxygenases Human genes 0.000 description 3
- 108010074633 Mixed Function Oxygenases Proteins 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 3
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 3
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 3
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 3
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 3
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 3
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 3
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 3
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 3
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 3
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 3
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 3
- 108010079005 RDV peptide Proteins 0.000 description 3
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 3
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 3
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 3
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 3
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 3
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 3
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 3
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 3
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 3
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 3
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 3
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 3
- 101100052502 Shigella flexneri yciB gene Proteins 0.000 description 3
- 101100278777 Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) dxs1 gene Proteins 0.000 description 3
- 101100126492 Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) ispG1 gene Proteins 0.000 description 3
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 3
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 3
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 3
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 3
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 3
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 3
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 3
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 3
- XXJDYWYVZBHELV-TUSQITKMSA-N Trp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CCCCN)C(=O)O)N XXJDYWYVZBHELV-TUSQITKMSA-N 0.000 description 3
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 3
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 3
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 3
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 3
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 3
- 108010064997 VPY tripeptide Proteins 0.000 description 3
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 3
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 3
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 3
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 3
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 3
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 3
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 3
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 3
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 3
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 108010060035 arginylproline Proteins 0.000 description 3
- 230000004186 co-expression Effects 0.000 description 3
- 101150118992 dxr gene Proteins 0.000 description 3
- 101150056470 dxs gene Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 101150064873 ispA gene Proteins 0.000 description 3
- 101150014059 ispD gene Proteins 0.000 description 3
- 101150022203 ispDF gene Proteins 0.000 description 3
- 101150068863 ispE gene Proteins 0.000 description 3
- 101150081094 ispG gene Proteins 0.000 description 3
- 108010000761 leucylarginine Proteins 0.000 description 3
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 3
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 235000019515 salmon Nutrition 0.000 description 3
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 3
- 241000894007 species Species 0.000 description 3
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 2
- OKZYCXHTTZZYSK-ZCFIWIBFSA-N (R)-5-phosphomevalonic acid Chemical compound OC(=O)C[C@@](O)(C)CCOP(O)(O)=O OKZYCXHTTZZYSK-ZCFIWIBFSA-N 0.000 description 2
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 2
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- OQCPATDFWYYDDX-HGNGGELXSA-N Ala-Gln-His Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OQCPATDFWYYDDX-HGNGGELXSA-N 0.000 description 2
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 2
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 2
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- AETQNIIFKCMVHP-UVBJJODRSA-N Ala-Trp-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AETQNIIFKCMVHP-UVBJJODRSA-N 0.000 description 2
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 2
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- 241000219194 Arabidopsis Species 0.000 description 2
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 2
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 2
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 2
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 2
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 2
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 2
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 2
- JTZUZBADHGISJD-SRVKXCTJSA-N Arg-His-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JTZUZBADHGISJD-SRVKXCTJSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 2
- JEXPNDORFYHJTM-IHRRRGAJSA-N Arg-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JEXPNDORFYHJTM-IHRRRGAJSA-N 0.000 description 2
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 2
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 2
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 2
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 2
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 2
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 2
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 2
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 2
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 2
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 2
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 2
- LWXJVHTUEDHDLG-XUXIUFHCSA-N Asn-Leu-Leu-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LWXJVHTUEDHDLG-XUXIUFHCSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 2
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 2
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 2
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 2
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 2
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 2
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 2
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 2
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 2
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 2
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 2
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- 241000228212 Aspergillus Species 0.000 description 2
- 241001465180 Botrytis Species 0.000 description 2
- 101100180240 Burkholderia pseudomallei (strain K96243) ispH2 gene Proteins 0.000 description 2
- 101100268671 Caenorhabditis elegans acc-4 gene Proteins 0.000 description 2
- 241000759905 Camptotheca acuminata Species 0.000 description 2
- KLWPJMFMVPTNCC-UHFFFAOYSA-N Camptothecin Natural products CCC1(O)C(=O)OCC2=C1C=C3C4Nc5ccccc5C=C4CN3C2=O KLWPJMFMVPTNCC-UHFFFAOYSA-N 0.000 description 2
- 235000002566 Capsicum Nutrition 0.000 description 2
- 241000222356 Coriolus Species 0.000 description 2
- LYSHSHHDBVKJRN-JBDRJPRFSA-N Cys-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N LYSHSHHDBVKJRN-JBDRJPRFSA-N 0.000 description 2
- PRHGYQOSEHLDRW-VGDYDELISA-N Cys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N PRHGYQOSEHLDRW-VGDYDELISA-N 0.000 description 2
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 2
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 2
- VCPHQVQGVSKDHY-FXQIFTODSA-N Cys-Ser-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O VCPHQVQGVSKDHY-FXQIFTODSA-N 0.000 description 2
- XWTGTTNUCCEFJI-UBHSHLNASA-N Cys-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N XWTGTTNUCCEFJI-UBHSHLNASA-N 0.000 description 2
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 2
- 101710088194 Dehydrogenase Proteins 0.000 description 2
- 101100286286 Dictyostelium discoideum ipi gene Proteins 0.000 description 2
- 241000123326 Fomes Species 0.000 description 2
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 2
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 2
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 2
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 2
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 2
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 2
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 2
- NNXIQPMZGZUFJJ-AVGNSLFASA-N Gln-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NNXIQPMZGZUFJJ-AVGNSLFASA-N 0.000 description 2
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 2
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 2
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 2
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 2
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 2
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 2
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 2
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 2
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 2
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 2
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 2
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 2
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 2
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 2
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 2
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 2
- 101150056978 HMGS gene Proteins 0.000 description 2
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 2
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 2
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 2
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 2
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 2
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 2
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 2
- DPQIPEAHIYMUEJ-IHRRRGAJSA-N His-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N DPQIPEAHIYMUEJ-IHRRRGAJSA-N 0.000 description 2
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 2
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 2
- SWBUZLFWGJETAO-KKUMJFAQSA-N His-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O SWBUZLFWGJETAO-KKUMJFAQSA-N 0.000 description 2
- LPBWRHRHEIYAIP-KKUMJFAQSA-N His-Tyr-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LPBWRHRHEIYAIP-KKUMJFAQSA-N 0.000 description 2
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 2
- 108010000775 Hydroxymethylglutaryl-CoA synthase Proteins 0.000 description 2
- 102100028888 Hydroxymethylglutaryl-CoA synthase, cytoplasmic Human genes 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 2
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 2
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 2
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 2
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 2
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 2
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 2
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- SNHYFFQZRFIRHO-CYDGBPFRSA-N Ile-Met-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N SNHYFFQZRFIRHO-CYDGBPFRSA-N 0.000 description 2
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 2
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 2
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 2
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 2
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 2
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 2
- RRHGJUQNOFWUDK-UHFFFAOYSA-N Isoprene Chemical compound CC(=C)C=C RRHGJUQNOFWUDK-UHFFFAOYSA-N 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- 240000008415 Lactuca sativa Species 0.000 description 2
- 235000003228 Lactuca sativa Nutrition 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 2
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 2
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 2
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 2
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 2
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 2
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 2
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 2
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 2
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 2
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 2
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 2
- INMBONMDMGPADT-AVGNSLFASA-N Lys-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N INMBONMDMGPADT-AVGNSLFASA-N 0.000 description 2
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 2
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 2
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 2
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 2
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 2
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 2
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 2
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 2
- ODFBIJXEWPWSAN-CYDGBPFRSA-N Met-Ile-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O ODFBIJXEWPWSAN-CYDGBPFRSA-N 0.000 description 2
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 2
- AOFZWWDTTJLHOU-ULQDDVLXSA-N Met-Lys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AOFZWWDTTJLHOU-ULQDDVLXSA-N 0.000 description 2
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 2
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 2
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 241000221960 Neurospora Species 0.000 description 2
- 241000221961 Neurospora crassa Species 0.000 description 2
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 2
- 101150053185 P450 gene Proteins 0.000 description 2
- 239000006002 Pepper Substances 0.000 description 2
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 2
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 2
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 2
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 2
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 2
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 2
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 2
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 2
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 2
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 2
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 2
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 2
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 2
- 235000016761 Piper aduncum Nutrition 0.000 description 2
- 235000017804 Piper guineense Nutrition 0.000 description 2
- 241000222350 Pleurotus Species 0.000 description 2
- 241000221945 Podospora Species 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 2
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 2
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 2
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 2
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 2
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 2
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 2
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 2
- 108091007187 Reductases Proteins 0.000 description 2
- 241001361634 Rhizoctonia Species 0.000 description 2
- 241000813090 Rhizoctonia solani Species 0.000 description 2
- 241000187561 Rhodococcus erythropolis Species 0.000 description 2
- 101100011891 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ERG13 gene Proteins 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- IDCKUIWEIZYVSO-WFBYXXMGSA-N Ser-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C)C(O)=O)=CNC2=C1 IDCKUIWEIZYVSO-WFBYXXMGSA-N 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 2
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 2
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 2
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 2
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 2
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 2
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 2
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 2
- LPSKHZWBQONOQJ-XIRDDKMYSA-N Ser-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N LPSKHZWBQONOQJ-XIRDDKMYSA-N 0.000 description 2
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 2
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 2
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 2
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 2
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- FRQRWAMUESPWMT-HSHDSVGOSA-N Thr-Trp-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N)O FRQRWAMUESPWMT-HSHDSVGOSA-N 0.000 description 2
- 241000222354 Trametes Species 0.000 description 2
- HYNAKPYFEYJMAS-XIRDDKMYSA-N Trp-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HYNAKPYFEYJMAS-XIRDDKMYSA-N 0.000 description 2
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 2
- KULBQAVOXHQLIY-HSCHXYMDSA-N Trp-Ile-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 KULBQAVOXHQLIY-HSCHXYMDSA-N 0.000 description 2
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 2
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 2
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 2
- IEESWNWYUOETOT-BVSLBCMMSA-N Trp-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1ccccc1)C(O)=O IEESWNWYUOETOT-BVSLBCMMSA-N 0.000 description 2
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 2
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 2
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 2
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 2
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 2
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 2
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 2
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 2
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 2
- NKMFRGPKTIEXSK-ULQDDVLXSA-N Tyr-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NKMFRGPKTIEXSK-ULQDDVLXSA-N 0.000 description 2
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 2
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 2
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 2
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 2
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 2
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 2
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 2
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 2
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 2
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 2
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 241000607365 Vibrio natriegens Species 0.000 description 2
- 241000219094 Vitaceae Species 0.000 description 2
- OJFDKHTZOUZBOS-CITAKDKDSA-N acetoacetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 OJFDKHTZOUZBOS-CITAKDKDSA-N 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 230000001851 biosynthetic effect Effects 0.000 description 2
- 239000006227 byproduct Substances 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- VSJKWCGYPAHWDS-FQEVSTJZSA-N camptothecin Chemical compound C1=CC=C2C=C(CN3C4=CC5=C(C3=O)COC(=O)[C@]5(O)CC)C4=NC2=C1 VSJKWCGYPAHWDS-FQEVSTJZSA-N 0.000 description 2
- 229940127093 camptothecin Drugs 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 2
- 108010044613 cytochrome P-450 CYP152A1 (Bacillus subtilis) Proteins 0.000 description 2
- YHAJBLWYOIUHHM-UHFFFAOYSA-N delta-guaiene Natural products C1CC(C(C)=C)CC2C(C)CCC2=C1C YHAJBLWYOIUHHM-UHFFFAOYSA-N 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- VSJKWCGYPAHWDS-UHFFFAOYSA-N dl-camptothecin Natural products C1=CC=C2C=C(CN3C4=CC5=C(C3=O)COC(=O)C5(O)CC)C4=NC2=C1 VSJKWCGYPAHWDS-UHFFFAOYSA-N 0.000 description 2
- 101150014423 fni gene Proteins 0.000 description 2
- 229930001612 germacrene Natural products 0.000 description 2
- YDLBHMSVYMFOMI-SDFJSLCBSA-N germacrene Chemical compound CC(C)[C@H]1CC\C(C)=C\CC\C(C)=C\C1 YDLBHMSVYMFOMI-SDFJSLCBSA-N 0.000 description 2
- 108010069695 germacrene D synthase Proteins 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 235000002532 grape seed extract Nutrition 0.000 description 2
- 235000021021 grapes Nutrition 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 101150075592 idi gene Proteins 0.000 description 2
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 2
- 101150017044 ispH gene Proteins 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010012988 lysyl-glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 150000007523 nucleic acids Chemical class 0.000 description 2
- JRZJOMJEPLMPRA-UHFFFAOYSA-N olefin Natural products CCCCCCCC=C JRZJOMJEPLMPRA-UHFFFAOYSA-N 0.000 description 2
- 239000012074 organic phase Substances 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 239000001931 piper nigrum l. white Substances 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 230000027756 respiratory electron transport chain Effects 0.000 description 2
- 238000007363 ring formation reaction Methods 0.000 description 2
- 238000000638 solvent extraction Methods 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 2
- XMRKUJJDDKYUHV-HNNXBMFYSA-N (1E,4E,7betaH)-germacra-1(10),4,11(12)-triene Chemical compound CC(=C)[C@H]1CCC(C)=CCCC(C)=CC1 XMRKUJJDDKYUHV-HNNXBMFYSA-N 0.000 description 1
- LIWOHUSRWUWRSX-ZJZGAYNASA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-methylbutanoyl]amino]-3-phenylpropanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LIWOHUSRWUWRSX-ZJZGAYNASA-N 0.000 description 1
- LXJXRIRHZLFYRP-VKHMYHEASA-L (R)-2-Hydroxy-3-(phosphonooxy)-propanal Natural products O=C[C@H](O)COP([O-])([O-])=O LXJXRIRHZLFYRP-VKHMYHEASA-L 0.000 description 1
- AJPADPZSRRUGHI-RFZPGFLSSA-N 1-deoxy-D-xylulose 5-phosphate Chemical compound CC(=O)[C@@H](O)[C@H](O)COP(O)(O)=O AJPADPZSRRUGHI-RFZPGFLSSA-N 0.000 description 1
- 108010068049 1-deoxy-D-xylulose 5-phosphate reductoisomerase Proteins 0.000 description 1
- ONVABDHFQKWOSV-UHFFFAOYSA-N 16-Phyllocladene Natural products C1CC(C2)C(=C)CC32CCC2C(C)(C)CCCC2(C)C31 ONVABDHFQKWOSV-UHFFFAOYSA-N 0.000 description 1
- 108030005203 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthases Proteins 0.000 description 1
- DQPMXYDFWRYWQV-UHFFFAOYSA-N 2-[[6-amino-2-[[2-[(2-amino-3-methylbutanoyl)amino]-3-hydroxybutanoyl]amino]hexanoyl]amino]acetic acid Chemical compound CC(C)C(N)C(=O)NC(C(C)O)C(=O)NC(CCCCN)C(=O)NCC(O)=O DQPMXYDFWRYWQV-UHFFFAOYSA-N 0.000 description 1
- 101710139854 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase (ferredoxin) Proteins 0.000 description 1
- 101710088071 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase (ferredoxin), chloroplastic Proteins 0.000 description 1
- 101710086072 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase (flavodoxin) Proteins 0.000 description 1
- WEZDOYDDKIHCLM-UHFFFAOYSA-N Agarospirene Natural products CC1CCC=C(C)C11CC(C(C)=C)CC1 WEZDOYDDKIHCLM-UHFFFAOYSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- PWYFCPCBOYMOGB-LKTVYLICSA-N Ala-Gln-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PWYFCPCBOYMOGB-LKTVYLICSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 1
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 1
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 1
- XKXAZPSREVUCRT-BPNCWPANSA-N Ala-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=C(O)C=C1 XKXAZPSREVUCRT-BPNCWPANSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 241000271309 Aquilaria crassna Species 0.000 description 1
- 101000998379 Arabidopsis thaliana NADPH-cytochrome P450 reductase 1 Proteins 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- IJPNNYWHXGADJG-GUBZILKMSA-N Arg-Ala-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O IJPNNYWHXGADJG-GUBZILKMSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 1
- HJWQFFYRVFEWRM-SRVKXCTJSA-N Arg-Arg-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O HJWQFFYRVFEWRM-SRVKXCTJSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- YWENWUYXQUWRHQ-LPEHRKFASA-N Arg-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O YWENWUYXQUWRHQ-LPEHRKFASA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- BJNUAWGXPSHQMJ-DCAQKATOSA-N Arg-Gln-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O BJNUAWGXPSHQMJ-DCAQKATOSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- PSOPJDUQUVFSLS-GUBZILKMSA-N Arg-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PSOPJDUQUVFSLS-GUBZILKMSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- XOZYYXMHMIEJET-XIRDDKMYSA-N Arg-Trp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XOZYYXMHMIEJET-XIRDDKMYSA-N 0.000 description 1
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 1
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- CQMQJWRCRQSBAF-BPUTZDHNSA-N Asn-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N CQMQJWRCRQSBAF-BPUTZDHNSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- LUVODTFFSXVOAG-ACZMJKKPSA-N Asn-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N LUVODTFFSXVOAG-ACZMJKKPSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- UEONJSPBTSWKOI-CIUDSAMLSA-N Asn-Gln-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O UEONJSPBTSWKOI-CIUDSAMLSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- QDXQWFBLUVTOFL-FXQIFTODSA-N Asn-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)N)N QDXQWFBLUVTOFL-FXQIFTODSA-N 0.000 description 1
- VOKWBBBXJONREA-DCAQKATOSA-N Asn-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N VOKWBBBXJONREA-DCAQKATOSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 1
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 1
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- SPKRHJOVRVDJGG-CIUDSAMLSA-N Asp-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SPKRHJOVRVDJGG-CIUDSAMLSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- RKNIUWSZIAUEPK-PBCZWWQYSA-N Asp-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N)O RKNIUWSZIAUEPK-PBCZWWQYSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- TZBJAXGYGSIUHQ-XUXIUFHCSA-N Asp-Leu-Leu-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O TZBJAXGYGSIUHQ-XUXIUFHCSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- CZECQDPEMSVPDH-MNXVOIDGSA-N Asp-Leu-Val-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CZECQDPEMSVPDH-MNXVOIDGSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- XFQOQUWGVCVYON-DCAQKATOSA-N Asp-Met-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 XFQOQUWGVCVYON-DCAQKATOSA-N 0.000 description 1
- SJLDOGLMVPHPLZ-IHRRRGAJSA-N Asp-Met-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SJLDOGLMVPHPLZ-IHRRRGAJSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- LIQNMKIBMPEOOP-IHRRRGAJSA-N Asp-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)O)N LIQNMKIBMPEOOP-IHRRRGAJSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- KNOGLZBISUBTFW-QRTARXTBSA-N Asp-Trp-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O KNOGLZBISUBTFW-QRTARXTBSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 101100168476 Bacillus megaterium (strain ATCC 14581 / DSM 32 / JCM 2506 / NBRC 15308 / NCIMB 9376 / NCTC 10342 / NRRL B-14308 / VKM B-512) cyp106 gene Proteins 0.000 description 1
- 241000194110 Bacillus sp. (in: Bacteria) Species 0.000 description 1
- 101100297770 Bacillus subtilis (strain 168) pksS gene Proteins 0.000 description 1
- 101100052471 Bacillus subtilis (strain 168) ycgG gene Proteins 0.000 description 1
- AEDUNTARNYTECW-NNONSXFYSA-N C\C\1=C\CC(C)(C)\C=C\C\C(\C)=C/CC1.C\C\1=C\CC(C)(C)\C=C\C\C(\C)=C/CC1 Chemical compound C\C\1=C\CC(C)(C)\C=C\C\C(\C)=C/CC1.C\C\1=C\CC(C)(C)\C=C\C\C(\C)=C/CC1 AEDUNTARNYTECW-NNONSXFYSA-N 0.000 description 1
- 101100268665 Caenorhabditis elegans acc-1 gene Proteins 0.000 description 1
- 241001239379 Calophysus macropterus Species 0.000 description 1
- 241000905957 Channa melasoma Species 0.000 description 1
- 241000123346 Chrysosporium Species 0.000 description 1
- 102000003780 Clusterin Human genes 0.000 description 1
- 108090000197 Clusterin Proteins 0.000 description 1
- ACTIUHUUMQJHFO-UHFFFAOYSA-N Coenzym Q10 Natural products COC1=C(OC)C(=O)C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)=C(C)C1=O ACTIUHUUMQJHFO-UHFFFAOYSA-N 0.000 description 1
- 241000186249 Corynebacterium sp. Species 0.000 description 1
- 241000415588 Costus plicatus Species 0.000 description 1
- 241000208947 Cynara Species 0.000 description 1
- 235000003198 Cynara Nutrition 0.000 description 1
- 244000019459 Cynara cardunculus Species 0.000 description 1
- 235000019106 Cynara scolymus Nutrition 0.000 description 1
- 241001044073 Cypa Species 0.000 description 1
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 1
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 1
- XELISBQUZZAPQK-CIUDSAMLSA-N Cys-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N XELISBQUZZAPQK-CIUDSAMLSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- VXLXATVURDNDCG-CIUDSAMLSA-N Cys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N VXLXATVURDNDCG-CIUDSAMLSA-N 0.000 description 1
- AFYGNOJUTMXQIG-FXQIFTODSA-N Cys-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N AFYGNOJUTMXQIG-FXQIFTODSA-N 0.000 description 1
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- ATFSDBMHRCDLBV-BPUTZDHNSA-N Cys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N ATFSDBMHRCDLBV-BPUTZDHNSA-N 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- YTMBNLHIDIKJIU-HCXYKTFWSA-N D-Arginyl-L-arginyl-D-glutaminyl-L-phenylalanine Chemical compound NC(=N)NCCC[C@@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](CCC(O)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YTMBNLHIDIKJIU-HCXYKTFWSA-N 0.000 description 1
- LXJXRIRHZLFYRP-VKHMYHEASA-N D-glyceraldehyde 3-phosphate Chemical compound O=C[C@H](O)COP(O)(O)=O LXJXRIRHZLFYRP-VKHMYHEASA-N 0.000 description 1
- 102100031515 D-ribitol-5-phosphate cytidylyltransferase Human genes 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- 101100082612 Escherichia coli (strain K12) pdeG gene Proteins 0.000 description 1
- 241000488157 Escherichia sp. Species 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- UVAOVENCIONMJP-GUBZILKMSA-N Gln-Cys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O UVAOVENCIONMJP-GUBZILKMSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- KSKFIECUYMYWNS-AVGNSLFASA-N Gln-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N KSKFIECUYMYWNS-AVGNSLFASA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- DFRYZTUPVZNRLG-KKUMJFAQSA-N Gln-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DFRYZTUPVZNRLG-KKUMJFAQSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- NSEKYCAADBNQFE-XIRDDKMYSA-N Gln-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 NSEKYCAADBNQFE-XIRDDKMYSA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- SYDJILXOZNEEDK-XIRDDKMYSA-N Glu-Arg-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SYDJILXOZNEEDK-XIRDDKMYSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 1
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 1
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- 108010050375 Glucose 1-Dehydrogenase Proteins 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 1
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000016761 Haem oxygenases Human genes 0.000 description 1
- 108050006318 Haem oxygenases Proteins 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- ZNPRMNDAFQKATM-LKTVYLICSA-N His-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZNPRMNDAFQKATM-LKTVYLICSA-N 0.000 description 1
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 1
- OBTMRGFRLJBSFI-GARJFASQSA-N His-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OBTMRGFRLJBSFI-GARJFASQSA-N 0.000 description 1
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 1
- LDTJBEOANMQRJE-CIUDSAMLSA-N His-Cys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LDTJBEOANMQRJE-CIUDSAMLSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- ZYDYEPDFFVCUBI-SRVKXCTJSA-N His-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZYDYEPDFFVCUBI-SRVKXCTJSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- UMBKDWGQESDCTO-KKUMJFAQSA-N His-Lys-Lys Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O UMBKDWGQESDCTO-KKUMJFAQSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 1
- WPUAVVXYEJAWIV-KKUMJFAQSA-N His-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WPUAVVXYEJAWIV-KKUMJFAQSA-N 0.000 description 1
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 1
- HBGKOLSGLYMWSW-DCAQKATOSA-N His-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CS)C(=O)O HBGKOLSGLYMWSW-DCAQKATOSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- FCPSGEVYIVXPPO-QTKMDUPCSA-N His-Thr-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FCPSGEVYIVXPPO-QTKMDUPCSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- IXQGOKWTQPCIQM-YJRXYDGGSA-N His-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O IXQGOKWTQPCIQM-YJRXYDGGSA-N 0.000 description 1
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 1
- PDLQNLSEJXOQNQ-IHPCNDPISA-N His-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CN=CN1 PDLQNLSEJXOQNQ-IHPCNDPISA-N 0.000 description 1
- UWNUQPZUSRFIIN-JUKXBJQTSA-N His-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N UWNUQPZUSRFIIN-JUKXBJQTSA-N 0.000 description 1
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 1
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 1
- 101000994204 Homo sapiens D-ribitol-5-phosphate cytidylyltransferase Proteins 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- CCHSQWLCOOZREA-GMOBBJLQSA-N Ile-Asp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N CCHSQWLCOOZREA-GMOBBJLQSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- MTONDYJJCIBZTK-PEDHHIEDSA-N Ile-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)O)N MTONDYJJCIBZTK-PEDHHIEDSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- WVUDHMBJNBWZBU-XUXIUFHCSA-N Ile-Lys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N WVUDHMBJNBWZBU-XUXIUFHCSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- FJWALBCCVIHZBS-QXEWZRGKSA-N Ile-Met-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N FJWALBCCVIHZBS-QXEWZRGKSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 1
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- BLFXHAFTNYZEQE-VKOGCVSHSA-N Ile-Trp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BLFXHAFTNYZEQE-VKOGCVSHSA-N 0.000 description 1
- BZUOLKFQVVBTJY-SLBDDTMCSA-N Ile-Trp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BZUOLKFQVVBTJY-SLBDDTMCSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- IPFKIGNDTUOFAF-CYDGBPFRSA-N Ile-Val-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IPFKIGNDTUOFAF-CYDGBPFRSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 108090000769 Isomerases Proteins 0.000 description 1
- 102000004195 Isomerases Human genes 0.000 description 1
- DQUHDYWUEKWRLN-UHFFFAOYSA-N Isophyllocladen Natural products C1CC2C3(C)CCCC(C)(C)C3CCC22C=C(C)C1C2 DQUHDYWUEKWRLN-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- 240000005183 Lantana involucrata Species 0.000 description 1
- 235000013628 Lantana involucrata Nutrition 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- GSSMYQHXZNERFX-WDSOQIARSA-N Leu-Met-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N GSSMYQHXZNERFX-WDSOQIARSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- LSLUTXRANSUGFY-XIRDDKMYSA-N Leu-Trp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O LSLUTXRANSUGFY-XIRDDKMYSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- USPJSTBDIGJPFK-PMVMPFDFSA-N Lys-Tyr-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O USPJSTBDIGJPFK-PMVMPFDFSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241000220225 Malus Species 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 1
- DBOMZJOESVYERT-GUBZILKMSA-N Met-Asn-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N DBOMZJOESVYERT-GUBZILKMSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- MYKLINMAGAIRPJ-CIUDSAMLSA-N Met-Gln-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MYKLINMAGAIRPJ-CIUDSAMLSA-N 0.000 description 1
- KLFPZIUIXZNEKY-DCAQKATOSA-N Met-Gln-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O KLFPZIUIXZNEKY-DCAQKATOSA-N 0.000 description 1
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 1
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 1
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 1
- ORRNBLTZBBESPN-HJWJTTGWSA-N Met-Ile-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ORRNBLTZBBESPN-HJWJTTGWSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 1
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- FMYLZGQFKPHXHI-GUBZILKMSA-N Met-Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O FMYLZGQFKPHXHI-GUBZILKMSA-N 0.000 description 1
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 1
- SJLPOVNXMJFKHJ-ULQDDVLXSA-N Met-Phe-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N SJLPOVNXMJFKHJ-ULQDDVLXSA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- GRKPXCKLOOUDFG-UFYCRDLUSA-N Met-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 GRKPXCKLOOUDFG-UFYCRDLUSA-N 0.000 description 1
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 1
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 235000006677 Monarda citriodora ssp. austromontana Nutrition 0.000 description 1
- 241000226677 Myceliophthora Species 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 241000208125 Nicotiana Species 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 241000579280 Nicotiana tomentosa Species 0.000 description 1
- 235000010676 Ocimum basilicum Nutrition 0.000 description 1
- 240000007926 Ocimum gratissimum Species 0.000 description 1
- 235000011203 Origanum Nutrition 0.000 description 1
- 240000000783 Origanum majorana Species 0.000 description 1
- 101710093888 Pentalenene synthase Proteins 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 1
- FGXIJNMDRCZVDE-KKUMJFAQSA-N Phe-Cys-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N FGXIJNMDRCZVDE-KKUMJFAQSA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- UXQFHEKRGHYJRA-STQMWFEESA-N Phe-Met-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O UXQFHEKRGHYJRA-STQMWFEESA-N 0.000 description 1
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- 241001503951 Phoma Species 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 241001495452 Podophyllum Species 0.000 description 1
- 241000222640 Polyporus Species 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 1
- XZGWNSIRZIUHHP-SRVKXCTJSA-N Pro-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 XZGWNSIRZIUHHP-SRVKXCTJSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- GQLOZEMWEBDEAY-NAKRPEOUSA-N Pro-Cys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GQLOZEMWEBDEAY-NAKRPEOUSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- PTLOFJZJADCNCD-DCAQKATOSA-N Pro-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 PTLOFJZJADCNCD-DCAQKATOSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 1
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- PUQRDHNIOONJJN-AVGNSLFASA-N Pro-Lys-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PUQRDHNIOONJJN-AVGNSLFASA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 241000589774 Pseudomonas sp. Species 0.000 description 1
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 description 1
- 241000235527 Rhizopus Species 0.000 description 1
- 241000191021 Rhodobacter sp. Species 0.000 description 1
- 244000178231 Rosmarinus officinalis Species 0.000 description 1
- NUWMTBMCSQWPDG-SDDRHHMPSA-N Rotundone Chemical compound C1([C@H](CC[C@H](C2)C(C)=C)C)=C2[C@@H](C)CC1=O NUWMTBMCSQWPDG-SDDRHHMPSA-N 0.000 description 1
- NUWMTBMCSQWPDG-UHFFFAOYSA-N Rotundone Natural products C1C(C(C)=C)CCC(C)C2=C1C(C)CC2=O NUWMTBMCSQWPDG-UHFFFAOYSA-N 0.000 description 1
- 235000018519 Scolymus Nutrition 0.000 description 1
- 241000189131 Scolymus Species 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- DGPGKMKUNGKHPK-QEJZJMRPSA-N Ser-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGPGKMKUNGKHPK-QEJZJMRPSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- LOKXAXAESFYFAX-CIUDSAMLSA-N Ser-His-Cys Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CN=CN1 LOKXAXAESFYFAX-CIUDSAMLSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 101710115850 Sesquiterpene synthase Proteins 0.000 description 1
- 241001313536 Thermothelomyces thermophila Species 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- NFMPFBCXABPALN-OWLDWWDNSA-N Thr-Ala-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O NFMPFBCXABPALN-OWLDWWDNSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- PAOYNIKMYOGBMR-PBCZWWQYSA-N Thr-Asn-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PAOYNIKMYOGBMR-PBCZWWQYSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 1
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- GQPQJNMVELPZNQ-GBALPHGKSA-N Thr-Ser-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GQPQJNMVELPZNQ-GBALPHGKSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 235000007303 Thymus vulgaris Nutrition 0.000 description 1
- 240000002657 Thymus vulgaris Species 0.000 description 1
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 1
- FOAJSVIXYCLTSC-PJODQICGSA-N Trp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FOAJSVIXYCLTSC-PJODQICGSA-N 0.000 description 1
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 1
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- RYXOUTORDIUWNI-BPUTZDHNSA-N Trp-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RYXOUTORDIUWNI-BPUTZDHNSA-N 0.000 description 1
- QNTBGBCOEYNAPV-CWRNSKLLSA-N Trp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O QNTBGBCOEYNAPV-CWRNSKLLSA-N 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- YTCNLMSUXPCFBW-SXNHZJKMSA-N Trp-Ile-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O YTCNLMSUXPCFBW-SXNHZJKMSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- IVBJBFSWJDNQFW-XIRDDKMYSA-N Trp-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IVBJBFSWJDNQFW-XIRDDKMYSA-N 0.000 description 1
- HHPSUFUXXBOFQY-AQZXSJQPSA-N Trp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O HHPSUFUXXBOFQY-AQZXSJQPSA-N 0.000 description 1
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- HGEHWFGAKHSIDY-SRVKXCTJSA-N Tyr-Asp-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O HGEHWFGAKHSIDY-SRVKXCTJSA-N 0.000 description 1
- MOCXXGZHHSPNEJ-AVGNSLFASA-N Tyr-Cys-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O MOCXXGZHHSPNEJ-AVGNSLFASA-N 0.000 description 1
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- LMLBOGIOLHZXOT-JYJNAYRXSA-N Tyr-Glu-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O LMLBOGIOLHZXOT-JYJNAYRXSA-N 0.000 description 1
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 1
- OSMTVLSRTQDWHJ-JBACZVJFSA-N Tyr-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 OSMTVLSRTQDWHJ-JBACZVJFSA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- OLWFDNLLBWQWCP-STQMWFEESA-N Tyr-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OLWFDNLLBWQWCP-STQMWFEESA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 1
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 1
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- VUVVMFSDLYKHPA-PMVMPFDFSA-N Tyr-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC3=CC=C(C=C3)O)N VUVVMFSDLYKHPA-PMVMPFDFSA-N 0.000 description 1
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- ZYVAAYAOTVJBSS-GMVOTWDCSA-N Tyr-Trp-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZYVAAYAOTVJBSS-GMVOTWDCSA-N 0.000 description 1
- AKRHKDCELJLTMD-BVSLBCMMSA-N Tyr-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N AKRHKDCELJLTMD-BVSLBCMMSA-N 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 1
- LABUITCFCAABSV-BPNCWPANSA-N Val-Ala-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-BPNCWPANSA-N 0.000 description 1
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- DCOOGDCRFXXQNW-ZKWXMUAHSA-N Val-Asn-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DCOOGDCRFXXQNW-ZKWXMUAHSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- MANXHLOVEUHVFD-DCAQKATOSA-N Val-His-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N MANXHLOVEUHVFD-DCAQKATOSA-N 0.000 description 1
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 1
- RHYOAUJXSRWVJT-GVXVVHGQSA-N Val-His-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RHYOAUJXSRWVJT-GVXVVHGQSA-N 0.000 description 1
- CHWRZUGUMAMTFC-IHRRRGAJSA-N Val-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CNC=N1 CHWRZUGUMAMTFC-IHRRRGAJSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- PYXQBKJPHNCTNW-CYDGBPFRSA-N Val-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N PYXQBKJPHNCTNW-CYDGBPFRSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- LZDNBBYBDGBADK-KBPBESRZSA-N Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-KBPBESRZSA-N 0.000 description 1
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 1
- AYHNXCJKBLYVOA-KSZLIROESA-N Val-Trp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N AYHNXCJKBLYVOA-KSZLIROESA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- 241000607284 Vibrio sp. Species 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 101100111939 Zea mays CYP71Z18 gene Proteins 0.000 description 1
- 240000000451 Zingiber zerumbet Species 0.000 description 1
- 235000014687 Zingiber zerumbet Nutrition 0.000 description 1
- 101100004915 Zingiber zerumbet CYP71BA1 gene Proteins 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 108010045649 agarase Proteins 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 239000012431 aqueous reaction media Substances 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 235000016520 artichoke thistle Nutrition 0.000 description 1
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 150000004334 bicyclic sesquiterpene derivatives Chemical class 0.000 description 1
- 230000036983 biotransformation Effects 0.000 description 1
- 239000007833 carbon precursor Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000003833 cell viability Effects 0.000 description 1
- 210000004671 cell-free system Anatomy 0.000 description 1
- ACTIUHUUMQJHFO-UPTCCGCDSA-N coenzyme Q10 Chemical compound COC1=C(OC)C(=O)C(C\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CCC=C(C)C)=C(C)C1=O ACTIUHUUMQJHFO-UPTCCGCDSA-N 0.000 description 1
- 235000017471 coenzyme Q10 Nutrition 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 101150011645 cypC gene Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 238000006356 dehydrogenation reaction Methods 0.000 description 1
- 108010060155 deoxyxylulose-5-phosphate synthase Proteins 0.000 description 1
- 108010060455 des-Tyr- beta-casomorphin Proteins 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 229930004069 diterpene Natural products 0.000 description 1
- 101150016796 djlA gene Proteins 0.000 description 1
- 229930000747 ent-isokaurene Natural products 0.000 description 1
- DQUHDYWUEKWRLN-HPUSYDDDSA-N ent-isokaurene Chemical compound C([C@@]1(C)[C@@H]2CC3)CCC(C)(C)[C@H]1CC[C@]21C=C(C)[C@H]3C1 DQUHDYWUEKWRLN-HPUSYDDDSA-N 0.000 description 1
- ONVABDHFQKWOSV-HPUSYDDDSA-N ent-kaur-16-ene Chemical compound C1C[C@H](C2)C(=C)C[C@@]32CC[C@@H]2C(C)(C)CCC[C@@]2(C)[C@@H]31 ONVABDHFQKWOSV-HPUSYDDDSA-N 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000004508 fractional distillation Methods 0.000 description 1
- 238000004817 gas chromatography Methods 0.000 description 1
- IBJVPIJUFFVDBS-UHFFFAOYSA-N germacrene A Natural products CC1=CCC(C(=C)C(O)=O)CCC(C)=CCC1 IBJVPIJUFFVDBS-UHFFFAOYSA-N 0.000 description 1
- 102000018146 globin Human genes 0.000 description 1
- 108060003196 globin Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 235000015143 herbs and spices Nutrition 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 125000000468 ketone group Chemical group 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 101150070011 lpxK gene Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 238000009629 microbiological culture Methods 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 229930003658 monoterpene Natural products 0.000 description 1
- 235000002577 monoterpenes Nutrition 0.000 description 1
- 238000002887 multiple sequence alignment Methods 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 101150050698 nlpI gene Proteins 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 101150077351 pgaC gene Proteins 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010030237 phenylalanyl-arginyl-valyl-phenylalanine Proteins 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 230000000865 phosphorylative effect Effects 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 230000037039 plant physiology Effects 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- YJGVMLPVUAXIQN-XVVDYKMHSA-N podophyllotoxin Chemical compound COC1=C(OC)C(OC)=CC([C@@H]2C3=CC=4OCOC=4C=C3[C@H](O)[C@@H]3[C@@H]2C(OC3)=O)=C1 YJGVMLPVUAXIQN-XVVDYKMHSA-N 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 229930000013 premnaspirodiene Natural products 0.000 description 1
- WEZDOYDDKIHCLM-RBSFLKMASA-N premnaspirodiene Chemical compound C[C@@H]1CCC=C(C)[C@@]11C[C@H](C(C)=C)CC1 WEZDOYDDKIHCLM-RBSFLKMASA-N 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- NPCOQXAVBJJZBQ-UHFFFAOYSA-N reduced coenzyme Q9 Natural products COC1=C(O)C(C)=C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)C(O)=C1OC NPCOQXAVBJJZBQ-UHFFFAOYSA-N 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 101150011963 sohB gene Proteins 0.000 description 1
- 235000013599 spices Nutrition 0.000 description 1
- 235000007586 terpenes Nutrition 0.000 description 1
- 239000001585 thymus vulgaris Substances 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 229940035936 ubiquinone Drugs 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010012050 valyl-aspartyl-prolyl-proline Proteins 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 101150040194 waaA gene Proteins 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 101150064056 ygdD gene Proteins 0.000 description 1
- 101150093426 yhcB gene Proteins 0.000 description 1
- 101150002761 ypfN gene Proteins 0.000 description 1
- 101150096853 zipA gene Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/24—Preparation of oxygen-containing organic compounds containing a carbonyl group
- C12P7/26—Ketones
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0006—Oxidoreductases (1.) acting on CH-OH groups as donors (1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1085—Transferases (2.) transferring alkyl or aryl groups other than methyl groups (2.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y101/00—Oxidoreductases acting on the CH-OH group of donors (1.1)
- C12Y101/01—Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
- C12Y101/01001—Alcohol dehydrogenase (1.1.1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y205/00—Transferases transferring alkyl or aryl groups, other than methyl groups (2.5)
- C12Y205/01—Transferases transferring alkyl or aryl groups, other than methyl groups (2.5) transferring alkyl or aryl groups, other than methyl groups (2.5.1)
- C12Y205/0101—(2E,6E)-Farnesyl diphosphate synthase (2.5.1.10), i.e. geranyltranstransferase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y402/00—Carbon-oxygen lyases (4.2)
- C12Y402/03—Carbon-oxygen lyases (4.2) acting on phosphates (4.2.3)
- C12Y402/03087—Alpha-guaiene synthase (4.2.3.87)
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本公开提供用于产生莎草奥酮的方法和组合物。在各个方面,本公开提供用于产生莎草奥酮的酶、编码所述酶的多核苷酸和重组微生物宿主细胞(或微生物宿主菌株)。在一些实施方案中,本公开提供微生物宿主细胞,用于从a‑愈创木烯的酶促转化或从糖或其他碳源以高纯度和/或高产率产生莎草奥酮。本公开还提供制备含有莎草奥酮的产品(包括香精或香料产品等)的方法。
Description
相关申请的交叉引用
本申请要求2018年9月6日提交的美国临时申请号62/727,815的权益,所述申请在此以引用的方式整体并入。
以电子方式提交的文本文件的说明
随此以电子方式提交的文本文件的内容以引用的方式整体并入本文。序列表的计算机可读格式副本(文件名:MAN-019PC Sequence Listing_ST25;记录日期:2019年9月6日;文件大小:235,920字节)。
背景技术
莎草奥酮(Rotundone)是一种氧化的倍半萜烯(类倍半萜烯),其在各种植物中负责令人愉悦的辛辣“胡椒”香气,包括葡萄(尤其是西拉(syrah)或设拉子(shiraz)、穆尔韦德(mourvèdre)、杜瑞夫(durif)、维斯琳娜(vespolina)和绿维特利纳(grüner veltliner)品种),和许多草药和香辛料,例如像黑胡椒和白胡椒、牛至、罗勒、百里香、马郁兰和迷迭香。鉴于其香气,莎草奥酮是应用于香料和香精的引人注目的分子。
α-愈创木烯(α-Guaiene)是(-)-莎草奥酮的前体。α-愈创木烯是存在于各种植物的油提取物中的倍半萜烯烃,并且通过空气氧化或酶促转化而转化为(-)-莎草奥酮。
鉴于莎草奥酮的商业价值,需要具有成本效益、可扩展和/或可持续的生产工艺。
发明内容
在各个方面,本公开提供用于产生莎草奥酮的方法和组合物。在各个方面,本公开提供用于产生莎草奥酮的酶、编码所述酶的多核苷酸和重组微生物宿主细胞(或微生物宿主菌株)。在一些实施方案中,本公开提供微生物宿主细胞,用于从α-愈创木烯的酶促转化或从糖或其他碳源以高纯度和/或高产率产生莎草奥酮。本公开还提供制备含有莎草奥酮的产品(包括香精或香料产品等)的方法。
在一些实施方案中,本公开提供一种微生物宿主细胞,其表达催化法呢基二磷酸(FPP)转化为莎草奥酮的酶途径,酶促途径包含α-愈创木烯萜烯合酶(αGTPS)和α-愈创木烯氧化酶(αGOX)。在这些实施方案中,微生物细胞可从任何合适的碳源合成莎草奥酮产物。在一些实施方案中,α-愈创木烯合酶的特异性使得能够在较少类萜副产物的情况下以高产率产生莎草奥酮。在一些实施方案中,αGOX产生莎草奥酮作为主要氧化产物。
在一些实施方案中,微生物宿主细胞进一步表达或过表达甲基赤藓糖醇磷酸(MEP)和/或甲羟戊酸(MVA)途径中的一种或多种酶,以催化葡萄糖或其他碳源转化为异戊烯基焦磷酸(IPP)和/或二甲基烯丙基焦磷酸(DMAPP)。在一些实施方案中,微生物宿主细胞进一步表达催化IPP和/或DMAPP转化为法呢基二磷酸(FPP)的酶,从而允许从糖或其他碳源(碳底物,诸如C1、C2、C3、C4、C5和/或C6碳底物)产生莎草奥酮。在一些实施方案中,宿主细胞是经工程改造以增加通过MEP途径的碳通量的细菌。
在一些实施方案中,微生物宿主细胞表达提供α-愈创木烯底物生物转化的α-愈创木烯氧化酶(αGOX),所述酶可为P450酶、非血红素铁加氧酶(NHIO)或漆酶。可将α-愈创木烯底物添加到全细胞或细胞提取物或纯化的酶中。在一些实施方案中,细胞进一步表达至少一种细胞色素P450还原酶,以支持全细胞生物转化过程中的P450酶活性。在一些实施方案中,αGOX产生莎草奥酮作为主要氧化产物。
在一些实施方案中,微生物宿主细胞进一步表达一种或多种醇脱氢酶(alcoholdehydrogenase)。在一些实施方案中,醇脱氢酶将由α-愈创木烯与αGOX反应产生的一种或多种醇中间体转化为莎草奥酮。
在一些实施方案中,微生物宿主细胞是原核的或真核的,并且可为细菌或酵母。
本发明的其他方面和实施方案从下面的详细公开内容来看将是显而易见的。
附图说明
图1示出了莎草奥酮的化学结构。
图2示出了用于产生莎草奥酮的生物合成途径。法呢基二磷酸通过α-愈创木烯萜烯合酶(αGTPS)转化为α-愈创木烯;并且α-愈创木烯通过α-愈创木烯氧化酶(αGOX)转化为(-)-莎草奥酮。
图3示出从α-愈创木烯产生(-)-莎草奥酮可直接从倍半萜烯前体进行,或可涉及产生一种或两种醇中间体,随后将中间体通过具有醇脱氢酶活性的酶转化为酮。
图4示出了示例性α-愈创木烯萜烯合酶(AcDGuaS3,SEQ ID NO:8)中氨基酸取代的筛选结果。测试了衍生物在大肠杆菌中产生α-愈创木烯的能力。图中示出了α-愈创木烯产量的提高倍数。
图5列出了对于AcDGuaS3中的几种氨基酸取代,在大肠杆菌中产生α-愈创木烯的改变的概况。列出了α-愈创木烯以总产物中百分比形式的提高倍数。
图6示出了在大肠杆菌中表达野生型AcDGuaS3(α-GS)和具有F406L突变的突变体AcDGuaS3(α-GS1)的α-愈创木烯的产生。
图7A和图7B示出了在表达α-GS1和示例性CYP450系统(工程改造的贝壳杉烯氧化酶;KOeng)的大肠杆菌中rotundol和莎草奥酮的产生。葡萄(Vitis vinifera)脱氢酶(VvDH)与α-GS1、作为α-愈创木烯氧化酶的KOeng和喜树(Camptotheca acuminata)细胞色素P450还原酶(CaCPR)一起的表达降低了rotundol的效价(图7A)并增加了莎草奥酮效价(图7B)。
图8A和图8B示出了从表达α-GS1、KOeng、CaCPR和VvDH的大肠杆菌菌株产生莎草奥酮的气相色谱/质谱(GC/MS)证实。图8A示出了GC/MS中莎草奥酮的丰度,并且图8B示出了莎草奥酮的气相色谱图。
图9示出了在表达α-GS1和替代的CYP450系统的大肠杆菌中rotundol和莎草奥酮的产生。α-GS1、被工程改造以在大肠杆菌中表达的向日葵(Helianthus annuus)大根香叶烯A单加氧酶(HaGAO)和AaCPR(黄花蒿(Artemisia annua)细胞色素P450还原酶)的表达主要产生莎草奥酮。
图10示出了包括用作α-愈创木烯氧化酶的KOeng(SEQ ID NO:51)的五种贝壳杉烯加氧酶的多序列比对。同源物是HaKO(SEQ ID NO:50)、AaKO(SEQ ID NO:49)、CcKO(SEQ IDNO:47)、LsKO(SEQ ID NO:46)和NtKO(SEQ ID NO:45)。
具体实施方式
在各个方面,本公开提供了用于产生莎草奥酮的微生物宿主细胞(或微生物宿主菌株)和方法以及制备含有莎草奥酮的产品(诸如香精和香料产品等)的方法。在其他方面,本发明提供了用于产生莎草奥酮的酶和编码酶的多核苷酸。
在一些实施方案中,本公开提供一种微生物宿主细胞,包括细菌和酵母,其表达催化法呢基二磷酸(FPP)转化为莎草奥酮的酶途径。在各种实施方案中,酶促途径包含α-愈创木烯合酶(αGTPS)和α-愈创木烯氧化酶(αGOX)。在这些实施方案中,微生物细胞可从任何合适的碳源合成莎草奥酮产物。在一些实施方案中,α-愈创木烯合酶的特异性使得能够在较少类萜副产物的情况下以高产率产生莎草奥酮。在一些实施方案中,微生物宿主细胞可进一步表达一种或多种醇脱氢酶(ADH)。在一些实施方案中,ADH将由α-愈创木烯与αGOX反应产生的一种或多种醇中间体转化为莎草奥酮。
在一些实施方案中,微生物宿主细胞表达提供α-愈创木烯底物生物转化的α-愈创木烯氧化酶(αGOX)。在一些实施方案中,αGOX是P450酶、非血红素铁加氧酶(NHIO)或漆酶。在一些实施方案中,细胞可进一步表达细胞色素P450还原酶以支持P450活性。在一些实施方案中,微生物宿主细胞可进一步表达一种或多种醇脱氢酶(ADH)。在一些实施方案中,ADH将由α-愈创木烯与αGOX反应产生的一种或多种醇中间体转化为莎草奥酮。
莎草奥酮包含在碳2位上具有单个酮基的愈创木烯碳骨架(参见图1)。α-愈创木烯是莎草奥酮的前体。α-愈创木烯是存在于多种植物的油提取物中的倍半萜烯烃。虽然α-愈创木烯可通过空气氧化或酶促转化而转化为莎草奥酮,但这些过程效率不高,部分原因是所用酶的特异性。
图2示出了莎草奥酮的生物合成途径。通过αGTPS萜烯合酶将C15倍半萜烯前体底物法呢基二磷酸(FPP)环化为α-愈创木烯。然后,通过αGOX酶将α-愈创木烯(即环化的FPP)氧化为莎草奥酮。导致形成莎草奥酮的α-愈创木烯中酮部分的产生可直接进行,或可替代地通过具有醇中间体的任一种立体化学(即,(2R)-rotundol或(2S)-rotundol)的醇中间体进行(参见图3)。
αGTPS酶是萜烯合酶(TPS)。TPS酶负责从两个异构的5-碳前体结构单元合成萜烯分子,从而生成5-碳异戊二烯、10-碳单萜烯、15-碳倍半萜烯和20-碳二萜烯。TPS酶的结构和功能描述于Chen等人,The Plant Journal,66:212-229(2011)中。已经描述了烟草5-表-马兜铃烯(5-epi-aristolochene)合酶(一种萜烯合酶),以及包括关键活性位点坐标的结构坐标。这些结构坐标可用于构建αGTPS酶的同源模型,所述同源模型可用于指导具有改善的特异性和生产率的αGTPS酶的工程改造。参见US 6,645,762、US 6,495,354和US 6,645,762,其在此以引用的方式整体并入。
在一些实施方案中,TPS酶选自葡萄GuaS(VvGuas)酶(SEQ ID NO:1)、来自广藿香(Pogostemon cablin)的广藿香(patchouli)合酶(PcPS)(Uniprot Q49SP3)(SEQ ID NO:2)、葡萄大根香叶烯D合酶(VvGDS;NCBI Ref#XP_002282488.1)(SEQ ID NO:21)或其变体。在一些实施方案中,TPS酶选自柯拉斯那沉香(Aquilaria crassna),例如,AcC1(UniprotD0VMR5);AcC2(Uniprot D0VMR6)(SEQ ID NO:3);AcC3(Uniprot D0VMR7)(SEQ ID NO:4);或AcC4(Uniprot D0VMR8)(SEQ ID NO:5)或其变体。在一些实施方案中,柯拉斯那沉香TPS是AcC1的突变体,例如AcC1mut1-M42(SEQ ID NO:6)和AcC1mut2-M50(SEQ ID NO:7)。其他TPS酶在本文中提供为SEQ ID NO:8(柯拉斯那沉香AcDGuaS3)、SEQ ID NO:9(柯拉斯那沉香AcDGuaS4)、SEQ ID NO:10(柯拉斯那沉香AcDGuaS2)、SEQ ID NO:11(柯拉斯那沉香AcDGuaS5)、SEQ ID NO:12(沉香属AmiDGuaS1)、SEQ ID NO:13(沉香属AmiDGuaS2)、SEQ IDNO:14(沉香属AmiDGuaS3)、SEQ ID NO:15(沉香属AmaDGuaS1)、SEQ ID NO:16(沉香属AmaDGuaS2)、SEQ ID NO:17(沉香属AsDGuaS1)、SEQ ID NO:18(沉香属AsDGuaS2)、SEQ IDNO:19(沉香属AsDGuaS3)和SEQ ID NO:20(沉香属AsDGuaS4)或其变体。
萜烯合酶变体包括包含与SEQ ID NO:1至21中的任一个具有50%或更高序列同一性的氨基酸序列的α-愈创木烯合酶。在一些实施方案中,变体包含与SEQ ID NO:1至21中的任一个的氨基酸序列具有至少约60%序列同一性、或至少约70%序列同一性、或至少约80%序列同一性、或至少约90%序列同一性、或至少约95%序列同一性、或至少约98%序列同一性或至少约99%序列同一性的氨基酸序列。在一些实施方案中,变体包括1个至约20个、或1个至约10个或1个至约5个氨基酸修饰,所述氨基酸修饰独立地选自对选自SEQ IDNO:1至21的氨基酸序列的取代、缺失和插入。在一些实施方案中,萜烯合酶包含对底物结合位点或活性位点中的一个或多个的取代。在一些实施方案中,可通过构建同源模型来了解对酶的修饰。在一些实施方案中,可选择氨基酸修饰以改善以下中的一项或多项:酶生产率、对所需底物和/或产物的选择性、稳定性、温度耐受性和表达。
在一些实施方案中,α-愈创木烯合酶包含与SEQ ID NO:1、3、4、6至10、11至15或19中的任一个具有至少50%序列同一性的氨基酸序列。在一些实施方案中,α-愈创木烯合酶包含与SEQ ID NO:1、3、4、6至10、11至15或19中的任一个的氨基酸序列具有至少约60%序列同一性、或至少约70%序列同一性、或至少约80%序列同一性、或至少约90%序列同一性、或至少约95%序列同一性、或至少约98%序列同一性或至少约99%序列同一性的氨基酸序列。在一些实施方案中,α-愈创木烯合酶包括1个至约20个、或1个至约10个或1个至约5个氨基酸修饰,所述氨基酸修饰独立地选自对选自SEQ ID NO:1、3、4、6至10、11至15或19的氨基酸序列的取代、缺失和插入。
在一些实施方案中,α-愈创木烯合酶包含与SEQ ID NO:8具有50%或更高序列同一性的氨基酸序列。在一些实施方案中,α-愈创木烯合酶包含与SEQ ID NO:8的氨基酸序列具有至少约60%序列同一性、或至少约70%序列同一性、或至少约80%序列同一性、或至少约90%序列同一性、或至少约95%序列同一性、或至少约98%序列同一性或至少约99%序列同一性的氨基酸序列。在一些实施方案中,α-愈创木烯合酶包括1个至约20个、或1个至约10个或1个至约5个氨基酸修饰,所述氨基酸修饰独立地选自对选自SEQ ID NO:8的氨基酸序列的取代、缺失和插入。
在一些实施方案中,α-愈创木烯合酶在与选自SEQ ID NO:8的以下的位置相对应的位置上可具有1个、2个、3个、4个、5个或更多个氨基酸取代:72、273、290、368、371、374、377、381、382、399、406、419、433、442、443、454、512和522。例如,在一些实施方案中,α-GTPS包含具有选自相对于SEQ ID NO:8的以下的氨基酸取代中的一个或多个(例如,2个、3个、4个、5个或所有)的氨基酸序列:T72I、M273L、R290K、F368M、I371L、S374A、R377V、Y381W、F382L、I399V、F406L、L419T、V433I、Y442L、I443M、E454K、F512L和K522D。在一些实施方案中,α-GTPS在对应于SEQ ID NO:8的位置406的位置处包括氨基酸取代,并且所述氨基酸取代任选地为F406L、F406A、F406I、F406V或F406G。在一些实施方案中,α-GTPS酶在对应于SEQID NO:8的位置443的位置处包括氨基酸取代,所述氨基酸取代任选地为I443M。在一些实施方案中,α-GTPS酶在对应于SEQ ID NO:8的位置512的位置处包括突变,所述突变任选地为F512L、F512A、F512I、F512V或F512G。
氨基酸取代可为保守的或非保守的取代。
“保守性取代”可例如基于所涉及的氨基酸残基在极性、电荷、大小、溶解度、疏水性、亲水性和/或两亲性质上的相似性来进行。20种天然存在的氨基酸可分为以下六个标准氨基酸组:
(1)疏水性的:Met、Ala、Val、Leu、Ile;
(2)中性亲水性的:Cys、Ser、Thr、Asn、Gln;
(3)酸性的:Asp、Glu;
(4)碱性的:His、Lys、Arg;
(5)影响链取向的残基:Gly、Pro;和
(6)芳族的:Trp、Tyr、Phe。
如本文所用,“保守性取代”定义为一个氨基酸被上面所示六个标准氨基酸组中的同一组中列出的另一个氨基酸交换。例如,Asp被Glu交换在这样修饰的多肽中保留一个负电荷。另外,基于甘氨酸和脯氨酸破坏a-螺旋的能力,它们可以彼此取代。以上六个组中的一些优选的保守性取代是以下亚组中的交换:(i)Ala、Val、Leu和Ile;(ii)Ser和Thr;(ii)Asn和Gln;(iv)Lys和Arg;以及(v)Tyr和Phe。
如本文所用,“非保守性取代”或“非保守性氨基酸交换”定义为一个氨基酸被上面所示六个标准氨基酸组(1)至(6)中的不同组中列出的另一个氨基酸交换。
α-GTPS酶的突变可通过使用公开于以下中的倍半萜烯合酶的分子结构/模型的同源模型来引导:Drew等人,J.of Exp.Botany,第67卷,第3期,第799-808页(2015)和/或Kumeta等人,Plant Physiology,第154卷,第1998-2007页(2010),其在此以引用的方式整体并入。
核苷酸和氨基酸序列的相似性,即序列同一性百分比,可以通过序列比对来测定。此类比对可以用几种本领域已知的算法,诸如用Karlin和Altschul的数学算法(Karlin和Altschul(1993)Proc.Natl.Acad.Sci.USA 90:5873-5877),用hmmalign(HMMER package,http://hmmer.wustl.edu/)或用CLUSTAL算法(Thompson,J.D.、Higgins,D.G.和Gibson,T.J.(1994)Nucleic Acids Res.22,4673-80)进行。可以使用例如BLAST、BLAT或BlastZ(或BlastX)来计算序列同一性(序列匹配)的等级。将类似算法并入到Altschul等人(1990)J.Mol.Biol.215:403-410的BLASTN和BLASTP程序中。BLAST蛋白质搜索可以用BLASTP程序执行,评分=50,字长=3。为了获得空位比对用于比较目的,如Altschul等人(1997)Nucleic Acids Res.25:3389-3402中所述利用空位BLAST。利用BLAST和空位BLAST程序时,使用各个程序的默认参数。可以通过已建立的同源性映射技术如Shuffle-LAGAN(BrudnoM.,Bioinformatics 2003b,19增刊1:154-162)或Markov随机场来补充序列匹配分析。
TPS酶可从FPP生成具有愈创木烯骨架的多种产物,其中通过不同的TPS酶产生不同数量的α-愈创木烯。在一些实施方案中,α-愈创木烯合酶(或工程改造的变体)主要从FPP底物产生α-愈创木烯(例如,大于50%)作为产物。在一些实施方案中,α-愈创木烯合酶从FPP产生大于约75%、或大于约80%、或大于约85%、或大于约90%或大于约95%的α-愈创木烯作为产物。可在产生FPP并表达α-愈创木烯合酶的宿主微生物细胞中确定酶的特异性,之后对总的类萜产物进行化学分析。在一些实施方案中,在αGTPS反应中产生的α-愈创木烯被氧化为莎草奥酮。在一些实施方案中,αGOX酶将α-愈创木烯氧化为莎草奥酮。在一些实施方案中,αGOX将至少一部分α-愈创木烯氧化为酮。在一些实施方案中,由α-GOX氧化α-愈创木烯导致产生一种或多种醇中间体。在一些实施方案中,通过一种或多种醇脱氢酶将醇中间体转化为莎草奥酮。
在一些实施方案中,αGOX酶为细胞色素P450(CYP450)酶。CYP450酶参与细胞内各种分子和化学物质的形成(合成)和分解(代谢)。CYP450酶已在所有生命王国(即动物、植物、真菌、原生生物、细菌、古细菌和甚至病毒)中被鉴定。CYP450酶的说明性结构和功能描述于Uracher等人,TRENDS in Biotechnology,24(7):324-330(2006)中。在一些实施方案中,已将P450酶工程改造以缺失野生型N-端跨膜区的全部或部分,并且添加了衍生自大肠杆菌内膜细胞质C端蛋白的跨膜结构域。在各种实施方案中,跨膜结构域是单次跨膜结构域。在各种实施方案中,跨膜结构域(或“N端锚”)衍生自选自以下中的大肠杆菌基因:waaA、ypfN、yhcB、yhbM、yhhm、zipA、ycgG、djlA、sohB、lpxK、F11O、motA、htpx、pgaC、ygdD、hemr和ycls。通过生物信息学预测以及实验验证,这些基因被鉴定为内膜细胞质C端蛋白。本发明可使用N端锚定序列,所述序列是大肠杆菌野生型跨膜结构域的衍生物,即,具有相对于野生型序列的一个或多个突变(例如,氨基酸取代)。制备此类工程改造的P450酶的方法以及工程改造的P450酶描述于美国专利公布号2018/0251738中,其在此以引用的方式整体并入。
在一些实施方案中,CYP450选自葡萄VvSTO2(CYP71BE5;Uniprot F6I534)(SEQ IDNO:22);枯草芽孢杆菌(Bacillus subtilis)CYP152A1(Uniprot O31440)(SEQ ID NO:23);枯草芽孢杆菌CYP107K1(Uniprot A5HNX5)(SEQ ID NO:24);蜡状芽孢杆菌(Bacilluscereus)CYP106(Uniprot Q737I9)(SEQ ID NO:25);和蜡状芽孢杆菌CYP107(UniprotQ737F2)(SEQ ID NO:26);或其变体。
αGOX变体包括酶,所述酶包含与SEQ ID NO:22至26中的任一个具有50%或更高序列同一性的氨基酸序列。在一些实施方案中,变体包含与SEQ ID NO:22至26中的任一个的氨基酸序列具有至少约60%序列同一性、或至少约70%序列同一性、或至少约80%序列同一性、或至少约90%序列同一性、或至少约95%序列同一性、或至少约98%序列同一性或至少约99%序列同一性的氨基酸序列。在一些实施方案中,变体包括1个至约20个、或1个至约10个或1个至约5个氨基酸修饰,所述氨基酸修饰独立地选自对选自SEQ ID NO:22至26的氨基酸序列的取代、缺失和插入。在一些实施方案中,加氧酶包含对底物结合位点或活性位点中的一个或多个的取代。在一些实施方案中,可通过构建同源模型来了解对酶的修饰。在一些实施方案中,通过测定对α-愈创木烯底物的活性来了解酶的选择和修饰。在一些实施方案中,可选择氨基酸修饰以改善以下中的一项或多项:酶生产率、对所需底物和/或产物的选择性、稳定性、温度耐受性和表达。
在一些实施方案中,αGOX酶是非血红素铁加氧酶(NHIO)或漆酶。在一些实施方案中,漆酶衍生自细菌或真菌(包括丝状真菌和酵母)。以举例的方式,在一些实施方案中,漆酶来自选自以下的种:曲霉属(Aspergillus)、链孢霉属(Neurospora)(例如粗糙链孢霉(N.crassa))、柄孢壳菌属(Podospora)、葡萄孢属(Botrytis)、金钱菌属(Collybia)、层孔菌属(Fomes)、香菇属(Lentinus)、香菇属、侧耳属(Pleurotus)、栓菌属(Trametes)、丝核菌属(Rhizoctonia)(例如,立枯丝核菌(R.solani))、鬼伞属(Coprinus)(例如,褶纹鬼伞(C.plicatilis))、小脆柄菇属(Psatyrella)、毁丝霉属(Myceliophtera)(例如,嗜热毁丝霉(M.thermophila))、柱顶孢属(Schytalidium)和多孔菌属(Polyporus)(例如,P.pinsitus)、射脉菌属(Phiebia)和革盖菌属(Coriolus)或其衍生物。
在一些实施方案中,CYP450(αGOX)包含与SEQ ID NO:51具有至少50%同一性的氨基酸序列。在一些实施方案中,CYP450酶包含与SEQ ID NO:51至少约55%、或至少约60%、或至少约65%、或至少约70%、或至少约75%、或至少约80%、或至少约85%、或至少约90%、或至少约95%、或至少约98%、或至少约99%同一性的氨基酸序列。例如,CYP450酶可包含相对于SEQ ID NO:51具有1至20个或1至10个氨基酸修饰的氨基酸序列,所述氨基酸修饰独立地选自相对于SEQ ID NO:51的相应位置的氨基酸取代、缺失和插入。
在一些实施方案中,CYP450包含与SEQ ID NO:52具有至少50%同一性的氨基酸序列。在一些实施方案中,CYP450酶包含与SEQ ID NO:52至少约55%、或至少约60%、或至少约65%、或至少约70%、或至少约75%、或至少约80%、或至少约85%、或至少约90%、或至少约95%、或至少约98%、或至少约99%同一性的氨基酸序列。例如,CYP450酶可包含相对于SEQ ID NO:52具有1至20个或1至10个氨基酸修饰的氨基酸序列,所述氨基酸修饰独立地选自相对于SEQ ID NO:52的相应位置的氨基酸取代、缺失和插入。
在一些实施方案中,CYP450包含与SEQ ID NO:54、55或56具有至少50%同一性的氨基酸序列。在一些实施方案中,CYP450酶包含与SEQ ID NO:54、55或56至少约55%、或至少约60%、或至少约65%、或至少约70%、或至少约75%、或至少约80%、或至少约85%、或至少约90%、或至少约95%、或至少约98%、或至少约99%同一性的氨基酸序列。例如,CYP450酶可包含相对于SEQ ID NO:54、55或56具有1至20个或1至10个氨基酸修饰的氨基酸序列,所述氨基酸修饰独立地选自相对于SEQ ID NO:54、55或56的相应位置的氨基酸取代、缺失和插入。
对CYP450酶的氨基酸修饰可通过可用的结构来指导,包括描述于以下中的结构:Pallan等人,“Structural and kinetic basis of steroid17α,20-lyase activity inteleost fish cytochrome P450 17A1 and its absence in cytochrome P450 17A2,”Journal of Biological Chemistry290.6(2015):3248-3268,其在此以引用的方式整体并入。Pallan等人描述了斑马鱼细胞色素P450 17A2以及结构坐标,包括关键活性位点坐标。这些结构坐标可用于构建CYP450酶的同源模型,所述同源模型可用于指导具有改善的特异性和生产率的CYP450酶的工程改造。
在一些实施方案中,CYP450酶需要存在能够将电子转移到CYP450蛋白的电子转移蛋白。在一些实施方案中,此电子转移蛋白是可由微生物宿主细胞表达的细胞色素P450还原酶(CPR)。可使用的各种还原酶描述于美国专利公布号2018/0135081中,其在此以引用的方式整体并入。
示例性P450还原酶包括在本文中显示为SEQ ID NO:27至34或53或其变体的那些。变体通常包括酶,所述酶包含与SEQ ID NO:27至34或53中的任一个具有50%或更高序列同一性的氨基酸序列。在一些实施方案中,变体包含与SEQ ID NO:27至34或53中的任一个的氨基酸序列具有至少约60%序列同一性、或至少约70%序列同一性、或至少约80%序列同一性、或至少约90%序列同一性、或至少约95%序列同一性、或至少约98%序列同一性或至少约99%序列同一性的氨基酸序列。在一些实施方案中,变体包括1个至约20个、或1个至约10个或1个至约5个氨基酸修饰,所述氨基酸修饰独立地选自对选自SEQ ID NO:27至34或53的氨基酸序列的取代、缺失和插入。
在一些实施方案中,CPR包含与SEQ ID NO:53(CaCPR)具有至少50%同一性的氨基酸序列。在一些实施方案中,CPR酶包含与SEQ ID NO:53至少约55%、或至少约60%、或至少约65%、或至少约70%、或至少约75%、或至少约80%、或至少约85%、或至少约90%、或至少约95%、或至少约98%、或至少约99%同一性的氨基酸序列。例如,CPR酶可包含相对于SEQ ID NO:53具有1至20个或1至10个氨基酸修饰的氨基酸序列,所述氨基酸修饰独立地选自相对于SEQ ID NO:53的相应位置的氨基酸取代、缺失和插入。
在一些实施方案中,αGOX反应导致α-愈创木烯羟基化,从而产生一种或多种醇中间体,例如,(2R)-rotundol或(2S)-rotundol(参见图3)。在一些实施方案中,αGOX进一步将至少一部分α-愈创木烯氧化为酮。在一些实施方案中,通过一种或多种醇脱氢酶(ADH)将醇中间体(例如,(2R)-rotundol或(2S)-rotundol)转化为莎草奥酮。因此,在一些实施方案中,微生物宿主细胞表达一种或多种醇脱氢酶(ADH)。以举例的方式,在一些实施方案中,ADH选自包含选自SEQ ID NO:35至44的氨基酸序列或其变体的酶。变体通常包括酶,所述酶包含与SEQ ID NO:35至44中的任一个具有50%或更高序列同一性的氨基酸序列。在一些实施方案中,变体包含与SEQ ID NO:35至44中的任一个的氨基酸序列具有至少约60%序列同一性、或至少约70%序列同一性、或至少约80%序列同一性、或至少约90%序列同一性、或至少约95%序列同一性、或至少约98%序列同一性或至少约99%序列同一性的氨基酸序列。在一些实施方案中,变体包括1个至约20个、或1个至约10个或1个至约5个氨基酸修饰,所述氨基酸修饰独立地选自对选自SEQ ID NO:35至44的氨基酸序列的取代、缺失和插入。在一些实施方案中,可选择氨基酸修饰以改善以下中的一项或多项:酶生产率、对所需底物和/或产物的选择性、稳定性、温度耐受性和表达。
在一些实施方案中,ADH包含与SEQ ID NO:43(VvDH)具有至少50%同一性的氨基酸序列。在一些实施方案中,ADH酶包含与SEQ ID NO:43至少约55%、或至少约60%、或至少约65%、或至少约70%、或至少约75%、或至少约80%、或至少约85%、或至少约90%、或至少约95%、或至少约98%、或至少约99%同一性的氨基酸序列。例如,ADH酶可包含相对于SEQ ID NO:43具有1至20个或1至10个氨基酸修饰的氨基酸序列,所述氨基酸修饰独立地选自相对于SEQ ID NO:43的相应位置的氨基酸取代、缺失和插入。
在一些实施方案中,微生物细胞表达αGOX,并且主要产生莎草奥酮(例如,至少75%的氧化产物是莎草奥酮),而不表达ADH酶。
在各种实施方案中,αGTPS和αGOX一起在操纵子中表达,或者单独表达。所述酶可由诸如质粒或细菌人工染色体的染色体外元件表达,或可在染色体上整合。
在一些实施方案中,细胞不表达αGTPS,但表达α-愈创木烯氧化酶(αGOX),从而允许α-愈创木烯的酶促生物转化,这可以对整个细胞或细胞的全部或部分纯化的提取物发生。
在一些实施方案中,以纯化的重组形式提供αGOX和/或ADH,以用于在无细胞系统中从α-愈创木烯、或(2R)-rotundol或(2S)-rotundol产生莎草奥酮。
在一些实施方案中,还将微生物宿主细胞工程改造以表达或过表达甲基赤藓糖醇磷酸(MEP)和/或甲羟戊酸(MVA)途径中的一种或多种酶,以催化从葡萄糖或其他碳源到异戊烯基焦磷酸(IPP)和二甲基烯丙基焦磷酸(DMAPP)。
在一些实施方案中,将微生物宿主细胞工程改造以表达或过表达MEP途径的一种或多种酶。在一些实施方案中,通过提供某些限速酶的重复拷贝,MEP途径得以增加并与下游途径平衡。MEP(2-C-甲基-D-赤藓糖醇4-磷酸)途径,也称为MEP/DOXP(2-C-甲基-D-赤藓糖醇4-磷酸/l-脱氧-D-木酮糖5-磷酸)途径或非甲羟戊酸途径或甲羟戊酸非依赖性途径是指将甘油醛-3-磷酸和丙酮酸转化为IPP和DMAPP的途径。所述途径通常涉及以下酶的作用:1-脱氧-D-木酮糖-5-磷酸合酶(Dxs)、1-脱氧-D-木酮糖-5-磷酸还原异构酶(IspC)、4-二磷酸胞苷-2-C-甲基-D-赤藓糖醇合酶(IspD)、4-二磷酸胞苷-2-C-甲基-D-赤藓糖醇激酶(IspE)、2C-甲基-D-赤藓糖醇2,4-环二磷酸合酶(IspF)、1-羟基-2-甲基-2-(E)-丁烯基4-二磷酸合酶(IspG)和异戊烯基二磷酸异构酶(IspH)。MEP途径以及构成MEP途径的基因和酶描述于US 8,512,988中,其在此以引用的方式整体并入。例如,构成MEP途径的基因包括dxs、ispC、ispD、ispE、ispF、ispG、ispH、idi和ispA。在一些实施方案中,微生物宿主细胞表达或过表达dxs、ispC、ispD、ispE、ispF、ispG、ispH、idi、ispA或其经修饰的变体中的一种或多种,从而导致IPP和DMAPP产量增加。在一些实施方案中,莎草奥酮至少部分地是通过经由MEP途径的代谢通量产生的,并且其中微生物宿主细胞具有dxs、ispC、ispD、ispE、ispF、ispG、ispH、idi、ispA或其经修饰的变体中的一种或多种的至少一个附加基因拷贝。
在一些实施方案中,将微生物宿主细胞工程改造以表达或过表达MVA途径的一种或多种酶。MVA途径是指将乙酰-CoA转化为IPP的生物合成途径。甲羟戊酸途径通常包含催化以下步骤的酶:(a)将两个乙酰-CoA分子缩合为乙酰乙酰-CoA(例如,通过乙酰乙酰-CoA硫解酶的作用);(b)将乙酰乙酰-CoA与乙酰-CoA缩合形成羟甲基戊二酰-辅酶A(HMG-CoA)(例如,通过HMG-CoA合酶(HMGS)的作用);(c)将HMG-CoA转化为甲羟戊酸(例如,通过HMG-CoA还原酶(HMGR)的作用);(d)将甲羟戊酸磷酸化为甲羟戊酸5-磷酸(例如,通过甲羟戊酸激酶(MK)的作用);(e)将甲羟戊酸5-磷酸转化为甲羟戊酸5-焦磷酸(例如,通过磷酸甲羟戊酸激酶(PMK)的作用);以及(f)将甲羟戊酸5-焦磷酸转化为异戊烯基焦磷酸(例如,通过甲羟戊酸焦磷酸脱羧酶(MPD)的作用)。MVA途径以及构成MVA途径的基因和酶描述于US 7,667,017中,其在此以引用的方式整体并入。在一些实施方案中,微生物宿主细胞表达或过表达乙酰乙酰-CoA硫解酶、HMGS、HMGR、MK、PMK和MPD或其经修饰的变体中的一种或多种,从而导致IPP和DMAPP产量增加。在一些实施方案中,莎草奥酮至少部分地是通过经由MVA途径的代谢通量产生的,并且其中微生物宿主细胞具有乙酰乙酰-CoA硫解酶、HMGS、HMGR、MK、PMK、MPD或其经修饰的变体中的一种或多种的至少一个附加基因拷贝。
在一些实施方案中,将微生物宿主细胞工程改造以增加由葡萄糖产生IPP和DMAPP的产量,如PCT申请号PCT/US2018/016848和PCT/US2018/015527所述,其内容在此以引用的方式整体并入。例如,在一些实施方案中,微生物宿主细胞过表达MEP途径的酶,并且平衡表达以向IPP和DMAPP推/拉碳通量。在一些实施方案中,将微生物宿主细胞工程改造以增加Fe-S簇蛋白的利用率或活性,以便支持作为Fe-S酶的IspG和IspH的更高活性。在一些实施方案中,将宿主细胞工程改造以过表达IspG和IspH,以便向1-羟基-2-甲基-2-(E)-丁烯基4-二磷酸(HMBPP)中间体提供增加的碳通量,但平衡表达以便防止HMBPP以降低细胞生长或活力的量,或以抑制MEP途径通量的量积聚。
IPP和DMAPP前体向法呢基二磷酸(FPP)的转化通常是通过法呢基二磷酸合酶(FPPS)的作用实现的。示例性FPPS酶公开于US2018/0135081中,其在此以引用的方式整体并入。
在一些实施方案中,将宿主细胞工程改造以例如通过降低使用IPP和FPP底物的IspB的表达或活性来下调泛醌生物合成途径。
在一些实施方案中,微生物宿主细胞是选自以下的细菌:埃希氏菌属(Escherichia spp.)、芽孢杆菌属(Bacillus spp.)、棒状杆菌属(Corynebacteriumspp.)、红细菌属(Rhodobacter spp.)、发酵单胞菌属(Zymomonas spp.)、弧菌属(Vibriospp.)和假单胞菌属(Pseudomonas spp.)。例如,在一些实施方案中,细菌宿主细胞是选自以下的种:大肠杆菌、枯草芽孢杆菌、谷氨酸棒杆菌(Corynebacterium glutamicum)、荚膜红细菌(Rhodobacter capsulatus)、类球红细菌(Rhodobacter sphaeroides)、运动发酵单胞菌(Zymomonas mobilis)、需钠弧菌(Vibrio natriegens)或恶臭假单胞菌(Pseudomonasputida)。在一些实施方案中,细菌宿主细胞是大肠杆菌。
在一些实施方案中,微生物宿主细胞是酵母属(Saccharomyces)、毕赤酵母属(Pichia)或耶氏酵母属(Yarrowia)的种,包括但不限于酿酒酵母(Saccharomycescerevisiae)、巴斯德毕赤酵母(Pichia pastoris)和解脂耶氏酵母(Yarrowialipolytica)。
在另一方面,本公开提供一种用于制备莎草奥酮的方法。方法包括提供如本文公开的微生物宿主细胞(或微生物宿主菌株)。微生物宿主细胞表达如本文所述的αGOX酶和任选地表达αGTPS酶。表达αGOX酶的细胞可用于使用全细胞或细胞提取物进行α-愈创木烯的生物转化。表达αGOX酶和αGTPS酶的细胞可从碳源产生莎草奥酮。
在一些实施方案中,微生物宿主细胞进一步表达本文公开的一种或多种醇脱氢酶(ADH)。表达ADH的细胞可将由αGOX反应产生的醇中间体转化为莎草奥酮。
在一些实施方案中,培养宿主细胞以产生莎草奥酮。在一些实施方案中,用碳底物(源)诸如C1、C2、C3、C4、C5和/或C6碳底物培养微生物细胞。在示例性实施方案中,碳源是葡萄糖、蔗糖、果糖、木糖和/或甘油。培养条件通常选自需氧、微需氧和厌氧条件。
在各种实施方案中,宿主细胞在22℃与37℃之间的温度下培养。虽然在细菌诸如大肠杆菌中的商业生物合成可受到过表达的酶和/或外来酶(例如,衍生自植物的酶)稳定时的温度的限制,但是可将重组酶(包括类萜合酶)工程改造以允许将培养物维持在更高的温度下,从而产生更高的产率和更高的总产生力。在一些实施方案中,宿主细胞是细菌宿主细胞,并且在约22℃或更高、约23℃或更高、约24℃或更高、约25℃或更高、约26℃或更高、约27℃或更高、约28℃或更高、约29℃或更高、约30℃或更高、约31℃或更高、约32℃或更高、约33℃或更高、约34℃或更高、约35℃或更高、约36℃或更高、或约37℃下进行培养。
莎草奥酮可从培养基和/或全细胞中提取并加以回收。在一些实施方案中,氧化的莎草奥酮产物被回收并任选地通过分馏(例如精馏)进行富集。氧化的产物可通过任何合适的方法回收,包括将所需产物分配至有机相中。可以例如通过气相色谱法(例如,GC-MS)测定和/或定量所需产物的产量。所需产物可以在分批或连续生物反应器系统中产生。产物的产生、回收和/或产物的分析可如US 2012/0246767中所述进行,所述专利在此以引用的方式整体并入。例如,在一些实施方案中,从含水反应介质中提取氧化的油,这可通过分配到有机相中,之后精馏来完成。馏分中的倍半萜烯和类倍半萜烯组分可通过GC/MS定量测量,之后将馏分掺混。
在一些实施方案中,本文公开的微生物宿主细胞和方法适合于商业生产莎草奥酮,即,微生物宿主细胞和方法可以商业规模生产。在一些实施方案中,培养物的大小为至少约100L、至少约200L、至少约500L、至少约1,000L、至少约10,000L、至少约100,000L或至少约1,000,000L。在一些实施方案中,培养可以分批培养、连续培养或半连续培养形式进行。
在一些方面,本公开提供了用于制备包含莎草奥酮的产品(包括香精和香料组合物或产品)的方法。在一些实施方案中,方法包括通过微生物培养产生如本文所述的莎草奥酮,回收莎草奥酮,以及将莎草奥酮掺入香精或香料组合物或消费品(例如食品)中。
除非内容另外明确指出,否则如本说明书和所附权利要求中所用,单数形式“一(a、an)”、和“所述”包括复数指示物。例如,对“一种细胞”的提及包括两种或更多种细胞的组合等。
如本文所用,关于数字的术语“约”通常是指包括落在所述数字任一方向(大于或小于)的10%范围内的数字。
实施例
莎草奥酮是双环倍半萜烯(图1),并且负责葡萄和葡萄酒以及草药和香辛料(尤其是黑胡椒和白胡椒)中的胡椒香气,其中莎草奥酮具有较高的气味活性值(OAV)。莎草奥酮的生物合成涉及通过α-GTPS萜烯合酶将C15倍半萜烯前体底物法呢基二磷酸(FPP)环化为α-愈创木烯(图2)。α-愈创木烯的酶促氧化可产生莎草奥酮,并且可通过醇中间体进行(图2和图3)。例如,可通过αGOX的作用将α-愈创木烯转化为(2S)-rotundol或(2R)-rotundol,并且可通过αGOX或醇脱氢酶的作用将一种或多种醇中间体(rotundol)转化为莎草奥酮。
实施例1:工程改造α-愈创木烯合酶以改善α-愈创木烯产量
可通过生物合成发酵工艺,使用产生高水平MEP途径产物的微生物菌株以及莎草奥酮生物合成酶的异源表达来产生α-愈创木烯前体、rotundol或莎草奥酮,所述莎草奥酮生物合成酶包括催化以下的酶:1)FPP环化为α-愈创木烯;2)α-愈创木烯氧化为莎草奥酮,其可包括3)rotundol脱氢为莎草奥酮。例如,在细菌诸如大肠杆菌中,异戊烯基焦磷酸(IPP)和二甲基烯丙基焦磷酸(DMAPP)可由葡萄糖或其他碳源产生,并且它们可通过重组法呢基二磷酸合酶(FPPS)转化为法呢基二磷酸(FPP)。由α-愈创木烯合酶(αGTPS)通过环化将FPP转化为α-愈创木烯。通过由α-愈创木烯氧化酶(αGOX)催化的氧化反应将α-愈创木烯转化为rotundol或莎草奥酮。在αGOX酶催化从α-愈创木烯产生(2S)-rotundol或(2R)-rotundol的情况下,可通过脱氢酶催化将rotundol转化为莎草奥酮。
使用产生高水平MEP途径产物IPP和DMAPP的大肠杆菌背景菌株(参见US 2018/0245103和US 2018/0216137,其在此以引用的方式并入),通过与FPPS共表达来筛选候选物αGTPS酶。在96孔板中进行发酵48小时。以下合酶证明了在大肠杆菌中产生α-愈创木烯:AcC1mut1_M42、AcC1mut2_M50、AcC2、AcC3、AcDGuaS2、AcDGuaS3、AcDGuaS4、AcDGuaS5、AmaDGuaS1、AmiDGuaS1、AmiDGuaS2、AmiDGuaS3、AsDGuaS3、PcPS和VvGuaS。除了所需的α-愈创木烯产物外,活性酶还具有不同的产物概况。例如,所有活性沉香酶都显示与α-愈创木烯一起作为主要产物的α-布藜烯(α-bulnesene)。VvGuaS所积累的α-布藜烯和球蛋白水平与α-愈创木烯相似。根据生产率和产物概况,对AcDGuaS3进行选择以供后续研究。
针对在大肠杆菌中将FPP转化为α-愈创木烯的能力筛选了一组对AcDGuaS3序列的氨基酸取代。在96孔板中进行发酵48小时。图4示出了几种突变体(即氨基酸取代)和在α-愈创木烯产生中的相关提高倍数。例如,AcDGuaS3中的F406L取代表明α-愈创木烯的效价显著提高(比野生型高1.71倍)。进一步针对使产物概况向α-愈创木烯偏移的取代对氨基酸取代进行了评估。参见图5。如图所示,野生型AcDGuaS3(I443M)中的单个取代显示出相对于其他产物,α-愈创木烯%的2.4倍提高。类似地,F406L取代显示出相对于其他产物,α-愈创木烯%的2.12倍提高。F512突变表明相对于其他产物,α-愈创木烯%的1.23倍提高。图6示出了与亲本酶相比,基于在AcDGuaS3(α-GS1)中具有F406L取代的变体的表达,α-愈创木烯效价的提高倍数。
实施例2:莎草奥酮的产生
通过在大肠杆菌中与FPPS和α-GS1共表达筛选了候选物α-愈创木烯氧化酶。在表达工程改造的贝壳杉烯氧化酶(KOeng)的情况下观察了rotundol和莎草奥酮的产生。参见US 2018/0135081,其在此以引用的方式整体并入。葡萄脱氢酶(VvDH)以及α-GS1、KOeng和喜树细胞色素P450还原酶(CaCPR)的共表达降低了rotundol的效价(图7A)并增加了莎草奥酮效价(图7B)。通过GC/MS确认了由细胞色素P450氧化α-愈创木烯产生的莎草奥酮(图8A和图8B)。可将KOeng进一步工程改造以改善α-愈创木烯底物的特异性。实施例10中示出了与野生型贝壳杉烯氧化酶的比对,这可有助于此工程改造。
图9示出了基于向日葵大根香叶烯A单加氧酶(HaGAO)的表达,使用替代的CYP450系统在体内产生rotundol和莎草奥酮。大肠杆菌菌株包括α-GS1的表达、用于在大肠杆菌中表达的工程改造的HaGAO(SEQ ID NO:52)和AaCPR(黄花蒿细胞色素P450还原酶;SEQ IDNO:33)。在96孔板中进行发酵48小时。如图9所示,氧化产物基本上是莎草奥酮,仅含有少量的rotundol中间体。
序列
萜烯合酶
SEQ ID NO:1
葡萄VvGuaS
MSVPLSVSVTPILSQRIDPEVARHEATYHPNFWGDRFLHYNPDDDFCGTHACKEQQIQELKEEVRKSLEATAGNTSQLLKLIDSIQRLGLAYHFEREIEEALKAMYQTYTLVDDNDHLTTVSLLFRLLRQEGYHIPSDVFKKFMDEGGNFKESLVGDLPGMLALYEAAHLMVHGEDILDEALGFTTAHLQSMAIDSDNPLTKQVIRALKRPIRKGLPRVEARHYITIYQEDDSHNESLLKLAKLDYNMLQSLHRKELSEITKWWKGLDFATKLPFARDRIVEGYFWILGVYFEPQYYLARRILMKVFGVLSIVDDIYDAYGTFEELKLFTEAIERWDASSIDQLPDYMKVCYQALLDVYEEMEEEMTKQGKLYRVHYAQAALKRQVQAYLLEAKWLKQEYIPTMEEYMSNALVTSACSMLTTTSFVGMGDMVTKEAFDWVFSDPKMIRASNVICRLMDDIVSHEFEQKRGHVASAVECYMKQYGVSKEEAYDEFKKQVESAWKDNNEEVLQPTAVPVPLLTRVLNFSRMVDVLYKDEDEYTLVGPLMKDLVAGMLIDPVPM
SEQ ID NO:2
广藿香PcPS(Q49SP3)
MELYAQSVGVGAASRPLANFHPCVWGDKFIVYNPQSCQAGEREEAEELKVELKRELKEASDNYMRQLKMVDAIQRLGIDYLFVEDVDEALKNLFEMFDAFCKNNHDMHATALSFRLLRQHGYRVSCEVFEKFKDGKDGFKVPNEDGAVAVLEFFEATHLRVHGEDVLDNAFDFTRNYLESVYATLNDPTAKQVHNALNEFSFRRGLPRVEARKYISIYEQYASHHKGLLKLAKLDFNLVQALHRRELSEDSRWWKTLQVPTKLSFVRDRLVESYFWASGSYFEPNYSVARMILAKGLAVLSLMDDVYDAYGTFEELQMFTDAIERWDASCLDKLPDYMKIVYKALLDVFEEVDEELIKLGAPYRAYYGKEAMKYAARAYMEEAQWREQKHKPTTKEYMKLATKTCGYITLIILSCLGVEEGIVTKEAFDWVFSRPPFIEATLIIARLVNDITGHEFEKKREHVRTAVECYMEEHKVGKQEVVSEFYNQMESAWKDINEGFLRPVEFPIPLLYLILNSVRTLEVIYKEGDSYTHVGPAMQNIIKQLYLHPVPY
SEQ ID NO:3
柯拉斯那沉香AcC2(D0VMR6)
MSSAKLGSASEDVNRRDANYHPTVWGDFFLTHSSNFLENNDSILEKHEELKQEVRNLLVVETSDLPSKIQLTDEIIRLGVGYHFETEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHRNVQRLAARQYIPQYDAEPTKIESLSLFAKIDFNMLQALHQRELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVEASSLIARYIDDLQTYKAEEERGETVSAVRCYMREFGVSEEQACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:4
柯拉斯那沉香AcC3(D0VMR7)
MSSAKLGSASEDVNRRDANYHPTVWGDFFLTHSSNFLENNDSILEKHEELKQEVRNLLVVETSDLPSKIQLTDEIIRLGVGYHFETEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHRNVQRLAARQYIPQYDAEPTKIESLSLFAKIDFNMLQALHQRELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVEASSLIARYIDDLQTYKAEEERGETVSAVRCYMREFGVSEEQACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:5
柯拉斯那沉香AcC4(D0VMR8)
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSDFLENNDSILEKHEELKQEVRNLLVVETSDLPSKIQLTDDIIRLGVGYHFETEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHRNVQRLAARQYIPQYDAEPTKIESLSLFAKIDFNMLQALHQRELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYIGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREYDVSEEEACKKMREMIEIEWKRLNKTTLEADEVSSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:6
柯拉斯那沉香AcC1mut1-M42
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNDSILEKHEELKQEVRNLLVVETSDLPSKIQLTDEIIRLGVGYHFETEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDAWTLWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHRNVQRLAARQYIPQYDAEPTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREFGVSEEQACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:7
柯拉斯那沉香AcC1mut2-M50
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNDSILEKHEELKQEVRNLLVVETSDLPSKIQLTDEIIRLGVGYHFETEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHRNVQRLAARQYIPQYDAEPTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREFGVSEEQACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:8
柯拉斯那沉香AcDGuaS3(F6LJD3)
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNDSILEKHEELKQEVRNLLVVETSDLPSKIQLTDEIIRLGVGYHFETEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHRNVQRLAARQYIPQYDAEPTKIESLSLFAKIDFNMLQALHQRELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREFGVSEEQACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:9
柯拉斯那沉香AcDGuaS4(F6LJD4)
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNDNILEKHEELKQEVRNLLVVETSDLPSKIQLTDEIIRLGVGYHFEMEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKRLEALLPKLSFPLSECVRDALHIPYHRNVQRLAARQYIPQYDAEQTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYIGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREYDVSEEEACKKMREMIEIEWKRLNKTTLEADEVSSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:10
柯拉斯那沉香AcDGuaS2(F6LJD2)
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNDSILEKHEELKQEVRNLLVVETIDLPSKIQLTDEIIRLGVGYHFEMEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHLNVQRLAARQYIPQYDAEPTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREYGVSEEEACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:11
柯拉斯那沉香AcDGuaS5(F6LJD5)
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNDNILEKHEELKQEVRNLLVVETSDLPSKIQLTDEIIRLGVGYHFEMEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKRLEALLPKLSFPLSECVRDALHIPYHRNVQRLAARQYIPQYDAEQTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYIGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKQTLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREYDVSEEEACKKMREMIEIEWKRLNKTTLEADEVSSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:12
沉香属AmiDGuaS1(A0A0U3ACM2)
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNDNILEKHEELKQEVRNLLVVETSDLPSKIQLTDKIIRLGVGYHFEMEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHRNVQRLAARQYIPQYDAELTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTKAVERWDIEAVQDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEKRGETVSAVRCYMREYGVSEEEACKKMREMIEIEWKKLNKTTLEANEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:13
沉香属AmiDGuaS2(A0A0U3A773)
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNDNILEKHEELKQEVTNLLVVETSDLPSKIQLTDEIIRLGVGYHFEMEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHRNVQRLAARQYIPQYDAELTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRAEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREYGVSEEEACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKGRIAALLRHAIEI
SEQ ID NO:14
沉香属AmiDGuaS3(A0A023J8Z5
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNHSILEKHEELKQEVRNLLVVETSDLPSKIQLTDKIIRLGVGYHFEMEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHRNVQRLAARQYIPQYDAELTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRAEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREYGVSEEEACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:15
沉香属AmaDGuaS1(A0A1B0U478)
MSSAKLGSAPEDVSRRDANYHPTVWGDFFLTHSSNFLENNHSILEKHEELKQEVRNLLVVETSDLPSKIQLTDKIIRLGVGYHFEMEIKAQLEKLQDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHRNVQRLAARQYIPQYDAELTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRAEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREYGVSEEEACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIATLLRHAIEI
SEQ ID NO:16
沉香属AmaDGuaS2(A0A0U2YQ77)
MSSAKLGSASEDVSRRDADYHPTVWGDFFLTHSSNFLENNHSILEKHEELKQEVRNLLVVETSDLPSKIQLTDKIIRLGVGYHFEMEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTSWCLYEATHLRVDGEDILEEAIQFSRKKLEALLPELSFPLNECVRDALHIPYHRNVQRLAARQYISQYDAELTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRAEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREYGVSEEEACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIA
SEQ ID NO:17
沉香属AsDGuaS1(K9MQ67)
MSSAKLGSTSEDVSRRDANYHPTVWGDFFLTHSSNFLENNDSILEKHEELKQEVRNLLVVETSDLPSKIQLTDEIIRLGVGYHFETEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFKTEDARTLWCLYEATHLRVDGEDVLEEAIQFSRKKLEALLPELSFPLSECVRDALHIPYHRNVQRLAARQYIPQYDAEPTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDSIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEKRGETVSAVRCYMREYGVSEEEACKKMKEMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:18
沉香属AsDGuaS2(K9MNV6)
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNDSILEKHEELKQEVRNLLVVETSDLPSKIQLTDEIIRLGVGYHFETEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFKTEDARTLWCLYEATHLRVDGEDVLEEAIQFSRKKLEALLPELSFPLSECVRDALHIPYHRNVQRLAARQYIPQYDAEPTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEKRGETVSAVRCYMREYGVSEEEACKKMKEMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:19
沉香属AsDGuaS3(K9MPP8)
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNDSILEKHEELKQEVRNLLVVETSDLPSKIQLTDEIIRLGVGYHFETEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFKTEDARTLWCLYEATHLRVDGEDVLEEAIQFSRKKLEALLPELSFPLSECVRDALHIPYHRNVQRLAARQYIPQYDAEPTKIESLSLFAKIDFNMLQALHQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREYGVSEEEACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:20
沉香属AsDGuaS4(M9SVT6)
MSSAKLGSASEDVSRRDANYHPTVWGDFFLTHSSNFLENNDNILEKHEELKQEVRNLLVVETSDLPSKIQLTDEIIRLGVGYHFEMEIKAQLEKLHDHQLHLNFDLLTTSVWFRLLRGHGFSISSDVFKRFKNTKGEFETEDARTLWCLYEATHLRVDGEDILEEAIQFSRKRLEALLPKLSFPLSECVRDALHIPYHRNVQRLAARQYIPQYDAEQTKIESLSLFAKIDFNMLQALRQSELREASRWWKEFDFPSKLPYARDRIAEGYYWMMGAHFEPKFSLSRKFLNRIIGITSLIDDTYDVYGTLEEVTLFTEAVERWDIEAVKDIPKYMQVIYTGMLGIFEDFKDNLINARGKDYCIDYAIEVFKEIVRSYQREAEYFHTGYVPSYDEYMENSIISGGYKMFIILMLIGRGEFELKETLDWASTIPEMVKASSLIARYIDDLQTYKAEEERGETVSAVRCYMREFGVSEEQACKKMREMIEIEWKRLNKTTLEADEISSSVVIPSLNFTRVLEVMYDKGDGYSDSQGVTKDRIAALLRHAIEI
SEQ ID NO:21
葡萄大根香叶烯D合酶(VvGDS)
MSVPLSVSVTPILSQRIDPEVARHEATYHPNFWGDRFLHYNPDDDFCGTHACKEQQIQELKEEVRKSLEATAGNTSQLLKLIDSIQRLGLAYHFEREIEEALKAMYQTYTLVDDNDHLTTVSLLFRLLRQEGYHIPSDVFKKFMDEGGNFKESLVGDLPGMLALYEAAHLMVHGEDILDEALGFTTAHLQSMAIDSDNPLTKQVIRALKRPIRKGLPRVEARHYITIYQEDDSHNESLLKLAKLDYNMLQSLHRKELSEITKWWKGLDFATKLPFARDRIVEGYFWILGVYFEPQYYLARRILMKVFGVLSIVDDIYDAYGTFEELKLFTEAIERWDASSIDQLPDYMKVCYQALLDVYEEMEEEMTKQGKLYRVHYAQAALKRQVQAYLLEAKWLKQEYIPRMDEYMSNALVSSACSMLTTTSFVGMGDIVTKEAFDWVFSDPKMIRASNVICRLMDDIVSHEFEQKRGHVASAVECYMKQYGVSKEEAYDEFKKQVESAWKDNNEEFLQPTAVPVPLLTRVLNFSRMMDVLYKDEDEYTLVGPLMKDLVAGMLIDPVPM
α-愈创木烯氧化酶
SEQ ID NO:22
葡萄VvSTO2(F6I534;工程改造的CYP71BE5α-愈创木烯2-氧化酶)
MELQFSFFPILCTFLLFIYLLKRLGKPSRTNHPAPKLPPGPWKLPIIGNMHQLVGSLPHRSLRSLAKKHGPLMHLQLGEVSAIVVSSREMAKEVMKTHDIIFSQRPCILAASIVSYDCTDIAFAPYGGYWRQIRKISVLELLSAKRVQSFRSVREEEVLNLVRSVSLQEGVLINLTKSIFSLTFSIISRTAFGKKCKDQEAFSVTLDKFADSAGGFTIADVFPSIKLLHVVSGMRRKLEKVHKKLDRILGNIINEHKARSAAKETCEAEVDDDLVDVLLKVQKQGDLEFPLTMDNIKAVLLDLFVAGTETSSTAVEWAMAEMLKNPRVMAKAQAEVRDIFSRKGNADETVVRELKFLKLVIKETLRLHPPVPLLIPRESRERCAINGYEIPVKTRVIINAWAIARDPKYWTDAESFNPERFLDSSIDYQGTNFEYIPFGAGRRMCPGILFGMANVELALAQLLYHFDWKLPNGARHEELDMTEGFRTSTKRKQDLYLIPITYRPLPVE
SEQ ID NO:23
枯草芽孢杆菌BsAGOX1(O31440;cypC CYP152A1)
MNEQIPHDKSLDNSLTLLKEGYLFIKNRTERYNSDLFQARLLGKNFICMTGAEAAKVFYDTDRFQRQNALPKRVQKSLFGVNAIQGMDGSAHIHRKMLFLSLMTPPHQKRLAELMTEEWKAAVTRWEKADEVVLFEEAKEILCRVACYWAGVPLKETEVKERADDFIDMVDAFGAVGPRHWKGRRARPRAEEWIEVMIEDARAGLLKTTSGTALHEMAFHTQEDGSQLDSRMAAIELINVLRPIVAISYFLVFSALALHEHPKYKEWLRSGNSREREMFVQEVRRYYPFGPFLGALVKKDFVWNNCEFKKGTSVLLDLYGTNHDPRLWDHPDEFRPERFAEREENLFDMIPQGGGHAEKGHRCPGEGITIEVMKASLDFLVHQIEYDVPEQSLHYSLARMPSLPESGFVMSGIRRKS
SEQ ID NO:24
枯草芽孢杆菌BsAGOX1(A5HNX5;pksS CYP107K1)
MQMEKLMFHPHGKEFHHNPFSVLGRFREEEPIHRFELKRFGATYPAWLITRYDDCMAFLKDNRITRDVKNVMNQEQIKMLNVSEDIDFVSDHMLAKDTPDHTRLRSLVHQAFTPRTIENLRGSIEQIAEQLLDEMEKENKADIMKSFASPLPFIVISELMGIPKEDRSQFQIWTNAMVDTSEGNRELTNQALREFKDYIAKLIHDRRIKPKDDLISKLVHAEENGSKLSEKELYSMLFLLVVAGLETTVNLLGSGTLALLQHKKECEKLKQQPEMIATAVEELLRYTSPVVMMANRWAIEDFTYKGHSIKRGDMIFIGIGSANRDPNFFENPEILNINRSPNRHISFGFGIHFCLGAPLARLEGHIAFKALLKRFPDIELAVAPDDIQWRKNVFLRGLESLPVSLSK
SEQ ID NO:25
蜡状芽孢杆菌BcAGOX1(Q737I9;BCE_2659CYP106)
MASPENVILVHEISKLKTKEELWNPYEWYQFMRDNHPVHYDEEQDVWNVFLYEDVNRVLSDYRLFSSRRERRQFSIPPLETRININSTDPPEHRNVRSIVSKAFTPRSLEQWKPRIQAIANELVQHIGKYSEVNIVEEFAAPLPVTVISDLLGVPTTDRKKIKAWSDILFMPYSKEKFNDLDVEKGIALNEFKAYLLPIVQEKRYHLTDDIISDLIRAEYEGERLTDEEIVTFSLGLLAAGNETTTNLIINSFYCFLVDSPGTYKELREEPTLISKAIEEVLRYRFPITLARRITEDTNIFGPLMKKDQMVVAWVSAANLDEKKFSQASKFNIHRIGNEKHLTFGKGPHFCLGAPLARLEAEIALTTFINAFEKIALSPSFNLEQCILENEQTLKFLPICLKTQ
SEQ ID NO:26
蜡状芽孢杆菌BcAGOX2(Q737F3;cypA BCE_2696CYP107)
MKKLTFNDLNSPETMRNPIMFYKNLMEQKERFFHIDDFYGMGGAWVVFHYDDVVAILKDSRFIKDLRKFTPPHYKQNPIEENTAVSKLFEWLMNMPNMLTVDPPDHTRLRRLVSKSFTPRMIEDLRPRIQQIADELLDVVQEQRKMEIIADFAYPLPIIVISEMLGIPATDRNQFRAWTQELMKASVDPGQGTTVTATLEKFINYIEILFNEKHLNPSDDLISALVQAKEQEDKLSKNELLSTIWLLIIAGHETTVNLISNGVLALLQHPEQMNLLRQDPSLLASAVDELLRYAGPIMFSSRFASEDVTIHGNRIRKGELVLLSLTAANIDPNIFPYPEELNISREENNHLAFGAGIHQCLGAPLARLEGQIALDTLLKRLPNLRLAIEADQLIYNHSKIRSLASLPVIF
SEQ ID NO:45
绒毛状烟草(Nicotiana tomentosiformis)NtKO
MDAILNLQTVPLGTALTIGGPAVALGGISLWFLKEYVNDQKRKSSNFLPPLPEVPGLPVIGNLLQLTEKKPHKTFTNWAETYGPIYSIKTGANTIVVLNTNELAKEAMVTRYSAISTRKLTNALKILTCDKSIVAISDYDEFHKTVKRHVLTSVLGPNAQKRHRIHRDTLIENVSKQLHDLVRKYPNEAVNLRKIFQSELFGLALKQALGKDIESIYVEGLDATLPREDVLKTLVLDIMEGAIDVDWRDFFPYLKWVPNKSFENRIQRKHLRREAVMKALIMEQRKRINSGEKLNSYIDYLSSEANTLTEKQILMLLWEAIIETSDTTVVSTEWAMYELAKDPKRQEQLFLEIQNVCGSNKITEEKLCQLPYLCAVFHETLRKHSPVPIVPLRYVHEDTQLGGYHIPKGAEIAINIYGCNRDKKVWESPEEWKPERFLDGKYDPVELQKTMAFGGGKRVCAGALQAMTITCTTIARLIQEFEWSLKDGEEENVATMGLTTHKLHPMQAHIKPRK
SEQ ID NO:46
莴苣(Lactuca sativa)LsKO
MDGVIDMQTIPLRTAIAIGGTAVALVVALYFWFLRSYASPSHHSNHLPPVPEVPGVPVLGNLLQLKEKKPYMTFTKWAEMYGPIYSIRTGATSMVVVSSNEIAKEVVVTRFPSISTRKLSYALKVLTEDKSMVAMSDYHDYHKTVKRHILTAVLGPNAQKKFRAHRDTMMENVSNELHAFFEKNPNQEVNLRKIFQSQLFGLAMKQALGKDVESIYVKDLETTMKREEIFEVLVVDPMMGAIEVDWRDFFPYLKWVPNKSFENIIHRMYTRREAVMKALIQEHKKRIASGENLNSYIDYLLSEAQTLTDKQLLMSLWEPIIESSDTTMVTTEWAMYELAKNPNMQDRLYEEIQSVCGSEKITEENLSQLPYLYAVFQETLRKHCPVPIMPLRYVHENTVLGGYHVPAGTEVAINIYGCNMDKKVWENPEEWNPERFLSEKESMDLYKTMAFGGGKRVCAGSLQAMVISCIGIGRLVQDFEWKLKDDAEEDVNTLGLTTQKLHPLLALINPRK
SEQ ID NO:47
洋蓟(Cynara cardunculus var.scolymus)CcKO
MDMQSIPAIAIGSTAVAIALGLFFWFFRRHVPDHIDHPNHLPSVPEVPGIPVLGNLLQLKEKKPYMTFTKWAETYGPIYSIRTGAISMVVVSSNAIAKEALVTRFPSISTRKLSKALEVLTADKTMVAMSDYNDYHKTVKRHILTAVLGPNAQKKHRVHRDIMMQNLSNQLHTFVQNSPQEEVNLRKVFQSELFGLAMRQTMGKDVESIYVEDLGTTMNRDEIFQVLVVDPLMGAIEVDWRDFFPYLKWIPNRNFENTIQQMYIRREAVMKALIQEHRKRIASGENLNSYIDYLLSEAQTLSEKQLXMSLWEPIIESSDTTMVTTEWAMYELAKNPKIQDRLYREIQGVCGSDKIXEENLGQLPYLSAIFNETLRRHGPVPIIPLRYVHEDTELGGYHIPAGTQIAVNIYGCNMEKAVWENPEEWNPERFFEVEGDQKTMAFGGGKRVCAGSLQAMLIACIGIGRMVQEFEWKLKDEAAQEDVNTLGLTTQKLRPLHAIIYPRKENDAKVWKC
SEQ ID NO:49
黄花蒿AaKO
MDALTDMLQIPPATPITVAITTVTIAVAIFLYIKSHASNHSRRSTHLPPVPEVPGVPVLGNLLQLKEKKPYLTFTRWAQTYGAIYSIRTGATSMVVVSSSEIAKEAMVTRFSSISTRNLSKALTILTADKTMVAMSDYNDYHRTVKRHILTAMLGPNAQRKQRVHRDFMIENISKQLHAFVENSPKEEVDLRKIFQSELFGLAMKQAVGKDVESLNVEDLGVTMKRDEIFQVLVVDPMMGAIEVDWRDFFPYLKWVPNKKFENTIQQMYIRRKAVMKALIKEHKKRIASGENLNSYIDYLLSEAQTFTDEQLIMSLWEPIIESSDTTMVTTEWAMYELAKNPKMQDRLYRDIQSVCGSDKITEENLSQLPYLSAIFHETLRRHSPVPIIPLRHVHEDTVLGGYHVPAGTELAVNIYGCNMEKNVWENPEEYNPDRFMKENETIDMQRTMAFGGGKRVCAGSLQAMLISCIGIGRMVQEFEWRFKDKAEEDINTLGLTTQRLNPLRAIIKPRN
SEQ ID NO:50
向日葵HaKO
MDALTGMLPIPPATALAIGGTAIALAVAISFWFLRSYTSGESNRLPRVPEVPGVPVLGNLLQLKEKKPYMTFTRWAETYGPIYSIRTGATSMVVVSSNEIAKEAFVTRFESISTRNLSKALKILTDDKTMVAMSDYNDYHKTVKRHILTAMLGPNAQKKHRIQRDIMMENLSNRLHAFVKTSTEQEEVDLREIFQSELFGLAMRQTMGKDVESIYVEDLKITMKRDEIFQVLVVDPMMGAIDVDWRDFFPYLKWVPNKKFENTIQQMYIRREAVMKALIKQHKERIASGEKLNSYIDYLLSEAQSLTDRQLLMSVWEPIIESSDTTMVTTEWAIYELAKNPHIQDRLYRDIQSVCGSDIIKEEHLSQLPFITAIFHETLRRHSPVPIIPLRYVHEDTVLGGYHVPAGTELAINIYGCNMEKSVWENPEEWNPERFMKENETIDFQKTMAFGGGKRVCAGSLQAMLISCVGIGRMVQEFKWELKNKAQEEVNTIGLTTQMLRPLRAIIKPRN
SEQ ID NO:51
工程改造的贝壳杉烯氧化酶(KOeng)
MAWEYALIGLVVGIIIGAVAMRWYLKSYTSARRSQSNHLPRVPEVPGVPLLGNLLQLKEKKPYMTFTKWAATYGPIYSIKTGATSVVVVSSNEIAKEALVTRFQSISTRNLSKALKVLTADKQMVAMSDYDDYHKTVKRHILTAVLGPNAQKKHRIHRDIMMDNISTQLHEFVKNNPEQEEVDLRKIFQSELFGLAMRQALGKDVESLYVEDLKITMNRDEILQVLVVDPMMGAIDVDWRDFFPYLKWVPNKKFENTIQQMYIRREAVMKSLIKEQKKRIASGEKLNSYIDYLLSEAQTLTDQQLLMSLWEPIIESSDTTMVTTEWAMYELAKNPKLQDRLYRDIKSVCGSEKITEEHLSQLPYITAIFHETLRKHSPVPILPLRHVHEDTVLGGYHVPAGTELAVNIYGCNMDKNVWENPEEWNPERFMKENETIDFQKTMAFGGGKRVCAGSLQALLIASIGIGRMVQEFEWKLKDMTQEEVNTIGLTNQMLRPLRAIIKPRI
SEQ ID NO:52
向日葵大根香叶烯A单加氧酶;工程改造的HaGAO
MAKPPLFFIVIIGLIVVAASFLYKLLTRPTSSKNRLPEPWRLPIIGHMHHLIGTMPHRGVMDLARKYGSLMHLQLGEVSAIVVSSPKWAKEILTTYDIPFANRPETLTGEIIAYHNTDIVLAPYGEYWRQLRKLCTLELLSVKKVKSFQSLREEECWNLVQEIKASGSGTPFNLSEGIFKVIATVLSRAAFGKGIKDQKQFTEIVKEILRETGGFDVADIFPSKKFLHHLSGKRGRLTSIHNKLDSLINNLVAEHTVSKSSKVNETLLDVLLRLKNSEEFPLTADNVKAIILDMFGAGTDTSSATVEWAISELIRCPRAMEKVQAELRQALNGKERIKEEEIQDLPYLNLVIRETLRLHPPLPLVMPRECRQAMNLAGYDVANKTKLIVNVFAINRDPEYWKDAESFNPERFENSNTTIMGADYEYLPFGAGRRMCPGSALGLANVQLPLANILYYFKWKLPNGASHDQLDMTESFGATVQRKTELMLVPSF
SEQ ID NO:54
α-律草烯(Alpha-humulene)10-羟化酶;工程改造的CYP71BA1
MAQDLRLILIIVGAIAIIALLVHGFLLIKRSSRSSVHKQQVLLASLPPSPPRLPLIGNIHQLVGGNPHRILLQLARTHGPLICLRLGQVDQVVASSVEAVEEIIKRHDLKFADRPRDLTFSRIFFYDGNAVVMTPYGGEWKQMRKIYAMELLNSRRVKSFAAIREDVARKLTGEIAHKAFAQTPVINLSEMVMSMINAIVIRVAFGDKCKQQAYFLHLVKEAMSYVSSFSVADMYPSLKFLDTLTGLKSKLEGVHGKLDKVFDEIIAQRQAALAAEQAEEDLIIDVLLKLKDEGNQEFPITYTSVKAIVMEIFLAGTETSSSVIDWVMSELIKNPKAMEKVQKEMREAMQGKTKLEESDIPKFSYLNLVIKETLRLHPPGPLLFPRECRETCEVMGYRVPAGARLLINAFALSRDEKYWGSDAESFKPERFEGISVDFKGSNFEFMPFGAGRRICPGMTFGISSVEVALAHLLFHFDWQLPQGMKIEDLDMMEVSGMSATRRSPLLVLAKLIIPLP
SEQ ID NO:55
对映-异贝壳杉烯(Ent-isokaurene)C2-羟化酶;工程改造的CYP71Z18
MAQDLRLILIIVGAIAIIALLVHGFLKSAVTKPKLNLPPGPWTLPLIGSIHHIVSNPLPYRAMRELAHKHGPLMMLWLGEVPTLVVSSPEAAQAITKTHDVSFADRHINSTVDILTFNGMDMVFGSYGEQWRQLRKLSVLELLSAARVQSFQRIREEEVARFMRSLAASASAGATVDLSKMISSFINDTFVRESIGSRCKYQDEYLAALDTAIRVAAELSVGNIFPSSRVLQSLSTARRKAIASRDEMARILGQIIRETKESMDQGDKTSNESMISVLLRLQKDAGLPIELTDNVVMALMFDLFGAGSDTSSTTLTWCMTELVRYPATMAKAQAEVREAFKGKTTITEDDLSTANLRYLKLVVKEALRLHCPVPLLLPRKCREACQVMGYDIPKGTCVFVNVWAICRDPRYWEDAEEFKPERFENSNLDYKGTYYEYLPFGSGRRMCPGANLGVANLELALASLLYHFDWKLPSGQEPKDVDVWEAAGLVAKKNIGLVLHPVSHIAPVNA
SEQ ID NO:56
普萘哌二烯(Premnaspirodiene)加氧酶;工程改造的CYP71D55_V482I/A484I
MAQDLRLILIIVGAIAIIALLVHGFFLLRKWKNSNSQSKKLPPGPWKLPLLGSMLHMVGGLPHHVLRDLAKKYGPLMHLQLGEVSAVVVTSPDMAKEVLKTHDIAFASRPKLLAPEIVCYNRSDIAFCPYGDYWRQMRKICVLEVLSAKNVRSFSSIRRDEVLRLVNFVRSSTSEPVNFTERLFLFTSSMTCRSAFGKVFKEQETFIQLIKEVIGLAGGFDVADIFPSLKFLHVLTGMEGKIMKAHHKVDAIVEDVINEHKKNLAMGKTNGALGGEDLIDVLLRLMNDGGLQFPITNDNIKAIIFDMFAAGTETSSSTLVWAMVQMMRNPTILAKAQAEVREAFKGKETFDENDVEELKYLKLVIKETLRLHPPVPLLVPRECREETEINGYTIPVKTKVMVNVWALGRDPKYWDDADNFKPERFEQCSVDFIGNNFEYLPFGGGRRICPGISFGLANVYLPLAQLLYHFDWKLPTGMEPKDLDLTELVGITIARKSDLMLVATPYQPSRE
细胞色素P450还原酶
SEQ ID NO:27
甜叶菊(Stevia rebaudiana)SrCPR
MAQSDSVKVSPFDLVSAAMNGKAMEKLNASESEDPTTLPALKMLVENRELLTLFTTSFAVLIGCLVFLMWRRSSSKKLVQDPVPQVIVVKKKEKESEVDDGKKKVSIFYGTQTGTAEGFAKALVEEAKVRYEKTSFKVIDLDDYAADDDEYEEKLKKESLAFFFLATYGDGEPTDNAANFYKWFTEGDDKGEWLKKLQYGVFGLGNRQYEHFNKIAIVVDDKLTEMGAKRLVPVGLGDDDQCIEDDFTAWKELVWPELDQLLRDEDDTSVTTPYTAAVLEYRVVYHDKPADSYAEDQTHTNGHVVHDAQHPSRSNVAFKKELHTSQSDRSCTHLEFDISHTGLSYETGDHVGVYSENLSEVVDEALKLLGLSPDTYFSVHADKEDGTPIGGASLPPPFPPCTLRDALTRYADVLSSPKKVALLALAAHASDPSEADRLKFLASPAGKDEYAQWIVANQRSLLEVMQSFPSAKPPLGVFFAAVAPRLQPRYYSISSSPKMSPNRIHVTCALVYETTPAGRIHRGLCSTWMKNAVPLTESPDCSQASIFVRTSNFRLPVDPKVPVIMIGPGTGLAPFRGFLQERLALKESGTELGSSIFFFGCRNRKVDFIYEDELNNFVETGALSELIVAFSREGTAKEYVQHKMSQKASDIWKLLSEGAYLYVCGDAKGMAKDVHRTLHTIVQEQGSLDSSKAELYVKNLQMSGRYLRDVW
SEQ ID NO:28
拟南芥(Arabidopsis thaliana)AtCPR1
MATSALYASDLFKQLKSIMGTDSLSDDVVLVIATTSLALVAGFVVLLWKKTTADRSGELKPLMIPKSLMAKDEDDDLDLGSGKTRVSIFFGTQTGTAEGFAKALSEEIKARYEKAAVKVIDLDDYAADDDQYEEKLKKETLAFFCVATYGDGEPTDNAARFYKWFTEENERDIKLQQLAYGVFALGNRQYEHFNKIGIVLDEELCKKGAKRLIEVGLGDDDQSIEDDFNAWKESLWSELDKLLKDEDDKSVATPYTAVIPEYRVVTHDPRFTTQKSMESNVANGNTTIDIHHPCRVDVAVQKELHTHESDRSCIHLEFDISRTGITYETGDHVGVYAENHVEIVEEAGKLLGHSLDLVFSIHADKEDGSPLESAVPPPFPGPCTLGTGLARYADLLNPPRKSALVALAAYATEPSEAEKLKHLTSPDGKDEYSQWIVASQRSLLEVMAAFPSAKPPLGVFFAAIAPRLQPRYYSISSSPRLAPSRVHVTSALVYGPTPTGRIHKGVCSTWMKNAVPAEKSHECSGAPIFIRASNFKLPSNPSTPIVMVGPGTGLAPFRGFLQERMALKEDGEELGSSLLFFGCRNRQMDFIYEDELNNFVDQGVISELIMAFSREGAQKEYVQHKMMEKAAQVWDLIKEEGYLYVCGDAKGMARDVHRTLHTIVQEQEGVSSSEAEAIVKKLQTEGRYLRDVW
SEQ ID NO:29
拟南芥AtCPR2
MASSSSSSSTSMIDLMAAIIKGEPVIVSDPANASAYESVAAELSSMLIENRQFAMIVTTSIAVLIGCIVMLVWRRSGSGNSKRVEPLKPLVIKPREEEIDDGRKKVTIFFGTQTGTAEGFAKALGEEAKARYEKTRFKIVDLDDYAADDDEYEEKLKKEDVAFFFLATYGDGEPTDNAARFYKWFTEGNDRGEWLKNLKYGVFGLGNRQYEHFNKVAKVVDDILVEQGAQRLVQVGLGDDDQCIEDDFTAWREALWPELDTILREEGDTAVATPYTAAVLEYRVSIHDSEDAKFNDINMANGNGYTVFDAQHPYKANVAVKRELHTPESDRSCIHLEFDIAGSGLTYETGDHVGVLCDNLSETVDEALRLLDMSPDTYFSLHAEKEDGTPISSSLPPPFPPCNLRTALTRYACLLSSPKKSALVALAAHASDPTEAERLKHLASPAGKDEYSKWVVESQRSLLEVMAEFPSAKPPLGVFFAGVAPRLQPRFYSISSSPKIAETRIHVTCALVYEKMPTGRIHKGVCSTWMKNAVPYEKSENCSSAPIFVRQSNFKLPSDSKVPIIMIGPGTGLAPFRGFLQERLALVESGVELGPSVLFFGCRNRRMDFIYEEELQRFVESGALAELSVAFSREGPTKEYVQHKMMDKASDIWNMISQGAYLYVCGDAKGMARDVHRSLHTIAQEQGSMDSTKAEGFVKNLQTSGRYLRDVW
SEQ ID NO:30
拟南芥eATR2
MASSSSSSSTSMIDLMAAIIKGEPVIVSDPANASAYESVAAELSSMLIENRQFAMIVTTSIAVLIGCIVMLVWRRSGSGNSKRVEPLKPLVIKPREEEIDDGRKKVTIFFGTQTGTAEGFAKALGEEAKARYEKTRFKIVDLDDYAADDDEYEEKLKKEDVAFFFLATYGDGEPTDNAARFYKWFTEGNDRGEWLKNLKYGVFGLGNRQYEHFNKVAKVVDDILVEQGAQRLVQVGLGDDDQCIEDDFTAWREALWPELDTILREEGDTAVATPYTAAVLEYRVSIHDSEDAKFNDITLANGNGYTVFDAQHPYKANVAVKRELHTPESDRSCIHLEFDIAGSGLTMKLGDHVGVLCDNLSETVDEALRLLDMSPDTYFSLHAEKEDGTPISSSLPPPFPPCNLRTALTRYACLLSSPKKSALVALAAHASDPTEAERLKHLASPAGKDEYSKWVVESQRSLLEVMAEFPSAKPPLGVFFAGVAPRLQPRFYSISSSPKIAETRIHVTCALVYEKMPTGRIHKGVCSTWMKNAVPYEKSEKLFLGRPIFVRQSNFKLPSDSKVPIIMIGPGTGLAPFRGFLQERLALVESGVELGPSVLFFGCRNRRMDFIYEEELQRFVESGALAELSVAFSREGPTKEYVQHKMMDKASDIWNMISQGAYLYVCGDAKGMARDVHRSLHTIAQEQGSMDSTKAEGFVKNLQTSGRYLRDVW
SEQ ID NO:32
甜叶菊SrCPR3
MAQSNSVKISPLDLVTALFSGKVLDTSNASESGESAMLPTIAMIMENRELLMILTTSVAVLIGCVVVLVWRRSSTKKSALEPPVIVVPKRVQEEEVDDGKKKVTVFFGTQTGTAEGFAKALVEEAKARYEKAVFKVIDLDDYAADDDEYEEKLKKESLAFFFLATYGDGEPTDNAARFYKWFTEGDAKGEWLNKLQYGVFGLGNRQYEHFNKIAKVVDDGLVEQGAKRLVPVGLGDDDQCIEDDFTAWKELVWPELDQLLRDEDDTTVATPYTAAVAEYRVVFHEKPDALSEDYSYTNGHAVHDAQHPCRSNVAVKKELHSPESDRSCTHLEFDISNTGLSYETGDHVGVYCENLSEVVNDAERLVGLPPDTYFSIHTDSEDGSPLGGASLPPPFPPCTLRKALTCYADVLSSPKKSALLALAAHATDPSEADRLKFLASPAGKDEYSQWIVASQRSLLEVMEAFPSAKPSLGVFFASVAPRLQPRYYSISSSPKMAPDRIHVTCALVYEKTPAGRIHKGVCSTWMKNAVPMTESQDCSWAPIYVRTSNFRLPSDPKVPVIMIGPGTGLAPFRGFLQERLALKEAGTDLGLSILFFGCRNRKVDFIYENELNNFVETGALSELIVAFSREGPTKEYVQHKMSEKASDIWNLLSEGAYLYVCGDAKGMAKDVHRTLHTIVQEQGSLDSSKAELYVKNLQMSGRYLRDVWSEQ ID NO:33
黄花蒿AaCPR
MAQSTTSVKLSPFDLMTALLNGKVSFDTSNTSDTNIPLAVFMENRELLMILTTSVAVLIGCVVVLVWRRSSSAAKKAAESPVIVVPKKVTEDEVDDGRKKVTVFFGTQTGTAEGFAKALVEEAKARYEKAVFKVIDLDDYAAEDDEYEEKLKKESLAFFFLATYGDGEPTDNAARFYKWFTEGEEKGEWLDKLQYAVFGLGNRQYEHFNKIAKVVDEKLVEQGAKRLVPVGMGDDDQCIEDDFTAWKELVWPELDQLLRDEDDTSVATPYTAAVAEYRVVFHDKPETYDQDQLTNGHAVHDAQHPCRSNVAVKKELHSPLSDRSCTHLEFDISNTGLSYETGDHVGVYVENLSEVVDEAEKLIGLPPHTYFSVHADNEDGTPLGGASLPPPFPPCTLRKALASYADVLSSPKKSALLALAAHATDSTEADRLKFLASPAGKDEYAQWIVASHRSLLEVMEAFPSAKPPLGVFFASVAPRLQPRYYSISSSPRFAPNRIHVTCALVYEQTPSGRVHKGVCSTWMKNAVPMTESQDCSWAPIYVRTSNFRLPSDPKVPVIMIGPGTGLAPFRGFLQERLAQKEAGTELGTAILFFGCRNRKVDFIYEDELNNFVETGALSELVTAFSREGATKEYVQHKMTQKASDIWNLLSEGAYLYVCGDAKGMAKDVHRTLHTIVQEQGSLDSSKAELYVKNLQMAGRYLRDVW
SEQ ID NO:34
香叶天竺葵(Pelargonium graveolens)PgCPR
MAQSSSGSMSPFDFMTAIIKGKMEPSNASLGAAGEVTAMILDNRELVMILTTSIAVLIGCVVVFIWRRSSSQTPTAVQPLKPLLAKETESEVDDGKQKVTIFFGTQTGTAEGFAKALADEAKARYDKVTFKVVDLDDYAADDEEYEEKLKKETLAFFFLATYGDGEPTDNAARFYKWFLEGKERGEWLQNLKFGVFGLGNRQYEHFNKIAIVVDEILAEQGGKRLISVGLGDDDQCIEDDFTAWRESLWPELDQLLRDEDDTTVSTPYTAAVLEYRVVFHDPADAPTLEKSYSNANGHSVVDAQHPLRANVAVRRELHTPASDRSCTHLEFDISGTGIAYETGDHVGVYCENLAETVEEALELLGLSPDTYFSVHADKEDGTPLSGSSLPPPFPPCTLRTALTLHADLLSSPKKSALLALAAHASDPTEADRLRHLASPAGKDEYAQWIVASQRSLLEVMAEFPSAKPPLGVFFASVAPRLQPRYYSISSSPRIAPSRIHVTCALVYEKTPTGRVHKGVCSTWMKNSVPSEKSDECSWAPIFVRQSNFKLPADAKVPIIMIGPGTGLAPFRGFLQERLALKEAGTELGPSILFFGCRNSKMDYIYEDELDNFVQNGALSELVLAFSREGPTKEYVQHKMMEKASDIWNLISQGAYLYVCGDAKGMARDVHRTLHTIAQEQGSLDSSKAESMVKNLQMSGRYLRDVW
SEQ ID NO:53
喜树细胞色素P450还原酶;CaCPR
MAQSSSVKVSTFDLMSAILRGRSMDQTNVSFESGESPALAMLIENRELVMILTTSVAVLIGCFVVLLWRRSSGKSGKVTEPPKPLMVKTEPEPEVDDGKKKVSIFYGTQTGTAEGFAKALAEEAKVRYEKASFKVIDLDDYAADDEEYEEKLKKETLTFFFLATYGDGEPTDNAARFYKWFMEGKERGDWLKNLHYGVFGLGNRQYEHFNRIAKVVDDTIAEQGGKRLIPVGLGDDDQCIEDDFAAWRELLWPELDQLLQDEDGTTVATPYTAAVLEYRVVFHDSPDASLLDKSFSKSNGHAVHDAQHPCRANVAVRRELHTPASDRSCTHLEFDISGTGLVYETGDHVGVYCENLIEVVEEAEMLLGLSPDTFFSIHTDKEDGTPLSGSSLPPPFPPCTLRRALTQYADLLSSPKKSSLLALAAHCSDPSEADRLRHLASPSGKDEYAQWVVASQRSLLEVMAEFPSAKPPIGAFFAGVAPRLQPRYYSISSSPRMAPSRIHVTCALVFEKTPVGRIHKGVCSTWMKNAVPLDESRDCSWAPIFVRQSNFKLPADTKVPVLMIGPGTGLAPFRGFLQERLALKEAGAELGPAILFFGCRNRQMDYIYEDELNNFVETGALSELIVAFSREGPKKEYVQHKMMEKASDIWNMISQEGYIYVCGDAKGMARDVHRTLHTIVQEQGSLDSSKTESMVKNLQMNGRYLRDVW
醇脱氢酶
SEQ ID NO:35
二穗短柄草(Brachypodium distachyon)BdDH
MSAAAAVSSSSSPRLEGKVALVTGGASGIGEAIVRLFRQHGAKVCIADVQDEAGQQVRDSLGDDAGTDVLFVHCDVTVEEDVSRAVDAAAEKFGTLDIMVNNAGITGDKVTDIRNLDFAEVRKVFDINVHGMLLGMKHAARVMIPGKKGSIVSLASVASVMGGMGPHAYTASKHAVVGLTKSVALELGKHGIRVNCVSPYAVPTALSMPHLPQGEHKGDAVRDFLAFVGGEANLKGVDLLPKDVAQAVLYLASDEARYISALNLVVDGGFTSVNPNLKAFED
SEQ ID NO:36
橙子(Citrus sinensis)CsABA2
MSNSNSTDSSPAVQRLVGRVALITGGATGIGESTVRLFHKHGAKVCIADVQDNLGQQVCQSLGGEPDTFFCHCDVTKEEDVCSAVDLTVEKFGTLDIMVNNAGISGAPCPDIREADLSEFEKVFDINVKGVFHGMKHAARIMIPQTKGTIISICSVAGAIGGLGPHAYTGSKHAVLGLNKNVAAELGKYGIRVNCVSPYAVATGLALAHLPEEERTEDAMVGFRNFVARNANMQGTELTANDVANAVLFLASDEARYISGTNLMVDGGFTSVNHSLRVFR
SEQ ID NO:37
橙子CsDH
MATPPISSLISQRLLGKVALVTGGASGIGEGIVRLFHRHGAKVCFVDVQDELGYRLQESLVGDKDSNIFYSHCDVTVEDDVRRAVDLTVTKFGTLDIMVNNAGISGTPSSDIRNVDVSEFEKVFDINVKGVFMGMKYAASVMIPRKQGSIISLGSVGSVIGGIGPHHYISSKHAVVGLTRSIAAELGQHGIRVNCVSPYAVPTNLAVAHLPEDERTEDMFTGFREFAKKNANLQGVELTVEDVANAVLFLASEDARYISGDNLIVDGGFTRVNHSFRVFR
SEQ ID NO:38
橙子CsDH1
MSKPRLQGKVAIIMGAASGIGEATAKLFAEHGAFVIIADIQDELGNQVVSSIGPEKASYRHCDVRDEKQVEETVAYAIEKYGSLDIMYSNAGVAGPVGTILDLDMAQFDRTIATNLAGSVMAVKYAARVMVANKIRGSIICTTSTASTVGGSGPHAYTISKHGLLGLVRSAASELGKHGIRVNCVSPFGVATPFSAGTINDVEGFVCKVANLKGIVLKAKHVAEAALFLASDESAYVSGHDLVVDGGFTAVTNVMSMLEGHG
SEQ ID NO:39
橙子CsDH2
MSNPRMEGKVALITGAASGIGEAAVRLFAEHGAFVVAADVQDELGHQVAASVGTDQVCYHHCDVRDEKQVEETVRYTLEKYGKLDVLFSNAGIMGPLTGILELDLTGFGNTMATNVCGVAATIKHAARAMVDKNIRGSIICTTSVASSLGGTAPHAYTTSKHALVGLVRTACSELGAYGIRVNCISPFGVATPLSCTAYNLRPDEVEANSCALANLKGIVLKAKHIAEAALFLASDESAYISGHNLAVDGGFTVVNHSSSSAT
SEQ ID NO:40
橙子CsDH3
MTTAGSRDSPLVAQRLLGKVALVTGGATGIGESIVRLFHKHGAKVCVVDINDDLGQHLCQTLGPTTRFIHGDVAIEDDVSRAVDFTVANFGTLDIMVNNAGMGGPPCPDIREFPISTFEKVFDINTKGTFIGMKHAARVMIPSKKGSIVSISSVTSAIGGAGPHAYTASKHAVLGLTKSVAAELGQHGIRVNCVSPYAILTNLALAHLHEDERTDDARAGFRAFIGKNANLQGVDLVEDDVANAVLFLASDDARYISGDNLFVDGGFTCTNHSLRVFR
SEQ ID NO:41
红串红球菌(Rhodococcus erythropolis)ReCDH
MARVEGQVALITGAARGQGRSHAIKLAEEGADVILVDVPNDVVDIGYPLGTADELDQTAKDVENLGRKAIVIHADVRDLESLTAEVDRAVSTLGRLDIVSANAGIASVPFLSHDIPDNTWRQMIDINLTGVWHTAKVAVPHILAGERGGSIVLTSSAAGLKGYAQISHYSAAKHGVVGLMRSLALELAPHRVRVNSLHPTQVNTPMIQNEGTYRIFSPDLENPTREDFEIASTTTNALPIPWVESVDVSNALLFLVSEDARYITGAAIPVDAGTTLK
SEQ ID NO:42
VoDH1
MSTASSGDVSLLSQRLVGKVALITGGATGIGESIARLFYRHGAKVCIVDIQDNPGQNLCRELGTDDACFFHCDVSIEIDVIRAVDFVVNRFGKLDIMVNNAGIADPPCPDIRNTDLSIFEKVFDVNVKGTFQCMKHAARVMVPQKKGSIISLTSVASVIGGAGPHAYTGSKHAVLGLTKSVAAELGLHGIRVNCVSPYAVPTGMPLAHLPESEKTEDAMMGMRAFVGRNANLQGIELTVDDVANSVVFLASDEARYVSGLNLMLDGGFSCVNHSLRVFR
SEQ ID NO:43
葡萄VvDH
MAATSIDNSPLPSQRLLGKVALVTGGATGIGESIVRLFLKQGAKVCIVDVQDDLGQKLCDTLGGDPNVSFFHCDVTIEDDVCHAVDFTVTKFGTLDIMVNNAGMAGPPCSDIRNVEVSMFEKVFDVNVKGVFLGMKHAARIMIPLKKGTIISLCSVSSAIAGVGPHAYTGSKCAVAGLTQSVAAEMGGHGIRVNCISPYAIATGLALAHLPEDERTEDAMAGFRAFVGKNANLQGVELTVDDVAHAAVFLASDEARYISGLNLMLDGGFSCTNHSLRVFR
SEQ ID NO:44
红球姜(Zingiber zerumbet)ZzSDR
MRLEGKVALVTGGASGIGESIARLFIEHGAKICIVDVQDELGQQVSQRLGGDPHACYFHCDVTVEDDVRRAVDFTAEKYGTIDIMVNNAGITGDKVIDIRDADFNEFKKVFDINVNGVFLGMKHAARIMIPKMKGSIVSLASVSSVIAGAGPHGYTGAKHAVVGLTKSVAAELGRHGIRVNCVSPYAVPTRLSMPYLPESEMQEDALRGFLTFVRSNANLKGVDLMPNDVAEAVLYLATEESKYVSGLNLVIDGGFSIANHTLQVFE
序列表
<110> 马努斯生物合成股份有限公司 (Manus Bio, Inc.)
<120> 莎草薁酮的微生物生产
<130> AJ3171PT2102
<150> US 62/727,815
<151> 2018-09-06
<160> 56
<170> PatentIn version 3.5
<210> 1
<211> 561
<212> PRT
<213> 葡萄(Vitis vinifera)
<400> 1
Met Ser Val Pro Leu Ser Val Ser Val Thr Pro Ile Leu Ser Gln Arg
1 5 10 15
Ile Asp Pro Glu Val Ala Arg His Glu Ala Thr Tyr His Pro Asn Phe
20 25 30
Trp Gly Asp Arg Phe Leu His Tyr Asn Pro Asp Asp Asp Phe Cys Gly
35 40 45
Thr His Ala Cys Lys Glu Gln Gln Ile Gln Glu Leu Lys Glu Glu Val
50 55 60
Arg Lys Ser Leu Glu Ala Thr Ala Gly Asn Thr Ser Gln Leu Leu Lys
65 70 75 80
Leu Ile Asp Ser Ile Gln Arg Leu Gly Leu Ala Tyr His Phe Glu Arg
85 90 95
Glu Ile Glu Glu Ala Leu Lys Ala Met Tyr Gln Thr Tyr Thr Leu Val
100 105 110
Asp Asp Asn Asp His Leu Thr Thr Val Ser Leu Leu Phe Arg Leu Leu
115 120 125
Arg Gln Glu Gly Tyr His Ile Pro Ser Asp Val Phe Lys Lys Phe Met
130 135 140
Asp Glu Gly Gly Asn Phe Lys Glu Ser Leu Val Gly Asp Leu Pro Gly
145 150 155 160
Met Leu Ala Leu Tyr Glu Ala Ala His Leu Met Val His Gly Glu Asp
165 170 175
Ile Leu Asp Glu Ala Leu Gly Phe Thr Thr Ala His Leu Gln Ser Met
180 185 190
Ala Ile Asp Ser Asp Asn Pro Leu Thr Lys Gln Val Ile Arg Ala Leu
195 200 205
Lys Arg Pro Ile Arg Lys Gly Leu Pro Arg Val Glu Ala Arg His Tyr
210 215 220
Ile Thr Ile Tyr Gln Glu Asp Asp Ser His Asn Glu Ser Leu Leu Lys
225 230 235 240
Leu Ala Lys Leu Asp Tyr Asn Met Leu Gln Ser Leu His Arg Lys Glu
245 250 255
Leu Ser Glu Ile Thr Lys Trp Trp Lys Gly Leu Asp Phe Ala Thr Lys
260 265 270
Leu Pro Phe Ala Arg Asp Arg Ile Val Glu Gly Tyr Phe Trp Ile Leu
275 280 285
Gly Val Tyr Phe Glu Pro Gln Tyr Tyr Leu Ala Arg Arg Ile Leu Met
290 295 300
Lys Val Phe Gly Val Leu Ser Ile Val Asp Asp Ile Tyr Asp Ala Tyr
305 310 315 320
Gly Thr Phe Glu Glu Leu Lys Leu Phe Thr Glu Ala Ile Glu Arg Trp
325 330 335
Asp Ala Ser Ser Ile Asp Gln Leu Pro Asp Tyr Met Lys Val Cys Tyr
340 345 350
Gln Ala Leu Leu Asp Val Tyr Glu Glu Met Glu Glu Glu Met Thr Lys
355 360 365
Gln Gly Lys Leu Tyr Arg Val His Tyr Ala Gln Ala Ala Leu Lys Arg
370 375 380
Gln Val Gln Ala Tyr Leu Leu Glu Ala Lys Trp Leu Lys Gln Glu Tyr
385 390 395 400
Ile Pro Thr Met Glu Glu Tyr Met Ser Asn Ala Leu Val Thr Ser Ala
405 410 415
Cys Ser Met Leu Thr Thr Thr Ser Phe Val Gly Met Gly Asp Met Val
420 425 430
Thr Lys Glu Ala Phe Asp Trp Val Phe Ser Asp Pro Lys Met Ile Arg
435 440 445
Ala Ser Asn Val Ile Cys Arg Leu Met Asp Asp Ile Val Ser His Glu
450 455 460
Phe Glu Gln Lys Arg Gly His Val Ala Ser Ala Val Glu Cys Tyr Met
465 470 475 480
Lys Gln Tyr Gly Val Ser Lys Glu Glu Ala Tyr Asp Glu Phe Lys Lys
485 490 495
Gln Val Glu Ser Ala Trp Lys Asp Asn Asn Glu Glu Val Leu Gln Pro
500 505 510
Thr Ala Val Pro Val Pro Leu Leu Thr Arg Val Leu Asn Phe Ser Arg
515 520 525
Met Val Asp Val Leu Tyr Lys Asp Glu Asp Glu Tyr Thr Leu Val Gly
530 535 540
Pro Leu Met Lys Asp Leu Val Ala Gly Met Leu Ile Asp Pro Val Pro
545 550 555 560
Met
<210> 2
<211> 552
<212> PRT
<213> 广藿香(Pogostemnon cablin)
<400> 2
Met Glu Leu Tyr Ala Gln Ser Val Gly Val Gly Ala Ala Ser Arg Pro
1 5 10 15
Leu Ala Asn Phe His Pro Cys Val Trp Gly Asp Lys Phe Ile Val Tyr
20 25 30
Asn Pro Gln Ser Cys Gln Ala Gly Glu Arg Glu Glu Ala Glu Glu Leu
35 40 45
Lys Val Glu Leu Lys Arg Glu Leu Lys Glu Ala Ser Asp Asn Tyr Met
50 55 60
Arg Gln Leu Lys Met Val Asp Ala Ile Gln Arg Leu Gly Ile Asp Tyr
65 70 75 80
Leu Phe Val Glu Asp Val Asp Glu Ala Leu Lys Asn Leu Phe Glu Met
85 90 95
Phe Asp Ala Phe Cys Lys Asn Asn His Asp Met His Ala Thr Ala Leu
100 105 110
Ser Phe Arg Leu Leu Arg Gln His Gly Tyr Arg Val Ser Cys Glu Val
115 120 125
Phe Glu Lys Phe Lys Asp Gly Lys Asp Gly Phe Lys Val Pro Asn Glu
130 135 140
Asp Gly Ala Val Ala Val Leu Glu Phe Phe Glu Ala Thr His Leu Arg
145 150 155 160
Val His Gly Glu Asp Val Leu Asp Asn Ala Phe Asp Phe Thr Arg Asn
165 170 175
Tyr Leu Glu Ser Val Tyr Ala Thr Leu Asn Asp Pro Thr Ala Lys Gln
180 185 190
Val His Asn Ala Leu Asn Glu Phe Ser Phe Arg Arg Gly Leu Pro Arg
195 200 205
Val Glu Ala Arg Lys Tyr Ile Ser Ile Tyr Glu Gln Tyr Ala Ser His
210 215 220
His Lys Gly Leu Leu Lys Leu Ala Lys Leu Asp Phe Asn Leu Val Gln
225 230 235 240
Ala Leu His Arg Arg Glu Leu Ser Glu Asp Ser Arg Trp Trp Lys Thr
245 250 255
Leu Gln Val Pro Thr Lys Leu Ser Phe Val Arg Asp Arg Leu Val Glu
260 265 270
Ser Tyr Phe Trp Ala Ser Gly Ser Tyr Phe Glu Pro Asn Tyr Ser Val
275 280 285
Ala Arg Met Ile Leu Ala Lys Gly Leu Ala Val Leu Ser Leu Met Asp
290 295 300
Asp Val Tyr Asp Ala Tyr Gly Thr Phe Glu Glu Leu Gln Met Phe Thr
305 310 315 320
Asp Ala Ile Glu Arg Trp Asp Ala Ser Cys Leu Asp Lys Leu Pro Asp
325 330 335
Tyr Met Lys Ile Val Tyr Lys Ala Leu Leu Asp Val Phe Glu Glu Val
340 345 350
Asp Glu Glu Leu Ile Lys Leu Gly Ala Pro Tyr Arg Ala Tyr Tyr Gly
355 360 365
Lys Glu Ala Met Lys Tyr Ala Ala Arg Ala Tyr Met Glu Glu Ala Gln
370 375 380
Trp Arg Glu Gln Lys His Lys Pro Thr Thr Lys Glu Tyr Met Lys Leu
385 390 395 400
Ala Thr Lys Thr Cys Gly Tyr Ile Thr Leu Ile Ile Leu Ser Cys Leu
405 410 415
Gly Val Glu Glu Gly Ile Val Thr Lys Glu Ala Phe Asp Trp Val Phe
420 425 430
Ser Arg Pro Pro Phe Ile Glu Ala Thr Leu Ile Ile Ala Arg Leu Val
435 440 445
Asn Asp Ile Thr Gly His Glu Phe Glu Lys Lys Arg Glu His Val Arg
450 455 460
Thr Ala Val Glu Cys Tyr Met Glu Glu His Lys Val Gly Lys Gln Glu
465 470 475 480
Val Val Ser Glu Phe Tyr Asn Gln Met Glu Ser Ala Trp Lys Asp Ile
485 490 495
Asn Glu Gly Phe Leu Arg Pro Val Glu Phe Pro Ile Pro Leu Leu Tyr
500 505 510
Leu Ile Leu Asn Ser Val Arg Thr Leu Glu Val Ile Tyr Lys Glu Gly
515 520 525
Asp Ser Tyr Thr His Val Gly Pro Ala Met Gln Asn Ile Ile Lys Gln
530 535 540
Leu Tyr Leu His Pro Val Pro Tyr
545 550
<210> 3
<211> 547
<212> PRT
<213> 柯拉斯那沉香(Aquilaria crassna)
<400> 3
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Asn Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Thr Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Asn Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Pro Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Arg
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Glu Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Phe Gly Val Ser Glu Glu Gln Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 4
<211> 547
<212> PRT
<213> 柯拉斯那沉香(Aquilaria crassna)
<400> 4
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Thr Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Pro Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Arg Leu Glu Ala
165 170 175
Leu Leu Pro Lys Leu Ser Phe Pro Leu Ser Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Gln Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Val Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Ile Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Phe Gly Val Ser Glu Glu Gln Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 5
<211> 547
<212> PRT
<213> 柯拉斯那沉香(Aquilaria crassna)
<400> 5
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asp Phe Leu Glu Asn Asn Asp Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Asp Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Thr Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Asn Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Pro Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Arg
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Ile Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Asp Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Val Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 6
<211> 547
<212> PRT
<213> 柯拉斯那沉香(Aquilaria crassna)
<400> 6
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Thr Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Trp
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Asn Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Pro Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Phe Gly Val Ser Glu Glu Gln Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 7
<211> 547
<212> PRT
<213> 柯拉斯那沉香(Aquilaria crassna)
<400> 7
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Thr Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Asn Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Pro Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Phe Gly Val Ser Glu Glu Gln Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 8
<211> 547
<212> PRT
<213> 柯拉斯那沉香(Aquilaria crassna)
<400> 8
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Thr Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Asn Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Pro Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Arg
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Phe Gly Val Ser Glu Glu Gln Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 9
<211> 547
<212> PRT
<213> 柯拉斯那沉香(Aquilaria crassna)
<400> 9
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Asn Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Met Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Arg Leu Glu Ala
165 170 175
Leu Leu Pro Lys Leu Ser Phe Pro Leu Ser Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Gln Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Ile Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Asp Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Val Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 10
<211> 547
<212> PRT
<213> 柯拉斯那沉香(Aquilaria crassna)
<400> 10
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ile Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Met Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Asn Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Leu Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Pro Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Gly Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 11
<211> 547
<212> PRT
<213> 柯拉斯那沉香(Aquilaria crassna)
<400> 11
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Asn Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Met Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Arg Leu Glu Ala
165 170 175
Leu Leu Pro Lys Leu Ser Phe Pro Leu Ser Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Gln Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Ile Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Gln Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Asp Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Val Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 12
<211> 547
<212> PRT
<213> 沉香属(Aquilaria spp.)
<400> 12
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Asn Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Lys Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Met Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Asn Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Leu Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Lys Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Gln Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Lys Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Gly Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Lys Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asn Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 13
<211> 547
<212> PRT
<213> 沉香属(Aquilaria spp.)
<400> 13
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Asn Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Thr Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Met Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Asn Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Leu Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Ala Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Gly Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Gly Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 14
<211> 547
<212> PRT
<213> 沉香属(Aquilaria spp.)
<400> 14
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn His Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Lys Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Met Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Asn Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Leu Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Ala Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Gly Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 15
<211> 547
<212> PRT
<213> 沉香属(Aquilaria spp.)
<400> 15
Met Ser Ser Ala Lys Leu Gly Ser Ala Pro Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn His Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Lys Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Met Glu Ile Lys Ala Gln Leu Glu Lys Leu Gln
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Asn Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Leu Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Ala Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Gly Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Thr Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 16
<211> 538
<212> PRT
<213> 沉香属(Aquilaria spp.)
<400> 16
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asp Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn His Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Lys Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Met Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Ser Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Asn Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Ser Gln Tyr Asp Ala Glu Leu Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Ala Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Gly Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala
530 535
<210> 17
<211> 547
<212> PRT
<213> 沉香属(Aquilaria spp.)
<400> 17
Met Ser Ser Ala Lys Leu Gly Ser Thr Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Thr Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Lys Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Val Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Ser Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Pro Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Ser Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Lys Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Gly Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Lys Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 18
<211> 547
<212> PRT
<213> 沉香属(Aquilaria spp.)
<400> 18
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Thr Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Lys Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Val Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Ser Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Pro Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Lys Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Gly Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Lys Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 19
<211> 547
<212> PRT
<213> 沉香属(Aquilaria spp.)
<400> 19
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Ser Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Thr Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Lys Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Val Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Lys Leu Glu Ala
165 170 175
Leu Leu Pro Glu Leu Ser Phe Pro Leu Ser Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Pro Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu His Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Tyr Gly Val Ser Glu Glu Glu Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 20
<211> 547
<212> PRT
<213> 沉香属(Aquilaria spp.)
<400> 20
Met Ser Ser Ala Lys Leu Gly Ser Ala Ser Glu Asp Val Ser Arg Arg
1 5 10 15
Asp Ala Asn Tyr His Pro Thr Val Trp Gly Asp Phe Phe Leu Thr His
20 25 30
Ser Ser Asn Phe Leu Glu Asn Asn Asp Asn Ile Leu Glu Lys His Glu
35 40 45
Glu Leu Lys Gln Glu Val Arg Asn Leu Leu Val Val Glu Thr Ser Asp
50 55 60
Leu Pro Ser Lys Ile Gln Leu Thr Asp Glu Ile Ile Arg Leu Gly Val
65 70 75 80
Gly Tyr His Phe Glu Met Glu Ile Lys Ala Gln Leu Glu Lys Leu His
85 90 95
Asp His Gln Leu His Leu Asn Phe Asp Leu Leu Thr Thr Ser Val Trp
100 105 110
Phe Arg Leu Leu Arg Gly His Gly Phe Ser Ile Ser Ser Asp Val Phe
115 120 125
Lys Arg Phe Lys Asn Thr Lys Gly Glu Phe Glu Thr Glu Asp Ala Arg
130 135 140
Thr Leu Trp Cys Leu Tyr Glu Ala Thr His Leu Arg Val Asp Gly Glu
145 150 155 160
Asp Ile Leu Glu Glu Ala Ile Gln Phe Ser Arg Lys Arg Leu Glu Ala
165 170 175
Leu Leu Pro Lys Leu Ser Phe Pro Leu Ser Glu Cys Val Arg Asp Ala
180 185 190
Leu His Ile Pro Tyr His Arg Asn Val Gln Arg Leu Ala Ala Arg Gln
195 200 205
Tyr Ile Pro Gln Tyr Asp Ala Glu Gln Thr Lys Ile Glu Ser Leu Ser
210 215 220
Leu Phe Ala Lys Ile Asp Phe Asn Met Leu Gln Ala Leu Arg Gln Ser
225 230 235 240
Glu Leu Arg Glu Ala Ser Arg Trp Trp Lys Glu Phe Asp Phe Pro Ser
245 250 255
Lys Leu Pro Tyr Ala Arg Asp Arg Ile Ala Glu Gly Tyr Tyr Trp Met
260 265 270
Met Gly Ala His Phe Glu Pro Lys Phe Ser Leu Ser Arg Lys Phe Leu
275 280 285
Asn Arg Ile Ile Gly Ile Thr Ser Leu Ile Asp Asp Thr Tyr Asp Val
290 295 300
Tyr Gly Thr Leu Glu Glu Val Thr Leu Phe Thr Glu Ala Val Glu Arg
305 310 315 320
Trp Asp Ile Glu Ala Val Lys Asp Ile Pro Lys Tyr Met Gln Val Ile
325 330 335
Tyr Thr Gly Met Leu Gly Ile Phe Glu Asp Phe Lys Asp Asn Leu Ile
340 345 350
Asn Ala Arg Gly Lys Asp Tyr Cys Ile Asp Tyr Ala Ile Glu Val Phe
355 360 365
Lys Glu Ile Val Arg Ser Tyr Gln Arg Glu Ala Glu Tyr Phe His Thr
370 375 380
Gly Tyr Val Pro Ser Tyr Asp Glu Tyr Met Glu Asn Ser Ile Ile Ser
385 390 395 400
Gly Gly Tyr Lys Met Phe Ile Ile Leu Met Leu Ile Gly Arg Gly Glu
405 410 415
Phe Glu Leu Lys Glu Thr Leu Asp Trp Ala Ser Thr Ile Pro Glu Met
420 425 430
Val Lys Ala Ser Ser Leu Ile Ala Arg Tyr Ile Asp Asp Leu Gln Thr
435 440 445
Tyr Lys Ala Glu Glu Glu Arg Gly Glu Thr Val Ser Ala Val Arg Cys
450 455 460
Tyr Met Arg Glu Phe Gly Val Ser Glu Glu Gln Ala Cys Lys Lys Met
465 470 475 480
Arg Glu Met Ile Glu Ile Glu Trp Lys Arg Leu Asn Lys Thr Thr Leu
485 490 495
Glu Ala Asp Glu Ile Ser Ser Ser Val Val Ile Pro Ser Leu Asn Phe
500 505 510
Thr Arg Val Leu Glu Val Met Tyr Asp Lys Gly Asp Gly Tyr Ser Asp
515 520 525
Ser Gln Gly Val Thr Lys Asp Arg Ile Ala Ala Leu Leu Arg His Ala
530 535 540
Ile Glu Ile
545
<210> 21
<211> 561
<212> PRT
<213> 葡萄(Vitis vinifera)
<400> 21
Met Ser Val Pro Leu Ser Val Ser Val Thr Pro Ile Leu Ser Gln Arg
1 5 10 15
Ile Asp Pro Glu Val Ala Arg His Glu Ala Thr Tyr His Pro Asn Phe
20 25 30
Trp Gly Asp Arg Phe Leu His Tyr Asn Pro Asp Asp Asp Phe Cys Gly
35 40 45
Thr His Ala Cys Lys Glu Gln Gln Ile Gln Glu Leu Lys Glu Glu Val
50 55 60
Arg Lys Ser Leu Glu Ala Thr Ala Gly Asn Thr Ser Gln Leu Leu Lys
65 70 75 80
Leu Ile Asp Ser Ile Gln Arg Leu Gly Leu Ala Tyr His Phe Glu Arg
85 90 95
Glu Ile Glu Glu Ala Leu Lys Ala Met Tyr Gln Thr Tyr Thr Leu Val
100 105 110
Asp Asp Asn Asp His Leu Thr Thr Val Ser Leu Leu Phe Arg Leu Leu
115 120 125
Arg Gln Glu Gly Tyr His Ile Pro Ser Asp Val Phe Lys Lys Phe Met
130 135 140
Asp Glu Gly Gly Asn Phe Lys Glu Ser Leu Val Gly Asp Leu Pro Gly
145 150 155 160
Met Leu Ala Leu Tyr Glu Ala Ala His Leu Met Val His Gly Glu Asp
165 170 175
Ile Leu Asp Glu Ala Leu Gly Phe Thr Thr Ala His Leu Gln Ser Met
180 185 190
Ala Ile Asp Ser Asp Asn Pro Leu Thr Lys Gln Val Ile Arg Ala Leu
195 200 205
Lys Arg Pro Ile Arg Lys Gly Leu Pro Arg Val Glu Ala Arg His Tyr
210 215 220
Ile Thr Ile Tyr Gln Glu Asp Asp Ser His Asn Glu Ser Leu Leu Lys
225 230 235 240
Leu Ala Lys Leu Asp Tyr Asn Met Leu Gln Ser Leu His Arg Lys Glu
245 250 255
Leu Ser Glu Ile Thr Lys Trp Trp Lys Gly Leu Asp Phe Ala Thr Lys
260 265 270
Leu Pro Phe Ala Arg Asp Arg Ile Val Glu Gly Tyr Phe Trp Ile Leu
275 280 285
Gly Val Tyr Phe Glu Pro Gln Tyr Tyr Leu Ala Arg Arg Ile Leu Met
290 295 300
Lys Val Phe Gly Val Leu Ser Ile Val Asp Asp Ile Tyr Asp Ala Tyr
305 310 315 320
Gly Thr Phe Glu Glu Leu Lys Leu Phe Thr Glu Ala Ile Glu Arg Trp
325 330 335
Asp Ala Ser Ser Ile Asp Gln Leu Pro Asp Tyr Met Lys Val Cys Tyr
340 345 350
Gln Ala Leu Leu Asp Val Tyr Glu Glu Met Glu Glu Glu Met Thr Lys
355 360 365
Gln Gly Lys Leu Tyr Arg Val His Tyr Ala Gln Ala Ala Leu Lys Arg
370 375 380
Gln Val Gln Ala Tyr Leu Leu Glu Ala Lys Trp Leu Lys Gln Glu Tyr
385 390 395 400
Ile Pro Arg Met Asp Glu Tyr Met Ser Asn Ala Leu Val Ser Ser Ala
405 410 415
Cys Ser Met Leu Thr Thr Thr Ser Phe Val Gly Met Gly Asp Ile Val
420 425 430
Thr Lys Glu Ala Phe Asp Trp Val Phe Ser Asp Pro Lys Met Ile Arg
435 440 445
Ala Ser Asn Val Ile Cys Arg Leu Met Asp Asp Ile Val Ser His Glu
450 455 460
Phe Glu Gln Lys Arg Gly His Val Ala Ser Ala Val Glu Cys Tyr Met
465 470 475 480
Lys Gln Tyr Gly Val Ser Lys Glu Glu Ala Tyr Asp Glu Phe Lys Lys
485 490 495
Gln Val Glu Ser Ala Trp Lys Asp Asn Asn Glu Glu Phe Leu Gln Pro
500 505 510
Thr Ala Val Pro Val Pro Leu Leu Thr Arg Val Leu Asn Phe Ser Arg
515 520 525
Met Met Asp Val Leu Tyr Lys Asp Glu Asp Glu Tyr Thr Leu Val Gly
530 535 540
Pro Leu Met Lys Asp Leu Val Ala Gly Met Leu Ile Asp Pro Val Pro
545 550 555 560
Met
<210> 22
<211> 508
<212> PRT
<213> 葡萄(Vitis vinifera)
<400> 22
Met Glu Leu Gln Phe Ser Phe Phe Pro Ile Leu Cys Thr Phe Leu Leu
1 5 10 15
Phe Ile Tyr Leu Leu Lys Arg Leu Gly Lys Pro Ser Arg Thr Asn His
20 25 30
Pro Ala Pro Lys Leu Pro Pro Gly Pro Trp Lys Leu Pro Ile Ile Gly
35 40 45
Asn Met His Gln Leu Val Gly Ser Leu Pro His Arg Ser Leu Arg Ser
50 55 60
Leu Ala Lys Lys His Gly Pro Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Ala Ile Val Val Ser Ser Arg Glu Met Ala Lys Glu Val Met Lys
85 90 95
Thr His Asp Ile Ile Phe Ser Gln Arg Pro Cys Ile Leu Ala Ala Ser
100 105 110
Ile Val Ser Tyr Asp Cys Thr Asp Ile Ala Phe Ala Pro Tyr Gly Gly
115 120 125
Tyr Trp Arg Gln Ile Arg Lys Ile Ser Val Leu Glu Leu Leu Ser Ala
130 135 140
Lys Arg Val Gln Ser Phe Arg Ser Val Arg Glu Glu Glu Val Leu Asn
145 150 155 160
Leu Val Arg Ser Val Ser Leu Gln Glu Gly Val Leu Ile Asn Leu Thr
165 170 175
Lys Ser Ile Phe Ser Leu Thr Phe Ser Ile Ile Ser Arg Thr Ala Phe
180 185 190
Gly Lys Lys Cys Lys Asp Gln Glu Ala Phe Ser Val Thr Leu Asp Lys
195 200 205
Phe Ala Asp Ser Ala Gly Gly Phe Thr Ile Ala Asp Val Phe Pro Ser
210 215 220
Ile Lys Leu Leu His Val Val Ser Gly Met Arg Arg Lys Leu Glu Lys
225 230 235 240
Val His Lys Lys Leu Asp Arg Ile Leu Gly Asn Ile Ile Asn Glu His
245 250 255
Lys Ala Arg Ser Ala Ala Lys Glu Thr Cys Glu Ala Glu Val Asp Asp
260 265 270
Asp Leu Val Asp Val Leu Leu Lys Val Gln Lys Gln Gly Asp Leu Glu
275 280 285
Phe Pro Leu Thr Met Asp Asn Ile Lys Ala Val Leu Leu Asp Leu Phe
290 295 300
Val Ala Gly Thr Glu Thr Ser Ser Thr Ala Val Glu Trp Ala Met Ala
305 310 315 320
Glu Met Leu Lys Asn Pro Arg Val Met Ala Lys Ala Gln Ala Glu Val
325 330 335
Arg Asp Ile Phe Ser Arg Lys Gly Asn Ala Asp Glu Thr Val Val Arg
340 345 350
Glu Leu Lys Phe Leu Lys Leu Val Ile Lys Glu Thr Leu Arg Leu His
355 360 365
Pro Pro Val Pro Leu Leu Ile Pro Arg Glu Ser Arg Glu Arg Cys Ala
370 375 380
Ile Asn Gly Tyr Glu Ile Pro Val Lys Thr Arg Val Ile Ile Asn Ala
385 390 395 400
Trp Ala Ile Ala Arg Asp Pro Lys Tyr Trp Thr Asp Ala Glu Ser Phe
405 410 415
Asn Pro Glu Arg Phe Leu Asp Ser Ser Ile Asp Tyr Gln Gly Thr Asn
420 425 430
Phe Glu Tyr Ile Pro Phe Gly Ala Gly Arg Arg Met Cys Pro Gly Ile
435 440 445
Leu Phe Gly Met Ala Asn Val Glu Leu Ala Leu Ala Gln Leu Leu Tyr
450 455 460
His Phe Asp Trp Lys Leu Pro Asn Gly Ala Arg His Glu Glu Leu Asp
465 470 475 480
Met Thr Glu Gly Phe Arg Thr Ser Thr Lys Arg Lys Gln Asp Leu Tyr
485 490 495
Leu Ile Pro Ile Thr Tyr Arg Pro Leu Pro Val Glu
500 505
<210> 23
<211> 417
<212> PRT
<213> 枯草芽孢杆菌(Bacillus subtilis)
<400> 23
Met Asn Glu Gln Ile Pro His Asp Lys Ser Leu Asp Asn Ser Leu Thr
1 5 10 15
Leu Leu Lys Glu Gly Tyr Leu Phe Ile Lys Asn Arg Thr Glu Arg Tyr
20 25 30
Asn Ser Asp Leu Phe Gln Ala Arg Leu Leu Gly Lys Asn Phe Ile Cys
35 40 45
Met Thr Gly Ala Glu Ala Ala Lys Val Phe Tyr Asp Thr Asp Arg Phe
50 55 60
Gln Arg Gln Asn Ala Leu Pro Lys Arg Val Gln Lys Ser Leu Phe Gly
65 70 75 80
Val Asn Ala Ile Gln Gly Met Asp Gly Ser Ala His Ile His Arg Lys
85 90 95
Met Leu Phe Leu Ser Leu Met Thr Pro Pro His Gln Lys Arg Leu Ala
100 105 110
Glu Leu Met Thr Glu Glu Trp Lys Ala Ala Val Thr Arg Trp Glu Lys
115 120 125
Ala Asp Glu Val Val Leu Phe Glu Glu Ala Lys Glu Ile Leu Cys Arg
130 135 140
Val Ala Cys Tyr Trp Ala Gly Val Pro Leu Lys Glu Thr Glu Val Lys
145 150 155 160
Glu Arg Ala Asp Asp Phe Ile Asp Met Val Asp Ala Phe Gly Ala Val
165 170 175
Gly Pro Arg His Trp Lys Gly Arg Arg Ala Arg Pro Arg Ala Glu Glu
180 185 190
Trp Ile Glu Val Met Ile Glu Asp Ala Arg Ala Gly Leu Leu Lys Thr
195 200 205
Thr Ser Gly Thr Ala Leu His Glu Met Ala Phe His Thr Gln Glu Asp
210 215 220
Gly Ser Gln Leu Asp Ser Arg Met Ala Ala Ile Glu Leu Ile Asn Val
225 230 235 240
Leu Arg Pro Ile Val Ala Ile Ser Tyr Phe Leu Val Phe Ser Ala Leu
245 250 255
Ala Leu His Glu His Pro Lys Tyr Lys Glu Trp Leu Arg Ser Gly Asn
260 265 270
Ser Arg Glu Arg Glu Met Phe Val Gln Glu Val Arg Arg Tyr Tyr Pro
275 280 285
Phe Gly Pro Phe Leu Gly Ala Leu Val Lys Lys Asp Phe Val Trp Asn
290 295 300
Asn Cys Glu Phe Lys Lys Gly Thr Ser Val Leu Leu Asp Leu Tyr Gly
305 310 315 320
Thr Asn His Asp Pro Arg Leu Trp Asp His Pro Asp Glu Phe Arg Pro
325 330 335
Glu Arg Phe Ala Glu Arg Glu Glu Asn Leu Phe Asp Met Ile Pro Gln
340 345 350
Gly Gly Gly His Ala Glu Lys Gly His Arg Cys Pro Gly Glu Gly Ile
355 360 365
Thr Ile Glu Val Met Lys Ala Ser Leu Asp Phe Leu Val His Gln Ile
370 375 380
Glu Tyr Asp Val Pro Glu Gln Ser Leu His Tyr Ser Leu Ala Arg Met
385 390 395 400
Pro Ser Leu Pro Glu Ser Gly Phe Val Met Ser Gly Ile Arg Arg Lys
405 410 415
Ser
<210> 24
<211> 407
<212> PRT
<213> 枯草芽孢杆菌(Bacillus subtilis)
<400> 24
Met Gln Met Glu Lys Leu Met Phe His Pro His Gly Lys Glu Phe His
1 5 10 15
His Asn Pro Phe Ser Val Leu Gly Arg Phe Arg Glu Glu Glu Pro Ile
20 25 30
His Arg Phe Glu Leu Lys Arg Phe Gly Ala Thr Tyr Pro Ala Trp Leu
35 40 45
Ile Thr Arg Tyr Asp Asp Cys Met Ala Phe Leu Lys Asp Asn Arg Ile
50 55 60
Thr Arg Asp Val Lys Asn Val Met Asn Gln Glu Gln Ile Lys Met Leu
65 70 75 80
Asn Val Ser Glu Asp Ile Asp Phe Val Ser Asp His Met Leu Ala Lys
85 90 95
Asp Thr Pro Asp His Thr Arg Leu Arg Ser Leu Val His Gln Ala Phe
100 105 110
Thr Pro Arg Thr Ile Glu Asn Leu Arg Gly Ser Ile Glu Gln Ile Ala
115 120 125
Glu Gln Leu Leu Asp Glu Met Glu Lys Glu Asn Lys Ala Asp Ile Met
130 135 140
Lys Ser Phe Ala Ser Pro Leu Pro Phe Ile Val Ile Ser Glu Leu Met
145 150 155 160
Gly Ile Pro Lys Glu Asp Arg Ser Gln Phe Gln Ile Trp Thr Asn Ala
165 170 175
Met Val Asp Thr Ser Glu Gly Asn Arg Glu Leu Thr Asn Gln Ala Leu
180 185 190
Arg Glu Phe Lys Asp Tyr Ile Ala Lys Leu Ile His Asp Arg Arg Ile
195 200 205
Lys Pro Lys Asp Asp Leu Ile Ser Lys Leu Val His Ala Glu Glu Asn
210 215 220
Gly Ser Lys Leu Ser Glu Lys Glu Leu Tyr Ser Met Leu Phe Leu Leu
225 230 235 240
Val Val Ala Gly Leu Glu Thr Thr Val Asn Leu Leu Gly Ser Gly Thr
245 250 255
Leu Ala Leu Leu Gln His Lys Lys Glu Cys Glu Lys Leu Lys Gln Gln
260 265 270
Pro Glu Met Ile Ala Thr Ala Val Glu Glu Leu Leu Arg Tyr Thr Ser
275 280 285
Pro Val Val Met Met Ala Asn Arg Trp Ala Ile Glu Asp Phe Thr Tyr
290 295 300
Lys Gly His Ser Ile Lys Arg Gly Asp Met Ile Phe Ile Gly Ile Gly
305 310 315 320
Ser Ala Asn Arg Asp Pro Asn Phe Phe Glu Asn Pro Glu Ile Leu Asn
325 330 335
Ile Asn Arg Ser Pro Asn Arg His Ile Ser Phe Gly Phe Gly Ile His
340 345 350
Phe Cys Leu Gly Ala Pro Leu Ala Arg Leu Glu Gly His Ile Ala Phe
355 360 365
Lys Ala Leu Leu Lys Arg Phe Pro Asp Ile Glu Leu Ala Val Ala Pro
370 375 380
Asp Asp Ile Gln Trp Arg Lys Asn Val Phe Leu Arg Gly Leu Glu Ser
385 390 395 400
Leu Pro Val Ser Leu Ser Lys
405
<210> 25
<211> 404
<212> PRT
<213> 蜡状芽孢杆菌(Bacillus cereus)
<400> 25
Met Ala Ser Pro Glu Asn Val Ile Leu Val His Glu Ile Ser Lys Leu
1 5 10 15
Lys Thr Lys Glu Glu Leu Trp Asn Pro Tyr Glu Trp Tyr Gln Phe Met
20 25 30
Arg Asp Asn His Pro Val His Tyr Asp Glu Glu Gln Asp Val Trp Asn
35 40 45
Val Phe Leu Tyr Glu Asp Val Asn Arg Val Leu Ser Asp Tyr Arg Leu
50 55 60
Phe Ser Ser Arg Arg Glu Arg Arg Gln Phe Ser Ile Pro Pro Leu Glu
65 70 75 80
Thr Arg Ile Asn Ile Asn Ser Thr Asp Pro Pro Glu His Arg Asn Val
85 90 95
Arg Ser Ile Val Ser Lys Ala Phe Thr Pro Arg Ser Leu Glu Gln Trp
100 105 110
Lys Pro Arg Ile Gln Ala Ile Ala Asn Glu Leu Val Gln His Ile Gly
115 120 125
Lys Tyr Ser Glu Val Asn Ile Val Glu Glu Phe Ala Ala Pro Leu Pro
130 135 140
Val Thr Val Ile Ser Asp Leu Leu Gly Val Pro Thr Thr Asp Arg Lys
145 150 155 160
Lys Ile Lys Ala Trp Ser Asp Ile Leu Phe Met Pro Tyr Ser Lys Glu
165 170 175
Lys Phe Asn Asp Leu Asp Val Glu Lys Gly Ile Ala Leu Asn Glu Phe
180 185 190
Lys Ala Tyr Leu Leu Pro Ile Val Gln Glu Lys Arg Tyr His Leu Thr
195 200 205
Asp Asp Ile Ile Ser Asp Leu Ile Arg Ala Glu Tyr Glu Gly Glu Arg
210 215 220
Leu Thr Asp Glu Glu Ile Val Thr Phe Ser Leu Gly Leu Leu Ala Ala
225 230 235 240
Gly Asn Glu Thr Thr Thr Asn Leu Ile Ile Asn Ser Phe Tyr Cys Phe
245 250 255
Leu Val Asp Ser Pro Gly Thr Tyr Lys Glu Leu Arg Glu Glu Pro Thr
260 265 270
Leu Ile Ser Lys Ala Ile Glu Glu Val Leu Arg Tyr Arg Phe Pro Ile
275 280 285
Thr Leu Ala Arg Arg Ile Thr Glu Asp Thr Asn Ile Phe Gly Pro Leu
290 295 300
Met Lys Lys Asp Gln Met Val Val Ala Trp Val Ser Ala Ala Asn Leu
305 310 315 320
Asp Glu Lys Lys Phe Ser Gln Ala Ser Lys Phe Asn Ile His Arg Ile
325 330 335
Gly Asn Glu Lys His Leu Thr Phe Gly Lys Gly Pro His Phe Cys Leu
340 345 350
Gly Ala Pro Leu Ala Arg Leu Glu Ala Glu Ile Ala Leu Thr Thr Phe
355 360 365
Ile Asn Ala Phe Glu Lys Ile Ala Leu Ser Pro Ser Phe Asn Leu Glu
370 375 380
Gln Cys Ile Leu Glu Asn Glu Gln Thr Leu Lys Phe Leu Pro Ile Cys
385 390 395 400
Leu Lys Thr Gln
<210> 26
<211> 410
<212> PRT
<213> 蜡状芽孢杆菌(Bacillus cereus)
<400> 26
Met Lys Lys Leu Thr Phe Asn Asp Leu Asn Ser Pro Glu Thr Met Arg
1 5 10 15
Asn Pro Ile Met Phe Tyr Lys Asn Leu Met Glu Gln Lys Glu Arg Phe
20 25 30
Phe His Ile Asp Asp Phe Tyr Gly Met Gly Gly Ala Trp Val Val Phe
35 40 45
His Tyr Asp Asp Val Val Ala Ile Leu Lys Asp Ser Arg Phe Ile Lys
50 55 60
Asp Leu Arg Lys Phe Thr Pro Pro His Tyr Lys Gln Asn Pro Ile Glu
65 70 75 80
Glu Asn Thr Ala Val Ser Lys Leu Phe Glu Trp Leu Met Asn Met Pro
85 90 95
Asn Met Leu Thr Val Asp Pro Pro Asp His Thr Arg Leu Arg Arg Leu
100 105 110
Val Ser Lys Ser Phe Thr Pro Arg Met Ile Glu Asp Leu Arg Pro Arg
115 120 125
Ile Gln Gln Ile Ala Asp Glu Leu Leu Asp Val Val Gln Glu Gln Arg
130 135 140
Lys Met Glu Ile Ile Ala Asp Phe Ala Tyr Pro Leu Pro Ile Ile Val
145 150 155 160
Ile Ser Glu Met Leu Gly Ile Pro Ala Thr Asp Arg Asn Gln Phe Arg
165 170 175
Ala Trp Thr Gln Glu Leu Met Lys Ala Ser Val Asp Pro Gly Gln Gly
180 185 190
Thr Thr Val Thr Ala Thr Leu Glu Lys Phe Ile Asn Tyr Ile Glu Ile
195 200 205
Leu Phe Asn Glu Lys His Leu Asn Pro Ser Asp Asp Leu Ile Ser Ala
210 215 220
Leu Val Gln Ala Lys Glu Gln Glu Asp Lys Leu Ser Lys Asn Glu Leu
225 230 235 240
Leu Ser Thr Ile Trp Leu Leu Ile Ile Ala Gly His Glu Thr Thr Val
245 250 255
Asn Leu Ile Ser Asn Gly Val Leu Ala Leu Leu Gln His Pro Glu Gln
260 265 270
Met Asn Leu Leu Arg Gln Asp Pro Ser Leu Leu Ala Ser Ala Val Asp
275 280 285
Glu Leu Leu Arg Tyr Ala Gly Pro Ile Met Phe Ser Ser Arg Phe Ala
290 295 300
Ser Glu Asp Val Thr Ile His Gly Asn Arg Ile Arg Lys Gly Glu Leu
305 310 315 320
Val Leu Leu Ser Leu Thr Ala Ala Asn Ile Asp Pro Asn Ile Phe Pro
325 330 335
Tyr Pro Glu Glu Leu Asn Ile Ser Arg Glu Glu Asn Asn His Leu Ala
340 345 350
Phe Gly Ala Gly Ile His Gln Cys Leu Gly Ala Pro Leu Ala Arg Leu
355 360 365
Glu Gly Gln Ile Ala Leu Asp Thr Leu Leu Lys Arg Leu Pro Asn Leu
370 375 380
Arg Leu Ala Ile Glu Ala Asp Gln Leu Ile Tyr Asn His Ser Lys Ile
385 390 395 400
Arg Ser Leu Ala Ser Leu Pro Val Ile Phe
405 410
<210> 27
<211> 711
<212> PRT
<213> 甜叶菊(Stevia rebaudiana)
<400> 27
Met Ala Gln Ser Asp Ser Val Lys Val Ser Pro Phe Asp Leu Val Ser
1 5 10 15
Ala Ala Met Asn Gly Lys Ala Met Glu Lys Leu Asn Ala Ser Glu Ser
20 25 30
Glu Asp Pro Thr Thr Leu Pro Ala Leu Lys Met Leu Val Glu Asn Arg
35 40 45
Glu Leu Leu Thr Leu Phe Thr Thr Ser Phe Ala Val Leu Ile Gly Cys
50 55 60
Leu Val Phe Leu Met Trp Arg Arg Ser Ser Ser Lys Lys Leu Val Gln
65 70 75 80
Asp Pro Val Pro Gln Val Ile Val Val Lys Lys Lys Glu Lys Glu Ser
85 90 95
Glu Val Asp Asp Gly Lys Lys Lys Val Ser Ile Phe Tyr Gly Thr Gln
100 105 110
Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Val Glu Glu Ala Lys
115 120 125
Val Arg Tyr Glu Lys Thr Ser Phe Lys Val Ile Asp Leu Asp Asp Tyr
130 135 140
Ala Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Ser Leu
145 150 155 160
Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn
165 170 175
Ala Ala Asn Phe Tyr Lys Trp Phe Thr Glu Gly Asp Asp Lys Gly Glu
180 185 190
Trp Leu Lys Lys Leu Gln Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln
195 200 205
Tyr Glu His Phe Asn Lys Ile Ala Ile Val Val Asp Asp Lys Leu Thr
210 215 220
Glu Met Gly Ala Lys Arg Leu Val Pro Val Gly Leu Gly Asp Asp Asp
225 230 235 240
Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Lys Glu Leu Val Trp Pro
245 250 255
Glu Leu Asp Gln Leu Leu Arg Asp Glu Asp Asp Thr Ser Val Thr Thr
260 265 270
Pro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Val Tyr His Asp Lys
275 280 285
Pro Ala Asp Ser Tyr Ala Glu Asp Gln Thr His Thr Asn Gly His Val
290 295 300
Val His Asp Ala Gln His Pro Ser Arg Ser Asn Val Ala Phe Lys Lys
305 310 315 320
Glu Leu His Thr Ser Gln Ser Asp Arg Ser Cys Thr His Leu Glu Phe
325 330 335
Asp Ile Ser His Thr Gly Leu Ser Tyr Glu Thr Gly Asp His Val Gly
340 345 350
Val Tyr Ser Glu Asn Leu Ser Glu Val Val Asp Glu Ala Leu Lys Leu
355 360 365
Leu Gly Leu Ser Pro Asp Thr Tyr Phe Ser Val His Ala Asp Lys Glu
370 375 380
Asp Gly Thr Pro Ile Gly Gly Ala Ser Leu Pro Pro Pro Phe Pro Pro
385 390 395 400
Cys Thr Leu Arg Asp Ala Leu Thr Arg Tyr Ala Asp Val Leu Ser Ser
405 410 415
Pro Lys Lys Val Ala Leu Leu Ala Leu Ala Ala His Ala Ser Asp Pro
420 425 430
Ser Glu Ala Asp Arg Leu Lys Phe Leu Ala Ser Pro Ala Gly Lys Asp
435 440 445
Glu Tyr Ala Gln Trp Ile Val Ala Asn Gln Arg Ser Leu Leu Glu Val
450 455 460
Met Gln Ser Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala
465 470 475 480
Ala Val Ala Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser
485 490 495
Pro Lys Met Ser Pro Asn Arg Ile His Val Thr Cys Ala Leu Val Tyr
500 505 510
Glu Thr Thr Pro Ala Gly Arg Ile His Arg Gly Leu Cys Ser Thr Trp
515 520 525
Met Lys Asn Ala Val Pro Leu Thr Glu Ser Pro Asp Cys Ser Gln Ala
530 535 540
Ser Ile Phe Val Arg Thr Ser Asn Phe Arg Leu Pro Val Asp Pro Lys
545 550 555 560
Val Pro Val Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg
565 570 575
Gly Phe Leu Gln Glu Arg Leu Ala Leu Lys Glu Ser Gly Thr Glu Leu
580 585 590
Gly Ser Ser Ile Phe Phe Phe Gly Cys Arg Asn Arg Lys Val Asp Phe
595 600 605
Ile Tyr Glu Asp Glu Leu Asn Asn Phe Val Glu Thr Gly Ala Leu Ser
610 615 620
Glu Leu Ile Val Ala Phe Ser Arg Glu Gly Thr Ala Lys Glu Tyr Val
625 630 635 640
Gln His Lys Met Ser Gln Lys Ala Ser Asp Ile Trp Lys Leu Leu Ser
645 650 655
Glu Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Lys
660 665 670
Asp Val His Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu
675 680 685
Asp Ser Ser Lys Ala Glu Leu Tyr Val Lys Asn Leu Gln Met Ser Gly
690 695 700
Arg Tyr Leu Arg Asp Val Trp
705 710
<210> 28
<211> 693
<212> PRT
<213> 拟南芥(Arabidopsis thaliana)
<400> 28
Met Ala Thr Ser Ala Leu Tyr Ala Ser Asp Leu Phe Lys Gln Leu Lys
1 5 10 15
Ser Ile Met Gly Thr Asp Ser Leu Ser Asp Asp Val Val Leu Val Ile
20 25 30
Ala Thr Thr Ser Leu Ala Leu Val Ala Gly Phe Val Val Leu Leu Trp
35 40 45
Lys Lys Thr Thr Ala Asp Arg Ser Gly Glu Leu Lys Pro Leu Met Ile
50 55 60
Pro Lys Ser Leu Met Ala Lys Asp Glu Asp Asp Asp Leu Asp Leu Gly
65 70 75 80
Ser Gly Lys Thr Arg Val Ser Ile Phe Phe Gly Thr Gln Thr Gly Thr
85 90 95
Ala Glu Gly Phe Ala Lys Ala Leu Ser Glu Glu Ile Lys Ala Arg Tyr
100 105 110
Glu Lys Ala Ala Val Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala Asp
115 120 125
Asp Asp Gln Tyr Glu Glu Lys Leu Lys Lys Glu Thr Leu Ala Phe Phe
130 135 140
Cys Val Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg
145 150 155 160
Phe Tyr Lys Trp Phe Thr Glu Glu Asn Glu Arg Asp Ile Lys Leu Gln
165 170 175
Gln Leu Ala Tyr Gly Val Phe Ala Leu Gly Asn Arg Gln Tyr Glu His
180 185 190
Phe Asn Lys Ile Gly Ile Val Leu Asp Glu Glu Leu Cys Lys Lys Gly
195 200 205
Ala Lys Arg Leu Ile Glu Val Gly Leu Gly Asp Asp Asp Gln Ser Ile
210 215 220
Glu Asp Asp Phe Asn Ala Trp Lys Glu Ser Leu Trp Ser Glu Leu Asp
225 230 235 240
Lys Leu Leu Lys Asp Glu Asp Asp Lys Ser Val Ala Thr Pro Tyr Thr
245 250 255
Ala Val Ile Pro Glu Tyr Arg Val Val Thr His Asp Pro Arg Phe Thr
260 265 270
Thr Gln Lys Ser Met Glu Ser Asn Val Ala Asn Gly Asn Thr Thr Ile
275 280 285
Asp Ile His His Pro Cys Arg Val Asp Val Ala Val Gln Lys Glu Leu
290 295 300
His Thr His Glu Ser Asp Arg Ser Cys Ile His Leu Glu Phe Asp Ile
305 310 315 320
Ser Arg Thr Gly Ile Thr Tyr Glu Thr Gly Asp His Val Gly Val Tyr
325 330 335
Ala Glu Asn His Val Glu Ile Val Glu Glu Ala Gly Lys Leu Leu Gly
340 345 350
His Ser Leu Asp Leu Val Phe Ser Ile His Ala Asp Lys Glu Asp Gly
355 360 365
Ser Pro Leu Glu Ser Ala Val Pro Pro Pro Phe Pro Gly Pro Cys Thr
370 375 380
Leu Gly Thr Gly Leu Ala Arg Tyr Ala Asp Leu Leu Asn Pro Pro Arg
385 390 395 400
Lys Ser Ala Leu Val Ala Leu Ala Ala Tyr Ala Thr Glu Pro Ser Glu
405 410 415
Ala Glu Lys Leu Lys His Leu Thr Ser Pro Asp Gly Lys Asp Glu Tyr
420 425 430
Ser Gln Trp Ile Val Ala Ser Gln Arg Ser Leu Leu Glu Val Met Ala
435 440 445
Ala Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ala Ile
450 455 460
Ala Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Arg
465 470 475 480
Leu Ala Pro Ser Arg Val His Val Thr Ser Ala Leu Val Tyr Gly Pro
485 490 495
Thr Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys
500 505 510
Asn Ala Val Pro Ala Glu Lys Ser His Glu Cys Ser Gly Ala Pro Ile
515 520 525
Phe Ile Arg Ala Ser Asn Phe Lys Leu Pro Ser Asn Pro Ser Thr Pro
530 535 540
Ile Val Met Val Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe
545 550 555 560
Leu Gln Glu Arg Met Ala Leu Lys Glu Asp Gly Glu Glu Leu Gly Ser
565 570 575
Ser Leu Leu Phe Phe Gly Cys Arg Asn Arg Gln Met Asp Phe Ile Tyr
580 585 590
Glu Asp Glu Leu Asn Asn Phe Val Asp Gln Gly Val Ile Ser Glu Leu
595 600 605
Ile Met Ala Phe Ser Arg Glu Gly Ala Gln Lys Glu Tyr Val Gln His
610 615 620
Lys Met Met Glu Lys Ala Ala Gln Val Trp Asp Leu Ile Lys Glu Glu
625 630 635 640
Gly Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val
645 650 655
His Arg Thr Leu His Thr Ile Val Gln Glu Gln Glu Gly Val Ser Ser
660 665 670
Ser Glu Ala Glu Ala Ile Val Lys Lys Leu Gln Thr Glu Gly Arg Tyr
675 680 685
Leu Arg Asp Val Trp
690
<210> 29
<211> 712
<212> PRT
<213> 拟南芥(Arabidopsis thaliana)
<400> 29
Met Ala Ser Ser Ser Ser Ser Ser Ser Thr Ser Met Ile Asp Leu Met
1 5 10 15
Ala Ala Ile Ile Lys Gly Glu Pro Val Ile Val Ser Asp Pro Ala Asn
20 25 30
Ala Ser Ala Tyr Glu Ser Val Ala Ala Glu Leu Ser Ser Met Leu Ile
35 40 45
Glu Asn Arg Gln Phe Ala Met Ile Val Thr Thr Ser Ile Ala Val Leu
50 55 60
Ile Gly Cys Ile Val Met Leu Val Trp Arg Arg Ser Gly Ser Gly Asn
65 70 75 80
Ser Lys Arg Val Glu Pro Leu Lys Pro Leu Val Ile Lys Pro Arg Glu
85 90 95
Glu Glu Ile Asp Asp Gly Arg Lys Lys Val Thr Ile Phe Phe Gly Thr
100 105 110
Gln Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Gly Glu Glu Ala
115 120 125
Lys Ala Arg Tyr Glu Lys Thr Arg Phe Lys Ile Val Asp Leu Asp Asp
130 135 140
Tyr Ala Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asp
145 150 155 160
Val Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp
165 170 175
Asn Ala Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asn Asp Arg Gly
180 185 190
Glu Trp Leu Lys Asn Leu Lys Tyr Gly Val Phe Gly Leu Gly Asn Arg
195 200 205
Gln Tyr Glu His Phe Asn Lys Val Ala Lys Val Val Asp Asp Ile Leu
210 215 220
Val Glu Gln Gly Ala Gln Arg Leu Val Gln Val Gly Leu Gly Asp Asp
225 230 235 240
Asp Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Arg Glu Ala Leu Trp
245 250 255
Pro Glu Leu Asp Thr Ile Leu Arg Glu Glu Gly Asp Thr Ala Val Ala
260 265 270
Thr Pro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Ser Ile His Asp
275 280 285
Ser Glu Asp Ala Lys Phe Asn Asp Ile Asn Met Ala Asn Gly Asn Gly
290 295 300
Tyr Thr Val Phe Asp Ala Gln His Pro Tyr Lys Ala Asn Val Ala Val
305 310 315 320
Lys Arg Glu Leu His Thr Pro Glu Ser Asp Arg Ser Cys Ile His Leu
325 330 335
Glu Phe Asp Ile Ala Gly Ser Gly Leu Thr Tyr Glu Thr Gly Asp His
340 345 350
Val Gly Val Leu Cys Asp Asn Leu Ser Glu Thr Val Asp Glu Ala Leu
355 360 365
Arg Leu Leu Asp Met Ser Pro Asp Thr Tyr Phe Ser Leu His Ala Glu
370 375 380
Lys Glu Asp Gly Thr Pro Ile Ser Ser Ser Leu Pro Pro Pro Phe Pro
385 390 395 400
Pro Cys Asn Leu Arg Thr Ala Leu Thr Arg Tyr Ala Cys Leu Leu Ser
405 410 415
Ser Pro Lys Lys Ser Ala Leu Val Ala Leu Ala Ala His Ala Ser Asp
420 425 430
Pro Thr Glu Ala Glu Arg Leu Lys His Leu Ala Ser Pro Ala Gly Lys
435 440 445
Asp Glu Tyr Ser Lys Trp Val Val Glu Ser Gln Arg Ser Leu Leu Glu
450 455 460
Val Met Ala Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe
465 470 475 480
Ala Gly Val Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser Ser
485 490 495
Ser Pro Lys Ile Ala Glu Thr Arg Ile His Val Thr Cys Ala Leu Val
500 505 510
Tyr Glu Lys Met Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr
515 520 525
Trp Met Lys Asn Ala Val Pro Tyr Glu Lys Ser Glu Asn Cys Ser Ser
530 535 540
Ala Pro Ile Phe Val Arg Gln Ser Asn Phe Lys Leu Pro Ser Asp Ser
545 550 555 560
Lys Val Pro Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe
565 570 575
Arg Gly Phe Leu Gln Glu Arg Leu Ala Leu Val Glu Ser Gly Val Glu
580 585 590
Leu Gly Pro Ser Val Leu Phe Phe Gly Cys Arg Asn Arg Arg Met Asp
595 600 605
Phe Ile Tyr Glu Glu Glu Leu Gln Arg Phe Val Glu Ser Gly Ala Leu
610 615 620
Ala Glu Leu Ser Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu Tyr
625 630 635 640
Val Gln His Lys Met Met Asp Lys Ala Ser Asp Ile Trp Asn Met Ile
645 650 655
Ser Gln Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala
660 665 670
Arg Asp Val His Arg Ser Leu His Thr Ile Ala Gln Glu Gln Gly Ser
675 680 685
Met Asp Ser Thr Lys Ala Glu Gly Phe Val Lys Asn Leu Gln Thr Ser
690 695 700
Gly Arg Tyr Leu Arg Asp Val Trp
705 710
<210> 30
<211> 713
<212> PRT
<213> 拟南芥(Arabidopsis thaliana)
<400> 30
Met Ala Ser Ser Ser Ser Ser Ser Ser Thr Ser Met Ile Asp Leu Met
1 5 10 15
Ala Ala Ile Ile Lys Gly Glu Pro Val Ile Val Ser Asp Pro Ala Asn
20 25 30
Ala Ser Ala Tyr Glu Ser Val Ala Ala Glu Leu Ser Ser Met Leu Ile
35 40 45
Glu Asn Arg Gln Phe Ala Met Ile Val Thr Thr Ser Ile Ala Val Leu
50 55 60
Ile Gly Cys Ile Val Met Leu Val Trp Arg Arg Ser Gly Ser Gly Asn
65 70 75 80
Ser Lys Arg Val Glu Pro Leu Lys Pro Leu Val Ile Lys Pro Arg Glu
85 90 95
Glu Glu Ile Asp Asp Gly Arg Lys Lys Val Thr Ile Phe Phe Gly Thr
100 105 110
Gln Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Gly Glu Glu Ala
115 120 125
Lys Ala Arg Tyr Glu Lys Thr Arg Phe Lys Ile Val Asp Leu Asp Asp
130 135 140
Tyr Ala Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asp
145 150 155 160
Val Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp
165 170 175
Asn Ala Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asn Asp Arg Gly
180 185 190
Glu Trp Leu Lys Asn Leu Lys Tyr Gly Val Phe Gly Leu Gly Asn Arg
195 200 205
Gln Tyr Glu His Phe Asn Lys Val Ala Lys Val Val Asp Asp Ile Leu
210 215 220
Val Glu Gln Gly Ala Gln Arg Leu Val Gln Val Gly Leu Gly Asp Asp
225 230 235 240
Asp Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Arg Glu Ala Leu Trp
245 250 255
Pro Glu Leu Asp Thr Ile Leu Arg Glu Glu Gly Asp Thr Ala Val Ala
260 265 270
Thr Pro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Ser Ile His Asp
275 280 285
Ser Glu Asp Ala Lys Phe Asn Asp Ile Thr Leu Ala Asn Gly Asn Gly
290 295 300
Tyr Thr Val Phe Asp Ala Gln His Pro Tyr Lys Ala Asn Val Ala Val
305 310 315 320
Lys Arg Glu Leu His Thr Pro Glu Ser Asp Arg Ser Cys Ile His Leu
325 330 335
Glu Phe Asp Ile Ala Gly Ser Gly Leu Thr Met Lys Leu Gly Asp His
340 345 350
Val Gly Val Leu Cys Asp Asn Leu Ser Glu Thr Val Asp Glu Ala Leu
355 360 365
Arg Leu Leu Asp Met Ser Pro Asp Thr Tyr Phe Ser Leu His Ala Glu
370 375 380
Lys Glu Asp Gly Thr Pro Ile Ser Ser Ser Leu Pro Pro Pro Phe Pro
385 390 395 400
Pro Cys Asn Leu Arg Thr Ala Leu Thr Arg Tyr Ala Cys Leu Leu Ser
405 410 415
Ser Pro Lys Lys Ser Ala Leu Val Ala Leu Ala Ala His Ala Ser Asp
420 425 430
Pro Thr Glu Ala Glu Arg Leu Lys His Leu Ala Ser Pro Ala Gly Lys
435 440 445
Asp Glu Tyr Ser Lys Trp Val Val Glu Ser Gln Arg Ser Leu Leu Glu
450 455 460
Val Met Ala Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe
465 470 475 480
Ala Gly Val Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser Ser
485 490 495
Ser Pro Lys Ile Ala Glu Thr Arg Ile His Val Thr Cys Ala Leu Val
500 505 510
Tyr Glu Lys Met Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr
515 520 525
Trp Met Lys Asn Ala Val Pro Tyr Glu Lys Ser Glu Lys Leu Phe Leu
530 535 540
Gly Arg Pro Ile Phe Val Arg Gln Ser Asn Phe Lys Leu Pro Ser Asp
545 550 555 560
Ser Lys Val Pro Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro
565 570 575
Phe Arg Gly Phe Leu Gln Glu Arg Leu Ala Leu Val Glu Ser Gly Val
580 585 590
Glu Leu Gly Pro Ser Val Leu Phe Phe Gly Cys Arg Asn Arg Arg Met
595 600 605
Asp Phe Ile Tyr Glu Glu Glu Leu Gln Arg Phe Val Glu Ser Gly Ala
610 615 620
Leu Ala Glu Leu Ser Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu
625 630 635 640
Tyr Val Gln His Lys Met Met Asp Lys Ala Ser Asp Ile Trp Asn Met
645 650 655
Ile Ser Gln Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met
660 665 670
Ala Arg Asp Val His Arg Ser Leu His Thr Ile Ala Gln Glu Gln Gly
675 680 685
Ser Met Asp Ser Thr Lys Ala Glu Gly Phe Val Lys Asn Leu Gln Thr
690 695 700
Ser Gly Arg Tyr Leu Arg Asp Val Trp
705 710
<210> 31
<400> 31
000
<210> 32
<211> 708
<212> PRT
<213> 甜叶菊(Stevia rebaudiana)
<400> 32
Met Ala Gln Ser Asn Ser Val Lys Ile Ser Pro Leu Asp Leu Val Thr
1 5 10 15
Ala Leu Phe Ser Gly Lys Val Leu Asp Thr Ser Asn Ala Ser Glu Ser
20 25 30
Gly Glu Ser Ala Met Leu Pro Thr Ile Ala Met Ile Met Glu Asn Arg
35 40 45
Glu Leu Leu Met Ile Leu Thr Thr Ser Val Ala Val Leu Ile Gly Cys
50 55 60
Val Val Val Leu Val Trp Arg Arg Ser Ser Thr Lys Lys Ser Ala Leu
65 70 75 80
Glu Pro Pro Val Ile Val Val Pro Lys Arg Val Gln Glu Glu Glu Val
85 90 95
Asp Asp Gly Lys Lys Lys Val Thr Val Phe Phe Gly Thr Gln Thr Gly
100 105 110
Thr Ala Glu Gly Phe Ala Lys Ala Leu Val Glu Glu Ala Lys Ala Arg
115 120 125
Tyr Glu Lys Ala Val Phe Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala
130 135 140
Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Ser Leu Ala Phe
145 150 155 160
Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala
165 170 175
Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asp Ala Lys Gly Glu Trp Leu
180 185 190
Asn Lys Leu Gln Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu
195 200 205
His Phe Asn Lys Ile Ala Lys Val Val Asp Asp Gly Leu Val Glu Gln
210 215 220
Gly Ala Lys Arg Leu Val Pro Val Gly Leu Gly Asp Asp Asp Gln Cys
225 230 235 240
Ile Glu Asp Asp Phe Thr Ala Trp Lys Glu Leu Val Trp Pro Glu Leu
245 250 255
Asp Gln Leu Leu Arg Asp Glu Asp Asp Thr Thr Val Ala Thr Pro Tyr
260 265 270
Thr Ala Ala Val Ala Glu Tyr Arg Val Val Phe His Glu Lys Pro Asp
275 280 285
Ala Leu Ser Glu Asp Tyr Ser Tyr Thr Asn Gly His Ala Val His Asp
290 295 300
Ala Gln His Pro Cys Arg Ser Asn Val Ala Val Lys Lys Glu Leu His
305 310 315 320
Ser Pro Glu Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp Ile Ser
325 330 335
Asn Thr Gly Leu Ser Tyr Glu Thr Gly Asp His Val Gly Val Tyr Cys
340 345 350
Glu Asn Leu Ser Glu Val Val Asn Asp Ala Glu Arg Leu Val Gly Leu
355 360 365
Pro Pro Asp Thr Tyr Phe Ser Ile His Thr Asp Ser Glu Asp Gly Ser
370 375 380
Pro Leu Gly Gly Ala Ser Leu Pro Pro Pro Phe Pro Pro Cys Thr Leu
385 390 395 400
Arg Lys Ala Leu Thr Cys Tyr Ala Asp Val Leu Ser Ser Pro Lys Lys
405 410 415
Ser Ala Leu Leu Ala Leu Ala Ala His Ala Thr Asp Pro Ser Glu Ala
420 425 430
Asp Arg Leu Lys Phe Leu Ala Ser Pro Ala Gly Lys Asp Glu Tyr Ser
435 440 445
Gln Trp Ile Val Ala Ser Gln Arg Ser Leu Leu Glu Val Met Glu Ala
450 455 460
Phe Pro Ser Ala Lys Pro Ser Leu Gly Val Phe Phe Ala Ser Val Ala
465 470 475 480
Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Lys Met
485 490 495
Ala Pro Asp Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu Lys Thr
500 505 510
Pro Ala Gly Arg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys Asn
515 520 525
Ala Val Pro Met Thr Glu Ser Gln Asp Cys Ser Trp Ala Pro Ile Tyr
530 535 540
Val Arg Thr Ser Asn Phe Arg Leu Pro Ser Asp Pro Lys Val Pro Val
545 550 555 560
Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu
565 570 575
Gln Glu Arg Leu Ala Leu Lys Glu Ala Gly Thr Asp Leu Gly Leu Ser
580 585 590
Ile Leu Phe Phe Gly Cys Arg Asn Arg Lys Val Asp Phe Ile Tyr Glu
595 600 605
Asn Glu Leu Asn Asn Phe Val Glu Thr Gly Ala Leu Ser Glu Leu Ile
610 615 620
Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu Tyr Val Gln His Lys
625 630 635 640
Met Ser Glu Lys Ala Ser Asp Ile Trp Asn Leu Leu Ser Glu Gly Ala
645 650 655
Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Lys Asp Val His
660 665 670
Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu Asp Ser Ser
675 680 685
Lys Ala Glu Leu Tyr Val Lys Asn Leu Gln Met Ser Gly Arg Tyr Leu
690 695 700
Arg Asp Val Trp
705
<210> 33
<211> 705
<212> PRT
<213> 黄花蒿(Artemisia annua)
<400> 33
Met Ala Gln Ser Thr Thr Ser Val Lys Leu Ser Pro Phe Asp Leu Met
1 5 10 15
Thr Ala Leu Leu Asn Gly Lys Val Ser Phe Asp Thr Ser Asn Thr Ser
20 25 30
Asp Thr Asn Ile Pro Leu Ala Val Phe Met Glu Asn Arg Glu Leu Leu
35 40 45
Met Ile Leu Thr Thr Ser Val Ala Val Leu Ile Gly Cys Val Val Val
50 55 60
Leu Val Trp Arg Arg Ser Ser Ser Ala Ala Lys Lys Ala Ala Glu Ser
65 70 75 80
Pro Val Ile Val Val Pro Lys Lys Val Thr Glu Asp Glu Val Asp Asp
85 90 95
Gly Arg Lys Lys Val Thr Val Phe Phe Gly Thr Gln Thr Gly Thr Ala
100 105 110
Glu Gly Phe Ala Lys Ala Leu Val Glu Glu Ala Lys Ala Arg Tyr Glu
115 120 125
Lys Ala Val Phe Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala Glu Asp
130 135 140
Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Ser Leu Ala Phe Phe Phe
145 150 155 160
Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe
165 170 175
Tyr Lys Trp Phe Thr Glu Gly Glu Glu Lys Gly Glu Trp Leu Asp Lys
180 185 190
Leu Gln Tyr Ala Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His Phe
195 200 205
Asn Lys Ile Ala Lys Val Val Asp Glu Lys Leu Val Glu Gln Gly Ala
210 215 220
Lys Arg Leu Val Pro Val Gly Met Gly Asp Asp Asp Gln Cys Ile Glu
225 230 235 240
Asp Asp Phe Thr Ala Trp Lys Glu Leu Val Trp Pro Glu Leu Asp Gln
245 250 255
Leu Leu Arg Asp Glu Asp Asp Thr Ser Val Ala Thr Pro Tyr Thr Ala
260 265 270
Ala Val Ala Glu Tyr Arg Val Val Phe His Asp Lys Pro Glu Thr Tyr
275 280 285
Asp Gln Asp Gln Leu Thr Asn Gly His Ala Val His Asp Ala Gln His
290 295 300
Pro Cys Arg Ser Asn Val Ala Val Lys Lys Glu Leu His Ser Pro Leu
305 310 315 320
Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp Ile Ser Asn Thr Gly
325 330 335
Leu Ser Tyr Glu Thr Gly Asp His Val Gly Val Tyr Val Glu Asn Leu
340 345 350
Ser Glu Val Val Asp Glu Ala Glu Lys Leu Ile Gly Leu Pro Pro His
355 360 365
Thr Tyr Phe Ser Val His Ala Asp Asn Glu Asp Gly Thr Pro Leu Gly
370 375 380
Gly Ala Ser Leu Pro Pro Pro Phe Pro Pro Cys Thr Leu Arg Lys Ala
385 390 395 400
Leu Ala Ser Tyr Ala Asp Val Leu Ser Ser Pro Lys Lys Ser Ala Leu
405 410 415
Leu Ala Leu Ala Ala His Ala Thr Asp Ser Thr Glu Ala Asp Arg Leu
420 425 430
Lys Phe Leu Ala Ser Pro Ala Gly Lys Asp Glu Tyr Ala Gln Trp Ile
435 440 445
Val Ala Ser His Arg Ser Leu Leu Glu Val Met Glu Ala Phe Pro Ser
450 455 460
Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ser Val Ala Pro Arg Leu
465 470 475 480
Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Arg Phe Ala Pro Asn
485 490 495
Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu Gln Thr Pro Ser Gly
500 505 510
Arg Val His Lys Gly Val Cys Ser Thr Trp Met Lys Asn Ala Val Pro
515 520 525
Met Thr Glu Ser Gln Asp Cys Ser Trp Ala Pro Ile Tyr Val Arg Thr
530 535 540
Ser Asn Phe Arg Leu Pro Ser Asp Pro Lys Val Pro Val Ile Met Ile
545 550 555 560
Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu Gln Glu Arg
565 570 575
Leu Ala Gln Lys Glu Ala Gly Thr Glu Leu Gly Thr Ala Ile Leu Phe
580 585 590
Phe Gly Cys Arg Asn Arg Lys Val Asp Phe Ile Tyr Glu Asp Glu Leu
595 600 605
Asn Asn Phe Val Glu Thr Gly Ala Leu Ser Glu Leu Val Thr Ala Phe
610 615 620
Ser Arg Glu Gly Ala Thr Lys Glu Tyr Val Gln His Lys Met Thr Gln
625 630 635 640
Lys Ala Ser Asp Ile Trp Asn Leu Leu Ser Glu Gly Ala Tyr Leu Tyr
645 650 655
Val Cys Gly Asp Ala Lys Gly Met Ala Lys Asp Val His Arg Thr Leu
660 665 670
His Thr Ile Val Gln Glu Gln Gly Ser Leu Asp Ser Ser Lys Ala Glu
675 680 685
Leu Tyr Val Lys Asn Leu Gln Met Ala Gly Arg Tyr Leu Arg Asp Val
690 695 700
Trp
705
<210> 34
<211> 706
<212> PRT
<213> 香叶天竺葵(Pelargonium graveolens)
<400> 34
Met Ala Gln Ser Ser Ser Gly Ser Met Ser Pro Phe Asp Phe Met Thr
1 5 10 15
Ala Ile Ile Lys Gly Lys Met Glu Pro Ser Asn Ala Ser Leu Gly Ala
20 25 30
Ala Gly Glu Val Thr Ala Met Ile Leu Asp Asn Arg Glu Leu Val Met
35 40 45
Ile Leu Thr Thr Ser Ile Ala Val Leu Ile Gly Cys Val Val Val Phe
50 55 60
Ile Trp Arg Arg Ser Ser Ser Gln Thr Pro Thr Ala Val Gln Pro Leu
65 70 75 80
Lys Pro Leu Leu Ala Lys Glu Thr Glu Ser Glu Val Asp Asp Gly Lys
85 90 95
Gln Lys Val Thr Ile Phe Phe Gly Thr Gln Thr Gly Thr Ala Glu Gly
100 105 110
Phe Ala Lys Ala Leu Ala Asp Glu Ala Lys Ala Arg Tyr Asp Lys Val
115 120 125
Thr Phe Lys Val Val Asp Leu Asp Asp Tyr Ala Ala Asp Asp Glu Glu
130 135 140
Tyr Glu Glu Lys Leu Lys Lys Glu Thr Leu Ala Phe Phe Phe Leu Ala
145 150 155 160
Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe Tyr Lys
165 170 175
Trp Phe Leu Glu Gly Lys Glu Arg Gly Glu Trp Leu Gln Asn Leu Lys
180 185 190
Phe Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His Phe Asn Lys
195 200 205
Ile Ala Ile Val Val Asp Glu Ile Leu Ala Glu Gln Gly Gly Lys Arg
210 215 220
Leu Ile Ser Val Gly Leu Gly Asp Asp Asp Gln Cys Ile Glu Asp Asp
225 230 235 240
Phe Thr Ala Trp Arg Glu Ser Leu Trp Pro Glu Leu Asp Gln Leu Leu
245 250 255
Arg Asp Glu Asp Asp Thr Thr Val Ser Thr Pro Tyr Thr Ala Ala Val
260 265 270
Leu Glu Tyr Arg Val Val Phe His Asp Pro Ala Asp Ala Pro Thr Leu
275 280 285
Glu Lys Ser Tyr Ser Asn Ala Asn Gly His Ser Val Val Asp Ala Gln
290 295 300
His Pro Leu Arg Ala Asn Val Ala Val Arg Arg Glu Leu His Thr Pro
305 310 315 320
Ala Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp Ile Ser Gly Thr
325 330 335
Gly Ile Ala Tyr Glu Thr Gly Asp His Val Gly Val Tyr Cys Glu Asn
340 345 350
Leu Ala Glu Thr Val Glu Glu Ala Leu Glu Leu Leu Gly Leu Ser Pro
355 360 365
Asp Thr Tyr Phe Ser Val His Ala Asp Lys Glu Asp Gly Thr Pro Leu
370 375 380
Ser Gly Ser Ser Leu Pro Pro Pro Phe Pro Pro Cys Thr Leu Arg Thr
385 390 395 400
Ala Leu Thr Leu His Ala Asp Leu Leu Ser Ser Pro Lys Lys Ser Ala
405 410 415
Leu Leu Ala Leu Ala Ala His Ala Ser Asp Pro Thr Glu Ala Asp Arg
420 425 430
Leu Arg His Leu Ala Ser Pro Ala Gly Lys Asp Glu Tyr Ala Gln Trp
435 440 445
Ile Val Ala Ser Gln Arg Ser Leu Leu Glu Val Met Ala Glu Phe Pro
450 455 460
Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ser Val Ala Pro Arg
465 470 475 480
Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Arg Ile Ala Pro
485 490 495
Ser Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu Lys Thr Pro Thr
500 505 510
Gly Arg Val His Lys Gly Val Cys Ser Thr Trp Met Lys Asn Ser Val
515 520 525
Pro Ser Glu Lys Ser Asp Glu Cys Ser Trp Ala Pro Ile Phe Val Arg
530 535 540
Gln Ser Asn Phe Lys Leu Pro Ala Asp Ala Lys Val Pro Ile Ile Met
545 550 555 560
Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu Gln Glu
565 570 575
Arg Leu Ala Leu Lys Glu Ala Gly Thr Glu Leu Gly Pro Ser Ile Leu
580 585 590
Phe Phe Gly Cys Arg Asn Ser Lys Met Asp Tyr Ile Tyr Glu Asp Glu
595 600 605
Leu Asp Asn Phe Val Gln Asn Gly Ala Leu Ser Glu Leu Val Leu Ala
610 615 620
Phe Ser Arg Glu Gly Pro Thr Lys Glu Tyr Val Gln His Lys Met Met
625 630 635 640
Glu Lys Ala Ser Asp Ile Trp Asn Leu Ile Ser Gln Gly Ala Tyr Leu
645 650 655
Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val His Arg Thr
660 665 670
Leu His Thr Ile Ala Gln Glu Gln Gly Ser Leu Asp Ser Ser Lys Ala
675 680 685
Glu Ser Met Val Lys Asn Leu Gln Met Ser Gly Arg Tyr Leu Arg Asp
690 695 700
Val Trp
705
<210> 35
<211> 282
<212> PRT
<213> 二穗短柄草(Brachypodium distachyon)
<400> 35
Met Ser Ala Ala Ala Ala Val Ser Ser Ser Ser Ser Pro Arg Leu Glu
1 5 10 15
Gly Lys Val Ala Leu Val Thr Gly Gly Ala Ser Gly Ile Gly Glu Ala
20 25 30
Ile Val Arg Leu Phe Arg Gln His Gly Ala Lys Val Cys Ile Ala Asp
35 40 45
Val Gln Asp Glu Ala Gly Gln Gln Val Arg Asp Ser Leu Gly Asp Asp
50 55 60
Ala Gly Thr Asp Val Leu Phe Val His Cys Asp Val Thr Val Glu Glu
65 70 75 80
Asp Val Ser Arg Ala Val Asp Ala Ala Ala Glu Lys Phe Gly Thr Leu
85 90 95
Asp Ile Met Val Asn Asn Ala Gly Ile Thr Gly Asp Lys Val Thr Asp
100 105 110
Ile Arg Asn Leu Asp Phe Ala Glu Val Arg Lys Val Phe Asp Ile Asn
115 120 125
Val His Gly Met Leu Leu Gly Met Lys His Ala Ala Arg Val Met Ile
130 135 140
Pro Gly Lys Lys Gly Ser Ile Val Ser Leu Ala Ser Val Ala Ser Val
145 150 155 160
Met Gly Gly Met Gly Pro His Ala Tyr Thr Ala Ser Lys His Ala Val
165 170 175
Val Gly Leu Thr Lys Ser Val Ala Leu Glu Leu Gly Lys His Gly Ile
180 185 190
Arg Val Asn Cys Val Ser Pro Tyr Ala Val Pro Thr Ala Leu Ser Met
195 200 205
Pro His Leu Pro Gln Gly Glu His Lys Gly Asp Ala Val Arg Asp Phe
210 215 220
Leu Ala Phe Val Gly Gly Glu Ala Asn Leu Lys Gly Val Asp Leu Leu
225 230 235 240
Pro Lys Asp Val Ala Gln Ala Val Leu Tyr Leu Ala Ser Asp Glu Ala
245 250 255
Arg Tyr Ile Ser Ala Leu Asn Leu Val Val Asp Gly Gly Phe Thr Ser
260 265 270
Val Asn Pro Asn Leu Lys Ala Phe Glu Asp
275 280
<210> 36
<211> 280
<212> PRT
<213> 橙子(Citrus sinensis)
<400> 36
Met Ser Asn Ser Asn Ser Thr Asp Ser Ser Pro Ala Val Gln Arg Leu
1 5 10 15
Val Gly Arg Val Ala Leu Ile Thr Gly Gly Ala Thr Gly Ile Gly Glu
20 25 30
Ser Thr Val Arg Leu Phe His Lys His Gly Ala Lys Val Cys Ile Ala
35 40 45
Asp Val Gln Asp Asn Leu Gly Gln Gln Val Cys Gln Ser Leu Gly Gly
50 55 60
Glu Pro Asp Thr Phe Phe Cys His Cys Asp Val Thr Lys Glu Glu Asp
65 70 75 80
Val Cys Ser Ala Val Asp Leu Thr Val Glu Lys Phe Gly Thr Leu Asp
85 90 95
Ile Met Val Asn Asn Ala Gly Ile Ser Gly Ala Pro Cys Pro Asp Ile
100 105 110
Arg Glu Ala Asp Leu Ser Glu Phe Glu Lys Val Phe Asp Ile Asn Val
115 120 125
Lys Gly Val Phe His Gly Met Lys His Ala Ala Arg Ile Met Ile Pro
130 135 140
Gln Thr Lys Gly Thr Ile Ile Ser Ile Cys Ser Val Ala Gly Ala Ile
145 150 155 160
Gly Gly Leu Gly Pro His Ala Tyr Thr Gly Ser Lys His Ala Val Leu
165 170 175
Gly Leu Asn Lys Asn Val Ala Ala Glu Leu Gly Lys Tyr Gly Ile Arg
180 185 190
Val Asn Cys Val Ser Pro Tyr Ala Val Ala Thr Gly Leu Ala Leu Ala
195 200 205
His Leu Pro Glu Glu Glu Arg Thr Glu Asp Ala Met Val Gly Phe Arg
210 215 220
Asn Phe Val Ala Arg Asn Ala Asn Met Gln Gly Thr Glu Leu Thr Ala
225 230 235 240
Asn Asp Val Ala Asn Ala Val Leu Phe Leu Ala Ser Asp Glu Ala Arg
245 250 255
Tyr Ile Ser Gly Thr Asn Leu Met Val Asp Gly Gly Phe Thr Ser Val
260 265 270
Asn His Ser Leu Arg Val Phe Arg
275 280
<210> 37
<211> 280
<212> PRT
<213> 橙子(Citrus sinensis)
<400> 37
Met Ala Thr Pro Pro Ile Ser Ser Leu Ile Ser Gln Arg Leu Leu Gly
1 5 10 15
Lys Val Ala Leu Val Thr Gly Gly Ala Ser Gly Ile Gly Glu Gly Ile
20 25 30
Val Arg Leu Phe His Arg His Gly Ala Lys Val Cys Phe Val Asp Val
35 40 45
Gln Asp Glu Leu Gly Tyr Arg Leu Gln Glu Ser Leu Val Gly Asp Lys
50 55 60
Asp Ser Asn Ile Phe Tyr Ser His Cys Asp Val Thr Val Glu Asp Asp
65 70 75 80
Val Arg Arg Ala Val Asp Leu Thr Val Thr Lys Phe Gly Thr Leu Asp
85 90 95
Ile Met Val Asn Asn Ala Gly Ile Ser Gly Thr Pro Ser Ser Asp Ile
100 105 110
Arg Asn Val Asp Val Ser Glu Phe Glu Lys Val Phe Asp Ile Asn Val
115 120 125
Lys Gly Val Phe Met Gly Met Lys Tyr Ala Ala Ser Val Met Ile Pro
130 135 140
Arg Lys Gln Gly Ser Ile Ile Ser Leu Gly Ser Val Gly Ser Val Ile
145 150 155 160
Gly Gly Ile Gly Pro His His Tyr Ile Ser Ser Lys His Ala Val Val
165 170 175
Gly Leu Thr Arg Ser Ile Ala Ala Glu Leu Gly Gln His Gly Ile Arg
180 185 190
Val Asn Cys Val Ser Pro Tyr Ala Val Pro Thr Asn Leu Ala Val Ala
195 200 205
His Leu Pro Glu Asp Glu Arg Thr Glu Asp Met Phe Thr Gly Phe Arg
210 215 220
Glu Phe Ala Lys Lys Asn Ala Asn Leu Gln Gly Val Glu Leu Thr Val
225 230 235 240
Glu Asp Val Ala Asn Ala Val Leu Phe Leu Ala Ser Glu Asp Ala Arg
245 250 255
Tyr Ile Ser Gly Asp Asn Leu Ile Val Asp Gly Gly Phe Thr Arg Val
260 265 270
Asn His Ser Phe Arg Val Phe Arg
275 280
<210> 38
<211> 262
<212> PRT
<213> 橙子(Citrus sinensis)
<400> 38
Met Ser Lys Pro Arg Leu Gln Gly Lys Val Ala Ile Ile Met Gly Ala
1 5 10 15
Ala Ser Gly Ile Gly Glu Ala Thr Ala Lys Leu Phe Ala Glu His Gly
20 25 30
Ala Phe Val Ile Ile Ala Asp Ile Gln Asp Glu Leu Gly Asn Gln Val
35 40 45
Val Ser Ser Ile Gly Pro Glu Lys Ala Ser Tyr Arg His Cys Asp Val
50 55 60
Arg Asp Glu Lys Gln Val Glu Glu Thr Val Ala Tyr Ala Ile Glu Lys
65 70 75 80
Tyr Gly Ser Leu Asp Ile Met Tyr Ser Asn Ala Gly Val Ala Gly Pro
85 90 95
Val Gly Thr Ile Leu Asp Leu Asp Met Ala Gln Phe Asp Arg Thr Ile
100 105 110
Ala Thr Asn Leu Ala Gly Ser Val Met Ala Val Lys Tyr Ala Ala Arg
115 120 125
Val Met Val Ala Asn Lys Ile Arg Gly Ser Ile Ile Cys Thr Thr Ser
130 135 140
Thr Ala Ser Thr Val Gly Gly Ser Gly Pro His Ala Tyr Thr Ile Ser
145 150 155 160
Lys His Gly Leu Leu Gly Leu Val Arg Ser Ala Ala Ser Glu Leu Gly
165 170 175
Lys His Gly Ile Arg Val Asn Cys Val Ser Pro Phe Gly Val Ala Thr
180 185 190
Pro Phe Ser Ala Gly Thr Ile Asn Asp Val Glu Gly Phe Val Cys Lys
195 200 205
Val Ala Asn Leu Lys Gly Ile Val Leu Lys Ala Lys His Val Ala Glu
210 215 220
Ala Ala Leu Phe Leu Ala Ser Asp Glu Ser Ala Tyr Val Ser Gly His
225 230 235 240
Asp Leu Val Val Asp Gly Gly Phe Thr Ala Val Thr Asn Val Met Ser
245 250 255
Met Leu Glu Gly His Gly
260
<210> 39
<211> 263
<212> PRT
<213> 橙子(Citrus sinensis)
<400> 39
Met Ser Asn Pro Arg Met Glu Gly Lys Val Ala Leu Ile Thr Gly Ala
1 5 10 15
Ala Ser Gly Ile Gly Glu Ala Ala Val Arg Leu Phe Ala Glu His Gly
20 25 30
Ala Phe Val Val Ala Ala Asp Val Gln Asp Glu Leu Gly His Gln Val
35 40 45
Ala Ala Ser Val Gly Thr Asp Gln Val Cys Tyr His His Cys Asp Val
50 55 60
Arg Asp Glu Lys Gln Val Glu Glu Thr Val Arg Tyr Thr Leu Glu Lys
65 70 75 80
Tyr Gly Lys Leu Asp Val Leu Phe Ser Asn Ala Gly Ile Met Gly Pro
85 90 95
Leu Thr Gly Ile Leu Glu Leu Asp Leu Thr Gly Phe Gly Asn Thr Met
100 105 110
Ala Thr Asn Val Cys Gly Val Ala Ala Thr Ile Lys His Ala Ala Arg
115 120 125
Ala Met Val Asp Lys Asn Ile Arg Gly Ser Ile Ile Cys Thr Thr Ser
130 135 140
Val Ala Ser Ser Leu Gly Gly Thr Ala Pro His Ala Tyr Thr Thr Ser
145 150 155 160
Lys His Ala Leu Val Gly Leu Val Arg Thr Ala Cys Ser Glu Leu Gly
165 170 175
Ala Tyr Gly Ile Arg Val Asn Cys Ile Ser Pro Phe Gly Val Ala Thr
180 185 190
Pro Leu Ser Cys Thr Ala Tyr Asn Leu Arg Pro Asp Glu Val Glu Ala
195 200 205
Asn Ser Cys Ala Leu Ala Asn Leu Lys Gly Ile Val Leu Lys Ala Lys
210 215 220
His Ile Ala Glu Ala Ala Leu Phe Leu Ala Ser Asp Glu Ser Ala Tyr
225 230 235 240
Ile Ser Gly His Asn Leu Ala Val Asp Gly Gly Phe Thr Val Val Asn
245 250 255
His Ser Ser Ser Ser Ala Thr
260
<210> 40
<211> 278
<212> PRT
<213> 橙子(Citrus sinensis)
<400> 40
Met Thr Thr Ala Gly Ser Arg Asp Ser Pro Leu Val Ala Gln Arg Leu
1 5 10 15
Leu Gly Lys Val Ala Leu Val Thr Gly Gly Ala Thr Gly Ile Gly Glu
20 25 30
Ser Ile Val Arg Leu Phe His Lys His Gly Ala Lys Val Cys Val Val
35 40 45
Asp Ile Asn Asp Asp Leu Gly Gln His Leu Cys Gln Thr Leu Gly Pro
50 55 60
Thr Thr Arg Phe Ile His Gly Asp Val Ala Ile Glu Asp Asp Val Ser
65 70 75 80
Arg Ala Val Asp Phe Thr Val Ala Asn Phe Gly Thr Leu Asp Ile Met
85 90 95
Val Asn Asn Ala Gly Met Gly Gly Pro Pro Cys Pro Asp Ile Arg Glu
100 105 110
Phe Pro Ile Ser Thr Phe Glu Lys Val Phe Asp Ile Asn Thr Lys Gly
115 120 125
Thr Phe Ile Gly Met Lys His Ala Ala Arg Val Met Ile Pro Ser Lys
130 135 140
Lys Gly Ser Ile Val Ser Ile Ser Ser Val Thr Ser Ala Ile Gly Gly
145 150 155 160
Ala Gly Pro His Ala Tyr Thr Ala Ser Lys His Ala Val Leu Gly Leu
165 170 175
Thr Lys Ser Val Ala Ala Glu Leu Gly Gln His Gly Ile Arg Val Asn
180 185 190
Cys Val Ser Pro Tyr Ala Ile Leu Thr Asn Leu Ala Leu Ala His Leu
195 200 205
His Glu Asp Glu Arg Thr Asp Asp Ala Arg Ala Gly Phe Arg Ala Phe
210 215 220
Ile Gly Lys Asn Ala Asn Leu Gln Gly Val Asp Leu Val Glu Asp Asp
225 230 235 240
Val Ala Asn Ala Val Leu Phe Leu Ala Ser Asp Asp Ala Arg Tyr Ile
245 250 255
Ser Gly Asp Asn Leu Phe Val Asp Gly Gly Phe Thr Cys Thr Asn His
260 265 270
Ser Leu Arg Val Phe Arg
275
<210> 41
<211> 277
<212> PRT
<213> 红串红球菌(Rhodococcus erythropolis)
<400> 41
Met Ala Arg Val Glu Gly Gln Val Ala Leu Ile Thr Gly Ala Ala Arg
1 5 10 15
Gly Gln Gly Arg Ser His Ala Ile Lys Leu Ala Glu Glu Gly Ala Asp
20 25 30
Val Ile Leu Val Asp Val Pro Asn Asp Val Val Asp Ile Gly Tyr Pro
35 40 45
Leu Gly Thr Ala Asp Glu Leu Asp Gln Thr Ala Lys Asp Val Glu Asn
50 55 60
Leu Gly Arg Lys Ala Ile Val Ile His Ala Asp Val Arg Asp Leu Glu
65 70 75 80
Ser Leu Thr Ala Glu Val Asp Arg Ala Val Ser Thr Leu Gly Arg Leu
85 90 95
Asp Ile Val Ser Ala Asn Ala Gly Ile Ala Ser Val Pro Phe Leu Ser
100 105 110
His Asp Ile Pro Asp Asn Thr Trp Arg Gln Met Ile Asp Ile Asn Leu
115 120 125
Thr Gly Val Trp His Thr Ala Lys Val Ala Val Pro His Ile Leu Ala
130 135 140
Gly Glu Arg Gly Gly Ser Ile Val Leu Thr Ser Ser Ala Ala Gly Leu
145 150 155 160
Lys Gly Tyr Ala Gln Ile Ser His Tyr Ser Ala Ala Lys His Gly Val
165 170 175
Val Gly Leu Met Arg Ser Leu Ala Leu Glu Leu Ala Pro His Arg Val
180 185 190
Arg Val Asn Ser Leu His Pro Thr Gln Val Asn Thr Pro Met Ile Gln
195 200 205
Asn Glu Gly Thr Tyr Arg Ile Phe Ser Pro Asp Leu Glu Asn Pro Thr
210 215 220
Arg Glu Asp Phe Glu Ile Ala Ser Thr Thr Thr Asn Ala Leu Pro Ile
225 230 235 240
Pro Trp Val Glu Ser Val Asp Val Ser Asn Ala Leu Leu Phe Leu Val
245 250 255
Ser Glu Asp Ala Arg Tyr Ile Thr Gly Ala Ala Ile Pro Val Asp Ala
260 265 270
Gly Thr Thr Leu Lys
275
<210> 42
<211> 279
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 合成序列
<400> 42
Met Ser Thr Ala Ser Ser Gly Asp Val Ser Leu Leu Ser Gln Arg Leu
1 5 10 15
Val Gly Lys Val Ala Leu Ile Thr Gly Gly Ala Thr Gly Ile Gly Glu
20 25 30
Ser Ile Ala Arg Leu Phe Tyr Arg His Gly Ala Lys Val Cys Ile Val
35 40 45
Asp Ile Gln Asp Asn Pro Gly Gln Asn Leu Cys Arg Glu Leu Gly Thr
50 55 60
Asp Asp Ala Cys Phe Phe His Cys Asp Val Ser Ile Glu Ile Asp Val
65 70 75 80
Ile Arg Ala Val Asp Phe Val Val Asn Arg Phe Gly Lys Leu Asp Ile
85 90 95
Met Val Asn Asn Ala Gly Ile Ala Asp Pro Pro Cys Pro Asp Ile Arg
100 105 110
Asn Thr Asp Leu Ser Ile Phe Glu Lys Val Phe Asp Val Asn Val Lys
115 120 125
Gly Thr Phe Gln Cys Met Lys His Ala Ala Arg Val Met Val Pro Gln
130 135 140
Lys Lys Gly Ser Ile Ile Ser Leu Thr Ser Val Ala Ser Val Ile Gly
145 150 155 160
Gly Ala Gly Pro His Ala Tyr Thr Gly Ser Lys His Ala Val Leu Gly
165 170 175
Leu Thr Lys Ser Val Ala Ala Glu Leu Gly Leu His Gly Ile Arg Val
180 185 190
Asn Cys Val Ser Pro Tyr Ala Val Pro Thr Gly Met Pro Leu Ala His
195 200 205
Leu Pro Glu Ser Glu Lys Thr Glu Asp Ala Met Met Gly Met Arg Ala
210 215 220
Phe Val Gly Arg Asn Ala Asn Leu Gln Gly Ile Glu Leu Thr Val Asp
225 230 235 240
Asp Val Ala Asn Ser Val Val Phe Leu Ala Ser Asp Glu Ala Arg Tyr
245 250 255
Val Ser Gly Leu Asn Leu Met Leu Asp Gly Gly Phe Ser Cys Val Asn
260 265 270
His Ser Leu Arg Val Phe Arg
275
<210> 43
<211> 280
<212> PRT
<213> 葡萄(Vitis vinifera)
<400> 43
Met Ala Ala Thr Ser Ile Asp Asn Ser Pro Leu Pro Ser Gln Arg Leu
1 5 10 15
Leu Gly Lys Val Ala Leu Val Thr Gly Gly Ala Thr Gly Ile Gly Glu
20 25 30
Ser Ile Val Arg Leu Phe Leu Lys Gln Gly Ala Lys Val Cys Ile Val
35 40 45
Asp Val Gln Asp Asp Leu Gly Gln Lys Leu Cys Asp Thr Leu Gly Gly
50 55 60
Asp Pro Asn Val Ser Phe Phe His Cys Asp Val Thr Ile Glu Asp Asp
65 70 75 80
Val Cys His Ala Val Asp Phe Thr Val Thr Lys Phe Gly Thr Leu Asp
85 90 95
Ile Met Val Asn Asn Ala Gly Met Ala Gly Pro Pro Cys Ser Asp Ile
100 105 110
Arg Asn Val Glu Val Ser Met Phe Glu Lys Val Phe Asp Val Asn Val
115 120 125
Lys Gly Val Phe Leu Gly Met Lys His Ala Ala Arg Ile Met Ile Pro
130 135 140
Leu Lys Lys Gly Thr Ile Ile Ser Leu Cys Ser Val Ser Ser Ala Ile
145 150 155 160
Ala Gly Val Gly Pro His Ala Tyr Thr Gly Ser Lys Cys Ala Val Ala
165 170 175
Gly Leu Thr Gln Ser Val Ala Ala Glu Met Gly Gly His Gly Ile Arg
180 185 190
Val Asn Cys Ile Ser Pro Tyr Ala Ile Ala Thr Gly Leu Ala Leu Ala
195 200 205
His Leu Pro Glu Asp Glu Arg Thr Glu Asp Ala Met Ala Gly Phe Arg
210 215 220
Ala Phe Val Gly Lys Asn Ala Asn Leu Gln Gly Val Glu Leu Thr Val
225 230 235 240
Asp Asp Val Ala His Ala Ala Val Phe Leu Ala Ser Asp Glu Ala Arg
245 250 255
Tyr Ile Ser Gly Leu Asn Leu Met Leu Asp Gly Gly Phe Ser Cys Thr
260 265 270
Asn His Ser Leu Arg Val Phe Arg
275 280
<210> 44
<211> 267
<212> PRT
<213> 红球姜(Zingiber zerumbet)
<400> 44
Met Arg Leu Glu Gly Lys Val Ala Leu Val Thr Gly Gly Ala Ser Gly
1 5 10 15
Ile Gly Glu Ser Ile Ala Arg Leu Phe Ile Glu His Gly Ala Lys Ile
20 25 30
Cys Ile Val Asp Val Gln Asp Glu Leu Gly Gln Gln Val Ser Gln Arg
35 40 45
Leu Gly Gly Asp Pro His Ala Cys Tyr Phe His Cys Asp Val Thr Val
50 55 60
Glu Asp Asp Val Arg Arg Ala Val Asp Phe Thr Ala Glu Lys Tyr Gly
65 70 75 80
Thr Ile Asp Ile Met Val Asn Asn Ala Gly Ile Thr Gly Asp Lys Val
85 90 95
Ile Asp Ile Arg Asp Ala Asp Phe Asn Glu Phe Lys Lys Val Phe Asp
100 105 110
Ile Asn Val Asn Gly Val Phe Leu Gly Met Lys His Ala Ala Arg Ile
115 120 125
Met Ile Pro Lys Met Lys Gly Ser Ile Val Ser Leu Ala Ser Val Ser
130 135 140
Ser Val Ile Ala Gly Ala Gly Pro His Gly Tyr Thr Gly Ala Lys His
145 150 155 160
Ala Val Val Gly Leu Thr Lys Ser Val Ala Ala Glu Leu Gly Arg His
165 170 175
Gly Ile Arg Val Asn Cys Val Ser Pro Tyr Ala Val Pro Thr Arg Leu
180 185 190
Ser Met Pro Tyr Leu Pro Glu Ser Glu Met Gln Glu Asp Ala Leu Arg
195 200 205
Gly Phe Leu Thr Phe Val Arg Ser Asn Ala Asn Leu Lys Gly Val Asp
210 215 220
Leu Met Pro Asn Asp Val Ala Glu Ala Val Leu Tyr Leu Ala Thr Glu
225 230 235 240
Glu Ser Lys Tyr Val Ser Gly Leu Asn Leu Val Ile Asp Gly Gly Phe
245 250 255
Ser Ile Ala Asn His Thr Leu Gln Val Phe Glu
260 265
<210> 45
<211> 514
<212> PRT
<213> 绒毛状烟草(Nicotiana tomentosiformis)
<400> 45
Met Asp Ala Ile Leu Asn Leu Gln Thr Val Pro Leu Gly Thr Ala Leu
1 5 10 15
Thr Ile Gly Gly Pro Ala Val Ala Leu Gly Gly Ile Ser Leu Trp Phe
20 25 30
Leu Lys Glu Tyr Val Asn Asp Gln Lys Arg Lys Ser Ser Asn Phe Leu
35 40 45
Pro Pro Leu Pro Glu Val Pro Gly Leu Pro Val Ile Gly Asn Leu Leu
50 55 60
Gln Leu Thr Glu Lys Lys Pro His Lys Thr Phe Thr Asn Trp Ala Glu
65 70 75 80
Thr Tyr Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Asn Thr Ile Val
85 90 95
Val Leu Asn Thr Asn Glu Leu Ala Lys Glu Ala Met Val Thr Arg Tyr
100 105 110
Ser Ala Ile Ser Thr Arg Lys Leu Thr Asn Ala Leu Lys Ile Leu Thr
115 120 125
Cys Asp Lys Ser Ile Val Ala Ile Ser Asp Tyr Asp Glu Phe His Lys
130 135 140
Thr Val Lys Arg His Val Leu Thr Ser Val Leu Gly Pro Asn Ala Gln
145 150 155 160
Lys Arg His Arg Ile His Arg Asp Thr Leu Ile Glu Asn Val Ser Lys
165 170 175
Gln Leu His Asp Leu Val Arg Lys Tyr Pro Asn Glu Ala Val Asn Leu
180 185 190
Arg Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala Leu Lys Gln Ala
195 200 205
Leu Gly Lys Asp Ile Glu Ser Ile Tyr Val Glu Gly Leu Asp Ala Thr
210 215 220
Leu Pro Arg Glu Asp Val Leu Lys Thr Leu Val Leu Asp Ile Met Glu
225 230 235 240
Gly Ala Ile Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp
245 250 255
Val Pro Asn Lys Ser Phe Glu Asn Arg Ile Gln Arg Lys His Leu Arg
260 265 270
Arg Glu Ala Val Met Lys Ala Leu Ile Met Glu Gln Arg Lys Arg Ile
275 280 285
Asn Ser Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr Leu Ser Ser Glu
290 295 300
Ala Asn Thr Leu Thr Glu Lys Gln Ile Leu Met Leu Leu Trp Glu Ala
305 310 315 320
Ile Ile Glu Thr Ser Asp Thr Thr Val Val Ser Thr Glu Trp Ala Met
325 330 335
Tyr Glu Leu Ala Lys Asp Pro Lys Arg Gln Glu Gln Leu Phe Leu Glu
340 345 350
Ile Gln Asn Val Cys Gly Ser Asn Lys Ile Thr Glu Glu Lys Leu Cys
355 360 365
Gln Leu Pro Tyr Leu Cys Ala Val Phe His Glu Thr Leu Arg Lys His
370 375 380
Ser Pro Val Pro Ile Val Pro Leu Arg Tyr Val His Glu Asp Thr Gln
385 390 395 400
Leu Gly Gly Tyr His Ile Pro Lys Gly Ala Glu Ile Ala Ile Asn Ile
405 410 415
Tyr Gly Cys Asn Arg Asp Lys Lys Val Trp Glu Ser Pro Glu Glu Trp
420 425 430
Lys Pro Glu Arg Phe Leu Asp Gly Lys Tyr Asp Pro Val Glu Leu Gln
435 440 445
Lys Thr Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ala Leu
450 455 460
Gln Ala Met Thr Ile Thr Cys Thr Thr Ile Ala Arg Leu Ile Gln Glu
465 470 475 480
Phe Glu Trp Ser Leu Lys Asp Gly Glu Glu Glu Asn Val Ala Thr Met
485 490 495
Gly Leu Thr Thr His Lys Leu His Pro Met Gln Ala His Ile Lys Pro
500 505 510
Arg Lys
<210> 46
<211> 512
<212> PRT
<213> 莴苣(Lactuca sativa)
<400> 46
Met Asp Gly Val Ile Asp Met Gln Thr Ile Pro Leu Arg Thr Ala Ile
1 5 10 15
Ala Ile Gly Gly Thr Ala Val Ala Leu Val Val Ala Leu Tyr Phe Trp
20 25 30
Phe Leu Arg Ser Tyr Ala Ser Pro Ser His His Ser Asn His Leu Pro
35 40 45
Pro Val Pro Glu Val Pro Gly Val Pro Val Leu Gly Asn Leu Leu Gln
50 55 60
Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Lys Trp Ala Glu Met
65 70 75 80
Tyr Gly Pro Ile Tyr Ser Ile Arg Thr Gly Ala Thr Ser Met Val Val
85 90 95
Val Ser Ser Asn Glu Ile Ala Lys Glu Val Val Val Thr Arg Phe Pro
100 105 110
Ser Ile Ser Thr Arg Lys Leu Ser Tyr Ala Leu Lys Val Leu Thr Glu
115 120 125
Asp Lys Ser Met Val Ala Met Ser Asp Tyr His Asp Tyr His Lys Thr
130 135 140
Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro Asn Ala Gln Lys
145 150 155 160
Lys Phe Arg Ala His Arg Asp Thr Met Met Glu Asn Val Ser Asn Glu
165 170 175
Leu His Ala Phe Phe Glu Lys Asn Pro Asn Gln Glu Val Asn Leu Arg
180 185 190
Lys Ile Phe Gln Ser Gln Leu Phe Gly Leu Ala Met Lys Gln Ala Leu
195 200 205
Gly Lys Asp Val Glu Ser Ile Tyr Val Lys Asp Leu Glu Thr Thr Met
210 215 220
Lys Arg Glu Glu Ile Phe Glu Val Leu Val Val Asp Pro Met Met Gly
225 230 235 240
Ala Ile Glu Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp Val
245 250 255
Pro Asn Lys Ser Phe Glu Asn Ile Ile His Arg Met Tyr Thr Arg Arg
260 265 270
Glu Ala Val Met Lys Ala Leu Ile Gln Glu His Lys Lys Arg Ile Ala
275 280 285
Ser Gly Glu Asn Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu Ala
290 295 300
Gln Thr Leu Thr Asp Lys Gln Leu Leu Met Ser Leu Trp Glu Pro Ile
305 310 315 320
Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met Tyr
325 330 335
Glu Leu Ala Lys Asn Pro Asn Met Gln Asp Arg Leu Tyr Glu Glu Ile
340 345 350
Gln Ser Val Cys Gly Ser Glu Lys Ile Thr Glu Glu Asn Leu Ser Gln
355 360 365
Leu Pro Tyr Leu Tyr Ala Val Phe Gln Glu Thr Leu Arg Lys His Cys
370 375 380
Pro Val Pro Ile Met Pro Leu Arg Tyr Val His Glu Asn Thr Val Leu
385 390 395 400
Gly Gly Tyr His Val Pro Ala Gly Thr Glu Val Ala Ile Asn Ile Tyr
405 410 415
Gly Cys Asn Met Asp Lys Lys Val Trp Glu Asn Pro Glu Glu Trp Asn
420 425 430
Pro Glu Arg Phe Leu Ser Glu Lys Glu Ser Met Asp Leu Tyr Lys Thr
435 440 445
Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln Ala
450 455 460
Met Val Ile Ser Cys Ile Gly Ile Gly Arg Leu Val Gln Asp Phe Glu
465 470 475 480
Trp Lys Leu Lys Asp Asp Ala Glu Glu Asp Val Asn Thr Leu Gly Leu
485 490 495
Thr Thr Gln Lys Leu His Pro Leu Leu Ala Leu Ile Asn Pro Arg Lys
500 505 510
<210> 47
<211> 513
<212> PRT
<213> 洋蓟(Cynara cardunculus)
<220>
<221> misc_feature
<222> (307)..(307)
<223> Xaa可为任何天然存在的氨基酸
<220>
<221> misc_feature
<222> (356)..(356)
<223> Xaa可为任何天然存在的氨基酸
<400> 47
Met Asp Met Gln Ser Ile Pro Ala Ile Ala Ile Gly Ser Thr Ala Val
1 5 10 15
Ala Ile Ala Leu Gly Leu Phe Phe Trp Phe Phe Arg Arg His Val Pro
20 25 30
Asp His Ile Asp His Pro Asn His Leu Pro Ser Val Pro Glu Val Pro
35 40 45
Gly Ile Pro Val Leu Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro
50 55 60
Tyr Met Thr Phe Thr Lys Trp Ala Glu Thr Tyr Gly Pro Ile Tyr Ser
65 70 75 80
Ile Arg Thr Gly Ala Ile Ser Met Val Val Val Ser Ser Asn Ala Ile
85 90 95
Ala Lys Glu Ala Leu Val Thr Arg Phe Pro Ser Ile Ser Thr Arg Lys
100 105 110
Leu Ser Lys Ala Leu Glu Val Leu Thr Ala Asp Lys Thr Met Val Ala
115 120 125
Met Ser Asp Tyr Asn Asp Tyr His Lys Thr Val Lys Arg His Ile Leu
130 135 140
Thr Ala Val Leu Gly Pro Asn Ala Gln Lys Lys His Arg Val His Arg
145 150 155 160
Asp Ile Met Met Gln Asn Leu Ser Asn Gln Leu His Thr Phe Val Gln
165 170 175
Asn Ser Pro Gln Glu Glu Val Asn Leu Arg Lys Val Phe Gln Ser Glu
180 185 190
Leu Phe Gly Leu Ala Met Arg Gln Thr Met Gly Lys Asp Val Glu Ser
195 200 205
Ile Tyr Val Glu Asp Leu Gly Thr Thr Met Asn Arg Asp Glu Ile Phe
210 215 220
Gln Val Leu Val Val Asp Pro Leu Met Gly Ala Ile Glu Val Asp Trp
225 230 235 240
Arg Asp Phe Phe Pro Tyr Leu Lys Trp Ile Pro Asn Arg Asn Phe Glu
245 250 255
Asn Thr Ile Gln Gln Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ala
260 265 270
Leu Ile Gln Glu His Arg Lys Arg Ile Ala Ser Gly Glu Asn Leu Asn
275 280 285
Ser Tyr Ile Asp Tyr Leu Leu Ser Glu Ala Gln Thr Leu Ser Glu Lys
290 295 300
Gln Leu Xaa Met Ser Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr
305 310 315 320
Thr Met Val Thr Thr Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro
325 330 335
Lys Ile Gln Asp Arg Leu Tyr Arg Glu Ile Gln Gly Val Cys Gly Ser
340 345 350
Asp Lys Ile Xaa Glu Glu Asn Leu Gly Gln Leu Pro Tyr Leu Ser Ala
355 360 365
Ile Phe Asn Glu Thr Leu Arg Arg His Gly Pro Val Pro Ile Ile Pro
370 375 380
Leu Arg Tyr Val His Glu Asp Thr Glu Leu Gly Gly Tyr His Ile Pro
385 390 395 400
Ala Gly Thr Gln Ile Ala Val Asn Ile Tyr Gly Cys Asn Met Glu Lys
405 410 415
Ala Val Trp Glu Asn Pro Glu Glu Trp Asn Pro Glu Arg Phe Phe Glu
420 425 430
Val Glu Gly Asp Gln Lys Thr Met Ala Phe Gly Gly Gly Lys Arg Val
435 440 445
Cys Ala Gly Ser Leu Gln Ala Met Leu Ile Ala Cys Ile Gly Ile Gly
450 455 460
Arg Met Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Glu Ala Ala Gln
465 470 475 480
Glu Asp Val Asn Thr Leu Gly Leu Thr Thr Gln Lys Leu Arg Pro Leu
485 490 495
His Ala Ile Ile Tyr Pro Arg Lys Glu Asn Asp Ala Lys Val Trp Lys
500 505 510
Cys
<210> 48
<400> 48
000
<210> 49
<211> 512
<212> PRT
<213> 黄花蒿(Artemisia annua)
<400> 49
Met Asp Ala Leu Thr Asp Met Leu Gln Ile Pro Pro Ala Thr Pro Ile
1 5 10 15
Thr Val Ala Ile Thr Thr Val Thr Ile Ala Val Ala Ile Phe Leu Tyr
20 25 30
Ile Lys Ser His Ala Ser Asn His Ser Arg Arg Ser Thr His Leu Pro
35 40 45
Pro Val Pro Glu Val Pro Gly Val Pro Val Leu Gly Asn Leu Leu Gln
50 55 60
Leu Lys Glu Lys Lys Pro Tyr Leu Thr Phe Thr Arg Trp Ala Gln Thr
65 70 75 80
Tyr Gly Ala Ile Tyr Ser Ile Arg Thr Gly Ala Thr Ser Met Val Val
85 90 95
Val Ser Ser Ser Glu Ile Ala Lys Glu Ala Met Val Thr Arg Phe Ser
100 105 110
Ser Ile Ser Thr Arg Asn Leu Ser Lys Ala Leu Thr Ile Leu Thr Ala
115 120 125
Asp Lys Thr Met Val Ala Met Ser Asp Tyr Asn Asp Tyr His Arg Thr
130 135 140
Val Lys Arg His Ile Leu Thr Ala Met Leu Gly Pro Asn Ala Gln Arg
145 150 155 160
Lys Gln Arg Val His Arg Asp Phe Met Ile Glu Asn Ile Ser Lys Gln
165 170 175
Leu His Ala Phe Val Glu Asn Ser Pro Lys Glu Glu Val Asp Leu Arg
180 185 190
Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala Met Lys Gln Ala Val
195 200 205
Gly Lys Asp Val Glu Ser Leu Asn Val Glu Asp Leu Gly Val Thr Met
210 215 220
Lys Arg Asp Glu Ile Phe Gln Val Leu Val Val Asp Pro Met Met Gly
225 230 235 240
Ala Ile Glu Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp Val
245 250 255
Pro Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln Met Tyr Ile Arg Arg
260 265 270
Lys Ala Val Met Lys Ala Leu Ile Lys Glu His Lys Lys Arg Ile Ala
275 280 285
Ser Gly Glu Asn Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu Ala
290 295 300
Gln Thr Phe Thr Asp Glu Gln Leu Ile Met Ser Leu Trp Glu Pro Ile
305 310 315 320
Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met Tyr
325 330 335
Glu Leu Ala Lys Asn Pro Lys Met Gln Asp Arg Leu Tyr Arg Asp Ile
340 345 350
Gln Ser Val Cys Gly Ser Asp Lys Ile Thr Glu Glu Asn Leu Ser Gln
355 360 365
Leu Pro Tyr Leu Ser Ala Ile Phe His Glu Thr Leu Arg Arg His Ser
370 375 380
Pro Val Pro Ile Ile Pro Leu Arg His Val His Glu Asp Thr Val Leu
385 390 395 400
Gly Gly Tyr His Val Pro Ala Gly Thr Glu Leu Ala Val Asn Ile Tyr
405 410 415
Gly Cys Asn Met Glu Lys Asn Val Trp Glu Asn Pro Glu Glu Tyr Asn
420 425 430
Pro Asp Arg Phe Met Lys Glu Asn Glu Thr Ile Asp Met Gln Arg Thr
435 440 445
Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln Ala
450 455 460
Met Leu Ile Ser Cys Ile Gly Ile Gly Arg Met Val Gln Glu Phe Glu
465 470 475 480
Trp Arg Phe Lys Asp Lys Ala Glu Glu Asp Ile Asn Thr Leu Gly Leu
485 490 495
Thr Thr Gln Arg Leu Asn Pro Leu Arg Ala Ile Ile Lys Pro Arg Asn
500 505 510
<210> 50
<211> 511
<212> PRT
<213> 向日葵(Helianthus annuus)
<400> 50
Met Asp Ala Leu Thr Gly Met Leu Pro Ile Pro Pro Ala Thr Ala Leu
1 5 10 15
Ala Ile Gly Gly Thr Ala Ile Ala Leu Ala Val Ala Ile Ser Phe Trp
20 25 30
Phe Leu Arg Ser Tyr Thr Ser Gly Glu Ser Asn Arg Leu Pro Arg Val
35 40 45
Pro Glu Val Pro Gly Val Pro Val Leu Gly Asn Leu Leu Gln Leu Lys
50 55 60
Glu Lys Lys Pro Tyr Met Thr Phe Thr Arg Trp Ala Glu Thr Tyr Gly
65 70 75 80
Pro Ile Tyr Ser Ile Arg Thr Gly Ala Thr Ser Met Val Val Val Ser
85 90 95
Ser Asn Glu Ile Ala Lys Glu Ala Phe Val Thr Arg Phe Glu Ser Ile
100 105 110
Ser Thr Arg Asn Leu Ser Lys Ala Leu Lys Ile Leu Thr Asp Asp Lys
115 120 125
Thr Met Val Ala Met Ser Asp Tyr Asn Asp Tyr His Lys Thr Val Lys
130 135 140
Arg His Ile Leu Thr Ala Met Leu Gly Pro Asn Ala Gln Lys Lys His
145 150 155 160
Arg Ile Gln Arg Asp Ile Met Met Glu Asn Leu Ser Asn Arg Leu His
165 170 175
Ala Phe Val Lys Thr Ser Thr Glu Gln Glu Glu Val Asp Leu Arg Glu
180 185 190
Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala Met Arg Gln Thr Met Gly
195 200 205
Lys Asp Val Glu Ser Ile Tyr Val Glu Asp Leu Lys Ile Thr Met Lys
210 215 220
Arg Asp Glu Ile Phe Gln Val Leu Val Val Asp Pro Met Met Gly Ala
225 230 235 240
Ile Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp Val Pro
245 250 255
Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln Met Tyr Ile Arg Arg Glu
260 265 270
Ala Val Met Lys Ala Leu Ile Lys Gln His Lys Glu Arg Ile Ala Ser
275 280 285
Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu Ala Gln
290 295 300
Ser Leu Thr Asp Arg Gln Leu Leu Met Ser Val Trp Glu Pro Ile Ile
305 310 315 320
Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Ile Tyr Glu
325 330 335
Leu Ala Lys Asn Pro His Ile Gln Asp Arg Leu Tyr Arg Asp Ile Gln
340 345 350
Ser Val Cys Gly Ser Asp Ile Ile Lys Glu Glu His Leu Ser Gln Leu
355 360 365
Pro Phe Ile Thr Ala Ile Phe His Glu Thr Leu Arg Arg His Ser Pro
370 375 380
Val Pro Ile Ile Pro Leu Arg Tyr Val His Glu Asp Thr Val Leu Gly
385 390 395 400
Gly Tyr His Val Pro Ala Gly Thr Glu Leu Ala Ile Asn Ile Tyr Gly
405 410 415
Cys Asn Met Glu Lys Ser Val Trp Glu Asn Pro Glu Glu Trp Asn Pro
420 425 430
Glu Arg Phe Met Lys Glu Asn Glu Thr Ile Asp Phe Gln Lys Thr Met
435 440 445
Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln Ala Met
450 455 460
Leu Ile Ser Cys Val Gly Ile Gly Arg Met Val Gln Glu Phe Lys Trp
465 470 475 480
Glu Leu Lys Asn Lys Ala Gln Glu Glu Val Asn Thr Ile Gly Leu Thr
485 490 495
Thr Gln Met Leu Arg Pro Leu Arg Ala Ile Ile Lys Pro Arg Asn
500 505 510
<210> 51
<211> 505
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 合成序列
<400> 51
Met Ala Trp Glu Tyr Ala Leu Ile Gly Leu Val Val Gly Ile Ile Ile
1 5 10 15
Gly Ala Val Ala Met Arg Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg
20 25 30
Arg Ser Gln Ser Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val
35 40 45
Pro Leu Leu Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met
50 55 60
Thr Phe Thr Lys Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys
65 70 75 80
Thr Gly Ala Thr Ser Val Val Val Val Ser Ser Asn Glu Ile Ala Lys
85 90 95
Glu Ala Leu Val Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser
100 105 110
Lys Ala Leu Lys Val Leu Thr Ala Asp Lys Gln Met Val Ala Met Ser
115 120 125
Asp Tyr Asp Asp Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala
130 135 140
Val Leu Gly Pro Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile
145 150 155 160
Met Met Asp Asn Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn
165 170 175
Pro Glu Gln Glu Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu
180 185 190
Phe Gly Leu Ala Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu
195 200 205
Tyr Val Glu Asp Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Leu Gln
210 215 220
Val Leu Val Val Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg
225 230 235 240
Asp Phe Phe Pro Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn
245 250 255
Thr Ile Gln Gln Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu
260 265 270
Ile Lys Glu Gln Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser
275 280 285
Tyr Ile Asp Tyr Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln
290 295 300
Leu Leu Met Ser Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr
305 310 315 320
Met Val Thr Thr Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys
325 330 335
Leu Gln Asp Arg Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu
340 345 350
Lys Ile Thr Glu Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile
355 360 365
Phe His Glu Thr Leu Arg Lys His Ser Pro Val Pro Ile Leu Pro Leu
370 375 380
Arg His Val His Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala
385 390 395 400
Gly Thr Glu Leu Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn
405 410 415
Val Trp Glu Asn Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu
420 425 430
Asn Glu Thr Ile Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys
435 440 445
Arg Val Cys Ala Gly Ser Leu Gln Ala Leu Leu Ile Ala Ser Ile Gly
450 455 460
Ile Gly Arg Met Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr
465 470 475 480
Gln Glu Glu Val Asn Thr Ile Gly Leu Thr Asn Gln Met Leu Arg Pro
485 490 495
Leu Arg Ala Ile Ile Lys Pro Arg Ile
500 505
<210> 52
<211> 492
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 合成序列
<400> 52
Met Ala Lys Pro Pro Leu Phe Phe Ile Val Ile Ile Gly Leu Ile Val
1 5 10 15
Val Ala Ala Ser Phe Leu Tyr Lys Leu Leu Thr Arg Pro Thr Ser Ser
20 25 30
Lys Asn Arg Leu Pro Glu Pro Trp Arg Leu Pro Ile Ile Gly His Met
35 40 45
His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Asp Leu Ala
50 55 60
Arg Lys Tyr Gly Ser Leu Met His Leu Gln Leu Gly Glu Val Ser Ala
65 70 75 80
Ile Val Val Ser Ser Pro Lys Trp Ala Lys Glu Ile Leu Thr Thr Tyr
85 90 95
Asp Ile Pro Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu Ile Ile
100 105 110
Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu Tyr Trp
115 120 125
Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Val Lys Lys
130 135 140
Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn Leu Val
145 150 155 160
Gln Glu Ile Lys Ala Ser Gly Ser Gly Thr Pro Phe Asn Leu Ser Glu
165 170 175
Gly Ile Phe Lys Val Ile Ala Thr Val Leu Ser Arg Ala Ala Phe Gly
180 185 190
Lys Gly Ile Lys Asp Gln Lys Gln Phe Thr Glu Ile Val Lys Glu Ile
195 200 205
Leu Arg Glu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Lys
210 215 220
Lys Phe Leu His His Leu Ser Gly Lys Arg Gly Arg Leu Thr Ser Ile
225 230 235 240
His Asn Lys Leu Asp Ser Leu Ile Asn Asn Leu Val Ala Glu His Thr
245 250 255
Val Ser Lys Ser Ser Lys Val Asn Glu Thr Leu Leu Asp Val Leu Leu
260 265 270
Arg Leu Lys Asn Ser Glu Glu Phe Pro Leu Thr Ala Asp Asn Val Lys
275 280 285
Ala Ile Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser Ser Ala
290 295 300
Thr Val Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg Ala Met
305 310 315 320
Glu Lys Val Gln Ala Glu Leu Arg Gln Ala Leu Asn Gly Lys Glu Arg
325 330 335
Ile Lys Glu Glu Glu Ile Gln Asp Leu Pro Tyr Leu Asn Leu Val Ile
340 345 350
Arg Glu Thr Leu Arg Leu His Pro Pro Leu Pro Leu Val Met Pro Arg
355 360 365
Glu Cys Arg Gln Ala Met Asn Leu Ala Gly Tyr Asp Val Ala Asn Lys
370 375 380
Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr
385 390 395 400
Trp Lys Asp Ala Glu Ser Phe Asn Pro Glu Arg Phe Glu Asn Ser Asn
405 410 415
Thr Thr Ile Met Gly Ala Asp Tyr Glu Tyr Leu Pro Phe Gly Ala Gly
420 425 430
Arg Arg Met Cys Pro Gly Ser Ala Leu Gly Leu Ala Asn Val Gln Leu
435 440 445
Pro Leu Ala Asn Ile Leu Tyr Tyr Phe Lys Trp Lys Leu Pro Asn Gly
450 455 460
Ala Ser His Asp Gln Leu Asp Met Thr Glu Ser Phe Gly Ala Thr Val
465 470 475 480
Gln Arg Lys Thr Glu Leu Met Leu Val Pro Ser Phe
485 490
<210> 53
<211> 709
<212> PRT
<213> 喜树(Camptotheca acuminata)
<400> 53
Met Ala Gln Ser Ser Ser Val Lys Val Ser Thr Phe Asp Leu Met Ser
1 5 10 15
Ala Ile Leu Arg Gly Arg Ser Met Asp Gln Thr Asn Val Ser Phe Glu
20 25 30
Ser Gly Glu Ser Pro Ala Leu Ala Met Leu Ile Glu Asn Arg Glu Leu
35 40 45
Val Met Ile Leu Thr Thr Ser Val Ala Val Leu Ile Gly Cys Phe Val
50 55 60
Val Leu Leu Trp Arg Arg Ser Ser Gly Lys Ser Gly Lys Val Thr Glu
65 70 75 80
Pro Pro Lys Pro Leu Met Val Lys Thr Glu Pro Glu Pro Glu Val Asp
85 90 95
Asp Gly Lys Lys Lys Val Ser Ile Phe Tyr Gly Thr Gln Thr Gly Thr
100 105 110
Ala Glu Gly Phe Ala Lys Ala Leu Ala Glu Glu Ala Lys Val Arg Tyr
115 120 125
Glu Lys Ala Ser Phe Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala Asp
130 135 140
Asp Glu Glu Tyr Glu Glu Lys Leu Lys Lys Glu Thr Leu Thr Phe Phe
145 150 155 160
Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg
165 170 175
Phe Tyr Lys Trp Phe Met Glu Gly Lys Glu Arg Gly Asp Trp Leu Lys
180 185 190
Asn Leu His Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His
195 200 205
Phe Asn Arg Ile Ala Lys Val Val Asp Asp Thr Ile Ala Glu Gln Gly
210 215 220
Gly Lys Arg Leu Ile Pro Val Gly Leu Gly Asp Asp Asp Gln Cys Ile
225 230 235 240
Glu Asp Asp Phe Ala Ala Trp Arg Glu Leu Leu Trp Pro Glu Leu Asp
245 250 255
Gln Leu Leu Gln Asp Glu Asp Gly Thr Thr Val Ala Thr Pro Tyr Thr
260 265 270
Ala Ala Val Leu Glu Tyr Arg Val Val Phe His Asp Ser Pro Asp Ala
275 280 285
Ser Leu Leu Asp Lys Ser Phe Ser Lys Ser Asn Gly His Ala Val His
290 295 300
Asp Ala Gln His Pro Cys Arg Ala Asn Val Ala Val Arg Arg Glu Leu
305 310 315 320
His Thr Pro Ala Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp Ile
325 330 335
Ser Gly Thr Gly Leu Val Tyr Glu Thr Gly Asp His Val Gly Val Tyr
340 345 350
Cys Glu Asn Leu Ile Glu Val Val Glu Glu Ala Glu Met Leu Leu Gly
355 360 365
Leu Ser Pro Asp Thr Phe Phe Ser Ile His Thr Asp Lys Glu Asp Gly
370 375 380
Thr Pro Leu Ser Gly Ser Ser Leu Pro Pro Pro Phe Pro Pro Cys Thr
385 390 395 400
Leu Arg Arg Ala Leu Thr Gln Tyr Ala Asp Leu Leu Ser Ser Pro Lys
405 410 415
Lys Ser Ser Leu Leu Ala Leu Ala Ala His Cys Ser Asp Pro Ser Glu
420 425 430
Ala Asp Arg Leu Arg His Leu Ala Ser Pro Ser Gly Lys Asp Glu Tyr
435 440 445
Ala Gln Trp Val Val Ala Ser Gln Arg Ser Leu Leu Glu Val Met Ala
450 455 460
Glu Phe Pro Ser Ala Lys Pro Pro Ile Gly Ala Phe Phe Ala Gly Val
465 470 475 480
Ala Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Arg
485 490 495
Met Ala Pro Ser Arg Ile His Val Thr Cys Ala Leu Val Phe Glu Lys
500 505 510
Thr Pro Val Gly Arg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys
515 520 525
Asn Ala Val Pro Leu Asp Glu Ser Arg Asp Cys Ser Trp Ala Pro Ile
530 535 540
Phe Val Arg Gln Ser Asn Phe Lys Leu Pro Ala Asp Thr Lys Val Pro
545 550 555 560
Val Leu Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe
565 570 575
Leu Gln Glu Arg Leu Ala Leu Lys Glu Ala Gly Ala Glu Leu Gly Pro
580 585 590
Ala Ile Leu Phe Phe Gly Cys Arg Asn Arg Gln Met Asp Tyr Ile Tyr
595 600 605
Glu Asp Glu Leu Asn Asn Phe Val Glu Thr Gly Ala Leu Ser Glu Leu
610 615 620
Ile Val Ala Phe Ser Arg Glu Gly Pro Lys Lys Glu Tyr Val Gln His
625 630 635 640
Lys Met Met Glu Lys Ala Ser Asp Ile Trp Asn Met Ile Ser Gln Glu
645 650 655
Gly Tyr Ile Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val
660 665 670
His Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu Asp Ser
675 680 685
Ser Lys Thr Glu Ser Met Val Lys Asn Leu Gln Met Asn Gly Arg Tyr
690 695 700
Leu Arg Asp Val Trp
705
<210> 54
<211> 516
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 合成序列
<400> 54
Met Ala Gln Asp Leu Arg Leu Ile Leu Ile Ile Val Gly Ala Ile Ala
1 5 10 15
Ile Ile Ala Leu Leu Val His Gly Phe Leu Leu Ile Lys Arg Ser Ser
20 25 30
Arg Ser Ser Val His Lys Gln Gln Val Leu Leu Ala Ser Leu Pro Pro
35 40 45
Ser Pro Pro Arg Leu Pro Leu Ile Gly Asn Ile His Gln Leu Val Gly
50 55 60
Gly Asn Pro His Arg Ile Leu Leu Gln Leu Ala Arg Thr His Gly Pro
65 70 75 80
Leu Ile Cys Leu Arg Leu Gly Gln Val Asp Gln Val Val Ala Ser Ser
85 90 95
Val Glu Ala Val Glu Glu Ile Ile Lys Arg His Asp Leu Lys Phe Ala
100 105 110
Asp Arg Pro Arg Asp Leu Thr Phe Ser Arg Ile Phe Phe Tyr Asp Gly
115 120 125
Asn Ala Val Val Met Thr Pro Tyr Gly Gly Glu Trp Lys Gln Met Arg
130 135 140
Lys Ile Tyr Ala Met Glu Leu Leu Asn Ser Arg Arg Val Lys Ser Phe
145 150 155 160
Ala Ala Ile Arg Glu Asp Val Ala Arg Lys Leu Thr Gly Glu Ile Ala
165 170 175
His Lys Ala Phe Ala Gln Thr Pro Val Ile Asn Leu Ser Glu Met Val
180 185 190
Met Ser Met Ile Asn Ala Ile Val Ile Arg Val Ala Phe Gly Asp Lys
195 200 205
Cys Lys Gln Gln Ala Tyr Phe Leu His Leu Val Lys Glu Ala Met Ser
210 215 220
Tyr Val Ser Ser Phe Ser Val Ala Asp Met Tyr Pro Ser Leu Lys Phe
225 230 235 240
Leu Asp Thr Leu Thr Gly Leu Lys Ser Lys Leu Glu Gly Val His Gly
245 250 255
Lys Leu Asp Lys Val Phe Asp Glu Ile Ile Ala Gln Arg Gln Ala Ala
260 265 270
Leu Ala Ala Glu Gln Ala Glu Glu Asp Leu Ile Ile Asp Val Leu Leu
275 280 285
Lys Leu Lys Asp Glu Gly Asn Gln Glu Phe Pro Ile Thr Tyr Thr Ser
290 295 300
Val Lys Ala Ile Val Met Glu Ile Phe Leu Ala Gly Thr Glu Thr Ser
305 310 315 320
Ser Ser Val Ile Asp Trp Val Met Ser Glu Leu Ile Lys Asn Pro Lys
325 330 335
Ala Met Glu Lys Val Gln Lys Glu Met Arg Glu Ala Met Gln Gly Lys
340 345 350
Thr Lys Leu Glu Glu Ser Asp Ile Pro Lys Phe Ser Tyr Leu Asn Leu
355 360 365
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Gly Pro Leu Leu Phe
370 375 380
Pro Arg Glu Cys Arg Glu Thr Cys Glu Val Met Gly Tyr Arg Val Pro
385 390 395 400
Ala Gly Ala Arg Leu Leu Ile Asn Ala Phe Ala Leu Ser Arg Asp Glu
405 410 415
Lys Tyr Trp Gly Ser Asp Ala Glu Ser Phe Lys Pro Glu Arg Phe Glu
420 425 430
Gly Ile Ser Val Asp Phe Lys Gly Ser Asn Phe Glu Phe Met Pro Phe
435 440 445
Gly Ala Gly Arg Arg Ile Cys Pro Gly Met Thr Phe Gly Ile Ser Ser
450 455 460
Val Glu Val Ala Leu Ala His Leu Leu Phe His Phe Asp Trp Gln Leu
465 470 475 480
Pro Gln Gly Met Lys Ile Glu Asp Leu Asp Met Met Glu Val Ser Gly
485 490 495
Met Ser Ala Thr Arg Arg Ser Pro Leu Leu Val Leu Ala Lys Leu Ile
500 505 510
Ile Pro Leu Pro
515
<210> 55
<211> 510
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 合成序列
<400> 55
Met Ala Gln Asp Leu Arg Leu Ile Leu Ile Ile Val Gly Ala Ile Ala
1 5 10 15
Ile Ile Ala Leu Leu Val His Gly Phe Leu Lys Ser Ala Val Thr Lys
20 25 30
Pro Lys Leu Asn Leu Pro Pro Gly Pro Trp Thr Leu Pro Leu Ile Gly
35 40 45
Ser Ile His His Ile Val Ser Asn Pro Leu Pro Tyr Arg Ala Met Arg
50 55 60
Glu Leu Ala His Lys His Gly Pro Leu Met Met Leu Trp Leu Gly Glu
65 70 75 80
Val Pro Thr Leu Val Val Ser Ser Pro Glu Ala Ala Gln Ala Ile Thr
85 90 95
Lys Thr His Asp Val Ser Phe Ala Asp Arg His Ile Asn Ser Thr Val
100 105 110
Asp Ile Leu Thr Phe Asn Gly Met Asp Met Val Phe Gly Ser Tyr Gly
115 120 125
Glu Gln Trp Arg Gln Leu Arg Lys Leu Ser Val Leu Glu Leu Leu Ser
130 135 140
Ala Ala Arg Val Gln Ser Phe Gln Arg Ile Arg Glu Glu Glu Val Ala
145 150 155 160
Arg Phe Met Arg Ser Leu Ala Ala Ser Ala Ser Ala Gly Ala Thr Val
165 170 175
Asp Leu Ser Lys Met Ile Ser Ser Phe Ile Asn Asp Thr Phe Val Arg
180 185 190
Glu Ser Ile Gly Ser Arg Cys Lys Tyr Gln Asp Glu Tyr Leu Ala Ala
195 200 205
Leu Asp Thr Ala Ile Arg Val Ala Ala Glu Leu Ser Val Gly Asn Ile
210 215 220
Phe Pro Ser Ser Arg Val Leu Gln Ser Leu Ser Thr Ala Arg Arg Lys
225 230 235 240
Ala Ile Ala Ser Arg Asp Glu Met Ala Arg Ile Leu Gly Gln Ile Ile
245 250 255
Arg Glu Thr Lys Glu Ser Met Asp Gln Gly Asp Lys Thr Ser Asn Glu
260 265 270
Ser Met Ile Ser Val Leu Leu Arg Leu Gln Lys Asp Ala Gly Leu Pro
275 280 285
Ile Glu Leu Thr Asp Asn Val Val Met Ala Leu Met Phe Asp Leu Phe
290 295 300
Gly Ala Gly Ser Asp Thr Ser Ser Thr Thr Leu Thr Trp Cys Met Thr
305 310 315 320
Glu Leu Val Arg Tyr Pro Ala Thr Met Ala Lys Ala Gln Ala Glu Val
325 330 335
Arg Glu Ala Phe Lys Gly Lys Thr Thr Ile Thr Glu Asp Asp Leu Ser
340 345 350
Thr Ala Asn Leu Arg Tyr Leu Lys Leu Val Val Lys Glu Ala Leu Arg
355 360 365
Leu His Cys Pro Val Pro Leu Leu Leu Pro Arg Lys Cys Arg Glu Ala
370 375 380
Cys Gln Val Met Gly Tyr Asp Ile Pro Lys Gly Thr Cys Val Phe Val
385 390 395 400
Asn Val Trp Ala Ile Cys Arg Asp Pro Arg Tyr Trp Glu Asp Ala Glu
405 410 415
Glu Phe Lys Pro Glu Arg Phe Glu Asn Ser Asn Leu Asp Tyr Lys Gly
420 425 430
Thr Tyr Tyr Glu Tyr Leu Pro Phe Gly Ser Gly Arg Arg Met Cys Pro
435 440 445
Gly Ala Asn Leu Gly Val Ala Asn Leu Glu Leu Ala Leu Ala Ser Leu
450 455 460
Leu Tyr His Phe Asp Trp Lys Leu Pro Ser Gly Gln Glu Pro Lys Asp
465 470 475 480
Val Asp Val Trp Glu Ala Ala Gly Leu Val Ala Lys Lys Asn Ile Gly
485 490 495
Leu Val Leu His Pro Val Ser His Ile Ala Pro Val Asn Ala
500 505 510
<210> 56
<211> 511
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 合成序列
<400> 56
Met Ala Gln Asp Leu Arg Leu Ile Leu Ile Ile Val Gly Ala Ile Ala
1 5 10 15
Ile Ile Ala Leu Leu Val His Gly Phe Phe Leu Leu Arg Lys Trp Lys
20 25 30
Asn Ser Asn Ser Gln Ser Lys Lys Leu Pro Pro Gly Pro Trp Lys Leu
35 40 45
Pro Leu Leu Gly Ser Met Leu His Met Val Gly Gly Leu Pro His His
50 55 60
Val Leu Arg Asp Leu Ala Lys Lys Tyr Gly Pro Leu Met His Leu Gln
65 70 75 80
Leu Gly Glu Val Ser Ala Val Val Val Thr Ser Pro Asp Met Ala Lys
85 90 95
Glu Val Leu Lys Thr His Asp Ile Ala Phe Ala Ser Arg Pro Lys Leu
100 105 110
Leu Ala Pro Glu Ile Val Cys Tyr Asn Arg Ser Asp Ile Ala Phe Cys
115 120 125
Pro Tyr Gly Asp Tyr Trp Arg Gln Met Arg Lys Ile Cys Val Leu Glu
130 135 140
Val Leu Ser Ala Lys Asn Val Arg Ser Phe Ser Ser Ile Arg Arg Asp
145 150 155 160
Glu Val Leu Arg Leu Val Asn Phe Val Arg Ser Ser Thr Ser Glu Pro
165 170 175
Val Asn Phe Thr Glu Arg Leu Phe Leu Phe Thr Ser Ser Met Thr Cys
180 185 190
Arg Ser Ala Phe Gly Lys Val Phe Lys Glu Gln Glu Thr Phe Ile Gln
195 200 205
Leu Ile Lys Glu Val Ile Gly Leu Ala Gly Gly Phe Asp Val Ala Asp
210 215 220
Ile Phe Pro Ser Leu Lys Phe Leu His Val Leu Thr Gly Met Glu Gly
225 230 235 240
Lys Ile Met Lys Ala His His Lys Val Asp Ala Ile Val Glu Asp Val
245 250 255
Ile Asn Glu His Lys Lys Asn Leu Ala Met Gly Lys Thr Asn Gly Ala
260 265 270
Leu Gly Gly Glu Asp Leu Ile Asp Val Leu Leu Arg Leu Met Asn Asp
275 280 285
Gly Gly Leu Gln Phe Pro Ile Thr Asn Asp Asn Ile Lys Ala Ile Ile
290 295 300
Phe Asp Met Phe Ala Ala Gly Thr Glu Thr Ser Ser Ser Thr Leu Val
305 310 315 320
Trp Ala Met Val Gln Met Met Arg Asn Pro Thr Ile Leu Ala Lys Ala
325 330 335
Gln Ala Glu Val Arg Glu Ala Phe Lys Gly Lys Glu Thr Phe Asp Glu
340 345 350
Asn Asp Val Glu Glu Leu Lys Tyr Leu Lys Leu Val Ile Lys Glu Thr
355 360 365
Leu Arg Leu His Pro Pro Val Pro Leu Leu Val Pro Arg Glu Cys Arg
370 375 380
Glu Glu Thr Glu Ile Asn Gly Tyr Thr Ile Pro Val Lys Thr Lys Val
385 390 395 400
Met Val Asn Val Trp Ala Leu Gly Arg Asp Pro Lys Tyr Trp Asp Asp
405 410 415
Ala Asp Asn Phe Lys Pro Glu Arg Phe Glu Gln Cys Ser Val Asp Phe
420 425 430
Ile Gly Asn Asn Phe Glu Tyr Leu Pro Phe Gly Gly Gly Arg Arg Ile
435 440 445
Cys Pro Gly Ile Ser Phe Gly Leu Ala Asn Val Tyr Leu Pro Leu Ala
450 455 460
Gln Leu Leu Tyr His Phe Asp Trp Lys Leu Pro Thr Gly Met Glu Pro
465 470 475 480
Lys Asp Leu Asp Leu Thr Glu Leu Val Gly Ile Thr Ile Ala Arg Lys
485 490 495
Ser Asp Leu Met Leu Val Ala Thr Pro Tyr Gln Pro Ser Arg Glu
500 505 510
Claims (37)
1.一种用于产生莎草奥酮的微生物宿主细胞,所述微生物细胞表达异源α-愈创木烯合酶(αGTPS)和异源α-愈创木烯氧化酶(αGOX)。
2.如权利要求1所述的微生物细胞,所述微生物细胞进一步表达法呢基二磷酸合酶。
3.如权利要求2所述的微生物细胞,其中所述αGTPS酶包含SEQ ID NO:1至21中的任一个的氨基酸序列或其变体。
4.如权利要求3所述的微生物细胞,其中α-愈创木烯合酶包含与SEQ ID NO:1至21中的任一个具有50%或更高序列同一性的氨基酸序列。
5.如权利要求3所述的微生物细胞,其中所述αGTPS酶包含与SEQ ID NO:8具有50%或更高序列同一性的氨基酸序列。
6.如权利要求5所述的微生物细胞,其中所述αGTPS酶包含选自相对于SEQ ID NO:8的以下的位置处的一个或多个氨基酸取代:72、273、290、368、371、374、377、381、382、399、406、419、433、442、443、454、512和522。
7.如权利要求6所述的微生物细胞,其中所述αGTPS酶包含选自相对于SEQ ID NO:8的以下的一个或多个氨基酸取代:T72I、M273L、R290K、F368M、I371L、S374A、R377V、Y381W、F382L、I399V、F406L、L419T、V433I、Y442L、I443M、E454K、F512L和K522D。
8.如权利要求4所述的微生物细胞,其中所述α-愈创木烯合酶主要从FPP底物产生α-愈创木烯作为产物。
9.如权利要求1至8中任一项所述的微生物细胞,其中所述αGOX酶是细胞色素P450(CYP450)酶。
10.如权利要求9所述的微生物细胞,其中所述CYP450包含SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26的氨基酸序列或其变体。
11.如权利要求10所述的微生物细胞,其中所述CYP450包含与SEQ ID NO:51、52或22至26中的任一个具有50%或更高序列同一性的氨基酸序列。
12.如权利要求9所述的微生物细胞,其中所述CYP450包含与SEQ ID NO:51具有50%或更高序列同一性的氨基酸序列。
13.如权利要求9所述的微生物细胞,其中所述CYP450包含与SEQ ID NO:52具有50%或更高序列同一性的氨基酸序列。
14.如权利要求1至13中任一项所述的微生物细胞,其中所述微生物宿主细胞表达细胞色素P450还原酶。
15.如权利要求1至14中任一项所述的微生物细胞,其中所述αGTPS和αGOX一起在操纵子中表达。
16.如权利要求1至15中任一项所述的微生物细胞,其中所述微生物宿主细胞进一步表达一种或多种醇脱氢酶(ADH)。
17.如权利要求16所述的微生物细胞,其中所述ADH包含SEQ ID NO:35至44中的任一个的氨基酸序列或其变体。
18.如权利要求11所述的微生物细胞,其中所述ADH包含与SEQ ID NO:43具有50%或更高序列同一性的氨基酸序列。
19.如权利要求1至18中任一项所述的微生物细胞,其中一种或多种酶是从染色体外元件表达的。
20.如权利要求1至18中任一项所述的微生物细胞,其中一种或多种酶是从在染色体上整合的基因表达的。
21.如权利要求1至20中任一项所述的微生物细胞,其中所述微生物宿主细胞过表达甲基赤藓糖醇磷酸(MEP)或甲羟戊酸(MVA)途径中的一种或多种酶。
22.如权利要求1至21中任一项所述的微生物细胞,其中所述微生物细胞是任选地选自以下的细菌:埃希氏菌属、芽孢杆菌属、棒状杆菌属、红细菌属、发酵单胞菌属、弧菌属和假单胞菌属。
23.如权利要求22所述的微生物细胞,其中所述细菌宿主细胞选自大肠杆菌、枯草芽孢杆菌、谷氨酸棒杆菌、荚膜红细菌、类球红细菌、运动发酵单胞菌、需钠弧菌或恶臭假单胞菌。
24.如权利要求1至21中任一项所述的微生物细胞,其中所述微生物宿主细胞是酵母,所述酵母任选地选自酵母属、毕赤酵母属或耶氏酵母属。
25.如权利要求24所述的微生物细胞,其中所述微生物细胞是酿酒酵母、巴斯德毕赤酵母和解脂耶氏酵母。
26.一种用于制备莎草奥酮的方法,所述方法包括:培养如权利要求1至25中任一项所述的微生物细胞,以及回收所述莎草奥酮。
27.如权利要求26所述的方法,其中用C1、C2、C3、C4、C5和/或C6碳底物培养所述微生物细胞。
28.如权利要求27所述的方法,其中所述碳源是葡萄糖、蔗糖、果糖、木糖和/或甘油。
29.如权利要求26至28中任一项所述的方法,其中培养条件选自需氧、微需氧和厌氧条件。
30.如权利要求29所述的方法,其中所述微生物细胞在22℃与37℃之间的温度下培养。
31.一种用于产生莎草奥酮的方法,所述方法包括将α-愈创木烯送入表达α-愈创木烯氧化酶(αGOX)的微生物细胞或所述细胞的提取物或包含重组αGOX的反应容器中,其中所述αGOX任选地包含SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26的氨基酸序列或其变体。
32.如权利要求31所述的方法,其中所述αGOX是包含与SEQ ID NO:51、SEQ ID NO:52或SEQ ID NO:22至26中的任一个具有50%或更高序列同一性的氨基酸序列的CYP450。
33.如权利要求31所述的方法,其中所述αGOX是非血红素铁加氧酶(NHIO)或漆酶。
34.如权利要求31至33中任一项所述的方法,其中所述微生物细胞表达一种或多种醇脱氢酶。
35.如权利要求31至33中任一项所述的方法,其中所述提取物或反应容器还包含一种或多种醇脱氢酶。
36.如权利要求33或34所述的方法,其中所述醇脱氢酶包含选自SEQ ID NO:35-44的氨基酸序列或其变体。
37.如权利要求31至36中任一项所述的方法,所述方法还包括回收莎草奥酮。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201862727815P | 2018-09-06 | 2018-09-06 | |
US62/727,815 | 2018-09-06 | ||
PCT/US2019/050004 WO2020051488A1 (en) | 2018-09-06 | 2019-09-06 | Microbial production of rotundone |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113195726A true CN113195726A (zh) | 2021-07-30 |
Family
ID=69722778
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201980060571.5A Pending CN113195726A (zh) | 2018-09-06 | 2019-09-06 | 莎草奥酮的微生物生产 |
Country Status (5)
Country | Link |
---|---|
US (2) | US11618908B2 (zh) |
EP (1) | EP3847268A4 (zh) |
JP (1) | JP2022500018A (zh) |
CN (1) | CN113195726A (zh) |
WO (1) | WO2020051488A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117925700A (zh) * | 2024-03-22 | 2024-04-26 | 三亚中国农业科学院国家南繁研究院 | GhTPS6基因在调控棉花黄萎病抗性中的应用 |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3183353A2 (en) * | 2014-08-21 | 2017-06-28 | Givaudan S.A. | Process for producing oxygenated products of valencene |
CA3130763A1 (en) | 2019-02-25 | 2020-09-03 | Ginkgo Bioworks, Inc. | Biosynthesis of cannabinoids and cannabinoid precursors |
CN112921024B (zh) * | 2021-04-22 | 2022-10-14 | 杭州师范大学 | 一种α-愈创木烯合成酶、基因及应用 |
EP4337768A1 (en) * | 2021-05-11 | 2024-03-20 | Manus Bio Inc. | Enzymes, host cells, and methods for production of rotundone and other terpenoids |
WO2023097167A1 (en) * | 2021-11-24 | 2023-06-01 | Ginkgo Bioworks, Inc. | Engineered sesquiterpene synthases |
NL2031120B1 (en) * | 2022-02-16 | 2023-08-22 | Sestina Bio Llc | Engineered alpha-guaiene synthases |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2016154502A (ja) * | 2015-02-25 | 2016-09-01 | 神戸天然物化学株式会社 | 酸化セスキテルペンの生産およびその利用 |
CN107002109A (zh) * | 2014-08-21 | 2017-08-01 | 马努斯生物合成股份有限公司 | 含氧萜烯的生产方法 |
JP2017216974A (ja) * | 2016-06-10 | 2017-12-14 | 長谷川香料株式会社 | (−)−ロタンドンの製造方法 |
WO2018144996A1 (en) * | 2017-02-03 | 2018-08-09 | Manus Bio, Inc. | Metabolic engineering for microbial production of terpenoid products |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8927241B2 (en) | 2009-11-10 | 2015-01-06 | Massachusetts Institute Of Technology | Microbial engineering for the production of chemical and pharmaceutical products from the isoprenoid pathway |
EP2499257A1 (en) | 2009-11-10 | 2012-09-19 | Massachusetts Institute of Technology | Microbial engineering for the production of chemical and pharmaceutical products from the isoprenoid pathway |
ES2625861T3 (es) * | 2011-11-01 | 2017-07-20 | Firmenich Sa | Citocromo p450 y su uso para la oxidación enzimática de terpenos |
EP2970934B1 (en) * | 2013-03-14 | 2017-08-16 | Evolva, Inc. | Valencene synthase polypeptides, encoding nucleic acid molecules and uses thereof |
CN107109453A (zh) | 2014-11-05 | 2017-08-29 | 马努斯生物合成股份有限公司 | 甜菊醇糖苷的微生物产生 |
ES2959560T3 (es) | 2015-08-21 | 2024-02-27 | Manus Bio Inc | Aumento de la productividad de células hospedadoras de E. coli que expresan funcionalmente enzimas P450 |
US20210161092A1 (en) | 2016-02-16 | 2021-06-03 | Ajikumar Parayil KUMARAN | Secondary metabolite screening system |
WO2018140778A1 (en) | 2017-01-26 | 2018-08-02 | Manus Bio, Inc. | Metabolic engineering for microbial production of terpenoid products |
EP3495489A1 (en) * | 2017-12-05 | 2019-06-12 | Givaudan SA | Production of guaiene and rotundone |
-
2019
- 2019-09-06 WO PCT/US2019/050004 patent/WO2020051488A1/en unknown
- 2019-09-06 US US17/273,567 patent/US11618908B2/en active Active
- 2019-09-06 JP JP2021512555A patent/JP2022500018A/ja active Pending
- 2019-09-06 CN CN201980060571.5A patent/CN113195726A/zh active Pending
- 2019-09-06 EP EP19856874.3A patent/EP3847268A4/en active Pending
-
2023
- 2023-02-22 US US18/172,594 patent/US20230183760A1/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107002109A (zh) * | 2014-08-21 | 2017-08-01 | 马努斯生物合成股份有限公司 | 含氧萜烯的生产方法 |
JP2016154502A (ja) * | 2015-02-25 | 2016-09-01 | 神戸天然物化学株式会社 | 酸化セスキテルペンの生産およびその利用 |
JP2017216974A (ja) * | 2016-06-10 | 2017-12-14 | 長谷川香料株式会社 | (−)−ロタンドンの製造方法 |
WO2018144996A1 (en) * | 2017-02-03 | 2018-08-09 | Manus Bio, Inc. | Metabolic engineering for microbial production of terpenoid products |
Non-Patent Citations (1)
Title |
---|
DAMIAN PAUL DREW等: "Two key polymorphisms in a newly discovered allele of the Vitis vinifera TPS24 gene are responsible for the production of the rotundone precursor α-guaiene", 《JOURNAL OF EXPERIMENTAL BOTANY》, vol. 67, no. 3, pages 799 - 808, XP055473823, DOI: 10.1093/jxb/erv491 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117925700A (zh) * | 2024-03-22 | 2024-04-26 | 三亚中国农业科学院国家南繁研究院 | GhTPS6基因在调控棉花黄萎病抗性中的应用 |
CN117925700B (zh) * | 2024-03-22 | 2024-05-28 | 三亚中国农业科学院国家南繁研究院 | GhTPS6基因在调控棉花黄萎病抗性中的应用 |
Also Published As
Publication number | Publication date |
---|---|
US11618908B2 (en) | 2023-04-04 |
US20230183760A1 (en) | 2023-06-15 |
US20210254107A1 (en) | 2021-08-19 |
JP2022500018A (ja) | 2022-01-04 |
EP3847268A1 (en) | 2021-07-14 |
EP3847268A4 (en) | 2022-06-08 |
WO2020051488A1 (en) | 2020-03-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11952608B2 (en) | Methods for production of oxygenated terpenes | |
US20230183760A1 (en) | Microbial production of rotundone | |
US20210147895A1 (en) | Method of producing terpenes or terpenoids | |
US9969999B2 (en) | Method for producing alpha-santalene | |
US9714440B2 (en) | Method for producing patchoulol and 7-epi-α-selinene | |
WO2019092388A1 (en) | Synthesis of monoterpenoid ester compounds | |
Yang et al. | A homomeric geranyl diphosphate synthase-encoding gene from Camptotheca acuminata and its combinatorial optimization for production of geraniol in Escherichia coli | |
WO2022240995A1 (en) | Enzymes, host cells, and methods for production of rotundone and other terpenoids | |
Mikš-Krajnik et al. | Microbial production of flavors and fragrances | |
WO2022046994A1 (en) | Microbial production of artemisinic acid and derivatives | |
CN115449527A (zh) | 氧化还原酶作为圆柚醇脱氢酶在生物合成圆柚酮中的应用 | |
Sun et al. | Mevalonate/2-Methylerythritol 4-Phosphate Pathways and Their Metabolic Engineering Applications |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |