CN116783289A - 用于生产挥发性化合物的方法和细胞 - Google Patents
用于生产挥发性化合物的方法和细胞 Download PDFInfo
- Publication number
- CN116783289A CN116783289A CN202180071428.3A CN202180071428A CN116783289A CN 116783289 A CN116783289 A CN 116783289A CN 202180071428 A CN202180071428 A CN 202180071428A CN 116783289 A CN116783289 A CN 116783289A
- Authority
- CN
- China
- Prior art keywords
- seq
- coa
- identity
- similarity
- acyl
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 82
- 150000001875 compounds Chemical class 0.000 title claims abstract description 56
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 claims abstract description 459
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 claims abstract description 222
- ZWEHNKRNPOVVGH-UHFFFAOYSA-N 2-Butanone Chemical compound CCC(C)=O ZWEHNKRNPOVVGH-UHFFFAOYSA-N 0.000 claims abstract description 176
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 85
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 77
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 77
- 239000013598 vector Substances 0.000 claims abstract description 26
- 108090000790 Enzymes Proteins 0.000 claims description 324
- 102000004190 Enzymes Human genes 0.000 claims description 323
- 108030002957 Acetate CoA-transferases Proteins 0.000 claims description 190
- 108010006229 Acetyl-CoA C-acetyltransferase Proteins 0.000 claims description 185
- 230000000694 effects Effects 0.000 claims description 155
- 102000005460 3-oxoacid CoA-transferase Human genes 0.000 claims description 154
- 108020002872 3-oxoacid CoA-transferase Proteins 0.000 claims description 154
- 108091033319 polynucleotide Proteins 0.000 claims description 148
- 102000040430 polynucleotide Human genes 0.000 claims description 148
- 239000002157 polynucleotide Substances 0.000 claims description 148
- 108091022873 acetoacetate decarboxylase Proteins 0.000 claims description 95
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 claims description 77
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 claims description 70
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 claims description 66
- 101000782838 Arabidopsis thaliana Acyl-CoA hydrolase 2 Proteins 0.000 claims description 61
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 56
- 102000005345 Acetyl-CoA C-acetyltransferase Human genes 0.000 claims description 54
- 241000193446 Thermoanaerobacterium thermosaccharolyticum Species 0.000 claims description 54
- 241000626621 Geobacillus Species 0.000 claims description 50
- 241000894006 Bacteria Species 0.000 claims description 45
- 101710088194 Dehydrogenase Proteins 0.000 claims description 38
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 claims description 37
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 claims description 34
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 34
- 239000008103 glucose Substances 0.000 claims description 34
- 241000186339 Thermoanaerobacter Species 0.000 claims description 33
- 230000001580 bacterial effect Effects 0.000 claims description 30
- 239000000203 mixture Substances 0.000 claims description 30
- QAQREVBBADEHPA-IEXPHMLFSA-N propionyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 QAQREVBBADEHPA-IEXPHMLFSA-N 0.000 claims description 30
- 241000193403 Clostridium Species 0.000 claims description 28
- 241000193385 Geobacillus stearothermophilus Species 0.000 claims description 28
- 108010069175 acyl-CoA transferase Proteins 0.000 claims description 26
- 239000001913 cellulose Substances 0.000 claims description 24
- 229920002678 cellulose Polymers 0.000 claims description 24
- 125000002252 acyl group Chemical group 0.000 claims description 20
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 claims description 19
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 claims description 19
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 claims description 19
- 239000000758 substrate Substances 0.000 claims description 19
- 102000004357 Transferases Human genes 0.000 claims description 18
- 108090000992 Transferases Proteins 0.000 claims description 18
- 241000205101 Sulfolobus Species 0.000 claims description 17
- 235000019260 propionic acid Nutrition 0.000 claims description 16
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 claims description 16
- 244000063299 Bacillus subtilis Species 0.000 claims description 15
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 15
- 241000193448 Ruminiclostridium thermocellum Species 0.000 claims description 13
- 241000204652 Thermotoga Species 0.000 claims description 13
- 241000205188 Thermococcus Species 0.000 claims description 12
- 241000589596 Thermus Species 0.000 claims description 12
- 230000008569 process Effects 0.000 claims description 12
- 241000203069 Archaea Species 0.000 claims description 10
- 241000193744 Bacillus amyloliquefaciens Species 0.000 claims description 10
- 239000002028 Biomass Substances 0.000 claims description 10
- 230000001461 cytolytic effect Effects 0.000 claims description 10
- 230000012010 growth Effects 0.000 claims description 10
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 9
- 241000282326 Felis catus Species 0.000 claims description 9
- 241000762460 Pseudothermotoga lettingae Species 0.000 claims description 9
- 241000193749 Bacillus coagulans Species 0.000 claims description 8
- 241000194108 Bacillus licheniformis Species 0.000 claims description 8
- 108010056771 Glucosidases Proteins 0.000 claims description 8
- 102000004366 Glucosidases Human genes 0.000 claims description 8
- 241001148569 Rhodothermus Species 0.000 claims description 8
- 241000204666 Thermotoga maritima Species 0.000 claims description 8
- 229940054340 bacillus coagulans Drugs 0.000 claims description 8
- 238000012258 culturing Methods 0.000 claims description 8
- 241000193398 Bacillus methanolicus Species 0.000 claims description 7
- 229910052799 carbon Inorganic materials 0.000 claims description 7
- 230000002194 synthesizing effect Effects 0.000 claims description 7
- 241000588771 Morganella <proteobacterium> Species 0.000 claims description 6
- 241000205156 Pyrococcus furiosus Species 0.000 claims description 6
- 241001137870 Thermoanaerobacterium Species 0.000 claims description 6
- 241000194103 Bacillus pumilus Species 0.000 claims description 5
- 241000589516 Pseudomonas Species 0.000 claims description 5
- 241000589500 Thermus aquaticus Species 0.000 claims description 5
- 241001468175 Geobacillus thermodenitrificans Species 0.000 claims description 4
- 241000193459 Moorella thermoacetica Species 0.000 claims description 4
- 241000186544 Moorella thermoautotrophica Species 0.000 claims description 4
- OSWRVYBYIGOAEZ-UHFFFAOYSA-N acetic acid;2-hydroxypropanoic acid Chemical compound CC(O)=O.CC(O)C(O)=O OSWRVYBYIGOAEZ-UHFFFAOYSA-N 0.000 claims description 4
- 239000000413 hydrolysate Substances 0.000 claims description 4
- 239000000052 vinegar Substances 0.000 claims description 4
- 235000021419 vinegar Nutrition 0.000 claims description 4
- 235000001674 Agaricus brunnescens Nutrition 0.000 claims description 3
- 241001626813 Anoxybacillus Species 0.000 claims description 3
- 241000193399 Bacillus smithii Species 0.000 claims description 3
- 241001058118 Caldanaerobacter Species 0.000 claims description 3
- 241001429558 Caldicellulosiruptor bescii Species 0.000 claims description 3
- 241000511679 Caldicellulosiruptor lactoaceticus Species 0.000 claims description 3
- 241000556413 Caldicellulosiruptor owensensis Species 0.000 claims description 3
- 241000192731 Chloroflexus aurantiacus Species 0.000 claims description 3
- 241000193468 Clostridium perfringens Species 0.000 claims description 3
- 241000193419 Geobacillus kaustophilus Species 0.000 claims description 3
- 241001468249 Geobacillus thermocatenulatus Species 0.000 claims description 3
- 241001468176 Geobacillus thermoleovorans Species 0.000 claims description 3
- 241000178985 Moorella Species 0.000 claims description 3
- 241000193390 Parageobacillus thermoglucosidasius Species 0.000 claims description 3
- 241000866625 Polymorphus Species 0.000 claims description 3
- 241000252565 Pseudothermotoga thermarum Species 0.000 claims description 3
- 241000205160 Pyrococcus Species 0.000 claims description 3
- 241001148023 Pyrococcus abyssi Species 0.000 claims description 3
- 241001148570 Rhodothermus marinus Species 0.000 claims description 3
- 241000205098 Sulfolobus acidocaldarius Species 0.000 claims description 3
- 241000167564 Sulfolobus islandicus Species 0.000 claims description 3
- 241000205091 Sulfolobus solfataricus Species 0.000 claims description 3
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 claims description 3
- 241001147773 Thermoanaerobacterium xylanolyticum Species 0.000 claims description 3
- 241000545779 Thermococcus barophilus Species 0.000 claims description 3
- 241001235254 Thermococcus kodakarensis Species 0.000 claims description 3
- 241000589499 Thermus thermophilus Species 0.000 claims description 3
- 230000001651 autotrophic effect Effects 0.000 claims description 3
- 150000001720 carbohydrates Chemical class 0.000 claims description 3
- 229910052717 sulfur Inorganic materials 0.000 claims description 3
- 239000011593 sulfur Substances 0.000 claims description 3
- 238000004519 manufacturing process Methods 0.000 abstract description 41
- 230000000813 microbial effect Effects 0.000 abstract description 5
- 210000004027 cell Anatomy 0.000 description 396
- 229940088598 enzyme Drugs 0.000 description 311
- 102100026105 3-ketoacyl-CoA thiolase, mitochondrial Human genes 0.000 description 131
- 239000002609 medium Substances 0.000 description 40
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 33
- 238000006243 chemical reaction Methods 0.000 description 33
- 108010050848 glycylleucine Proteins 0.000 description 25
- 108060008225 Thiolase Proteins 0.000 description 22
- 102000002932 Thiolase Human genes 0.000 description 21
- 210000005266 circulating tumour cell Anatomy 0.000 description 21
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 21
- 108010047495 alanylglycine Proteins 0.000 description 20
- 238000000855 fermentation Methods 0.000 description 20
- 230000004151 fermentation Effects 0.000 description 19
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 18
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 17
- 241000193401 Clostridium acetobutylicum Species 0.000 description 16
- 230000037361 pathway Effects 0.000 description 15
- 229940095574 propionic acid Drugs 0.000 description 15
- 108010053725 prolylvaline Proteins 0.000 description 14
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 13
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 13
- 108010008355 arginyl-glutamine Proteins 0.000 description 13
- 239000013612 plasmid Substances 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- 108090000623 proteins and genes Proteins 0.000 description 13
- 108010061238 threonyl-glycine Proteins 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 12
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- 238000002835 absorbance Methods 0.000 description 12
- OJFDKHTZOUZBOS-CITAKDKDSA-N acetoacetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 OJFDKHTZOUZBOS-CITAKDKDSA-N 0.000 description 12
- 108010038633 aspartylglutamate Proteins 0.000 description 12
- 239000000194 fatty acid Substances 0.000 description 12
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 11
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 11
- 108010005233 alanylglutamic acid Proteins 0.000 description 11
- 108010047857 aspartylglycine Proteins 0.000 description 11
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 11
- 108010054155 lysyllysine Proteins 0.000 description 11
- 108010005942 methionylglycine Proteins 0.000 description 11
- 241000223257 Thermomyces Species 0.000 description 10
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 10
- 230000015572 biosynthetic process Effects 0.000 description 10
- 235000014113 dietary fatty acids Nutrition 0.000 description 10
- 229930195729 fatty acid Natural products 0.000 description 10
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 9
- 241000588724 Escherichia coli Species 0.000 description 9
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 9
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 9
- 150000004665 fatty acids Chemical class 0.000 description 9
- 230000014509 gene expression Effects 0.000 description 9
- 108010049041 glutamylalanine Proteins 0.000 description 9
- 108010009298 lysylglutamic acid Proteins 0.000 description 9
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 8
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 8
- 108020004705 Codon Proteins 0.000 description 8
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 8
- 229920002488 Hemicellulose Polymers 0.000 description 8
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 229940041514 candida albicans extract Drugs 0.000 description 8
- 108010016616 cysteinylglycine Proteins 0.000 description 8
- 239000007789 gas Substances 0.000 description 8
- 108010037850 glycylvaline Proteins 0.000 description 8
- 244000005700 microbiome Species 0.000 description 8
- 241000894007 species Species 0.000 description 8
- 235000000346 sugar Nutrition 0.000 description 8
- 108010073969 valyllysine Proteins 0.000 description 8
- 239000012138 yeast extract Substances 0.000 description 8
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 7
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 7
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 7
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 7
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 7
- 238000006460 hydrolysis reaction Methods 0.000 description 7
- 230000001965 increasing effect Effects 0.000 description 7
- 230000035772 mutation Effects 0.000 description 7
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 7
- 108010026333 seryl-proline Proteins 0.000 description 7
- FHSUFDYFOHSYHI-UHFFFAOYSA-N 3-oxopentanoic acid Chemical compound CCC(=O)CC(O)=O FHSUFDYFOHSYHI-UHFFFAOYSA-N 0.000 description 6
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 6
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 6
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 6
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 6
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 6
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 6
- 108010079364 N-glycylalanine Proteins 0.000 description 6
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 6
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 6
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 230000002255 enzymatic effect Effects 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- 108010085325 histidylproline Proteins 0.000 description 6
- 230000001939 inductive effect Effects 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010056582 methionylglutamic acid Proteins 0.000 description 6
- WIOQNWTZBOQTEU-ZMHDXICWSA-N s-[2-[3-[[(2r)-4-[[[(2r,3s,4r,5r)-5-(6-aminopurin-9-yl)-4-hydroxy-3-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-hydroxyphosphoryl]oxy-2-hydroxy-3,3-dimethylbutanoyl]amino]propanoylamino]ethyl] 3-oxopentanethioate Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(=O)CC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 WIOQNWTZBOQTEU-ZMHDXICWSA-N 0.000 description 6
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 5
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 5
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 5
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 5
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 5
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 5
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 5
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 5
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 5
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 5
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 5
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 5
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 5
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 5
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 5
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 5
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 5
- WCHONUZTYDQMBY-PYJNHQTQSA-N His-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WCHONUZTYDQMBY-PYJNHQTQSA-N 0.000 description 5
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 5
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 5
- 108091028043 Nucleic acid sequence Proteins 0.000 description 5
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 5
- XBDQKXXYIPTUBI-UHFFFAOYSA-M Propionate Chemical compound CCC([O-])=O XBDQKXXYIPTUBI-UHFFFAOYSA-M 0.000 description 5
- 241000235527 Rhizopus Species 0.000 description 5
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 5
- 230000000789 acetogenic effect Effects 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 150000001413 amino acids Chemical class 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 5
- 108010093581 aspartyl-proline Proteins 0.000 description 5
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 5
- 239000012531 culture fluid Substances 0.000 description 5
- 238000004520 electroporation Methods 0.000 description 5
- 108010079547 glutamylmethionine Proteins 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 108010020688 glycylhistidine Proteins 0.000 description 5
- 108010077515 glycylproline Proteins 0.000 description 5
- 230000007062 hydrolysis Effects 0.000 description 5
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 5
- 108010048818 seryl-histidine Proteins 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 229920001817 Agar Polymers 0.000 description 4
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 4
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 4
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 4
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 4
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 4
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 4
- 102000002226 Alkyl and Aryl Transferases Human genes 0.000 description 4
- 108010014722 Alkyl and Aryl Transferases Proteins 0.000 description 4
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 4
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 4
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 4
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 4
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 4
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 4
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 4
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 4
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 4
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 4
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 4
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 4
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 4
- FJWALBCCVIHZBS-QXEWZRGKSA-N Ile-Met-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N FJWALBCCVIHZBS-QXEWZRGKSA-N 0.000 description 4
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 4
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 4
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 4
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 4
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 4
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 4
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 4
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 4
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 4
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 4
- 241000135044 Thermobifida fusca YX Species 0.000 description 4
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 4
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 4
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 4
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 4
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 4
- WDJHALXBUFZDSR-UHFFFAOYSA-M acetoacetate Chemical compound CC(=O)CC([O-])=O WDJHALXBUFZDSR-UHFFFAOYSA-M 0.000 description 4
- WDJHALXBUFZDSR-UHFFFAOYSA-N acetoacetic acid Chemical compound CC(=O)CC(O)=O WDJHALXBUFZDSR-UHFFFAOYSA-N 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 4
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 238000009835 boiling Methods 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 239000001569 carbon dioxide Substances 0.000 description 4
- 229910002092 carbon dioxide Inorganic materials 0.000 description 4
- 210000000349 chromosome Anatomy 0.000 description 4
- 238000006114 decarboxylation reaction Methods 0.000 description 4
- -1 fatty acids 3-oxovalerate Chemical class 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 239000012978 lignocellulosic material Substances 0.000 description 4
- 108010003700 lysyl aspartic acid Proteins 0.000 description 4
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 108010029020 prolylglycine Proteins 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 150000008163 sugars Chemical class 0.000 description 4
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 4
- 239000003039 volatile agent Substances 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- KIUMMUBSPKGMOY-UHFFFAOYSA-N 3,3'-Dithiobis(6-nitrobenzoic acid) Chemical compound C1=C([N+]([O-])=O)C(C(=O)O)=CC(SSC=2C=C(C(=CC=2)[N+]([O-])=O)C(O)=O)=C1 KIUMMUBSPKGMOY-UHFFFAOYSA-N 0.000 description 3
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 3
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 3
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 3
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 3
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 3
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 3
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 3
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 3
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 3
- UGJLILSJKSBVIR-ZFWWWQNUSA-N Arg-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)NCC(O)=O)=CNC2=C1 UGJLILSJKSBVIR-ZFWWWQNUSA-N 0.000 description 3
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 3
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 3
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 3
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 3
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- GUBGYTABKSRVRQ-CUHNMECISA-N D-Cellobiose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-CUHNMECISA-N 0.000 description 3
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 3
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 3
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 3
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 3
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 3
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 3
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 3
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 3
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 3
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 3
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 3
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 3
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 3
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 3
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 3
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 3
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 3
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 3
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 3
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 3
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 3
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 3
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 3
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 3
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 3
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 3
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 3
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 3
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 3
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 3
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 3
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 3
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 3
- BCUVPZLLSRMPJL-XIRDDKMYSA-N Leu-Trp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N BCUVPZLLSRMPJL-XIRDDKMYSA-N 0.000 description 3
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 3
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 3
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 3
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 3
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 3
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 3
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 3
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 3
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 3
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 3
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 3
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 3
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 244000188014 Spathodea campanulata Species 0.000 description 3
- 235000017899 Spathodea campanulata Nutrition 0.000 description 3
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 3
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 3
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 3
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 3
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 3
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 3
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 3
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 3
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 3
- 150000004729 acetoacetic acid derivatives Chemical class 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 108010011559 alanylphenylalanine Proteins 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 108010009297 diglycyl-histidine Proteins 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000007071 enzymatic hydrolysis Effects 0.000 description 3
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 3
- 238000004880 explosion Methods 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 3
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 3
- 150000002402 hexoses Chemical class 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 229920005610 lignin Polymers 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 238000006384 oligomerization reaction Methods 0.000 description 3
- 150000002972 pentoses Chemical class 0.000 description 3
- 108010090894 prolylleucine Proteins 0.000 description 3
- 235000018102 proteins Nutrition 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 239000007320 rich medium Substances 0.000 description 3
- 125000003396 thiol group Chemical group [H]S* 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 229940035893 uracil Drugs 0.000 description 3
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 2
- 101710161460 3-oxoacyl-[acyl-carrier-protein] synthase Proteins 0.000 description 2
- 241000501828 Acidocella Species 0.000 description 2
- 101710120269 Acyl-CoA thioester hydrolase YbgC Proteins 0.000 description 2
- 108030003177 Acyl-CoA:acyl-CoA alkyltransferases Proteins 0.000 description 2
- 102100022089 Acyl-[acyl-carrier-protein] hydrolase Human genes 0.000 description 2
- 101710144623 Acyl-[acyl-carrier-protein] hydrolase Proteins 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- WJRXVTCKASUIFF-FXQIFTODSA-N Ala-Cys-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WJRXVTCKASUIFF-FXQIFTODSA-N 0.000 description 2
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 2
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 2
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 2
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- BGDILZXXDJCKPF-CIUDSAMLSA-N Arg-Gln-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(O)=O BGDILZXXDJCKPF-CIUDSAMLSA-N 0.000 description 2
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 2
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 2
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 2
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 2
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 2
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- KEUNWIXNKVWCFL-FXQIFTODSA-N Asn-Met-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O KEUNWIXNKVWCFL-FXQIFTODSA-N 0.000 description 2
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 2
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 2
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 2
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 2
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 2
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 2
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 2
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- UGFAIRIUMAVXCW-UHFFFAOYSA-N Carbon monoxide Chemical compound [O+]#[C-] UGFAIRIUMAVXCW-UHFFFAOYSA-N 0.000 description 2
- 108010059892 Cellulase Proteins 0.000 description 2
- 108010084185 Cellulases Proteins 0.000 description 2
- 102000005575 Cellulases Human genes 0.000 description 2
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 2
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 2
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 2
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 2
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 2
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 2
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 2
- NXPXQIZKDOXIHH-JSGCOSHPSA-N Gln-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N NXPXQIZKDOXIHH-JSGCOSHPSA-N 0.000 description 2
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 2
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 2
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 2
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 2
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 2
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 2
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 2
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 2
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 2
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 2
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 2
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 2
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- 235000010469 Glycine max Nutrition 0.000 description 2
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 2
- LIEIYPBMQJLASB-SRVKXCTJSA-N His-Gln-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LIEIYPBMQJLASB-SRVKXCTJSA-N 0.000 description 2
- IIVZNQCUUMBBKF-GVXVVHGQSA-N His-Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 IIVZNQCUUMBBKF-GVXVVHGQSA-N 0.000 description 2
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 2
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 2
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 2
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 2
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 2
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 2
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 2
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 2
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 2
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 2
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 2
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- 240000001046 Lactobacillus acidophilus Species 0.000 description 2
- 235000013956 Lactobacillus acidophilus Nutrition 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 2
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 2
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 2
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 2
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 2
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 2
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 2
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 2
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 2
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 2
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 2
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 2
- UDOYVQQKQHZYMB-DCAQKATOSA-N Met-Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDOYVQQKQHZYMB-DCAQKATOSA-N 0.000 description 2
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 2
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 2
- 241000588621 Moraxella Species 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 2
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 2
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- VWHJZETTZDAGOM-XUXIUFHCSA-N Pro-Lys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VWHJZETTZDAGOM-XUXIUFHCSA-N 0.000 description 2
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 2
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 2
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 2
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 2
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 241001017226 Pseudothermotoga lettingae TMO Species 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- RQXDSYQXBCRXBT-GUBZILKMSA-N Ser-Met-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RQXDSYQXBCRXBT-GUBZILKMSA-N 0.000 description 2
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 2
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 2
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 2
- XVNZSJIKGJLQLH-RCWTZXSCSA-N Thr-Arg-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N)O XVNZSJIKGJLQLH-RCWTZXSCSA-N 0.000 description 2
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 2
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 2
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 2
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 2
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 2
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 2
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 2
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 2
- 108010064997 VPY tripeptide Proteins 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 2
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 2
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 2
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 2
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 2
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 2
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 2
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 241001327213 [Bacillus] clarkii Species 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 238000005903 acid hydrolysis reaction Methods 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 229910002091 carbon monoxide Inorganic materials 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000012824 chemical production Methods 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 238000009833 condensation Methods 0.000 description 2
- 230000005494 condensation Effects 0.000 description 2
- 238000011109 contamination Methods 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- HYBBIBNJHNGZAN-UHFFFAOYSA-N furfural Chemical compound O=CC1=CC=CO1 HYBBIBNJHNGZAN-UHFFFAOYSA-N 0.000 description 2
- GAEKPEKOJKCEMS-UHFFFAOYSA-N gamma-valerolactone Chemical compound CC1CCC(=O)O1 GAEKPEKOJKCEMS-UHFFFAOYSA-N 0.000 description 2
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 2
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 108010002430 hemicellulase Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 229910052739 hydrogen Inorganic materials 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 150000004715 keto acids Chemical class 0.000 description 2
- 229940039695 lactobacillus acidophilus Drugs 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- PCBAHNJKZYRMBM-UHFFFAOYSA-M lithium 3-oxopentanoate Chemical compound O=C(CC(=O)[O-])CC.[Li+] PCBAHNJKZYRMBM-UHFFFAOYSA-M 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- 238000012269 metabolic engineering Methods 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 150000002772 monosaccharides Chemical class 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 229920001282 polysaccharide Polymers 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 231100000419 toxicity Toxicity 0.000 description 2
- 230000001988 toxicity Effects 0.000 description 2
- 239000011573 trace mineral Substances 0.000 description 2
- 235000013619 trace mineral Nutrition 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 241001148471 unidentified anaerobic bacterium Species 0.000 description 2
- 238000009279 wet oxidation reaction Methods 0.000 description 2
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- VWWKKDNCCLAGRM-GVXVVHGQSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]propanoyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VWWKKDNCCLAGRM-GVXVVHGQSA-N 0.000 description 1
- KJTLQQUUPVSXIM-ZCFIWIBFSA-M (R)-mevalonate Chemical compound OCC[C@](O)(C)CC([O-])=O KJTLQQUUPVSXIM-ZCFIWIBFSA-M 0.000 description 1
- UZDMJOILBYFRMP-UHFFFAOYSA-N 2-[2-[2-[(2-amino-3-methylpentanoyl)amino]propanoylamino]propanoylamino]-3-methylpentanoic acid Chemical compound CCC(C)C(N)C(=O)NC(C)C(=O)NC(C)C(=O)NC(C(O)=O)C(C)CC UZDMJOILBYFRMP-UHFFFAOYSA-N 0.000 description 1
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 1
- SCPRYBYMKVYVND-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-4-methylpentanoyl)pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(O)=O SCPRYBYMKVYVND-UHFFFAOYSA-N 0.000 description 1
- MKRXAIMALGQSHI-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-methylpentanoyl)amino]-3-methylpentanoyl]amino]-3-methylbutanoyl]amino]-3-methylbutanoic acid Chemical compound CCC(C)C(N)C(=O)NC(C(C)CC)C(=O)NC(C(C)C)C(=O)NC(C(C)C)C(O)=O MKRXAIMALGQSHI-UHFFFAOYSA-N 0.000 description 1
- OTEWWRBKGONZBW-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]-4-methylpentanoyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NC(CC(C)C)C(=O)NCC(=O)NCC(O)=O OTEWWRBKGONZBW-UHFFFAOYSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- LOTVQXNRIAEYCG-UHFFFAOYSA-N 3-hydroxy-2-(hydroxymethyl)-2-[hydroxymethyl(methyl)amino]propanoic acid Chemical compound OCN(C)C(CO)(CO)C(O)=O LOTVQXNRIAEYCG-UHFFFAOYSA-N 0.000 description 1
- 241001468163 Acetobacterium woodii Species 0.000 description 1
- 108010043467 Acetone carboxylase Proteins 0.000 description 1
- 108020003549 Acetyl-CoA hydrolase/transferase Proteins 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- FRFDXQWNDZMREB-ACZMJKKPSA-N Ala-Cys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRFDXQWNDZMREB-ACZMJKKPSA-N 0.000 description 1
- NJIFPLAJSVUQOZ-JBDRJPRFSA-N Ala-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)N NJIFPLAJSVUQOZ-JBDRJPRFSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- YEVZMOUUZINZCK-LKTVYLICSA-N Ala-Glu-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YEVZMOUUZINZCK-LKTVYLICSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- GRPHQEMIFDPKOE-HGNGGELXSA-N Ala-His-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GRPHQEMIFDPKOE-HGNGGELXSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- OOBVTWHLKYJFJH-FXQIFTODSA-N Arg-Ala-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O OOBVTWHLKYJFJH-FXQIFTODSA-N 0.000 description 1
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- QEKBCDODJBBWHV-GUBZILKMSA-N Arg-Arg-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O QEKBCDODJBBWHV-GUBZILKMSA-N 0.000 description 1
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 1
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- JEXPNDORFYHJTM-IHRRRGAJSA-N Arg-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JEXPNDORFYHJTM-IHRRRGAJSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- RYAOESLKINBXFH-CMOCDZPBSA-N Arg-Phe-Phe-Cys Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 RYAOESLKINBXFH-CMOCDZPBSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 1
- BRCVLJZIIFBSPF-ZLUOBGJFSA-N Asn-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BRCVLJZIIFBSPF-ZLUOBGJFSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 1
- PTNFNTOBUDWHNZ-GUBZILKMSA-N Asn-Arg-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O PTNFNTOBUDWHNZ-GUBZILKMSA-N 0.000 description 1
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 1
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- QUAWOKPCAKCHQL-SRVKXCTJSA-N Asn-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QUAWOKPCAKCHQL-SRVKXCTJSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- LUJQEUOZJUWRRX-BPUTZDHNSA-N Asn-Trp-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(O)=O LUJQEUOZJUWRRX-BPUTZDHNSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- IAMNNSSEBXDJMN-CIUDSAMLSA-N Asp-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N IAMNNSSEBXDJMN-CIUDSAMLSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- RATOMFTUDRYMKX-ACZMJKKPSA-N Asp-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RATOMFTUDRYMKX-ACZMJKKPSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- WDMNFNXKGSLIOB-GUBZILKMSA-N Asp-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N WDMNFNXKGSLIOB-GUBZILKMSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- LIQNMKIBMPEOOP-IHRRRGAJSA-N Asp-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)O)N LIQNMKIBMPEOOP-IHRRRGAJSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- LGGHQRZIJSYRHA-GUBZILKMSA-N Asp-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N LGGHQRZIJSYRHA-GUBZILKMSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- LTARLVHGOGBRHN-AAEUAGOBSA-N Asp-Trp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O LTARLVHGOGBRHN-AAEUAGOBSA-N 0.000 description 1
- ZQFZEBRNAMXXJV-KKUMJFAQSA-N Asp-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O ZQFZEBRNAMXXJV-KKUMJFAQSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 102000011802 Beta-ketoacyl synthases Human genes 0.000 description 1
- 108050002233 Beta-ketoacyl synthases Proteins 0.000 description 1
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 description 1
- 229920003043 Cellulose fiber Polymers 0.000 description 1
- 238000003512 Claisen condensation reaction Methods 0.000 description 1
- 241001112695 Clostridiales Species 0.000 description 1
- 241001478240 Coccus Species 0.000 description 1
- 241000192700 Cyanobacteria Species 0.000 description 1
- ZOLXQKZHYOHHMD-DLOVCJGASA-N Cys-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N ZOLXQKZHYOHHMD-DLOVCJGASA-N 0.000 description 1
- BYALSSDCQYHKMY-XGEHTFHBSA-N Cys-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O BYALSSDCQYHKMY-XGEHTFHBSA-N 0.000 description 1
- KOHBWQDSVCARMI-BWBBJGPYSA-N Cys-Cys-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KOHBWQDSVCARMI-BWBBJGPYSA-N 0.000 description 1
- UXUSHQYYQCZWET-WDSKDSINSA-N Cys-Glu-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O UXUSHQYYQCZWET-WDSKDSINSA-N 0.000 description 1
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 1
- UPURLDIGQGTUPJ-ZKWXMUAHSA-N Cys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N UPURLDIGQGTUPJ-ZKWXMUAHSA-N 0.000 description 1
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- CIVXDCMSSFGWAL-YUMQZZPRSA-N Cys-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N CIVXDCMSSFGWAL-YUMQZZPRSA-N 0.000 description 1
- HJGUQJJJXQGXGJ-FXQIFTODSA-N Cys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HJGUQJJJXQGXGJ-FXQIFTODSA-N 0.000 description 1
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 1
- DQUWSUWXPWGTQT-DCAQKATOSA-N Cys-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS DQUWSUWXPWGTQT-DCAQKATOSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 1
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- YFKWIIRWHGKSQQ-WFBYXXMGSA-N Cys-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N YFKWIIRWHGKSQQ-WFBYXXMGSA-N 0.000 description 1
- MQQLYEHXSBJTRK-FXQIFTODSA-N Cys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N MQQLYEHXSBJTRK-FXQIFTODSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- KJTLQQUUPVSXIM-UHFFFAOYSA-N DL-mevalonic acid Natural products OCCC(O)(C)CC(O)=O KJTLQQUUPVSXIM-UHFFFAOYSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 108020005199 Dehydrogenases Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- BDAGIHXWWSANSR-UHFFFAOYSA-M Formate Chemical compound [O-]C=O BDAGIHXWWSANSR-UHFFFAOYSA-M 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- MINZLORERLNSPP-ACZMJKKPSA-N Gln-Asn-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N MINZLORERLNSPP-ACZMJKKPSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- UZMWDBOHAOSCCH-ACZMJKKPSA-N Gln-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O UZMWDBOHAOSCCH-ACZMJKKPSA-N 0.000 description 1
- QFTRCUPCARNIPZ-XHNCKOQMSA-N Gln-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)C(=O)O QFTRCUPCARNIPZ-XHNCKOQMSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- QDXMSSWCEVYOLZ-SZMVWBNQSA-N Gln-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QDXMSSWCEVYOLZ-SZMVWBNQSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- LUGUNEGJNDEBLU-DCAQKATOSA-N Gln-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LUGUNEGJNDEBLU-DCAQKATOSA-N 0.000 description 1
- DOMHVQBSRJNNKD-ZPFDUUQYSA-N Gln-Met-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DOMHVQBSRJNNKD-ZPFDUUQYSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 1
- KFHASAPTUOASQN-JYJNAYRXSA-N Gln-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KFHASAPTUOASQN-JYJNAYRXSA-N 0.000 description 1
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- XFHMVFKCQSHLKW-HJGDQZAQSA-N Gln-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XFHMVFKCQSHLKW-HJGDQZAQSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- YDJOULGWHQRPEV-SRVKXCTJSA-N Glu-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N YDJOULGWHQRPEV-SRVKXCTJSA-N 0.000 description 1
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- BKRQSECBKKCCKW-HVTMNAMFSA-N Glu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BKRQSECBKKCCKW-HVTMNAMFSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- ZTVGZOIBLRPQNR-KKUMJFAQSA-N Glu-Met-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZTVGZOIBLRPQNR-KKUMJFAQSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- ZSIDREAPEPAPKL-XIRDDKMYSA-N Glu-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N ZSIDREAPEPAPKL-XIRDDKMYSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 1
- LEGMTEAZGRRIMY-ZKWXMUAHSA-N Gly-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN LEGMTEAZGRRIMY-ZKWXMUAHSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- VILQXLMSDPJBFR-IUCAKERBSA-N Gly-Gly-Cys-His Natural products NCC(=O)NCC(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)O VILQXLMSDPJBFR-IUCAKERBSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- UUWOBINZFGTFMS-UWVGGRQHSA-N Gly-His-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O UUWOBINZFGTFMS-UWVGGRQHSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 1
- IFHJOBKVXBESRE-YUMQZZPRSA-N Gly-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN IFHJOBKVXBESRE-YUMQZZPRSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- MDKCBHZLQJZOCJ-STQMWFEESA-N Gly-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)CN MDKCBHZLQJZOCJ-STQMWFEESA-N 0.000 description 1
- 108010009504 Gly-Phe-Leu-Gly Proteins 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- NZOAFWHVAFJERA-OALUTQOASA-N Gly-Phe-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NZOAFWHVAFJERA-OALUTQOASA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- PYFHPYDQHCEVIT-KBPBESRZSA-N Gly-Trp-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O PYFHPYDQHCEVIT-KBPBESRZSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- 241000168525 Haematococcus Species 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 1
- QIVPRLJQQVXCIY-HGNGGELXSA-N His-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIVPRLJQQVXCIY-HGNGGELXSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 1
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 1
- OMNVOTCFQQLEQU-CIUDSAMLSA-N His-Asn-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMNVOTCFQQLEQU-CIUDSAMLSA-N 0.000 description 1
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 1
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 1
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 1
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 1
- WCNXUTNLSRWWQN-DCAQKATOSA-N His-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WCNXUTNLSRWWQN-DCAQKATOSA-N 0.000 description 1
- BQYZXYCEKYJKAM-VGDYDELISA-N His-Cys-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQYZXYCEKYJKAM-VGDYDELISA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- DVHGLDYMGWTYKW-GUBZILKMSA-N His-Gln-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DVHGLDYMGWTYKW-GUBZILKMSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- UQTKYYNHMVAOAA-HJPIBITLSA-N His-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N UQTKYYNHMVAOAA-HJPIBITLSA-N 0.000 description 1
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- BCZFOHDMCDXPDA-BZSNNMDCSA-N His-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)O BCZFOHDMCDXPDA-BZSNNMDCSA-N 0.000 description 1
- MIHTTYXBXIRRGV-AVGNSLFASA-N His-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MIHTTYXBXIRRGV-AVGNSLFASA-N 0.000 description 1
- YXASFUBDSDAXQD-UWVGGRQHSA-N His-Met-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O YXASFUBDSDAXQD-UWVGGRQHSA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 1
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- DMAPKBANYNZHNR-ULQDDVLXSA-N His-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DMAPKBANYNZHNR-ULQDDVLXSA-N 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- 108700039609 IRW peptide Proteins 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- ZXJFURYTPZMUNY-VKOGCVSHSA-N Ile-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 ZXJFURYTPZMUNY-VKOGCVSHSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- PMAOIIWHZHAPBT-HJPIBITLSA-N Ile-Tyr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N PMAOIIWHZHAPBT-HJPIBITLSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- 102000003855 L-lactate dehydrogenase Human genes 0.000 description 1
- 108700023483 L-lactate dehydrogenases Proteins 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- QKIBIXAQKAFZGL-GUBZILKMSA-N Leu-Cys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QKIBIXAQKAFZGL-GUBZILKMSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- POMXSEDNUXYPGK-IHRRRGAJSA-N Leu-Met-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N POMXSEDNUXYPGK-IHRRRGAJSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- LSLUTXRANSUGFY-XIRDDKMYSA-N Leu-Trp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O LSLUTXRANSUGFY-XIRDDKMYSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 1
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- CFOLERIRBUAYAD-HOCLYGCPSA-N Lys-Trp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O CFOLERIRBUAYAD-HOCLYGCPSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- XATKLFSXFINPSB-JYJNAYRXSA-N Lys-Tyr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O XATKLFSXFINPSB-JYJNAYRXSA-N 0.000 description 1
- NQOQDINRVQCAKD-ULQDDVLXSA-N Lys-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N NQOQDINRVQCAKD-ULQDDVLXSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 239000007993 MOPS buffer Substances 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 1
- DZTDEZSHBVRUCQ-FXQIFTODSA-N Met-Asp-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DZTDEZSHBVRUCQ-FXQIFTODSA-N 0.000 description 1
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 1
- CNUPMMXDISGXMU-CIUDSAMLSA-N Met-Cys-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O CNUPMMXDISGXMU-CIUDSAMLSA-N 0.000 description 1
- GTRWUQSSISWRTL-NAKRPEOUSA-N Met-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N GTRWUQSSISWRTL-NAKRPEOUSA-N 0.000 description 1
- PTYVBBNIAQWUFV-DCAQKATOSA-N Met-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N PTYVBBNIAQWUFV-DCAQKATOSA-N 0.000 description 1
- YKWHHKDMBZBMLG-GUBZILKMSA-N Met-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N YKWHHKDMBZBMLG-GUBZILKMSA-N 0.000 description 1
- JYCQGAGDJQYEDB-GUBZILKMSA-N Met-Gln-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JYCQGAGDJQYEDB-GUBZILKMSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- PQPMMGQTRQFSDA-SRVKXCTJSA-N Met-Glu-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O PQPMMGQTRQFSDA-SRVKXCTJSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- STLBOMUOQNIALW-BQBZGAKWSA-N Met-Gly-Cys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O STLBOMUOQNIALW-BQBZGAKWSA-N 0.000 description 1
- GVIVXNFKJQFTCE-YUMQZZPRSA-N Met-Gly-Gln Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O GVIVXNFKJQFTCE-YUMQZZPRSA-N 0.000 description 1
- UZVKFARGHHMQGX-IUCAKERBSA-N Met-Gly-Met Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCSC UZVKFARGHHMQGX-IUCAKERBSA-N 0.000 description 1
- MHQXIBRPDKXDGZ-ZFWWWQNUSA-N Met-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 MHQXIBRPDKXDGZ-ZFWWWQNUSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 1
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 1
- ZEVPMOHYCQFWSE-NAKRPEOUSA-N Met-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N ZEVPMOHYCQFWSE-NAKRPEOUSA-N 0.000 description 1
- FTQOFRPGLYXRFM-CYDGBPFRSA-N Met-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCSC)N FTQOFRPGLYXRFM-CYDGBPFRSA-N 0.000 description 1
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 description 1
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 1
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 1
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 1
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 1
- FMYLZGQFKPHXHI-GUBZILKMSA-N Met-Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O FMYLZGQFKPHXHI-GUBZILKMSA-N 0.000 description 1
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 1
- LNXGEYIEEUZGGH-JYJNAYRXSA-N Met-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 LNXGEYIEEUZGGH-JYJNAYRXSA-N 0.000 description 1
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 1
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- RMLWDZINJUDMEB-IHRRRGAJSA-N Met-Tyr-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RMLWDZINJUDMEB-IHRRRGAJSA-N 0.000 description 1
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 1
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 240000005373 Panax quinquefolius Species 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 108010019160 Pancreatin Proteins 0.000 description 1
- 108090000526 Papain Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 1
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- YOFKMVUAZGPFCF-IHRRRGAJSA-N Phe-Met-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O YOFKMVUAZGPFCF-IHRRRGAJSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- NWVMQNAELALJFW-RNXOBYDBSA-N Phe-Trp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NWVMQNAELALJFW-RNXOBYDBSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- 241000287502 Phoenicopteriformes Species 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 1
- ZJXXCGZFYQQETF-CYDGBPFRSA-N Pro-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 ZJXXCGZFYQQETF-CYDGBPFRSA-N 0.000 description 1
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- BVRBCQBUNGAWFP-KKUMJFAQSA-N Pro-Tyr-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O BVRBCQBUNGAWFP-KKUMJFAQSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 1
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 108010009736 Protein Hydrolysates Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 101100142275 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) RPL1A gene Proteins 0.000 description 1
- 101100476983 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SDT1 gene Proteins 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- CLKKNZQUQMZDGD-SRVKXCTJSA-N Ser-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CN=CN1 CLKKNZQUQMZDGD-SRVKXCTJSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 238000010793 Steam injection (oil industry) Methods 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 241000724227 Sulfurifustis variabilis Species 0.000 description 1
- 241000204649 Thermoanaerobacter kivui Species 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- NFMPFBCXABPALN-OWLDWWDNSA-N Thr-Ala-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O NFMPFBCXABPALN-OWLDWWDNSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- SEQKRHFRPICQDD-UHFFFAOYSA-N Tricine Natural products OCC(CO)(CO)[NH2+]CC([O-])=O SEQKRHFRPICQDD-UHFFFAOYSA-N 0.000 description 1
- GHXXDFDIDHIEIL-WFBYXXMGSA-N Trp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GHXXDFDIDHIEIL-WFBYXXMGSA-N 0.000 description 1
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 1
- WCTYCXZYBNKEIV-SXNHZJKMSA-N Trp-Glu-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 WCTYCXZYBNKEIV-SXNHZJKMSA-N 0.000 description 1
- DVWAIHZOPSYMSJ-ZVZYQTTQSA-N Trp-Glu-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 DVWAIHZOPSYMSJ-ZVZYQTTQSA-N 0.000 description 1
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 1
- DNUJCLUFRGGSDJ-YLVFBTJISA-N Trp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DNUJCLUFRGGSDJ-YLVFBTJISA-N 0.000 description 1
- HLDFBNPSURDYEN-VHWLVUOQSA-N Trp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HLDFBNPSURDYEN-VHWLVUOQSA-N 0.000 description 1
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- YTYHAYZPOARHAP-HOCLYGCPSA-N Trp-Lys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N YTYHAYZPOARHAP-HOCLYGCPSA-N 0.000 description 1
- WNGMGTMSUBARLB-RXVVDRJESA-N Trp-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(=O)NCC(O)=O)=CNC2=C1 WNGMGTMSUBARLB-RXVVDRJESA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- CDRYEAWHKJSGAF-BPNCWPANSA-N Tyr-Ala-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O CDRYEAWHKJSGAF-BPNCWPANSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- XBWKCYFGRXKWGO-SRVKXCTJSA-N Tyr-Cys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XBWKCYFGRXKWGO-SRVKXCTJSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- LMLBOGIOLHZXOT-JYJNAYRXSA-N Tyr-Glu-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O LMLBOGIOLHZXOT-JYJNAYRXSA-N 0.000 description 1
- FMOSEWZYZPMJAL-KKUMJFAQSA-N Tyr-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N FMOSEWZYZPMJAL-KKUMJFAQSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- LTSIAOZUVISRAQ-QWRGUYRKSA-N Tyr-Gly-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O LTSIAOZUVISRAQ-QWRGUYRKSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- PJWCWGXAVIVXQC-STECZYCISA-N Tyr-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PJWCWGXAVIVXQC-STECZYCISA-N 0.000 description 1
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 1
- HFJJDMOFTCQGEI-STECZYCISA-N Tyr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HFJJDMOFTCQGEI-STECZYCISA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 1
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 1
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- HLBHFAWNMAQGNO-AVGNSLFASA-N Val-His-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N HLBHFAWNMAQGNO-AVGNSLFASA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- PYXQBKJPHNCTNW-CYDGBPFRSA-N Val-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N PYXQBKJPHNCTNW-CYDGBPFRSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- WHVSJHJTMUHYBT-SRVKXCTJSA-N Val-Met-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N WHVSJHJTMUHYBT-SRVKXCTJSA-N 0.000 description 1
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- ODUHAIXFXFACDY-SRVKXCTJSA-N Val-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C ODUHAIXFXFACDY-SRVKXCTJSA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 108010047506 alanyl-glutaminyl-glycyl-valine Proteins 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010031014 alanyl-histidyl-leucyl-leucine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 108010058033 alcohol dehydrogenase (NADP+) Proteins 0.000 description 1
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 150000004718 beta keto acids Chemical class 0.000 description 1
- 238000010364 biochemical engineering Methods 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 101150068947 cac gene Proteins 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000030570 cellular localization Effects 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000007073 chemical hydrolysis Effects 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000005516 coenzyme A Substances 0.000 description 1
- 229940093530 coenzyme a Drugs 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 101150082482 ctc gene Proteins 0.000 description 1
- 238000010543 cumene process Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000006477 desulfuration reaction Methods 0.000 description 1
- 230000023556 desulfurization Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 230000000763 evoking effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000012262 fermentative production Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 239000010921 garden waste Substances 0.000 description 1
- 239000007792 gaseous phase Substances 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 230000034659 glycolysis Effects 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 239000004463 hay Substances 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 239000001307 helium Substances 0.000 description 1
- 229910052734 helium Inorganic materials 0.000 description 1
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 1
- 229940059442 hemicellulase Drugs 0.000 description 1
- 229920000140 heteropolymer Polymers 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- 150000002431 hydrogen Chemical class 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000000752 ionisation method Methods 0.000 description 1
- 235000015141 kefir Nutrition 0.000 description 1
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 239000002029 lignocellulosic biomass Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 230000001320 lysogenic effect Effects 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- LQNUZADURLCDLV-UHFFFAOYSA-N nitrobenzene Substances [O-][N+](=O)C1=CC=CC=C1 LQNUZADURLCDLV-UHFFFAOYSA-N 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 150000002482 oligosaccharides Chemical class 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 229940055695 pancreatin Drugs 0.000 description 1
- 229940055729 papain Drugs 0.000 description 1
- 235000019834 papain Nutrition 0.000 description 1
- 239000001814 pectin Substances 0.000 description 1
- 229920001277 pectin Polymers 0.000 description 1
- 235000010987 pectin Nutrition 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 239000003348 petrochemical agent Substances 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 238000002203 pretreatment Methods 0.000 description 1
- 238000011027 product recovery Methods 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000012047 saturated solution Substances 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 229940054269 sodium pyruvate Drugs 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 239000008174 sterile solution Substances 0.000 description 1
- 239000010902 straw Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- KDYFGRWQOYBRFD-UHFFFAOYSA-L succinate(2-) Chemical compound [O-]C(=O)CCC([O-])=O KDYFGRWQOYBRFD-UHFFFAOYSA-L 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000001502 supplementing effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 238000010555 transalkylation reaction Methods 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 239000002912 waste gas Substances 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 229910052984 zinc sulfide Inorganic materials 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/1029—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0006—Oxidoreductases (1.) acting on CH-OH groups as donors (1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/13—Transferases (2.) transferring sulfur containing groups (2.8)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/04—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/24—Preparation of oxygen-containing organic compounds containing a carbonyl group
- C12P7/26—Ketones
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/24—Preparation of oxygen-containing organic compounds containing a carbonyl group
- C12P7/26—Ketones
- C12P7/28—Acetone-containing products
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y101/00—Oxidoreductases acting on the CH-OH group of donors (1.1)
- C12Y101/01—Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
- C12Y101/0108—Isopropanol dehydrogenase (NADP+) (1.1.1.80)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y203/00—Acyltransferases (2.3)
- C12Y203/01—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
- C12Y203/01009—Acetyl-CoA C-acetyltransferase (2.3.1.9)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y208/00—Transferases transferring sulfur-containing groups (2.8)
- C12Y208/03—CoA-transferases (2.8.3)
- C12Y208/03005—3-Oxoacid CoA-transferase (2.8.3.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y208/00—Transferases transferring sulfur-containing groups (2.8)
- C12Y208/03—CoA-transferases (2.8.3)
- C12Y208/03008—Acetate CoA-transferase (2.8.3.8)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y208/00—Transferases transferring sulfur-containing groups (2.8)
- C12Y208/03—CoA-transferases (2.8.3)
- C12Y208/03009—Butyrate--acetoacetate CoA-transferase (2.8.3.9)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/02—Thioester hydrolases (3.1.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y401/00—Carbon-carbon lyases (4.1)
- C12Y401/01—Carboxy-lyases (4.1.1)
- C12Y401/01004—Acetoacetate decarboxylase (4.1.1.4)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/10—Biofuels, e.g. bio-diesel
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Heterocyclic Carbon Compounds Containing A Hetero Ring Having Oxygen Or Sulfur (AREA)
Abstract
本发明涉及用于微生物生产挥发性化合物(包括丙酮、丁酮和异丙醇)的嗜热细胞和方法。还提供了可用于这样的方法中的核酸构建体、载体和宿主细胞。
Description
技术领域
本发明涉及用于微生物生产挥发性化合物的嗜热细胞和方法,其中所述挥发性化合物包括丙酮、丁酮和异丙醇。还提供了可用于这样的方法中的核酸构建体、载体和宿主细胞。
背景
在第一次世界大战期间,通过丙酮丁醇梭菌(Clostridium acetobutylicum)发酵生产丙酮开始以工业规模进行以满足军事需要。该技术迅速传播,并在二十世纪上半叶,其重要性仅次于乙醇发酵。随着石化工业的发展,丙酮发酵在西方逐渐衰落,但一些国家继续用到20世纪80年代和90年代。
目前对石油使用及其枯竭的环境影响的担忧推动了对化学品生产的替代方法的探索,并已经重新唤起了对丙酮的生物生产的兴趣。在2014年丙酮消费量为590万吨,并预计在2020年之前以每年约3%的速度增长至720万吨(“Acetone market:global industryanalysis and opportunity assessment,2014-2020,”2015)。今天,绝大部分的丙酮是通过异丙苯法化学生产的。这伴随着巨大的环境成本。
天然梭菌丙酮途径由从乙酰辅酶A和乙酸盐开始的三个酶促步骤组成(图7)。尽管先前的研究已经解决了天然宿主丙酮丁醇梭菌中生物合成丙酮的不同方面(Jones等人,1986),但最近的研究朝着其它生物体中更先进的代谢工程迈出了一步。在1998年,Bermejo等人将来自丙酮丁醇梭菌的天然丙酮途径克隆到大肠杆菌(Escherichia coli)中,并实现了与天然生产者相当和甚至更高的产量(Bernejo等人,1998)。其他人通过在第二步引入水解反应来修改天然途径,这会产生乙酰乙酸盐和CoA-SH(May等人,2013)。其他人已经通过建立合成途径来修改大肠杆菌(E.coli)的代谢网络,例如最近发明的非氧化糖酵解(Bogorad等人,2013;Yang等人,2016)和甲羟戊酸途径的一部分(Baer等人,2016)。丙酮是一种相对便宜的商品化学品。为了使其生物生产与石化产品具有竞争力,值得考虑替代生产宿主,例如利用比精制糖更便宜的原料的那些。例如,将来自丙酮丁醇梭菌的天然途径在能够由CO2和水产生丙酮的蓝细菌中表达(Zhou等人,2012)。用于异源丙酮生产的其它宿主是扬氏梭菌(C.ljungdahlii)(Banerjee等人,2014)和伍氏醋酸杆菌(Acetobacteriumwoodii)(Hoffmeister等人,2016),它们都是能够代谢CO或能够代谢CO2和H2的混合物(合成气)的产乙酸菌。但是,仍然还对利用其它粗制的和低成本的碳源感兴趣。
地芽孢杆菌(Geobacillus)属的代表也越来越多地被用作化学品生产的宿主(Bosma等人,2013)。地芽孢杆菌的嗜热生产的优点包括:1)降低被嗜温菌污染的风险;2)在高温下更高的反应速率;3)减少用于冷却热法预处理的生物质的能量输入;4)地芽孢杆菌能够利用广泛的碳源,包括C6和C5糖以及乙酸盐。此外,丙酮和其它挥发性化合物在高温下蒸发,并可以在下游收集,这有助于它们的纯化,同时减少产物毒性和产物抑制的问题。
迄今为止,代谢工程的大部分努力都集中在提高地芽孢杆菌自身发酵副产物(尤其是乙醇)的生产上。这通过从竞争性途径和上调途径中敲除基因以引起通向乙醇的通量的增加来实现(Cripps等人,2009;Zhou等人,2016)。
实现更高丙酮产率的策略之一是构建替代的生物合成途径。
因此,仍然需要允许以高效、成本有效且可持续的方式生物生产丙酮、丁酮和异丙醇的细胞和方法。
发明内容
本文提供了一种生产选自丙酮、丁酮和异丙醇的一种或多种化合物的方法,所述方法包括以下步骤:
a)提供嗜热细胞,优选嗜热细菌或嗜热古细菌细胞,其表达:
i)选自以下的第一种酶:
乙酰辅酶A乙酰基转移酶(EC 2.3.1.9),所述乙酰辅酶A乙酰基转移酶(EC2.3.1.9)选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ IDNO:3)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)、Tfu_0436(SEQ ID NO:6)、Slip_0880(SEQ ID NO:7)、Tfu_2394(SEQ ID NO:8)、Slip_1236(SEQ ID NO:9)、Caur_1540(SEQ ID NO:10)、Tfu_0253(SEQ ID NO:11)、CHY_1604(SEQ ID NO:14)、CHY_1288(SEQ IDNO:15)、Slip_2085(SEQ ID NO:16)、Slip_0465(SEQ ID NO:17)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和CHY_1355(SEQ ID NO:18);
EC编号2.3.3.20的酶,所述EC编号2.3.3.20的酶选自3-氧代酰基-ACP合酶SVA_3859(SEQ ID NO:12)和酰基辅酶A:酰基辅酶A烷基转移酶Despr_2661(SEQ ID NO:13);和与其具有至少70%同源性、相似性或同一性的其功能性变体;
ii)选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶,其中所述第二种酶选自:Tle2、Dde2(EC2.8.3.5)(SEQ ID NO:21)、Ghh2(EC 2.8.3.5)、Tme(EC 2.8.3.8)、Pth(EC 2.8.3.1)(SEQID NO:26)和Rma(EC 3.1.2.-)(SEQ ID NO:27)、或与其具有至少70%同源性、相似性或同一性的其功能性变体,
其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)组成,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)组成,且其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
iii)乙酰乙酸脱羧酶(EC 4.1.1.4),优选Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;
iv)任选的异丙醇脱氢酶(EC 1.1.1.80),优选Tbr(SEQ ID NO:29),或与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)在42℃至80℃之间,诸如50℃至75℃之间,例如60℃的温度下,于包含培养液的生物反应器中培养所述细菌细胞,由此生产所述一种或多种化合物;
c)回收在步骤b)中生产的所述一种或多种化合物。
本文还提供了能够生产丙酮和/或丁酮和任选的异丙醇的嗜热细胞,所述细胞是细菌细胞或古细菌细胞并且表达:
i)选自以下的第一种酶:
乙酰辅酶A乙酰基转移酶(EC 2.3.1.9),所述乙酰辅酶A乙酰基转移酶(EC2.3.1.9)选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ IDNO:3)、Slip_0479(SEQID NO:4)、Tfu_1520(SEQ ID NO:5)、Tfu_0436(SEQ ID NO:
6)、Slip_0880(SEQ ID NO:7)、Tfu_2394(SEQ ID NO:8)、Slip_1236(SEQ ID NO:9)、Caur_1540(SEQ ID NO:10)、Tfu_0253(SEQ ID NO:11)、CHY_1604(SEQ ID NO:14)、CHY_1288(SEQ ID NO:15)、Slip_2085(SEQ ID NO:16)、Slip_0465(SEQ ID NO:17)、Dde1(SEQID NO:59)、Rxy2(SEQID NO:60)和CHY_1355(SEQ ID NO:18);或
EC编号2.3.3.20的酶,所述EC编号2.3.3.20的酶选自3-氧代酰基-ACP合酶SVA_3859(SEQ ID NO:12)和酰基辅酶A:酰基辅酶A烷基转移酶Despr_2661(SEQ ID NO:13);和
与其具有至少70%同源性、相似性或同一性的其功能性变体;
ii)选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶,
其中所述第二种酶选自:Tle2、Dde2(EC 2.8.3.5)(SEQ ID NO:21)、Ghh2(EC2.8.3.5)、Tme(EC 2.8.3.8)、Pth(EC 2.8.3.1)(SEQ ID NO:
26)和Rma(EC 3.1.2.-)(SEQ ID NO:27),或与其具有至少70%同源性、相似性或同一性的其功能性变体,
其中Tle2由Tle2亚基A(EC 2.8.3.8(SEQ ID NO:19)和Tle2亚基B(EC
2.8.3.9)(SEQ ID NO:20)组成,其中Ghh2由Ghh2亚基A(SEQ ID NO:
22)和Ghh2亚基B(SEQ ID NO:23)组成,且其中Tme由Tme亚基A
(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)组成,和
iii)乙酰乙酸脱羧酶(EC 4.1.1.4),优选Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;
由此所述细胞能够将乙酰辅酶A转化为丙酮,从而以至少0.8g/L的滴度生产丙酮;
和/或,由此所述细胞能够将乙酰辅酶A和丙酰辅酶A转化为丁酮,从而生产丁酮;
iv)任选的异丙醇脱氢酶(EC 1.1.1.80),优选Tbr(SEQ ID NO:29),
或与其具有至少70%同源性、相似性或同一性的其功能性变体,
由此所述细胞能够进一步将丙酮转化为异丙醇,从而生产异丙醇。
本文还提供了一种用于修饰选自嗜热细菌细胞和嗜热古细菌细胞的嗜热细胞的核酸构建体,所述核酸构建体包含:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,其中所述乙酰辅酶A乙酰基转移酶选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)、Tfu_0436(SEQ ID NO:6)、Slip_0880(SEQ IDNO:7)、Tfu_2394(SEQ ID NO:8)、Slip_1236(SEQ ID NO:9)、Caur_1540(SEQ ID NO:10)、Tfu_0253(SEQ ID NO:11)、CHY_1604(SEQ ID NO:14)、CHY_1288(SEQ ID NO:15)、Slip_2085(SEQ ID NO:16)、Slip_0465(SEQ ID NO:17)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和CHY_1355(SEQ ID NO:18)和/或EC编号2.3.3.20的酶,该EC编号2.3.3.20的酶选自3-氧代酰基-ACP合酶SVA_3859(SEQ ID NO:12)和酰基辅酶A:酰基辅酶A烷基转移酶Despr_2661(SEQ ID NO:13);
ii)编码第二种酶的多核苷酸,所述第二种酶选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II;
其中所述第二种酶选自:Tle2、Dde2(EC 2.8.3.5)(SEQ ID NO:21)、Ghh2(EC2.8.3.5)、Tme(EC 2.8.3.8)、Pth(EC 2.8.3.1)(SEQ ID NO:26)和Rma(EC 3.1.2.-)(SEQID NO:27),或与其具有至少70%同源性、相似性或同一性的其功能性变体,
其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)组成,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)组成,且其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
iii)编码乙酰乙酸脱羧酶(EC 4.1.1.4)或与其具有至少70%同源性、相似性或同一性的其功能性变体,优选Cac(SEQ ID NO:28)的多核苷酸。
本文还提供了一种载体,其包含本文中公开的核酸构建体。
本文还提供了一种嗜热细胞,其包含本文中公开的核酸构建体和/或载体,其中所述嗜热细胞是嗜热细菌细胞或嗜热古细菌细胞。
本文还提供了一种试剂盒,其包含本文描述的核酸构建体、载体或嗜热细胞。
附图说明
图1:热葡糖苷酶地芽孢杆菌(G.thermoglucosidasius)作为用于丙酮生产的宿主。(A)丙酮耐受性。使细胞在不同浓度的丙酮下生长,并测量最终密度。(B)在不同的条件下,使用三种酶组合的丙酮生物合成。每个字母代表一种酶,位置对应于在该途径中的酶促步骤。酶来源:D:脱硫脱铁杆菌(D.desulfuricans);G:地芽孢杆菌属种(Geobacillus sp.)GHH01;C:丙酮丁醇梭菌(C.acetobutylicum)。
图2:热葡糖苷酶地芽孢杆菌中丁酮和丙酮的生产,所述热葡糖苷酶地芽孢杆菌表达指定的硫解酶,并且以其它方式表达来自Pseudothermotoga lettingae的乙酰基辅酶A转移酶Tle2(UniProt ID A8F7H7,A8F7H6)和来自丙酮丁醇梭菌的乙酰乙酸脱羧酶Cac(P23670)。
图3:Dde1-Dde2-Cac操纵子的丙酮产量与驱动其在热葡糖苷酶地芽孢杆菌中表达的启动子强度之间的相关性。
图4:在含有1%葡萄糖和不同浓度的乙酸盐的TMM培养基中,热葡糖苷酶地芽孢杆菌菌株CTC的丙酮的产量(数据来自表4)。
图5:在含有0.2%乙酸和不同浓度的葡萄糖和木糖的TMM培养基中,热葡糖苷酶地芽孢杆菌菌株CTC的丙酮的产量。
图6:在含有1%葡萄糖和不同浓度的丙酸的半确定成分培养基TMM中,热葡糖苷酶地芽孢杆菌菌株CTC的丁酮的产量(数据来自表6)。
图7:丙酮丁醇梭菌中的天然丙酮途径。
图8:在含有2%葡萄糖、0.2%乙酸和1%酵母提取物的半确定成分培养基TMM中,热葡糖苷酶地芽孢杆菌菌株CTC的丁酮的产量,30L补料分批发酵:2g/L/h葡萄糖、1g/L/h乙酸、1g/L/h酵母提取物。X-轴:时间,以小时为单位;左侧Y-轴:丙酮,g/L;右侧Y-轴:CO2,g/L。
图9:热葡糖苷酶地芽孢杆菌中的丙酮的产量,所述热葡糖苷酶地芽孢杆菌表达指定的硫解酶,并且以其它方式表达来自Pseudothermotoga lettingae的乙酰基辅酶A转移酶Tle2(UniProt ID A8F7H7,A8F7H6)和来自丙酮丁醇梭菌的乙酰乙酸脱羧酶Cac(P23670)。Y-轴:丙酮,mg/L。
图10:在1L恒定补料分批发酵中,STC菌株(Slip_0880-Tle2-Cac)中和CTC菌株(Caur_1461-Tle2-Cac)中的丙酮的产量。菌株在补充有2%葡萄糖、0.2%乙酸、1%酵母提取物的TMM培养基中生长。Y-轴:丙酮,g/L。X-轴:时间,小时。
具体实施方式
本公开涉及微生物生产挥发性化合物(特别是丙酮、丁酮和异丙醇)的方法。通过利用嗜热细胞进行生产,这些化合物可以容易且连续地从发酵(诸如发酵液)中除去,这除了使该过程具有劳动力和成本效益外,还解决了产物抑制以及与产物毒性相关的对生长负面影响的问题。其它优点包括由于相对高的生产温度而降低的污染风险。
定义
术语“嗜热生物”在本文中表示在高于42℃的温度下最好地茁壮成长,或至少能够生长的微生物,特别是细菌和古细菌。
功能性变体:该术语在本文中适用于酶的功能性变体,即酶的经修饰形式,或源自不同物种的同源酶,它们保留原始酶的一些或全部催化活性。功能性变体可能已经通过引入突变(其赋予例如增加的活性、细胞内定位的变化、增加的热稳定性、延长的半衰期等)而被修饰,但保留了与衍生出它们的酶进行相同酶促反应的能力,尽管可能在程度上有所不同。优选地,在功能性变体中引入的突变是编码相应酶的基因中的突变,例如在基因启动子中或在编码酶的编码序列中的突变。
关于多核苷酸(或多肽)的“同一性”、“相似性”和“同源性”在本文中被定义为,在比对序列并如果必要的话引入缺口以达到最大同一性/相似性/同源性百分比,并根据NCIUB规则考虑任何保守替换(hftp://www.chem.qmul.ac.uk/iubmb/misc/naseq.html;NC-IUB,Eur J Biochem(1985)150:1-5)作为序列同一性的部分以后,候选序列中与相应天然核酸(或氨基酸)的残基相同的核酸(或氨基酸)的百分比。5'或3'延伸或插入(对于核酸)或N'或C'延伸或插入(对于多肽)均不会引起同一性、相似性或同源性的降低。用于比对的方法和计算机程序是本领域众所周知的。通常,两个序列之间的给定同源性意味着这些序列之间的同一性至少等于同源性;例如,如果两个序列彼此之间具有70%同源性,那么它们彼此之间的同一性不可能小于70%—但可以共享80%同一性。贯穿于本公开,与另一个序列共享至少70%同一性、同源性或相似性的序列(氨基酸序列或核酸序列)是指,该序列与所述序列共享至少70%同一性、同源性或相似性,诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%、诸如100%同一性、同源性或相似性。
术语“乙酰辅酶A乙酰基转移酶”或“硫解酶”在本文中表示催化两个乙酰辅酶A分子转化为乙酰乙酰辅酶A和辅酶A(CoA)或催化一个乙酰辅酶A和一个丙酰辅酶A的转化(产生3-酮戊酰基辅酶A)的酶。特别地,该术语表示EC编号2.3.1.9的乙酰辅酶A乙酰基转移酶。这些特定酶具有乙酰辅酶A或丙酰辅酶A的底物偏好,并因此优选地正向催化反应。技术人员将知道如何确定突变酶是否具有硫解酶活性。例如,可将潜在的硫解酶与乙酰乙酰辅酶A和辅酶A一起孵育,并且可监测在303nm的吸光度。在303nm处吸光度的降低表明,潜在的硫解酶可以进行所述反应并且具有硫解酶活性。
术语“3-氧代酰基-ACP合酶”(3-氧代酰基-[酰基-载体-蛋白]合酶)和“酰基辅酶A:酰基辅酶A烷基转移酶”表示EC编号2.3.3.20的相同酶。它们在水的存在下催化两分子的酰基辅酶A转化为一分子的(2R)-2-烷基-3-氧代烷酸酯,从而产生两个辅酶A(CoA)分子。该反应是头对头的非脱羧克莱森缩合。技术人员将知道如何确定突变酶是否具有3-氧代酰基-ACP合酶活性。例如,可将潜在的3-氧代酰基-ACP合酶与乙酰辅酶A(或乙酰辅酶A和丙酰辅酶A)一起孵育,随后添加5,5’-二硫代-二-(2-硝基苯甲酸),其与释放的CoASH的游离硫醇基团反应。可以监测产物在412nm的吸光度。产物的形成表明,潜在的3-氧代酰基-ACP合酶保留了3-氧代酰基-ACP合酶活性。
乙酰乙酸脱羧酶:该术语在本文中表示EC编号4.1.1.4的酶。乙酰乙酸脱羧酶参与人类和其它哺乳动物中的酮体生成途径以及细菌中的溶剂生成(solventogenesis)。它们催化乙酰乙酸盐的脱羧,从而产生丙酮和二氧化碳。技术人员将知道如何确定突变酶是否具有乙酰乙酸脱羧酶活性。例如,可将潜在的乙酰乙酸脱羧酶与乙酰乙酸锂一起孵育。例如通过测压法可监测随之而来的伴随的CO2释放。CO2的释放表明,被测的酶具有乙酰乙酸脱羧酶活性。
乙酸辅酶A转移酶(EC 2.8.3.8)是催化以下化学反应的酶:酰基辅酶A+乙酸盐脂肪酸+乙酰辅酶A。可以通过本领域已知的方法测量乙酸辅酶A转移酶变体的活性,例如通过将该酶与乙酰辅酶A和乙酰乙酸锂一起孵育,并通过测量在313nm的吸光度来跟踪乙酰乙酰辅酶A形成。
3-氧代酸辅酶A转移酶(EC 2.8.3.5)是催化以下化学反应的酶:3-酮戊酰基辅酶A+脂肪酸3-氧代戊酸盐+酰基辅酶A。可通过本领域已知的方法测试乙酸辅酶A转移酶变体的活性,例如通过将该酶与乙酰辅酶A和3-氧代戊酸锂一起孵育,并通过测量在304nm的吸光度来跟踪3-酮戊酰基辅酶A形成。
酰基辅酶A:乙酸/3-酮酸辅酶A转移酶(EC 2.8.3.1)是催化以下化学反应的酶:3-酮酰基辅酶A +脂肪酸3-酮酸+酰基辅酶A。其它名称包括丙酸辅酶A转移酶、乙酰辅酶A:丙酸辅酶A转移酶、丙酸辅酶A-转移酶、丙酸辅酶A:乳酰基-辅酶A转移酶、丙酰辅酶A:乙酸辅酶A转移酶和丙酰辅酶A转移酶。可通过本领域已知的方法测量乙酸辅酶A转移酶变体的活性,例如通过将该酶与乙酰辅酶A和3-氧代戊酸锂一起孵育,并通过测量在304nm的吸光度来跟踪3-酮戊酰基辅酶A形成。
酰基辅酶A硫酯酶II(EC 3.1.2.-)是催化以下水解化学反应的酶:酰基辅酶A+H2O→脂肪酸+辅酶A。技术人员将知道如何确定突变酶是否具有酰基辅酶A硫酯酶II活性。例如,在5,5’-二硫代双(2-硝基苯甲酸)的存在下,可将潜在的酰基辅酶A硫酯酶II与乙酰乙酰辅酶A一起孵育。辅酶A的游离硫醇基团的释放将引起5-硫代-2-硝基苯甲酸盐的形成,这可以通过测量在412nm的吸光度来量化。
异丙醇脱氢酶:异丙醇脱氢酶(NADP+)(EC 1.1.1.80)是催化丙烷-2-醇向丙酮和丙酮向丙烷-2-醇的转化的酶。可如下测量异丙醇脱氢酶的(突变)变体的活性:将该酶与丙酮和NAD(P)H一起孵育,并通过测量在340nm的吸光度来跟踪NAD(P)H氧化。
滴度:本文中化合物的滴度是指化合物的产生浓度。当化合物由细胞产生时,该术语是指由细胞产生的总浓度,即化合物的总量除以培养基的体积。这意味着,特别是对于挥发性化合物,滴度包括可能已经从培养基中蒸发的化合物部分,并因此其通过从发酵液和从发酵罐的潜在废气收集产生的化合物来确定。
本文提供了一种生产选自丙酮、丁酮和异丙醇的一种或多种化合物的方法,所述方法包括以下步骤:
a)提供嗜热细胞,优选嗜热细菌或嗜热古细菌细胞,其表达:
i)选自以下的第一种酶:
乙酰辅酶A乙酰基转移酶(EC 2.3.1.9),所述乙酰辅酶A乙酰基转移酶(EC2.3.1.9)选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ IDNO:3)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)、Tfu_0436(SEQ ID NO:6)、Slip_0880(SEQ ID NO:7)、Tfu_2394(SEQ ID NO:8)、Slip_1236(SEQ ID NO:9)、Caur_1540(SEQ ID NO:10)、Tfu_0253(SEQ ID NO:11)、CHY_1604(SEQ ID NO:14)、CHY_1288(SEQ IDNO:15)、Slip_2085(SEQ ID NO:16)、Slip_0465(SEQ ID NO:17)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和CHY_1355
(SEQ ID NO:18);或
EC编号2.3.3.20的酶,所述EC编号2.3.3.20的酶选自3-氧代酰基-ACP合酶SVA_3859(SEQ ID NO:12)和酰基辅酶A:
酰基辅酶A烷基转移酶Despr_2661(SEQ ID NO:13);和
与其具有至少70%同源性、相似性或同一性的其功能性变体;ii)选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:
乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶,
其中所述第二种酶选自:Tle2、Dde2(EC 2.8.3.5)(SEQ ID NO:21)、Ghh2(EC2.8.3.5)、Tme(EC 2.8.3.8)、Pth(EC 2.8.3.1)(SEQ ID NO:26)和Rma(EC 3.1.2.-)(SEQID NO:27)、或与其具有至少70%同源性、相似性或同一性的其功能性变体,
其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)组成,其中Ghh2由Ghh2亚基
A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)组成,且其中Tme由Tme亚基A(SEQID NO:24)和Tme亚基B(SEQ ID NO:25)组成,
和
iii)乙酰乙酸脱羧酶(EC 4.1.1.4),优选Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;
iv)任选的异丙醇脱氢酶(EC 1.1.1.80),优选Tbr(SEQ ID NO:29),或与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)在42℃和80℃之间,诸如50℃和75℃之间,例如在60℃的温度下,于包含培养液的生物反应器中培养所述细菌细胞,由此生产所述一种或多种化合物;
c)回收在步骤b)中生产的所述一种或多种化合物。
嗜热细胞
在本公开的上下文中采用的细胞是嗜热细胞,更具体地是细菌或古细菌细胞。具体地,具有42℃或更高的最佳生长温度的细菌或古细菌细胞是令人感兴趣的。除非另有说明,否则本文中的术语“细胞”通常被解释为表示嗜热细胞,更具体地表示嗜热细菌细胞或嗜热古细菌细胞,即能够在42℃或更高的温度下生长的细胞。
本文提供了一种能够生产丙酮和/或丁酮和任选的异丙醇的嗜热细胞,所述细胞是细菌细胞或古细菌细胞并且表达:
i)选自以下的第一种酶:
乙酰辅酶A乙酰基转移酶(EC 2.3.1.9),所述乙酰辅酶A乙酰基转移酶(EC2.3.1.9)选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ IDNO:3)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)、Tfu_0436(SEQ ID NO:6)、Slip_0880(SEQ ID NO:7)、Tfu_2394(SEQ ID NO:8)、Slip_1236(SEQ ID NO:9)、Caur_1540(SEQ ID NO:10)、Tfu_0253(SEQ ID NO:11)、CHY_1604(SEQ ID NO:14)、CHY_1288(SEQ IDNO:15)、Slip_2085(SEQ ID NO:16)、Slip_0465(SEQ ID NO:17)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和CHY_1355(SEQ ID NO:18);或
EC编号2.3.3.20的酶,所述EC编号2.3.3.20的酶选自3-氧代酰基-ACP合酶SVA_3859(SEQ ID NO:12)和酰基辅酶A:酰基辅酶A烷基转移酶Despr_2661(SEQ ID NO:13);和
与其具有至少70%同源性、相似性或同一性的其功能性变体;
ii)选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶,
其中所述第二种酶选自:Tle2、Dde2(EC 2.8.3.5)(SEQ ID NO:21)、Ghh2(EC2.8.3.5)、Tme(EC 2.8.3.8)、Pth(EC 2.8.3.1)(SEQ ID NO:26)和Rma(EC 3.1.2.-)(SEQID NO:27),或与其具有至少70%同源性、相似性或同一性的其功能性变体,
其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)组成,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)组成,且其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)组成;和
iii)乙酰乙酸脱羧酶(EC 4.1.1.4),优选Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;
由此所述细胞能够将乙酰辅酶A转化为丙酮,从而以至少0.8g/L的滴度生产丙酮;
和/或,由此所述细胞能够将乙酰辅酶A和丙酰辅酶A转化为丁酮,从而生产丁酮;
iv)任选的异丙醇脱氢酶(EC 1.1.1.80),优选Tbr(SEQ ID NO:29),或与其具有至少70%同源性、相似性或同一性的其功能性变体,
由此所述细胞能够进一步将丙酮转化为异丙醇,从而生产异丙醇。
本发明利用嗜热细胞来微生物生产挥发性化合物,特别是丙酮、丁酮和异丙醇。由于与常规细胞,即非嗜热细胞相比,这样的细胞在更高的温度下繁荣成长,因此可以促进挥发性产物的回收,因为这些挥发性产物通常存在于在嗜热细胞的培养过程中产生的废气中。这不仅降低生产成本,而且通常还预期有利于生产者的寿命,因为最终产物(丙酮、丁酮和异丙醇)通常对生产细胞有毒。
本文描述的嗜热细胞已被工程化以生产挥发性化合物,即丙酮、丁酮和/或异丙醇。本文描述的嗜热细胞优选地不是天然存在的。在某些实施方案中,所述嗜热细胞是非天然的细胞或经工程化的细胞,其已被修饰以表达异源途径(即不存在于母代细胞中的途径)或表达经修饰的天然途径。
在某些实施方案中,所述能够生产丙酮和/或丁酮和任选的异丙醇的嗜热细胞是细菌细胞或古细菌细胞并且表达:
i)选自以下的第一种酶:
乙酰辅酶A乙酰基转移酶(EC 2.3.1.9),所述乙酰辅酶A乙酰基转移酶(EC2.3.1.9)选自如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:4中所示的Slip_0479、如在SEQ ID NO:7中所示的Slip_0880和如在SEQ ID NO:59中所示的Dde1,或与其具有至少70%同一性或相似性的其功能性变体;
ii)选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶,
其中所述第二种酶选自:Tle2和如在SEQ ID NO:21中所示的Dde2(EC 2.8.3.5),或与其具有至少70%同一性或相似性的其功能性变体,
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ IDNO:20中所示的Tle2亚基B(EC 2.8.3.9)组成;和
iii)乙酰乙酸脱羧酶(EC 4.1.1.4),其中所述乙酰乙酸脱羧酶是如在SEQ ID NO:28中所示的Cac或与其具有至少70%同一性或相似性的其功能性变体;
由此所述细胞能够将乙酰辅酶A转化为丙酮,从而以至少0.8g/L的滴度生产丙酮;
和/或,由此所述细胞能够将乙酰辅酶A和丙酰辅酶A转化为丁酮,从而生产丁酮;
和
iv)任选的异丙醇脱氢酶(EC 1.1.1.80),其中所述异丙醇脱氢酶是如在SEQ IDNO:29中所示的Tbr或与其具有至少70%同一性或相似性的其功能性变体,
由此所述细胞能够进一步将丙酮转化为异丙醇,从而生产异丙醇。
在某些实施方案中,所述嗜热细胞是细菌细胞,即嗜热细菌细胞。在其它实施方案中,所述嗜热细胞是古细菌细胞,即嗜热古细菌细胞。
在某些实施方案中,所述嗜热细胞属于选自以下的属:地芽孢杆菌属(Geobacillus)、高温厌氧杆菌属(Thermoanaerobacterium)、热厌氧杆菌属(Thermoanaerobacter)、嗜热厌氧菌属(Caldanaerobacter)、芽孢杆菌属(Bacillus)、热梭菌属(Thermoclostridium)、无氧芽孢杆菌属(Anoxybacillus)、热解纤维素菌属(Caldicellulosiruptor)、穆尔氏菌属(Moorella)、栖热菌属(Thermus)、栖热袍菌属(Thermotoga)、假栖热袍菌属(Pseudothermotoga)、绿屈挠菌属(Chloroflexus)、厌氧解纤维素菌属(Anaerocellum)、红嗜热菌属(Rhodothermus)、硫化叶菌属(Sulfolobus)、热球菌属(Thermococcus)、火球菌属(Pyrococcus)和梭菌属(Clostridium)。在具体实施方案中,所述嗜热细胞是地芽孢杆菌属细胞、芽孢杆菌属细胞或梭菌属细胞。
在某些实施方案中,所述嗜热细胞属于选自以下的种:热葡糖苷酶地芽孢杆菌(Geobacillus thermoglucosidasius)、就地堆肥地芽胞杆菌(Geobacillus toebii)、嗜热脂肪地芽孢杆菌(Geobacillus stearothermophilus)、热反硝化地芽孢杆菌(Geobacillusthermodenitrificans)、嗜热地芽孢杆菌(Geobacillus kaustophilus)、喜热噬油地芽孢杆菌(Geobacillus thermoleovorans)、热小链地芽孢杆菌(Geobacillusthermocatenulatus)、解木聚糖高温厌氧杆菌(Thermoanaerobacterium xylanolyticum)、解糖高温厌氧杆菌(Thermoanaerobacterium saccharotyticum)、热解糖高温厌氧杆菌(Thermoanaerobacterium thermosaccharolyticum)、马瑞氏热厌氧杆菌(Thermoanaerobacter mathranii)、假乙醇热厌氧杆菌(Thermoanaerobacterpseudoethanolicus)、布氏热厌氧杆菌(Thermoanaerobacter brockii)、凯伍热厌氧杆菌(Thermoanaerobacter kivui)、布氏热厌氧杆菌(Thermoanaerobacter brockii)、地下嗜热厌氧菌(Caldanaerobacter subterraneus)、热纤梭菌(Clostridium thermocellum)、琥珀酸嗜热梭菌(Clostridium thermosuccinogenes)、嗜粪热梭菌(Thermoclostridiumstercorarium)、枯草芽孢杆菌(Bacillus subtilis)、地衣芽孢杆菌(Bacilluslicheniformis)、凝结芽孢杆菌(Bacillus coagulans)、史氏芽孢杆菌(Bacillussmithii)、甲醇芽孢杆菌(Bacillus methanolicus)、黄热芽孢杆菌(Bacillusflavothermus)、堪察加无氧芽孢杆菌(Anoxybacillus kamchatkensis)、冈尼西氏厌氧杆菌(Anoxybacillus gonensis)、热解纤维素菌(Caldicellulosiruptor bescii)、解糖热解纤维素菌(Caldicellulosiruptor saccharolyticus)、克里斯托热解纤维素菌(Caldicellulosiruptor kristjanssonii)、欧文湖热解纤维素菌(Caldicellulosiruptorowensensis)、产乳酸乙酸热解纤维素菌(Caldicellulosiruptor lactoaceticus)、热醋穆尔氏菌(Moorella thermoacetica)、热自养穆尔氏菌(Moorella thermoautotrophica)、嗜热栖热菌(Thermus thermophilus)、水生栖热菌(Thermus aquaticus)、海栖热袍菌(Thermotoga maritima)、Pseudothermotoga lettingae、温泉假栖热袍菌(Pseudothermotoga thermarum)、橙色绿屈挠菌(Chloroflexus aurantiacus)、嗜热厌氧解纤维素菌(Anaerocellum thermophilum)、海洋红嗜热菌(Rhodothermus marinus)、酸热硫化叶菌(Sulfolobus acidocaldarius)、冰岛硫化叶菌(Sulfolobus islandicus)、硫矿硫化叶菌(Sulfolobus solfataricus)、极端嗜热嗜压古菌(Thermococcus barophilus)、海洋异养古细菌(Thermococcus kodakarensis)、深海火球菌(Pyrococcus abyssi)和激烈火球菌(Pyrococcus furiosus)。在具体实施方案中,所述细胞是热葡糖苷酶地芽孢杆菌细胞。在其它实施方案中,所述细胞是枯草芽孢杆菌细胞。在其它实施方案中,所述细胞是热纤梭菌细胞。
在某些实施方案中,所述嗜热细胞具有在42℃和80℃之间的最佳生长温度,或者能够在42℃和80℃之间,诸如50℃和75℃之间,例如在60℃的温度下生长。例如,所述嗜热细胞具有42℃或更高、诸如43℃或更高、诸如44℃或更高、诸如45℃或更高、诸如46℃或更高、诸如47℃或更高、诸如48℃或更高、诸如49℃或更高、诸如50℃或更高、诸如51℃或更高、52℃或更高、53℃或更高、54℃或更高、55℃或更高、56℃或更高、57℃或更高、58℃或更高、59℃或更高、例如60℃或更高的最佳生长温度。在某些实施方案中,所述嗜热细胞能够在42℃和80℃之间,诸如50℃和75℃之间,例如在60℃的温度下生长。例如,所述嗜热细胞能够在42℃或更高、诸如43℃或更高、诸如44℃或更高、诸如45℃或更高、诸如46℃或更高、诸如47℃或更高、诸如48℃或更高、诸如49℃或更高、诸如50℃或更高、诸如51℃或更高、52℃或更高、53℃或更高、54℃或更高、55℃或更高、56℃或更高、57℃或更高、58℃或更高、59℃或更高、例如60℃或更高的温度下生长。
具体地,所述嗜热细胞优选地能够在使其生产的丙酮、丁酮和/或异丙醇的至少一部分蒸发的温度下生长,从而促进所生产的丙酮、丁酮和/或异丙醇的回收。因而,在某些实施方案中,所述嗜热细胞能够在等于或大于丙酮、丁酮和/或异丙醇的沸点的温度下生长。在某些实施方案中,所述嗜热细胞能够在56℃(丙酮的沸点)或更高的温度下生长。
挥发性化合物的生产
本文公开了可用于生产挥发性化合物、特别是选自丙酮、丁酮和异丙醇的一种或多种化合物的方法和细胞。
本文中公开的嗜热细胞表达实现所述化合物的生产所必需的酶。
因此,本文提供了一种生产选自丙酮、丁酮和异丙醇的一种或多种化合物的方法,所述方法包括以下步骤:
a)提供嗜热细胞,优选嗜热细菌或嗜热古细菌细胞,其表达:
i)选自以下的第一种酶:
乙酰辅酶A乙酰基转移酶(EC 2.3.1.9),所述乙酰辅酶A乙酰基转移酶(EC2.3.1.9)选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ IDNO:3)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)、Tfu_0436(SEQ ID NO:6)、Slip_0880(SEQ ID NO:7)、Tfu_2394(SEQ ID NO:8)、Slip_1236(SEQ ID NO:9)、Caur_1540(SEQ ID NO:10)、Tfu_0253(SEQ ID NO:11)、CHY_1604(SEQ ID NO:14)、CHY_1288(SEQ IDNO:15)、Slip_2085(SEQ ID NO:16)、Slip_0465(SEQ ID NO:17)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和CHY_1355(SEQ ID NO:18);
EC编号2.3.3.20的酶,所述EC编号2.3.3.20的酶选自3-氧代酰基-ACP合酶SVA_3859(SEQ ID NO:12)和酰基辅酶A:酰基辅酶A烷基转移酶Despr_2661(SEQ ID NO:13);和
与其具有至少70%同源性、相似性或同一性的其功能性变体;
ii)选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:
乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶,
其中所述第二种酶选自:Tle2、Dde2(EC 2.8.3.5)(SEQ ID NO:21)、Ghh2(EC2.8.3.5)、Tme(EC 2.8.3.8)、Pth(EC 2.8.3.1)(SEQ ID NO:26)和Rma(EC 3.1.2.-)(SEQID NO:27),或与其具有至少70%同源性、相似性或同一性的其功能性变体,
其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)组成,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)组成,且其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,
和
iii)乙酰乙酸脱羧酶(EC 4.1.1.4),优选Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;
iv)任选的异丙醇脱氢酶(EC 1.1.1.80),优选Tbr(SEQ ID NO:29),或与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)在42℃和80℃之间,诸如50℃和75℃之间,例如60℃的温度下,于包含培养液的生物反应器中培养所述细菌细胞,由此生产所述一种或多种化合物;
c)回收在步骤b)中生产的所述一种或多种化合物。
本公开的嗜热细胞能够生产一种或多种挥发性化合物,优选丙酮、丁酮和/或异丙醇。技术人员将知道如何调整孵育嗜热细胞的条件以获得一种特定化合物。例如,所述嗜热细胞能够由乙酰辅酶A生产丙酮,其中乙酰辅酶A能够由细胞合成和/或可以将其提供给细胞。在其它情况下,所述嗜热细胞能够由丙酰辅酶A和乙酰辅酶A生产丁酮,其中丙酰辅酶A和乙酰辅酶A能够由细胞合成和/或可以将其提供给细胞。如果所述细胞表达异丙醇脱氢酶,则它可以将所生产的丙酮转化为异丙醇。因此,所述嗜热细胞具有多种用途:通过改变温育条件,可以获得所有三种化合物(丙酮、丁酮和异丙醇)。
所述嗜热细胞可例如当在培养基中提供乙酸时,能够合成乙酰辅酶A,,或者乙酰辅酶A可通过所述细胞由其它底物来合成,或者可以提供给所述细胞,例如如果所述细胞已经被工程化为能够利用在发酵液中提供的细胞外乙酰辅酶A。所述嗜热细胞可例如当在培养基中提供丙酸时,能够合成丙酰辅酶A,或者丙酰辅酶A可通过所述细胞由其它底物合成,或者可以提供给所述细胞,例如如果所述细胞已经被工程化为能够利用在发酵液中提供的细胞外丙酰辅酶A。
在要生产的挥发性化合物是丙酮的实施方案中,所述细胞通过以下步骤能够将乙酰辅酶A转化为丙酮:
1)将乙酰辅酶A转化为乙酰乙酰辅酶A;
2)将乙酰乙酰辅酶A转化为乙酰乙酸盐;
3)将乙酰乙酸盐转化为丙酮。
在要生产的挥发性化合物是丁酮的实施方案中,所述细胞通过以下步骤能够将丙酰辅酶A和乙酰辅酶A转化为丁酮:
1)将丙酰辅酶A和乙酰辅酶A转化为3-酮戊酰基辅酶A;
2)将3-酮戊酰基辅酶A转化为3-氧代戊酸盐;
3)将3-氧代戊酸盐转化为丁酮。
在要生产的挥发性化合物是异丙醇的实施方案中,所述细胞能够生产如本文描述的丙酮,并且进一步能够将丙酮转化为异丙醇。这涉及以下步骤:
1)将乙酰辅酶A转化为乙酰乙酰辅酶A;
2)将乙酰乙酰辅酶A转化为乙酰乙酸盐;
3)将乙酰乙酸盐转化为丙酮;
4)将丙酮转化为异丙醇。
上述步骤1)至3)可以通过相同的酶进行,无关于要生产的挥发性化合物。根据本方法生产异丙醇要求所述嗜热细胞表达另一种酶,其对于生产丙酮或丁酮而言不是必需的,如下文中所详述。
因此,本文提供了一种生产选自丙酮、丁酮和异丙醇的一种或多种化合物的方法,所述方法包括以下步骤:
a)提供嗜热细胞,优选嗜热细菌或嗜热古细菌细胞,其表达:
i)选自以下的第一种酶:
-乙酰辅酶A乙酰基转移酶(EC 2.3.1.9),所述乙酰辅酶A乙酰基转移酶(EC2.3.1.9)选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ IDNO:3)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)、Tfu_0436(SEQ ID NO:6)、Slip_0880(SEQ ID NO:7)、Tfu_2394(SEQ ID NO:8)、Slip_1236(SEQ ID NO:9)、Caur_1540(SEQ ID NO:10)、Tfu_0253(SEQ ID NO:11)、CHY_1604(SEQ ID NO:14)、CHY_1288(SEQ IDNO:15)、Slip_2085(SEQ ID NO:16)、Slip_0465(SEQ ID NO:17)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和CHY_1355(SEQ ID NO:18),
-EC编号2.3.3.20的酶,所述EC编号2.3.3.20的酶选自3-氧代酰基-ACP合酶SVA_3859(SEQ ID NO:12)和酰基辅酶A:酰基辅酶A烷基转移酶Despr_2661(SEQ ID NO:13);和
-其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性;
ii)选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶,其中所述第二种酶选自:Tle2、Dde2(EC2.8.3.5)(SEQ ID NO:21)、Ghh2(EC 2.8.3.5)、Tme(EC 2.8.3.8)、Pth(EC 2.8.3.1)(SEQID NO:26)和Rma(EC 3.1.2.-)(SEQ ID NO:27),或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性,
其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)组成,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)组成,且其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或其功能性变体组成,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性,
和
iii)乙酰乙酸脱羧酶(EC 4.1.1.4),优选Cac(SEQ ID NO:28),或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性;
iv)任选的异丙醇脱氢酶(EC 1.1.1.80),优选Tbr(SEQ ID NO:29),或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性;
b)在42℃和80℃之间,诸如50℃和75℃之间,例如60℃的温度下,于包含培养液的生物反应器中培养所述细菌细胞,由此生产所述一种或多种化合物;
c)回收在步骤b)中生产的所述一种或多种化合物。
所述嗜热细胞可以如本文所述,具体地,所述细胞可以是细菌细胞或古细菌细胞。
第一种酶
所述第一种酶可以是乙酰辅酶A乙酰基转移酶(也称作硫解酶)或3-氧代酰基-ACP合酶。
硫解酶催化以下转化:
i.两分子乙酰辅酶A转化为乙酰乙酰辅酶A和辅酶A(CoA),或
ii.一个乙酰辅酶A和一个丙酰辅酶A转化为3-酮戊酰基辅酶A和辅酶A(CoA)。
嗜热细胞中实际发生哪种反应将取决于在培养液中存在哪些底物,或取决于所使用的特定细胞的代谢,如技术人员所熟知的那样。如果存在乙酰辅酶A,则反应i将发生。如果同时存在乙酰辅酶A和丙酰辅酶A,则反应ii将发生或者两种反应都将发生。补充有乙酸的培养液可能会增加滴度。可以在发酵中提供丙酰辅酶A。所述细胞也可能已经被工程化为能够合成丙酰辅酶A和/或乙酰辅酶A,或者与相应的非工程化细胞相比以更大的量合成丙酰辅酶A和/或乙酰辅酶A。
根据本方法生产丙酮需要反应i。根据本方法生产丁酮需要反应ii。
在某些实施方案中,所述第一种酶是EC编号2.3.1.9的乙酰辅酶A乙酰基转移酶。这些特定酶具有乙酰辅酶A或丙酰辅酶A的底物偏好,并因此优选地正向催化反应。
在某些实施方案中,所述第一种酶是EC编号2.3.3.20的酶,即3-氧代酰基-ACP合酶(3-氧代酰基-[酰基-载体-蛋白]合酶)或酰基辅酶A:酰基辅酶A烷基转移酶。这些在水的存在下,催化两分子酰基辅酶A向一分子(2R)-2-烷基-3-氧代烷酸酯分子的转化,由此产生两分子辅酶A(CoA)。
所述第一种酶选自由以下组成的组:GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQID NO:2)、Caur_1461(SEQ ID NO:3)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)、Tfu_0436(SEQ ID NO:6)、Slip_0880(SEQ ID NO:7)、Tfu_2394(SEQ ID NO:8)、Slip_1236(SEQ ID NO:9)、Caur_1540(SEQ ID NO:10)、Tfu_0253(SEQ ID NO:11)、CHY_1604(SEQ IDNO:14)、CHY_1288(SEQ ID NO:15)、Slip_2085(SEQ ID NO:16)、Slip_0465(SEQ ID NO:17)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、CHY_1355(SEQ ID NO:18)、SVA_3859(SEQID NO:12)、Despr_2661(SEQ ID NO:13)和其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。优选地,所述第一种酶是乙酰辅酶A乙酰基转移酶(EC 2.3.1.9),其选自如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:4中所示的Slip_0479、如在SEQ ID NO:7中所示的Slip_0880和如在SEQ ID NO:59中所示的Dde1,和与其具有至少70%同一性、同源性或相似性的其功能性变体。
在一个实施方案中,所述第一种酶是GHH_c20420(SEQ ID NO:1)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Slip_0499(SEQ ID NO:2)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Caur_1461(SEQ ID NO:3)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Slip_0479(SEQ ID NO:4)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Tfu_1520(SEQ ID NO:5)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Tfu_0436(SEQ ID NO:6)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Slip_0880(SEQ ID NO:7)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Tfu_2394(SEQ ID NO:8)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Slip_1236(SEQ ID NO:9)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Caur_1540(SEQ ID NO:10)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Tfu_0253(SEQ ID NO:11)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是CHY_1604(SEQ ID NO:14)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是CHY_1288(SEQ ID NO:15)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Slip_2085(SEQ ID NO:16)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Slip_0465(SEQ ID NO:17)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Dde1(SEQ ID NO:59)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Rxy2(SEQ ID NO:60)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是CHY_1355(SEQ IDNO:18)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是SVA_3859(SEQ ID NO:12)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在另一个实施方案中,所述第一种酶是Despr_2661(SEQ ID NO:13)或与其具有至少70%同源性、相似性或同一性的其功能性变体。
上述酶的功能性变体是所述酶的修饰形式,其仍然至少保留原始酶的一些活性。在热稳定酶的情况下,所述功能性变体优选也是热稳定的。在某些实施方案中,所述功能性变体与原始酶相比具有优选不位于酶的活性位点内的突变。技术人员知道如何确定第一种酶的变体是否具有功能。例如,可将潜在的硫解酶与乙酰乙酰辅酶A和辅酶A一起孵育,并可监测在303nm的吸光度。在303nm的吸光度的降低表明,潜在的硫解酶可以进行所述反应并具有硫解酶活性—因此它可以被认为是功能性变体。可将潜在的3-氧代酰基-ACP合酶与乙酰辅酶A(或乙酰辅酶A和丙酰辅酶A)一起孵育,随后添加5,5’-二硫代-二-(2-硝基苯甲酸),其与释放的CoASH的游离硫醇基团反应。可以在412nm监测产物的吸光度。产物的形成表明,潜在的3-氧代酰基-ACP合酶保留3-氧代酰基-ACP合酶活性。
该类酶含有硫解酶N-端结构域(Pfam登录号PF00108)、硫解酶C-端结构域(PF02803)和β-酮酰基合酶结构域(PF00109),它们含有活性中心并参与功能酶的寡聚化(Mathieu等人,1997)。因此,这样的酶的功能性变体优选包含所述结构域。功能性变体可以如本领域已知的那样在其它方面经过工程化。
第二种酶
在本方法中采用的嗜热细胞进一步表达第二种酶,该第二种酶选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II。这可以通过进一步工程化细胞来实现。
乙酸辅酶A转移酶(EC 2.8.3.8)催化乙酸盐和酰基辅酶A向乙酰辅酶A和脂肪酸的转化。3-氧代酸辅酶A转移酶(EC 2.8.3.5)催化3-氧代酰基辅酶A和琥珀酸盐向3-氧代酸和3-琥珀酰辅酶A的转化或者催化3-酮戊酰基辅酶A+乙酸盐向3-氧代戊酸盐+乙酰辅酶A的转化。酰基辅酶A:乙酸/3-酮酸辅酶A转移酶(EC 2.8.3.1)催化乙酰辅酶A和丙酸酯向乙酸盐和丙酰辅酶A的转化。酰基辅酶A硫酯酶II(EC 3.1.2.-)催化酰基辅酶A水解为脂肪酸和CoASH的反应。
一些上述酶可以催化不同的反应。在嗜热细胞中实际发生的反应的类型将取决于在培养液中存在哪些底物或细胞已经如何被工程化,正如技术人员所熟知的那样。
更具体地,所述第二种酶选自:Tle2、Dde2(EC 2.8.3.5)(SEQ ID NO:21)、Ghh2(EC2.8.3.5)、Tme(EC 2.8.3.8)、Pth(EC 2.8.3.1)(SEQ ID NO:26)、Rma(EC 3.1.2.-)(SEQ IDNO:27)和其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性;其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)或其功能性变体组成,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性;其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)或其功能性变体组成,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性;且其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或其功能性变体组成,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。
优选地,所述第二种酶是Tle2或Dde2(SEQ ID NO:21)或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在某些实施方案中,所述第二种酶是Tle2或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。该酶由两个亚基组成:如在SEQ ID NO:19中所示的亚基A,和如在SEQ ID NO:20中所示的亚基B。亚基A具有EC编号2.8.3.8,且亚基B具有EC编号2.8.3.9。在某些实施方案中,所述第二种酶是Tle2,其由在SEQ ID NO:19和SEQ ID NO:20中分别所示的Tle2亚基A和Tle2亚基B组成。在某些实施方案中,所述第二种酶是Tle2或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。在某些实施方案中,所述第二种酶是Tle2的功能性变体,其由Tle2亚基A和与SEQ ID NO:20具有至少70%同源性、相似性或同一性的Tle2亚基B的功能性变体组成,该Tle2亚基B的功能性变体与SEQ ID NO:20具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。在其它实施方案中,所述第二种酶是Tle2的功能性变体,其由Tle2亚基B和与SEQID NO:19具有至少70%同源性、相似性或同一性的Tle2亚基A的功能性变体组成,该Tle2亚基A的功能性变体与SEQ ID NO:19具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。在某些实施方案中,所述第二种酶是Tle2的功能性变体,其由与SEQ ID NO:19具有至少70%同源性、相似性或同一性的Tle2亚基A的功能性变体和与SEQ ID NO:20具有至少70%同源性、相似性或同一性的Tle2亚基B的功能性变体组成,其中该Tle2亚基A的功能性变体与SEQ ID NO:20具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性,该Tle2亚基B的功能性变体与SEQ ID NO:20具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。
在某些实施方案中,所述第二种酶是具有EC编号2.8.3.5的酶。在某些实施方案中,所述第二种酶是Dde2(SEQ ID NO:21)或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。
在某些实施方案中,所述第二种酶是Ghh2或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。该酶由两个亚基组成:如在SEQ ID NO:22中所示的亚基A,和如在SEQ ID NO:23中所示的亚基B。两个亚基具有EC编号2.8.3.5。在某些实施方案中,所述第二种酶是Ghh2,其由在SEQ ID NO:22和SEQ IDNO:23中分别所示的Ghh2亚基A和Ghh2亚基B组成。在某些实施方案中,所述第二种酶是Tle2或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。在某些实施方案中,所述第二种酶是Ghh2的功能性变体,其由Ghh2亚基A和与SEQ ID NO:23具有至少70%同源性、相似性或同一性的Ghh2亚基B的功能性变体组成,该Ghh2亚基B的功能性变体与SEQ ID NO:23具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。在其它实施方案中,所述第二种酶是Ghh2的功能性变体,其由Ghh2亚基B和与SEQ ID NO:22具有至少70%同源性、相似性或同一性的Ghh2亚基A的功能性变体组成,该Ghh2亚基A的功能性变体与SEQID NO:22具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。在某些实施方案中,所述第二种酶是Ghh2的功能性变体,其由与SEQ ID NO:22具有至少70%同源性、相似性或同一性的Ghh2亚基A的功能性变体和与SEQ ID NO:23具有至少70%同源性、相似性或同一性的Ghh2亚基B的功能性变体组成,其中该Ghh2亚基A的功能性变体与SEQ ID NO:22具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性,该Ghh2亚基B的功能性变体与SEQ ID NO:23具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。
在某些实施方案中,所述第二种酶是Tme或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。EC编号2.8.3.8的该酶由两个亚基组成:如在SEQ ID NO:24中所示的亚基A,和如在SEQ ID NO:25中所示的亚基B。在某些实施方案中,所述第二种酶是Tme,其由在SEQ ID NO:24和SEQ ID NO:25中分别所示的Ghh2亚基A和Tme亚基B组成。在某些实施方案中,所述第二种酶是Tme或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。在某些实施方案中,所述第二种酶是Tme的功能性变体,其由Tme亚基A和与SEQ ID NO:25具有至少70%同源性、相似性或同一性的Tme亚基B的功能性变体组成,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。在其它实施方案中,所述第二种酶是Tme的功能性变体,其由Tme亚基B和与SEQ ID NO:24具有至少70%同源性、相似性或同一性的Tme亚基A的功能性变体组成,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。在某些实施方案中,所述第二种酶是Tme的功能性变体,其由与SEQ ID NO:24具有至少70%同源性、相似性或同一性的Tme亚基A的功能性变体和与SEQ IDNO:25具有至少70%同源性、相似性或同一性的Tme亚基B的功能性变体组成,上述功能性变体分别与SEQ ID NO:24和SEQ ID NO:25具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。
在某些实施方案中,所述第二种酶是具有EC编号2.8.3.1的酶。在某些实施方案中,所述第二种酶是Pth(SEQ ID NO:26)或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。
在某些实施方案中,所述第二种酶是具有EC编号3.1.2.-的酶。在某些实施方案中,所述第二种酶是Rma(SEQ ID NO:27)或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。
上述酶的功能性变体是所述酶的修饰形式,其仍然至少保留原始酶的一些活性。在热稳定酶的情况下,所述功能性变体优选也是热稳定的。在某些实施方案中,与原始酶相比,所述功能性变体具有优选不位于酶的活性位点内的突变。技术人员知道如何确定第二种酶的变体是否具有功能。例如,可将EC编号2.8.3.8的潜在乙酸辅酶A-转移酶与乙酰辅酶A和乙酰乙酸锂一起孵育,并且可以监测在313nm的吸光度以跟踪乙酰乙酰辅酶A的形成。可如上文所述测试EC编号2.8.3.5的潜在3-氧代酸辅酶A转移酶。可如上文所述测试潜在的酰基辅酶A:乙酸/3-酮酸辅酶A转移酶(EC 2.8.3.1)。可以如上文所述测试潜在的酰基辅酶A硫酯酶II(EC 3.1.2.-)。
该类酶含有辅酶A转移酶结构域(Pfam登录号PF01144)和乙酰辅酶A水解酶/转移酶C-端结构域(PF13336),其含有活性中心,并参与功能性酶的寡聚化。因此,这样的酶的功能性变体优选包含所述结构域。功能性变体可以如本领域已知的那样在其它方面经过工程化。
乙酰乙酸脱羧酶
在本方法中采用的嗜热细胞进一步表达乙酰乙酸脱羧酶。该酶具有EC编号4.1.1.4,并催化乙酰乙酸盐的脱羧,从而生成丙酮和二氧化碳;该酶还能够催化所述3-氧代戊酸盐脱羧为丁酮;它还能够参与乙酰乙酰辅酶A向丙酮的转化或3-酮戊酰基辅酶A向丁酮的转化。该酶在本文公开的嗜热细胞中的表达因此允许乙酸盐转化为丙酮,其中乙酸盐被提供给细胞(例如在培养基中)或由细胞产生。当在产生3-氧代戊酸盐的条件下孵育嗜热细胞时,例如如果培养液包含丙酸,则该酶能够催化所述3-氧代戊酸盐脱羧成丁酮。
所述乙酰乙酸脱羧酶优选是热稳定的乙酰乙酸脱羧酶。在优选的实施方案中,所述乙酰乙酸脱羧酶对于嗜热微生物不是天然的,具体地所述乙酰乙酸脱羧酶可对于梭菌属种,诸如丙酮丁醇梭菌是天然的。如在SEQ ID NO:28中所示的乙酰乙酸脱羧酶Cac对本方法可尤其有利。
因此,在某些实施方案中,所述乙酰乙酸脱羧酶是如在SEQ ID NO:28中所示的Cac,或与其具有至少70%同源性、相似性或同一性的功能性变体,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。
上述酶的功能性变体是所述酶的修饰形式,其仍然至少保留原始酶的一些活性。在热稳定酶的情况下,所述功能性变体优选也是热稳定的。在某些实施方案中,所述功能性变体与原始酶相比具有优选不位于酶的活性位点内的突变。技术人员知道如何确定乙酰乙酸脱羧酶的变体是否具有功能。例如,可将潜在的乙酰乙酸脱羧酶与乙酰乙酸锂一起孵育。例如通过测压法可监测随之而来的伴随的CO2释放。CO2的释放表明,被测酶具有乙酰乙酸脱羧酶活性。
该类酶含有乙酰乙酸脱羧酶结构域(Pfam登录号PF06314),其含有活性中心,并参与功能性酶的寡聚化。在活性位点中的氨基酸残基Lys 115、Lys 116、Arg 29、Glu 61、Glu76是酶活性所必需的(Ho等人,2009)。因此,该酶的功能性变体优选包含所述结构域和/或残基。功能性变体可以如本领域已知的那样在其它方面经过工程化。
异丙醇脱氢酶
本文还提供了在嗜热细胞中生产异丙醇的细胞和方法。为此,除了上述酶之外,即除了第一种酶、第二种酶和乙酰乙酸脱羧酶之外,所述嗜热细胞可以进一步表达异丙醇脱氢酶。该酶(EC 1.1.1.80)催化丙酮向丙烷-2-醇的转化。因此,如本文描述的能够生产丙酮的嗜热细胞可以被进一步修饰成表达异丙醇脱氢酶,该酶随后可以将生产的丙酮转化为异丙醇或至少部分的丙酮转化为异丙醇。
在某些实施方案中,所述异丙醇脱氢酶是Tbr(SEQ ID NO:29)或其功能性变体,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。
嗜热细胞的培养
在反应器中,例如本领域已知的生物反应器或发酵罐中培养所述嗜热细胞。在本公开的上下文中,在“高”温,即高于通常用于细菌培养的常规37℃的温度下培养所述细胞。在较高温度进行培养的优点在于,这可以促进产生的挥发性化合物的回收。优选地,在42℃和80℃之间,诸如50℃和75℃之间,例如60℃的温度下培养所述嗜热细胞。在某些实施方案中,在42℃或更高、诸如43℃或更高、诸如44℃或更高、诸如45℃或更高、诸如46℃或更高、诸如47℃或更高、诸如48℃或更高、诸如49℃或更高、诸如50℃或更高、诸如51℃或更高、52℃或更高、53℃或更高、54℃或更高、55℃或更高、56℃或更高、57℃或更高、58℃或更高、59℃或更高、例如60℃或更高的温度下进行步骤b)。在某些实施方案中,在42℃和80℃之间,诸如50℃和75℃之间,例如60℃的温度下进行所述方法。例如,在42℃或更高、诸如43℃或更高、诸如44℃或更高、诸如45℃或更高、诸如46℃或更高、诸如47℃或更高、诸如48℃或更高、诸如49℃或更高、诸如50℃或更高、诸如51℃或更高、52℃或更高、53℃或更高、54℃或更高、55℃或更高、56℃或更高、57℃或更高、58℃或更高、59℃或更高、例如60℃或更高的温度下进行所述方法。
培养基或培养液包含本领域已知的可发酵碳源。在某些实施方案中,所述培养基包含含有碳水化合物的底物。具体地,戊糖或己糖,诸如葡萄糖、木糖或其混合物可以用作底物,或者所述培养基可以包含或由生物质水解物例如木质纤维素水解物组成。在本发明的上下文中,术语“木质纤维素水解物”意于表示木质纤维素生物质,其优选地已经进行预处理步骤,由此木质纤维素材料已经至少部分地分离成纤维素、半纤维素和木质素。木质纤维素材料通常可以源自植物材料,诸如稻草、干草、花园废弃物、粉碎的木材、果壳和种子壳。
最常用的预处理方法是酸水解,其中使木质纤维素材料经受酸(诸如硫酸)处理,由此糖聚合物纤维素和半纤维素部分地或完全地水解成它们的组分糖单体。另一种类型的木质纤维素水解是蒸汽爆炸(steam explosion),该工艺包括通过蒸汽注入将木质纤维素材料加热到190℃至230℃的温度。第三种方法是湿法氧化,其中在150℃至185℃下用氧气处理所述材料。预处理之后可以进行酶促水解以完成糖单体的释放。该预处理步骤使纤维素水解为葡萄糖或纤维二糖,而半纤维素被转化为戊糖木糖和阿拉伯糖以及己糖葡萄糖、半乳糖和甘露糖。在某些实施方案中,可以给预处理步骤补充使纤维素和半纤维素进一步水解的处理。这种额外的水解处理的目的是水解在纤维素和/或半纤维素来源的酸水解、湿法氧化或蒸汽爆炸过程中产生的寡糖和可能的多糖物质,以形成可发酵的糖(例如葡萄糖、木糖和可能的其它单糖)。这样的进一步处理可以是化学的或酶促的。化学水解通常通过在约100℃至150℃范围内的温度用酸处理(诸如用硫酸水溶液处理)来实现。酶促水解通常通过用一种或多种适当的酶(诸如纤维素酶、葡萄糖苷酶和半纤维素酶,包括木聚糖酶)处理来进行。
处理生物质以提取可发酵的糖可以通过物理、化学或生物学方法进行。木质纤维素由构成复杂微观结构的纤维素、半纤维素(木糖、阿拉伯糖、甘露糖等的均聚物和杂聚物的混合物)、果胶和木质素组成,其进化成抵抗微生物和昆虫的侵袭。因此它可以相对抵抗酶促分解,并且往往和不同的解构方法结合起来。通常进行预处理以使纤维素纤维更容易接触相应的酶(纤维素酶)、水解半纤维素和/或除去木质素。典型的工艺包括在100℃和220℃之间的温度下用稀酸或碱处理。由于其无定形结构,半纤维素在此步骤中更容易水解,并且其高达90%的糖可被回收。但是,这种方法也会产生糠醛和其它可能抑制微生物生长的产物。因此,有时使用统称为半纤维素酶的混合酶。另一方面,纤维素构造在微晶纤维中,并且不易水解,但在预处理过程中细胞壁基质的降解使其更容易被酶接触。纤维素酶包括:1)作用于纤维素分子中间的内切葡聚糖酶;2)从纤维素末端释放纤维二糖的纤维二糖水解酶;3)将纤维二糖水解成葡萄糖的β-D-葡萄糖苷酶。如上所述,酶促水解可以作为单独的步骤进行(独立的水解和发酵,SHF)或与发酵同时进行(SSF)。最近,已经提出了一种使用生物质-衍生的γ-戊内酯完全溶解木质纤维素的补充方法。
另一种有吸引力的方法是联合生物加工(CBP),它将酶生产、糖化和发酵组合在一个步骤中。这可以通过设计本公开的生产细胞表达异源代谢途径以降解和利用生物质来完成。
可替换地,特别是在如上所述的较高温度下,本公开的细胞可以与能够降解和利用生物质的另一种细胞一起培养。使用此设置,一种微生物(例如热纤梭菌)会降解生物质并为其它微生物提供必要的底物,这可以产生如上所述的挥发性化合物。
在某些实施方案中,所述培养基包含葡萄糖、木糖或其混合物。例如,所述培养基可包含0.1%至20%(w/vol)葡萄糖、木糖或其混合物。例如,所述培养基包含0.1%至15%(w/vol)葡萄糖、木糖或其混合物,诸如0.5%至15%(w/vol)葡萄糖、木糖或其混合物,诸如1%至10%(w/vol)葡萄糖、木糖或其混合物,诸如2%至10%(w/vol)葡萄糖、木糖或其混合物,诸如5%至10%(w/vol)葡萄糖、木糖或其混合物,诸如5%至7.5%(w/vol)葡萄糖、木糖或其混合物。在某些实施方案中,所述培养基包含至少0.1%(w/vol)葡萄糖、木糖或其混合物,诸如至少0.25%(w/vol)、诸如至少0.5%(w/vol)、诸如至少0.75%(w/vol)、诸如至少1%(w/vol)、诸如至少2.5%(w/vol)、诸如至少5%(w/vol)、诸如至少10%(w/vol)、诸如至少15%(w/vol)、诸如20%(w/vol)葡萄糖、木糖或其混合物。
在某些实施方案中,所述嗜热细胞是产乙酸嗜热细胞,特别是产乙酸细菌细胞,其已经被工程化以生产丙酮、丁酮或异丙醇。这样的细胞能够将一氧化碳、二氧化碳、氢或其混合物转化为乙酰辅酶A,乙酰辅酶A是上述化合物的底物或协同底物。例如,产乙酸物种包括热醋穆尔氏菌(Moorella thermoacetica)、热自养穆尔氏菌(Moorellathermoautotrophica)和凯伍热厌氧杆菌(Thermoanaerobacter kivui)。
在希望产生丙酮的实施方案中,所述培养基可有利地进一步包含乙酸或乙酸盐。在某些实施方案中,所述培养基包含0.05%至5%(w/vol)乙酸或乙酸盐。例如,所述培养基包含0.05%至5%(w/vol)乙酸或乙酸盐或其混合物,诸如0.1%至5%(w/vol)、诸如0.5%至5%(w/vol)、诸如1%至5%(w/vol)、诸如2%至4%(w/vol)诸如3%乙酸或乙酸盐或其混合物。在某些实施方案中,所述培养基包含至少0.05%(w/vol)乙酸或乙酸盐或其混合物,诸如至少0.1%(w/vol)、诸如至少0.5%、诸如至少1%(w/vol)、诸如至少2%(w/vol)、诸如至少3%(w/vol)、诸如至少4%(w/vol)、诸如5%(w/vol)乙酸或乙酸盐或其混合物。
如上文描述的,所述细胞也可以已经被工程化成更高效地合成乙酰辅酶A以用作底物,或者它可以与能够由可发酵的碳源生产乙酰辅酶A的微生物一起培养。
在希望产生丁酮的实施方案中,所述培养基可有利地进一步包含丙酸或丙酸盐。在某些实施方案中,所述培养基包含0.05%至2%(w/vol)丙酸或丙酸盐。例如,所述培养基包含0.05%至2%(w/vol)乙酸或乙酸盐或其混合物,诸如0.1%至2%(w/vol)、诸如0.5%至2%(w/vol)、诸如1%至2%(w/vol)丙酸、丙酸盐或其混合物。在某些实施方案中,所述培养基包含至少0.05%(w/vol)丙酸、丙酸盐或其混合物,诸如至少0.1%(w/vol)、诸如至少0.5%、诸如至少1%(w/vol)、诸如2%(w/vol)丙酸、丙酸盐或其混合物。
如上文描述的,所述细胞也可以已经被工程化成合成丙酰辅酶A以用作底物,或者它可以与能够由可发酵的碳源生产丙酰辅酶A的微生物一起培养。
可以在本领域已知的连续发酵装置中培养本发明的嗜热细胞。这可以是特别有利的,因为它还允许连续的产物回收,从而防止反馈抑制和产物毒性。因为所述细胞是嗜热的并且培养在比通常用于发酵嗜温细胞的温度更高的温度下进行,所以挥发性化合物将至少部分地蒸发并且能够容易地从嗜热细胞产生的废气中回收。因此,在某些实施方案中,本文公开的方法进一步包含从在发酵过程中产生的废气中回收一种或多种挥发性化合物。在某些实施方案中,这通过冷凝完成。
在此装置中,废气连续地从生物反应器排出并被冷却至低于感兴趣化合物沸点的温度。这使得其从气相变为液相,并在此时被收集。可替换地,可以通过用温度低于感兴趣化合物的沸点的溶剂(诸如水)来冲洗废气。该过程产生了这种化学品的饱和溶液。可替换地,可以使废气穿过过滤器,例如活性炭,其结合产物。
用于生产丙酮、丁酮和/或异丙醇的方法
本方法优选地允许以至少0.8g/L,诸如至少0.9g/L,诸如至少1.0g/L,诸如至少1.1g/L,诸如至少1.2g/L,诸如至少1.3g/L,诸如至少1.4g/L,诸如至少1.5g/L,诸如至少1.6g/L,诸如至少1.7g/L,诸如至少1.8g/L,诸如至少1.9g/L,诸如至少2.0g/L,诸如至少5g/L,诸如至少7.5g/L,诸如至少10g/L,诸如至少12.5g/L,诸如至少15g/L,诸如至少20g/L,诸如至少25g/L,诸如至少50g/L,诸如至少75g/L,诸如至少100g/L,诸如至少150g/L,诸如至少250g/L或更高滴度生产丙酮。
在某些实施方案中,所述方法用于生产至少丙酮,且所述第一种酶选自由以下组成的组:CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQ ID NO:1)、Caur_1461(SEQ ID NO:3)和Slip_0880(SEQ ID NO:7),或与其具有至少70%同源性、相似性或同一性的其功能性变体,优选Caur_1461(SEQ ID NO:3)、Rxy2(SEQ ID NO:60)、Slip_0880(SEQ ID NO:7)和Dde1(SEQ ID NO:59)。
在某些实施方案中,所述嗜热细胞用于生产至少丙酮,所述细胞表达:
a)第一种酶,所述第一种酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQ ID NO:1)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和Slip_0880(SEQ ID NO:7)和与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Tle2,其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ IDNO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在一个实施方案中,所述嗜热细胞表达:CHY_1288(SEQ ID NO:15)、Tle2和Cac,或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Tle2由Tle2亚基A(EC2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Caur_1540(SEQ ID NO:10)、Tle2和Cac,或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Tle2和Cac,与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Tle2和Cac,或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Tle2由Tle2亚基A(EC2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Slip_0880(SEQ ID NO:7)、Tle2和Cac,或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Tle2由Tle2亚基A(EC2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Tle2和Cac,或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Tle2由Tle2亚基A(EC2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Tle2和Cac,或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Tle2由Tle2亚基A(EC2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在某些实施方案中,所述嗜热细胞用于生产至少丙酮,所述细胞表达:
a)第一种酶,所述第一种酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQ ID NO:1)、Caur_1461(SEQ ID NO:3)、Rxy2(SEQ ID NO:60)、Slip_0880(SEQ ID NO:7)、Dde1(SEQ ID NO:59)和与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Dde2(EC 2.8.3.5)(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在一个实施方案中,所述嗜热细胞表达:CHY_1288(SEQ ID NO:15)、Dde2(SEQ IDNO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Caur_1540(SEQ ID NO:10)、Dde2(SEQID NO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Dde2(SEQID NO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Dde2(SEQID NO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Slip_0880(SEQ ID NO:7)、Dde2(SEQID NO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Dde2(SEQ IDNO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Dde2(SEQ IDNO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在某些实施方案中,所述嗜热细胞用于生产至少丙酮,所述细胞表达:
a)第一种酶,所述第一种酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQ ID NO:1)、Caur_1461(SEQ ID NO:3)、Rxy2(SEQ ID NO:60)、Slip_0880(SEQ ID NO:7)和Dde1(SEQ ID NO:59)和与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Ghh2(EC 2.8.3.5),其中Ghh2由Ghh2亚基A(SEQ IDNO:22)和Ghh2亚基B(SEQ ID NO:23)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在一个实施方案中,所述嗜热细胞表达:CHY_1288(SEQ ID NO:15)、Ghh2和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Caur_1540(SEQ ID NO:10)、Ghh2和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Ghh2和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Ghh2和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Slip_0880(SEQ ID NO:7)、Ghh2和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Ghh2和Cac(SEQID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Ghh2和Cac(SEQID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Ghh2和Cac(SEQID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Ghh2和Cac(SEQID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在某些实施方案中,所述嗜热细胞用于生产至少丙酮,所述细胞表达:
a)第一种酶,所述第一种酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQ ID NO:1)、Caur_1461(SEQ ID NO:3)、Rxy2(SEQ ID NO:60)、Slip_0880(SEQ ID NO:7)和Dde1(SEQ ID NO:59)和与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Tme(EC 2.8.3.8),其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的功能性变体。
在一个实施方案中,所述嗜热细胞表达:CHY_1288(SEQ ID NO:15)、Tme和Cac(SEQID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体,其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Caur_1540(SEQ ID NO:10)、Tme和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Tme和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Tme和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Slip_0880(SEQ ID NO:7)、Tme和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Tme和Cac(SEQID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Tme和Cac(SEQID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在某些实施方案中,所述嗜热细胞用于生产至少丙酮,所述细胞表达:
a)第一种酶,所述第一种酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQ ID NO:1)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和Slip_0880(SEQ ID NO:7)和与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Pth(EC 2.8.3.1)(SEQ ID NO:26)或与其具有至少70%同源性、相似性或同一性的功能性变体,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的功能性变体。
在一个实施方案中,所述嗜热细胞表达:CHY_1288(SEQ ID NO:15)、Pth(SEQ IDNO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Caur_1540(SEQ ID NO:10)、Pth(SEQID NO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Pth(SEQID NO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Pth(SEQ IDNO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Slip_0880(SEQ ID NO:7)、Pth(SEQ IDNO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Pth(SEQ ID NO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Pth(SEQ ID NO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在某些实施方案中,所述嗜热细胞用于生产至少丙酮,所述细胞表达:
a)第一种酶,所述第一种酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQ ID NO:1)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和Slip_0880(SEQ ID NO:7)和与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Rma(EC 3.1.2.-)(SEQ ID NO:27)或与其具有至少70%同源性、相似性或同一性的功能性变体,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的功能性变体。
在一个实施方案中,所述嗜热细胞表达:CHY_1288(SEQ ID NO:15)、Rma(SEQ IDNO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Caur_1540(SEQ ID NO:10)、Rma(SEQID NO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Rma(SEQID NO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Rma(SEQ IDNO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Slip_0880(SEQ ID NO:7)、Rma(SEQ IDNO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Rma(SEQ ID NO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Rma(SEQ ID NO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
优选地,所述嗜热细胞能够生产至少丙酮,并且表达如在SEQ ID NO:28中所示的Cac或与其具有至少70%同一性或相似性的其功能性变体以及第一种和第二种酶的下述组合之一:
i)如在SEQ ID NO:59中所示的Dde1和如在SEQ ID NO:21中所示的Dde2;或与其具有至少70%同一性或相似性的其功能性变体;或
ii)如在SEQ ID NO:3中所示的Caur_1461和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;或
iii)如在SEQ ID NO:7中所示的Slip_0880和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ IDNO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的其功能性变体组成。
本方法能够以至少0.05g/L,诸如至少0.075g/L,诸如至少0.1g/L,诸如至少0.2g/L,诸如至少0.3g/L,诸如至少0.4g/L,诸如至少0.5g/L,诸如至少0.75g/L,诸如至少1.0g/L,诸如至少2.0g/L,诸如至少3.0g/L,诸如至少4.0g/L,诸如至少5.0g/L,诸如至少7.5g/L,诸如至少10.0g/L,诸如至少25g/L,诸如至少50g/L,诸如至少75g/L,诸如至少100g/L,诸如至少150g/L,诸如至少250g/L或更高的滴度生产丁酮。
在某些实施方案中,所述方法用于生产至少丁酮,且所述第一种酶选自由以下组成的组:GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ IDNO:5)和Tfu_0436(SEQ ID NO:6),或与其具有至少70%同源性、相似性或同一性的其功能性变体,优选地所述第一种酶是GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)或Slip_0479(SEQ ID NO:4)。
在某些实施方案中,所述嗜热细胞用于生产至少丁酮,所述细胞表达:
a)第一种酶,所述第一种酶选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ IDNO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)和Tfu_0436(SEQ ID NO:6)和与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Tle2,其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ IDNO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的功能性变体。
在一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Tle2和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Slip_0499(SEQ ID NO:2)、Tle2和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Tle2和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Slip_0479(SEQ ID NO:4)、Tle2和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Tle2和Cac(SEQID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Tle2和Cac(SEQID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Tfu_1520(SEQ ID NO:5)、Tle2和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Tfu_0436(SEQ ID NO:6)、Tle2和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在某些实施方案中,所述嗜热细胞用于生产至少丁酮,所述细胞表达:
a)第一种酶,所述第一种酶选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ IDNO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)或Tfu_0436(SEQ ID NO:6)和与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Dde2(EC 2.8.3.5)(SEQ ID NO:21)或与其具有至少70%同源性、相似性或同一性的其功能性变体,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Dde2(SEQ IDNO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Slip_0499(SEQ ID NO:2)、Dde2(SEQID NO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Dde2(SEQID NO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Slip_0479(SEQ ID NO:4)、Dde2(SEQID NO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Dde2(SEQ IDNO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Dde2(SEQ IDNO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Tfu_1520(SEQ ID NO:5)、Dde2(SEQ IDNO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Tfu_0436(SEQ ID NO:6)、Dde2(SEQ IDNO:21)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在某些实施方案中,所述嗜热细胞用于生产至少丁酮,所述细胞表达:
a)第一种酶,所述第一种酶选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ IDNO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)和Tfu_0436(SEQ ID NO:6)和与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Ghh2(EC 2.8.3.5),其中Ghh2由Ghh2亚基A(SEQ IDNO:22)和Ghh2亚基B(SEQ ID NO:23)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:21)或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Ghh2和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Slip_0499(SEQ ID NO:2)、Ghh2和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Ghh2和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Slip_0479(SEQ ID NO:4)、Ghh2和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Ghh2和Cac(SEQID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Ghh2和Cac(SEQID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Tfu_1520(SEQ ID NO:5)、Ghh2和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Tfu_0436(SEQ ID NO:6)、Ghh2和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在某些实施方案中,所述嗜热细胞用于生产至少丁酮,所述细胞表达:
a)第一种酶,所述第一种酶选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ IDNO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)和Tfu_0436(SEQ ID NO:6)以及与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Tme(EC 2.8.3.8),其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Tme和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Slip_0499(SEQ ID NO:2)、Tme和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Tme和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Slip_0479(SEQ ID NO:4)、Tme和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Tme和Cac(SEQID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Tme和Cac(SEQID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Tfu_1520(SEQ ID NO:5)、Tme和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在另一个实施方案中,所述嗜热细胞表达:Tfu_0436(SEQ ID NO:6)、Tme和Cac(SEQ ID NO:21),或与其具有至少70%同源性、相似性或同一性的其功能性变体;其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成。
在某些实施方案中,所述嗜热细胞用于生产至少丁酮,所述细胞表达:
a)第一种酶,所述第一种酶选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ IDNO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)和Tfu_0436(SEQ ID NO:6)和与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Pth(EC 2.8.3.1)(SEQ ID NO:26)或与其具有至少70%同源性、相似性或同一性的其功能性变体,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Pth(SEQ IDNO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Slip_0499(SEQ ID NO:2)、Pth(SEQ IDNO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Pth(SEQ IDNO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Slip_0479(SEQ ID NO:4)、Pth(SEQ IDNO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Pth(SEQ ID NO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Pth(SEQ ID NO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Tfu_1520(SEQ ID NO:5)、Pth(SEQ IDNO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Tfu_0436(SEQ ID NO:6)、Pth(SEQ IDNO:26)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在某些实施方案中,所述嗜热细胞用于生产至少丁酮,所述细胞表达:
a)第一种酶,所述第一种美选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ IDNO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)和Tfu_0436(SEQ ID NO:6)和与其具有至少70%同源性、相似性或同一性的其功能性变体;
b)第二种酶,所述第二种酶为Rma(EC 3.1.2.-)(SEQ ID NO:27)或与其具有至少70%同源性、相似性或同一性的其功能性变体,和
c)乙酰乙酸脱羧酶Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在一个实施方案中,所述嗜热细胞表达:GHH_c20420(SEQ ID NO:1)、Rma(SEQ IDNO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Slip_0499(SEQ ID NO:2)、Rma(SEQ IDNO:27)和Cac SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Caur_1461(SEQ ID NO:3)、Rma(SEQ IDNO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Slip_0479(SEQ ID NO:4)、Rma(SEQ IDNO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Dde1(SEQ ID NO:59)、Rma(SEQ ID NO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Rxy2(SEQ ID NO:60)、Rma(SEQ ID NO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Tfu_1520(SEQ ID NO:5)、Rma(SEQ IDNO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
在另一个实施方案中,所述嗜热细胞表达:Tfu_0436(SEQ ID NO:6)、Rma(SEQ IDNO:27)和Cac(SEQ ID NO:28),或与其具有至少70%同源性、相似性或同一性的其功能性变体。
优选地,所述嗜热细胞能够生产至少丁酮,并且表达如在SEQ ID NO:28中所示的Cac或与其具有至少70%同一性或相似性的其功能性变体以及第一种和第二种酶的下述组合之一:
i)如在SEQ ID NO:3中所示的Caur_1461和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;或
ii)如在SEQ ID NO:1中所示的GHH_c20420和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;或
iii)如在SEQ ID NO:2中所示的Slip_0499和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;或
iv)如在SEQ ID NO:4中所示的Slip_0479和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ IDNO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的其功能性变体组成。
上述嗜热细胞例如在合适的底物(诸如乙酸,其可以由细胞合成)的存在下,能够由乙酰辅酶A生产丙酮,和/或例如在合适的底物(诸如丙酸)的存在下,能够由丙酰辅酶A生产丁酮。给培养液补充乙酸可增加滴度。可以在发酵中提供丙酰辅酶A。所述细胞也可能已经被工程化成能够合成丙酰辅酶A和/或乙酰辅酶A,或者与相应的非工程化细胞相比以更大的量合成丙酰辅酶A和/或乙酰辅酶A。任何上述嗜热细胞除上述方面之外还可以表达异丙醇脱氢酶,特别是Tbr(SEQ ID NO:29)或与其具有至少70%同源性、相似性或同一性的功能性变体。这使得由嗜热细胞产生的(或提供给细胞的)丙酮中的至少部分转化为异丙醇。
在某些实施方案中,至少产生异丙醇。所述异丙醇滴度优选地是至少0.05g/L,诸如至少0.075g/L,诸如至少0.1g/L,诸如至少0.2g/L,诸如至少0.3g/L,诸如至少0.4g/L,诸如至少0.5g/L,诸如至少0.75g/L,诸如至少1.0g/L,诸如至少2.0g/L,诸如至少3.0g/L,诸如至少4.0g/L,诸如至少5.0g/L,诸如至少7.5g/L,诸如至少10.0g/L,诸如至少12.5g/L,诸如至少15g/L,诸如至少20g/L,诸如至少25g/L,诸如至少50g/L,诸如至少75g/L,诸如至少100g/L,诸如至少150g/L,诸如至少250g/L或更高。
优选地,所述嗜热细胞能够生产至少丙酮和异丙醇,并且表达如在SEQ ID NO:28中所示的Cac和Tbr(SEQ ID NO:29)或与其具有至少70%同一性或相似性的其功能性变体以及第一种和第二种酶的下述组合之一:
i)如在SEQ ID NO:59中所示的Dde1和如在SEQ ID NO:21中所示的Dde2;或与其具有至少70%同一性或相似性的其功能性变体;或
ii)如在SEQ ID NO:3中所示的Caur_1461和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;或
iii)如在SEQ ID NO:7中所示的Slip_0880和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ IDNO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的其功能性变体组成。
在某些实施方案中,所述嗜热细胞能够生产至少丁酮和异丙醇,并且表达如在SEQID NO:28中所示的Cac和Tbr(SEQ ID NO:29)或与其具有至少70%同一性或相似性的其功能性变体以及第一种和第二种酶的下述组合之一:
i)如在SEQ ID NO:3中所示的Caur_1461和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;或
ii)如在SEQ ID NO:1中所示的GHH_c20420和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;或
iii)如在SEQ ID NO:2中所示的Slip_0499和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;或
iv)如在SEQ ID NO:4中所示的Slip_0479和Tle2;或与其具有至少70%同一性或相似性的其功能性变体;
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ IDNO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的其功能性变体组成。
嗜热细胞和核酸构建体
已在上文中详细描述了有用的嗜热细胞。一旦本领域技术人员已经确定要在本公开的嗜热细胞中表达哪些酶,他/她将不难做到。
通过将编码这些酶中的每一种的核酸序列引入细胞中,例如在质粒上,或通过基因组整合可表达这些酶。例如,可以将基因插入到复制质粒中,然后将其借助于电穿孔转化到细胞中。在同一质粒中编码抗生素抗性标记的基因将确保只有转化的细胞在含有相应抗生素的培养基中存活,但也可以利用其它选择系统。例如,通过使用携带感兴趣基因的质粒,例如温度敏感的质粒实现基因组整合。在不允许温度下的适当条件下,质粒将通过同源重组进行双交换,从而无缝无标记地整合到基因组DNA中。可以如本领域已知的那样控制基因表达,例如通过使用适当的载体、质粒、启动子或密码子优化。例如,在嗜热细胞是地芽孢杆菌属细胞、特别是热葡糖苷酶地芽孢杆菌(Geobacillus thermoglucosiadus)细胞的实施方案中,可以采用在Pogrebnyakov等人,2017中描述的方法。
因此,本文中公开的嗜热细胞可以包含编码如上文描述的第一种酶、第二种酶和乙酰乙酸脱羧酶,并任选地编码如上文描述的异丙醇脱氢酶的一种或多种多核苷酸。所述多核苷酸中的每一种可以编码单一酶,或者它可以编码随后同时获得表达的几种酶。
因此,本文中还提供了一种用于修饰选自嗜热细菌细胞和嗜热古细菌细胞的嗜热细胞的核酸构建体,其包含:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,其中所述乙酰辅酶A乙酰基转移酶选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)、Tfu_0436(SEQ ID NO:6)、Slip_0880(SEQ IDNO:7)、Tfu_2394(SEQ ID NO:8)、Slip_1236(SEQ ID NO:9)、Caur_1540(SEQ ID NO:10)、Tfu_0253(SEQ ID NO:11)、CHY_1604(SEQ ID NO:14)、CHY_1288(SEQ ID NO:15)、Slip_2085(SEQ ID NO:16)、Slip_0465(SEQ ID NO:17)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和CHY_1355(SEQ ID NO:18)和/或EC编号2.3.3.20的酶,该EC编号2.3.3.20的酶选自3-氧代酰基-ACP合酶SVA_3859(SEQ ID NO:12)和酰基辅酶A:酰基辅酶A烷基转移酶Despr_2661(SEQ ID NO:13);
ii)编码第二种酶的多核苷酸,所述第二种酶选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II,
其中所述第二种酶选自:Tle2、Dde2(EC 2.8.3.5)(SEQ ID NO:21)、Ghh2(EC2.8.3.5)、Tme(EC 2.8.3.8)、Pth(EC 2.8.3.1)(SEQ ID NO:26)和Rma(EC 3.1.2.-)(SEQID NO:27),或与其具有至少70%同源性、相似性或同一性的其功能性变体,
其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)组成,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)组成,且其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成;和
iii)编码乙酰乙酸脱羧酶(EC 4.1.1.4)或与其具有至少70%同源性、相似性或同一性的其功能性变体,优选Cac(SEQ ID NO:28)的多核苷酸。
具体地,本文提供了一种用于修饰选自嗜热细菌细胞和嗜热古细菌细胞的嗜热细胞的核酸构建体,其包含:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体的多核苷酸,其中所述乙酰辅酶A乙酰基转移酶选自如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:4中所示的Slip_0479、如在SEQ ID NO:7中所示的Slip_0880和如在SEQ ID NO:59中所示的Dde1;
ii)编码第二种酶的多核苷酸,所述第二种酶选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II,
其中所述第二种酶选自:Tle2和如在SEQ ID NO:21中所示的Dde2(EC 2.8.3.5),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体,
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ IDNO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体组成,和
iii)编码乙酰乙酸脱羧酶(EC 4.1.1.4)或与其具有至少70%同一性或相似性的具有乙酰乙酸脱羧酶活性的其功能性变体的多核苷酸,其中所述乙酰乙酸脱羧酶是如在SEQ ID NO:28中所示的Cac,和
iv)任选的编码异丙醇脱氢酶(EC 1.1.1.80)的多核苷酸,其中所述异丙醇脱氢酶是如在SEQ ID NO:29中所示的Tbr或与其具有至少70%同一性或相似性的具有异丙醇脱氢酶活性的其功能性变体。
每种多核苷酸的表达可以是在诱导型启动子的控制下或在组成型启动子的控制下。
因此,本文中还公开了用于修饰嗜热细胞、尤其是嗜热细菌细胞或嗜热古细菌细胞的核酸构建体,其可以用于构建本公开的嗜热细胞,即能够生产丙酮、丁酮和/或异丙醇的细胞。
所述核酸构建体包含以下或由以下组成:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或其功能性变体的多核苷酸,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性,其中所述乙酰辅酶A乙酰基转移酶选自GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)、Tfu_0436(SEQ ID NO:6)、Slip_0880(SEQ ID NO:7)、Tfu_2394(SEQ ID NO:8)、Slip_1236(SEQ IDNO:9)、Caur_1540(SEQ ID NO:10)、Tfu_0253(SEQ ID NO:11)、CHY_1604(SEQ ID NO:14)、CHY_1288(SEQ ID NO:15)、Slip_2085(SEQ ID NO:16)、Slip_0465(SEQ ID NO:17)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和CHY_1355(SEQ ID NO:18)和/或EC编号2.3.3.20的酶,该EC编号2.3.3.20的酶选自3-氧代酰基-ACP合酶SVA_3859(SEQ ID NO:12)和酰基辅酶A:酰基辅酶A烷基转移酶Despr_2661(SEQ ID NO:13);
ii)编码第二种酶的多核苷酸,所述第二种酶选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II,
其中所述第二种酶选自:Tle2、Dde2(EC 2.8.3.5)(SEQ ID NO:21)、Ghh2(EC2.8.3.5)、Tme(EC 2.8.3.8)、Pth(EC 2.8.3.1)(SEQ ID NO:26)和Rma(EC 3.1.2.-)(SEQID NO:27),或其功能性变体;所述其功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性,
其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20)组成,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23)组成,且其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或其功能性变体组成,所述功能性变体与其具有至少70%同源性、相似性或同一性,与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性,和
iii)编码乙酰乙酸脱羧酶(EC 4.1.1.4)或与其具有至少70%同源性、相似性或同一性的其功能性变体,优选Cac(SEQ ID NO:28)的多核苷酸,所述功能性变体与其具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性。
所述核酸构建体可包含一种或多种多核苷酸或由一种或多种多核苷酸组成。应当理解,术语“核酸构建体”可表示包含相关的核酸序列的一个核酸分子或多个核酸分子。所述核酸构建体因此可以是一个核酸分子,其可编码几种酶,或者它可以是几个核酸分子,每个包含一个编码酶的序列。相关的核酸序列因此可以被包含在一个载体上或几个载体上。它们也可能整合在基因组中,整合在在一条染色体上,或甚至一起在一个位置,或者它们可能整合在不同的染色体上。也可能在一个或多个载体上有一些序列,并且一些序列整合在基因组中。
所述核酸构建体包含编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)的多核苷酸,其如本文别处所述。在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码GHH_c20420(诸如在SEQ ID NO:1中所示)或与其具有至少70%同源性、相似性或同一性的功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:30或与其具有至少70%同源性、相似性或同一性的其同源物或由SEQ ID NO:30或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Slip_0499(诸如在SEQ ID NO:2中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:31或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:31或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Caur_1461(诸如在SEQ ID NO:3中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:32或SEQ ID NO:63或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:32或SEQ ID NO:63或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Slip_0479(诸如在SEQ ID NO:4中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:33或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:33或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Dde1(诸如在SEQ ID NO:59中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:61或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:61或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Rxy2(诸如在SEQ ID NO:60中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:62或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:62或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Tfu_1520(诸如在SEQ ID NO:5中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:34或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:34或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Tfu_0436(诸如在SEQ ID NO:6中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:35或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:35或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Slip_0880(诸如在SEQ ID NO:7中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:36或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:36或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Tfu_2394(诸如在SEQ ID NO:8中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:37或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:37或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Slip_1236(诸如在SEQ ID NO:9中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:38或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:38或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Caur_1540(诸如在SEQ ID NO:10中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:39或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:39或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Tfu_0253(诸如在SEQ ID NO:11中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:40或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:40或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码CHY_1604(诸如在SEQ ID NO:14中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:43或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:43或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码CHY_1288(诸如在SEQ ID NO:15中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:44或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:44或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Slip_2085(诸如在SEQ ID NO:16中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:45或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:45或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Slip_0465(诸如在SEQ ID NO:17中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:46或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:46或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Dde1(诸如在SEQ ID NO:59中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:61或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:61或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Rxy2(诸如在SEQ ID NO:60中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:62或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:62或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码CHY_1355(诸如在SEQ ID NO:18中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:47或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:47或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在优选的实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQ ID NO:1)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)或Slip_0880(SEQ ID NO:7),或与其具有至少70%同源性、相似性或同一性的其功能性变体。因此,在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸包含以下序列或由以下序列组成:SEQ ID NO:44、SEQ ID NO:47、SEQ ID NO:39、SEQ ID NO:30、SEQ ID NO:32、SEQ ID NO:63、SEQ ID NO:61、SEQ ID NO:62或SEQ ID NO:36,或与其具有至少70%同源性、相似性或同一性的其同源物,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。在特定实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Caur_1461(SEQ IDNO:3)、Dde1(SEQ ID NO:59)或Slip_0880(SEQ ID NO:7),或与其具有至少70%同源性、相似性或同一性的其功能性变体。因此,在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸包含以下序列或由其组成:SEQ ID NO:32、SEQ ID NO:61或SEQ ID NO:36,或与其具有至少70%同源性、相似性或同一性的其同源物,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
在其它优选的实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ ID NO:4)、Tfu_1520(SEQ ID NO:5)或Tfu_0436(SEQ ID NO:6),或与其具有至少70%同源性、相似性或同一性的其功能性变体。因此,在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸包含以下序列或由以下序列组成:SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:32、SEQ ID NO:63、SEQ IDNO:33、SEQ ID NO:34、SEQ ID NO:61、SEQ ID NO:62或SEQ ID NO:35,或与其具有至少70%同源性、相似性或同一性的其同源物,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。在特定实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸编码Caur_1461(SEQID NO:3)或Dde1(SEQ ID NO:59),或与其具有至少70%同源性、相似性或同一性的其功能性变体。因此,在某些实施方案中,所述编码乙酰辅酶A乙酰基转移酶的多核苷酸包含以下序列或由以下序列组成:SEQ ID NO:32、SEQ ID NO:63、SEQ ID NO:33或SEQ ID NO:61,或与其具有至少70%同源性、相似性或同一性的其同源物,该同源物编码保留乙酰辅酶A乙酰基转移酶活性的酶。
所述核酸构建体进一步包含编码选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶的多核苷酸。在某些实施方案中,所述第二种酶是Tle2。该酶由两个亚基组成;编码Tle2的多核苷酸因此优选地编码如在SEQ ID NO:19和SEQ ID NO:20中分别所示的Tle2的亚基A和亚基B。在某些实施方案中,所述多核苷酸包含以下序列或由以下序列组成:SEQ ID NO:48和SEQ ID NO:49,或与其具有至少70%同源性、相似性或同一性的其同源物,该同源物编码在一起保留乙酰辅酶A转移酶活性的亚基。
在某些实施方案中,所述编码第二种酶的多核苷酸编码Dde2(诸如在SEQ ID NO:21中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:50或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:50或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留3-氧代酸转移酶活性的酶。
在某些实施方案中,所述第二种酶是Ghh2。该酶由两个亚基组成;编码Ghh2的多核苷酸因此优选地编码如在SEQ ID NO:22和SEQ ID NO:23中分别所示的Ghh2的亚基A和亚基B。在某些实施方案中,所述多核苷酸包含以下序列或由以下序列组成:SEQ ID NO:51和SEQID NO:52,或与其具有至少70%同源性、相似性或同一性的其同源物,该同源物编码在一起保留3-氧代酸转移酶活性的亚基。
在某些实施方案中,所述第二种酶是Tme。该酶由两个亚基组成;编码Tme的多核苷酸因此优选地编码如在SEQ ID NO:24和SEQ ID NO:25中分别所示的Tme的亚基A和亚基B。在某些实施方案中,所述多核苷酸包含以下序列或由以下序列组成:SEQ ID NO:53和SEQID NO:54,或与其具有至少70%同源性、相似性或同一性的其同源物,该同源物编码在一起保留3-氧代酸转移酶活性的亚基。
在某些实施方案中,所述编码第二种酶的多核苷酸编码Pth(诸如在SEQ ID NO:26中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:26或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:26或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留酰基辅酶A:乙酸/3-酮酸辅酶A转移酶活性的酶。
在某些实施方案中,所述编码第二种酶的多核苷酸编码Rma(诸如在SEQ ID NO:27中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:56或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:56或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留酰基辅酶A硫酯酶活性的酶。
在优选的实施方案中,所述多核苷酸编码Tle2或与其具有至少70%同源性、相似性或同一性的其功能性变体。因此,在优选的实施方案中,所述编码第二种酶的多核苷酸包含以下序列或由以下序列组成:SEQ ID NO:48和SEQ ID NO:49,或与其具有至少70%同源性、相似性或同一性的其同源物。
所述核酸构建体进一步包含编码乙酰乙酸脱羧酶的多核苷酸。优选地,所述乙酰乙酸脱羧酶是Cac(诸如在SEQ ID NO:28中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:57或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:57或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留乙酰乙酸脱羧酶活性的酶。
所述核酸构建体可以进一步包含编码异丙醇脱氢酶(EC 1.1.1.80)的多核苷酸。优选地,所述异丙醇脱氢酶是Tbr(诸如在SEQ ID NO:29中所示)或与其具有至少70%同源性、相似性或同一性的其功能性变体。在某些实施方案中,所述多核苷酸包含SEQ ID NO:58或与其具有至少70%同源性、相似性或同一性的其同源物或者由SEQ ID NO:58或与其具有至少70%同源性、相似性或同一性的其同源物组成,该同源物编码保留异丙醇脱氢酶活性的酶。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,其中所述乙酰辅酶A乙酰基转移酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQID NO:1)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和Slip_0880(SEQ ID NO:7);优选地所述乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Tle2的多核苷酸,其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
iii)编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,其中所述乙酰辅酶A乙酰基转移酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQID NO:1)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和Slip_0880(SEQ ID NO:7);优选地所述乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Dde2(EC 2.8.3.5)(SEQ ID NO:21)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,和
iii)编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,其中所述乙酰辅酶A乙酰基转移酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQID NO:1)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和Slip_0880(SEQ ID NO:7);优选地所述乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Ghh2(EC 2.8.3.5)的多核苷酸,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
iii)编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,其中所述乙酰辅酶A乙酰基转移酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQID NO:1)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和Slip_0880(SEQ ID NO:7);优选地所述乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Tme(EC 2.8.3.8)的多核苷酸,其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25)或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
iii)编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,其中所述乙酰辅酶A乙酰基转移酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQID NO:1)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和Slip_0880(SEQ ID NO:7);优选地所述乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Pth(EC 2.8.3.1)(SEQ ID NO:26)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,和
iii)编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,其中所述乙酰辅酶A乙酰基转移酶选自CHY_1288(SEQ ID NO:15)、CHY_1355(SEQ ID NO:18)、Caur_1540(SEQ ID NO:10)、GHH_c20420(SEQID NO:1)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)和Slip_0880(SEQ ID NO:7);优选地所述乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Rma(EC 3.1.2.-)(SEQ ID NO:27)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,和
iii)编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码以下的多核苷酸:GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ IDNO:4)、Tfu_1520(SEQ ID NO:5)或Tfu_0436(SEQ ID NO:6),或与其具有至少70%同源性、相似性或同一性的其功能性变体;优选地乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQ IDNO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Tle2的多核苷酸,其中Tle2由Tle2亚基A(EC 2.8.3.8)(SEQ ID NO:19)和Tle2亚基B(EC 2.8.3.9)(SEQ ID NO:20),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
iii)编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码以下的多核苷酸:GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ IDNO:4)、Tfu_1520(SEQ ID NO:5)或Tfu_0436(SEQ ID NO:6),或与其具有至少70%同源性、相似性或同一性的其功能性变体;优选地乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQ IDNO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Dde2(EC 2.8.3.5)(SEQ ID NO:21)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,和
iii)编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码以下的多核苷酸:GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ IDNO:4)、Tfu_1520(SEQ ID NO:5)或Tfu_0436(SEQ ID NO:6),或与其具有至少70%同源性、相似性或同一性的其功能性变体;优选地所述乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQID NO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Ghh2(EC 2.8.3.5)的多核苷酸,其中Ghh2由Ghh2亚基A(SEQ ID NO:22)和Ghh2亚基B(SEQ ID NO:23),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
iii)其编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码以下的多核苷酸:GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ IDNO:4)、Tfu_1520(SEQ ID NO:5)或Tfu_0436(SEQ ID NO:6),或与其具有至少70%同源性、相似性或同一性的其功能性变体;优选地所述乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQID NO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Tme(EC 2.8.3.8)的多核苷酸,其中Tme由Tme亚基A(SEQ ID NO:24)和Tme亚基B(SEQ ID NO:25),或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
iii)编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码以下的多核苷酸:GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ IDNO:4)、Tfu_1520(SEQ ID NO:5)或Tfu_0436(SEQ ID NO:6),或与其具有至少70%同源性、相似性或同一性的其功能性变体;优选地所述乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQID NO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Pth(EC 2.8.3.1)(SEQ ID NO:26),或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,和
iii)编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)编码以下的多核苷酸:GHH_c20420(SEQ ID NO:1)、Slip_0499(SEQ ID NO:2)、Caur_1461(SEQ ID NO:3)、Dde1(SEQ ID NO:59)、Rxy2(SEQ ID NO:60)、Slip_0479(SEQ IDNO:4)、Tfu_1520(SEQ ID NO:5)或Tfu_0436(SEQ ID NO:6),或与其具有至少70%同源性、相似性或同一性的其功能性变体;优选地所述乙酰辅酶A乙酰基转移酶选自Caur_1461(SEQID NO:3)、Dde1(SEQ ID NO:59)和Slip_0880(SEQ ID NO:7);
ii)编码Rma(EC 3.1.2.-)(SEQ ID NO:27)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,和
iii)编码Cac(SEQ ID NO:28)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)SEQ ID NO:61、SEQ ID NO:50和SEQ ID NO:57;或
ii)SEQ ID NO:32、SEQ ID NO:48和SEQ ID NO:49,和SEQ ID NO:57;或
iii)SEQ ID NO:36、SEQ ID NO:48和SEQ ID NO:49,和SEQ ID NO:57;
和任选的SEQ ID NO:58,
或其具有至少70%同一性或相似性的其同源物。
在某些实施方案中,所述核酸构建体包含以下或由以下组成:
i)SEQ ID NO:32、SEQ ID NO:48和SEQ ID NO:49,和SEQ ID NO:57;或
ii)SEQ ID NO:30、SEQ ID NO:48和SEQ ID NO:49,和SEQ ID NO:57;或
iii)SEQ ID NO:31、SEQ ID NO:48和SEQ ID NO:49,和SEQ ID NO:57;或
iv)SEQ ID NO:33、SEQ ID NO:48和SEQ ID NO:49,和SEQ ID NO:57;
和任选的SEQ ID NO:58,
或与其具有至少70%同一性或相似性的其同源物。
上述核酸构建体中的任一种可以进一步包含编码异丙醇脱氢酶,优选Tbr(SEQ IDNO:29)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸。
与核酸序列相关的术语“至少70%同源性、相似性或同一性”在本文中应理解为表示与给定的核酸序列具有至少70%同源性、相似性或同一性,具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性的同源物(homologues),其仍然编码保留由所述给定核酸序列编码的酶的活性的酶。上文已经描述了如何测试相关活性。
与蛋白或酶相关的术语“至少70%同源性、相似性或同一性”在本文中应理解为表示与给定的蛋白或酶具有至少70%同源性、相似性或同一性,具有诸如至少71%、诸如至少72%、诸如至少73%、诸如至少74%、诸如至少75%、诸如至少76%、诸如至少77%、诸如至少78%、诸如至少79%、诸如至少80%、诸如至少81%、诸如至少82%、诸如至少83%、诸如至少84%、诸如至少85%、诸如至少86%、诸如至少87%、诸如至少88%、诸如至少89%、诸如至少90%、诸如至少91%、诸如至少92%、诸如至少93%、诸如至少94%、诸如至少95%、诸如至少96%、诸如至少97%、诸如至少98%、诸如至少99%同源性、相似性或同一性的同源物,其优选地保留原始蛋白或酶的至少一些活性。上文已经描述了如何测试相关活性。
如本领域已知的,所有核酸序列都可以针对在微生物中的表达进行密码子优化。
可能感兴趣的是,利用诱导型启动子。因此,在某些实施方案中,所述核酸构建体包含在诱导型启动子控制下的一种或多种上述核酸序列。这允许更好地控制在何时实际表达由所述序列编码的酶并且可以是有利的,例如,当挥发性化合物之一的生产不利地影响细胞生长时。技术人员将不难识别合适的诱导型启动子。在其它实施方案中,所述核酸构建体处于组成型启动子的控制之下。这样的组成型启动子可以是强启动子。
在某些实施方案中,所述核酸构建体是一种或多种载体,例如整合或复制载体,诸如在一起形成核酸构建体的多个载体。合适的载体是本领域已知的并且技术人员容易获得。因此,本文还提供了包含任何上述核酸构建体的载体。
上述核酸构建体可用于修饰嗜热细胞,特别是选自以下的属的细胞:地芽孢杆菌属、高温厌氧杆菌属、热厌氧杆菌属、嗜热厌氧菌属、芽孢杆菌属、热梭菌属、无氧芽孢杆菌属、热解纤维素菌属、穆尔氏菌属、栖热菌属、栖热袍菌属、假栖热袍菌属、绿屈挠菌属、厌氧解纤维素菌属、红嗜热菌属、硫化叶菌属、热球菌属、火球菌属和梭菌属。在某些实施方案中,所述嗜热细胞选自热葡糖苷酶地芽孢杆菌、就地堆肥地芽胞杆菌、嗜热脂肪地芽孢杆菌、热反硝化地芽孢杆菌、嗜热地芽孢杆菌、喜热噬油地芽孢杆菌、热小链地芽孢杆菌、解木聚糖高温厌氧杆菌、解糖高温厌氧杆菌、热解糖高温厌氧杆菌、马瑞氏热厌氧杆菌、假乙醇热厌氧杆菌、布氏热厌氧杆菌、凯伍热厌氧杆菌、布氏热厌氧杆菌、地下嗜热厌氧菌、热纤梭菌、琥珀酸嗜热梭菌、嗜粪热梭菌、枯草芽孢杆菌、地衣芽孢杆菌、凝结芽孢杆菌、史氏芽孢杆菌、甲醇芽孢杆菌、黄热芽孢杆菌、堪察加无氧芽孢杆菌、冈尼西氏厌氧杆菌、热解纤维素菌、解糖热解纤维素菌、克里斯托热解纤维素菌、欧文湖热解纤维素菌、产乳酸乙酸热解纤维素菌、热醋穆尔氏菌、热自养穆尔氏菌、嗜热栖热菌、水生栖热菌、海栖热袍菌、Pseudothermotoga lettingae、温泉假栖热袍菌、橙色绿屈挠菌、嗜热厌氧解纤维素菌、海洋红嗜热菌、酸热硫化叶菌、冰岛硫化叶菌、硫矿硫化叶菌、极端嗜热嗜压古菌、海洋异养古细菌、深海火球菌、激烈火球菌,优选地所述细胞是热葡糖苷酶地芽孢杆菌细胞、枯草芽孢杆菌细胞或热纤梭菌细胞。
本文还提供了一种嗜热细胞,其包含上文描述的核酸构建体。
本文还提供了一种载体或载体系统,其包含上文描述的核酸构建体。
本文还提供了一种宿主细胞,其包含上文描述的核酸构建体或载体。所述宿主细胞可以是原核生物或真核生物。在优选的实施方案中,所述细胞是原核生物,诸如细菌细胞,例如大肠杆菌(Escherichia coli)。所述宿主细胞可以是如本文描述的能够生产丙酮、丁酮和/或异丙醇的嗜热细胞。
试剂盒的部件
本文还提供了一种试剂盒,其包含上文描述的核酸构建体、载体或嗜热细胞和任选的使用说明书。
在某些实施方案中,所述试剂盒包含本文描述的核酸构建体和/或载体,并且可以进一步包含要修饰的嗜热细胞。所述嗜热细胞可以是上文描述的任何细胞。所述试剂盒可以进一步包含可用于修饰酵母细胞的试剂。
实施例
实施例1-材料和方法
菌株、质粒和培养基
在本研究中使用的细菌菌株和质粒列于表1中。
表1.在本研究中使用的菌株和质粒.
使大肠杆菌细胞在溶源性培养液(lysogeny broth,LB)中生长,在需要时添加100μg/mL氨苄西林或6.25μg/mL卡那霉素。使地芽孢杆菌属菌株在几种培养基中的任一种中生长。
mTGP(修改自Taylor等人,2008)培养基每升含有:17g胰蛋白胨、3g大豆蛋白胨、5gNaCl、2.5g K2HPO4。高压灭菌后,添加无菌溶液至最终浓度:4mL/L甘油、4g/L丙酮酸钠、0.59mM MgSO4、0.91mM CaCl2和0.04mM FeSO4。胰酶(Tripticase)大豆琼脂(TSA)每升含有:15g酪蛋白胰消化物、5g大豆木瓜蛋白酶消化物、5g NaCl、15g琼脂。SPY培养基由16g/l大豆蛋白胨、10g/l酵母浸出物、5g/l NaCl组成。通过添加5M NaOH,将其pH调至7.0。当指出时,加入甘油至10g/l的终浓度。
嗜热生物基本培养基(TMM)调整自Fong等人,2006,进行了一些修改。它含有(每升):六盐溶液(SSS),930mL;1M MOPS(pH 8.2),40mL;在0.4M三(羟甲基)甲基甘氨酸中的1mM FeSO4,10mL;0.132MK2HPO4,10mL;0.953M NH4Cl,10mL;1M CaCl2,0.5mL;痕量元素溶液,0.5ml;Wolfe氏维生素溶液,10mL。SSS每930mL含有:4.6g NaCl,1.35g Na2SO4,0.23g KCl,0.037g KBr,1.72g MgCl2·6H2O,0.83g NaNO3。每升所含的痕量元素溶液:1g FeCl3·6H2O,0.18g ZnSO4·7H2O,0.12gCuCl2·2H2O,0.12g MnSO4·H2O,0.18g CoCl2·6H2O。在指出时,加入终浓度为0.05%(w/v)的酵母浸出物。对于地芽孢杆菌属种(Geobacillus spp.)选择,使用12.5μg/mL卡那霉素。
DNA操作
根据生产商的说明书,使用基因组DNA纯化试剂盒(Promega)提取基因组DNA。使用Plasmid EasyPure试剂盒(Macherey-Nagel)进行质粒提取。
PCR和克隆
在表2中描述了在本研究中使用的引物。
表2.在本研究中使用的寡核苷酸
使用Phusion U Hot Start DNA Polymerase(Thermo Fisher Scientific)用含有尿嘧啶的引物进行用于USER克隆的DNA片段的PCR。使用Taq 2xMaster Mix(New England Biolabs)进行菌落PCR以检测阳性菌落。根据生产商的推荐进行反应,并为特定靶标和引物调整延伸时间和退火温度。使用USER(尿嘧啶特异性切除试剂)技术(Cavaleiro等人,2015)进行DNA克隆。
混合含有掺入引物的尿嘧啶(靠近其两个5'末端)的PCR扩增的DNA片段(PCR后的纯化不是必需的),并在37℃用DpnI酶(Thermo Fisher Scientific)处理30分钟以消化模板DNA。然后加入USERTM酶(New England Biolabs),并将混合物在三个步骤中孵育:1)在37℃下保持15min;2)在12℃下保持15min;3)在10℃下保持10min。然后将其转移到冰上并与化学感受态的大肠杆菌细胞混合。
大肠杆菌的转化
根据生产商的推荐转化化学感受态的大肠杆菌NEB5-α细胞(New England Biolabs)。
热葡糖苷酶地芽孢杆菌的转化
该程过程基于Taylor等人在2008年描述的方案,并调整一些步骤。使热葡糖苷酶地芽孢杆菌在60℃下在TSA琼脂平板上生长过夜。将一菌环量的细胞接种到250ml烧瓶中的50mL预温热的液体SPY培养基中,并在60℃和250rpm下孵育,直到培养物达到约2.0的OD600。将细胞在冰上冷却10min并通过以2600g离心10分钟进行收获。将它们用新鲜制备的冰冷的电穿孔缓冲液洗涤四次(2600g,10min)。所述缓冲液含有(每100mL)9.1g甘露醇、9.1g山梨醇、10mL甘油,并通过过滤除菌。对于每个连续步骤,以25ml、15ml、15ml和10ml的体积添加缓冲液。在最后一个洗涤步骤后,将细胞沉淀物溶解在2mL电穿孔缓冲液中,分成60μL等分试样并储存在-80℃直至进一步使用。
对于转化,将等分试样在冰上解冻并与DNA混合。将它转移进在电极(Bio-Rad)之间具有1mm间隙的电穿孔比色皿中,并使用Gene Pulser XcellTM(Bio-Rad)在以下条件下进行放电:2.5kV,600Ω,10μF。时间常数通常为4ms至5ms。在电穿孔后立即将细胞溶解在1mL补充有甘油的预热SPY培养基中,并在52℃下以200rpm回收4小时。然后将它们离心并接种在选择性琼脂培养基平板上。
DNA设计和分析
使用Integrated DNA Technologies,Inc.的在线服务(https://eu.idtdna.com/CodonOpt)或DNA 2.0的Gene Designer(Villalobos等人,2006)完成密码子优化。由Eurofins Scientific(Luxembourg)进行DNA测序。
GC-MS分析
在无菌的20ml顶空管形瓶中,使表达丙酮途径的菌株在2ml添加相应补充剂的TMM中生长。为了防止丙酮损失,将管形瓶在培养物生长的整个时间中保持封闭,直到取样用于色谱分析。孵育20小时后,将培养物在-20℃冷冻以停止生长和代谢活动,并转移用于测量。
使用BP20毛细管柱(30m,内径0.25mm,膜厚度0.25mm),用分析型GC-MS(BrukerScion 436GC TQ)测量丙酮浓度。使用氦气作为载体。入口温度设置为250℃,且烘箱温度设置如下程序化:在37℃下运行5分钟,然后以5℃/min的速率升温至直到它达到100℃,随后以15℃/min升温直到250℃,并在最后保持3分钟。使用电子电离方法进行质谱分析。使用全扫描模式,扫描范围为35amu至400amu。在无分流模式下,低浓度丙酮的进样体积为1μl至5μl,高浓度采用分流模式(1:1)。
实施例2–热葡糖苷酶地芽孢杆菌中的丙酮产量
为了在热葡糖苷酶地芽孢杆菌中生产丙酮,我们最初测试了热葡糖苷酶地芽孢杆菌对丙酮的耐受性。当在培养基与顶空体积比为1:10的紧密密封容器中生长时,发现该菌株耐受至少25g/l的丙酮(图1)。因此,我们接下来试图在地芽孢杆菌属中表达功能性丙酮途径。使用载体骨架将丙酮操纵子引入最近开发的pMTL61110(Sheng等人,2016)。
将四种硫解酶变体和三种氧代酸辅酶A转移酶变体的组合与来自丙酮丁醇梭菌菌株的乙酰乙酸脱羧酶一起构建为在来自丙酮丁醇梭菌的硫解酶启动子控制下的操纵子。使携带这些操纵子的热葡糖苷酶地芽孢杆菌菌株在以下环境中生长:i)含有1%葡萄糖的半确定成分培养基,和ii)补充有0.2%葡萄糖的营养丰富的培养基。在所有组合中,Dde1-Dde2-Cac在半确定成分培养基中产生最高的滴度,而Cau-Tle2-Cac在丰富培养液中表现最好。结果显示在表3中。
表3.携带硫解酶变体(行)和氧代酸辅酶A转移酶变体(列)的不同组合的热葡糖苷酶地芽孢杆菌的丙酮滴度(mg/l)。在每种组合中,与在硫解酶基因(AE001437:3007142..3007364)的启动子控制下的来自丙酮丁醇梭菌(UniProt ID P23670)的乙酰乙酸脱羧酶一起,在一个操纵子中从质粒表达两种酶基因。将菌株在基本培养基(细胞中的较高值)或丰富培养基(较低值)中培养。
还在30L补料分批发酵中测试CTC菌株,补料为2g/L/h葡萄糖、1g/L/h乙酸、1g/L/h酵母浸出物。使CTC菌株在补充有2%葡萄糖、0.2%乙酸、1%酵母提取物的TMM培养基中生长。在强启动子P3的控制下,将CTC基因整合到基因组中(Pogrebnyakov等人,2017)。实现了高达2.9g/L丙酮(图8)。
也在1L恒定补料分批发酵中测试了STC菌株(Slip_0880-Tle2-Cac)和CTC菌株(Caur_1461-Tle2-Cac)。使菌株在补充有2%葡萄糖、0.2%乙酸、1%酵母提取物的TMM培养基中生长。在强启动子P3的控制下,CTC和STC基因整合到基因组中。STC菌株达到了1.6g/L的最终丙酮滴度,而CTC菌株达到了1.1g/L的最终丙酮滴度(图10)。尽管STC菌株的生长速率和最大细胞密度低于CTC菌株(STC为1.5h-1和6.9的OD600,而CTC为2.1h-1和16.5的OD600),但STC菌株达到了更高的滴度。与CTC菌株相比,STC菌株还消耗更多的葡萄糖和乙酸盐,表明它在将底物转化为期望产物(丙酮)而不是细胞生物质方面甚至更有效。
实施例3-启动子强度的影响
在热葡糖苷酶地芽孢杆菌中过表达两种最佳表现的酶组合Dde1-Dde2-Cac和Cau-Tle2-Cac,其中Cau是Caur_1461的密码子优化版本。将每个操纵子整合到热葡糖苷酶地芽孢杆菌的染色体中,在Dde1-Dde2-Cac的情况下替代假定的丙酮羧化酶(AOT13_RS09545、AOT13_RS09550和AOT13_RS09555),或在Cau-Tle2-Cac的情况下替代乳酸脱氢酶(AOT13_RS05985)。将具有从低到高的不同活性水平的一系列组成型启动子(Pogrebnyakov等人,2017)被整合到这些操纵子的上游以驱动它们的表达,从而产生热葡糖苷酶地芽孢杆菌菌株G31-G38和DDC。来自Dde1-Dde2-Cac的丙酮滴度随着启动子强度的增加而增加(表3)。
Cau-Tle2-Cac操纵子在热葡糖苷酶地芽孢杆菌菌株CTC中的过表达也导致丙酮滴度的增加。添加乙酸钠、尤其是乙酸进一步将丙酮滴度增加至两倍(图4、表4)。在补充有0.2%乙酸的营养丰富的SPY培养基中,菌株CTC产生1.61g/l丙酮。
表4.热葡糖苷酶地芽孢杆菌菌株CTC在含有1%葡萄糖和不同浓度乙酸盐的半确定成分培养基中的丙酮产量
实施例4-糖组成的影响
底物的糖组成还影响热葡糖苷酶地芽孢杆菌的丙酮生产。该物种能够利用许多戊糖和己糖,特别是葡萄糖和木糖。使热葡糖苷酶地芽孢杆菌CTC在这些单糖的混合物的存在下生长,并将它们以高产率转化丙酮(图5)。
实施例5–热葡糖苷酶地芽孢杆菌中的丙酮和丁酮的产量
针对丁酮和丙酮的产量,筛选来自嗜热生物的硫解酶和酰基辅酶A:酰基辅酶A烷基转移酶的多种变体。在先前创建的中等强度启动子P7(Pogrebnyakov等人,2017)的控制下,在染色体中携带Tle2和Cac基因的热葡糖苷酶地芽孢杆菌菌株中在来自丙酮丁醇梭菌的硫解酶启动子的控制下表达它们。所得到的菌株命名为G51-G82。使它们在补充有1%葡萄糖和0.2%丙酸的半确定TMM培养基中生长。在这些条件下,大多数菌株生产不同比例和滴度的丁酮和丙酮的混合物(表5)。促成最高丁酮产量的硫解酶变体是Caur_1461、GHH_c20420、Slip_0499和Slip_0479。变体Slip_0880的表达导致最高的丙酮滴度和相对低的丁酮量。结果显示在图2和图9中;图9中的数据显示了与图2相同的实验结果,合并了在相同条件下进行的额外独立测量。
表5:热葡糖苷酶地芽孢杆菌中的丁酮和丙酮的产量,所述热葡糖苷酶地芽孢杆菌表达指定的硫解酶并且另外表达来自Pseudothermotoga lettingae的乙酰基辅酶A转移酶Tle2(UniProt ID A8F7H7,A8F7H6)和来自丙酮丁醇梭菌的乙酰乙酸脱羧酶Cac(P23670)
来自在先实施例的热葡糖苷酶地芽孢杆菌菌株CTC过表达Cau-Tle2-Cac操纵子,其中Cau是Caur_1461的密码子优化版本,即在该实施例中最好的丁酮生产者之一。使热葡糖苷酶地芽孢杆菌CTC在补充有1%葡萄糖和0.1%至0.3%丙酸的TMM中生长,并生产高达0.43g/l丁酮(图6、表6)。
表6.热葡糖苷酶地芽孢杆菌菌株CTC在含有1%葡萄糖和不同浓度丙酸的半确定成分培养基中的丁酮产量
实施例6-热葡糖苷酶地芽孢杆菌中的异丙醇生产
将丙酮转化为异丙醇需要一个涉及醇脱氢酶的酶促步骤。先前已经鉴定出来自布氏热厌氧杆菌的特定异丙醇脱氢酶(Hanai等人,2007)。将该基因的密码子优化版本整合到热葡糖苷酶地芽孢杆菌CTC的基因组中在Cac基因的下游,从而产生菌株CTCI。在补充有1%葡萄糖的TMM中生长的热葡糖苷酶地芽孢杆菌CTCI生产0.11g/l异丙醇。
参考文献
“Acetone market:global industry analysis and opportunity assessment,2014-2020,”2015
Z.C.Baer,S.Bormann,S.Sreekumar,A.Grippo,F.D.Toste,H.W.Blanch,和D.S.Clark,“Co-production of acetone and ethanol with molar ratio controlenables production of improved gasoline or jet fuel blends,”BiotechnolBioeng,第9999卷,第10期,第1-9页,2016
A.Banerjee,C.Leang,T.Ueki,K.P.Nevin,和D.R.Lovley,“Lactose-induciblesystem for metabolic engineering of Clostridium ljungdahlii,”Appl Env.Microb,第80卷,第8期,第2410-2416页,2014.
L.L.Bermejo,N.E.Welker,和E.T.Papoutsakis,“Expression of Clostridiumacetobutylicum ATCC 824genes in Escherichia coli for acetone production andacetate detoxification,”Appl Env.Microbiol,第64卷,第3期,第1079-1085页,1998
I.W.Bogorad,T.Lin,和J.C.Liao,“Synthetic non-oxidative glycolysisenables complete carbon conservation,”Nature,第502卷,第7473期,第693-697页,2013
E.F.Bosma,J.Van Der Oost,W.M.De Vos,和R.Van Kranenburg,“Sustainableproduction of bio-based chemicals by extremophiles,”Curr Biotechnol,第2卷,第360-379页,2013
A.M.Cavaleiro,S.H.Kim,S.M.T.Nielsen,和M.H.H.“Accurate DNA Assembly and Genome Engineering with Optimized Uracil ExcisionCloning,”ACS Synth.Biol.,第4卷,第9期,第1042-1046页,2015
R.E.Cripps,K.Eley,D.J.Leak,B.Rudd,M.Taylor,M.Todd,S.Boakes,S.Martin,和T.Atkinson,“Metabolic engineering of Geobacillus thermoglucosidasius forhigh yield ethanol production,”Metab.Eng.,第11卷,第6期,第398-408页,2009
J.C.N.Fong,C.J.Svenson,K.Nakasugi,C.T.C.Leong,J.P.Bowman,B.Chen,D.R.Glenn,B.A.Neilan,和P.L.Rogers,“Isolation and characterization of twonovel ethanol-tolerant facultative-anaerobic thermophilic bacteria strainsfrom waste compost,”Extremophiles,第10卷,第5期,第363-372页,2006
T.Hanai,S.Atsumi,和J.C.Liao,“Engineered synthetic pathway forisopropanol production in Escherichia coli,”Appl.Environ.Microbiol.,第73卷,第24期,第7814-7818页,2007.
M.C.Ho,J.F.Ménétret,H.Tsuruta,和K.N.Allen,“The origin of theelectrostatic perturbation in acetoacetate decarboxylase,”Nature,第459卷,第393-397页,2009.
S.Hoffmeister,M.Gerdom,F.R.Bengelsdorf,S.Linder,S.Flüchter,H.W.Blümke,A.May,R.Fischer,H.Bahl,和P.Dürre,“Acetone production withmetabolically engineered strains of Acetobacterium woodii,”Metab Eng,第36卷,第37-47页,2016.
D.T.Jones and D.R.Woods,“Acetone-butanol fermentation revisited,”Microbiol Rev,第50卷,第4期,第484-524页,1986
M.Mathieu等人,“Thestructure of the dimeric peroxisomal3-ketoacyl-CoA thiolase of Saccharomyces cerevisiae:Implications for substratebinding and reaction mechanism,”J.Mol.Biol.,第273卷,第3期,第714-728页,1997
A.May,R.-J.Fischer,M.S.Thum,S.Schaffer,S.Verseck,P.Durre,和H.Bahl,“Amodified pathway for the production of acetone in Escherichia coli,”MetabEng,第15卷,第218-225页,2013.
I.Pogrebnyakov,C.B.Jendresen,和A.T.Nielsen,“Genetic toolbox forcontrolled expression of functional proteins in Geobacillus spp.,”PLoS One,第12卷,第2期,第1-15页,2017
L.Sheng and N.P.Minton,“Development of the shuttle vector pMTL fortransformation in Geobacillus spp.and Escherichia coli,”印刷中,2016.
M.P.Taylor,C.D.Esteban,和D.J.Leak,“Development of a versatile shuttlevector for gene expression in Geobacillus spp.,”Plasmid,第60卷,第1期,第45-52页,2008
A.Villalobos,J.E.Ness,C.Gustafsson,J.Minshull,和S.Govindarajan,“GeneDesigner:A synthetic biology tool for constructing artificial DNA segments,”BMC Bioinformatics,第7卷,第285-292页,2006
X.Yang,Q.Yuan,Y.Zheng,H.Ma,T.Chen,和X.Zhao,“An engineered non-oxidative glycolysis pathway for acetone production in Escherichia coli,”Biotechnol Lett,第38卷,第8期,第1359-1365页,2016.
J.Zhou,H.Zhang,Y.Zhang,Y.Li,和Y.Ma,“Designing and creatingamodularized synthetic pathway in cyanobacterium Synechocystis enablesproduction of acetone from carbon dioxide,”Metab Eng,第14卷,第4期,第394-400页,2012
J.Zhou,K.Wu,和C.V Rao,“Evolutionary engineering of Geobacillusthermoglucosidasius for improved ethanol production,”Biotechnol Bioeng,第9999卷,第1-12页,2016
序列概述
项目
1.一种生产选自丙酮、丁酮和异丙醇的一种或多种化合物的方法,所述方法包括以下步骤:
a)提供嗜热细胞,优选嗜热细菌或嗜热古细菌细胞,其表达:
i)选自以下的第一种酶:
乙酰辅酶A乙酰基转移酶(EC 2.3.1.9),所述乙酰辅酶A乙酰基转移酶(EC2.3.1.9)选自如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:4中所示的Slip_0479、如在SEQ ID NO:5中所示的Tfu_1520、如在SEQ ID NO:6中所示的Tfu_0436、如在SEQ ID NO:7中所示的Slip_0880、如在SEQ ID NO:8中所示的Tfu_2394、如在SEQ ID NO:9中所示的Slip_1236、如在SEQ ID NO:10中所示的Caur_1540、如在SEQ ID NO:11中所示的Tfu_0253、如在SEQ ID NO:14中所示的CHY_1604、如在SEQ ID NO:15中所示的CHY_1288、如在SEQ ID NO:16中所示的Slip_2085、如在SEQ ID NO:17中所示的Slip_0465、如在SEQ ID NO:59中所示的Dde1、如在SEQ ID NO:60中所示的Rxy2和如在SEQ ID NO:18中所示的CHY_1355,
EC编号2.3.3.20的酶,所述EC编号2.3.3.20的酶选自如在SEQ ID NO:12中所示的3-氧代酰基-ACP合酶SVA_3859和如在SEQ ID NO:13中所示的酰基辅酶A:酰基辅酶A烷基转移酶Despr_2661;
和
与其具有至少70%同源性、相似性或同一性的其功能性变体;
ii)选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:
乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶,
其中所述第二种酶选自:Tle2、如在SEQ ID NO:21中所示的Dde2
(EC 2.8.3.5)、Ghh2(EC 2.8.3.5)、Tme(EC 2.8.3.8)、如在SEQID NO:26中所示的Pth(EC 2.8.3.1)和如在SEQ ID NO:27中所示的Rma(EC 3.1.2.-),或与其具有至少70%同源性、相似性或同一性的其功能性变体,
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)
和如在SEQ ID NO:20中所示的Tle2亚基B(EC 2.8.3.9)组成,其中Ghh2由如在SEQID NO:22中所示的Ghh2亚基A和如在SEQ ID NO:23中所示的Ghh2亚基B组成,且其中Tme由如在SEQ ID NO:
24中所示的Tme亚基A和如在SEQ ID NO:25中所示的Tme亚基B,或与其具有至少70%同源性、相似性或同一性的其功能性变体组成;
和
iii)乙酰乙酸脱羧酶(EC 4.1.1.4),优选如在SEQ ID NO:28中所示的Cac,或与其具有至少70%同源性、相似性或同一性的其功能性变体;
iv)任选的异丙醇脱氢酶(EC 1.1.1.80),优选如在SEQ ID NO:29
中所示的Tbr,或与其具有至少70%同源性、相似性或同一性的功能性变体;
b)在42℃和80℃之间,诸如50℃和75℃之间,例如60℃的温度下,于包含培养液的生物反应器中培养所述细菌细胞,由此生产所述一种或多种化合物;
c)回收在步骤b)中生产的所述一种或多种化合物。
2.根据项目1所述的方法,其中所述嗜热细胞具有42℃和80℃之间,诸如50℃和75℃之间、例如60℃的最佳生长温度。
3.根据上述项目中任一项所述的方法,其中所述嗜热细胞属于选自以下的属:地芽孢杆菌属(Geobacillus)、高温厌氧杆菌属(Thermoanaerobacterium)、热厌氧杆菌属(Thermoanaerobacter)、嗜热厌氧菌属(Caldanaerobacter)、芽孢杆菌属(Bacillus)、热梭菌属(Thermoclostridium)、无氧芽孢杆菌属(Anoxybacillus)、热解纤维素菌属(Caldicellulosiruptor)、穆尔氏菌属(Moorella)、栖热菌属(Thermus)、栖热袍菌属(Thermotoga)、假栖热袍菌属(Pseudothermotoga)、绿屈挠菌属(Chloroflexus)、厌氧解纤维素菌属(Anaerocellum)、红嗜热菌属(Rhodothermus)、硫化叶菌属(Sulfolobus)、热球菌属(Thermococcus)、火球菌属(Pyrococcus)和梭菌属(Clostridium)。
4.根据上述项目中任一项所述的方法,其中所述嗜热细胞属于选自以下的种:热葡糖苷酶地芽孢杆菌(Geobacillus thermoglucosidasius)、就地堆肥地芽胞杆菌(Geobacillus toebii)、嗜热脂肪地芽孢杆菌(Geobacillus stearothermophilus)、热反硝化地芽孢杆菌(Geobacillus thermodenitrificans)、嗜热地芽孢杆菌(Geobacilluskaustophilus)、喜热噬油地芽孢杆菌(Geobacillus thermoleovorans)、热小链地芽孢杆菌(Geobacillus thermocatenulatus)、解木聚糖高温厌氧杆菌(Thermoanaerobacteriumxylanolyticum)、解糖高温厌氧杆菌(Thermoanaerobacterium saccharotyticum)、热解糖高温厌氧杆菌(Thermoanaerobacterium thermosaccharolyticum)、马瑞氏热厌氧杆菌(Thermoanaerobacter mathranii)、假乙醇热厌氧杆菌(Thermoanaerobacterpseudoethanolicus)、布氏热厌氧杆菌(Thermoanaerobacter brockii)、凯伍热厌氧杆菌(Thermoanaerobacter kivui)、布氏热厌氧杆菌(Thermoanaerobacter brockii)、地下嗜热厌氧菌(Caldanaerobacter subterraneus)、热纤梭菌(Clostridium thermocellum)、琥珀酸嗜热梭菌(Clostridium thermosuccinogenes)、嗜粪热梭菌(Thermoclostridiumstercorarium)、枯草芽孢杆菌(Bacillus subtilis)、地衣芽孢杆菌(Bacilluslicheniformis)、凝结芽孢杆菌(Bacillus coagulans)、史氏芽孢杆菌(Bacillussmithii)、甲醇芽孢杆菌(Bacillus methanolicus)、黄热芽孢杆菌(Bacillusflavothermus)、堪察加无氧芽孢杆菌(Anoxybacillus kamchatkensis)、冈尼西氏厌氧杆菌(Anoxybacillus gonensis)、热解纤维素菌(Caldicellulosiruptor bescii)、解糖热解纤维素菌(Caldicellulosiruptor saccharolyticus)、克里斯托热解纤维素菌(Caldicellulosiruptor kristjanssonii)、欧文湖热解纤维素菌(Caldicellulosiruptorowensensis)、产乳酸乙酸热解纤维素菌(Caldicellulosiruptor lactoaceticus)、热醋穆尔氏菌(Moorella thermoacetica)、热自养穆尔氏菌(Moorella thermoautotrophica)、嗜热栖热菌(Thermus thermophilus)、水生栖热菌(Thermus aquaticus)、海栖热袍菌(Thermotoga maritima)、Pseudothermotoga lettingae、温泉假栖热袍菌(Pseudothermotoga thermarum)、橙色绿屈挠菌(Chloroflexus aurantiacus)、嗜热厌氧解纤维素菌(Anaerocellum thermophilum)、海洋红嗜热菌(Rhodothermus marinus)、酸热硫化叶菌(Sulfolobus acidocaldarius)、冰岛硫化叶菌(Sulfolobus islandicus)、硫矿硫化叶菌(Sulfolobus solfataricus)、极端嗜热嗜压古菌(Thermococcus barophilus)、海洋异养古细菌(Thermococcus kodakarensis)、深海火球菌(Pyrococcus abyssi)、激烈火球菌(Pyrococcus furiosus),优选地所述细胞是热葡糖苷酶地芽孢杆菌细胞、枯草芽孢杆菌细胞或热纤梭菌细胞。
5.根据上述项目中任一项所述的方法,其中所述培养液包含可发酵底物,所述可发酵底物包含碳源,诸如碳水化合物,例如葡萄糖、木糖或它们的混合物,或者包含诸如生物质水解物。
6.根据上述项目中任一项所述的方法,其中所述嗜热细胞是产乙酸细胞,且其中向所述细胞提供一氧化碳、二氧化碳、氢或它们混合物。
7.根据上述项目中任一项所述的方法,其中所述一种或多种化合物包含丙酮和任选的异丙醇,其中所述细胞能够合成乙酰辅酶A,和/或其中所述培养液包含乙酸或乙酸盐。
8.根据上述项目中任一项所述的方法,其中以至少0.8g/L,诸如至少0.9g/L,诸如至少1.0g/L,诸如至少1.1g/L,诸如至少1.2g/L,诸如至少1.3g/L,诸如至少1.4g/L,诸如至少1.5g/L,诸如至少1.6g/L,诸如至少1.7g/L,诸如至少1.8g/L,诸如至少1.9g/L,诸如至少2.0g/L,诸如至少5g/L,诸如至少7.5g/L,诸如至少10g/L,诸如至少12.5g/L,诸如至少15g/L,诸如至少20g/L,诸如至少25g/L,诸如至少50g/L,诸如至少75g/L,诸如至少100g/L,诸如至少150g/L,诸如至少250g/L或更高的滴度生产丙酮。
9.根据上述项目中任一项所述的方法,其中至少生产丙酮且其中所述第一种酶是如在SEQ ID NO:15中所示的CHY_1288、如在SEQ ID NO:18中所示的CHY_1355、如在SEQ IDNO:10中所示的Caur_1540、如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:59中所示的Dde1、如在SEQ ID NO:60中所示的Rxy2或如在SEQ ID NO:7中所示的Slip_0880,或与其具有至少70%同源性、相似性或同一性的其功能性变体,优选如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:60中所示的Rxy2、如在SEQ ID NO:7中所示的Slip_0880或如在SEQ ID NO:59中所示的Dde1。
10.根据上述项目中任一项所述的方法,其中所述嗜热细胞表达如在SEQ ID NO:29中所示的Tbr或与其具有至少70%同源性、相似性或同一性的其功能性变体,由此生产的丙酮的至少一部分转化成异丙醇。
11.根据上述项目中任一项所述的方法,其中以至少0.05g/L,诸如至少0.075g/L,诸如至少0.1g/L,诸如至少0.2g/L,诸如至少0.3g/L,诸如至少0.4g/L,诸如至少0.5g/L,诸如至少0.75g/L,诸如至少1.0g/L,诸如至少2.0g/L,诸如至少3.0g/L,诸如至少4.0g/L,诸如至少5.0g/L,诸如至少7.5g/L,诸如至少10.0g/L或更高,诸如至少12.5g/L,诸如至少15g/L,诸如至少20g/L,诸如至少25g/L,诸如至少50g/L,诸如至少75g/L,诸如至少100g/L,诸如至少150g/L,诸如至少250g/L或更高的滴度至少生产异丙醇。
12.根据上述项目中任一项所述的方法,其中所述一种或多种化合物包含丁酮,其中所述培养液包含丙酸或丙酸盐。
13.根据上述项目中任一项所述的方法,其中至少生产丁酮,且其中所述第一种酶是如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQID NO:3中所示的Caur_1461、如在SEQ ID NO:4中所示的Slip_0479、如在SEQ ID NO:59中所示的Dde1、如在SEQ ID NO:60中所示的Rxy2、如在SEQ ID NO:5中所示的Tfu_1520、或如在SEQ ID NO:6中所示的Tfu_0436,或与其具有至少70%同源性、相似性或同一性的其功能性变体,优选如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:59中所示的Dde1、如在SEQ IDNO:60中所示的Rxy2或如在SEQ ID NO:4中所示的Slip_0479。
14.根据上述项目中任一项所述的方法,其中以至少0.05g/L,诸如至少0.075g/L,诸如至少0.1g/L,诸如至少0.2g/L,诸如至少0.3g/L,诸如至少0.4g/L,诸如至少0.5g/L,诸如至少0.75g/L,诸如至少1.0g/L,诸如至少2.0g/L,诸如至少3.0g/L,诸如至少4.0g/L,诸如至少5.0g/L,诸如至少7.5g/L,诸如至少10.0g/L或更高,诸如至少12.5g/L,诸如至少15g/L,诸如至少20g/L,诸如至少25g/L,诸如至少50g/L,诸如至少75g/L,诸如至少100g/L,诸如至少150g/L,诸如至少250g/L或更高的滴度生产丁酮。
15.根据上述项目中任一项所述的方法,其中所述第二种酶是:
i)Tle2或Tle2的功能性变体,其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ ID NO:20中所示的Tle2亚基B(EC 2.8.3.9)组成,且其中所述Tle2的功能性变体由与在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)具有至少70%同源性、相似性或同一性的亚基和与在SEQ ID NO:20中所示的Tle2亚基B(EC 2.8.3.9)具有至少70%同源性、相似性或同一性的另一亚基组成;
ii)如在SEQ ID NO:21中所示的Dde2或与其具有至少70%同源性、相似性或同一性的其功能性变体;或
iii)Ghh2或Ghh2的功能性变体,其中Ghh2由如在SEQ ID NO:22中所示的Ghh2亚基A和如在SEQ ID NO:23中所示的Ghh2亚基B组成,且其中所述Ghh2的功能性变体由与在SEQID NO:22中所示的Ghh2亚基A具有至少70%同源性、相似性或同一性的亚基和与在SEQ IDNO:23中所示的Ghh2亚基B具有至少70%同源性、相似性或同一性的另一亚基组成。
16.根据上述项目中任一项所述的方法,其中所述乙酰乙酸脱羧酶是如在SEQ IDNO:28中所示的Cac或与其具有至少70%同源性、相似性或同一性的功能性变体。
17.根据上述项目中任一项所述的方法,其中步骤b)中的培养是连续发酵。
18.根据上述项目中任一项所述的方法,其中步骤c)包括诸如通过冷凝,从在步骤b)中产生的废气回收所述一种或多种挥发性化合物。
19.根据上述项目中任一项所述的方法,其中所述第一种酶由乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与所述乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的功能性变体组成,所述乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)选自如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:4中所示的Slip_0479、如在SEQ ID NO:7中所示的Slip_0880和如在SEQ ID NO:59中所示的Dde1。
20.根据上述项目中任一项所述的方法,其中所述第二种酶选自:Tle2和如在SEQID NO:21中所示的Dde2(EC 2.8.3.5),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体。
21.根据上述项目中任一项所述的方法,其中至少生产丙酮且其中所述第一种酶是Caur_1461(SEQ ID NO:3)、Slip_0880(SEQ ID NO:7)或Dde1(SEQ ID NO:59),或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体。
22.根据上述项目中任一项所述的方法,其中所述一种或多种化合物包含丁酮,其中所述培养液包含丙酸或丙酸盐,和/或其中所述第一种酶是如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:59中所示的Dde1或如在SEQ ID NO:4中所示的Slip_0479,或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体。
23.根据上述项目中任一项所述的方法,其中所述嗜热细胞表达如在SEQ ID NO:28中所示的Cac,或与其具有至少70%同一性或相似性的具有乙酰乙酸脱羧酶活性的其功能性变体,其中所述嗜热细胞进一步表达:
i)如在SEQ ID NO:59中所示的Dde1和如在SEQ ID NO:21中所示的Dde2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丙酮;或
ii)如在SEQ ID NO:3中所示的Caur_1461和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丙酮和/或丁酮;或
iii)如在SEQ ID NO:7中所示的Slip_0880和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丙酮;或
iv)如在SEQ ID NO:1中所示的GHH_c20420和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丁酮;或
v)如在SEQ ID NO:2中所示的Slip_0499和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丁酮;或
vi)如在SEQ ID NO:4中所示的Slip_0479和Tle2;或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丁酮;
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ IDNO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶、或酰基辅酶A硫酯酶II活性的其功能性变体组成。
24.能够生产丙酮和/或丁酮和任选的异丙醇的嗜热细胞,所述细胞是细菌细胞或古细菌细胞并且表达:
i)选自以下的第一种酶:
乙酰辅酶A乙酰基转移酶(EC 2.3.1.9),所述乙酰辅酶A乙酰基转移酶(EC2.3.1.9)选自如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:4中所示的Slip_0479、如在SEQ ID NO:5中所示的Tfu_1520、如在SEQ ID NO:6中所示的Tfu_0436、如在SEQ ID NO:7中所示的Slip_0880、如在SEQ ID NO:8中所示的Tfu_2394、如在SEQ ID NO:9中所示的Slip_1236、如在SEQ ID NO:10中所示的Caur_1540、如在SEQ ID NO:11中所示的Tfu_0253、如在SEQ ID NO:14中所示的CHY_1604、如在SEQ ID NO:15中所示的CHY_1288、如在SEQ ID NO:16中所示的Slip_2085、如在SEQ ID NO:17中所示的Slip_0465、如在SEQ ID NO:59中所示的Dde1、如在SEQ ID NO:60中所示的Rxy2和如在SEQ ID NO:18中所示的CHY_1355,或
EC编号2.3.3.20的酶,所述EC编号2.3.3.20的酶选自如在SEQ ID NO:12中所示的3-氧代酰基-ACP合酶SVA_3859和如在SEQ ID NO:13中所示的酰基辅酶A:酰基辅酶A烷基转移酶Despr_2661;和
与其具有至少70%同源性、相似性或同一性的其功能性变体;
ii)选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶,
其中所述第二种酶选自:Tle2、如在SEQ ID NO:21中所示的Dde2(EC 2.8.3.5)、Ghh2(EC 2.8.3.5)、Tme(EC 2.8.3.8)、如在SEQ ID NO:26中所示的Pth(EC 2.8.3.1)和如在SEQ ID NO:27中所示的Rma(EC 3.1.2.-),或与其具有至少70%同源性、相似性或同一性的其功能性变体,
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ IDNO:20中所示的Tle2亚基B(EC 2.8.3.9)组成,其中Ghh2由如在SEQ ID NO:22中所示的Ghh2亚基A和如在SEQ ID NO:23中所示的Ghh2亚基B组成,且其中Tme由如在SEQ ID NO:24中所示的Tme亚基A和如在SEQ ID NO:25中所示的Tme亚基B组成,和
iii)乙酰乙酸脱羧酶(EC 4.1.1.4),优选如在SEQ ID NO:28中所示的Cac或与其具有至少70%同源性、相似性或同一性的其功能性变体;
由此所述细胞能够将乙酰辅酶A转化为丙酮,从而以至少0.8g/L的滴度生产丙酮;
和/或由此所述细胞能够将乙酰辅酶A和丙酰辅酶A转化为丁酮,从而生产丁酮;
iv)任选的异丙醇脱氢酶(EC 1.1.1.80),优选如在SEQ ID NO:29中所示的Tbr或与其具有至少70%同源性、相似性或同一性的其功能性变体,
由此所述细胞能够进一步将丙酮转化为异丙醇,从而生产异丙醇。
25.根据项目24所述的嗜热细胞,所述细胞属于选自以下的属:地芽孢杆菌属、高温厌氧杆菌属、热厌氧杆菌属、嗜热厌氧菌属、芽孢杆菌属、热梭菌属、无氧芽孢杆菌属、热解纤维素菌属、穆尔氏菌属、栖热菌属、栖热袍菌属、假栖热袍菌属、绿屈挠菌属、厌氧解纤维素菌属、红嗜热菌属、硫化叶菌属、热球菌属、火球菌属和梭菌属。
26.根据项目25所述的嗜热细胞,所述细胞属于选自以下的种:热葡糖苷酶地芽孢杆菌、就地堆肥地芽胞杆菌、嗜热脂肪地芽孢杆菌、热反硝化地芽孢杆菌、嗜热地芽孢杆菌、喜热噬油地芽孢杆菌、热小链地芽孢杆菌、解木聚糖高温厌氧杆菌、解糖高温厌氧杆菌、热解糖高温厌氧杆菌、马瑞氏热厌氧杆菌、假乙醇热厌氧杆菌、布氏热厌氧杆菌、凯伍热厌氧杆菌、布氏热厌氧杆菌、地下嗜热厌氧菌、热纤梭菌、琥珀酸嗜热梭菌、嗜粪热梭菌、枯草芽孢杆菌、地衣芽孢杆菌、凝结芽孢杆菌、史氏芽孢杆菌、甲醇芽孢杆菌、黄热芽孢杆菌、堪察加无氧芽孢杆菌、冈尼西氏厌氧杆菌、热解纤维素菌、解糖热解纤维素菌、克里斯托热解纤维素菌、欧文湖热解纤维素菌、产乳酸乙酸热解纤维素菌、热醋穆尔氏菌、热自养穆尔氏菌、嗜热栖热菌、水生栖热菌、海栖热袍菌、Pseudothermotoga lettingae、温泉假栖热袍菌、橙色绿屈挠菌、嗜热厌氧解纤维素菌、海洋红嗜热菌、酸热硫化叶菌、冰岛硫化叶菌、硫矿硫化叶菌、极端嗜热嗜压古菌、海洋异养古细菌、深海火球菌、激烈火球菌,优选地所述细胞是热葡糖苷酶地芽孢杆菌细胞、枯草芽孢杆菌细胞或热纤梭菌细胞。
27.根据项目24至26中任一项所述的嗜热细胞,其中所述细胞能够合成乙酰辅酶A。
28.根据项目24至27中任一项所述的嗜热细胞,其中所述细胞包含一种或多种多核苷酸,所述一种或多种编码所述第一种酶、所述第二种酶、所述乙酰乙酸脱羧酶和任选的所述异丙醇脱氢酶。
29.根据项目24至28中任一项所述的嗜热细胞,其中所述一种或多种多核苷酸针对在所述细胞中的表达进行了密码子优化。
30.根据项目24至29中任一项所述的嗜热细胞,其中所述一种或多种多核苷酸被包含在载体内或被整合在所述细胞的基因组中。
31.根据项目24至30中任一项所述的嗜热细胞,其中所述一种或多种多核苷酸是在诱导型启动子的控制下或在组成型启动子的控制下。
32.根据项目24至31中任一项所述的嗜热细胞,其中所述细胞是非天然细胞。
33.根据项目24至32中任一项所述的嗜热细胞,其中所述第一种酶由乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)组成,所述乙酰辅酶A乙酰基转移酶选自如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:4中所示的Slip_0479、如在SEQ ID NO:7中所示的Slip_0880和如在SEQ ID NO:59中所示的Dde1,或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体。
34.根据项目24至33中任一项所述的嗜热细胞,其中所述第二种酶选自:Tle2和如在SEQ ID NO:21中所示的Dde2(EC 2.8.3.5),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体。
35.根据项目24至34中任一项所述的嗜热细胞,其中所述第一种酶是Caur_1461(SEQ ID NO:3)、Slip_0880(SEQ ID NO:7)或Dde1(SEQ ID NO:59),或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的功能性变体。
36.根据项目24至35中任一项所述的嗜热细胞,其中所述第一种酶是如在SEQ IDNO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:59中所示的Dde1、或如在SEQ ID NO:4中所示的Slip_0479,或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体。
37.根据项目24至36中任一项所述的嗜热细胞,其中所述嗜热细胞表达如在SEQID NO:28中所示的Cac,或与其具有至少70%同一性或相似性的具有乙酰乙酸脱羧酶活性的其功能性变体,其中所述嗜热细胞进一步表达:
i)如在SEQ ID NO:59中所示的Dde1和如在SEQ ID NO:21中所示的Dde2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丙酮;或
ii)如在SEQ ID NO:3中所示的Caur_1461和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丙酮和/或丁酮;或
iii)如在SEQ ID NO:7中所示的Slip_0880和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丙酮;或
iv)如在SEQ ID NO:1中所示的GHH_c20420和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丁酮;或
v)如在SEQ ID NO:2中所示的Slip_0499和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丁酮;或
vi)如在SEQ ID NO:4中所示的Slip_0479和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丁酮;
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ IDNO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体组成。
38.一种用于修饰选自嗜热细菌细胞和嗜热古细菌细胞的嗜热细胞的核酸构建体,包含:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,其中所述乙酰辅酶A乙酰基转移酶选自如在SEQID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:4中所示的Slip_0479、如在SEQ ID NO:5中所示的Tfu_1520、如在SEQ ID NO:6中所示的Tfu_0436、如在SEQ ID NO:7中所示的Slip_0880、如在SEQID NO:8中所示的Tfu_2394、如在SEQ ID NO:9中所示的Slip_1236、如在SEQ ID NO:10中所示的Caur_1540、如在SEQ ID NO:11中所示的Tfu_0253、如在SEQ ID NO:14中所示的CHY_1604、如在SEQ ID NO:15中所示的CHY_1288、如在SEQ ID NO:16中所示的Slip_2085、如在SEQ ID NO:17中所示的Slip_0465、如在SEQ ID NO:59中所示的Dde1、如在SEQ ID NO:60中所示的Rxy2和如在SEQ ID NO:18中所示的CHY_1355,和/或EC编号2.3.3.20的酶,所述EC编号2.3.3.20的酶选自如在SEQ ID NO:12中所示的3-氧代酰基-ACP合酶SVA_3859和如在SEQID NO:13中所示的酰基辅酶A:酰基辅酶A烷基转移酶Despr_2661;
ii)编码选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶的多核苷酸,
其中所述第二种酶选自:Tle2、如在SEQ ID NO:21中所示的Dde2(EC 2.8.3.5)、Ghh2(EC 2.8.3.5)、Tme(EC 2.8.3.8)、如在SEQ ID NO:26中所示的Pth(EC 2.8.3.1)和如在SEQ ID NO:27中所示的Rma(EC 3.1.2.-),或与其具有至少70%同源性、相似性或同一性的其功能性变体,
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ IDNO:20中所示的Tle2亚基B(EC 2.8.3.9)组成,其中Ghh2由如在SEQ ID NO:22中所示的Ghh2亚基A和如在SEQ ID NO:23中所示的Ghh2亚基B组成,且其中Tme由如在SEQ ID NO:24中所示的Tme亚基A和如在SEQ ID NO:25中所示的Tme亚基B,或与其具有至少70%同源性、相似性或同一性的其功能性变体组成,和
iii)编码乙酰乙酸脱羧酶(EC 4.1.1.4)或与其具有至少70%同源性、相似性或同一性的功能性变体,优选如在SEQ ID NO:28中所示的Cac的多核苷酸。
39.根据项目38所述的核酸构建体,进一步包含编码异丙醇脱氢酶(EC1.1.1.80),优选如在SEQ ID NO:29中所示的Tbr或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸,优选地其中所述多核苷酸编码的异丙醇脱氢酶或与其具有至少70%同一性或相似性的具有异丙醇脱氢酶活性的其功能性变体是SEQ ID NO:58或与其具有至少70%同一性的其同源物。
40.根据项目38至39中任一项所述的核酸构建体,其中所述多核苷酸中的一种或多种针对在所述嗜热细胞中的表达进行了密码子优化。
41.根据项目38至40中任一项所述的核酸构建体,其中所述多核苷酸中的一种或多种是在诱导型启动子或组成型启动子的控制下。
42.根据项目38至41中任一项所述的核酸构建体,其中所述嗜热细胞属于选自以下的属:地芽孢杆菌属、高温厌氧杆菌属、热厌氧杆菌属、嗜热厌氧菌属、芽孢杆菌属、热梭菌属、无氧芽孢杆菌属、热解纤维素菌属、穆尔氏菌属、栖热菌属、栖热袍菌属、假栖热袍菌属、绿屈挠菌属、厌氧解纤维素菌属、红嗜热菌属、硫化叶菌属、热球菌属、火球菌属和梭菌属。
43.根据项目38至42中任一项所述的核酸构建体,其中所述嗜热细胞属于选自以下的种:热葡糖苷酶地芽孢杆菌、就地堆肥地芽胞杆菌、嗜热脂肪地芽孢杆菌、热反硝化地芽孢杆菌、嗜热地芽孢杆菌、喜热噬油地芽孢杆菌、热小链地芽孢杆菌、解木聚糖高温厌氧杆菌、解糖高温厌氧杆菌、热解糖高温厌氧杆菌、马瑞氏热厌氧杆菌、假乙醇热厌氧杆菌、布氏热厌氧杆菌、凯伍热厌氧杆菌、布氏热厌氧杆菌、地下嗜热厌氧菌、热纤梭菌、琥珀酸嗜热梭菌、嗜粪热梭菌、枯草芽孢杆菌、地衣芽孢杆菌、凝结芽孢杆菌、史氏芽孢杆菌、甲醇芽孢杆菌、黄热芽孢杆菌、堪察加无氧芽孢杆菌、冈尼西氏厌氧杆菌、热解纤维素菌、解糖热解纤维素菌、克里斯托热解纤维素菌、欧文湖热解纤维素菌、产乳酸乙酸热解纤维素菌、热醋穆尔氏菌、热自养穆尔氏菌、嗜热栖热菌、水生栖热菌、海栖热袍菌、Pseudothermotogalettingae、温泉假栖热袍菌、橙色绿屈挠菌、嗜热厌氧解纤维素菌、海洋红嗜热菌、酸热硫化叶菌、冰岛硫化叶菌、硫矿硫化叶菌、极端嗜热嗜压古菌、海洋异养古细菌、深海火球菌、激烈火球菌,优选地所述细胞是热葡糖苷酶地芽孢杆菌细胞、枯草芽孢杆菌细胞或热纤梭菌细胞。
44.根据项目38至43中任一项所述的核酸构建体,其中所述乙酰辅酶A乙酰基转移酶选自如在SEQ ID NO:15中所示的CHY_1288、如在SEQ ID NO:18中所示的CHY_1355、如在SEQ ID NO:10中所示的Caur_1540、如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ IDNO:3中所示的Caur_1461、如在SEQ ID NO:59中所示的Dde1、如在SEQ ID NO:60中所示的Rxy2和如在SEQ ID NO:7中所示的Slip_0880,或与其具有至少70%同源性、相似性或同一性的其功能性变体。
45.根据项目38至44中任一项所述的核酸构建体,其中所述异丙醇脱氢酶(EC1.1.1.80)是如在SEQ ID NO:29中所示的Tbr或与其具有至少70%同源性、相似性或同一性的其功能性变体。
46.根据项目38至45中任一项所述的核酸构建体,其中所述乙酰辅酶A乙酰基转移酶是如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:4中所示的Slip_0479、如在SEQ ID NO:59中所示的Dde1、如在SEQ ID NO:60中所示的Rxy2、如在SEQ ID NO:5中所示的Tfu_1520、或如在SEQ ID NO:6中所示的Tfu_0436,或与其具有至少70%同源性、相似性或同一性的其功能性变体,优选如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:59中所示的Dde1、如在SEQ IDNO:60中所示的Rxy2或如在SEQ ID NO:4中所示的Slip_0479。
47.根据项目38至46中任一项所述的核酸构建体,其中所述第二种酶是:
i)Tle2或Tle2的功能性变体,其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ ID NO:20中所示的Tle2亚基B(EC 2.8.3.9)组成,且其中所述Tle2的功能性变体由与在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)具有至少70%同源性、相似性或同一性的亚基和与在SEQ ID NO:20中所示的Tle2亚基B(EC 2.8.3.9)具有至少70%同源性、相似性或同一性的另一亚基组成;
ii)如在SEQ ID NO:21中所示的Dde2或与其具有至少70%同源性、相似性或同一性的其功能性变体;或
iii)Ghh2或Ghh2的功能性变体,其中Ghh2由如在SEQ ID NO:22中所示的Ghh2亚基A和如在SEQ ID NO:23中所示的Ghh2亚基B组成,且其中所述Ghh2的功能性变体由与在SEQID NO:22中所示的Ghh2亚基A具有至少70%同源性、相似性或同一性的亚基和与在SEQ IDNO:23中所示的Ghh2亚基B具有至少70%同源性、相似性或同一性的另一亚基组成。
48.根据项目38至47中任一项所述的核酸构建体,其中所述乙酰乙酸脱羧酶是如在SEQ ID NO:28中所示的Cac或与其具有至少70%同源性、相似性或同一性的其功能性变体。
49.根据项目38至48中任一项所述的核酸构建体,其中:
i)所述乙酰辅酶A乙酰基转移酶选自如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:4中所示的Slip_0479、如在SEQ ID NO:7中所示的Slip_0880和如在SEQ ID NO:59中所示的Dde1,或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体;和
ii)所述第二种酶选自:Tle2和如在SEQ ID NO:21中所示的Dde2,或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸盐活性的其功能性变体。
50.根据项目38至49中的任一项的核酸构建体,其中:
i)所述编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体的多核苷酸选自SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:32、SEQ ID NO:33、SEQ ID NO:34、SEQ ID NO:35、SEQ IDNO:36、SEQ ID NO:37、SEQ ID NO:38、SEQ ID NO:39、SEQ ID NO:40、SEQ ID NO:43、SEQ IDNO:44、SEQ ID NO:46、SEQ ID NO:61、SEQ ID NO:62和SEQ ID NO:47,或与其具有至少70%同一性的其同源物;且
ii)所述编码第二种酶的多核苷酸选自:SEQ ID NO:48和SEQ ID NO:49;SEQ IDNO:50;SEQ ID NO:51和SEQ ID NO:52、SEQ ID NO:53和SEQ ID NO:54;和SEQ ID NO:56,或与其具有至少70%同一性的其同源物;且
iii)所述编码乙酰乙酸脱羧酶或与其具有至少70%同源性、相似性或同一性的其功能性变体的多核苷酸是SEQ ID NO:57或与其具有至少70%同一性的其同源物。
51.根据项目38至50中任一项所述的核酸构建体,其中所述编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体的多核苷酸选自SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:32、SEQ IDNO:33、SEQ ID NO:36和SEQ ID NO:61,或与其具有至少70%同一性的其同源物;
和/或
其中所述编码第二种酶的多核苷酸选自:
i)SEQ ID NO:48和SEQ ID NO:49,或与其具有至少70%同一性的其同源物;和
ii)SEQ ID NO:50或与其具有至少70%同一性的其同源物;
和/或
其中所述编码乙酰乙酸脱羧酶或与其具有至少70%同一性或相似性的具有乙酰乙酸脱羧酶活性的其功能性变体的多核苷酸是SEQ ID NO:57或与其具有至少70%同一性的其同源物。
52.根据项目38至51中任一项所述的核酸构建体,包含SEQ ID NO:57或与其具有至少70%同一性的其同源物,且进一步包含:
i)SEQ ID NO:61和SEQ ID NO:50;或
ii)SEQ ID NO:32;以及SEQ ID NO:48和SEQ ID NO:49;或
iii)SEQ ID NO:36;以及SEQ ID NO:48和SEQ ID NO:49;或
iv)SEQ ID NO:30;以及SEQ ID NO:48和SEQ ID NO:49;或
v)SEQ ID NO:31;以及SEQ ID NO:48和SEQ ID NO:49;或
vi)SEQ ID NO:33;以及SEQ ID NO:48和SEQ ID NO:49;
或与其具有至少70%同一性的其同源物;
优选地其中所述核酸构建体包含SEQ ID NO:57以及ii)或iii)。
53.一种载体,其包含根据项目38至52中任一项所述的核酸构建体。
54.一种嗜热细胞,其包含根据项目38至52中任一项所述的核酸构建体和/或根据项目53所述的载体,其中所述嗜热细胞是嗜热细菌细胞或嗜热古细菌细胞。
55.一种试剂盒,其包含根据项目38至52中任一项所述的核酸构建体、根据项目53所述的载体或根据项目54所述的嗜热细胞。
序列表
<110> 丹麦技术大学
<120> 用于生产挥发性化合物的方法和细胞
<130> P5616PC00
<160> 63
<170> PatentIn 3.5版
<210> 1
<211> 391
<212> PRT
<213> 地芽孢杆菌属种 GHH01
<400> 1
Met Arg Glu Val Val Ile Thr Ala Ala Val Arg Thr Pro Ile Gly Thr
1 5 10 15
Phe Gly Gly Val Phe Lys Asp Leu Leu Pro Thr Asp Leu Ile Val Pro
20 25 30
Val Leu Glu Glu Ala Val Lys Arg Ser Gln Ile Glu Lys Asp Glu Val
35 40 45
Asn Glu Val Ile Leu Gly His Cys Ile Gln Arg Thr Asp Ile Pro Asn
50 55 60
Thr Ala Arg Thr Ala Ala Leu Leu Ala Gly Phe Pro His Thr Thr Thr
65 70 75 80
Gly Phe Thr Ile Gln Arg Gln Cys Ala Ser Gly Met Gln Ala Val Ile
85 90 95
Ser Ala Ala Met Gln Ile Gln Val Gly Leu Ser Asp Val Val Ile Ala
100 105 110
Gly Gly Val Glu Ser Met Ser Ser Ser Pro Tyr Ile Leu Lys Gln His
115 120 125
Arg Trp Gly Ala Arg Leu Gln His Gln Gln Val Arg Asp Ser Val Trp
130 135 140
Glu Val Leu Glu Asp Pro Ile His His Val Met Met Gly Glu Thr Ala
145 150 155 160
Glu Asn Leu Ala Glu Arg Tyr Gly Ile Thr Arg Glu Glu Gln Asp Glu
165 170 175
Leu Ala Leu Leu Ser His Arg Arg Ala Ile Leu Ala Met Glu Ser Gly
180 185 190
Tyr Phe Asp Ser Gln Ile Val Pro Ile Thr Val Lys Thr Arg Lys Glu
195 200 205
Glu Ile Val Val Thr Lys Asp Glu His Pro Arg Ala Asp Val Thr Lys
210 215 220
Glu Lys Leu Ala Ser Leu Arg Pro Val Phe Arg Lys Asn Gly Thr Val
225 230 235 240
Thr Ala Gly Asn Ala Ser Gly Ile Asn Asp Gly Ala Ala Ala Leu Val
245 250 255
Leu Met Ser Ala Glu Tyr Ala Gln Gln Arg Gly Ile Glu Pro Leu Ala
260 265 270
Lys Val Val Gly Tyr Ser Val Ala Gly Val Asp Pro Leu Val Met Gly
275 280 285
Arg Gly Pro Val Pro Ala Val Gln Lys Gly Leu Glu Arg Val Asn Trp
290 295 300
Thr Leu Ala Glu Ala Asp Leu Ile Glu Ile Asn Glu Ala Phe Ala Ala
305 310 315 320
Gln Tyr Leu Ala Val Glu Arg Glu Leu Arg Leu Asp Arg Asp Lys Val
325 330 335
Asn Val Asn Gly Ser Gly Ile Ser Leu Gly His Pro Ile Gly Cys Thr
340 345 350
Gly Ala Arg Ile Val Val Ser Leu Ile His Glu Leu Gln Arg Arg Gln
355 360 365
Leu Glu Lys Gly Ile Ala Ser Leu Cys Val Gly Gly Gly Met Gly Thr
370 375 380
Ala Val Phe Ile Glu Ala Leu
385 390
<210> 2
<211> 400
<212> PRT
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 2
Met Ile Asn Glu Val Val Met Val Ser Ala Cys Arg Thr Ala Ile Gly
1 5 10 15
Asp Phe Met Gly Ser Leu Lys Asp Leu Lys Ala Asn Asp Leu Ser Ala
20 25 30
Ile Thr Ala Thr Glu Ala Leu Lys Arg Ala Gly Ile Gln Pro Glu Met
35 40 45
Val Asp Ser Leu Val Leu Gly Met Cys Leu His His Gly Asn Asp Ser
50 55 60
Gly Pro Ala Arg Gln Val Ala Met Ala Ile Gly Met Arg His Ser Ser
65 70 75 80
Trp Ala Cys Met Val Asn Gln Asn Cys Ala Ser Ala Met Arg Ala Leu
85 90 95
Glu Ile Ala Ala Asn Glu Leu Met Leu Gly Lys Ser Glu Ile Ser Leu
100 105 110
Val Val Gly Thr Glu Ser Met Thr Asn Val Pro Tyr Ile Leu Arg Lys
115 120 125
Ala Arg Phe Gly Tyr Arg Leu Phe Asp Gly Asp Lys Ala Glu Asp Ala
130 135 140
Met Ile Cys Asp Gly Leu Phe Asp Lys Met Val Pro Gly His Met Ala
145 150 155 160
Ile Thr Ala Glu Asn Val Ala Glu Lys Tyr Gly Ile Thr Arg Glu Glu
165 170 175
Cys Asp Glu Leu Ala Leu Leu Ser His Thr Arg Ala Leu Lys Ala Asn
180 185 190
Ala Glu Gly Ile Phe Ala Arg Glu Ile Val Pro Val Glu Ile Lys Thr
195 200 205
Lys Lys Gly Val Lys Val Val Asp Lys Asp Glu His Pro Met Asp Thr
210 215 220
Ser Leu Glu Lys Leu Ala Gln Leu Pro Pro Val Phe Lys Lys Gly Gly
225 230 235 240
Val Val Thr Ala Gly Asn Ala Ser Gly Ile Asn Asp Gly Ser Ala Ala
245 250 255
Ala Val Leu Met Thr Lys Lys Lys Ala Glu Glu Leu Gly Ile Lys Pro
260 265 270
Leu Met Lys Leu Leu Tyr Val Cys Ser Glu Gly Val Asp Pro Lys Phe
275 280 285
Met Gly Leu Gly Pro Ala Val Ala Ile Pro Lys Val Leu Asn Lys Ala
290 295 300
Gly Leu Lys Phe Glu Asp Val Glu Tyr Trp Glu Ile Asn Glu Ala Phe
305 310 315 320
Ala Ala Gln Trp Leu Gly Val Gly Arg Met Leu Lys Glu Asp Phe Gly
325 330 335
Ile Glu Leu Asp Leu Asp Lys Val Asn His Asn Gly Ser Gly Ile Gly
340 345 350
Leu Gly His Pro Val Gly Cys Thr Gly Leu Arg Ile Gln Val Ser Met
355 360 365
Tyr Tyr Glu Met Glu Arg Leu Gly Leu Thr Ile Gly Gly Ala Ser Leu
370 375 380
Cys Val Gly Gly Gly Pro Ala Met Ala Ala Leu Trp Thr Arg Asp Ile
385 390 395 400
<210> 3
<211> 395
<212> PRT
<213> 橙色绿屈挠菌 J-10-fl
<400> 3
Met Ser Glu Lys Arg Glu Val Val Val Leu Ser Gly Val Arg Thr Ala
1 5 10 15
Ile Gly Thr Phe Gly Gly Ser Leu Lys Asp Ile Pro Pro Thr Glu Leu
20 25 30
Ala Ala Leu Val Thr Arg Glu Ala Val Ala Arg Ser Gly Leu Gln Pro
35 40 45
Asn Glu Ile Gly His Val Val Phe Gly His Val Ile Asn Thr Glu Pro
50 55 60
His Asp Met Tyr Leu Ala Arg Tyr Ala Ala Val Arg Gly Gly Leu Ser
65 70 75 80
Val Glu Thr Pro Ala Leu Thr Leu Asn Arg Leu Cys Gly Ser Gly Leu
85 90 95
Gln Ala Ile Val Ser Ala Ala Gln Tyr Ile Leu Gln Gly Asp Ala Glu
100 105 110
Ala Ala Val Ala Gly Gly Ala Glu Cys Met Ser Arg Gly Pro Tyr Ser
115 120 125
Leu Pro Ala Met Arg Phe Gly Ala Arg Met Asn Asp Ser Lys Val Val
130 135 140
Asp Met Met Val Gly Ala Leu Thr Asp Pro Phe Asp Asp Cys His Met
145 150 155 160
Gly Val Thr Ala Glu Asn Val Ala Ala Lys Trp Gly Ile Ser Arg Glu
165 170 175
Asp Gln Asp Gln Leu Ala Tyr Glu Ser His Met Arg Ala Ala Arg Ala
180 185 190
Ile Asp Glu Gly Arg Phe Ala Asn Gln Ile Val Pro Val Glu Ile Lys
195 200 205
Val Lys Gly Gly Thr Ala Gln Phe Met Val Asp Glu Gly Val Arg Arg
210 215 220
Asp Thr Thr Ile Asp Lys Leu Ala Lys Leu Arg Pro Val Phe Leu Lys
225 230 235 240
Asp Gly Ser Val Thr Ala Gly Asn Ala Ser Ser Ile Asn Asp Ala Ala
245 250 255
Ala Ala Val Val Leu Met Asp Arg Ala Thr Ala Glu Arg Arg Gly Tyr
260 265 270
Lys Pro Leu Ala Arg Leu Val Gly Tyr Ser His Ala Ala Val Glu Pro
275 280 285
Lys Tyr Met Gly Ile Gly Pro Val Pro Ala Val Arg Arg Leu Leu Glu
290 295 300
Arg Thr Gly Leu Arg Ile Ser Asp Ile Asp Leu Phe Glu Val Asn Glu
305 310 315 320
Ala Phe Ala Ala Gln Ala Leu Ala Val Ile Arg Asp Leu Glu Leu Pro
325 330 335
Pro Asp Arg Thr Asn Pro Asn Gly Ser Gly Ile Ser Leu Gly His Pro
340 345 350
Ile Gly Ala Thr Gly Cys Ile Leu Thr Val Lys Ala Ile His Glu Leu
355 360 365
His Arg Thr Gly Gly Arg Tyr Ala Leu Val Thr Met Cys Ile Gly Gly
370 375 380
Gly Gln Gly Ile Ala Ala Ile Phe Glu Arg Met
385 390 395
<210> 4
<211> 392
<212> PRT
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 4
Met Arg Asp Val Val Ile Val Ser Gly Lys Arg Thr Ala Ile Gly Asn
1 5 10 15
Phe Leu Gly Ala Leu Lys Asp Phe Ser Ala Val Asp Leu Gly Thr Ile
20 25 30
Ala Leu Lys Ala Ala Ile Asn Ser Ala Gly Ile Ser Pro Asp Thr Val
35 40 45
Glu Glu Val Ala Ala Gly His Val Tyr Gln Ala Gly Cys Lys Gly Asn
50 55 60
Pro Ala Arg Gln Ile Thr Ile Gly Ala Gly Cys Pro Val Glu Thr Val
65 70 75 80
Ser Val Thr Val Asn Gln Gln Cys Pro Ser Ala Met Arg Ala Leu Glu
85 90 95
Ile Ile Ser Gln Glu Ile Met Leu Gly Lys Ile Asp Ala Gly Ala Ala
100 105 110
Val Gly Ile Glu Ser Met Ser Asn Val Pro Tyr Leu Leu Leu Lys Ala
115 120 125
Arg Thr Gly Tyr Arg Met Gly Asn Gly Glu Leu Val Asp Gly Met Leu
130 135 140
Tyr Asp Ala Leu Ile Asp Ala Phe Gly Asn Gly His Gln Gly Ile Thr
145 150 155 160
Ala Glu Asn Leu Ala Glu Met Tyr Asn Ile Ser Arg Glu Glu Gln Asp
165 170 175
Glu Trp Ala Phe Ile Ser His Gln Arg Ala Cys Gln Ala Ile Lys Glu
180 185 190
Gly Lys Phe Lys Asp Glu Ile Val Pro Val Glu Val Lys Thr Lys Lys
195 200 205
Glu Thr Phe Leu Phe Asp Thr Asp Glu His Pro Asn Pro Asp Thr Thr
210 215 220
Leu Glu Ser Leu Ala Lys Leu Lys Pro Ala Phe Lys Lys Asp Gly Thr
225 230 235 240
Val Thr Ala Gly Asn Ala Ser Ser Ile Asn Asp Ala Ala Cys Ala Ala
245 250 255
Val Val Met Ala His Asp Lys Ala Val Glu Leu Gly Ile Lys Pro Leu
260 265 270
Ala Arg Ile Val Ala Thr Ala Ser Ala Ala Val Glu Pro Arg Ile Met
275 280 285
Gly Ile Gly Val Val Pro Ala Val Lys Arg Ala Leu Lys Phe Ala Gly
290 295 300
Met Ser Leu Asp Asp Val Gln Leu Trp Glu Ile Asn Glu Ala Phe Ala
305 310 315 320
Ala Gln Phe Leu Ala Cys Asn Arg Glu Leu Lys Leu Asp Thr Glu Lys
325 330 335
Ile Asn Val Asn Gly Ser Gly Ile Ser Leu Gly His Pro Val Gly Cys
340 345 350
Thr Gly Leu Arg Leu Val Ile Thr Leu Ile Asn Glu Met Lys Arg Arg
355 360 365
Asn Leu Arg Tyr Gly Cys Ala Ala Leu Cys Ala Gly Gly Gly Pro Ala
370 375 380
Met Ala Thr Ile Ile Glu Val Leu
385 390
<210> 5
<211> 393
<212> PRT
<213> 褐色嗜热裂孢菌 YX
<400> 5
Met Ser Ser Pro Glu Arg Ile Ile Val Val Asp Gly Ala Arg Thr Pro
1 5 10 15
Val Gly Ser Phe Gly Gly Ala Phe Lys Asp Val Pro Ala His Glu Leu
20 25 30
Gly Ala Val Ala Ala Arg Ala Ala Leu Gln Arg Ser Gly Ile Ala Ala
35 40 45
Ser Asp Ile Asp Glu Val Val Met Gly Cys Ile Gly Gln Val Gly Pro
50 55 60
Asp Ala Tyr Asn Ala Arg Arg Val Ala Ile Ala Ala Gly Leu Pro Glu
65 70 75 80
Ser Val Pro Ala Tyr Thr Val Asn Arg Leu Cys Gly Ser Gly Leu Gln
85 90 95
Ala Val Trp Ser Gly Ala Met Gln Ile Arg Trp Gly Ala Ala Asp Ile
100 105 110
Val Leu Ala Gly Gly Asp Glu Asn Met Ser Arg Met Pro Phe Tyr Asp
115 120 125
Phe Gly Ala Arg Ser Gly Tyr Arg Leu Gly Asp Arg Thr Leu Val Asp
130 135 140
Gly Thr Val Ala Met Leu Thr Asp Pro Phe Ser Asn Val His Met Gly
145 150 155 160
Cys Thr Ala Glu Ala Val Ala Arg Lys Tyr Gly Val Ser Arg Ala Glu
165 170 175
Gln Asp Glu Phe Ala Leu Glu Ser Gln Arg Arg Ala Ala Ala Asp Ala
180 185 190
Ala Arg Ala Ala Phe Ala Glu Glu Ile Thr Pro Val Glu Val Gly Gly
195 200 205
Arg Lys Pro Val Val Val Glu Val Asp Glu His Pro Arg Pro Asp Thr
210 215 220
Thr Leu Glu Gly Leu Ala Arg Leu Arg Pro Val Phe Glu Lys Asp Gly
225 230 235 240
Thr Val Thr Ala Gly Asn Ala Ser Gly Ile Asn Asp Gly Ala Ala Ala
245 250 255
Leu Val Leu Ala Arg Glu Ser Val Val Arg Glu Arg Gly Leu Lys Gly
260 265 270
Leu Ala Val Val Glu Ser Val Ala Thr Ala Ala Met Asp Pro Gln Leu
275 280 285
Met Gly Tyr Ala Pro Val Leu Ala Leu Arg Lys Leu Phe Glu Gln Thr
290 295 300
Gly Thr Ser Pro Ala Val Val Asp Val Val Glu Leu Asn Glu Ala Phe
305 310 315 320
Ala Ala Gln Ala Val Ala Val Ile Arg Asp Ala Gly Leu Asp Pro Glu
325 330 335
Lys Thr Asn Pro Tyr Gly Gly Ala Ile Ala Leu Gly His Pro Val Gly
340 345 350
Ala Thr Gly Ala Ile Leu Thr Leu Arg Val Ala Arg Asp Leu Val Arg
355 360 365
Arg Asp Leu Glu Leu Gly Val Val Thr Met Cys Ile Gly Gly Gly Gln
370 375 380
Ala Leu Ala Ala Leu Leu Arg Arg Val
385 390
<210> 6
<211> 407
<212> PRT
<213> 褐色嗜热裂孢菌 YX
<400> 6
Met Pro Glu Ala Val Ile Val Ala Thr Ala Arg Ser Pro Ile Gly Arg
1 5 10 15
Ala Phe Lys Gly Ser Leu Lys Asp Ile Arg Pro Asp Asp Leu Thr Ala
20 25 30
Gln Ile Ile Ser Ala Ala Leu Ala Lys Val Pro Gln Leu Asp Pro Ala
35 40 45
Thr Ile Asp Asp Leu Leu Leu Gly Cys Gly Leu Pro Gly Gly Glu Gln
50 55 60
Gly Phe Asn Met Ala Arg Val Val Ala Val Gln Leu Gly Leu Asp Ser
65 70 75 80
Val Pro Gly Thr Thr Ile Thr Arg Tyr Cys Ser Ser Ser Leu Gln Thr
85 90 95
Thr Arg Met Ala Phe His Ala Ile Lys Ala Gly Glu Gly Asp Val Phe
100 105 110
Ile Ser Ala Gly Val Glu Met Val Ser Arg Phe Thr Lys Gly Asn Ser
115 120 125
Asp Thr Leu Pro Asp Thr Lys Asn Pro Leu Phe Ala Glu Ala Glu Ala
130 135 140
Arg Thr Ala Arg Arg Ala Glu Gly Gly Ala Glu Pro Trp Arg Asp Pro
145 150 155 160
Arg Glu Glu Gly Lys Leu Pro Asp Ile Tyr Ile Ala Met Gly Gln Thr
165 170 175
Ala Glu Asn Val Ala Gln Leu Arg Gly Val Ser Arg Gln Arg Gln Asp
180 185 190
Glu Phe Ala Val Arg Ser Gln Asn Leu Ala Glu Lys Ala Leu Asp Asn
195 200 205
Gly Phe Trp Glu Arg Glu Ile Thr Pro Val Thr Leu Pro Asp Gly Thr
210 215 220
Val Val Ser Thr Asp Asp Gly Pro Arg Arg Gly Thr Thr Tyr Glu Lys
225 230 235 240
Val Ala Ala Leu Asp Pro Val Phe Arg Pro Asp Gly Thr Val Thr Ala
245 250 255
Gly Asn Cys Cys Pro Leu Asn Asp Gly Ala Ala Ala Leu Ile Ile Met
260 265 270
Ser Asp Arg Lys Ala Ala Glu Leu Gly Ile Thr Pro Leu Ala Arg Ile
275 280 285
Val Ser Thr Gly Val Ser Ala Leu Ser Pro Glu Ile Met Gly Leu Gly
290 295 300
Pro Val Glu Ala Ser Arg Gln Ala Leu Ala Arg Ala Asn Met Ser Ile
305 310 315 320
Arg Asp Ile Asp Leu Val Glu Ile Asn Glu Ala Phe Ala Ala Gln Val
325 330 335
Leu Pro Ser Ala Asp Asp Leu Gly Ile Asp Ile Asp Ser Gln Leu Asn
340 345 350
Val Asn Gly Gly Ala Ile Ala Ile Gly His Pro Phe Gly Met Thr Gly
355 360 365
Ala Arg Ile Thr Thr Thr Leu Ile Asn Ala Leu Gln Phe His Asp Lys
370 375 380
Thr Phe Gly Leu Glu Thr Met Cys Val Gly Gly Gly Gln Gly Met Ala
385 390 395 400
Ala Ile Phe Glu Arg Leu Ser
405
<210> 7
<211> 394
<212> PRT
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 7
Met Ser Arg Glu Val Val Leu Val Gly Ala Cys Arg Thr Pro Ile Gly
1 5 10 15
Thr Phe Gly Gly Thr Leu Lys Asp Met Thr Ala Val Gln Leu Gly Thr
20 25 30
Ile Val Met Lys Glu Ala Leu Lys Arg Ala Gly Ile Ser Gly Asp Gln
35 40 45
Val Asp Glu Val Ile Phe Gly Cys Val Leu Gln Ala Gly Gln Gly Gln
50 55 60
Asn Val Ala Arg Gln Cys Ala Ile His Ala Gly Ile Pro Glu Thr Val
65 70 75 80
Thr Ser Phe Thr Ile Asn Lys Val Cys Gly Ser Gly Leu Arg Ala Val
85 90 95
Ser Leu Ala Ala Gln Ile Ile Lys Ala Gly Asp Ala Asp Ile Val Leu
100 105 110
Ala Gly Gly Thr Glu Ser Met Thr Asn Ala Pro Tyr Leu Val Pro Lys
115 120 125
Ala Arg Tyr Gly Tyr Arg Met Gly Asp Gly Lys Leu Val Asp Glu Met
130 135 140
Val Phe Gly Gly Leu Thr Asp Ile Phe Asn Gly Tyr His Met Gly Ile
145 150 155 160
Thr Ala Glu Asn Val Asn Glu Met Tyr Gly Ile Thr Arg Glu Glu Gln
165 170 175
Asp Glu Phe Gly Leu Arg Ser Gln Glu Arg Ala Phe Ala Ala Ile Glu
180 185 190
Ser Gly Arg Phe Lys Asp Glu Ile Val Pro Val Val Ile Lys Thr Lys
195 200 205
Lys Gly Glu Val Val Phe Asp Thr Asp Glu His Pro Arg Arg Thr Thr
210 215 220
Met Glu Ala Leu Ala Lys Leu Lys Pro Ala Phe Lys Lys Asp Gly Ser
225 230 235 240
Val Thr Ala Gly Asn Ala Ser Gly Ile Asn Asp Gly Ala Ala Ala Val
245 250 255
Val Val Met Ser Lys Glu Lys Ala Asp Glu Leu Gly Ile Lys Pro Met
260 265 270
Ala Arg Val Val Ser Tyr Ala Ser Gly Gly Val Asp Pro Lys Ile Met
275 280 285
Gly Val Gly Pro Val Pro Ala Thr Lys Lys Ala Leu Ala Lys Ala Gly
290 295 300
Leu Thr Leu Asp Asp Ile Asp Leu Ile Glu Ala Asn Glu Ala Phe Ala
305 310 315 320
Ala Gln Ser Ile Ala Val Ala Arg Asp Met Gly Trp Asp Lys Met Met
325 330 335
Asp Lys Val Asn Val Asn Gly Gly Ala Ile Ala Leu Gly His Pro Ile
340 345 350
Gly Ala Ser Gly Cys Arg Ile Leu Val Thr Leu Leu Tyr Glu Met Gln
355 360 365
Lys Arg Asn Ala Lys Arg Gly Leu Ala Thr Leu Cys Ile Gly Gly Gly
370 375 380
Gln Gly Thr Thr Leu Ile Val Glu Ser Leu
385 390
<210> 8
<211> 395
<212> PRT
<213> 褐色嗜热裂孢菌 YX
<400> 8
Met Pro Gly Ser Val Ile Val Gly Gly Ala Arg Thr Pro Ile Gly Lys
1 5 10 15
Leu Leu Gly Ala Leu Ser Gly Phe Ala Ala Val Asp Leu Gly Ala Ile
20 25 30
Ala Ile Lys Ala Ala Leu Gln Arg Ala Gly Ile Ser Gly Asp Gln Val
35 40 45
Asp Tyr Val Ile Met Gly Gln Val Leu Gln Ala Gly Gln Gly Gln Ile
50 55 60
Pro Ser Arg Gln Ala Ser Val Lys Ala Gly Ile Pro Met Ser Val Pro
65 70 75 80
Ser Leu Thr Ile Asn Lys Val Cys Leu Ser Gly Leu Asp Ala Ile Ala
85 90 95
Leu Ala Asp Gln Leu Ile Thr Ala Gly Glu Phe Asp Val Val Val Ala
100 105 110
Gly Gly Met Glu Ser Met Thr Asn Ala Pro His Val Leu Pro Lys Ala
115 120 125
Arg His Gly Tyr Lys Tyr Gly Ser Ile Glu Val Leu Asp Ala Thr Ala
130 135 140
His Asp Ala Leu Thr Asp Ala Phe Asp His Val Ser Met Gly Leu Ser
145 150 155 160
Thr Glu Arg Tyr Asn Ala Arg His Gly Met Thr Arg Glu Glu Gln Asp
165 170 175
Ala Phe Ala Ala Arg Ser His Gln Arg Ala Ala Ala Ala Ile Glu Ala
180 185 190
Gly Leu Phe Lys Asp Glu Ile Val Pro Val Glu Val Pro Arg Arg Lys
195 200 205
Gly Asp Pro Thr Ile Val Asp Thr Asp Glu Gly Val Arg Pro Asp Thr
210 215 220
Thr Val Glu Ala Leu Ala Arg Leu Arg Pro Ala Phe Asp Pro Asp Gly
225 230 235 240
Thr Ile Thr Ala Gly Ser Ser Ser Gln Ile Ser Asp Gly Ala Cys Ala
245 250 255
Val Val Val Met Ser Arg Thr Lys Ala Glu Glu Leu Gly Cys Glu Ile
260 265 270
Leu Ala Glu Ile Gln Ala His Gly Asn Val Ala Gly Pro Asp Asn Ser
275 280 285
Leu His Cys Gln Pro Ala Asn Ala Ile Lys His Ala Leu Ala Lys Ala
290 295 300
Gly Arg Asp Val Ala Asp Leu Asp Leu Val Glu Ile Asn Glu Ala Phe
305 310 315 320
Ala Ser Val Ala Ile Gln Ser Met Arg Glu Leu Gly Val Ser Glu Asp
325 330 335
Ile Val Asn Val Asn Gly Gly Ala Ile Ala Leu Gly His Pro Val Gly
340 345 350
Met Ser Gly Ala Arg Ile Val Leu His Leu Val His Glu Leu Arg Arg
355 360 365
Arg Gly Gly Gly Leu Gly Ala Ala Gly Leu Cys Gly Gly Gly Gly Gln
370 375 380
Gly Asp Ala Leu Leu Leu Ser Val Pro Ala Ser
385 390 395
<210> 9
<211> 393
<212> PRT
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 9
Met Ala Ala Gly Ile Lys Asp Lys Ala Ala Val Ile Gly Met Gly Cys
1 5 10 15
Thr Lys Phe Gly Glu Arg Phe Asp Cys Asn Leu Glu Asp Leu Met Leu
20 25 30
Glu Ala Ile Glu Glu Ala Leu Ala Asp Ser Gly Leu Glu Phe Asn Asp
35 40 45
Ile Asp Ala Phe Trp Phe Gly Thr Phe Thr Ser Gly Met Ala Gly Leu
50 55 60
Ala Phe Ser Asn Arg Met Lys Ser Gln Tyr Lys Pro Val Thr Arg Ile
65 70 75 80
Glu Asn Met Cys Cys Thr Gly Leu Asp Ala Phe Arg Asn Ala Cys Tyr
85 90 95
Ala Val Val Ser Gly Ala Tyr Asp Val Val Met Ala Ile Gly Ala Glu
100 105 110
Lys Leu Lys Asp Gly Gly Tyr Ser Gly Leu Glu Val Pro Ala Glu Asp
115 120 125
Ser Asp Arg Thr Met Pro Asp Leu Thr Ala Pro Ala Arg Phe Ala Val
130 135 140
Ile Ala Pro Ala Tyr Ala His Lys Tyr Gly Leu Ser Met Gln Gln Met
145 150 155 160
Lys Glu Val Met Ala Arg Ile Ala Trp Lys Asn His Lys Asn Gly Ser
165 170 175
Leu Asn Pro Lys Ala Gln Phe Gln Ala Glu Val Pro Ile Glu Asn Ile
180 185 190
Leu Lys Ser Pro Met Ile Cys Ser Pro Leu Gly Ile Met Asp Cys Ser
195 200 205
Gly Val Ser Asp Gly Ala Ala Cys Ala Ile Ile Val Arg Ser Glu Asp
210 215 220
Ala Lys Lys Tyr Arg Pro Asp Pro Met Tyr Val Lys Gly Ile Gln Ile
225 230 235 240
Ala Ala Gly Pro Gly His Ser Glu Lys His Gln Ser Tyr Asp Phe Thr
245 250 255
Thr Ala Trp Glu Thr Tyr Tyr Ala Gly Gln Ala Ala Tyr Arg Glu Ala
260 265 270
Gly Ile Thr Asn Pro Arg Glu Gln Ile Asp Leu Ala Glu Val His Asp
275 280 285
Cys Phe Thr Pro Thr Glu Leu Ile Ile Tyr Glu Asp Leu Gln Phe Ser
290 295 300
Ala Arg Gly Gln Gly Trp Arg Asp Ala Leu Asp Gly Phe Phe Asp Leu
305 310 315 320
Asp Gly Lys Leu Pro Val Asn Pro Asp Gly Gly Leu Lys Ser Phe Gly
325 330 335
His Pro Ile Gly Ala Ser Gly Ile Arg Met Leu Tyr Glu Ser Trp Leu
340 345 350
Gln Phe His Gly Lys Ala Gly Lys Arg Gln Leu Glu Asn Pro Lys Ile
355 360 365
Gly Leu Ala His Asn Leu Gly Gly Gln Pro Tyr Gln Cys Val Val Gly
370 375 380
Val Ala Val Val Gly Lys Glu Leu Gly
385 390
<210> 10
<211> 393
<212> PRT
<213> 橙色绿屈挠菌 J-10-fl
<400> 10
Met Asp Asp Val Val Ile Val Gly Ala Ala Arg Thr Pro Ile Gly Arg
1 5 10 15
Phe Asn Ser Ala Tyr Ser Gly Leu Ser Ala Ile Asp Leu Gly Ala Ala
20 25 30
Ala Val Gln Ala Ala Val Gln Arg Ala Gly Ile Glu Ala Asp Ser Ile
35 40 45
Asp Glu Cys Ile Met Gly Cys Val Val Thr Ala Gly Leu Gly Gln Ser
50 55 60
Pro Ala Arg Gln Ala Ala Leu Arg Ala Gly Leu Pro His Thr Ile Gly
65 70 75 80
Gly Leu Thr Ile Asn Lys Val Cys Gly Ser Gly Leu Lys Ala Val Met
85 90 95
Ile Gly Thr Ala Leu Ile Lys Ala Gly Glu Ala Asp Val Ile Val Ala
100 105 110
Gly Gly Met Glu His Met Ser Gly Ala Pro Tyr Leu Leu Pro Gln Ala
115 120 125
Arg His Gly Tyr Arg Leu Gly His Gly Gln Ile Ile Asp Ala Val Val
130 135 140
His Asp Gly Leu Trp Cys Ala Phe Glu His His His Met Gly Val Ala
145 150 155 160
Ala Glu Trp Ile Ala Arg Thr Phe Asn Val Thr Arg Glu Gln Gln Asp
165 170 175
Ala Tyr Ala Leu Gln Ser His Gln Arg Ala Val Ala Ala Gln Asp Ser
180 185 190
Gly Ala Phe Gln Ala Glu Ile Ala Pro Val Thr Val Pro Gly Pro Lys
195 200 205
Gly Gln Val Asn Leu Val Thr Thr Asp Glu Gly Pro Arg Arg Asp Thr
210 215 220
Ser Leu Ala Ala Leu Ala Lys Leu Lys Pro Ala Phe Val Thr Asp Gly
225 230 235 240
Thr Val Thr Ala Gly Asn Ala Pro Gly Ile Thr Asp Gly Ala Ala Ala
245 250 255
Val Val Leu Met Arg Ala Ser Arg Ala Ala Gln Leu Gly Val Gln Pro
260 265 270
Leu Ala Arg Ile Gly Thr Ala Ala Gln Ala Ala Val Lys Pro Leu Glu
275 280 285
Leu Phe Thr Ala Pro Ala Phe Ala Ile Glu Arg Leu Met Lys Arg Ala
290 295 300
Gly Arg Thr Leu Asp Asp Tyr Asp Leu Phe Glu Ile Asn Glu Ala Phe
305 310 315 320
Ala Ala Gln Val Ile Ala Asn Leu Arg Ala Leu Ala Leu Asp Ala Asp
325 330 335
Arg Val Asn Val His Gly Gly Ala Ile Ala Leu Gly His Pro Ile Gly
340 345 350
Ala Ser Gly Ala Arg Val Leu Val Thr Leu Ile Ser Ala Leu Arg Gln
355 360 365
Arg Gly Gly Gln Arg Gly Ile Ala Ala Leu Cys Leu Gly Gly Gly Glu
370 375 380
Ala Val Ala Leu Glu Val Glu Val Val
385 390
<210> 11
<211> 380
<212> PRT
<213> 褐色嗜热裂孢菌 YX
<400> 11
Met Ala Glu Ala Tyr Ile Val Gly Ala Val Arg Thr Pro Ile Gly Thr
1 5 10 15
Arg Lys Gly Ala Leu Ala Ala Val His Pro Ala Asp Leu Gly Ala His
20 25 30
Val Leu Lys Glu Leu Val Asn Arg Thr Gly Ile Asp Pro Ala Ala Val
35 40 45
Glu Asp Val Ile Met Gly Cys Val Thr Gln Val Gly Pro Gln Ala Leu
50 55 60
Asp Leu Ala Arg Thr Ala Trp Leu Ser Ala Gly Leu Pro Glu Ser Thr
65 70 75 80
Pro Gly Val Thr Ile Asp Arg Gln Cys Gly Ser Ser Gln Gln Ala Val
85 90 95
His Phe Ala Ala Gln Gly Val Met Ser Gly Thr Gln Asp Leu Val Ile
100 105 110
Ala Ala Gly Val Glu Asn Met Gly Met Val Pro Met Gly Ala Asn Val
115 120 125
Gln Phe Ala Val Asp Asn Gly Leu Ser Val Tyr Gly Gln Gly Trp Val
130 135 140
Glu Arg Tyr Gly Thr Gln Glu Ile Ser Gln Phe Arg Gly Ala Gln Leu
145 150 155 160
Met Cys Glu Lys Trp Gly Tyr Thr Arg Glu Asp Leu Glu Lys Tyr Ala
165 170 175
Leu Glu Ser His Arg Arg Ala Ala Ala Ala Ile Glu Ala Gly Tyr Phe
180 185 190
Asp Ala Glu Thr Ala Pro Leu Ala Gly Val Thr His Asp Glu Gly Val
195 200 205
Arg Pro Asp Thr Ser Leu Glu Lys Met Ala Glu Leu Ala Pro Leu Arg
210 215 220
Glu Gly Trp Ala Leu Thr Ala Ala Val Ser Ser Gln Ile Ser Val Gly
225 230 235 240
Ala Ser Ala Leu Leu Ile Ala Ser Glu Arg Ala Val Ala Glu His Gly
245 250 255
Leu Thr Pro Leu Ala Arg Ile Val Gln Leu Ala Leu Ala Gly Asp Asp
260 265 270
Pro Val Tyr Met Leu Thr Ala Pro Ile Pro Ala Thr Arg Ile Ala Leu
275 280 285
Arg Lys Ala Gly Leu Asp Ile Asp Asp Ile Asp Val Val Glu Ile Asn
290 295 300
Glu Ala Phe Ala Pro Val Pro Met Ala Trp Ile Asp Glu Ile Gly Ala
305 310 315 320
Asp Pro Ala Lys Val Asn Pro Asn Gly Gly Ala Ile Ala Leu Gly His
325 330 335
Pro Leu Gly Ala Thr Gly Ala Val Leu Met Thr Lys Leu Val His Glu
340 345 350
Leu Arg Arg Thr Gly Gly Arg Tyr Gly Leu Gln Thr Met Cys Glu Gly
355 360 365
Gly Gly Gln Ala Asn Val Thr Ile Ile Glu Arg Val
370 375 380
<210> 12
<211> 342
<212> PRT
<213> Sulfurifustis variabilis
<400> 12
Met His Ser Val Gly His Ser Arg Ile Ile Ser Thr Gly Met Tyr Leu
1 5 10 15
Pro Pro Glu Arg Leu Ser Ser Arg Glu Leu Met Glu Met Phe Arg Ser
20 25 30
Arg Glu Arg Phe Gly Leu Pro Tyr Glu Trp Leu Glu Arg Thr Thr Gly
35 40 45
Ile Arg Glu Arg Arg Phe Ala Pro Pro Asp Phe Lys Ser Ser Glu Met
50 55 60
Ala Val Ala Ala Ala Arg Glu Ala Leu Glu Leu Gly Glu Val Ser Pro
65 70 75 80
Ser Gln Ile Asp Ala Ile Ile Tyr Cys Gly Val Leu Arg Asp His Val
85 90 95
Glu Pro Ala Thr Ala His Val Val Gln Asp Lys Ile Gly Ala Arg Asn
100 105 110
Ala Ile Ala Phe Asp Val Ser Asn Ala Cys Leu Gly Phe Met Asn Gly
115 120 125
Met His Leu Met Asp Ala Leu Ile Ala Thr Gly Gln Ala Arg Arg Gly
130 135 140
Leu Val Val Thr Gly Glu Arg Gly Asn His Tyr Ile Arg Lys Ala Leu
145 150 155 160
Arg Val Leu Ala Glu Leu Pro Asp Asn Gly Asp Phe Ser Asp Leu Ala
165 170 175
Ala Ala Leu Thr Leu Gly Asp Ala Gly Ala Ala Ala Val Met Gly Pro
180 185 190
Lys Leu Asp Pro Glu Thr Gly Ile Lys Gly Phe Val Val Gln Ser Gln
195 200 205
Gly Gln His Asn Gly Leu Cys Val Cys Gly Asp Asn Gly Glu Asp Thr
210 215 220
His Leu Val Thr Lys Ile Thr Glu Ile Val Arg Glu Thr Thr Arg Leu
225 230 235 240
Val Gly Pro Leu Tyr Gln Ala Leu Met His Glu His Leu Gly Trp Gln
245 250 255
Pro Ser Glu Leu Ser Arg Tyr Ile Pro His Gln Val Gly Leu Arg Ser
260 265 270
Val Arg Lys His Ala Glu Val Ala Gln Val Pro Leu Glu Ile Ile Pro
275 280 285
Ile Thr Val Asp Tyr Leu Gly Asn Ile Ile Ser Ala Thr Ile Pro Val
290 295 300
Asn Ile Ser Leu Leu Met Lys Asp Lys Lys Leu Thr Asn Gly Glu Arg
305 310 315 320
Ile Tyr Leu Ser Gly Thr Gly Ser Gly Ile Ser Ile Ala Gln Ala Ala
325 330 335
Met Val Trp Asp Ala Ala
340
<210> 13
<211> 346
<212> PRT
<213> 丙酸脱硫叶菌 DSM 2032
<400> 13
Met Thr Leu Arg Tyr Thr Gln Val Cys Leu His Asp Phe Gly Tyr Gln
1 5 10 15
Leu Pro Pro Val Glu Leu Ser Ser Ala Ala Ile Glu Glu Arg Leu Gln
20 25 30
Pro Leu Tyr Glu Arg Leu Lys Leu Pro Ala Gly Arg Leu Glu Leu Met
35 40 45
Thr Gly Ile Asn Thr Arg Arg Leu Trp Gln Pro Gly Thr Arg Pro Ser
50 55 60
Ala Gly Ala Ala Ala Ala Gly Ala Asp Ala Met Ala Lys Ala Gly Val
65 70 75 80
Asp Val Ala Asp Leu Gly Cys Leu Leu Phe Thr Ser Val Ser Arg Asp
85 90 95
Met Met Glu Pro Ala Thr Ala Ala Phe Val His Arg Ser Leu Gly Leu
100 105 110
Pro Ser Ser Cys Leu Leu Phe Asp Ile Ser Asn Ala Cys Leu Gly Phe
115 120 125
Leu Asp Gly Met Ile Met Leu Ala Asn Met Leu Glu Leu Gly Gln Val
130 135 140
Lys Ala Gly Leu Val Val Ala Gly Glu Thr Ala Glu Gly Leu Val Glu
145 150 155 160
Ser Thr Leu Ala His Leu Leu Ala Glu Thr Gly Leu Thr Arg Lys Ser
165 170 175
Ile Lys Pro Leu Phe Ala Ser Leu Thr Ile Gly Ser Gly Ala Val Ala
180 185 190
Leu Val Met Thr Arg Arg Asp Tyr Arg Asp Thr Gly His Tyr Leu His
195 200 205
Gly Gly Ala Cys Trp Ala Gln Thr Val His Asn Asp Leu Cys Gln Gly
210 215 220
Gly Gln Asn Ala Glu Gln Gly Thr Leu Met Ser Thr Asp Ser Glu Gln
225 230 235 240
Leu Leu Glu Lys Gly Ile Glu Thr Ala Ala Ala Cys Trp Gln Gln Phe
245 250 255
His Ala Thr Leu Gly Trp Asp Lys Gly Ser Ile Asp Arg Phe Phe Cys
260 265 270
His Gln Val Gly Lys Ala His Ala Gln Leu Leu Phe Glu Thr Leu Glu
275 280 285
Leu Asp Pro Ala Lys Asn Phe Glu Thr Leu Pro Leu Leu Gly Asn Val
290 295 300
Gly Ser Val Ser Ala Pro Ile Thr Met Ala Leu Gly Ile Glu Gln Gly
305 310 315 320
Ala Leu Gly Ala Gly Gln Arg Ala Ala Ile Leu Gly Ile Gly Ser Gly
325 330 335
Ile Asn Ser Leu Met Leu Gly Ile Asp Trp
340 345
<210> 14
<211> 393
<212> PRT
<213> 生氢氧化碳嗜热菌Z-2901
<400> 14
Met Arg Glu Val Val Ile Val Ser Ala Ala Arg Thr Pro Phe Gly Lys
1 5 10 15
Phe Gly Gly Gly Leu Ser Ala Leu Lys Ala Val Asp Leu Gly Ala Ile
20 25 30
Ala Ile Lys Ala Ala Val Glu Arg Ser Gly Val Ser Pro Glu Glu Phe
35 40 45
Asp Tyr Val Tyr Met Gly Gln Val Leu Gln Gly Gly Ala Gly Gln Ile
50 55 60
Pro Ser Arg Gln Ala Ala Arg Lys Ala Gly Leu Pro Trp Glu Val Pro
65 70 75 80
Ser Val Thr Val Asn Lys Val Cys Ala Ser Gly Leu Ile Ala Val Ala
85 90 95
Met Ala Ala Lys Met Ile Ala Leu Gly Glu Ile Asp Val Ala Val Ala
100 105 110
Gly Gly Met Glu Ser Met Ser Asn Ala Pro Tyr Ile Leu Pro Ser Ala
115 120 125
Arg Trp Gly Gln Arg Met Phe Asn Phe Glu Ala Ile Asp Leu Met Val
130 135 140
His Asp Gly Leu Trp Cys Ala Phe Tyr Asp Arg His Met Ala Val His
145 150 155 160
Gly Ser Glu Val Ala Lys Glu Tyr Gly Ile Ser Arg Gln Ala Gln Asp
165 170 175
Glu Trp Ala Tyr Ile Ser Gln Met Arg Ala Lys Glu Ala Met Glu Lys
180 185 190
Gly Arg Leu Asn Asp Glu Ile Val Lys Val Glu Val Pro Gly Lys Lys
195 200 205
Gly Glu Val Val Val Ile Glu Lys Asp Glu Gln Pro Arg Pro Asn Thr
210 215 220
Thr Ile Glu Ala Leu Ser Lys Leu Pro Pro Val Phe Asp Ala Asn Gly
225 230 235 240
Thr Val Thr Ala Gly Asn Ala Pro Gly Val Asn Asp Gly Ala Gly Ala
245 250 255
Leu Val Leu Met Ser Arg Glu Lys Ala Arg Glu Leu Gly Ile Lys Pro
260 265 270
Leu Ala Thr Tyr Leu Asn His Ala Glu Val Ala Leu Asp Ala Lys Tyr
275 280 285
Ile Ala Thr Ala Pro Gly Gln Ala Ile Asn Lys Leu Leu Ala Lys Lys
290 295 300
Gly Met Lys Ile Glu Gln Ile Asp Leu Leu Glu Val Asn Glu Ala Phe
305 310 315 320
Ala Ala Val Val Leu Val Ser Gln Lys Ile Ala Gly Tyr Asn Leu Glu
325 330 335
Lys Val Asn Val Asn Gly Gly Ala Val Ala Phe Gly His Pro Ile Gly
340 345 350
Ala Ser Gly Ala Arg Ile Leu Met Thr Leu Ile Tyr Glu Leu Arg Arg
355 360 365
Arg Gly Gly Gly Thr Gly Ile Ala Ala Ile Cys Ser Gly Ala Ala Gln
370 375 380
Gly Asp Ala Met Leu Ile Lys Val Glu
385 390
<210> 15
<211> 393
<212> PRT
<213> 生氢氧化碳嗜热菌Z-2901
<400> 15
Met Gln Glu Val Val Ile Leu Ser Ala Val Arg Thr Ala Ile Gly Lys
1 5 10 15
Phe Gly Gly Ser Leu Lys Asp Ile Pro Ala Ala Glu Leu Gly Ala Ile
20 25 30
Val Ile Lys Glu Ala Leu Val Arg Ala Gln Ile Pro Pro Ala Glu Val
35 40 45
Asp Glu Val Ile Phe Gly Asn Val Leu Gln Ala Gly Gln Gly Gln Asn
50 55 60
Pro Ala Arg Gln Ala Ala Ile Lys Ala Gly Ile Pro Val Asp Ile Pro
65 70 75 80
Ala Met Thr Val Asn Met Val Cys Gly Ser Gly Leu Arg Ser Val Ser
85 90 95
Leu Ala Ala Thr Leu Ile Ala Ala Gly Glu Ala Asp Leu Ile Val Ala
100 105 110
Gly Gly Met Glu Asn Met Ser Ala Ala Pro Tyr Ala Ile Pro Gly Ala
115 120 125
Arg Trp Gly Thr Arg Met Gly Asp Gly Lys Ile Val Asp Leu Met Ile
130 135 140
Lys Asp Gly Leu Trp Asp Ala Phe Tyr Asp Tyr His Met Gly Ile Thr
145 150 155 160
Ala Glu Asn Leu Ala Glu Arg Tyr Asn Ile Ser Arg Glu Glu Gln Asp
165 170 175
Arg Phe Ala Leu Glu Ser Gln Arg Arg Ala Glu Lys Ala Ile Lys Glu
180 185 190
Gly Arg Phe Arg Asp Glu Ile Val Pro Val Lys Leu Pro Gln Arg Lys
195 200 205
Gly Glu Pro Leu Glu Phe Val Gln Asp Glu Asn Pro Arg Phe Asp Thr
210 215 220
Thr Leu Glu Ala Leu Ala Lys Leu Lys Pro Ala Phe Lys Glu Gly Gly
225 230 235 240
Thr Val Thr Ala Gly Asn Ala Ser Ser Ile Asn Asp Gly Ala Ala Ala
245 250 255
Leu Val Ile Ala Ser Ser Lys Lys Ala Glu Ser Leu Gly Ile Lys Pro
260 265 270
Met Ala Val Ile Arg Ser Trp Gly Ala Thr Gly Val Asp Pro Ser Ile
275 280 285
Met Gly Ile Gly Pro Val Gly Ala Thr Arg Lys Ala Leu Lys Arg Ala
290 295 300
Gly Leu Thr Ile Ala Asp Ile Asp Leu Val Glu Ala Asn Glu Ala Phe
305 310 315 320
Ala Ala Gln Ala Leu Ala Val Ala Lys Glu Leu Glu Leu Asp Leu Ser
325 330 335
Lys Thr Asn Val Asn Gly Gly Ala Ile Ala Leu Gly His Pro Ile Gly
340 345 350
Ala Ser Gly Ala Arg Ile Leu Val Thr Leu Leu His Glu Met Lys Lys
355 360 365
Ser Asn Ser Arg Tyr Gly Leu Ala Thr Leu Cys Ile Gly Gly Gly Met
370 375 380
Gly Val Ala Ala Ile Val Glu Lys Ala
385 390
<210> 16
<211> 435
<212> PRT
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 16
Met Ser Phe Lys Lys Ser Lys Asp Asp Leu Val Cys Val Ser Ala Val
1 5 10 15
Arg Thr Pro Phe Gly Arg Phe Gly Gly Ser Met Arg Asp Ile Asp Ile
20 25 30
Tyr Asp Leu Gly Ala Ile Ala Met Lys Asn Ala Leu Glu Arg Ile Lys
35 40 45
Met Asp Pro Glu Leu Ile Asp Glu Val Trp Trp Gly Cys Gly Asp Thr
50 55 60
Thr Asn Cys Lys Asp Pro Tyr Thr Pro Val Val Ala Arg Gln Ser Met
65 70 75 80
Leu Lys Ala Gly Ile Pro Pro Glu Lys Pro Ser Val Thr Phe Asp Gln
85 90 95
Ala Cys Ile Ser Gly Met Asp Ala Val Lys Tyr Gly Gly Arg Ser Ile
100 105 110
Gln Leu Gly Glu Ala Glu Ile Val Met Thr Gly Gly Ala Thr Ser Phe
115 120 125
Ser Thr Val Pro Phe Leu Leu Arg Gly Ile Arg Trp Glu Gly Lys Arg
130 135 140
His Thr Ser Phe Leu Val Glu Asp Pro Ile Ile Pro Leu Gly Tyr Lys
145 150 155 160
Asp Tyr Ala Pro Val Ala Val Asp Ser Gly Asp Val Ala Val Glu Tyr
165 170 175
Gly Val Ser Arg Glu Glu Gln Asp Glu Phe Ala Val Ala Ser His Val
180 185 190
Lys Tyr Gly Lys Ala Tyr Glu Arg Gly Phe Phe Lys Gln Glu Met Val
195 200 205
Pro Leu Glu Leu Val Lys Lys Asp Lys Lys Gly Asn Val Val Ser Lys
210 215 220
Lys Val Leu Glu Ile Asp Glu Gln Tyr Arg Pro Asp Val Lys Ile Glu
225 230 235 240
Glu Leu Ala Arg Leu Lys Pro Ile Phe Gly Asn Pro Thr Val Thr Ala
245 250 255
Gly Asn Ala Pro Gly Met Asn Asp Gly Ala Cys Ala Gln Ile Phe Met
260 265 270
Lys Arg Glu Lys Ala Glu Gln Leu Gly Leu Asp Val Leu Tyr Thr Val
275 280 285
Val Ala Met Ser Ser Ile Ala Leu Gln Pro Arg Ile Met Pro Val Ser
290 295 300
Pro Ala Phe Ala Ile Lys Lys Cys Leu Asp Val Thr Gly Leu Thr Ile
305 310 315 320
Asp Asp Met Lys Phe Ile Glu Ile Asn Glu Ala Phe Ala Cys Val Pro
325 330 335
Leu Val Ala Thr Lys Leu Leu Ser Asn Gln Arg Phe Leu Thr Ser Asp
340 345 350
Tyr Asn Glu Met Val Lys Glu Ala Ser Thr Lys Pro Ile Leu Asp Asn
355 360 365
Asp Asp Ser Lys Tyr Gln Glu Leu Lys Ser Lys Leu Asn Val Asn Gly
370 375 380
Ser Ala Ile Ala Val Gly His Ala Asn Thr Ala Ser Gly Ser Arg Ile
385 390 395 400
Met Met Thr Ala Ala Tyr Asn Leu Lys Glu Asn Gly Gly Gly Tyr Ala
405 410 415
Ala Cys Ala Ile Cys Gly Gly Leu Thr Gln Gly Ala Gly Cys Ile Ile
420 425 430
Trp Val Glu
435
<210> 17
<211> 417
<212> PRT
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 17
Met Lys Asp Val Val Ile Val Ser Ala Cys Arg Thr Ala Ile Gly Thr
1 5 10 15
Phe Gly Gly Ser Leu Lys Asp Leu Asn Ala Pro Thr Leu Ala Lys Val
20 25 30
Ala Met Arg Gly Ala Ile Glu Arg Ala Gly Ile Asp Pro Gly Leu Ile
35 40 45
Asn Asp Val Arg Phe Gly Cys Ala Phe Glu His Pro Asp Ser Asn Asn
50 55 60
Val Ala Arg Val Ala Ala Leu Leu Ala Gly Val Pro Ala Glu Thr Ser
65 70 75 80
Thr Ala Val Thr Ile Asn Arg Val Cys Val Ser Gly Met Glu Ala Val
85 90 95
Val Ser Gly Met Ala Met Ile Gln Ala Gly Leu Val Asp Val Val Leu
100 105 110
Ala Gly Gly Val Glu His Met Ser Gly Val Pro Phe Ser Val Leu Asn
115 120 125
Ala Arg Trp Gly Cys Arg Leu Gln Asp Ser Val Phe Val Asp Asn Leu
130 135 140
Ile His Gly Leu Tyr Gly Gly Ser Lys Phe Leu Pro Gly Pro Glu Asn
145 150 155 160
Gly Pro Val Lys Glu Gly Pro Ile Leu Glu Ala Gly Arg Gly Lys Pro
165 170 175
Tyr Ile Met Gly Tyr Thr Ala Glu Leu Leu Ala Gln Tyr Cys Asn Ile
180 185 190
Ser Arg Glu Ala Met Asp Glu Val Ala Leu Arg Ser His Asn Asn Ala
195 200 205
Glu Arg Ala Thr Lys Asp Gly Ser Phe Arg Glu Glu Ile Val Pro Val
210 215 220
Glu Ile Pro Gln Lys Lys Gly Lys Ala Pro Leu Val Phe Asp Lys Asp
225 230 235 240
Glu His Phe Arg Pro Gly Val Thr Met Glu Gln Leu Ala Ala Leu Pro
245 250 255
Pro Ala Phe Val Pro Lys Ile Gly Lys Val Thr Ala Gly Asn Ala Ser
260 265 270
Gly Met Asn Asp Gly Ala Ala Ala Met Val Ile Met Ser Ala Asp Lys
275 280 285
Ala Arg Glu Leu Gly Met Lys Pro Ile Ala Arg Ile Lys Ala Val Gly
290 295 300
Tyr Gly Gly Cys His Pro Ser Ile Met Gly Leu Ser Pro Val Pro Ala
305 310 315 320
Ile Lys Asn Leu Leu Ser Lys Ser Gly Leu Lys Leu Glu Asp Phe Glu
325 330 335
Leu Ile Glu Ile Asn Glu Ala Phe Ala Ala Gln Tyr Leu Ala Val Glu
340 345 350
Gln Glu Leu Gly Leu Asn Arg Glu Ile Thr Asn Val Asn Gly Ser Gly
355 360 365
Ile Gly Leu Gly His Pro Val Gly Ala Thr Gly Cys Arg Ile Met Val
370 375 380
Thr Leu Leu Tyr Ala Met Lys Lys Arg Gly Lys Thr Leu Gly Leu Ala
385 390 395 400
Ser Leu Cys Gly Gly Gly Gly Val Ser Met Ala Val Ala Leu Glu Met
405 410 415
Val
<210> 18
<211> 393
<212> PRT
<213> 生氢氧化碳嗜热菌 Z-2901
<400> 18
Met Glu Glu Val Val Ile Val Ser Ala Val Arg Thr Pro Ile Gly Ser
1 5 10 15
Phe Leu Gly Ser Leu Ala Gln Thr Pro Ala Val Asp Leu Gly Ala Leu
20 25 30
Val Ile Lys Glu Ser Leu Asn Arg Ile Asn Leu Ala Pro Arg Phe Val
35 40 45
Asp Glu Val Ile Met Gly Asn Val Leu Gln Ala Gly Leu Gly Gln Asn
50 55 60
Pro Ala Arg Gln Ala Ala Ile Lys Ala Gly Ile Pro Gln Glu Val Pro
65 70 75 80
Ala Phe Thr Val Asn Lys Val Cys Gly Ser Gly Leu Lys Ser Val Gly
85 90 95
Leu Ala Tyr Gln Ala Ile Ala Thr Gly Asp Ala Asp Ile Val Val Ala
100 105 110
Gly Gly Met Glu Asn Met Ser Leu Ala Pro Tyr Val Leu Pro Lys Ala
115 120 125
Arg Thr Gly Tyr Arg Met Gly His Asp Thr Leu Ile Asp Ser Met Ile
130 135 140
Lys Asp Gly Leu Trp Cys Ala Phe Thr Asp Val His Met Gly Ile Thr
145 150 155 160
Ala Glu Asn Ile Ala Glu Lys Tyr Asn Ile Thr Arg Glu Glu Gln Asp
165 170 175
Lys Phe Ala Leu Gln Ser Gln Glu Arg Ala Ile Lys Ala Ile Asp Glu
180 185 190
Gly Lys Phe Lys Glu Glu Ile Val Pro Val Ile Ile Pro Gln Lys Lys
195 200 205
Gly Glu Pro Leu Val Phe Ser Thr Asp Glu Phe Pro Lys Arg Gly Thr
210 215 220
Ser Leu Glu Lys Leu Ala Ala Leu Lys Pro Ala Phe Lys Lys Asp Gly
225 230 235 240
Thr Val Thr Ala Gly Asn Ala Ser Gly Ile Asn Asp Gly Ala Ala Ala
245 250 255
Val Val Val Met Ser Ala Lys Lys Ala Gln Glu Leu Asn Ile Lys Pro
260 265 270
Leu Ala Val Ile Arg Gly Tyr Ala Ala Ala Gly Val Asp Pro Ala Tyr
275 280 285
Met Gly Leu Gly Pro Ile Pro Ala Thr Arg Lys Ala Leu Lys Lys Ala
290 295 300
Asn Leu Thr Val Ser Asp Leu Gly Leu Ile Glu Ala Asn Glu Ala Phe
305 310 315 320
Ala Ala Gln Ala Leu Ala Val Ile Lys Glu Leu Glu Leu Asn Pro Glu
325 330 335
Ile Thr Asn Val Asn Gly Gly Ala Ile Ala Leu Gly His Pro Ile Gly
340 345 350
Ala Ser Gly Ala Arg Ile Leu Val Thr Leu Leu His Glu Met Gln Lys
355 360 365
Arg Asn Thr Lys Tyr Gly Leu Ala Thr Leu Cys Ile Gly Gly Gly Gln
370 375 380
Gly Phe Ala Leu Val Val Glu Lys Val
385 390
<210> 19
<211> 219
<212> PRT
<213> Pseudothermotoga lettingae TMO
<400> 19
Met Lys Asn Lys Ala Ile Thr Val Glu Gln Ala Ile Glu Met Ile Pro
1 5 10 15
Asp Gly Ala Val Leu Met Ile Gly Gly Phe Leu Gly Asp Gly Thr Pro
20 25 30
Glu Leu Leu Ile Asp Ala Leu Val Lys Ser Gly Lys Arg Asn Phe Thr
35 40 45
Ile Ile Ala Asn Asp Thr Ala Phe Pro Asp Lys Gly Ile Gly Lys Met
50 55 60
Ile Val Asn Lys Met Ala Lys Lys Val Ile Val Ser His Ile Gly Thr
65 70 75 80
Asn Pro Glu Thr Gln Lys Gln Met Ile Glu Gly Thr Leu Glu Val Glu
85 90 95
Leu Val Pro Gln Gly Thr Leu Ala Glu Lys Val Arg Ala Gly Gly Phe
100 105 110
Gly Leu Gly Gly Ile Leu Thr Pro Thr Gly Val Gly Thr Val Val Glu
115 120 125
Asn Gly Lys Gln Lys Ile Val Ile Asp Asp Lys Glu Tyr Leu Val Glu
130 135 140
Pro Ala Leu Arg Ala Asp Phe Ala Leu Ile Lys Ala Gln Lys Ala Asp
145 150 155 160
Phe Tyr Gly Asn Leu Phe Phe Asn Leu Thr Ser Arg Asn Phe Asn Pro
165 170 175
Leu Met Ala Phe Ala Gly Lys Ile Thr Ile Val Glu Val Glu Glu Phe
180 185 190
Val Pro Val Gly Gly Leu Ser Pro Asn Glu Ile His Thr Pro His Ala
195 200 205
Val Val Asp Tyr Ile Val Arg Gly Asn Ala Arg
210 215
<210> 20
<211> 224
<212> PRT
<213> Pseudothermotoga lettingae TMO
<400> 20
Met Ile Gln Asp Gln Asn Leu Ala Lys Ala Val Ile Ala Lys Arg Val
1 5 10 15
Ala Leu Glu Leu Lys Asp Gly Asp Ile Val Asn Leu Gly Ile Gly Ile
20 25 30
Pro Thr Leu Val Ala Asn Tyr Leu Pro Pro Lys Val Glu Ile Phe Leu
35 40 45
Gln Ser Glu Asn Gly Ile Leu Gly Met Gly Pro Ala Pro Met Ser Gly
50 55 60
Tyr Glu His Pro Asn Leu Thr Asn Ala Gly Gly Ser Pro Ile Thr Phe
65 70 75 80
Leu Pro Gly Ala Cys Ala Phe Asp Ser Ala Val Ser Phe Gly Leu Ile
85 90 95
Arg Gly Gly His Val Asp Ala Thr Val Leu Gly Ala Leu Gln Val Asp
100 105 110
Glu Glu Gly His Leu Ala Asn Trp Met Ile Pro Gly Lys Met Val Pro
115 120 125
Gly Met Gly Gly Ala Met Asp Leu Val Thr Gly Ala Lys Lys Val Ile
130 135 140
Val Ala Met Gln His Val Ala Lys Gly Asn Ala Pro Lys Ile Val Lys
145 150 155 160
Lys Cys Thr Leu Pro Leu Thr Ser Ile Arg Arg Val Asp Leu Ile Val
165 170 175
Thr Asp Met Ala Val Ile Glu Val Thr Gly Asn Gly Leu Ile Leu Lys
180 185 190
Glu Leu Ala Pro Gln Thr Thr Val Asp Glu Val Val Lys Phe Thr Glu
195 200 205
Ala Lys Leu Ile Val Pro Glu Asp Val Pro Val Met Pro Val Ser Leu
210 215 220
<210> 21
<211> 446
<212> PRT
<213> 脱硫脱铁杆菌 SSM1
<400> 21
Met Ala Glu Ile Leu Lys Ser Ser Ile Glu Ala Ile Lys Asp Val Ile
1 5 10 15
Lys Asp Gly Met Val Val Ala Ala Gly Gly Phe Gly Leu Cys Gly Ile
20 25 30
Pro Glu Asn Leu Ile Asn Ala Ile Lys Glu Leu Lys Val Lys Asp Leu
35 40 45
Thr Phe Val Ser Asn Asn Ala Gly Val Asp Asp Phe Gly Leu Gly Ile
50 55 60
Leu Leu Gln Thr Arg Gln Ile Lys Lys Met Ile Ser Ser Tyr Val Gly
65 70 75 80
Glu Asn Lys Ile Phe Glu Gln Gln Tyr Leu Asn Gly Glu Leu Glu Leu
85 90 95
Glu Leu Val Pro Gln Gly Thr Leu Ala Glu Lys Leu Arg Ala Gly Gly
100 105 110
Ala Gly Ile Pro Ala Phe Tyr Thr Met Thr Gly Tyr Gly Thr Ile Leu
115 120 125
Thr Glu Gly Lys Glu Ile Lys Val Phe Asp Gly Lys Glu Tyr Val Leu
130 135 140
Glu Glu Ser Ile Arg Pro Asp Leu Ala Ile Val Lys Gly Trp Lys Ala
145 150 155 160
Asp Lys Lys Gly Asn Val Ile Phe Arg Tyr Thr Ala Asn Asn Phe Asn
165 170 175
Glu Val Cys Ala Lys Ala Ala Lys Phe Thr Ile Val Glu Val Glu Glu
180 185 190
Ile Val Asp Glu Ile Asp Pro His Tyr Ile His Leu Pro Ser Ile Tyr
195 200 205
Val Asp Arg Ile Val Leu Gly Glu Arg Tyr Glu Lys Arg Ile Glu Gln
210 215 220
Leu Thr Thr Leu Glu Asn Met Thr Glu Ala Lys Met Asn Glu Lys Arg
225 230 235 240
Glu Trp Met Ala Lys Arg Val Ala Lys Glu Leu Lys Lys Gly Met Tyr
245 250 255
Val Asn Leu Gly Ile Gly Met Pro Thr Leu Val Ala Asn Phe Ile Thr
260 265 270
Asp Asp Met Asp Ile Thr Leu His Ser Glu Asn Gly Leu Leu Gly Ile
275 280 285
Gly Pro Phe Pro Lys Thr Glu Lys Asp Ala Asp Pro Asp Leu Ile Asn
290 295 300
Ala Gly Lys Gln Thr Ile Thr Tyr Lys Lys Gly Ala Ala Phe Phe Asp
305 310 315 320
Ser Ser Glu Ser Phe Ala Met Val Arg Gly Gly His Ile Asp Leu Ser
325 330 335
Val Leu Gly Gly Met Gln Val Ser Glu Lys Gly Asp Leu Ala Asn Trp
340 345 350
Met Ile Pro Gly Lys Met Val Lys Gly Pro Gly Gly Ala Met Asp Leu
355 360 365
Val Ser Gly Val Lys Lys Val Ile Val Met Met Glu His Val Ala Lys
370 375 380
Asp Gly Lys Pro Lys Ile Leu Lys Glu Cys Thr Leu Pro Ile Thr Gly
385 390 395 400
Lys Gly Val Val Asp Met Leu Val Thr Asp Lys Gly Val Phe Glu Ile
405 410 415
Asn Ser Glu Gly Leu Tyr Leu Leu Glu Ile Ser Pro Phe Ser Asp Leu
420 425 430
Glu Asp Ile Lys Lys Ser Thr Gly Cys Glu Val Lys Val Lys
435 440 445
<210> 22
<211> 228
<212> PRT
<213> 地芽孢杆菌属种 GHH01
<400> 22
Met Lys Gln Ile His Ser Ser Phe Ile Glu Ala Val Lys Asp Ile Pro
1 5 10 15
Asp Gly Ala Thr Ile Met Val Gly Gly Phe Gly Leu Val Gly Ile Pro
20 25 30
Glu Asn Leu Ile Leu Ala Leu Val Glu Thr Gly Val Lys Glu Leu Thr
35 40 45
Val Ile Ser Asn Asn Cys Gly Val Asp Asp Trp Gly Leu Gly Leu Leu
50 55 60
Leu Lys Asn Lys Gln Ile Lys Lys Met Ile Ala Ser Tyr Val Gly Glu
65 70 75 80
Asn Lys Glu Phe Glu Arg Gln Val Leu Asn Gln Glu Ile Glu Val Glu
85 90 95
Leu Ile Pro Gln Gly Thr Leu Ala Glu Arg Ile Arg Ala Gly Gly Ala
100 105 110
Gly Ile Pro Ala Phe Tyr Thr Pro Ala Gly Val Gly Thr Pro Ile Ala
115 120 125
Glu Gly Lys Glu Val Arg Val Phe Asn Gly Lys Glu Tyr Ile Leu Glu
130 135 140
Thr Ala Leu Val Ala Asp Phe Ser Leu Val Arg Ala Trp Lys Gly Asp
145 150 155 160
Lys Met Gly Asn Leu Ile Tyr Asn Lys Thr Ala Arg Asn Phe Asn Pro
165 170 175
Met Met Ala Ala Ala Gly Lys Val Thr Ile Ala Glu Val Glu Glu Leu
180 185 190
Val Glu Ile Gly Glu Leu Asp Pro Asp His Ile His Thr Pro Ser Ile
195 200 205
Tyr Val Gln Arg Leu Val Val Gly Lys Gln Glu Lys Arg Ile Glu Arg
210 215 220
Leu Val Val Arg
225
<210> 23
<211> 222
<212> PRT
<213> 地芽孢杆菌属种 GHH01
<400> 23
Met Lys Thr Met Asn Lys Gln Ser Ile Arg Glu Arg Ile Ala Lys Arg
1 5 10 15
Ala Glu Gln Glu Ile Glu Asn Gly Phe Tyr Val Asn Leu Gly Ile Gly
20 25 30
Ile Pro Thr Leu Val Ala Asn Phe Ile Gln Ser His Lys Lys Val Val
35 40 45
Leu Gln Ser Glu Asn Gly Leu Leu Gly Ile Gly Pro Tyr Pro Leu Lys
50 55 60
Asp Glu Val Asp Pro Asp Leu Ile Asn Ala Gly Lys Glu Thr Ile Thr
65 70 75 80
Ala Ile Pro Gly Ala Cys Tyr Phe Ser Ser Ala Glu Ser Phe Ala Met
85 90 95
Ile Arg Gly Gly His Ile Asp Val Ala Ile Leu Gly Gly Met Glu Val
100 105 110
Ser Glu Glu Gly Asp Leu Ala Asn Trp Met Ile Pro Gly Lys Met Ile
115 120 125
Lys Gly Met Gly Gly Ala Met Asp Leu Val His Gly Ala Lys Lys Ile
130 135 140
Ile Val Val Met Glu His Val Ser Lys Asp Gly Lys Pro Lys Ile Val
145 150 155 160
Lys Lys Cys Ser Leu Pro Leu Thr Gly Arg Lys Val Val Asn Arg Ile
165 170 175
Ile Thr Glu Lys Ala Val Ile Asp Val Thr Glu Asn Gly Leu Lys Leu
180 185 190
Val Glu Ile Leu Asp Gly Ser Ser Val Glu Glu Ile Gln Ser Leu Thr
195 200 205
Glu Pro Thr Leu Met Ile Asp Glu Thr Leu Leu Ile Gln Ala
210 215 220
<210> 24
<211> 217
<212> PRT
<213> Thermosipho melanesiensis BI429
<400> 24
Met Lys Val Val Asp Ile Ser Lys Ile Asn Glu Leu Val Lys Glu Gly
1 5 10 15
Ala Thr Leu Met Ile Gly Gly Phe Leu Gly Val Gly Thr Pro Glu Asn
20 25 30
Ile Ile Asp Glu Ile Ile Arg His Asn Ile Ser Asn Leu Thr Val Ile
35 40 45
Ala Asn Asp Thr Ala Phe Glu Asp Arg Gly Ile Gly Lys Leu Val Lys
50 55 60
Asn Lys Leu Cys Lys Lys Val Ile Val Ser His Ile Gly Thr Asn Pro
65 70 75 80
Glu Thr Gln Arg Gln Met Ile Glu Gly Thr Leu Glu Val Glu Leu Val
85 90 95
Pro Gln Gly Thr Leu Ala Glu Arg Ile Arg Ala Ala Gly Val Gly Leu
100 105 110
Gly Gly Ile Leu Thr Pro Thr Gly Val Gly Thr Val Val Glu Lys Asp
115 120 125
Lys Lys Val Ile Glu Val Glu Gly Lys Lys Tyr Leu Leu Glu Leu Pro
130 135 140
Ile His Ala Asp Val Ala Leu Ile Lys Ala Lys Lys Ala Asp Tyr Leu
145 150 155 160
Gly Asn Leu Val Tyr Asn Leu Thr Ala Glu Asn Phe Asn Pro Ile Met
165 170 175
Ala Leu Ala Ala Lys Thr Val Ile Ala Glu Val Glu Glu Ile Val Pro
180 185 190
Thr Gly Thr Leu Ser Pro Asn Glu Ile Lys Thr Pro Gly Ile Ile Val
195 200 205
Asp Tyr Ile Val Thr Gly Val Thr Arg
210 215
<210> 25
<211> 214
<212> PRT
<213> Thermosipho melanesiensis BI429
<400> 25
Met Asn Pro Lys Glu Lys Ile Ala Ile Arg Val Ala Gln Glu Leu Lys
1 5 10 15
Lys Gly Gln Leu Val Asn Leu Gly Ile Gly Leu Pro Thr Leu Val Ala
20 25 30
Asn Tyr Ile Pro Lys Asp Ile His Val Phe Phe Gln Ser Glu Asn Gly
35 40 45
Ile Ile Gly Met Gly Pro Ala Pro Lys Glu Gly Tyr Glu Asn Ser Asp
50 55 60
Leu Thr Asn Ala Gly Ala Ser Tyr Ile Thr Ala Leu Pro Gly Ala Met
65 70 75 80
Thr Phe Asp Ser Ala Phe Ser Phe Gly Ile Ile Arg Gly Gly His Leu
85 90 95
Asp Val Thr Val Leu Gly Gly Leu Gln Val Asp Glu Glu Gly His Leu
100 105 110
Ala Asn Trp Met Ile Pro Gly Lys Met Ile Pro Gly Met Gly Gly Ala
115 120 125
Met Asp Leu Val Thr Gly Ala Lys Lys Val Ile Val Ala Met Thr His
130 135 140
Thr Ala Lys Gly Thr Pro Lys Ile Val Lys Lys Cys Thr Leu Pro Leu
145 150 155 160
Thr Ser Ile Arg Lys Val Asp Leu Ile Val Thr Glu Leu Ala Val Ile
165 170 175
Glu Pro Thr Asp Glu Gly Leu Leu Leu Lys Glu Ile Ser Lys Glu Thr
180 185 190
Thr Leu Asp Glu Val Leu Lys Leu Thr Glu Ala Lys Leu Ile Ile Ala
195 200 205
Asp Asp Leu Lys Ile Phe
210
<210> 26
<211> 517
<212> PRT
<213> 嗜热丙酸厌氧肠状菌 SI
<400> 26
Met Ala Pro Arg Phe Leu Thr Ala Glu Glu Ala Val Asn Leu Ile Lys
1 5 10 15
Asp Gly Asp Thr Val Ala Ser Val Gly Phe Leu Gly Asn Val Phe Pro
20 25 30
Glu Glu Leu Ala Val Ala Leu Glu Glu Arg Phe Leu Lys Thr Ala Lys
35 40 45
Pro Glu Arg Leu Thr Leu Ile Tyr Ala Ala Ala Gln Gly Asp Gly Lys
50 55 60
Glu Arg Gly Leu Asn His Leu Ala Tyr Glu Gly Leu Val Lys Arg Val
65 70 75 80
Ile Gly Gly His Trp Asn Leu Gln Pro Lys Met Ala Lys Leu Ala Ile
85 90 95
Glu Asn Lys Ile Glu Ala Tyr Asn Leu Pro Gln Gly Thr Ile Ser Gln
100 105 110
Leu Phe Arg Glu Ile Ala Ala Lys Arg Pro Gly Val Ile Thr His Val
115 120 125
Gly Leu Lys Thr Phe Val Asp Pro Arg Ile Glu Gly Gly Lys Leu Asn
130 135 140
Ala Val Thr Lys Glu Asp Ile Val Glu Val Ile Thr Ile Asp Gly Lys
145 150 155 160
Glu Lys Leu Phe Tyr Arg Ser Ile Pro Leu Asn Val Gly Leu Ile Arg
165 170 175
Gly Thr Ser Ala Asp Gln Leu Gly Asn Ile Ser Leu Glu Lys Glu Ala
180 185 190
Asn Thr Leu Glu Val Leu Ser Ile Ala Gln Ala Val Arg Asn Cys Gly
195 200 205
Gly Ile Val Ile Ala Gln Val Glu Arg Val Val Ala Ala Gly Ser Leu
210 215 220
Asp Pro Arg Leu Val Lys Val Pro Gly Ile Leu Val Asp Val Val Val
225 230 235 240
Val Ser Arg Pro Glu Asn His His Gln Thr Phe Ala Glu Val Tyr Asn
245 250 255
Pro Ala Tyr Ser Gly Glu Val Val Ile Pro Leu Thr Glu Leu Pro Pro
260 265 270
Ala Lys Leu Asp Glu Arg Lys Val Ile Ser Arg Arg Ala Ala Phe Glu
275 280 285
Leu Arg Pro Gly Ser Val Val Asn Leu Gly Ile Gly Ile Pro Glu Gly
290 295 300
Ile Ala Ser Val Ala Ala Glu Glu Gly Ile Ser Asp Phe Met Thr Leu
305 310 315 320
Thr Val Glu Ala Gly Pro Val Gly Gly Val Pro Ala Gly Gly Leu Ser
325 330 335
Phe Gly Ala Ser Thr Asn Pro Tyr Cys Val Leu Asp Gln Ala Tyr Gln
340 345 350
Phe Asp Phe Tyr Asp Gly Gly Gly Val Asp Ile Ala Phe Leu Gly Leu
355 360 365
Ala Gln Met Asp Ser Asn Gly Asn Ile Asn Val Ser Lys Phe Gly Pro
370 375 380
Arg Ile Ala Gly Cys Gly Gly Phe Ile Asn Ile Thr Gln Asn Ala Lys
385 390 395 400
Lys Val Val Phe Cys Gly Thr Phe Lys Ala Gly Gly Leu Lys Val Asn
405 410 415
Val Gly Asp Gly Lys Leu Thr Ile Val Asn Glu Gly Lys Ser Val Lys
420 425 430
Leu Val Pro Lys Val Glu Gln Ile Thr Phe Ser Gly Glu Tyr Ala Arg
435 440 445
Gln Gln Gly Gln Lys Val Leu Tyr Ile Thr Glu Arg Ala Val Phe Glu
450 455 460
Met Thr Ala Glu Gly Val Met Leu Thr Glu Ile Ala Pro Gly Val Asp
465 470 475 480
Leu Glu Arg Asp Val Leu Gln Gln Met Asp Phe Lys Pro Leu Ile Ser
485 490 495
Pro Ser Leu Lys Thr Met Asp Lys Arg Ile Phe Ile Asp Ala Pro Met
500 505 510
Gly Ile Lys Asn Ser
515
<210> 27
<211> 288
<212> PRT
<213> 海洋红嗜热菌 DSM 4252
<400> 27
Met Ser Glu Pro Val Asp His Leu Leu His Leu Leu Asn Leu Glu Arg
1 5 10 15
Ile Glu Glu Asn Ile Phe Arg Gly Pro Ser Arg Asp Ile Gly Ser Pro
20 25 30
Thr Val Phe Gly Gly Gln Val Leu Gly Gln Ala Leu Arg Ala Ala Ala
35 40 45
Tyr Thr Val Pro Pro Glu Arg Arg Ala His Ser Leu His Ala Tyr Phe
50 55 60
Ile Leu Pro Gly Asp Pro Asn Ala Pro Ile Val Tyr Leu Val Glu Arg
65 70 75 80
Leu Arg Asp Gly Arg Ser Phe Thr Thr Arg Arg Val Thr Ala Ile Gln
85 90 95
His Gly Arg Pro Ile Phe Asn Leu Ser Ala Ser Phe Gln Ile Glu Glu
100 105 110
Pro Gly Val Glu His Gln Asp Pro Met Pro Glu Val Pro Pro Pro Glu
115 120 125
Glu Leu Ile Ser Glu Ala Glu Leu Arg Arg Gln Leu Ala Glu Gln Val
130 135 140
Pro Glu Val Leu Arg Pro Phe Leu Leu His Glu Arg Pro Ile Glu Ile
145 150 155 160
Arg Pro Val Glu Pro Val Asn Leu Leu Phe Pro Glu Lys Arg Pro Pro
165 170 175
Arg Arg His Ala Trp Ile Arg Ala Ala Gly Thr Leu Pro Asp Asp Asp
180 185 190
Leu Ala Leu His Gln Ser Val Leu Ala Tyr Ala Ser Asp Phe Gly Phe
195 200 205
Met Gly Thr Ala Met Leu Pro His Gly Leu Ser Phe Leu Gln Pro His
210 215 220
Val Gln Ala Ala Ser Leu Asp His Ala Met Trp Phe Tyr Arg Pro Phe
225 230 235 240
Arg Ala Asp Glu Trp Leu Leu Phe Ala Met Glu Ser Pro Val Ala Ala
245 250 255
His Ala Arg Gly Leu Asn Arg Gly Leu Phe Phe Arg Arg Asp Gly Thr
260 265 270
Leu Val Ala Ala Val Val Gln Glu Gly Leu Met Arg Ile Arg Ser Asp
275 280 285
<210> 28
<211> 244
<212> PRT
<213> 丙酮丁醇梭菌 ATCC 824
<400> 28
Met Leu Lys Asp Glu Val Ile Lys Gln Ile Ser Thr Pro Leu Thr Ser
1 5 10 15
Pro Ala Phe Pro Arg Gly Pro Tyr Lys Phe His Asn Arg Glu Tyr Phe
20 25 30
Asn Ile Val Tyr Arg Thr Asp Met Asp Ala Leu Arg Lys Val Val Pro
35 40 45
Glu Pro Leu Glu Ile Asp Glu Pro Leu Val Arg Phe Glu Ile Met Ala
50 55 60
Met His Asp Thr Ser Gly Leu Gly Cys Tyr Thr Glu Ser Gly Gln Ala
65 70 75 80
Ile Pro Val Ser Phe Asn Gly Val Lys Gly Asp Tyr Leu His Met Met
85 90 95
Tyr Leu Asp Asn Glu Pro Ala Ile Ala Val Gly Arg Glu Leu Ser Ala
100 105 110
Tyr Pro Lys Lys Leu Gly Tyr Pro Lys Leu Phe Val Asp Ser Asp Thr
115 120 125
Leu Val Gly Thr Leu Asp Tyr Gly Lys Leu Arg Val Ala Thr Ala Thr
130 135 140
Met Gly Tyr Lys His Lys Ala Leu Asp Ala Asn Glu Ala Lys Asp Gln
145 150 155 160
Ile Cys Arg Pro Asn Tyr Met Leu Lys Ile Ile Pro Asn Tyr Asp Gly
165 170 175
Ser Pro Arg Ile Cys Glu Leu Ile Asn Ala Lys Ile Thr Asp Val Thr
180 185 190
Val His Glu Ala Trp Thr Gly Pro Thr Arg Leu Gln Leu Phe Asp His
195 200 205
Ala Met Ala Pro Leu Asn Asp Leu Pro Val Lys Glu Ile Val Ser Ser
210 215 220
Ser His Ile Leu Ala Asp Ile Ile Leu Pro Arg Ala Glu Val Ile Tyr
225 230 235 240
Asp Tyr Leu Lys
<210> 29
<211> 353
<212> PRT
<213> 布氏热厌氧杆菌 Ako-1
<400> 29
Met Met Lys Gly Phe Ala Met Leu Ser Ile Gly Lys Val Gly Trp Ile
1 5 10 15
Glu Lys Glu Lys Pro Ala Pro Gly Pro Phe Asp Ala Ile Val Arg Pro
20 25 30
Leu Ala Val Ala Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly
35 40 45
Ala Ile Gly Glu Arg His Asn Met Ile Leu Gly His Glu Ala Val Gly
50 55 60
Glu Val Val Glu Val Gly Ser Glu Val Lys Asp Phe Lys Pro Gly Asp
65 70 75 80
Arg Val Val Val Pro Ala Ile Thr Pro Asp Trp Arg Thr Ser Glu Val
85 90 95
Gln Arg Gly Tyr His Gln His Ser Gly Gly Met Leu Ala Gly Trp Lys
100 105 110
Phe Ser Asn Val Lys Asp Gly Val Phe Gly Glu Phe Phe His Val Asn
115 120 125
Asp Ala Asp Met Asn Leu Ala His Leu Pro Lys Glu Ile Pro Leu Glu
130 135 140
Ala Ala Val Met Ile Pro Asp Met Met Thr Thr Gly Phe His Gly Ala
145 150 155 160
Glu Leu Ala Asp Ile Glu Leu Gly Ala Thr Val Ala Val Leu Gly Ile
165 170 175
Gly Pro Val Gly Leu Met Ala Val Ala Gly Ala Lys Leu Arg Gly Ala
180 185 190
Gly Arg Ile Ile Ala Val Gly Ser Arg Pro Val Cys Val Asp Ala Ala
195 200 205
Lys Tyr Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asp Gly Pro Ile
210 215 220
Glu Ser Gln Ile Met Asn Leu Thr Glu Gly Lys Gly Val Asp Ala Ala
225 230 235 240
Ile Ile Ala Gly Gly Asn Ala Asp Ile Met Ala Thr Ala Val Lys Ile
245 250 255
Val Lys Pro Gly Gly Thr Ile Ala Asn Val Asn Tyr Phe Gly Glu Gly
260 265 270
Glu Val Leu Pro Val Pro Arg Leu Glu Trp Gly Cys Gly Met Ala His
275 280 285
Lys Thr Ile Lys Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu
290 295 300
Arg Leu Ile Asp Leu Val Phe Tyr Lys Arg Val Asp Pro Ser Lys Leu
305 310 315 320
Val Thr His Val Phe Arg Gly Phe Asp Asn Ile Glu Lys Ala Phe Met
325 330 335
Leu Met Lys Asp Lys Pro Lys Asp Leu Ile Lys Pro Val Val Ile Leu
340 345 350
Ala
<210> 30
<211> 1176
<212> DNA
<213> 地芽孢杆菌属种 GHH01
<400> 30
atgagagaag tggtcattac agcggccgtg cgtacgccga ttggaacatt tggcggcgtc 60
tttaaagacc tgttgccgac ggatttaatt gttcctgttt tagaggaagc agtaaaacgc 120
agccaaattg agaaagacga agtaaacgaa gtgattttag gtcattgcat tcagagaacg 180
gatataccca atacagcaag aacggctgcc ttgctagcag gattccctca tacaacaacc 240
ggttttacga ttcagcgcca gtgcgcttct ggaatgcaag cagttatttc ggcggctatg 300
caaattcaag tcgggctgag cgatgtggtc attgccggcg gtgttgaatc catgagttct 360
agtccgtata tattaaagca gcatcgttgg ggagcgcgtt tacagcacca gcaagtccgt 420
gatagcgttt gggaagttct tgaggatccg attcaccatg tgatgatggg agaaacagcc 480
gagaatcttg cggaacggta tgggatcaca agggaggagc aggatgaact ggcgttgtta 540
agccatcggc gagctatttt ggcgatggaa tcgggatact ttgattctca aattgttccg 600
atcacggtaa aaacacgaaa ggaggagata gtcgtaacaa aagatgagca tccacgagcc 660
gatgtgacga aagaaaaatt ggcttcctta agacctgtat tccgaaaaaa tgggacggta 720
acagcaggga atgcatcggg aattaatgac ggtgctgctg cgcttgtgct catgtctgcc 780
gagtatgcac agcaacgagg gatcgagccg cttgcaaaag tagttggtta ttctgttgcc 840
ggagtggatc ctctagtgat gggacgcggc ccggttccag cggtacaaaa aggattagaa 900
agggtaaatt ggacgttagc ggaagccgat ttaattgaaa tcaatgaagc atttgctgct 960
cagtatttag ctgtagaaag ggaactgcgt ttagatagag ataaagtgaa cgtaaacgga 1020
agcggcatca gcttgggaca tccgattgga tgcacaggag cgcgtattgt cgtcagtctt 1080
attcatgagt tgcagcgccg tcagcttgaa aaaggaattg cctctttatg cgtgggcggt 1140
ggaatgggaa cggcggtgtt tattgaggct ttgtaa 1176
<210> 31
<211> 1203
<212> DNA
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 31
atgattaacg aagtcgtaat ggtcagtgca tgccgcacgg ccataggaga ttttatggga 60
agcctgaaag atttgaaagc caatgacctg tcagcaataa ccgcgaccga agcactgaaa 120
agagccggaa tccagccgga aatggttgat agtcttgttt taggcatgtg cctccaccac 180
ggtaacgatt ccgggccagc gcgccaggta gctatggcga ttggcatgag acatagcagc 240
tgggcctgca tggtcaatca gaattgcgcc tccgccatgc gcgcccttga aatcgcagcc 300
aacgagctca tgctgggcaa gagcgagata agcctggttg tgggaacgga aagcatgacc 360
aacgtgccgt acattctgcg taaagccaga ttcggctatc gcttgtttga cggtgacaag 420
gccgaggacg ctatgatctg cgatggcctg tttgacaaaa tggtacccgg acacatggcg 480
atcacggccg aaaatgttgc cgaaaagtac ggaataacta gggaagaatg cgatgagcta 540
gcgctgttga gccatacccg tgcccttaag gccaacgccg agggtatctt cgcccgggag 600
attgtgccgg tggagatcaa gaccaagaaa ggagtcaaag tagttgacaa agacgaacat 660
cctatggata caagtctaga aaaattagcc cagctacctc cggtcttcaa gaaaggcggc 720
gtagtcacag ccggtaatgc ctccggtatc aacgacggtt ctgcggcggc ggtcctcatg 780
accaagaaga aggccgaaga actcggcatc aaacccttaa tgaagcttct atatgtatgc 840
agtgaaggag ttgaccccaa gtttatgggc ttaggaccgg cagtagctat tcctaaggtt 900
ctgaataaag cggggttgaa gttcgaggat gtggaatact gggaaatcaa cgaagctttt 960
gccgctcagt ggctgggagt cggccggatg cttaaggagg acttcggaat cgagctcgac 1020
ctcgacaagg tcaaccataa cggctccggc atcggtctcg gccatcccgt gggctgtacc 1080
ggccttcgta ttcaagtatc catgtactac gaaatggaaa ggctcggttt gaccatcggc 1140
ggagcttcac tctgcgtggg tggtggaccg gcaatggctg ccctctggac ccgggacata 1200
taa 1203
<210> 32
<211> 1191
<212> DNA
<213> 橙色绿屈挠菌 J-10-fl
<220>
<221> misc_feature
<222> (1)..(1191)
<223> 密码子优化的
<400> 32
atgtccgaaa aacgcgaagt ggtggtcctc tcaggagtcc gtacggccat cggcacgttt 60
ggcgggagcc ttaaggatat tcctccgacg gaattggccg cgttggtgac acgcgaagcc 120
gtggcgcggt cgggtctgca accgaatgaa attggccatg tagtctttgg ccacgtaatt 180
aacacggagc cgcatgatat gtacctggct cgctacgctg cagtccgggg cggacttagc 240
gtggagacgc cagccctgac gctcaaccgt ttatgtggat cgggcctcca agcgatcgtc 300
tcggcggccc agtatatcct tcaaggagat gctgaagcgg ccgtcgcggg tggagcggag 360
tgcatgtcgc gcggaccgta tagcttaccg gccatgcgct tcggagcgcg catgaatgac 420
agcaaagtcg ttgacatgat ggtcggtgcc ttgacggatc cattcgatga ttgtcacatg 480
ggcgtcacgg ccgagaatgt ggccgcgaaa tggggcattt cacgtgagga tcaggaccaa 540
ttagcctatg aaagccatat gcgcgcggct cgcgcgatcg acgaagggcg ctttgcgaat 600
caaatcgtgc ctgtcgaaat caaagtcaaa gggggcacgg cccaattcat ggtcgatgag 660
ggagtgcgcc gggataccac gatcgataaa cttgccaaat tacggccagt atttctcaag 720
gatgggtcgg tcaccgccgg aaacgcctcg tcgattaacg atgcggcagc cgcggtcgta 780
ctgatggatc gtgccaccgc tgaacgccgc ggttataaac ctctggcccg cctggtcggg 840
tattcccatg ctgcggtgga accgaagtat atgggcattg gcccggttcc ggctgttcgg 900
cggttgttgg aacggacggg gttacgcatc tcagatatcg acttatttga ggtgaatgaa 960
gcctttgctg cccaggctct cgcggtgatt cgtgatctcg aactgccacc ggatcgcacg 1020
aatccaaacg ggtcgggcat ttctttgggc cacccaattg gcgccacggg atgcatcctc 1080
acggttaagg caatccacga gttacaccgg accggcggtc gttatgctct ggtcacgatg 1140
tgcatcggag gtggccaggg tatcgctgcg atttttgaac gcatgtaatg a 1191
<210> 33
<211> 1179
<212> DNA
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 33
atgcgcgatg tcgtaatagt aagtgggaag aggaccgcaa tcggcaattt tttaggagcg 60
cttaaagatt tttctgcagt tgatctagga acaattgcgc tcaaagctgc cattaatagt 120
gctggcatta gcccggatac cgtagaagaa gtcgccgccg ggcatgtata ccaggccggt 180
tgtaagggaa atccagcaag gcaaatcacc atcggcgctg gctgcccggt ggaaacggtt 240
tcggttactg tcaatcagca atgcccttcg gccatgcggg ccctcgaaat tattagccag 300
gaaatcatgc taggcaagat cgacgcgggt gctgccgtag gcatagaaag catgagtaat 360
gtcccttatc tcttgctcaa agctcgcact ggttaccgca tgggaaatgg agagcttgtt 420
gacggcatgc tctacgatgc gttgatagac gcgttcggaa acggtcatca aggaattacc 480
gctgagaacc tggcagaaat gtacaatatc agccgagaag aacaggacga gtgggccttc 540
ataagccacc agcgtgcctg ccaggctatc aaggagggca agtttaaaga cgaaatagtg 600
cccgtcgaag taaagactaa gaaagagact ttcctgtttg acaccgacga acatcctaat 660
ccggacacca cgctagaaag cttggctaaa ctcaagcctg cttttaagaa ggacggaacc 720
gtcactgccg gaaacgcgtc gtcaattaat gatgcggcgt gtgcagccgt ggttatggct 780
cacgacaagg cagtggaatt aggaatcaaa ccgctcgccc gcatagtggc cacggcttcg 840
gcggctgttg aaccgcgcat tatgggcatc ggcgtggtcc cggcggtcaa acgggccctc 900
aaatttgcag gaatgagctt agatgatgtt cagctttggg agatcaatga ggcgtttgct 960
gcccagttcc ttgcgtgcaa ccgtgaattg aagctcgata cggaaaagat taacgtaaac 1020
gggtctggga tctccctggg gcatcccgta gggtgtaccg gacttcgttt agtaattacc 1080
cttataaacg agatgaaacg ccggaacctg agatacgggt gtgcagccct ctgcgcgggt 1140
ggaggccctg ccatggcaac cattatcgaa gttctctag 1179
<210> 34
<211> 1182
<212> DNA
<213> 褐色嗜热裂孢菌 YX
<400> 34
atgagctccc ccgaacggat cattgtggtt gacggtgcgc ggacgccagt cggcagtttt 60
ggcggcgcgt tcaaggatgt gcccgcccac gaactcggtg cggtggcggc ccgggcagcg 120
ctccagcggt ccgggatcgc ggcgtccgac atcgacgagg tggtcatggg ctgcattggc 180
caggttggcc cggacgctta caacgcgcgg cgggtcgcta tcgccgctgg gcttccggag 240
agtgtccccg cctataccgt caaccggttg tgcggtagcg gtctgcaggc ggtgtggtct 300
ggggcgatgc agatccgctg gggtgcggcc gacattgtcc tggccggcgg tgacgagaac 360
atgagccgga tgccgttcta cgatttcggg gcgcgttccg gttatcggct gggggaccgc 420
acgctggtgg acggcacggt ggcgatgctg acggacccgt tctccaacgt gcacatgggg 480
tgcacggctg aggcggtggc ccgaaagtac ggggtgagcc gtgctgagca ggatgagttt 540
gcgttggagt cgcagcgtcg cgcggctgct gatgcggcgc gtgccgcgtt cgctgaggag 600
atcaccccgg tggaggtggg gggccgtaag ccggtggtgg ttgaggtgga tgagcatcct 660
cggccggaca ccacgttgga ggggttggcg cggctccgtc cggtttttga gaaggacggt 720
acggtgacgg cggggaacgc gtcggggatc aatgatggtg cggccgcgtt ggtcctggct 780
cgtgagtcgg tggtgcgtga gcggggcctg aagggtctgg ctgtggtgga gtcggtggcg 840
accgcggcga tggatccgca gctgatgggg tatgcgccgg tgcttgcgtt gcgcaagctg 900
tttgagcaga cggggacgag cccggctgtg gttgatgtgg tggagttgaa tgaggcgttt 960
gcggcgcagg cggttgcggt gatccgggac gctggtctgg atccggagaa gaccaacccc 1020
tatggtgggg cgattgcgtt gggtcatccg gtgggggcga ccggggcgat tcttacgttg 1080
cgggtggccc gggatttggt acggcgtgat cttgagcttg gtgtcgtcac gatgtgcatt 1140
ggtggcggac aggctttggc cgctttgttg cgtcgggtgt ga 1182
<210> 35
<211> 1224
<212> DNA
<213> 褐色嗜热裂孢菌 YX
<400> 35
atgcctgaag ccgtcatcgt cgctacggca cgctctccca tcggacgggc tttcaagggg 60
tccctcaagg acatccgccc ggacgacctg accgcgcaga tcatctccgc ggcgctcgcc 120
aaggtcccgc aactggaccc cgccaccatc gacgacctcc tcctgggctg cgggctcccc 180
ggtggcgaac agggcttcaa catggcccgc gtcgtcgcgg tgcagctcgg tctggactcg 240
gtcccgggca ccaccatcac ccgctactgc tcttcatccc tgcagaccac ccggatggcg 300
ttccacgcca tcaaggccgg ggaaggcgac gtcttcatct ccgccggcgt ggaaatggtc 360
agccgcttca ccaagggcaa cagtgacacg ctgcccgaca cgaagaaccc gctgttcgcc 420
gaggctgagg cacgcaccgc ccgccgcgcc gagggcggtg cagaaccctg gcgcgacccg 480
cgggaagagg gcaagctccc cgacatctac atcgctatgg ggcagaccgc ggagaacgtg 540
gcgcagctgc gcggcgtctc ccgccagcgc caggacgaat tcgcggtgcg ttcgcagaac 600
ctggcggaaa aggcgctcga caacggtttc tgggagcggg agatcacccc ggtgaccctg 660
cctgacggca cggtggtctc caccgacgac ggtccgcggc gcggcaccac ctacgagaaa 720
gtcgccgccc tggacccggt gttccgcccc gacggcacgg tgaccgcggg gaactgctgc 780
ccgctcaacg acggcgcggc cgcactgatc atcatgagcg accggaaggc cgctgaactg 840
ggcatcaccc cgctggcccg gatcgtgtcg accggggtga gcgcgctgtc acccgagatc 900
atggggctgg gaccggtcga ggcctcccgg caggcgctgg cccgcgccaa catgtcgatc 960
cgcgacatcg acctcgtgga gatcaacgag gcgttcgccg cgcaggtgct tccgtccgcg 1020
gacgacctgg gcatcgacat cgactcccag ctcaacgtca acggcggcgc catcgctatc 1080
ggccacccgt tcggtatgac gggggcgcgg atcaccacca cgctgatcaa cgccctccag 1140
ttccacgaca agaccttcgg cttggagacc atgtgcgtgg gcggcgggca gggaatggcc 1200
gccatcttcg aacggctgag ctga 1224
<210> 36
<211> 1185
<212> DNA
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 36
atgagcagag aggtcgtttt ggtaggggca tgtcgcactc cgatagggac ttttggaggg 60
actcttaagg acatgacggc ggtacagttg ggtacaatcg tcatgaagga agcattgaag 120
agagccggga tctcagggga ccaggtagat gaggtaatat tcggatgtgt gttgcaggca 180
ggacagggac agaacgttgc ccgccagtgt gctattcatg cggggatacc ggaaacggtc 240
acgtcgttca ccattaacaa ggtatgcggt tctggtctaa gagcagtcag ccttgcggcg 300
cagatcatca aagcagggga cgctgacatt gtattggctg gcggcaccga gagcatgacc 360
aacgctcctt atctggttcc taaagcccgt tacggatatc gcatgggcga cggcaagctg 420
gtggacgaga tggtgttcgg cgggttgacc gacatcttca acgggtatca catggggatt 480
accgcggaga atgtaaacga aatgtatggg ataaccaggg aggaacagga cgaatttgga 540
ttaaggagcc aggagagggc tttcgcagct atagaatctg gcagatttaa ggacgagatc 600
gtgccggtag tcatcaagac caagaagggc gaggtagttt tcgatacaga tgaacatccc 660
cgccgtacta cgatggaggc cttggccaag ctaaaaccgg cgtttaagaa agacggcagt 720
gtaactgcag gtaacgcttc ggggataaac gacggggcag cggcggtagt agtcatgtcg 780
aaagaaaagg cggacgaact gggaatcaag ccgatggcca gagtagtaag ctatgcctcg 840
ggcggggtgg atcccaagat tatgggtgta ggacctgtac ctgctactaa gaaagcatta 900
gccaaagccg gattaacctt agacgatata gatctgattg aagccaacga agcatttgct 960
gcccaatcca tagcggtggc acgcgatatg ggctgggaca agatgatgga caaggttaac 1020
gtaaacggag gggcaatagc cctgggtcat ccgatcggag cttctggttg tcgtatactc 1080
gtaactctgc tctatgaaat gcaaaagagg aacgcgaagc ggggactggc aaccttgtgt 1140
atcggtggtg gacaagggac cacgcttatt gtcgagagtt tatag 1185
<210> 37
<211> 1188
<212> DNA
<213> 褐色嗜热裂孢菌 YX
<400> 37
atgcctggat cggtcattgt cggcggcgca cgcaccccca tcggaaagct cctcggagca 60
ctctccggtt tcgccgccgt ggacctgggg gcgatagcga tcaaagctgc cctgcagcgg 120
gcagggatct ccggtgacca ggtggactac gtcatcatgg gccaggtcct gcaggccggc 180
caggggcaga tcccctcacg gcaggcctcc gtcaaagccg gaatccccat gagcgtgcct 240
tcgctcacca tcaacaaagt ctgcctctcc ggcctcgacg cgatcgcctt ggctgatcag 300
ctcattaccg ccggggaatt cgacgtggtc gtggccggcg gaatggaatc catgacgaat 360
gccccgcacg tcctccccaa agcccgccac ggctacaagt acggctccat cgaggtcctc 420
gacgccaccg cccacgacgc cctgaccgac gctttcgacc acgtgtccat gggcctgtcc 480
acggagcggt acaacgcccg ccacggcatg acccgggaag agcaggacgc gttcgcggcg 540
cgctcccacc agcgggccgc cgccgctatc gaagcgggac tgttcaaaga cgagatcgtc 600
cccgtcgaag taccgcggcg gaaaggcgac cccacgatcg tcgacaccga cgaaggggtg 660
cgccccgaca ccactgtgga agccctggcc cggctgcgcc ccgcgttcga cccggacggc 720
acgatcaccg ctggatcctc ctcccagatc tccgatggcg cgtgcgcggt cgtggtgatg 780
agccggacga aagcagaaga gctgggatgc gagatcctcg cggagatcca ggcgcacggc 840
aacgtcgcag gcccggacaa ctccctgcac tgccagccgg cgaacgcgat caagcacgcg 900
ctggccaaag cgggacggga tgtcgctgac ctcgacctcg tcgagatcaa cgaggcgttc 960
gccagcgtgg ccatccagtc catgcgcgag ctgggcgtca gcgaggacat cgtcaacgtc 1020
aacggcggag cgatcgcgct aggccacccg gtcggcatgt cgggggcacg gatcgtgctg 1080
cacctcgtcc acgagctgcg ccgccgcggc ggtggactgg gtgccgcagg cctgtgcggc 1140
ggcggcgggc agggcgacgc cctcctgctg tcggtgcccg cctcctga 1188
<210> 38
<211> 1182
<212> DNA
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 38
atggctgcgg gaatcaagga taaggctgca gttataggga tgggatgcac caagtttggc 60
gaaagattcg actgtaacct ggaggacttg atgttggagg caatagaaga agccctggcc 120
gattcgggac ttgagttcaa cgatatcgac gctttctggt ttggaacttt tacctcggga 180
atggccgggt tagccttttc caatcgcatg aaatcacagt acaagccagt gacgcgcatc 240
gaaaacatgt gctgtacggg gctggacgct tttcgcaacg cgtgttacgc ggtggtgtcc 300
ggagcctacg acgtagtcat ggccattggt gcggaaaaac tgaaagacgg cggttacagc 360
ggtctggaag ttcctgccga ggactcagac cgtaccatgc ctgacctcac tgcgccggca 420
cgttttgcgg tgattgcgcc cgcctacgct cacaaatatg gcctgtccat gcagcagatg 480
aaggaagtta tggcccgtat tgcctggaag aatcacaaga acggatcctt aaacccgaag 540
gcgcaattcc aagcagaagt tccgattgag aatatactta agtcccccat gatctgtagt 600
ccgttgggga taatggactg ttctggggta tcggatgggg ctgcttgcgc catcatagtc 660
cgcagtgagg atgctaaaaa gtatcgcccg gatccgatgt acgtcaaagg tattcagatt 720
gcagccgggc cggggcacag cgagaagcac cagagttacg acttcaccac tgcttgggaa 780
acgtactacg ccggacaggc agcgtaccgc gaggccggta taaccaaccc tcgggagcag 840
attgacttgg ctgaggttca cgactgcttc actcccacgg agttgattat ctatgaggat 900
ctccagttca gcgctagagg acaggggtgg agagacgcgt tggatggctt ttttgactta 960
gatggcaagt tgccggtcaa cccggacggt ggcttgaaat cattcggtca tcccatagga 1020
gctagcggga ttcgcatgtt gtacgagtcg tggctgcagt ttcacggtaa ggccggaaag 1080
cgccaactgg aaaacccgaa gataggactg gctcataacc tgggagggca gccttaccag 1140
tgcgtggtgg gagtggctgt ggtcggcaag gaactgggat ag 1182
<210> 39
<211> 1182
<212> DNA
<213> 橙色绿屈挠菌 J-10-fl
<400> 39
atggacgatg tcgtcattgt tggtgcagcg cgtaccccaa tcgggcgctt caacagcgcc 60
tacagcggat tgagtgccat cgatttaggt gcagccgccg tgcaagcggc tgtccaacgg 120
gccggaattg aggcagactc catcgatgaa tgcattatgg gctgcgtagt caccgccggc 180
ctgggacaat caccggcacg ccaggcagct ctgcgtgcgg gcttgccgca tacaattggc 240
ggcctgacca tcaacaaggt ctgtggcagt ggcctcaagg cagtcatgat cggcaccgcc 300
ctgatcaaag ccggcgaagc tgatgtcatt gtcgccggcg gtatggagca catgagcggt 360
gcgccatacc tgcttcccca ggcccgccac ggctaccggc tcggccacgg ccagatcatc 420
gacgctgtcg ttcacgatgg tctgtggtgc gcttttgagc atcatcacat gggagtggcc 480
gctgaatgga ttgcgcgcac cttcaatgtc actcgcgaac agcaagatgc ttacgcattg 540
caatcacacc aacgcgcagt agccgctcag gacagcggcg ccttccaggc cgaaattgca 600
ccggtaaccg tcccagggcc gaaaggccag gtcaatctgg tgacgactga cgaaggcccc 660
agacgcgaca cctcgctggc tgcactggca aagctcaaac cggcatttgt caccgacggc 720
accgtcactg ccggcaatgc ccccggcatt accgacggcg cggcagcagt cgtactaatg 780
cgagccagcc gggcagccca attgggggtg caacccttag cccgcatcgg cacagccgcc 840
caggccgccg tcaagccgct tgaactcttc accgcaccgg cgtttgccat cgaacggctg 900
atgaagcggg caggccgtac cctcgacgac tacgacctgt tcgagatcaa cgaagccttt 960
gccgcccagg tcattgcgaa cctgcgtgcc ctggccctcg atgcagaccg ggtcaatgtc 1020
cacggtggcg cgattgctct cggccatccg ataggagcca gcggagcacg tgtcctggtg 1080
acactcatct cagcgttacg ccagcgcggc ggccagcgcg ggattgccgc actgtgtctg 1140
ggaggtggtg aagcggtcgc cctcgaagtc gaggtcgtct aa 1182
<210> 40
<211> 1143
<212> DNA
<213> 褐色嗜热裂孢菌 YX
<220>
<221> misc_feature
<222> (1)..(1143)
<223> 密码子优化的
<400> 40
atggccgagg catacatcgt cggagcggtg cgcaccccga tcgggaccag gaagggggcg 60
ctcgctgcgg tgcacccggc cgacctgggc gcccacgtgc tcaaagaact ggtgaaccgg 120
accgggatcg acccggccgc ggtcgaggac gtgatcatgg gctgcgtcac ccaggtgggg 180
ccgcaggcac tcgacctggc ccgcaccgca tggctttctg ccggactccc ggagagtacg 240
ccgggggtca ccatcgaccg ccagtgcggt tcctcccagc aggccgtgca ctttgcggcg 300
caaggggtca tgtccggcac ccaagacctg gtgatcgcgg cgggtgtgga gaacatgggc 360
atggtcccca tgggcgccaa cgtgcagttc gccgtggaca acgggttgtc cgtctacggg 420
cagggctggg tcgaacggta cggcacccag gagatctccc agttccgcgg ggcccaactg 480
atgtgtgaaa agtgggggta cacccgcgag gacttggaga agtacgcgct ggaaagccac 540
cgtcgggctg ccgcggcgat cgaagcgggc tacttcgacg cggagactgc cccgctggcc 600
ggggtcaccc acgacgaggg ggtgcgcccc gacacgtcgc tggagaagat ggccgagctc 660
gcaccgctcc gcgaaggctg ggcgttgacc gcggccgtct ccagccagat ttcggtgggg 720
gcgagcgccc tgctcatcgc gtcggagcgg gccgtggccg agcacgggct cacccccttg 780
gcgcggatcg tgcaactggc tttggccggg gacgacccgg tgtacatgct caccgcgccg 840
atccccgcca ctcggatcgc gctgcgcaag gccgggctgg acatcgacga catcgacgtg 900
gtcgagatca acgaggcgtt cgccccggtc cccatggcgt ggatcgacga aatcggcgcc 960
gacccggcga aggtcaaccc caacggcggt gcgatcgccc tgggccaccc gctgggggcc 1020
accggcgccg tgctcatgac caagctcgtc cacgaactgc gccgcacggg cggccgctac 1080
gggctgcaga ccatgtgcga gggcggggga caggccaacg tcaccatcat cgaacgggtg 1140
tga 1143
<210> 41
<211> 1029
<212> DNA
<213> Sulfurifustis variabilis
<400> 41
atgcacagcg ttggccactc gcggatcatc agcacgggga tgtatctccc gccggagcgg 60
ctctcttcga gagagctcat ggagatgttc cgatcgcggg agcgatttgg actcccctac 120
gagtggctcg agcgcaccac cggcatccgc gagcggcgct tcgcgccgcc cgatttcaaa 180
tcctcggaga tggccgtcgc ggcggcccgc gaggcgctgg aactcggcga ggtctcgccg 240
tcccagatcg acgcgatcat ttactgcggg gtgctccgcg accacgtcga gcccgccacg 300
gcacacgtgg tccaggacaa gatcggcgcg cgcaatgcca tcgccttcga cgtctcgaac 360
gcctgcctcg ggttcatgaa cggcatgcat ttgatggacg cgctgatcgc caccggccag 420
gccaggcggg gtctcgtcgt aacgggcgag cgaggcaacc actacatccg caaggcgctc 480
cgggtgcttg cggaactgcc ggacaacggc gatttcagcg acctggccgc cgccctcacc 540
ctgggtgacg caggggccgc ggccgtcatg ggtcccaagc tcgacccgga gaccggcatc 600
aagggcttcg ttgtacagtc gcagggacag cacaacgggc tgtgcgtgtg cggggacaac 660
ggtgaggaca cgcatctggt caccaaaatc acggagatcg tgagggagac cacgaggctg 720
gtgggcccgt tgtaccaggc cctcatgcat gagcacctcg ggtggcaacc ctcggagctg 780
agtcgctata tcccccatca ggtcggattg cgctccgtgc gcaagcacgc cgaggtggcc 840
caagtcccgc tggaaatcat cccgattacg gtcgattacc tcggaaacat catttcggcc 900
accatacctg taaatatctc gttgttaatg aaggataaaa agctaaccaa cggggaaagg 960
atctatcttt ccgggacggg cagcgggatc agcatcgccc aggccgccat ggtatgggac 1020
gccgcctga 1029
<210> 42
<211> 1041
<212> DNA
<213> 丙酸脱硫叶菌DSM 2032
<400> 42
atgactttgc gttacaccca ggtctgtttg cacgacttcg gctatcaact gccgccggtg 60
gagttgtctt cggcggcgat cgaggagcgg cttcagcccc tctatgagcg gctgaagctg 120
ccggccggtc gactggagct gatgaccggg atcaacaccc ggcgtctgtg gcaacccggc 180
acccggccaa gcgcaggggc ggcagccgct ggagcagatg ccatggccaa ggccggggtg 240
gacgtggccg atctcggctg tctgctcttt acctcggtga gccgcgacat gatggagccg 300
gccaccgccg cctttgtcca tcgcagcctg gggctgccct cgtcctgttt gctgtttgac 360
atttccaacg cctgtctcgg ctttcttgac ggcatgatca tgctggccaa catgctggaa 420
ttgggacagg tcaaggccgg gttggtggtg gcgggcgaga ccgccgaggg tctggtcgaa 480
tccaccctgg cccatctgct cgccgaaacc ggactgaccc gcaaatcgat caagcctctc 540
tttgcctccc tgaccatcgg ctcgggggcc gtggccctgg tgatgacccg gcgtgactac 600
cgggataccg gccattatct gcacggcggc gcctgctggg cccagaccgt ccacaacgat 660
ctttgccagg gcgggcagaa tgccgaacag ggcacgctca tgtccaccga ttccgagcag 720
ctgctggaaa agggcatcga gaccgcggcc gcctgctggc agcagtttca cgccaccttg 780
ggctgggaca agggttccat cgaccgcttc ttttgtcatc aggtcggcaa ggcccacgcc 840
caactgctgt tcgagaccct ggaactcgat ccggccaaga atttcgagac cctgcccctg 900
ttgggcaacg ttggttcggt gtccgcgccc attaccatgg ccttgggcat cgagcagggc 960
gccttgggtg ccggacagcg ggccgccatc ctgggcatcg gctcgggcat caattcgctc 1020
atgctgggca tcgactggta a 1041
<210> 43
<211> 1182
<212> DNA
<213> 生氢氧化碳嗜热菌 Z-2901
<400> 43
atgagagaag tagttattgt aagtgcggcc cgaacaccct ttgggaagtt tggtggagga 60
ctttcggctt taaaagcggt tgacctgggg gcaatagcta tcaaggcagc ggtagagaga 120
agcggagtaa gtccggaaga gtttgactat gtttacatgg gtcaggtttt acagggagga 180
gcgggtcaaa taccttcccg gcaggcggca agaaaagcgg gtctaccctg ggaagttccg 240
tcagtaacgg taaataaagt atgtgccagc ggtttaatcg cggtagctat ggcggcaaag 300
atgattgctt taggcgaaat tgacgtggca gttgcaggcg gaatggaaag catgagtaat 360
gccccatata tattgcccag tgcccgctgg ggacagagaa tgtttaattt tgaagctata 420
gatttaatgg tgcatgatgg tctctggtgt gctttttatg accggcacat ggcggttcac 480
ggctcggaag ttgccaagga atatggtatt tcccggcaag ctcaggatga atgggcatat 540
attagtcaaa tgagggctaa agaagcaatg gaaaaaggac ggctgaacga tgaaattgtc 600
aaagttgagg tacccgggaa aaaaggagag gttgttgtca ttgaaaaaga tgaacagccg 660
cgtcccaata caacgattga agctctttct aaacttccgc cggtttttga tgccaacggg 720
accgttactg ccggaaatgc tcccggtgta aatgatggag caggagcttt ggtcctaatg 780
agtagagaaa aagcccggga acttggaatt aaacctctgg cgacttacct taaccatgcc 840
gaagtagctt tagatgccaa atatattgca actgcaccgg gacaggcgat taacaagctt 900
ttagcgaaga aaggaatgaa aattgaacaa atagatcttt tagaagttaa tgaagctttt 960
gcggcggtag ttttggtcag tcaaaaaatt gccgggtata accttgaaaa agttaatgtt 1020
aacggtggag ctgtagcttt tggtcatccc atcggtgcaa gcggggcccg tattttaatg 1080
accctgattt atgagttaag acgtcggggt gggggaacag gaatagctgc catttgcagc 1140
ggagcggccc aaggggatgc catgttgatt aaagtggaat ag 1182
<210> 44
<211> 1182
<212> DNA
<213> 生氢氧化碳嗜热菌 Z-2901
<400> 44
atgcaggagg tagtaatttt aagtgcggtg aggactgcta taggcaaatt cggaggtagc 60
ttaaaagaca ttcctgctgc agaattgggg gctatcgtta taaaagaagc tttggtgaga 120
gctcaaatac cacctgcaga ggtagatgaa gttatttttg ggaatgtctt acaagctgga 180
caagggcaaa atccagcgcg tcaggcggct attaaggcag gcattccggt agatattccg 240
gcaatgactg taaatatggt ttgtggctcc ggtttacggt ctgtcagttt agcagctact 300
cttattgctg caggggaagc tgatcttatt gtcgcaggcg ggatggaaaa tatgtctgct 360
gctccctatg ccatacccgg agcacgttgg ggtacacgca tgggggatgg aaagattgtt 420
gatttgatga ttaaggatgg tctatgggat gctttttatg actaccatat ggggattact 480
gctgaaaatt tggcagaacg ctataatata agccgggaag aacaagacag atttgcttta 540
gaaagtcaac ggcgagctga aaaagcaatt aaagaaggac gtttccgtga tgagattgtt 600
ccggtgaagt tacctcagcg gaaaggagaa cctcttgaat ttgttcaaga tgagaacccg 660
cgttttgata ctactcttga agcgttagca aagttaaagc cagcatttaa agaaggtgga 720
acagtaactg ccggcaatgc atcaagcata aatgacggtg ctgcggcatt ggtaatagct 780
tccagcaaaa aagctgagag tttgggaatt aaaccaatgg ctgttattcg gagttgggga 840
gctaccgggg tagatccaag tattatgggg atcggtcctg ttggagctac tcgtaaagcg 900
ttaaagagag caggcttaac aattgctgat atagatttag tagaagctaa tgaagctttt 960
gcagctcaag cccttgcagt agctaaggag ttagaacttg acttaagtaa aactaatgta 1020
aatggtggtg ctattgcttt aggtcacccg attggtgcaa gcggggcaag aattcttgta 1080
actttgcttc atgagatgaa aaaatcaaat agccgctatg gcctggctac gttgtgcatt 1140
ggtggcggta tgggagtagc agctatagta gaaaaagctt ag 1182
<210> 45
<211> 1308
<212> DNA
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 45
atgagtttta agaagagcaa ggacgaccta gtatgtgtat cagcggtaag gactccgttc 60
ggtcgttttg gcgggtcaat gcgagacatc gatatctatg acctaggcgc cattgccatg 120
aagaacgccc tagagcggat aaaaatggac cccgagttga tcgacgaagt ctggtggggg 180
tgtggcgata ccactaactg taaagacccc tacaccccgg ttgttgctcg gcaaagcatg 240
ttgaaagccg gtattccacc ggaaaagccg tctgtaacgt ttgaccaagc ctgcatctct 300
gggatggatg cggtgaaata tggaggccgg agcatacaac tgggtgaagc cgagattgtc 360
atgactggtg gtgctactag ctttagtacg gttccattcc ttctccgcgg catacggtgg 420
gaaggaaaac gtcacacgtc atttctagtg gaggatccca taattcctct aggatacaag 480
gattacgctc cggtggcagt tgactctggg gacgtagctg ttgaatacgg agtatccaga 540
gaggaacagg atgagttcgc agtggccagt cacgttaagt acggcaaggc ctacgaacgg 600
ggtttcttca agcaggaaat ggttccgctg gaattggtca agaaggataa gaaagggaat 660
gtagtgtcta aaaaggtttt ggagattgac gagcaatatc gccctgacgt caaaattgaa 720
gaactagcca ggttaaagcc tatatttggc aatcctacgg taacggcggg taacgccccg 780
ggcatgaacg acggggcttg cgcacaaatc ttcatgaagc gggaaaaggc cgaacagtta 840
gggctggacg tcctgtatac agtggtagcc atgtcgtcaa tagcgcttca accccggatt 900
atgcccgtat ccccggcatt tgcgatcaaa aagtgcttgg acgtgaccgg gttaaccatc 960
gacgatatga agttcatcga gattaacgag gcctttgctt gcgttccact ggtggcaaca 1020
aaactcctgt ccaaccagag gttcctgact agcgactaca acgagatggt taaggaagcg 1080
tcgaccaagc ccatcctaga taacgacgat agcaagtacc aggaattgaa gagcaagcta 1140
aatgtcaacg gcagtgccat tgcggtgggg catgctaata cggccagtgg ttcacgcatc 1200
atgatgactg ctgcctataa cttgaaggaa aacgggggcg gctacgctgc gtgtgcaata 1260
tgcggcgggc tgactcaagg agcaggttgc ataatctggg ttgaataa 1308
<210> 46
<211> 1254
<212> DNA
<213> 脂肪酸特异互养栖热菌 DSM 12680
<400> 46
atgaaagacg tggtcatagt gtccgcctgc cggacagcta taggaacctt tgggggttcg 60
ttgaaggacc taaatgctcc gaccctggcg aaggtagcca tgcggggagc aatcgagagg 120
gcagggatag atccggggct aattaatgat gtgcgctttg gttgtgcgtt tgaacatcct 180
gacagcaata acgtagctcg tgtcgcagcg ctgttagcag gagtacctgc tgagacatcg 240
actgcggtga cgataaacag ggtttgtgtg tcgggaatgg aagcggttgt atcgggtatg 300
gccatgatcc aggcaggcct tgtcgatgtg gttttggccg gtggggtgga gcacatgtca 360
ggtgtcccgt ttagcgttct aaatgccagg tggggctgtc gccttcagga ttcggttttt 420
gtcgataacc tgattcacgg gttatacggc gggtcaaagt ttttgccagg gccggaaaac 480
ggtccggtca aggaaggtcc cattctggag gcgggtcgag gtaagcctta tattatgggt 540
tacacggcgg agttattagc ccaatactgc aatatcagtc gcgaagcgat ggatgaagtt 600
gccctgcgga gccataacaa cgcagagcgc gccacaaagg atggatcgtt tcgagaggag 660
attgttccag ttgaaatccc acagaaaaag gggaaggctc ccttagtgtt tgacaaggat 720
gagcatttta gaccaggcgt tactatggaa caacttgctg ctttaccgcc ggcttttgtc 780
cccaagatcg gaaaagtaac ggcaggcaat gcttccggga tgaatgacgg agccgcagct 840
atggtaataa tgtcggctga taaggctaga gaattaggga tgaaaccaat tgccagaatt 900
aaggcggtcg gctacggagg atgccatcca tctatcatgg gattgagccc ggttccggct 960
ataaagaatc tgctgtcaaa atcggggctt aaattagaag attttgaact catagaaatt 1020
aatgaagcat ttgcggctca gtatctcgcc gttgagcagg aattgggctt aaatcgcgag 1080
attaccaacg tcaatggatc tgggatcggt ttggggcacc cggttggagc taccggatgc 1140
cggattatgg tgacgctgct gtatgcgatg aaaaagagag gcaagacact ggggttagca 1200
agcctgtgcg gcggcggcgg agtatcgatg gcggtcgctt tggagatggt ttaa 1254
<210> 47
<211> 1182
<212> DNA
<213> 生氢氧化碳嗜热菌 Z-2901
<400> 47
atggaagaag ttgtaatagt tagtgccgtt agaactccca ttggcagttt tttgggtagc 60
cttgcccaaa ctccggcagt ggatttggga gcgcttgtta ttaaagaaag tctaaaccgc 120
attaaccttg cgccccggtt tgtcgatgag gttatcatgg gaaacgtttt gcaggcagga 180
ttaggccaga acccagcccg gcaggcagct ataaaagcag gaatacccca ggaagtacct 240
gcttttacag taaacaaagt ttgcggttca ggattaaaat ccgtcggact ggcttatcag 300
gcaatagcaa caggtgatgc cgatatcgtt gtcgccgggg gaatggaaaa catgtcttta 360
gcaccttacg tcttgcccaa agccaggaca ggttaccgca tggggcatga taccctgata 420
gattccatga ttaaagatgg cttatggtgt gcttttaccg atgtgcatat gggtattacc 480
gcggaaaata tagccgaaaa atacaatata acccgcgaag aacaggataa atttgccctg 540
caaagtcagg aaagagctat aaaagcaatt gatgaaggaa agtttaaaga agaaatcgtt 600
cccgtaatca tcccccagaa aaaaggagaa cccctggtat tttccaccga tgaatttccc 660
aaacgcggta catccctgga aaaacttgcc gctctaaaac cggccttcaa aaaagatggt 720
accgtaactg ccggaaatgc ctcgggaatt aacgatggag ctgctgcggt tgtagttatg 780
tcagcaaaaa aagctcaaga gttaaatatt aaacccttgg ctgttatccg cggttatgca 840
gctgcgggag tagatcctgc ctatatgggt ttaggcccaa tacctgccac ccgcaaagcc 900
cttaaaaaag ccaatttaac cgtttcggac ttggggctta ttgaagcaaa cgaagctttt 960
gccgcccagg ctttagcggt aattaaagag ctcgaattaa atccggaaat aactaatgtc 1020
aacggtggtg ccatagcgtt aggtcacccc ataggagcct cgggagctcg aatattggta 1080
accttattac atgagatgca aaaacgtaat acaaaatacg gtttagcaac cttgtgtatc 1140
ggtggcggcc aaggatttgc tttagtagtt gaaaaagttt aa 1182
<210> 48
<211> 663
<212> DNA
<213> Pseudothermotoga lettingae TMO
<220>
<221> misc_feature
<222> (1)..(663)
<223> 密码子优化的
<400> 48
atgaaaaaca aggcgattac cgtcgaacaa gcgattgaga tgatcccgga tggcgcggtc 60
cttatgattg ggggatttct tggtgacggc acgccggagt tgctaattga tgcgttggtc 120
aaatcaggga agcggaattt cacgattatt gccaacgata cggcctttcc ggacaagggc 180
attggcaaga tgatcgttaa caaaatggcg aaaaaggtca tcgtgtcgca tattgggacc 240
aatcctgaaa cacagaaaca aatgatcgag ggcacccttg aggttgagtt ggtgccgcaa 300
ggaacattag cggaaaaagt tagagccggc ggctttggcc ttggcggcat tttgacgccg 360
acaggcgtgg ggacggtcgt cgaaaatgga aaacagaaaa tcgtgatcga cgataaggaa 420
tatcttgttg aaccggcttt acgagcagac tttgcactaa ttaaagcaca aaaagcggac 480
ttctacggaa atctgttctt taatttgaca tcccgtaact ttaacccgct catggcgttt 540
gctggcaaaa taacaattgt cgaggtggaa gagtttgtac ctgttggggg actttctcct 600
aatgaaattc acacgccaca tgcggtggtg gattatattg ttcgggggaa cgctcggtaa 660
tga 663
<210> 49
<211> 678
<212> DNA
<213> Pseudothermotoga lettingae TMO
<220>
<221> misc_feature
<222> (1)..(678)
<223> 密码子优化的
<400> 49
atgatccaag accaaaacct ggccaaagct gtcatcgcga aacgtgtggc attggaatta 60
aaagacgggg atattgtcaa tttaggaatt gggatcccta cgctcgtagc gaattatctt 120
ccccctaaag tagaaatctt cctccaatca gaaaatggca tcctgggtat gggtcctgct 180
ccaatgtcgg gctatgagca tccaaatttg acgaatgccg gcgggtcgcc gatcacgttt 240
ttgccaggtg cctgcgcatt tgacagcgcc gtctcctttg ggttaatccg cggggggcac 300
gtcgatgcga cggttctcgg cgccctccaa gtggatgaag aagggcactt ggccaactgg 360
atgattccgg gcaaaatggt gccgggcatg ggtggcgcta tggacttggt gacaggcgcg 420
aagaaagtca tcgtcgccat gcaacacgtc gccaaaggca atgctccgaa aatcgtgaaa 480
aagtgcacgc tgccgctcac gagcattagg cgcgtcgact tgattgttac ggatatggcg 540
gtgattgaag tcactgggaa tggtttaatt cttaaagaac ttgctccgca aacaacagtc 600
gatgaggtcg ttaaatttac ggaagcgaaa ctcattgtcc cagaggatgt gccagtgatg 660
ccggtaagcc tctaatga 678
<210> 50
<211> 1344
<212> DNA
<213> 脱硫脱铁杆菌 SSM1
<220>
<221> misc_feature
<222> (1)..(1344)
<223> 密码子优化的
<400> 50
atggccgaaa ttttgaaatc gtctatcgag gccattaaag acgtgattaa agatggaatg 60
gtggtggccg ccgggggctt cgggctctgc ggtatcccag aaaacttgat caatgcaata 120
aaggagctca aagtcaagga tctgacattt gtgagcaata atgcgggtgt agatgatttt 180
ggccttggaa ttctcctcca aacgagacaa atcaaaaaga tgatttcgtc gtacgtgggc 240
gaaaataaga tttttgaaca gcaatacctt aacggggaac tggaattgga gttggtcccg 300
caaggcaccc ttgccgagaa attacgtgcg ggaggtgcag ggattcccgc gttctacacg 360
atgacgggct acgggacaat ccttactgaa gggaaagaaa tcaaagtatt cgatggcaaa 420
gaatatgtgc tggaagaaag cattcgccct gatttagcca tcgtgaaagg ctggaaagcc 480
gacaaaaaag gcaatgtaat cttccggtac actgccaata attttaatga agtgtgtgcg 540
aaagccgcga agtttacgat tgtcgaagtt gaagaaatcg ttgatgaaat tgacccgcac 600
tacatccacc ttccgtcgat ctacgtcgat cgaattgttc tcggcgaacg ctacgagaaa 660
agaatcgaac aacttacgac cttagaaaat atgacagaag cgaaaatgaa cgagaaacgt 720
gaatggatgg ccaaacgcgt ggcgaaagag ttgaaaaagg gtatgtatgt taacctcggc 780
attggcatgc cgacgttagt cgcgaacttt atcaccgacg atatggatat aacgctgcac 840
tcggaaaacg ggttacttgg cataggtccg tttccaaaaa cggagaaaga tgctgacccg 900
gaccttatca acgccggtaa acaaacaatc acgtataaaa aaggcgcggc tttttttgat 960
tcaagcgaat cgtttgctat ggtccggggc ggacatattg atctctccgt cctgggcgga 1020
atgcaggtca gcgaaaaggg cgaccttgca aactggatga ttccgggtaa aatggtgaaa 1080
ggaccggggg gagccatgga cttagtctcc ggcgttaaaa aagttatcgt tatgatggaa 1140
catgtggcta aagatgggaa gccgaagatt ctaaaagaat gcacgttgcc gattaccggg 1200
aaaggtgttg tggatatgtt ggtgactgac aagggtgtgt tcgagatcaa ttctgagggg 1260
ttgtaccttt tagaaatctc tccatttagt gaccttgaag acattaagaa gagcaccggg 1320
tgtgaagtga aagtcaaata ataa 1344
<210> 51
<211> 687
<212> DNA
<213> 地芽孢杆菌属种 GHH01
<400> 51
atgaagcaga tacattcttc ttttattgag gcggtgaaag acattccgga cggagcaacg 60
attatggttg gcggcttcgg gcttgtcggc attccagaaa acttaattct cgcgctagtg 120
gaaaccgggg tcaaggaatt aacagtcatc tctaataatt gtggcgtgga tgactgggga 180
cttggattgc tcctgaaaaa taaacaaatt aagaaaatga tagcttccta tgttggagaa 240
aataaggagt ttgaacgcca agttctcaat caggaaatag aagttgaatt aattccccaa 300
ggaacgttgg cagaacgcat tcgcgccggc ggggccggaa taccggcatt ttatactcct 360
gctggagttg gcaccccgat tgcggaaggg aaagaagtac gagtatttaa cggcaaagag 420
tatattcttg aaacggcgtt agttgctgac tttagtttag tgcgcgcgtg gaaaggagat 480
aaaatgggga atttgattta caacaaaaca gcgcgtaact ttaacccgat gatggcggca 540
gcagggaaag ttacgattgc agaagtggag gaacttgtgg aaattggaga attggatccg 600
gatcacattc atacgccaag catttatgta caacgattag tagttggaaa acaagaaaaa 660
cggattgaac gtctagttgt tcgctag 687
<210> 52
<211> 660
<212> DNA
<213> 地芽孢杆菌属种 GHH01
<400> 52
atgaataaac aatccattcg tgaaagaatt gccaagcgtg ctgaacagga gattgaaaac 60
ggtttctacg tcaatttagg gattggaata ccaactcttg tcgccaattt tattcaatcg 120
cataaaaagg tggtgctgca gtccgaaaac ggattgttag ggattggacc ttaccctctc 180
aaggatgagg tagaccccga tttaatcaat gccgggaaag aaacgataac ggctattccc 240
ggagcttgct attttagcag tgccgaatca tttgccatga tccgtggcgg tcatatcgat 300
gtagctattt taggaggaat ggaagtttcg gaagagggtg atcttgctaa ttggatgatc 360
cctggaaaaa tgattaaagg catgggagga gcgatggatc tagtgcatgg agcgaaaaag 420
attattgttg ttatggagca tgttagcaag gatggaaaac cgaagattgt gaaaaagtgt 480
agtcttccgt tgacaggaag gaaagtggtc aaccgcatta ttaccgaaaa agcggttatc 540
gatgtgaccg agaatggctt gaagttagta gaaattttgg atggaagtag cgtcgaagag 600
attcaatctc tgacagaacc aacattgatg atcgatgaaa cgcttcttat tcaggcataa 660
<210> 53
<211> 654
<212> DNA
<213> Thermosipho melanesiensis BI429
<220>
<221> misc_feature
<222> (1)..(654)
<223> 密码子优化的
<400> 53
atgaaagtgg tagacatctc taagatcaac gagctggtaa aagagggtgc aacattgatg 60
atcggtggtt ttctcggtgt tggcacaccg gaaaatatca ttgatgagat catccggcat 120
aacatttcta accttacagt gattgctaac gatacagctt ttgaagaccg gggtattggt 180
aaattagtaa agaataaact ctgcaagaag gtaattgtgt cccatatcgg aacaaacccg 240
gaaacacaac gccagatgat tgagggcaca ctggaggtgg agcttgtacc gcagggaacc 300
cttgccgaac gtatccgcgc cgctggggta gggcttgggg gtatccttac gcctacaggt 360
gtaggcacgg tggtggagaa agacaagaag gtgatcgaag tggaaggcaa aaagtactta 420
cttgaacttc cgatccatgc ggacgtcgcc cttatcaaag cgaaaaaggc agactatctc 480
ggtaaccttg tctataacct cacggctgaa aattttaacc ctattatggc ccttgcggca 540
aagacagtta tcgcagaggt cgaggaaatc gtgccaacgg gcacattatc tcctaatgag 600
atcaaaacgc ctgggattat cgttgattac atcgtaacag gggtcacacg ttag 654
<210> 54
<211> 645
<212> DNA
<213> Thermosipho melanesiensis BI429
<220>
<221> misc_feature
<222> (1)..(645)
<223> 密码子优化的
<400> 54
atgaacccta aagaaaaaat cgctattcgc gttgcacaag aactcaaaaa gggacagtta 60
gtaaacctcg gaatcggatt accaacgctt gtagcgaact acatcccgaa agatattcat 120
gtcttcttcc agtccgagaa tggtatcatt ggaatgggcc ctgcgccgaa ggagggatac 180
gagaactcgg atttaacgaa tgccggtgcg agctacatta cggcccttcc aggtgcgatg 240
accttcgatt ctgcgttctc gtttggaatt atccggggtg ggcaccttga cgttacagtt 300
cttggaggtt tacaagttga cgaggagggg caccttgcga attggatgat ccctgggaag 360
atgattcctg gaatgggggg cgctatggat ctggtaacag gggctaagaa ggtcattgta 420
gccatgaccc acaccgcaaa gggtacccca aaaatcgtca agaagtgtac attaccactt 480
acatccatcc gcaaagtaga tcttattgta acggagttag cagttattga accgacagac 540
gagggcctct tgctgaagga gatctctaag gaaacgacac tggatgaagt tctcaaattg 600
acagaagcta agttaattat tgccgatgac ctgaaaatct tctaa 645
<210> 55
<211> 1557
<212> DNA
<213> 嗜热丙酸厌氧肠状菌 SI
<220>
<221> misc_feature
<222> (1)..(1557)
<223> 密码子优化的
<400> 55
atggcgccac ggtttttaac tgccgaagaa gccgtcaacc tgattaaaga tggcgacacg 60
gtcgcgtccg tggggttcct cggaaatgtg ttccctgagg agttagctgt ggcattggag 120
gaacgctttc tgaaaacggc caaaccggag cgcctgactc ttatctatgc agcggcacaa 180
ggtgatggca aggaacgcgg ccttaatcat ctggcctatg aaggtttggt gaagcgggtc 240
attggtggtc attggaactt gcaaccaaag atggccaaat tagccatcga gaataaaatc 300
gaagcttata acttaccgca aggcaccatc agccaacttt tcagggagat tgcggccaaa 360
cgtccagggg tcatcacgca tgtaggactt aaaacgtttg tcgaccctcg cattgagggg 420
ggtaaactga atgccgtgac taaagaagac attgtcgaag tcattacaat tgatggaaaa 480
gagaaattgt tttaccgttc tattccgctc aatgtagggt tgattcgcgg cacatccgca 540
gatcagttgg gcaatatttc gctcgaaaaa gaggcgaata ccctcgaagt ccttagcatc 600
gcccaagcgg ttcgcaactg cggtggcatc gtgattgccc aggtagaacg cgtagtagca 660
gctggctcgt tagacccacg gttagtaaaa gttccgggta ttttggtaga cgtggtcgtc 720
gtctcacggc ctgaaaacca tcatcaaact tttgctgaag tatacaaccc tgcttacagt 780
ggagaggtag tcattcctct gaccgaactt ccgccggcta aattggatga acgtaaggtg 840
atcagccgtc gcgccgcttt cgaacttcgg ccgggcagcg tggttaactt ggggatcggg 900
atccctgaag gtattgcgtc tgtcgcggca gaggaaggga tcagcgactt catgacactg 960
acggtcgaag ccggaccggt tggcggcgtt ccagcgggcg gcttgtcatt tggggcttcg 1020
acgaatccgt attgcgtgct cgatcaagcc tatcaattcg atttctacga cggaggcgga 1080
gtagatattg cctttttggg tttggctcaa atggatagca acggaaacat taacgttagt 1140
aaatttgggc ctcgtattgc aggatgtggc gggtttatca acattacgca aaatgcgaaa 1200
aaagtggtat tctgcggcac gtttaaggcc gggggcttga aagtgaacgt gggagatgga 1260
aaattaacta ttgtgaacga gggaaagtcc gtgaaattgg tgccgaaagt cgagcagatc 1320
acctttagtg gtgaatatgc tagacaacaa ggccagaaag tgctgtatat caccgaacgc 1380
gctgtctttg aaatgacggc cgaaggtgtg atgttgactg aaatcgctcc tggcgtcgac 1440
ttggaacgcg atgtcttgca acagatggac tttaagccac tgatctcacc gtcgttaaaa 1500
acaatggaca aacgcatctt tatagacgcg ccgatgggaa ttaaaaattc ctgatga 1557
<210> 56
<211> 870
<212> DNA
<213> 海洋红嗜热菌 DSM 4252
<220>
<221> misc_feature
<222> (1)..(870)
<223> 密码子优化的
<400> 56
atgtccgaac cggtcgacca tctacttcat ctgcttaatt tggaacgcat cgaagagaat 60
atttttcgag gaccgtctcg tgatattggc tcgccaacgg tgtttggtgg gcaagtgctt 120
ggccaggcgt tacgggccgc cgcctacact gtgccgccag agcgtagagc ccatagcttg 180
catgcctatt ttattcttcc aggtgatccg aacgcgccga ttgtatatct agtggagcgg 240
ttacgcgatg gccggtcgtt tacgactcgc agagtaacgg caatccaaca tggccggccg 300
atctttaacc tctcggcgag ctttcaaatt gaagaaccag gagttgaaca tcaggatccg 360
atgcccgagg tgcctccgcc ggaggaactt atttccgaag cagagctacg ccggcagctt 420
gctgaacagg tgccggaagt tttaaggcca ttcttgctgc acgaacgtcc gattgaaata 480
cggccggtcg agccggtcaa tttattgttt ccggagaaac gtccgccacg caggcatgcg 540
tggattcgag cagcagggac gcttccggat gacgacttgg ccctccatca gtcagtttta 600
gcctatgctt cagattttgg tttcatgggt acggcgatgt taccgcacgg cttgtcattt 660
ctgcaaccgc atgttcaagc cgcatcattg gatcacgcca tgtggtttta tcgtccgttt 720
cgggcagacg aatggctgtt gttcgccatg gaatcaccgg tcgcggccca cgcacggggc 780
ttaaataggg gcttgttttt taggcgtgat gggacgctgg tagcagcggt cgtccaagaa 840
ggacttatgc ggattcgctc ggattaatga 870
<210> 57
<211> 735
<212> DNA
<213> 丙酮丁醇梭菌 ATCC 824
<400> 57
atgttaaagg atgaagtaat taaacaaatt agcacgccat taacttcgcc tgcatttcct 60
agaggaccct ataaatttca taatcgtgag tattttaaca ttgtatatcg tacagatatg 120
gatgcacttc gtaaagttgt gccagagcct ttagaaattg atgagccctt agtcaggttt 180
gaaattatgg caatgcatga tacgagtgga cttggttgtt atacagaaag cggacaggct 240
attcccgtaa gctttaatgg agttaaggga gattatcttc atatgatgta tttagataat 300
gagcctgcaa ttgcagtagg aagggaatta agtgcatatc ctaaaaagct cgggtatcca 360
aagctttttg tggattcaga tactttagta ggaactttag actatggaaa acttagagtt 420
gcgacagcta caatggggta caaacataaa gccttagatg ctaatgaagc aaaggatcaa 480
atttgtcgcc ctaattatat gttgaaaata atacccaatt atgatggaag ccctagaata 540
tgtgagctta taaatgcgaa aatcacagat gttaccgtac atgaagcttg gacaggacca 600
actcgactgc agttatttga tcacgctatg gcgccactta atgatttgcc agtaaaagag 660
attgtttcta gctctcacat tcttgcagat ataatattgc ctagagctga agttatatat 720
gattatctta agtaa 735
<210> 58
<211> 1062
<212> DNA
<213> 布氏热厌氧杆菌 Ako-1
<220>
<221> misc_feature
<222> (1)..(1062)
<223> 密码子优化的
<400> 58
atgaaaggat ttgcaatgtt atccattgga aaggtaggtt ggatcgaaaa agagaagcca 60
gcgcctggac catttgatgc tatcgtccgc ccgttggcag ttgcaccttg cacctccgac 120
attcacaccg ttttcgaggg cgcgatcgga gagcgccata acatgatttt gggccatgag 180
gctgttggtg aagtagtaga ggtgggttcc gaggttaaag attttaagcc tggtgaccgg 240
gtagtggtcc ctgccattac cccagattgg cgtacgtctg aagtacagcg tggttaccac 300
cagcactcgg ggggaatgtt ggcaggatgg aagttttcga atgtaaaaga tggcgtattc 360
ggtgaatttt ttcatgtaaa cgacgcggat atgaacttgg ctcaccttcc gaaggaaatc 420
ccgcttgaag cggcagttat gatcccggac atgatgacca ccggctttca cggtgcggag 480
ctggccgaca tcgagttagg cgctacggta gccgtacttg gtatcggtcc tgtaggactc 540
atggccgttg ctggcgcgaa actgcgtgga gcgggccgca tcatcgccgt cggatcccgg 600
ccagtctgtg tggacgcggc aaaatattat ggagcaacag acatcgtcaa ttataaggat 660
gggcctatcg agtcgcagat catgaactta accgagggca aaggcgtaga tgccgccatc 720
attgcaggtg ggaatgctga cattatggct acagccgtga aaatcgttaa accgggtggg 780
accatcgcaa atgtgaatta ctttggggag ggggaagtcc tgccggttcc tcggcttgag 840
tggggatgcg gaatggctca taagacgatt aagggaggct tatgtcctgg tggccgcttg 900
cgtatggagc gtttgattga tctggtcttt tacaagcgcg tcgatccgtc gaagcttgtt 960
acccatgtct ttcggggatt tgataacatt gagaaggcct tcatgctcat gaaagataag 1020
ccaaaggatc ttatcaagcc ggtcgtcatc ttagcgtagt ga 1062
<210> 59
<211> 391
<212> PRT
<213> 脱硫脱铁杆菌
<400> 59
Met Arg Asp Val Phe Val Val Glu Gly Leu Arg Thr Pro Phe Gly Ser
1 5 10 15
Phe Gly Gly Ser Leu Ser Asp Val His Pro Ala Val Leu Ala Ala Asp
20 25 30
Val Ile Lys Lys Leu Leu Glu Lys Thr Glu Val Lys Pro Asp Asp Ile
35 40 45
Asp Glu Val Ile Leu Gly Gln Val Leu Thr Gly Gly Phe Gly Gln Ala
50 55 60
Pro Ala Arg Gln Ala Met Arg Tyr Ala Gly Leu Leu Asp Lys Val His
65 70 75 80
Ala Met Thr Ile Asn Lys Val Cys Gly Ser Gly Leu Lys Ala Leu Met
85 90 95
Leu Gly Ala Gln Ser Ile Met Leu Gly Asp Ser Asp Leu Ala Ile Val
100 105 110
Gly Gly Met Glu Asn Met Ser Met Ala Pro Tyr Ala Leu Leu Gln Ala
115 120 125
Arg Tyr Gly Tyr Arg Met Gly Asn Asn Glu Val Val Asp Leu Met Ile
130 135 140
Tyr Asp Ala Leu Leu Asp Pro Tyr Thr Lys Arg His Met Gly Glu Leu
145 150 155 160
Thr Glu Glu Thr Ile Lys Lys Val Gly Val Thr Arg Glu Glu Gln Asp
165 170 175
Asp Tyr Ala Glu Arg Ser Tyr Lys Leu Ser Gln Lys Ala Val Glu Ser
180 185 190
Gly Ile Phe Asp Glu Glu Val Val Pro Val Val Lys Lys Thr Lys Lys
195 200 205
Gly Asp Ile Val Val Asp Lys Asp Glu Glu Pro Phe Arg Val Asn Phe
210 215 220
Glu Lys Leu Arg Gln Leu Arg Pro Val Phe Val Lys Asp Gly Thr Ile
225 230 235 240
Thr Ala Gly Asn Ala Ser Thr Ile Asn Asp Gly Ala Ala Cys Leu Leu
245 250 255
Leu Ala Ser Glu Asp Ala Val Lys Lys Tyr Asn Leu Lys Pro Ile Gly
260 265 270
Arg Leu Val Ala Tyr Ala Thr Asn Ser Ile His Pro Asp Glu Phe Ser
275 280 285
Leu Ala Pro Val Gly Ala Ile Glu Lys Val Cys Glu Lys Ala Gly Leu
290 295 300
Lys Leu Asp Asp Ile Asp Leu Phe Glu Ile Asn Glu Ala Phe Ala Ala
305 310 315 320
Val Val Leu Phe Ala Val Lys Lys Leu Asn Leu Pro Leu Asp Lys Val
325 330 335
Asn Val Asn Gly Gly Ala Val Ser Ile Gly His Pro Val Gly Ala Ser
340 345 350
Gly Gly Arg Leu Ala Val Thr Leu Leu Lys Glu Met Gln Arg Arg Asn
355 360 365
Ala Lys Tyr Gly Leu Ala Thr Leu Cys Ile Gly Gly Gly Glu Ala Val
370 375 380
Ser Ala Ile Phe Glu Arg Val
385 390
<210> 60
<211> 402
<212> PRT
<213> Rubrobacter xylanophilus
<400> 60
Met Ser Phe Gly Asn Gly Asn Gly Arg Glu Val Val Ile Ser Thr Pro
1 5 10 15
Leu Arg Thr Ala Ile Gly Thr Phe Gly Gly Ser Leu Arg Asp Val Pro
20 25 30
Ala Thr Glu Leu Gly Ala Thr Val Gly Arg Glu Val Ile Ser Arg Ser
35 40 45
Gly Val Asp Pro Glu Arg Val Asp Gln Val Val Val Gly Asn Ile Leu
50 55 60
Ser Ala Gly Gln Gly Met Asn Pro Ala Arg Gln Val Gly Ile Lys Ser
65 70 75 80
Gly Leu Pro Val Glu Ala Pro Ala Met Thr Leu Asn Arg Met Cys Gly
85 90 95
Ser Gly Leu Gln Ala Ile Val Ser Ala Ala Gln Glu Ile Ala Leu Gly
100 105 110
Asp Ala Glu Val Val Met Ala Gly Gly Ile Glu Asn Met Asp Gln Ala
115 120 125
Pro Phe Leu Leu Pro Lys Gly Arg Tyr Gly Tyr Arg Met Gly Met Pro
130 135 140
Lys Ala Asp Leu Leu Asp His Met Val Tyr Asp Gly Leu Trp Asp Ile
145 150 155 160
Phe Asn Asp Tyr His Met Gly Met Thr Ala Glu Asn Val Ala Glu Arg
165 170 175
Tyr Gly Val Ser Arg Glu Asp Ser Asp Ala Tyr Ala Val Arg Ser His
180 185 190
Gln Arg Ala Ala Arg Ala Ile Ala Glu Gly Tyr Phe Asp Glu Gln Ile
195 200 205
Val Pro Val Glu Val Arg Gln Lys Lys Glu Thr Val Lys Phe Thr Arg
210 215 220
Asp Glu His Val Arg Glu Asn Ala Thr Leu Glu Gly Leu Ala Arg Leu
225 230 235 240
Lys Pro Val Phe Lys Arg Asp Gly Gly Thr Val Thr Ala Gly Asn Ala
245 250 255
Ser Gly Ile Asn Asp Gly Ala Ala Met Met Leu Val Ser Ser Ala Arg
260 265 270
Lys Ala Glu Glu Leu Gly Leu Pro Val Ala Gly Arg Leu Val Ser Ala
275 280 285
Ala Val Ala Gly Val Asp Pro Ala Ile Met Gly Val Gly Met Val Pro
290 295 300
Ala Ser Arg Ala Ala Leu Lys Lys Ala Gly Leu Ser Ile Glu Asp Met
305 310 315 320
Asp Val Val Glu Ala Asn Glu Ala Phe Ala Ser Ile Ala Val Thr Val
325 330 335
Gly Arg Glu Leu Lys Val Pro Glu Glu Lys Leu Asn Pro Leu Gly Gly
340 345 350
Ala Val Ala Leu Gly His Pro Ile Gly Ala Thr Gly Ala Ile Leu Thr
355 360 365
Val Lys Ile Leu His Glu Leu Ala Arg Thr Gly Gly Arg Tyr Gly Leu
370 375 380
Val Thr Leu Cys Ile Gly Gly Gly Met Gly Ile Ala Ala Ile Phe Glu
385 390 395 400
Arg Val
<210> 61
<211> 1179
<212> DNA
<213> 脱硫脱铁杆菌
<400> 61
atgcgggatg tctttgtagt cgaggggctg cgtacgccat tcggaagctt tggcggctca 60
ctgtcggatg tccatccggc cgttttagct gcggacgtga tcaaaaagct tttagaaaaa 120
acagaagtga aaccggatga catcgatgaa gtcatcttgg gacaggtgct cacaggcgga 180
tttggccaag ccccagcccg tcaagctatg cgttacgcgg gcctgttaga caaggttcat 240
gctatgacga tcaataaagt ttgcggctct ggcttgaagg ctttaatgct cggggcccaa 300
agcattatgt tgggcgattc agatctcgcc atcgtcggag gcatggagaa catgagcatg 360
gcgccgtatg ctttgctgca ggctcgttac ggctatcgca tgggcaacaa cgaggtggtg 420
gatttaatga tctatgatgc actgctcgac ccgtatacca aacgccatat gggtgagttg 480
acggaagaaa cgatcaaaaa agtcggcgtg acccgcgaag aacaggatga ctatgcggaa 540
cgcagctaca aactcagcca aaaagcagtg gaatcaggca tctttgacga ggaagtggtt 600
cctgtggtca aaaaaacaaa aaagggcgat attgtcgtcg ataaagatga agaaccgttc 660
cgggtcaact ttgagaaact ccggcagtta cgcccggtct tcgtaaaaga cggcaccatt 720
acagcgggta atgcctcgac cattaacgat ggtgccgcct gccttctctt ggcctcagaa 780
gacgccgtca agaaatataa ccttaaacca attggccgct tggtagccta tgccacgaat 840
tcgattcatc cagacgagtt cagcctcgcg ccggtgggcg cgattgaaaa ggtgtgtgaa 900
aaagcgggct taaaattaga tgacattgac ttatttgaaa tcaatgaggc ctttgctgcg 960
gtcgtcttgt ttgctgtaaa aaaactcaat ttgccgttag ataaagtcaa cgttaatggt 1020
ggagcagtca gcattggcca cccggtgggc gcgtcgggtg gccggttagc cgtcacgctg 1080
ttgaaagaaa tgcagcgccg taacgcaaag tatggcttag ccaccttgtg catcggaggt 1140
ggtgaagccg tgagcgcgat tttcgagcgc gtctaatga 1179
<210> 62
<211> 1212
<212> DNA
<213> Rubrobacter xylanophilus
<400> 62
atgtcgtttg gaaacggaaa tggacgcgaa gtcgtgattt cgacgccgtt acgcaccgcg 60
attggcacgt tcggtgggtc gctccgcgac gttcctgcga cggagttggg tgcaacagtc 120
gggcgtgagg tcatctcacg ctccggcgtt gatccggaac gcgtagacca agtcgtggta 180
ggaaacatct tgtcggcagg ccaaggtatg aacccagcgc gccaagtcgg gatcaagagc 240
ggcttgccag tcgaggcccc agctatgacc cttaaccgca tgtgtggatc gggcctccaa 300
gctattgtgt ccgcggcgca ggagatcgcg ctgggggatg cagaagttgt gatggctggc 360
ggaattgaaa atatggacca agctccattt ctgttgccga aaggtcggta tggataccgt 420
atgggtatgc caaaagctga cttattggat catatggtct acgacggact gtgggatatc 480
ttcaacgatt accacatggg catgacggcc gagaatgttg ccgaacgtta tggtgtctca 540
cgtgaagata gcgacgcata cgcagtgcgc agccatcagc gcgccgcgcg tgctattgcg 600
gaagggtatt ttgacgagca aattgtccct gtggaggtgc gccagaagaa agaaacggta 660
aaattcacac gggatgagca cgttcgtgaa aacgcgacgt tggaaggtct tgcccggtta 720
aaaccagtct ttaaacgcga cggaggcacg gtcaccgccg gaaacgcctc gggcattaac 780
gatggcgcag ccatgatgtt ggtctcaagc gctcgcaagg ccgaagagtt gggcttgccg 840
gtcgcgggtc gcctggtgtc tgcggccgtt gccggggtgg atccggctat tatgggggtc 900
ggcatggtgc cggcatcacg cgctgcttta aaaaaagcag gcctttctat tgaagacatg 960
gacgtcgtcg aggccaatga agcctttgct agcatcgctg tcacggtagg gcgtgaattg 1020
aaagttccgg aagagaaact taacccgctg ggcggagcgg tcgcgttggg tcatccgatc 1080
ggcgccacgg gagctatcct cacggtgaaa atcttgcatg agcttgcacg cacaggcggc 1140
cgttatgggc tggtaacgtt gtgcatcgga ggtggcatgg gtatcgctgc aatcttcgaa 1200
cgcgtgtaat ga 1212
<210> 63
<211> 1188
<212> DNA
<213> 橙色绿屈挠菌 J-10-fl
<400> 63
atgagcgaga agcgagaggt cgtggtgctc agcggcgtgc gcacggccat tggcacattt 60
ggtggtagtt tgaaggatat tccgccaacc gaattggcgg cactggttac ccgtgaagca 120
gttgcccgct ctggcctgca accaaacgaa atcggtcatg tggtcttcgg gcacgtgatc 180
aataccgaac cgcatgatat gtatctggct cgctatgcgg cggtacgcgg cggtctgtcg 240
gtagagactc cagccctaac gcttaaccgg ctctgcggta gtgggttgca ggctatcgtg 300
tcggcggccc aatacatttt gcaaggtgat gctgaagcag ccgttgccgg tggtgccgag 360
tgtatgagcc gtggcccgta cagcttgccg gccatgcgtt tcggtgcccg tatgaatgat 420
tcaaaggtcg tcgatatgat ggtcggtgcc ctaaccgacc cgtttgacga ttgccatatg 480
ggggtaactg ccgaaaacgt ggcggcaaag tggggaatca gccgcgaaga tcaggatcaa 540
ctggcttacg agagccatat gcgcgcagcg cgggcgattg acgaaggacg tttcgccaat 600
cagatcgtgc cggttgagat taaggtcaag ggtggtaccg cccaattcat ggttgatgaa 660
ggggtacgcc gcgatacgac catcgacaag ctggccaagc tccgcccggt gtttctgaag 720
gatggttcgg tgacggccgg caatgcttcg agcatcaatg atgcagcagc ggcggtagtg 780
ctgatggatc gggccaccgc tgagcggcgt ggctacaagc cgctggcgcg tctggtcggt 840
tacagccacg cagccgttga gccaaagtat atggggattg gcccggtacc ggctgtacga 900
cgcctgcttg agcgcaccgg cttgcgcatc agtgatattg atcttttcga ggtcaacgaa 960
gcttttgcag cgcaggcgct ggctgtgatc cgcgatctgg agttgccgcc tgatcgcacg 1020
aatcccaatg gtagtggtat ctccctcggt cacccgattg gcgccaccgg ctgtattttg 1080
accgtcaaag caattcacga actacaccgc accggtggcc gttatgccct ggtcacgatg 1140
tgtatcggtg gtgggcaggg gattgctgcg atcttcgagc gaatgtag 1188
Claims (18)
1.一种生产选自丙酮、丁酮和异丙醇的一种或多种化合物的方法,所述方法包括以下步骤:
a)提供嗜热细胞,优选嗜热细菌或嗜热古细菌细胞,所述细胞表达:
i)第一种酶,所述第一种酶由选自以下的乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)组成:如在SEQ ID NO:7中所示的Slip_0880、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ IDNO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:4中所示的Slip_0479和如在SEQ ID NO:59中所示的Dde1,
或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体;
ii)第二种酶,所述第二种酶选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II,
其中所述第二种酶选自:Tle2和如在SEQ ID NO:21中所示的Dde2(EC 2.8.3.5),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体,
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ ID NO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体组成,
和
iii)乙酰乙酸脱羧酶(EC 4.1.1.4),其中所述乙酰乙酸脱羧酶是如在SEQ ID NO:28中所示的Cac或与其具有至少70%同一性或相似性的具有乙酰乙酸脱羧酶活性的其功能性变体;和
iv)任选的异丙醇脱氢酶(EC 1.1.1.80),其中所述异丙醇脱氢酶是如在SEQ ID NO:29中所示的Tbr或与其具有至少70%同一性或相似性的具有异丙醇脱氢酶活性的其功能性变体;
b)在42℃和80℃之间,诸如50℃和75℃之间,例如60℃的温度下,于包含培养液的生物反应器中培养所述嗜热细胞,由此生产所述一种或多种化合物;
c)回收在步骤b)中生产的所述一种或多种化合物。
2.根据权利要求1所述的方法,其中所述嗜热细胞具有在42℃和80℃之间,诸如在50℃和75℃之间、例如60℃的最佳生长温度。
3.根据前述权利要求中任一项所述的方法,其中所述嗜热细胞属于选自以下的属:地芽孢杆菌属(Geobacillus)、高温厌氧杆菌属(Thermoanaerobacterium)、热厌氧杆菌属(Thermoanaerobacter)、嗜热厌氧菌属(Caldanaerobacter)、芽孢杆菌属(Bacillus)、热梭菌属(Thermoclostridium)、无氧芽孢杆菌属(Anoxybacillus)、热解纤维素菌属(Caldicellulosiruptor)、穆尔氏菌属(Moorella)、栖热菌属(Thermus)、栖热袍菌属(Thermotoga)、假栖热袍菌属(Pseudothermotoga)、绿屈挠菌属(Chloroflexus)、厌氧解纤维素菌属(Anaerocellum)、红嗜热菌属(Rhodothermus)、硫化叶菌属(Sulfolobus)、热球菌属(Thermococcus)、火球菌属(Pyrococcus)和梭菌属(Clostridium),优选地其中所述嗜热细胞属于选自以下的种:热葡糖苷酶地芽孢杆菌(Geobacillus thermoglucosidasius)、就地堆肥地芽胞杆菌(Geobacillus toebii)、嗜热脂肪地芽孢杆菌(Geobacillusstearothermophilus)、热反硝化地芽孢杆菌(Geobacillus thermodenitrificans)、嗜热地芽孢杆菌(Geobacillus kaustophilus)、喜热噬油地芽孢杆菌(Geobacillusthermoleovorans)、热小链地芽孢杆菌(Geobacillus thermocatenulatus)、解木聚糖高温厌氧杆菌(Thermoanaerobacterium xylanolyticum)、解糖高温厌氧杆菌(Thermoanaerobacterium saccharotyticum)、热解糖高温厌氧杆菌(Thermoanaerobacterium thermosaccharolyticum)、马瑞氏热厌氧杆菌(Thermoanaerobacter mathranii)、假乙醇热厌氧杆菌(Thermoanaerobacterpseudoethanolicus)、布氏热厌氧杆菌(Thermoanaerobacter brockii)、凯伍热厌氧杆菌(Thermoanaerobacter kivui)、布氏热厌氧杆菌(Thermoanaerobacter brockii)、地下嗜热厌氧菌(Caldanaerobacter subterraneus)、热纤梭菌(Clostridium thermocellum)、琥珀酸嗜热梭菌(Clostridium thermosuccinogenes)、嗜粪热梭菌(Thermoclostridiumstercorarium)、枯草芽孢杆菌(Bacillus subtilis)、地衣芽孢杆菌(Bacilluslicheniformis)、凝结芽孢杆菌(Bacillus coagulans)、史氏芽孢杆菌(Bacillussmithii)、甲醇芽孢杆菌(Bacillus methanolicus)、黄热芽孢杆菌(Bacillusflavothermus)、堪察加无氧芽孢杆菌(Anoxybacillus kamchatkensis)、冈尼西氏厌氧杆菌(Anoxybacillus gonensis)、热解纤维素菌(Caldicellulosiruptor bescii)、解糖热解纤维素菌(Caldicellulosiruptor saccharolyticus)、克里斯托热解纤维素菌(Caldicellulosiruptor kristjanssonii)、欧文湖热解纤维素菌(Caldicellulosiruptorowensensis)、产乳酸乙酸热解纤维素菌(Caldicellulosiruptor lactoaceticus)、热醋穆尔氏菌(Moorella thermoacetica)、热自养穆尔氏菌(Moorella thermoautotrophica)、嗜热栖热菌(Thermus thermophilus)、水生栖热菌(Thermus aquaticus)、海栖热袍菌(Thermotoga maritima)、Pseudothermotoga lettingae、温泉假栖热袍菌(Pseudothermotoga thermarum)、橙色绿屈挠菌(Chloroflexus aurantiacus)、嗜热厌氧解纤维素菌(Anaerocellum thermophilum)、海洋红嗜热菌(Rhodothermus marinus)、酸热硫化叶菌(Sulfolobus acidocaldarius)、冰岛硫化叶菌(Sulfolobus islandicus)、硫矿硫化叶菌(Sulfolobus solfataricus)、极端嗜热嗜压古菌(Thermococcus barophilus)、海洋异养古细菌(Thermococcus kodakarensis)、深海火球菌(Pyrococcus abyssi)、激烈火球菌(Pyrococcus furiosus),优选地所述细胞是热葡糖苷酶地芽孢杆菌细胞、枯草芽孢杆菌细胞或热纤梭菌细胞。
4.根据前述权利要求中任一项所述的方法,其中所述培养液包含可发酵底物,所述可发酵底物包含碳源,诸如碳水化合物,例如葡萄糖、木糖或它们的混合物,或者诸如生物质水解物。
5.根据前述权利要求中任一项所述的方法,其中所述一种或多种化合物包含丙酮和任选的异丙醇,其中所述细胞能够合成乙酰辅酶A和/或其中所述培养液包含乙酸或乙酸盐,和/或其中以至少0.8g/L,诸如至少0.9g/L,诸如至少1.0g/L,诸如至少1.1g/L,诸如至少1.2g/L,诸如至少1.3g/L,诸如至少1.4g/L,诸如至少1.5g/L,诸如至少1.6g/L,诸如至少1.7g/L,诸如至少1.8g/L,诸如至少1.9g/L,诸如至少2.0g/L,诸如至少5g/L,诸如至少7.5g/L,诸如至少10g/L,诸如至少12.5g/L,诸如至少15g/L,诸如至少20g/L,诸如至少25g/L,诸如至少50g/L,诸如至少75g/L,诸如至少100g/L,诸如至少150g/L,诸如至少250g/L或更高的滴度生产丙酮。
6.根据前述权利要求中任一项所述的方法,其中至少生产丙酮且其中所述第一种酶是Caur_1461(SEQ ID NO:3)、Slip_0880(SEQ ID NO:7)或Dde1(SEQ ID NO:59),或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体。
7.根据前述权利要求中任一项所述的方法,其中所述嗜热细胞表达如在SEQ ID NO:29中所示的Tbr或与其具有至少70%同一性或相似性的具有异丙醇脱氢酶活性的其功能性变体,由此生产的丙酮的至少一部分被转化成异丙醇,优选地其中以至少0.05g/L,诸如至少0.075g/L,诸如至少0.1g/L,诸如至少0.2g/L,诸如至少0.3g/L,诸如至少0.4g/L,诸如至少0.5g/L,诸如至少0.75g/L,诸如至少1.0g/L,诸如至少2.0g/L,诸如至少3.0g/L,诸如至少4.0g/L,诸如至少5.0g/L,诸如至少7.5g/L,诸如至少10.0g/L或更高,诸如至少12.5g/L,诸如至少15g/L,诸如至少20g/L,诸如至少25g/L,诸如至少50g/L,诸如至少75g/L,诸如至少100g/L,诸如至少150g/L,诸如至少250g/L或更高的滴度生产至少异丙醇。
8.根据前述权利要求中任一项所述的方法,其中所述一种或多种化合物包含丁酮,其中所述培养液包含丙酸或丙酸盐,和/或
其中所述第一种酶是如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:59中所示的Dde1、或如在SEQ ID NO:4中所示的Slip_0479,或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体,和/或
其中以至少0.05g/L,诸如至少0.075g/L,诸如至少0.1g/L,诸如至少0.2g/L,诸如至少0.3g/L,诸如至少0.4g/L,诸如至少0.5g/L,诸如至少0.75g/L,诸如至少1.0g/L,诸如至少2.0g/L,诸如至少3.0g/L,诸如至少4.0g/L,诸如至少5.0g/L,诸如至少7.5g/L,诸如至少10.0g/L或更高,诸如至少12.5g/L,诸如至少15g/L,诸如至少20g/L,诸如至少25g/L,诸如至少50g/L,诸如至少75g/L,诸如至少100g/L,诸如至少150g/L,诸如至少250g/L或更高的滴度生产丁酮。
9.根据前述权利要求中任一项所述的方法,其中所述嗜热细胞表达如在SEQ ID NO:28中所示的Cac或与其具有至少70%同一性或相似性的具有乙酰乙酸脱羧酶活性的其功能性变体,其中所述嗜热细胞进一步表达:
i)如在SEQ ID NO:7中所示的Slip_0880和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丙酮;或
ii)如在SEQ ID NO:3中所示的Caur_1461和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丙酮和/或丁酮;或
iii)如在SEQ ID NO:1中所示的GHH_c20420和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丁酮;或
iv)如在SEQ ID NO:59中所示的Dde1和如在SEQ ID NO:21中所示的Dde2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丙酮;或;
v)如在SEQ ID NO:2中所示的Slip_0499和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丁酮;或
vi)如在SEQ ID NO:4中所示的Slip_0479和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;由此至少生产丁酮;
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ ID NO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体组成,
优选地其中所述嗜热细胞表达如在SEQ ID NO:28中所示的Cac或与其具有至少70%同一性或相似性的具有乙酰乙酸脱羧酶活性的其功能性变体,并进一步表达i)或ii)。
10.能够生产丙酮和/或丁酮和任选的异丙醇的嗜热细胞,所述细胞是细菌细胞或古细菌细胞并且表达:
i)第一种酶,所述第一种酶由选自以下的乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)组成:如在SEQ ID NO:7中所示的Slip_0880、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ IDNO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:4中所示的Slip_0479和如在SEQ ID NO:59中所示的Dde1,或与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体;
ii)选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶,
其中所述第二种酶选自:Tle2和如在SEQ ID NO:21中所示的Dde2(EC 2.8.3.5),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体,
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ ID NO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体组成,和
iii)乙酰乙酸脱羧酶(EC 4.1.1.4),其中所述乙酰乙酸脱羧酶是如在SEQ ID NO:28中所示的Cac或与其具有至少70%同一性或相似性的具有乙酰乙酸脱羧酶活性的其功能性变体;
由此所述细胞能够将乙酰辅酶A转化为丙酮,从而以至少0.8g/L的滴度生产丙酮;
和/或,由此所述细胞能够将乙酰辅酶A和丙酰辅酶A转化为丁酮,从而生产丁酮;
和
iv)任选的异丙醇脱氢酶(EC 1.1.1.80),其中所述异丙醇脱氢酶是如在SEQ ID NO:29中所示的Tbr或与其具有至少70%同一性或相似性的具有异丙醇脱氢酶活性的其功能性变体,
由此所述细胞能够进一步将丙酮转化为异丙醇,从而生产异丙醇,
优选地,其中所述嗜热细胞是根据前述权利要求中的任一项。
11.根据权利要求10所述的嗜热细胞,其中所述嗜热细胞表达如在SEQ ID NO:28中所示的Cac或与其具有至少70%同一性或相似性的具有乙酰乙酸脱羧酶活性的其功能性变体,其中所述嗜热细胞进一步表达:
i)如在SEQ ID NO:7中所示的Slip_0880和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;或
ii)如在SEQ ID NO:3中所示的Caur_1461和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;或
iii)如在SEQ ID NO:1中所示的GHH_c20420和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;或
iv)如在SEQ ID NO:59中所示的Dde1和如在SEQ ID NO:21中所示的Dde2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;或
v)如在SEQ ID NO:2中所示的Slip_0499和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;或
vi)如在SEQ ID NO:4中所示的Slip_0479和Tle2,或与其具有至少70%同一性或相似性的分别具有乙酰辅酶A乙酰基转移酶活性或乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体;
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ ID NO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体组成。
12.根据权利要求11所述的嗜热细胞,其中所述嗜热细胞表达i)或ii)。
13.根据权利要求10至12中任一项所述的嗜热细胞,其中所述嗜热细胞进一步表达如在SEQ ID NO:29中所示的Tbr或与其具有至少70%同一性或相似性的具有异丙醇脱氢酶活性的其功能性变体。
14.一种用于修饰选自嗜热细菌细胞和嗜热古细菌细胞的嗜热细胞的核酸构建体,所述核酸构建体包含:
i)编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或者与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体的多核苷酸,其中所述乙酰辅酶A乙酰基转移酶选自如在SEQ ID NO:7中所示的Slip_0880、如在SEQ ID NO:3中所示的Caur_1461、如在SEQ ID NO:1中所示的GHH_c20420、如在SEQ ID NO:2中所示的Slip_0499、如在SEQ ID NO:4中所示的Slip_0479和如在SEQ ID NO:59中所示的Dde1;
ii)编码第二种酶的多核苷酸,所述第二种酶选自乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶和酰基辅酶A硫酯酶II的第二种酶,
其中所述第二种酶选自:Tle2和如在SEQ ID NO:21中所示的Dde2(EC 2.8.3.5),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体,
其中Tle2由如在SEQ ID NO:19中所示的Tle2亚基A(EC 2.8.3.8)和如在SEQ ID NO:20中所示的Tle2亚基B(EC 2.8.3.9),或与其具有至少70%同一性或相似性的具有乙酸辅酶A转移酶、3-氧代酸辅酶A转移酶、酰基辅酶A:乙酸/3-酮酸辅酶A转移酶或酰基辅酶A硫酯酶II活性的其功能性变体组成,和
iii)编码乙酰乙酸脱羧酶(EC 4.1.1.4),或与其具有至少70%同一性或相似性的具有乙酰乙酸脱羧酶活性的其功能性变体的多核苷酸,其中所述乙酰乙酸脱羧酶是如在SEQ IDNO:28中所示的Cac,和
iv)任选的编码异丙醇脱氢酶(EC 1.1.1.80)的多核苷酸,其中所述异丙醇脱氢酶是如在SEQ ID NO:29中所示的Tbr,或与其具有至少70%同一性或相似性的具有异丙醇脱氢酶活性的其功能性变体。
15.根据权利要求14所述的核酸构建体,
其中所述编码乙酰辅酶A乙酰基转移酶(EC 2.3.1.9)或者与其具有至少70%同一性或相似性的具有乙酰辅酶A乙酰基转移酶活性的其功能性变体的多核苷酸选自SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:32、SEQ IDNO:33、SEQ ID NO:36和SEQ ID NO:61,或与其具有至少70%同一性的其同源物;
和/或
其中所述编码第二种酶的多核苷酸选自:
iii)SEQ ID NO:48和SEQ ID NO:49,或与其具有至少70%同一性的其同源物;和
iv)SEQ ID NO:50或与其具有至少70%同一性的其同源物;
和/或
其中所述编码乙酰乙酸脱羧酶或者与其具有至少70%同一性或相似性的具有乙酰乙酸脱羧酶活性的其功能性变体的多核苷酸是SEQ ID NO:57或与其具有至少70%同一性的其同源物。
16.根据权利要求14至15中任一项所述的核酸构建体,其中所述编码异丙醇脱氢酶或者与其具有至少70%同一性或相似性的具有异丙醇脱氢酶活性的其功能性变体的多核苷酸是SEQ ID NO:58或与其具有至少70%同一性的其同源物。
17.一种载体,其包含根据权利要求14至16中任一项所述的核酸构建体。
18.一种嗜热细胞,其包含根据权利要求14至16中任一项所述的核酸构建体和/或根据权利要求17所述的载体,其中所述嗜热细胞是嗜热细菌细胞或嗜热古细菌细胞。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP20193767 | 2020-09-01 | ||
EP20193767.9 | 2020-09-01 | ||
PCT/EP2021/074134 WO2022049125A1 (en) | 2020-09-01 | 2021-09-01 | Methods and cells for production of volatile compounds |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116783289A true CN116783289A (zh) | 2023-09-19 |
Family
ID=72322317
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202180071428.3A Pending CN116783289A (zh) | 2020-09-01 | 2021-09-01 | 用于生产挥发性化合物的方法和细胞 |
Country Status (6)
Country | Link |
---|---|
US (1) | US20240026391A1 (zh) |
EP (1) | EP4208541A1 (zh) |
CN (1) | CN116783289A (zh) |
AU (1) | AU2021335406A1 (zh) |
CA (1) | CA3191268A1 (zh) |
WO (1) | WO2022049125A1 (zh) |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2798452C (en) * | 2010-05-05 | 2019-09-03 | Mascoma Corporation | Detoxification of biomass derived acetate via metabolic conversion to ethanol, acetone, isopropanol, or ethyl acetate |
-
2021
- 2021-09-01 US US18/043,177 patent/US20240026391A1/en active Pending
- 2021-09-01 WO PCT/EP2021/074134 patent/WO2022049125A1/en active Application Filing
- 2021-09-01 AU AU2021335406A patent/AU2021335406A1/en active Pending
- 2021-09-01 CA CA3191268A patent/CA3191268A1/en active Pending
- 2021-09-01 EP EP21769462.9A patent/EP4208541A1/en active Pending
- 2021-09-01 CN CN202180071428.3A patent/CN116783289A/zh active Pending
Also Published As
Publication number | Publication date |
---|---|
AU2021335406A1 (en) | 2023-05-11 |
AU2021335406A9 (en) | 2023-07-06 |
CA3191268A1 (en) | 2022-03-10 |
WO2022049125A1 (en) | 2022-03-10 |
US20240026391A1 (en) | 2024-01-25 |
EP4208541A1 (en) | 2023-07-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2012221176B2 (en) | Recombinant microorganisms and uses therefor | |
CA2874832C (en) | Recombinant microorganisms and uses therefor | |
Deng et al. | Metabolic engineering of Thermobifida fusca for direct aerobic bioconversion of untreated lignocellulosic biomass to 1-propanol | |
JP7304859B2 (ja) | エチレングリコールの生物生産のための微生物および方法 | |
US8637283B2 (en) | Production of hydrocarbons in microorganisms | |
JP2017534268A (ja) | 有用産物の生産のための改変微生物および方法 | |
CA2660486A1 (en) | Thermophilic microorganisms for ethanol production | |
CN116783289A (zh) | 用于生产挥发性化合物的方法和细胞 | |
CA2708818A1 (en) | Modification of hydrogenase activities in thermophilic bacteria to enhance ethanol production | |
EP2964757A1 (en) | Improvement of clostridial butanol production by gene overexpression | |
US12098168B2 (en) | XylR mutant for improved xylose utilization or improved co-utilization of glucose and xylose preliminary | |
JP2010504734A (ja) | リグノセルロースバイオマスのエタノールへの変換のための好熱性生物 | |
Hendricks | Bacillus licheniformis isolated from Mozambican soil capable of producing 2, 3-butanediol | |
KR101814997B1 (ko) | 오탄당을 이용할 수 있는 재조합 미생물 및 이를 이용한 바이오에탄올의 제조방법 | |
Yao | Metabolic engineering of ethanol production in Thermoanaerobacter mathranii | |
Liu et al. | Functional expression of the thiolase | |
WO2017168161A1 (en) | Modified enzyme | |
Xu | Lehrstuhl für Mikrobiologie | |
NZ614459B2 (en) | Recombinant microorganisms and uses therefor |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |