CN114774477A - 使用了重组酵母的乙醇的制造方法 - Google Patents
使用了重组酵母的乙醇的制造方法 Download PDFInfo
- Publication number
- CN114774477A CN114774477A CN202111253587.3A CN202111253587A CN114774477A CN 114774477 A CN114774477 A CN 114774477A CN 202111253587 A CN202111253587 A CN 202111253587A CN 114774477 A CN114774477 A CN 114774477A
- Authority
- CN
- China
- Prior art keywords
- ala
- leu
- gly
- val
- lys
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 title claims abstract description 357
- 240000004808 Saccharomyces cerevisiae Species 0.000 title claims abstract description 247
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 50
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 claims abstract description 122
- 108010081577 aldehyde dehydrogenase (NAD(P)+) Proteins 0.000 claims abstract description 78
- 238000000855 fermentation Methods 0.000 claims abstract description 73
- 230000004151 fermentation Effects 0.000 claims abstract description 73
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 claims abstract description 63
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 claims abstract description 61
- 108700040099 Xylose isomerases Proteins 0.000 claims abstract description 49
- 108090000623 proteins and genes Proteins 0.000 claims description 237
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 85
- 102000004169 proteins and genes Human genes 0.000 claims description 61
- 230000000694 effects Effects 0.000 claims description 56
- 235000018102 proteins Nutrition 0.000 claims description 56
- 238000000034 method Methods 0.000 claims description 37
- 241000588724 Escherichia coli Species 0.000 claims description 33
- 108010021809 Alcohol dehydrogenase Proteins 0.000 claims description 28
- IKHGUXGNUITLKF-UHFFFAOYSA-N Acetaldehyde Chemical compound CC=O IKHGUXGNUITLKF-UHFFFAOYSA-N 0.000 claims description 24
- 241000195597 Chlamydomonas reinhardtii Species 0.000 claims description 24
- 241000193454 Clostridium beijerinckii Species 0.000 claims description 24
- 102000004190 Enzymes Human genes 0.000 claims description 20
- 108090000790 Enzymes Proteins 0.000 claims description 20
- 108091022915 xylulokinase Proteins 0.000 claims description 17
- 239000001913 cellulose Substances 0.000 claims description 16
- 229920002678 cellulose Polymers 0.000 claims description 16
- 230000014509 gene expression Effects 0.000 claims description 12
- 230000002829 reductive effect Effects 0.000 claims description 12
- 230000037361 pathway Effects 0.000 claims description 11
- 230000004108 pentose phosphate pathway Effects 0.000 claims description 10
- 102100028601 Transaldolase Human genes 0.000 claims description 8
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 7
- 229960001230 asparagine Drugs 0.000 claims description 7
- 235000009582 asparagine Nutrition 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 7
- 108010043652 Transketolase Proteins 0.000 claims description 6
- 235000018417 cysteine Nutrition 0.000 claims description 6
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 6
- 108020004530 Transaldolase Proteins 0.000 claims description 5
- 102000014701 Transketolase Human genes 0.000 claims description 5
- 230000003647 oxidation Effects 0.000 claims description 5
- 238000007254 oxidation reaction Methods 0.000 claims description 5
- 102000007382 Ribose-5-phosphate isomerase Human genes 0.000 claims description 3
- 108060007030 Ribulose-phosphate 3-epimerase Proteins 0.000 claims description 3
- 102100039270 Ribulose-phosphate 3-epimerase Human genes 0.000 claims description 3
- 108020005610 ribose 5-phosphate isomerase Proteins 0.000 claims description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 2
- IKHGUXGNUITLKF-XPULMUKRSA-N acetaldehyde Chemical compound [14CH]([14CH3])=O IKHGUXGNUITLKF-XPULMUKRSA-N 0.000 claims description 2
- 238000012258 culturing Methods 0.000 claims 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 abstract description 135
- 239000002609 medium Substances 0.000 abstract description 52
- 239000001963 growth medium Substances 0.000 abstract description 5
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 239
- 108020004414 DNA Proteins 0.000 description 124
- 241000282326 Felis catus Species 0.000 description 79
- 239000013612 plasmid Substances 0.000 description 41
- 239000013598 vector Substances 0.000 description 40
- 101150078509 ADH2 gene Proteins 0.000 description 39
- 210000004027 cell Anatomy 0.000 description 33
- 101150024975 mhpF gene Proteins 0.000 description 33
- 101710193111 All-trans-retinol dehydrogenase [NAD(+)] ADH4 Proteins 0.000 description 31
- 102100034044 All-trans-retinol dehydrogenase [NAD(+)] ADH1B Human genes 0.000 description 29
- 239000003550 marker Substances 0.000 description 29
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 28
- 230000006801 homologous recombination Effects 0.000 description 28
- 238000002744 homologous recombination Methods 0.000 description 28
- 239000012634 fragment Substances 0.000 description 26
- 108010050848 glycylleucine Proteins 0.000 description 26
- 108091028043 Nucleic acid sequence Proteins 0.000 description 25
- 125000000539 amino acid group Chemical group 0.000 description 25
- 108010057821 leucylproline Proteins 0.000 description 24
- 239000002773 nucleotide Substances 0.000 description 23
- 125000003729 nucleotide group Chemical group 0.000 description 23
- 108010047495 alanylglycine Proteins 0.000 description 20
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 20
- 108010037850 glycylvaline Proteins 0.000 description 20
- 239000002028 Biomass Substances 0.000 description 19
- 101150021974 Adh1 gene Proteins 0.000 description 18
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 18
- 108010034529 leucyl-lysine Proteins 0.000 description 18
- 238000011144 upstream manufacturing Methods 0.000 description 18
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 17
- 101150050575 URA3 gene Proteins 0.000 description 17
- 229940088598 enzyme Drugs 0.000 description 17
- 239000008103 glucose Substances 0.000 description 17
- 108010059892 Cellulase Proteins 0.000 description 16
- 101100017604 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) HOR7 gene Proteins 0.000 description 16
- 108010026333 seryl-proline Proteins 0.000 description 16
- 238000012360 testing method Methods 0.000 description 16
- 102100034035 Alcohol dehydrogenase 1A Human genes 0.000 description 15
- 108010013835 arginine glutamate Proteins 0.000 description 14
- 108010077245 asparaginyl-proline Proteins 0.000 description 14
- 108010047754 beta-Glucosidase Proteins 0.000 description 14
- 108010049041 glutamylalanine Proteins 0.000 description 14
- 108010003700 lysyl aspartic acid Proteins 0.000 description 14
- 108010017391 lysylvaline Proteins 0.000 description 14
- 239000000243 solution Substances 0.000 description 14
- 235000000346 sugar Nutrition 0.000 description 14
- 108010073969 valyllysine Proteins 0.000 description 14
- 229940106157 cellulase Drugs 0.000 description 13
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- 241000880493 Leptailurus serval Species 0.000 description 12
- 101150052008 TKL-1 gene Proteins 0.000 description 12
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 12
- 108010038633 aspartylglutamate Proteins 0.000 description 12
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 description 11
- 101150014383 adhE gene Proteins 0.000 description 11
- 238000002360 preparation method Methods 0.000 description 11
- 101150005492 rpe1 gene Proteins 0.000 description 11
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 10
- 101000892220 Geobacillus thermodenitrificans (strain NG80-2) Long-chain-alcohol dehydrogenase 1 Proteins 0.000 description 10
- 101000780443 Homo sapiens Alcohol dehydrogenase 1A Proteins 0.000 description 10
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 10
- 241000235060 Scheffersomyces stipitis Species 0.000 description 10
- 108010087924 alanylproline Proteins 0.000 description 10
- 238000010586 diagram Methods 0.000 description 10
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 10
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 10
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 10
- 108010078274 isoleucylvaline Proteins 0.000 description 10
- 108010064235 lysylglycine Proteins 0.000 description 10
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 10
- 108010053725 prolylvaline Proteins 0.000 description 10
- 108010061238 threonyl-glycine Proteins 0.000 description 10
- 238000012546 transfer Methods 0.000 description 10
- 229920001817 Agar Polymers 0.000 description 9
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 9
- 108700010070 Codon Usage Proteins 0.000 description 9
- 239000008272 agar Substances 0.000 description 9
- 239000013604 expression vector Substances 0.000 description 9
- 101150115276 tal1 gene Proteins 0.000 description 9
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 8
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 8
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 8
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 8
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 8
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 8
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 8
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 8
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 8
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 8
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 238000002105 Southern blotting Methods 0.000 description 8
- 102100029089 Xylulose kinase Human genes 0.000 description 8
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 8
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 8
- 108010062796 arginyllysine Proteins 0.000 description 8
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 8
- 108010092854 aspartyllysine Proteins 0.000 description 8
- 108010016616 cysteinylglycine Proteins 0.000 description 8
- 108010077515 glycylproline Proteins 0.000 description 8
- 108010025306 histidylleucine Proteins 0.000 description 8
- 101150003679 hsaG gene Proteins 0.000 description 8
- 238000009396 hybridization Methods 0.000 description 8
- 230000000968 intestinal effect Effects 0.000 description 8
- 108010009298 lysylglutamic acid Proteins 0.000 description 8
- 108010038320 lysylphenylalanine Proteins 0.000 description 8
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 description 8
- 108010051242 phenylalanylserine Proteins 0.000 description 8
- 108010029020 prolylglycine Proteins 0.000 description 8
- 108091008146 restriction endonucleases Proteins 0.000 description 8
- 150000003839 salts Chemical class 0.000 description 8
- 239000000758 substrate Substances 0.000 description 8
- 239000002344 surface layer Substances 0.000 description 8
- 241000894006 Bacteria Species 0.000 description 7
- 108091026890 Coding region Proteins 0.000 description 7
- 108020004705 Codon Proteins 0.000 description 7
- 101150118141 GRE3 gene Proteins 0.000 description 7
- BAWFJGJZGIEFAR-NNYOXOHSSA-O NAD(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-O 0.000 description 7
- 101150012255 RKI1 gene Proteins 0.000 description 7
- 150000001299 aldehydes Chemical class 0.000 description 7
- 235000001014 amino acid Nutrition 0.000 description 7
- 229940024606 amino acid Drugs 0.000 description 7
- 238000010367 cloning Methods 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 6
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 6
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 6
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 6
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 6
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 6
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 6
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 6
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 6
- 108010053754 Aldehyde reductase Proteins 0.000 description 6
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 6
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 6
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 6
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 6
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 6
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 6
- 241000222178 Candida tropicalis Species 0.000 description 6
- ZAQJHHRNXZUBTE-WUJLRWPWSA-N D-xylulose Chemical compound OC[C@@H](O)[C@H](O)C(=O)CO ZAQJHHRNXZUBTE-WUJLRWPWSA-N 0.000 description 6
- 101150025279 DIT1 gene Proteins 0.000 description 6
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 6
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 6
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 6
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 6
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- 101100246753 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) pyrF gene Proteins 0.000 description 6
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 description 6
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 6
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 6
- 108010065920 Insulin Lispro Proteins 0.000 description 6
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 6
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 6
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 6
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 6
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 6
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 6
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 6
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 6
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 6
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 6
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 6
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 6
- 108010079364 N-glycylalanine Proteins 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 6
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 6
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 6
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 6
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 6
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 6
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 6
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 6
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 6
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 108010070944 alanylhistidine Proteins 0.000 description 6
- 150000001413 amino acids Chemical class 0.000 description 6
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 229940041514 candida albicans extract Drugs 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 108010040030 histidinoalanine Proteins 0.000 description 6
- 108010027338 isoleucylcysteine Proteins 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 108010056582 methionylglutamic acid Proteins 0.000 description 6
- 244000005700 microbiome Species 0.000 description 6
- 239000002245 particle Substances 0.000 description 6
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 108010031719 prolyl-serine Proteins 0.000 description 6
- 108010070643 prolylglutamic acid Proteins 0.000 description 6
- 108010015796 prolylisoleucine Proteins 0.000 description 6
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 6
- 229940035893 uracil Drugs 0.000 description 6
- 239000012138 yeast extract Substances 0.000 description 6
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 5
- 101710187578 Alcohol dehydrogenase 1 Proteins 0.000 description 5
- 108010058076 D-xylulose reductase Proteins 0.000 description 5
- 229920002488 Hemicellulose Polymers 0.000 description 5
- 241000866500 Reticulitermes speratus Species 0.000 description 5
- 101100069420 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GRE3 gene Proteins 0.000 description 5
- 101100103120 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) XKS1 gene Proteins 0.000 description 5
- 108700005078 Synthetic Genes Proteins 0.000 description 5
- 101150100773 XKS1 gene Proteins 0.000 description 5
- 101150032677 Xk gene Proteins 0.000 description 5
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 5
- 230000000295 complement effect Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 4
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 4
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 4
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 4
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 4
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 4
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 4
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 4
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 4
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 4
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 4
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 4
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 4
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 4
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 4
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 4
- 101100389688 Arabidopsis thaliana AERO1 gene Proteins 0.000 description 4
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 4
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 4
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 4
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 4
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 4
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 4
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 4
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 4
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 4
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 4
- UBGGJTMETLEXJD-DCAQKATOSA-N Asn-Leu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O UBGGJTMETLEXJD-DCAQKATOSA-N 0.000 description 4
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 4
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 4
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 4
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 4
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 4
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 4
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 4
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 4
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 4
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 4
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 4
- 101150047030 ERO1 gene Proteins 0.000 description 4
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 4
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 4
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 4
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 4
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 4
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 4
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 4
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 4
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 4
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 4
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 4
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 4
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 4
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 4
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 4
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 4
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 4
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 4
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 4
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 4
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 4
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 4
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 4
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 4
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 4
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 4
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 4
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 4
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 4
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 4
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 4
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 4
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 4
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 4
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 4
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 4
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 4
- GMIWMPUGTFQFHK-KCTSRDHCSA-N His-Ala-Trp Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O GMIWMPUGTFQFHK-KCTSRDHCSA-N 0.000 description 4
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 4
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 4
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 4
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 4
- 101000891113 Homo sapiens T-cell acute lymphocytic leukemia protein 1 Proteins 0.000 description 4
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 4
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 4
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 4
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 4
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 4
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 4
- VLCMCYDZJCWPQT-VKOGCVSHSA-N Ile-Met-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N VLCMCYDZJCWPQT-VKOGCVSHSA-N 0.000 description 4
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 4
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 4
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 4
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 4
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 4
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 4
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 4
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 4
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 4
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 4
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 4
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 4
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 4
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 4
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 4
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 4
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 4
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 4
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 4
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 4
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 4
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 4
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 4
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 4
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 4
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 4
- 102000003960 Ligases Human genes 0.000 description 4
- 108090000364 Ligases Proteins 0.000 description 4
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 4
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 4
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 4
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 4
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 4
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 4
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 4
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 4
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 4
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 4
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 4
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 4
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 4
- 239000001888 Peptone Substances 0.000 description 4
- 108010080698 Peptones Proteins 0.000 description 4
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 4
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 4
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 4
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 4
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 4
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 4
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 4
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 4
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 4
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 4
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 4
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 4
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 4
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 4
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 4
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 4
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 4
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 4
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 4
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 4
- 101000702553 Schistosoma mansoni Antigen Sm21.7 Proteins 0.000 description 4
- 101000714192 Schistosoma mansoni Tegument antigen Proteins 0.000 description 4
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 4
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 4
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 4
- UCOYFSCEIWQYNL-FXQIFTODSA-N Ser-Cys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O UCOYFSCEIWQYNL-FXQIFTODSA-N 0.000 description 4
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 4
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 4
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 4
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 4
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 4
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 4
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 4
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 4
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 4
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 4
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 4
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 4
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 4
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 4
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 4
- 102100040365 T-cell acute lymphocytic leukemia protein 1 Human genes 0.000 description 4
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 4
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 4
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 4
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 4
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 4
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 4
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 4
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 4
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 4
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 4
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 4
- MOCXXGZHHSPNEJ-AVGNSLFASA-N Tyr-Cys-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O MOCXXGZHHSPNEJ-AVGNSLFASA-N 0.000 description 4
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 4
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 4
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 4
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 4
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 4
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 4
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 4
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 4
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 4
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 4
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 4
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 4
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 4
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 4
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 4
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 4
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 4
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 4
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 4
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 4
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 4
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 4
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 4
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 4
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 4
- 101150101877 XI gene Proteins 0.000 description 4
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 description 4
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 108010011559 alanylphenylalanine Proteins 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 102000006995 beta-Glucosidase Human genes 0.000 description 4
- 229910052799 carbon Inorganic materials 0.000 description 4
- 210000000170 cell membrane Anatomy 0.000 description 4
- 210000000349 chromosome Anatomy 0.000 description 4
- 230000007423 decrease Effects 0.000 description 4
- 108010009297 diglycyl-histidine Proteins 0.000 description 4
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 4
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 230000005764 inhibitory process Effects 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 4
- 230000004060 metabolic process Effects 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 235000019319 peptone Nutrition 0.000 description 4
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 229910052708 sodium Inorganic materials 0.000 description 4
- 239000011734 sodium Substances 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 239000001509 sodium citrate Substances 0.000 description 4
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 4
- 230000002194 synthesizing effect Effects 0.000 description 4
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 230000001131 transforming effect Effects 0.000 description 4
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 4
- 108010029384 tryptophyl-histidine Proteins 0.000 description 4
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 4
- 108010038745 tryptophylglycine Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 4
- 108010009962 valyltyrosine Proteins 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- 230000004127 xylose metabolism Effects 0.000 description 4
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 3
- 101100517196 Arabidopsis thaliana NRPE1 gene Proteins 0.000 description 3
- 101100190825 Bos taurus PMEL gene Proteins 0.000 description 3
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 3
- 241000222173 Candida parapsilosis Species 0.000 description 3
- FNZLKVNUWIIPSJ-UHNVWZDZSA-N D-ribulose 5-phosphate Chemical compound OCC(=O)[C@H](O)[C@H](O)COP(O)(O)=O FNZLKVNUWIIPSJ-UHNVWZDZSA-N 0.000 description 3
- FNZLKVNUWIIPSJ-RFZPGFLSSA-N D-xylulose 5-phosphate Chemical compound OCC(=O)[C@@H](O)[C@H](O)COP(O)(O)=O FNZLKVNUWIIPSJ-RFZPGFLSSA-N 0.000 description 3
- 101100073341 Oryza sativa subsp. japonica KAO gene Proteins 0.000 description 3
- 108090001066 Racemases and epimerases Proteins 0.000 description 3
- 102000004879 Racemases and epimerases Human genes 0.000 description 3
- 101710094436 Transaldolase 1 Proteins 0.000 description 3
- 101710094544 Transketolase 1 Proteins 0.000 description 3
- LIPOUNRJVLNBCD-UHFFFAOYSA-N acetyl dihydrogen phosphate Chemical compound CC(=O)OP(O)(O)=O LIPOUNRJVLNBCD-UHFFFAOYSA-N 0.000 description 3
- 230000001476 alcoholic effect Effects 0.000 description 3
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 229940055022 candida parapsilosis Drugs 0.000 description 3
- 239000005515 coenzyme Substances 0.000 description 3
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 3
- 239000010410 layer Substances 0.000 description 3
- 230000004807 localization Effects 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 2
- 108010036211 5-HT-moduline Proteins 0.000 description 2
- SEHFUALWMUWDKS-UHFFFAOYSA-N 5-fluoroorotic acid Chemical compound OC(=O)C=1NC(=O)NC(=O)C=1F SEHFUALWMUWDKS-UHFFFAOYSA-N 0.000 description 2
- 108010049926 Acetate-CoA ligase Proteins 0.000 description 2
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 2
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 2
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 2
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 2
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 2
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 2
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 2
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 2
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 2
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 2
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- GRPHQEMIFDPKOE-HGNGGELXSA-N Ala-His-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GRPHQEMIFDPKOE-HGNGGELXSA-N 0.000 description 2
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 2
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 2
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- XIWKVCDQMCNKOZ-UVBJJODRSA-N Ala-Met-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XIWKVCDQMCNKOZ-UVBJJODRSA-N 0.000 description 2
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 2
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 2
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 2
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- 102000016912 Aldehyde Reductase Human genes 0.000 description 2
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 2
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 2
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 2
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 2
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 2
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 2
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 2
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 2
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 2
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- PYZPXCZNQSEHDT-GUBZILKMSA-N Arg-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PYZPXCZNQSEHDT-GUBZILKMSA-N 0.000 description 2
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 2
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 2
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 2
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 2
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 2
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 2
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 2
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 2
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 2
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 2
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 2
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 2
- FANGHKQYFPYDNB-UBHSHLNASA-N Asn-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N FANGHKQYFPYDNB-UBHSHLNASA-N 0.000 description 2
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 2
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 2
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 2
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 2
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 2
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 2
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 2
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 2
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 2
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 2
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 2
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 2
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 2
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 2
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 2
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 2
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 2
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 2
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 2
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 2
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 2
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 2
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 2
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 description 2
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 2
- SMZCLQGDQMGESY-ACZMJKKPSA-N Asp-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N SMZCLQGDQMGESY-ACZMJKKPSA-N 0.000 description 2
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 2
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 2
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 2
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 2
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 2
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 2
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 2
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 2
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 2
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 2
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 2
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 2
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 2
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 2
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 2
- SJLDOGLMVPHPLZ-IHRRRGAJSA-N Asp-Met-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SJLDOGLMVPHPLZ-IHRRRGAJSA-N 0.000 description 2
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 2
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 2
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 2
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 2
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 2
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 2
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 2
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 2
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 2
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 2
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 2
- GXIUDSXIUSTSLO-QXEWZRGKSA-N Asp-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N GXIUDSXIUSTSLO-QXEWZRGKSA-N 0.000 description 2
- 240000006439 Aspergillus oryzae Species 0.000 description 2
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 102000005575 Cellulases Human genes 0.000 description 2
- 108010084185 Cellulases Proteins 0.000 description 2
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 description 2
- 241000193169 Clostridium cellulovorans Species 0.000 description 2
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 2
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 2
- DCJNIJAWIRPPBB-CIUDSAMLSA-N Cys-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N DCJNIJAWIRPPBB-CIUDSAMLSA-N 0.000 description 2
- ZOLXQKZHYOHHMD-DLOVCJGASA-N Cys-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N ZOLXQKZHYOHHMD-DLOVCJGASA-N 0.000 description 2
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 2
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 2
- OIMUAKUQOUEPCZ-WHFBIAKZSA-N Cys-Asn-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIMUAKUQOUEPCZ-WHFBIAKZSA-N 0.000 description 2
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 2
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 2
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 2
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 2
- ZQHQTSONVIANQR-BQBZGAKWSA-N Cys-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N ZQHQTSONVIANQR-BQBZGAKWSA-N 0.000 description 2
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 2
- CUXIOFHFFXNUGG-HTFCKZLJSA-N Cys-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CS)N CUXIOFHFFXNUGG-HTFCKZLJSA-N 0.000 description 2
- KCPOQGRVVXYLAC-KKUMJFAQSA-N Cys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KCPOQGRVVXYLAC-KKUMJFAQSA-N 0.000 description 2
- XXDATQFUGMAJRV-XIRDDKMYSA-N Cys-Leu-Trp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XXDATQFUGMAJRV-XIRDDKMYSA-N 0.000 description 2
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 2
- OETOANMAHTWESF-KKUMJFAQSA-N Cys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CS)N OETOANMAHTWESF-KKUMJFAQSA-N 0.000 description 2
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 2
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 2
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 2
- GUBGYTABKSRVRQ-CUHNMECISA-N D-Cellobiose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-CUHNMECISA-N 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 101150002721 GPD2 gene Proteins 0.000 description 2
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 2
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 2
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 2
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 2
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 2
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 2
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 2
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 2
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 2
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 2
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 2
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 2
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 2
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 2
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 2
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 2
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 2
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 2
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 2
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 2
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 2
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 2
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 2
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 2
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 2
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 2
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 2
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 2
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 2
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 2
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 2
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 2
- FTMLQFPULNGION-ZVZYQTTQSA-N Gln-Val-Trp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FTMLQFPULNGION-ZVZYQTTQSA-N 0.000 description 2
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 2
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 2
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 2
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 2
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 2
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 2
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 2
- BUVMZWZNWMKASN-QEJZJMRPSA-N Glu-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 BUVMZWZNWMKASN-QEJZJMRPSA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 2
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 2
- OWVURWCRZZMAOZ-XHNCKOQMSA-N Glu-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OWVURWCRZZMAOZ-XHNCKOQMSA-N 0.000 description 2
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 2
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 2
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 2
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 2
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 2
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 2
- OFIHURVSQXAZIR-SZMVWBNQSA-N Glu-Lys-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OFIHURVSQXAZIR-SZMVWBNQSA-N 0.000 description 2
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 2
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 2
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 2
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 2
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 2
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 2
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 2
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 2
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 2
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 2
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 2
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 2
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 2
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 2
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 2
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 2
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 2
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 2
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 2
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 2
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 2
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 2
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 2
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 2
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 2
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 2
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 2
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 2
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 2
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 2
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 2
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 2
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 2
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 2
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 2
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 2
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 2
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 2
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 2
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- 102100036669 Glycerol-3-phosphate dehydrogenase [NAD(+)], cytoplasmic Human genes 0.000 description 2
- 101150043522 HO gene Proteins 0.000 description 2
- QIVPRLJQQVXCIY-HGNGGELXSA-N His-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIVPRLJQQVXCIY-HGNGGELXSA-N 0.000 description 2
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 2
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 2
- MBSSHYPAEHPSGY-LSJOCFKGSA-N His-Ala-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O MBSSHYPAEHPSGY-LSJOCFKGSA-N 0.000 description 2
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 2
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 2
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 2
- DVHGLDYMGWTYKW-GUBZILKMSA-N His-Gln-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DVHGLDYMGWTYKW-GUBZILKMSA-N 0.000 description 2
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 2
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 2
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 2
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 2
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 2
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 2
- IDQNVIWPPWAFSY-AVGNSLFASA-N His-His-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O IDQNVIWPPWAFSY-AVGNSLFASA-N 0.000 description 2
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 2
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 2
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 2
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 2
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 2
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 2
- YXASFUBDSDAXQD-UWVGGRQHSA-N His-Met-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O YXASFUBDSDAXQD-UWVGGRQHSA-N 0.000 description 2
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 2
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 2
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 2
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 2
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 2
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 2
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 2
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 2
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 2
- QTMKFZAYZKBFRC-BZSNNMDCSA-N His-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O QTMKFZAYZKBFRC-BZSNNMDCSA-N 0.000 description 2
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 2
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 2
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 2
- 101001072574 Homo sapiens Glycerol-3-phosphate dehydrogenase [NAD(+)], cytoplasmic Proteins 0.000 description 2
- 101001009678 Homo sapiens Glycerol-3-phosphate dehydrogenase, mitochondrial Proteins 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 2
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 2
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 2
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 2
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 2
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 2
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 2
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 2
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 2
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 2
- WTOAPTKSZJJWKK-HTFCKZLJSA-N Ile-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WTOAPTKSZJJWKK-HTFCKZLJSA-N 0.000 description 2
- VCYVLFAWCJRXFT-HJPIBITLSA-N Ile-Cys-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N VCYVLFAWCJRXFT-HJPIBITLSA-N 0.000 description 2
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 2
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 2
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 2
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 2
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 2
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 2
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 2
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 2
- RWYCOSAAAJBJQL-KCTSRDHCSA-N Ile-Gly-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RWYCOSAAAJBJQL-KCTSRDHCSA-N 0.000 description 2
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 2
- JNDYZNJRRNFYIR-VGDYDELISA-N Ile-His-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N JNDYZNJRRNFYIR-VGDYDELISA-N 0.000 description 2
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 2
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 2
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 2
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 2
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 2
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 2
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 2
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 2
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 2
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 2
- WVUDHMBJNBWZBU-XUXIUFHCSA-N Ile-Lys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N WVUDHMBJNBWZBU-XUXIUFHCSA-N 0.000 description 2
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 2
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 2
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 2
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 2
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 2
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 2
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 2
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 2
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 2
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 2
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 2
- 241000256602 Isoptera Species 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 2
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 2
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 2
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 2
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 2
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 2
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 2
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 2
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 2
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 2
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 2
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 2
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 2
- POMXSEDNUXYPGK-IHRRRGAJSA-N Leu-Met-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N POMXSEDNUXYPGK-IHRRRGAJSA-N 0.000 description 2
- FZMNAYBEFGZEIF-AVGNSLFASA-N Leu-Met-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N FZMNAYBEFGZEIF-AVGNSLFASA-N 0.000 description 2
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 2
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 2
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 2
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- URJUVJDTPXCQFL-IHPCNDPISA-N Leu-Trp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N URJUVJDTPXCQFL-IHPCNDPISA-N 0.000 description 2
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 2
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 2
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 2
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 2
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 2
- CFVQPNSCQMKDPB-CIUDSAMLSA-N Lys-Cys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N CFVQPNSCQMKDPB-CIUDSAMLSA-N 0.000 description 2
- BYEBKXRNDLTGFW-CIUDSAMLSA-N Lys-Cys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O BYEBKXRNDLTGFW-CIUDSAMLSA-N 0.000 description 2
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 2
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 2
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 2
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 2
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 2
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 2
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 2
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 2
- GTAXSKOXPIISBW-AVGNSLFASA-N Lys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GTAXSKOXPIISBW-AVGNSLFASA-N 0.000 description 2
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 2
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 2
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 2
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 2
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 2
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 2
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 2
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 2
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 2
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 2
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 2
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 2
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 2
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 2
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 2
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 2
- CENKQZWVYMLRAX-ULQDDVLXSA-N Lys-Phe-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CENKQZWVYMLRAX-ULQDDVLXSA-N 0.000 description 2
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 2
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 2
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 2
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 2
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 2
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 2
- BVRNWWHJYNPJDG-XIRDDKMYSA-N Lys-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N BVRNWWHJYNPJDG-XIRDDKMYSA-N 0.000 description 2
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 2
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 2
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 2
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 2
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 2
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 2
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 2
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 2
- PQPMMGQTRQFSDA-SRVKXCTJSA-N Met-Glu-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O PQPMMGQTRQFSDA-SRVKXCTJSA-N 0.000 description 2
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 2
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 2
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 2
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 2
- RBGLBUDVQVPTEG-DCAQKATOSA-N Met-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N RBGLBUDVQVPTEG-DCAQKATOSA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 2
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 2
- OXIWIYOJVNOKOV-SRVKXCTJSA-N Met-Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCNC(N)=N OXIWIYOJVNOKOV-SRVKXCTJSA-N 0.000 description 2
- XOFDBXYPKZUAAM-GUBZILKMSA-N Met-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N XOFDBXYPKZUAAM-GUBZILKMSA-N 0.000 description 2
- XGIQKEAKUSPCBU-SRVKXCTJSA-N Met-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)N XGIQKEAKUSPCBU-SRVKXCTJSA-N 0.000 description 2
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 2
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 2
- WXXNVZMWHOLNRJ-AVGNSLFASA-N Met-Pro-Lys Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O WXXNVZMWHOLNRJ-AVGNSLFASA-N 0.000 description 2
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 2
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 2
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 2
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 2
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 2
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 2
- VOAKKHOIAFKOQZ-JYJNAYRXSA-N Met-Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 VOAKKHOIAFKOQZ-JYJNAYRXSA-N 0.000 description 2
- MFDDVIJCQYOOES-GUBZILKMSA-N Met-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N MFDDVIJCQYOOES-GUBZILKMSA-N 0.000 description 2
- 101000689035 Mus musculus Ribulose-phosphate 3-epimerase Proteins 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 108010065395 Neuropep-1 Proteins 0.000 description 2
- 101100490537 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) adh-1 gene Proteins 0.000 description 2
- 101000729343 Oryza sativa subsp. japonica Ribulose-phosphate 3-epimerase, cytoplasmic isoform Proteins 0.000 description 2
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 2
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 2
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 2
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 2
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 2
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 2
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 2
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 2
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 2
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 2
- OXUMFAOVGFODPN-KKUMJFAQSA-N Phe-Asn-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OXUMFAOVGFODPN-KKUMJFAQSA-N 0.000 description 2
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 2
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 2
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 2
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 2
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 2
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 2
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 2
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 2
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 2
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 2
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 2
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 2
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 2
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 2
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 2
- DZVXMMSUWWUIQE-ACRUOGEOSA-N Phe-His-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N DZVXMMSUWWUIQE-ACRUOGEOSA-N 0.000 description 2
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 2
- HTXVATDVCRFORF-MGHWNKPDSA-N Phe-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N HTXVATDVCRFORF-MGHWNKPDSA-N 0.000 description 2
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 2
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 2
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 2
- OAOLATANIHTNCZ-IHRRRGAJSA-N Phe-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N OAOLATANIHTNCZ-IHRRRGAJSA-N 0.000 description 2
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 2
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 2
- GZGPMBKUJDRICD-ULQDDVLXSA-N Phe-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O GZGPMBKUJDRICD-ULQDDVLXSA-N 0.000 description 2
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 2
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 2
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 2
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 2
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 2
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 2
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 2
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 2
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 2
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 2
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 2
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 2
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 2
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 2
- 241001265066 Piromyces sp. E2 Species 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 2
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 2
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 2
- SBYVDRLQAGENMY-DCAQKATOSA-N Pro-Asn-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O SBYVDRLQAGENMY-DCAQKATOSA-N 0.000 description 2
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 2
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 2
- ZYBUKTMPPFQSHL-JYJNAYRXSA-N Pro-Asp-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZYBUKTMPPFQSHL-JYJNAYRXSA-N 0.000 description 2
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 2
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 2
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 2
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 2
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 2
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 2
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 2
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 2
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 2
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 2
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 2
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 2
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 2
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 2
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 2
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 2
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 2
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 2
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 2
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 2
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 2
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
- RJTUIDFUUHPJMP-FHWLQOOXSA-N Pro-Trp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CN=CN4)C(=O)O RJTUIDFUUHPJMP-FHWLQOOXSA-N 0.000 description 2
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 2
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 108010011939 Pyruvate Decarboxylase Proteins 0.000 description 2
- 241001509970 Reticulitermes <genus> Species 0.000 description 2
- 241001509967 Reticulitermes flavipes Species 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- 241000192263 Scheffersomyces shehatae Species 0.000 description 2
- 101000928111 Scheffersomyces stipitis (strain ATCC 58785 / CBS 6054 / NBRC 10063 / NRRL Y-11545) Alcohol dehydrogenase 1 Proteins 0.000 description 2
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 2
- 108091058545 Secretory proteins Proteins 0.000 description 2
- 102000040739 Secretory proteins Human genes 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 2
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 2
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 2
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 2
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 2
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 2
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 2
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 2
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 2
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 2
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 2
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 2
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 2
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 2
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 2
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 2
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 2
- TVPQRPNBYCRRLL-IHRRRGAJSA-N Ser-Phe-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O TVPQRPNBYCRRLL-IHRRRGAJSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 2
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 2
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 2
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 2
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 2
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 2
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 2
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 2
- 108020004459 Small interfering RNA Proteins 0.000 description 2
- 241001215623 Talaromyces cellulolyticus Species 0.000 description 2
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 2
- NFMPFBCXABPALN-OWLDWWDNSA-N Thr-Ala-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O NFMPFBCXABPALN-OWLDWWDNSA-N 0.000 description 2
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 2
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 2
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 2
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 2
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 2
- VLIUBAATANYCOY-GBALPHGKSA-N Thr-Cys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VLIUBAATANYCOY-GBALPHGKSA-N 0.000 description 2
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 2
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 2
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 2
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 2
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 2
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 2
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 2
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 2
- DDDLIMCZFKOERC-SVSWQMSJSA-N Thr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N DDDLIMCZFKOERC-SVSWQMSJSA-N 0.000 description 2
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 2
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 2
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 2
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 2
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 2
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 2
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- 241000499912 Trichoderma reesei Species 0.000 description 2
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 2
- RSUXQZNWAOTBQF-XIRDDKMYSA-N Trp-Arg-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RSUXQZNWAOTBQF-XIRDDKMYSA-N 0.000 description 2
- OBWQLWYNNZPWGX-QEJZJMRPSA-N Trp-Gln-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OBWQLWYNNZPWGX-QEJZJMRPSA-N 0.000 description 2
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 2
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 2
- AWEGFIJXYWGBCA-XIRDDKMYSA-N Trp-His-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AWEGFIJXYWGBCA-XIRDDKMYSA-N 0.000 description 2
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 2
- SLOYNOMYOAOUCX-BVSLBCMMSA-N Trp-Phe-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SLOYNOMYOAOUCX-BVSLBCMMSA-N 0.000 description 2
- KXFYAQUYJKOQMI-QEJZJMRPSA-N Trp-Ser-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 KXFYAQUYJKOQMI-QEJZJMRPSA-N 0.000 description 2
- UJGDFQRPYGJBEH-AAEUAGOBSA-N Trp-Ser-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N UJGDFQRPYGJBEH-AAEUAGOBSA-N 0.000 description 2
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 2
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 2
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 2
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 2
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 2
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 2
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 2
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 2
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 2
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 2
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 2
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 2
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 2
- RIFVTNDKUMSSMN-ULQDDVLXSA-N Tyr-His-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O RIFVTNDKUMSSMN-ULQDDVLXSA-N 0.000 description 2
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 2
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 2
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 2
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 2
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 2
- MXFPBNFKVBHIRW-BZSNNMDCSA-N Tyr-Lys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O MXFPBNFKVBHIRW-BZSNNMDCSA-N 0.000 description 2
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 2
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 2
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 2
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 2
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 2
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 2
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 2
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 2
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 2
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 2
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 2
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 2
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 2
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 2
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 2
- MJUTYRIMFIICKL-JYJNAYRXSA-N Tyr-Val-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJUTYRIMFIICKL-JYJNAYRXSA-N 0.000 description 2
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 2
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 2
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 2
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 2
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 2
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 2
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 2
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 2
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 2
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 2
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 2
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 2
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 2
- ILMVQSHENUZYIZ-JYJNAYRXSA-N Val-Met-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N ILMVQSHENUZYIZ-JYJNAYRXSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 2
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 2
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 2
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 2
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 2
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 2
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 2
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 2
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 2
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 2
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 2
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 2
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 2
- LMVWCLDJNSBOEA-FKBYEOEOSA-N Val-Tyr-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N LMVWCLDJNSBOEA-FKBYEOEOSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- TVXBFESIOXBWNM-UHFFFAOYSA-N Xylitol Natural products OCCC(O)C(O)C(O)CCO TVXBFESIOXBWNM-UHFFFAOYSA-N 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- PPQRONHOSHZGFQ-LMVFSUKVSA-N aldehydo-D-ribose 5-phosphate Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PPQRONHOSHZGFQ-LMVFSUKVSA-N 0.000 description 2
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 2
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 2
- 108010066119 arginyl-leucyl-aspartyl-serine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010066988 asparaginyl-alanyl-glycyl-alanine Proteins 0.000 description 2
- 108010078007 aspartyl-aspartyl-asparagyl-lysyl-isoleucine Proteins 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 2
- 239000005516 coenzyme A Substances 0.000 description 2
- 229940093530 coenzyme a Drugs 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- HYBBIBNJHNGZAN-UHFFFAOYSA-N furfural Chemical compound O=CC1=CC=CO1 HYBBIBNJHNGZAN-UHFFFAOYSA-N 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 2
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 101150087371 gpd1 gene Proteins 0.000 description 2
- 229940059442 hemicellulase Drugs 0.000 description 2
- 108010002430 hemicellulase Proteins 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 2
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 2
- HEBKCHPVOIAQTA-UHFFFAOYSA-N meso ribitol Natural products OCC(O)C(O)C(O)CO HEBKCHPVOIAQTA-UHFFFAOYSA-N 0.000 description 2
- 230000037353 metabolic pathway Effects 0.000 description 2
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 108091070501 miRNA Proteins 0.000 description 2
- 239000002679 microRNA Substances 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 238000007747 plating Methods 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 238000010298 pulverizing process Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 239000004055 small Interfering RNA Substances 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 108700026220 vif Genes Proteins 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- 239000000811 xylitol Substances 0.000 description 2
- HEBKCHPVOIAQTA-SCDXWVJYSA-N xylitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)CO HEBKCHPVOIAQTA-SCDXWVJYSA-N 0.000 description 2
- 229960002675 xylitol Drugs 0.000 description 2
- 235000010447 xylitol Nutrition 0.000 description 2
- WBQJTPDOGLYTBE-VIFPVBQESA-N 1-nitroso-L-tryptophan Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CN(N=O)C2=C1 WBQJTPDOGLYTBE-VIFPVBQESA-N 0.000 description 1
- 101150028074 2 gene Proteins 0.000 description 1
- 101150090724 3 gene Proteins 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- 101150101112 7 gene Proteins 0.000 description 1
- 101150105694 ACS1 gene Proteins 0.000 description 1
- 101150026650 ALD4 gene Proteins 0.000 description 1
- 101150076082 ALD5 gene Proteins 0.000 description 1
- 101150021180 ALD6 gene Proteins 0.000 description 1
- 101150078335 ALDH1 gene Proteins 0.000 description 1
- 102100035709 Acetyl-coenzyme A synthetase, cytoplasmic Human genes 0.000 description 1
- 101710187573 Alcohol dehydrogenase 2 Proteins 0.000 description 1
- 101710133776 Alcohol dehydrogenase class-3 Proteins 0.000 description 1
- 241000228215 Aspergillus aculeatus Species 0.000 description 1
- 241000020089 Atacta Species 0.000 description 1
- 101100434663 Bacillus subtilis (strain 168) fbaA gene Proteins 0.000 description 1
- 241000606123 Bacteroides thetaiotaomicron Species 0.000 description 1
- 241000186018 Bifidobacterium adolescentis Species 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 241001509964 Coptotermes Species 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- 241001665144 Cyllamyces aberensis Species 0.000 description 1
- 101150038170 ENA1 gene Proteins 0.000 description 1
- 108010001817 Endo-1,4-beta Xylanases Proteins 0.000 description 1
- 241000224432 Entamoeba histolytica Species 0.000 description 1
- 241000588914 Enterobacter Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 101150095274 FBA1 gene Proteins 0.000 description 1
- 101150051414 FPS1 gene Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 102000004366 Glucosidases Human genes 0.000 description 1
- 108010056771 Glucosidases Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 102000004195 Isomerases Human genes 0.000 description 1
- 108090000769 Isomerases Proteins 0.000 description 1
- SRBFZHDQGSBBOR-HWQSCIPKSA-N L-arabinopyranose Chemical compound O[C@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-HWQSCIPKSA-N 0.000 description 1
- 108010018080 L-arabinose isomerase Proteins 0.000 description 1
- 108090000416 L-ribulose-5-phosphate 4-epimerases Proteins 0.000 description 1
- 241000933069 Lachnoclostridium phytofermentans Species 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 102100037214 Orotidine 5'-phosphate decarboxylase Human genes 0.000 description 1
- 108010055012 Orotidine-5'-phosphate decarboxylase Proteins 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 241000235647 Pachysolen tannophilus Species 0.000 description 1
- 108700023175 Phosphate acetyltransferases Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 101100010928 Saccharolobus solfataricus (strain ATCC 35092 / DSM 1617 / JCM 11322 / P2) tuf gene Proteins 0.000 description 1
- 101100099697 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) TKL2 gene Proteins 0.000 description 1
- 101000832889 Scheffersomyces stipitis (strain ATCC 58785 / CBS 6054 / NBRC 10063 / NRRL Y-11545) Alcohol dehydrogenase 2 Proteins 0.000 description 1
- 102100026974 Sorbitol dehydrogenase Human genes 0.000 description 1
- 241001468239 Streptomyces murinus Species 0.000 description 1
- 101150001810 TEAD1 gene Proteins 0.000 description 1
- 101150074253 TEF1 gene Proteins 0.000 description 1
- 102100029898 Transcriptional enhancer factor TEF-1 Human genes 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- 102000014384 Type C Phospholipases Human genes 0.000 description 1
- 108010079194 Type C Phospholipases Proteins 0.000 description 1
- 229940100228 acetyl coenzyme a Drugs 0.000 description 1
- 230000002546 agglutinic effect Effects 0.000 description 1
- PYMYPHUHKUWMLA-WISUUJSJSA-N aldehydo-L-xylose Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WISUUJSJSA-N 0.000 description 1
- 229930195726 aldehydo-L-xylose Natural products 0.000 description 1
- 239000012670 alkaline solution Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 101150052795 cbh-1 gene Proteins 0.000 description 1
- 101150114858 cbh2 gene Proteins 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- FYGDTMLNYKFZSV-ZWSAEMDYSA-N cellotriose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)[C@H](O)[C@H]1O FYGDTMLNYKFZSV-ZWSAEMDYSA-N 0.000 description 1
- 239000012295 chemical reaction liquid Substances 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 238000006356 dehydrogenation reaction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000004821 distillation Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 229940007078 entamoeba histolytica Drugs 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 239000000413 hydrolysate Substances 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 238000010335 hydrothermal treatment Methods 0.000 description 1
- 108010002685 hygromycin-B kinase Proteins 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 239000002608 ionic liquid Substances 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- 229920005610 lignin Polymers 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 101150082324 lpp1 gene Proteins 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 150000003905 phosphatidylinositols Chemical class 0.000 description 1
- 230000000865 phosphorylative effect Effects 0.000 description 1
- 230000032361 posttranscriptional gene silencing Effects 0.000 description 1
- 229910000160 potassium phosphate Inorganic materials 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- 238000002203 pretreatment Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 108700022487 rRNA Genes Proteins 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 108020002667 ribulokinase Proteins 0.000 description 1
- 101150018078 rpe gene Proteins 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 101150106193 tal gene Proteins 0.000 description 1
- 239000013076 target substance Substances 0.000 description 1
- 101150003389 tdh2 gene Proteins 0.000 description 1
- 229920001221 xylan Polymers 0.000 description 1
- 150000004823 xylans Chemical class 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/04—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
- C12P7/06—Ethanol, i.e. non-beverage
- C12P7/08—Ethanol, i.e. non-beverage produced as by-product or from waste or cellulosic material substrate
- C12P7/10—Ethanol, i.e. non-beverage produced as by-product or from waste or cellulosic material substrate substrate containing cellulosic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/14—Fungi; Culture media therefor
- C12N1/16—Yeasts; Culture media therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0008—Oxidoreductases (1.) acting on the aldehyde or oxo group of donors (1.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/10—Biofuels, e.g. bio-diesel
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Mycology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Botany (AREA)
- Medicinal Chemistry (AREA)
- Biomedical Technology (AREA)
- Virology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
在具有木糖代谢能力的酵母的木糖同化和乙醇发酵时代谢培养基中的乙酸,降低乙酸浓度。乙醇的制造方法,具有以下工序:将导入了木糖异构酶基因和乙醛脱氢酶基因的重组酵母,用含有木糖的培养基进行培养,从而进行乙醇发酵木。
Description
本申请是申请日为2014年2月27日、申请号为201480010549.7、发明名称为“使用了重组酵母的乙醇的制造方法”的专利申请的分案申请。
技术领域
本发明涉及使用了具有木糖代谢能力的重组酵母的乙醇的制造方法。
背景技术
纤维素系生物质作为乙醇等有用醇和/或有机酸的原料而被有效利用。为了在利用了纤维素系生物质的乙醇制造中提高乙醇生产量,开发了作为底物能够利用5碳单糖木糖的酵母。例如,专利文献1公开了染色体中插入了来源于树干毕赤酵母(Pichiastipitis)的木糖还原酶基因和木糖醇脱氢酶基因以及来源于酿酒酵母(S.cerevisiae)的木酮糖激酶基因的酵母。
另外,已知乙酸在纤维素系生物质的水解物中大量含有,抑制酵母的乙醇发酵。特别地,已知在导入了木糖同化基因的酵母中,显著地抑制以木糖为糖源的乙醇发酵(非专利文献1和2)
另一方面,将用纤维素酶糖化纤维素系生物质而得的物质进行发酵后的醪中,主要包含未发酵残渣、难发酵残渣、酶和发酵用微生物。通过将包含醪的反应液在接着进行的发酵中利用,可以实现发酵用微生物的再利用,降低发酵用微生物的新加入量,实现低成本化。但是,此时,醪所含的乙酸也同时被带入,其结果是有时发酵培养基所含的乙酸的浓度上升,抑制乙醇发酵。另外,即使在将发酵槽内的醪送入保持减压的闪蒸罐,在闪蒸罐内除去乙醇后,将醪送回发酵槽这样的连续发酵法中,也难以从醪中去除乙酸,因而由乙酸产生的发酵抑制成为重要的问题。
为了避免由乙酸产生的发酵抑制,有通过乙醇发酵中一般使用的菌株酿酒酵母的LPP1基因和/或ENA1基因的过表达(非专利文献3)、FPS1基因的破坏(非专利文献4)来改善在乙酸存在下乙醇的发酵能力的报告。然而,这些报告是以葡萄糖作为底物时的乙醇发酵的结果,对由于乙酸而发酵被显著抑制的以木糖作为底物时的乙醇发酵的效果不明。另外,即使使用这些报告的突变酵母,也不能实现减少在如上所述的发酵菌体再利用和/或连续发酵时成为问题的乙酸的带入量。
作为避免由乙酸产生的发酵抑制的其他方法,考虑使培养基中的乙酸在乙醇发酵的同时代谢。但是,乙酸的代谢是需氧的反应,与乙醇的代谢途径重复。因此,虽然如果在需氧条件下进行发酵则可能使乙酸代谢,但同时目的物质乙醇也被代谢。
为了在乙醇不被代谢的厌氧条件下使乙酸代谢,有通过在破坏了酿酒酵母的甘油生产途径的GPD1·GPD2基因的菌株中导入编码乙醛脱氢酶(EC 1.2.1.10)的mhpF基因,从而同化乙酸的报告(非专利文献5、专利文献2)。此外,乙醛脱氢酶催化以下的可逆反应。
此外,利用GPD1·GPD2基因的甘油生产途径如下述式所示,是将代谢中产生的剩余的辅酶NADH氧化成NAD+的途径。
0.5葡萄糖+NADH+H++ATP→甘油+NAD++ADP+Pi
于是通过破坏GPD1·GPD2基因来破坏该反应途径,通过导入mhpF来供给剩余的辅酶NADH,进行下述的反应。
乙酰辅酶A+NADH+H+→乙醛+NAD++辅酶A
另外,乙酰辅酶A由乙酸通过乙酰辅酶A合成酶而合成,乙醛被转换成乙醇,因而最终形成下述反应式,在剩余的辅酶NADH被氧化的同时,乙酸也被代谢。
乙酸+2NADH+2H++ATP→乙醇+NAD++AMP+Pi
如上所述,对酵母赋予乙酸代谢能力的机理,需要破坏甘油途径。然而,已知GPD1基因和GPD2基因的重复破坏株发酵能力大幅降低,产业水平上的实用性低。另外,非专利文献5和专利文献2不是关于木糖同化酵母的报告,在木糖同化时是否有效果是不明的。
此外,还有对GPD1基因和GPD2基因的非破坏株导入mhpF基因的报告(非专利文献6)。然而,非专利文献6中虽然通过mhpF基因的导入而乙酸的生产量减少,但没有培养基中的乙酸减少这样的报告。进而,该非专利文献6也还不是关于木糖同化酵母的报告。
此外,有关于导入了木糖异构酶(XI)基因(来源于白蚁的肠内原生生物)的木糖同化酵母的报告(专利文献3)、以及对导入有XI基因(来源于Piromyces sp.E2)的木糖同化酵母进一步导入了乙醛脱氢酶基因(来源于青春双歧杆菌(Bifidobacteriumadolescentis0)的报告(专利文献4)。但是,这些文献中关于木糖同化时的乙酸同化几乎没有报告。
如上,现有技术中并未报告在同化木糖的同时进行乙醇发酵的条件下,使乙酸高效地代谢·分解的技术。
现有技术文献
专利文献
专利文献1:日本特开2009-195220号公报
专利文献2:WO 2011/010923
专利文献3:日本特开2011-147445号公报
专利文献4:日本特开2010-239925号公报
非专利文献
非专利文献1:FEMS Yeast Research,vol.9,2009,358-364
非专利文献2:Enzyme and Microbial Technology 33,2003,786-792
非专利文献3:Biotechnol.Bioeng.2009 103(3):500-512
非专利文献4:Biotechnol.Lett.2011 33:277-284
非专利文献5:Appl.Environ.Microbiol.2010 76:190-195
非专利文献6:Biotechnol.Lett.2011 33:1375-1380
发明内容
发明要解决的课题
因此,本发明鉴于上述事实,目的是特别地提供乙醇的制造方法,其中,使用了具有木糖代谢能力的酵母中的在木糖同化和乙醇发酵时能够代谢培养基中的乙酸而降低乙酸浓度的重组酵母。
用于解决课题的方法
为了实现上述目的,本发明者们进行了深入研究,结果发现,对具有木糖代谢能力的酵母导入特定的乙醛脱氢酶基因而得的重组酵母,在包含木糖的培养基中进行乙醇发酵时,能够代谢培养基中的乙酸,从而完成了本发明。
本发明包含以下内容。
(1)乙醇的制造方法,具有以下工序:将导入了木糖异构酶基因和乙醛脱氢酶基因的重组酵母,用含有木糖的培养基进行培养,从而进行乙醇发酵。
(2)根据(1)所述的乙醇的制造方法,其特征在于,所述木糖异构酶基因是编码以下的(a)或(b)的蛋白质的基因,
(a)具有序列号4所示的氨基酸序列的蛋白质,
(b)具有相对于序列号4所示的氨基酸序列有70%以上的同一性的氨基酸序列、且具有将木糖变为木酮糖的酶活性的蛋白质。
(3)根据(1)所述的乙醇的制造方法,其特征在于,所述乙醛脱氢酶基因编码来源于大肠杆菌的乙醛脱氢酶。
(4)根据(3)所述的乙醇的制造方法,其特征在于,所述来源于大肠杆菌的乙醛脱氢酶是以下的(a)或(b)的蛋白质,
(a)具有序列号2或20所示的氨基酸序列的蛋白质,
(b)具有相对于序列号2或20所示的氨基酸序列有70%以上的同一性的氨基酸序列、且具有乙醛脱氢酶活性的蛋白质。
(5)根据(1)所述的乙醇的制造方法,其特征在于,所述乙醛脱氢酶基因编码来源于拜氏梭菌(Clostridium beijerinckii)的乙醛脱氢酶。
(6)根据(5)所述的乙醇的制造方法,其特征在于,所述来源于拜氏梭菌(Clostridium beijerinckii)的乙醛脱氢酶是以下的(a)或(b)的蛋白质,
(a)具有序列号22所示的氨基酸序列的蛋白质,
(b)具有相对于序列号22所示的氨基酸序列有70%以上的同一性的氨基酸序列、且具有乙醛脱氢酶活性的蛋白质。
(7)根据(1)所述的乙醇的制造方法,其特征在于,所述乙醛脱氢酶基因编码来源于莱茵衣藻(Chlamydomonas reinhardtii)的乙醛脱氢酶。
(8)根据(5)所述的乙醇的制造方法,其特征在于,所述来源于莱茵衣藻(Chlamydomonas reinhardtii)的乙醛脱氢酶是以下的(a)或(b)的蛋白质,
(a)具有序列号24所示的氨基酸序列的蛋白质,
(b)具有相对于序列号24所示的氨基酸序列有70%以上的同一性的氨基酸序列、且具有乙醛脱氢酶活性的蛋白质。
(9)根据(1)所述的乙醇的制造方法,其特征在于,所述重组酵母是进一步导入了木酮糖激酶基因的重组酵母。
(10)根据(1)所述的乙醇的制造方法,其特征在于,所述重组酵母是导入了下述基因的重组酵母,所述基因编码选自构成戊糖磷酸途径中的非氧化过程的途径的酶群中的酶。
(11)根据(10)所述的乙醇的制造方法,其特征在于,构成戊糖磷酸途径中的非氧化过程的途径的酶群是核糖-5-磷酸异构酶、核酮糖-5-磷酸-3-差向异构酶、转酮醇酶和转醛醇酶。
(12)根据(1)所述的乙醇的制造方法,其特征在于,所述培养基含有纤维素,在所述乙醇发酵中,至少同时进行所述纤维素的糖化。
(13)根据(1)所述的乙醇的制造方法,其特征在于,所述重组酵母高表达具有将乙醛转换为乙醇的活性的醇脱氢酶基因。
(14)根据(1)所述的乙醇的制造方法,其特征在于,所述重组酵母是具有将乙醇转换为乙醛的活性的醇脱氢酶基因的表达量降低了的重组酵母。
本说明书包含作为本申请的优先权基础的日本专利申请2013-037501号和日本特愿2014-36652号的说明书和/或附图所记载的内容。。
发明的效果
在本发明的乙醇的制造方法中,可以降低培养基中的乙酸浓度,从而可以有效地避免由乙酸引起的发酵抑制。其结果是,本发明的乙醇的制造方法可以较高地维持以木糖作为糖源的乙醇发酵的效率,可以实现优异的乙醇收率。因此,本发明的乙醇的制造方法,例如,在再利用重组酵母时和/或在连续培养中使用时可以降低乙酸的带入量,维持优异的乙醇收率。
附图说明
图1是显示pUC-HIS3U-P_HOR7-XKS1-T_TDH3-P_TDH2-hph-T_CYC1-HIS3D的构成示意图。
图2是显示pUC-R67-HOR7p-RsXI-T_TDH3-TRP1d-R45的构成示意图。
图3是显示pUC-LEU2U-P_HOR7-TAL1-T_TDH3-P_HOR7-TKL1-T_TDH3-HIS3-LEU2D的构成示意图。
图4是显示pUC-GRE3U-P_HOR7-RPE1-T_TDH3-P_HOR7-RKI1-T_TDH3-LEU2-GRE3D的构成示意图。
图5是显示pCR-ADH2U-URA3-ADH2D的构成示意图。
图6是显示pCR-ADH2part-T_CYC1-P_TDH3-ADH1-T_ADH1-URA3-ADH2D的构成示意图。
图7pCR-ADH2part-T_CYC1-ERO1_T-mhpF-HOR7_P-URA3-ADH2D的构成示意图。
图8是显示pCR-ADH2part-T_CYC1-P_TDH3-ADH1-T_ADH1-ERO1_T-mhpF-HOR7_P-URA3-ADH2D的构成示意图。
图9是显示pCR-ADH2U-ERO1_T-mhpF-HOR7_P-URA3-ADH2D的构成示意图。
图10是显示pCR-ADH2U-P_TDH3-ADH1-T_ADH1-ERO1_T-mhpF-HOR7_P-URA3-ADH2D的构成示意图。
图11是显示pCR-ADH2part-T_CYC1-URA3-ADH2D的构成示意图。
具体实施方式
以下,使用附图和实施例更详细地说明本发明。
本发明的乙醇的制造方法是,使用具有木糖代谢能力、且导入了乙醛脱氢酶基因的重组酵母,由培养基所含的糖源合成乙醇的方法。在本发明的乙醇的制造方法中,具有能够通过上述重组酵母来代谢培养基所含的乙酸,随着乙醇发酵而培养基中的乙酸浓度降低这样的特征。
<重组酵母>
本发明的乙醇的制造方法中使用的重组酵母,是导入了木糖异构酶基因和乙醛脱氢酶基因的酵母,并且是具有木糖代谢能力的酵母。这里,具有木糖代谢能力的酵母,包含通过对本来不具有木糖代谢能力的酵母导入木糖异构酶基因从而被赋予了木糖代谢能力的酵母、通过对本来不具有木糖代谢能力的酵母导入木糖异构酶基因和其他木糖代谢相关基因从而被赋予了木糖代谢能力的酵母酵母、本来具有木糖代谢能力的酵母的任一者的含义。
具有木糖代谢能力的酵母可以同化培养基中所含的木糖而生产乙醇。此外,培养基中所含的木糖,既可以是通过将以木糖作为构成糖的木聚糖和/或半纤维素等进行糖化的工艺而得的,也可以是培养基所含的木聚糖和/或半纤维素等被糖化酶糖化而供给至培养基中的。后者的情况下,是指所谓同时糖化发酵的体系。
作为木糖异构酶基因(XI基因),不特别限定,可以使用来源于任何生物种的基因。例如,可以不特别限制地使用日本特开2011-147445号公报所公开的来源于白蚁的肠内原生生物的多种木糖异构酶基因。另外,作为木糖异构酶基因,还可以利用来源于作为厌氧性的霉的Piromyces sp.E2种(特表2005-514951号公报)、来源于作为厌氧性的霉的Cyllamyces aberensi、来源于作为细菌的多形拟杆菌(Bacteroides thetaiotaomicron)、来源于作为细菌的Clostridium phytofermentans、来源于鼠灰链霉菌(Streptomycesmurinus)簇的基因。
具体地,作为木糖异构酶基因,优选使用来源于黄胸散白蚁(Reticulitermessperatus)肠内原生生物的木糖异构酶基因。该来源于黄胸散白蚁(Reticulitermessperatus)肠内原生生物的木糖异构酶基因的编码区的碱基序列和该基因所编码的蛋白质的氨基酸序列分别示于序列号3和4。
但是,作为木糖异构酶基因,不限于序列号3和4所特定的,也可以是碱基序列和/或氨基酸序列不同但处于旁系同源物(paralog)的关系或狭义的同源物的关系的基因。
另外,木糖异构酶基因不限于这些序列号3和4所特定的,例如,还可以是编码具有相对于序列号4的氨基酸序列有70%以上、优选80%以上、更优选90%以上、最优选95%以上的序列类似性或同一性的氨基酸序列,且具有木糖异构酶活性的蛋白质的基因。序列类似性和同一性的值,可以通过安装了BLAST算法的BLASTN和/或BLASTX程序来算出(默认的设定)。此外,序列类似性的值,可以算出在将一对氨基酸序列两两比对分析时完全一致的氨基酸残基、和物理化学上功能类似的氨基酸残基的合计,作为比较的全氨基酸残基中的上述合计数的比例而算出。此外,同一性的值,可以算出在将一对氨基酸序列两两比对分析时完全一致的氨基酸残基,作为比较的全氨基酸残基中的上述氨基酸残基数的比例算出。
进而,木糖异构酶基因不限于这些序列号3和4所特定的,还可以是例如,编码具有相对于序列号4的氨基酸序列置换、缺失、插入或添加了1或数个氨基酸的氨基酸序列、且具有木糖异构酶活性的蛋白质的基因。这里,数个例如为2~30个,优选为2~20个,更优选为2~10个,最优选为2~5个。
进而另外,木糖异构酶基因不限于这些序列号3和4所特定的,也可以是例如,与包含序列号3的碱基序列的DNA的互补链的全部或一部分在严格条件下杂交、且编码具有木糖异构酶活性的蛋白质的基因。这里所说的“严格条件”是指形成所谓特异性杂种、不形成非特异性杂种的条件,可以参照例如Molecular Cloning:A Laboratory Manual(ThirdEdition)来适宜确定。具体地,可以根据DNA杂交(southern hybridization)时的温度和/或溶液所含的盐浓度、和DNA杂交(southern hybridization)的洗涤工序时的温度和/或溶液所含的盐浓度来设定严格度。更详细地,作为严格条件,例如钠浓度为25~500mM、优选为25~300mM,温度为42~68℃、优选为42~65℃。更具体地为5×SSC(83mM NaCl、83mM柠檬酸钠)、温度42℃。
如上所述,关于包含与序列号3不同的碱基序列的基因、或编码与序列号4不同的氨基酸序列的基因是否作为木糖异构酶基因发挥功能,只要制作将该基因插入适当的启动子和终止子等之间而得的表达载体,使用该表达载体转化例如大肠杆菌等宿主,测定表达的蛋白质的木糖异构酶活性即可。木糖异构酶活性是指将木糖异构化为木酮糖的活性。因此,木糖异构酶活性可以如下评价:作为底物准备包含木糖的溶液,使检查对象蛋白质在适当温度下作用,测定木糖的减少量和/或木酮糖的生成量。
特别是,作为木糖异构酶基因,优选使用编码包含对序列号4所示的氨基酸序列中的特定的氨基酸残基导入了特定的突变的氨基酸序列、且木糖异构酶活性提高了的突变型木糖异构酶的基因。具体地,作为编码突变型木糖异构酶的基因,可以列举编码序列号4所示的氨基酸序列中的第337位的天冬酰胺被替换为半胱氨酸的氨基酸序列的基因。包含序列号4所示的氨基酸序列中的第337位的天冬酰胺被替换为半胱氨酸的氨基酸序列的木糖异构酶,与野生型的木糖异构酶相比具有优异的木糖异构酶活性。此外,突变型木糖异构酶不限于将上述第337位的天冬酰胺替换为半胱氨酸的突变体,既可以是将上述第337位的天冬酰胺替换成了除了半胱氨酸以外的氨基酸的突变体,也可以是除了上述第337位的天冬酰胺之外进而将不同的氨基酸残基替换成了其他氨基酸的突变体,还可以是替换了除了所述第337位的天冬酰胺以外的其他的氨基酸残基的突变体。
另一方面,除了木糖异构酶基因以外的木糖代谢相关基因,是包含编码将木糖转换为木糖醇的木糖还原酶的木糖还原酶基因、编码将木糖醇转换为木酮糖的木糖醇脱氢酶的木糖醇脱氢酶基因和编码将木酮糖磷酸化而生成木酮糖5-磷酸的木酮糖激酶的木酮糖激酶基因的含义。此外,通过木酮糖激酶而生成的木酮糖5-磷酸进入戊糖磷酸途径而被代谢。
作为木糖代谢相关基因,不特别限定,可列举来源于树干毕赤酵母(Pichiastipitis)的木糖还原酶基因和木糖醇脱氢酶基因、来源于酿酒酵母(Saccharomycescerevisiae)的木酮糖激酶基因(参照Eliasson A.et al.,Appl.Environ.Microbiol,66:3381-3386和Toivari MN et al.,Metab.Eng.3:236-249)。此外,作为木糖还原酶基因,可利用来源于热带假丝酵母(Candida tropicalis)和/或近平滑假丝酵母(Candidaprapsilosis)的木糖还原酶基因。作为木糖醇脱氢酶基因,可利用来源于热带假丝酵母(Candida tropicalis)和/或近平滑假丝酵母(Candida prapsilosis)的木糖醇脱氢酶基因。作为木酮糖激酶基因,可利用来源于树干毕赤酵母(Pichia stipitis)的木酮糖激酶基因。
另外,作为本来具有木糖代谢能力的酵母,不特别限定,可列举树干毕赤酵母(Pichia stipitis)、热带假丝酵母(Candida tropicalis)和近平滑假丝酵母(Candidaprapsilosis)等。
另一方面,作为导入具有木糖代谢能力的酵母中的乙醛脱氢酶基因,不特别限定,可以使用来源于任何生物的基因。另外,在乙醛脱氢酶基因使用来源于除了酵母等真菌以外的生物,例如,细菌和/或动物、植物、昆虫、藻类的基因的情况下,优选使用根据导入的酵母中的密码子使用频率而改变了碱基序列的基因。
更具体地,作为乙醛脱氢酶基因,可以使用大肠杆菌中的mhpF基因,和/或如Applied and Environmental Microbiology,May 2004,p.2892-2897,Vol.70,No.5中所公开的那样使用溶组织内阿米巴(Entamoeba histolytica)中的ALDH1基因。这里,大肠杆菌中的mhpF基因的碱基序列和由mhpF基因所编码的蛋白质的氨基酸序列分别示于序列号1和2。
但是,作为乙醛脱氢酶基因,不限于序列号1和2所特定的,只要是EC编号1.2.1.10所定义的酶,就也可以是碱基序列和/或氨基酸序列不同但处于旁系同源物(paralog)的关系或狭义的同源物的关系的基因。作为乙醛脱氢酶基因,可列举例如,大肠杆菌中的adhE基因、来源于拜氏梭菌(Clostridium beijerinckii)的乙醛脱氢酶基因和来源于莱茵衣藻(Chlamydomonas reinhardtii)的乙醛脱氢酶基因。这里,大肠杆菌中的adhE基因的碱基序列和由adhE基因所编码的蛋白质的氨基酸序列分别示于序列号19和20。另外,来源于拜氏梭菌(Clostridium beijerinckii)的乙醛脱氢酶基因的碱基序列和该基因所编码的蛋白质的氨基酸序列分别示于序列号21和22。进而,来源于莱茵衣藻(Chlamydomonasreinhardtii)的乙醛脱氢酶基因乙醛脱氢酶基因的碱基序列和该基因所编码的蛋白质的氨基酸序列分别示于序列号23和24。
另外,乙醛脱氢酶基因不限于这些序列号1和2所特定的,只要是由EC编号1.2.1.10所定义的酶,也可以是碱基序列、氨基酸序列不同但处于旁系同源物(paralog)的关系或狭义的同源物的关系的基因。作为乙醛脱氢酶基因,可列举例如,大肠杆菌中的adhE基因、来源于拜氏梭菌(Clostridium beijerinckii)的乙醛脱氢酶基因和来源于莱茵衣藻(Chlamydomonas reinhardtii)的乙醛脱氢酶基因。这里,大肠杆菌中的adhE基因的碱基序列和adhE基因所编码的蛋白质的氨基酸序列分别示于序列号19和20。另外,来源于拜氏梭菌(Clostridium beijerinckii)的乙醛脱氢酶基因的碱基序列和该基因所编码的蛋白质的氨基酸序列分别示于序列号21和22。进而,来源于莱茵衣藻(Chlamydomonasreinhardtii)的乙醛脱氢酶基因的碱基序列和该基因所编码的蛋白质的氨基酸序列分别示于序列号23和24。
另外,乙醛脱氢酶基因不限于这些序列号1和2、19和20、21和22以及23和24所特定的,也可以是例如,编码具有相对于序列号2、20、22或24的氨基酸序列有70%以上、优选为80%以上、更优选为90%以上、最优选为95%以上的序列类似性或同一性的氨基酸序列、且具有乙醛脱氢酶活性的蛋白质的基因。序列类似性和同一性的值,可以通过安装了BLAST算法的BLASTN和/或BLASTX程序来算出(默认的设定)。此外,序列类似性的值,可以算出在将一对氨基酸序列两两比对分析时完全一致的氨基酸残基、和物理化学上功能类似的氨基酸残基的合计,作为比较的全氨基酸残基中的上述合计数的比例而算出。此外,同一性的值,可以算出在将一对氨基酸序列两两比对分析时完全一致的氨基酸残基,作为比较的全氨基酸残基中的上述氨基酸残基数的比例算出。
进而,乙醛脱氢酶基因不限于这些序列号1和2、19和20、21和22以及23和24所特定的,还可以是例如,编码具有相对于序列号2、20、22或24的氨基酸序列置换、缺失、插入或添加了1或数个氨基酸的氨基酸序列、且具有乙醛脱氢酶活性的蛋白质的基因。这里,数个例如为2~30个,优选为2~20个,更优选为2~10个,最优选为2~5个。
进而另外,乙醛脱氢酶基因不限于这些序列号1和2、19和20、21和22以及23和24所特定的,也可以是例如,与包含序列号1、19、21或23的碱基序列的DNA的互补链的全部或一部分在严格条件下杂交、且编码具有乙醛脱氢酶活性的蛋白质的基因。这里所说的“严格条件”是指形成所谓特异性杂种、不形成非特异性杂种的条件,可以参照例如MolecularCloning:A Laboratory Manual(Third Edition)来适宜确定。具体地,可以根据DNA杂交(southern hybridization)时的温度和/或溶液所含的盐浓度、和DNA杂交(southernhybridization)的洗涤工序时的温度和/或溶液所含的盐浓度来设定严格度。更详细地,作为严格条件,例如钠浓度为25~500mM、优选为25~300mM,温度为42~68℃、优选为42~65℃。更具体地为5×SSC(83mM NaCl、83mM柠檬酸钠)、温度42℃。
如上所述,关于包含与序列号1、19、21或23不同的碱基序列的基因、或编码与序列号2、20、22或24不同的氨基酸序列的基因是否作为乙醛脱氢酶基因发挥功能,只要制作将该基因插入适当的启动子和终止子等之间而得的表达载体,使用该表达载体转化例如大肠杆菌等宿主,测定表达的蛋白质的乙醛脱氢酶活性即可。乙醛脱氢酶活性可以如下测定:作为底物准备包含乙醛、辅酶A和NAD+的溶液,在适当的温度下与检查对象蛋白质作用,将生成的乙酰磷酸通过磷酸乙酰转移酶的作用而转换为乙酰磷酸,从而测定,或者将生成的NADH进行分光学测量而测定。
此外,本发明的乙醇的制造方法中使用的重组酵母也可以是具有木糖代谢能力、且至少导入了乙醛脱氢酶基因、进而导入了其他基因的酵母。作为其他基因不特别限定,例如,可以是导入了参与葡萄糖等的糖代谢的基因的酵母。作为一例,重组酵母可以通过导入β-葡糖苷酶基因而制成具有β-葡糖苷酶活性的酵母。
这里β-葡糖苷酶活性是指催化糖的β-糖苷键水解的反应的活性。即,β-葡糖苷酶可以将纤维二糖等纤维寡糖分解成葡萄糖。β-葡糖苷酶基因也可以作为细胞表层呈递型基因而导入。这里,细胞表层呈递型基因,是指进行了改变而使得由该基因所编码的蛋白质以在细胞的表层展示的方式表达的基因。例如,细胞表层呈递型β葡糖苷酶基因,是将β葡糖苷酶基因与细胞表层定位蛋白质基因融合而得的基因。细胞表层定位蛋白质是指固定于酵母的细胞表层、存在于细胞表层的蛋白质。可列举例如作为凝集性蛋白质的α-或a-凝集素、FLO蛋白质等。通常细胞表层定位蛋白质在N-末端侧具有分泌信序列号并在C-末端侧具有GPI锚附着识别信号。虽然在具有分泌信号方面与分泌性蛋白质相同,但细胞表层定位蛋白质在介由GPI锚而被固定输送到细胞膜上这方面与分泌性蛋白质不同。细胞表层定位蛋白质在通过细胞膜时,GPI锚附着识别序列信号被选择性地切断,在新突出的C末端部分与GPI锚结合从而被固定在细胞膜上。之后GPI锚的根部被磷脂酰肌醇依赖性磷脂酶C(PI-PLC)切断。接着,从细胞膜被切下的蛋白质插入细胞壁并被固定在细胞表层,从而定位于细胞表层(例如,参照日本特开2006-174767号公报)。
作为β-葡糖苷酶基因,不特别限定,可列举例如来自棘孢曲霉(Aspergillusaculeatus)的β-葡糖苷酶基因(Murai et al.,Appl.Environ.Microbiol.64:4857-4861)。此外,作为β-葡糖苷酶基因,还可利用来自米曲霉(Aspergillus oryzae)的β-葡糖苷酶基因、来自噬纤维梭菌(Clostridiumcellulovorans)的β-葡糖苷酶基因和来自扣囊复膜酵母(Saccharomycopsis fibligera)的β-葡糖苷酶基因等。
另外,本发明的乙醇的制造方法中使用的重组酵母,还可以是在β葡糖苷酶基因的基础之上,或者除了β葡糖苷酶基因以外,还导入了编码构成纤维素酶的其他酶的基因的重组酵母。作为除了β葡糖苷酶以外的构成纤维素酶的酶,可以列举从结晶纤维素的末端使纤维二糖游离的外切型的纤维二糖水解酶(CBH1和CBH2)、不能分解结晶纤维素但随机切断非结晶纤维素(无定形纤维素)链的内切型的内切葡聚糖酶(EG)。
另外,作为导入重组酵母的其他基因,可列举具有将乙醛转换为乙醇的活性的醇脱氢酶基因(ADH1基因)、具有将乙酸转换为乙酰辅酶A的活性的乙酰辅酶A合成酶基因(ACS1基因)、具有将乙醛转换为乙酸的活性的基因(ALD4基因、ALD5基因和ALD6基因)。此外,还可以破坏具有将乙醇转换为乙醛的活性醇脱氢酶基因(ADH2基因)。
进而,在本发明的乙醇的制造方法中使用的重组酵母,优选为具有以下特征的重组酵母,所述特征是高表达具有将乙醛转换为乙醇的活性的醇脱氢酶基因(ADH1基因)。为了高表达该基因,可列举将内在的该基因的启动子置换为高表达用启动子、将能表达地具有该基因的表达载体导入酵母这样的方法。
酿酒酵母(Saccharomyces cerevisiae)的ADH1基因的碱基序列和由该基因所编码的蛋白质的氨基酸序列分别示于序列号5和6。但是,作为高表达对象的醇脱氢酶基因,不限于由序列号5和6所特定的基因,也可以是碱基序列和/或氨基酸序列不同但处于旁系同源物(paralog)的关系或狭义的同源物的关系的基因。
另外,醇脱氢酶基因不限于由这些序列号5和6所特定的,也可以是例如,编码具有相对于序列号6的氨基酸序列有70%以上、优选为80%以上、更优选为90%以上、最优选为95%以上的序列类似性或同一性的氨基酸序列、且具有醇脱氢酶活性的蛋白质的基因。序列类似性和同一性的值,可以通过安装了BLAST算法的BLASTN和/或BLASTX程序来算出(默认的设定)。此外,序列类似性的值,可以算出在将一对氨基酸序列两两比对分析时完全一致的氨基酸残基、和物理化学上功能类似的氨基酸残基的合计,作为比较的全氨基酸残基中的上述合计数的比例而算出。此外,同一性的值,可以算出在将一对氨基酸序列两两比对分析时完全一致的氨基酸残基,作为比较的全氨基酸残基中的上述氨基酸残基数的比例算出。
进而,醇脱氢酶基因不限于由这些序列号5和6所特定的,也还可以是例如,编码具有相对于序列号6的氨基酸序列的氨基酸序列置换、缺失、插入或添加了1或数个氨基酸的氨基酸序列、且具有醇脱氢酶的蛋白质的基因。这里,数个例如为2~30个,优选为2~20个,更优选为2~10个,最优选为2~5个。
进而另外,醇脱氢酶基因不限于由这些序列号5和6所特定的,也可以是例如,与包含序列号5的碱基序列的DNA的互补链的全部或一部分在严格条件下杂交、且编码具有醇脱氢酶活性的蛋白质的基因。这里所说的“严格条件”是指形成所谓特异性杂种、不形成非特异性杂种的条件,可以参照例如Molecular Cloning:A Laboratory Manual(ThirdEdition)来适宜确定。具体地,可以根据DNA杂交(southern hybridization)时的温度和/或溶液所含的盐浓度、和DNA杂交(southern hybridization)的洗涤工序时的温度和/或溶液所含的盐浓度来设定严格度。更详细地,作为严格条件,例如钠浓度为25~500mM、优选为25~300mM,温度为42~68℃、优选为42~65℃。更具体地为5×SSC(83mM NaCl、83mM柠檬酸钠)、温度42℃。
如上所述,关于包含与序列号5不同的碱基序列的基因、或编码与序列号6不同的氨基酸序列的基因是否作为具有将乙醛转换为乙醇的活性的醇脱氢酶基因发挥功能,只要制作将该基因插入适当的启动子和终止子等之间而得的表达载体,使用该表达载体转化例如酵母等宿主,测定表达的蛋白质的醇脱氢酶活性即可。将乙醛转换为乙醇的醇脱氢酶活性可以如下测定:作为底物准备包含醛和NADH或NADPH的溶液,在适当的温度下与检查对象蛋白质作用,测量生成的醇,或者分光测量NAD+或NADP+,从而测定。
进而,本发明的乙醇的制造方法中使用的重组酵母优选具有如下特征,所述特征是具有将乙醇转换为醛的活性的醇脱氢酶基因(ADH2基因)的表达量下降了。为了使该基因的表达量下降,可列举改变内在的该基因的启动子、使该基因缺失这样的方法。使该基因缺失时,既可以使二倍体的重组酵母中存在的一对ADH2基因中的一方缺失,也可以使两方缺失。作为抑制基因表达的方法,可列举所谓转座子法、转基因法、转录后基因沉默法、RNAi法、无义介导的衰减(Nonsense mediated decay,NMD)法、核酶法、反义法、miRNA(micro-RNA,微RNA)法、siRNA(small interfering RNA,小干扰RNA)法等。
酿酒酵母(Saccharomyces cerevisiae)的ADH2基因的碱基序列和由该基因所编码的蛋白质的氨基酸序列分别示于序列号7和8。但是,作为对象的醇脱氢酶基因,不限于序列号7和8所特定的,也可以是碱基序列和/或氨基酸序列不同但处于旁系同源物(paralog)的关系或狭义的同源物的关系的基因。
另外,醇脱氢酶基因不限于由这些序列号7和8所特定的,也可以是例如,编码具有相对于序列号8的氨基酸序列有70%以上、优选为80%以上、更优选为90%以上、最优选为95%以上的序列类似性或同一性的氨基酸序列、且具有醇脱氢酶活性的蛋白质的基因。序列类似性和同一性的值,可以通过安装了BLAST算法的BLASTN和/或BLASTX程序来算出(默认的设定)。此外,序列类似性的值,可以算出在将一对氨基酸序列两两比对分析时完全一致的氨基酸残基、和物理化学上功能类似的氨基酸残基的合计,作为比较的全氨基酸残基中的上述合计数的比例而算出。此外,同一性的值,可以算出在将一对氨基酸序列两两比对分析时完全一致的氨基酸残基,作为比较的全氨基酸残基中的上述氨基酸残基数的比例算出。
进而,醇脱氢酶基因不限于由这些序列号7和8所特定的,也还可以是例如,编码具有相对于序列号8的氨基酸序列置换、缺失、插入或添加了1或数个氨基酸的氨基酸序列、且具有醇脱氢酶活性的蛋白质的基因。这里,数个例如为2~30个,优选为2~20个,更优选为2~10个,最优选为2~5个。
进而另外,醇脱氢酶基因不限于这些序列号7和8所特定的,也可以是例如,与包含序列号7的碱基序列的DNA的互补链的全部或一部分在严格条件下杂交、且编码具有醇脱氢酶活性的蛋白质的基因。这里所说的“严格条件”是指形成所谓特异性杂种、不形成非特异性杂种的条件,可以参照例如Molecular Cloning:A Laboratory Manual(Third Edition)来适宜确定。具体地,可以根据DNA杂交(southern hybridization)时的温度和/或溶液所含的盐浓度、和DNA杂交(southern hybridization)的洗涤工序时的温度和/或溶液所含的盐浓度来设定严格度。更详细地,作为严格条件,例如钠浓度为25~500mM、优选为25~300mM,温度为42~68℃、优选为42~65℃。更具体地为5×SSC(83mM NaCl、83mM柠檬酸钠)、温度42℃。
如上所述,包含与序列号7不同的碱基序列的基因、或编码与序列号8不同的氨基酸序列的基因是否作为具有将乙醇转换为醛的活性的醇脱氢酶基因发挥功能,只要制作将该基因插入适当的启动子和终止子等之间而得的表达载体,使用该表达载体转化例如酵母等宿主,测定表达的蛋白质的醇脱氢酶活性即可。将乙醇转换为醛的醇脱氢酶活性可以如下测定:作为底物准备包含醇和NAD+或NADP+的溶液,在适当的温度下与检查对象蛋白质作用,测量生成的醛,或者分光测量NADH或NADPH,从而测定。
进而,作为导入重组酵母中的其他基因,可列举参与构成生物质的半纤维素所含的5碳糖L-阿拉伯糖的代谢途径的基因。作为这样的基因,可列举例如,来源于原核生物的L-阿拉伯糖异构酶基因、L-核酮糖激酶基因、L-核酮糖-5-磷酸-4-差向异构酶基因、和/或来源于真核生物的L-阿糖醇-4-脱氢酶基因、L-木糖还原酶基因。
特别是作为导入重组酵母中的其他基因,可列举能够促进培养基中的木糖的利用的基因。具体地,可列举编码具有以木酮糖作为底物生成木酮糖-5-磷酸的活性的木酮糖激酶的基因。通过导入木酮糖激酶基因,能够提高戊糖磷酸途径的代谢通量。
进而重组酵母中可以导入编码选自构成戊糖磷酸途径中的非氧化过程的途径的酶群中的酶的基因。作为构成戊糖磷酸途径中的非氧化过程的途径的酶,可列举核糖-5-磷酸异构酶、核酮糖-5-磷酸-3-差向异构酶、转酮醇酶和转醛醇酶。优选导入1种以上编码这些酶的基因。另外,更优选将这些基因中的2种以上组合导入,进一步优选3种以上组合导入,最优选导入全种类的基因。
更具体地作为木酮糖激酶(XK)基因,可以不特别限制来源生物地使用。此外XK基因由同化木酮糖的细菌、酵母等大量微生物保持。关于XK基因的信息可以通过NCBI的HP等的检索来适宜获得。优选列举来源于酵母、乳酸菌、大肠杆菌、植物等的XK基因。作为XK基因,可列举例如,来源于酿酒酵母(S.cerevisiae)S288C株的XK基因XKS1(GenBank:Z72979)(CDS的编码区的碱基序列和氨基酸序列)。
另外,更具体地,转醛醇酶(TAL)基因、转酮醇酶(TKL)基因、核酮糖-5-磷酸差向异构酶(RPE)基因、核糖-5-磷酸酮异构酶(RKI)基因可以不特别限定来源生物地使用。这些基因在具有戊糖磷酸途径的大量生物中保持。例如,酿酒酵母(S.cerevisiae)等通用酵母也保持这些基因。关于这些基因的信息可以通过登录NCBI等的HP来适宜获得。优选列举来源于真核细胞或酵母等与宿主真核细胞同一属、更优选与宿主真核细胞同一种的各基因。作为TAL基因优选使用TAL1基因,作为TKL基因优选使用TKL1基因和TKL2基因,作为RPE基因优选使用RPE1基因,作为RKI基因优选使用RKI1基因。例如,作为这些基因,可列举作为来源于酿酒酵母(S.cerevisiae)S288株的TAL1基因的TAL1基因(GenBank:U19102)、来源于酿酒酵母(S.cerevisiae)S288株的TKL1基因(GenBank:X73224)、来源于酿酒酵母(S.cerevisiae)S288株的RPE1基因(Genbank:X83571)、来源于酿酒酵母(S.cerevisiae)S288株的RKI1基因(GenBank:Z75003)。
<重组酵母的制作>
可以通过将上述的木糖异构酶基因和乙醛脱氢酶基因导入成为宿主的酵母基因组中,来制作能够在本发明中使用的重组酵母。这里,木糖异构酶基因和乙醛脱氢酶基因既可以导入不具有木糖代谢能力的酵母中,也可以导入本来具有木糖代谢能力的酵母中,还可以与木糖代谢相关基因一起导入不具有木糖代谢能力的酵母中。另外,将木糖异构酶基因、乙醛脱氢酶基因和上述的基因导入酵母时,既可以将全部基因同时导入,也可以利用不同的表达载体逐次导入。
作为能用作宿主的酵母,不特别限定,可列举休哈塔假丝酵母(CandidaShehatae)、树干毕赤酵母(Pichia stipitis)、嗜鞣管囊酵母(Pachysolen tannophilus)、酿酒酵母(Saccharomyces cerevisiae)和粟酒裂殖酵母(Schizosaccaromyces pombe)等酵母,特别优选为酿酒酵母(Saccharomyces cerevisiae)。此外,作为酵母,既可以是为了实验方面的方便性而使用的实验株,也可以是为了实用方面的有用性而使用的工业株(实用株)。作为工业株,可列举例如用于葡萄酒、清酒和/或烧酎制作的酵母株。
此外,作为成为宿主的酵母,优选使用具有同宗配合性的酵母。根据日本特开2009-34036号公报所公开的方法,通过利用具有同宗配合性的酵母,可以简便地向基因组中导入多拷贝基因。具有同宗配合性的酵母与同宗配合酵母为同义。作为具有同宗配合性的酵母,不特别限定,也可以使用任何酵母。作为具有同宗配合性的酵母,可列举酿酒酵母OC-2株(NBRC2260),但不限于此。此外,作为具有同宗配合性的酵母,可列举醇酵母(台研396号、NBRC0216)(出处:“アルコール酵母的諸特性(醇酵母的诸特性)”酒研会报、No37、p18-22(1998.8))、在巴西和冲绳分离出的乙醇生产酵母(出处:“ブラジルと沖縄で分離したSaccharomyces cerevisiae野生株的遺伝学的性質(在巴西和冲绳分离的酿酒酵母野生株的遗传学性质)”日本农艺化学会志、Vol.65、No.4、p759-762(1991.4))和180(出处:“アルコール発酵力の強い酵母的スクリーニング(醇发酵能力强的酵母的筛选)”日本酿造协会志、Vol.82、No.6、p439-443(1987.6))等。另外,即使是显示异宗配合的表型的酵母,也可通过以可表达的方式导入HO基因而制成具有同宗配合性的酵母来使用。因此,在本发明中,具有同宗配合性的酵母是也包含以可表达的方式导入了HO基因的酵母的含义。
其中,酿酒酵母(Saccharomyces cerevisiae)OC-2株是一直以来在葡萄酒酿酒的现场被利用的被确认了安全性的菌株,因而优选。此外酿酒酵母(Saccharomycescerevisiae)OC-2株如后述实施例所述的那样,是在高糖浓度的条件下启动子活性优异的菌株,因而优选。特别是酿酒酵母(Saccharomyces cerevisiae)OC-2株在高糖浓度条件下丙酮酸脱羧酶基因(PDC1)的启动子活性优异,因而优选。
另外,作为导入的基因的启动子,不特别限定,可利用例如,甘油醛-3-磷酸脱氢酶基因(TDH3)的启动子、3-磷酸甘油酸激酶基因(PGK1)的启动子、高渗透压应答7基因(HOR7)的启动子等。其中,丙酮酸脱羧酶基因(PDC1)的启动子高表达下游的目的基因的能力高,因而优选。
即,上述基因可以与调控表达的启动子和/或其他表达调控区一起导入酵母的基因组中。或者,上述基因也可以以通过成为宿主的酵母的基因组中本来存在基因的启动子和/或其他表达调控区而进行表达调控的方式导入。
作为导入上述基因的方法,可以应用已知作为酵母的转化方法的现有公知的任何方法。具体来说,例如,可以以例如电穿孔法“Meth.Enzym.,194,p182(1990)”、原生质球法“Proc.Natl.Acad.Sci.USA,75p1929(1978)”、乙酸锂法“J.Bacteriology,153,p163(1983)”、Proc.Natl.Acad.Sci.USA,75p1929(1978)、Methods in yeast genetics,2000Edition:A Cold Spring Harbor Laboratory Course Manual等记载的方法实施,但不限于此。
<乙醇制造>
在使用以上所说明的重组酵母制造乙醇时,用至少含有木糖的培养基进行乙醇发酵培养。即,进行乙醇发酵的培养基,作为碳源至少含有木糖。此外,培养基中也可以预先含有葡萄糖等其他碳源。
另外,在乙醇发酵中利用的培养基所含的木糖可以来源于生物质。换句话说,在乙醇发酵中利用的培养基也可以是包含纤维素系生物质、糖化纤维素系生物质所含的半纤维素而生成木糖的半纤维素酶的组成。这里,作为纤维素系生物质,可以是实施了现有公知的前处理的物质。作为前处理,不特别限定,可列举例如,将木质素用微生物分解的处理、和/或纤维素系生物质的粉碎处理等。另外,作为前处理,也可以应用例如,将粉碎了的纤维素系生物质浸渍于稀硫酸溶液和/或碱溶液、离子液体中的处理、水热处理、微粉碎处理这样的处理。通过这些前处理,可以提高生物质的糖化率。
此外,使用以上所说明的重组酵母制造乙醇时,上述培养基还可以是进一步包含纤维素和纤维素酶的组成。此时,上述培养基中含有纤维素酶作用于纤维素而生成的葡萄糖。在乙醇发酵中利用的培养基含有纤维素的情况下,该纤维素可以来源于生物质。换句话说,在乙醇发酵中利用的培养基还可以是包含能够糖化纤维素系生物质所含的纤维素的纤维素酶的组成。
另外,在乙醇发酵中利用的培养基,还可以添加将纤维素系生物质糖化处理后的糖化液。此时,糖化液包含残存的纤维素和/或纤维素酶、以及来源于纤维素系生物质所含的半纤维素的木糖。
如上,本发明的乙醇的制造方法包含至少以木糖作为糖源的乙醇发酵的工序。本发明的乙醇的制造方法能够通过以木糖作为糖源的乙醇发酵来制造乙醇。在本发明的利用了重组酵母的乙醇的制造方法中,在乙醇发酵之后从培养基中回收乙醇。对乙醇的回收方法没有特别限制,可采用现有公知的任何方法。例如,在上述乙醇发酵结束后,通过固液分离操作将含有乙醇的液层、与含有重组酵母和/或固体成分的固层分离。然后将液层中所含的乙醇通过蒸馏法进行分离·纯化,从而可以回收纯度高的乙醇。此外,乙醇的纯化度可以根据乙醇的使用目的而适宜调整。
一般地,在利用来源于生物质的糖来制造乙醇的情况下,在上述的前处理和/或糖化处理中有时产生乙酸和/或糠醛这样的发酵抑制物质。特别地关于乙酸,已知其抑制酵母的生长·增殖,使以木糖作为糖源的乙醇发酵的效率下降。
然而,在本发明中,由于使用导入了木糖异构酶基因和乙醛脱氢酶基因的重组酵母,因而可以代谢培养基所含的乙酸,从而可以将培养基所含的乙酸浓度抑制得较低。因此,本发明的乙醇的制造方法,与使用了未导入木糖异构酶基因和乙醛脱氢酶基因的酵母时相比,可以实现优异的乙醇收率。
另外,根据本发明的乙醇的制造方法,由于即使在将重组酵母培养规定时间之后培养基中的乙酸浓度也低,因而,即使在利用培养规定时间之后的培养基的一部分重新开始培养的连续培养体系中使用,也能够降低乙酸的带入量。另外,根据本发明的乙醇的制造方法,即使在乙醇发酵工序结束之后回收菌体而再利用的情况下,也由于同样的理由而能够降低乙酸的带入量。
另外,本发明的乙醇的制造方法,可以成为将培养基所含的纤维素用纤维素酶进行糖化的工序、与以木糖和糖化所生成的葡萄糖作为糖源的乙醇发酵的工序同时进行的所谓同时糖化发酵处理。这里,同时糖化发酵处理是指将糖化纤维素系生物质的工序与乙醇发酵工序不区分地同时实施的处理。
此外,作为糖化方法没有特别限制,可列举利用纤维素酶和/或半纤维素酶等纤维素酶制剂的酶法等。纤维素酶制剂含有参与纤维素链和半纤维素链的分解的多种酶,显示内切葡聚糖酶活性、内切木聚糖酶活性、纤维二糖水解酶活性、葡糖苷酶活性和木糖苷酶活性等多种活性。作为纤维素酶制剂,没有特别限制,可列举例如,由里氏木霉(Trichodermareesei)、和/或解纤维顶孢霉(Acremonium cellulolyticus)等所生产的纤维素酶。作为纤维素酶制剂,也可以使用市售品。
在同时糖化发酵处理中,向包含纤维素系生物质(也可以在前处理后)的培养基中加入纤维素酶制剂和上述重组微生物,在规定的温度范围内培养该重组酵母。作为培养温度没有特别限制,考虑乙醇发酵的效率可以为25~45℃,优选为30~40℃。此外培养液的pH值优选为4~6。此外,培养时也可以进行搅拌和/或振荡。进而,也可以是先在酶的最适温度(40~70℃)下进行糖化,然后,使温度下降至规定的温度(30~40℃)再添加酵母这样的不规则的同时糖化发酵。
实施例
以下使用实施例更详细地说明本发明,但本发明的技术范围并不限制于以下实施例。
〔实施例1〕
在本实施例中,制作导入了木糖异构酶基因和大肠杆菌的乙醛脱氢酶基因(mhpF基因)的重组酵母,评价该重组酵母的乙酸代谢能力。
<导入载体的制作>
(1)XKS1基因导入用载体
作为来源于酿酒酵母(S.cerevisiae)的木酮糖激酶(XK)基因的酵母导入用载体,制作图1所示的pUC-HIS3U-P_HOR7-XKS1-T_TDH3-P_TDH2-hph-T_CYC1-HIS3D。该载体包含:5’侧添加了HOR7启动子和3’侧添加了TDH3终止子的作为来源于酿酒酵母(S.cerevisiae)NBRC304株的XK基因的XKS1基因(genebank:X61377)、成为向酵母基因组上的同源重组区域的组氨酸合成酶(HIS3)基因的上游约500bp的区域(HIS3U)以及该基因内的约500bp的区域(HIS3D)、以及5’侧添加了TDH2启动子和3’侧添加了CYC1终止子的潮霉素磷酸转移酶(hph)基因(标志物基因)。此外,同源重组区域的外侧导入了限制性酶Sse8387I的位点。另外,来源于酿酒酵母(S.cerevisiae)NBRC304株的XKS1基因的编码区的碱基序列和该基因所编码的木酮糖激酶的氨基酸序列分别示于序列号9和10。
(2)XI基因导入用载体
作为来源于黄胸散白蚁(Reticulitermes speratus)肠内原生生物的木糖异构酶(RsXI-C1、参照日本特开2011-147445)基因的酵母导入用载体,制作图2所示的pUC-R67-HOR7p-RsXI-T_TDH3-TRP1d-R45。该载体包含:5’侧添加了HOR7启动子和3’侧添加了TDH3终止子的RsXI-C1基因、成为向酵母基因组上的同源重组区域的rRNA基因(rDNA)的同源序列R45和R67、以及缺失启动子部分而降低了表达量的TRP1d标志物基因。此外,同源重组区域的外侧导入了限制性酶Sse8387I的位点。通过R45和R67,包含RsXI-C1的基因被多拷贝导入到第12号染色体上的rDNA座。进而TRP1d标志物在以多拷贝被导入染色体上的情况下才作为标志物发挥功能。因此,通过利用本载体,多拷贝导入成为可能。此外,在本实施例中RsXI-C1基因,使用将全区域设计成根据酵母的密码子使用频率而转换了使用的密码子的碱基序列,基于该碱基序列全合成而得的基因。本实施例中设计的RsXI-C1基因的碱基序列和该基因所编码的木糖异构酶的氨基酸序列分别示于序列号3和4。
(3)TAL1·TKL1基因导入用载体
作为来源于酿酒酵母(S.cerevisiae)的转醛醇酶1(TAL1)基因和转酮醇酶1(TKL1)基因的酵母导入用载体,制作图3所示的pUC-LEU2U-P_HOR7-TAL1-T_TDH3-P_HOR7-TKL1-T_TDH3-HIS3-LEU2D。该载体包含:5’侧添加了HOR7启动子和3’侧添加了TDH3终止子的作为来源于酿酒酵母(S.cerevisiae)S288株的TAL1基因的TAL1基因(genebank:U19102);5’侧添加了HOR7启动子和3’侧添加了TDH3终止子的作为来源于酿酒酵母(S.cerevisiae)S288株的TKL1基因的TKL1基因(genebank:X73224);成为向酵母基因组上的同源重组区域的亮氨酸合成酶(LEU2)基因从3’侧末端到上游约500bp的区域(LEU2U)以及该基因5’侧末端上游的约450bp的区域(LEU2D);以及组氨酸合成酶(HIS3)基因(标志物基因)。此外,同源重组区域的外侧导入了限制性酶Sse8387I的位点。另外,来源于酿酒酵母(S.cerevisiae)S288株的TAL1基因的编码区的碱基序列和该基因所编码的转醛醇酶1的氨基酸序列分别示于序列号11和12。进而来源于酿酒酵母(S.cerevisiae)S288株的TKL1基因的编码区的碱基序列和该基因所编码的转酮醇酶1的氨基酸序列分别示于序列号13和14。
(4)RPE1·RKI1基因导入、GRE3基因破坏用载体
作为来源于酿酒酵母(S.cerevisiae)的核酮糖磷酸差向异构酶1(RPE1)基因和核糖磷酸酮异构酶(RKI1)基因的酵母导入用载体,制作了图4所示的pUC-GRE3U-P_HOR7-RPE1-T_TDH3-P_HOR7-RKI1-T_TDH3-LEU2-GRE3D。该载体包含:5’侧添加了HOR7启动子和3’侧添加了TDH3终止子的作为来源于酿酒酵母(S.cerevisiae)S288株的RPE1基因的RPE1基因(genebank:X83571);5’侧添加了HOR7启动子和3’侧添加了TDH3终止子的来源于酿酒酵母(S.cerevisiae)S288株的RKI1基因(genebank:Z75003);成为向酵母基因组上的同源重组和用于破坏醛糖还原酶3(GRE3)基因的区域的包含GRE3基因的3’末端区域约500bp的约800bp的区域(GRE3U)和GRE3基因的上游约1000bp的区域(GRE3D);以及亮氨酸合成酶(LEU2)基因(标志物基因)。此外,同源重组区域的外侧导入了限制性酶Sse8387I的位点。另外,来源于酿酒酵母(S.cerevisiae)S288株的RPE1基因的编码区的碱基序列和该基因所编码的核酮糖磷酸差向异构酶1的氨基酸序列分别示于序列号15和16。进而,来源于酿酒酵母(S.cerevisiae)S288株的RKI1基因的编码区的碱基序列和该基因所编码的核糖磷酸酮异构酶的氨基酸序列分别示于序列号17和18。
(5)ADH2基因破坏用载体
作为宿主中内在的ADH2基因的破坏用载体,制作了图5所示的pCR-ADH2U-URA3-ADH2D。该载体中,作为向酵母基因组上的同源重组和用于破坏醇脱氢酶2(ADH2)基因的区域,包含ADH2基因的上游约700bp的区域(ADH2U)、ADH2基因的下流约800bp的区域(ADH2D)、以及乳清苷-5’-磷酸脱羧酶(URA3)基因(标志物基因)。
(6)ADH1基因导入用载体
作为醇脱氢酶1(ADH1)基因的酵母导入用载体,制作了图6所示的pCR-ADH2part-T_CYC1-P_TDH3-ADH1-T_ADH1-URA3-ADH2D。该载体包含:5’侧添加了TDH3启动子和3’侧添加了ADH1终止子的来源于酿酒酵母(S.cerevisiae)S288株的ADH1基因(genebank:Z74828.1)、成为向酵母基因组上的同源重组区域的ADH2基因的从3’侧末端到上游约450bp的区域(ADH2part)和从3’侧末端到下流的约700bp的区域(ADH2D)、作为ADH2的终止子的CYC1终止子、以及URA3基因(标志物基因)。
(7)mhpF基因导入用载体
作为来源于大肠杆菌(E.coli)的乙醛脱氢酶(mhpF)基因的酵母导入用载体,制作了图7所示的pCR-ADH2part-T_CYC1-ERO1_T-mhpF-HOR7_P-URA3-ADH2D。该载体包含:5’侧添加了HOR7启动子和3’侧添加了ERO1终止子的来源于大肠杆菌(E.coli)的乙醛脱氢酶基因(mhpF基因)、成为向酵母基因组上的同源重组区域的ADH2基因的从3’侧末端到上游约450bp的区域(ADH2part)和从3’侧末端到下流的约700bp的区域(ADH2D)、作为ADH2的终止子的CYC1终止子、以及包含URA3基因的基因(标志物基因)。此外,本实施例中,mhpF基因使用将全区域设计成根据酵母的密码子使用频率而转换了使用的密码子的碱基序列,基于该碱基序列全合成而得的基因。本实施例中设计的mhpF基因的碱基序列和该基因所编码的乙醛脱氢酶的氨基酸序列分别示于序列号1和2。
(8)mhpF·ADH1基因导入用载体
作为mhpF基因和ADH1基因的酵母导入用载体,制作了图8所示的pCR-ADH2part-T_CYC1-P_TDH3-ADH1-T_ADH1-ERO1_T-mhpF-HOR7_P-URA3-ADH2D。该载体包含:5’侧添加了HOR7启动子和3’侧添加了ERO1终止子的mhpF基因(与上述(7)相同)、5’侧添加了TDH3启动子和3’侧添加了ADH1终止子的来源于酿酒酵母(S.cerevisiae)S288株的ADH1基因(与上述(6)相同)、成为向酵母基因组上的同源重组区域的ADH2基因的从3’侧末端到上游约450bp的区域(ADH2part)、和从3’侧末端到下流的约700bp的区域(ADH2D)、作为ADH2的终止子的CYC1终止子、以及URA3基因(标志物基因)。
(9)mhpF基因导入、ADH2破坏用载体
作为mhpF基因的酵母导入和ADH2基因破坏用载体,制作了图9所示的pCR-ADH2U-ERO1_T-mhpF-HOR7_P-URA3-ADH2D。该载体包含:5’侧添加了HOR7启动子和3’侧添加了ERO1终止子的mhpF基因(与上述(7)相同)、成为向酵母基因组上的同源重组和用于破坏ADH2基因的区域的ADH2基因的上游约700bp的区域(ADH2U)和ADH2基因的上游约800bp的区域(ADH2D)、以及URA3基因(标志物基因)。
(10)mhpF基因和ADH1基因导入、ADH2破坏用载体
作为mhpF基因和ADH1基因的酵母导入以及ADH2基因破坏用载体,制作了图10所示的pCR-ADH2U-P_TDH3-ADH1-T_ADH1-ERO1_T-mhpF-HOR7_P-URA3-ADH2D。该载体包含:5’侧添加了HOR7启动子和3’侧添加了ERO1终止子的mhpF基因(与上述(7)相同)、5’侧添加了TDH3启动子和3’侧添加了ADH1终止子的来源于酿酒酵母(S.cerevisiae)S288株的ADH1基因(与上述(6)相同)、成为向酵母基因组上的同源重组和用于破坏ADH2基因的区域的ADH2基因的上游约700bp的区域(ADH2U)和ADH2基因的上游约800bp的区域(ADH2D)、以及URA3基因(标志物基因)。
(11)对照载体(仅标志物基因)
作为仅导入标志物基因的对照载体,制作了图11所示的pCR-ADH2part-T_CYC1-URA3-ADH2D。该载体包含:成为向酵母基因组上的同源重组区域的ADH2基因的从3’侧末端到上游约450bp的区域(ADH2part)和从3’侧末端到下流的约700bp的区域(ADH2D)、作为ADH2的终止子的CYC1终止子、以及URA3基因(标志物基因)。
<载体导入酵母株的制作>
用添加了5-氟乳清酸的培养基筛选作为2倍体酵母的酿酒酵母(Saccharomycescerevisiae)OC2-T株(Saitoh,S.等,J.Ferment.Bioeng.1996年、81卷98-103)(Boeke,J.D.,et al.1987Methods Enzymol.;154:164-75.),以成为尿嘧啶营养缺陷型的菌株作为宿主。
酵母的转化使用Frozen-EZ Yeast Transformation II(ZYMO RESEARCH),按照附带的实验步骤进行。首先,使用将pUC-HIS3U-P_HOR7-XKS1-T_TDH3-P_TDH2-hph-T_CYC1-HIS3D载体用限制性酶Sse8387I消化而得的片段,进行OC2-T株的转化,涂布于YPD+HYG琼脂培养基,将生长的集落纯化。将纯化后的选择株命名为OC100株。接下来,使用将pUC-LEU2U-P_HOR7-TAL1-T_TDH3-P_HOR7-TKL1-T_TDH3-HIS3-LEU2D载体用限制性酶Sse8387I消化而得的片段,进行OC100株的转化,涂布于不含组氨酸的SD琼脂培养基(Mthods in YeastGenetics,Cold Spring Harbor Laboratory Press),将生长的集落纯化。将纯化后的选择株命名为OC300株。接下来,使用将pUC-GRE3U-P_HOR7-RPE1-T_TDH3-P_HOR7-RKI1-T_TDH3-LEU2-GRE3D载体用限制性酶Sse8387I消化而得的片段,进行OC300株的转化,涂布于不含亮氨酸的SD琼脂培养基,将生长的集落纯化。将纯化后的选择株命名为OC600株。接下来,使用将pUC-R67-HOR7p-RsXI-T_TDH3-TRP1d-R45载体用限制性酶Sse8387I消化而得的片段,进行OC600株的转化,涂布于不含色氨酸的SD琼脂培养基,将生长的集落纯化。将纯化后的选择株命名为OC700株。如上制作的OC700株中导入有RsXI-C1基因、XK基因、TAL1基因、TKL1基因、RPE1基因和RKI1基因。
接下来,使用将pCR-ADH2U-URA3-ADH2D、pCR-ADH2part-T_CYC1-P_TDH3-ADH1-T_ADH1-URA3-ADH2D、pCR-ADH2part-T_CYC1-ERO1_T-mhpF-HOR7_P-URA3-ADH2D、pCR-ADH2part-T_CYC1-P_TDH3-ADH1-T_ADH1-ERO1_T-mhpF-HOR7_P-URA3-ADH2D、pCR-ADH2U-ERO1_T-mhpF-HOR7_P-URA3-ADH2D、pCR-ADH2U-P_TDH3-ADH1-T_ADH1-ERO1_T-mhpF-HOR7_P-URA3-ADH2D、pCR-ADH2part-T_CYC1-URA3-ADH2D的各载体的同源重组部位之间用PCR扩增而得的片段,进行OC700株的转化,涂布于不含尿嘧啶的SD琼脂培养基,将生长的集落纯化。将纯化后的选择株分别命名为Uz1048、Uz1047、Uz928、Uz1012、Uz926、Uz736和Uz1049株。
<发酵试验>
从如上所得的Uz1048、Uz1047、Uz928、Uz1012、Uz926、Uz736和Uz1049系统的菌株中分别挑选发酵能力高的菌株,如下实施烧瓶发酵试验。首先,在分装有20ml葡萄糖浓度20g/L的YPD液体培养基(酵母提取物10g/L、胨20g/L、葡萄糖20g/L)的100ml容量的带挡板的烧瓶中接种供试株,以30℃120rpm进行24小时培养。集菌后,接种于分装有10mlD20X60YAc6培养基(葡萄糖20g/L、木糖60g/L、酵母提取物10g/L、乙酸6g/L)的20ml容量的烧瓶中(菌浓度0.3g干燥菌体/L),以振荡培养(80rpm、振幅35mm、30℃)进行发酵试验。此外,盖在烧瓶上的塞子是通有内径1.5mm的针的橡胶制,通过在针的最前部安装止回阀而使得烧瓶保持厌氧环境。
发酵开始65小时后进行取样,对于发酵液中的葡萄糖、木糖、乙酸、乙醇,使用HPLC(LC-10A;岛津制作所),在下述条件测定。
柱:AminexHPX-87H
移动相:0.01N H2SO4
流量:0.6ml/min
温度:30℃
检测器:差示折射率检测器RID-10A
<发酵试验结果>
上述发酵试验的结果示于表1。
表1
如由表1判断的那样,与mhpF过表达株相比,在过表达mhpF基因和ADH1基因的同时破坏了ADH2基因的Uz736株中,木糖的同化速度大幅提高,其结果乙醇的生产性提高。由于仅ADH2破坏的菌株、仅ADH1过表达的菌株的木糖的同化速度不改善,因而考虑存在协同效果。另外,在Uz736株中,培养基中的乙酸的浓度有意义地减少,明确了乙酸同化能力也提高。
〔实施例2〕
本实施例中,制作导入了木糖异构酶基因和大肠杆菌的mhpF基因、adhE基因、来源于拜氏梭菌(Clostridium beijerinckii)的乙醛脱氢酶基因或来源于莱茵衣藻(Chlamydomonas reinhardtii)的乙醛脱氢酶基因的重组酵母。本实施例中制作的重组酵母中,破坏了内在的一对ADH2基因中的一方或两方。
<导入载体的制作>
(1)XI·XKS1·TKL1·TAL1·RKI1·RPE1基因导入和GRE3基因破坏用质粒
制作在GRE3基因座破坏GRE3基因、同时包含将来源于黄胸散白蚁(Reticulitermes speratus)肠内原生生物的木糖异构酶基因的第377位的氨基酸从天冬酰胺替换为半胱氨酸、木糖的同化速度提高了的突变基因(XI_N337C)、来源于酵母的木酮糖激酶(XKS1)基因、戊糖磷酸途径的转酮醇酶1(TKL1)基因、转醛醇酶1(TAL1)基因、核酮糖磷酸差向异构酶1(RPE1)基因和核糖磷酸酮异构酶(RKI1)基因导入酵母所需的序列的质粒pUC-5U_GRE3-P_HOR7-TKL1-TAL1-FBA1_P-P_ADH1-RPE1-RKI1-TE F1_P-P_TDH1-XI_N337C-T_DIT1-P_TDH3-XKS1-T_HIS3-LoxP-G418-LoxP-3U_GRE3。
该质粒以包含5’侧添加了HOR7启动子的来源于酿酒酵母(Saccharomycescerevisiae)BY4742株的TKL1基因、添加了FBA1启动子的TAL1基因、添加了ADH1启动子的RKI1基因、添加了TEF1启动子的RPE1基因、添加了TDH1启动子和DIT1终止子的XI_N337C(全合成将全长根据酵母的密码子使用频率而转换了密码子的序列而得的基因)、添加了TDH3启动子和HIS3终止子的XKS1基因、作为向酵母基因组上的同源重组区域的GRE3基因的从5’侧末端到上游约700bp的区域的基因序列(GRE3U)和GRE3基因从3’侧末端到下流的约800bp的区域的DNA序列(GRE3D)、以及作为标志物的包含G418基因的基因序列(G418标志物)的方式构建。此外,标志物基因是能够通过在两侧导入LoxP序列而除去标志物的序列。
此外,本质粒所含的各DNA序列可以使用表2的引物来扩增。为了使各DNA片段结合,使用对表2的引物以与邻接DNA序列重复约15bp的方式添加了DNA序列的引物,以酿酒酵母(Saccharomyces cerevisiae)BY4742基因组、XI_N337C合成基因DNA、LoxP序列的合成DNA为模板扩增目的DNA片段,使用In-Fusion HD Cloning Kit(タカラバイオ)等依次使DNA片段结合,克隆到质粒pUC19中而制成最终目的质粒。
表2
(2)mhpF·ADH1基因导入和ADH2基因破坏用质粒
制作在ADH2基因座破坏ADH2基因、同时包含将来源于大肠杆菌(E.coli)的乙醛脱氢酶基因(mhpF)和来源于酵母的醇脱氢酶1(ADH1)基因导入酵母所需的序列的质粒pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-mhpF-HOR7_P-URA3-3U_ADH2。
该质粒以包含5’侧添加了TDH3启动子的来源于酿酒酵母(Saccharomycescerevisiae)BY4742株的ADH1基因、添加了HOR7启动子和DIT1终止子的mhpF基因(全合成将全长根据酵母的密码子使用频率而转换了密码子的序列而得的基因)、作为向酵母基因组上的同源重组区域的ADH2基因的从5’侧末端到上游约700bp的区域的基因序列(ADH2U)和ADH2基因从3’侧末端到下流的约800bp的区域的DNA序列(ADH2D)、以及作为标志物的包含URA3基因的基因序列(URA3标志物)的方式构建。
此外,本质粒所含的各DNA序列可以使用表3的引物来扩增。为了使各DNA片段结合,使用对表3的引物以与邻接DNA序列重复约15bp的方式添加了DNA序列的引物,以酿酒酵母(Saccharomyces cerevisiae)BY4742基因组或mhpF合成基因DNA为模板扩增目的DNA片段,使用In-Fusion HD Cloning Kit等依次使DNA片段结合,克隆到质粒pUC19中而制成最终目的质粒。
表3
(3)adhE·ADH1基因导入和ADH2基因破坏用质粒
制作在ADH2基因座破坏ADH2基因、同时包含将来源于大肠杆菌(E.coli)的乙醛脱氢酶基因(adhE)和来源于酵母的醇脱氢酶1(ADH1)基因导入酵母所需的序列的质粒pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-adhE-HOR7_P-URA3-3U_ADH2。
该质粒以包含5’侧添加了TDH3启动子的来源于酿酒酵母(Saccharomycescerevisiae)BY4742株的ADH1基因、添加了HOR7启动子和DIT1终止子的adhE基因(NCBI登录号NP_415757.1、全合成将全长根据酵母的密码子使用频率而转换了密码子的序列而得的基因)、作为向酵母基因组上的同源重组区域的ADH2基因的从5’侧末端到上游约700bp的区域的基因序列(ADH2U)和ADH2基因从3’侧末端到下流的约800bp的区域的DNA序列(ADH2D)、以及作为标志物的包含URA3基因的基因序列(URA3标志物)的方式构建。
此外,本质粒所含的各DNA序列可以使用表4的引物来扩增。为了使各DNA片段结合,使用对表4的引物以与邻接DNA序列重复约15bp的方式添加了DNA序列的引物,以质粒pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-mhpF-HOR7_P-URA3-3U_ADH2或adhE合成基因DNA为模板扩增目的DNA片段,使用In-Fusion HD Cloning Kit等依次使DNA片段结合,克隆到质粒pUC19中而制成最终目的质粒。
表4
(4)来源于拜氏梭菌(Clostridium beijerinckii)的乙醛脱氢酶基因·ADH1基因导入和ADH2基因破坏用质粒
制作在ADH2基因座破坏ADH2基因、同时包含将来源于拜氏梭菌(Clostridiumbeijerinckii)的乙醛脱氢酶基因和来源于酵母的醇脱氢酶1(ADH1)基因导入酵母所需的序列的质粒pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-CloADH-HOR7_P-URA3-3U_ADH2。
该质粒以包含5’侧添加了TDH3启动子的来源于酿酒酵母(Saccharomycescerevisiae)BY4742株的ADH1基因、添加了HOR7启动子和DIT1终止子的来源于拜氏梭菌(Clostridium beijerinckii)的乙醛脱氢酶基因(NCBI登录号YP_001310903.1、全合成将全长根据酵母的密码子使用频率而转换了密码子的序列而得的基因)、作为向酵母基因组上的同源重组区域的ADH2基因的从5’侧末端到上游约700bp的区域的基因序列(ADH2U)和ADH2基因从3’侧末端到下流的约800bp的区域的DNA序列(ADH2D)、以及作为标志物的包含URA3基因的基因序列(URA3标志物)的方式构建。
此外,本质粒所含的各DNA序列可以使用表5的引物来扩增。为了使各DNA片段结合,使用对表5的引物以与邻接DNA序列重复约15bp的方式添加了DNA序列的引物,以质粒pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-mhpF-HOR7_P-URA3-3U_ADH2或来源于拜氏梭菌(Clostridium beijerinckii)的乙醛脱氢的合成基因DNA为模板扩增目的DNA片段,使用In-Fusion HD Cloning Kit等依次使DNA片段结合,克隆到质粒pUC19中而制成最终目的质粒。
表5
(5)来源于莱茵衣藻(Chlamydomonas reinhardtii)的乙醛脱氢酶基因·ADH1基因导入和ADH2基因破坏用质粒
制作在ADH2基因座破坏ADH2基因、同时包含将来源于莱茵衣藻(Chlamydomonasreinhardtii)的乙醛脱氢酶基因和来源于酵母的醇脱氢酶1(ADH1)基因导入酵母所需的序列的质粒pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-ChlaADH1-HOR7_P-URA3-3U_ADH2。
该质粒以包含5’侧添加了TDH3启动子的来源于酿酒酵母(Saccharomycescerevisiae)BY4742株的ADH1基因、添加了HOR7启动子并添加了DIT1终止子的来源于莱茵衣藻(Chlamydomonas reinhardtii)的乙醛脱氢酶基因(NCBI登录号5729132,全合成将全长根据酵母的密码子使用频率而转换了密码子的序列而得的基因)、作为向酵母基因组上的同源重组区域的ADH2基因的从5’侧末端到上游约700bp的区域的基因序列(ADH2U)和ADH2基因从3’侧末端到下流的约800bp的区域的DNA序列(ADH2D)、以及作为标志物的包含URA3基因的基因序列(URA3标志物)的方式构建。
此外,本质粒所含的各DNA序列可以使用表6的引物来扩增。为了使各DNA片段结合,使用对表6的引物以与邻接DNA序列重复约15bp的方式添加了DNA序列的引物,以质粒pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-mhpF-HOR7_P-URA3-3U_ADH2或来源于莱茵衣藻(Chlamydomonas reinhardtii)的乙醛脱氢的合成基因DNA为模板扩增目的DNA片段,使用In-Fusion HD Cloning Kit等依次使DNA片段结合,克隆到质粒pUC19中而制成最终目的质粒。
表6
(6)mhpF基因导入用质粒
制作不在ADH2基因座破坏ADH2基因、并包含在ADH2基因座附近将来源于大肠杆菌(E.coli)的乙醛脱氢酶基因(mhpF)导入酵母所需的序列的质粒pUC-ADH2-T_CYC1-DIT1_T-mhpF-HOR7_P-URA3-3U_ADH2。
该质粒以包含5’侧添加了HOR7启动子并添加了DIT1终止子的来源于酿酒酵母(Saccharomyces cerevisiae)BY4742株的mhpF基因(全合成将全长根据酵母的密码子使用频率而转换了密码子的序列而得的基因)、作为向酵母基因组上的同源重组区域的ADH2基因和ADH2基因从3’侧末端到下流的约800bp的区域的DNA序列(ADH2D)、以及作为标志物的包含URA3基因的基因序列(URA3标志物)的方式构建。
此外,本质粒所含的各DNA序列可以使用表7的引物来扩增。为了使各DNA片段结合,使用对表7的引物以与邻接DNA序列重复约15bp的方式添加了DNA序列的引物,以质粒pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-mhpF-HOR7_P-URA3-3U_ADH2或酿酒酵母(Saccharomyces cerevisiae)BY4742基因组为模板扩增目的DNA片段,使用In-Fusion HDCloning Kit等依次使DNA片段结合,克隆到质粒pUC19中而制成最终目的质粒。
表7
<载体导入酵母株的制作>
用添加了5-氟乳清酸的培养基筛选作为2倍体酵母的酿酒酵母(Saccharomycescerevisiae)OC2株(NBRC2260)(Boeke,J.D.,et al.1987Methods Enzymol.;154:164-75.),以成为尿嘧啶营养缺陷型的菌株(OC2U)作为宿主。酵母的转化使用Frozen-EZ YeastTransformation II(ZYMO RESEARCH),按照附带的实验步骤进行。
使用将上述(1)中制作的质粒:pUC-5U_GRE3-P_HOR7-TKL1-TAL1-FBA1_P-P_ADH1-RPE1-RKI1-TEF1_P-P_TDH1-XI_N337C-T_DIT1-P_TDH3-XKS1-T_HIS3-LoxP-G418-LoxP-3U_GRE3的同源重组部位用PCR扩增而得的片段,进行OC2U株的转化,涂布于包含G418的YPD琼脂培养基,将生长的集落纯化。将纯化后的选择株命名为Uz1252株。使本菌株在胞子形成培养基(1%磷酸钾、0.1%酵母提取物、0.05%葡萄糖、2%琼脂)中形成胞子,利用同宗配合性进行2倍化。获得在作为2倍体的染色体的GRE3基因座区域插入了突变型XI基因、TKL1基因、TAL1基因、RPE1基因、RKI1基因和XKS1基因、且GRE3基因被破坏的菌株。将此作为Uz1252-3株。
并且,使用将上述(2)中制作的质粒:pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-mhpF-HOR7_P-URA3-3U_ADH2、上述(3)中制作的质粒:pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-adhE-HOR7_P-URA3-3U_ADH2、上述(4)中制作的质粒:pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-CloADH-HOR7_P-URA3-3U_ADH2、上述(5)中制作的质粒:pUC-5U_ADH2-P_TDH3-ADH1-T_ADH1-DIT1_T-ChlaADH1-HOR7_P-URA3-3U_ADH2、和上述(6)中制作的质粒:pUC-ADH2-T_CYC1-DIT1_T-mhpF-HOR7_P-URA3-3U_ADH2的各质粒的同源重组部位之间用PCR扩增而得的片段,进行上述Uz1252-3株的转化,涂布于不含尿嘧啶的SD琼脂培养基,将生长的集落纯化。将纯化后的选择株分别命名为Uz1317株、Uz1298株、Uz1296株、Uz1330株、和Uz1320株。
此外,确认了这些全部菌株发生了杂合(1拷贝)重组。对于所得的Uz1317株、Uz1298株和Uz1296株,使其在胞子形成培养基中形成胞子,利用同宗配合性进行2倍化,将所得的菌株分别命名为Uz1319株、Uz1318株和Uz1311株。
另外,作为对照,使用以OC2基因组为模板用PCR扩增而得的尿嘧啶基因,进行OC2U株的转化,涂布于不含尿嘧啶的SD琼脂培养基,将生长的集落纯化,命名为Uz1313株。使Uz1313株在胞子形成培养基中形成胞子,利用同宗配合性进行2倍化,命名为Uz1323株。
此外,本实施例中制作的株的基因型归纳于表8。
表8
<发酵试验>
从如上制成的菌株中分别选择发酵能力高的2株,如下实施烧瓶发酵试验。首先,在分装有20ml葡萄糖浓度20g/L的YPD液体培养基(酵母提取物10g/L、胨20g/L、葡萄糖20g/L)的100ml容量的带挡板的烧瓶中接种供试株,以30℃120rpm进行24小时培养。集菌后,接种于分装有8ml D60X80YPAc4培养基(葡萄糖60g/L、木糖80g/L、酵母提取物10g/L、胨20g/L、乙酸4g/L)或D40X80YPAc2培养基(葡萄糖40g/L、木糖80g/L、酵母提取物10g/L、胨20g/L、乙酸2g/L)的10ml容量的烧瓶中,通过振荡培养(80rpm、振幅35mm、30℃)进行发酵试验。此外,盖在烧瓶上的塞子是通有内径1.5mm针的橡胶制,通过在针的最前部安装止回阀来使烧瓶保持厌氧环境。
对于发酵液中的葡萄糖、木糖、乙醇,使用HPLC(LC-10A;岛津制作所),在下述条件测定。
柱:AminexHPX-87H
移动相:0.01N H2SO4
流量:0.6ml/min
温度:30℃
检测器:差示折射率检测器RID-10A
<发酵试验结果>
将使用D60X80YPAc4培养基、发酵时间为66小时的发酵试验(加入菌浓度0.3g干燥菌体/L)的结果示于表9和10。此外,表9和10所示的数据是独立获得的重组株3株的数据平均值。
表9
表10
将使用D40X80YPAc2培养基、发酵时间为42小时的发酵试验(加入菌浓度0.24g干燥菌体/L)的结果示于表11和12,另外将对于杂合导入株使用D40X80YPAc2培养基、发酵时间为42小时的发酵试验(加入菌浓度0.3g干燥菌体/L)的结果示于表13。此外,表11~13所示的数据是独立获得的重组株3株的数据平均值。
表11
表12
表13
如由表9~13可知的那样,与对照相比,杂合或纯合地破坏了ADH2、同时过表达ADH1和3种乙醛脱氢酶的任一种的菌株,木糖的同化速度大幅提高,乙酸的减少量也多,其结果乙醇的生产性提高。此外,关于乙酸的减少量,与ADH2的杂合导入株相比,ADH2的纯合导入株更多。另一方面,仅表达乙醛脱氢酶mhpF的菌株,木糖的同化速度降低,乙酸基本不减少,乙醇的生产性不提高。
本说明书中引用的全部出版物、专利和专利申请均直接作为参考而纳入本说明书中。
序列表
<110> 丰田自动车株式会社
<120> 使用了重组酵母的乙醇的制造方法
<130> PH-5535-PCT
<150> JP 2013-037501
<151> 2013-2-27
<150> JP 2014-036652
<151> 2014-2-27
<160> 100
<170> PatentIn 版本 3.5
<210> 1
<211> 951
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<220>
<221> CDS
<222> (1)..(951)
<400> 1
atg tca aag aga aaa gtt gct att atc ggt agt gga aac atc ggt act 48
Met Ser Lys Arg Lys Val Ala Ile Ile Gly Ser Gly Asn Ile Gly Thr
1 5 10 15
gat ttg atg atc aag atc ctt aga cat gga caa cac ttg gaa atg gct 96
Asp Leu Met Ile Lys Ile Leu Arg His Gly Gln His Leu Glu Met Ala
20 25 30
gtt atg gtc ggt atc gat cct cag tca gac gga ctt gct aga gcc aga 144
Val Met Val Gly Ile Asp Pro Gln Ser Asp Gly Leu Ala Arg Ala Arg
35 40 45
aga atg ggt gtt gct act aca cat gaa ggt gtt att gga ttg atg aac 192
Arg Met Gly Val Ala Thr Thr His Glu Gly Val Ile Gly Leu Met Asn
50 55 60
atg cca gag ttt gca gat att gac atc gtt ttc gat gct aca agt gca 240
Met Pro Glu Phe Ala Asp Ile Asp Ile Val Phe Asp Ala Thr Ser Ala
65 70 75 80
ggt gct cac gtt aag aat gac gct gcc ttg aga gaa gct aaa cct gat 288
Gly Ala His Val Lys Asn Asp Ala Ala Leu Arg Glu Ala Lys Pro Asp
85 90 95
att aga ttg atc gac ctt acc cca gca gct att gga cca tac tgt gtt 336
Ile Arg Leu Ile Asp Leu Thr Pro Ala Ala Ile Gly Pro Tyr Cys Val
100 105 110
cct gtt gtc aac ttg gag gcc aat gtt gat caa ctt aac gtt aat atg 384
Pro Val Val Asn Leu Glu Ala Asn Val Asp Gln Leu Asn Val Asn Met
115 120 125
gtc aca tgc ggt gga cag gct acc att cct atg gtt gcc gca gtc tct 432
Val Thr Cys Gly Gly Gln Ala Thr Ile Pro Met Val Ala Ala Val Ser
130 135 140
aga gtt gct aga gtc cat tat gcc gaa att atc gca tcc atc gct tca 480
Arg Val Ala Arg Val His Tyr Ala Glu Ile Ile Ala Ser Ile Ala Ser
145 150 155 160
aag agt gca ggt cca gga acc aga gct aac att gac gaa ttc act gag 528
Lys Ser Ala Gly Pro Gly Thr Arg Ala Asn Ile Asp Glu Phe Thr Glu
165 170 175
acc act tct aga gct atc gaa gtt gtc ggt gga gct gcc aag ggt aaa 576
Thr Thr Ser Arg Ala Ile Glu Val Val Gly Gly Ala Ala Lys Gly Lys
180 185 190
gcc att atc gtt ttg aat cct gca gag cca cct ctt atg atg aga gat 624
Ala Ile Ile Val Leu Asn Pro Ala Glu Pro Pro Leu Met Met Arg Asp
195 200 205
act gtt tac gtc ttg tct gac gaa gct tcc caa gat gac att gag gcc 672
Thr Val Tyr Val Leu Ser Asp Glu Ala Ser Gln Asp Asp Ile Glu Ala
210 215 220
tct atc aac gaa atg gcc gag gca gtt cag gct tac gtc cca ggt tat 720
Ser Ile Asn Glu Met Ala Glu Ala Val Gln Ala Tyr Val Pro Gly Tyr
225 230 235 240
aga ttg aag caa aga gtt cag ttt gag gtc att cca caa gat aaa cct 768
Arg Leu Lys Gln Arg Val Gln Phe Glu Val Ile Pro Gln Asp Lys Pro
245 250 255
gtt aat ttg cca ggt gtc gga cag ttc tcc gga ttg aaa aca gct gtt 816
Val Asn Leu Pro Gly Val Gly Gln Phe Ser Gly Leu Lys Thr Ala Val
260 265 270
tgg ctt gaa gtc gag ggt gca gct cac tac ttg cca gct tat gcc gga 864
Trp Leu Glu Val Glu Gly Ala Ala His Tyr Leu Pro Ala Tyr Ala Gly
275 280 285
aac ctt gac att atg act tct tcc gca ttg gct aca gcc gaa aag atg 912
Asn Leu Asp Ile Met Thr Ser Ser Ala Leu Ala Thr Ala Glu Lys Met
290 295 300
gct caa tct ctt gcc aga aaa gca gga gag gcc gca taa 951
Ala Gln Ser Leu Ala Arg Lys Ala Gly Glu Ala Ala
305 310 315
<210> 2
<211> 316
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 2
Met Ser Lys Arg Lys Val Ala Ile Ile Gly Ser Gly Asn Ile Gly Thr
1 5 10 15
Asp Leu Met Ile Lys Ile Leu Arg His Gly Gln His Leu Glu Met Ala
20 25 30
Val Met Val Gly Ile Asp Pro Gln Ser Asp Gly Leu Ala Arg Ala Arg
35 40 45
Arg Met Gly Val Ala Thr Thr His Glu Gly Val Ile Gly Leu Met Asn
50 55 60
Met Pro Glu Phe Ala Asp Ile Asp Ile Val Phe Asp Ala Thr Ser Ala
65 70 75 80
Gly Ala His Val Lys Asn Asp Ala Ala Leu Arg Glu Ala Lys Pro Asp
85 90 95
Ile Arg Leu Ile Asp Leu Thr Pro Ala Ala Ile Gly Pro Tyr Cys Val
100 105 110
Pro Val Val Asn Leu Glu Ala Asn Val Asp Gln Leu Asn Val Asn Met
115 120 125
Val Thr Cys Gly Gly Gln Ala Thr Ile Pro Met Val Ala Ala Val Ser
130 135 140
Arg Val Ala Arg Val His Tyr Ala Glu Ile Ile Ala Ser Ile Ala Ser
145 150 155 160
Lys Ser Ala Gly Pro Gly Thr Arg Ala Asn Ile Asp Glu Phe Thr Glu
165 170 175
Thr Thr Ser Arg Ala Ile Glu Val Val Gly Gly Ala Ala Lys Gly Lys
180 185 190
Ala Ile Ile Val Leu Asn Pro Ala Glu Pro Pro Leu Met Met Arg Asp
195 200 205
Thr Val Tyr Val Leu Ser Asp Glu Ala Ser Gln Asp Asp Ile Glu Ala
210 215 220
Ser Ile Asn Glu Met Ala Glu Ala Val Gln Ala Tyr Val Pro Gly Tyr
225 230 235 240
Arg Leu Lys Gln Arg Val Gln Phe Glu Val Ile Pro Gln Asp Lys Pro
245 250 255
Val Asn Leu Pro Gly Val Gly Gln Phe Ser Gly Leu Lys Thr Ala Val
260 265 270
Trp Leu Glu Val Glu Gly Ala Ala His Tyr Leu Pro Ala Tyr Ala Gly
275 280 285
Asn Leu Asp Ile Met Thr Ser Ser Ala Leu Ala Thr Ala Glu Lys Met
290 295 300
Ala Gln Ser Leu Ala Arg Lys Ala Gly Glu Ala Ala
305 310 315
<210> 3
<211> 1320
<212> DNA
<213> 黄胸散白蚁(Reticulitermes speratus)肠内原生生物
<220>
<221> CDS
<222> (1)..(1320)
<400> 3
atg tct caa att ttt aag gat atc cca gtt att aaa tat gaa ggt cca 48
Met Ser Gln Ile Phe Lys Asp Ile Pro Val Ile Lys Tyr Glu Gly Pro
1 5 10 15
gct tcc aag aat cct ttg agt ttc aaa tac tac gat gca aac aag gtt 96
Ala Ser Lys Asn Pro Leu Ser Phe Lys Tyr Tyr Asp Ala Asn Lys Val
20 25 30
att gat ggt aaa cca atg aag gaa cat ttg aga tac gca atg gct tgg 144
Ile Asp Gly Lys Pro Met Lys Glu His Leu Arg Tyr Ala Met Ala Trp
35 40 45
tgg cat aat ttg tgt gct acc ggt caa gat atg ttt ggt cct ggt act 192
Trp His Asn Leu Cys Ala Thr Gly Gln Asp Met Phe Gly Pro Gly Thr
50 55 60
gca gat aaa tcc ttc ggt agt aag aca gtt ggt acc atg gaa cat gca 240
Ala Asp Lys Ser Phe Gly Ser Lys Thr Val Gly Thr Met Glu His Ala
65 70 75 80
cat gct aaa gtt gat gct ggt ttt gaa ttc atg tcc aag ttg ggt gtt 288
His Ala Lys Val Asp Ala Gly Phe Glu Phe Met Ser Lys Leu Gly Val
85 90 95
gaa tac ttc tgt ttc cat gat gct gat ttg gtt cca gaa gca gat act 336
Glu Tyr Phe Cys Phe His Asp Ala Asp Leu Val Pro Glu Ala Asp Thr
100 105 110
ttg agt gaa aca aac aaa aga ttg gat gaa atc gct gaa cat atc gtt 384
Leu Ser Glu Thr Asn Lys Arg Leu Asp Glu Ile Ala Glu His Ile Val
115 120 125
gct aag caa aag gca act ggt att aaa tgt ttg tgg ggt aca gca aat 432
Ala Lys Gln Lys Ala Thr Gly Ile Lys Cys Leu Trp Gly Thr Ala Asn
130 135 140
ttg ttt tct aac cct aga ttc tta aat ggt tct ggt tct tca aac tca 480
Leu Phe Ser Asn Pro Arg Phe Leu Asn Gly Ser Gly Ser Ser Asn Ser
145 150 155 160
gct gat gtt tat gca tac gct gca gct caa att aaa aag gct ttg gat 528
Ala Asp Val Tyr Ala Tyr Ala Ala Ala Gln Ile Lys Lys Ala Leu Asp
165 170 175
ttg act gtt aaa ttt ggt ggt gtt ggt tat gtt ttc tgg ggt ggt aga 576
Leu Thr Val Lys Phe Gly Gly Val Gly Tyr Val Phe Trp Gly Gly Arg
180 185 190
gaa ggt tac gaa acc ttg ttg aac act gat gtt aag ttc gaa caa gaa 624
Glu Gly Tyr Glu Thr Leu Leu Asn Thr Asp Val Lys Phe Glu Gln Glu
195 200 205
aac atc gct aac ttg atg cat ttg gca gtt act tac ggt aga tca atc 672
Asn Ile Ala Asn Leu Met His Leu Ala Val Thr Tyr Gly Arg Ser Ile
210 215 220
ggt ttt aaa ggt gac ttc tac att gaa cca aaa cct aag gaa cca aca 720
Gly Phe Lys Gly Asp Phe Tyr Ile Glu Pro Lys Pro Lys Glu Pro Thr
225 230 235 240
aag cat caa tat gat ttt gat gca gct act aca att ggt ttc att aga 768
Lys His Gln Tyr Asp Phe Asp Ala Ala Thr Thr Ile Gly Phe Ile Arg
245 250 255
caa tac ggt ttg gaa aag gat ttc aag ttg aac atc gaa gca aac cat 816
Gln Tyr Gly Leu Glu Lys Asp Phe Lys Leu Asn Ile Glu Ala Asn His
260 265 270
gct aca tta gca ggt cat acc ttc caa cat gat ttg aga atc tct gct 864
Ala Thr Leu Ala Gly His Thr Phe Gln His Asp Leu Arg Ile Ser Ala
275 280 285
att aat ggc atg tta ggt tca gtt gat gca aac aca ggt gac cca ttg 912
Ile Asn Gly Met Leu Gly Ser Val Asp Ala Asn Thr Gly Asp Pro Leu
290 295 300
tta ggt tgg gat acc gat gaa ttt cct tat tcc gtt tac gat acc act 960
Leu Gly Trp Asp Thr Asp Glu Phe Pro Tyr Ser Val Tyr Asp Thr Thr
305 310 315 320
ttg gct atg tac gaa att att aag gca ggt ggt ttg acc ggt ggt ttg 1008
Leu Ala Met Tyr Glu Ile Ile Lys Ala Gly Gly Leu Thr Gly Gly Leu
325 330 335
aat ttt gat tcc aag gtt aga aga cca agt tac aca cat gaa gat ttg 1056
Asn Phe Asp Ser Lys Val Arg Arg Pro Ser Tyr Thr His Glu Asp Leu
340 345 350
ttt tac ggt ttc att ttg ggt atg gat tct ttc gct ttg ggt ttg att 1104
Phe Tyr Gly Phe Ile Leu Gly Met Asp Ser Phe Ala Leu Gly Leu Ile
355 360 365
aaa gca aag gct ttg att gca gat ggt aga ttg gat tca ttc gtt aag 1152
Lys Ala Lys Ala Leu Ile Ala Asp Gly Arg Leu Asp Ser Phe Val Lys
370 375 380
gat aga tac gct tct tac ggt tca ggt att ggt gct aag att aga gat 1200
Asp Arg Tyr Ala Ser Tyr Gly Ser Gly Ile Gly Ala Lys Ile Arg Asp
385 390 395 400
cat tct gca act ttg gaa gaa tta gca gct tat gca tta gct aaa gat 1248
His Ser Ala Thr Leu Glu Glu Leu Ala Ala Tyr Ala Leu Ala Lys Asp
405 410 415
aca gtt gct ttg cct ggt tcc ggt aga caa gaa tac tta gaa agt att 1296
Thr Val Ala Leu Pro Gly Ser Gly Arg Gln Glu Tyr Leu Glu Ser Ile
420 425 430
att aac caa att ttg ttt caa taa 1320
Ile Asn Gln Ile Leu Phe Gln
435
<210> 4
<211> 439
<212> PRT
<213> 黄胸散白蚁(Reticulitermes speratus)肠内原生生物
<400> 4
Met Ser Gln Ile Phe Lys Asp Ile Pro Val Ile Lys Tyr Glu Gly Pro
1 5 10 15
Ala Ser Lys Asn Pro Leu Ser Phe Lys Tyr Tyr Asp Ala Asn Lys Val
20 25 30
Ile Asp Gly Lys Pro Met Lys Glu His Leu Arg Tyr Ala Met Ala Trp
35 40 45
Trp His Asn Leu Cys Ala Thr Gly Gln Asp Met Phe Gly Pro Gly Thr
50 55 60
Ala Asp Lys Ser Phe Gly Ser Lys Thr Val Gly Thr Met Glu His Ala
65 70 75 80
His Ala Lys Val Asp Ala Gly Phe Glu Phe Met Ser Lys Leu Gly Val
85 90 95
Glu Tyr Phe Cys Phe His Asp Ala Asp Leu Val Pro Glu Ala Asp Thr
100 105 110
Leu Ser Glu Thr Asn Lys Arg Leu Asp Glu Ile Ala Glu His Ile Val
115 120 125
Ala Lys Gln Lys Ala Thr Gly Ile Lys Cys Leu Trp Gly Thr Ala Asn
130 135 140
Leu Phe Ser Asn Pro Arg Phe Leu Asn Gly Ser Gly Ser Ser Asn Ser
145 150 155 160
Ala Asp Val Tyr Ala Tyr Ala Ala Ala Gln Ile Lys Lys Ala Leu Asp
165 170 175
Leu Thr Val Lys Phe Gly Gly Val Gly Tyr Val Phe Trp Gly Gly Arg
180 185 190
Glu Gly Tyr Glu Thr Leu Leu Asn Thr Asp Val Lys Phe Glu Gln Glu
195 200 205
Asn Ile Ala Asn Leu Met His Leu Ala Val Thr Tyr Gly Arg Ser Ile
210 215 220
Gly Phe Lys Gly Asp Phe Tyr Ile Glu Pro Lys Pro Lys Glu Pro Thr
225 230 235 240
Lys His Gln Tyr Asp Phe Asp Ala Ala Thr Thr Ile Gly Phe Ile Arg
245 250 255
Gln Tyr Gly Leu Glu Lys Asp Phe Lys Leu Asn Ile Glu Ala Asn His
260 265 270
Ala Thr Leu Ala Gly His Thr Phe Gln His Asp Leu Arg Ile Ser Ala
275 280 285
Ile Asn Gly Met Leu Gly Ser Val Asp Ala Asn Thr Gly Asp Pro Leu
290 295 300
Leu Gly Trp Asp Thr Asp Glu Phe Pro Tyr Ser Val Tyr Asp Thr Thr
305 310 315 320
Leu Ala Met Tyr Glu Ile Ile Lys Ala Gly Gly Leu Thr Gly Gly Leu
325 330 335
Asn Phe Asp Ser Lys Val Arg Arg Pro Ser Tyr Thr His Glu Asp Leu
340 345 350
Phe Tyr Gly Phe Ile Leu Gly Met Asp Ser Phe Ala Leu Gly Leu Ile
355 360 365
Lys Ala Lys Ala Leu Ile Ala Asp Gly Arg Leu Asp Ser Phe Val Lys
370 375 380
Asp Arg Tyr Ala Ser Tyr Gly Ser Gly Ile Gly Ala Lys Ile Arg Asp
385 390 395 400
His Ser Ala Thr Leu Glu Glu Leu Ala Ala Tyr Ala Leu Ala Lys Asp
405 410 415
Thr Val Ala Leu Pro Gly Ser Gly Arg Gln Glu Tyr Leu Glu Ser Ile
420 425 430
Ile Asn Gln Ile Leu Phe Gln
435
<210> 5
<211> 1047
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<220>
<221> CDS
<222> (1)..(1047)
<400> 5
atg tct atc cca gaa act caa aaa ggt gtt atc ttc tac gaa tcc cac 48
Met Ser Ile Pro Glu Thr Gln Lys Gly Val Ile Phe Tyr Glu Ser His
1 5 10 15
ggt aag ttg gaa tac aaa gat att cca gtt cca aag cca aag gcc aac 96
Gly Lys Leu Glu Tyr Lys Asp Ile Pro Val Pro Lys Pro Lys Ala Asn
20 25 30
gaa ttg ttg atc aac gtt aaa tac tct ggt gtc tgt cac act gac ttg 144
Glu Leu Leu Ile Asn Val Lys Tyr Ser Gly Val Cys His Thr Asp Leu
35 40 45
cac gct tgg cac ggt gac tgg cca ttg cca gtt aag cta cca tta gtc 192
His Ala Trp His Gly Asp Trp Pro Leu Pro Val Lys Leu Pro Leu Val
50 55 60
ggt ggt cac gaa ggt gcc ggt gtc gtt gtc ggc atg ggt gaa aac gtt 240
Gly Gly His Glu Gly Ala Gly Val Val Val Gly Met Gly Glu Asn Val
65 70 75 80
aag ggc tgg aag atc ggt gac tac gcc ggt atc aaa tgg ttg aac ggt 288
Lys Gly Trp Lys Ile Gly Asp Tyr Ala Gly Ile Lys Trp Leu Asn Gly
85 90 95
tct tgt atg gcc tgt gaa tac tgt gaa ttg ggt aac gaa tcc aac tgt 336
Ser Cys Met Ala Cys Glu Tyr Cys Glu Leu Gly Asn Glu Ser Asn Cys
100 105 110
cct cac gct gac ttg tct ggt tac acc cac gac ggt tct ttc caa caa 384
Pro His Ala Asp Leu Ser Gly Tyr Thr His Asp Gly Ser Phe Gln Gln
115 120 125
tac gct acc gct gac gct gtt caa gcc gct cac att cct caa ggt acc 432
Tyr Ala Thr Ala Asp Ala Val Gln Ala Ala His Ile Pro Gln Gly Thr
130 135 140
gac ttg gcc caa gtc gcc ccc atc ttg tgt gct ggt atc acc gtc tac 480
Asp Leu Ala Gln Val Ala Pro Ile Leu Cys Ala Gly Ile Thr Val Tyr
145 150 155 160
aag gct ttg aag tct gct aac ttg atg gcc ggt cac tgg gtt gct atc 528
Lys Ala Leu Lys Ser Ala Asn Leu Met Ala Gly His Trp Val Ala Ile
165 170 175
tcc ggt gct gct ggt ggt cta ggt tct ttg gct gtt caa tac gcc aag 576
Ser Gly Ala Ala Gly Gly Leu Gly Ser Leu Ala Val Gln Tyr Ala Lys
180 185 190
gct atg ggt tac aga gtc ttg ggt att gac ggt ggt gaa ggt aag gaa 624
Ala Met Gly Tyr Arg Val Leu Gly Ile Asp Gly Gly Glu Gly Lys Glu
195 200 205
gaa tta ttc aga tcc atc ggt ggt gaa gtc ttc att gac ttc act aag 672
Glu Leu Phe Arg Ser Ile Gly Gly Glu Val Phe Ile Asp Phe Thr Lys
210 215 220
gaa aag gac att gtc ggt gct gtt cta aag gcc act gac ggt ggt gct 720
Glu Lys Asp Ile Val Gly Ala Val Leu Lys Ala Thr Asp Gly Gly Ala
225 230 235 240
cac ggt gtc atc aac gtt tcc gtt tcc gaa gcc gct att gaa gct tct 768
His Gly Val Ile Asn Val Ser Val Ser Glu Ala Ala Ile Glu Ala Ser
245 250 255
acc aga tac gtt aga gct aac ggt acc acc gtt ttg gtc ggt atg cca 816
Thr Arg Tyr Val Arg Ala Asn Gly Thr Thr Val Leu Val Gly Met Pro
260 265 270
gct ggt gcc aag tgt tgt tct gat gtc ttc aac caa gtc gtc aag tcc 864
Ala Gly Ala Lys Cys Cys Ser Asp Val Phe Asn Gln Val Val Lys Ser
275 280 285
atc tct att gtt ggt tct tac gtc ggt aac aga gct gac acc aga gaa 912
Ile Ser Ile Val Gly Ser Tyr Val Gly Asn Arg Ala Asp Thr Arg Glu
290 295 300
gct ttg gac ttc ttc gcc aga ggt ttg gtc aag tct cca atc aag gtt 960
Ala Leu Asp Phe Phe Ala Arg Gly Leu Val Lys Ser Pro Ile Lys Val
305 310 315 320
gtc ggc ttg tct acc ttg cca gaa att tac gaa aag atg gaa aag ggt 1008
Val Gly Leu Ser Thr Leu Pro Glu Ile Tyr Glu Lys Met Glu Lys Gly
325 330 335
caa atc gtt ggt aga tac gtt gtt gac act tct aaa taa 1047
Gln Ile Val Gly Arg Tyr Val Val Asp Thr Ser Lys
340 345
<210> 6
<211> 348
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 6
Met Ser Ile Pro Glu Thr Gln Lys Gly Val Ile Phe Tyr Glu Ser His
1 5 10 15
Gly Lys Leu Glu Tyr Lys Asp Ile Pro Val Pro Lys Pro Lys Ala Asn
20 25 30
Glu Leu Leu Ile Asn Val Lys Tyr Ser Gly Val Cys His Thr Asp Leu
35 40 45
His Ala Trp His Gly Asp Trp Pro Leu Pro Val Lys Leu Pro Leu Val
50 55 60
Gly Gly His Glu Gly Ala Gly Val Val Val Gly Met Gly Glu Asn Val
65 70 75 80
Lys Gly Trp Lys Ile Gly Asp Tyr Ala Gly Ile Lys Trp Leu Asn Gly
85 90 95
Ser Cys Met Ala Cys Glu Tyr Cys Glu Leu Gly Asn Glu Ser Asn Cys
100 105 110
Pro His Ala Asp Leu Ser Gly Tyr Thr His Asp Gly Ser Phe Gln Gln
115 120 125
Tyr Ala Thr Ala Asp Ala Val Gln Ala Ala His Ile Pro Gln Gly Thr
130 135 140
Asp Leu Ala Gln Val Ala Pro Ile Leu Cys Ala Gly Ile Thr Val Tyr
145 150 155 160
Lys Ala Leu Lys Ser Ala Asn Leu Met Ala Gly His Trp Val Ala Ile
165 170 175
Ser Gly Ala Ala Gly Gly Leu Gly Ser Leu Ala Val Gln Tyr Ala Lys
180 185 190
Ala Met Gly Tyr Arg Val Leu Gly Ile Asp Gly Gly Glu Gly Lys Glu
195 200 205
Glu Leu Phe Arg Ser Ile Gly Gly Glu Val Phe Ile Asp Phe Thr Lys
210 215 220
Glu Lys Asp Ile Val Gly Ala Val Leu Lys Ala Thr Asp Gly Gly Ala
225 230 235 240
His Gly Val Ile Asn Val Ser Val Ser Glu Ala Ala Ile Glu Ala Ser
245 250 255
Thr Arg Tyr Val Arg Ala Asn Gly Thr Thr Val Leu Val Gly Met Pro
260 265 270
Ala Gly Ala Lys Cys Cys Ser Asp Val Phe Asn Gln Val Val Lys Ser
275 280 285
Ile Ser Ile Val Gly Ser Tyr Val Gly Asn Arg Ala Asp Thr Arg Glu
290 295 300
Ala Leu Asp Phe Phe Ala Arg Gly Leu Val Lys Ser Pro Ile Lys Val
305 310 315 320
Val Gly Leu Ser Thr Leu Pro Glu Ile Tyr Glu Lys Met Glu Lys Gly
325 330 335
Gln Ile Val Gly Arg Tyr Val Val Asp Thr Ser Lys
340 345
<210> 7
<211> 1047
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<220>
<221> CDS
<222> (1)..(1047)
<400> 7
atg tct att cca gaa act caa aaa gcc att atc ttc tac gaa tcc aac 48
Met Ser Ile Pro Glu Thr Gln Lys Ala Ile Ile Phe Tyr Glu Ser Asn
1 5 10 15
ggc aag ttg gag cat aag gat atc cca gtt cca aag cca aag ccc aac 96
Gly Lys Leu Glu His Lys Asp Ile Pro Val Pro Lys Pro Lys Pro Asn
20 25 30
gaa ttg tta atc aac gtc aag tac tct ggt gtc tgc cac acc gat ttg 144
Glu Leu Leu Ile Asn Val Lys Tyr Ser Gly Val Cys His Thr Asp Leu
35 40 45
cac gct tgg cat ggt gac tgg cca ttg cca act aag tta cca tta gtt 192
His Ala Trp His Gly Asp Trp Pro Leu Pro Thr Lys Leu Pro Leu Val
50 55 60
ggt ggt cac gaa ggt gcc ggt gtc gtt gtc ggc atg ggt gaa aac gtt 240
Gly Gly His Glu Gly Ala Gly Val Val Val Gly Met Gly Glu Asn Val
65 70 75 80
aag ggc tgg aag atc ggt gac tac gcc ggt atc aaa tgg ttg aac ggt 288
Lys Gly Trp Lys Ile Gly Asp Tyr Ala Gly Ile Lys Trp Leu Asn Gly
85 90 95
tct tgt atg gcc tgt gaa tac tgt gaa ttg ggt aac gaa tcc aac tgt 336
Ser Cys Met Ala Cys Glu Tyr Cys Glu Leu Gly Asn Glu Ser Asn Cys
100 105 110
cct cac gct gac ttg tct ggt tac acc cac gac ggt tct ttc caa gaa 384
Pro His Ala Asp Leu Ser Gly Tyr Thr His Asp Gly Ser Phe Gln Glu
115 120 125
tac gct acc gct gac gct gtt caa gcc gct cac att cct caa ggt act 432
Tyr Ala Thr Ala Asp Ala Val Gln Ala Ala His Ile Pro Gln Gly Thr
130 135 140
gac ttg gct gaa gtc gcg cca atc ttg tgt gct ggt atc acc gta tac 480
Asp Leu Ala Glu Val Ala Pro Ile Leu Cys Ala Gly Ile Thr Val Tyr
145 150 155 160
aag gct ttg aag tct gcc aac ttg aga gca ggc cac tgg gcg gcc att 528
Lys Ala Leu Lys Ser Ala Asn Leu Arg Ala Gly His Trp Ala Ala Ile
165 170 175
tct ggt gct gct ggt ggt cta ggt tct ttg gct gtt caa tat gct aag 576
Ser Gly Ala Ala Gly Gly Leu Gly Ser Leu Ala Val Gln Tyr Ala Lys
180 185 190
gcg atg ggt tac aga gtc tta ggt att gat ggt ggt cca gga aag gaa 624
Ala Met Gly Tyr Arg Val Leu Gly Ile Asp Gly Gly Pro Gly Lys Glu
195 200 205
gaa ttg ttt acc tcg ctc ggt ggt gaa gta ttc atc gac ttc acc aaa 672
Glu Leu Phe Thr Ser Leu Gly Gly Glu Val Phe Ile Asp Phe Thr Lys
210 215 220
gag aag gac att gtt agc gca gtc gtt aag gct acc aac ggc ggt gcc 720
Glu Lys Asp Ile Val Ser Ala Val Val Lys Ala Thr Asn Gly Gly Ala
225 230 235 240
cac ggt atc atc aat gtt tcc gtt tcc gaa gcc gct atc gaa gct tct 768
His Gly Ile Ile Asn Val Ser Val Ser Glu Ala Ala Ile Glu Ala Ser
245 250 255
acc aga tac tgt agg gcg aac ggt act gtt gtc ttg gtt ggt ttg cca 816
Thr Arg Tyr Cys Arg Ala Asn Gly Thr Val Val Leu Val Gly Leu Pro
260 265 270
gcc ggt gca aag tgc tcc tct gat gtc ttc aac cac gtt gtc aag tct 864
Ala Gly Ala Lys Cys Ser Ser Asp Val Phe Asn His Val Val Lys Ser
275 280 285
atc tcc att gtc ggc tct tac gtg ggg aac aga gct gat acc aga gaa 912
Ile Ser Ile Val Gly Ser Tyr Val Gly Asn Arg Ala Asp Thr Arg Glu
290 295 300
gcc tta gat ttc ttt gcc aga ggt cta gtc aag tct cca ata aag gta 960
Ala Leu Asp Phe Phe Ala Arg Gly Leu Val Lys Ser Pro Ile Lys Val
305 310 315 320
gtt ggc tta tcc agt tta cca gaa att tac gaa aag atg gag aag ggc 1008
Val Gly Leu Ser Ser Leu Pro Glu Ile Tyr Glu Lys Met Glu Lys Gly
325 330 335
caa att gct ggt aga tac gtt gtt gac act tct aaa taa 1047
Gln Ile Ala Gly Arg Tyr Val Val Asp Thr Ser Lys
340 345
<210> 8
<211> 348
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 8
Met Ser Ile Pro Glu Thr Gln Lys Ala Ile Ile Phe Tyr Glu Ser Asn
1 5 10 15
Gly Lys Leu Glu His Lys Asp Ile Pro Val Pro Lys Pro Lys Pro Asn
20 25 30
Glu Leu Leu Ile Asn Val Lys Tyr Ser Gly Val Cys His Thr Asp Leu
35 40 45
His Ala Trp His Gly Asp Trp Pro Leu Pro Thr Lys Leu Pro Leu Val
50 55 60
Gly Gly His Glu Gly Ala Gly Val Val Val Gly Met Gly Glu Asn Val
65 70 75 80
Lys Gly Trp Lys Ile Gly Asp Tyr Ala Gly Ile Lys Trp Leu Asn Gly
85 90 95
Ser Cys Met Ala Cys Glu Tyr Cys Glu Leu Gly Asn Glu Ser Asn Cys
100 105 110
Pro His Ala Asp Leu Ser Gly Tyr Thr His Asp Gly Ser Phe Gln Glu
115 120 125
Tyr Ala Thr Ala Asp Ala Val Gln Ala Ala His Ile Pro Gln Gly Thr
130 135 140
Asp Leu Ala Glu Val Ala Pro Ile Leu Cys Ala Gly Ile Thr Val Tyr
145 150 155 160
Lys Ala Leu Lys Ser Ala Asn Leu Arg Ala Gly His Trp Ala Ala Ile
165 170 175
Ser Gly Ala Ala Gly Gly Leu Gly Ser Leu Ala Val Gln Tyr Ala Lys
180 185 190
Ala Met Gly Tyr Arg Val Leu Gly Ile Asp Gly Gly Pro Gly Lys Glu
195 200 205
Glu Leu Phe Thr Ser Leu Gly Gly Glu Val Phe Ile Asp Phe Thr Lys
210 215 220
Glu Lys Asp Ile Val Ser Ala Val Val Lys Ala Thr Asn Gly Gly Ala
225 230 235 240
His Gly Ile Ile Asn Val Ser Val Ser Glu Ala Ala Ile Glu Ala Ser
245 250 255
Thr Arg Tyr Cys Arg Ala Asn Gly Thr Val Val Leu Val Gly Leu Pro
260 265 270
Ala Gly Ala Lys Cys Ser Ser Asp Val Phe Asn His Val Val Lys Ser
275 280 285
Ile Ser Ile Val Gly Ser Tyr Val Gly Asn Arg Ala Asp Thr Arg Glu
290 295 300
Ala Leu Asp Phe Phe Ala Arg Gly Leu Val Lys Ser Pro Ile Lys Val
305 310 315 320
Val Gly Leu Ser Ser Leu Pro Glu Ile Tyr Glu Lys Met Glu Lys Gly
325 330 335
Gln Ile Ala Gly Arg Tyr Val Val Asp Thr Ser Lys
340 345
<210> 9
<211> 1803
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<220>
<221> CDS
<222> (1)..(1803)
<400> 9
atg ttg tgt tca gta att cag aga cag aca aga gag gtt tcc aac aca 48
Met Leu Cys Ser Val Ile Gln Arg Gln Thr Arg Glu Val Ser Asn Thr
1 5 10 15
atg tct tta gac tca tac tat ctt ggg ttt gat ctt tcg acc caa caa 96
Met Ser Leu Asp Ser Tyr Tyr Leu Gly Phe Asp Leu Ser Thr Gln Gln
20 25 30
ctg aaa tgt ctc gcc att aac cag gac cta aaa att gtc cat tca gaa 144
Leu Lys Cys Leu Ala Ile Asn Gln Asp Leu Lys Ile Val His Ser Glu
35 40 45
aca gtg gaa ttt gaa aag gat ctt ccg cat tat cac aca aag aag ggt 192
Thr Val Glu Phe Glu Lys Asp Leu Pro His Tyr His Thr Lys Lys Gly
50 55 60
gtc tat ata cac ggc gac act atc gaa tgt ccc gta gcc atg tgg tta 240
Val Tyr Ile His Gly Asp Thr Ile Glu Cys Pro Val Ala Met Trp Leu
65 70 75 80
gag gct cta gat ctg gtt ctc tcg aaa tat cgc gag gct aaa ttt cca 288
Glu Ala Leu Asp Leu Val Leu Ser Lys Tyr Arg Glu Ala Lys Phe Pro
85 90 95
ttg aac aaa gtt atg gcc gtc tca ggg tcc tgc cag cag cac ggg tct 336
Leu Asn Lys Val Met Ala Val Ser Gly Ser Cys Gln Gln His Gly Ser
100 105 110
gtc tac tgg tcc tcc caa gcc gaa tct ctg tta gag caa ttg aat aag 384
Val Tyr Trp Ser Ser Gln Ala Glu Ser Leu Leu Glu Gln Leu Asn Lys
115 120 125
aaa ccg gaa aaa gat tta ttg cac tac gtg agc tct gta gca ttt gca 432
Lys Pro Glu Lys Asp Leu Leu His Tyr Val Ser Ser Val Ala Phe Ala
130 135 140
agg caa acc gcc ccc aat tgg caa gac cac agt act gca aag caa tgt 480
Arg Gln Thr Ala Pro Asn Trp Gln Asp His Ser Thr Ala Lys Gln Cys
145 150 155 160
caa gag ttt gaa gag tgc ata ggt ggg cct gaa aaa atg gct caa tta 528
Gln Glu Phe Glu Glu Cys Ile Gly Gly Pro Glu Lys Met Ala Gln Leu
165 170 175
aca ggg tcc aga gcc cat ttt aga ttt act ggt cct caa att ctg aaa 576
Thr Gly Ser Arg Ala His Phe Arg Phe Thr Gly Pro Gln Ile Leu Lys
180 185 190
att gca caa tta gaa cca gaa gct tac gaa aaa aca aag acc att tct 624
Ile Ala Gln Leu Glu Pro Glu Ala Tyr Glu Lys Thr Lys Thr Ile Ser
195 200 205
tta gtg tct aat ttt ttg act tct atc tta gtg ggc cat ctt gtt gaa 672
Leu Val Ser Asn Phe Leu Thr Ser Ile Leu Val Gly His Leu Val Glu
210 215 220
tta gag gag gca gat gcc tgt ggt atg aac ctt tat gat ata cgt gaa 720
Leu Glu Glu Ala Asp Ala Cys Gly Met Asn Leu Tyr Asp Ile Arg Glu
225 230 235 240
aga aaa ttc agt gat gag cta cta cat cta att gat agt tct tct aag 768
Arg Lys Phe Ser Asp Glu Leu Leu His Leu Ile Asp Ser Ser Ser Lys
245 250 255
gat aaa act atc aga caa aaa tta atg aga gca ccc atg aaa aat ttg 816
Asp Lys Thr Ile Arg Gln Lys Leu Met Arg Ala Pro Met Lys Asn Leu
260 265 270
ata gcg ggt acc atc tgt aaa tat ttt att gag aag tac ggt ttc aat 864
Ile Ala Gly Thr Ile Cys Lys Tyr Phe Ile Glu Lys Tyr Gly Phe Asn
275 280 285
aca aac tgc aag gtc tct ccc atg act ggg gat aat tta gcc act ata 912
Thr Asn Cys Lys Val Ser Pro Met Thr Gly Asp Asn Leu Ala Thr Ile
290 295 300
tgt tct tta ccc ctg cgg aag aat gac gtt ctc gtt tcc cta gga aca 960
Cys Ser Leu Pro Leu Arg Lys Asn Asp Val Leu Val Ser Leu Gly Thr
305 310 315 320
agt act aca gtt ctt ctg gtc acc gat aag tat cac ccc tct ccg aac 1008
Ser Thr Thr Val Leu Leu Val Thr Asp Lys Tyr His Pro Ser Pro Asn
325 330 335
tat cat ctt ttc att cat cca act ctg cca aac cat tat atg ggt atg 1056
Tyr His Leu Phe Ile His Pro Thr Leu Pro Asn His Tyr Met Gly Met
340 345 350
att tgt tat tgt aat ggt tct ttg gca agg gag agg ata aga gac gag 1104
Ile Cys Tyr Cys Asn Gly Ser Leu Ala Arg Glu Arg Ile Arg Asp Glu
355 360 365
tta aac aaa gaa cgg gaa aat aat tat gag aag act aac gat tgg act 1152
Leu Asn Lys Glu Arg Glu Asn Asn Tyr Glu Lys Thr Asn Asp Trp Thr
370 375 380
ctt ttt aat caa gct gtg cta gat gac tca gaa agt agt gaa aat gaa 1200
Leu Phe Asn Gln Ala Val Leu Asp Asp Ser Glu Ser Ser Glu Asn Glu
385 390 395 400
tta ggt gta tat ttt cct ctg ggg gag atc gtt cct agc gta aaa gcc 1248
Leu Gly Val Tyr Phe Pro Leu Gly Glu Ile Val Pro Ser Val Lys Ala
405 410 415
ata aac aaa agg gtt atc ttc aat cca aaa acg ggt atg att gaa aga 1296
Ile Asn Lys Arg Val Ile Phe Asn Pro Lys Thr Gly Met Ile Glu Arg
420 425 430
gag gtg gcc aag ttc aaa gac aag agg cac gat gcc aaa aat att gta 1344
Glu Val Ala Lys Phe Lys Asp Lys Arg His Asp Ala Lys Asn Ile Val
435 440 445
gaa tca cag gct tta agt tgc agg gta aga ata tct ccc ctg ctt tcg 1392
Glu Ser Gln Ala Leu Ser Cys Arg Val Arg Ile Ser Pro Leu Leu Ser
450 455 460
gat tca aac gca agc tca caa cag aga ctg aac gaa gat aca atc gtg 1440
Asp Ser Asn Ala Ser Ser Gln Gln Arg Leu Asn Glu Asp Thr Ile Val
465 470 475 480
aag ttt gat tac gat gaa tct ccg ctg cgg gac tac cta aat aaa agg 1488
Lys Phe Asp Tyr Asp Glu Ser Pro Leu Arg Asp Tyr Leu Asn Lys Arg
485 490 495
cca gaa agg act ttt ttt gta ggt ggg gct tct aaa aac gat gct att 1536
Pro Glu Arg Thr Phe Phe Val Gly Gly Ala Ser Lys Asn Asp Ala Ile
500 505 510
gtg aag aag ttt gct caa gtc att ggt gct aca aag ggt aat ttt agg 1584
Val Lys Lys Phe Ala Gln Val Ile Gly Ala Thr Lys Gly Asn Phe Arg
515 520 525
cta gaa aca cca aac tca tgt gcc ctt ggt ggt tgt tat aag gcc atg 1632
Leu Glu Thr Pro Asn Ser Cys Ala Leu Gly Gly Cys Tyr Lys Ala Met
530 535 540
tgg tca ttg tta tat gac tct aat aaa att gca gtt cct ttt gat aaa 1680
Trp Ser Leu Leu Tyr Asp Ser Asn Lys Ile Ala Val Pro Phe Asp Lys
545 550 555 560
ttt ctg aat gac aat ttt cca tgg cat gta atg gaa agc ata tcc gat 1728
Phe Leu Asn Asp Asn Phe Pro Trp His Val Met Glu Ser Ile Ser Asp
565 570 575
gtg gat aat gaa aat tgg gat cgc tat aat tcc aag att gtc ccc tta 1776
Val Asp Asn Glu Asn Trp Asp Arg Tyr Asn Ser Lys Ile Val Pro Leu
580 585 590
agc gaa ctg gaa aag act ctc atc taa 1803
Ser Glu Leu Glu Lys Thr Leu Ile
595 600
<210> 10
<211> 600
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 10
Met Leu Cys Ser Val Ile Gln Arg Gln Thr Arg Glu Val Ser Asn Thr
1 5 10 15
Met Ser Leu Asp Ser Tyr Tyr Leu Gly Phe Asp Leu Ser Thr Gln Gln
20 25 30
Leu Lys Cys Leu Ala Ile Asn Gln Asp Leu Lys Ile Val His Ser Glu
35 40 45
Thr Val Glu Phe Glu Lys Asp Leu Pro His Tyr His Thr Lys Lys Gly
50 55 60
Val Tyr Ile His Gly Asp Thr Ile Glu Cys Pro Val Ala Met Trp Leu
65 70 75 80
Glu Ala Leu Asp Leu Val Leu Ser Lys Tyr Arg Glu Ala Lys Phe Pro
85 90 95
Leu Asn Lys Val Met Ala Val Ser Gly Ser Cys Gln Gln His Gly Ser
100 105 110
Val Tyr Trp Ser Ser Gln Ala Glu Ser Leu Leu Glu Gln Leu Asn Lys
115 120 125
Lys Pro Glu Lys Asp Leu Leu His Tyr Val Ser Ser Val Ala Phe Ala
130 135 140
Arg Gln Thr Ala Pro Asn Trp Gln Asp His Ser Thr Ala Lys Gln Cys
145 150 155 160
Gln Glu Phe Glu Glu Cys Ile Gly Gly Pro Glu Lys Met Ala Gln Leu
165 170 175
Thr Gly Ser Arg Ala His Phe Arg Phe Thr Gly Pro Gln Ile Leu Lys
180 185 190
Ile Ala Gln Leu Glu Pro Glu Ala Tyr Glu Lys Thr Lys Thr Ile Ser
195 200 205
Leu Val Ser Asn Phe Leu Thr Ser Ile Leu Val Gly His Leu Val Glu
210 215 220
Leu Glu Glu Ala Asp Ala Cys Gly Met Asn Leu Tyr Asp Ile Arg Glu
225 230 235 240
Arg Lys Phe Ser Asp Glu Leu Leu His Leu Ile Asp Ser Ser Ser Lys
245 250 255
Asp Lys Thr Ile Arg Gln Lys Leu Met Arg Ala Pro Met Lys Asn Leu
260 265 270
Ile Ala Gly Thr Ile Cys Lys Tyr Phe Ile Glu Lys Tyr Gly Phe Asn
275 280 285
Thr Asn Cys Lys Val Ser Pro Met Thr Gly Asp Asn Leu Ala Thr Ile
290 295 300
Cys Ser Leu Pro Leu Arg Lys Asn Asp Val Leu Val Ser Leu Gly Thr
305 310 315 320
Ser Thr Thr Val Leu Leu Val Thr Asp Lys Tyr His Pro Ser Pro Asn
325 330 335
Tyr His Leu Phe Ile His Pro Thr Leu Pro Asn His Tyr Met Gly Met
340 345 350
Ile Cys Tyr Cys Asn Gly Ser Leu Ala Arg Glu Arg Ile Arg Asp Glu
355 360 365
Leu Asn Lys Glu Arg Glu Asn Asn Tyr Glu Lys Thr Asn Asp Trp Thr
370 375 380
Leu Phe Asn Gln Ala Val Leu Asp Asp Ser Glu Ser Ser Glu Asn Glu
385 390 395 400
Leu Gly Val Tyr Phe Pro Leu Gly Glu Ile Val Pro Ser Val Lys Ala
405 410 415
Ile Asn Lys Arg Val Ile Phe Asn Pro Lys Thr Gly Met Ile Glu Arg
420 425 430
Glu Val Ala Lys Phe Lys Asp Lys Arg His Asp Ala Lys Asn Ile Val
435 440 445
Glu Ser Gln Ala Leu Ser Cys Arg Val Arg Ile Ser Pro Leu Leu Ser
450 455 460
Asp Ser Asn Ala Ser Ser Gln Gln Arg Leu Asn Glu Asp Thr Ile Val
465 470 475 480
Lys Phe Asp Tyr Asp Glu Ser Pro Leu Arg Asp Tyr Leu Asn Lys Arg
485 490 495
Pro Glu Arg Thr Phe Phe Val Gly Gly Ala Ser Lys Asn Asp Ala Ile
500 505 510
Val Lys Lys Phe Ala Gln Val Ile Gly Ala Thr Lys Gly Asn Phe Arg
515 520 525
Leu Glu Thr Pro Asn Ser Cys Ala Leu Gly Gly Cys Tyr Lys Ala Met
530 535 540
Trp Ser Leu Leu Tyr Asp Ser Asn Lys Ile Ala Val Pro Phe Asp Lys
545 550 555 560
Phe Leu Asn Asp Asn Phe Pro Trp His Val Met Glu Ser Ile Ser Asp
565 570 575
Val Asp Asn Glu Asn Trp Asp Arg Tyr Asn Ser Lys Ile Val Pro Leu
580 585 590
Ser Glu Leu Glu Lys Thr Leu Ile
595 600
<210> 11
<211> 1008
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<220>
<221> CDS
<222> (1)..(1008)
<400> 11
atg tct gaa cca gct caa aag aaa caa aag gtt gct aac aac tct cta 48
Met Ser Glu Pro Ala Gln Lys Lys Gln Lys Val Ala Asn Asn Ser Leu
1 5 10 15
gaa caa ttg aaa gcc tcc ggc act gtc gtt gtt gcc gac act ggt gat 96
Glu Gln Leu Lys Ala Ser Gly Thr Val Val Val Ala Asp Thr Gly Asp
20 25 30
ttc ggc tct att gcc aag ttt caa cct caa gac tcc aca act aac cca 144
Phe Gly Ser Ile Ala Lys Phe Gln Pro Gln Asp Ser Thr Thr Asn Pro
35 40 45
tca ttg atc ttg gct gct gcc aag caa cca act tac gcc aag ttg atc 192
Ser Leu Ile Leu Ala Ala Ala Lys Gln Pro Thr Tyr Ala Lys Leu Ile
50 55 60
gat gtt gcc gtg gaa tac ggt aag aag cat ggt aag acc acc gaa gaa 240
Asp Val Ala Val Glu Tyr Gly Lys Lys His Gly Lys Thr Thr Glu Glu
65 70 75 80
caa gtc gaa aat gct gtg gac aga ttg tta gtc gaa ttc ggt aag gag 288
Gln Val Glu Asn Ala Val Asp Arg Leu Leu Val Glu Phe Gly Lys Glu
85 90 95
atc tta aag att gtt cca ggc aga gtc tcc acc gaa gtt gat gct aga 336
Ile Leu Lys Ile Val Pro Gly Arg Val Ser Thr Glu Val Asp Ala Arg
100 105 110
ttg tct ttt gac act caa gct acc att gaa aag gct aga cat atc att 384
Leu Ser Phe Asp Thr Gln Ala Thr Ile Glu Lys Ala Arg His Ile Ile
115 120 125
aaa ttg ttt gaa caa gaa ggt gtc tcc aag gaa aga gtc ctt att aaa 432
Lys Leu Phe Glu Gln Glu Gly Val Ser Lys Glu Arg Val Leu Ile Lys
130 135 140
att gct tcc act tgg gaa ggt att caa gct gcc aaa gaa ttg gaa gaa 480
Ile Ala Ser Thr Trp Glu Gly Ile Gln Ala Ala Lys Glu Leu Glu Glu
145 150 155 160
aag gac ggt atc cac tgt aat ttg act cta tta ttc tcc ttc gtt caa 528
Lys Asp Gly Ile His Cys Asn Leu Thr Leu Leu Phe Ser Phe Val Gln
165 170 175
gca gtt gcc tgt gcc gag gcc caa gtt act ttg att tcc cca ttt gtt 576
Ala Val Ala Cys Ala Glu Ala Gln Val Thr Leu Ile Ser Pro Phe Val
180 185 190
ggt aga att cta gac tgg tac aaa tcc agc act ggt aaa gat tac aag 624
Gly Arg Ile Leu Asp Trp Tyr Lys Ser Ser Thr Gly Lys Asp Tyr Lys
195 200 205
ggt gaa gcc gac cca ggt gtt att tcc gtc aag aaa atc tac aac tac 672
Gly Glu Ala Asp Pro Gly Val Ile Ser Val Lys Lys Ile Tyr Asn Tyr
210 215 220
tac aag aag tac ggt tac aag act att gtt atg ggt gct tct ttc aga 720
Tyr Lys Lys Tyr Gly Tyr Lys Thr Ile Val Met Gly Ala Ser Phe Arg
225 230 235 240
agc act gac gaa atc aaa aac ttg gct ggt gtt gac tat cta aca att 768
Ser Thr Asp Glu Ile Lys Asn Leu Ala Gly Val Asp Tyr Leu Thr Ile
245 250 255
tct cca gct tta ttg gac aag ttg atg aac agt act gaa cct ttc cca 816
Ser Pro Ala Leu Leu Asp Lys Leu Met Asn Ser Thr Glu Pro Phe Pro
260 265 270
aga gtt ttg gac cct gtc tcc gct aag aag gaa gcc ggc gac aag att 864
Arg Val Leu Asp Pro Val Ser Ala Lys Lys Glu Ala Gly Asp Lys Ile
275 280 285
tct tac atc agc gac gaa tct aaa ttc aga ttc gac ttg aat gaa gac 912
Ser Tyr Ile Ser Asp Glu Ser Lys Phe Arg Phe Asp Leu Asn Glu Asp
290 295 300
gct atg gcc act gaa aaa ttg tcc gaa ggt atc aga aaa ttc tct gcc 960
Ala Met Ala Thr Glu Lys Leu Ser Glu Gly Ile Arg Lys Phe Ser Ala
305 310 315 320
gat att gtt act cta ttc gac ttg att gaa aag aaa gtt acc gct taa 1008
Asp Ile Val Thr Leu Phe Asp Leu Ile Glu Lys Lys Val Thr Ala
325 330 335
<210> 12
<211> 335
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 12
Met Ser Glu Pro Ala Gln Lys Lys Gln Lys Val Ala Asn Asn Ser Leu
1 5 10 15
Glu Gln Leu Lys Ala Ser Gly Thr Val Val Val Ala Asp Thr Gly Asp
20 25 30
Phe Gly Ser Ile Ala Lys Phe Gln Pro Gln Asp Ser Thr Thr Asn Pro
35 40 45
Ser Leu Ile Leu Ala Ala Ala Lys Gln Pro Thr Tyr Ala Lys Leu Ile
50 55 60
Asp Val Ala Val Glu Tyr Gly Lys Lys His Gly Lys Thr Thr Glu Glu
65 70 75 80
Gln Val Glu Asn Ala Val Asp Arg Leu Leu Val Glu Phe Gly Lys Glu
85 90 95
Ile Leu Lys Ile Val Pro Gly Arg Val Ser Thr Glu Val Asp Ala Arg
100 105 110
Leu Ser Phe Asp Thr Gln Ala Thr Ile Glu Lys Ala Arg His Ile Ile
115 120 125
Lys Leu Phe Glu Gln Glu Gly Val Ser Lys Glu Arg Val Leu Ile Lys
130 135 140
Ile Ala Ser Thr Trp Glu Gly Ile Gln Ala Ala Lys Glu Leu Glu Glu
145 150 155 160
Lys Asp Gly Ile His Cys Asn Leu Thr Leu Leu Phe Ser Phe Val Gln
165 170 175
Ala Val Ala Cys Ala Glu Ala Gln Val Thr Leu Ile Ser Pro Phe Val
180 185 190
Gly Arg Ile Leu Asp Trp Tyr Lys Ser Ser Thr Gly Lys Asp Tyr Lys
195 200 205
Gly Glu Ala Asp Pro Gly Val Ile Ser Val Lys Lys Ile Tyr Asn Tyr
210 215 220
Tyr Lys Lys Tyr Gly Tyr Lys Thr Ile Val Met Gly Ala Ser Phe Arg
225 230 235 240
Ser Thr Asp Glu Ile Lys Asn Leu Ala Gly Val Asp Tyr Leu Thr Ile
245 250 255
Ser Pro Ala Leu Leu Asp Lys Leu Met Asn Ser Thr Glu Pro Phe Pro
260 265 270
Arg Val Leu Asp Pro Val Ser Ala Lys Lys Glu Ala Gly Asp Lys Ile
275 280 285
Ser Tyr Ile Ser Asp Glu Ser Lys Phe Arg Phe Asp Leu Asn Glu Asp
290 295 300
Ala Met Ala Thr Glu Lys Leu Ser Glu Gly Ile Arg Lys Phe Ser Ala
305 310 315 320
Asp Ile Val Thr Leu Phe Asp Leu Ile Glu Lys Lys Val Thr Ala
325 330 335
<210> 13
<211> 2043
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<220>
<221> CDS
<222> (1)..(2043)
<400> 13
atg act caa ttc act gac att gat aag cta gcc gtc tcc acc ata aga 48
Met Thr Gln Phe Thr Asp Ile Asp Lys Leu Ala Val Ser Thr Ile Arg
1 5 10 15
att ttg gct gtg gac acc gta tcc aag gcc aac tca ggt cac cca ggt 96
Ile Leu Ala Val Asp Thr Val Ser Lys Ala Asn Ser Gly His Pro Gly
20 25 30
gct cca ttg ggt atg gca cca gct gca cac gtt cta tgg agt caa atg 144
Ala Pro Leu Gly Met Ala Pro Ala Ala His Val Leu Trp Ser Gln Met
35 40 45
cgc atg aac cca acc aac cca gac tgg atc aac aga gat aga ttt gtc 192
Arg Met Asn Pro Thr Asn Pro Asp Trp Ile Asn Arg Asp Arg Phe Val
50 55 60
ttg tct aac ggt cac gcg gtc gct ttg ttg tat tct atg cta cat ttg 240
Leu Ser Asn Gly His Ala Val Ala Leu Leu Tyr Ser Met Leu His Leu
65 70 75 80
act ggt tac gat ctg tct att gaa gac ttg aaa cag ttc aga cag ttg 288
Thr Gly Tyr Asp Leu Ser Ile Glu Asp Leu Lys Gln Phe Arg Gln Leu
85 90 95
ggt tcc aga aca cca ggt cat cct gaa ttt gag ttg cca ggt gtt gaa 336
Gly Ser Arg Thr Pro Gly His Pro Glu Phe Glu Leu Pro Gly Val Glu
100 105 110
gtt act acc ggt cca tta ggt caa ggt atc tcc aac gct gtt ggt atg 384
Val Thr Thr Gly Pro Leu Gly Gln Gly Ile Ser Asn Ala Val Gly Met
115 120 125
gcc atg gct caa gct aac ctg gct gcc act tac aac aag ccg ggc ttt 432
Ala Met Ala Gln Ala Asn Leu Ala Ala Thr Tyr Asn Lys Pro Gly Phe
130 135 140
acc ttg tct gac aac tac acc tat gtt ttc ttg ggt gac ggt tgt ttg 480
Thr Leu Ser Asp Asn Tyr Thr Tyr Val Phe Leu Gly Asp Gly Cys Leu
145 150 155 160
caa gaa ggt att tct tca gaa gct tcc tcc ttg gct ggt cat ttg aaa 528
Gln Glu Gly Ile Ser Ser Glu Ala Ser Ser Leu Ala Gly His Leu Lys
165 170 175
ttg ggt aac ttg att gcc atc tac gat gac aac aag atc act atc gat 576
Leu Gly Asn Leu Ile Ala Ile Tyr Asp Asp Asn Lys Ile Thr Ile Asp
180 185 190
ggt gct acc agt atc tca ttc gat gaa gat gtt gct aag aga tac gaa 624
Gly Ala Thr Ser Ile Ser Phe Asp Glu Asp Val Ala Lys Arg Tyr Glu
195 200 205
gcc tac ggt tgg gaa gtt ttg tac gta gaa aat ggt aac gaa gat cta 672
Ala Tyr Gly Trp Glu Val Leu Tyr Val Glu Asn Gly Asn Glu Asp Leu
210 215 220
gcc ggt att gcc aag gct att gct caa gct aag tta tcc aag gac aaa 720
Ala Gly Ile Ala Lys Ala Ile Ala Gln Ala Lys Leu Ser Lys Asp Lys
225 230 235 240
cca act ttg atc aaa atg acc aca acc att ggt tac ggt tcc ttg cat 768
Pro Thr Leu Ile Lys Met Thr Thr Thr Ile Gly Tyr Gly Ser Leu His
245 250 255
gcc ggc tct cac tct gtg cac ggt gcc cca ttg aaa gca gat gat gtt 816
Ala Gly Ser His Ser Val His Gly Ala Pro Leu Lys Ala Asp Asp Val
260 265 270
aaa caa cta aag agc aaa ttc ggt ttc aac cca gac aag tcc ttt gtt 864
Lys Gln Leu Lys Ser Lys Phe Gly Phe Asn Pro Asp Lys Ser Phe Val
275 280 285
gtt cca caa gaa gtt tac gac cac tac caa aag aca att tta aag cca 912
Val Pro Gln Glu Val Tyr Asp His Tyr Gln Lys Thr Ile Leu Lys Pro
290 295 300
ggt gtc gaa gcc aac aac aag tgg aac aag ttg ttc agc gaa tac caa 960
Gly Val Glu Ala Asn Asn Lys Trp Asn Lys Leu Phe Ser Glu Tyr Gln
305 310 315 320
aag aaa ttc cca gaa tta ggt gct gaa ttg gct aga aga ttg agc ggc 1008
Lys Lys Phe Pro Glu Leu Gly Ala Glu Leu Ala Arg Arg Leu Ser Gly
325 330 335
caa cta ccc gca aat tgg gaa tct aag ttg cca act tac acc gcc aag 1056
Gln Leu Pro Ala Asn Trp Glu Ser Lys Leu Pro Thr Tyr Thr Ala Lys
340 345 350
gac tct gcc gtg gcc act aga aaa tta tca gaa act gtt ctt gag gat 1104
Asp Ser Ala Val Ala Thr Arg Lys Leu Ser Glu Thr Val Leu Glu Asp
355 360 365
gtt tac aat caa ttg cca gag ttg att ggt ggt tct gcc gat tta aca 1152
Val Tyr Asn Gln Leu Pro Glu Leu Ile Gly Gly Ser Ala Asp Leu Thr
370 375 380
cct tct aac ttg acc aga tgg aag gaa gcc ctt gac ttc caa cct cct 1200
Pro Ser Asn Leu Thr Arg Trp Lys Glu Ala Leu Asp Phe Gln Pro Pro
385 390 395 400
tct tcc ggt tca ggt aac tac tct ggt aga tac att agg tac ggt att 1248
Ser Ser Gly Ser Gly Asn Tyr Ser Gly Arg Tyr Ile Arg Tyr Gly Ile
405 410 415
aga gaa cac gct atg ggt gcc ata atg aac ggt att tca gct ttc ggt 1296
Arg Glu His Ala Met Gly Ala Ile Met Asn Gly Ile Ser Ala Phe Gly
420 425 430
gcc aac tac aaa cca tac ggt ggt act ttc ttg aac ttc gtt tct tat 1344
Ala Asn Tyr Lys Pro Tyr Gly Gly Thr Phe Leu Asn Phe Val Ser Tyr
435 440 445
gct gct ggt gcc gtt aga ttg tcc gct ttg tct ggc cac cca gtt att 1392
Ala Ala Gly Ala Val Arg Leu Ser Ala Leu Ser Gly His Pro Val Ile
450 455 460
tgg gtt gct aca cat gac tct atc ggt gtc ggt gaa gat ggt cca aca 1440
Trp Val Ala Thr His Asp Ser Ile Gly Val Gly Glu Asp Gly Pro Thr
465 470 475 480
cat caa cct att gaa act tta gca cac ttc aga tcc cta cca aac att 1488
His Gln Pro Ile Glu Thr Leu Ala His Phe Arg Ser Leu Pro Asn Ile
485 490 495
caa gtt tgg aga cca gct gat ggt aac gaa gtt tct gcc gcc tac aag 1536
Gln Val Trp Arg Pro Ala Asp Gly Asn Glu Val Ser Ala Ala Tyr Lys
500 505 510
aac tct tta gaa tcc aag cat act cca agt atc att gct ttg tcc aga 1584
Asn Ser Leu Glu Ser Lys His Thr Pro Ser Ile Ile Ala Leu Ser Arg
515 520 525
caa aac ttg cca caa ttg gaa ggt agc tct att gaa agc gct tct aag 1632
Gln Asn Leu Pro Gln Leu Glu Gly Ser Ser Ile Glu Ser Ala Ser Lys
530 535 540
ggt ggt tac gta cta caa gat gtt gct aac cca gat att att tta gtg 1680
Gly Gly Tyr Val Leu Gln Asp Val Ala Asn Pro Asp Ile Ile Leu Val
545 550 555 560
gct act ggt tcc gaa gtg tct ttg agt gtt gaa gct gct aag act ttg 1728
Ala Thr Gly Ser Glu Val Ser Leu Ser Val Glu Ala Ala Lys Thr Leu
565 570 575
gcc gca aag aac atc aag gct cgt gtt gtt tct cta cca gat ttc ttc 1776
Ala Ala Lys Asn Ile Lys Ala Arg Val Val Ser Leu Pro Asp Phe Phe
580 585 590
act ttt gac aaa caa ccc cta gaa tac aga cta tca gtc tta cca gac 1824
Thr Phe Asp Lys Gln Pro Leu Glu Tyr Arg Leu Ser Val Leu Pro Asp
595 600 605
aac gtt cca atc atg tct gtt gaa gtt ttg gct acc aca tgt tgg ggc 1872
Asn Val Pro Ile Met Ser Val Glu Val Leu Ala Thr Thr Cys Trp Gly
610 615 620
aaa tac gct cat caa tcc ttc ggt att gac aga ttt ggt gcc tcc ggt 1920
Lys Tyr Ala His Gln Ser Phe Gly Ile Asp Arg Phe Gly Ala Ser Gly
625 630 635 640
aag gca cca gaa gtc ttc aag ttc ttc ggt ttc acc cca gaa ggt gtt 1968
Lys Ala Pro Glu Val Phe Lys Phe Phe Gly Phe Thr Pro Glu Gly Val
645 650 655
gct gaa aga gct caa aag acc att gca ttc tat aag ggt gac aag cta 2016
Ala Glu Arg Ala Gln Lys Thr Ile Ala Phe Tyr Lys Gly Asp Lys Leu
660 665 670
att tct cct ttg aaa aaa gct ttc taa 2043
Ile Ser Pro Leu Lys Lys Ala Phe
675 680
<210> 14
<211> 680
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 14
Met Thr Gln Phe Thr Asp Ile Asp Lys Leu Ala Val Ser Thr Ile Arg
1 5 10 15
Ile Leu Ala Val Asp Thr Val Ser Lys Ala Asn Ser Gly His Pro Gly
20 25 30
Ala Pro Leu Gly Met Ala Pro Ala Ala His Val Leu Trp Ser Gln Met
35 40 45
Arg Met Asn Pro Thr Asn Pro Asp Trp Ile Asn Arg Asp Arg Phe Val
50 55 60
Leu Ser Asn Gly His Ala Val Ala Leu Leu Tyr Ser Met Leu His Leu
65 70 75 80
Thr Gly Tyr Asp Leu Ser Ile Glu Asp Leu Lys Gln Phe Arg Gln Leu
85 90 95
Gly Ser Arg Thr Pro Gly His Pro Glu Phe Glu Leu Pro Gly Val Glu
100 105 110
Val Thr Thr Gly Pro Leu Gly Gln Gly Ile Ser Asn Ala Val Gly Met
115 120 125
Ala Met Ala Gln Ala Asn Leu Ala Ala Thr Tyr Asn Lys Pro Gly Phe
130 135 140
Thr Leu Ser Asp Asn Tyr Thr Tyr Val Phe Leu Gly Asp Gly Cys Leu
145 150 155 160
Gln Glu Gly Ile Ser Ser Glu Ala Ser Ser Leu Ala Gly His Leu Lys
165 170 175
Leu Gly Asn Leu Ile Ala Ile Tyr Asp Asp Asn Lys Ile Thr Ile Asp
180 185 190
Gly Ala Thr Ser Ile Ser Phe Asp Glu Asp Val Ala Lys Arg Tyr Glu
195 200 205
Ala Tyr Gly Trp Glu Val Leu Tyr Val Glu Asn Gly Asn Glu Asp Leu
210 215 220
Ala Gly Ile Ala Lys Ala Ile Ala Gln Ala Lys Leu Ser Lys Asp Lys
225 230 235 240
Pro Thr Leu Ile Lys Met Thr Thr Thr Ile Gly Tyr Gly Ser Leu His
245 250 255
Ala Gly Ser His Ser Val His Gly Ala Pro Leu Lys Ala Asp Asp Val
260 265 270
Lys Gln Leu Lys Ser Lys Phe Gly Phe Asn Pro Asp Lys Ser Phe Val
275 280 285
Val Pro Gln Glu Val Tyr Asp His Tyr Gln Lys Thr Ile Leu Lys Pro
290 295 300
Gly Val Glu Ala Asn Asn Lys Trp Asn Lys Leu Phe Ser Glu Tyr Gln
305 310 315 320
Lys Lys Phe Pro Glu Leu Gly Ala Glu Leu Ala Arg Arg Leu Ser Gly
325 330 335
Gln Leu Pro Ala Asn Trp Glu Ser Lys Leu Pro Thr Tyr Thr Ala Lys
340 345 350
Asp Ser Ala Val Ala Thr Arg Lys Leu Ser Glu Thr Val Leu Glu Asp
355 360 365
Val Tyr Asn Gln Leu Pro Glu Leu Ile Gly Gly Ser Ala Asp Leu Thr
370 375 380
Pro Ser Asn Leu Thr Arg Trp Lys Glu Ala Leu Asp Phe Gln Pro Pro
385 390 395 400
Ser Ser Gly Ser Gly Asn Tyr Ser Gly Arg Tyr Ile Arg Tyr Gly Ile
405 410 415
Arg Glu His Ala Met Gly Ala Ile Met Asn Gly Ile Ser Ala Phe Gly
420 425 430
Ala Asn Tyr Lys Pro Tyr Gly Gly Thr Phe Leu Asn Phe Val Ser Tyr
435 440 445
Ala Ala Gly Ala Val Arg Leu Ser Ala Leu Ser Gly His Pro Val Ile
450 455 460
Trp Val Ala Thr His Asp Ser Ile Gly Val Gly Glu Asp Gly Pro Thr
465 470 475 480
His Gln Pro Ile Glu Thr Leu Ala His Phe Arg Ser Leu Pro Asn Ile
485 490 495
Gln Val Trp Arg Pro Ala Asp Gly Asn Glu Val Ser Ala Ala Tyr Lys
500 505 510
Asn Ser Leu Glu Ser Lys His Thr Pro Ser Ile Ile Ala Leu Ser Arg
515 520 525
Gln Asn Leu Pro Gln Leu Glu Gly Ser Ser Ile Glu Ser Ala Ser Lys
530 535 540
Gly Gly Tyr Val Leu Gln Asp Val Ala Asn Pro Asp Ile Ile Leu Val
545 550 555 560
Ala Thr Gly Ser Glu Val Ser Leu Ser Val Glu Ala Ala Lys Thr Leu
565 570 575
Ala Ala Lys Asn Ile Lys Ala Arg Val Val Ser Leu Pro Asp Phe Phe
580 585 590
Thr Phe Asp Lys Gln Pro Leu Glu Tyr Arg Leu Ser Val Leu Pro Asp
595 600 605
Asn Val Pro Ile Met Ser Val Glu Val Leu Ala Thr Thr Cys Trp Gly
610 615 620
Lys Tyr Ala His Gln Ser Phe Gly Ile Asp Arg Phe Gly Ala Ser Gly
625 630 635 640
Lys Ala Pro Glu Val Phe Lys Phe Phe Gly Phe Thr Pro Glu Gly Val
645 650 655
Ala Glu Arg Ala Gln Lys Thr Ile Ala Phe Tyr Lys Gly Asp Lys Leu
660 665 670
Ile Ser Pro Leu Lys Lys Ala Phe
675 680
<210> 15
<211> 717
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<220>
<221> CDS
<222> (1)..(717)
<400> 15
atg gtc aaa cca att ata gct ccc agt atc ctt gct tct gac ttc gcc 48
Met Val Lys Pro Ile Ile Ala Pro Ser Ile Leu Ala Ser Asp Phe Ala
1 5 10 15
aac ttg ggt tgc gaa tgt cat aag gtc atc aac gcc ggc gca gat tgg 96
Asn Leu Gly Cys Glu Cys His Lys Val Ile Asn Ala Gly Ala Asp Trp
20 25 30
tta cat atc gat gtc atg gac ggc cat ttt gtt cca aac att act ctg 144
Leu His Ile Asp Val Met Asp Gly His Phe Val Pro Asn Ile Thr Leu
35 40 45
ggc caa cca att gtt acc tcc cta cgt cgt tct gtg cca cgc cct ggc 192
Gly Gln Pro Ile Val Thr Ser Leu Arg Arg Ser Val Pro Arg Pro Gly
50 55 60
gat gct agc aac aca gaa aag aag ccc act gcg ttc ttc gat tgt cac 240
Asp Ala Ser Asn Thr Glu Lys Lys Pro Thr Ala Phe Phe Asp Cys His
65 70 75 80
atg atg gtt gaa aat cct gaa aaa tgg gtc gac gat ttt gct aaa tgt 288
Met Met Val Glu Asn Pro Glu Lys Trp Val Asp Asp Phe Ala Lys Cys
85 90 95
ggt gct gac caa ttt acg ttc cac tac gag gcc aca caa gac cct ttg 336
Gly Ala Asp Gln Phe Thr Phe His Tyr Glu Ala Thr Gln Asp Pro Leu
100 105 110
cat tta gtt aag ttg att aag tct aag ggc atc aaa gct gca tgc gcc 384
His Leu Val Lys Leu Ile Lys Ser Lys Gly Ile Lys Ala Ala Cys Ala
115 120 125
atc aaa cct ggt act tct gtt gac gtt tta ttt gaa cta gct cct cat 432
Ile Lys Pro Gly Thr Ser Val Asp Val Leu Phe Glu Leu Ala Pro His
130 135 140
ttg gat atg gct ctt gtt atg act gtg gaa cct ggg ttt gga ggc caa 480
Leu Asp Met Ala Leu Val Met Thr Val Glu Pro Gly Phe Gly Gly Gln
145 150 155 160
aaa ttc atg gaa gac atg atg cca aaa gtg gaa act ttg aga gcc aag 528
Lys Phe Met Glu Asp Met Met Pro Lys Val Glu Thr Leu Arg Ala Lys
165 170 175
ttc ccc cat ttg aat atc caa gtc gat ggt ggt ttg ggc aag gag acc 576
Phe Pro His Leu Asn Ile Gln Val Asp Gly Gly Leu Gly Lys Glu Thr
180 185 190
atc ccg aaa gcc gcc aaa gcc ggt gcc aac gtt att gtc gct ggt acc 624
Ile Pro Lys Ala Ala Lys Ala Gly Ala Asn Val Ile Val Ala Gly Thr
195 200 205
agt gtt ttc act gca gct gac ccg cac gat gtt atc tcc ttc atg aaa 672
Ser Val Phe Thr Ala Ala Asp Pro His Asp Val Ile Ser Phe Met Lys
210 215 220
gaa gaa gtc tcg aag gaa ttg cgt tct aga gat ttg cta gat tag 717
Glu Glu Val Ser Lys Glu Leu Arg Ser Arg Asp Leu Leu Asp
225 230 235
<210> 16
<211> 238
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 16
Met Val Lys Pro Ile Ile Ala Pro Ser Ile Leu Ala Ser Asp Phe Ala
1 5 10 15
Asn Leu Gly Cys Glu Cys His Lys Val Ile Asn Ala Gly Ala Asp Trp
20 25 30
Leu His Ile Asp Val Met Asp Gly His Phe Val Pro Asn Ile Thr Leu
35 40 45
Gly Gln Pro Ile Val Thr Ser Leu Arg Arg Ser Val Pro Arg Pro Gly
50 55 60
Asp Ala Ser Asn Thr Glu Lys Lys Pro Thr Ala Phe Phe Asp Cys His
65 70 75 80
Met Met Val Glu Asn Pro Glu Lys Trp Val Asp Asp Phe Ala Lys Cys
85 90 95
Gly Ala Asp Gln Phe Thr Phe His Tyr Glu Ala Thr Gln Asp Pro Leu
100 105 110
His Leu Val Lys Leu Ile Lys Ser Lys Gly Ile Lys Ala Ala Cys Ala
115 120 125
Ile Lys Pro Gly Thr Ser Val Asp Val Leu Phe Glu Leu Ala Pro His
130 135 140
Leu Asp Met Ala Leu Val Met Thr Val Glu Pro Gly Phe Gly Gly Gln
145 150 155 160
Lys Phe Met Glu Asp Met Met Pro Lys Val Glu Thr Leu Arg Ala Lys
165 170 175
Phe Pro His Leu Asn Ile Gln Val Asp Gly Gly Leu Gly Lys Glu Thr
180 185 190
Ile Pro Lys Ala Ala Lys Ala Gly Ala Asn Val Ile Val Ala Gly Thr
195 200 205
Ser Val Phe Thr Ala Ala Asp Pro His Asp Val Ile Ser Phe Met Lys
210 215 220
Glu Glu Val Ser Lys Glu Leu Arg Ser Arg Asp Leu Leu Asp
225 230 235
<210> 17
<211> 777
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<220>
<221> CDS
<222> (1)..(777)
<400> 17
atg gct gcc ggt gtc cca aaa att gat gcg tta gaa tct ttg ggc aat 48
Met Ala Ala Gly Val Pro Lys Ile Asp Ala Leu Glu Ser Leu Gly Asn
1 5 10 15
cct ttg gag gat gcc aag aga gct gca gca tac aga gca gtt gat gaa 96
Pro Leu Glu Asp Ala Lys Arg Ala Ala Ala Tyr Arg Ala Val Asp Glu
20 25 30
aat tta aaa ttt gat gat cac aaa att att gga att ggt agt ggt agc 144
Asn Leu Lys Phe Asp Asp His Lys Ile Ile Gly Ile Gly Ser Gly Ser
35 40 45
aca gtg gtt tat gtt gcc gaa aga att gga caa tat ttg cat gac cct 192
Thr Val Val Tyr Val Ala Glu Arg Ile Gly Gln Tyr Leu His Asp Pro
50 55 60
aaa ttt tat gaa gta gcg tct aaa ttc att tgc att cca aca gga ttc 240
Lys Phe Tyr Glu Val Ala Ser Lys Phe Ile Cys Ile Pro Thr Gly Phe
65 70 75 80
caa tca aga aac ttg att ttg gat aac aag ttg caa tta ggc tcc att 288
Gln Ser Arg Asn Leu Ile Leu Asp Asn Lys Leu Gln Leu Gly Ser Ile
85 90 95
gaa cag tat cct cgc att gat ata gcg ttt gac ggt gct gat gaa gtg 336
Glu Gln Tyr Pro Arg Ile Asp Ile Ala Phe Asp Gly Ala Asp Glu Val
100 105 110
gat gag aat tta caa tta att aaa ggt ggt ggt gct tgt cta ttt caa 384
Asp Glu Asn Leu Gln Leu Ile Lys Gly Gly Gly Ala Cys Leu Phe Gln
115 120 125
gaa aaa ttg gtt agt act agt gct aaa acc ttc att gtc gtt gct gat 432
Glu Lys Leu Val Ser Thr Ser Ala Lys Thr Phe Ile Val Val Ala Asp
130 135 140
tca aga aaa aag tca cca aaa cat tta ggt aag aac tgg agg caa ggt 480
Ser Arg Lys Lys Ser Pro Lys His Leu Gly Lys Asn Trp Arg Gln Gly
145 150 155 160
gtt ccc att gaa att gta cct tcc tca tac gtg agg gtc aag aat gat 528
Val Pro Ile Glu Ile Val Pro Ser Ser Tyr Val Arg Val Lys Asn Asp
165 170 175
cta tta gaa caa ttg cat gct gaa aaa gtt gac atc aga caa gga ggt 576
Leu Leu Glu Gln Leu His Ala Glu Lys Val Asp Ile Arg Gln Gly Gly
180 185 190
tct gct aaa gca ggt cct gtt gta act gac aat aat aac ttc att atc 624
Ser Ala Lys Ala Gly Pro Val Val Thr Asp Asn Asn Asn Phe Ile Ile
195 200 205
gat gcg gat ttc ggt gaa att tcc gat cca aga aaa ttg cat aga gaa 672
Asp Ala Asp Phe Gly Glu Ile Ser Asp Pro Arg Lys Leu His Arg Glu
210 215 220
atc aaa ctg tta gtg ggc gtg gtg gaa aca ggt tta ttc atc gac aac 720
Ile Lys Leu Leu Val Gly Val Val Glu Thr Gly Leu Phe Ile Asp Asn
225 230 235 240
gct tca aaa gcc tac ttc ggt aat tct gac ggt agt gtt gaa gtt acc 768
Ala Ser Lys Ala Tyr Phe Gly Asn Ser Asp Gly Ser Val Glu Val Thr
245 250 255
gaa aag tga 777
Glu Lys
<210> 18
<211> 258
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 18
Met Ala Ala Gly Val Pro Lys Ile Asp Ala Leu Glu Ser Leu Gly Asn
1 5 10 15
Pro Leu Glu Asp Ala Lys Arg Ala Ala Ala Tyr Arg Ala Val Asp Glu
20 25 30
Asn Leu Lys Phe Asp Asp His Lys Ile Ile Gly Ile Gly Ser Gly Ser
35 40 45
Thr Val Val Tyr Val Ala Glu Arg Ile Gly Gln Tyr Leu His Asp Pro
50 55 60
Lys Phe Tyr Glu Val Ala Ser Lys Phe Ile Cys Ile Pro Thr Gly Phe
65 70 75 80
Gln Ser Arg Asn Leu Ile Leu Asp Asn Lys Leu Gln Leu Gly Ser Ile
85 90 95
Glu Gln Tyr Pro Arg Ile Asp Ile Ala Phe Asp Gly Ala Asp Glu Val
100 105 110
Asp Glu Asn Leu Gln Leu Ile Lys Gly Gly Gly Ala Cys Leu Phe Gln
115 120 125
Glu Lys Leu Val Ser Thr Ser Ala Lys Thr Phe Ile Val Val Ala Asp
130 135 140
Ser Arg Lys Lys Ser Pro Lys His Leu Gly Lys Asn Trp Arg Gln Gly
145 150 155 160
Val Pro Ile Glu Ile Val Pro Ser Ser Tyr Val Arg Val Lys Asn Asp
165 170 175
Leu Leu Glu Gln Leu His Ala Glu Lys Val Asp Ile Arg Gln Gly Gly
180 185 190
Ser Ala Lys Ala Gly Pro Val Val Thr Asp Asn Asn Asn Phe Ile Ile
195 200 205
Asp Ala Asp Phe Gly Glu Ile Ser Asp Pro Arg Lys Leu His Arg Glu
210 215 220
Ile Lys Leu Leu Val Gly Val Val Glu Thr Gly Leu Phe Ile Asp Asn
225 230 235 240
Ala Ser Lys Ala Tyr Phe Gly Asn Ser Asp Gly Ser Val Glu Val Thr
245 250 255
Glu Lys
<210> 19
<211> 2676
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<220>
<221> CDS
<222> (1)..(2676)
<400> 19
atg gca gtt acg aac gtt gca gag ctt aat gcc ctt gtg gag agg gtt 48
Met Ala Val Thr Asn Val Ala Glu Leu Asn Ala Leu Val Glu Arg Val
1 5 10 15
aag aaa gca caa agg gaa tac gcc agc ttc acg caa gaa caa gtt gac 96
Lys Lys Ala Gln Arg Glu Tyr Ala Ser Phe Thr Gln Glu Gln Val Asp
20 25 30
aag att ttc agg gca gcc gcg cta gcg gca gct gat gcg aga ata ccc 144
Lys Ile Phe Arg Ala Ala Ala Leu Ala Ala Ala Asp Ala Arg Ile Pro
35 40 45
ctt gct aag atg gca gtc gca gaa agc gga atg gga att gtg gaa gat 192
Leu Ala Lys Met Ala Val Ala Glu Ser Gly Met Gly Ile Val Glu Asp
50 55 60
aag gtc atc aag aat cat ttc gct tct gag tat atc tac aac gcg tat 240
Lys Val Ile Lys Asn His Phe Ala Ser Glu Tyr Ile Tyr Asn Ala Tyr
65 70 75 80
aag gat gag aag act tgc ggc gtc tta tct gag gac gat aca ttc ggc 288
Lys Asp Glu Lys Thr Cys Gly Val Leu Ser Glu Asp Asp Thr Phe Gly
85 90 95
aca ata act atc gct gaa cct ata ggg atc atc tgt ggt ata gtt cca 336
Thr Ile Thr Ile Ala Glu Pro Ile Gly Ile Ile Cys Gly Ile Val Pro
100 105 110
act acg aac cct acc tca act gct ata ttc aag agt ctt att tct ttg 384
Thr Thr Asn Pro Thr Ser Thr Ala Ile Phe Lys Ser Leu Ile Ser Leu
115 120 125
aaa acc cgt aac gct atc atc ttt tct cca cat cca aga gcc aag gac 432
Lys Thr Arg Asn Ala Ile Ile Phe Ser Pro His Pro Arg Ala Lys Asp
130 135 140
gcc acc aac aag gcg gcg gac atc gtg ctg cag gcg gcc atc gcg gcg 480
Ala Thr Asn Lys Ala Ala Asp Ile Val Leu Gln Ala Ala Ile Ala Ala
145 150 155 160
ggc gcg cct aag gat cta att ggc tgg atc gac caa cct agc gtt gaa 528
Gly Ala Pro Lys Asp Leu Ile Gly Trp Ile Asp Gln Pro Ser Val Glu
165 170 175
ctt tcc aat gct ctt atg cac cat cct gat att aat ttg ata ctt gcg 576
Leu Ser Asn Ala Leu Met His His Pro Asp Ile Asn Leu Ile Leu Ala
180 185 190
act gga ggc cct ggg atg gta aag gcc gct tat agc agc ggt aag ccc 624
Thr Gly Gly Pro Gly Met Val Lys Ala Ala Tyr Ser Ser Gly Lys Pro
195 200 205
gcc atc ggt gtt ggc gct gga aat acg ccg gta gtc att gat gag acc 672
Ala Ile Gly Val Gly Ala Gly Asn Thr Pro Val Val Ile Asp Glu Thr
210 215 220
gcc gac att aag agg gca gtc gcg tca gtt ttg atg tct aag act ttt 720
Ala Asp Ile Lys Arg Ala Val Ala Ser Val Leu Met Ser Lys Thr Phe
225 230 235 240
gac aat gga gta atc tgc gcg tca gag cag tcc gtt gtc gtt gtg gat 768
Asp Asn Gly Val Ile Cys Ala Ser Glu Gln Ser Val Val Val Val Asp
245 250 255
tca gta tac gac gcg gtc aga gaa agg ttc gcg act cac gga ggg tac 816
Ser Val Tyr Asp Ala Val Arg Glu Arg Phe Ala Thr His Gly Gly Tyr
260 265 270
cta ctt caa ggg aag gag ttg aaa gcc gtc cag gac gtt atc ctt aag 864
Leu Leu Gln Gly Lys Glu Leu Lys Ala Val Gln Asp Val Ile Leu Lys
275 280 285
aat ggt gca ttg aac gcc gcc att gta ggc caa cca gcg tat aaa ata 912
Asn Gly Ala Leu Asn Ala Ala Ile Val Gly Gln Pro Ala Tyr Lys Ile
290 295 300
gcc gaa ctt gcg ggc ttt agc gta cca gaa aac act aaa ata ctt att 960
Ala Glu Leu Ala Gly Phe Ser Val Pro Glu Asn Thr Lys Ile Leu Ile
305 310 315 320
ggt gaa gtc acc gtg gtt gat gag tca gaa ccc ttc gcc cat gag aaa 1008
Gly Glu Val Thr Val Val Asp Glu Ser Glu Pro Phe Ala His Glu Lys
325 330 335
ctt tcc cca aca ctg gcg atg tac aga gcg aag gac ttt gaa gat gca 1056
Leu Ser Pro Thr Leu Ala Met Tyr Arg Ala Lys Asp Phe Glu Asp Ala
340 345 350
gtg gaa aaa gcc gaa aag ctt gtt gca atg ggc ggt atc ggc cac aca 1104
Val Glu Lys Ala Glu Lys Leu Val Ala Met Gly Gly Ile Gly His Thr
355 360 365
agc tgc ctt tac aca gat cag gat aac caa cca gca aga gtt agt tat 1152
Ser Cys Leu Tyr Thr Asp Gln Asp Asn Gln Pro Ala Arg Val Ser Tyr
370 375 380
ttc ggc cag aaa atg aag acg gct agg ata cta atc aac acc ccg gcc 1200
Phe Gly Gln Lys Met Lys Thr Ala Arg Ile Leu Ile Asn Thr Pro Ala
385 390 395 400
agc caa gga ggc att ggt gac ctg tat aac ttc aag ttg gct cct tct 1248
Ser Gln Gly Gly Ile Gly Asp Leu Tyr Asn Phe Lys Leu Ala Pro Ser
405 410 415
ctt aca ttg ggc tgt ggt tcc tgg ggc gga aac tct atc tct gag aac 1296
Leu Thr Leu Gly Cys Gly Ser Trp Gly Gly Asn Ser Ile Ser Glu Asn
420 425 430
gtt gga cct aag cat ctt atc aat aag aaa acc gtc gct aag cgt gct 1344
Val Gly Pro Lys His Leu Ile Asn Lys Lys Thr Val Ala Lys Arg Ala
435 440 445
gag aac atg ctt tgg cac aaa ctg ccg aag tca atc tat ttc cgt agg 1392
Glu Asn Met Leu Trp His Lys Leu Pro Lys Ser Ile Tyr Phe Arg Arg
450 455 460
ggc tcc ctg cct atc gcg tta gat gaa gtc atc acc gat ggg cat aag 1440
Gly Ser Leu Pro Ile Ala Leu Asp Glu Val Ile Thr Asp Gly His Lys
465 470 475 480
aga gca cta atc gtc aca gat agg ttc cta ttc aac aac ggt tac gca 1488
Arg Ala Leu Ile Val Thr Asp Arg Phe Leu Phe Asn Asn Gly Tyr Ala
485 490 495
gac caa atc acg tct gtt ctg aaa gcg gcc ggt gtc gaa aca gag gtg 1536
Asp Gln Ile Thr Ser Val Leu Lys Ala Ala Gly Val Glu Thr Glu Val
500 505 510
ttc ttc gaa gtg gaa gcc gac ccg aca tta agt att gtc agg aaa gga 1584
Phe Phe Glu Val Glu Ala Asp Pro Thr Leu Ser Ile Val Arg Lys Gly
515 520 525
gct gag ctt gcg aat agt ttt aag ccg gat gtc atc atc gct ttg gga 1632
Ala Glu Leu Ala Asn Ser Phe Lys Pro Asp Val Ile Ile Ala Leu Gly
530 535 540
gga gga tcc ccg atg gat gct gct aag atc atg tgg gtc atg tat gaa 1680
Gly Gly Ser Pro Met Asp Ala Ala Lys Ile Met Trp Val Met Tyr Glu
545 550 555 560
cac ccg gag aca cac ttt gaa gag ttg gcg ttg aga ttc atg gat att 1728
His Pro Glu Thr His Phe Glu Glu Leu Ala Leu Arg Phe Met Asp Ile
565 570 575
agg aaa agg atc tat aaa ttc cct aag atg gga gtg aaa gcc aag atg 1776
Arg Lys Arg Ile Tyr Lys Phe Pro Lys Met Gly Val Lys Ala Lys Met
580 585 590
ata gca gtg acg acc acc agt gga acc ggg agt gaa gtg act ccg ttt 1824
Ile Ala Val Thr Thr Thr Ser Gly Thr Gly Ser Glu Val Thr Pro Phe
595 600 605
gcg gtt gtt act gac gac gct acg ggc cag aag tat ccc cta gcc gac 1872
Ala Val Val Thr Asp Asp Ala Thr Gly Gln Lys Tyr Pro Leu Ala Asp
610 615 620
tat gca ttg acg ccg gat atg gcg att gtg gac gcg aac ctg gtt atg 1920
Tyr Ala Leu Thr Pro Asp Met Ala Ile Val Asp Ala Asn Leu Val Met
625 630 635 640
gat atg cca aag tca ctt tgc gca ttc ggt gga ctt gac gca gtt aca 1968
Asp Met Pro Lys Ser Leu Cys Ala Phe Gly Gly Leu Asp Ala Val Thr
645 650 655
cat gca atg gag gca tat gtg tcc gtg ctt gct tca gag ttt agc gat 2016
His Ala Met Glu Ala Tyr Val Ser Val Leu Ala Ser Glu Phe Ser Asp
660 665 670
gga cag gca ctt caa gcc ttg aag tta cta aag gaa tac ctt ccc gca 2064
Gly Gln Ala Leu Gln Ala Leu Lys Leu Leu Lys Glu Tyr Leu Pro Ala
675 680 685
tct tac cat gag ggc agc aag aac cct gtt gcc cgt gag aga gta cat 2112
Ser Tyr His Glu Gly Ser Lys Asn Pro Val Ala Arg Glu Arg Val His
690 695 700
agt gcg gct acc att gcc ggt atc gca ttc gct aat gct ttc ctg ggt 2160
Ser Ala Ala Thr Ile Ala Gly Ile Ala Phe Ala Asn Ala Phe Leu Gly
705 710 715 720
gtg tgt cat tca atg gcc cat aag ctt gga tcc cag ttt cac atc cca 2208
Val Cys His Ser Met Ala His Lys Leu Gly Ser Gln Phe His Ile Pro
725 730 735
cac gga tta gcc aac gct ctt ctt ata tgt aac gtc atc cgt tat aat 2256
His Gly Leu Ala Asn Ala Leu Leu Ile Cys Asn Val Ile Arg Tyr Asn
740 745 750
gcc aac gac aac cct aca aag caa acc gca ttc tca caa tat gac agg 2304
Ala Asn Asp Asn Pro Thr Lys Gln Thr Ala Phe Ser Gln Tyr Asp Arg
755 760 765
ccg cag gcg agg cgt cgt tat gca gaa ata gcc gat cac cta ggc tta 2352
Pro Gln Ala Arg Arg Arg Tyr Ala Glu Ile Ala Asp His Leu Gly Leu
770 775 780
tcc gcg cca gga gat cgt aca gct gct aag att gaa aag ttg ctt gcg 2400
Ser Ala Pro Gly Asp Arg Thr Ala Ala Lys Ile Glu Lys Leu Leu Ala
785 790 795 800
tgg tta gaa acg ctt aag gcc gaa cta ggc att cca aag tct att aga 2448
Trp Leu Glu Thr Leu Lys Ala Glu Leu Gly Ile Pro Lys Ser Ile Arg
805 810 815
gag gcg gga gtt caa gag gcg gat ttc ctt gct aac gtg gac aag ctt 2496
Glu Ala Gly Val Gln Glu Ala Asp Phe Leu Ala Asn Val Asp Lys Leu
820 825 830
agc gaa gat gca ttc gat gat cag tgc act gga gcc aac ccg aga tat 2544
Ser Glu Asp Ala Phe Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Tyr
835 840 845
cct ctg att tca gaa ttg aag caa att tta ctt gat aca tac tat ggg 2592
Pro Leu Ile Ser Glu Leu Lys Gln Ile Leu Leu Asp Thr Tyr Tyr Gly
850 855 860
aga gac tac gtt gag gga gag acc gca gcg aag aag gag gcg gcg ccg 2640
Arg Asp Tyr Val Glu Gly Glu Thr Ala Ala Lys Lys Glu Ala Ala Pro
865 870 875 880
gct aag gcc gag aag aaa gca aag aaa tca gct taa 2676
Ala Lys Ala Glu Lys Lys Ala Lys Lys Ser Ala
885 890
<210> 20
<211> 891
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 20
Met Ala Val Thr Asn Val Ala Glu Leu Asn Ala Leu Val Glu Arg Val
1 5 10 15
Lys Lys Ala Gln Arg Glu Tyr Ala Ser Phe Thr Gln Glu Gln Val Asp
20 25 30
Lys Ile Phe Arg Ala Ala Ala Leu Ala Ala Ala Asp Ala Arg Ile Pro
35 40 45
Leu Ala Lys Met Ala Val Ala Glu Ser Gly Met Gly Ile Val Glu Asp
50 55 60
Lys Val Ile Lys Asn His Phe Ala Ser Glu Tyr Ile Tyr Asn Ala Tyr
65 70 75 80
Lys Asp Glu Lys Thr Cys Gly Val Leu Ser Glu Asp Asp Thr Phe Gly
85 90 95
Thr Ile Thr Ile Ala Glu Pro Ile Gly Ile Ile Cys Gly Ile Val Pro
100 105 110
Thr Thr Asn Pro Thr Ser Thr Ala Ile Phe Lys Ser Leu Ile Ser Leu
115 120 125
Lys Thr Arg Asn Ala Ile Ile Phe Ser Pro His Pro Arg Ala Lys Asp
130 135 140
Ala Thr Asn Lys Ala Ala Asp Ile Val Leu Gln Ala Ala Ile Ala Ala
145 150 155 160
Gly Ala Pro Lys Asp Leu Ile Gly Trp Ile Asp Gln Pro Ser Val Glu
165 170 175
Leu Ser Asn Ala Leu Met His His Pro Asp Ile Asn Leu Ile Leu Ala
180 185 190
Thr Gly Gly Pro Gly Met Val Lys Ala Ala Tyr Ser Ser Gly Lys Pro
195 200 205
Ala Ile Gly Val Gly Ala Gly Asn Thr Pro Val Val Ile Asp Glu Thr
210 215 220
Ala Asp Ile Lys Arg Ala Val Ala Ser Val Leu Met Ser Lys Thr Phe
225 230 235 240
Asp Asn Gly Val Ile Cys Ala Ser Glu Gln Ser Val Val Val Val Asp
245 250 255
Ser Val Tyr Asp Ala Val Arg Glu Arg Phe Ala Thr His Gly Gly Tyr
260 265 270
Leu Leu Gln Gly Lys Glu Leu Lys Ala Val Gln Asp Val Ile Leu Lys
275 280 285
Asn Gly Ala Leu Asn Ala Ala Ile Val Gly Gln Pro Ala Tyr Lys Ile
290 295 300
Ala Glu Leu Ala Gly Phe Ser Val Pro Glu Asn Thr Lys Ile Leu Ile
305 310 315 320
Gly Glu Val Thr Val Val Asp Glu Ser Glu Pro Phe Ala His Glu Lys
325 330 335
Leu Ser Pro Thr Leu Ala Met Tyr Arg Ala Lys Asp Phe Glu Asp Ala
340 345 350
Val Glu Lys Ala Glu Lys Leu Val Ala Met Gly Gly Ile Gly His Thr
355 360 365
Ser Cys Leu Tyr Thr Asp Gln Asp Asn Gln Pro Ala Arg Val Ser Tyr
370 375 380
Phe Gly Gln Lys Met Lys Thr Ala Arg Ile Leu Ile Asn Thr Pro Ala
385 390 395 400
Ser Gln Gly Gly Ile Gly Asp Leu Tyr Asn Phe Lys Leu Ala Pro Ser
405 410 415
Leu Thr Leu Gly Cys Gly Ser Trp Gly Gly Asn Ser Ile Ser Glu Asn
420 425 430
Val Gly Pro Lys His Leu Ile Asn Lys Lys Thr Val Ala Lys Arg Ala
435 440 445
Glu Asn Met Leu Trp His Lys Leu Pro Lys Ser Ile Tyr Phe Arg Arg
450 455 460
Gly Ser Leu Pro Ile Ala Leu Asp Glu Val Ile Thr Asp Gly His Lys
465 470 475 480
Arg Ala Leu Ile Val Thr Asp Arg Phe Leu Phe Asn Asn Gly Tyr Ala
485 490 495
Asp Gln Ile Thr Ser Val Leu Lys Ala Ala Gly Val Glu Thr Glu Val
500 505 510
Phe Phe Glu Val Glu Ala Asp Pro Thr Leu Ser Ile Val Arg Lys Gly
515 520 525
Ala Glu Leu Ala Asn Ser Phe Lys Pro Asp Val Ile Ile Ala Leu Gly
530 535 540
Gly Gly Ser Pro Met Asp Ala Ala Lys Ile Met Trp Val Met Tyr Glu
545 550 555 560
His Pro Glu Thr His Phe Glu Glu Leu Ala Leu Arg Phe Met Asp Ile
565 570 575
Arg Lys Arg Ile Tyr Lys Phe Pro Lys Met Gly Val Lys Ala Lys Met
580 585 590
Ile Ala Val Thr Thr Thr Ser Gly Thr Gly Ser Glu Val Thr Pro Phe
595 600 605
Ala Val Val Thr Asp Asp Ala Thr Gly Gln Lys Tyr Pro Leu Ala Asp
610 615 620
Tyr Ala Leu Thr Pro Asp Met Ala Ile Val Asp Ala Asn Leu Val Met
625 630 635 640
Asp Met Pro Lys Ser Leu Cys Ala Phe Gly Gly Leu Asp Ala Val Thr
645 650 655
His Ala Met Glu Ala Tyr Val Ser Val Leu Ala Ser Glu Phe Ser Asp
660 665 670
Gly Gln Ala Leu Gln Ala Leu Lys Leu Leu Lys Glu Tyr Leu Pro Ala
675 680 685
Ser Tyr His Glu Gly Ser Lys Asn Pro Val Ala Arg Glu Arg Val His
690 695 700
Ser Ala Ala Thr Ile Ala Gly Ile Ala Phe Ala Asn Ala Phe Leu Gly
705 710 715 720
Val Cys His Ser Met Ala His Lys Leu Gly Ser Gln Phe His Ile Pro
725 730 735
His Gly Leu Ala Asn Ala Leu Leu Ile Cys Asn Val Ile Arg Tyr Asn
740 745 750
Ala Asn Asp Asn Pro Thr Lys Gln Thr Ala Phe Ser Gln Tyr Asp Arg
755 760 765
Pro Gln Ala Arg Arg Arg Tyr Ala Glu Ile Ala Asp His Leu Gly Leu
770 775 780
Ser Ala Pro Gly Asp Arg Thr Ala Ala Lys Ile Glu Lys Leu Leu Ala
785 790 795 800
Trp Leu Glu Thr Leu Lys Ala Glu Leu Gly Ile Pro Lys Ser Ile Arg
805 810 815
Glu Ala Gly Val Gln Glu Ala Asp Phe Leu Ala Asn Val Asp Lys Leu
820 825 830
Ser Glu Asp Ala Phe Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Tyr
835 840 845
Pro Leu Ile Ser Glu Leu Lys Gln Ile Leu Leu Asp Thr Tyr Tyr Gly
850 855 860
Arg Asp Tyr Val Glu Gly Glu Thr Ala Ala Lys Lys Glu Ala Ala Pro
865 870 875 880
Ala Lys Ala Glu Lys Lys Ala Lys Lys Ser Ala
885 890
<210> 21
<211> 1407
<212> DNA
<213> 拜氏梭菌(Clostridium beijerinckii)
<220>
<221> CDS
<222> (1)..(1407)
<400> 21
atg aat aag gat acc ttg att cca act aca aag gat ttg aag gtt aag 48
Met Asn Lys Asp Thr Leu Ile Pro Thr Thr Lys Asp Leu Lys Val Lys
1 5 10 15
aca aac ggt gaa aac atc aac ttg aaa aat tac aag gat aac tct tca 96
Thr Asn Gly Glu Asn Ile Asn Leu Lys Asn Tyr Lys Asp Asn Ser Ser
20 25 30
tgt ttc ggt gtt ttc gaa aac gtt gaa aac gct att tcc agt gct gtt 144
Cys Phe Gly Val Phe Glu Asn Val Glu Asn Ala Ile Ser Ser Ala Val
35 40 45
cat gca caa aag att ttg tca ttg cat tac act aaa gaa caa aga gaa 192
His Ala Gln Lys Ile Leu Ser Leu His Tyr Thr Lys Glu Gln Arg Glu
50 55 60
aag att atc aca gaa atc aga aag gct gca ttg caa aat aag gaa gtt 240
Lys Ile Ile Thr Glu Ile Arg Lys Ala Ala Leu Gln Asn Lys Glu Val
65 70 75 80
ttg gca acc atg atc ttg gaa gaa act cat atg ggt aga tac gaa gat 288
Leu Ala Thr Met Ile Leu Glu Glu Thr His Met Gly Arg Tyr Glu Asp
85 90 95
aag atc ttg aag cat gaa tta gtt gct aaa tac aca cca ggt act gaa 336
Lys Ile Leu Lys His Glu Leu Val Ala Lys Tyr Thr Pro Gly Thr Glu
100 105 110
gat ttg acc act aca gca tgg tcc ggt gac aat ggt tta act gtt gtt 384
Asp Leu Thr Thr Thr Ala Trp Ser Gly Asp Asn Gly Leu Thr Val Val
115 120 125
gaa atg agt cct tat ggt gtt att ggt gct att act cca tct aca aat 432
Glu Met Ser Pro Tyr Gly Val Ile Gly Ala Ile Thr Pro Ser Thr Asn
130 135 140
cct acc gaa act gtt att tgt aac tca atc ggt atg att gct gct ggt 480
Pro Thr Glu Thr Val Ile Cys Asn Ser Ile Gly Met Ile Ala Ala Gly
145 150 155 160
aat gct gtt gtt ttt aac ggt cat cca tgt gct aag aaa tgt gtt gct 528
Asn Ala Val Val Phe Asn Gly His Pro Cys Ala Lys Lys Cys Val Ala
165 170 175
ttc gca gtt gaa atg atc aat aag gct atc atc tct tgt ggt ggt cca 576
Phe Ala Val Glu Met Ile Asn Lys Ala Ile Ile Ser Cys Gly Gly Pro
180 185 190
gaa aat ttg gtt acc act att aag aac cct act atg gaa tct ttg gat 624
Glu Asn Leu Val Thr Thr Ile Lys Asn Pro Thr Met Glu Ser Leu Asp
195 200 205
gca att att aag cat cca tca att aag ttg tta tgt ggt aca ggt ggt 672
Ala Ile Ile Lys His Pro Ser Ile Lys Leu Leu Cys Gly Thr Gly Gly
210 215 220
cct ggt atg gtt aag acc ttg ttg aac tct ggt aaa aag gct att ggt 720
Pro Gly Met Val Lys Thr Leu Leu Asn Ser Gly Lys Lys Ala Ile Gly
225 230 235 240
gct ggt gca ggt aac cca cct gtt att gtt gat gat aca gct gat atc 768
Ala Gly Ala Gly Asn Pro Pro Val Ile Val Asp Asp Thr Ala Asp Ile
245 250 255
gaa aag gca ggt aga tct atc atc gaa ggt tgt tca ttc gat aac aac 816
Glu Lys Ala Gly Arg Ser Ile Ile Glu Gly Cys Ser Phe Asp Asn Asn
260 265 270
ttg cct tgt atc gct gaa aag gaa gtt ttc gtt ttc gaa aac gtt gca 864
Leu Pro Cys Ile Ala Glu Lys Glu Val Phe Val Phe Glu Asn Val Ala
275 280 285
gat gat ttg atc tcc aac atg tta aag aat aac gct gtt att att aat 912
Asp Asp Leu Ile Ser Asn Met Leu Lys Asn Asn Ala Val Ile Ile Asn
290 295 300
gaa gat caa gtt tca aag tta att gat ttg gtt tta caa aag aat aac 960
Glu Asp Gln Val Ser Lys Leu Ile Asp Leu Val Leu Gln Lys Asn Asn
305 310 315 320
gaa act caa gaa tac ttc atc aat aag aaa tgg gtt ggt aaa gat gct 1008
Glu Thr Gln Glu Tyr Phe Ile Asn Lys Lys Trp Val Gly Lys Asp Ala
325 330 335
aag ttg ttc tta gat gaa atc gat gtt gaa tct cca tca aac gtt aag 1056
Lys Leu Phe Leu Asp Glu Ile Asp Val Glu Ser Pro Ser Asn Val Lys
340 345 350
tgt atc atc tgt gaa gtt aat gca aac cat cct ttt gtt atg aca gaa 1104
Cys Ile Ile Cys Glu Val Asn Ala Asn His Pro Phe Val Met Thr Glu
355 360 365
ttg atg atg cca atc ttg cct atc gtt aga gtt aag gat att gat gaa 1152
Leu Met Met Pro Ile Leu Pro Ile Val Arg Val Lys Asp Ile Asp Glu
370 375 380
gct att aaa tac gct aag atc gca gaa caa aac aga aag cat tcc gca 1200
Ala Ile Lys Tyr Ala Lys Ile Ala Glu Gln Asn Arg Lys His Ser Ala
385 390 395 400
tac atc tat agt aag aac atc gat aat ttg aac aga ttt gaa aga gaa 1248
Tyr Ile Tyr Ser Lys Asn Ile Asp Asn Leu Asn Arg Phe Glu Arg Glu
405 410 415
atc gat aca acc atc ttc gtt aaa aat gct aag tct ttc gca ggt gtt 1296
Ile Asp Thr Thr Ile Phe Val Lys Asn Ala Lys Ser Phe Ala Gly Val
420 425 430
ggt tac gaa gct gaa ggt ttt act aca ttc acc att gca ggt tcc act 1344
Gly Tyr Glu Ala Glu Gly Phe Thr Thr Phe Thr Ile Ala Gly Ser Thr
435 440 445
ggt gaa ggt att aca agt gct aga aac ttc act aga caa aga aga tgt 1392
Gly Glu Gly Ile Thr Ser Ala Arg Asn Phe Thr Arg Gln Arg Arg Cys
450 455 460
gtt tta gca ggt taa 1407
Val Leu Ala Gly
465
<210> 22
<211> 468
<212> PRT
<213> 拜氏梭菌(Clostridium beijerinckii)
<400> 22
Met Asn Lys Asp Thr Leu Ile Pro Thr Thr Lys Asp Leu Lys Val Lys
1 5 10 15
Thr Asn Gly Glu Asn Ile Asn Leu Lys Asn Tyr Lys Asp Asn Ser Ser
20 25 30
Cys Phe Gly Val Phe Glu Asn Val Glu Asn Ala Ile Ser Ser Ala Val
35 40 45
His Ala Gln Lys Ile Leu Ser Leu His Tyr Thr Lys Glu Gln Arg Glu
50 55 60
Lys Ile Ile Thr Glu Ile Arg Lys Ala Ala Leu Gln Asn Lys Glu Val
65 70 75 80
Leu Ala Thr Met Ile Leu Glu Glu Thr His Met Gly Arg Tyr Glu Asp
85 90 95
Lys Ile Leu Lys His Glu Leu Val Ala Lys Tyr Thr Pro Gly Thr Glu
100 105 110
Asp Leu Thr Thr Thr Ala Trp Ser Gly Asp Asn Gly Leu Thr Val Val
115 120 125
Glu Met Ser Pro Tyr Gly Val Ile Gly Ala Ile Thr Pro Ser Thr Asn
130 135 140
Pro Thr Glu Thr Val Ile Cys Asn Ser Ile Gly Met Ile Ala Ala Gly
145 150 155 160
Asn Ala Val Val Phe Asn Gly His Pro Cys Ala Lys Lys Cys Val Ala
165 170 175
Phe Ala Val Glu Met Ile Asn Lys Ala Ile Ile Ser Cys Gly Gly Pro
180 185 190
Glu Asn Leu Val Thr Thr Ile Lys Asn Pro Thr Met Glu Ser Leu Asp
195 200 205
Ala Ile Ile Lys His Pro Ser Ile Lys Leu Leu Cys Gly Thr Gly Gly
210 215 220
Pro Gly Met Val Lys Thr Leu Leu Asn Ser Gly Lys Lys Ala Ile Gly
225 230 235 240
Ala Gly Ala Gly Asn Pro Pro Val Ile Val Asp Asp Thr Ala Asp Ile
245 250 255
Glu Lys Ala Gly Arg Ser Ile Ile Glu Gly Cys Ser Phe Asp Asn Asn
260 265 270
Leu Pro Cys Ile Ala Glu Lys Glu Val Phe Val Phe Glu Asn Val Ala
275 280 285
Asp Asp Leu Ile Ser Asn Met Leu Lys Asn Asn Ala Val Ile Ile Asn
290 295 300
Glu Asp Gln Val Ser Lys Leu Ile Asp Leu Val Leu Gln Lys Asn Asn
305 310 315 320
Glu Thr Gln Glu Tyr Phe Ile Asn Lys Lys Trp Val Gly Lys Asp Ala
325 330 335
Lys Leu Phe Leu Asp Glu Ile Asp Val Glu Ser Pro Ser Asn Val Lys
340 345 350
Cys Ile Ile Cys Glu Val Asn Ala Asn His Pro Phe Val Met Thr Glu
355 360 365
Leu Met Met Pro Ile Leu Pro Ile Val Arg Val Lys Asp Ile Asp Glu
370 375 380
Ala Ile Lys Tyr Ala Lys Ile Ala Glu Gln Asn Arg Lys His Ser Ala
385 390 395 400
Tyr Ile Tyr Ser Lys Asn Ile Asp Asn Leu Asn Arg Phe Glu Arg Glu
405 410 415
Ile Asp Thr Thr Ile Phe Val Lys Asn Ala Lys Ser Phe Ala Gly Val
420 425 430
Gly Tyr Glu Ala Glu Gly Phe Thr Thr Phe Thr Ile Ala Gly Ser Thr
435 440 445
Gly Glu Gly Ile Thr Ser Ala Arg Asn Phe Thr Arg Gln Arg Arg Cys
450 455 460
Val Leu Ala Gly
465
<210> 23
<211> 2865
<212> DNA
<213> 莱茵衣藻(Chlamydomonas reinhardtii)
<220>
<221> CDS
<222> (1)..(2865)
<400> 23
atg atg agt tcc tct ctg gtt agt ggt aag aga gtt gcc gta cct tct 48
Met Met Ser Ser Ser Leu Val Ser Gly Lys Arg Val Ala Val Pro Ser
1 5 10 15
gcc gca aaa ccg tgc gct gca gtt cct cta cct agg gtt gct ggc aga 96
Ala Ala Lys Pro Cys Ala Ala Val Pro Leu Pro Arg Val Ala Gly Arg
20 25 30
aga acg gct gca aga gtt gtg tgc gaa gca gcg cct tca ggt gct gcc 144
Arg Thr Ala Ala Arg Val Val Cys Glu Ala Ala Pro Ser Gly Ala Ala
35 40 45
cct gcc tct cca aaa gca gag gca gct gct cct gtt gca gct gct ccc 192
Pro Ala Ser Pro Lys Ala Glu Ala Ala Ala Pro Val Ala Ala Ala Pro
50 55 60
gct aca cct cat gca gaa gtt aag aaa gaa aga gcg cca gca act gac 240
Ala Thr Pro His Ala Glu Val Lys Lys Glu Arg Ala Pro Ala Thr Asp
65 70 75 80
gaa gcg ttg aca gag tta aag gct ttg cta aag aga gca caa act gca 288
Glu Ala Leu Thr Glu Leu Lys Ala Leu Leu Lys Arg Ala Gln Thr Ala
85 90 95
caa gct cag tat tct aca tac act caa gaa cag gtc gac gaa atc ttt 336
Gln Ala Gln Tyr Ser Thr Tyr Thr Gln Glu Gln Val Asp Glu Ile Phe
100 105 110
cgt gca gcc gct gaa gca gct aac gca gcc aga ata cct ttg gcc aaa 384
Arg Ala Ala Ala Glu Ala Ala Asn Ala Ala Arg Ile Pro Leu Ala Lys
115 120 125
atg gca gtg gaa gaa act cgt atg gga gtg gct gaa gat aaa gtt gtt 432
Met Ala Val Glu Glu Thr Arg Met Gly Val Ala Glu Asp Lys Val Val
130 135 140
aag aat cac ttt gcg tca gaa ttc att tat aat aag tat aaa cac acg 480
Lys Asn His Phe Ala Ser Glu Phe Ile Tyr Asn Lys Tyr Lys His Thr
145 150 155 160
aaa act tgt ggt gtg atc gag cat gac cca gca gga ggg att cag aaa 528
Lys Thr Cys Gly Val Ile Glu His Asp Pro Ala Gly Gly Ile Gln Lys
165 170 175
gtg gct gag cct gta ggt gta ata gct ggt atc gtt cca act act aat 576
Val Ala Glu Pro Val Gly Val Ile Ala Gly Ile Val Pro Thr Thr Asn
180 185 190
ccc act tct acg gca ata ttt aag tcc cta cta agc tta aag acc aga 624
Pro Thr Ser Thr Ala Ile Phe Lys Ser Leu Leu Ser Leu Lys Thr Arg
195 200 205
aac gcc ttg gtt ttg tgt ccc cac cct agg gca gct aaa agt aca atc 672
Asn Ala Leu Val Leu Cys Pro His Pro Arg Ala Ala Lys Ser Thr Ile
210 215 220
gca gct gcc aga ata gtg aga gat gcg gct gtc gca gct gga gct cct 720
Ala Ala Ala Arg Ile Val Arg Asp Ala Ala Val Ala Ala Gly Ala Pro
225 230 235 240
ccc aat ata att tct tgg gtg gaa aca ccg tcg ttg cca gta agt caa 768
Pro Asn Ile Ile Ser Trp Val Glu Thr Pro Ser Leu Pro Val Ser Gln
245 250 255
gcc ttg atg caa gct act gaa att aac ttg att tta gct act ggt ggt 816
Ala Leu Met Gln Ala Thr Glu Ile Asn Leu Ile Leu Ala Thr Gly Gly
260 265 270
ccc gcc atg gta aga gct gca tac agt tca ggg aat cct tct ttg gga 864
Pro Ala Met Val Arg Ala Ala Tyr Ser Ser Gly Asn Pro Ser Leu Gly
275 280 285
gtt ggc gct ggg aat aca ccg gct ttg ata gat gaa acc gca gat gta 912
Val Gly Ala Gly Asn Thr Pro Ala Leu Ile Asp Glu Thr Ala Asp Val
290 295 300
gcc atg gca gtc tct tcc att cta ctg agc aaa acc ttc gac aat ggt 960
Ala Met Ala Val Ser Ser Ile Leu Leu Ser Lys Thr Phe Asp Asn Gly
305 310 315 320
gtg att tgt gct tca gaa caa agc gtt gtg gtc gtt gcc aag gct tat 1008
Val Ile Cys Ala Ser Glu Gln Ser Val Val Val Val Ala Lys Ala Tyr
325 330 335
gac gct gtc agg aca gaa ttc gtt agg aga ggt gcc tac ttt cta acg 1056
Asp Ala Val Arg Thr Glu Phe Val Arg Arg Gly Ala Tyr Phe Leu Thr
340 345 350
gaa gat gat aag gta aag gtt aga gcg gga gtc gta gta gat ggc aaa 1104
Glu Asp Asp Lys Val Lys Val Arg Ala Gly Val Val Val Asp Gly Lys
355 360 365
ttg aat cca aat ata gtt ggc cag tca att cca aaa ctt gcc gcc cta 1152
Leu Asn Pro Asn Ile Val Gly Gln Ser Ile Pro Lys Leu Ala Ala Leu
370 375 380
ttt ggc ata aag gtt cct caa ggc aca aaa gtt ttg att ggt gag gta 1200
Phe Gly Ile Lys Val Pro Gln Gly Thr Lys Val Leu Ile Gly Glu Val
385 390 395 400
gag aaa atc ggt cca gaa gaa gct tta tct caa gag aaa ctt tgc ccc 1248
Glu Lys Ile Gly Pro Glu Glu Ala Leu Ser Gln Glu Lys Leu Cys Pro
405 410 415
ata ctg gca atg tat aga gct cca gat tac gac cat ggt gtc aaa atg 1296
Ile Leu Ala Met Tyr Arg Ala Pro Asp Tyr Asp His Gly Val Lys Met
420 425 430
gct tgt gag tta ata atg tac ggt ggt gcc gga cat acg agc gtt ttg 1344
Ala Cys Glu Leu Ile Met Tyr Gly Gly Ala Gly His Thr Ser Val Leu
435 440 445
tac acc aat ccc ctt aat aat gcg cac att cag cag tat caa tca gct 1392
Tyr Thr Asn Pro Leu Asn Asn Ala His Ile Gln Gln Tyr Gln Ser Ala
450 455 460
gtc aaa aca gtg aga atc ttg att aac act cca gct agt caa ggt gca 1440
Val Lys Thr Val Arg Ile Leu Ile Asn Thr Pro Ala Ser Gln Gly Ala
465 470 475 480
atc ggt gat ctt tac aat ttt cat ctt gat ccc tct ctt aca ttg ggc 1488
Ile Gly Asp Leu Tyr Asn Phe His Leu Asp Pro Ser Leu Thr Leu Gly
485 490 495
tgt ggt aca tgg ggc tca act tct gta tca acg aat gtt ggt ccc caa 1536
Cys Gly Thr Trp Gly Ser Thr Ser Val Ser Thr Asn Val Gly Pro Gln
500 505 510
cat ctt ttg aac atc aaa acc gta aca gcc agg agg gaa aac atg tta 1584
His Leu Leu Asn Ile Lys Thr Val Thr Ala Arg Arg Glu Asn Met Leu
515 520 525
tgg ttt cgt gtt cca cct aaa atc tac ttt aaa ggt gga tgt tta gaa 1632
Trp Phe Arg Val Pro Pro Lys Ile Tyr Phe Lys Gly Gly Cys Leu Glu
530 535 540
gtt gcg tta acc gac ctg aga ggg aaa agc aga gct ttc att gta act 1680
Val Ala Leu Thr Asp Leu Arg Gly Lys Ser Arg Ala Phe Ile Val Thr
545 550 555 560
gat aag cca tta ttt gac atg ggt tat gct gat aaa gtc acg cac att 1728
Asp Lys Pro Leu Phe Asp Met Gly Tyr Ala Asp Lys Val Thr His Ile
565 570 575
tta gac tcc att aac gtg cat cat caa gtt ttc tac cac gtt aca ccg 1776
Leu Asp Ser Ile Asn Val His His Gln Val Phe Tyr His Val Thr Pro
580 585 590
gac cct aca ttg gct tgc att gaa gcg ggg tta aaa gag atc ttg gaa 1824
Asp Pro Thr Leu Ala Cys Ile Glu Ala Gly Leu Lys Glu Ile Leu Glu
595 600 605
ttt aaa cca gac gtt ata atc gct tta ggc gga ggg agt cca atg gat 1872
Phe Lys Pro Asp Val Ile Ile Ala Leu Gly Gly Gly Ser Pro Met Asp
610 615 620
gcc gct aag atc atg tgg cta atg tac gaa tgc cca gac acc aga ttt 1920
Ala Ala Lys Ile Met Trp Leu Met Tyr Glu Cys Pro Asp Thr Arg Phe
625 630 635 640
gat gga cta gcc atg agg ttc atg gat att aga aag aga gtt tat gag 1968
Asp Gly Leu Ala Met Arg Phe Met Asp Ile Arg Lys Arg Val Tyr Glu
645 650 655
gtt ccg gaa ttg ggc aag aaa gca acc atg gtg tgc att cct aca acc 2016
Val Pro Glu Leu Gly Lys Lys Ala Thr Met Val Cys Ile Pro Thr Thr
660 665 670
tct ggt aca ggt tcc gaa gtc act cca ttt tct gta gtt aca gat gaa 2064
Ser Gly Thr Gly Ser Glu Val Thr Pro Phe Ser Val Val Thr Asp Glu
675 680 685
cgt ttg ggc gca aaa tat cca ctg gca gat tat gca ttg act ccg tcg 2112
Arg Leu Gly Ala Lys Tyr Pro Leu Ala Asp Tyr Ala Leu Thr Pro Ser
690 695 700
atg gcc att gtc gat cca caa ttg gtg tta aac atg cct aag aaa tta 2160
Met Ala Ile Val Asp Pro Gln Leu Val Leu Asn Met Pro Lys Lys Leu
705 710 715 720
acc gcc tgg ggt ggt att gac gca ttg act cat gcg ctt gag tcg tat 2208
Thr Ala Trp Gly Gly Ile Asp Ala Leu Thr His Ala Leu Glu Ser Tyr
725 730 735
gtc tcc att tgt gcg act gat tat acg aag gga tta tca aga gaa gcc 2256
Val Ser Ile Cys Ala Thr Asp Tyr Thr Lys Gly Leu Ser Arg Glu Ala
740 745 750
ata tca cta ctg ttc aag tac ttg ccc agg gca tat gca aac ggg tcg 2304
Ile Ser Leu Leu Phe Lys Tyr Leu Pro Arg Ala Tyr Ala Asn Gly Ser
755 760 765
aat gat tac tta gct aga gaa aag gtt cac tat gcc gcg act att gct 2352
Asn Asp Tyr Leu Ala Arg Glu Lys Val His Tyr Ala Ala Thr Ile Ala
770 775 780
ggt atg gcc ttt gca aac gca ttc tta gga ata tgt cac tcc atg gcg 2400
Gly Met Ala Phe Ala Asn Ala Phe Leu Gly Ile Cys His Ser Met Ala
785 790 795 800
cat aaa tta ggc gct gct tat cat gtg cct cat ggt cta gct aat gca 2448
His Lys Leu Gly Ala Ala Tyr His Val Pro His Gly Leu Ala Asn Ala
805 810 815
gcc tta att tcg cat gtc ata cgt tat aac gct acc gat atg cca gcg 2496
Ala Leu Ile Ser His Val Ile Arg Tyr Asn Ala Thr Asp Met Pro Ala
820 825 830
aaa caa gcc gct ttc cca caa tac gag tat cct act gca aag caa gac 2544
Lys Gln Ala Ala Phe Pro Gln Tyr Glu Tyr Pro Thr Ala Lys Gln Asp
835 840 845
tat gct gat ttg gcc aac atg ctg ggt ctt gga ggt aat acc gta gat 2592
Tyr Ala Asp Leu Ala Asn Met Leu Gly Leu Gly Gly Asn Thr Val Asp
850 855 860
gag aag gtc atc aag ttg att gaa gca gta gaa gaa ctg aaa gct aaa 2640
Glu Lys Val Ile Lys Leu Ile Glu Ala Val Glu Glu Leu Lys Ala Lys
865 870 875 880
gtt gac ata cca ccg act att aag gaa atc ttt aac gat cca aag gtg 2688
Val Asp Ile Pro Pro Thr Ile Lys Glu Ile Phe Asn Asp Pro Lys Val
885 890 895
gat gcg gat ttt ctg gct aat gtc gac gct ttg gcc gag gat gca ttt 2736
Asp Ala Asp Phe Leu Ala Asn Val Asp Ala Leu Ala Glu Asp Ala Phe
900 905 910
gat gat caa tgt aca gga gca aat cca aga tac cca ttg atg gct gat 2784
Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Tyr Pro Leu Met Ala Asp
915 920 925
tta aaa cag tta tac ctt gat gcc cat gct gcc cca att tta cca gtc 2832
Leu Lys Gln Leu Tyr Leu Asp Ala His Ala Ala Pro Ile Leu Pro Val
930 935 940
aaa acc ctt gaa ttc ttc tcc aaa atc aac taa 2865
Lys Thr Leu Glu Phe Phe Ser Lys Ile Asn
945 950
<210> 24
<211> 954
<212> PRT
<213> 莱茵衣藻(Chlamydomonas reinhardtii)
<400> 24
Met Met Ser Ser Ser Leu Val Ser Gly Lys Arg Val Ala Val Pro Ser
1 5 10 15
Ala Ala Lys Pro Cys Ala Ala Val Pro Leu Pro Arg Val Ala Gly Arg
20 25 30
Arg Thr Ala Ala Arg Val Val Cys Glu Ala Ala Pro Ser Gly Ala Ala
35 40 45
Pro Ala Ser Pro Lys Ala Glu Ala Ala Ala Pro Val Ala Ala Ala Pro
50 55 60
Ala Thr Pro His Ala Glu Val Lys Lys Glu Arg Ala Pro Ala Thr Asp
65 70 75 80
Glu Ala Leu Thr Glu Leu Lys Ala Leu Leu Lys Arg Ala Gln Thr Ala
85 90 95
Gln Ala Gln Tyr Ser Thr Tyr Thr Gln Glu Gln Val Asp Glu Ile Phe
100 105 110
Arg Ala Ala Ala Glu Ala Ala Asn Ala Ala Arg Ile Pro Leu Ala Lys
115 120 125
Met Ala Val Glu Glu Thr Arg Met Gly Val Ala Glu Asp Lys Val Val
130 135 140
Lys Asn His Phe Ala Ser Glu Phe Ile Tyr Asn Lys Tyr Lys His Thr
145 150 155 160
Lys Thr Cys Gly Val Ile Glu His Asp Pro Ala Gly Gly Ile Gln Lys
165 170 175
Val Ala Glu Pro Val Gly Val Ile Ala Gly Ile Val Pro Thr Thr Asn
180 185 190
Pro Thr Ser Thr Ala Ile Phe Lys Ser Leu Leu Ser Leu Lys Thr Arg
195 200 205
Asn Ala Leu Val Leu Cys Pro His Pro Arg Ala Ala Lys Ser Thr Ile
210 215 220
Ala Ala Ala Arg Ile Val Arg Asp Ala Ala Val Ala Ala Gly Ala Pro
225 230 235 240
Pro Asn Ile Ile Ser Trp Val Glu Thr Pro Ser Leu Pro Val Ser Gln
245 250 255
Ala Leu Met Gln Ala Thr Glu Ile Asn Leu Ile Leu Ala Thr Gly Gly
260 265 270
Pro Ala Met Val Arg Ala Ala Tyr Ser Ser Gly Asn Pro Ser Leu Gly
275 280 285
Val Gly Ala Gly Asn Thr Pro Ala Leu Ile Asp Glu Thr Ala Asp Val
290 295 300
Ala Met Ala Val Ser Ser Ile Leu Leu Ser Lys Thr Phe Asp Asn Gly
305 310 315 320
Val Ile Cys Ala Ser Glu Gln Ser Val Val Val Val Ala Lys Ala Tyr
325 330 335
Asp Ala Val Arg Thr Glu Phe Val Arg Arg Gly Ala Tyr Phe Leu Thr
340 345 350
Glu Asp Asp Lys Val Lys Val Arg Ala Gly Val Val Val Asp Gly Lys
355 360 365
Leu Asn Pro Asn Ile Val Gly Gln Ser Ile Pro Lys Leu Ala Ala Leu
370 375 380
Phe Gly Ile Lys Val Pro Gln Gly Thr Lys Val Leu Ile Gly Glu Val
385 390 395 400
Glu Lys Ile Gly Pro Glu Glu Ala Leu Ser Gln Glu Lys Leu Cys Pro
405 410 415
Ile Leu Ala Met Tyr Arg Ala Pro Asp Tyr Asp His Gly Val Lys Met
420 425 430
Ala Cys Glu Leu Ile Met Tyr Gly Gly Ala Gly His Thr Ser Val Leu
435 440 445
Tyr Thr Asn Pro Leu Asn Asn Ala His Ile Gln Gln Tyr Gln Ser Ala
450 455 460
Val Lys Thr Val Arg Ile Leu Ile Asn Thr Pro Ala Ser Gln Gly Ala
465 470 475 480
Ile Gly Asp Leu Tyr Asn Phe His Leu Asp Pro Ser Leu Thr Leu Gly
485 490 495
Cys Gly Thr Trp Gly Ser Thr Ser Val Ser Thr Asn Val Gly Pro Gln
500 505 510
His Leu Leu Asn Ile Lys Thr Val Thr Ala Arg Arg Glu Asn Met Leu
515 520 525
Trp Phe Arg Val Pro Pro Lys Ile Tyr Phe Lys Gly Gly Cys Leu Glu
530 535 540
Val Ala Leu Thr Asp Leu Arg Gly Lys Ser Arg Ala Phe Ile Val Thr
545 550 555 560
Asp Lys Pro Leu Phe Asp Met Gly Tyr Ala Asp Lys Val Thr His Ile
565 570 575
Leu Asp Ser Ile Asn Val His His Gln Val Phe Tyr His Val Thr Pro
580 585 590
Asp Pro Thr Leu Ala Cys Ile Glu Ala Gly Leu Lys Glu Ile Leu Glu
595 600 605
Phe Lys Pro Asp Val Ile Ile Ala Leu Gly Gly Gly Ser Pro Met Asp
610 615 620
Ala Ala Lys Ile Met Trp Leu Met Tyr Glu Cys Pro Asp Thr Arg Phe
625 630 635 640
Asp Gly Leu Ala Met Arg Phe Met Asp Ile Arg Lys Arg Val Tyr Glu
645 650 655
Val Pro Glu Leu Gly Lys Lys Ala Thr Met Val Cys Ile Pro Thr Thr
660 665 670
Ser Gly Thr Gly Ser Glu Val Thr Pro Phe Ser Val Val Thr Asp Glu
675 680 685
Arg Leu Gly Ala Lys Tyr Pro Leu Ala Asp Tyr Ala Leu Thr Pro Ser
690 695 700
Met Ala Ile Val Asp Pro Gln Leu Val Leu Asn Met Pro Lys Lys Leu
705 710 715 720
Thr Ala Trp Gly Gly Ile Asp Ala Leu Thr His Ala Leu Glu Ser Tyr
725 730 735
Val Ser Ile Cys Ala Thr Asp Tyr Thr Lys Gly Leu Ser Arg Glu Ala
740 745 750
Ile Ser Leu Leu Phe Lys Tyr Leu Pro Arg Ala Tyr Ala Asn Gly Ser
755 760 765
Asn Asp Tyr Leu Ala Arg Glu Lys Val His Tyr Ala Ala Thr Ile Ala
770 775 780
Gly Met Ala Phe Ala Asn Ala Phe Leu Gly Ile Cys His Ser Met Ala
785 790 795 800
His Lys Leu Gly Ala Ala Tyr His Val Pro His Gly Leu Ala Asn Ala
805 810 815
Ala Leu Ile Ser His Val Ile Arg Tyr Asn Ala Thr Asp Met Pro Ala
820 825 830
Lys Gln Ala Ala Phe Pro Gln Tyr Glu Tyr Pro Thr Ala Lys Gln Asp
835 840 845
Tyr Ala Asp Leu Ala Asn Met Leu Gly Leu Gly Gly Asn Thr Val Asp
850 855 860
Glu Lys Val Ile Lys Leu Ile Glu Ala Val Glu Glu Leu Lys Ala Lys
865 870 875 880
Val Asp Ile Pro Pro Thr Ile Lys Glu Ile Phe Asn Asp Pro Lys Val
885 890 895
Asp Ala Asp Phe Leu Ala Asn Val Asp Ala Leu Ala Glu Asp Ala Phe
900 905 910
Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Tyr Pro Leu Met Ala Asp
915 920 925
Leu Lys Gln Leu Tyr Leu Asp Ala His Ala Ala Pro Ile Leu Pro Val
930 935 940
Lys Thr Leu Glu Phe Phe Ser Lys Ile Asn
945 950
<210> 25
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 25
tgggaatatt accgctcgaa g 21
<210> 26
<211> 29
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 26
ctttaaaaaa tttccaattt tcctttacg 29
<210> 27
<211> 18
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 27
tcgcagccac gggtcaac 18
<210> 28
<211> 37
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 28
ttttattatt agtctttttt ttttttgaca atatctg 37
<210> 29
<211> 31
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 29
atgactcaat tcactgacat tgataagcta g 31
<210> 30
<211> 32
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 30
atattcttta ttggctttat acttgaatgg tg 32
<210> 31
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 31
gacgttgatt taaggtggtt ccgg 24
<210> 32
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 32
atgtctgaac cagctcaaaa gaaac 25
<210> 33
<211> 37
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 33
tttgaatatg tattacttgg ttatggttat atatgac 37
<210> 34
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 34
actggtagag agcgactttg tatgc 25
<210> 35
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 35
gctttcaatt catttgggtg tg 22
<210> 36
<211> 30
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 36
tgtatatgag atagttgatt gtatgcttgg 30
<210> 37
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 37
atggtcaaac caattatagc tcccagta 28
<210> 38
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 38
aaatggatat tgatctagat ggcgg 25
<210> 39
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 39
cttggtgtgt catcggtagt aacg 24
<210> 40
<211> 17
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 40
atggctgccg gtgtccc 17
<210> 41
<211> 36
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 41
ttgtaattaa aacttagatt agattgctat gctttc 36
<210> 42
<211> 18
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 42
aggaacagcc gtcaaggg 18
<210> 43
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 43
cttccctttt acagtgcttc ggaaaagc 28
<210> 44
<211> 32
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 44
tttgttttgt gtgtaaattt agtgaagtac tg 32
<210> 45
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 45
atgtctcaaa tttttaagga tatcccag 28
<210> 46
<211> 29
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 46
ttattgaaac aaaatttggt taataatac 29
<210> 47
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 47
taaagtaaga gcgctacatt ggtctacc 28
<210> 48
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 48
ttactccgca acgcttttct gaac 24
<210> 49
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 49
tagcgttgaa tgttagcgtc aacaac 26
<210> 50
<211> 41
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 50
tttgtttgtt tatgtgtgtt tattcgaaac taagttcttg g 41
<210> 51
<211> 27
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 51
atgttgtgtt cagtaattca gagacag 27
<210> 52
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 52
ttagatgaga gtcttttcca gttcgc 26
<210> 53
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 53
tgacaccgat tatttaaagc tgcag 25
<210> 54
<211> 18
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 54
agagcgcgcc tcgttcag 18
<210> 55
<211> 27
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 55
aattccgctg tatagctcat atctttc 27
<210> 56
<211> 39
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 56
aacgaggcgc gctctaattc cgctgtatag ctcatatct 39
<210> 57
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 57
acgacatcgt cgaatatgat tcag 24
<210> 58
<211> 33
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 58
tattaattta gtgtgtgtat ttgtgtttgt gtg 33
<210> 59
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 59
atgagccata ttcaacggga aac 23
<210> 60
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 60
ttacaaccaa ttaaccaatt ctgattag 28
<210> 61
<211> 37
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 61
tgcatgtcta ctaaactcac aaattagagc ttcaatt 37
<210> 62
<211> 39
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 62
gggtaataac tgatataatt aaattgaagc tctaatttg 39
<210> 63
<211> 59
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 63
cccataactt cgtatagcat acattatacg aagttattga caccgattat ttaaagctg 59
<210> 64
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 64
gtatgctgca gctttaaata atcgg 25
<210> 65
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 65
tccagccagt aaaatccata ctcaac 26
<210> 66
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 66
aagggggaag gtgtggaatc 20
<210> 67
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 67
ctatgggact tccgggaaac 20
<210> 68
<211> 38
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 68
tgtgtattac gatatagtta atagttgata gttgattg 38
<210> 69
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 69
tagcgttgaa tgttagcgtc aacaac 26
<210> 70
<211> 41
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 70
tttgtttgtt tatgtgtgtt tattcgaaac taagttcttg g 41
<210> 71
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 71
atgtctatcc cagaaactca aaaaggtg 28
<210> 72
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 72
ttgtcctctg aggacataaa atacacac 28
<210> 73
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 73
ttactccgca acgcttttct gaac 24
<210> 74
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 74
taaagtaaga gcgctacatt ggtctacc 28
<210> 75
<211> 18
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 75
ttatgcggcc tctcctgc 18
<210> 76
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 76
atgtcaaaga gaaaagttgc tattatcg 28
<210> 77
<211> 37
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 77
ttttattatt agtctttttt ttttttgaca atatctg 37
<210> 78
<211> 18
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 78
tcgctcgcag ccacgggt 18
<210> 79
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 79
gattcggtaa tctccgagca g 21
<210> 80
<211> 39
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 80
gggtaataac tgatataatt aaattgaagc tctaatttg 39
<210> 81
<211> 35
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 81
gcggatctct tatgtcttta cgatttatag ttttc 35
<210> 82
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 82
gagggttggg cattcatcag 20
<210> 83
<211> 37
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 83
ttttattatt agtctttttt ttttttgaca atatctg 37
<210> 84
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 84
taaagtaaga gcgctacatt ggtctacc 28
<210> 85
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 85
ttaagctgat ttctttgctt tcttctcg 28
<210> 86
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 86
atggcagtta cgaacgttgc agag 24
<210> 87
<211> 37
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 87
ttttattatt agtctttttt ttttttgaca atatctg 37
<210> 88
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 88
taaagtaaga gcgctacatt ggtctacc 28
<210> 89
<211> 27
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 89
ttaacctgct aaaacacatc ttctttg 27
<210> 90
<211> 29
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 90
atgaataagg ataccttgat tccaactac 29
<210> 91
<211> 37
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 91
ttttattatt agtctttttt ttttttgaca atatctg 37
<210> 92
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 92
taaagtaaga gcgctacatt ggtctacc 28
<210> 93
<211> 29
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 93
ttagttgatt ttggagaaga attcaaggg 29
<210> 94
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 94
atgatgagtt cctctctggt tag 23
<210> 95
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 95
gtctgccaca ccgatttgc 19
<210> 96
<211> 31
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 96
cttatttaga agtgtcaaca acgtatctac c 31
<210> 97
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 97
cttaagacag gccccttttc ctttg 25
<210> 98
<211> 27
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 98
ctgcaggaat tcgatatcaa gcttatc 27
<210> 99
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 99
ttactccgca acgcttttct gaac 24
<210> 100
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 100
tccccgggta ccgagctcg 19
Claims (5)
1.乙醇的制造方法,具有以下工序:将导入了木糖异构酶基因和乙醛脱氢酶基因、高表达具有将乙醛转换为乙醇的活性的醇脱氢酶基因、且具有将乙醇转换为乙醛的活性的醇脱氢酶基因的表达量降低了的重组酵母,用含有木糖的培养基进行培养,从而进行乙醇发酵,其中所述重组酵母能够代谢所述含有木糖的培养基中的乙酸,
所述木糖异构酶基因编码序列号4所示的氨基酸序列中的第337位的天冬酰胺被替换成了半胱氨酸的氨基酸序列,
所述乙醛脱氢酶基因编码来源于大肠杆菌的乙醛脱氢酶、来源于拜氏梭菌的乙醛脱氢酶、或者来源于莱茵衣藻的乙醛脱氢酶,
所述来源于大肠杆菌的乙醛脱氢酶是以下的(c)或(d)的蛋白质,
(c)具有序列号2或20所示的氨基酸序列的蛋白质,
(d)具有相对于序列号2或20所示的氨基酸序列有70%以上的同一性的氨基酸序列、且具有乙醛脱氢酶活性的蛋白质,
所述来源于拜氏梭菌的乙醛脱氢酶是以下的(e)或(f)的蛋白质,
(e)具有序列号22所示的氨基酸序列的蛋白质,
(f)具有相对于序列号22所示的氨基酸序列有70%以上的同一性的氨基酸序列、且具有乙醛脱氢酶活性的蛋白质,
所述来源于莱茵衣藻的乙醛脱氢酶是以下的(g)或(h)的蛋白质,
(g)具有序列号24所示的氨基酸序列的蛋白质,
(h)具有相对于序列号24所示的氨基酸序列有70%以上的同一性的氨基酸序列、且具有乙醛脱氢酶活性的蛋白质。
2.根据权利要求1所述的乙醇的制造方法,其特征在于,所述重组酵母是进一步导入了木酮糖激酶基因的重组酵母。
3.根据权利要求1所述的乙醇的制造方法,其特征在于,所述重组酵母是导入了下述基因的重组酵母,所述基因编码选自构成戊糖磷酸途径中的非氧化过程的途径的酶群中的酶。
4.根据权利要求3所述的乙醇的制造方法,其特征在于,构成戊糖磷酸途径中的非氧化过程的途径的酶群是核糖-5-磷酸异构酶、核酮糖-5-磷酸-3-差向异构酶、转酮醇酶和转醛醇酶。
5.根据权利要求1所述的乙醇的制造方法,其特征在于,所述培养基含有纤维素,在所述乙醇发酵中,至少同时进行所述纤维素的糖化。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111253587.3A CN114774477B (zh) | 2013-02-27 | 2014-02-27 | 使用了重组酵母的乙醇的制造方法 |
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2013037501 | 2013-02-27 | ||
JP2013-037501 | 2013-02-27 | ||
CN202111253587.3A CN114774477B (zh) | 2013-02-27 | 2014-02-27 | 使用了重组酵母的乙醇的制造方法 |
CN201480010549.7A CN105073990A (zh) | 2013-02-27 | 2014-02-27 | 使用了重组酵母的乙醇的制造方法 |
JP2014036652A JP6087854B2 (ja) | 2013-02-27 | 2014-02-27 | 組換え酵母を用いたエタノールの製造方法 |
PCT/JP2014/054915 WO2014133092A1 (ja) | 2013-02-27 | 2014-02-27 | 組換え酵母を用いたエタノールの製造方法 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201480010549.7A Division CN105073990A (zh) | 2013-02-27 | 2014-02-27 | 使用了重组酵母的乙醇的制造方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114774477A true CN114774477A (zh) | 2022-07-22 |
CN114774477B CN114774477B (zh) | 2024-04-09 |
Family
ID=51428351
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111253587.3A Active CN114774477B (zh) | 2013-02-27 | 2014-02-27 | 使用了重组酵母的乙醇的制造方法 |
CN201480010549.7A Pending CN105073990A (zh) | 2013-02-27 | 2014-02-27 | 使用了重组酵母的乙醇的制造方法 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201480010549.7A Pending CN105073990A (zh) | 2013-02-27 | 2014-02-27 | 使用了重组酵母的乙醇的制造方法 |
Country Status (6)
Country | Link |
---|---|
US (1) | US20160002674A1 (zh) |
JP (1) | JP6087854B2 (zh) |
CN (2) | CN114774477B (zh) |
BR (1) | BR112015019777B1 (zh) |
CA (1) | CA2901974C (zh) |
WO (1) | WO2014133092A1 (zh) |
Families Citing this family (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104024419A (zh) | 2011-10-24 | 2014-09-03 | 丰田自动车株式会社 | 使用了重组酵母的乙醇的制造方法 |
JP6027559B2 (ja) * | 2013-03-28 | 2016-11-16 | 株式会社豊田中央研究所 | キシロースイソメラーゼ活性を有するタンパク質及びその利用 |
JP6616311B2 (ja) * | 2014-10-15 | 2019-12-04 | Jxtgエネルギー株式会社 | キシロースからエタノールを生産する酵母 |
JP6303984B2 (ja) * | 2014-10-31 | 2018-04-04 | トヨタ自動車株式会社 | 連続培養によるエタノールの製造方法及び連続培養装置 |
JP6697250B2 (ja) * | 2015-12-02 | 2020-05-20 | 株式会社豊田中央研究所 | ゲノム再編方法及びその利用 |
JP6447583B2 (ja) * | 2016-06-24 | 2019-01-09 | トヨタ自動車株式会社 | 組換え酵母、及びそれを用いたエタノールの製造方法 |
JP6097869B1 (ja) * | 2016-08-01 | 2017-03-15 | 新日鉄住金エンジニアリング株式会社 | エタノールの製造方法 |
JP6879111B2 (ja) | 2017-08-02 | 2021-06-02 | トヨタ自動車株式会社 | 組換え酵母及びこれを用いたエタノールの製造方法 |
CA3091962A1 (en) | 2017-09-19 | 2019-03-28 | Lallemand Hungary Liquidity Management Llc | Acetate toxicity tolerance in recombinant microbial host cells |
JP2019068788A (ja) * | 2017-10-11 | 2019-05-09 | トヨタ自動車株式会社 | 組換え酵母及びこれを用いたエタノールの製造方法 |
JP7298673B2 (ja) * | 2017-11-06 | 2023-06-27 | トヨタ自動車株式会社 | エタノール発酵によるエタノール生産性の向上に関与する変異遺伝子及びこれを用いたエタノールの製造方法 |
JP7298674B2 (ja) * | 2017-11-06 | 2023-06-27 | トヨタ自動車株式会社 | エタノール発酵によるエタノール生産性の向上に関与する変異遺伝子及びこれを用いたエタノールの製造方法 |
JP7078887B2 (ja) * | 2017-11-06 | 2022-06-01 | トヨタ自動車株式会社 | エタノール発酵によるエタノール生産性の向上に関与する変異遺伝子及びこれを用いたエタノールの製造方法 |
US20210017526A1 (en) * | 2018-03-27 | 2021-01-21 | Basf Se | Xylose metabolizing yeast |
WO2019188839A1 (ja) | 2018-03-27 | 2019-10-03 | 積水化学工業株式会社 | エタノールの製造方法及びエタノール組成物 |
JP2020025493A (ja) * | 2018-08-10 | 2020-02-20 | トヨタ自動車株式会社 | 組換え酵母、及びそれを用いたエタノールの製造方法 |
CN113046252B (zh) * | 2021-03-23 | 2022-07-05 | 江南大学 | 一株乙醛脱氢酶高产菌株的分离与鉴定 |
CN114107368B (zh) * | 2021-11-29 | 2023-05-26 | 重庆大学 | 表达反式菊酸的联合表达载体及其在调控番茄vi型腺体腺毛合成反式菊酸中的应用 |
CN117384871B (zh) * | 2023-12-13 | 2024-03-01 | 北京科为博生物科技有限公司 | 一种热稳定性提高的乙醛脱氢酶及其基因和应用 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1989001520A1 (en) * | 1987-08-11 | 1989-02-23 | Cetus Corporation | Procaryotic xylose isomerase muteins and method to increase protein stability |
CN101939433A (zh) * | 2007-07-23 | 2011-01-05 | 帝斯曼知识产权资产管理有限公司 | 酵母中生产乙酰-CoA的酶 |
JP2011024500A (ja) * | 2009-07-27 | 2011-02-10 | Toyota Central R&D Labs Inc | 発酵能力が向上された酵母及びその利用 |
US20110165660A1 (en) * | 2009-07-09 | 2011-07-07 | Verdezyne, Inc. | Engineered microorganisms with enhanced fermentation activity |
WO2012067510A1 (en) * | 2010-11-18 | 2012-05-24 | C5 Yeast Company B.V. | Yeast strains engineered to produce ethanol from glycerol |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5091523B2 (ja) * | 2007-03-30 | 2012-12-05 | 三井造船株式会社 | アルコール生産方法 |
JP5608999B2 (ja) * | 2009-04-08 | 2014-10-22 | 株式会社豊田中央研究所 | キシロースを利用して有用物質を生産する方法 |
EP2277989A1 (en) * | 2009-07-24 | 2011-01-26 | Technische Universiteit Delft | Fermentative glycerol-free ethanol production |
WO2011078262A1 (ja) * | 2009-12-22 | 2011-06-30 | 株式会社豊田中央研究所 | キシロースイソメラーゼ及びその利用 |
BR112012028290B1 (pt) * | 2010-05-05 | 2021-02-02 | Lallemand Hungary Liquidity Management Llc. | levedura recombinante, processo para converter biomassa em etanol e meio de fermentação compreendendo dita levedura |
EP2633030A1 (en) * | 2010-10-29 | 2013-09-04 | Novozymes A/S | Recombinant n-propanol and isopropanol production |
CN104903441A (zh) * | 2012-11-09 | 2015-09-09 | 拉勒曼德匈牙利流动管理有限责任公司 | 在纤维素原料的乙醇发酵期间消耗乙酸盐的方法 |
-
2014
- 2014-02-27 CN CN202111253587.3A patent/CN114774477B/zh active Active
- 2014-02-27 US US14/767,821 patent/US20160002674A1/en not_active Abandoned
- 2014-02-27 BR BR112015019777-9A patent/BR112015019777B1/pt active IP Right Grant
- 2014-02-27 CA CA2901974A patent/CA2901974C/en active Active
- 2014-02-27 JP JP2014036652A patent/JP6087854B2/ja active Active
- 2014-02-27 WO PCT/JP2014/054915 patent/WO2014133092A1/ja active Application Filing
- 2014-02-27 CN CN201480010549.7A patent/CN105073990A/zh active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1989001520A1 (en) * | 1987-08-11 | 1989-02-23 | Cetus Corporation | Procaryotic xylose isomerase muteins and method to increase protein stability |
CN101939433A (zh) * | 2007-07-23 | 2011-01-05 | 帝斯曼知识产权资产管理有限公司 | 酵母中生产乙酰-CoA的酶 |
US20110165660A1 (en) * | 2009-07-09 | 2011-07-07 | Verdezyne, Inc. | Engineered microorganisms with enhanced fermentation activity |
JP2011024500A (ja) * | 2009-07-27 | 2011-02-10 | Toyota Central R&D Labs Inc | 発酵能力が向上された酵母及びその利用 |
WO2012067510A1 (en) * | 2010-11-18 | 2012-05-24 | C5 Yeast Company B.V. | Yeast strains engineered to produce ethanol from glycerol |
Also Published As
Publication number | Publication date |
---|---|
CN114774477B (zh) | 2024-04-09 |
CA2901974C (en) | 2020-01-14 |
CN105073990A (zh) | 2015-11-18 |
BR112015019777B1 (pt) | 2022-12-06 |
JP2014193152A (ja) | 2014-10-09 |
WO2014133092A1 (ja) | 2014-09-04 |
US20160002674A1 (en) | 2016-01-07 |
CA2901974A1 (en) | 2014-09-04 |
BR112015019777A2 (zh) | 2017-08-22 |
JP6087854B2 (ja) | 2017-03-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN114774477B (zh) | 使用了重组酵母的乙醇的制造方法 | |
JP5817836B2 (ja) | 組換え酵母を用いたエタノールの製造方法 | |
JP5590140B2 (ja) | 組換え酵母を用いたエタノールの製造方法 | |
JP5845210B2 (ja) | 組換え酵母、及びそれを用いたエタノールの製造方法 | |
US10745684B2 (en) | Recombinant yeast and a method for producing ethanol using the same | |
CA3028926C (en) | A recombinant yeast and a method for producing ethanol using the same | |
CN113272438A (zh) | 重组酵母和使用其的乙醇的制造方法 | |
EP2643466B1 (en) | Yeast cells and process for converting acetaldehyde to ethanol | |
WO2020032233A1 (ja) | 組換え酵母、及びそれを用いたエタノールの製造方法 | |
WO2014021163A1 (ja) | 組換え酵母を用いたエタノールの製造方法 | |
CA3020447A1 (en) | Recombinant yeast and method for producing ethanol using the same | |
JP7078900B2 (ja) | 形質転換酵母及びこれを用いたエタノールの製造方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |