CN110072997A - 具有蛋白酶活性的多肽和编码其的多核苷酸 - Google Patents
具有蛋白酶活性的多肽和编码其的多核苷酸 Download PDFInfo
- Publication number
- CN110072997A CN110072997A CN201780060772.6A CN201780060772A CN110072997A CN 110072997 A CN110072997 A CN 110072997A CN 201780060772 A CN201780060772 A CN 201780060772A CN 110072997 A CN110072997 A CN 110072997A
- Authority
- CN
- China
- Prior art keywords
- ala
- ser
- gly
- thr
- polypeptide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 284
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 276
- 102000004196 processed proteins & peptides Human genes 0.000 title claims abstract description 276
- 108091005804 Peptidases Proteins 0.000 title claims abstract description 128
- 102000035195 Peptidases Human genes 0.000 title claims abstract description 125
- 102000040430 polynucleotide Human genes 0.000 title claims abstract description 90
- 108091033319 polynucleotide Proteins 0.000 title claims abstract description 90
- 239000002157 polynucleotide Substances 0.000 title claims abstract description 89
- 230000000694 effects Effects 0.000 title claims abstract description 60
- 235000019833 protease Nutrition 0.000 title claims abstract description 41
- 238000000034 method Methods 0.000 claims abstract description 167
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 21
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 20
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 20
- 108090000637 alpha-Amylases Proteins 0.000 claims description 139
- 102000004139 alpha-Amylases Human genes 0.000 claims description 139
- 229940024171 alpha-amylase Drugs 0.000 claims description 137
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 124
- 238000000855 fermentation Methods 0.000 claims description 93
- 230000004151 fermentation Effects 0.000 claims description 93
- 239000004365 Protease Substances 0.000 claims description 87
- 235000019419 proteases Nutrition 0.000 claims description 87
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 claims description 86
- 102100022624 Glucoamylase Human genes 0.000 claims description 86
- 235000019441 ethanol Nutrition 0.000 claims description 71
- 108090000790 Enzymes Proteins 0.000 claims description 65
- 229920002472 Starch Polymers 0.000 claims description 64
- 239000008107 starch Substances 0.000 claims description 64
- 102000004190 Enzymes Human genes 0.000 claims description 63
- 235000019698 starch Nutrition 0.000 claims description 63
- 239000000203 mixture Substances 0.000 claims description 61
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 claims description 56
- 239000005864 Sulphur Substances 0.000 claims description 56
- 150000001413 amino acids Chemical class 0.000 claims description 55
- 239000000463 material Substances 0.000 claims description 55
- 229940088598 enzyme Drugs 0.000 claims description 51
- 241001478240 Coccus Species 0.000 claims description 48
- 238000004519 manufacturing process Methods 0.000 claims description 30
- 239000002002 slurry Substances 0.000 claims description 30
- 230000002255 enzymatic effect Effects 0.000 claims description 29
- 238000011084 recovery Methods 0.000 claims description 23
- 238000009396 hybridization Methods 0.000 claims description 15
- 108010045789 sfericase Proteins 0.000 claims description 10
- 229950010531 sfericase Drugs 0.000 claims description 10
- 238000003259 recombinant expression Methods 0.000 claims description 8
- 239000000446 fuel Substances 0.000 claims description 7
- 230000009286 beneficial effect Effects 0.000 claims description 5
- 230000000295 complement effect Effects 0.000 claims description 5
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims 3
- 239000008186 active pharmaceutical agent Substances 0.000 claims 1
- 210000004027 cell Anatomy 0.000 description 109
- 235000001014 amino acid Nutrition 0.000 description 60
- 229940024606 amino acid Drugs 0.000 description 55
- 230000001580 bacterial effect Effects 0.000 description 53
- 108090000623 proteins and genes Proteins 0.000 description 48
- 108020004414 DNA Proteins 0.000 description 45
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 40
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 35
- 125000003275 alpha amino acid group Chemical group 0.000 description 30
- 239000000523 sample Substances 0.000 description 27
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 26
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 25
- 241000193830 Bacillus <bacterium> Species 0.000 description 24
- 241000894006 Bacteria Species 0.000 description 24
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 22
- 239000007788 liquid Substances 0.000 description 21
- 239000004202 carbamide Substances 0.000 description 20
- 239000002773 nucleotide Substances 0.000 description 19
- 125000003729 nucleotide group Chemical group 0.000 description 19
- 102200086056 rs41494349 Human genes 0.000 description 19
- 239000002609 medium Substances 0.000 description 18
- 239000002253 acid Substances 0.000 description 17
- 230000014509 gene expression Effects 0.000 description 17
- 240000008042 Zea mays Species 0.000 description 16
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 16
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 16
- 235000005822 corn Nutrition 0.000 description 16
- 230000013595 glycosylation Effects 0.000 description 16
- 238000006206 glycosylation reaction Methods 0.000 description 16
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 16
- 238000006467 substitution reaction Methods 0.000 description 16
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 15
- 235000015097 nutrients Nutrition 0.000 description 15
- 239000007787 solid Substances 0.000 description 15
- 230000001360 synchronised effect Effects 0.000 description 15
- 108010061238 threonyl-glycine Proteins 0.000 description 15
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 12
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 12
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 12
- 238000003780 insertion Methods 0.000 description 12
- 239000000126 substance Substances 0.000 description 12
- 241000985513 Penicillium oxalicum Species 0.000 description 11
- 239000000872 buffer Substances 0.000 description 11
- 238000005516 engineering process Methods 0.000 description 11
- 230000037431 insertion Effects 0.000 description 11
- 239000000243 solution Substances 0.000 description 11
- 238000012360 testing method Methods 0.000 description 11
- LKDMKWNDBAVNQZ-WJNSRDFLSA-N 4-[[(2s)-1-[[(2s)-1-[(2s)-2-[[(2s)-1-(4-nitroanilino)-1-oxo-3-phenylpropan-2-yl]carbamoyl]pyrrolidin-1-yl]-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-4-oxobutanoic acid Chemical compound OC(=O)CCC(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)NC=1C=CC(=CC=1)[N+]([O-])=O)CC1=CC=CC=C1 LKDMKWNDBAVNQZ-WJNSRDFLSA-N 0.000 description 10
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 10
- 244000063299 Bacillus subtilis Species 0.000 description 10
- 235000014469 Bacillus subtilis Nutrition 0.000 description 10
- 241000233866 Fungi Species 0.000 description 10
- 108010077245 asparaginyl-proline Proteins 0.000 description 10
- 239000012634 fragment Substances 0.000 description 10
- 108010050848 glycylleucine Proteins 0.000 description 10
- 238000005259 measurement Methods 0.000 description 10
- 244000005700 microbiome Species 0.000 description 10
- 230000035772 mutation Effects 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 10
- 238000012545 processing Methods 0.000 description 10
- 108010082371 succinyl-alanyl-alanyl-prolyl-phenylalanine-4-nitroanilide Proteins 0.000 description 10
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 9
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 9
- 101710180319 Protease 1 Proteins 0.000 description 9
- 108010076504 Protein Sorting Signals Proteins 0.000 description 9
- 101710137710 Thioesterase 1/protease 1/lysophospholipase L1 Proteins 0.000 description 9
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 9
- 239000002585 base Substances 0.000 description 9
- 235000013405 beer Nutrition 0.000 description 9
- 239000001110 calcium chloride Substances 0.000 description 9
- 229910001628 calcium chloride Inorganic materials 0.000 description 9
- 108010089804 glycyl-threonine Proteins 0.000 description 9
- 238000004128 high performance liquid chromatography Methods 0.000 description 9
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 9
- 238000002360 preparation method Methods 0.000 description 9
- -1 variant Proteins 0.000 description 9
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 8
- 241000123332 Gloeophyllum Species 0.000 description 8
- 241001492300 Gloeophyllum trabeum Species 0.000 description 8
- 125000003412 L-alanyl group Chemical group [H]N([H])[C@@](C([H])([H])[H])(C(=O)[*])[H] 0.000 description 8
- 240000003183 Manihot esculenta Species 0.000 description 8
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 8
- 108010047857 aspartylglycine Proteins 0.000 description 8
- 235000013339 cereals Nutrition 0.000 description 8
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 8
- 108010015792 glycyllysine Proteins 0.000 description 8
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 8
- 239000000843 powder Substances 0.000 description 8
- 235000018102 proteins Nutrition 0.000 description 8
- 102000004169 proteins and genes Human genes 0.000 description 8
- 108010026333 seryl-proline Proteins 0.000 description 8
- 238000013518 transcription Methods 0.000 description 8
- 230000035897 transcription Effects 0.000 description 8
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 7
- 229920000945 Amylopectin Polymers 0.000 description 7
- 108091026890 Coding region Proteins 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- 108010079364 N-glycylalanine Proteins 0.000 description 7
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 7
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 7
- 241000209140 Triticum Species 0.000 description 7
- 235000021307 Triticum Nutrition 0.000 description 7
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- 108010047495 alanylglycine Proteins 0.000 description 7
- 239000012876 carrier material Substances 0.000 description 7
- 238000001914 filtration Methods 0.000 description 7
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 7
- 239000001963 growth medium Substances 0.000 description 7
- 108010034529 leucyl-lysine Proteins 0.000 description 7
- 239000002853 nucleic acid probe Substances 0.000 description 7
- 150000007524 organic acids Chemical class 0.000 description 7
- 102200081484 rs1553259760 Human genes 0.000 description 7
- 230000028327 secretion Effects 0.000 description 7
- 239000013049 sediment Substances 0.000 description 7
- 238000013519 translation Methods 0.000 description 7
- 108010051110 tyrosyl-lysine Proteins 0.000 description 7
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 6
- 241000972773 Aulopiformes Species 0.000 description 6
- 241000194108 Bacillus licheniformis Species 0.000 description 6
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 6
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 6
- 108010068370 Glutens Proteins 0.000 description 6
- 239000004471 Glycine Substances 0.000 description 6
- 102220466851 HLA class II histocompatibility antigen, DR beta 4 chain_V59A_mutation Human genes 0.000 description 6
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- 241000959173 Rasamsonia emersonii Species 0.000 description 6
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 6
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 6
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 239000002299 complementary DNA Substances 0.000 description 6
- 239000006167 equilibration buffer Substances 0.000 description 6
- 239000013604 expression vector Substances 0.000 description 6
- 239000000284 extract Substances 0.000 description 6
- 235000013312 flour Nutrition 0.000 description 6
- 239000008103 glucose Substances 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- 235000021312 gluten Nutrition 0.000 description 6
- 108020004999 messenger RNA Proteins 0.000 description 6
- 238000002703 mutagenesis Methods 0.000 description 6
- 231100000350 mutagenesis Toxicity 0.000 description 6
- 229910052757 nitrogen Inorganic materials 0.000 description 6
- 229910017464 nitrogen compound Inorganic materials 0.000 description 6
- 150000002830 nitrogen compounds Chemical class 0.000 description 6
- 238000003752 polymerase chain reaction Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 238000004064 recycling Methods 0.000 description 6
- 230000010076 replication Effects 0.000 description 6
- 235000019515 salmon Nutrition 0.000 description 6
- 239000000758 substrate Substances 0.000 description 6
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 5
- 239000004382 Amylase Substances 0.000 description 5
- 241000228245 Aspergillus niger Species 0.000 description 5
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- 241000726221 Gemma Species 0.000 description 5
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 5
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 5
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 5
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 5
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 5
- 239000007983 Tris buffer Substances 0.000 description 5
- 229920004890 Triton X-100 Polymers 0.000 description 5
- 239000013504 Triton X-100 Substances 0.000 description 5
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 5
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 5
- 235000011054 acetic acid Nutrition 0.000 description 5
- 229910021529 ammonia Inorganic materials 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 108010068265 aspartyltyrosine Proteins 0.000 description 5
- 229910052799 carbon Inorganic materials 0.000 description 5
- 238000005119 centrifugation Methods 0.000 description 5
- 230000008859 change Effects 0.000 description 5
- 238000004520 electroporation Methods 0.000 description 5
- 239000012530 fluid Substances 0.000 description 5
- 239000000413 hydrolysate Substances 0.000 description 5
- 239000012071 phase Substances 0.000 description 5
- 150000003839 salts Chemical class 0.000 description 5
- 238000000926 separation method Methods 0.000 description 5
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 5
- 239000004475 Arginine Substances 0.000 description 4
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 4
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 4
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 4
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 4
- GNBMOZPQUXTCRW-STQMWFEESA-N Gly-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)CN)C(O)=O)=CNC2=C1 GNBMOZPQUXTCRW-STQMWFEESA-N 0.000 description 4
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 4
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 4
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 4
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 4
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 4
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 4
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- 240000007594 Oryza sativa Species 0.000 description 4
- 235000007164 Oryza sativa Nutrition 0.000 description 4
- 229930182555 Penicillin Natural products 0.000 description 4
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 4
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 4
- 241000205156 Pyrococcus furiosus Species 0.000 description 4
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 4
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 4
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 4
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 4
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 4
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 4
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 4
- 241000218636 Thuja Species 0.000 description 4
- YMZYSCDRTXEOKD-IHPCNDPISA-N Tyr-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YMZYSCDRTXEOKD-IHPCNDPISA-N 0.000 description 4
- 235000004279 alanine Nutrition 0.000 description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 4
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 4
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 4
- 238000000113 differential scanning calorimetry Methods 0.000 description 4
- 230000007717 exclusion Effects 0.000 description 4
- 230000002538 fungal effect Effects 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 4
- 235000013922 glutamic acid Nutrition 0.000 description 4
- 239000004220 glutamic acid Substances 0.000 description 4
- 235000011187 glycerol Nutrition 0.000 description 4
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 239000004310 lactic acid Substances 0.000 description 4
- 235000014655 lactic acid Nutrition 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 230000035800 maturation Effects 0.000 description 4
- 229940049954 penicillin Drugs 0.000 description 4
- 108010084572 phenylalanyl-valine Proteins 0.000 description 4
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 4
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 4
- 230000006798 recombination Effects 0.000 description 4
- 238000005215 recombination Methods 0.000 description 4
- 238000010992 reflux Methods 0.000 description 4
- 235000009566 rice Nutrition 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 235000000346 sugar Nutrition 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- NRDGOJQPHWAEES-RKDXNWHRSA-N (2r)-2,6-diamino-n-[(2r)-1-amino-5-[[amino(nitramido)methylidene]amino]-1-oxopentan-2-yl]hexanamide Chemical compound NCCCC[C@@H](N)C(=O)N[C@@H](C(N)=O)CCCN=C(N)N[N+]([O-])=O NRDGOJQPHWAEES-RKDXNWHRSA-N 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 3
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 3
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 3
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 3
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 3
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 3
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 3
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 3
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 3
- YWFLXGZHZXXINF-BPUTZDHNSA-N Asn-Pro-Trp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 YWFLXGZHZXXINF-BPUTZDHNSA-N 0.000 description 3
- MLJZMGIXXMTEPO-UBHSHLNASA-N Asn-Trp-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O MLJZMGIXXMTEPO-UBHSHLNASA-N 0.000 description 3
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 3
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 3
- 102000010911 Enzyme Precursors Human genes 0.000 description 3
- 108010062466 Enzyme Precursors Proteins 0.000 description 3
- 241001468125 Exiguobacterium Species 0.000 description 3
- 229930091371 Fructose Natural products 0.000 description 3
- 239000005715 Fructose Substances 0.000 description 3
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 3
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 3
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 3
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 3
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 3
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 3
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 3
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 3
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 3
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 3
- 235000007340 Hordeum vulgare Nutrition 0.000 description 3
- 240000005979 Hordeum vulgare Species 0.000 description 3
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 3
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 3
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 3
- 125000000570 L-alpha-aspartyl group Chemical group [H]OC(=O)C([H])([H])[C@]([H])(N([H])[H])C(*)=O 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 3
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 3
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 3
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 3
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 3
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- GXCLVBGFBYZDAG-UHFFFAOYSA-N N-[2-(1H-indol-3-yl)ethyl]-N-methylprop-2-en-1-amine Chemical compound CN(CCC1=CNC2=C1C=CC=C2)CC=C GXCLVBGFBYZDAG-UHFFFAOYSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 3
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 3
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 3
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 3
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 3
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 3
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 3
- 241000235525 Rhizomucor pusillus Species 0.000 description 3
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 3
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 3
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 3
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 3
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 3
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 3
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 3
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 3
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 3
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 3
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 3
- 240000006394 Sorghum bicolor Species 0.000 description 3
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 3
- 244000046109 Sorghum vulgare var. nervosum Species 0.000 description 3
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Natural products OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 3
- 241000228341 Talaromyces Species 0.000 description 3
- 241000482676 Thermococcus thioreducens Species 0.000 description 3
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 3
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 3
- ZOCJFNXUVSGBQI-HSHDSVGOSA-N Thr-Trp-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ZOCJFNXUVSGBQI-HSHDSVGOSA-N 0.000 description 3
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 3
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 3
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 3
- 241000222354 Trametes Species 0.000 description 3
- 241000223259 Trichoderma Species 0.000 description 3
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 3
- HQJOVVWAPQPYDS-ZFWWWQNUSA-N Trp-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQJOVVWAPQPYDS-ZFWWWQNUSA-N 0.000 description 3
- UMIACFRBELJMGT-GQGQLFGLSA-N Trp-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UMIACFRBELJMGT-GQGQLFGLSA-N 0.000 description 3
- QHWMVGCEQAPQDK-UMPQAUOISA-N Trp-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O QHWMVGCEQAPQDK-UMPQAUOISA-N 0.000 description 3
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 3
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 3
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 3
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 3
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 3
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 3
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 3
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 3
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 3
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 3
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 239000012491 analyte Substances 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- KDYFGRWQOYBRFD-NUQCWPJISA-N butanedioic acid Chemical compound O[14C](=O)CC[14C](O)=O KDYFGRWQOYBRFD-NUQCWPJISA-N 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 235000014633 carbohydrates Nutrition 0.000 description 3
- 239000003054 catalyst Substances 0.000 description 3
- 150000001768 cations Chemical class 0.000 description 3
- 239000006143 cell culture medium Substances 0.000 description 3
- 230000002759 chromosomal effect Effects 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000001816 cooling Methods 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 238000004821 distillation Methods 0.000 description 3
- 238000009837 dry grinding Methods 0.000 description 3
- 238000004043 dyeing Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 3
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 230000006801 homologous recombination Effects 0.000 description 3
- 238000002744 homologous recombination Methods 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 150000002500 ions Chemical class 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010000761 leucylarginine Proteins 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 108010012581 phenylalanylglutamate Proteins 0.000 description 3
- ZLMJMSJWJFRBEC-OUBTZVSYSA-N potassium-40 Chemical compound [40K] ZLMJMSJWJFRBEC-OUBTZVSYSA-N 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 210000001938 protoplast Anatomy 0.000 description 3
- 102220087235 rs864622622 Human genes 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 239000007858 starting material Substances 0.000 description 3
- 239000008399 tap water Substances 0.000 description 3
- 235000020679 tap water Nutrition 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 108010080629 tryptophan-leucine Proteins 0.000 description 3
- 108010044292 tryptophyltyrosine Proteins 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- 239000003643 water by type Substances 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- XNPKNHHFCKSMRV-UHFFFAOYSA-N 4-(cyclohexylamino)butane-1-sulfonic acid Chemical compound OS(=O)(=O)CCCCNC1CCCCC1 XNPKNHHFCKSMRV-UHFFFAOYSA-N 0.000 description 2
- 102220466243 Acyl-coenzyme A thioesterase MBLAC2_R170A_mutation Human genes 0.000 description 2
- 229920000936 Agarose Polymers 0.000 description 2
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 2
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 2
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 2
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- 102000013142 Amylases Human genes 0.000 description 2
- 108010065511 Amylases Proteins 0.000 description 2
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 2
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 2
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 2
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 2
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 2
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 2
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 2
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 2
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 2
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 2
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 2
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 2
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 2
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 2
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 2
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 2
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- WNGZKSVJFDZICU-XIRDDKMYSA-N Asp-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N WNGZKSVJFDZICU-XIRDDKMYSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- MRYDJCIIVRXVGG-QEJZJMRPSA-N Asp-Trp-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O MRYDJCIIVRXVGG-QEJZJMRPSA-N 0.000 description 2
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 2
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 2
- 241000228212 Aspergillus Species 0.000 description 2
- 241001513093 Aspergillus awamori Species 0.000 description 2
- 101000757144 Aspergillus niger Glucoamylase Proteins 0.000 description 2
- 240000006439 Aspergillus oryzae Species 0.000 description 2
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 2
- 108700003918 Bacillus Thuringiensis insecticidal crystal Proteins 0.000 description 2
- 101000775727 Bacillus amyloliquefaciens Alpha-amylase Proteins 0.000 description 2
- 241001328122 Bacillus clausii Species 0.000 description 2
- 101000695691 Bacillus licheniformis Beta-lactamase Proteins 0.000 description 2
- 108010029675 Bacillus licheniformis alpha-amylase Proteins 0.000 description 2
- 239000008000 CHES buffer Substances 0.000 description 2
- GUBGYTABKSRVRQ-PICCSMPSSA-N D-Maltose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 2
- 241000605909 Fusobacterium Species 0.000 description 2
- 101100369308 Geobacillus stearothermophilus nprS gene Proteins 0.000 description 2
- 101100080316 Geobacillus stearothermophilus nprT gene Proteins 0.000 description 2
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 description 2
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 2
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 2
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 2
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 2
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 2
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 2
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 2
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 2
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 2
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 2
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 2
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 2
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 2
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 2
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 239000007995 HEPES buffer Substances 0.000 description 2
- 102220477021 Hexokinase-4_S411F_mutation Human genes 0.000 description 2
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 2
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 2
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 2
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 2
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 2
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 2
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 2
- 102220468791 Inositol 1,4,5-trisphosphate receptor type 2_Y167A_mutation Human genes 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- 125000003440 L-leucyl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])C(C([H])([H])[H])([H])C([H])([H])[H] 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- 125000002842 L-seryl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])O[H] 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 2
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 2
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 2
- 241000235395 Mucor Species 0.000 description 2
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 2
- MKWKNSIESPFAQN-UHFFFAOYSA-N N-cyclohexyl-2-aminoethanesulfonic acid Chemical compound OS(=O)(=O)CCNC1CCCCC1 MKWKNSIESPFAQN-UHFFFAOYSA-N 0.000 description 2
- 229920002274 Nalgene Polymers 0.000 description 2
- 241001328040 Nigrofomes Species 0.000 description 2
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 2
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 2
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 2
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 2
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 2
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 2
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 2
- DIDLUFMLRUJLFB-FKBYEOEOSA-N Pro-Trp-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=C(C=C4)O)C(=O)O DIDLUFMLRUJLFB-FKBYEOEOSA-N 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- 244000082988 Secale cereale Species 0.000 description 2
- 235000007238 Secale cereale Nutrition 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 2
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 2
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 2
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 2
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 2
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 244000061456 Solanum tuberosum Species 0.000 description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 description 2
- 241000194017 Streptococcus Species 0.000 description 2
- 241000187747 Streptomyces Species 0.000 description 2
- 108010056079 Subtilisins Proteins 0.000 description 2
- 102000005158 Subtilisins Human genes 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 2
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 2
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 2
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 2
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- YTHWAWACWGWBLE-MNSWYVGCSA-N Trp-Tyr-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 YTHWAWACWGWBLE-MNSWYVGCSA-N 0.000 description 2
- SSSDKJMQMZTMJP-BVSLBCMMSA-N Trp-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 SSSDKJMQMZTMJP-BVSLBCMMSA-N 0.000 description 2
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 2
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 2
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 2
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 2
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 2
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 2
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 2
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- UFCHCOKFAGOQSF-BQFCYCMXSA-N Val-Trp-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N UFCHCOKFAGOQSF-BQFCYCMXSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 239000008351 acetate buffer Substances 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 235000015107 ale Nutrition 0.000 description 2
- 239000003513 alkali Substances 0.000 description 2
- 235000019418 amylase Nutrition 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 229960005091 chloramphenicol Drugs 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 239000002361 compost Substances 0.000 description 2
- 238000010411 cooking Methods 0.000 description 2
- 235000013365 dairy product Nutrition 0.000 description 2
- 238000010908 decantation Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- 238000009795 derivation Methods 0.000 description 2
- 238000002050 diffraction method Methods 0.000 description 2
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 239000003797 essential amino acid Substances 0.000 description 2
- 235000020776 essential amino acid Nutrition 0.000 description 2
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 2
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 239000004615 ingredient Substances 0.000 description 2
- FGKJLKRYENPLQH-UHFFFAOYSA-N isocaproic acid Chemical compound CC(C)CCC(O)=O FGKJLKRYENPLQH-UHFFFAOYSA-N 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 238000007834 ligase chain reaction Methods 0.000 description 2
- 239000007791 liquid phase Substances 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 230000002906 microbiologic effect Effects 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 235000005985 organic acids Nutrition 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 239000012088 reference solution Substances 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 102200053231 rs104894354 Human genes 0.000 description 2
- 102220052102 rs35524245 Human genes 0.000 description 2
- 102220026086 rs397518426 Human genes 0.000 description 2
- 102220239115 rs779234287 Human genes 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 239000002689 soil Substances 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 238000010025 steaming Methods 0.000 description 2
- 229930101283 tetracycline Natural products 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 239000011782 vitamin Substances 0.000 description 2
- 235000013343 vitamin Nutrition 0.000 description 2
- 229940088594 vitamin Drugs 0.000 description 2
- 229930003231 vitamin Natural products 0.000 description 2
- 150000003722 vitamin derivatives Chemical class 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- DWNBOPVKNPVNQG-LURJTMIESA-N (2s)-4-hydroxy-2-(propylamino)butanoic acid Chemical compound CCCN[C@H](C(O)=O)CCO DWNBOPVKNPVNQG-LURJTMIESA-N 0.000 description 1
- NWXMGUDVXFXRIG-WESIUVDSSA-N (4s,4as,5as,6s,12ar)-4-(dimethylamino)-1,6,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4,4a,5,5a-tetrahydrotetracene-2-carboxamide Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(=O)C(C(N)=O)=C(O)[C@@]4(O)C(=O)C3=C(O)C2=C1O NWXMGUDVXFXRIG-WESIUVDSSA-N 0.000 description 1
- CHHHXKFHOYLYRE-UHFFFAOYSA-M 2,4-Hexadienoic acid, potassium salt (1:1), (2E,4E)- Chemical compound [K+].CC=CC=CC([O-])=O CHHHXKFHOYLYRE-UHFFFAOYSA-M 0.000 description 1
- JAHNSTQSQJOJLO-UHFFFAOYSA-N 2-(3-fluorophenyl)-1h-imidazole Chemical compound FC1=CC=CC(C=2NC=CN=2)=C1 JAHNSTQSQJOJLO-UHFFFAOYSA-N 0.000 description 1
- PPINMSZPTPRQQB-NHCYSSNCSA-N 2-[[(2s)-1-[(2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PPINMSZPTPRQQB-NHCYSSNCSA-N 0.000 description 1
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 1
- KDZIGQIDPXKMBA-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-methylbutanoyl)amino]acetyl]amino]-3-hydroxypropanoyl]amino]pentanedioic acid Chemical compound CC(C)C(N)C(=O)NCC(=O)NC(CO)C(=O)NC(C(O)=O)CCC(O)=O KDZIGQIDPXKMBA-UHFFFAOYSA-N 0.000 description 1
- WLJVXDMOQOGPHL-PPJXEINESA-N 2-phenylacetic acid Chemical compound O[14C](=O)CC1=CC=CC=C1 WLJVXDMOQOGPHL-PPJXEINESA-N 0.000 description 1
- KKAJSJJFBSOMGS-UHFFFAOYSA-N 3,6-diamino-10-methylacridinium chloride Chemical compound [Cl-].C1=C(N)C=C2[N+](C)=C(C=C(N)C=C3)C3=CC2=C1 KKAJSJJFBSOMGS-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- QCVGEOXPDFCNHA-UHFFFAOYSA-N 5,5-dimethyl-2,4-dioxo-1,3-oxazolidine-3-carboxamide Chemical compound CC1(C)OC(=O)N(C(N)=O)C1=O QCVGEOXPDFCNHA-UHFFFAOYSA-N 0.000 description 1
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- IDLBLNBDLCTPGC-HERUPUMHSA-N Ala-Trp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N IDLBLNBDLCTPGC-HERUPUMHSA-N 0.000 description 1
- RIPMDCIXRYWXSH-KNXALSJPSA-N Ala-Trp-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N RIPMDCIXRYWXSH-KNXALSJPSA-N 0.000 description 1
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 241000534414 Anotopterus nikparini Species 0.000 description 1
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- RAKKBBHMTJSXOY-XVYDVKMFSA-N Asn-His-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O RAKKBBHMTJSXOY-XVYDVKMFSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- UPAGTDJAORYMEC-VHWLVUOQSA-N Asn-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N UPAGTDJAORYMEC-VHWLVUOQSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- KGAJCJXBEWLQDZ-UBHSHLNASA-N Asp-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N KGAJCJXBEWLQDZ-UBHSHLNASA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 1
- SXLCDCZHNCLFGZ-BPUTZDHNSA-N Asp-Pro-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SXLCDCZHNCLFGZ-BPUTZDHNSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- KCOPOPKJRHVGPE-AQZXSJQPSA-N Asp-Thr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O KCOPOPKJRHVGPE-AQZXSJQPSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 102000035101 Aspartic proteases Human genes 0.000 description 1
- 108091005502 Aspartic proteases Proteins 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 108090000145 Bacillolysin Proteins 0.000 description 1
- 241000193752 Bacillus circulans Species 0.000 description 1
- 241000193749 Bacillus coagulans Species 0.000 description 1
- 241000680658 Bacillus deramificans Species 0.000 description 1
- 241000193747 Bacillus firmus Species 0.000 description 1
- 241000193422 Bacillus lentus Species 0.000 description 1
- 241000194107 Bacillus megaterium Species 0.000 description 1
- 241000194103 Bacillus pumilus Species 0.000 description 1
- 241000193388 Bacillus thuringiensis Species 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 241000193764 Brevibacillus brevis Species 0.000 description 1
- 241000589876 Campylobacter Species 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 102220584229 Cellular tumor antigen p53_A76G_mutation Human genes 0.000 description 1
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- RGHNJXZEOKUKBD-UHFFFAOYSA-N D-gluconic acid Natural products OCC(O)C(O)C(O)C(O)C(O)=O RGHNJXZEOKUKBD-UHFFFAOYSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 229920001353 Dextrin Polymers 0.000 description 1
- 239000004375 Dextrin Substances 0.000 description 1
- 101100342470 Dictyostelium discoideum pkbA gene Proteins 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 241000194033 Enterococcus Species 0.000 description 1
- 101100385973 Escherichia coli (strain K12) cycA gene Proteins 0.000 description 1
- 241000589565 Flavobacterium Species 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- 241000626621 Geobacillus Species 0.000 description 1
- 101100001650 Geobacillus stearothermophilus amyM gene Proteins 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 1
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- DWDBJWAXPXXYLP-SRVKXCTJSA-N Gln-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DWDBJWAXPXXYLP-SRVKXCTJSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 1
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 1
- NPMFDZGLKBNFOO-SRVKXCTJSA-N Gln-Pro-His Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NPMFDZGLKBNFOO-SRVKXCTJSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- QXQDADBVIBLBHN-FHWLQOOXSA-N Gln-Tyr-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QXQDADBVIBLBHN-FHWLQOOXSA-N 0.000 description 1
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 1
- HGBHRZBXOOHRDH-JBACZVJFSA-N Gln-Tyr-Trp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HGBHRZBXOOHRDH-JBACZVJFSA-N 0.000 description 1
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- XMVLTPMCUJTJQP-FXQIFTODSA-N Glu-Gln-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N XMVLTPMCUJTJQP-FXQIFTODSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 1
- CGWHAXBNGYQBBK-JBACZVJFSA-N Glu-Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)C1=CC=C(O)C=C1 CGWHAXBNGYQBBK-JBACZVJFSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- RGHNJXZEOKUKBD-SQOUGZDYSA-N Gluconic acid Natural products OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C(O)=O RGHNJXZEOKUKBD-SQOUGZDYSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 1
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- WSWWTQYHFCBKBT-DVJZZOLTSA-N Gly-Thr-Trp Chemical compound C[C@@H](O)[C@H](NC(=O)CN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O WSWWTQYHFCBKBT-DVJZZOLTSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- 102220536644 Hemoglobin subunit epsilon_G20S_mutation Human genes 0.000 description 1
- SQUHHTBVTRBESD-UHFFFAOYSA-N Hexa-Ac-myo-Inositol Natural products CC(=O)OC1C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C1OC(C)=O SQUHHTBVTRBESD-UHFFFAOYSA-N 0.000 description 1
- LYSVCKOXIDKEEL-SRVKXCTJSA-N His-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYSVCKOXIDKEEL-SRVKXCTJSA-N 0.000 description 1
- WZOGEMJIZBNFBK-CIUDSAMLSA-N His-Asp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WZOGEMJIZBNFBK-CIUDSAMLSA-N 0.000 description 1
- WCNXUTNLSRWWQN-DCAQKATOSA-N His-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WCNXUTNLSRWWQN-DCAQKATOSA-N 0.000 description 1
- YOSQCYUFZGPIPC-PBCZWWQYSA-N His-Asp-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YOSQCYUFZGPIPC-PBCZWWQYSA-N 0.000 description 1
- VTMLJMNQHKBPON-QWRGUYRKSA-N His-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 VTMLJMNQHKBPON-QWRGUYRKSA-N 0.000 description 1
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- HZWWOGWOBQBETJ-CUJWVEQBSA-N His-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O HZWWOGWOBQBETJ-CUJWVEQBSA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- 101000882901 Homo sapiens Claudin-2 Proteins 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- RWYCOSAAAJBJQL-KCTSRDHCSA-N Ile-Gly-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RWYCOSAAAJBJQL-KCTSRDHCSA-N 0.000 description 1
- YBGTWSFIGHUWQE-MXAVVETBSA-N Ile-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CN=CN1 YBGTWSFIGHUWQE-MXAVVETBSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- KWHFUMYCSPJCFQ-NGTWOADLSA-N Ile-Thr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N KWHFUMYCSPJCFQ-NGTWOADLSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- XVUAQNRNFMVWBR-BLMTYFJBSA-N Ile-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N XVUAQNRNFMVWBR-BLMTYFJBSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- IPFKIGNDTUOFAF-CYDGBPFRSA-N Ile-Val-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IPFKIGNDTUOFAF-CYDGBPFRSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 244000017020 Ipomoea batatas Species 0.000 description 1
- 235000002678 Ipomoea batatas Nutrition 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- 125000001176 L-lysyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C([H])([H])C([H])([H])C([H])([H])C(N([H])[H])([H])[H] 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 125000000769 L-threonyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])[C@](O[H])(C([H])([H])[H])[H] 0.000 description 1
- 125000003798 L-tyrosyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C([H])([H])C1=C([H])C([H])=C(O[H])C([H])=C1[H] 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 125000003580 L-valyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(C([H])([H])[H])(C([H])([H])[H])[H] 0.000 description 1
- 108010029541 Laccase Proteins 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 241000194036 Lactococcus Species 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- RNYLNYTYMXACRI-VFAJRCTISA-N Leu-Thr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O RNYLNYTYMXACRI-VFAJRCTISA-N 0.000 description 1
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 1
- LXGSOEPHQJONMG-PMVMPFDFSA-N Leu-Trp-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N LXGSOEPHQJONMG-PMVMPFDFSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 108010036940 Levansucrase Proteins 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- LXNPMPIQDNSMTA-AVGNSLFASA-N Lys-Gln-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 LXNPMPIQDNSMTA-AVGNSLFASA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- KQAREVUPVXMNNP-WDSOQIARSA-N Lys-Trp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(O)=O KQAREVUPVXMNNP-WDSOQIARSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- GVIVXNFKJQFTCE-YUMQZZPRSA-N Met-Gly-Gln Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O GVIVXNFKJQFTCE-YUMQZZPRSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 1
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 1
- GHQFLTYXGUETFD-UFYCRDLUSA-N Met-Tyr-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N GHQFLTYXGUETFD-UFYCRDLUSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 102000035092 Neutral proteases Human genes 0.000 description 1
- 108091005507 Neutral proteases Proteins 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 241001072230 Oceanobacillus Species 0.000 description 1
- 108010029182 Pectin lyase Proteins 0.000 description 1
- 241000228143 Penicillium Species 0.000 description 1
- 241000228150 Penicillium chrysogenum Species 0.000 description 1
- 244000046052 Phaseolus vulgaris Species 0.000 description 1
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- WZEWCHQHNCMBEN-PMVMPFDFSA-N Phe-Lys-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N WZEWCHQHNCMBEN-PMVMPFDFSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- 235000010582 Pisum sativum Nutrition 0.000 description 1
- 240000004713 Pisum sativum Species 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- YAZNFQUKPUASKB-DCAQKATOSA-N Pro-Lys-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O YAZNFQUKPUASKB-DCAQKATOSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- DMNANGOFEUVBRV-GJZGRUSLSA-N Pro-Trp-Gly Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)O)C(=O)[C@@H]1CCCN1 DMNANGOFEUVBRV-GJZGRUSLSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 102100038946 Proprotein convertase subtilisin/kexin type 6 Human genes 0.000 description 1
- 241000222644 Pycnoporus <fungus> Species 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000235402 Rhizomucor Species 0.000 description 1
- 241000235403 Rhizomucor miehei Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- FZNNGIHSIPKFRE-QEJZJMRPSA-N Ser-Trp-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZNNGIHSIPKFRE-QEJZJMRPSA-N 0.000 description 1
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- YXGCIEUDOHILKR-IHRRRGAJSA-N Ser-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CO)N YXGCIEUDOHILKR-IHRRRGAJSA-N 0.000 description 1
- BIWBTRRBHIEVAH-IHPCNDPISA-N Ser-Tyr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BIWBTRRBHIEVAH-IHPCNDPISA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 108010022999 Serine Proteases Proteins 0.000 description 1
- 102000012479 Serine Proteases Human genes 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 244000188014 Spathodea campanulata Species 0.000 description 1
- 235000017899 Spathodea campanulata Nutrition 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 101100309436 Streptococcus mutans serotype c (strain ATCC 700610 / UA159) ftf gene Proteins 0.000 description 1
- 241000187432 Streptomyces coelicolor Species 0.000 description 1
- 101710135785 Subtilisin-like protease Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 241000205188 Thermococcus Species 0.000 description 1
- 241000205180 Thermococcus litoralis Species 0.000 description 1
- 241001313536 Thermothelomyces thermophila Species 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- UXUAZXWKIGPUCH-RCWTZXSCSA-N Thr-Met-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O UXUAZXWKIGPUCH-RCWTZXSCSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 1
- JNKAYADBODLPMQ-HSHDSVGOSA-N Thr-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)=CNC2=C1 JNKAYADBODLPMQ-HSHDSVGOSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- 241001230654 Trametes cingulata Species 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 241000499912 Trichoderma reesei Species 0.000 description 1
- YEGMNOHLZNGOCG-UBHSHLNASA-N Trp-Asn-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YEGMNOHLZNGOCG-UBHSHLNASA-N 0.000 description 1
- MHNHRNHJMXAVHZ-AAEUAGOBSA-N Trp-Asn-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N MHNHRNHJMXAVHZ-AAEUAGOBSA-N 0.000 description 1
- GKUROEIXVURAAO-BPUTZDHNSA-N Trp-Asp-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GKUROEIXVURAAO-BPUTZDHNSA-N 0.000 description 1
- AFSYEUHJBVCPEL-JBACZVJFSA-N Trp-Gln-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 AFSYEUHJBVCPEL-JBACZVJFSA-N 0.000 description 1
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 1
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 1
- DZIKVMCFXIIETR-JSGCOSHPSA-N Trp-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O DZIKVMCFXIIETR-JSGCOSHPSA-N 0.000 description 1
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- UUIYFDAWNBSWPG-IHPCNDPISA-N Trp-Lys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N UUIYFDAWNBSWPG-IHPCNDPISA-N 0.000 description 1
- BGWSLEYVITZIQP-DCPHZVHLSA-N Trp-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O BGWSLEYVITZIQP-DCPHZVHLSA-N 0.000 description 1
- VCGOTJGGBXEBFO-FDARSICLSA-N Trp-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VCGOTJGGBXEBFO-FDARSICLSA-N 0.000 description 1
- OJKVFAWXPGCJMF-BPUTZDHNSA-N Trp-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CO)C(=O)O OJKVFAWXPGCJMF-BPUTZDHNSA-N 0.000 description 1
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 1
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 1
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 1
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 1
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 1
- XQMGDVVKFRLQKH-BBRMVZONSA-N Trp-Val-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O)=CNC2=C1 XQMGDVVKFRLQKH-BBRMVZONSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 1
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- PLVVHGFEMSDRET-IHPCNDPISA-N Tyr-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PLVVHGFEMSDRET-IHPCNDPISA-N 0.000 description 1
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 1
- AKKYBQGHUAWPJR-MNSWYVGCSA-N Tyr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O AKKYBQGHUAWPJR-MNSWYVGCSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- ZYVAAYAOTVJBSS-GMVOTWDCSA-N Tyr-Trp-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZYVAAYAOTVJBSS-GMVOTWDCSA-N 0.000 description 1
- MQUYPYFPHIPVHJ-MNSWYVGCSA-N Tyr-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O MQUYPYFPHIPVHJ-MNSWYVGCSA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- CCEVJBJLPRNAFH-BVSLBCMMSA-N Tyr-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N CCEVJBJLPRNAFH-BVSLBCMMSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 241000202898 Ureaplasma Species 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- CWOSXNKDOACNJN-BZSNNMDCSA-N Val-Arg-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N CWOSXNKDOACNJN-BZSNNMDCSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- RKIGNDAHUOOIMJ-BQFCYCMXSA-N Val-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 RKIGNDAHUOOIMJ-BQFCYCMXSA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 1
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- PDASTHRLDFOZMG-JYJNAYRXSA-N Val-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 PDASTHRLDFOZMG-JYJNAYRXSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 108010045649 agarase Proteins 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 238000012867 alanine scanning Methods 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 125000003158 alcohol group Chemical group 0.000 description 1
- OENHQHLEOONYIE-UKMVMLAPSA-N all-trans beta-carotene Natural products CC=1CCCC(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C OENHQHLEOONYIE-UKMVMLAPSA-N 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000000845 anti-microbial effect Effects 0.000 description 1
- 101150009206 aprE gene Proteins 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 238000005844 autocatalytic reaction Methods 0.000 description 1
- 229940054340 bacillus coagulans Drugs 0.000 description 1
- 229940005348 bacillus firmus Drugs 0.000 description 1
- 229940097012 bacillus thuringiensis Drugs 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 235000010233 benzoic acid Nutrition 0.000 description 1
- 108010019077 beta-Amylase Proteins 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 235000013734 beta-carotene Nutrition 0.000 description 1
- TUPZEYHYWIEDIH-WAIFQNFQSA-N beta-carotene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CCCC1(C)C)C=CC=C(/C)C=CC2=CCCCC2(C)C TUPZEYHYWIEDIH-WAIFQNFQSA-N 0.000 description 1
- 239000011648 beta-carotene Substances 0.000 description 1
- 229960002747 betacarotene Drugs 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 239000003225 biodiesel Substances 0.000 description 1
- 230000009141 biological interaction Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- KGBXLFKZBHKPEV-UHFFFAOYSA-N boric acid Chemical compound OB(O)O KGBXLFKZBHKPEV-UHFFFAOYSA-N 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 238000011098 chromatofocusing Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000005336 cracking Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 101150005799 dagA gene Proteins 0.000 description 1
- 238000013502 data validation Methods 0.000 description 1
- 238000005034 decoration Methods 0.000 description 1
- 230000018044 dehydration Effects 0.000 description 1
- 238000006297 dehydration reaction Methods 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 235000019425 dextrin Nutrition 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000007876 drug discovery Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 235000014103 egg white Nutrition 0.000 description 1
- 210000000969 egg white Anatomy 0.000 description 1
- 235000013601 eggs Nutrition 0.000 description 1
- FPIQZBQZKBKLEI-UHFFFAOYSA-N ethyl 1-[[2-chloroethyl(nitroso)carbamoyl]amino]cyclohexane-1-carboxylate Chemical compound ClCCN(N=O)C(=O)NC1(C(=O)OCC)CCCCC1 FPIQZBQZKBKLEI-UHFFFAOYSA-N 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 238000001400 expression cloning Methods 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 108010061330 glucan 1,4-alpha-maltohydrolase Proteins 0.000 description 1
- 239000000174 gluconic acid Substances 0.000 description 1
- 235000012208 gluconic acid Nutrition 0.000 description 1
- 239000003292 glue Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 229930182470 glycoside Natural products 0.000 description 1
- 150000002338 glycosides Chemical class 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 229940059442 hemicellulase Drugs 0.000 description 1
- 108010002430 hemicellulase Proteins 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000009655 industrial fermentation Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 229960000367 inositol Drugs 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 238000001155 isoelectric focusing Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 239000010977 jade Substances 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical class O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 239000010985 leather Substances 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 235000021440 light beer Nutrition 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 238000005297 material degradation process Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000012092 media component Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 238000005374 membrane filtration Methods 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- LVHBHZANLOWSRM-UHFFFAOYSA-N methylenebutanedioic acid Natural products OC(=O)CC(=C)C(O)=O LVHBHZANLOWSRM-UHFFFAOYSA-N 0.000 description 1
- 238000001471 micro-filtration Methods 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 238000013365 molecular weight analysis method Methods 0.000 description 1
- 239000005445 natural material Substances 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 238000007857 nested PCR Methods 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 101150105920 npr gene Proteins 0.000 description 1
- 101150017837 nprM gene Proteins 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- 101150019841 penP gene Proteins 0.000 description 1
- 230000000149 penetrating effect Effects 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 235000020030 perry Nutrition 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 235000020004 porter Nutrition 0.000 description 1
- 235000010241 potassium sorbate Nutrition 0.000 description 1
- 239000004302 potassium sorbate Substances 0.000 description 1
- 229940069338 potassium sorbate Drugs 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 235000019260 propionic acid Nutrition 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 101150108007 prs gene Proteins 0.000 description 1
- 101150086435 prs1 gene Proteins 0.000 description 1
- 101150070305 prsA gene Proteins 0.000 description 1
- 238000000197 pyrolysis Methods 0.000 description 1
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 101150025220 sacB gene Proteins 0.000 description 1
- 239000012488 sample solution Substances 0.000 description 1
- CDAISMWEOUEBRE-UHFFFAOYSA-N scyllo-inosotol Natural products OC1C(O)C(O)C(O)C(O)C1O CDAISMWEOUEBRE-UHFFFAOYSA-N 0.000 description 1
- 108010063779 seryl-tyrosyl-prolyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 101150091813 shfl gene Proteins 0.000 description 1
- 238000010563 solid-state fermentation Methods 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 235000011044 succinic acid Nutrition 0.000 description 1
- 150000003444 succinic acids Chemical class 0.000 description 1
- 150000005846 sugar alcohols Polymers 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 229940032021 tetramune Drugs 0.000 description 1
- 238000001757 thermogravimetry curve Methods 0.000 description 1
- 239000002562 thickening agent Substances 0.000 description 1
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000004704 ultra performance liquid chromatography Methods 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 238000001238 wet grinding Methods 0.000 description 1
- 210000004885 white matter Anatomy 0.000 description 1
- 101150052264 xylA gene Proteins 0.000 description 1
- 101150110790 xylB gene Proteins 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- OENHQHLEOONYIE-JLTXGRSLSA-N β-Carotene Chemical compound CC=1CCCC(C)(C)C=1\C=C\C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C OENHQHLEOONYIE-JLTXGRSLSA-N 0.000 description 1
- 150000003952 β-lactams Chemical class 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/04—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
- C12P7/06—Ethanol, i.e. non-beverage
- C12P7/065—Ethanol, i.e. non-beverage with microorganisms other than yeasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
- C12N15/75—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Bacillus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/52—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from bacteria or Archaea
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/02—Monosaccharides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/12—Disaccharides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/14—Preparation of compounds containing saccharide radicals produced by the action of a carbohydrase (EC 3.2.x), e.g. by alpha-amylase, e.g. by cellulase, hemicellulase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
- C12P7/04—Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
- C12P7/06—Ethanol, i.e. non-beverage
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/21—Serine endopeptidases (3.4.21)
- C12Y304/21062—Subtilisin (3.4.21.62)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/10—Biofuels, e.g. bio-diesel
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明涉及具有蛋白酶活性的多肽和编码这些多肽的多核苷酸。本发明还涉及包含这些多核苷酸的核酸构建体、载体和宿主细胞,以及产生和使用这些多肽的方法。
Description
序列表的引用
本申请包含处于计算机可读形式的序列表,将其通过引用结合在本文中。
技术领域
本发明涉及具有蛋白酶活性的多肽和编码这些多肽的多核苷酸。本发明还涉及包含这些多核苷酸的核酸构建体、载体和宿主细胞,以及产生和使用这些多肽的方法。
背景技术
发酵产物(例如乙醇)典型地是通过以下过程产生的:首先在干磨或湿磨过程中研磨含淀粉材料,然后使用酶将材料降解成可发酵糖,并且最后使用发酵生物将糖直接或间接转化为所希望的发酵产物。例如通过从其他液体和/或固体分离所希望的发酵产物的蒸馏,从发酵醪(通常被称为“啤酒醪液”)中回收液体发酵产物。剩余部分称为“全酒糟”。例如通过离心将全酒糟脱水并且分离为固相和液相。固相称为“湿饼”(或“湿谷物”),并且液相(上清液)称为“酒糟水”。湿饼和酒糟水分别含有约35%和7%固体。将脱水的湿饼干燥以提供“干酒糟(Distillers Dried Grains)”(DDG),用作动物饲料中的营养素。典型地蒸发酒糟水,以提供冷凝物和浆料或可替代地可以作为“反流(backset)”被直接再循环至浆料槽。冷凝物可以在排放前送往甲烷转化器或者可以被再循环至浆料槽。可以将浆体共混入DDG或在干燥前添加至湿饼中,以产生DDGS(含可溶物的干酒糟)。
WO 2012/088303(诺维信公司(Novozymes))披露了在从4.5至5.0范围的pH,在从80℃至90℃范围的温度,使用在pH 4.5、85℃、0.12mM CaCl2下具有至少10的T1/2(min)的α-淀粉酶和具有超过20%的热稳定性值(确定为在80℃/70℃的相对活性)的蛋白酶的组合,通过液化含淀粉材料随后进行糖化和发酵来生产发酵产物的方法。
WO 2013/082486(诺维信公司)披露了在从高于5.0-7.0之间范围的pH,在高于初始糊化温度的温度,使用α-淀粉酶、具有超过20%的热稳定性值(确定为在80℃/70℃的相对活性)的蛋白酶、以及任选地产碳水化合物源的酶,通过液化含淀粉材料随后进行糖化和发酵来生产发酵产物的方法。使用来自强烈火球菌(Pyrococcus furiosus)(PfuS)的蛋白酶示例该方法。
WO 2014/209800(诺维信公司)披露了在高于初始糊化温度的温度,使用α-淀粉酶和高剂量的PfuS蛋白酶,通过液化含淀粉材料来生产发酵产物的方法。
越来越多的乙醇厂从作为副产物的酒糟水和/或浆体中提取油,供生物柴油生产或其他生物可再生产品中使用。发酵产物生产过程中的油回收/提取的大量工作已经集中在改善酒糟水中油的可提取性。油的有效去除通常通过己烷提取来完成。然而,己烷提取的使用由于需要高资金投入而未见广泛应用。因此,已经开发了改善从发酵产物生产过程进行油提取的其他方法。
WO 2011/126897(诺维信公司)披露了通过以下来回收油的方法:用α-淀粉酶将含淀粉材料转化为糊精;用产碳水化合物源的酶进行糖化以形成糖;使用发酵生物来发酵这些糖;其中发酵培养基包含半纤维素酶;蒸馏发酵产物,以形成全酒糟;将全酒糟分离为酒糟水和湿饼;并且从酒糟水回收油。发酵培养基可以进一步包含蛋白酶。
WO 2016/196202披露了来自高温球菌属(Thermococcus)的S8蛋白酶用于乙醇方法中。
本发明的一个目的是提供用于增加可从发酵产物生产过程中回收的油的量的改善方法,并且提供用于由含淀粉材料生产发酵产物(例如乙醇)的方法,这些方法相比于常规方法可以提供较高的发酵产物产量或其他优点。
发明内容
本发明涉及具有蛋白酶活性的多肽,该多肽选自由以下组成的组:
(a)多肽,该多肽与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%或100%序列同一性;
(b)多肽,该多肽由以下多核苷酸编码,该多核苷酸在非常高严格条件下与(i)SEQID NO:1的成熟多肽编码序列,(ii)(i)或(ii)的全长互补序列杂交;
(c)多肽,该多肽由以下多核苷酸编码,该多核苷酸与SEQ ID NO:1的成熟多肽编码序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%序列同一性;和
(d)(a)、(b)或(c)的多肽的片段,该片段具有蛋白酶活性。
本发明还涉及编码本发明的多肽的多核苷酸;包含这些多核苷酸的核酸构建体、重组表达载体、重组宿主细胞;以及产生这些多肽的方法。
本发明进一步涉及用于液化含淀粉材料的方法,该方法包括在至少α-淀粉酶和S8A减硫高温球菌(Thermococcus thioreducens)蛋白酶的存在下,在高于初始糊化温度的温度,液化该含淀粉材料。在另外的方面,本发明涉及由含淀粉材料生产发酵产物的方法,该方法包括以下步骤:a)在至少以下酶的存在下,在高于初始糊化温度的温度,液化该含淀粉材料:α-淀粉酶;和减硫高温球菌S8A蛋白酶;b)使用葡糖淀粉酶进行糖化;c)使用发酵生物进行发酵。
本发明进一步涉及从发酵产物生产中回收油的方法,该方法包括以下步骤:a)在至少以下酶的存在下,在高于初始糊化温度的温度,液化该含淀粉材料:α-淀粉酶;和本发明的减硫高温球菌S8A蛋白酶;b)使用葡糖淀粉酶进行糖化;c)使用发酵生物进行发酵;d)回收该发酵产物以形成全酒糟;e)将该全酒糟分离为酒糟水和湿饼;f)任选地将该酒糟水浓缩为浆体;其中从以下各项中回收油:在该方法的步骤a)之后液化的含淀粉材料;和/或该方法的发酵步骤c)的下游。
本发明进一步涉及酶组合物,该酶组合物包含本发明的减硫高温球菌S8A蛋白酶。
在仍另外的方面,本发明涉及减硫高温球菌S8A蛋白酶在液化含淀粉材料中的用途。
定义
S8A蛋白酶:术语“S8A蛋白酶”意指属于亚家族A的S8蛋白酶。枯草杆菌蛋白酶(EC3.4.21.62)是亚家族S8A中的一个亚组,然而,来自减硫高温球菌的本发明S8A蛋白酶是一种枯草杆菌蛋白酶样蛋白酶,其尚未被包括在IUBMB分类系统中。根据本发明的S8A蛋白酶水解底物Suc-Ala-Ala-Pro-Phe-pNA。对硝基苯胺(pNA)的释放导致在405nm处的吸光度增加,并且与酶活性成比例。
在一个方面,本发明的这些多肽具有SEQ ID NO:2的成熟多肽的至少20%,例如至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%或至少100%的蛋白酶活性。在一个实施例中,蛋白酶活性可以通过如实例2中披露的动力学Suc-AAPF-pNA测定来确定。
等位基因变体:术语“等位基因变体”意指占据相同染色体基因座的基因的两种或更多种替代形式的任一种。等位基因变异通过突变而自然产生,并且可以导致群体内部的多态性。基因突变可以是沉默的(所编码的多肽无变化)或可以编码具有改变的氨基酸序列的多肽。多肽的等位基因变体是由基因的等位基因变体编码的多肽。
催化结构域:术语“催化结构域”意指酶的含有该酶的催化机构的区域。
cDNA:术语“cDNA”意指可以通过从获得自真核或原核细胞的成熟的、剪接的mRNA分子进行反转录而制备的DNA分子。cDNA缺乏可以存在于对应基因组DNA中的内含子序列。初始的初级RNA转录物是mRNA的前体,其在呈现为成熟的剪接的mRNA之前要通过一系列的步骤(包括剪接)进行加工。
编码序列:术语“编码序列”意指直接指定多肽的氨基酸序列的多核苷酸。编码序列的边界通常由可读框确定,该可读框以起始密码子(如ATG、GTG或TTG)开始并且以终止密码子(如TAA、TAG或TGA)结束。编码序列可为基因组DNA、cDNA、合成DNA或其组合。
控制序列:术语“控制序列”意指表达编码本发明的成熟多肽的多核苷酸所必需的核酸序列。每个控制序列对于编码该多肽的多核苷酸来说可以是天然的(即,来自相同基因)或外源的(即,来自不同基因),或相对于彼此是天然的或外源的。此类控制序列包括但不限于前导序列、多聚腺苷酸化序列、前肽序列、启动子、信号肽序列、和转录终止子。最少,控制序列包括启动子、以及转录和翻译终止信号。出于引入有利于将控制序列与编码多肽的多核苷酸的编码区连接的特异性限制性酶切位点的目的,这些控制序列可以提供有多个接头。
表达:术语“表达”包括涉及多肽产生的任何步骤,包括但不限于:转录、转录后修饰、翻译、翻译后修饰、以及分泌。
表达载体:术语“表达载体”意指直链或环状DNA分子,其包含编码多肽的多核苷酸并且可操作地连接至提供用于其表达的控制序列。
片段:术语“片段”意指具有从成熟多肽或结构域的氨基和/或羧基端缺失的一个或多个(例如,若干个)氨基酸的多肽;其中该片段具有蛋白酶活性。在一个方面,一个片段包含至少320个氨基酸残基(例如,SEQ ID NO:2的氨基酸102至422)。
宿主细胞:术语“宿主细胞”意指易于用包含本发明的多核苷酸的核酸构建体或表达载体进行转化、转染、转导等的任何细胞类型。术语“宿主细胞”涵盖由于复制期间出现的突变而与亲本细胞不完全相同的任何亲本细胞子代。
分离的:术语“分离的”意指处于自然界中不存在的形式或环境中的一种物质。分离的物质的非限制性实例包括(1)任何非天然存在的物质,(2)包括但不限于任何酶、变体、核酸、蛋白质、肽或辅因子的任何物质,该物质至少部分地从与其性质相关的一种或多种或所有天然存在的成分中去除;(3)相对于自然界中发现的物质通过人工修饰的任何物质;或(4)通过相对于与其天然相关的其他组分增加物质的量而修饰的任何物质(例如,宿主细胞中的重组产生;编码该物质的基因的多个拷贝;以及使用比与编码该物质的基因天然相关的启动子更强的启动子)。分离的物质可以存在于发酵液样品中;例如宿主细胞可以经遗传修饰以表达本发明的多肽。来自该宿主细胞的发酵液将包含分离的多肽。
成熟多肽:术语“成熟多肽”意指在翻译和任何翻译后修饰如N-末端加工、C-末端截短、糖基化作用、磷酸化作用等之后处于其最终形式的多肽。在一个方面,该成熟多肽是SEQ ID NO:2的氨基酸102至422。SEQ ID NO:2的氨基酸1至25是信号肽。氨基酸26至101是前肽。
在本领域中已知的是,宿主细胞可以产生由相同多核苷酸表达的两种或多种不同的成熟多肽(即,具有不同的C-末端和/或N-末端氨基酸)的混合物。在本领域中还已知的是,不同的宿主细胞以不同的方式加工多肽,且因此表达多核苷酸的一种宿主细胞当与表达相同多核苷酸的另一种宿主细胞相比时可产生不同的成熟多肽(例如,具有不同的C-末端和/或N-末端氨基酸)。如实例部分所示,通过纯化蛋白酶的MS-EDMAN数据确认N-末端。
成熟多肽编码序列:术语“成熟多肽编码序列”意指编码具有蛋白酶活性的成熟多肽的多核苷酸。在一个方面,成熟多肽编码序列是SEQ ID NO:1的核苷酸304至1266。
核酸构建体:术语“核酸构建体”意指单-链或双链的核酸分子,该核酸分子是从天然存在的基因中分离的,或以本来不存在于自然界中的方式被修饰成包含核酸的区段,或是合成的,该核酸分子包含一个或多个控制序列。
可操作地连接:术语“可操作地连接”意指如下的构型,在该构型中,控制序列被放置在相对于多核苷酸的编码序列适当的位置处,使得该控制序列引导该编码序列的表达。
序列同一性:两个氨基酸序列之间或两个核苷酸序列之间的关联度通过参数“序列同一性”来描述。
出于本发明的目的,使用如在EMBOSS软件包(EMBOSS:欧洲分子生物学开放软件包(The European Molecular Biology Open Software Suite),Rice等人,2000,TrendsGenet.[遗传学趋势]16:276-277)(优选版本5.0.0或更新版本)的尼德尔(Needle)程序中所实施的尼德尔-翁施(Needleman-Wunsch)算法(Needleman和Wunsch,1970,J.Mol.Biol.[分子生物学杂志]48:443-453)确定两个氨基酸序列之间的序列同一性。使用的参数是空位开放罚分10、空位延伸罚分0.5、以及EBLOSUM62(BLOSUM62的EMBOSS版本)取代矩阵。使用尼德尔(Needle)标记的“最长同一性”的输出(使用-nobrief选项获得)作为同一性百分比并且如下计算:
(同一的残基x 100)/(比对长度-比对中的空位总数)
出于本发明的目的,使用如在EMBOSS软件包(EMBOSS:欧洲分子生物学开放软件包(EMBOSS:The European Molecular Biology Open Software Suite),Rice等人,2000,同上)(优选版本5.0.0或更新版本)的尼德尔程序中所实施的尼德尔-翁施算法(Needleman和Wunsch,1970,同上)确定两个脱氧核糖核苷酸序列之间的序列同一性。使用的参数是空位开放罚分10,空位延伸罚分0.5,以及EDNAFULL(NCBI NUC4.4的EMBOSS版本)取代矩阵。使用尼德尔标记的“最长同一性”的输出(使用-nobrief选项获得)作为同一性百分比并且如下计算:
(同一的脱氧核糖核苷酸x 100)/(比对长度-比对中的空位总数)
严格条件:术语“非常低严格条件”意指对于长度为至少100个核苷酸的探针而言,遵循标准DNA印迹程序,在42℃在5X SSPE、0.3%SDS、200微克/ml剪切并变性的鲑鱼精子DNA和25%甲酰胺中预杂交和杂交12至24小时。最后在45℃使用2X SSC、0.2%SDS将运载体材料洗涤三次,每次15分钟。
术语“低严格条件”意指对于长度为至少100个核苷酸的探针而言,遵循标准DNA印迹程序,在42℃在5X SSPE、0.3%SDS、200微克/ml剪切并变性的鲑鱼精子DNA和25%甲酰胺中预杂交和杂交12至24小时。最后在50℃使用2X SSC、0.2%SDS将运载体材料洗涤三次,每次15分钟。
术语“中严格条件”意指对于长度为至少100个核苷酸的探针而言,遵循标准DNA印迹程序,在42℃在5X SSPE、0.3%SDS、200微克/ml剪切并变性的鲑鱼精子DNA和35%甲酰胺中预杂交和杂交12至24小时。最后在55℃使用2X SSC、0.2%SDS将运载体材料洗涤三次,每次15分钟。
术语“中-高严格条件”意指对于长度为至少100个核苷酸的探针而言,遵循标准DNA印迹程序,在42℃在5X SSPE、0.3%SDS、200微克/ml剪切并变性的鲑鱼精子DNA和35%甲酰胺中预杂交和杂交12至24小时。最后在60℃使用2X SSC、0.2%SDS将运载体材料洗涤三次,每次15分钟。
术语“高严格条件”意指对于长度为至少100个核苷酸的探针而言,遵循标准DNA印迹程序,在42℃在5X SSPE、0.3%SDS、200微克/ml剪切并变性的鲑鱼精子DNA和50%甲酰胺中预杂交和杂交12至24小时。最后在65℃使用2X SSC、0.2%SDS将运载体材料洗涤三次,每次15分钟。
术语“非常高严格条件”意指对于长度为至少100个核苷酸的探针而言,遵循标准DNA印迹程序,在42℃在5X SSPE、0.3%SDS、200微克/ml剪切并变性的鲑鱼精子DNA和50%甲酰胺中预杂交和杂交12至24小时。最后在70℃使用2X SSC、0.2%SDS将运载体材料洗涤三次,每次15分钟。
子序列:术语“子序列”意指具有从成熟多肽编码序列的5'端和/或3'端缺失的一个或多个(例如,若干个)核苷酸的多核苷酸;其中该子序列编码具有蛋白酶活性的片段。
变体:术语“变体”意指具有蛋白酶活性的、在一个或多个(例如,若干个)位置处包含改变(即,取代、插入和/或缺失)的多肽。取代意指用不同的氨基酸替代占用某一位置的氨基酸;缺失意指去除占用某一位置的氨基酸;并且插入意指在邻接并且紧随占用某一位置的氨基酸之后添加氨基酸。在描述变体中,以下所述的命名法适于方便参考。采用了已接受的IUPAC单字母或三字母的氨基酸缩写。
取代对于氨基酸取代,使用以下命名法:原始氨基酸、位置、被取代的氨基酸。因此,将在位置226处的苏氨酸被丙氨酸取代表示为“Thr226Ala”或“T226A”。多个突变由加号(“+”)分开,例如“Gly205Arg+Ser411Phe”或“G205R+S411F”代表在位置205和位置411处的甘氨酸(G)和丝氨酸(S)分别被精氨酸(R)和苯丙氨酸(F)取代。
缺失对于氨基酸缺失,使用以下命名法:原有氨基酸、位置、*。因此,将在位置195处的甘氨酸的缺失表示为“Gly195*”或“G195*”。多个缺失由加号(“+”)分开,例如,“Gly195*+Ser411*”或“G195*+S411*”。
插入对于氨基酸插入,使用以下命名法:原始氨基酸、位置、原始氨基酸、插入的氨基酸。因此,将在位置195处的甘氨酸之后插入赖氨酸表示为“Gly195GlyLys”或“G195GK”。将多个氨基酸的插入表示为[原始氨基酸、位置、原始氨基酸、插入的氨基酸#1、插入的氨基酸#2;等]。例如,将在位置195处的甘氨酸之后插入赖氨酸和丙氨酸表示为“Gly195GlyLysAla”或“G195GKA”。
多种改变。包含多种改变的变体由加号(“+”)分开,例如,“Arg170Tyr+Gly195Glu”或“R170Y+G195E”代表在位置170和位置195处的精氨酸和甘氨酸分别被酪氨酸和谷氨酸取代。
不同改变。在可以于一个位置处引入不同改变的情况下,不同的改变由逗号分开,例如,“Arg170Tyr,Glu”代表用酪氨酸或谷氨酸取代在位置170处的精氨酸。因此,“Tyr167Gly,Ala+Arg170Gly,Ala”表示以下变体:
“Tyr167Gly+Arg170Gly”、“Tyr167Gly+Arg170Ala”、“Tyr167Ala+Arg170Gly”、和“Tyr167Ala+Arg170Ala”。
具体实施方式
具有蛋白酶活性的多肽
在一个实施例中,本发明涉及与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、95%、至少96%、至少97%、至少98%、至少99%、或100%序列同一性的多肽,该多肽具有蛋白酶活性。在一个方面,这些多肽与SEQ ID NO:2的成熟多肽相差多达10个(例如1个、2个、3个、4个、5个、6个、7个、8个、9个、或10个)氨基酸。
在一个具体的实施例中,本发明涉及与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%的序列同一性的多肽,并且其中该多肽具有SEQ ID NO:2的成熟多肽的至少75%的蛋白酶活性。
在一个具体的实施例中,本发明涉及与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%的序列同一性的多肽,并且其中该多肽具有SEQ ID NO:2的成熟多肽的至少80%的蛋白酶活性。
在一个具体的实施例中,本发明涉及与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%的序列同一性的多肽,并且其中该多肽具有SEQ ID NO:2的成熟多肽的至少85%的蛋白酶活性。
在一个具体的实施例中,本发明涉及与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%的序列同一性的多肽,并且其中该多肽具有SEQ ID NO:2的成熟多肽的至少90%的蛋白酶活性。
在一个具体的实施例中,本发明涉及与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%的序列同一性的多肽,并且其中该多肽具有SEQ ID NO:2的成熟多肽的至少95%的蛋白酶活性。
在一个具体的实施例中,本发明涉及与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%的序列同一性的多肽,并且其中该多肽具有SEQ ID NO:2的成熟多肽的至少96%的蛋白酶活性。
在一个具体的实施例中,本发明涉及与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%的序列同一性的多肽,并且其中该多肽具有SEQ ID NO:2的成熟多肽的至少97%的蛋白酶活性。
在一个具体的实施例中,本发明涉及与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%的序列同一性的多肽,并且其中该多肽具有SEQ ID NO:2的成熟多肽的至少98%的蛋白酶活性。
在一个具体的实施例中,本发明涉及与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%的序列同一性的多肽,并且其中该多肽具有SEQ ID NO:2的成熟多肽的至少99%的蛋白酶活性。
SEQ ID NO:1的多核苷酸或其子序列,以及SEQ ID NO:2的多肽或其片段可根据本领域熟知的方法用于设计核酸探针以鉴定和克隆来自不同属或种的株系的、编码具有蛋白酶活性的多肽的DNA。特别地,可以遵循标准DNA印迹程序,使用此类探针来与感兴趣的细胞的基因组DNA或cDNA杂交,以便鉴定和分离其中的相应基因。此类探针可明显短于完整序列,但是长度应为至少15,如至少25、至少35、或至少70个核苷酸。优选地,该核酸探针长度为至少100个核苷酸,例如长度为至少200个核苷酸、至少300个核苷酸、至少400个核苷酸、至少500个核苷酸、至少600个核苷酸、至少700个核苷酸、至少800个核苷酸、或至少900个核苷酸。DNA和RNA探针两者都可使用。典型地将探针进行标记(例如,用32P、3H、35S、生物素、或抗生物素蛋白),用于检测相应的基因。此类探针涵盖于本发明中。
可以筛选从此类其他菌株制备的基因组DNA或cDNA文库的与上述探针杂交并编码具有蛋白酶活性的多肽的DNA。来自此类其他菌株的基因组DNA或其他DNA可通过琼脂糖或聚丙烯酰胺凝胶电泳或其他分离技术来分离。可将来自文库的DNA或分离的DNA转移到并固定在硝化纤维素或其他合适的运载体材料上。为了鉴定与SEQ ID NO:1或其子序列杂交的克隆或DNA,在DNA印迹中使用运载体材料。
出于本发明的目的,杂交表示多核苷酸与对应于以下的标记的核酸探针杂交:(i)SEQ ID NO:1;(ii)SEQ ID NO:1的成熟多肽编码序列;(iii)其全长互补序列;或(iv)其子序列;杂交是在非常低的至非常高的严格条件下进行。可以使用例如X-射线胶片或本领域已知的任何其他检测手段来检测在这些条件下核酸探针杂交的分子。
在一个方面,核酸探针是SEQ ID NO:1的核苷酸1至1266。在另一个方面,核酸探针是编码以下的多核苷酸:SEQ ID NO:2的多肽;其成熟多肽;或其片段。在另一个方面,核酸探针是SEQ ID NO:1。
在另一个实施例中,本发明涉及具有蛋白酶活性的多肽,该多肽由以下多核苷酸编码,该多核苷酸与SEQ ID NO:1的成熟多肽编码序列具有至少80%,如至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%序列同一性。在另外的实施例中,该多肽已经被分离。
在另一个实施例中,本发明涉及在一个或多个(例如,若干个)位置处包含取代、缺失、和/或插入的SEQ ID NO:2的成熟多肽的变体。在一个实施例中,引入SEQ ID NO:2的成熟多肽中的氨基酸取代、缺失和/或插入的数目多达10个,例如1、2、3、4、5、6、7、8、9或10个。这些氨基酸改变可以具有微小性质,即,不会显著地影响蛋白质的折叠和/或活性的保守氨基酸取代或插入;典型地为1-30个氨基酸的小缺失;小的氨基-末端或羧基末端延伸,例如氨基末端的甲硫氨酸残基;多达20-25个残基的小接头肽;或小的延伸,其通过改变净电荷或另一官能(例如聚组氨酸段、抗原表位或结合结构域)来促进纯化。
保守取代的实例是在下组之内:碱性氨基酸(精氨酸、赖氨酸及组氨酸)、酸性氨基酸(谷氨酸和天冬氨酸)、极性氨基酸(谷氨酰胺和天冬酰胺)、疏水性氨基酸(亮氨酸、异亮氨酸及缬氨酸)、芳香族氨基酸(苯丙氨酸、色氨酸及酪氨酸)及小氨基酸(甘氨酸、丙氨酸、丝氨酸、苏氨酸及甲硫氨酸)。通常不会改变比活性的氨基酸取代是本领域已知的并且例如由H.Neurath和R.L.Hill,1979,于The Proteins[蛋白质],Academic Press[学术出版社],纽约中描述。常见取代为Ala/Ser、Val/Ile、Asp/Glu、Thr/Ser、Ala/Gly、Ala/Thr、Ser/Asn、Ala/Val、Ser/Gly、Tyr/Phe、Ala/Pro、Lys/Arg、Asp/Asn、Leu/Ile、Leu/Val、Ala/Glu和Asp/Gly。
可以根据本领域中已知的程序,例如定点诱变或丙氨酸扫描诱变(Cunningham和Wells,1989,Science[科学]244:1081-1085)来鉴定多肽中的必需氨基酸。在后一项技术中,在该分子中的每个残基处引入单个丙氨酸突变,并且测试所得分子的蛋白酶活性以鉴定对于该分子的活性关键的氨基酸残基。还参见,Hilton等人,1996,J.Biol.Chem.[生物化学杂志]271:4699-4708。酶或其他生物学相互作用的活性部位还可通过对结构的物理分析来确定,如由下述技术确定:核磁共振、晶体学(crystallography)、电子衍射、或光亲和标记,连同对推定的接触位点(contract site)氨基酸进行突变。参见,例如,de Vos等人,1992,Science[科学]255:306-312;Smith等人,1992,J.Mol.Biol.[分子生物学杂志]224:899-904;Wlodaver等人,1992,FEBS Lett.[欧洲生化学会联合会快报]309:59-64。还可以从与相关多肽的比对来推断必需氨基酸的身份。
使用已知的诱变、重组和/或改组方法、随后进行相关的筛选程序可以做出单或多氨基酸取代、缺失和/或插入并对其进行测试,这些相关的筛选程序例如由Reidhaar-Olson和Sauer,1988,Science[科学]241:53-57;Bowie和Sauer,1989,Proc.Natl.Acad.Sci.USA[美国国家科学院院刊]86:2152-2156;WO 95/17413;或WO 95/22625中披露的那些。其他可以使用的方法包括易错PCR、噬菌体展示(例如Lowman等人,1991,Biochemistry[生物化学]30:10832-10837;美国专利号5,223,409;WO 92/06204)以及区域定向诱变(Derbyshire等人,1986,Gene[基因]46:145;Ner等人,1988,DNA 7:127)。
诱变/改组方法可以与高通量、自动化的筛选方法组合以检测由宿主细胞表达的克隆的、诱变的多肽的活性(Ness等人,1999,Nature Biotechnology[自然生物技术]17:893-896)。可从宿主细胞回收编码活性多肽的诱变的DNA分子,并使用本领域的标准方法快速测序。这些方法允许快速确定多肽中各个氨基酸残基的重要性。
该多肽可以是杂合多肽,其中一种多肽的区域在另一种多肽的区域的N-末端或C-末端处融合。
该多肽可以是融合多肽或可切割的融合多肽,其中另一个多肽在本发明多肽的N-末端或C-末端处融合。通过将编码另一种多肽的多核苷酸融合于本发明的多核苷酸来产生融合多肽。用于产生融合多肽的技术是本领域已知的,且包括连接编码多肽的编码序列使得它们符合读框,而且融合多肽的表达处于一个或多个相同的启动子和终止子的控制之下。还可以使用内含肽技术构建融合多肽,其中在翻译后产生融合多肽(Cooper等人,1993,EMBO J.[欧洲分子生物学学会杂志]12:2575-2583;Dawson等人,1994,Science[科学]266:776-779)。
融合多肽可以进一步包含两种多肽之间的切割位点。在融合蛋白分泌之时,该位点被切割,从而释放出这两种多肽。切割位点的实例包括但不限于在以下文献中披露的位点:Martin等人,2003,J.Ind.Microbiol.Biotechnol.[工业微生物生物技术杂志]3:568-576;Svetina等人,2000,J.Biotechnol.[生物技术杂志]76:245-251;Rasmussen-Wilson等人,1997,Appl.Environ.Microbiol.[应用与环境微生物学]63:3488-3493;Ward等人,1995,Biotechnology[生物技术]13:498-503;和Contreras等人,1991,Biotechnology[生物技术]9:378-381;Eaton等人,1986,Biochemistry[生物化学]25:505-512;Collins-Racie等人,1995,Biotechnology[生物技术]13:982-987;Carter等人,1989,Proteins[蛋白质];Structure,Function,and Genetics[结构、功能以及遗传学]6:240-248;以及Stevens,2003,Drug Discovery World[世界药物发现]4:35-48。具有蛋白酶活性的多肽的来源
可以从高温球菌属的微生物获得本发明的具有蛋白酶活性的多肽。
在另一个方面,该多肽是减硫高温球菌多肽。
这些物种的菌株可容易地在许多培养物保藏中心为公众所获得,如美国典型培养物保藏中心(American Type Culture Collection,ATCC)、德国微生物和细胞培养物保藏中心(Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH,DSMZ)、荷兰菌种保藏中心(Centraalbureau Voor Schimmelcultures,CBS)以及美国农业研究服务专利培养物保藏中心北方地区研究中心(Agricultural Research Service Patent CultureCollection,Northern Regional Research Center,NRRL)。
可以使用以上提到的探针从其他来源,包括从自然界(例如,土壤、堆肥、水等)分离的微生物或直接从天然材料(例如,土壤、堆肥、水等)获得的DNA样品鉴定和获得该多肽。用于从天然生境中直接分离微生物和DNA的技术是本领域熟知的。然后可以通过类似地筛选另一微生物的基因组DNA或cDNA文库或混合的DNA样品来获得编码该多肽的多核苷酸。一旦已经用一种或多种探针检测到编码多肽的多核苷酸,则可以通过利用本领域普通技术人员已知的技术(参见例如,Sambrook等人,1989,同上)分离或克隆多核苷酸。
多核苷酸
本发明还涉及编码本发明的多肽的多核苷酸,如本文中所述。在一个实施例中,编码本发明的多肽的多核苷酸已经被分离。
用于分离或克隆多核苷酸的技术是本领域已知的且包括从基因组DNA或cDNA或其组合进行分离。可以例如通过使用熟知的聚合酶链式反应(PCR)或表达文库的抗体筛选来检测具有共有结构特征的克隆DNA片段,实现从基因组DNA克隆多核苷酸。参见例如,Innis等人,1990,PCR:AGuide to Methods and Application[PCR:方法和应用指南],AcademicPress[学术出版社],纽约。可以使用其他核酸扩增程序如连接酶链式反应(LCR)、连接激活转录(LAT)和基于多核苷酸的扩增(NASBA)。这些多核苷酸可以克隆自高温球菌尤其是减硫高温球菌的菌株或相关生物,并且因此,例如可以是该多核苷酸多肽编码区的等位基因变体或物种变体。
核酸构建体
本发明还涉及核酸构建体,其包含可操作地连接至一个或多个控制序列的本发明的多核苷酸,在与控制序列相容的条件下,该一个或多个控制序列指导该编码序列在合适的宿主细胞中的表达。
在一个具体的实施例中,至少一种控制序列与编码本发明的变体的多核苷酸是异源的。因此,该核酸构建体在自然界中是不可见的。
可用许多方式操作多核苷酸以提供多肽的表达。取决于表达载体,在多核苷酸插入载体之前对其进行操作可以是希望的或必需的。用于利用重组DNA方法修饰多核苷酸的技术是本领域熟知的。
该控制序列可为启动子,即,被宿主细胞识别用于表达编码本发明的多肽的多核苷酸的多核苷酸。该启动子包含转录控制序列,其介导该多肽的表达。该启动子可以是在宿主细胞中显示出转录活性的任何多核苷酸,包括变体、截短型及杂合型启动子,并且可以从编码与该宿主细胞同源或异源的细胞外或细胞内多肽的基因获得。
用于在细菌宿主细胞中指导本发明核酸构建体的转录的合适的启动子的实例是从以下基因中获得的启动子:解淀粉芽孢杆菌α-淀粉酶基因(amyQ)、地衣芽孢杆菌α-淀粉酶基因(amyL)、地衣芽孢杆菌青霉素酶基因(penP)、嗜热脂肪芽孢杆菌产麦芽糖淀粉酶基因(amyM)、枯草芽孢杆菌果聚糖蔗糖酶基因(sacB)、枯草芽孢杆菌xylA和xylB基因、苏云金芽孢杆菌cryIIIA基因(Agaisse和Lereclus,1994,Molecular Microbiology[分子微生物学]13:97-107)、大肠杆菌lac操纵子、大肠杆菌trc启动子(Egon等人,1988,基因69:301-315)、天蓝链霉菌琼脂水解酶基因(dagA)和原核β-内酰胺酶基因(Villa-Kamaroff等人,1978,Natl.Acad.Sci.USA[美国国家科学院院刊]75:3727-3731)以及tac启动子(DeBoer等人,1983,Natl.Acad.Sci.USA[美国国家科学院院刊]80:21-25)。其他启动子描述于Gilbert等人,1980,Scientific American[科学美国人]242:74-94的“Useful proteinsfrom recombinant bacteria[来自重组细菌的有用蛋白质]”;和在Sambrook等人,1989,同上。串联启动子的实例披露于WO 99/43835中。
控制序列也可为由宿主细胞识别以终止转录的转录终止子。该终止子可操作地连接到编码该多肽的多核苷酸的3'-末端。在宿主细胞中有功能的任何终止子可用于本发明中。
细菌宿主细胞的优选终止子从针对以下的基因获得:克劳氏芽孢杆菌碱性蛋白酶(aprH)、地衣芽孢杆菌α-淀粉酶(amyL)、和大肠杆菌核糖体RNA(rrnB)。
控制序列还可以是在启动子下游并且在基因编码序列上游的mRNA稳定子区域,它增加该基因的表达。
合适的mRNA稳定子区的实例从以下获得:苏云金芽孢杆菌cryIIIA基因(WO 94/25612)和枯草芽孢杆菌SP82基因(Hue等人,1995,Journal of Bacteriology[细菌学杂志]177:3465-3471)。
控制序列也可以是前导序列,即对宿主细胞翻译很重要的mRNA的非翻译区域。该前导序列可操作地连接至编码该多肽的多核苷酸的5’-末端。可以使用在宿主细胞中有功能的任何前导序列。
控制序列也可为编码与多肽的N-末端连接的信号肽并指导多肽进入细胞的分泌途径的信号肽编码区。多核苷酸的编码序列的5’-端可固有地包含在翻译阅读框中与编码多肽的编码序列的区段天然地连接的信号肽编码序列。可替代地,编码序列的5'-端可包含对于编码序列为外源的信号肽编码序列。在编码序列天然地不包含信号肽编码序列的情况下,可能需要外源信号肽编码序列。可替代地,外源信号肽编码序列可以单纯地替代天然信号肽编码序列以便增强多肽的分泌。然而,可以使用指导已表达多肽进入宿主细胞的分泌途径的任何信号肽编码序列。
用于细菌宿主细胞的有效信号肽编码序列是从芽孢杆菌NCIB 11837生麦芽糖淀粉酶、地衣芽孢杆菌枯草杆菌蛋白酶、地衣芽孢杆菌β-内酰胺酶、嗜热脂肪芽孢杆菌α-淀粉酶、嗜热脂肪芽孢杆菌中性蛋白酶(nprT、nprS、nprM)和枯草芽孢杆菌prsA的基因获得的信号肽编码序列。另外的信号肽由Simonen和Palva,1993,Microbiological Reviews[微生物评论]57:109-137描述。
控制序列也可以是编码位于多肽N-末端处的前肽的前肽编码序列。所得的多肽被称为前体酶(proenzyme)或多肽原(或在一些情况下被称为酶原(zymogen))。多肽原通常是无活性的并且可通过催化切割或自身催化切割来自多肽原的前肽而转化为活性多肽。前肽编码序列可以从以下的基因获得:枯草芽孢杆菌碱性蛋白酶(aprE)、枯草芽孢杆菌中性蛋白酶(nprT)、嗜热毁丝霉漆酶(WO 95/33836)、米黑根毛霉天冬氨酸蛋白酶和酿酒酵母α-因子。
在信号肽序列和前肽序列两者都存在的情况下,该前肽序列位于紧邻多肽的N-末端且该信号肽序列位于紧邻该前肽序列的N-末端。
表达载体
本发明还涉及包含本发明的多核苷酸、启动子、以及转录和翻译终止信号的重组表达载体。多个核苷酸和控制序列可连接在一起以产生重组表达载体,其可包括一个或多个便利的限制位点以允许编码该多肽的多核苷酸在此类位点处的插入或取代。在一个具体的实施例中,至少一种控制序列与本发明的多核苷酸是异源的。可替代地,可以通过将多核苷酸或包含该多核苷酸的核酸构建体插入用于表达的适当载体中而表达该多核苷酸。在产生该表达载体时,该编码序列位于该载体中,这样使得该编码序列与该用于表达的适当控制序列可操作地连接。
重组表达载体可以是可以方便地经受重组DNA程序并且可以引起多核苷酸表达的任何载体(例如,质粒或病毒)。载体的选择将典型地取决于载体与待引入载体的宿主细胞的相容性。载体可以是线性或闭合环状质粒。
载体可以是自主复制载体,即作为染色体外实体存在的载体,其复制独立于染色体复制,例如质粒、染色体外元件、微染色体或人工染色体。载体可以包含用于确保自我复制的任何手段。可替代地,载体可以是这样的载体,当它引入宿主细胞中时整合入基因组中并与其中已整合了它的一个或多个染色体一起复制。此外,可以使用单独的载体或质粒或两个或更多个载体或质粒,其共同包含待引入宿主细胞基因组的总DNA,或可以使用转座子。
载体优选地包含允许方便地选择转化细胞、转染细胞、转导细胞等细胞的一个或多个选择性标志。选择性标志是一种基因,其产物提供了杀生物剂抗性或病毒抗性、对重金属抗性、对营养缺陷型的原养型等。
细菌选择性标志的实例是地衣芽孢杆菌或枯草芽孢杆菌dal基因、或赋予抗生素抗性(如氨苄青霉素、氯霉素、卡那霉素、新霉素、大观霉素、或四环素抗性)的标志。
选择性标志可以是如WO 2010/039889中所述的双选择性标志系统。在一个方面,双选择性标志是hph-tk双选择性标志系统。
载体优选地包含允许载体整合到宿主细胞的基因组中或载体在细胞中独立于基因组自主复制的一个或多个元件。
对于整合到该宿主细胞基因组中,该载体可以依靠编码该多肽的多核苷酸序列或用于通过同源或非同源重组整合到该基因组中的该载体的任何其他元件。可替代地,该载体可包含用于指导通过同源重组而整合入宿主细胞基因组中的染色体中的精确位置处的另外的多核苷酸。为了增加在精确位置处整合的可能性,整合元件应当包含足够数目的核酸,如100至10,000个碱基对、400至10,000个碱基对和800至10,000个碱基对,该核酸与对应的靶序列具有高度序列同一性以增强同源重组的概率。整合元件可以是与宿主细胞基因组内的靶序列同源的任何序列。此外,整合元件可以是非编码或编码的多核苷酸。另一方面,载体可以通过非同源重组整合入宿主细胞的基因组中。
为了自主复制,载体还可以另外包含复制起点,该复制起点使得载体在讨论中的宿主细胞中自主复制成为可能。复制起点可以是在细胞中发挥作用的介导自主复制的任何质粒复制子。术语“复制起点”或“质粒复制子”意指使质粒或载体能够在体内复制的多核苷酸。
细菌复制起点的实例是允许在大肠杆菌中复制的质粒pBR322、pUC19、pACYC177、和pACYC184的复制起点,以及允许在芽孢杆菌属中复制的质粒pUB110、pE194、pTA1060、和pAMβ1的复制起点。
可将本发明多核苷酸的多于一个拷贝插入宿主细胞以增加多肽的产生。通过将序列的至少一个另外的拷贝整合到宿主细胞基因组中或者通过包括与该多核苷酸一起的可扩增的选择性标志基因可以获得多核苷酸的增加的拷贝数目,其中通过在适当的选择性试剂的存在下培养细胞可以选择含有选择性标志基因的经扩增的拷贝以及由此该多核苷酸的另外的拷贝的细胞。
用于连接以上所述的元件以构建本发明的重组表达载体的程序是本领域的普通技术人员熟知的(参见例如,Sambrook等人,1989,见上文)。宿主细胞
本发明还涉及重组宿主细胞,这些宿主细胞包含可操作地连接至一个或多个控制序列的本发明的多核苷酸,该一个或多个控制序列指导本发明的多肽的产生。在一个实施例中,该一种或多种控制序列与本发明的多核苷酸是异源的。将包含多核苷酸的构建体或载体引入宿主细胞中,这样使得该构建体或载体作为染色体整合体或作为自主复制的染色体外载体维持,如较早前所述。术语“宿主细胞”涵盖由于复制期间出现的突变而与亲本细胞不完全相同的任何亲本细胞子代。宿主细胞的选择将在很大程度上取决于编码该多肽的基因及其来源。
该宿主细胞可以是在本发明的多肽的重组产生中有用的任何细胞,例如原核细胞或真核细胞。
原核宿主细胞可以是任何革兰氏阳性细菌。革兰氏阳性细菌包括但不限于:芽孢杆菌属、梭菌属、肠球菌属、土芽孢杆菌属、乳杆菌属、乳球菌属、大洋芽孢杆菌属(Oceanobacillus)、葡萄球菌属、链球菌属、以及链霉菌属。革兰氏阴性细菌包括但不限于弯曲杆菌属、大肠杆菌、黄杆菌属、梭杆菌属、螺杆菌属、泥杆菌属、奈瑟氏菌属、假单孢菌属、沙门氏菌属和脲原体属。
细菌宿主细胞可以是任何芽孢杆菌属细胞,包括但不限于嗜碱芽孢杆菌、解淀粉芽孢杆菌、短芽孢杆菌、环状芽孢杆菌、克劳氏芽孢杆菌、凝结芽孢杆菌、坚硬芽孢杆菌、灿烂芽孢杆菌、迟缓芽孢杆菌、地衣芽孢杆菌、巨大芽孢杆菌、短小芽孢杆菌、嗜热脂肪芽孢杆菌、枯草芽孢杆菌以及苏云金芽孢杆菌细胞。
将DNA引入芽孢杆菌属细胞中可以通过以下来实现:原生质体转化(参见,例如,Chang和Cohen,1979,Mol.Gen.Genet.[分子遗传学与基因组学]168:111-115)、感受态细胞转化(参见例如,Young和Spizizen,1961,J.Bacteriol.[细菌学杂志]81:823-829;或Dubnau和Davidoff-Abelson,1971,J.Mol.Biol.[分子生物学杂志]56:209-221)、电穿孔(参见例如,Shigekawa和Dower,1988,Biotechniques[生物技术]6:742-751)或接合(参见例如,Koehler和Thorne,1987,J.Bacteriol.[细菌学杂志]169:5271-5278)。将DNA引入大肠杆菌细胞中可以通过以下来实现:原生质体转化(参见例如,Hanahan,1983,J.Mol.Biol.[分子生物学杂志]166:557-580)或电穿孔(参见例如,Dower等人,1988,Nucleic AcidsRes.[核酸研究]16:6127-6145)。将DNA引入链霉菌属细胞中可以通过以下来实现:原生质体转化、电穿孔(参见例如,Gong等人,2004,Folia Microbiol.(Praha)[叶线形微生物学(布拉格)]49:399-405)、接合(参见例如,Mazodier等人,1989,J.Bacteriol.[细菌学杂志]171:3583-3585)、或转导(参见例如,Burke等人,2001,Proc.Natl.Acad.Sci.USA[美国国家科学院院刊]98:6289-6294)。将DNA引入假单孢菌属细胞中可以通过以下来实现:电穿孔(参见例如,Choi等人,2006,J.Microbiol.Methods[微生物学方法杂志]64:391-397)或接合(参见例如,Pinedo和Smets,2005,Appl.Environ.Microbiol.[应用与环境微生物学]71:51-57)。将DNA引入链球菌属细胞中可以通过以下来实现:天然感受态(naturalcompetence)(参见例如,Perry和Kuramitsu,1981,Infect.Immun.[感染与免疫]32:1295-1297)、原生质体转化(参见,例如,Catt和Jollick,1991,Microbios[微生物学]68:189-207)、电穿孔(参见,例如,Buckley等人,1999,Appl.Environ.Microbiol.[应用与环境微生物学]65:3800-3804)、或接合(参见,例如,Clewell,1981,Microbiol.Rev.[微生物学评论]45:409-436)。然而,可以使用本领域中已知的将DNA引入宿主细胞中的任何方法。
产生方法
本发明还涉及产生本发明的多肽的方法,这些方法包括(a)在有益于产生该多肽的条件下培养细胞,该细胞以其野生型形式产生该多肽;和任选地(b)回收该多肽。在一个方面,该细胞是减硫高温球菌细胞,特别是DSM14981。
本发明还涉及产生本发明的多肽的方法,这些方法包括(a)在有益于产生该多肽的条件下培养本发明的重组宿主细胞;和任选地(b)回收该多肽。
宿主细胞是在适合使用本领域已知的方法产生多肽的营养培养基中培养的。例如,可以通过摇瓶培养、或在实验室或工业发酵器中小规模或大规模发酵(包括连续、分批、补料分批或固态发酵)培养细胞,该培养在合适的培养基中并且在允许表达和/或分离多肽的条件下进行。使用本领域中已知的程序,培养发生在包含碳和氮来源及无机盐的合适的营养培养基中。合适的培养基可从商业供应商获得或可以根据公开的组成(例如,在美国典型培养物保藏中心的目录中)制备。如果多肽被分泌到该营养培养基中,那么可以直接从该培养基中回收该多肽。如果多肽不进行分泌,那么其可以从细胞裂解液中进行回收。
可以使用本领域已知的方法来回收多肽。例如,可通过常规方法,包括但不限于,收集、离心、过滤、提取、喷雾干燥、蒸发或沉淀,从营养培养基回收多肽。在一个方面,回收包含多肽的发酵液。
可以通过本领域已知的多种程序纯化多肽以获得基本上纯的多肽,这些程序包括但不限于:色谱法(例如,离子交换、亲和、疏水、色谱聚焦和尺寸排阻)、电泳程序(例如,制备型等电聚焦)、差示溶解度(例如,硫酸铵沉淀)、SDS-PAGE、或提取(参见例如,ProteinPurification[蛋白质纯化],Janson和Ryden编辑,VCH Publishers[VCH出版公司],纽约,1989)。
在一个替代性方面,不回收多肽,而是将表达该多肽的本发明的宿主细胞用作多肽的来源。
发酵液配制品或细胞组合物
本发明还涉及包含本发明的多肽的发酵液配制品或细胞组合物。该发酵液产物进一步包含在发酵过程中使用的另外的成分,例如像细胞(包括含有编码本发明的多肽的基因的宿主细胞,这些宿主细胞用于产生感兴趣的多肽)、细胞碎片、生物质、发酵培养基和/或发酵产物。在一些实施例中,该组合物是含有一种或多种有机酸、杀灭的细胞和/或细胞碎片以及培养基的细胞杀灭的全培养液。
如本文使用的术语“发酵液”是指由细胞发酵产生的、不经历或经历最少的回收和/或纯化的制剂。例如,当微生物培养物在允许蛋白质合成(例如,由宿主细胞表达酶)并且将蛋白质分泌到细胞培养基中的碳限制条件下孵育生长到饱和时,产生发酵液。该发酵液可以含有在发酵结束时得到的发酵材料的未分级的或分级的内容物。典型地,该发酵液是未分级的并且包含用过的培养基以及例如通过离心去除微生物细胞(例如,丝状真菌细胞)之后存在的细胞碎片。在一些实施例中,该发酵液含有用过的细胞培养基、胞外酶以及有活力的和/或无活力的微生物细胞。
在一个实施例中,该发酵液配制品和细胞组合物包含第一有机酸组分(包含至少一种1-5个碳的有机酸和/或其盐)和第二有机酸组分(包含至少一种6个或更多个碳的有机酸和/或其盐)。在一个具体实施例中,该第一有机酸组分是乙酸、甲酸、丙酸、其盐,或前述中的两种或更多种的混合物;并且该第二有机酸组分是苯甲酸、环己烷羧酸、4-甲基戊酸、苯乙酸、其盐,或前述中的两种或更多种的混合物。
在一个方面,该组合物含有一种或多种有机酸,并且任选地进一步含有杀灭的细胞和/或细胞碎片。在一个实施例中,从细胞杀灭的全培养液中去除这些杀灭的细胞和/或细胞碎片,以提供不含这些组分的组合物。
这些发酵液配制品或细胞组合物可以进一步包含防腐剂和/或抗微生物(例如,抑菌)剂,包括但不限于山梨醇、氯化钠、山梨酸钾、以及本领域已知的其他试剂。
该细胞杀灭的全培养液或组合物可以含有在发酵结束时得到的发酵材料的未分级的内容物。典型地,该细胞杀灭的全培养液或组合物包含用过的培养基以及在微生物细胞(例如,丝状真菌细胞)生长至饱和、在碳限制条件下孵育以允许蛋白合成之后存在的细胞碎片。在一些实施例中,该细胞杀灭的全培养液或组合物含有用过的细胞培养基、胞外酶和杀灭的丝状真菌细胞。在一些实施例中,可以使用本领域已知的方法来使细胞杀灭的全培养液或组合物中存在的微生物细胞透性化和/或裂解。
如本文所述的全培养液或细胞组合物通常是液体,但是可以含有不溶性组分,如杀灭的细胞、细胞碎片、培养基组分和/或一种或多种不溶性酶。在一些实施例中,可以去除不溶性组分以提供澄清的液体组合物。
本发明的全培养液配制品和细胞组合物可以通过WO 90/15861或WO2010/096673中所述的方法来产生。
酶组合物
本发明还涉及包含本发明的多肽的组合物。
这些组合物可以包含本发明的蛋白酶作为主要酶组分,例如单组分组合物。可替代地,组合物可以包含多种酶活性,如选自由以下组成的组的一种或多种(例如,若干种)酶:α-淀粉酶、葡糖淀粉酶、β-淀粉酶、支链淀粉酶。
组合物可以根据本领域已知的方法制备并且可以呈液体或干燥组合物的形式。可以根据本领域已知的方法稳定组合物。
下面给出了本发明的组合物的优选的用途的实例。
本发明的酶组合物包含适用于本发明的方法中液化步骤的α-淀粉酶和减硫高温球菌S8A蛋白酶。
在一个具体的实施例中,本发明涉及一种酶组合物,该酶组合物包含:
α-淀粉酶和减硫高温球菌S8A蛋白酶,特别是与SEQ ID NO:2的
成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%序列同一性的蛋白酶。
在一个优选的实施例中,α-淀粉酶与蛋白酶之间的比率在1:1和1:50之间(微克α-淀粉酶:微克蛋白酶)的范围内,更特别地在1:3和1:40之间,例如约1:4(微克α-淀粉酶:微克蛋白酶)的范围内。
在一个优选的实施例中,本发明的酶组合物包含葡糖淀粉酶,并且α-淀粉酶与葡糖淀粉酶之间的比率在1:1和1:10之间,例如约1:2(微克α-淀粉酶:微克葡糖淀粉酶)。
α-淀粉酶优选地是细菌酸稳定的α-淀粉酶。特别地,α-淀粉酶来自微小杆菌属物种或芽孢杆菌属物种(例如像嗜热脂肪芽孢杆菌或地衣芽孢杆菌)。
在一个实施例中,该α-淀粉酶来自芽孢杆菌属,如嗜热脂肪芽孢杆菌菌株,特别是嗜热脂肪芽孢杆菌α-淀粉酶的变体,如WO 99/019467中的SEQ ID NO:3或本文的SEQ IDNO:4中所示的α-淀粉酶。
在一个实施例中,该嗜热脂肪芽孢杆菌α-淀粉酶或其变体是截短的,优选地截短以具有约491个氨基酸,例如从480-495个氨基酸。
在一个实施例中,嗜热脂肪芽孢杆菌α-淀粉酶在从位置179至位置182的范围内的两个位置处,如在位置I181+G182、R179+G180、G180+I181、R179+I181或G180+G182,优选I181+G182处具有缺失,和任选地N193F取代(使用SEQ ID NO:4进行编号)。
在一个实施例中,该嗜热脂肪芽孢杆菌α-淀粉酶在位置S242处具有取代,优选S242Q取代。
在一个实施例中,该嗜热脂肪芽孢杆菌α-淀粉酶在位置E188处具有取代,优选E188P取代。
在一个实施例中,α-淀粉酶选自嗜热脂肪芽孢杆菌α-淀粉酶变体的组,该变体除在从位置179至位置182的区域中的双缺失(特别是I181*+G182*)和任选地N193F之外还具有以下突变:
-V59A+Q89R+G112D+E129V+K177L+R179E+K220P+N224L+Q254S; |
-V59A+Q89R+E129V+K177L+R179E+H208Y+K220P+N224L+Q254S; |
-V59A+Q89R+E129V+K177L+R179E+K220P+N224L+Q254S+D269E+D281N; |
-V59A+Q89R+E129V+K177L+R179E+K220P+N224L+Q254S+I270L; |
-V59A+Q89R+E129V+K177L+R179E+K220P+N224L+Q254S+H274K; |
-V59A+Q89R+E129V+K177L+R179E+K220P+N224L+Q254S+Y276F; |
-V59A+E129V+R157Y+K177L+R179E+K220P+N224L+S242Q+Q254S; |
-V59A+E129V+K177L+R179E+H208Y+K220P+N224L+S242Q+Q254S; |
-59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S; |
-V59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+H274K; |
-V59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+Y276F; |
-V59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+D281N; |
-V59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+M284T; |
-V59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+G416V; |
-V59A+E129V+K177L+R179E+K220P+N224L+Q254S; |
-V59A+E129V+K177L+R179E+K220P+N224L+Q254S+M284T; |
-A91L+M96I+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S; |
-E129V+K177L+R179E; |
-E129V+K177L+R179E+K220P+N224L+S242Q+Q254S; |
-E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+Y276F+L427M; |
-E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+M284T; |
-E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+N376*+I377*; |
-E129V+K177L+R179E+K220P+N224L+Q254S; |
-E129V+K177L+R179E+K220P+N224L+Q254S+M284T; |
-E129V+K177L+R179E+S242Q; |
-E129V+K177L+R179V+K220P+N224L+S242Q+Q254S; |
-K220P+N224L+S242Q+Q254S; |
-M284V; |
-V59A Q89R+E129V+K177L+R179E+Q254S+M284V。 |
在一个实施例中,该α-淀粉酶选自嗜热脂肪芽孢杆菌α-淀粉酶变体的组,该变体具有以下突变:
-I181*+G182*+N193F+E129V+K177L+R179E;
-I181*+G182*+N193F+V59A+Q89R+E129V+K177L+R179E+H208Y+K220P+N224L+Q254S;
-I181*+G182*+N193F+V59AQ89R+E129V+K177L+R179E+Q254S+M284V;和
-I181*+G182*+N193F+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S(使用SEQID NO:4进行编号)。
在一个实施例中,该α-淀粉酶变体与SEQ ID NO:4的多肽具有至少75%的同一性,优选至少80%,更优选至少85%,更优选至少90%,更优选至少91%,更优选至少92%,甚至更优选至少93%,最优选至少94%,并且甚至最优选至少95%,如甚至至少96%、至少97%、至少98%、至少99%,但小于100%的同一性。
在一个优选的实施例中,本发明的酶组合物包含减硫高温球菌S8A蛋白酶,该蛋白酶与SEQ ID NO:2的氨基酸102至422具有至少80%,如至少85%,如至少90%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%、或至少100%的同一性。
在一个实施例中,该酶组合物进一步包含葡糖淀粉酶。
在一个实施例中,该葡糖淀粉酶源自青霉属菌株,尤其是如WO2011/127802中的SEQ ID NO:2披露的草酸青霉菌菌株。
在一个实施例中,该葡糖淀粉酶与WO 2011/127802中的SEQ ID NO:2的成熟多肽具有至少80%,更优选至少85%,更优选至少90%,更优选至少91%,更优选至少92%,甚至更优选至少93%,最优选至少94%,并且甚至最优选至少95%,如甚至至少96%、至少97%、至少98%、至少99%或100%的同一性。
在一个实施例中,该葡糖淀粉酶是如本文WO 2011/127802中的SEQ ID NO:2披露的草酸青霉菌葡糖淀粉酶的、具有K79V取代的变体,例如WO2013/053801中披露的变体。
在一个实施例中,该葡糖淀粉酶是草酸青霉菌葡糖淀粉酶,该草酸青霉菌葡糖淀粉酶具有K79V取代,并且进一步具有以下取代中的一个:
-P11F+T65A+Q327F
-P2N+P4S+P11F+T65A+Q327F。
在一个实施例中,该组合物进一步包含支链淀粉酶。
在一个实施例中,本发明的组合物包含嗜热脂肪芽孢杆菌α-淀粉酶和减硫高温球菌S8A蛋白酶。在一个实施例中,α-淀粉酶与蛋白酶之间的比率在从1:1和1:50(微克α-淀粉酶:微克蛋白酶)的范围内。
在一个实施例中,α-淀粉酶与蛋白酶之间的比率在1:3和1:40之间,例如约1:4(微克α-淀粉酶:微克蛋白酶)的范围内。
在一个实施例中,α-淀粉酶与葡糖淀粉酶之间的比率在1:1和1:10之间,例如约1:2(微克α-淀粉酶:微克葡糖淀粉酶)。
本发明的方法
本发明涉及从发酵产物生产过程中回收油的方法,以及用于由含淀粉材料生产发酵产物的方法。
诸位发明人已经发现,当在液化中结合α-淀粉酶和来自减硫高温球菌的蛋白酶时,在用于由含淀粉材料生产发酵产物的方法中可以获得增加的乙醇产量。因此,在一个方面,本发明涉及用于液化含淀粉材料的方法,该方法包括在至少α-淀粉酶和本发明的S8A减硫高温球菌蛋白酶的存在下,在高于初始糊化温度的温度,液化该含淀粉材料。
还发现本发明的乙醇方法可以在SSF中减少或不添加氮源(例如尿素)来有效的运行。
生产本发明的发酵产物的方法
在一个具体的方面,本发明涉及用于由含淀粉材料生产发酵产物的方法,这些方法包括以下步骤:
a)在至少以下酶的存在下,在高于初始糊化温度的温度,液化该含淀粉材料:
-α-淀粉酶;和
-来自减硫高温球菌的S8A蛋白酶;
b)使用葡糖淀粉酶进行糖化;
c)使用发酵生物进行发酵。
在一个实施例中,该发酵产物是在发酵之后回收。在一个优选的实施例中,该发酵产物是在发酵之后回收,例如通过蒸馏。在一个实施例中,该发酵产物是醇,优选乙醇,尤其是燃料乙醇、饮用乙醇和/或工业乙醇。
本发明的回收/提取油的方法
在另一个具体方面,本发明涉及从发酵产物生产过程中回收油的方法,该方法包括以下步骤:
a)在至少以下酶的存在下,在高于初始糊化温度的温度,液化该含淀粉材料:
-α-淀粉酶;和
-来自减硫高温球菌的S8A蛋白酶;
b)使用葡糖淀粉酶进行糖化;
c)使用发酵生物进行发酵。
d)回收该发酵产物以形成全酒糟;
e)将该全酒糟分离为酒糟水和湿饼;
f)任选地将该酒糟水浓缩为浆体;
其中从以下各项中回收油:
-在步骤a)之后液化的含淀粉材料;和/或
-发酵步骤c)的下游。
在一个实施例中,在液化含淀粉材料期间和/或之后回收/提取该油。在一个实施例中,从全酒糟中回收该油。在一个实施例中,从酒糟水中回收该油。在一个实施例中,从浆体中回收该油。
在本发明的方法的一个优选的实施例中,糖化和发酵同时进行。
在一个优选的实施例中,在步骤a)-c)中,例如在糖化步骤b)、或发酵步骤c)或同时糖化和发酵(SSF)期间,没有氮化合物(例如尿素)存在和/或没有添加氮化合物(例如尿素)。
在一个实施例中,在步骤a)-c)中,例如在糖化步骤b)或发酵步骤c)或同时糖化和发酵(SSF)期间,存在和/或添加10-1,000ppm,如50-800ppm、如100-600ppm、如200-500ppm氮化合物(优选尿素)。
在一个实施例中,在液化步骤a)中存在和/或添加在0.5至100微克之间的减硫高温球菌S8A蛋白酶/克DS(干固体)。在一个实施例中,在液化步骤a)中存在和/或添加在1至50微克之间的减硫高温球菌S8A蛋白酶/克DS(干固体)。在一个实施例中,在液化步骤a)中存在和/或添加在2至40微克之间的减硫高温球菌S8A蛋白酶/克DS。在一个实施例中,在液化步骤a)中存在和/或添加在4至25微克之间的减硫高温球菌S8A蛋白酶/克DS。在一个实施例中,在液化步骤a)中存在和/或添加在5至20微克之间的减硫高温球菌S8A蛋白酶/克DS。在一个实施例中,在液化步骤a)中存在和/或添加约或大于1微克的减硫高温球菌S8A蛋白酶/克DS。在一个实施例中,在液化步骤a)中存在和/或添加约或大于2微克的减硫高温球菌S8A蛋白酶/克DS。在一个实施例中,在液化步骤a)中存在和/或添加约或大于5微克的减硫高温球菌S8A蛋白酶/克DS。
在液化中存在的和/或添加的α-淀粉酶
在本发明的方法(即,油回收方法和发酵产物生产过程)中,液化步骤a)期间所添加的α-淀粉酶可以是任何α-淀粉酶。
优选地是细菌α-淀粉酶,其在液化过程中使用的温度下典型地是稳定的。
在一个实施例中,α-淀粉酶来自微小杆菌属或芽孢杆菌属的菌株。
在一个优选的实施例中,α-淀粉酶来自嗜热脂肪芽孢杆菌的菌株,如在WO 99/019467中所示的SEQ ID NO:3或在本文的SEQ ID NO:4中所示的序列。在一个实施例中,α-淀粉酶是在本文的SEQ ID NO:4中所示的嗜热脂肪芽孢杆菌α-淀粉酶,如与本文的SEQ IDNO:4具有至少80%,如至少85%、如至少90%、如至少95%、如至少96%、如至少97%、如至少98%、如至少99%的同一性的嗜热脂肪芽孢杆菌α-淀粉酶。
在一个实施例中,嗜热脂肪芽孢杆菌α-淀粉酶或其变体是截短的,优选地在C-末端,优选地截短以具有约491个氨基酸,如从480至495个氨基酸。
在一个实施例中,嗜热脂肪芽孢杆菌α-淀粉酶在从位置179至位置182的范围内的两个位置处,如在位置I181+G182、R179+G180、G180+I181、R179+I181或G180+G182,优选I181+G182处具有缺失,和任选地N193F取代(使用SEQ ID NO:4进行编号)。
在一个实施例中,该嗜热脂肪芽孢杆菌α-淀粉酶在位置S242处具有取代,优选S242Q取代。
在一个实施例中,该嗜热脂肪芽孢杆菌α-淀粉酶在位置E188处具有取代,优选E188P取代。
在一个实施例中,α-淀粉酶选自嗜热脂肪芽孢杆菌α-淀粉酶变体的组,该变体除在从位置179至位置182的区域中的双缺失(特别是I181*+G182*)和任选地N193F之外还具有以下突变:
-V59A+Q89R+G112D+E129V+K177L+R179E+K220P+N224L+Q254S; |
-V59A+Q89R+E129V+K177L+R179E+H208Y+K220P+N224L+Q254S; |
-V59A+Q89R+E129V+K177L+R179E+K220P+N224L+Q254S+D269E+D281N; |
-V59A+Q89R+E129V+K177L+R179E+K220P+N224L+Q254S+I270L; |
-V59A+Q89R+E129V+K177L+R179E+K220P+N224L+Q254S+H274K; |
-V59A+Q89R+E129V+K177L+R179E+K220P+N224L+Q254S+Y276F; |
-V59A+E129V+R157Y+K177L+R179E+K220P+N224L+S242Q+Q254S; |
-V59A+E129V+K177L+R179E+H208Y+K220P+N224L+S242Q+Q254S; |
-59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S; |
-V59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+H274K; |
-V59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+Y276F; |
-V59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+D281N; |
-V59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+M284T; |
-V59A+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+G416V; |
-V59A+E129V+K177L+R179E+K220P+N224L+Q254S; |
-V59A+E129V+K177L+R179E+K220P+N224L+Q254S+M284T; |
-A91L+M96I+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S; |
-E129V+K177L+R179E; |
-E129V+K177L+R179E+K220P+N224L+S242Q+Q254S; |
-E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+Y276F+L427M; |
-E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+M284T; |
-E129V+K177L+R179E+K220P+N224L+S242Q+Q254S+N376*+I377*; |
-E129V+K177L+R179E+K220P+N224L+Q254S; |
-E129V+K177L+R179E+K220P+N224L+Q254S+M284T; |
-E129V+K177L+R179E+S242Q; |
-E129V+K177L+R179V+K220P+N224L+S242Q+Q254S; |
-K220P+N224L+S242Q+Q254S; |
-M284V; |
-V59AQ89R+E129V+K177L+R179E+Q254S+M284V。 |
在一个优选的实施例中,α-淀粉酶选自嗜热脂肪芽孢杆菌α-淀粉酶变体的组:
-I181*+G182*+N193F+E129V+K177L+R179E;
-I181*+G182*+N193F+V59A+Q89R+E129V+K177L+R179E+H208Y+K220P+N224L+Q254S;
-I181*+G182*+N193F+V59AQ89R+E129V+K177L+R179E+Q254S+M284V;和
-I181*+G182*+N193F+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S(使用SEQID NO:4进行编号)。
根据本发明,α-淀粉酶变体与本文的SEQ ID NO:4的多肽具有至少80%,更优选至少85%,更优选至少90%,更优选至少91%,更优选至少92%,甚至更优选至少93%,最优选至少94%,并且甚至最优选至少95%,如甚至至少96%、至少97%、至少98%、至少99%,但小于100%的同一性。
根据本发明,α-淀粉酶能以0.1至100微克/克DS,如0.5至50微克/克DS、如1至25微克/克DS、如1至10微克/克DS、如2至5微克/克DS的浓度存在和/或添加。
在一个实施例中,在液化中存在和/或添加从1至50微克,特别地是从2至40微克,特别地是从4至25微克,特别地是从5至20微克减硫高温球菌S8A蛋白酶/克DS,并且在液化中存在和/或添加从1至10微克嗜热脂肪芽孢杆菌α-淀粉酶。
在一个实施例中,该减硫高温球菌蛋白酶选自:
a)多肽,该多肽包含SEQ ID NO:2的氨基酸102至422或由其组成;
b)多肽,该多肽与SEQ ID NO:2的氨基酸102至422具有至少80%、至少85、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%序列同一性。
在液化中存在的和/或添加的葡糖淀粉酶
在一个实施例中,在本发明的方法(即,油回收方法和发酵产物生产过程)中的液化步骤a)中,存在和/或添加葡糖淀粉酶。
在一个优选的实施例中,在液化步骤a)中存在的和/或添加的葡糖淀粉酶源自青霉属菌株,尤其是如WO 2011/127802中的SEQ ID NO:2披露的草酸青霉菌菌株。
在一个实施例中,该葡糖淀粉酶与WO 2011/127802中的SEQ ID NO:2中所示的成熟多肽具有至少80%,更优选至少85%,更优选至少90%,更优选至少91%,更优选至少92%,甚至更优选至少93%,最优选至少94%,并且甚至最优选至少95%,如甚至至少96%、至少97%、至少98%、至少99%或100%的同一性。
在一个优选的实施例中,该葡糖淀粉酶是WO 2011/127802中的SEQ ID NO:2所示的、具有K79V取代的草酸青霉菌葡糖淀粉酶,如WO 2013/053801中披露的变体。
在一个优选的实施例中,在液化中存在的和/或添加的葡糖淀粉酶是草酸青霉菌葡糖淀粉酶,该草酸青霉菌葡糖淀粉酶具有K79V取代并且优选进一步具有以下取代中的一个:
-P11F+T65A+Q327F;
-P2N+P4S+P11F+T65A+Q327F。
在一个实施例中,葡糖淀粉酶变体与WO 2011/127802中的SEQ ID NO:2的多肽的成熟部分或本文的SEQ ID NO:10具有至少75%的同一性,优选至少80%,更优选至少85%,更优选至少90%,更优选至少91%,更优选至少92%,甚至更优选至少93%,最优选至少94%,并且甚至最优选至少95%,如甚至至少96%、至少97%、至少98%、至少99%,但小于100%的同一性。
该葡糖淀粉酶能以从0.1至100微克EP/g,如0.5至50微克EP/g,如1至25微克EP/g,如2至12微克EP/g DS的量添加。
在糖化和/或发酵中存在的和/或添加的葡糖淀粉酶
在本发明的方法(即,油回收方法和发酵产物生产过程)中,在糖化和/或发酵,优选同时糖化和发酵(SSF)中,存在和/或添加葡糖淀粉酶。
在一个实施例中,在糖化和/或发酵中存在的和/或添加的葡糖淀粉酶是真菌来源的,优选地来自曲霉属的菌株,优选黑曲霉、泡盛曲霉或米曲霉;或木霉属的菌株,优选里氏木霉;或篮状菌属的菌株,优选埃默森篮状菌;或栓菌属的菌株,优选瓣环栓菌(T.cingulata);或密孔菌属的菌株;或粘褶菌属的菌株,如篱边粘褶菌或密粘褶菌;或黑层孔属(Nigrofomes)的菌株。
在一个实施例中,葡糖淀粉酶是源自篮状菌属,如埃默森篮状菌的菌株,如在本文的SEQ ID NO:5中所示的菌株。
在一个实施例中,该葡糖淀粉酶选自由以下组成的组:
(i)包含本文的SEQ ID NO:5的多肽的葡糖淀粉酶;
(ii)包含以下氨基酸序列的葡糖淀粉酶,该氨基酸序列与本文的SEQ ID NO:5的多肽具有至少60%、至少70%,如至少75%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%或至少99%的同一性。
在一个实施例中,该葡糖淀粉酶是源自栓菌属,如瓣环栓菌的菌株,如在本文的SEQ ID NO:6中所示的菌株。
在一个实施例中,该葡糖淀粉酶选自由以下组成的组:
(i)包含本文的SEQ ID NO:6的多肽的葡糖淀粉酶;
(ii)包含以下氨基酸序列的葡糖淀粉酶,该氨基酸序列与本文的SEQ ID NO:6的多肽具有至少60%、至少70%,如至少75%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%或至少99%的同一性。
在一个实施例中,该葡糖淀粉酶是源自密孔菌属的菌株,特别是描述于WO 2011/066576中的血红密孔菌的菌株(SEQ ID NO 2、4或6),如示WO2011/066576中的SEQ ID NO:4中所示的菌株。
在一个实施例中,葡糖淀粉酶是源自粘褶菌属的菌株,如篱边粘褶菌或密粘褶菌的菌株,特别是如在WO 2011/068803中描述的粘褶菌属的菌株(SEQ ID NO:2、4、6、8、10、12、14或16)。在一个优选的实施例中,该葡糖淀粉酶是在WO 2011/068803中的SEQ ID NO:2或本文的SEQ ID NO:6中所示的篱边粘褶菌。
在一个优选的实施例中,葡糖淀粉酶源自篱边粘褶菌,如在本文的SEQ ID NO:6中所示的篱边粘褶菌。在一个实施例中,该葡糖淀粉酶选自由以下组成的组:
(i)包含本文的SEQ ID NO:6的多肽的葡糖淀粉酶;
(ii)包含以下氨基酸序列的葡糖淀粉酶,该氨基酸序列与本文的SEQ ID NO:6的多肽具有至少60%、至少70%,如至少75%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%或至少99%的同一性。
在另一个实施例中,葡糖淀粉酶源自密粘褶菌,如在本文的SEQ ID NO:7中所示的密粘褶菌。在一个实施例中,该葡糖淀粉酶选自由以下组成的组:
(i)包含本文的SEQ ID NO:7的多肽的葡糖淀粉酶;
(ii)包含以下氨基酸序列的葡糖淀粉酶,该氨基酸序列与本文的SEQ ID NO:7的多肽具有至少60%、至少70%,如至少75%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%或至少99%的同一性。
在一个实施例中,该葡糖淀粉酶源自黑层孔属(Nigrofomes)的菌株,特别是披露于WO 2012/064351中的黑层孔属物种的菌株。
在一个实施例中,该葡糖淀粉酶能以如下量添加到糖化和/或发酵中:0.0001至20AGU/g DS,优选0.001至10AGU/g DS,尤其是在0.01至5AGU/g DS之间,如0.1至2AGU/gDS。
包含葡糖淀粉酶的可商购组合物包括AMG 200L;AMG 300L;SANTMSUPER、SANTMEXTRA L、SPIRIZYMETMPLUS、SPIRIZYMETMFUEL、SPIRIZYMETMB4U、SPIRIZYMETMULTRA、SPIRIZYMETMEXCEL和AMGTME(来自诺维信公司(Novozymes A/S));OPTIDEXTM300、GC480、GC417(来自杜邦公司(DuPont.));AMIGASETM和AMIGASETMPLUS(来自帝斯曼公司(DSM));G-ZYMETMG900、G-ZYMETM和G990ZR(来自杜邦公司)。
根据本发明的一个优选的实施例,在糖化和/或发酵中,葡糖淀粉酶是与α-淀粉酶相结合存在的和/或添加的。下面描述了合适的α-淀粉酶的实例。
在糖化和/或发酵中存在和/或添加的α-淀粉酶
在一个实施例中,在本发明的方法中,在糖化和/或发酵中存在和/或添加α-淀粉酶。在一个优选的实施例中,α-淀粉酶是真菌或细菌起源的。在一个优选的实施例中,α-淀粉酶是真菌酸稳定的α-淀粉酶。真菌酸性稳定的α-淀粉酶是在3.0至7.0的pH范围内,并且优选地在3.5至6.5的pH范围内具有活性的α-淀粉酶,包括在约4.0、4.5、5.0、5.5、和6.0的pH时的活性。
在一个优选的实施例中,在糖化和/或发酵中存在的和/或添加的α-淀粉酶源自根毛霉属的菌株,优选菌株微小根毛霉,如在WO 2013/006756的SEQ ID NO:3中所示的菌株,如具有黑曲霉接头和淀粉结合结构域的微小根毛霉α-淀粉酶杂合体,如在本文的SEQ IDNO:8中所示的杂合体或其变体。
在一个实施例中,在糖化和/或发酵中存在的和/或添加的α-淀粉酶选自由以下组成的组:
(i)包含本文的SEQ ID NO:8的多肽的α-淀粉酶;
(ii)包含以下氨基酸序列的α-淀粉酶,该氨基酸序列与本文的SEQ ID NO:8的多肽具有至少60%、至少70%,如至少75%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%或至少99%的同一性。
在一个优选的实施例中,该α-淀粉酶是在SEQ ID NO:8中所示的具有以下取代或取代的组合中的至少一个的α-淀粉酶的变体:D165M;Y141W;Y141R;K136F;K192R;P224A;P224R;S123H+Y141W;G20S+Y141W;A76G+Y141W;G128D+Y141W;G128D+D143N;P219C+Y141W;N142D+D143N;Y141W+K192R;Y141W+D143N;Y141W+N383R;Y141W+P219C+A265C;Y141W+N142D+D143N;Y141W+K192R V410A;G128D+Y141W+D143N;Y141W+D143N+P219C;Y141W+D143N+K192R;G128D+D143N+K192R;Y141W+D143N+K192R+P219C;G128D+Y141W+D143N+K192R;或G128D+Y141W+D143N+K192R+P219C(使用SEQ ID NO:8进行编号)。
在一个实施例中,源自具有黑曲霉葡糖淀粉酶接头和淀粉结合结构域(SBD)的微小根毛霉的α-淀粉酶,优选如本文的SEQ ID NO:8披露的,优选具有一个或多个以下取代:G128D、D143N,优选G128D+D143N(使用SEQ ID NO:8进行编号)。
在一个实施例中,在糖化和/或发酵中存在的和/或添加的α-淀粉酶变体与本文的SEQ ID NO:8的多肽具有至少75%的同一性,优选至少80%,更优选至少85%,更优选至少90%,更优选至少91%,更优选至少92%,甚至更优选至少93%,最优选至少94%,并且甚至最优选至少95%,如甚至至少96%、至少97%、至少98%、至少99%,但小于100%的同一性。
在一个优选的实施例中,在糖化和/或发酵期间存在的和/或添加的葡糖淀粉酶与α-淀粉酶之间的比率优选地可以在500:1至1:1,如从250:1至1:1、如从100:1至1:1、如从100:2至100:50、如从100:3至100:70的范围内。
在液化和/或糖化和/或发酵中存在的和/或添加的支链淀粉酶。
在液化步骤a)和/或糖化步骤b)或发酵步骤c)或同时糖化和发酵期间可以存在和/或添加支链淀粉酶。
支链淀粉酶(E.C.3.2.1.41,支链淀粉6-葡聚糖-水解酶)是去分支酶,其特征在于它们在例如分枝淀粉和支链淀粉中水解α-1,6-糖苷键的能力。
根据本发明考虑的支链淀粉酶包括来自美国专利号4,560,651(特此通过引用结合)中披露的淀粉脱支芽孢杆菌(Bacillus amyloderamificans)的支链淀粉酶、如WO 01/51620(特此通过引用结合)中的SEQ ID NO:2披露的支链淀粉酶、如WO 01/151620(特此通过引用结合)中的SEQ ID NO:4披露的脱支芽孢杆菌(Bacillus deramificans)的支链淀粉酶,以及来自如WO01/51620中的SEQ ID NO:6披露的嗜酸性支链淀粉芽孢杆菌的支链淀粉酶,以及还有描述于FEMS Mic.Let.[FEMS微生物学通讯](1994)115,97-106中的支链淀粉酶。
根据本发明,该支链淀粉酶能以有效量添加,包括约0.0001至10mg酶蛋白/克DS的优选量,优选0.0001至0.10mg酶蛋白/克DS,更优选0.0001至0.010mg酶蛋白/克DS。支链淀粉酶活性可以确定为NPUN。用于确定NPUN的测定法描述于下文的“材料与方法”-部分中。
合适的可商购支链淀粉酶产品包括PROMOZYME D、PROMOZYMETMD2(诺维信公司,丹麦)、OPTIMAX L-300(杰能科公司(Genencor Int.),美国)、以及AMANO 8(安能满公司(Amano),日本)。
本发明的方法的另外方面
在液化步骤a)之前,本发明的方法(包括提取/回收油的方法和用于生产发酵产物的方法)可以包括以下步骤:
i)优选地通过干磨来减小该含淀粉材料的粒度;
ii)形成包含该含淀粉材料和水的浆料。
在一个实施例中,至少50%,优选至少70%,更优选至少80%,尤其是至少90%的含淀粉材料适合通过具有#6筛网的筛子。
在一个实施例中,液化期间的pH是在高于4.5至6.5之间,如4.5至5.0、如约4.8,或在5.0至6.2之间的pH,如5.0至6.0,如在5.0至5.5之间,如约5.2、如约5.4、如约5.6、如约5.8。
在一个实施例中,液化期间的温度高于初始糊化温度,优选在从70℃至100℃的范围内,如在75℃至95℃之间、如在75℃至90℃之间,优选在80℃至90℃之间,尤其是约85℃。
在一个实施例中,在步骤a)中的液化之前进行喷射蒸煮步骤。在一个实施例中,喷射蒸煮是在110℃至145℃之间,优选120℃至140℃,如125℃至135℃,优选约130℃的温度进行约1至15分钟,优选约3至10分钟,尤其是约5分钟。
在一个优选的实施例中,糖化和发酵顺序进行或同时进行。
在一个实施例中,糖化是在从20℃至75℃,优选从40℃至70℃,如约60℃的温度,并且在4和5之间的pH进行。
在一个实施例中,发酵、或同时糖化和发酵(SSF)在从25℃至40℃,如从28℃至35℃、如从30℃至34℃,优选约32℃的温度进行。在一个实施例中,发酵进行6至120小时,特别是24至96小时。
在一个优选的实施例中,该发酵产物是在发酵之后回收,例如通过蒸馏。
在一个实施例中,该发酵产物是醇,优选乙醇,尤其是燃料乙醇、饮用乙醇和/或工业乙醇。
在一个实施例中,含淀粉起始材料是全谷物。在一个实施例中,含淀粉材料选自玉米、小麦、大麦、黑麦、蜀黍、西米、木薯、树薯、木薯淀粉、高粱、稻以及马铃薯的组。
在一个实施例中,发酵生物是酵母,优选酵母属的菌株,尤其是酿酒酵母的菌株。
在一个实施例中,步骤(a)中的温度高于初始糊化温度,如在80℃至90℃之间的温度,如约85℃。
在一个实施例中,本发明的方法在糖化步骤b)之前进一步包括预糖化步骤,该预糖化步骤在30℃至65℃之间的温度进行40至90分钟。在一个实施例中,糖化是在从20℃至75℃,优选从40℃至70℃,如约60℃的温度,并且在4和5之间的pH进行。在一个实施例中,发酵步骤c)或同时糖化和发酵(SSF)(即,步骤b)和c))在从25℃至40℃,如从28℃至35℃、如从30℃至34℃,优选约32℃的温度进行。在一个实施例中,发酵步骤c)或同时糖化和发酵(SSF)(即,步骤b)和c))进行6至120小时,特别是24至96小时。
在一个实施例中,步骤e)中的分离是通过离心(优选倾滗式离心机)、过滤,优选使用压滤机、螺旋压榨机、板框压榨机、重力浓缩机或脱水机进行。
在一个实施例中,发酵产物是通过蒸馏回收。
发酵培养基
进行发酵的环境通常称为“发酵培养基(fermentation media或fermentationmedium)”。发酵培养基包括发酵底物,即被发酵生物代谢的碳水化合物源。根据本发明,发酵培养基可以包含用于一种或多种发酵生物的一种或多种营养素和生长刺激剂。营养素和生长刺激剂在发酵领域中广泛使用,且包括氮源如氨;尿素、维生素和矿物质或其组合。
发酵生物
术语“发酵生物”是指适用于发酵过程中并且能够生产所希望的发酵产物的任何生物,包括细菌和真菌生物,尤其是酵母。尤其合适的发酵生物能够使糖(如葡萄糖或麦芽糖)直接或间接发酵成(即,转化成)所希望的发酵产物(如乙醇)。发酵生物的实例包括真菌生物,如酵母。优选的酵母包括酵母属物种的菌株,特别是酿酒酵母。
在发酵(如SSF)期间,有活力的发酵生物的合适的浓度在本领域是熟知的,或可以由本领域的技术人员容易地确定。在一个实施例中,将发酵生物(如乙醇发酵酵母(例如,酿酒酵母))添加至发酵培养基中,使得每mL的发酵培养基的有活力的发酵生物(如酵母)计数在从105至1012,优选从107至1010的范围内,尤其是约5x107个。
可商购的酵母的实例包括例如RED STARTM和ETHANOL REDTM酵母(可获得自富酶泰斯公司/乐斯福公司(Fermentis/Lesaffre),美国)、FALI(可获得自弗莱希曼酵母公司(Fleischmann’s Yeast),美国)、SUPERSTART和THERMOSACCTM新鲜酵母(可获得自乙醇技术公司(Ethanol Technology),威斯康星州,美国)、BIOFERM AFT和XR(可获得自NABC-北美生物制品公司(North American Bioproducts Corporation),佐治亚州,美国)、GERT STRAND(可获得自格特·斯特兰德公司(Gert Strand AB),瑞典)以及FERMIOL(可获得自帝斯曼特殊产品公司(DSM Specialties))。
含淀粉材料
根据本发明可以使用任何合适的含淀粉材料。通常基于所希望的发酵产物来选择起始材料。适用于本发明的方法中的含淀粉材料的实例包括全谷物、玉米、小麦、大麦、黑麦、蜀黍、西米、木薯、木薯淀粉、高粱、稻、豌豆、豆类或甜薯或其混合物或者由其衍生的淀粉,或谷类。还考虑了糯型(waxy type)与非糯型(non-waxy type)的玉米与大麦。在一个优选的实施例中,用于根据本发明的乙醇生产的含淀粉材料是玉米或小麦。
发酵产物
术语“发酵产物”意指通过包括使用发酵生物进行的发酵步骤在内的方法生产的产物。根据本发明考虑的发酵产物包括醇(例如,乙醇、甲醇、丁醇;多元醇如甘油、山梨醇和肌醇);有机酸(例如,柠檬酸、乙酸、衣康酸、乳酸、琥珀酸、葡糖酸);酮(例如,丙酮);氨基酸(例如,谷氨酸);气体(例如,H2和CO2);抗生素(例如,青霉素和四环素);酶;维生素(例如,核黄素、B12、β-胡萝卜素);和激素。在一个优选的实施例中,发酵产物是乙醇,例如,燃料乙醇;饮用乙醇,即中性饮用酒精;或用于消费性醇工业(例如,啤酒和酒)、乳品工业(例如,发酵的乳制品)、皮革工业和烟草工业的工业乙醇或产物。优选的啤酒类型包含爱尔啤酒(ale)、烈性黑啤酒(stout)、波特啤酒(porter)、拉格啤酒(lager)、苦啤酒(bitter)、麦芽酒(maltliquor)、低麦芽啤酒(happoushu)、高醇啤酒、低醇啤酒、低热量啤酒或清淡啤酒。优选地,本发明的方法是用于生产醇,如乙醇。根据本发明获得的发酵产物(如乙醇)可以用作典型地与汽油共混的燃料。然而,在乙醇的情况下,它还可以用作饮用乙醇。
发酵产物的回收
发酵或SSF之后,可以将发酵产物与发酵培养基分开。可以蒸馏浆料以提取所希望的发酵产物(例如,乙醇)。可替代地,可以通过微滤或膜过滤技术从发酵培养基中提取所希望的发酵产物。还可以通过汽提或本领域熟知的其他方法来回收发酵产物。
油的回收
根据本发明,在液化期间和/或之后,从该全酒糟中,从该酒糟水中,或从该浆体中回收油。可以通过提取回收油。在一个实施例中,通过己烷提取回收油。还可以使用本领域熟知的其他油回收技术。
在以下编号的实施例中进一步定义本发明:
1.一种多肽,该多肽具有蛋白酶活性,该多肽选自由以下组成的组:
(a)多肽,该多肽与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%或100%序列同一性;
(b)多肽,该多肽由以下多核苷酸编码,该多核苷酸在非常高严格条件下与(i)SEQID NO:1的成熟多肽编码序列,(ii)(i)或(ii)的全长互补序列杂交;
(c)多肽,该多肽由以下多核苷酸编码,该多核苷酸与SEQ ID NO:1的成熟多肽编码序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%序列同一性;和
(d)(a)、(b)或(c)的多肽的片段,该片段具有蛋白酶活性。
2.如实施例1所述的多肽,该多肽与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%或100%序列同一性。
3.如实施例1或2所述的多肽,该多肽由以下多核苷酸编码,该多核苷酸在非常高严格条件下与(i)SEQ ID NO:1的成熟多肽编码序列,或(ii)(i)的全长互补序列杂交。
4.如实施例1-3中任一项所述的多肽,该多肽由以下多核苷酸编码,该多核苷酸与SEQ ID NO:1的成熟多肽编码序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%或100%序列同一性。
5.如实施例1-4中任一项所述的多肽,该多肽包含SEQ ID NO:2或SEQ ID NO:2的成熟多肽或由其组成。
6.如实施例5所述的多肽,其中该成熟多肽是SEQ ID NO:2的氨基酸102至422。
7.如实施例1-6中任一项所述的多肽,该多肽是SEQ ID NO:2的成熟多肽的变体,该变体在一个或多个(若干)位置处包括取代、缺失、和/或插入。
8.如实施例1所述的多肽,该多肽是SEQ ID NO:2的片段,其中该片段具有蛋白酶活性。
9.一种多核苷酸,该多核苷酸编码如实施例1-8中任一项所述的多肽。
10.一种核酸构建体或重组表达载体,该核酸构建体或重组表达载体包含如实施例9所述的多核苷酸,该多核苷酸可操作地连接至指导该多肽在表达宿主中的产生的一个或多个异源控制序列。
11.一种重组宿主细胞,该重组宿主细胞包含如实施例9所述的多核苷酸,该多核苷酸可操作地连接至指导该多肽的产生的一个或多个异源控制序列。
12.一种组合物,该组合物包含如实施例1-8中任一项所述的多肽。
13.一种产生如实施例1-8中任一项所述的多肽的方法,该方法包括:
(a)在有益于产生该多肽的条件下培养细胞,该细胞以其野生型形式产生该多肽;和
(b)任选地回收该多肽。
14.一种产生具有蛋白酶活性的多肽的方法,该方法包括:
(a)在有益于产生该多肽的条件下培养如实施例11所述的宿主细胞;和(b)任选地回收该多肽。
15.一种用于液化含淀粉材料的方法,该方法包括在至少α-淀粉酶和如实施例1-8中任一项所述的S8A减硫高温球菌蛋白酶的存在下,在高于初始糊化温度的温度,液化该含淀粉材料。
16.一种用于由含淀粉材料生产发酵产物的方法,该方法包括以下步骤:
a)在至少以下酶的存在下,在高于初始糊化温度的温度,液化该含淀粉材料:
-α-淀粉酶;和
-S8A减硫高温球菌蛋白酶;
b)使用葡糖淀粉酶进行糖化;
c)使用发酵生物进行发酵。
17.一种从如实施例16所披露的方法中回收油的方法,该方法进一步包括如下步骤:
d)回收该发酵产物以形成全酒糟;
e)将该全酒糟分离为酒糟水和湿饼;
f)任选地将该酒糟水浓缩为浆体;
其中从以下各项中回收油:
-在如实施例15所披露的方法的步骤a)之后液化的含淀粉材料;和/或
-如实施例15所披露的方法的发酵步骤c)的下游。
18.如实施例17所述的方法,其中在液化含淀粉材料期间和/或之后回收油。
19.如实施例18所述的方法,其中从该全酒糟中回收油。
20.如实施例17中任一项所述的方法,其中从该酒糟水中回收油。
21.如实施例17所述的方法,其中从该浆体中回收油。
22.如实施例16-21中任一项所述的方法,其中糖化和发酵同时进行。
23.如实施例16-22中任一项所述的方法,其中在步骤a)-c)中,例如在糖化步骤b)、发酵步骤c)或同时糖化和发酵(SSF)期间,没有氮化合物存在和/或没有添加氮化合物。
24.如实施例16-22中任一项所述的方法,其中在步骤a)-c)中,例如在糖化步骤b)或发酵步骤c)、或同时糖化和发酵(SSF)中存在和/或添加10至1,000ppm,如50至800ppm、如100至600ppm、如200-500ppm的氮化合物,优选尿素。
25.如实施例15-24中任一项所述的方法,其中在步骤a)中的该α-淀粉酶来自芽孢杆菌属,例如嗜热脂肪芽孢杆菌菌株,特别是嗜热脂肪芽孢杆菌α-淀粉酶的变体,如SEQ IDNO:4中所示的α-淀粉酶。
26.如实施例25所述的方法,其中该嗜热脂肪芽孢杆菌α-淀粉酶或其变体是截短的,优选地截短以具有约491个氨基酸,如从480至495个氨基酸。
27.如实施例25或26中任一项所述的方法,其中该嗜热脂肪芽孢杆菌α-淀粉酶在从位置179至位置182的范围内的两个位置处,如在位置I181+G182、R179+G180、G180+I181、R179+I181或G180+G182,优选I181+G182处具有缺失,和任选地N193F取代,(使用SEQ IDNO:4进行编号)。
28.如实施例25-27中任一项所述的方法,其中该嗜热脂肪芽孢杆菌α-淀粉酶在位置S242处具有取代,优选S242Q取代。
29.如实施例25-28中任一项所述的方法,其中该嗜热脂肪芽孢杆菌α-淀粉酶在位置E188处具有取代,优选E188P取代。
30.如实施例25-29中任一项所述的方法,其中该α-淀粉酶选自嗜热脂肪芽孢杆菌α-淀粉酶变体的组,这些变体除I181*+G182*以及任选地N193F之外具有以下突变:
31.如实施例25-30中任一项所述的方法,其中该α-淀粉酶选自嗜热脂肪芽孢杆菌α-淀粉酶变体的组:
-I181*+G182*+N193F+E129V+K177L+R179E;
-I181*+G182*+N193F+V59A+Q89R+E129V+K177L+R179E+H208Y+K220P+N224L+Q254S;
-I181*+G182*+N193F+V59AQ89R+E129V+K177L+R179E+Q254S+M284V;和
-I181*+G182*+N193F+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S(使用SEQID NO:4进行编号)。
32.如实施例25-31中任一项所述的方法,其中该α-淀粉酶变体与SEQ ID NO:4的多肽具有至少75%的同一性,优选至少80%,更优选至少85%,更优选至少90%,更优选至少91%,更优选至少92%,甚至更优选至少93%,最优选至少94%,并且甚至最优选至少95%,如甚至至少96%、至少97%、至少98%、至少99%,但小于100%的同一性。
33.如实施例25-32中任一项所述的方法,其中该α-淀粉酶以0.1至100微克/克DS,如0.5至50微克/克DS、如1至25微克/克DS、如1至10微克/克DS、如2至5微克/克DS的浓度存在和/或添加。
34.如实施例15-33中任一项所述的方法,其中在液化中存在和/或添加从1至50微克、特别地是从2至40微克、特别地是从4-25微克、特别地是从5至20微克减硫高温球菌S8A蛋白酶/克DS。
35.如实施例15-34中任一项所述的方法,其中该减硫高温球菌蛋白酶选自:
a)多肽,该多肽包含SEQ ID NO:2的氨基酸102至422或由其组成;
b)多肽,该多肽与SEQ ID NO:2的氨基酸102至422具有至少80%、至少85、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%序列同一性。
36.如实施例16-35中任一项所述的方法,进一步地,其中在糖化步骤b)和/或发酵步骤c)中存在的和/或添加的该葡糖淀粉酶是真菌起源的,优选地来自曲霉属的菌株,优选黑曲霉、泡盛曲霉或米曲霉;或木霉属的菌株,优选里氏木霉;或篮状菌属的菌株,优选埃默森篮状菌;或栓菌属的菌株,优选瓣环栓菌;或密孔菌属的菌株;或粘褶菌属的菌株,如篱边粘褶菌或密粘褶菌;或黑层孔属的菌株。
37.如实施例36所述的方法,其中该葡糖淀粉酶源自埃默森篮状菌,如在本文的SEQ ID NO:5中所示的埃默森篮状菌。
38.如实施例37所述的方法,其中该葡糖淀粉酶选自由以下组成的组:
(i)包含SEQ ID NO:5的多肽的葡糖淀粉酶;
(ii)包含以下氨基酸序列的葡糖淀粉酶,该氨基酸序列与SEQ ID NO:5的多肽具有至少60%、至少70%,如至少75%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%或至少99%的同一性。
39.如实施例36所述的方法,其中该葡糖淀粉酶源自篱边粘褶菌,如在SEQ ID NO:6中所示的篱边粘褶菌。
40.如实施例39所述的方法,其中该葡糖淀粉酶选自由以下组成的组:
(i)包含SEQ ID NO:6的多肽的葡糖淀粉酶;
(ii)包含以下氨基酸序列的葡糖淀粉酶,该氨基酸序列与SEQ ID NO:6的多肽具有至少60%、至少70%,例如至少75%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%或至少99%的同一性。
41.如实施例36所述的方法,其中该葡糖淀粉酶源自密粘褶菌,如在SEQ ID NO:7中所示的密粘褶菌。
42.如实施例41所述的方法,其中该葡糖淀粉酶选自由以下组成的组:(i)包含SEQID NO:7的多肽的葡糖淀粉酶;
(ii)包含以下氨基酸序列的葡糖淀粉酶,该氨基酸序列与SEQ ID NO:7的多肽具有至少60%、至少70%,如至少75%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%或至少99%的同一性。
43.如实施例16-42中任一项所述的方法,其中该葡糖淀粉酶是在糖化和/或发酵过程中与α-淀粉酶结合存在的。
44.如实施例43所述的方法,其中在糖化和/或发酵中存在的该α-淀粉酶是真菌或细菌起源的。
45.如实施例43或44所述的方法,其中在糖化和/或发酵中存在的和/或添加的该α-淀粉酶源自根毛霉属的菌株,优选菌株微小根毛霉,如具有黑曲霉接头和淀粉结合结构域的微小根毛霉α-淀粉酶杂合体,如在SEQ ID NO:8中所示的杂合体。
46.如实施例45所述的方法,其中在糖化和/或发酵中存在的该α-淀粉酶选自由以下组成的组:
(i)包含SEQ ID NO:8的多肽的α-淀粉酶;
(ii)包含以下氨基酸序列的α-淀粉酶,该氨基酸序列与SEQ ID NO:8的多肽具有至少60%、至少70%,如至少75%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%或至少99%的同一性。
47.如实施例44-46中任一项所述的方法,其中该α-淀粉酶源自具有黑曲霉葡糖淀粉酶接头和淀粉结合域(SBD)的微小根毛霉,优选如SEQ ID NO:8披露的,优选具有以下取代中的一个或多个:G128D、D143N,优选G128D+D143N(使用SEQ ID NO:8进行编号)。
48.如实施例16-47中任一项所述的方法,该方法在液化步骤a)之前进一步包括以下步骤:
i)优选地通过干磨来减小该含淀粉材料的粒度;
ii)形成包含该含淀粉材料和水的浆料。
49.如实施例16-48中任一项所述的方法,其中至少50%,优选至少70%,更优选至少80%,尤其是至少90%的含淀粉材料适合通过具有#6筛网的筛子。
50.如实施例15-49中任一项所述的方法,其中液化中的pH在高于4.5至6.5之间,如约4.8,或在5.0至6.2之间的pH,如5.0至6.0,如在5.0至5.5之间,如约5.2、如约5.4、如约5.6、如约5.8。
51.如实施例51-50中任一项所述的方法,其中液化中的温度高于初始糊化温度,如在从70℃至100℃的范围内,如在75℃至95℃之间、如在75℃至90℃之间,优选在80℃至90℃之间,尤其是约85℃。
52.如实施例15-51中任一项所述的方法,其中在步骤a)中的液化之前进行喷射蒸煮步骤。
53.如实施例52所述的方法,其中该喷射蒸煮是在110℃至145℃之间,优选120℃至140℃,如125℃至135℃,优选约130℃的温度进行约1至15分钟,优选约3至10分钟,尤其是约5分钟。
54.如实施例16-53中任一项所述的方法,其中糖化是在从20℃至75℃,优选从40℃至70℃,如约60℃的温度,并且在4和5之间的pH下进行。
55.如实施例16-54中任一项所述的方法,其中发酵或同时糖化和发酵(SSF)在从25℃至40℃,如从28℃至35℃、如从30℃至34℃,优选约32℃的温度进行。
56.如实施例16-55中任一项所述的方法,其中该发酵产物是在发酵之后例如通过蒸馏回收。
57.如实施例16-56中任一项所述的方法,其中该发酵产物是醇,优选乙醇,尤其是燃料乙醇、饮用乙醇和/或工业乙醇。
58.如实施例16-57中任一项所述的方法,其中该含淀粉起始材料是全谷物。
59.如实施例16-58中任一项所述的方法,其中该含淀粉材料源自玉米、小麦、大麦、黑麦、蜀黍、西米、木薯、树薯、木薯淀粉、高粱、稻或马铃薯。
60.如实施例16-59中任一项所述的方法,其中该发酵生物是酵母,优选酵母属的菌株,尤其是酿酒酵母的菌株。
61.如实施例15-60中任一项所述的方法,其中在液化中α-淀粉酶与蛋白酶之间的比率在1:1和1:50(微克α-淀粉酶:微克蛋白酶)之间的范围内,例如在1:3和1:40之间,例如约1:4(微克α-淀粉酶:微克蛋白酶)。
62.一种酶组合物,该酶组合物包含α-淀粉酶和减硫高温球菌S8A蛋白酶,优选地如实施例1-8所述的多肽。
63.如实施例62所述的酶组合物,其中α-淀粉酶与蛋白酶之间的比率在1:1和1:50(微克α-淀粉酶:微克蛋白酶)之间的范围内,例如在1:3和1:40之间,例如约1:4(微克α-淀粉酶:微克蛋白酶)。
64.如实施例62-64中任一项所述的酶组合物,其中该酶组合物包含葡糖淀粉酶,并且α-淀粉酶与葡糖淀粉酶之间的比率在1:1和1:10之间,例如约1:2(微克α-淀粉酶:微克葡糖淀粉酶)。
65.如实施例62-64中任一项所述的酶组合物,其中该α-淀粉酶是细菌α-淀粉酶,特别地是源自芽孢杆菌或微小杆菌物种,例如像地衣芽孢杆菌或嗜热脂肪芽孢杆菌的细菌α-淀粉酶。
66.如实施例62-65中任一项所述的酶组合物,其中该α-淀粉酶来自嗜热脂肪芽孢杆菌的菌株,特别是嗜热脂肪芽孢杆菌α-淀粉酶的变体,如在SEQ ID NO:4中所示的α-淀粉酶。
67.如实施例62-66中任一项所述的酶组合物,其中该嗜热脂肪芽孢杆菌α-淀粉酶或其变体是截短的,优选地截短以具有约491个氨基酸,如从480-495个氨基酸。
68.如实施例62-67中任一项所述的酶组合物,其中该嗜热脂肪芽孢杆菌α-淀粉酶在从位置179至位置182的范围内的两个位置处,如在位置I181+G182、R179+G180、G180+I181、R179+I181或G180+G182,优选I181+G182处具有缺失,和任选地N193F取代,(使用SEQID NO:4进行编号)。
69.如实施例62-68中任一项所述的酶组合物,其中该嗜热脂肪芽孢杆菌α-淀粉酶在位置S242处具有取代,优选S242Q取代。
70.如实施例62-69中任一项所述的酶组合物,其中该嗜热脂肪芽孢杆菌α-淀粉酶在位置E188处具有取代,优选E188P取代。
71.如实施例62-70中任一项所述的酶组合物,其中该α-淀粉酶选自嗜热脂肪芽孢杆菌α-淀粉酶变体的组,这些变体除缺失I181*+G182*以及任选地N193F之外具有以下突变:
72.如实施例62-71中任一项所述的酶组合物,其中该α-淀粉酶选自具有以下突变的嗜热脂肪芽孢杆菌α-淀粉酶变体的组:
-I181*+G182*+N193F+E129V+K177L+R179E;
-I181*+G182*+N193F+V59A+Q89R+E129V+K177L+R179E+H208Y+K220P+N224L+Q254S;
-I181*+G182*+N193F+V59A Q89R+E129V+K177L+R179E+Q254S+M284V;和
-I181*+G182*+N193F+E129V+K177L+R179E+K220P+N224L+S242Q+Q254S(使用SEQID NO:4进行编号)。
73.如实施例62-72中任一项所述的酶组合物,其中该α-淀粉酶变体与SEQ ID NO:4的多肽具有至少85%,更优选至少90%,更优选至少91%,更优选至少92%,甚至更优选至少93%,最优选至少94%,并且甚至最优选至少95%,如甚至至少96%、至少97%、至少98%、至少99%,但小于100%的同一性。
74.如实施例62-73中任一项所述的酶组合物,其中该减硫高温球菌S8A蛋白酶与SEQ ID NO:2的氨基酸102至422具有至少80%,如至少85%,如至少90%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%的同一性。
75.如实施例60所述的方法,其中该酵母细胞表达葡糖淀粉酶,例如如实施例36-42所述的葡糖淀粉酶。
76.如实施例15-60所述的方法,其中在液化中存在或添加葡糖淀粉酶。
77.如实施例76所述的方法,其中在液化中存在的和/或添加的葡糖淀粉酶是草酸青霉菌葡糖淀粉酶,该草酸青霉菌葡糖淀粉酶具有K79V取代(使用SEQ ID NO:10进行编号),并且进一步具有以下取代的组合中的一个:
-P11F+T65A+Q327F;或
-P2N+P4S+P11F+T65A+Q327F(使用SEQ ID NO:10进行编号),并且其中该葡糖淀粉酶与SEQ ID NO:10的多肽具有至少75%的同一性,优选至少80%,更优选至少85%,更优选至少90%,更优选至少91%,更优选至少92%,甚至更优选至少93%,最优选至少94%,并且甚至最优选至少95%,如甚至至少96%、至少97%、至少98%、至少99%,但小于100%的同一性。
78.减硫高温球菌S8A蛋白酶在液化含淀粉材料中的用途。
79.如实施例78所述的用途,其中该减硫高温球菌S8A蛋白酶与SEQ ID NO:2的氨基酸102至422具有至少80%,如至少85%,如至少90%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%的同一性。
本发明进一步通过以下实例进行描述。
实例
在实例中使用的酶和酵母:
α-淀粉酶BE369(AA369):本文中披露为SEQ ID NO:4并且其进一步具有以下突变的嗜热脂肪芽孢杆菌α-淀粉酶:I181*+G182*+N193F+V59A+Q89R+E129V+K177L+R179E+Q254S+M284V,截短至491个氨基酸(使用SEQ ID NO:4进行编号)。
葡糖淀粉酶PoAMG:草酸青霉菌葡糖淀粉酶的成熟部分,如WO 2011/127802中的SEQ ID NO:2披露的以及本文中显示的SEQ ID NO:10。
葡糖淀粉酶PoAMG498(GA498):具有以下突变的草酸青霉菌葡糖淀粉酶变体:K79V+P2N+P4S+P11F+T65A+Q327F(使用SEQ ID NO:10进行编号)。
葡糖淀粉酶X:包含以下的共混物:如WO 99/28448中的SEQ ID NO:34(本文中的SEQ ID NO:5)披露的埃默森篮状菌葡糖淀粉酶、如WO 06/69289中的SEQ ID NO:2(本文中的SEQ ID NO:9)披露的瓣环栓菌葡糖淀粉酶、以及本文中的SEQ ID NO:8披露的具有黑曲霉葡糖淀粉酶接头和淀粉结合结构域(SBD)的微小根毛霉α-淀粉酶,其具有以下取代:G128D+D143N(使用SEQ ID NO:8进行编号)(以AGU:AGU:FAU-F表示的活性比为约28:7:1)。
酵母:来自美国富酶泰斯公司(Fermentis)的ETHANOL REDTM。
测定
蛋白酶测定
1)动力学Suc-AAPF-pNA测定:
pNA底物:Suc-AAPF-pNA(巴亨公司(Bachem)L-1400)。
温度:室温(25℃)
测定缓冲液:100mM琥珀酸,100mM HEPES,100mM CHES,100mM CABS,1mM CaCl2,150mM KCl,0.01%Triton X-100,用HCl或NaOH调节至pH值2.0、3.0、4.0、5.0、6.0、7.0、8.0、9.0、10.0、和11.0。
将20μl蛋白酶(稀释于0.01%Triton X-100中)与100μl测定缓冲液混合。通过添加100μl pNA底物(50mg,溶解在1.0ml DMSO中,并进一步地用0.01%Triton X-100稀释45倍)开始进行测定。监测OD405的增加作为蛋白酶活性的量度。
2)端点Suc-AAPF-pNA AK测定:
pNA底物:Suc-AAPF-pNA(巴亨公司(Bachem)L-1400)。
温度:受控的(测定温度)。
测定缓冲液:100mM琥珀酸,100mM HEPES,100mM CHES,100mM CABS,1mM CaCl2,150mM KCl,0.01%Triton X-100,pH 9.0。
将200μl pNA底物(50mg,溶解在1.0ml DMSO中,并进一步地用测定缓冲液稀释45倍)用移液管移至艾本德管(Eppendorf)中并置于冰上。添加20μl蛋白酶样品(稀释于0.01%Triton X-100中)。通过将艾本德管转移至设置为测定温度的艾本德恒温混匀仪中来启动测定。将管在艾本德恒温混匀仪上在最高振摇速率(1400rpm)下孵育15分钟。通过转移管返回至冰浴并添加600μl 500mM琥珀酸/NaOH(pH 3.5)来停止孵育。通过涡旋混合艾本德管后,将200μl混合物转移至微量滴定板中。读取OD405作为蛋白酶活性的量度。在测定中包括空白缓冲液(buffer blind)(代替酶)。
实例1:来自减硫高温球菌DSM 14981的S8蛋白酶1的克隆与表达
基因
S8蛋白酶多肽编码序列的基因组DNA序列克隆自注释为减硫高温球菌DSM 14981的古细菌菌株。基因组DNA序列和推导的氨基酸序列分别在SEQ ID NO:1和SEQ ID NO:2中示出。
表达克隆
编码S8蛋白酶1多肽(SEQ ID NO 1)的1269bp基因作为StringsTm线性DNA片段订购自赛默飞世尔科技公司(Thermo Fisher Scientific)。将5’和3’区域与StringsTm DNA线性片段融合以允许其在SOE-PCR中直接使用。编码减硫高温球菌DSM 14981的S8蛋白酶1多肽的线性DNA片段通过SOE-PCR与调节元件和同源区域融合以重组到枯草芽孢杆菌基因组中。线性整合构建体是SOE-PCR融合产物(Horton,R.M.,Hunt,H.D.,Ho,S.N.,Pullen,J.K.以及Pease,L.R.(1989),Engineering hybrid genes withoutthe use of restriction enzymes,gene splicing by overlap extension[不使用限制酶,通过重叠延伸的基因剪接的工程化杂种基因]Gene[基因]77:61-68),该融合产物由两个枯草芽孢杆菌染色体区域之间的基因连同强启动子与氯霉素抗性标志的融合制备。SOEPCR方法也描述于专利申请WO2003095658中。
在三联启动子系统(如WO 99/43835中所述)的控制下表达该基因,该启动子系统由包含稳定化序列的地衣芽孢杆菌α-淀粉酶基因(amyL)启动子、解淀粉芽孢杆菌α-淀粉酶基因(amyQ)启动子和苏云金芽孢杆菌cryIIIA启动子组成。该SOE-PCR产物被转化进枯草芽孢杆菌中并且在染色体上通过同源重组将其整合到果胶裂解酶位点中。随后将包含该整合表达构建体的重组枯草芽孢杆菌克隆在液体培养基中进行生长。将培养液离心(20000x g,20min)并且将上清液小心地与沉淀物滗析分开,并用于酶的纯化。
实例2:来自减硫高温球菌的S8蛋白酶1的纯化和表征
来自减硫高温球菌的S8蛋白酶1的纯化
将培养液离心(20000x g,20min)并且将上清液小心地与沉淀物滗析分开。将上清液通过耐洁(Nalgene)0.2μm过滤装置过滤以便除去剩余的芽孢杆菌宿主细胞。将固体(NH4)2SO4添加至0.2μm滤液中达到1.0M(NH4)2SO4的最终浓度,并且将该酶溶液施加至在50mM H3BO3、10mM MES、2mM CaCl2、1.0M(NH4)2SO4、pH 6.0中平衡的苯基-琼脂糖FF(高取代)柱(来自通用电气医疗集团(GE Healthcare))上。在用平衡缓冲液充分洗涤柱之后,将蛋白酶用线性梯度在平衡缓冲液与75%(50mM H3BO3,10mM MES,2mM CaCl2,pH 6.0)+25%异丙醇之间经三个柱体积进行洗脱。分析来自柱的级分针对蛋白酶活性(使用在pH 9的动力学Suc-AAPF-pNA测定),并且合并蛋白酶活性峰。将来自苯基-琼脂糖柱的合并物转移至G25Sephadex柱(来自通用电气医疗集团)上的10mM Tris/HCl、1mM CaCl2、pH 9.0中,并将G25转移酶应用于在10mM Tris/HCl、1mM CaCl2、pH 9.0中平衡的Q-琼脂糖FF柱(来自通用电气医疗集团)。在将柱用平衡缓冲液充分地洗涤之后,将蛋白酶用在平衡缓冲液与10mMTris/HCl、1mM CaCl2、750mM NaCl、pH 9.0之间的线性梯度经五个柱体积洗脱。分析来自柱的级分的蛋白酶活性(使用在pH 9下的动力学Suc-AAPF-pNA测定),并且将活性级分合并并用脱矿质水稀释10倍。将该稀释的蛋白酶合并物施加到SOURCE 30Q柱(来自通用电气医疗集团)上,该柱以10mM Tris/HCl、1mM CaCl2(pH 9.0)平衡。在将柱用平衡缓冲液充分地洗涤之后,将蛋白酶用在平衡缓冲液与10mM Tris/HCl、1mM CaCl2、750mM NaCl、pH 9.0之间的线性梯度经五个柱体积洗脱。分析来自柱的级分的蛋白酶活性(使用在pH 9下的动力学Suc-AAPF-pNA测定),并且通过SDS-PAGE进一步分析活性级分。将级分(在考马斯染色的SDS-PAGE凝胶上有约37kDa的一条主要带)合并。合并物是纯化的制剂并且用于进一步表征。
来自减硫高温球菌的S8蛋白酶1的表征
使用动力学Suc-AAPF-pNA测定来获得来自减硫高温球菌的S8蛋白酶1的pH-活性曲线和pH-稳定性曲线。对于pH-稳定性曲线,将蛋白酶在不同测定缓冲液中稀释10倍以达到这些缓冲液的pH值并且然后在37℃孵育2小时。孵育之后,通过稀释于pH 9.0测定缓冲液中,使该蛋白酶孵育的pH转至pH 9.0,之后对残余活性进行测定。使用端点Suc-AAPF-pNA测定以获得在pH 9.0下的温度-活性曲线。
结果示于下表1-3中。对于表1,活性相对于酶的最佳pH。对于表2,活性是相对于样品的残余活性,将这些样品保持在稳定条件(5℃,pH 9.0)下。对于表3,活性相对于酶在pH9.0的最佳温度。
表1:pH活性曲线
表2:pH-稳定性曲线(在37℃2小时之后的残余活性)
表3:在pH 9.0下的温度活性曲线
来自减硫高温球菌的S8蛋白酶1的其他特征
抑制剂:PMSF。
确定N-末端序列的测定开始于SEQ ID NO:2中的位置102。
如通过SDS-PAGE确定的相对分子量约是Mr=37kDa。
通过完整分子量分析确定的观察到的分子量是33153.3Da。
将成熟序列(来自EDMAN N-末端测序数据和完整MS数据)确定为SEQ ID NO:2的氨基酸102至422。
来自这一成熟序列的计算的分子量是33152.3Da。
实例3:通过差示扫描量热法确定Td。
通过差示扫描量热法(DSC)使用VP-毛细管差示扫描量热仪(微量热公司(MicroCal Inc.),皮斯卡特维(Piscataway),新泽西州,美国),确定S8蛋白酶1和来自强烈火球菌(本文注释为PfuS)的参比丝氨酸蛋白酶(如SEQ ID NO:3披露的)的热稳定性。将用PfuS作为参比,因为它先前已示出具有良好的热稳定性并且适用于含淀粉材料的液化(WO2012/088303)。在200K/小时的恒定的程序化加热速率下,在加热缓冲液(50mM乙酸盐缓冲液,pH 4.5,2mM CaCl2)中的酶溶液(约0.5mg/ml)后获得的热分析图(Cp对T)中,热变性温度Td(℃)取自变性峰(主要的吸热峰)的顶端。将样品溶液和参比溶液(约0.2ml)从10℃的储存条件下装载到量热仪中(参比溶液:不含酶的缓冲液),并且在20℃热预平衡20分钟,随后从20℃至100℃进行DSC扫描。以约+/-1℃的精确度确定变性温度。在这些条件下获得的S8蛋白酶1和PfuS的Td总结于表4中。
表4:通过差示扫描量热法确定Td
样品 | Td |
S8蛋白酶1 | 112.2℃ |
PfuS | 90.4℃+96.5℃ |
实例4:玉米面筋水解产物
将含有约30%(w/v)干固体(DS)的玉米湿面筋在15mM乙酸盐缓冲液(pH 5)中稀释至5%(w/v)DS并搅拌直至完全溶解。将100ml 5%(w/v)DS(相当于5g DS)转移到具有三个挡板的500ml摇瓶中,并且每g DS添加500μg蛋白酶。将样品在设定为125rpm的旋转台上在50℃孵育24小时。在24小时长的孵育后,将玉米面筋水解产物通过0.45μm过滤器过滤,并添加苯基甲磺酰氟至最终浓度为500μM。然后将玉米面筋水解产物进行如下所述的游离氨基酸分析。表5总结了玉米面筋水解产物中蛋白酶释放的游离氨基酸的总量。
游离氨基酸分析
首先在3kDa滤膜上洗涤样品,并收集含有游离氨基酸的贯流物。使用WatersAccQ-Tag Ultra方法通过柱前衍生进行氨基酸分析。简而言之,将氨基酸通过AccQ-TagUltra试剂衍生化并用反相UPLC(沃特斯公司(Waters Corp.),米尔福德(Milford),马萨诸塞州)分离,并且基于UV吸光度定量衍生物。
表5:玉米面筋水解产物中的总游离氨基酸
实例5:减硫高温球菌蛋白酶用于乙醇生产的用途
测试本发明的成熟蛋白酶(SEQ ID NO:2的氨基酸102至422),用于对玉米粉浆料的常规乙醇方法中,包括液化步骤,随后是同时糖化和发酵。
液化:将全玉米粉、酒糟水和自来水的七个浆体制备成目标是32.50%干固体(DS)的120g的总重量;将酒糟水以30%反流重量/浆料重量共混。初始浆料pH约为5.2并且用45%w/v氢氧化钾或40%v/v硫酸调节为5.0。将固定剂量的α-淀粉酶BE369(2.1μg EP/gDS)和葡糖淀粉酶Po AMG498(4.5μg EP/g DS)施加到所有的浆体上并且与如下的来自嗜热高温球菌(Thermococcus litoralis)(Tl)的S8蛋白酶(SEQ ID NO:11)或来自减硫高温球菌(Tt)的S8蛋白酶(SEQ ID NO:2的氨基酸102至422)合并以评估液化期间的蛋白酶处理效果:
对照:α-淀粉酶BE369+葡糖淀粉酶PoAMG498
α-淀粉酶BE369+葡糖淀粉酶PoAMG498+0.5μg/g DS Tl蛋白酶
α-淀粉酶BE369+葡糖淀粉酶PoAMG498+1μg/g DS Tl蛋白酶
α-淀粉酶BE369+葡糖淀粉酶PoAMG498+3μg/g DS Tl蛋白酶
α-淀粉酶BE369+葡糖淀粉酶PoAMG498+0.5μg/g DS Tt蛋白酶
α-淀粉酶BE369+葡糖淀粉酶PoAMG498+1μg/g DS Tt蛋白酶
α-淀粉酶BE369+葡糖淀粉酶PoAMG498+3μg/g DS Tt蛋白酶
向每个罐中添加水和酶,并且然后在装入Labomat之前密封每个罐并且充分混合。在设置为以下条件的Labomat中孵育所有样品:以5℃/min斜升,15分钟斜升至80℃,保持1min,以1℃/min斜升至85℃并且保持103min,40rpm向左持续30秒并且向右持续30秒。一旦完成液化,在进行发酵之前,将所有的罐在冰浴中冷却约20分钟。
同时糖化和发酵(SSF):将青霉素添加至每个醪液中至最终浓度为3ppm并且调节pH至5.0。接下来,将该醪液的部分转移到试管中。用1/64”钻头钻所有的试管,以允许CO释放。将尿素添加到管的一半中至浓度为500ppm。此外,在所有处理中如保证尿素与无尿素醪液包含相等固体所需要的,通过添加水来保持等量固体。通过添加葡糖淀粉酶X(0.60AGU/gDS)、水和再水化的酵母来启动发酵。通过将5.5g的ETHANOL REDTM混合到100mL的32℃自来水中持续至少15分钟并且每个试管给予100μl进行酵母再水化。
HPLC分析:HPLC分析使用与Bio-Rad HPX-87H离子排阻柱(300mm x 7.8mm)和Bio-Rad阳离子H保护柱(Bio-Rad Cation H guard cartridge)结合的Agilent 1100/1200。流动相是0.6ml/min流速的0.005M硫酸和处理过的样品,其中柱和RI检波器分别温度为65℃和55℃。在54小时后通过每个处理舍弃3个管进行发酵取样。将每个管通过用50μl的40%v/v H2SO4失活、涡旋、以1460×g离心10分钟,并且通过0.45μm沃特曼PP过滤器(Whatman PPfilter)过滤进行处理。在HPLC分析之前和期间将样品储存在4℃。该方法使用针对DP4+、DP3、DP2、葡萄糖、果糖、乙酸、乳酸、甘油以及乙醇(%w/v)的校准标准品对分析物进行定量。使用四点校准(包括来源)进行定量。
获得的乙醇产量示于下表6和7中。
表6.用于氮受限的(无尿素)发酵的最终乙醇
处理 | 蛋白酶剂量(μg/g DS) | EtOH(%w/v) |
BE369+PoAMG(对照) | 0 | 11.272 |
对照+Tl | 0.5 | 12.0768 |
对照+Tl | 1 | 12.6484 |
对照+Tl | 3 | 13.2986 |
对照+Tt | 0.5 | 12.3314 |
对照+Tt | 1 | 12.8282 |
对照+Tt | 3 | 13.4724 |
表7用于基于尿素(500ppm)的发酵的最终乙醇
处理 | 蛋白酶剂量(μg/g DS) | EtOH(%w/v) |
BE369+PoAMG(对照) | 0 | 13.489 |
对照+Tl | 0.5 | 13.5632 |
对照+Tl | 1 | 13.524 |
对照+Tl | 3 | 13.5262 |
对照+Tt | 0.5 | 13.6232 |
对照+Tt | 1 | 13.547 |
对照+Tt | 3 | 13.5976 |
实例6:减硫高温球菌蛋白酶用于乙醇生产的用途
测试本发明的成熟蛋白酶(SEQ ID NO:2的氨基酸102至422),用于对玉米粉浆料的常规乙醇方法中,包括液化步骤,随后是同时糖化和发酵。
液化:将全玉米粉、酒糟水和自来水的浆体制备成目标是32.50%干固体(DS)的120g的总重量;将酒糟水以30%反流重量/浆料重量共混。初始浆料pH约为5.2并且用45%w/v氢氧化钾或40%v/v硫酸调节为5.0。将固定剂量的α-淀粉酶BE369(2.1μg EP/g DS)施加到所有的浆体上并且与如下的来自嗜热高温球菌(Tl)的S8蛋白酶(SEQ ID NO:11)或来自减硫高温球菌的S8蛋白酶(SEQ ID NO:2的氨基酸102至422)合并以评估液化期间的蛋白酶处理效果:
对照:α-淀粉酶BE369
α-淀粉酶BE369+0.5μg/g DS Tl蛋白酶
α-淀粉酶BE369+1μg/g DS Tl蛋白酶
α-淀粉酶BE369+3μg/g DS Tl蛋白酶
α-淀粉酶BE369+15μg/g DS Tl蛋白酶
α-淀粉酶BE369+0.5μg/g DS Tt蛋白酶
α-淀粉酶BE369+1μg/g DS Tt蛋白酶
α-淀粉酶BE369+3μg/g DS Tt蛋白酶
α-淀粉酶BE369+15μg/g DS Tt蛋白酶
向每个罐中添加水和酶,并且然后在装入Labomat之前密封每个罐并且充分混合。在设置为以下条件的Labomat中孵育所有样品:以5℃/min斜升,15分钟斜升至80℃,保持1min,以1℃/min斜升至85℃并且保持103min,40rpm向左持续30秒并且向右持续30秒。一旦完成液化,在进行发酵之前,将所有的罐在冰浴中冷却约20分钟。
同时糖化和发酵(SSF):将青霉素添加至每个醪液中至最终浓度为3ppm并且调节pH至5.0。接下来,将该醪液的部分转移到试管中。用1/64”钻头钻所有的试管,以允许CO释放。将尿素添加到管的一半中至浓度为500ppm。此外,在所有处理中如保证尿素与无尿素醪液包含相等固体所需要的,通过添加水来保持等量固体。通过添加葡糖淀粉酶X(0.60AGU/gDS)、水和再水化的酵母来启动发酵。通过将5.5g的ETHANOL REDTM混合到100mL的32℃自来水中持续至少15分钟并且每个试管给予100μl进行酵母再水化。
HPLC分析:HPLC分析使用与Bio-Rad HPX-87H离子排阻柱(300mm x 7.8mm)和Bio-Rad阳离子H保护柱(Bio-Rad Cation H guard cartridge)结合的Agilent 1100/1200。流动相是0.6ml/min流速的0.005M硫酸和处理过的样品,其中柱和RI检波器分别温度为65℃和55℃。在54小时后通过每个处理舍弃3个管进行发酵取样。将每个管通过用50μl的40%v/v H2SO4失活、涡旋、以1460×g离心10分钟,并且通过0.45μm沃特曼PP过滤器(Whatman PPfilter)过滤进行处理。在HPLC分析之前和期间将样品储存在4℃。该方法使用针对DP4+、DP3、DP2、葡萄糖、果糖、乙酸、乳酸、甘油以及乙醇(%w/v)的校准标准品对分析物进行定量。使用四点校准(包括来源)进行定量。
获得的乙醇产量示于下表8和9中。
表8.用于氮受限的(无尿素)发酵的最终乙醇
表9.用于基于尿素(500ppm)的发酵的最终乙醇
实例7:减硫高温球菌蛋白酶用于乙醇生产的用途
测试本发明的成熟蛋白酶(SEQ ID NO:2的氨基酸102至422),用于对玉米粉浆料的常规乙醇方法中,包括液化步骤,随后是同时糖化和发酵。
液化:将全玉米粉、酒糟水和自来水的浆体制备成目标是32.50%干固体(DS)的120g的总重量;将酒糟水以30%反流重量/浆料重量共混。初始浆料pH约为5.2并且用45%w/v氢氧化钾或40%v/v硫酸调节为5.0。将固定剂量的α-淀粉酶BE369(2.1μg EP/g DS)施加到所有的浆体上并且与如下的来自嗜热高温球菌的S8蛋白酶(SEQ ID NO:11)或来自减硫高温球菌的S8蛋白酶(SEQ ID NO:2的氨基酸102至422)合并以评估液化期间的蛋白酶处理效果:
对照:α-淀粉酶
α-淀粉酶BE369+0.5μg/g DS Tl蛋白酶
α-淀粉酶BE369+5.0μg/g DS Tl蛋白酶
α-淀粉酶BE369+5.0μg/g DS Tt蛋白酶
向每个罐中添加水和酶,并且然后在装入Labomat之前密封每个罐并且充分混合。在设置为以下条件的Labomat中孵育所有样品:以5℃/min斜升,15分钟斜升至80℃,保持1min,以1℃/min斜升至85℃并且保持103min,40rpm向左持续30秒并且向右持续30秒。一旦完成液化,在进行发酵之前,将所有的罐在冰浴中冷却约20分钟。
同时糖化和发酵(SSF):将青霉素添加至每个醪液中至最终浓度为3ppm并且调节pH至5.0。接下来,将该醪液的部分转移到试管中。用1/64”钻头钻所有的试管,以允许CO释放。将尿素添加到管的一半中至浓度为500ppm。此外,在所有处理中如保证尿素与无尿素醪液包含相等固体所需要的,通过添加水来保持等量固体。通过添加葡糖淀粉酶X(0.60AGU/gDS)、水和再水化的酵母来启动发酵。通过将5.5g的ETHANOL REDTM混合到100mL的32℃自来水中持续至少15分钟并且每个试管给予100μl进行酵母再水化。
HPLC分析:HPLC分析使用与Bio-Rad HPX-87H离子排阻柱(300mm x 7.8mm)和Bio-Rad阳离子H保护柱(Bio-Rad Cation H guard cartridge)结合的Agilent 1100/1200。流动相是0.6ml/min流速的0.005M硫酸和处理过的样品,其中柱和RI检波器分别温度为65℃和55℃。在54小时后通过每个处理舍弃3个管进行发酵取样。将每个管通过用50μl的40%v/v H2SO4失活、涡旋、以1460×g离心10分钟,并且通过0.45μm沃特曼PP过滤器(Whatman PPfilter)过滤进行处理。在HPLC分析之前和期间将样品储存在4℃。该方法使用针对DP4+、DP3、DP2、葡萄糖、果糖、乙酸、乳酸、甘油以及乙醇(%w/v)的校准标准品对分析物进行定量。使用四点校准(包括来源)进行定量。
获得的乙醇产量示于下表中。
表10.用于氮受限的(无尿素)发酵的最终乙醇
表11.用于基于尿素(500ppm)的发酵的最终乙醇
序列表
<110> 诺维信公司(Novozymes A/S)
<120> 具有蛋白酶活性的多肽和编码其的多核苷酸
<130> 14255-WO-PCT
<160> 11
<170> PatentIn版本3.5
<210> 1
<211> 1269
<212> DNA
<213> 减硫高温球菌(Thermococcus thioreducens)
<400> 1
atgggcagga aggatataac aattgctcta gtggccctga ttgtgctttc ccttttaggg 60
gttccagcga cggcagaaaa gcctgagctt gttagagtga tagtgcacgt ggacagggga 120
cacttcaaca cggcagacgt tgccacgata ggcggccacg ttgtttatca gtttaagctg 180
atagacgcgg tagtagtgga agtgccttca acagccgtgg gaaggctcaa gaagcttccg 240
ggagtcaaaa tggtagagtt tgaccacaag gcgaggatac ttgccgggcc accctcctgg 300
ctcggaggtg gacagccttc ccagcagatt ccgtggggaa tcagcagagt cagagccccg 360
gatgtatggg gcataaccga tggctctgga ggtgttattg aggtcgccgt tcttgatact 420
ggggttgact acgaccatcc ggatctggct ggtaatatag catggtgtgt tagcactctc 480
cggggcaggg ttacaacaaa tccagcccag tgtaaagacc agaatggtca tgggacacat 540
gttataggga caatagccgc gctcaacaat gacatcggcg ttgttggtgt tgctcccggg 600
gttgaaatat actccatcag ggttctggat gcaagcggga gcggttccta cagcgatata 660
gccatcggaa tcgaacaggc cctccttggc cccgatggaa ttctcgacaa ggacggcgac 720
gggataatcg tcggcgaccc ggacgacgat gccgcagaag ttataagcat gtccctcgga 780
ggcccaacgg acgaccagta tctccacgac atgattatca cggcatacaa ctacggtgtg 840
gttatagtgg cagcgagcgg caacgaggga gcttccagtc ccagctatcc cgccgcatat 900
cctgaggtca tagccgttgg tgcgagtgat gtaaacgatc agatcgcttc ctggagcaac 960
agacagccag aagttagtgc tccgggcgtt gacattctaa gcacctaccc ggacgacacc 1020
tacgagaccc tcagcggaac cagcatggca acgccacacg tcagcggtgt ggttgctctc 1080
atacaggcgg cctactacaa caagtacggc aaggttctcc cggttggaac cttcgacgac 1140
atgggaacca acaccgtcag gggaatcctc cacgttacgg ccgatgacct tggggacgct 1200
ggctgggaca tatactacgg ctacggaata gtccgggcag acttagccgt tcaggcggcc 1260
atcggctaa 1269
<210> 2
<211> 422
<212> PRT
<213> 减硫高温球菌(Thermococcus thioreducens)
<400> 2
Met Gly Arg Lys Asp Ile Thr Ile Ala Leu Val Ala Leu Ile Val Leu
1 5 10 15
Ser Leu Leu Gly Val Pro Ala Thr Ala Glu Lys Pro Glu Leu Val Arg
20 25 30
Val Ile Val His Val Asp Arg Gly His Phe Asn Thr Ala Asp Val Ala
35 40 45
Thr Ile Gly Gly His Val Val Tyr Gln Phe Lys Leu Ile Asp Ala Val
50 55 60
Val Val Glu Val Pro Ser Thr Ala Val Gly Arg Leu Lys Lys Leu Pro
65 70 75 80
Gly Val Lys Met Val Glu Phe Asp His Lys Ala Arg Ile Leu Ala Gly
85 90 95
Pro Pro Ser Trp Leu Gly Gly Gly Gln Pro Ser Gln Gln Ile Pro Trp
100 105 110
Gly Ile Ser Arg Val Arg Ala Pro Asp Val Trp Gly Ile Thr Asp Gly
115 120 125
Ser Gly Gly Val Ile Glu Val Ala Val Leu Asp Thr Gly Val Asp Tyr
130 135 140
Asp His Pro Asp Leu Ala Gly Asn Ile Ala Trp Cys Val Ser Thr Leu
145 150 155 160
Arg Gly Arg Val Thr Thr Asn Pro Ala Gln Cys Lys Asp Gln Asn Gly
165 170 175
His Gly Thr His Val Ile Gly Thr Ile Ala Ala Leu Asn Asn Asp Ile
180 185 190
Gly Val Val Gly Val Ala Pro Gly Val Glu Ile Tyr Ser Ile Arg Val
195 200 205
Leu Asp Ala Ser Gly Ser Gly Ser Tyr Ser Asp Ile Ala Ile Gly Ile
210 215 220
Glu Gln Ala Leu Leu Gly Pro Asp Gly Ile Leu Asp Lys Asp Gly Asp
225 230 235 240
Gly Ile Ile Val Gly Asp Pro Asp Asp Asp Ala Ala Glu Val Ile Ser
245 250 255
Met Ser Leu Gly Gly Pro Thr Asp Asp Gln Tyr Leu His Asp Met Ile
260 265 270
Ile Thr Ala Tyr Asn Tyr Gly Val Val Ile Val Ala Ala Ser Gly Asn
275 280 285
Glu Gly Ala Ser Ser Pro Ser Tyr Pro Ala Ala Tyr Pro Glu Val Ile
290 295 300
Ala Val Gly Ala Ser Asp Val Asn Asp Gln Ile Ala Ser Trp Ser Asn
305 310 315 320
Arg Gln Pro Glu Val Ser Ala Pro Gly Val Asp Ile Leu Ser Thr Tyr
325 330 335
Pro Asp Asp Thr Tyr Glu Thr Leu Ser Gly Thr Ser Met Ala Thr Pro
340 345 350
His Val Ser Gly Val Val Ala Leu Ile Gln Ala Ala Tyr Tyr Asn Lys
355 360 365
Tyr Gly Lys Val Leu Pro Val Gly Thr Phe Asp Asp Met Gly Thr Asn
370 375 380
Thr Val Arg Gly Ile Leu His Val Thr Ala Asp Asp Leu Gly Asp Ala
385 390 395 400
Gly Trp Asp Ile Tyr Tyr Gly Tyr Gly Ile Val Arg Ala Asp Leu Ala
405 410 415
Val Gln Ala Ala Ile Gly
420
<210> 3
<211> 412
<212> PRT
<213> 强烈火球菌(Pyrococcus furiosus)
<400> 3
Ala Glu Leu Glu Gly Leu Asp Glu Ser Ala Ala Gln Val Met Ala Thr
1 5 10 15
Tyr Val Trp Asn Leu Gly Tyr Asp Gly Ser Gly Ile Thr Ile Gly Ile
20 25 30
Ile Asp Thr Gly Ile Asp Ala Ser His Pro Asp Leu Gln Gly Lys Val
35 40 45
Ile Gly Trp Val Asp Phe Val Asn Gly Arg Ser Tyr Pro Tyr Asp Asp
50 55 60
His Gly His Gly Thr His Val Ala Ser Ile Ala Ala Gly Thr Gly Ala
65 70 75 80
Ala Ser Asn Gly Lys Tyr Lys Gly Met Ala Pro Gly Ala Lys Leu Ala
85 90 95
Gly Ile Lys Val Leu Gly Ala Asp Gly Ser Gly Ser Ile Ser Thr Ile
100 105 110
Ile Lys Gly Val Glu Trp Ala Val Asp Asn Lys Asp Lys Tyr Gly Ile
115 120 125
Lys Val Ile Asn Leu Ser Leu Gly Ser Ser Gln Ser Ser Asp Gly Thr
130 135 140
Asp Ala Leu Ser Gln Ala Val Asn Ala Ala Trp Asp Ala Gly Leu Val
145 150 155 160
Val Val Val Ala Ala Gly Asn Ser Gly Pro Asn Lys Tyr Thr Ile Gly
165 170 175
Ser Pro Ala Ala Ala Ser Lys Val Ile Thr Val Gly Ala Val Asp Lys
180 185 190
Tyr Asp Val Ile Thr Ser Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly
195 200 205
Arg Leu Lys Pro Glu Val Val Ala Pro Gly Asn Trp Ile Ile Ala Ala
210 215 220
Arg Ala Ser Gly Thr Ser Met Gly Gln Pro Ile Asn Asp Tyr Tyr Thr
225 230 235 240
Ala Ala Pro Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ile Ala
245 250 255
Ala Leu Leu Leu Gln Ala His Pro Ser Trp Thr Pro Asp Lys Val Lys
260 265 270
Thr Ala Leu Ile Glu Thr Ala Asp Ile Val Lys Pro Asp Glu Ile Ala
275 280 285
Asp Ile Ala Tyr Gly Ala Gly Arg Val Asn Ala Tyr Lys Ala Ile Asn
290 295 300
Tyr Asp Asn Tyr Ala Lys Leu Val Phe Thr Gly Tyr Val Ala Asn Lys
305 310 315 320
Gly Ser Gln Thr His Gln Phe Val Ile Ser Gly Ala Ser Phe Val Thr
325 330 335
Ala Thr Leu Tyr Trp Asp Asn Ala Asn Ser Asp Leu Asp Leu Tyr Leu
340 345 350
Tyr Asp Pro Asn Gly Asn Gln Val Asp Tyr Ser Tyr Thr Ala Tyr Tyr
355 360 365
Gly Phe Glu Lys Val Gly Tyr Tyr Asn Pro Thr Asp Gly Thr Trp Thr
370 375 380
Ile Lys Val Val Ser Tyr Ser Gly Ser Ala Asn Tyr Gln Val Asp Val
385 390 395 400
Val Ser Asp Gly Ser Leu Ser Gln Pro Gly Ser Ser
405 410
<210> 4
<211> 515
<212> PRT
<213> 嗜热脂肪芽孢杆菌
<400> 4
Ala Ala Pro Phe Asn Gly Thr Met Met Gln Tyr Phe Glu Trp Tyr Leu
1 5 10 15
Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn
20 25 30
Leu Ser Ser Leu Gly Ile Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys
35 40 45
Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp
50 55 60
Leu Gly Glu Phe Asn Gln Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr
65 70 75 80
Lys Ala Gln Tyr Leu Gln Ala Ile Gln Ala Ala His Ala Ala Gly Met
85 90 95
Gln Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly
100 105 110
Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gln
115 120 125
Glu Ile Ser Gly Thr Tyr Gln Ile Gln Ala Trp Thr Lys Phe Asp Phe
130 135 140
Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His
145 150 155 160
Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg Ile Tyr
165 170 175
Lys Phe Arg Gly Ile Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu
180 185 190
Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His
195 200 205
Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr Val Asn
210 215 220
Thr Thr Asn Ile Asp Gly Phe Arg Leu Asp Ala Val Lys His Ile Lys
225 230 235 240
Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gln Thr Gly
245 250 255
Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp Ile Asn Lys
260 265 270
Leu His Asn Tyr Ile Thr Lys Thr Asn Gly Thr Met Ser Leu Phe Asp
275 280 285
Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Ala
290 295 300
Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gln Pro
305 310 315 320
Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gln
325 330 335
Ala Leu Gln Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala
340 345 350
Phe Ile Leu Thr Arg Gln Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp
355 360 365
Tyr Tyr Gly Ile Pro Gln Tyr Asn Ile Pro Ser Leu Lys Ser Lys Ile
370 375 380
Asp Pro Leu Leu Ile Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gln His
385 390 395 400
Asp Tyr Leu Asp His Ser Asp Ile Ile Gly Trp Thr Arg Glu Gly Val
405 410 415
Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu Ile Thr Asp Gly Pro
420 425 430
Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gln His Ala Gly Lys Val
435 440 445
Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr Ile Asn Ser
450 455 460
Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp
465 470 475 480
Val Pro Arg Lys Thr Thr Val Ser Thr Ile Ala Arg Pro Ile Thr Thr
485 490 495
Arg Pro Trp Thr Gly Glu Phe Val Arg Trp Thr Glu Pro Arg Leu Val
500 505 510
Ala Trp Pro
515
<210> 5
<211> 591
<212> PRT
<213> 埃默森篮状菌
<400> 5
Ala Thr Gly Ser Leu Asp Ser Phe Leu Ala Thr Glu Thr Pro Ile Ala
1 5 10 15
Leu Gln Gly Val Leu Asn Asn Ile Gly Pro Asn Gly Ala Asp Val Ala
20 25 30
Gly Ala Ser Ala Gly Ile Val Val Ala Ser Pro Ser Arg Ser Asp Pro
35 40 45
Asn Tyr Phe Tyr Ser Trp Thr Arg Asp Ala Ala Leu Thr Ala Lys Tyr
50 55 60
Leu Val Asp Ala Phe Ile Ala Gly Asn Lys Asp Leu Glu Gln Thr Ile
65 70 75 80
Gln Gln Tyr Ile Ser Ala Gln Ala Lys Val Gln Thr Ile Ser Asn Pro
85 90 95
Ser Gly Asp Leu Ser Thr Gly Gly Leu Gly Glu Pro Lys Phe Asn Val
100 105 110
Asn Glu Thr Ala Phe Thr Gly Pro Trp Gly Arg Pro Gln Arg Asp Gly
115 120 125
Pro Ala Leu Arg Ala Thr Ala Leu Ile Ala Tyr Ala Asn Tyr Leu Ile
130 135 140
Asp Asn Gly Glu Ala Ser Thr Ala Asp Glu Ile Ile Trp Pro Ile Val
145 150 155 160
Gln Asn Asp Leu Ser Tyr Ile Thr Gln Tyr Trp Asn Ser Ser Thr Phe
165 170 175
Asp Leu Trp Glu Glu Val Glu Gly Ser Ser Phe Phe Thr Thr Ala Val
180 185 190
Gln His Arg Ala Leu Val Glu Gly Asn Ala Leu Ala Thr Arg Leu Asn
195 200 205
His Thr Cys Ser Asn Cys Val Ser Gln Ala Pro Gln Val Leu Cys Phe
210 215 220
Leu Gln Ser Tyr Trp Thr Gly Ser Tyr Val Leu Ala Asn Phe Gly Gly
225 230 235 240
Ser Gly Arg Ser Gly Lys Asp Val Asn Ser Ile Leu Gly Ser Ile His
245 250 255
Thr Phe Asp Pro Ala Gly Gly Cys Asp Asp Ser Thr Phe Gln Pro Cys
260 265 270
Ser Ala Arg Ala Leu Ala Asn His Lys Val Val Thr Asp Ser Phe Arg
275 280 285
Ser Ile Tyr Ala Ile Asn Ser Gly Ile Ala Glu Gly Ser Ala Val Ala
290 295 300
Val Gly Arg Tyr Pro Glu Asp Val Tyr Gln Gly Gly Asn Pro Trp Tyr
305 310 315 320
Leu Ala Thr Ala Ala Ala Ala Glu Gln Leu Tyr Asp Ala Ile Tyr Gln
325 330 335
Trp Lys Lys Ile Gly Ser Ile Ser Ile Thr Asp Val Ser Leu Pro Phe
340 345 350
Phe Gln Asp Ile Tyr Pro Ser Ala Ala Val Gly Thr Tyr Asn Ser Gly
355 360 365
Ser Thr Thr Phe Asn Asp Ile Ile Ser Ala Val Gln Thr Tyr Gly Asp
370 375 380
Gly Tyr Leu Ser Ile Val Glu Lys Tyr Thr Pro Ser Asp Gly Ser Leu
385 390 395 400
Thr Glu Gln Phe Ser Arg Thr Asp Gly Thr Pro Leu Ser Ala Ser Ala
405 410 415
Leu Thr Trp Ser Tyr Ala Ser Leu Leu Thr Ala Ser Ala Arg Arg Gln
420 425 430
Ser Val Val Pro Ala Ser Trp Gly Glu Ser Ser Ala Ser Ser Val Pro
435 440 445
Ala Val Cys Ser Ala Thr Ser Ala Thr Gly Pro Tyr Ser Thr Ala Thr
450 455 460
Asn Thr Val Trp Pro Ser Ser Gly Ser Gly Ser Ser Thr Thr Thr Ser
465 470 475 480
Ser Ala Pro Cys Thr Thr Pro Thr Ser Val Ala Val Thr Phe Asp Glu
485 490 495
Ile Val Ser Thr Ser Tyr Gly Glu Thr Ile Tyr Leu Ala Gly Ser Ile
500 505 510
Pro Glu Leu Gly Asn Trp Ser Thr Ala Ser Ala Ile Pro Leu Arg Ala
515 520 525
Asp Ala Tyr Thr Asn Ser Asn Pro Leu Trp Tyr Val Thr Val Asn Leu
530 535 540
Pro Pro Gly Thr Ser Phe Glu Tyr Lys Phe Phe Lys Asn Gln Thr Asp
545 550 555 560
Gly Thr Ile Val Trp Glu Asp Asp Pro Asn Arg Ser Tyr Thr Val Pro
565 570 575
Ala Tyr Cys Gly Gln Thr Thr Ala Ile Leu Asp Asp Ser Trp Gln
580 585 590
<210> 6
<211> 556
<212> PRT
<213> 篱边粘褶菌
<400> 6
Gln Ser Val Asp Ser Tyr Val Ser Ser Glu Gly Pro Ile Ala Lys Ala
1 5 10 15
Gly Val Leu Ala Asn Ile Gly Pro Asn Gly Ser Lys Ala Ser Gly Ala
20 25 30
Ser Ala Gly Val Val Val Ala Ser Pro Ser Thr Ser Asp Pro Asp Tyr
35 40 45
Trp Tyr Thr Trp Thr Arg Asp Ser Ser Leu Val Phe Lys Ser Leu Ile
50 55 60
Asp Gln Tyr Thr Thr Gly Ile Asp Ser Thr Ser Ser Leu Arg Thr Leu
65 70 75 80
Ile Asp Asp Phe Val Thr Ala Glu Ala Asn Leu Gln Gln Val Ser Asn
85 90 95
Pro Ser Gly Thr Leu Thr Thr Gly Gly Leu Gly Glu Pro Lys Phe Asn
100 105 110
Val Asp Glu Thr Ala Phe Thr Gly Ala Trp Gly Arg Pro Gln Arg Asp
115 120 125
Gly Pro Ala Leu Arg Ser Thr Ala Leu Ile Thr Tyr Gly Asn Trp Leu
130 135 140
Leu Ser Asn Gly Asn Thr Ser Tyr Val Thr Ser Asn Leu Trp Pro Ile
145 150 155 160
Ile Gln Asn Asp Leu Gly Tyr Val Val Ser Tyr Trp Asn Gln Ser Thr
165 170 175
Tyr Asp Leu Trp Glu Glu Val Asp Ser Ser Ser Phe Phe Thr Thr Ala
180 185 190
Val Gln His Arg Ala Leu Arg Glu Gly Ala Ala Phe Ala Thr Ala Ile
195 200 205
Gly Gln Thr Ser Gln Val Ser Ser Tyr Thr Thr Gln Ala Asp Asn Leu
210 215 220
Leu Cys Phe Leu Gln Ser Tyr Trp Asn Pro Ser Gly Gly Tyr Ile Thr
225 230 235 240
Ala Asn Thr Gly Gly Gly Arg Ser Gly Lys Asp Ala Asn Thr Leu Leu
245 250 255
Ala Ser Ile His Thr Tyr Asp Pro Ser Ala Gly Cys Asp Ala Ala Thr
260 265 270
Phe Gln Pro Cys Ser Asp Lys Ala Leu Ser Asn Leu Lys Val Tyr Val
275 280 285
Asp Ser Phe Arg Ser Val Tyr Ser Ile Asn Ser Gly Val Ala Ser Asn
290 295 300
Ala Ala Val Ala Thr Gly Arg Tyr Pro Glu Asp Ser Tyr Gln Gly Gly
305 310 315 320
Asn Pro Trp Tyr Leu Thr Thr Phe Ala Val Ala Glu Gln Leu Tyr Asp
325 330 335
Ala Leu Asn Val Trp Glu Ser Gln Gly Ser Leu Glu Val Thr Ser Thr
340 345 350
Ser Leu Ala Phe Phe Gln Gln Phe Ser Ser Gly Val Thr Ala Gly Thr
355 360 365
Tyr Ser Ser Ser Ser Ser Thr Tyr Ser Thr Leu Thr Ser Ala Ile Lys
370 375 380
Asn Phe Ala Asp Gly Phe Val Ala Ile Asn Ala Lys Tyr Thr Pro Ser
385 390 395 400
Asn Gly Gly Leu Ala Glu Gln Tyr Ser Lys Ser Asp Gly Ser Pro Leu
405 410 415
Ser Ala Val Asp Leu Thr Trp Ser Tyr Ala Ser Ala Leu Thr Ala Phe
420 425 430
Glu Ala Arg Asn Asn Thr Gln Phe Ala Gly Trp Gly Ala Ala Gly Leu
435 440 445
Thr Val Pro Ser Ser Cys Ser Gly Asn Ser Gly Gly Pro Thr Val Ala
450 455 460
Val Thr Phe Asn Val Asn Ala Glu Thr Val Trp Gly Glu Asn Ile Tyr
465 470 475 480
Leu Thr Gly Ser Val Asp Ala Leu Glu Asn Trp Ser Ala Asp Asn Ala
485 490 495
Leu Leu Leu Ser Ser Ala Asn Tyr Pro Thr Trp Ser Ile Thr Val Asn
500 505 510
Leu Pro Ala Ser Thr Ala Ile Glu Tyr Lys Tyr Ile Arg Lys Asn Asn
515 520 525
Gly Ala Val Thr Trp Glu Ser Asp Pro Asn Asn Ser Ile Thr Thr Pro
530 535 540
Ala Ser Gly Ser Thr Thr Glu Asn Asp Thr Trp Arg
545 550 555
<210> 7
<211> 559
<212> PRT
<213> 密粘褶菌
<400> 7
Gln Ser Val Asp Ser Tyr Val Gly Ser Glu Gly Pro Ile Ala Lys Ala
1 5 10 15
Gly Val Leu Ala Asn Ile Gly Pro Asn Gly Ser Lys Ala Ser Gly Ala
20 25 30
Ala Ala Gly Val Val Val Ala Ser Pro Ser Lys Ser Asp Pro Asp Tyr
35 40 45
Trp Tyr Thr Trp Thr Arg Asp Ser Ser Leu Val Phe Lys Ser Leu Ile
50 55 60
Asp Gln Tyr Thr Thr Gly Ile Asp Ser Thr Ser Ser Leu Arg Ser Leu
65 70 75 80
Ile Asp Ser Phe Val Ile Ala Glu Ala Asn Ile Gln Gln Val Ser Asn
85 90 95
Pro Ser Gly Thr Leu Thr Thr Gly Gly Leu Gly Glu Pro Lys Phe Asn
100 105 110
Val Asp Glu Thr Ala Phe Thr Gly Ala Trp Gly Arg Pro Gln Arg Asp
115 120 125
Gly Pro Ala Leu Arg Ala Thr Ala Leu Ile Thr Tyr Gly Asn Trp Leu
130 135 140
Leu Ser Asn Gly Asn Thr Thr Trp Val Thr Ser Thr Leu Trp Pro Ile
145 150 155 160
Ile Gln Asn Asp Leu Asn Tyr Val Val Gln Tyr Trp Asn Gln Thr Thr
165 170 175
Phe Asp Leu Trp Glu Glu Val Asn Ser Ser Ser Phe Phe Thr Thr Ala
180 185 190
Val Gln His Arg Ala Leu Arg Glu Gly Ala Ala Phe Ala Thr Lys Ile
195 200 205
Gly Gln Thr Ser Ser Val Ser Ser Tyr Thr Thr Gln Ala Ala Asn Leu
210 215 220
Leu Cys Phe Leu Gln Ser Tyr Trp Asn Pro Thr Ser Gly Tyr Ile Thr
225 230 235 240
Ala Asn Thr Gly Gly Gly Arg Ser Gly Lys Asp Ala Asn Thr Leu Leu
245 250 255
Ala Ser Ile His Thr Tyr Asp Pro Ser Ala Gly Cys Asp Ala Thr Thr
260 265 270
Phe Gln Pro Cys Ser Asp Lys Ala Leu Ser Asn Leu Lys Val Tyr Val
275 280 285
Asp Ser Phe Arg Ser Val Tyr Ser Ile Asn Ser Gly Ile Ala Ser Asn
290 295 300
Ala Ala Val Ala Thr Gly Arg Tyr Pro Glu Asp Ser Tyr Gln Gly Gly
305 310 315 320
Asn Pro Trp Tyr Leu Thr Thr Phe Ala Val Ala Glu Gln Leu Tyr Asp
325 330 335
Ala Leu Asn Val Trp Ala Ala Gln Gly Ser Leu Asn Val Thr Ser Ile
340 345 350
Ser Leu Pro Phe Phe Gln Gln Phe Ser Ser Ser Val Thr Ala Gly Thr
355 360 365
Tyr Ala Ser Ser Ser Thr Thr Tyr Thr Thr Leu Thr Ser Ala Ile Lys
370 375 380
Ser Phe Ala Asp Gly Phe Val Ala Ile Asn Ala Gln Tyr Thr Pro Ser
385 390 395 400
Asn Gly Gly Leu Ala Glu Gln Phe Ser Arg Ser Asn Gly Ala Pro Val
405 410 415
Ser Ala Val Asp Leu Thr Trp Ser Tyr Ala Ser Ala Leu Thr Ala Phe
420 425 430
Glu Ala Arg Asn Asn Thr Gln Phe Ala Gly Trp Gly Ala Val Gly Leu
435 440 445
Thr Val Pro Thr Ser Cys Ser Ser Asn Ser Gly Gly Gly Gly Gly Ser
450 455 460
Thr Val Ala Val Thr Phe Asn Val Asn Ala Gln Thr Val Trp Gly Glu
465 470 475 480
Asn Ile Tyr Ile Thr Gly Ser Val Asp Ala Leu Ser Asn Trp Ser Pro
485 490 495
Asp Asn Ala Leu Leu Leu Ser Ser Ala Asn Tyr Pro Thr Trp Ser Ile
500 505 510
Thr Val Asn Leu Pro Ala Ser Thr Ala Ile Gln Tyr Lys Tyr Ile Arg
515 520 525
Lys Asn Asn Gly Ala Val Thr Trp Glu Ser Asp Pro Asn Asn Ser Ile
530 535 540
Thr Thr Pro Ala Ser Gly Ser Val Thr Glu Asn Asp Thr Trp Arg
545 550 555
<210> 8
<211> 583
<212> PRT
<213> 人工
<220>
<223> 杂合α-淀粉酶,其包含来自融合至接头的微小根毛霉α-淀粉酶催化结构域和来自黑曲霉葡糖淀粉酶的淀粉结合结构域
<400> 8
Ala Thr Ser Asp Asp Trp Lys Gly Lys Ala Ile Tyr Gln Leu Leu Thr
1 5 10 15
Asp Arg Phe Gly Arg Ala Asp Asp Ser Thr Ser Asn Cys Ser Asn Leu
20 25 30
Ser Asn Tyr Cys Gly Gly Thr Tyr Glu Gly Ile Thr Lys His Leu Asp
35 40 45
Tyr Ile Ser Gly Met Gly Phe Asp Ala Ile Trp Ile Ser Pro Ile Pro
50 55 60
Lys Asn Ser Asp Gly Gly Tyr His Gly Tyr Trp Ala Thr Asp Phe Tyr
65 70 75 80
Gln Leu Asn Ser Asn Phe Gly Asp Glu Ser Gln Leu Lys Ala Leu Ile
85 90 95
Gln Ala Ala His Glu Arg Asp Met Tyr Val Met Leu Asp Val Val Ala
100 105 110
Asn His Ala Gly Pro Thr Ser Asn Gly Tyr Ser Gly Tyr Thr Phe Gly
115 120 125
Asp Ala Ser Leu Tyr His Pro Lys Cys Thr Ile Asp Tyr Asn Asp Gln
130 135 140
Thr Ser Ile Glu Gln Cys Trp Val Ala Asp Glu Leu Pro Asp Ile Asp
145 150 155 160
Thr Glu Asn Ser Asp Asn Val Ala Ile Leu Asn Asp Ile Val Ser Gly
165 170 175
Trp Val Gly Asn Tyr Ser Phe Asp Gly Ile Arg Ile Asp Thr Val Lys
180 185 190
His Ile Arg Lys Asp Phe Trp Thr Gly Tyr Ala Glu Ala Ala Gly Val
195 200 205
Phe Ala Thr Gly Glu Val Phe Asn Gly Asp Pro Ala Tyr Val Gly Pro
210 215 220
Tyr Gln Lys Tyr Leu Pro Ser Leu Ile Asn Tyr Pro Met Tyr Tyr Ala
225 230 235 240
Leu Asn Asp Val Phe Val Ser Lys Ser Lys Gly Phe Ser Arg Ile Ser
245 250 255
Glu Met Leu Gly Ser Asn Arg Asn Ala Phe Glu Asp Thr Ser Val Leu
260 265 270
Thr Thr Phe Val Asp Asn His Asp Asn Pro Arg Phe Leu Asn Ser Gln
275 280 285
Ser Asp Lys Ala Leu Phe Lys Asn Ala Leu Thr Tyr Val Leu Leu Gly
290 295 300
Glu Gly Ile Pro Ile Val Tyr Tyr Gly Ser Glu Gln Gly Phe Ser Gly
305 310 315 320
Gly Ala Asp Pro Ala Asn Arg Glu Val Leu Trp Thr Thr Asn Tyr Asp
325 330 335
Thr Ser Ser Asp Leu Tyr Gln Phe Ile Lys Thr Val Asn Ser Val Arg
340 345 350
Met Lys Ser Asn Lys Ala Val Tyr Met Asp Ile Tyr Val Gly Asp Asn
355 360 365
Ala Tyr Ala Phe Lys His Gly Asp Ala Leu Val Val Leu Asn Asn Tyr
370 375 380
Gly Ser Gly Ser Thr Asn Gln Val Ser Phe Ser Val Ser Gly Lys Phe
385 390 395 400
Asp Ser Gly Ala Ser Leu Met Asp Ile Val Ser Asn Ile Thr Thr Thr
405 410 415
Val Ser Ser Asp Gly Thr Val Thr Phe Asn Leu Lys Asp Gly Leu Pro
420 425 430
Ala Ile Phe Thr Ser Ala Thr Gly Gly Thr Thr Thr Thr Ala Thr Pro
435 440 445
Thr Gly Ser Gly Ser Val Thr Ser Thr Ser Lys Thr Thr Ala Thr Ala
450 455 460
Ser Lys Thr Ser Thr Ser Thr Ser Ser Thr Ser Cys Thr Thr Pro Thr
465 470 475 480
Ala Val Ala Val Thr Phe Asp Leu Thr Ala Thr Thr Thr Tyr Gly Glu
485 490 495
Asn Ile Tyr Leu Val Gly Ser Ile Ser Gln Leu Gly Asp Trp Glu Thr
500 505 510
Ser Asp Gly Ile Ala Leu Ser Ala Asp Lys Tyr Thr Ser Ser Asp Pro
515 520 525
Leu Trp Tyr Val Thr Val Thr Leu Pro Ala Gly Glu Ser Phe Glu Tyr
530 535 540
Lys Phe Ile Arg Ile Glu Ser Asp Asp Ser Val Glu Trp Glu Ser Asp
545 550 555 560
Pro Asn Arg Glu Tyr Thr Val Pro Gln Ala Cys Gly Thr Ser Thr Ala
565 570 575
Thr Val Thr Asp Thr Trp Arg
580
<210> 9
<211> 556
<212> PRT
<213> 瓣环栓菌
<400> 9
Gln Ser Ser Ala Ala Asp Ala Tyr Val Ala Ser Glu Ser Pro Ile Ala
1 5 10 15
Lys Ala Gly Val Leu Ala Asn Ile Gly Pro Ser Gly Ser Lys Ser Asn
20 25 30
Gly Ala Lys Ala Gly Ile Val Ile Ala Ser Pro Ser Thr Ser Asn Pro
35 40 45
Asn Tyr Leu Tyr Thr Trp Thr Arg Asp Ser Ser Leu Val Phe Lys Ala
50 55 60
Leu Ile Asp Gln Phe Thr Thr Gly Glu Asp Thr Ser Leu Arg Thr Leu
65 70 75 80
Ile Asp Glu Phe Thr Ser Ala Glu Ala Ile Leu Gln Gln Val Pro Asn
85 90 95
Pro Ser Gly Thr Val Ser Thr Gly Gly Leu Gly Glu Pro Lys Phe Asn
100 105 110
Ile Asp Glu Thr Ala Phe Thr Asp Ala Trp Gly Arg Pro Gln Arg Asp
115 120 125
Gly Pro Ala Leu Arg Ala Thr Ala Ile Ile Thr Tyr Ala Asn Trp Leu
130 135 140
Leu Asp Asn Lys Asn Thr Thr Tyr Val Thr Asn Thr Leu Trp Pro Ile
145 150 155 160
Ile Lys Leu Asp Leu Asp Tyr Val Ala Ser Asn Trp Asn Gln Ser Thr
165 170 175
Phe Asp Leu Trp Glu Glu Ile Asn Ser Ser Ser Phe Phe Thr Thr Ala
180 185 190
Val Gln His Arg Ala Leu Arg Glu Gly Ala Thr Phe Ala Asn Arg Ile
195 200 205
Gly Gln Thr Ser Val Val Ser Gly Tyr Thr Thr Gln Ala Asn Asn Leu
210 215 220
Leu Cys Phe Leu Gln Ser Tyr Trp Asn Pro Thr Gly Gly Tyr Ile Thr
225 230 235 240
Ala Asn Thr Gly Gly Gly Arg Ser Gly Lys Asp Ala Asn Thr Val Leu
245 250 255
Thr Ser Ile His Thr Phe Asp Pro Ala Ala Gly Cys Asp Ala Val Thr
260 265 270
Phe Gln Pro Cys Ser Asp Lys Ala Leu Ser Asn Leu Lys Val Tyr Val
275 280 285
Asp Ala Phe Arg Ser Ile Tyr Ser Ile Asn Ser Gly Ile Ala Ser Asn
290 295 300
Ala Ala Val Ala Thr Gly Arg Tyr Pro Glu Asp Ser Tyr Met Gly Gly
305 310 315 320
Asn Pro Trp Tyr Leu Thr Thr Ser Ala Val Ala Glu Gln Leu Tyr Asp
325 330 335
Ala Leu Ile Val Trp Asn Lys Leu Gly Ala Leu Asn Val Thr Ser Thr
340 345 350
Ser Leu Pro Phe Phe Gln Gln Phe Ser Ser Gly Val Thr Val Gly Thr
355 360 365
Tyr Ala Ser Ser Ser Ser Thr Phe Lys Thr Leu Thr Ser Ala Ile Lys
370 375 380
Thr Phe Ala Asp Gly Phe Leu Ala Val Asn Ala Lys Tyr Thr Pro Ser
385 390 395 400
Asn Gly Gly Leu Ala Glu Gln Tyr Ser Arg Ser Asn Gly Ser Pro Val
405 410 415
Ser Ala Val Asp Leu Thr Trp Ser Tyr Ala Ala Ala Leu Thr Ser Phe
420 425 430
Ala Ala Arg Ser Gly Lys Thr Tyr Ala Ser Trp Gly Ala Ala Gly Leu
435 440 445
Thr Val Pro Thr Thr Cys Ser Gly Ser Gly Gly Ala Gly Thr Val Ala
450 455 460
Val Thr Phe Asn Val Gln Ala Thr Thr Val Phe Gly Glu Asn Ile Tyr
465 470 475 480
Ile Thr Gly Ser Val Pro Ala Leu Gln Asn Trp Ser Pro Asp Asn Ala
485 490 495
Leu Ile Leu Ser Ala Ala Asn Tyr Pro Thr Trp Ser Ile Thr Val Asn
500 505 510
Leu Pro Ala Ser Thr Thr Ile Glu Tyr Lys Tyr Ile Arg Lys Phe Asn
515 520 525
Gly Ala Val Thr Trp Glu Ser Asp Pro Asn Asn Ser Ile Thr Thr Pro
530 535 540
Ala Ser Gly Thr Phe Thr Gln Asn Asp Thr Trp Arg
545 550 555
<210> 10
<211> 595
<212> PRT
<213> 草酸青霉菌
<400> 10
Arg Pro Asp Pro Lys Gly Gly Asn Leu Thr Pro Phe Ile His Lys Glu
1 5 10 15
Gly Glu Arg Ser Leu Gln Gly Ile Leu Asp Asn Leu Gly Gly Arg Gly
20 25 30
Lys Lys Thr Pro Gly Thr Ala Ala Gly Leu Phe Ile Ala Ser Pro Asn
35 40 45
Thr Glu Asn Pro Asn Tyr Tyr Tyr Thr Trp Thr Arg Asp Ser Ala Leu
50 55 60
Thr Ala Lys Cys Leu Ile Asp Leu Phe Glu Asp Ser Arg Ala Lys Phe
65 70 75 80
Pro Ile Asp Arg Lys Tyr Leu Glu Thr Gly Ile Arg Asp Tyr Lys Ser
85 90 95
Ser Gln Ala Ile Leu Gln Ser Val Ser Asn Pro Ser Gly Thr Leu Lys
100 105 110
Asp Gly Ser Gly Leu Gly Glu Pro Lys Phe Glu Ile Asp Leu Asn Pro
115 120 125
Phe Ser Gly Ala Trp Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg
130 135 140
Ala Thr Ala Met Ile Thr Tyr Ala Asn Tyr Leu Ile Ser His Gly Gln
145 150 155 160
Lys Ser Asp Val Ser Gln Val Met Trp Pro Ile Ile Ala Asn Asp Leu
165 170 175
Ala Tyr Val Gly Gln Tyr Trp Asn Asn Thr Gly Phe Asp Leu Trp Glu
180 185 190
Glu Val Asp Gly Ser Ser Phe Phe Thr Ile Ala Val Gln His Arg Ala
195 200 205
Leu Val Glu Gly Ser Gln Leu Ala Lys Lys Leu Gly Lys Ser Cys Asp
210 215 220
Ala Cys Asp Ser Gln Pro Pro Gln Ile Leu Cys Phe Leu Gln Ser Phe
225 230 235 240
Trp Asn Gly Lys Tyr Ile Thr Ser Asn Ile Asn Thr Gln Ala Ser Arg
245 250 255
Ser Gly Ile Asp Leu Asp Ser Val Leu Gly Ser Ile His Thr Phe Asp
260 265 270
Pro Glu Ala Ala Cys Asp Asp Ala Thr Phe Gln Pro Cys Ser Ala Arg
275 280 285
Ala Leu Ala Asn His Lys Val Tyr Val Asp Ser Phe Arg Ser Ile Tyr
290 295 300
Lys Ile Asn Ala Gly Leu Ala Glu Gly Ser Ala Ala Asn Val Gly Arg
305 310 315 320
Tyr Pro Glu Asp Val Tyr Gln Gly Gly Asn Pro Trp Tyr Leu Ala Thr
325 330 335
Leu Gly Ala Ser Glu Leu Leu Tyr Asp Ala Leu Tyr Gln Trp Asp Arg
340 345 350
Leu Gly Lys Leu Glu Val Ser Glu Thr Ser Leu Ser Phe Phe Lys Asp
355 360 365
Phe Asp Ala Thr Val Lys Ile Gly Ser Tyr Ser Arg Asn Ser Lys Thr
370 375 380
Tyr Lys Lys Leu Thr Gln Ser Ile Lys Ser Tyr Ala Asp Gly Phe Ile
385 390 395 400
Gln Leu Val Gln Gln Tyr Thr Pro Ser Asn Gly Ser Leu Ala Glu Gln
405 410 415
Tyr Asp Arg Asn Thr Ala Ala Pro Leu Ser Ala Asn Asp Leu Thr Trp
420 425 430
Ser Phe Ala Ser Phe Leu Thr Ala Thr Gln Arg Arg Asp Ala Val Val
435 440 445
Pro Pro Ser Trp Gly Ala Lys Ser Ala Asn Lys Val Pro Thr Thr Cys
450 455 460
Ser Ala Ser Pro Val Val Gly Thr Tyr Lys Ala Pro Thr Ala Thr Phe
465 470 475 480
Ser Ser Lys Thr Lys Cys Val Pro Ala Lys Asp Ile Val Pro Ile Thr
485 490 495
Phe Tyr Leu Ile Glu Asn Thr Tyr Tyr Gly Glu Asn Val Phe Met Ser
500 505 510
Gly Asn Ile Thr Ala Leu Gly Asn Trp Asp Ala Lys Lys Gly Phe Pro
515 520 525
Leu Thr Ala Asn Leu Tyr Thr Gln Asp Gln Asn Leu Trp Phe Ala Ser
530 535 540
Val Glu Phe Ile Pro Ala Gly Thr Pro Phe Glu Tyr Lys Tyr Tyr Lys
545 550 555 560
Val Glu Pro Asn Gly Asp Ile Thr Trp Glu Lys Gly Pro Asn Arg Val
565 570 575
Phe Val Ala Pro Thr Gly Cys Pro Val Gln Pro His Ser Asn Asp Val
580 585 590
Trp Gln Phe
595
<210> 11
<211> 424
<212> PRT
<213> 嗜热高温球菌
<400> 11
Met Glu Phe Asn Lys Val Phe Ser Leu Leu Leu Val Phe Val Val Leu
1 5 10 15
Gly Ala Thr Ala Gly Ile Val Gly Ala Val Ser Ala Glu Lys Val Arg
20 25 30
Val Ile Ile Thr Ile Asp Lys Asp Phe Asn Glu Asn Ser Val Phe Ala
35 40 45
Leu Gly Gly Asn Val Val Ala Arg Gly Lys Val Phe Pro Ile Val Ile
50 55 60
Ala Glu Leu Ser Pro Arg Ala Val Glu Arg Leu Lys Asn Ala Lys Gly
65 70 75 80
Val Val Arg Val Glu Tyr Asp Ala Glu Val Gln Val Leu Lys Gly Lys
85 90 95
Ser Pro Gly Ala Gly Lys Pro Lys Pro Ser Gln Pro Ala Gln Thr Ile
100 105 110
Pro Trp Gly Ile Glu Arg Ile Lys Ala Pro Asp Val Trp Ser Ile Thr
115 120 125
Asp Gly Ser Ser Ser Gly Val Ile Glu Val Ala Ile Leu Asp Thr Gly
130 135 140
Ile Asp Tyr Asp His Pro Asp Leu Ala Ala Asn Leu Ala Trp Gly Val
145 150 155 160
Ser Val Leu Arg Gly Lys Val Ser Thr Lys Pro Lys Asp Tyr Lys Asp
165 170 175
Gln Asn Gly His Gly Thr His Val Ala Gly Thr Val Ala Ala Leu Asn
180 185 190
Asn Asp Ile Gly Val Val Gly Val Ala Pro Ala Val Glu Ile Tyr Ala
195 200 205
Val Arg Val Leu Asp Ala Ser Gly Arg Gly Ser Tyr Ser Asp Ile Ile
210 215 220
Leu Gly Ile Glu Gln Ala Leu Leu Gly Pro Asp Gly Val Leu Asp Ser
225 230 235 240
Asp Gly Asp Gly Ile Ile Val Gly Asp Pro Asp Asp Asp Ala Ala Glu
245 250 255
Val Ile Ser Met Ser Leu Gly Gly Leu Ser Asp Val Gln Ala Phe His
260 265 270
Asp Ala Ile Ile Glu Ala Tyr Asn Tyr Gly Val Val Ile Val Ala Ala
275 280 285
Ser Gly Asn Glu Gly Ala Ser Ser Pro Ser Tyr Pro Ala Ala Tyr Pro
290 295 300
Glu Val Ile Ala Val Gly Ala Thr Asp Val Asn Asp Gln Val Pro Trp
305 310 315 320
Trp Ser Asn Arg Gly Val Glu Val Ser Ala Pro Gly Val Asp Val Leu
325 330 335
Ser Thr Tyr Pro Asp Asp Ser Tyr Glu Thr Leu Ser Gly Thr Ser Met
340 345 350
Ala Thr Pro His Val Ser Gly Val Val Ala Leu Ile Gln Ala Ala Tyr
355 360 365
Tyr Asn Lys Tyr Gly Ser Val Leu Pro Val Gly Thr Phe Asp Asp Asn
370 375 380
Thr Met Ser Thr Val Arg Gly Ile Leu His Ile Thr Ala Asp Asp Leu
385 390 395 400
Gly Ser Ser Gly Trp Asp Ala Asp Tyr Gly Tyr Gly Ile Val Arg Ala
405 410 415
Asp Leu Ala Val Gln Ala Val Asn
420
Claims (15)
1.一种多肽,该多肽具有蛋白酶活性,该多肽选自由以下组成的组:
(a)多肽,该多肽与SEQ ID NO:2的成熟多肽具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%或100%序列同一性;
(b)多肽,该多肽由以下多核苷酸编码,该多核苷酸在非常高严格条件下与(i)SEQ IDNO:1的成熟多肽编码序列,(ii)(i)或(ii)的全长互补序列杂交;
(c)多肽,该多肽由以下多核苷酸编码,该多核苷酸与SEQ ID NO:1的成熟多肽编码序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%序列同一性;
(d)(a)、(b)、(c)或(d)的多肽的片段,该片段具有蛋白酶活性。
2.如权利要求1所述的多肽,其中该成熟多肽是SEQ ID NO:2的氨基酸102至422。
3.一种多核苷酸,该多核苷酸编码如权利要求1-2中任一项所述的多肽。
4.一种核酸构建体或重组表达载体,该核酸构建体或重组表达载体包含如权利要求3所述的多核苷酸,该多核苷酸可操作地连接至指导该多肽在表达宿主中的产生的一个或多个异源控制序列。
5.一种重组宿主细胞,该重组宿主细胞包含如权利要求3所述的多核苷酸,该多核苷酸可操作地连接至指导该多肽的产生的一个或多个异源控制序列。
6.一种产生具有蛋白酶活性的多肽的方法,该方法包括(a)在有益于产生该多肽的条件下培养如权利要求5所述的宿主细胞,和(b)任选地回收该多肽。
7.一种用于液化含淀粉材料的方法,该方法包括在至少α-淀粉酶和S8A减硫高温球菌蛋白酶的存在下,在高于初始糊化温度的温度,液化该含淀粉材料。
8.一种用于由含淀粉材料生产发酵产物的方法,该方法包括以下步骤:
a)在至少以下酶的存在下,在高于初始糊化温度的温度,液化该含淀粉材料:
-α-淀粉酶;和
-S8A减硫高温球菌蛋白酶;
b)使用葡糖淀粉酶进行糖化;
c)使用发酵生物进行发酵。
9.一种通过如权利要求8所述的方法从发酵产物生产中回收油的方法,该方法进一步包括以下步骤:
d)回收该发酵产物以形成全酒糟;
e)将该全酒糟分离为酒糟水和湿饼;
f)任选地将该酒糟水浓缩为浆体;
其中从以下各项中回收油:
-在如权利要求8所述的方法的步骤a)之后液化的含淀粉材料;和/或
-如权利要求8所述的方法的发酵步骤c)的下游。
10.如权利要求8-9中任一项所述的方法,其中在液化中存在和/或添加从1至50微克、特别地是从2至40微克、特别地是从4至25微克、特别地是从5至20微克减硫高温球菌S8A蛋白酶/克DS。
11.如权利要求8-10中任一项所述的方法,其中该减硫高温球菌蛋白酶选自:
a)多肽,该多肽包含SEQ ID NO:2的氨基酸102至422或由其组成;或
b)多肽,该多肽与SEQ ID NO:2的氨基酸102至422具有至少80%、至少85、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%序列同一性。
12.如实施例8-11中任一项所述的方法,其中该发酵产物是醇,优选乙醇,尤其是燃料乙醇、饮用乙醇和/或工业乙醇。
13.一种酶组合物,该酶组合物包含如权利要求1-2中任一项所述的S8A蛋白酶。
14.如权利要求13所述的酶组合物,该酶组合物进一步包含α-淀粉酶。
15.减硫高温球菌S8A蛋白酶在液化含淀粉材料中的用途。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201662425655P | 2016-11-23 | 2016-11-23 | |
US62/425,655 | 2016-11-23 | ||
PCT/US2017/062718 WO2018098124A1 (en) | 2016-11-23 | 2017-11-21 | Polypeptides having protease activity and polynucleotides encoding same |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110072997A true CN110072997A (zh) | 2019-07-30 |
Family
ID=60766140
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201780060772.6A Pending CN110072997A (zh) | 2016-11-23 | 2017-11-21 | 具有蛋白酶活性的多肽和编码其的多核苷酸 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20190367952A1 (zh) |
EP (1) | EP3545086A1 (zh) |
CN (1) | CN110072997A (zh) |
CA (1) | CA3039809A1 (zh) |
WO (1) | WO2018098124A1 (zh) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112601818A (zh) * | 2018-07-25 | 2021-04-02 | 诺维信公司 | 用于生产乙醇的表达酶的酵母 |
US20220010340A1 (en) * | 2020-07-10 | 2022-01-13 | Lallemand Hungary Liquidity Management Llc | Bacterial-derived nitrogen source for ethanol fermentation |
WO2023225459A2 (en) | 2022-05-14 | 2023-11-23 | Novozymes A/S | Compositions and methods for preventing, treating, supressing and/or eliminating phytopathogenic infestations and infections |
WO2024137246A1 (en) | 2022-12-19 | 2024-06-27 | Novozymes A/S | Carbohydrate esterase family 1 (ce1) polypeptides having ferulic acid esterase and/or acetyl xylan esterase activity and polynucleotides encoding same |
WO2024137248A1 (en) | 2022-12-19 | 2024-06-27 | Novozymes A/S | Compositions comprising arabinofuranosidases and a xylanase, and use thereof for increasing hemicellulosic fiber solubilization |
WO2024137252A1 (en) | 2022-12-19 | 2024-06-27 | Novozymes A/S | Process for reducing syrup viscosity in the backend of a process for producing a fermentation product |
WO2024137250A1 (en) | 2022-12-19 | 2024-06-27 | Novozymes A/S | Carbohydrate esterase family 3 (ce3) polypeptides having acetyl xylan esterase activity and polynucleotides encoding same |
WO2024137704A2 (en) | 2022-12-19 | 2024-06-27 | Novozymes A/S | Processes for producing fermentation products using fiber-degrading enzymes with engineered yeast |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013082486A1 (en) * | 2011-12-02 | 2013-06-06 | Novozymes A/S | Processes for producing fermentation products |
CN103620029A (zh) * | 2011-06-24 | 2014-03-05 | 诺维信公司 | 具有蛋白酶活性的多肽和编码它们的多核苷酸 |
CN104011204A (zh) * | 2011-12-20 | 2014-08-27 | 诺维信公司 | 枯草杆菌酶变体和编码它们的多核苷酸 |
WO2014209800A1 (en) * | 2013-06-24 | 2014-12-31 | Novozymes A/S | Processes for recovering oil from fermentation product processes and processes for producing fermentation products |
CN104812905A (zh) * | 2012-11-30 | 2015-07-29 | 诺维信公司 | 用于生产发酵产物的方法 |
Family Cites Families (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4560651A (en) | 1981-04-20 | 1985-12-24 | Novo Industri A/S | Debranching enzyme product, preparation and use thereof |
US5223409A (en) | 1988-09-02 | 1993-06-29 | Protein Engineering Corp. | Directed evolution of novel binding proteins |
AU641169B2 (en) | 1989-06-13 | 1993-09-16 | Genencor International, Inc. | A method for killing cells without cell lysis |
IL99552A0 (en) | 1990-09-28 | 1992-08-18 | Ixsys Inc | Compositions containing procaryotic cells,a kit for the preparation of vectors useful for the coexpression of two or more dna sequences and methods for the use thereof |
FR2704860B1 (fr) | 1993-05-05 | 1995-07-13 | Pasteur Institut | Sequences de nucleotides du locus cryiiia pour le controle de l'expression de sequences d'adn dans un hote cellulaire. |
DE4343591A1 (de) | 1993-12-21 | 1995-06-22 | Evotec Biosystems Gmbh | Verfahren zum evolutiven Design und Synthese funktionaler Polymere auf der Basis von Formenelementen und Formencodes |
US5605793A (en) | 1994-02-17 | 1997-02-25 | Affymax Technologies N.V. | Methods for in vitro recombination |
CN1192108C (zh) | 1994-06-03 | 2005-03-09 | 诺沃奇梅兹生物技术有限公司 | 纯化的毁丝霉属漆酶及编码该酶的核酸 |
AU9434398A (en) | 1997-10-13 | 1999-05-03 | Novo Nordisk A/S | Alpha-amylase mutants |
JP4718005B2 (ja) | 1997-11-26 | 2011-07-06 | ノボザイムス アクティーゼルスカブ | 熱安定グルコアミラーゼ |
US5955310A (en) | 1998-02-26 | 1999-09-21 | Novo Nordisk Biotech, Inc. | Methods for producing a polypeptide in a bacillus cell |
EP1250423B1 (en) | 2000-01-12 | 2008-09-03 | Novozymes A/S | Pullulanase variants and methods for preparing such variants with predetermined properties |
WO2003095658A1 (en) | 2002-05-07 | 2003-11-20 | Novozymes A/S | Homologous recombination into bacterium for the generation of polynucleotide libraries |
WO2006069289A2 (en) | 2004-12-22 | 2006-06-29 | Novozymes North America, Inc | Polypeptides having glucoamylase activity and polynucleotides encoding same |
US20110223671A1 (en) | 2008-09-30 | 2011-09-15 | Novozymes, Inc. | Methods for using positively and negatively selectable genes in a filamentous fungal cell |
BRPI1008890A2 (pt) | 2009-02-20 | 2015-08-25 | Danisco Us Inc | Formulações de caldos de fermentação |
MX2012006096A (es) | 2009-11-30 | 2012-07-03 | Novozymes North America Inc | Polipeptidos que tienen actividad glucoamilasa y polinucleotidos que codifican para los mismos. |
CA2782036C (en) | 2009-12-01 | 2019-01-15 | Novozymes A/S | Polypeptides having glucoamylase activity and polynucleotides encoding same |
KR20110069283A (ko) * | 2009-12-17 | 2011-06-23 | 한국해양연구원 | Thermococcus sp. NA1 으로부터의 유용한 유전자들 |
EP3070171B1 (en) | 2010-03-30 | 2018-06-13 | Novozymes A/S | Process for enhancing by-products from fermentation processes |
DK2558484T3 (en) | 2010-04-14 | 2016-03-29 | Novozymes As | Polypeptides having glucoamylase activity and polynucleotides encoding them |
CN103298932B (zh) | 2010-11-08 | 2016-08-10 | 诺维信公司 | 具有葡糖淀粉酶活性的多肽和编码该多肽的多核苷酸 |
ES2673940T3 (es) | 2010-12-22 | 2018-06-26 | Novozymes North America, Inc. | Proceso para producir productos de fermentación a partir de materiales que contienen almidón |
EP2729572B1 (en) | 2011-07-06 | 2019-03-20 | Novozymes A/S | Alpha amylase variants and polynucleotides encoding same |
WO2013053801A1 (en) | 2011-10-11 | 2013-04-18 | Novozymes A/S | Glucoamylase variants and polynucleotides encoding same |
WO2016196202A1 (en) | 2015-05-29 | 2016-12-08 | Novozymes A/S | Polypeptides having protease activity and polynucleotides encoding same |
-
2017
- 2017-11-21 US US16/461,846 patent/US20190367952A1/en not_active Abandoned
- 2017-11-21 WO PCT/US2017/062718 patent/WO2018098124A1/en unknown
- 2017-11-21 CA CA3039809A patent/CA3039809A1/en not_active Abandoned
- 2017-11-21 EP EP17817955.2A patent/EP3545086A1/en not_active Withdrawn
- 2017-11-21 CN CN201780060772.6A patent/CN110072997A/zh active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103620029A (zh) * | 2011-06-24 | 2014-03-05 | 诺维信公司 | 具有蛋白酶活性的多肽和编码它们的多核苷酸 |
WO2013082486A1 (en) * | 2011-12-02 | 2013-06-06 | Novozymes A/S | Processes for producing fermentation products |
CN104011204A (zh) * | 2011-12-20 | 2014-08-27 | 诺维信公司 | 枯草杆菌酶变体和编码它们的多核苷酸 |
CN104812905A (zh) * | 2012-11-30 | 2015-07-29 | 诺维信公司 | 用于生产发酵产物的方法 |
WO2014209800A1 (en) * | 2013-06-24 | 2014-12-31 | Novozymes A/S | Processes for recovering oil from fermentation product processes and processes for producing fermentation products |
CN105339501A (zh) * | 2013-06-24 | 2016-02-17 | 诺维信公司 | 用于从发酵产物过程中回收油的方法以及用于生产发酵产物的方法 |
Non-Patent Citations (1)
Title |
---|
HONG S.J,ET AL: "Thermococus thioreducens DSM 14981 genome sequencing", 《UNIPROTKB》 * |
Also Published As
Publication number | Publication date |
---|---|
WO2018098124A1 (en) | 2018-05-31 |
EP3545086A1 (en) | 2019-10-02 |
CA3039809A1 (en) | 2018-05-31 |
US20190367952A1 (en) | 2019-12-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110072997A (zh) | 具有蛋白酶活性的多肽和编码其的多核苷酸 | |
CN103946378B (zh) | 葡糖淀粉酶变体和编码它们的多核苷酸 | |
DK2521774T3 (en) | ALFA AMYLASE VARANTS AND POLYNUCLEOTIDES CODING THESE | |
US11427811B2 (en) | Polypeptides having pullulanase activity suitable for use in liquefaction | |
CN103857794B (zh) | α‑淀粉酶变体以及编码它们的多核苷酸 | |
CN103890170B (zh) | α‑淀粉酶变体以及对其进行编码的多核苷酸 | |
CN105339494A (zh) | 普鲁兰酶嵌合体以及编码它们的多核苷酸 | |
CN105164253A (zh) | 葡糖淀粉酶变体和编码它们的多核苷酸 | |
EP2748189B1 (en) | Polypeptides having alpha-amylase activity and polynucleotides encoding same | |
EP3720956B1 (en) | Alpha-amylase variants and polynucleotides encoding same | |
CN108350444A (zh) | 丝氨酸蛋白酶用于改进乙醇产量的用途 | |
WO2020187883A1 (en) | Polypeptides having pullulanase activity suitable for use in liquefaction | |
CN110651048A (zh) | 葡糖淀粉酶变体和编码它们的多核苷酸 | |
EP3487995A1 (en) | Serine protease variants and polynucleotides encoding same | |
WO2016196202A1 (en) | Polypeptides having protease activity and polynucleotides encoding same | |
WO2016087327A1 (en) | Polypeptides having pullulanase activity comprising the x25, x45 and cbm41 domains | |
CN102482658B (zh) | 具有异淀粉酶活性的多肽和编码该多肽的多核苷酸 | |
WO2014202793A1 (en) | Production of polypeptides without secretion signal in bacillus | |
US20200248159A1 (en) | Polypeptides having protease activity and polynucleotides encoding same | |
CN108603203A (zh) | 低泡发酵方法 | |
CN105121638A (zh) | 葡糖淀粉酶变体和编码它们的多核苷酸 | |
US20180208916A1 (en) | Alpha-amylase variants and polynucleotides encoding same | |
CN110177873A (zh) | α-淀粉酶变体以及对其进行编码的多核苷酸 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20190730 |