CN111770996A - 突变β-葡萄糖苷酶 - Google Patents
突变β-葡萄糖苷酶 Download PDFInfo
- Publication number
- CN111770996A CN111770996A CN201980015241.4A CN201980015241A CN111770996A CN 111770996 A CN111770996 A CN 111770996A CN 201980015241 A CN201980015241 A CN 201980015241A CN 111770996 A CN111770996 A CN 111770996A
- Authority
- CN
- China
- Prior art keywords
- gly
- ala
- leu
- val
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108010047754 beta-Glucosidase Proteins 0.000 title claims abstract description 108
- 102000006995 beta-Glucosidase Human genes 0.000 title claims abstract description 108
- 230000000694 effects Effects 0.000 claims abstract description 75
- 239000002028 Biomass Substances 0.000 claims abstract description 66
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims abstract description 17
- 235000009582 asparagine Nutrition 0.000 claims abstract description 17
- 229960001230 asparagine Drugs 0.000 claims abstract description 17
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims abstract description 12
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 102
- 238000000034 method Methods 0.000 claims description 54
- 108091033319 polynucleotide Proteins 0.000 claims description 50
- 102000040430 polynucleotide Human genes 0.000 claims description 50
- 239000002157 polynucleotide Substances 0.000 claims description 50
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 29
- 239000013598 vector Substances 0.000 claims description 29
- 229920001184 polypeptide Polymers 0.000 claims description 28
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 28
- 238000004519 manufacturing process Methods 0.000 claims description 24
- 238000001179 sorption measurement Methods 0.000 claims description 24
- 125000000539 amino acid group Chemical group 0.000 claims description 22
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 16
- 235000000346 sugar Nutrition 0.000 claims description 16
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 12
- 238000006467 substitution reaction Methods 0.000 claims description 10
- 239000004472 Lysine Substances 0.000 claims description 7
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 7
- 239000004473 Threonine Substances 0.000 claims description 7
- 239000003795 chemical substances by application Substances 0.000 claims description 7
- 235000013922 glutamic acid Nutrition 0.000 claims description 7
- 239000004220 glutamic acid Substances 0.000 claims description 7
- 241000894006 Bacteria Species 0.000 claims description 6
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 claims description 6
- 239000007858 starting material Substances 0.000 claims 1
- 150000001413 amino acids Chemical group 0.000 abstract description 12
- 101001017764 Homo sapiens Lipopolysaccharide-responsive and beige-like anchor protein Proteins 0.000 description 91
- 102100033353 Lipopolysaccharide-responsive and beige-like anchor protein Human genes 0.000 description 91
- 101000586271 Solea senegalensis Osteocalcin 1 Proteins 0.000 description 91
- 102000004190 Enzymes Human genes 0.000 description 72
- 108090000790 Enzymes Proteins 0.000 description 72
- 229940088598 enzyme Drugs 0.000 description 72
- 108090000623 proteins and genes Proteins 0.000 description 36
- 108010059892 Cellulase Proteins 0.000 description 31
- 108020004414 DNA Proteins 0.000 description 31
- 241000499912 Trichoderma reesei Species 0.000 description 21
- 239000000243 solution Substances 0.000 description 21
- 229940106157 cellulase Drugs 0.000 description 20
- 239000000203 mixture Substances 0.000 description 20
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 19
- 108010037850 glycylvaline Proteins 0.000 description 19
- 239000002773 nucleotide Chemical group 0.000 description 17
- 125000003729 nucleotide group Chemical group 0.000 description 17
- 230000035772 mutation Effects 0.000 description 16
- 238000006243 chemical reaction Methods 0.000 description 15
- 108010050848 glycylleucine Proteins 0.000 description 15
- 229920005610 lignin Polymers 0.000 description 15
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 14
- 210000004027 cell Anatomy 0.000 description 14
- 102000004169 proteins and genes Human genes 0.000 description 14
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 description 13
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 13
- 235000001014 amino acid Nutrition 0.000 description 13
- 108010026333 seryl-proline Proteins 0.000 description 13
- 241000880493 Leptailurus serval Species 0.000 description 12
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 12
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 12
- 229940024606 amino acid Drugs 0.000 description 12
- 108010047857 aspartylglycine Proteins 0.000 description 12
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 12
- 108010010147 glycylglutamine Proteins 0.000 description 12
- 108010079364 N-glycylalanine Proteins 0.000 description 11
- 108010077245 asparaginyl-proline Proteins 0.000 description 11
- 239000012634 fragment Substances 0.000 description 11
- 235000018102 proteins Nutrition 0.000 description 11
- BTJIUGUIPKRLHP-UHFFFAOYSA-N 4-nitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1 BTJIUGUIPKRLHP-UHFFFAOYSA-N 0.000 description 10
- 241000228215 Aspergillus aculeatus Species 0.000 description 10
- 101150027351 BGL gene Proteins 0.000 description 10
- 241000193830 Bacillus <bacterium> Species 0.000 description 10
- 102000005575 Cellulases Human genes 0.000 description 10
- 108010084185 Cellulases Proteins 0.000 description 10
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 10
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- 241000223261 Trichoderma viride Species 0.000 description 10
- 108010005233 alanylglutamic acid Proteins 0.000 description 10
- 229920002678 cellulose Polymers 0.000 description 10
- 239000001913 cellulose Substances 0.000 description 10
- 108010081551 glycylphenylalanine Proteins 0.000 description 10
- 108010017391 lysylvaline Proteins 0.000 description 10
- 108010029020 prolylglycine Proteins 0.000 description 10
- 238000010298 pulverizing process Methods 0.000 description 10
- 241000228245 Aspergillus niger Species 0.000 description 9
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 9
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 9
- 108010012058 leucyltyrosine Proteins 0.000 description 9
- 244000005700 microbiome Species 0.000 description 9
- 241000122821 Aspergillus kawachii Species 0.000 description 8
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 description 8
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 8
- 241001480714 Humicola insolens Species 0.000 description 8
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 8
- 241000223259 Trichoderma Species 0.000 description 8
- 108010047495 alanylglycine Proteins 0.000 description 8
- 239000003513 alkali Substances 0.000 description 8
- 108010038633 aspartylglutamate Proteins 0.000 description 8
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 8
- 108010092114 histidylphenylalanine Proteins 0.000 description 8
- 239000002609 medium Substances 0.000 description 8
- 230000028327 secretion Effects 0.000 description 8
- 241000194110 Bacillus sp. (in: Bacteria) Species 0.000 description 7
- 238000002741 site-directed mutagenesis Methods 0.000 description 7
- 239000007974 sodium acetate buffer Substances 0.000 description 7
- 239000006228 supernatant Substances 0.000 description 7
- 230000014616 translation Effects 0.000 description 7
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 6
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 6
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 6
- 108010076441 Ala-His-His Proteins 0.000 description 6
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 6
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 6
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 6
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 6
- 241000609240 Ambelania acida Species 0.000 description 6
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 6
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 6
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 6
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 6
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 6
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 6
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 6
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 6
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 6
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 6
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 6
- 101100111428 Aspergillus niger (strain CBS 513.88 / FGSC A1513) bglA gene Proteins 0.000 description 6
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 6
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 6
- LLRJEFPKIIBGJP-DCAQKATOSA-N Gln-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LLRJEFPKIIBGJP-DCAQKATOSA-N 0.000 description 6
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 6
- NSEKYCAADBNQFE-XIRDDKMYSA-N Gln-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 NSEKYCAADBNQFE-XIRDDKMYSA-N 0.000 description 6
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 6
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 6
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 6
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 6
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 6
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 6
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 6
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 6
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 6
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 6
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 6
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 6
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 6
- QLQDIJBYJZKQPR-BQBZGAKWSA-N Gly-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN QLQDIJBYJZKQPR-BQBZGAKWSA-N 0.000 description 6
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 6
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 6
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 6
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 6
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 6
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 6
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 6
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 6
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 6
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 6
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 6
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 6
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 6
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 6
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 6
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 6
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 6
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 6
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 6
- GNZCMRRSXOBHLC-JYJNAYRXSA-N Phe-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N GNZCMRRSXOBHLC-JYJNAYRXSA-N 0.000 description 6
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 6
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 6
- 108010076504 Protein Sorting Signals Proteins 0.000 description 6
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 6
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 6
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 6
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 6
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 6
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 6
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 6
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 6
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 6
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 6
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 6
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 6
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 6
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 6
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 6
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 6
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 6
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 description 6
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 6
- YXONONCLMLHWJX-SZMVWBNQSA-N Trp-Glu-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 YXONONCLMLHWJX-SZMVWBNQSA-N 0.000 description 6
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 6
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 6
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 6
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 6
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 6
- FRMFMFNMGQGMNB-BVSLBCMMSA-N Tyr-Pro-Trp Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FRMFMFNMGQGMNB-BVSLBCMMSA-N 0.000 description 6
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 6
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 6
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 6
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 6
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 6
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 108010036533 arginylvaline Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 108010068265 aspartyltyrosine Proteins 0.000 description 6
- 239000010905 bagasse Substances 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 6
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 6
- 108010002430 hemicellulase Proteins 0.000 description 6
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 6
- 108010079317 prolyl-tyrosine Proteins 0.000 description 6
- 210000001938 protoplast Anatomy 0.000 description 6
- 239000000725 suspension Substances 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- 108010080629 tryptophan-leucine Proteins 0.000 description 6
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 6
- 239000002023 wood Substances 0.000 description 6
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 5
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 5
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 5
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 5
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 5
- 241000228212 Aspergillus Species 0.000 description 5
- 108020004705 Codon Proteins 0.000 description 5
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 5
- 229920002488 Hemicellulose Polymers 0.000 description 5
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 5
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 5
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 5
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 5
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 5
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 5
- 241000282849 Ruminantia Species 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 5
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 5
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 5
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 5
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 239000008103 glucose Substances 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 229940059442 hemicellulase Drugs 0.000 description 5
- 102000039446 nucleic acids Human genes 0.000 description 5
- 108020004707 nucleic acids Proteins 0.000 description 5
- 150000007523 nucleic acids Chemical class 0.000 description 5
- 229920000136 polysorbate Polymers 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- SRVJKTDHMYAMHA-WUXMJOGZSA-N thioacetazone Chemical compound CC(=O)NC1=CC=C(\C=N\NC(N)=S)C=C1 SRVJKTDHMYAMHA-WUXMJOGZSA-N 0.000 description 5
- 108010051110 tyrosyl-lysine Proteins 0.000 description 5
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 4
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 4
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 4
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 4
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 4
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 4
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 4
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 4
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 4
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 4
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 4
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 4
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 4
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 4
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 4
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 4
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 4
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 4
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 4
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 4
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 4
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 4
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 4
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 4
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- 241000233866 Fungi Species 0.000 description 4
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 4
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 4
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 4
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 4
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 4
- CGWHAXBNGYQBBK-JBACZVJFSA-N Glu-Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)C1=CC=C(O)C=C1 CGWHAXBNGYQBBK-JBACZVJFSA-N 0.000 description 4
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 4
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 4
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 4
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 4
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 4
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 4
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 4
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 4
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 4
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 4
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 4
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 4
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 4
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 4
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 4
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 4
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 4
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 4
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 4
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 4
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 4
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 4
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 4
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 4
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 4
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 4
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 4
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 4
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 4
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 4
- PXIPVTKHYLBLMZ-UHFFFAOYSA-N Sodium azide Chemical compound [Na+].[N-]=[N+]=[N-] PXIPVTKHYLBLMZ-UHFFFAOYSA-N 0.000 description 4
- 241000223257 Thermomyces Species 0.000 description 4
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 4
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 4
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 4
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 4
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 4
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 4
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 4
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 4
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 4
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 4
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 4
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 4
- IOETTZIEIBVWBZ-GUBZILKMSA-N Val-Met-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N IOETTZIEIBVWBZ-GUBZILKMSA-N 0.000 description 4
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 239000012228 culture supernatant Substances 0.000 description 4
- 108010038658 exo-1,4-beta-D-xylosidase Proteins 0.000 description 4
- 239000013613 expression plasmid Substances 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 4
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 4
- 150000008163 sugars Chemical class 0.000 description 4
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 3
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 3
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 3
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 3
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 3
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 3
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 3
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 3
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 3
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 3
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 3
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 3
- 229920003043 Cellulose fiber Polymers 0.000 description 3
- QADHATDBZXHRCA-ACZMJKKPSA-N Cys-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N QADHATDBZXHRCA-ACZMJKKPSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 3
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 3
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 3
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 3
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 3
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 3
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 3
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 3
- MQVNVZUEPUIAFA-WDSKDSINSA-N Gly-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN MQVNVZUEPUIAFA-WDSKDSINSA-N 0.000 description 3
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 3
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 3
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 3
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 3
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 3
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 3
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 3
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 3
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 3
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 3
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 3
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 3
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 3
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 3
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 3
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 3
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 3
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 3
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 3
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 3
- NBDHWLZEMKSVHH-UVBJJODRSA-N Pro-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 NBDHWLZEMKSVHH-UVBJJODRSA-N 0.000 description 3
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 3
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 3
- 241000187747 Streptomyces Species 0.000 description 3
- 241000228178 Thermoascus Species 0.000 description 3
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 3
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 3
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 3
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 3
- HJTYJQVRIQXMHM-XIRDDKMYSA-N Trp-Asp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N HJTYJQVRIQXMHM-XIRDDKMYSA-N 0.000 description 3
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 3
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 3
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 3
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 3
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 3
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 3
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical class N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 238000006911 enzymatic reaction Methods 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 3
- 108010084389 glycyltryptophan Proteins 0.000 description 3
- 230000003301 hydrolyzing effect Effects 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 108010005942 methionylglycine Proteins 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 239000008188 pellet Substances 0.000 description 3
- 239000013600 plasmid vector Substances 0.000 description 3
- 239000002244 precipitate Substances 0.000 description 3
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 3
- 230000000644 propagated effect Effects 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 2
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 2
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 2
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 2
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 2
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 2
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 2
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 2
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 2
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 2
- 240000006439 Aspergillus oryzae Species 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000194103 Bacillus pumilus Species 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 101710099628 Beta-glucosidase 1 Proteins 0.000 description 2
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 2
- 241000186321 Cellulomonas Species 0.000 description 2
- 241000193403 Clostridium Species 0.000 description 2
- 229910021580 Cobalt(II) chloride Inorganic materials 0.000 description 2
- 241000195493 Cryptophyta Species 0.000 description 2
- GUBGYTABKSRVRQ-CUHNMECISA-N D-Cellobiose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-CUHNMECISA-N 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 240000003133 Elaeis guineensis Species 0.000 description 2
- 235000001950 Elaeis guineensis Nutrition 0.000 description 2
- 241000194033 Enterococcus Species 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 2
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 2
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 2
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 2
- OFPWCBGRYAOLMU-AVGNSLFASA-N Gln-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OFPWCBGRYAOLMU-AVGNSLFASA-N 0.000 description 2
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 2
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 2
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 2
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 2
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- OHWJUIXZHVIXJJ-GUBZILKMSA-N Glu-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N OHWJUIXZHVIXJJ-GUBZILKMSA-N 0.000 description 2
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 2
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 2
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 2
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 2
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 2
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 2
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 2
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 2
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 2
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 2
- VUPHVQCDULLACF-NAKRPEOUSA-N Ile-Met-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N VUPHVQCDULLACF-NAKRPEOUSA-N 0.000 description 2
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 2
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- 241000218652 Larix Species 0.000 description 2
- 235000005590 Larix decidua Nutrition 0.000 description 2
- 241000207925 Leonurus Species 0.000 description 2
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 2
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- 241000186781 Listeria Species 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 2
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- 229910021380 Manganese Chloride Inorganic materials 0.000 description 2
- GLFNIEUTAYBVOC-UHFFFAOYSA-L Manganese chloride Chemical compound Cl[Mn]Cl GLFNIEUTAYBVOC-UHFFFAOYSA-L 0.000 description 2
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 2
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 2
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 2
- ZVJGAXNBBKPYOE-HKUYNNGSSA-N Phe-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 ZVJGAXNBBKPYOE-HKUYNNGSSA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 2
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 2
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 241000959173 Rasamsonia emersonii Species 0.000 description 2
- 108020005091 Replication Origin Proteins 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- 241000235527 Rhizopus Species 0.000 description 2
- 240000005384 Rhizopus oryzae Species 0.000 description 2
- 235000013752 Rhizopus oryzae Nutrition 0.000 description 2
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 2
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 2
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 2
- 241000191940 Staphylococcus Species 0.000 description 2
- 241000203640 Thermomonospora Species 0.000 description 2
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 2
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 2
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 2
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 2
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 2
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 2
- LHNNQVXITHUCAB-QTKMDUPCSA-N Thr-Met-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O LHNNQVXITHUCAB-QTKMDUPCSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- 229920004890 Triton X-100 Polymers 0.000 description 2
- 239000013504 Triton X-100 Substances 0.000 description 2
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 2
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 2
- WCTYCXZYBNKEIV-SXNHZJKMSA-N Trp-Glu-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 WCTYCXZYBNKEIV-SXNHZJKMSA-N 0.000 description 2
- GFUOTIPYXKAPAH-BVSLBCMMSA-N Trp-Pro-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GFUOTIPYXKAPAH-BVSLBCMMSA-N 0.000 description 2
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 2
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 2
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 2
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 2
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 2
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 2
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 2
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 2
- 108091023045 Untranslated Region Proteins 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 2
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 2
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 2
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 2
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 2
- 235000011130 ammonium sulphate Nutrition 0.000 description 2
- -1 and for this purpose Substances 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- NGPGDYLVALNKEG-UHFFFAOYSA-N azanium;azane;2,3,4-trihydroxy-4-oxobutanoate Chemical compound [NH4+].[NH4+].[O-]C(=O)C(O)C(O)C([O-])=O NGPGDYLVALNKEG-UHFFFAOYSA-N 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 229910052927 chalcanthite Inorganic materials 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- YPHMISFOHDHNIV-FSZOTQKASA-N cycloheximide Chemical compound C1[C@@H](C)C[C@H](C)C(=O)[C@@H]1[C@H](O)CC1CC(=O)NC(=O)C1 YPHMISFOHDHNIV-FSZOTQKASA-N 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 229910052564 epsomite Inorganic materials 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 239000000706 filtrate Substances 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 2
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 239000011565 manganese chloride Substances 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 229910052603 melanterite Inorganic materials 0.000 description 2
- 238000002515 oligonucleotide synthesis Methods 0.000 description 2
- 239000000123 paper Substances 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 239000002994 raw material Substances 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- 239000001632 sodium acetate Substances 0.000 description 2
- 235000017281 sodium acetate Nutrition 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- NWONKYPBYAMBJT-UHFFFAOYSA-L zinc sulfate Chemical compound [Zn+2].[O-]S([O-])(=O)=O NWONKYPBYAMBJT-UHFFFAOYSA-L 0.000 description 2
- 229910000368 zinc sulfate Inorganic materials 0.000 description 2
- 239000011686 zinc sulphate Substances 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- IFBHRQDFSNCLOZ-RMPHRYRLSA-N 4-nitrophenyl beta-D-glucoside Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=CC=C([N+]([O-])=O)C=C1 IFBHRQDFSNCLOZ-RMPHRYRLSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 101710152845 Arabinogalactan endo-beta-1,4-galactanase Proteins 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 1
- 241000223651 Aureobasidium Species 0.000 description 1
- 241000193375 Bacillus alcalophilus Species 0.000 description 1
- 241000186000 Bifidobacterium Species 0.000 description 1
- 238000009010 Bradford assay Methods 0.000 description 1
- 241000186146 Brevibacterium Species 0.000 description 1
- 241000178334 Caldicellulosiruptor Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 244000060011 Cocos nucifera Species 0.000 description 1
- 235000013162 Cocos nucifera Nutrition 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 244000301850 Cupressus sempervirens Species 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 108010001817 Endo-1,4-beta Xylanases Proteins 0.000 description 1
- 101710147028 Endo-beta-1,4-galactanase Proteins 0.000 description 1
- 101710112457 Exoglucanase Proteins 0.000 description 1
- 101710098246 Exoglucanase 2 Proteins 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- 102000004366 Glucosidases Human genes 0.000 description 1
- 108010056771 Glucosidases Proteins 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- 108090000856 Lyases Proteins 0.000 description 1
- 102000004317 Lyases Human genes 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 239000008118 PEG 6000 Substances 0.000 description 1
- 101100434648 Petromyzon marinus SDS-1 gene Proteins 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- 229920002584 Polyethylene Glycol 6000 Polymers 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 229920001131 Pulp (paper) Polymers 0.000 description 1
- 241000522615 Pyrococcus horikoshii Species 0.000 description 1
- 108010011939 Pyruvate Decarboxylase Proteins 0.000 description 1
- 102000013009 Pyruvate Kinase Human genes 0.000 description 1
- 108020005115 Pyruvate Kinase Proteins 0.000 description 1
- 241000193448 Ruminiclostridium thermocellum Species 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 241000605036 Selenomonas Species 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- PMZURENOXWZQFD-UHFFFAOYSA-L Sodium Sulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=O PMZURENOXWZQFD-UHFFFAOYSA-L 0.000 description 1
- 241001215623 Talaromyces cellulolyticus Species 0.000 description 1
- 241000203780 Thermobifida fusca Species 0.000 description 1
- 108090001109 Thermolysin Proteins 0.000 description 1
- 241000204652 Thermotoga Species 0.000 description 1
- 241000589499 Thermus thermophilus Species 0.000 description 1
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- FHHYVSCGOMPLLO-IHPCNDPISA-N Trp-Tyr-Asp Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 FHHYVSCGOMPLLO-IHPCNDPISA-N 0.000 description 1
- SWSUXOKZKQRADK-FDARSICLSA-N Trp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SWSUXOKZKQRADK-FDARSICLSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 239000011111 cardboard Substances 0.000 description 1
- 101150052795 cbh-1 gene Proteins 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 125000001547 cellobiose group Chemical group 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000010485 coping Effects 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 235000012343 cottonseed oil Nutrition 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- MTHSVFCYNBDYFN-UHFFFAOYSA-N diethylene glycol Chemical compound OCCOCCO MTHSVFCYNBDYFN-UHFFFAOYSA-N 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000007515 enzymatic degradation Effects 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 238000005227 gel permeation chromatography Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 238000010335 hydrothermal treatment Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000007852 inverse PCR Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 208000024191 minimally invasive lung adenocarcinoma Diseases 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 150000008300 phosphoramidites Chemical class 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 108010038196 saccharide-binding proteins Proteins 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 1
- VWDWKYIASSYTQR-UHFFFAOYSA-N sodium nitrate Inorganic materials [Na+].[O-][N+]([O-])=O VWDWKYIASSYTQR-UHFFFAOYSA-N 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 229910052938 sodium sulfate Inorganic materials 0.000 description 1
- 235000011152 sodium sulphate Nutrition 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 239000010902 straw Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 229920001221 xylan Polymers 0.000 description 1
- 150000004823 xylans Chemical class 0.000 description 1
- 150000008495 β-glucosides Chemical class 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
- C12N9/2434—Glucanases acting on beta-1,4-glucosidic bonds
- C12N9/2445—Beta-glucosidase (3.2.1.21)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/14—Fungi; Culture media therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/10—Cells modified by introduction of foreign genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/02—Monosaccharides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/14—Preparation of compounds containing saccharide radicals produced by the action of a carbohydrase (EC 3.2.x), e.g. by alpha-amylase, e.g. by cellulase, hemicellulase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01021—Beta-glucosidase (3.2.1.21)
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Physics & Mathematics (AREA)
- Mycology (AREA)
- Medicinal Chemistry (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Botany (AREA)
- Tropical Medicine & Parasitology (AREA)
- Virology (AREA)
- Cell Biology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明提供一种能够更高效地将生物质糖化的突变β‑葡萄糖苷酶。所述突变β‑葡萄糖苷酶包含如下的氨基酸序列,其为与序列编号1至少80%序列相同的氨基酸序列,并且在选自对应于序列编号1的787位、790位、和797位的位置中的至少1个位置具有天冬酰胺,并且所述突变β‑葡萄糖苷酶具有β‑葡萄糖苷酶活性。
Description
技术领域
本发明涉及一种突变β-葡萄糖苷酶。
背景技术
已知有从含有纤维素的生物质材料(以下有时称为“生物质”)中的纤维素制造糖,将其通过发酵法等转换为以乙醇为首的能源或化学品的技术。从应对近年来的环境问题的观点考虑,关于生物质在工业上的利用的各种技术开发得到发展,利用生物质的燃料或化学品的大规模制造也得到了实现。
生物质由纤维素纤维与将其包围并主要含有木聚糖的半纤维素及木质素构成。在以生物质为原料的糖的制造中,提高纤维素、半纤维素的糖化效率是重要的,为此,需要将纤维素、半纤维素水解的纤维素酶、半纤维素酶等的生物质糖化酶。例如,为了将纤维素高效地分解为葡萄糖,至少需要(1)从结晶性和非结晶性的纤维素纤维的末端以纤维二糖单元将糖切出的纤维二糖水解酶(CBH)、(2)对非结晶性纤维素发挥作用并在纤维素链导入切口的内切葡聚糖酶(EG)、和(3)将由这些酶生成的纤维二糖水解而生成葡萄糖的β-葡萄糖苷酶(BGL)的3种纤维素酶协调作用。
作为产生用于生物质的糖化的纤维素酶的菌,有名的是木霉属(Trichoderma)微生物,例如里氏木霉(Trichoderma reesei)或绿色木霉(Trichoderma viride)。然而,使用了源自木霉属微生物的纤维素酶的生物质糖化中,被指出β-葡萄糖苷酶活性相对较低(非专利文献1)。开发了在木霉属微生物中表达棘孢曲霉(Aspergillus aculeatus)的β-葡萄糖苷酶来提高源自木霉属微生物的酶的生物质糖化活性的技术(例如非专利文献1、专利文献1)。
生物质的酶分解时,生物质糖化酶吸附于作为底物的纤维素或半纤维素。这样的生物质糖化酶对生物质底物的吸附被称为生产性吸附。另一方面,生物质糖化酶也吸附于未被该酶分解的成分,例如木质素、灰分、其它成分等,这被称为非特异性吸附。用糖化酶将生物质糖化之后的残渣成分主要为木质素,因此,生物质糖化酶对木质素的非特异性吸附可能成为使该酶活性降低的原因。报道了木质素的存在使黑曲霉(Aspergillus niger)的β-葡萄糖苷酶的活性降低(非专利文献2、非专利文献3、非专利文献4)。
报道了在木质素存在下的酶活性提高的生物质糖化酶。在专利文献2中,公开了源自嗜热单胞菌(Thermobifida fusca)等微生物的纤维二糖水解酶(CBH)II中,通过取代为带负电荷的氨基酸、除去带正电荷的氨基酸等的改变,降低了向非纤维素原材料的吸附。在专利文献3中,公开了降低了由木质素引起的失活的里氏木霉家族6纤维素酶的突变体。在专利文献4中,公开了改变连接肽而提高木质素存在下的活性和/或抑制与木质素的结合的纤维素酶突变体。在专利文献5中,公开了提高木质素存在下的活性和/或抑制与木质素的结合的里氏木霉Cel6A的碳水化合物结合模块(CBM)的突变体。
(专利文献1)国际公开公报第2013/115305号
(专利文献2)日本特表2011-523854号公报
(专利文献3)国际公开公报第2010/012102号
(专利文献4)国际公开公报第2010/096931号
(专利文献5)国际公开公报第2011/097713号
(非专利文献1)森川康、生物质分解酶研究的前沿、2012、CMC出版、p10-19
(非专利文献2)J Biol Chem,2013,288(46):32991-33005
(非专利文献3)Analyst,2009,134(11):2267-2272
(非专利文献4)Biotechnol Prog,2007,23(2):398-406
发明内容
本发明提供一种选自下述(i)和(ii)的突变β-葡萄糖苷酶,
(i)由在序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列中选自对应于序列编号1的787位的位置、对应于790位的位置、和对应于797位的位置中的至少1个位置的氨基酸残基被取代为天冬酰胺的氨基酸序列构成,并且具有β-葡萄糖苷酶活性的多肽;
(ii)包含该(i)的多肽,并且具有β-葡萄糖苷酶活性的多肽。
另外,本发明提供一种编码该突变β-葡萄糖苷酶的多核苷酸。
另外,本发明提供一种含有该多核苷酸的载体。
另外,本发明提供一种包含该多核苷酸或载体的转化体。
另外,本发明提供一种含有该突变β-葡萄糖苷酶的生物质糖化剂。
另外,本发明提供一种糖的制造方法,其包括用该突变β-葡萄糖苷酶将生物质糖化的步骤。
另外,本发明提供一种突变β-葡萄糖苷酶的制造方法,其中,所述方法包括:在包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列并且具有β-葡萄糖苷酶活性的多肽中,将选自对应于序列编号1的787位的位置、对应于790位的位置、和对应于797位的位置中的至少1个位置的氨基酸残基取代为天冬酰胺的步骤。
附图说明
图1:与糖化残渣反应后的突变β-葡萄糖苷酶的相对活性。白色条柱:酶浓度5.6μg/mL,黑色条柱:酶浓度55.8μg/mL。
具体实施方式
在本说明书中,“氨基酸残基”是指构成蛋白质的20种氨基酸残基,丙氨酸(Ala或A)、精氨酸(Arg或R)、天冬酰胺(Asn或N)、天冬氨酸(Asp或D)、半胱氨酸(Cys或C)、谷酰胺(Gln或Q)、谷氨酸(Glu或E)、甘氨酸(Gly或G)、组氨酸(His或H)、异亮氨酸(Ile或I)、亮氨酸(Leu或L)、赖氨酸(Lys或K)、蛋氨酸(Met或M)、苯丙氨酸(Phe或F)、脯氨酸(Pro或P)、丝氨酸(Ser或S)、苏氨酸(Thr或T)、色氨酸(Trp或W)、酪氨酸(Tyr或Y)和缬氨酸(Val或V)。
在本说明书中,氨基酸序列和核苷酸序列间的同一性能够通过李普曼-皮尔逊法(Lipman-Pearson法;Science,1985,227:1435-41)来计算。具体而言,能够通过使用遗传信息处理软件Genetyx-Win(Ver.5.1.1;软件开发)的同源性分析(Search homology)程序,将Unit size to compare(ktup)设为2进行分析来算出。
在本说明书中,关于氨基酸序列和核苷酸序列记载的“至少80%的同一性”是指80%以上、优选为85%以上、更优选为90%以上、进一步优选为95%以上、进一步优选为98%以上、进一步优选为99%以上、进一步优选为99.5%以上、进一步优选为99.6%以上、进一步优选为99.7%以上、进一步优选为99.8%以上的同一性。
在本说明书中,关于氨基酸的缺失、取代或添加的“1~多个”是指优选为1~160个、更优选为1~80个、进一步优选为1~40个、进一步优选为1~20个、进一步优选为1~10个、进一步优选为1~5个。
在本说明书中,氨基酸序列和核苷酸序列上的“对应的位置”能够通过将目标序列与参照序列(例如,序列编号1所示的氨基酸序列)以对各氨基酸序列或核苷酸序列中存在的保守氨基酸残基或核苷酸赋予最大的相同性的方式排列(比对)来确定。比对能够使用公知的算法来执行,其步骤是本领域技术人员所公知。例如,比对能够通过以默认设定使用Clustal W多重比对程序(Nucleic Acids Res,1994,22:4673-4680)来进行。或者,也能够使用Clustal W的修订版的Clustal W2或Clustal omega。Clustal W、Clustal W2和Clustal omega例如能够在欧洲生物信息研究所(European Bioinformatics Institute:EBI[www.ebi.ac.uk/index.html])或国立遗传学研究所运营的日本DNA数据库(DDBJ[www.ddbj.nig.ac.jp/Welcome-j.html])的网站上利用。
只要是本领域技术人员,就能够将上述得到的比对根据需要以成为最优的比对的方式进行微调。这样的最优比对优选考虑氨基酸序列的相似性、插入的间隙的频度等来确定。这里,氨基酸序列的相似性是指将2个氨基酸序列进行比对时该双方的序列中相同或相似的氨基酸残基存在的位置的个数相对于全长氨基酸残基数的比例(%)。相似的氨基酸残基是指构成蛋白质的20种氨基酸之中在极性、电荷的方面具有相互相似的性质,如发生所谓的保守性取代的氨基酸残基。由这样的相似的氨基酸残基构成的组是本领域技术人员所熟知的,例如,可以分别列举精氨酸和赖氨酸;谷氨酸和天冬氨酸;丝氨酸和苏氨酸;谷酰胺和天冬酰胺;缬氨酸、亮氨酸和异亮氨酸等,但不限定于这些。
通过上述的比对在与参照序列的任意的位置相对应的位置对准的目标序列的氨基酸残基或核苷酸的位置可以看作与该任意的位置“对应的位置”,这里的该氨基酸残基或核苷酸称为“对应的位置的氨基酸残基”或“对应的位置的核苷酸”。
在本说明书中,“β-葡萄糖苷酶”(或者也称为“BGL”)是指具有β-葡萄糖苷酶活性的多肽。在本说明书中,“β-葡萄糖苷酶活性”是指水解糖的β-糖苷键的活性,优选是指水解纤维二糖而生成葡萄糖的活性。蛋白质的β-葡萄糖苷酶活性能够通过利用例如pNP(对硝基苯酚,p-Nitrophenol)法测定从4-硝基苯基-β-D-吡喃葡糖苷由于酶降解而游离的pNP的量来确定。β-葡萄糖苷酶活性的测定的具体步骤在后述的实施例中详述。
在本说明书中,“生物质”是指包含植物、藻类生产的纤维素的纤维素类生物质。作为生物质的具体例子,可以列举选自:由落叶松或落羽杉等的针叶树或油棕(树干部)、柏树等的阔叶树等得到的各种木材;木屑等的木材的加工物或粉碎物;由木材制造的木浆、由棉籽周围的纤维得到的棉短绒纸浆等的纸浆类;报纸、瓦楞纸板、杂志、打印纸等的纸类;蔗渣(甘蔗的果渣)、棕榈空果束(EFB)、稻草、玉米秸秆或叶等植物的茎、叶、果簇等;稻壳、棕榈壳、椰子壳等的植物壳类;藻类等中的一种以上。这些之中,从获取容易性和原料成本的观点考虑,优选木材、木材的加工物或粉碎物、植物的茎、叶、果簇等,更优选蔗渣、EFB、油棕(树干部),更优选蔗渣。该生物质可以单独使用或混合2种以上使用。另外,上述生物质也可以被干燥。
在本说明书中,“糖化残渣”是指将生物质中的纤维素、半纤维素用糖化酶糖化后的残留固体成分。作为糖化残渣的成分,例如可以列举木质素。酶的“糖化残渣吸附性”的高低例如能够通过使糖化残渣和酶反应,使酶吸附于该糖化残渣后,测定反应液的上清液中残留的该酶的活性,求出相对于没有与该糖化残渣反应时的酶活性的相对活性而算出。例如,糖化残渣吸附率能够以以下的式子表示。
糖化残渣吸附率(%)=100-相对活性(%)
相对活性(%)=(与糖化残渣反应后的酶试样的酶活性/未与糖化残渣反应的酶试样的酶活性)×100
上述相对活性越低的酶,具有越高的糖化残渣吸附性。作为糖化残渣,例如,能够使用公知的纤维素酶或纤维素酶制剂(例如,Novozymes公司制造的Cellic(注册商标)CTec2)将碱混合粉碎蔗渣在50℃下糖化24小时后的固体残渣、或木质素。糖化残渣与酶的反应中,例如能够采用在50℃下处理1小时。用于酶的糖化残渣吸附性的测定的具体步骤的例子在后述的实施例中详述。
本发明提供一种能够更高效地将生物质糖化的β-葡萄糖苷酶突变体。
本发明者们发现了非特异性吸附性低,在生物质糖化反应中能够维持高的活性的β-葡萄糖苷酶突变体。
本发明的突变β-葡萄糖苷酶的非特异性吸附性低,即使在生物质糖化反应中产生的糖化残渣存在下也能够维持高的活性。因此,本发明的突变β-葡萄糖苷酶在生物质糖化反应中发挥高的β-葡萄糖苷酶活性。通过使用本发明的突变β-葡萄糖苷酶,能够提高生物质的糖化效率。
本发明提供一种突变β-葡萄糖苷酶。本发明的突变β-葡萄糖苷酶能够通过改变具有序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列的β-葡萄糖苷酶,将规定的位置的氨基酸残基取代为天冬酰胺来制造。本发明的突变β-葡萄糖苷酶与该取代前的β-葡萄糖苷酶(亲本β-葡萄糖苷酶)相比,糖化残渣(例如木质素)吸附性(或非特异性吸附性)较低,在糖化残渣存在下酶活性也不易降低。另外,本发明的突变β-葡萄糖苷酶与亲本β-葡萄糖苷酶相比,相对于生物质的β-葡萄糖苷酶活性提高。
因此,在一个实施方式中,本发明提供一种突变β-葡萄糖苷酶,包含如下的氨基酸序列,其为与序列编号1所示的氨基酸序列至少80%相同的氨基酸序列,并且在选自对应于序列编号1的787位、790位、和797位的位置中的至少1个位置具有天冬酰胺,并且该突变β-葡萄糖苷酶具有β-葡萄糖苷酶活性。优选与序列编号1所示的氨基酸序列至少80%相同的氨基酸序列为与序列编号1所示的氨基酸序列90%以上相同的氨基酸序列。
在优选的实施方式中,本发明的突变β-葡萄糖苷酶选自下述(i)和(ii):
(i)由在序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列中选自对应于序列编号1的787位的位置、对应于790位的位置、和对应于797位的位置中的至少1个位置的氨基酸残基被取代为天冬酰胺的氨基酸序列构成,并且具有β-葡萄糖苷酶活性的多肽;
(ii)包含该(i)的多肽,并且具有β-葡萄糖苷酶活性的多肽。
优选具有与序列编号1所示的氨基酸序列至少80%的同一性的氨基酸序列为与序列编号1所示的氨基酸序列90%以上相同的氨基酸序列。
在其它的实施方式中,本发明提供一种突变β-葡萄糖苷酶的制造方法。该方法包括:在包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列并且具有β-葡萄糖苷酶活性的多肽中,将选自对应于序列编号1的787位的位置、对应于790位的位置、和对应于797位的位置中的至少1个位置的氨基酸残基取代为天冬酰胺的步骤。
优选具有与序列编号1所示的氨基酸序列至少80%的同一性的氨基酸序列为与序列编号1所示的氨基酸序列90%以上相同的氨基酸序列。
在本发明的突变β-葡萄糖苷酶或其制造方法中,作为该序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列的优选例子,可以列举序列编号1~3中任一者所示的氨基酸序列;序列编号4~6中任一者所示的氨基酸序列;和对于序列编号1~6中任一者所示的氨基酸序列缺失、取代或添加1~多个的氨基酸得到的(其中,与序列编号1至少80%、优选为90%以上相同的)氨基酸序列。
在本说明书中,有时将进行上述氨基酸取代前的、包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性(其中,对应于序列编号1的787位、790位和797位的位置的氨基酸残基不是天冬酰胺)的氨基酸序列的β-葡萄糖苷酶称为本发明的突变葡萄糖苷酶的亲本β-葡萄糖苷酶(或简单称为亲本β-葡萄糖苷酶或亲本BGL)。
作为该亲本BGL的例子,可以列举由序列编号1所示的氨基酸序列构成的棘孢曲霉(Aspergillus aculeatus)的β-葡萄糖苷酶1(在本说明书中也称为AaBGL1,GenBank:BAA10968.1、UniProtKB/Swiss-Prot:P48825.1)。
作为该亲本BGL的其它例子,可以列举由与序列编号1所示的氨基酸序列具有至少80%的同一性的氨基酸序列构成,并且具有β-葡萄糖苷酶活性的多肽。作为这样的多肽的例子,可以列举由与序列编号1所示的氨基酸序列具有至少80%的同一性的氨基酸序列构成的源自曲霉属的BGL,作为优选的例子,可以列举与序列编号1的氨基酸序列82.9%序列相同的源自黑曲霉(A.niger)CBS 513.88的BGL即An18g03570(序列编号2、GenBank:XP_001398816.1)、和与序列编号1的氨基酸序列82.5%序列相同的源自白曲霉(A.kawachii或A.awamori var.kawachi)NBRC4308的BGL(序列编号3、GenBank:BAA19913.1)。
作为该亲本BGL的另外其它例子,可以列举上述的AaBGL1、An18g03570、或源自白曲霉的BGL(序列编号1~3)的突变体。作为这样的突变体,可以列举由对序列编号1~3中任一者所示的氨基酸序列缺失、取代或添加1~多个的氨基酸得到的(其中,与序列编号1至少80%相同、优选为90%以上相同的)氨基酸序列构成的β-葡萄糖苷酶。这样的突变体可以为天然生成的突变体,也可以人工制作。
作为该亲本BGL的另外其它例子,可以列举连结有上述的AaBGL1、An18g03570、源自白曲霉的BGL(序列编号1~3)、或它们的突变体与分泌信号序列得到的BGL前蛋白。作为该前蛋白的优选例子,可以列举分别由序列编号4~6所示的氨基酸序列构成的、具有分泌信号序列的AaBGL1、An18g03570和源自白曲霉的BGL的前蛋白。作为该前蛋白的其它例子,可以列举由对序列编号4~6中任一者所示的氨基酸序列缺失、取代或添加1~多个的氨基酸得到的(其中,与序列编号1至少80%相同、优选为90%以上相同的)氨基酸序列构成的BGL前蛋白。
优选上述的亲本BGL在对应于序列编号1所示的氨基酸序列的787位的位置具有赖氨酸,在对应于790位的位置具有谷氨酸,并且在对应于797位的位置具有苏氨酸。
本发明的突变BGL例如能够通过使编码本发明的突变BGL的多核苷酸表达来生产。优选本发明的突变BGL能够由导入有编码该突变BGL的多核苷酸的转化体生产。即,将编码本发明的突变BGL的多核苷酸或含有其的载体导入宿主得到转化体之后,对该转化体用适当的培养基进行培养,使导入的多核苷酸表达,由此生产本发明的突变BGL。将生产的突变BGL从该培养物分离或精制,由此,能够获得本发明的突变BGL。
因此,本发明还提供一种编码本发明的突变BGL的多核苷酸、和含有其的载体。本发明还提供一种转化体的制造方法,其包括将编码本发明的突变BGL的多核苷酸或含有其的载体导入宿主的步骤。另外,本发明提供一种含有该多核苷酸或载体的转化体。另外,本发明提供一种突变BGL的制造方法,其包括培养该转化体的步骤。
编码本发明的突变BGL的多核苷酸可以包含单链或双链的DNA、cDNA、RNA或其它的人工核酸。该DNA、cDNA和RNA也可以化学合成。另外,该本发明的多核苷酸除了开放阅读框(ORF)之外,还可以包含非翻译区(UTR)的核苷酸序列。
编码本发明的突变BGL的多核苷酸能够基于该突变BGL的氨基酸序列基因工程或化学地合成。例如,编码本发明的突变木聚糖酶的多核苷酸能够通过在编码上述的亲本BGL的多核苷酸(以下,也称为亲本BGL基因)中将编码选自对应于序列编号1的787位、790位和797位的位置中的至少1个位置的氨基酸残基的核苷酸序列(密码子)突变为编码天冬酰胺的核苷酸序列(密码子)来制备。通过表达该突变的多核苷酸,能够得到取代对象的氨基酸残基被天冬酰胺取代的突变BGL。
作为亲本BGL基因的例子,可以列举编码上述的序列编号1所示的AaBGL1的多核苷酸、编码序列编号2所示的An18g03570的多核苷酸、和编码源自序列编号3所示的白曲霉的BGL的多核苷酸、以及编码具有这些的分泌信号序列的前蛋白(序列编号4~6)的多核苷酸等。作为优选例子,可以列举编码AaBGL1(序列编号1)或其前蛋白(序列编号4)的多核苷酸(序列编号7)。这些多核苷酸能够通过该领域中使用的任意的方法获得。例如,编码序列编号1所示的氨基酸序列的多核苷酸能够通过提取棘孢曲霉的全基因组DNA后,利用使用了基于序列编号7的序列设计的引物的PCR选择性地扩增目标核酸,对扩增的核酸进行精制而获得。
或者,作为亲本BGL基因的例子,可以列举编码由与序列编号1所示的氨基酸序列至少80%相同的氨基酸序列构成的上述亲本BGL的多核苷酸。作为这样的多核苷酸的例子,可以列举编码上述的AaBGL1、An18g03570或源自白曲霉的BGL(序列编号1~3)的突变体的多核苷酸。亲本BGL基因可以是天然存在的基因,也可以基于序列编号1~3的序列人工制作。例如,对编码序列编号1~3的氨基酸序列的基因(例如序列编号7)通过如紫外线照射或位点特异性突变导入这样的公知的突变导入法导入突变,调查所得到的突变基因编码的多肽的β-葡萄糖苷酶活性。通过选择编码具有所期望的活性的多肽的基因,能够得到亲本BGL基因。根据需要,能够通过测序等确认基因序列。这样的突变的方法是本领域技术人员公知的。
向亲本BGL基因导入目标的突变能够使用本领域技术人员公知的各种位点特异性突变导入法来进行。位点特异性突变导入法例如能够通过反向PCR法或退火法等(村松等编辑、《修订第4版新基因工程手册》、羊土公司、p.82-88)的任意的方法来进行。也能够根据需要,使用Stratagene公司的QuickChange II定点突变试剂盒(Site-Directed MutagenesisKit)、QuickChange多点突变试剂盒(Multi Site-Directed Mutagenesis Kit)等各种市售的位点特异性突变导入用试剂盒。或者,向亲本BGL基因的位点特异性突变导入能够通过SOE(通过重叠延伸的剪接,splicing by overlap extension)-PCR法(Horton R.M.et al,Gene,1989,77(1):61-68)、大引物PCR法(megaprimer method)等公知的方法进行。
例如,向亲本BGL基因的位点特异性突变导入能够使用包含合适导入的核苷酸突变的突变引物来进行。这样的突变引物可以以对包含编码亲本BGL基因内的取代对象的氨基酸残基的核苷酸序列的区域进行退火,并且代替编码该取代对象的氨基酸残基的核苷酸序列(密码子)而包含具有编码天冬酰胺的核苷酸序列(密码子)的核苷酸序列的方式进行设计。编码取代对象和取代后的氨基酸残基的核苷酸序列(密码子)是本领域技术人员能够基于通常的教科书等适当识别并选择的。
突变引物能够通过亚磷酰胺法(Nucleic Acids Research,1989,17:7059-7071)等公知的寡核苷酸合成法制作。另外,突变引物例如也能够使用市售的寡核苷酸合成装置(ABI公司制等)来制作。通过使用包含突变引物的引物对,以亲本BGL基因为模板DNA进行如上所述的位点特异性突变导入,能够得到编码目标的突变BGL的多核苷酸。
包含编码本发明的突变BGL的多核苷酸的载体能够通过将该基因导入载体来制作。作为合适导入该多核苷酸的载体的种类,没有特别限定,可以列举蛋白质生产所通常使用的载体,例如质粒、粘粒、噬菌体、病毒、YAC、BAC等。其中,优选质粒载体,更优选诱导蛋白质的高表达的质粒载体。本领域技术人员能够根据宿主细胞的种类来选择合适的载体。蛋白质表达用质粒载体可以根据宿主来制作,也可以使用市售品。作为载体的例子,可以列举酵母表达载体pNAN8142(Biosci Biotechnol Biochem,1996,60:383-389)、pMA91(BiosciBiotechnol Biochem,1998,62:1615-1618)等。
该载体可以含有包含DNA的复制起始区域的DNA片段或包含复制起点的DNA区域。或者,在该载体中,也可以在编码上述本发明的突变BGL的多核苷酸可操作地连结有启动子区域、终止子区域、或用于将所表达的蛋白质向细胞外分泌的分泌信号区域等控制区域。或者,也可以进一步导入有用于选择适当导入了该载体的宿主的标记基因(例如,氨苄青霉素、新霉素、卡那霉素、氯霉素等药剂的耐性基因)。或者,在将营养缺陷型株用于导入本发明的载体的宿主时,也可以使用包含编码所要求的营养的基因的载体。
作为该控制区域的优选例子,可以列举P-No8142启动子(Biosci BiotechnolBiochem,1996,60:383-389)、源自里氏木霉(Trichoderma reesei)的cbh1启动子序列(Curr Genet,1995,28(1):71-79)等。或者,也可以使用表达纤维二糖水解酶、内切葡聚糖酶、β-葡萄糖苷酶、木聚糖酶、β-木糖苷酶等糖化酶的启动子。或者,也可以使用丙酮酸脱羧酶、醇脱氢酶、丙酮酸激酶等的代谢途径的酶的启动子。
编码本发明的突变BGL的序列与上述控制区域、标记基因序列的连结能够通过上述的SOE-PCR法等方法进行。向载体导入基因序列的操作方法在该领域是公知的。启动子区域、终止子、分泌信号区域等的控制区域的种类没有特别限定,能够根据导入的宿主适当选择并使用通常所使用的启动子、分泌信号序列。
含有编码本发明的突变BGL的多核苷酸或包含其的载体的转化体只要将该载体导入宿主或将该多核苷酸导入宿主的基因组即可。作为向宿主导入载体的方法,能够使用原生质体法、电穿孔法等该领域通常使用的方法。通过以标记基因的表达、营养缺陷等为指标选择适当进行过导入的株,能够得到导入了载体的目标转化体。
作为将编码本发明的突变BGL的多核苷酸导入宿主的基因组的方法,没有特别限定,例如,可以列举使用了包含该多核苷酸的DNA片段的双重交叉法。该DNA片段可以在上述的宿主细胞中导入到表达量多的基因的启动子序列的下游,或者,也可以预先制作将该DNA片段和上述的控制区域可操作地连结而成的片段,将该连结片段导入宿主的基因组。另外,该DNA片段也可以预先与用于选择适当导入有本发明的多核苷酸的细胞的标记(药剂耐性基因、营养缺陷型互补基因等)连结。
在本说明书中,编码BGL的多核苷酸与控制区域“可操作地连结”是指该多核苷酸和控制区域以由该多核苷酸编码的BGL可以在该控制区域的控制下表达的方式配置。
作为该转化体的宿主的例子,可以列举酵母、丝状菌、细菌等的微生物。作为酵母的例子,可以列举米根霉(Rhizopus oryzae)、酿酒酵母(Saccharomyces cerevisiae)、毕赤酵母(Pichia pastoris)等。作为细菌的例子,可以列举大肠杆菌(Escherichia coli)、属于葡萄球菌属(Staphylococcus)、肠球菌属(Enterococcus)、李斯特菌属(Listeria)、芽孢杆菌属(Bacillus)的细菌等,其中,优选大肠杆菌和芽孢杆菌属细菌(例如,枯草芽孢杆菌或其突变株)。作为枯草芽孢杆菌突变株的例子,能够列举J Biosci Bioeng,2007,104(2):135-143中记载的蛋白酶9重缺陷株KA8AX、以及Biotechnol Lett,2011,33(9):1847-1852中记载的蛋白酶8重缺陷株中提高了蛋白质的折叠效率的D8PA株。作为丝状菌的例子,可以列举木霉属(Trichoderma)、曲霉属(Aspergillus)、根霉属(Rhizopus)等,其中,从酶生产性的观点考虑,优选木霉属。
如果使用适当的培养基培养将这样得到的导入有编码本发明的突变BGL的多核苷酸或包含其的载体的转化体,则该多核苷酸表达,生成本发明的突变BGL。本领域技术人员能够根据该转化体的微生物的种类适当选择该转化体的培养中使用的培养基。
或者,本发明的突变BGL也可以使用无细胞翻译系统,从编码本发明的突变BGL的多核苷酸或其转录产物表达。“无细胞翻译系统”是在将成为宿主的细胞机械破坏得到的悬浮液中添加蛋白质的翻译所需的氨基酸等的试剂,构建体外(in vitro)转录翻译系统或体外(in vitro)翻译系统。
上述培养物或无细胞翻译系统中生成的本发明的突变BGL能够通过单独使用或适当组合使用蛋白质精制中所使用的通常的方法,例如离心分离、硫酸铵沉淀、凝胶层析、离子交换层析、亲和层析等来分离或精制。此时,在转化体内的载体上可操作地连结有编码突变BGL的多核苷酸和分泌信号序列时,生成的BGL被分泌到菌体外,因此,能够更容易地从培养物回收。从培养物回收的BGL可以通过公知的方法进一步进行精制。
如后述的实施例所示的那样,本发明的突变β-葡萄糖苷酶与突变前的β-葡萄糖苷酶(亲本BGL)相比,糖化残渣(例如木质素)吸附性(或非特异性吸附性)较低,在糖化残渣存在下活性不易降低。并且,本发明的突变β-葡萄糖苷酶与亲本BGL相比,对于生物质的β-葡萄糖苷酶活性提高(图1、表2)。因此,本发明的突变β-葡萄糖苷酶作为生物质糖化用的酶是有用的。
因此,本发明进一步提供一种含有本发明的突变β-葡萄糖苷酶的生物质糖化剂。本发明还提供一种糖的制造方法,其包括用本发明的突变β-葡萄糖苷酶将生物质糖化的步骤。
本发明的生物质糖化剂优选为含有本发明的突变β-葡萄糖苷酶的生物质糖化用的酶组合物(以下,有时也称为本发明的酶组合物)。本发明的酶组合物含有本发明的突变BGL,并且从糖化效率的提高的观点考虑,优选还含有本发明的突变BGL以外的其它生物质糖化酶。该其它生物质糖化酶可以是源自动物、植物、微生物的酶。作为该其它生物质糖化酶的例子,可以列举内切葡聚糖酶、外切葡聚糖酶或纤维二糖水解酶、本发明的突变BGL以外的BGL等本发明的突变BGL以外的其它纤维素酶;或木聚糖酶、木糖苷酶、半乳聚糖酶等的半纤维素酶等,优选为本发明的突变BGL以外的其它纤维素酶,更优选为选自纤维二糖水解酶和内切葡聚糖酶中的1种以上。这些生物质糖化酶可以单独使用、也可以组合2种以上使用。从提高生物质糖化效率的观点考虑,本发明的酶组合物优选含有选自纤维二糖水解酶和内切葡聚糖酶中的1种以上。
作为本发明的酶组合物中所含的本发明的突变BGL以外的其它纤维素酶的具体例子,不限定于这些,可以列举源自里氏木霉(Trichoderma reesei)的纤维素酶;源自绿色木霉(Trichoderma viride)的纤维素酶;源自芽孢杆菌(Bacillus sp.)KSM-N145(FERM P-19727)、芽孢杆菌(Bacillus sp.)KSM-N252(FERM P-17474)、芽孢杆菌(Bacillus sp.)KSM-N115(FERM P-19726)、芽孢杆菌(Bacillus sp.)KSM-N440(FERM P-19728)、芽孢杆菌(Bacillus sp.)KSM-N659(FERM P-19730)等各种芽孢杆菌株的纤维素酶;源自极端嗜热古菌(Pyrococcus horikoshii)的耐热性纤维素酶;源自特异腐质霉(Humicola insolens)的纤维素酶等。这些之中,从提高糖化效率的观点考虑,优选源自里氏木霉、绿色木霉、或特异腐质霉的纤维素酶。另外,也可以使用使对上述的微生物外来性地导入的纤维素酶基因表达得到的重组纤维素酶。作为具体例子,可以列举对里氏木霉导入源自棘孢曲霉的β-葡萄糖苷酶基因得到的X3AB1株(J Ind Microbiol Biotechnol,2012,1741-1749)产生的纤维素酶JN11。或者,也可以在本发明的酶组合物中含有包含上述其它纤维素酶的纤维素酶制剂,与本发明的突变BGL并用。作为该纤维素酶制剂的具体例子,可以列举Celluclast(注册商标)1.5L(Novozymes公司制造)、TP-60(明治制果株式会社制造)、Cellic(注册商标)CTec2(Novozymes公司制造)、AccelleraseTMDUET(Genencor Inc.制造)、和Ultraflo(注册商标)L(Novozymes公司制造)等。
作为本发明的酶组合物中所含的纤维二糖水解酶的具体例子,可以列举源自里氏木霉、绿色木霉、或特异腐质霉的纤维二糖水解酶、源自极端嗜热古菌的耐热性纤维二糖水解酶等。这些之中,从提高糖化效率的观点考虑,优选源自里氏木霉、绿色木霉、或特异腐质霉的纤维二糖水解酶,更优选源自里氏木霉的纤维二糖水解酶。
作为本发明的酶组合物中所含的内切葡聚糖酶的具体例子,可以列举源自里氏木霉、解纤维枝顶孢霉(Acremonium celluloriticus)、特异腐质霉、热纤梭菌(Clostridiumthermocellum)、芽孢杆菌(Bacillus)、高温双歧菌(Thermobifida)、纤维单胞菌(Cellulomonas)的酶等。其中,从提高糖化效率的观点考虑,优选源自里氏木霉、特异腐质霉、芽孢杆菌、纤维单胞菌的内切葡聚糖酶,更优选源自里氏木霉的内切葡聚糖酶。
作为本发明的酶组合物中所含的本发明的突变BGL以外的BGL的例子,可以列举源自黑曲霉的BGL(例如,Novozymes公司制造的Novozyme 188或Megazyme公司制造的BGL)、以及源自里氏木霉或埃摩森青霉菌(Penicillium emersonii)的BGL等。其中,从提高生物质糖化效率的观点考虑,优选Novozyme 188、源自里氏木霉的BGL,更优选源自里氏木霉的BGL。
作为本发明的酶组合物中所含的半纤维素酶的例子,可以列举源自里氏木霉的半纤维素酶;源自芽孢杆菌KSM-N546(FERM P-19729)的木聚糖酶;源自黑曲霉、绿色木霉、特异腐质霉或嗜碱芽孢杆菌(Bacillus alcalophilus)的木聚糖酶;源自嗜热真菌(Thermomyces)、短柄霉(Aureobasidium)、链霉菌(Streptomyces)、梭状杆菌(Clostridium)、栖热孢菌(Thermotoga)、热子囊菌(Thermoascus)、解糖热纤维菌(Caldocellum)、或高温单胞菌(Thermomonospora)属的木聚糖酶;源自短小芽孢杆菌(Bacillus pumilus)的β-木糖苷酶;源自反刍兽月形单胞菌(Selenomonas ruminantium)的β-木糖苷酶等。其中,从提高糖化效率的观点考虑,优选源自芽孢杆菌、黑曲霉、绿色木霉或链霉菌的木聚糖酶、或源自反刍兽月形单胞菌的β-木糖苷酶,更优选源自芽孢杆菌或绿色木霉的木聚糖酶、或源自反刍兽月形单胞菌的β-木糖苷酶。或者,作为优选的半纤维素酶的例子,可以列举日本特开2013-243953号公报、日本特开2013-243954号公报、日本特开2015-167552号公报、日本特开2016-119877号公报、日本特开2017-012006号公报、或日本特开2017-035001号公报中记载的突变木聚糖酶。
本发明的生物质糖化剂(或本发明的酶组合物)中的本发明的突变BGL的含量在其总蛋白质量中优选为0.5质量%以上、更优选为1质量%以上、进一步优选为2质量%以上,并且优选为70质量%以下、更优选为50质量%以下、进一步优选为40质量%以下、更进一步优选为30质量%以下,或者在总蛋白质量中优选为0.5~70质量%、更优选为1~50质量%、进一步优选为2~40质量%、更进一步优选为2~30质量%。优选本发明的生物质糖化剂(或本发明的酶组合物)的总蛋白质量为3~25质量%。
本发明的酶组合物中的本发明的突变BGL以外的其它纤维素酶的含量在其总蛋白质量中优选为10质量%以上、更优选为30质量%以上、进一步优选为50质量%以上,并且优选为99质量%以下、更优选为95质量%以下,或者,在其总蛋白质量中优选为10~99质量%、更优选为30~95质量%、进一步优选为50~95质量%。
本发明的酶组合物中的内切葡聚糖酶的含量在其总蛋白质量中优选为1质量%以上、更优选为5质量%以上、进一步优选为10质量%以上,并且优选为70质量%以下、更优选为50质量%以下、进一步优选为40质量%以下,或者,在其总蛋白质量中优选为1~70质量%、更优选为5~50质量%、进一步优选为10~40质量%。
本发明的酶组合物中的半纤维素酶的含量在其总蛋白质量中优选为0.01质量以上%、更优选为0.1质量%以上、进一步优选为0.5以上质量%,并且优选为30质量%以下、更优选为20质量%以下,或者,在其总蛋白质量中优选为0.01~30质量%、更优选为0.1~20质量%、进一步优选为0.5~20质量%。
本发明的糖的制造方法包括用上述本发明的突变BGL将生物质糖化的步骤。在该方法中,作为本发明的突变BGL,也可以使用上述的本发明的酶组合物。该本发明的方法中的糖化处理的条件只要是不使本发明的突变BGL和并用的其它酶失活的条件即可,没有特别限定。本领域技术人员能够根据生物质的种类、前处理工序的步骤、使用的酶的种类适当确定适当的条件。
在该糖化处理中,优选在含有生物质的悬浮液中添加本发明的突变BGL。从提高糖化效率或糖生产效率(即缩短糖生产时间)的观点考虑,该悬浮液中的生物质的含量优选为0.5~20质量%、更优选为3~15质量%、进一步优选为5~10质量%。
本发明的突变BGL相对于该悬浮液的使用量能够根据生物质的种类、形状和量、以及并用的酶的种类和性质等来适当确定。优选相对于生物质质量,突变BGL的质量为0.04~600质量%、更优选为0.1~100质量%、进一步优选为0.1~50质量%。
从提高糖化效率或糖生产效率和降低生产成本的观点考虑,该糖化处理的反应pH优选为pH4~9、更优选为pH5~8、进一步优选为pH5~7。从提高糖化效率、提高糖化效率或糖生产效率、以及降低生产成本的观点考虑,该糖化处理的反应温度优选为20~90℃、更优选为25~85℃、进一步优选为30~80℃、进一步优选为40~75℃、进一步优选为45~65℃、进一步优选为45~60℃、进一步优选为50~60℃。该糖化处理的反应时间能够根据生物质的种类、形状和量、酶量等适当设定。从提高糖化效率或糖生产效率、以及降低生产成本的观点考虑,优选为1~5天、更优选为1~4天、进一步优选为1~3天。
另外,从提高生物质的糖化效率或糖生产效率的观点考虑,本发明的糖的制造方法优选在将生物质用本发明的突变BGL糖化之前,还包括对该生物质进行前处理的工序。作为该前处理,例如,可以列举选自碱处理、粉碎处理和水热处理中的1种以上。作为该前处理,从提高糖化效率的观点考虑,优选碱处理,从进一步提高糖化效率的观点考虑,优选进行碱处理和粉碎处理,更优选并行地进行碱处理和粉碎处理。该粉碎处理可以为湿式粉碎、也可以为干式粉碎,优选干式粉碎。更优选将固体的碱和生物质一起施加粉碎处理,并与碱处理并行地进行干式粉碎(碱混合粉碎处理)。
作为本发明的例示性实施方式,在本说明书中还公开了以下的物质、制造方法、用途、方法等。但是,本发明不限定于这些实施方式。
[1]一种突变β-葡萄糖苷酶,其中,选自下述(i)和(ii),
(i)由在序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列中选自对应于序列编号1的787位的位置、对应于790位的位置、和对应于797位的位置中的至少1个位置的氨基酸残基被取代为天冬酰胺的氨基酸序列构成,并且具有β-葡萄糖苷酶活性的多肽;
(ii)包含该(i)的多肽,并且具有β-葡萄糖苷酶活性的多肽。
[2]如[1]所述的突变β-葡萄糖苷酶,其中,优选所述序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列为:
序列编号1~6中任一者所示的氨基酸序列、或者
对序列编号1~6中任一者所示的氨基酸序列缺失、取代或添加1~多个的氨基酸得到的氨基酸序列。
[3]如[1]或[2]所述的突变β-葡萄糖苷酶,其中,优选与亲本β-葡萄糖苷酶相比,糖化残渣吸附性较低。
[4]如[1]~[3]中任一项所述的突变β-葡萄糖苷酶,其中,优选与亲本β-葡萄糖苷酶相比,对于生物质的β-葡萄糖苷酶活性提高。
[5]如[3]或[4]所述的突变β-葡萄糖苷酶,其中,优选所述亲本β-葡萄糖苷酶由与序列编号1所示的氨基酸序列至少80%相同的氨基酸序列构成,并且在对应于序列编号1所示的氨基酸序列的787位的位置具有赖氨酸,在对应于790位的位置具有谷氨酸,并在对应于797位的位置具有苏氨酸。
[6]一种多核苷酸,其中,编码该[1]~[5]中任一项所述的突变β-葡萄糖苷酶。
[7]一种载体,其中,含有该[6]所述的多核苷酸。
[8]一种转化体,其中,包含该[6]所述的多核苷酸或该[7]所述的载体。
[9]如[8]所述的转化体,其中,优选为丝状菌。
[10]一种生物质糖化剂,其中,含有该[1]~[5]中任一项所述的突变β-葡萄糖苷酶。
[11]一种糖的制造方法,其中,包括用该[1]~[5]中任一项所述的突变β-葡萄糖苷酶将生物质糖化的步骤。
[12]一种突变β-葡萄糖苷酶的制造方法,其中,所述方法包括:在包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列并且具有β-葡萄糖苷酶活性的多肽中,将选自对应于序列编号1的787位的位置、对应于790位的位置、和对应于797位的位置中的至少1个位置的氨基酸残基取代为天冬酰胺的步骤。
[13]如[12]所述的方法,其中,所述包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列并且具有β-葡萄糖苷酶活性的多肽在对应于序列编号1所示的氨基酸序列的787位的位置具有赖氨酸,在对应于790位的位置具有谷氨酸,并在对应于797位的位置具有苏氨酸。
[14]如[12]或[13]所述的方法,其中,优选所述突变β-葡萄糖苷酶与所述包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列并且具有β-葡萄糖苷酶活性的多肽相比,糖化残渣吸附性较低。
[15]如[12]~[14]中任一项所述的方法,其中,优选所述突变β-葡萄糖苷酶与所述包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列并且具有β-葡萄糖苷酶活性的多肽相比,对于生物质的β-葡萄糖苷酶活性提高。
[16]如[12]~[15]中任一项所述的方法,其中,优选所述包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列并且具有β-葡萄糖苷酶活性的多肽由序列编号1~6中任一者所示的氨基酸序列;或者对序列编号1~6中任一者所示的氨基酸序列缺失、取代或添加1~多个的氨基酸得到的氨基酸序列构成。
实施例
以下,利用实施例对本发明进行更详细地说明,但是本发明的技术范围不受该实施例限定。
实施例1突变BGL表达质粒的构建
构建编码对棘孢曲霉的β-葡萄糖苷酶1(AaBGL1;序列编号1)导入有以下的突变的突变体的表达质粒:S641T、R738T、R745N、K787N、E790N、V793N、T797N、D807T、V814N、S826N。
通过PCR-megaprimer法,对编码具有分泌信号序列的AaBGL1前蛋白的基因(AaBGL1基因、序列编号7)导入突变。即,使用表1所记载的各突变导入用引物和BGL1-R引物,以棘孢曲霉No.F-50株基因组DNA为模板,使用PrimeSTAR HS DNA Polymerase并按照所附的操作说明书,从AaBGL1基因的突变导入部位扩增下游侧的基因。扩增片段通过琼脂糖凝胶电泳分离并回收。将得到的扩增片段作为大引物(megaprimer)与BGL1-F引物一起使用,以F-50株基因组DNA为模板,以同样的方法进行PCR,扩增导入有突变的全长AaBGL1基因。将所得到的DNA片段在源自引物的Not I和Sph I位点切断,插入到丝状菌表达载体pNAN8142(Biosci Biotechnol Biochem,1996,60:383-389)的同位点,由此构建表达质粒。通过测序确认了在DNA片段中正确导入了突变。
[表1]
实施例2突变BGL表达株的制作
转化体的制作按照Gomi等的方法(Agric Biol Chem,1987,51:2549-2555)进行。即,向在MM(NH4 +)平板培养基(1.0%葡萄糖、0.3%酒石酸铵、0.13%KCl、0.13%MgSO4·7H2O、0.38%KH2PO4、0.00011%Mo7O24·4H2O、0.00011%H3BO3、0.00016%CoCl2·6H2O、0.00016%CuSO4·5H2O、0.005%EDTA、0.0005%FeSO4·7H2O、0.0005%MnCl2·4H2O、0.0022%ZnSO4·7H2O、pH6.5)中繁殖的米曲霉(A.oryzae)niaD300株中添加5mL的Tween/saline solution(0.1%(w/v)Tween(注册商标)80、0.01%NaCl),使用涂抹器将胞子悬浮。将该胞子悬浮液加入到MM(NH4 +)液体培养基200mL(500mL容量的带挡板的三角烧瓶)中,以30℃、160rpm振荡培养一晚。使其适度繁殖,结束培养,在Miracloth上收集菌体,用原生质体缓冲液(Protoplasting buffer,以下称为PB,0.8M NaCl、10mM NaH2PO4)清洗。将回收的菌体加入到50mL容量离心管中,在加入有Yatalase(Takara Bio Inc.制造)30mg、裂解酶(Lysing enzyme,Sigma-Ardrich Co.制造)50mg的PB 10mL中悬浮,一边缓慢振荡一边孵育(30℃、90min)。另外,每30分钟通过移液将菌体分散。其后,为了仅回收原生质体,用Miracloth进行过滤,对得到的滤液进行离心分离(4℃、2000rpm、5min)。将沉淀悬浮于Transformation buffer I(以下、TB I;0.8M NaCl、10mM Tris-HCl[pH7.5]、50mM CaCl2)10mL中,进行离心分离(4℃、2000rpm、5min)。其后,除去上清液,将沉淀悬浮于200μL的TB I后的混合物作为原生质体溶液。通过显微镜确认原生质体数,将大约107个/mL的原生质体用于此后的操作。
在含有实施例1中得到的表达质粒DNA约10μg的溶液中加入等量的2×TB I,将其添加到原生质体溶液中。进一步添加其0.2倍量的Transformation buffer II(以下、TBII;50%PEG6000、50mM Tris-HCl[pH7.5]、50mM CaCl2),缓慢混合并静置于冰上10min。其后添加1mL的TB II,在室温下静置15min。接着,添加10mL的TB I,进行离心分离(4℃、2000rpm、5min)。除去上清液,将沉淀悬浮于200μL的TB I,置于再生培养基(Regenerationmedium)(以下、RE;1.0%葡萄糖、0.3%NaNO3、4.68%NaCl、0.13%KCl、0.13%MgSO4·7H2O、0.38%KH2PO4、0.00011%Mo7O24·4H2O、0.00011%H3BO3、0.00016%CoCl2·6H2O、0.00016%CuSO4·5H2O、0.005%EDTA、0.0005%FeSO4·7H2O、0.0005%MnCl2·4H2O、0.0022%ZnSO4·7H2O、pH6.5),层叠顶层琼脂(RE、0.7%琼脂)。得到的转化体为了纯化核而进行单一(Monochrome)化。即,准备2个添加有200μL Tween/saline solution的微型管,从繁殖转化体的RE培养基板将附着于铂环的前端的转化体的胞子悬浮于一个Tween/saline solution中。从其中取2μL,与另一个Tween/saline solution混合,由此稀释100倍。从该溶液中将100μL铺展于MM(NO3 —、0.1%Triton X-100)(添加NaNO3来代替MM(NH4 +)的酒石酸铵,并进一步添加Triton X-100而成的)平板培养基中,在30℃下静置培养3~4天。在其中从繁殖的菌落中选择1株,分离在MM(NO3 —)平板培养基中。将得到的转化体在5mL的MM(NO3 —)液体培养基中在30℃下培养4天。
实施例3突变BGL的精制
从实施例2中得到的突变BGL表达株精制突变BGL酶,用pNP(对硝基苯酚,p-Nitrophenol)法测定其BGL活性。将培养上清液稀释,制备酶溶液。将1.5mM pNP-Glc溶液(100mM醋酸钠缓冲液、pH5.0)作为底物溶液。在37℃下孵育了5分钟的100μL的酶溶液中等量混合1.5mM的底物溶液,由此,开始酶反应。在37℃下反应10分钟后,添加2mL的1M Na2CO3溶液使反应停止。其后,测定405nm的吸光度,使用pNP的吸光系数(ε405nm=0.0185mL/nmol·cm-1)算出游离的pNP浓度,求出酶活性。在空白中,使用在酶溶液100μL中依次添加1MNa2CO3 2mL、底物溶液100μL的不发生酶反应的溶液。将1分钟中使1μmol的pNP游离的酶量定义为1Unit,通过以下的式子算出培养上清液的酶活性。
酶活性(Unit/mL)=A405/18.5×2.2mL/(10min)×(1/0.1mL)×培养上清液的稀释率
其结果,包含S641T、R738T、R745N、K787N、E790N、V793N、T797N、D807T、V814N、或S826N突变体的上清液显示BGL活性。
接着,从确认有BGL活性的培养上清液精制突变BGL。精制按照Baba等(AMBExpress,2015,5:3)的方法,按照以下的步骤进行。将各突变酶高表达株以5%Glc、1.5%Na2NO3作为单一碳源、作为氮源的MM(NO3 —)液体培养基1,200mL(200mL×6)培养3天之后,用布氏漏斗收集菌体。用5L的离子交换水清洗后,将菌体悬浮于1200mL的Releasing buffer(0.02%叠氮化钠、1mM PMSF、10μg/mL环己酰亚胺、20mM醋酸钠缓冲液、pH5.0)中,以30℃、160rpm振荡2天。振荡后,使用布氏漏斗回收滤液,作为粗酶液。使粗酶吸附于用20mM醋酸钠缓冲液(pH5.0)进行过平衡化的DEAE TOYOPEARL 650M,用4L的20mM醋酸钠缓冲液(pH5.0)清洗,接着用0-0.3M NaCl溶液(20mM醋酸钠缓冲液、pH5.0)1L的线性梯度进行洗脱。回收A280值处显示主峰的洗脱组分,提供给SDS-PAGE,确认有对应于AaBGL1的130kDa的条带的存在。接着,在回收的组分中以成为30%饱和的方式添加硫酸铵(Ammonium sulfate),使其吸附于预先用30%饱和硫酸铵(20mM醋酸钠缓冲液、pH5.0)平衡化的Butyl TOYOPEARL650M,用30-0%饱和硫酸铵溶液(20mM醋酸钠缓冲液、pH5.0)1L的线性梯度洗脱。回收A280值处显示主峰的洗脱组分,提供给SDS-PAGE,确认有对应于AaBGL1的130kDa的条带的存在。
实施例4向糖化残渣的吸附试验
对于实施例3中经过精制的突变BGL,调查向糖化残渣的吸附性。将碱处理蔗渣2g和源自里氏木霉PC-3-7株的纤维素酶培养液10mg(以成为5mg/g生物质的方式添加、Bradford法)溶解于含有0.2mg/mL叠氮化钠的40mL 50mM醋酸钠缓冲液(pH5.0)中,在50℃下使其反应24小时。将反应液离心分离(10,000rpm、4℃、10min),回收沉淀。将回收的沉淀用蒸馏水40mL悬浮,再次进行离心分离并清洗。将清洗后的沉淀悬浮于36mL的蒸馏水中,由此,制备纤维素酶糖化碱处理蔗渣的糖化残渣(约90%v/v)。对该糖化残渣400μL添加20mM醋酸钠缓冲液(pH5.0)50μL、以及5.6或55.8μg/mL的野生型AaBGL1(WT)或突变酶50μL,在50℃下一边以160rpm振荡1小时一边进行酶反应。反应后,将反应液离心分离(15,000rpm、4℃、5min)并回收上清液。以与实施例3同样的步骤使用pNP法测定回收的上清液的残留BGL活性,求出将不添加糖化残渣的反应液上清液的残留BGL活性设为100%时的相对活性(%)。
在图1中表示结果。调查的突变BGL中,K787N、E790N和T797N突变体与野生型AaBGL1(WT)相比,残留活性较高,向糖化残渣的吸附降低。
对于这些K787N、E790N和T797N突变体,调查了比活性。使用实施例3中精制的酶,以与实施例3同样的步骤通过pNP法测定其BGL活性,通过以下的式子算出酶的比活性。
比活性(Unit/mg)=酶活性(Unit/mL)/蛋白质浓度(mg/mL)
任一突变体与野生型相比,比活性均提高(表2)。因此,表示这些突变BGL与野生型酶相比,向糖化残渣的吸附性低,在糖化残渣存在下活性不易降低,因此,能够在生物质中发挥更高的β-葡萄糖苷酶活性。
[表2]
酶 | 比活性(U/mg) |
WT | 128 |
K787N | 201 |
E790N | 194 |
T797N | 198 |
以上,对本发明的实施方式进行了说明,但是这些应当理解为不意图将本发明限定于说明的特定的实施方式。本发明的范围内的各种其它变更和修正对本领域技术人员来说是显而易见的。本说明书中引用的文献和专利申请如在本说明书中完全记载的那样作为参考而引用。
序列表
<110> 花王株式会社
<120> 突变β-葡萄糖苷酶
<130> KS1610
<150> JP 2018-031809
<151> 2018-02-26
<160> 19
<170> PatentIn version 3.5
<210> 1
<211> 841
<212> PRT
<213> 棘孢曲霉
<400> 1
Asp Glu Leu Ala Phe Ser Pro Pro Phe Tyr Pro Ser Pro Trp Ala Asn
1 5 10 15
Gly Gln Gly Glu Trp Ala Glu Ala Tyr Gln Arg Ala Val Ala Ile Val
20 25 30
Ser Gln Met Thr Leu Asp Glu Lys Val Asn Leu Thr Thr Gly Thr Gly
35 40 45
Trp Glu Leu Glu Lys Cys Val Gly Gln Thr Gly Gly Val Pro Arg Leu
50 55 60
Asn Ile Gly Gly Met Cys Leu Gln Asp Ser Pro Leu Gly Ile Arg Asp
65 70 75 80
Ser Asp Tyr Asn Ser Ala Phe Pro Ala Gly Val Asn Val Ala Ala Thr
85 90 95
Trp Asp Lys Asn Leu Ala Tyr Leu Arg Gly Gln Ala Met Gly Gln Glu
100 105 110
Phe Ser Asp Lys Gly Ile Asp Val Gln Leu Gly Pro Ala Ala Gly Pro
115 120 125
Leu Gly Arg Ser Pro Asp Gly Gly Arg Asn Trp Glu Gly Phe Ser Pro
130 135 140
Asp Pro Ala Leu Thr Gly Val Leu Phe Ala Glu Thr Ile Lys Gly Ile
145 150 155 160
Gln Asp Ala Gly Val Val Ala Thr Ala Lys His Tyr Ile Leu Asn Glu
165 170 175
Gln Glu His Phe Arg Gln Val Ala Glu Ala Ala Gly Tyr Gly Phe Asn
180 185 190
Ile Ser Asp Thr Ile Ser Ser Asn Val Asp Asp Lys Thr Ile His Glu
195 200 205
Met Tyr Leu Trp Pro Phe Ala Asp Ala Val Arg Ala Gly Val Gly Ala
210 215 220
Ile Met Cys Ser Tyr Asn Gln Ile Asn Asn Ser Tyr Gly Cys Gln Asn
225 230 235 240
Ser Tyr Thr Leu Asn Lys Leu Leu Lys Ala Glu Leu Gly Phe Gln Gly
245 250 255
Phe Val Met Ser Asp Trp Gly Ala His His Ser Gly Val Gly Ser Ala
260 265 270
Leu Ala Gly Leu Asp Met Ser Met Pro Gly Asp Ile Thr Phe Asp Ser
275 280 285
Ala Thr Ser Phe Trp Gly Thr Asn Leu Thr Ile Ala Val Leu Asn Gly
290 295 300
Thr Val Pro Gln Trp Arg Val Asp Asp Met Ala Val Arg Ile Met Ala
305 310 315 320
Ala Tyr Tyr Lys Val Gly Arg Asp Arg Leu Tyr Gln Pro Pro Asn Phe
325 330 335
Ser Ser Trp Thr Arg Asp Glu Tyr Gly Phe Lys Tyr Phe Tyr Pro Gln
340 345 350
Glu Gly Pro Tyr Glu Lys Val Asn His Phe Val Asn Val Gln Arg Asn
355 360 365
His Ser Glu Val Ile Arg Lys Leu Gly Ala Asp Ser Thr Val Leu Leu
370 375 380
Lys Asn Asn Asn Ala Leu Pro Leu Thr Gly Lys Glu Arg Lys Val Ala
385 390 395 400
Ile Leu Gly Glu Asp Ala Gly Ser Asn Ser Tyr Gly Ala Asn Gly Cys
405 410 415
Ser Asp Arg Gly Cys Asp Asn Gly Thr Leu Ala Met Ala Trp Gly Ser
420 425 430
Gly Thr Ala Glu Phe Pro Tyr Leu Val Thr Pro Glu Gln Ala Ile Gln
435 440 445
Ala Glu Val Leu Lys His Lys Gly Ser Val Tyr Ala Ile Thr Asp Asn
450 455 460
Trp Ala Leu Ser Gln Val Glu Thr Leu Ala Lys Gln Ala Ser Val Ser
465 470 475 480
Leu Val Phe Val Asn Ser Asp Ala Gly Glu Gly Tyr Ile Ser Val Asp
485 490 495
Gly Asn Glu Gly Asp Arg Asn Asn Leu Thr Leu Trp Lys Asn Gly Asp
500 505 510
Asn Leu Ile Lys Ala Ala Ala Asn Asn Cys Asn Asn Thr Ile Val Val
515 520 525
Ile His Ser Val Gly Pro Val Leu Val Asp Glu Trp Tyr Asp His Pro
530 535 540
Asn Val Thr Ala Ile Leu Trp Ala Gly Leu Pro Gly Gln Glu Ser Gly
545 550 555 560
Asn Ser Leu Ala Asp Val Leu Tyr Gly Arg Val Asn Pro Gly Ala Lys
565 570 575
Ser Pro Phe Thr Trp Gly Lys Thr Arg Glu Ala Tyr Gly Asp Tyr Leu
580 585 590
Val Arg Glu Leu Asn Asn Gly Asn Gly Ala Pro Gln Asp Asp Phe Ser
595 600 605
Glu Gly Val Phe Ile Asp Tyr Arg Gly Phe Asp Lys Arg Asn Glu Thr
610 615 620
Pro Ile Tyr Glu Phe Gly His Gly Leu Ser Tyr Thr Thr Phe Asn Tyr
625 630 635 640
Ser Gly Leu His Ile Gln Val Leu Asn Ala Ser Ser Asn Ala Gln Val
645 650 655
Ala Thr Glu Thr Gly Ala Ala Pro Thr Phe Gly Gln Val Gly Asn Ala
660 665 670
Ser Asp Tyr Val Tyr Pro Glu Gly Leu Thr Arg Ile Ser Lys Phe Ile
675 680 685
Tyr Pro Trp Leu Asn Ser Thr Asp Leu Lys Ala Ser Ser Gly Asp Pro
690 695 700
Tyr Tyr Gly Val Asp Thr Ala Glu His Val Pro Glu Gly Ala Thr Asp
705 710 715 720
Gly Ser Pro Gln Pro Val Leu Pro Ala Gly Gly Gly Ser Gly Gly Asn
725 730 735
Pro Arg Leu Tyr Asp Glu Leu Ile Arg Val Ser Val Thr Val Lys Asn
740 745 750
Thr Gly Arg Val Ala Gly Asp Ala Val Pro Gln Leu Tyr Val Ser Leu
755 760 765
Gly Gly Pro Asn Glu Pro Lys Val Val Leu Arg Lys Phe Asp Arg Leu
770 775 780
Thr Leu Lys Pro Ser Glu Glu Thr Val Trp Thr Thr Thr Leu Thr Arg
785 790 795 800
Arg Asp Leu Ser Asn Trp Asp Val Ala Ala Gln Asp Trp Val Ile Thr
805 810 815
Ser Tyr Pro Lys Lys Val His Val Gly Ser Ser Ser Arg Gln Leu Pro
820 825 830
Leu His Ala Ala Leu Pro Lys Val Gln
835 840
<210> 2
<211> 841
<212> PRT
<213> 黑曲霉
<400> 2
Asp Glu Leu Ala Tyr Ser Pro Pro Tyr Tyr Pro Ser Pro Trp Ala Asn
1 5 10 15
Gly Gln Gly Asp Trp Ala Glu Ala Tyr Gln Arg Ala Val Asp Ile Val
20 25 30
Ser Gln Met Thr Leu Ala Glu Lys Val Asn Leu Thr Thr Gly Thr Gly
35 40 45
Trp Glu Leu Glu Leu Cys Val Gly Gln Thr Gly Gly Val Pro Arg Leu
50 55 60
Gly Ile Pro Gly Met Cys Ala Gln Asp Ser Pro Leu Gly Val Arg Asp
65 70 75 80
Ser Asp Tyr Asn Ser Ala Phe Pro Ala Gly Val Asn Val Ala Ala Thr
85 90 95
Trp Asp Lys Asn Leu Ala Tyr Leu Arg Gly Gln Ala Met Gly Gln Glu
100 105 110
Phe Ser Asp Lys Gly Ala Asp Ile Gln Leu Gly Pro Ala Ala Gly Pro
115 120 125
Leu Gly Arg Ser Pro Asp Gly Gly Arg Asn Trp Glu Gly Phe Ser Pro
130 135 140
Asp Pro Ala Leu Ser Gly Val Leu Phe Ala Glu Thr Ile Lys Gly Ile
145 150 155 160
Gln Asp Ala Gly Val Val Ala Thr Ala Lys His Tyr Ile Ala Tyr Glu
165 170 175
Gln Glu His Phe Arg Gln Ala Pro Glu Ala Gln Gly Tyr Gly Phe Asn
180 185 190
Ile Thr Glu Ser Gly Ser Ala Asn Leu Asp Asp Lys Thr Met His Glu
195 200 205
Leu Tyr Leu Trp Pro Phe Ala Asp Ala Ile Arg Ala Gly Ala Gly Ala
210 215 220
Val Met Cys Ser Tyr Asn Gln Ile Asn Asn Ser Tyr Gly Cys Gln Asn
225 230 235 240
Ser Tyr Thr Leu Asn Lys Leu Leu Lys Ala Glu Leu Gly Phe Gln Gly
245 250 255
Phe Val Met Ser Asp Trp Ala Ala His His Ala Gly Val Ser Gly Ala
260 265 270
Leu Ala Gly Leu Asp Met Ser Met Pro Gly Asp Val Asp Tyr Asp Ser
275 280 285
Gly Thr Ser Tyr Trp Gly Thr Asn Leu Thr Ile Ser Val Leu Asn Gly
290 295 300
Thr Val Pro Gln Trp Arg Val Asp Asp Met Ala Val Arg Ile Met Ala
305 310 315 320
Ala Tyr Tyr Lys Val Gly Arg Asp Arg Leu Trp Thr Pro Pro Asn Phe
325 330 335
Ser Ser Trp Thr Arg Asp Glu Tyr Gly Phe Lys Tyr Tyr Tyr Val Ser
340 345 350
Glu Gly Pro Tyr Glu Lys Val Asn Gln Phe Val Asn Val Gln Arg Asn
355 360 365
His Ser Glu Leu Ile Arg Arg Ile Gly Ala Asp Ser Thr Val Leu Leu
370 375 380
Lys Asn Asp Gly Ala Leu Pro Leu Thr Gly Lys Glu Arg Leu Val Ala
385 390 395 400
Leu Ile Gly Glu Asp Ala Gly Ser Asn Pro Tyr Gly Ala Asn Gly Cys
405 410 415
Ser Asp Arg Gly Cys Asp Asn Gly Thr Leu Ala Met Gly Trp Gly Ser
420 425 430
Gly Thr Ala Asn Phe Pro Tyr Leu Val Thr Pro Glu Gln Ala Ile Ser
435 440 445
Asn Glu Val Leu Lys Asn Lys Asn Gly Val Phe Thr Ala Thr Asp Asn
450 455 460
Trp Ala Ile Asp Gln Ile Glu Ala Leu Ala Lys Thr Ala Ser Val Ser
465 470 475 480
Leu Val Phe Val Asn Ala Asp Ser Gly Glu Gly Tyr Ile Asn Val Asp
485 490 495
Gly Asn Leu Gly Asp Arg Arg Asn Leu Thr Leu Trp Arg Asn Gly Asp
500 505 510
Asn Val Ile Lys Ala Ala Ala Ser Asn Cys Asn Asn Thr Ile Val Ile
515 520 525
Ile His Ser Val Gly Pro Val Leu Val Asn Glu Trp Tyr Asp Asn Pro
530 535 540
Asn Val Thr Ala Ile Leu Trp Gly Gly Leu Pro Gly Gln Glu Ser Gly
545 550 555 560
Asn Ser Leu Ala Asp Val Leu Tyr Gly Arg Val Asn Pro Gly Ala Lys
565 570 575
Ser Pro Phe Thr Trp Gly Lys Thr Arg Glu Ala Tyr Gln Asp Tyr Leu
580 585 590
Tyr Thr Glu Pro Asn Asn Gly Asn Gly Ala Pro Gln Glu Asp Phe Val
595 600 605
Glu Gly Val Phe Ile Asp Tyr Arg Gly Phe Asp Lys Arg Asn Glu Thr
610 615 620
Pro Ile Tyr Glu Phe Gly Tyr Gly Leu Ser Tyr Thr Thr Phe Asn Tyr
625 630 635 640
Ser Asn Leu Gln Val Glu Val Leu Ser Ala Pro Ala Tyr Glu Pro Ala
645 650 655
Ser Gly Glu Thr Glu Ala Ala Pro Thr Phe Gly Glu Val Gly Asn Ala
660 665 670
Ser Asp Tyr Leu Tyr Pro Asp Gly Leu Gln Arg Ile Thr Lys Phe Ile
675 680 685
Tyr Pro Trp Leu Asn Ser Thr Asp Leu Glu Ala Ser Ser Gly Asp Ala
690 695 700
Ser Tyr Gly Gln Asp Ala Ser Asp Tyr Leu Pro Glu Gly Ala Thr Asp
705 710 715 720
Gly Ser Ala Gln Pro Ile Leu Pro Ala Gly Gly Gly Ala Gly Gly Asn
725 730 735
Pro Arg Leu Tyr Asp Glu Leu Ile Arg Val Ser Val Thr Ile Lys Asn
740 745 750
Thr Gly Lys Val Ala Gly Asp Glu Val Pro Gln Leu Tyr Val Ser Leu
755 760 765
Gly Gly Pro Asn Glu Pro Lys Ile Val Leu Arg Gln Phe Glu Arg Ile
770 775 780
Thr Leu Gln Pro Ser Lys Glu Thr Gln Trp Ser Thr Thr Leu Thr Arg
785 790 795 800
Arg Asp Leu Ala Asn Trp Asn Val Glu Thr Gln Asp Trp Glu Ile Thr
805 810 815
Ser Tyr Pro Lys Met Val Phe Ala Gly Ser Ser Ser Arg Lys Leu Pro
820 825 830
Leu Arg Ala Ser Leu Pro Thr Val His
835 840
<210> 3
<211> 841
<212> PRT
<213> 白曲霉
<400> 3
Asp Glu Leu Ala Tyr Ser Pro Pro Tyr Tyr Pro Ser Pro Trp Ala Asn
1 5 10 15
Gly Gln Gly Asp Trp Ala Gln Ala Tyr Gln Arg Ala Val Asp Ile Val
20 25 30
Ser Gln Met Thr Leu Ala Glu Lys Val Asn Leu Thr Thr Gly Thr Gly
35 40 45
Trp Glu Leu Glu Leu Cys Val Gly Gln Thr Gly Gly Val Pro Arg Leu
50 55 60
Gly Val Pro Gly Met Cys Leu Gln Asp Ser Pro Leu Gly Val Arg Asp
65 70 75 80
Ser Asp Tyr Asn Ser Ala Phe Pro Ser Gly Met Asn Val Ala Ala Thr
85 90 95
Trp Asp Lys Asn Leu Ala Tyr Leu Arg Gly Lys Ala Met Gly Gln Glu
100 105 110
Phe Ser Asp Lys Gly Ala Asp Ile Gln Leu Gly Pro Ala Ala Gly Pro
115 120 125
Leu Gly Arg Ser Pro Asp Gly Gly Arg Asn Trp Glu Gly Phe Ser Pro
130 135 140
Asp Pro Ala Leu Ser Gly Val Leu Phe Ala Glu Thr Ile Lys Gly Ile
145 150 155 160
Gln Asp Ala Gly Val Val Ala Thr Ala Lys His Tyr Ile Ala Tyr Glu
165 170 175
Gln Glu His Phe Arg Gln Ala Pro Glu Ala Gln Gly Tyr Gly Phe Asn
180 185 190
Ile Ser Glu Ser Gly Ser Ala Asn Leu Asp Asp Lys Thr Met His Glu
195 200 205
Leu Tyr Leu Trp Pro Phe Ala Asp Ala Ile Arg Ala Gly Ala Gly Ala
210 215 220
Val Met Cys Ser Tyr Asn Gln Ile Asn Asn Ser Tyr Gly Cys Gln Asn
225 230 235 240
Ser Tyr Thr Leu Asn Lys Leu Leu Lys Ala Glu Leu Gly Phe Gln Gly
245 250 255
Phe Val Met Ser Asp Trp Ala Ala His His Ala Gly Val Ser Gly Ala
260 265 270
Leu Ala Gly Leu Asp Met Ser Met Pro Gly Asp Val Asp Tyr Asp Ser
275 280 285
Gly Thr Ser Tyr Trp Gly Thr Asn Leu Thr Val Ser Val Leu Asn Gly
290 295 300
Thr Val Pro Gln Trp Arg Val Asp Asp Met Ala Val Arg Ile Met Ala
305 310 315 320
Ala Tyr Tyr Lys Val Gly Arg Asp Arg Leu Trp Thr Pro Pro Asn Phe
325 330 335
Ser Ser Trp Thr Arg Asp Glu Tyr Gly Tyr Lys Tyr Tyr Tyr Val Ser
340 345 350
Glu Gly Pro Tyr Glu Lys Val Asn His Tyr Val Asn Val Gln Arg Asn
355 360 365
His Ser Glu Leu Ile Arg Arg Ile Gly Ala Asp Ser Thr Val Leu Leu
370 375 380
Lys Asn Asp Gly Ala Leu Pro Leu Thr Gly Lys Glu Arg Leu Val Ala
385 390 395 400
Leu Ile Gly Glu Asp Ala Gly Ser Asn Pro Tyr Gly Ala Asn Gly Cys
405 410 415
Ser Asp Arg Gly Cys Asp Asn Gly Thr Leu Ala Met Gly Trp Gly Ser
420 425 430
Gly Thr Ala Asn Phe Pro Tyr Leu Val Thr Pro Glu Gln Ala Ile Ser
435 440 445
Asn Glu Val Leu Lys Asn Lys Asn Gly Val Phe Thr Ala Thr Asp Asn
450 455 460
Trp Ala Ile Asp Gln Ile Glu Ala Leu Ala Lys Thr Ala Ser Val Ser
465 470 475 480
Leu Val Phe Val Asn Ala Asp Ser Gly Glu Gly Tyr Ile Asn Val Asp
485 490 495
Gly Asn Leu Gly Asp Arg Arg Asn Leu Thr Leu Trp Arg Asn Gly Asp
500 505 510
Asn Val Ile Lys Ala Ala Ala Ser Asn Cys Asn Asn Thr Ile Val Ile
515 520 525
Ile His Ser Val Gly Pro Val Leu Val Asn Glu Trp Tyr Asp Asn Pro
530 535 540
Asn Val Thr Ala Ile Leu Trp Gly Gly Leu Pro Gly Gln Glu Ser Gly
545 550 555 560
Asn Ser Leu Ala Asp Val Leu Tyr Gly Arg Val Asn Pro Gly Ala Lys
565 570 575
Ser Pro Phe Thr Trp Gly Lys Thr Arg Glu Ala Tyr Gln Asp Tyr Leu
580 585 590
Val Thr Glu Pro Asn Asn Gly Asn Gly Ala Pro Gln Glu Asp Phe Val
595 600 605
Glu Gly Val Phe Ile Asp Tyr Arg Gly Phe Asp Lys Arg Asn Glu Thr
610 615 620
Pro Ile Tyr Glu Phe Gly Tyr Gly Leu Ser Tyr Thr Thr Phe Asn Tyr
625 630 635 640
Ser Asn Leu Glu Val Gln Val Leu Ser Ala Pro Ala Tyr Glu Pro Ala
645 650 655
Ser Gly Glu Thr Glu Ala Ala Pro Thr Phe Gly Glu Val Gly Asn Ala
660 665 670
Ser Asn Tyr Leu Tyr Pro Asp Gly Leu Gln Lys Ile Thr Lys Phe Ile
675 680 685
Tyr Pro Trp Leu Asn Ser Thr Asp Leu Glu Ala Ser Ser Gly Asp Ala
690 695 700
Ser Tyr Gly Gln Asp Ser Ser Asp Tyr Leu Pro Glu Gly Ala Thr Asp
705 710 715 720
Gly Ser Ala Gln Pro Ile Leu Pro Ala Gly Gly Gly Pro Gly Gly Asn
725 730 735
Pro Arg Leu Tyr Asp Glu Leu Ile Arg Val Ser Val Thr Ile Lys Asn
740 745 750
Thr Gly Lys Val Ala Gly Asp Glu Val Pro Gln Leu Tyr Val Ser Leu
755 760 765
Gly Gly Pro Asn Glu Pro Lys Ile Val Leu Arg Gln Phe Glu Arg Ile
770 775 780
Thr Leu Gln Pro Ser Glu Glu Thr Lys Trp Ser Thr Thr Leu Thr Arg
785 790 795 800
Arg Asp Leu Ala Asn Trp Asn Val Glu Lys Gln Asp Trp Glu Ile Thr
805 810 815
Ser Tyr Pro Lys Met Val Phe Val Gly Ser Ser Ser Arg Lys Leu Pro
820 825 830
Leu Arg Ala Ser Leu Pro Thr Val His
835 840
<210> 4
<211> 860
<212> PRT
<213> 棘孢曲霉
<400> 4
Met Lys Leu Ser Trp Leu Glu Ala Ala Ala Leu Thr Ala Ala Ser Val
1 5 10 15
Val Ser Ala Asp Glu Leu Ala Phe Ser Pro Pro Phe Tyr Pro Ser Pro
20 25 30
Trp Ala Asn Gly Gln Gly Glu Trp Ala Glu Ala Tyr Gln Arg Ala Val
35 40 45
Ala Ile Val Ser Gln Met Thr Leu Asp Glu Lys Val Asn Leu Thr Thr
50 55 60
Gly Thr Gly Trp Glu Leu Glu Lys Cys Val Gly Gln Thr Gly Gly Val
65 70 75 80
Pro Arg Leu Asn Ile Gly Gly Met Cys Leu Gln Asp Ser Pro Leu Gly
85 90 95
Ile Arg Asp Ser Asp Tyr Asn Ser Ala Phe Pro Ala Gly Val Asn Val
100 105 110
Ala Ala Thr Trp Asp Lys Asn Leu Ala Tyr Leu Arg Gly Gln Ala Met
115 120 125
Gly Gln Glu Phe Ser Asp Lys Gly Ile Asp Val Gln Leu Gly Pro Ala
130 135 140
Ala Gly Pro Leu Gly Arg Ser Pro Asp Gly Gly Arg Asn Trp Glu Gly
145 150 155 160
Phe Ser Pro Asp Pro Ala Leu Thr Gly Val Leu Phe Ala Glu Thr Ile
165 170 175
Lys Gly Ile Gln Asp Ala Gly Val Val Ala Thr Ala Lys His Tyr Ile
180 185 190
Leu Asn Glu Gln Glu His Phe Arg Gln Val Ala Glu Ala Ala Gly Tyr
195 200 205
Gly Phe Asn Ile Ser Asp Thr Ile Ser Ser Asn Val Asp Asp Lys Thr
210 215 220
Ile His Glu Met Tyr Leu Trp Pro Phe Ala Asp Ala Val Arg Ala Gly
225 230 235 240
Val Gly Ala Ile Met Cys Ser Tyr Asn Gln Ile Asn Asn Ser Tyr Gly
245 250 255
Cys Gln Asn Ser Tyr Thr Leu Asn Lys Leu Leu Lys Ala Glu Leu Gly
260 265 270
Phe Gln Gly Phe Val Met Ser Asp Trp Gly Ala His His Ser Gly Val
275 280 285
Gly Ser Ala Leu Ala Gly Leu Asp Met Ser Met Pro Gly Asp Ile Thr
290 295 300
Phe Asp Ser Ala Thr Ser Phe Trp Gly Thr Asn Leu Thr Ile Ala Val
305 310 315 320
Leu Asn Gly Thr Val Pro Gln Trp Arg Val Asp Asp Met Ala Val Arg
325 330 335
Ile Met Ala Ala Tyr Tyr Lys Val Gly Arg Asp Arg Leu Tyr Gln Pro
340 345 350
Pro Asn Phe Ser Ser Trp Thr Arg Asp Glu Tyr Gly Phe Lys Tyr Phe
355 360 365
Tyr Pro Gln Glu Gly Pro Tyr Glu Lys Val Asn His Phe Val Asn Val
370 375 380
Gln Arg Asn His Ser Glu Val Ile Arg Lys Leu Gly Ala Asp Ser Thr
385 390 395 400
Val Leu Leu Lys Asn Asn Asn Ala Leu Pro Leu Thr Gly Lys Glu Arg
405 410 415
Lys Val Ala Ile Leu Gly Glu Asp Ala Gly Ser Asn Ser Tyr Gly Ala
420 425 430
Asn Gly Cys Ser Asp Arg Gly Cys Asp Asn Gly Thr Leu Ala Met Ala
435 440 445
Trp Gly Ser Gly Thr Ala Glu Phe Pro Tyr Leu Val Thr Pro Glu Gln
450 455 460
Ala Ile Gln Ala Glu Val Leu Lys His Lys Gly Ser Val Tyr Ala Ile
465 470 475 480
Thr Asp Asn Trp Ala Leu Ser Gln Val Glu Thr Leu Ala Lys Gln Ala
485 490 495
Ser Val Ser Leu Val Phe Val Asn Ser Asp Ala Gly Glu Gly Tyr Ile
500 505 510
Ser Val Asp Gly Asn Glu Gly Asp Arg Asn Asn Leu Thr Leu Trp Lys
515 520 525
Asn Gly Asp Asn Leu Ile Lys Ala Ala Ala Asn Asn Cys Asn Asn Thr
530 535 540
Ile Val Val Ile His Ser Val Gly Pro Val Leu Val Asp Glu Trp Tyr
545 550 555 560
Asp His Pro Asn Val Thr Ala Ile Leu Trp Ala Gly Leu Pro Gly Gln
565 570 575
Glu Ser Gly Asn Ser Leu Ala Asp Val Leu Tyr Gly Arg Val Asn Pro
580 585 590
Gly Ala Lys Ser Pro Phe Thr Trp Gly Lys Thr Arg Glu Ala Tyr Gly
595 600 605
Asp Tyr Leu Val Arg Glu Leu Asn Asn Gly Asn Gly Ala Pro Gln Asp
610 615 620
Asp Phe Ser Glu Gly Val Phe Ile Asp Tyr Arg Gly Phe Asp Lys Arg
625 630 635 640
Asn Glu Thr Pro Ile Tyr Glu Phe Gly His Gly Leu Ser Tyr Thr Thr
645 650 655
Phe Asn Tyr Ser Gly Leu His Ile Gln Val Leu Asn Ala Ser Ser Asn
660 665 670
Ala Gln Val Ala Thr Glu Thr Gly Ala Ala Pro Thr Phe Gly Gln Val
675 680 685
Gly Asn Ala Ser Asp Tyr Val Tyr Pro Glu Gly Leu Thr Arg Ile Ser
690 695 700
Lys Phe Ile Tyr Pro Trp Leu Asn Ser Thr Asp Leu Lys Ala Ser Ser
705 710 715 720
Gly Asp Pro Tyr Tyr Gly Val Asp Thr Ala Glu His Val Pro Glu Gly
725 730 735
Ala Thr Asp Gly Ser Pro Gln Pro Val Leu Pro Ala Gly Gly Gly Ser
740 745 750
Gly Gly Asn Pro Arg Leu Tyr Asp Glu Leu Ile Arg Val Ser Val Thr
755 760 765
Val Lys Asn Thr Gly Arg Val Ala Gly Asp Ala Val Pro Gln Leu Tyr
770 775 780
Val Ser Leu Gly Gly Pro Asn Glu Pro Lys Val Val Leu Arg Lys Phe
785 790 795 800
Asp Arg Leu Thr Leu Lys Pro Ser Glu Glu Thr Val Trp Thr Thr Thr
805 810 815
Leu Thr Arg Arg Asp Leu Ser Asn Trp Asp Val Ala Ala Gln Asp Trp
820 825 830
Val Ile Thr Ser Tyr Pro Lys Lys Val His Val Gly Ser Ser Ser Arg
835 840 845
Gln Leu Pro Leu His Ala Ala Leu Pro Lys Val Gln
850 855 860
<210> 5
<211> 860
<212> PRT
<213> 黑曲霉
<400> 5
Met Arg Phe Thr Ser Ile Glu Ala Val Ala Leu Thr Ala Val Ser Leu
1 5 10 15
Ala Ser Ala Asp Glu Leu Ala Tyr Ser Pro Pro Tyr Tyr Pro Ser Pro
20 25 30
Trp Ala Asn Gly Gln Gly Asp Trp Ala Glu Ala Tyr Gln Arg Ala Val
35 40 45
Asp Ile Val Ser Gln Met Thr Leu Ala Glu Lys Val Asn Leu Thr Thr
50 55 60
Gly Thr Gly Trp Glu Leu Glu Leu Cys Val Gly Gln Thr Gly Gly Val
65 70 75 80
Pro Arg Leu Gly Ile Pro Gly Met Cys Ala Gln Asp Ser Pro Leu Gly
85 90 95
Val Arg Asp Ser Asp Tyr Asn Ser Ala Phe Pro Ala Gly Val Asn Val
100 105 110
Ala Ala Thr Trp Asp Lys Asn Leu Ala Tyr Leu Arg Gly Gln Ala Met
115 120 125
Gly Gln Glu Phe Ser Asp Lys Gly Ala Asp Ile Gln Leu Gly Pro Ala
130 135 140
Ala Gly Pro Leu Gly Arg Ser Pro Asp Gly Gly Arg Asn Trp Glu Gly
145 150 155 160
Phe Ser Pro Asp Pro Ala Leu Ser Gly Val Leu Phe Ala Glu Thr Ile
165 170 175
Lys Gly Ile Gln Asp Ala Gly Val Val Ala Thr Ala Lys His Tyr Ile
180 185 190
Ala Tyr Glu Gln Glu His Phe Arg Gln Ala Pro Glu Ala Gln Gly Tyr
195 200 205
Gly Phe Asn Ile Thr Glu Ser Gly Ser Ala Asn Leu Asp Asp Lys Thr
210 215 220
Met His Glu Leu Tyr Leu Trp Pro Phe Ala Asp Ala Ile Arg Ala Gly
225 230 235 240
Ala Gly Ala Val Met Cys Ser Tyr Asn Gln Ile Asn Asn Ser Tyr Gly
245 250 255
Cys Gln Asn Ser Tyr Thr Leu Asn Lys Leu Leu Lys Ala Glu Leu Gly
260 265 270
Phe Gln Gly Phe Val Met Ser Asp Trp Ala Ala His His Ala Gly Val
275 280 285
Ser Gly Ala Leu Ala Gly Leu Asp Met Ser Met Pro Gly Asp Val Asp
290 295 300
Tyr Asp Ser Gly Thr Ser Tyr Trp Gly Thr Asn Leu Thr Ile Ser Val
305 310 315 320
Leu Asn Gly Thr Val Pro Gln Trp Arg Val Asp Asp Met Ala Val Arg
325 330 335
Ile Met Ala Ala Tyr Tyr Lys Val Gly Arg Asp Arg Leu Trp Thr Pro
340 345 350
Pro Asn Phe Ser Ser Trp Thr Arg Asp Glu Tyr Gly Phe Lys Tyr Tyr
355 360 365
Tyr Val Ser Glu Gly Pro Tyr Glu Lys Val Asn Gln Phe Val Asn Val
370 375 380
Gln Arg Asn His Ser Glu Leu Ile Arg Arg Ile Gly Ala Asp Ser Thr
385 390 395 400
Val Leu Leu Lys Asn Asp Gly Ala Leu Pro Leu Thr Gly Lys Glu Arg
405 410 415
Leu Val Ala Leu Ile Gly Glu Asp Ala Gly Ser Asn Pro Tyr Gly Ala
420 425 430
Asn Gly Cys Ser Asp Arg Gly Cys Asp Asn Gly Thr Leu Ala Met Gly
435 440 445
Trp Gly Ser Gly Thr Ala Asn Phe Pro Tyr Leu Val Thr Pro Glu Gln
450 455 460
Ala Ile Ser Asn Glu Val Leu Lys Asn Lys Asn Gly Val Phe Thr Ala
465 470 475 480
Thr Asp Asn Trp Ala Ile Asp Gln Ile Glu Ala Leu Ala Lys Thr Ala
485 490 495
Ser Val Ser Leu Val Phe Val Asn Ala Asp Ser Gly Glu Gly Tyr Ile
500 505 510
Asn Val Asp Gly Asn Leu Gly Asp Arg Arg Asn Leu Thr Leu Trp Arg
515 520 525
Asn Gly Asp Asn Val Ile Lys Ala Ala Ala Ser Asn Cys Asn Asn Thr
530 535 540
Ile Val Ile Ile His Ser Val Gly Pro Val Leu Val Asn Glu Trp Tyr
545 550 555 560
Asp Asn Pro Asn Val Thr Ala Ile Leu Trp Gly Gly Leu Pro Gly Gln
565 570 575
Glu Ser Gly Asn Ser Leu Ala Asp Val Leu Tyr Gly Arg Val Asn Pro
580 585 590
Gly Ala Lys Ser Pro Phe Thr Trp Gly Lys Thr Arg Glu Ala Tyr Gln
595 600 605
Asp Tyr Leu Tyr Thr Glu Pro Asn Asn Gly Asn Gly Ala Pro Gln Glu
610 615 620
Asp Phe Val Glu Gly Val Phe Ile Asp Tyr Arg Gly Phe Asp Lys Arg
625 630 635 640
Asn Glu Thr Pro Ile Tyr Glu Phe Gly Tyr Gly Leu Ser Tyr Thr Thr
645 650 655
Phe Asn Tyr Ser Asn Leu Gln Val Glu Val Leu Ser Ala Pro Ala Tyr
660 665 670
Glu Pro Ala Ser Gly Glu Thr Glu Ala Ala Pro Thr Phe Gly Glu Val
675 680 685
Gly Asn Ala Ser Asp Tyr Leu Tyr Pro Asp Gly Leu Gln Arg Ile Thr
690 695 700
Lys Phe Ile Tyr Pro Trp Leu Asn Ser Thr Asp Leu Glu Ala Ser Ser
705 710 715 720
Gly Asp Ala Ser Tyr Gly Gln Asp Ala Ser Asp Tyr Leu Pro Glu Gly
725 730 735
Ala Thr Asp Gly Ser Ala Gln Pro Ile Leu Pro Ala Gly Gly Gly Ala
740 745 750
Gly Gly Asn Pro Arg Leu Tyr Asp Glu Leu Ile Arg Val Ser Val Thr
755 760 765
Ile Lys Asn Thr Gly Lys Val Ala Gly Asp Glu Val Pro Gln Leu Tyr
770 775 780
Val Ser Leu Gly Gly Pro Asn Glu Pro Lys Ile Val Leu Arg Gln Phe
785 790 795 800
Glu Arg Ile Thr Leu Gln Pro Ser Lys Glu Thr Gln Trp Ser Thr Thr
805 810 815
Leu Thr Arg Arg Asp Leu Ala Asn Trp Asn Val Glu Thr Gln Asp Trp
820 825 830
Glu Ile Thr Ser Tyr Pro Lys Met Val Phe Ala Gly Ser Ser Ser Arg
835 840 845
Lys Leu Pro Leu Arg Ala Ser Leu Pro Thr Val His
850 855 860
<210> 6
<211> 860
<212> PRT
<213> 白曲霉
<400> 6
Met Arg Phe Thr Leu Ile Glu Ala Val Ala Leu Thr Ala Val Ser Leu
1 5 10 15
Ala Ser Ala Asp Glu Leu Ala Tyr Ser Pro Pro Tyr Tyr Pro Ser Pro
20 25 30
Trp Ala Asn Gly Gln Gly Asp Trp Ala Gln Ala Tyr Gln Arg Ala Val
35 40 45
Asp Ile Val Ser Gln Met Thr Leu Ala Glu Lys Val Asn Leu Thr Thr
50 55 60
Gly Thr Gly Trp Glu Leu Glu Leu Cys Val Gly Gln Thr Gly Gly Val
65 70 75 80
Pro Arg Leu Gly Val Pro Gly Met Cys Leu Gln Asp Ser Pro Leu Gly
85 90 95
Val Arg Asp Ser Asp Tyr Asn Ser Ala Phe Pro Ser Gly Met Asn Val
100 105 110
Ala Ala Thr Trp Asp Lys Asn Leu Ala Tyr Leu Arg Gly Lys Ala Met
115 120 125
Gly Gln Glu Phe Ser Asp Lys Gly Ala Asp Ile Gln Leu Gly Pro Ala
130 135 140
Ala Gly Pro Leu Gly Arg Ser Pro Asp Gly Gly Arg Asn Trp Glu Gly
145 150 155 160
Phe Ser Pro Asp Pro Ala Leu Ser Gly Val Leu Phe Ala Glu Thr Ile
165 170 175
Lys Gly Ile Gln Asp Ala Gly Val Val Ala Thr Ala Lys His Tyr Ile
180 185 190
Ala Tyr Glu Gln Glu His Phe Arg Gln Ala Pro Glu Ala Gln Gly Tyr
195 200 205
Gly Phe Asn Ile Ser Glu Ser Gly Ser Ala Asn Leu Asp Asp Lys Thr
210 215 220
Met His Glu Leu Tyr Leu Trp Pro Phe Ala Asp Ala Ile Arg Ala Gly
225 230 235 240
Ala Gly Ala Val Met Cys Ser Tyr Asn Gln Ile Asn Asn Ser Tyr Gly
245 250 255
Cys Gln Asn Ser Tyr Thr Leu Asn Lys Leu Leu Lys Ala Glu Leu Gly
260 265 270
Phe Gln Gly Phe Val Met Ser Asp Trp Ala Ala His His Ala Gly Val
275 280 285
Ser Gly Ala Leu Ala Gly Leu Asp Met Ser Met Pro Gly Asp Val Asp
290 295 300
Tyr Asp Ser Gly Thr Ser Tyr Trp Gly Thr Asn Leu Thr Val Ser Val
305 310 315 320
Leu Asn Gly Thr Val Pro Gln Trp Arg Val Asp Asp Met Ala Val Arg
325 330 335
Ile Met Ala Ala Tyr Tyr Lys Val Gly Arg Asp Arg Leu Trp Thr Pro
340 345 350
Pro Asn Phe Ser Ser Trp Thr Arg Asp Glu Tyr Gly Tyr Lys Tyr Tyr
355 360 365
Tyr Val Ser Glu Gly Pro Tyr Glu Lys Val Asn His Tyr Val Asn Val
370 375 380
Gln Arg Asn His Ser Glu Leu Ile Arg Arg Ile Gly Ala Asp Ser Thr
385 390 395 400
Val Leu Leu Lys Asn Asp Gly Ala Leu Pro Leu Thr Gly Lys Glu Arg
405 410 415
Leu Val Ala Leu Ile Gly Glu Asp Ala Gly Ser Asn Pro Tyr Gly Ala
420 425 430
Asn Gly Cys Ser Asp Arg Gly Cys Asp Asn Gly Thr Leu Ala Met Gly
435 440 445
Trp Gly Ser Gly Thr Ala Asn Phe Pro Tyr Leu Val Thr Pro Glu Gln
450 455 460
Ala Ile Ser Asn Glu Val Leu Lys Asn Lys Asn Gly Val Phe Thr Ala
465 470 475 480
Thr Asp Asn Trp Ala Ile Asp Gln Ile Glu Ala Leu Ala Lys Thr Ala
485 490 495
Ser Val Ser Leu Val Phe Val Asn Ala Asp Ser Gly Glu Gly Tyr Ile
500 505 510
Asn Val Asp Gly Asn Leu Gly Asp Arg Arg Asn Leu Thr Leu Trp Arg
515 520 525
Asn Gly Asp Asn Val Ile Lys Ala Ala Ala Ser Asn Cys Asn Asn Thr
530 535 540
Ile Val Ile Ile His Ser Val Gly Pro Val Leu Val Asn Glu Trp Tyr
545 550 555 560
Asp Asn Pro Asn Val Thr Ala Ile Leu Trp Gly Gly Leu Pro Gly Gln
565 570 575
Glu Ser Gly Asn Ser Leu Ala Asp Val Leu Tyr Gly Arg Val Asn Pro
580 585 590
Gly Ala Lys Ser Pro Phe Thr Trp Gly Lys Thr Arg Glu Ala Tyr Gln
595 600 605
Asp Tyr Leu Val Thr Glu Pro Asn Asn Gly Asn Gly Ala Pro Gln Glu
610 615 620
Asp Phe Val Glu Gly Val Phe Ile Asp Tyr Arg Gly Phe Asp Lys Arg
625 630 635 640
Asn Glu Thr Pro Ile Tyr Glu Phe Gly Tyr Gly Leu Ser Tyr Thr Thr
645 650 655
Phe Asn Tyr Ser Asn Leu Glu Val Gln Val Leu Ser Ala Pro Ala Tyr
660 665 670
Glu Pro Ala Ser Gly Glu Thr Glu Ala Ala Pro Thr Phe Gly Glu Val
675 680 685
Gly Asn Ala Ser Asn Tyr Leu Tyr Pro Asp Gly Leu Gln Lys Ile Thr
690 695 700
Lys Phe Ile Tyr Pro Trp Leu Asn Ser Thr Asp Leu Glu Ala Ser Ser
705 710 715 720
Gly Asp Ala Ser Tyr Gly Gln Asp Ser Ser Asp Tyr Leu Pro Glu Gly
725 730 735
Ala Thr Asp Gly Ser Ala Gln Pro Ile Leu Pro Ala Gly Gly Gly Pro
740 745 750
Gly Gly Asn Pro Arg Leu Tyr Asp Glu Leu Ile Arg Val Ser Val Thr
755 760 765
Ile Lys Asn Thr Gly Lys Val Ala Gly Asp Glu Val Pro Gln Leu Tyr
770 775 780
Val Ser Leu Gly Gly Pro Asn Glu Pro Lys Ile Val Leu Arg Gln Phe
785 790 795 800
Glu Arg Ile Thr Leu Gln Pro Ser Glu Glu Thr Lys Trp Ser Thr Thr
805 810 815
Leu Thr Arg Arg Asp Leu Ala Asn Trp Asn Val Glu Lys Gln Asp Trp
820 825 830
Glu Ile Thr Ser Tyr Pro Lys Met Val Phe Val Gly Ser Ser Ser Arg
835 840 845
Lys Leu Pro Leu Arg Ala Ser Leu Pro Thr Val His
850 855 860
<210> 7
<211> 2812
<212> DNA
<213> 棘孢曲霉
<400> 7
ccatggtacc cggatcctcg cttcaacatc tttctcaaat tctcgagagc gcaagctgtg 60
tggctggcta gctcgttgct tcctctttct tcagctacct ctacgccatc atgaagctca 120
gttggcttga ggcggctgcc ttgacggctg cttcagtcgt cagcgctgat gaactggcgt 180
tctctcctcc tttctacccc tctccgtggg ccaatggcca gggagagtgg gcggaagcct 240
accagcgtgc agtggccatt gtatcccaga tgactctgga tgagaaggtc aacctgacca 300
ccggaactgg atgggagctg gagaagtgcg tcggtcagac tggtggtgtc ccaagactga 360
acatcggtgg catgtgtctt caggacagtc ccttgggaat tcgtgatagt gactacaatt 420
cggctttccc tgctggtgtc aacgttgctg cgacatggga caagaacctt gcttatctac 480
gtggtcaggc tatgggtcaa gagttcagtg acaaaggaat tgatgttcaa ttgggaccgg 540
ccgcgggtcc cctcggcagg agccctgatg gaggtcgcaa ctgggaaggt ttctctccag 600
acccggctct tactggtgtg ctctttgcgg agacgattaa gggtattcaa gacgctggtg 660
tcgtggcgac agccaagcat tacattctca atgagcaaga gcatttccgc caggtcgcag 720
aggctgcggg ctacggattc aatatctccg acacgatcag ctctaacgtt gatgacaaga 780
ccattcatga aatgtacctc tggcccttcg cggatgccgt tcgcgccggc gttggcgcca 840
tcatgtgttc ctacaaccag atcaacaaca gctacggttg ccagaacagt tacactctga 900
acaagcttct gaaggccgag ctcggcttcc agggctttgt gatgtctgac tggggtgctc 960
accacagtgg tgttggctct gctttggccg gcttggatat gtcaatgcct ggcgatatca 1020
ccttcgattc tgccactagt ttctggggta ccaacctgac cattgctgtg ctcaacggta 1080
ccgtcccgca gtggcgcgtt gacgacatgg ctgtccgtat catggctgcc tactacaagg 1140
ttggccgcga ccgcctgtac cagccgccta acttcagctc ctggactcgc gatgaatacg 1200
gcttcaagta tttctacccc caggaagggc cctatgagaa ggtcaatcac tttgtcaatg 1260
tgcagcgcaa ccacagcgag gttattcgca agttgggagc agacagtact gttctactga 1320
agaacaacaa tgccctgccg ctgaccggaa aggagcgcaa agttgcgatc ctgggtgaag 1380
atgctggatc caactcgtac ggtgccaatg gctgctctga ccgtggctgt gacaacggta 1440
ctcttgctat ggcttggggt agcggcactg ccgaattccc atatctcgtg acccctgagc 1500
aggctattca agccgaggtg ctcaagcata agggcagcgt ctacgccatc acggacaact 1560
gggcgctgag ccaggtggag accctcgcta aacaagccag tgtctctctt gtatttgtca 1620
actcggacgc gggagagggc tatatctccg tggacggaaa cgagggcgac cgcaacaacc 1680
tcaccctctg gaagaacggc gacaacctca tcaaggctgc tgcaaacaac tgcaacaaca 1740
ccatcgttgt catccactcc gttggacctg ttttggttga cgagtggtat gaccacccca 1800
acgttactgc catcctctgg gcgggcttgc ctggccagga gtctggcaac tccttggctg 1860
acgtgctcta cggccgcgtc aacccgggcg ccaaatctcc attcacctgg ggcaagacga 1920
gggaggcgta cggggattac cttgtccgtg agctcaacaa cggcaacgga gctccccaag 1980
atgatttctc ggaaggtgtt ttcattgact accgcggatt cgacaagcgc aatgagaccc 2040
cgatctacga gttcggacat ggtctgagct acaccacttt caactactct ggccttcaca 2100
tccaggttct caacgcttcc tccaacgctc aagtagccac tgagactggc gccgctccca 2160
ccttcggaca agtcggcaat gcctctgact acgtgtaccc tgagggattg accagaatca 2220
gcaagttcat ctatccctgg cttaattcca cagacctgaa ggcctcatct ggcgacccgt 2280
actatggagt cgacaccgcg gagcacgtgc ccgagggtgc tactgatggc tctccgcagc 2340
ccgttctgcc tgccggtggt ggctctggtg gtaacccgcg cctctacgat gagttgatcc 2400
gtgtttcggt gacagtcaag aacactggtc gtgttgccgg tgatgctgtg cctcaattgt 2460
atgtttccct tggtggaccc aatgagccca aggttgtgtt gcgcaaattc gaccgcctca 2520
ccctcaagcc ctccgaggag acggtgtgga cgactaccct gacccgccgc gatctgtcta 2580
actgggacgt tgcggctcag gactgggtca tcacttctta cccgaagaag gtccatgttg 2640
gtagctcttc gcgtcagctg ccccttcacg cggcgctccc gaaggtgcaa tgagcagctg 2700
aaggtgttgt gaaggaaggg ctttgggcct cagcttcagc ttgcagctga agatgatgta 2760
tacatttttc ccaagtcgta gagactacga atttaatgac tatgatgctg tc 2812
<210> 8
<211> 38
<212> DNA
<213> 人工
<220>
<223> BGL1-F
<400> 8
aactgcaggc ggccgcatca tgaagctcag ttggcttg 38
<210> 9
<211> 27
<212> DNA
<213> 人工
<220>
<223> BGL1-R
<400> 9
aagcatgctc attgcacctt cgggagc 27
<210> 10
<211> 24
<212> DNA
<213> 人工
<220>
<223> FbgI S641T
<400> 10
ctttcaacta cactggcctt caca 24
<210> 11
<211> 27
<212> DNA
<213> 人工
<220>
<223> FbgI R738T
<400> 11
ctggtggtaa cgcgaccctc tacgatg 27
<210> 12
<211> 24
<212> DNA
<213> 人工
<220>
<223> FbgI R745N
<400> 12
gatgagttga tcaacgtttc ggtg 24
<210> 13
<211> 19
<212> DNA
<213> 人工
<220>
<223> Fbgi K787N
<400> 13
tcaccctcaa tccctccga 19
<210> 14
<211> 23
<212> DNA
<213> 人工
<220>
<223> FbgI E790N
<400> 14
caagccctcc aacgagacgg tgt 23
<210> 15
<211> 23
<212> DNA
<213> 人工
<220>
<223> Fbgi V793N
<400> 15
cgaggagacg aactggacga cta 23
<210> 16
<211> 23
<212> DNA
<213> 人工
<220>
<223> FbgI T797N
<400> 16
gtggacgact aacctgaccc gcc 23
<210> 17
<211> 23
<212> DNA
<213> 人工
<220>
<223> FbgI D807T
<400> 17
gtctaactgg actgttgcgg ctc 23
<210> 18
<211> 24
<212> DNA
<213> 人工
<220>
<223> Fbgi V814N
<400> 18
ctcaggactg gaacatcact tctt 24
<210> 19
<211> 23
<212> DNA
<213> 人工
<220>
<223> Fbgi S826N
<400> 19
ccatgttggt aactcttcgc gtc 23
Claims (14)
1.一种突变β-葡萄糖苷酶,其中,
选自下述(i)和(ii),
(i)由在序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列中选自对应于序列编号1的787位的位置、对应于790位的位置、和对应于797位的位置中的至少1个位置的氨基酸残基被取代为天冬酰胺的氨基酸序列构成,并且具有β-葡萄糖苷酶活性的多肽;
(ii)包含该(i)的多肽,并且具有β-葡萄糖苷酶活性的多肽。
2.如权利要求1所述的突变β-葡萄糖苷酶,其中,
与所述取代前的β-葡萄糖苷酶相比,糖化残渣吸附性较低。
3.如权利要求1或2所述的突变β-葡萄糖苷酶,其中,
与所述取代前的β-葡萄糖苷酶相比,对于生物质的β-葡萄糖苷酶活性提高。
4.如权利要求2或3所述的突变β-葡萄糖苷酶,其中,
所述取代前的β-葡萄糖苷酶由与序列编号1所示的氨基酸序列至少80%相同的氨基酸序列构成,并且在对应于序列编号1所示的氨基酸序列的787位的位置具有赖氨酸,在对应于790位的位置具有谷氨酸,并在对应于797位的位置具有苏氨酸。
5.一种多核苷酸,其中,
编码权利要求1~4中任一项所述的突变β-葡萄糖苷酶。
6.一种载体,其中,
含有权利要求5所述的多核苷酸。
7.一种转化体,其中,
包含权利要求5所述的多核苷酸或权利要求6所述的载体。
8.如权利要求7所述的转化体,其中,
所述转化体为丝状菌。
9.一种生物质糖化剂,其中,
含有权利要求1~4中任一项所述的突变β-葡萄糖苷酶。
10.一种糖的制造方法,其中,
包括:用权利要求1~4中任一项所述的突变β-葡萄糖苷酶将生物质糖化的步骤。
11.一种突变β-葡萄糖苷酶的制造方法,其中,
所述方法包括:在包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列并且具有β-葡萄糖苷酶活性的多肽中,将选自对应于序列编号1的787位的位置、对应于790位的位置、和对应于797位的位置中的至少1个位置的氨基酸残基取代为天冬酰胺的步骤。
12.如权利要求11所述的方法,其中,
所述突变β-葡萄糖苷酶与所述包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列并且具有β-葡萄糖苷酶活性的多肽相比,糖化残渣吸附性较低。
13.如权利要求11或12所述的方法,其中,
所述突变β-葡萄糖苷酶与所述包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列并且具有β-葡萄糖苷酶活性的多肽相比,对于生物质的β-葡萄糖苷酶活性提高。
14.如权利要求11~13中任一项所述的方法,其中,
所述包含序列编号1所示的氨基酸序列或与其具有至少80%的同一性的氨基酸序列并且具有β-葡萄糖苷酶活性的多肽在对应于序列编号1所示的氨基酸序列的787位的位置具有赖氨酸,在对应于790位的位置具有谷氨酸,并在对应于797位的位置具有苏氨酸。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2018-031809 | 2018-02-26 | ||
JP2018031809A JP7051491B2 (ja) | 2018-02-26 | 2018-02-26 | 変異β-グルコシダーゼ |
PCT/JP2019/006542 WO2019163886A1 (ja) | 2018-02-26 | 2019-02-21 | 変異β-グルコシダーゼ |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111770996A true CN111770996A (zh) | 2020-10-13 |
CN111770996B CN111770996B (zh) | 2024-03-26 |
Family
ID=67687325
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201980015241.4A Active CN111770996B (zh) | 2018-02-26 | 2019-02-21 | 突变β-葡萄糖苷酶 |
Country Status (5)
Country | Link |
---|---|
US (1) | US11261436B2 (zh) |
JP (1) | JP7051491B2 (zh) |
CN (1) | CN111770996B (zh) |
PH (1) | PH12020551328A1 (zh) |
WO (1) | WO2019163886A1 (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP7051491B2 (ja) * | 2018-02-26 | 2022-04-11 | 花王株式会社 | 変異β-グルコシダーゼ |
WO2023225459A2 (en) | 2022-05-14 | 2023-11-23 | Novozymes A/S | Compositions and methods for preventing, treating, supressing and/or eliminating phytopathogenic infestations and infections |
CN117210442B (zh) * | 2023-05-30 | 2024-05-28 | 西北大学 | 一种用于修饰人参皂苷的β-糖苷酶SS-BGL突变体及其应用 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100041104A1 (en) * | 2008-06-06 | 2010-02-18 | Danisco Us Inc. | Compositions and Methods Comprising Cellulase Variants with Reduced Affinity to Non-Cellulosic Materials |
US20100221778A1 (en) * | 2009-02-27 | 2010-09-02 | Iogen Energy Corporation | Novel lignin-resistant cellulase enzymes |
US20140127759A1 (en) * | 2011-07-14 | 2014-05-08 | Toray Industries, Inc. | Mutant b-glucosidase, enzyme composition for decomposing biomass, and method of producing sugar solution |
US20140186897A1 (en) * | 2011-03-24 | 2014-07-03 | Asahi Glass Company, Limited | TRANSFORMANT OF YEAST OF GENUS SCHIZOSACCHAROMYCES, METHOD FOR PRODUCING SAME, METHOD FOR PRODUCING ß-GLUCOSIDASE, AND CELLULOSE DEGRADATION METHOD |
CN107709559A (zh) * | 2015-06-26 | 2018-02-16 | 花王株式会社 | 新型木聚糖酶 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2318523A4 (en) | 2008-08-01 | 2011-11-23 | Iogen Energy Corp | CELLULASE OF FAMILY 6 WITH INACTIVATION BY LOWERED LIGNIN |
CN105316303B (zh) | 2009-11-20 | 2019-05-31 | 丹尼斯科美国公司 | 具有改善的特性的β-葡萄糖苷酶I变体 |
WO2011097713A1 (en) | 2010-02-11 | 2011-08-18 | Iogen Energy Corporation | Carbohydrate binding modules with reduced binding to lignin |
JP2011223962A (ja) * | 2010-04-22 | 2011-11-10 | Osaka Prefecture Univ | 新規セルラーゼ |
WO2013115305A1 (ja) | 2012-01-31 | 2013-08-08 | 国立大学法人長岡技術科学大学 | セルロース系バイオマスからの糖及びアルコールの製造方法、並びに該方法に用いられる微生物 |
JP6449573B2 (ja) * | 2014-07-04 | 2019-01-09 | 花王株式会社 | β−グルコシダーゼ |
JP7051491B2 (ja) * | 2018-02-26 | 2022-04-11 | 花王株式会社 | 変異β-グルコシダーゼ |
-
2018
- 2018-02-26 JP JP2018031809A patent/JP7051491B2/ja active Active
-
2019
- 2019-02-21 CN CN201980015241.4A patent/CN111770996B/zh active Active
- 2019-02-21 WO PCT/JP2019/006542 patent/WO2019163886A1/ja active Application Filing
- 2019-02-21 US US16/971,341 patent/US11261436B2/en active Active
-
2020
- 2020-08-26 PH PH12020551328A patent/PH12020551328A1/en unknown
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100041104A1 (en) * | 2008-06-06 | 2010-02-18 | Danisco Us Inc. | Compositions and Methods Comprising Cellulase Variants with Reduced Affinity to Non-Cellulosic Materials |
US20100221778A1 (en) * | 2009-02-27 | 2010-09-02 | Iogen Energy Corporation | Novel lignin-resistant cellulase enzymes |
US20140186897A1 (en) * | 2011-03-24 | 2014-07-03 | Asahi Glass Company, Limited | TRANSFORMANT OF YEAST OF GENUS SCHIZOSACCHAROMYCES, METHOD FOR PRODUCING SAME, METHOD FOR PRODUCING ß-GLUCOSIDASE, AND CELLULOSE DEGRADATION METHOD |
US20140127759A1 (en) * | 2011-07-14 | 2014-05-08 | Toray Industries, Inc. | Mutant b-glucosidase, enzyme composition for decomposing biomass, and method of producing sugar solution |
CN107709559A (zh) * | 2015-06-26 | 2018-02-16 | 花王株式会社 | 新型木聚糖酶 |
Non-Patent Citations (2)
Title |
---|
RUCHI AGRAWAL ET AL.: "Development of a b-glucosidase hyperproducing mutant by combined chemical and UV mutagenesis" * |
王婧等: "产β -葡萄糖苷酶酿酒酵母菌株的化学诱变选育及产酶条件优化" * |
Also Published As
Publication number | Publication date |
---|---|
JP2019146499A (ja) | 2019-09-05 |
PH12020551328A1 (en) | 2021-09-01 |
WO2019163886A1 (ja) | 2019-08-29 |
US20210017511A1 (en) | 2021-01-21 |
JP7051491B2 (ja) | 2022-04-11 |
US11261436B2 (en) | 2022-03-01 |
CN111770996B (zh) | 2024-03-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7007645B2 (ja) | 変異型bxl1遺伝子を有するトリコデルマ属真菌及びそれを使用したキシロオリゴ糖とグルコースの製造方法 | |
KR20120106774A (ko) | 동시 당화 발효 반응의 효율을 향상시키는 방법 | |
US11236375B2 (en) | Mutant strain Clostridium thermocellum for producing cellulase and xylanase and preparation method thereof | |
CN111770996B (zh) | 突变β-葡萄糖苷酶 | |
CN107709559B (zh) | 新型木聚糖酶 | |
JP7208762B2 (ja) | 新規アラビノフラノシダーゼ | |
US20170327854A1 (en) | Methods for cellobiosan utilization | |
WO2021029828A1 (en) | Talaromyces pinophilus strain for producing cellulolytic enzymes | |
CN111757939B (zh) | 突变β-葡萄糖苷酶 | |
JP6518107B2 (ja) | 転写因子変異株 | |
JP6470967B2 (ja) | 変異キシラナーゼ | |
JP6849749B2 (ja) | 新規キシラナーゼ | |
WO2015076260A1 (ja) | 耐熱性キシラナーゼ | |
KR101236558B1 (ko) | 귤 표면에서 분리한 신규한 포마 종 균주 및 이로부터 생산되는 섬유소 분해 효소 및 베타글루코시다아제 | |
EP3133155B1 (en) | Hyperthermostable endoglucanase | |
JP2023145205A (ja) | バイオマス糖化のための酵素組成物 | |
Part | Chuannan Long, Yijin Cheng, Jingjing Cui, Jian Liu, Lihui Gan, Bin Zeng & Minnan Long |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |