CN113106114B - 调控里氏木霉蛋白表达效率的因子、调控方法及应用 - Google Patents
调控里氏木霉蛋白表达效率的因子、调控方法及应用 Download PDFInfo
- Publication number
- CN113106114B CN113106114B CN202010032308.XA CN202010032308A CN113106114B CN 113106114 B CN113106114 B CN 113106114B CN 202010032308 A CN202010032308 A CN 202010032308A CN 113106114 B CN113106114 B CN 113106114B
- Authority
- CN
- China
- Prior art keywords
- trichoderma reesei
- protein
- gene
- strain
- target gene
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 245
- 241000499912 Trichoderma reesei Species 0.000 title claims abstract description 129
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 104
- 238000000034 method Methods 0.000 title claims abstract description 52
- 230000014509 gene expression Effects 0.000 title claims abstract description 49
- 230000001105 regulatory effect Effects 0.000 title abstract description 22
- 108010059892 Cellulase Proteins 0.000 claims description 54
- 229940106157 cellulase Drugs 0.000 claims description 51
- 210000001938 protoplast Anatomy 0.000 claims description 30
- 238000012258 culturing Methods 0.000 claims description 22
- 101100028161 Porcine circovirus 2 ORF4 gene Proteins 0.000 claims description 20
- 101150052795 cbh-1 gene Proteins 0.000 claims description 15
- 108091033409 CRISPR Proteins 0.000 claims description 14
- 210000004027 cell Anatomy 0.000 claims description 12
- 101001065065 Aspergillus awamori Feruloyl esterase A Proteins 0.000 claims description 11
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 9
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims description 9
- 230000000692 anti-sense effect Effects 0.000 claims description 9
- 239000003153 chemical reaction reagent Substances 0.000 claims description 9
- 150000007523 nucleic acids Chemical class 0.000 claims description 9
- 239000002773 nucleotide Substances 0.000 claims description 9
- 125000003729 nucleotide group Chemical group 0.000 claims description 9
- 108020004459 Small interfering RNA Proteins 0.000 claims description 8
- 230000030279 gene silencing Effects 0.000 claims description 8
- 238000002744 homologous recombination Methods 0.000 claims description 8
- 230000006801 homologous recombination Effects 0.000 claims description 8
- 230000002452 interceptive effect Effects 0.000 claims description 8
- 108091070501 miRNA Proteins 0.000 claims description 8
- 102000039446 nucleic acids Human genes 0.000 claims description 8
- 108020004707 nucleic acids Proteins 0.000 claims description 8
- 239000004055 small Interfering RNA Substances 0.000 claims description 8
- 230000001965 increasing effect Effects 0.000 claims description 7
- 238000012217 deletion Methods 0.000 claims description 6
- 230000037430 deletion Effects 0.000 claims description 6
- 239000002679 microRNA Substances 0.000 claims description 6
- 238000010453 CRISPR/Cas method Methods 0.000 claims description 5
- 238000005516 engineering process Methods 0.000 claims description 5
- 238000010362 genome editing Methods 0.000 claims description 5
- 230000005764 inhibitory process Effects 0.000 claims description 5
- 230000008685 targeting Effects 0.000 claims description 5
- 238000003776 cleavage reaction Methods 0.000 claims description 4
- 239000013604 expression vector Substances 0.000 claims description 4
- 238000003780 insertion Methods 0.000 claims description 4
- 230000037431 insertion Effects 0.000 claims description 4
- 230000007017 scission Effects 0.000 claims description 4
- 125000003275 alpha amino acid group Chemical group 0.000 claims 5
- 230000028327 secretion Effects 0.000 abstract description 13
- 230000001276 controlling effect Effects 0.000 abstract description 7
- 238000013518 transcription Methods 0.000 abstract description 6
- 230000035897 transcription Effects 0.000 abstract description 6
- 230000015572 biosynthetic process Effects 0.000 abstract description 3
- 238000003786 synthesis reaction Methods 0.000 abstract description 3
- 102000004190 Enzymes Human genes 0.000 description 48
- 108090000790 Enzymes Proteins 0.000 description 48
- 229940088598 enzyme Drugs 0.000 description 48
- 230000000694 effects Effects 0.000 description 37
- 238000000855 fermentation Methods 0.000 description 32
- 230000004151 fermentation Effects 0.000 description 32
- 108010047754 beta-Glucosidase Proteins 0.000 description 27
- 102000006995 beta-Glucosidase Human genes 0.000 description 27
- 108020004414 DNA Proteins 0.000 description 25
- 239000001963 growth medium Substances 0.000 description 25
- 239000007788 liquid Substances 0.000 description 23
- 239000000243 solution Substances 0.000 description 22
- 238000004519 manufacturing process Methods 0.000 description 21
- 238000012216 screening Methods 0.000 description 14
- 101000870438 Streptococcus gordonii UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase stabilizing protein GtfB Proteins 0.000 description 11
- 101000645119 Vibrio campbellii (strain ATCC BAA-1116 / BB120) Nucleotide-binding protein VIBHAR_03667 Proteins 0.000 description 11
- 238000002703 mutagenesis Methods 0.000 description 11
- 231100000350 mutagenesis Toxicity 0.000 description 11
- 229920001184 polypeptide Polymers 0.000 description 11
- 108090000765 processed proteins & peptides Proteins 0.000 description 11
- 102000004196 processed proteins & peptides Human genes 0.000 description 11
- 239000013598 vector Substances 0.000 description 11
- 239000001888 Peptone Substances 0.000 description 10
- 108010080698 Peptones Proteins 0.000 description 10
- 235000019319 peptone Nutrition 0.000 description 10
- 108091027544 Subgenomic mRNA Proteins 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 239000002609 medium Substances 0.000 description 9
- 108091033319 polynucleotide Proteins 0.000 description 9
- 102000040430 polynucleotide Human genes 0.000 description 9
- 239000002157 polynucleotide Substances 0.000 description 9
- 239000006228 supernatant Substances 0.000 description 9
- 230000002222 downregulating effect Effects 0.000 description 8
- 239000000725 suspension Substances 0.000 description 8
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 7
- 239000008103 glucose Substances 0.000 description 7
- 239000000411 inducer Substances 0.000 description 7
- 239000000123 paper Substances 0.000 description 7
- 102000005575 Cellulases Human genes 0.000 description 6
- 108010084185 Cellulases Proteins 0.000 description 6
- 102100031725 Cortactin-binding protein 2 Human genes 0.000 description 6
- 102000004366 Glucosidases Human genes 0.000 description 6
- 108010056771 Glucosidases Proteins 0.000 description 6
- 108020005004 Guide RNA Proteins 0.000 description 6
- 101710197985 Probable protein Rev Proteins 0.000 description 6
- 244000061456 Solanum tuberosum Species 0.000 description 6
- 235000002595 Solanum tuberosum Nutrition 0.000 description 6
- 150000001413 amino acids Chemical group 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- 230000000903 blocking effect Effects 0.000 description 6
- 229940041514 candida albicans extract Drugs 0.000 description 6
- 238000009776 industrial production Methods 0.000 description 6
- 239000012138 yeast extract Substances 0.000 description 6
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 5
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 5
- 241000452385 Trichoderma reesei RUT C-30 Species 0.000 description 5
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 5
- 239000004202 carbamide Substances 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 230000006872 improvement Effects 0.000 description 5
- 230000002401 inhibitory effect Effects 0.000 description 5
- 238000011068 loading method Methods 0.000 description 5
- 230000001404 mediated effect Effects 0.000 description 5
- 229920000136 polysorbate Polymers 0.000 description 5
- 239000000843 powder Substances 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 238000011218 seed culture Methods 0.000 description 5
- 239000007787 solid Substances 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- BTJIUGUIPKRLHP-UHFFFAOYSA-N 4-nitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1 BTJIUGUIPKRLHP-UHFFFAOYSA-N 0.000 description 4
- 102100030988 Angiotensin-converting enzyme Human genes 0.000 description 4
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 4
- 102000016397 Methyltransferase Human genes 0.000 description 4
- 108060004795 Methyltransferase Proteins 0.000 description 4
- 229920000168 Microcrystalline cellulose Polymers 0.000 description 4
- 108090000882 Peptidyl-Dipeptidase A Proteins 0.000 description 4
- 244000046052 Phaseolus vulgaris Species 0.000 description 4
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 238000012271 agricultural production Methods 0.000 description 4
- 238000009395 breeding Methods 0.000 description 4
- 235000010633 broth Nutrition 0.000 description 4
- 239000001913 cellulose Substances 0.000 description 4
- 235000010980 cellulose Nutrition 0.000 description 4
- 229920002678 cellulose Polymers 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- 230000007062 hydrolysis Effects 0.000 description 4
- 238000006460 hydrolysis reaction Methods 0.000 description 4
- 230000001976 improved effect Effects 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 229910017053 inorganic salt Inorganic materials 0.000 description 4
- PHTQWCKDNZKARW-UHFFFAOYSA-N isoamylol Chemical compound CC(C)CCO PHTQWCKDNZKARW-UHFFFAOYSA-N 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 235000019813 microcrystalline cellulose Nutrition 0.000 description 4
- 239000008108 microcrystalline cellulose Substances 0.000 description 4
- 229940016286 microcrystalline cellulose Drugs 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 239000002994 raw material Substances 0.000 description 4
- 230000002441 reversible effect Effects 0.000 description 4
- 238000011426 transformation method Methods 0.000 description 4
- 230000014616 translation Effects 0.000 description 4
- 239000002699 waste material Substances 0.000 description 4
- 238000010354 CRISPR gene editing Methods 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 3
- 240000008042 Zea mays Species 0.000 description 3
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 3
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000001488 breeding effect Effects 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 235000005822 corn Nutrition 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 239000011086 glassine Substances 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 239000002054 inoculum Substances 0.000 description 3
- 230000006916 protein interaction Effects 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- BHZOKUMUHVTPBX-UHFFFAOYSA-M sodium acetic acid acetate Chemical compound [Na+].CC(O)=O.CC([O-])=O BHZOKUMUHVTPBX-UHFFFAOYSA-M 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- UDIODXADSSQKTM-GUHNCMMLSA-N (5ar,8ar,9r)-5-[[(2r,4ar,6r,7r,8r,8as)-7,8-dihydroxy-2-methyl-4,4a,6,7,8,8a-hexahydropyrano[3,2-d][1,3]dioxin-6-yl]oxy]-9-(4-hydroxy-3,5-dimethoxyphenyl)-5a,6,8a,9-tetrahydro-5h-[2]benzofuro[6,5-f][1,3]benzodioxol-8-one;(7s,9s)-7-[(2r,4s,5s,6s)-4-amino-5- Chemical compound ClCCN(CCCl)P1(=O)NCCCO1.O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1.COC1=C(O)C(OC)=CC([C@@H]2C3=CC=4OCOC=4C=C3C(O[C@H]3[C@@H]([C@@H](O)[C@@H]4O[C@H](C)OC[C@H]4O3)O)[C@@H]3[C@@H]2C(OC3)=O)=C1 UDIODXADSSQKTM-GUHNCMMLSA-N 0.000 description 2
- 101150111329 ACE-1 gene Proteins 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 2
- 108010088751 Albumins Proteins 0.000 description 2
- 102000009027 Albumins Human genes 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 2
- 108010017384 Blood Proteins Proteins 0.000 description 2
- 102000004506 Blood Proteins Human genes 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- 108010022355 Fibroins Proteins 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- 102000003939 Membrane transport proteins Human genes 0.000 description 2
- 108090000301 Membrane transport proteins Proteins 0.000 description 2
- 101100329389 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cre-1 gene Proteins 0.000 description 2
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 2
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 2
- 229920001872 Spider silk Polymers 0.000 description 2
- 241000193996 Streptococcus pyogenes Species 0.000 description 2
- 241000187747 Streptomyces Species 0.000 description 2
- 101710172711 Structural protein Proteins 0.000 description 2
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 2
- 229920004890 Triton X-100 Polymers 0.000 description 2
- 239000013504 Triton X-100 Substances 0.000 description 2
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 210000002421 cell wall Anatomy 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000012136 culture method Methods 0.000 description 2
- 230000003828 downregulation Effects 0.000 description 2
- 238000001952 enzyme assay Methods 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 230000009368 gene silencing by RNA Effects 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010002430 hemicellulase Proteins 0.000 description 2
- 150000002443 hydroxylamines Chemical class 0.000 description 2
- 230000004957 immunoregulator effect Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 238000001243 protein synthesis Methods 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000009962 secretion pathway Effects 0.000 description 2
- 239000007974 sodium acetate buffer Substances 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 238000001086 yeast two-hybrid system Methods 0.000 description 2
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- IESDGNYHXIOKRW-YXMSTPNBSA-N (2s)-2-[[(2s)-1-[(2s)-6-amino-2-[[(2s,3r)-2-amino-3-hydroxybutanoyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IESDGNYHXIOKRW-YXMSTPNBSA-N 0.000 description 1
- IDCPFAYURAQKDZ-UHFFFAOYSA-N 1-nitroguanidine Chemical compound NC(=N)N[N+]([O-])=O IDCPFAYURAQKDZ-UHFFFAOYSA-N 0.000 description 1
- CWPDVLMKCHLLPS-JVPBZIDWSA-N 2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]propanoyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 CWPDVLMKCHLLPS-JVPBZIDWSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 101150073246 AGL1 gene Proteins 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- AAXVGJXZKHQQHD-LSJOCFKGSA-N Ala-His-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N AAXVGJXZKHQQHD-LSJOCFKGSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 1
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- VXLBDJWTONZHJN-YUMQZZPRSA-N Asn-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N VXLBDJWTONZHJN-YUMQZZPRSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 1
- QXNGSPZMGFEZNO-QRTARXTBSA-N Asn-Val-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QXNGSPZMGFEZNO-QRTARXTBSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- FIRWLDUOFOULCA-XIRDDKMYSA-N Asp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N FIRWLDUOFOULCA-XIRDDKMYSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- BDWIZLQVVWQMTB-XKBZYTNZSA-N Cys-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)O BDWIZLQVVWQMTB-XKBZYTNZSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 101000925662 Enterobacteria phage PRD1 Endolysin Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 101001065501 Escherichia phage MS2 Lysis protein Proteins 0.000 description 1
- 108050001049 Extracellular proteins Proteins 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 1
- NXPXQIZKDOXIHH-JSGCOSHPSA-N Gln-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N NXPXQIZKDOXIHH-JSGCOSHPSA-N 0.000 description 1
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- 101150059802 KU80 gene Proteins 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- JACMWNXOOUYXCD-JYJNAYRXSA-N Met-Val-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JACMWNXOOUYXCD-JYJNAYRXSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 101100342585 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) mus-51 gene Proteins 0.000 description 1
- 101100074054 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) mus-52 gene Proteins 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- ABQFNJAFONNUTH-FHWLQOOXSA-N Phe-Gln-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N ABQFNJAFONNUTH-FHWLQOOXSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 1
- GHNVJQZQYKNTDX-HJWJTTGWSA-N Phe-Ile-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O GHNVJQZQYKNTDX-HJWJTTGWSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- SZZBUDVXWZZPDH-BQBZGAKWSA-N Pro-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 SZZBUDVXWZZPDH-BQBZGAKWSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 1
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 101100297422 Schizosaccharomyces pombe (strain 972 / ATCC 24843) phd1 gene Proteins 0.000 description 1
- 101100342589 Schizosaccharomyces pombe (strain 972 / ATCC 24843) pku70 gene Proteins 0.000 description 1
- 101100074057 Schizosaccharomyces pombe (strain 972 / ATCC 24843) pku80 gene Proteins 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- BIWBTRRBHIEVAH-IHPCNDPISA-N Ser-Tyr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BIWBTRRBHIEVAH-IHPCNDPISA-N 0.000 description 1
- HIWPGCMGAMJNRG-ACCAVRKYSA-N Sophorose Natural products O([C@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O)[C@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HIWPGCMGAMJNRG-ACCAVRKYSA-N 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- VUKVQVNKIIZBPO-HOUAVDHOSA-N Thr-Asp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VUKVQVNKIIZBPO-HOUAVDHOSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- VTHNLRXALGUDBS-BPUTZDHNSA-N Trp-Gln-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VTHNLRXALGUDBS-BPUTZDHNSA-N 0.000 description 1
- KIMOCKLJBXHFIN-YLVFBTJISA-N Trp-Ile-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O)=CNC2=C1 KIMOCKLJBXHFIN-YLVFBTJISA-N 0.000 description 1
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- GFUOTIPYXKAPAH-BVSLBCMMSA-N Trp-Pro-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GFUOTIPYXKAPAH-BVSLBCMMSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- FBVGQXJIXFZKSQ-GMVOTWDCSA-N Tyr-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FBVGQXJIXFZKSQ-GMVOTWDCSA-N 0.000 description 1
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 1
- GZOCMHSZGGJBCX-ULQDDVLXSA-N Tyr-Lys-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O GZOCMHSZGGJBCX-ULQDDVLXSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- WHVSJHJTMUHYBT-SRVKXCTJSA-N Val-Met-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N WHVSJHJTMUHYBT-SRVKXCTJSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 239000002154 agricultural waste Substances 0.000 description 1
- 238000007605 air drying Methods 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 239000002168 alkylating agent Substances 0.000 description 1
- 229940100198 alkylating agent Drugs 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 239000007640 basal medium Substances 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- HIWPGCMGAMJNRG-UHFFFAOYSA-N beta-sophorose Natural products OC1C(O)C(CO)OC(O)C1OC1C(O)C(O)C(O)C(CO)O1 HIWPGCMGAMJNRG-UHFFFAOYSA-N 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 108010091735 fibrinogen peptide 6A Proteins 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 238000012226 gene silencing method Methods 0.000 description 1
- 238000012214 genetic breeding Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 230000014726 immortalization of host cell Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 101150044508 key gene Proteins 0.000 description 1
- 101150085005 ku70 gene Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 230000031700 light absorption Effects 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 230000002101 lytic effect Effects 0.000 description 1
- 238000002705 metabolomic analysis Methods 0.000 description 1
- 230000001431 metabolomic effect Effects 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 238000000053 physical method Methods 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 238000004537 pulping Methods 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 230000000754 repressing effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- PZDOWFGHCNHPQD-VNNZMYODSA-N sophorose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](C=O)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O PZDOWFGHCNHPQD-VNNZMYODSA-N 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 239000004753 textile Substances 0.000 description 1
- 108010001055 thymocartin Proteins 0.000 description 1
- XPFJYKARVSSRHE-UHFFFAOYSA-K trisodium;2-hydroxypropane-1,2,3-tricarboxylate;2-hydroxypropane-1,2,3-tricarboxylic acid Chemical compound [Na+].[Na+].[Na+].OC(=O)CC(O)(C(O)=O)CC(O)=O.[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O XPFJYKARVSSRHE-UHFFFAOYSA-K 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010025432 tyrosyl-alanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 230000009750 upstream signaling Effects 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 235000015099 wheat brans Nutrition 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/37—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/14—Fungi; Culture media therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/18—Carboxylic ester hydrolases (3.1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
- C12N9/2434—Glucanases acting on beta-1,4-glucosidic bonds
- C12N9/2437—Cellulases (3.2.1.4; 3.2.1.74; 3.2.1.91; 3.2.1.150)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
- C12N9/2434—Glucanases acting on beta-1,4-glucosidic bonds
- C12N9/2445—Beta-glucosidase (3.2.1.21)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/01—Carboxylic ester hydrolases (3.1.1)
- C12Y301/01073—Feruloyl esterase (3.1.1.73)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01021—Beta-glucosidase (3.2.1.21)
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Mycology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Chemical & Material Sciences (AREA)
- Botany (AREA)
- Tropical Medicine & Parasitology (AREA)
- Virology (AREA)
- Gastroenterology & Hepatology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明涉及调控里氏木霉蛋白表达效率的因子、调控方法及应用。所述的调控里氏木霉蛋白表达效率的因子包括影响里氏木霉表达、合成或分泌蛋白质效率的非转录调控因子。本发明还公开了它们在里氏木霉表达、合成与分泌蛋白质中的应用。本发明还公开了围绕本发明的新发现而构建的重组的里氏木霉菌株。
Description
技术领域
本发明属于生物技术领域,更具体地,本发明涉及调控里氏木霉蛋白表达效率的因子、调控方法及应用。
背景技术
里氏木霉(Trichoderma reesei)是纤维素酶工业中利用最多的菌株。里氏木霉菌株Qm6a为各个高产纤维素酶菌株的出发株,为了获取高产纤维素酶的新菌株,Qm6a经过两轮直线加速器的诱变和筛选得到了更高效的产纤维素酶突变菌株Qm9414,但是仍然受到碳代谢阻遏,产酶状况会随着发酵时间增加,葡萄糖浓度升高,纤维素酶的合成逐渐受到抑制。为了获得更高的纤维素酶产量,对Qm6a重新进行育种,通过三轮诱变和筛选得到了抗碳代谢阻遏的高产菌株Rut-C30:第一轮通过紫外诱变和抗代谢阻遏筛选获得了菌株M7;第二轮对M7通过N-硝基胍诱变得到突变株NG14,相比Qm9414,NG14胞外蛋白产量和滤纸酶活分别提高了1倍和4倍;第三轮在NG14的基础上经过紫外诱变和抗代谢阻遏筛选的得到了更高效的产纤维素酶菌株Rut-C30。此外,在NG14的基础上紫外诱变获取了另一突变株RL-P37,另一工业上广泛应用的纤维素酶高产菌株CL-847是由Qm9414诱变而来。由于其高产纤维素酶的能力,在工业上被广泛用于纺织、造纸、制浆及生物能源等领域。该菌目前最高发酵水平可达100g/L的蛋白产量,具有很强的蛋白分泌能力,同时对人畜安全,已经被开发成表达同源蛋白和异源蛋白的良好真菌宿主。在纤维素酶工业应用中,里氏木霉的产酶和蛋白分泌能力的进一步提高有助于纤维素酶应用市场的进一步打开。
虽然传统的诱变育种可以获得不少优良的工业菌株,但Kubicek等认为,从1978年到1991年单纯通过传统诱变方法获得里氏木霉的突变株,在产酶能力上已经没有大的改观。近年来,随着分子生物学的进一步发展,人们已经逐渐从常规的诱变育种和改善发酵条件等传统生物学转向利用比较基因组学、蛋白质组学、转录组学和代谢组学等系统生物学,以此来分析了解一些高产菌株重要突变基因的功能和相关分子机制,并利用这些机制进行进一步遗传改造以获取更符合工业需求的生产菌株。然而,里氏木霉产纤维素酶的真正诱导机制还并未发现。此外里氏木霉突变株强大蛋白分泌能力也未被解析。只有弄清楚这些机理,才能从根源上改造里氏木霉,使其具有更高的纤维素酶产量以及蛋白分泌能力。
因此,有必要对里氏木霉进行更加深入的改造,提高其产酶能力和蛋白分泌能力,降低纤维素酶的工业生产成本,进而获得有用的工业菌株。
发明内容
本发明的目的在于提供调控里氏木霉蛋白表达效率的因子、调控方法及应用。
在本发明的第一方面,提供一种提高里氏木霉的蛋白表达效率的方法,包括:下调里氏木霉基因组中的靶基因,所述靶基因选自:编码ORF4蛋白的基因,编码β-葡萄糖苷酶的基因。
在一个优选例中,所述的蛋白是里氏木霉内源蛋白或异源蛋白。
在另一优选例中,所述的内源蛋白包括(但不限于):纤维素酶,半纤维素酶,β葡萄糖苷酶,糖类透过酶,蛋白合成分泌途径相关酶。
在另一优选例中,所述的异源蛋白包括非纤维素酶分泌蛋白;较佳地,包括(但不限于):异源葡萄糖苷酶,异源阿魏酸酯酶FEA,结构蛋白(如蜘蛛丝蛋白、蚕丝蛋白等),功能蛋白(如免疫调节蛋白、血清白蛋白等)。
在另一优选例中,所述的异源蛋白的编码基因插入(替换)到里氏木霉基因组中;或,所述的异源蛋白的编码基因由表达载体引入到里氏木霉细胞内。
在另一优选例中,所述的异源蛋白的编码基因插入(替换)到里氏木霉基因组中cbh1位置。
在另一优选例中,所述的里氏木霉菌株的出发菌株为Rut-C30,Qm9414,RC30-8。
在另一优选例中,下调里氏木霉基因组中的靶基因包括:抑制、敲除、沉默或突变所述的靶基因,或抑制所述的靶基因编码的蛋白的表达或活性;或将下调所述靶基因的下调剂转入里氏木霉中(较佳地包括抑制或敲除所述的靶基因或抑制所述的靶基因编码的蛋白的表达或活性的作用的试剂);或调节所述靶基因的上游信号通路或上游基因,下调所述靶基因。
在另一优选例中,通过采用CRISPR/Cas技术进行基因编辑以敲除所述的靶基因;或通过同源重组技术,对靶基因所在区段进行缺失或插入的操作以敲除所述的靶基因;或通过干扰分子干扰靶基因的表达;所述的干扰分子是以所述靶基因或其转录本为抑制或沉默靶标的dsRNA、反义核酸、小干扰RNA、微小RNA,或能表达或形成所述dsRNA、反义核酸、小干扰RNA、微小RNA的构建物。
在另一优选例中,所述的ORF4蛋白具有SEQ ID NO:2所示的氨基酸序列,还包括其同源物(同源序列)。
在另一优选例中,所述β-葡萄糖苷酶具有SEQ ID NO:15所示的氨基酸序列,还包括其同源物(同源序列)。
在另一优选例中,通过引入阿魏酸酯酶FEA表达框同源替换编码ORF4蛋白的基因,从而敲除该编码ORF4蛋白的基因,同时引入阿魏酸酯酶FEA表达框。
在另一优选例中,所述编码ORF4蛋白的基因具有SEQ ID NO:1所示的核苷酸序列,基于该序列,靶向于其第257-277位,使该靶位置发生剪切,破坏基因功能。
在另一优选例中,所述编码β-葡萄糖苷酶的基因具有SEQ ID NO:3所示的核苷酸序列,基于该序列,靶向于其第2607-2626位,使该靶位置发生剪切,破坏基因功能。
在本发明的另一方面,提供靶基因的调节剂的用途,用于提高里氏木霉的蛋白表达效率;所述调节剂包括下调剂;所述下调剂为靶向于ORF4蛋白或其编码基因或靶向于内源葡萄糖苷酶或其编码基因的下调剂。
在一个优选例中,所述的下调剂包括:靶向且敲除靶基因的CRISPR/Cas基因编辑试剂(如SEQ ID NO:14所示的sgRNA);或通过同源重组对靶基因进行缺失或插入操作、从而敲除所述的靶基因的试剂(如针对靶基因的同源臂);或干扰分子,其包括以所述靶基因或其转录本为抑制或沉默靶标的dsRNA、反义核酸、小干扰RNA、微小RNA,或能表达或形成所述dsRNA、反义核酸、小干扰RNA、微小RNA的构建物。
在本发明的另一方面,提供一种重组的里氏木霉菌株或其孢子、菌丝体、原生质体,其基因组中的靶基因被下调;其中,下调的靶基因选自:编码ORF4蛋白的基因,编码内源β-葡萄糖苷酶的基因。
在一个优选例中,所述的里氏木霉菌株或其孢子、菌丝体、原生质体中,所述靶基因被抑制、敲除、沉默或突变,或该靶基因编码的蛋白的表达或活性被抑制;或所述里氏木霉菌株或其孢子、菌丝体、原生质体中,转入了所述靶基因的下调剂;较佳地,所述靶基因的下调剂包括:抑制或敲除所述的靶基因,或抑制所述的靶基因编码的蛋白的表达或活性的作用的试剂。
在另一优选例中,所述的里氏木霉菌株或其孢子、菌丝体、原生质体,其中还包括异源蛋白的编码基因;较佳地,所述异源蛋白的编码基因由表达载体引入到里氏木霉细胞内,或插入(同源替换)到里氏木霉基因组中,如(但不限于)插入到的cbh1位置。
在本发明的另一方面,提供前面任一所述的里氏木霉菌株或其孢子、菌丝体、原生质体的用途,用于表达里氏木霉内源蛋白或异源蛋白。
在本发明的另一方面,提供一种重组表达异源蛋白方法,包括:将异源蛋白的编码基因插入到任一所述的重组的里氏木霉菌株或其孢子、菌丝体、原生质体的基因组中,培养该里氏木霉菌株或其孢子、菌丝体、原生质体,从而表达异源蛋白;较佳地,所述的异源蛋白的编码基因插入(同源替换)到里氏木霉或其孢子、菌丝体、原生质体基因组中的cbh1位置。
在本发明的另一方面,提供一种重组表达里氏木霉内源蛋白方法,包括:培养所述的重组的里氏木霉菌株或其孢子、菌丝体、原生质体,从而重组表达里氏木霉内源蛋白(如纤维素酶)。
在本发明的另一方面,提供一种用于蛋白表达的试剂盒,其中包含:前面任一所述的重组的里氏木霉菌株或其孢子、菌丝体、原生质体;或靶基因的下调剂(如用于敲除靶基因的质粒,或RNA干扰试剂),所述靶基因选自:编码ORF4蛋白的基因,编码内源β-葡萄糖苷酶的基因。
在一个优选例中,所述的内源蛋白包括(但不限于):纤维素酶,半纤维素酶,β葡萄糖苷酶,糖类透过酶,蛋白合成分泌途径相关酶。
在另一优选例中,所述的异源蛋白包括非纤维素酶分泌蛋白;较佳地,包括(但不限于):异源葡萄糖苷酶,异源阿魏酸酯酶FEA,结构蛋白(如蜘蛛丝蛋白、蚕丝蛋白等),功能蛋白(如免疫调节蛋白、血清白蛋白等)。
本发明的其它方面由于本文的公开内容,对本领域的技术人员而言是显而易见的。
附图说明
图1、利用酵母双杂系统验证orf4和ACE1存在蛋白互作。阳性对照为已经报道存在蛋白互作的Lae1和Vel1,实验组为AD-orf4和BD-ACE1,阴性对照为AD-orf4和BD空载体,以及AD空载体和BD-ACE1。显蓝色说明存在蛋白互作。
具体实施方式
里氏木霉是商业纤维素酶的主要生产微生物,目前工业上应用的常见菌株虽然相比野生株纤维素酶产量已经有了一定的提高,但是还有一定的上升空间。加上里氏木霉产纤维素酶的诱导机制还未被真正解析,因此寻找新的产酶以及蛋白分析相关的关键因子显得尤为重要。本发明人经过深入的研究,发现里氏木霉基因组中的一些靶基因与该菌株的内源或外源蛋白产量和/或活性密切相关,下调这些靶基因,可以极为显著地提高该菌株的内源或外源蛋白产量和/或活性。因此,所述靶基因或调节所述靶基因的物质和方法可应用于实现里氏木霉的改良,提高里氏木霉的蛋白表达效率,增强其表达的内源或外源蛋白产量和/或活性。
在本发明中,“ORF4多肽(蛋白)”指具有SEQ ID NO:2序列的多肽,还包括具有与ORF4多肽相同功能的、SEQ ID NO:2序列的变异形式或同源物。这些变异形式包括(但并不限于):若干个(如1-50个,较佳地1-30个,更佳地1-20个,最佳地1-10个,还更佳如1-8个、1-5个)氨基酸的缺失、插入和/或取代,以及在C末端和/或N末端添加或缺失一个或数个(通常为20个以内,较佳地为10个以内,更佳地为5个以内)氨基酸。
在本发明中,“β-葡萄糖苷酶”指具有SEQ ID NO:15序列的多肽,还包括具有与β-葡萄糖苷酶多肽相同功能的、SEQ ID NO:15序列的变异形式或同源物。这些变异形式包括(但并不限于):若干个(如1-50个,较佳地1-30个,更佳地1-20个,最佳地1-10个,还更佳如1-8个、1-5个)氨基酸的缺失、插入和/或取代,以及在C末端和/或N末端添加或缺失一个或数个(通常为20个以内,较佳地为10个以内,更佳地为5个以内)氨基酸。
任何与所述的ORF4多肽或β-葡萄糖苷酶同源性高(比如与SEQ ID NO:2或15所示的序列的同源性为85%或更高;优选的,同源性为90%或更高;更优选的,同源性为95%或更高,如同源性98%或99%)的、且具ORF4多肽或β-葡萄糖苷酶相同功能的蛋白也包括在本发明内。
尽管本发明的具体实施例中,列举了具体的ORF4多肽或β-葡萄糖苷酶,但是应理解,由于里氏木霉存在一些不同的变种,它们之间序列高度保守,来自于这些菌中的同源多肽或多核苷酸也应被包含在本发明中,对这些同源多肽或多核苷酸(同源物)的如本发明类似或相同的调控方法也应被包含在本发明中。比对序列相同性的方法和工具也是本领域周知的,例如BLAST。
本发明人通过里氏木霉基因组逐步敲除,发现敲除株的蛋白分泌能力有所增加。再对这段序列进行单基因的筛选工作,最终发现该区域的甲基转移酶和产纤维素酶的调控有关,通过实验验证,该甲基转移酶酶可通过调控纤维素酶的抑制因子ACE1来调控纤维素酶的分泌。而且其他位点的基因也和纤维素酶的产量有关。该甲基转移酶的敲除可以使里氏木霉Rut-C30的FPU酶活提高25%以上。因此是一个产纤维素酶的关键因子,可用于里氏木霉工业菌株改造,非常具有应用价值。另一方面,本发明人还进一步对一个预测为β-葡萄糖苷酶的基因进行敲除,发现敲除株的蛋白分泌能力还能进一步提高。通过实验发现,该β-葡萄糖苷酶对里氏木霉诱导产酶的最强诱导物槐糖有较强的水解能力,可能导致胞内诱导物下降,和上述甲基转移酶都是诱导产纤维素酶的关键因子。
因此,本发明首次确定里氏木霉基因组中的一些靶基因与该菌株的内源或外源蛋白产量和/或活性密切相关。基于该新发现,本发明提供了一种提高里氏木霉的蛋白表达效率的方法,所述方法包括:下调里氏木霉基因组中的靶基因;其中,所述下调的靶基因选自:编码ORF4蛋白的基因,编码内源β-葡萄糖苷酶的基因。
在得知了所述的靶基因后,可以采用本领域人员熟知的多种方法来调节(下调或变异)它们。包括但不限于抑制、敲除、沉默或突变所述的靶基因,或抑制所述的靶基因编码的蛋白的表达或活性;或将下调所述靶基因的下调剂转入里氏木霉中;或通过定点突变试剂来进行靶向突变。
可以采用多种本领域已知的方法在里氏木霉株中下调所述靶基因,包括基因沉默、基因阻断、基因敲除、基因抑制等。这些方法均被包含在本发明中。
例如,可以通过基于同源重组的基因插入或缺失的基因阻断技术来改造基因组中的靶基因,从而使得靶基因被阻断;也可以针对靶基因设计干扰性RNA或反义核苷酸来使靶基因表达抑制或沉默。
一种下调靶基因的方法是基因阻断技术,在本发明的优选实施方式中,体外构建靶基因阻断质粒,通过同源重组的方法,在里氏木霉株染色体靶基因中插入其他无关元件,从而使得染色体上的靶基因不再能够编码活性的蛋白质。当进行基因阻断时,无关元件的选择是本领域技术人员易于选择到的,例如应用一些抗性基因。一种基因阻断(敲除)的方法例如可参见Genetic Manipulation of Streptomyces:a Laboratory Manual中所记载的。
在设计用于进行基因阻断或敲除的构建物时,同时包含抗性筛选基因是优选的,从而有利于后续筛选出发生基因被阻断或敲除的菌株。
本发明还提供了下调所述靶基因的里氏木霉或其孢子、菌丝体、原生质体,更特别的是采用CRISPR/Cas技术缺失靶基因后获得的里氏木霉或其孢子、菌丝体、原生质体,该菌株或其孢子、菌丝体、原生质体不表达所述靶基因或表达量显著性降低。本发明还涉及所述菌株或其孢子、菌丝体、原生质体的用途,用于提高里氏木霉的蛋白表达效率。
基于本发明人的工作,本发明还提供了一种用于蛋白表达的试剂盒,其中包含:本发明的遗传工程化(重组)的里氏木霉或其孢子、菌丝体、原生质体。
本发明还提供了一种用于蛋白表达的试剂盒,其中包含:所述靶基因的下调剂,如用于敲除靶基因的质粒,或RNA干扰试剂,或定点突变试剂等。
所述的用于蛋白表达的试剂盒中,还可以包括其它应用于里氏木霉生产过程的试剂,例如里氏木霉的基础培养基。
所述的用于蛋白表达的试剂盒中,还可以包括使用说明书,说明培养所述里氏木霉的方法,或说明利用所述下调剂或定点突变试剂下调里氏木霉中靶基因的方法。
在本发明实施例中,本发明人通过对Rut-C30的基因组SEQ ID NO:1中的ORF进行敲除发现,这段序列和里氏木霉纤维素酶产量及蛋白分泌能力相关。本发明人还在该区域同源重组表达异源基因获得了比其他位点表达更高的产量,在此基础上进一步敲除一个预测为β-葡萄糖苷酶的基因SEQ ID NO:3可以进一步提高产量。因此,本发明涉及的基因及非编码多核苷酸序列对里氏木霉菌株具有很好的工业应用前景。
在本发明的具体实施例中,在里氏木霉Rut-C30菌株中敲除ORF4的多核苷酸序列后,其孢子生产的纤维素酶的酶活高于15IU/mL(U/mL);较佳地高于18IU/mL。在本发明的另一实施例中,在里氏木霉Rut-C30菌株中阿魏酸酯酶FEA表达框(SEQ ID NO:5)在cbh1位点同源重组表达后,进一步敲除SEQ ID NO:1所示的多核苷酸序列后,其孢子生产的纤维素酶的酶活高于200IU/mL;较佳地高于220IU/mL。
在本发明的另一具体实施例中,在里氏木霉已经缺失SEQ ID NO:1的高产异源β-葡萄糖苷酶菌株U10中进一步还敲除SEQ ID NO:3所示的多核苷酸,其孢子生产的β-葡萄糖苷酶酶活(pNPGase)高于2000IU/ml;较佳地高于2500IU/mL。
通过本发明的改造获得高产菌株,加上转化子发酵培养的碳源是麸皮等简单易得的农用废弃物,在提高纤维素酶制剂的水解效率方面有着十分明显的作用,如应用在食品其他行业生产中,也将大大降低其生产成本,有着更为广泛的工业应用潜力。
进一步地,本发明的活性位点,还可以用于异源基因的重组表达。通过农杆菌介导的转化、PEG介导的原生质体转化、基于ku70/ku80、CRSIPR/Cas9等手段将异源基因表达框引入到基因组中,优选引入到cbh1位置,取代该序列的部分或全部序列进行重组表达,进行进一步改良而获得产量更高或酶系更为优化的衍生菌株或者其他重组蛋白的工程菌株。这些工程菌株还可以进一步建立无筛选标记的基因操作,在工业上有着巨大的应用前景。
更进一步的,本发明的菌株可以作为进一步优化衍生菌株或者其他重组蛋白的工程菌株的基础,构建无标记筛选系统后,可作为分子操作的对象,可以敲入和敲除基因等遗传操作,进行有目的性的遗传改造,或过表达里氏木霉表达量过少的纤维素酶基因,如过表达纤维素酶激活因子Xyr1等,或敲除纤维素酶阻遏因子Cre1等,获得进一步高产的工程菌株。
本发明的改造菌株是活体细胞,一旦获得了本发明菌株的孢子、菌丝、原生质体及其相关的还有活体细胞的培养混合物,就可以用接种传代、再生等手段来大批量地获得本发明的菌株。这通常是将其接种到固体平板培养基或液体培养基中进行菌株的扩大培养而获得本发明的活体细胞。而获得的活体细胞可进一步进行实验室驯化、遗传育种和分子遗传操作等来获得突变体和转化子。也可利用本发明作为异源表达的宿主细胞。
本领域的技术人员熟知的方法能用于诱变本发明的活体细胞,而造成活体细胞的基因编码改变、酶活特性和形态学上的改变。这些方法包括利用射线、粒子、激光、紫外光等物理方法,利用烷化剂、碱基类似物(base analog)、羟胺(hydroxylamine)、吖啶色素等化学诱变方法。诱变可是以上一种方法或多种方法的多代诱变,且不限于这些方法。基于本发明提供的菌株,可以进一步进行物理化学等方式进行育种,也可以导入新的纤维素酶基因和相关调控基因,获得的突变体和转化子产酶性能可获得进一步提高,所述的育种方法为上述的一种或一种以上相结合。
本领域的技术人员熟知的方法将本发明的多核苷酸序列用于构建表达构建物(载体)和进一步改造宿主细胞。例如,对于菌株中已发现或新发现的与纤维素酶生产相关的信号途径、信号通路及其中涉及的蛋白进行进一步的改良(例如增加有益因子的表达,减少有害因子的表达)。
用重组DNA转化宿主细胞可用本领域技术人员熟知的常规技术进行。所用的步骤在本领域众所周知。如农杆菌介导的真菌转化方法,原生质体转化方法、电击转化方法、CRISPR/Cas9基因组编辑法、基因枪法等,且不限于这些方法。
本发明通过改造获得的里氏木霉优化菌株能够利用含有木质纤维素的工农业生产废弃物,如麸皮、玉米浆、豆饼粉等廉价原料,通过液态或固态发酵生产纤维素酶,其优势在于所产纤维素酶活力高,组分合理,水解木质纤维素能力强。本发明为解决纤维素资源利用中酶活力不高、酶组分不合理导致的生产成本高,糖化效率低等问题提供了新的菌种资源。
本发明通过引入FEA表达框同源替换cbh1后进一步敲除SEQ ID NO:1获得的里氏木霉工程菌株能够利用含有木质纤维素的工农业生产废弃物,如麸皮、玉米浆、豆饼粉等廉价原料,通过液态或固态发酵生产纤维素酶,其优势在于所产纤维素酶活力高,组分合理,水解木质纤维素能力强。相同发酵条件下,该菌株是随机引入FEA表达框产量的2倍。本发明为解决纤维素资源利用中酶活力不高、酶组分不合理导致的生产成本高,糖化效率低等问题提供了新的菌种资源。
本发明通过敲除SEQ ID NO:3获得的里氏木霉优化菌株能够利用含有木质纤维素的工农业生产废弃物,如麸皮、玉米浆、豆饼粉等廉价原料,通过液态或固态发酵生产纤维素酶,其优势在于所产纤维素酶活力高,组分合理,水解木质纤维素能力强。本发明为解决纤维素资源利用中酶活力不高、酶组分不合理导致的生产成本高,糖化效率低等问题提供了新的菌种资源。
本发明的方法获得的里氏木霉优化菌株能够利用含有木质纤维素的工农业生产废弃物,如豆饼粉等廉价原料,通过液态或固态发酵生产纤维素酶,其优势在于所产纤维素酶显著下降。本发明的敲除菌株可为表达非纤维素酶分泌蛋白提供表达宿主。
本发明在制备纤维素酶和异源蛋白方面的应用可以通过液态发酵实现,作为一个优选实施例方式,发酵方法包括:(1)将里氏木霉菌种在土豆培养基制成的斜面或平板活化后,制成浓度为106~108mL-1的孢子悬液,按10%的接种量接入沙堡氏培养基(种子培养基:1%酵母膏,1%蛋白胨,4%葡萄糖)中,28℃,200rpm振荡培养,得种子液,然后将种子液以10%接种量接入液体发酵培养基中,初始pH 5.0,装液量为10mL于50mL三角瓶,于28℃,200rpm摇床中培养5-7天;(2)将步骤(1)获得的发酵液离心分离,取上清液作为粗酶液;所述的诱导型发酵培养基是含有5%诱导物(3%微晶纤维素和2%麸皮)的无机盐培养液(0.4%KH2PO4,0.28%(NH4)2SO4,0.06%MgSO4·7H2O,0.05%CaCl2,0.06%urea,0.3%tryptone,0.1%Tween 80,0.5%CaCO3,0.001%FeSO4·7H2O,0.00032%MnSO4·H2O,0.00028%ZnSO4·7H2O,0.0004%CoCl2)。
应用于培养本发明的菌株的培养基和培养方法并不限于以上公布的那些,其它常规应用于培养里氏木霉的培养基和培养方法也可以应用于本发明中。
如上所述发酵体系可以进行体系放大进行工业生产,根据体系的大小不同,本领域技术人员根据所掌握的一般知识可以进行适当的调整以有利于菌株的生长或生产。
如上所述液体发酵粗酶液,可经超滤、盐析或有机溶剂沉淀等方法获得较纯的纤维素酶、酶粉或其他异源重组蛋白。通过发酵并测定所产纤维素酶的滤纸酶活(FPA)、CMC酶活、β-葡萄糖苷酶酶活及其他重组蛋白的活性。应理解,分离和纯化纤维素酶、重组蛋白的方法不受限于本发明中所提供的那些,其它本领域技术人员已知的方法也可应用于本发明中。
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如J.萨姆布鲁克等编著,分子克隆实验指南,第三版,科学出版社,2002中所述的条件,或按照制造厂商所建议的条件。
实施例1、SEQ ID NO:1多核苷酸的获得
1、培养里氏木霉菌株
将里氏木霉Rut-C30菌种(购自ATCC,菌株编号:ATCC 56765)接种在土豆培养基制成的平板上,置于28℃培养7天后,制成浓度为106~108个/mL的孢子悬液,按10%(v/v)的接种量接入沙堡氏培养基(种子培养基(v/v):1%酵母膏,1%蛋白胨,4%葡萄糖)中,28℃,200rpm振荡培养,得菌液。
2、抽提里氏木霉基因组DNA
收集菌液到离心管,12000g离心获得菌丝。弃去上清液,加入0.5ml酚:氯仿:异戊醇(v:v:v=25:24:1),再加入0.5ml DES(0.1M Tris-HCl,pH 8.0,1.0%SDS,2%Triton X-100,10mM EDTA,0.1M NaCl)。加入0.1g beads后,放入Fastprep,6M/S,破碎30s,间隔5分钟后重复一次。裂解后的菌液12000g,4℃,离心10min。吸取上清0.4ml,加入等体积的异丙醇,-20℃沉淀30分钟。12000g,4℃,离心10min,弃上清。沉淀用75%乙醇洗涤后,12000g,4℃,离心5min,弃上清,晾干5min,之后加入200μl无菌水。
3、orf4多核苷酸(SEQ ID NO:1)的克隆
从上述分离到的里氏木霉基因组DNA,以正向引物为:5’ATGTCCCAATACGACTCCATCGG 3’(SEQ ID NO:6),其5’端添加XbaI识别位点:TCTAGA;反向引物为5’TTACAACCGCAACTAAAACAC 3’(SEQ ID NO:7),其5’端添加XbaI识别位点。
将PCR产物纯化后用XbaI酶切,应用AxygenPCR产物柱回收试剂盒回收酶切的DNA片段,将该DNA片段和经同样酶切的回收的载体pHDt/sk(含有Kan和hyg筛选标记,购自Sigma公司)去磷酸化处理后,用T4 DNA连接酶在16℃下连接过夜,得到载体pHDt/sk-full。并通过测序验证序列正确与否。
SEQ ID NO:1(orf4)核酸序列:
ATGTCCCAATACGACTCCATCGGCGCCTCCTATAATGTCCTTGAGCAACTTGCCTACAGAGCTGTGGAGAAGTACAACGTGTACACAACTATTAAGCCACTACTCAGACCGGGGGTGCATGTTCTAGATCTTGCCTGCGGTACCGGCTTCTACTCATCACATCTTCTTGCATGGGGCGCTGAAGACGTCCTCGGCGTGGATATATCATCTGCAATGCTTGAGAGTGCCGTATCTCGATTATCATCAGAGATTTCTGCGGGAAAGGCACAATTCGTGTTGGGGGATGGTGTCATTCCACAGTCGTTCGCCCCAGACAACAGTCGAGGATTTTTCGGCATGGTCTTTGGTGCGTGGTTTCTCAATTATGCGTCGAGCAAAGACCAATTGGTTGCCATGTTCACAAATATCTCCCTCAACCTCACAGTGGATGGCGTTTTTGTGGGAGTTGTCCCTTATCCAACAAATGATATCCAGGAACGGGCCAAAAACTCTGAGCAAATCTCACTACGGCAGTATTTCCCACGCAACGAGTATGTAGAAGAGCTGAGCTCAGGCGACGGCTGGAGCTTACGGGTATTTCAAGTGATGATGGGGTGGATTTCATGACCTGGCATATGAAGAAAAGCGTTTACGAGGAAGCCGCAAGACTGGGAGGTTTGCGGGGCAAATTGGAGTGGCGCGATGAGACTCTGGGAACAGAAAATTCAGAGCAGCAATTTGGTTTGACAGCTGACGAGTGGTCAATCAGAGTACAAAACCCTCACTTGGGAATATTGGTCGCCTGGAAGAGTTGGTTCAATACTGGAAAGAGAACAAATCGTTAAGCTGAGAGTAGTAACAGGAACGGAGGCTATGGAGCATCTTCATAATACCTTTGGCAGTTAACAAACGTCATCAATTCATAACTGAGCAACTATATACCTGATACTAGACTGGTTATCATGGTATCTTAGGAGAGGGGTTGAAGAAGGTAAGTGTTTTAGTTGCGGTTGTAA
SEQ ID NO:2(orf4)多肽序列:
MSQYDSIGASYNVLEQLAYRAVEKYNVYTTIKPLLRPGVHVLDLACGTGFYSSHLLAWGAEDVLGVDISSAMLESAVSRLSSEISAGKAQFVLGDGVIPQSFAPDNSRGFFGMVFGAWFLNYASSKDQLVAMFTNISLNLTVDGVFVGVVPYPTNDIQERAKNSEQISLRQYFPRNELVIMLRL
实施例2、orf4功能研究
1、里氏木霉CRISPR/Cas9底盘细胞的构建
通过对酿脓链球菌(Streptococcus pyogenes)的Cas9编码基因的密码子进行分析,以人工优化的Cas9基因(SEQ ID NO:4;其中第4117-4137位为入核信号)为模板,通过引物Cas9F(ATGGACAAGAAGTACAGCATTGG(SEQ ID NO:8))和Cas9R(TTAGACCTTGCGCTTCTTCTTGGG(SEQ ID NO:9))PCR分离获得该序列。以pDHt/sk载体为骨架,构建密码子优化的Cas9基因表达载体。
以里氏木霉基因组DNA为模板,利用正向引物:5’ACGACGGCCAGTGCCAAGCTTAGGACTTCCAGGGCTACTTG 3’(SEQ ID NO:10)和反向引物:5’GATTGTGCTGTAGCTGCGCTGCTTTGATCGTTTTGAGGTGC 3’(SEQ ID NO:11)扩增获得Ppdc作为优化的Cas9基因表达的启动子。
以里氏木霉基因组DNA为模板,利用正向引物5’AGAAGAAGAGGAAGGTGTGACCCGGCATGAAGTCTGACCG 3’(SEQ ID NO:16)和反向引物5’TAATTGCGCGGATCCTCTAGATGGACGCCTCGATGTCTTCC 3’(SEQ ID NO:17)扩增Tpdc作为终止子。
采用一步法克隆的方法构建载体,通过验证序列成功构建包含诱导型启动子的重组载体pDHt/sk-ppdc-Cas9-tpdc。
将上述构建好的质粒转化入根癌农杆菌AGL1中,在根癌农杆菌的介导下转化入里氏木霉Rut-C30中,用潮霉素作为筛选压力,将长出的转化子接种于SDB培养液中,2天后抽提DNA为模板,通过载体上的启动子处设计正向引物5’TACGTCGGCCCCCTGGCC 3’(SEQ IDNO:12)和Cas9基因的反向引物为5’GAGGTTGTCAAACTTGCGCTGCG 3’(SEQ ID NO:13)PCR鉴定阳性转化子。获得的转化子分别命名为C30-pc。
2.orf4的功能性敲除
建立sgRNA:5’TAATACGACTCACTATAGGGAAAGGCACAATTCGTGTGTTTTAGAGCTAGAAATAGC3’(SEQ ID NO:14),其靶向于SEQ ID NO:1(在基因的第257-277位发生剪切破坏其基因功能)。
将构建好的sgRNA连入pMD18T(购自Takara公司,Amp抗性),以此为模板,获得的体外转录模板用苯酚:氯仿:异戊醇(25:24:1)抽提纯化,最后用Nuclease free water溶解DNA,即可用于gRNA的体外转录。
gRNA的体外转录,具体方法参考试剂盒说明书。将C30-pc孢子接种于PDA平板,28℃培养7d,使用0.85%NaCl+0.02%吐温80洗下孢子。用灭菌的镊子在PDA平板上盖一张玻璃纸,将其铺平,在玻璃纸上加入150ul左右的孢子悬液,用玻棒涂布均匀,做6个同样的处理,28度培养约14h。用裂解酶(Sigma#L1412 lysing enzymes)裂解细胞壁,除去细胞壁碎片后即可获得原生质体悬液,与纯化gRNA片段混合后用PEG介导的原生质转化法转化,涂布于的筛选平板,置28℃培养,5天左右可观察到菌丝长出。待转化子长出后,验证后即可获得对应sgRNA的敲除子。
3.敲除子酶活筛选
(1)将里氏木霉菌种接种在土豆培养基制成的平板上置于28℃培养7天后,制成浓度为106~108个/mL的孢子悬液,按10%(v/v)的接种量接入沙堡氏培养基(种子培养基(v/v):1%酵母膏,1%蛋白胨,4%葡萄糖)中,28℃,200rpm振荡培养,得种子液,然后将种子液以10%(v/v)接种量接入液体发酵培养基中,初始pH 5.0,装液量为10mL于50mL三角瓶,于28℃,200rpm摇床中培养5-7天;
(2)将步骤(1)获得的发酵液离心分离,取上清液作为粗酶液;所述的发酵培养基是含有5%诱导物(3%微晶纤维素和2%麸皮)的无机盐培养液(0.4%KH2PO4,0.28%(NH4)2SO4,0.06%MgSO4·7H2O,0.05%CaCl2,0.06%尿素,0.3%蛋白胨,0.1%Tween 80,0.5%CaCO3,0.001%FeSO4·7H2O,0.00032%MnSO4·H2O,0.00028%ZnSO4·7H2O,0.0004%CoCl2),
(3)在0.05mol/L pH5.0的醋酸-醋酸钠或柠檬酸-柠檬酸钠缓冲液中,将里氏木霉出发株及其敲除子粗酶液加入到滤纸中,50~60℃水解60min,测定滤纸上的纤维素酶的酶活(单位:Filter Paper Unit,FPU)。测得酶活如表1。
表1、里氏木霉转化子及其敲除菌株的滤纸酶活
由表1结果可见,orf4敲除使得纤维素酶的酶活显著性升高,因此其与纤维素酶的产酶相关,是关键基因。
实施例3、SEQ ID NO:1编码的蛋白的互作蛋白筛选
利用酵母双杂交载体pGADT7 AD Vector,pGBKT7 DNA-BD Vector(购于Takara公司)分别构建orf4(即SEQ ID NO:1)和其他已知转录调控因子包括:ACE1(GenBank登录号AF190793),Vel1(GenBank登录号XM_006966084),Hda1(GenBank登录号XP_006968036),Cre1(GenBank登录号AAB01677),Lae1(GenBank登录号AFX86442等,得到AD-orf4,AD-ACE1,AD-Vel1,AD-Hda1,AD-Cre1,AD-Lae1,AD-Cre1以及BD-orf4,BD-ACE1,BD-Vib1,BD-Hda1,BD-Cre1,BD-Lae1,BD-Cre1等。将不同组合的AD和BD载体转化的AH109酵母菌株(购自Takara),并分别做好空白对照,分别涂布于含有SD+Dropout+Ade+His培养基筛选平板。30℃培养2~3天后,挑取菌落少许至SD+Dropout+3-AT+X-α-GAL营养缺陷培养基,30℃培养2-3天,观察酵母菌能生长同时变蓝色的说明两个蛋白之间存在互作。
结果发现,共转化AD-orf4和BD-ACE1的转化子能够生长并显蓝色,如图1所示,说明orf4和ACE1存在互作。
ACE1是一个纤维素酶抑制因子,该结果提示orf4通过对ACE1某些位点进行修饰,进而激活其抑制纤维素酶转录的功能。
实施例4、利用orf4表达异源蛋白
以实施例2获得的C30-pc菌株为出发株,利用CRISPR/Cas9技术将阿魏酸酯酶FEA(SEQ ID NO:5所示的核苷酸)同源替换里氏木霉cbh1位置,继而敲除SEQ ID NO:1,获得的转化子命名为U10FEA1~U10FEA5,仅同源替换里氏木霉cbh1位置而不敲除SEQ ID NO:1的菌株作为对照(FEAΔ1)。将转化子和C30-pc接种到在土豆培养基制成的平板上置于28℃培养7天后,制成浓度为106~108个/mL的孢子悬液,按10%(v/v)的接种量接入沙堡氏培养基(种子培养基(v/v):1%酵母膏,1%蛋白胨,4%葡萄糖)中,28℃,200rpm振荡培养,得种子液,然后将种子液以10%(v/v)接种量接入液体发酵培养基中,初始pH 5.0,装液量为10mL于50mL三角瓶,于28℃,200rpm摇床中培养5-7天;获得的发酵液离心分离,取上清液作为粗酶液;所述的发酵培养基是含有5%诱导物(3%微晶纤维素和2%麸皮)的无机盐培养液(0.4%KH2PO4,0.28%(NH4)2SO4,0.06%MgSO4·7H2O,0.05%CaCl2,0.06%尿素,0.3%蛋白胨,0.1%Tween 80,0.5%CaCO3,0.001%FeSO4·7H2O,0.00032%MnSO4·H2O,0.00028%ZnSO4·7H2O,0.0004%CoCl2)。将发酵液用0.05mol/L pH6.0的醋酸-醋酸钠缓冲液稀释,取50ul加入到0.625mM pNPF(先用DMSO配成10mM贮存液,取0.5ml溶于7.5ml含有5%TritonX-100的0.05mol/L pH6.0的醋酸-醋酸钠缓冲液中,pNPF的制备按照文献AnalyticalBiochemistry387(2009)128-129所述制备)中,37℃水解10min,离心后取上清液于OD405读数。结果测得阿魏酸酯酶FEA酶活如表2所示。
表2、C30-pc及其转化子的pNPF酶活
由表2结果可见,替换cbh1后进一步敲除orf4(SEQ ID NO:1)可以获得比仅替换cbh1位点更高的异源蛋白表达量。敲除orf4是提高异源蛋白表达产量的一个优选策略。
实施例5、通过对内源β-葡萄糖苷酶的操作提高胞外分泌蛋白水平
以里氏木霉菌株Qm9414(购自ATCC 26921)为出发株,用实施例1中的方法,敲除β-葡萄糖苷酶编码基因(SEQ ID NO:3所示的核苷酸),设计sgRNA:5’TAATACGACTCACTATAGGC AAGTATCCGTATCCCGAGTTTTAGAGCTAGAAATAGC 3’(SEQ ID NO:19),对SEQ ID NO:3所示的核苷酸的第2607-2626位进行剪切,破坏其功能,获得转化子Δbgl1~Δbgl3。
β-葡萄糖苷酶的基因(SEQ ID NO:3):
ATGACCTCGTTTCACGACGGCGTCAAGTTGTCCACTGTGACGTGTGTCCTGTCCGGCTTGGTGGCTCTAGGAAGTGCCGGACCTACTGCTGCGTCGGCAAATGCACAAGTAGCGGCGGCTGCGGCTGCCCAAGCCTGGGTTCCAGATGGCTACTATGTGCCGCCGTATTATCCTGCGCCGTACGGTGGATGGGTGGAGGATTGGCAAGAGAGTTATACCAAGGCCAAGGCTCTGGTCGATTCCATGACTCTGGCAGAAAAGACCAATATCACTGCAGGAACTGGCATCTATATGGGTGAGTGGTGGCTTCCTTGAGGATGAGAGGCAGTTTCAGACTGTGTGTGGATGAGCGAGTAATTGTATGCTAACCAGCGAGAACAGGGTGAGTTATTTGGAGCCTTGCCAACGTACAGACCGAAGCCTCTTCAAGGGATGAAAGGCTTTTGTCAGAACTTCAGCAAACGGGTCTTGTACACCATATGCTGGGATGATGATAACCCATTGAGAGATTATAAACTAACACTGATCAAGGCGATGTGCCGGAAACACAGGCTCTGCGTTTCGAGTTTCCTTTCCCCAGCTGTGTCTCAACGACAGCCCGGCGGGTGTCCGACATGCGGACAATGTAACAGCTTTTCCTGACGGAATCACCGTCGGCGCAACGTTCGACAAGGCCCTCATGTACAAGCGCGGCGTGGCCATTGGAAAGGAGAACCGCGGAAAGGGCGTCAATGTCTGGCTGGGACCAACTGTTGGGCCGATAGGCCGCAAGCCAAAGGGCGGCCGCAACTGGGAAGGCTTCGGTGCTGATCCTGTGCTCCAGGCCGTCGGTGCTCGAGAGACGATCAAGGGCGTTCAAGAGCAGGGGGTCATTGCGACTATCAAACACTTTATCGGCAACGAGCAGGAGATGTACCGCATGTACAATCCCTTTCAATACGCATACAGCTCAAATATTGGTACGTGTTTCCAGCGGAATGAAGAGTAAAAGTGAGATAAGATGCTGACAATGAGTTGACGGGTAAAAGACGATAGGACGCTGCATGAAGTATATGCCTGGCCATTCGCCGAGGGCATCCGAGCGGGTGTTGGCGCCGTCATGATGGCTTACAATGCTGTAAGTGATTCTGAGACGAGTGGCTTGATTGTCCAGCAGCTAATTGCTTTTCTCCCGCCAGGTGAACGGGACTGCGTGCTCTCAACACCCATATCTTATGAGTGCCCTTCTCAAGGATGAGATGGGCTTTCAAGGCTTCATCATGACCGACTGGCTCGCTCACATGTCTGGCGTCGCTTCTGCCATTGCCGGCCTGGATATGGACATGCCTGGCGACGTTCAGATTCCCTTCTTCGGCGGCTCATACTGGATGTATGAGCTCACTCGGTCTGCTCTTAATGGTTCGGTTCCCATGGATCGCATCAACGACGCGGCTACACGCATAGCCGCTGCGTGGTACAAGATGGGCCAGGACAAGGGATTCCCCGCGACGAATTTCGACACCAACTCTCGTGCTGCCTTCAACCCGCTGTATCCTGCCGCGCTGCCACTTTCGCCATTTGGCATCACAAACGAGTTTGTACCGGTCCAAGACGATCACGACGTAATTGCTCGCCAAATTTCTCAGGAAGCAATCACCCTGTTGAAGAACGACGGCGACATCCTCCCTCTATCGCCTTCGCAGCACCTCAAGGTTTTCGGCACCGATGCTCAGAAAAACCCAGACGGCATCAATTCATGCACAGATCGCAATTGCAACAAGGGAACTCTGGGCCAAGGATGGGGCTCGGGAACTGTCGATTACCCCTATTTGGACGATCCCATCTCAGCTATCACTGCAGAGGCCGACAATGTCACGTTTTACAACACGGACAAGTTCCCTTCTGTCGGCGAAGTATCAGATAGCGACGTCGCCATCGTCTTTGTCAACTCGGATGCTGGCGAAAACACCTACACCGTGGAAGGCAACCACGGCGACCGTGACAAATCCGGCTTGTACGCATGGCACGACGGCGATAAGCTTGTGCAGGACGCCGCGAGCAAGTTCAGCAACGTCATTGTCGTGATACACACTGTTGGCCCCCTGATCCTCGAGAAGTGGATCGATCTGCCATCCGTCAAGGCGGTGCTCGTCGCACATCTGCCCGGCCAGGAAGCCGGCAAGTCTCTTACAAACGTCCTCTTTGGACACGCCTCGCCGTGCGGACACTTGCCTTATTCCATCACGAAAGAGGAGGACGACCTTCCCAAAAGCGTCACTACGCTCATTGACTCGGAGTTCCTCAACCAGCCCCAAGACACATACACAGAGGGTCTGTACATTGACTACCGCTGGCTCAACAAGAACAAGACCAAGCCTCGCTACGCCTTTGGCCACGGTCTGAGCTACACAAACTTCACCTTCAAGGCAGCATCCATCAAACAGGTCGCCAGGCTGAGCGCATACCCGCCGGCTCGCCCAGCCAAGGGCAGCACCCCCGATTTTGCGCAATCCATTCCATCCGCAAGCGAAGCCGTTGCGCCGTCAGGCTTCGGCAAGATCCCCCGCTACATCTACTCTTGGCTGTCGCAGGGCGATGCCAACCGGGCCATTTCAGACGGCAAGACGGGCA AGTATCCGTATCCCGACGGCTACTCTACCACCCAGAAGCCCGGCGCTCGCGCTGGCGGCGGCGAGGGAGGCAACCCCGCGCTCTGGGACGTTGCGTACAGCCTCACGGTGACGGTGCAGAATACGGGCGATGAGTATGCTGGCAAGGCCTCCGTGCAGGCATATCTCCAATTCCCGGACGATATCGACTACGATACGCCAATCATTCAGCTCCGTGACTTTGAAAAGACAAAGGAGCTAAAGCCGGGGGAGACGACGACTGTGACGCTGACTCTTACACGCAAGGACGTCAGTGTGTGGGACGTGGTGGCGCAGGATTGGAAGGTTCCTGCGGTTGACGGAGGGTATAAGGTGTGGATTGGGGATGCCAGTGATTCGTTAAGTATTGTATGTCATACGGATACGTTGGAGTGTGAGACTGGTGTTGTTGGGCCTGTGTAG
β-葡萄糖苷酶的蛋白(SEQ ID NO:15):
MTSFHDGVKLSTVTCVLSGLVALGSAGPTAASANAQVAAAAAAQAWVPDGYYVPPYYPAPYGGWVEDWQESYTKAKALVDSMTLAEKTNITAGTGIYMGERCAGNTGSAFRVSFPQLCLNDSPAGVRHADNVTAFPDGITVGATFDKALMYKRGVAIGKENRGKGVNVWLGPTVGPIGRKPKGGRNWEGFGADPVLQAVGARETIKGVQEQGVIATIKHFIGNEQEMYRMYNPFQYAYSSNIDDRTLHEVYAWPFAEGIRAGVGAVMMAYNAVNGTACSQHPYLMSALLKDEMGFQGFIMTDWLAHMSGVASAIAGLDMDMPGDVQIPFFGGSYWMYELTRSALNGSVPMDRINDAATRIAAAWYKMGQDKGFPATNFDTNSRAAFNPLYPAALPLSPFGITNEFVPVQDDHDVIARQISQEAITLLKNDGDILPLSPSQHLKVFGTDAQKNPDGINSCTDRNCNKGTLGQGWGSGTVDYPYLDDPISAITAEADNVTFYNTDKFPSVGEVSDSDVAIVFVNSDAGENTYTVEGNHGDRDKSGLYAWHDGDKLVQDAASKFSNVIVVIHTVGPLILEKWIDLPSVKAVLVAHLPGQEAGKSLTNVLFGHASPCGHLPYSITKEEDDLPKSVTTLIDSEFLNQPQDTYTEGLYIDYRWLNKNKTKPRYAFGHGLSYTNFTFKAASIKQVARLSAYPPARPAKGSTPDFAQSIPSASEAVAPSGFGKIPRYIYSWLSQGDANRAISDGKTGKYPYPDGYSTTQKPGARAGGGEGGNPALWDVAYSLTVTVQNTGDEYAGKASVQAYLQFPDDIDYDTPIIQLRDFEKTKELKPGETTTVTLTLTRKDVSVWDVVAQDWKVPAVDGGYKVWIGDASDSLSIVCHTDTLECETGVVGPV
将转化子和Qm9414接种到在土豆培养基制成的平板上置于28℃培养7天后,制成浓度为106~108个/mL的孢子悬液,按10%(v/v)的接种量接入沙堡氏培养基(种子培养基(v/v):1%酵母膏,1%蛋白胨,4%葡萄糖)中,28℃,200rpm振荡培养,得种子液,然后将种子液以10%(v/v)接种量接入液体发酵培养基中,初始pH 5.0,装液量为10mL于50mL三角瓶,于28℃,200rpm摇床中培养5-7天;获得的发酵液离心分离,取上清液作为粗酶液;所述的发酵培养基是含有5%诱导物(3%微晶纤维素和2%麸皮)的无机盐培养液(0.4%KH2PO4,0.28%(NH4)2SO4,0.06%MgSO4·7H2O,0.05%CaCl2,0.06%尿素,0.3%蛋白胨,0.1%Tween80,0.5%CaCO3,0.001%FeSO4·7H2O,0.00032%MnSO4·H2O,0.00028%ZnSO4·7H2O,0.0004%CoCl2)。酶活检测具体如表3所述。
表3、Qm9414及其转化子的发酵液的纤维素酶酶活
由表3的结果可见,敲除内源β-葡萄糖苷酶可显著提高内源纤维素酶的产量。
实施例6、通过对内源β-葡萄糖苷酶的操作进一步提高外源蛋白产量
以过表达外源β葡萄糖苷酶trbgls的里氏木霉菌株(专利号201210258194.6,以RC30-8为出发株),用实施例1中的方法进行对SEQ ID NO:1序列进行敲除,获得U10。以U10为出发株,将内源β-葡萄糖苷酶编码基因SEQ ID NO:3所示的核苷酸敲除(sgRNA:5’TAATACGACTCACTATAGGCAAGTATCCGTATCCCGAGTTTTAGAGCTAGAAATAGC 3’(SEQ ID NO:18);剪切位置:SEQ ID NO:3所示的核苷酸的第2607-2626位),获得蛋白功能被破坏的菌株,获得的转化子命名为U10ΔB1。
将转化子和U10接种到在土豆培养基制成的平板上置于28℃培养7天后,制成浓度为106~108个/mL的孢子悬液,按10%(v/v)的接种量接入沙堡氏培养基(种子培养基(v/v):1%酵母膏,1%蛋白胨,4%葡萄糖)中,28℃,200rpm振荡培养,得种子液,然后将种子液以10%(v/v)接种量接入液体发酵培养基中,初始pH 5.0,装液量为10mL于50mL三角瓶,于28℃,200rpm摇床中培养5-7天;获得的发酵液离心分离,取上清液作为粗酶液;所述的发酵培养基是含有5%诱导物(3%微晶纤维素和2%麸皮)的无机盐培养液(0.4%KH2PO4,0.28%(NH4)2SO4,0.06%MgSO4·7H2O,0.05%CaCl2,0.06%尿素,0.3%蛋白胨,0.1%Tween 80,0.5%CaCO3,0.001%FeSO4·7H2O,0.00032%MnSO4·H2O,0.00028%ZnSO4·7H2O,0.0004%CoCl2)。酶活检测具体如下所述:
以对硝基苯-β-D-葡萄糖苷(pNPG)为底物检测异源葡萄糖苷酶的酶活及理化特性,具体操作如下:
(1)对硝基苯酚(pNP)标准曲线制备
取8*3排列的PCR板,按表4加入溶液,共8组,每组3个平行重复。
表4
pNP浓度为10mg/ml。上表每份标样加100μl 1M NaCO3,终止反应并显色,每管吸取100μl于酶标板中,用酶标仪测405nm光吸收,标样编号0为空白对照。各样品值减空白后制备标线。
(2)异源葡萄糖苷酶的标准酶活测定
在100μl反应体系中,加入50μl浓度为4mM的pNPG,然后加入浓度为0.1M的NaAc/HAc(pH5.0)缓冲液稀释至一定稀释度的酶液50μl,60℃反应10分钟,再加入100μl 1MNaCO3终止反应并显色(对照为在上述反应体系中先加入100μl 1M NaCO3后再加酶液),用酶标仪测405nm光吸收值,样品测定值减去对照后利用标准曲线计算酶活单位(U)。结果如表5。
酶活单位(U)定义:1U为每分钟催化pNPG产生1μmol pNP所需的酶量。
表5、U10及其转化子的发酵液的pNPG酶活
由表5的结果可见,敲除内源β-葡萄糖苷酶可以进一步提高外源蛋白(异源葡萄糖苷酶)产量。
在本发明提及的所有文献都在本申请中引用作为参考,就如同每一篇文献被单独引用作为参考那样。此外应理解,在阅读了本发明的上述讲授内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。
序列表
<110> 中国科学院上海生命科学研究院
<120> 调控里氏木霉蛋白表达效率的因子、调控方法及应用
<130> 185371
<160> 19
<170> SIPOSequenceListing 1.0
<210> 1
<211> 995
<212> DNA
<213> 里氏木霉(Trichoderma reesei)
<400> 1
atgtcccaat acgactccat cggcgcctcc tataatgtcc ttgagcaact tgcctacaga 60
gctgtggaga agtacaacgt gtacacaact attaagccac tactcagacc gggggtgcat 120
gttctagatc ttgcctgcgg taccggcttc tactcatcac atcttcttgc atggggcgct 180
gaagacgtcc tcggcgtgga tatatcatct gcaatgcttg agagtgccgt atctcgatta 240
tcatcagaga tttctgcggg aaaggcacaa ttcgtgttgg gggatggtgt cattccacag 300
tcgttcgccc cagacaacag tcgaggattt ttcggcatgg tctttggtgc gtggtttctc 360
aattatgcgt cgagcaaaga ccaattggtt gccatgttca caaatatctc cctcaacctc 420
acagtggatg gcgtttttgt gggagttgtc ccttatccaa caaatgatat ccaggaacgg 480
gccaaaaact ctgagcaaat ctcactacgg cagtatttcc cacgcaacga gtatgtagaa 540
gagctgagct caggcgacgg ctggagctta cgggtatttc aagtgatgat ggggtggatt 600
tcatgacctg gcatatgaag aaaagcgttt acgaggaagc cgcaagactg ggaggtttgc 660
ggggcaaatt ggagtggcgc gatgagactc tgggaacaga aaattcagag cagcaatttg 720
gtttgacagc tgacgagtgg tcaatcagag tacaaaaccc tcacttggga atattggtcg 780
cctggaagag ttggttcaat actggaaaga gaacaaatcg ttaagctgag agtagtaaca 840
ggaacggagg ctatggagca tcttcataat acctttggca gttaacaaac gtcatcaatt 900
cataactgag caactatata cctgatacta gactggttat catggtatct taggagaggg 960
gttgaagaag gtaagtgttt tagttgcggt tgtaa 995
<210> 2
<211> 184
<212> PRT
<213> 里氏木霉(Trichoderma reesei)
<400> 2
Met Ser Gln Tyr Asp Ser Ile Gly Ala Ser Tyr Asn Val Leu Glu Gln
1 5 10 15
Leu Ala Tyr Arg Ala Val Glu Lys Tyr Asn Val Tyr Thr Thr Ile Lys
20 25 30
Pro Leu Leu Arg Pro Gly Val His Val Leu Asp Leu Ala Cys Gly Thr
35 40 45
Gly Phe Tyr Ser Ser His Leu Leu Ala Trp Gly Ala Glu Asp Val Leu
50 55 60
Gly Val Asp Ile Ser Ser Ala Met Leu Glu Ser Ala Val Ser Arg Leu
65 70 75 80
Ser Ser Glu Ile Ser Ala Gly Lys Ala Gln Phe Val Leu Gly Asp Gly
85 90 95
Val Ile Pro Gln Ser Phe Ala Pro Asp Asn Ser Arg Gly Phe Phe Gly
100 105 110
Met Val Phe Gly Ala Trp Phe Leu Asn Tyr Ala Ser Ser Lys Asp Gln
115 120 125
Leu Val Ala Met Phe Thr Asn Ile Ser Leu Asn Leu Thr Val Asp Gly
130 135 140
Val Phe Val Gly Val Val Pro Tyr Pro Thr Asn Asp Ile Gln Glu Arg
145 150 155 160
Ala Lys Asn Ser Glu Gln Ile Ser Leu Arg Gln Tyr Phe Pro Arg Asn
165 170 175
Glu Leu Val Ile Met Leu Arg Leu
180
<210> 3
<211> 3050
<212> DNA
<213> 里氏木霉(Trichoderma reesei)
<400> 3
atgacctcgt ttcacgacgg cgtcaagttg tccactgtga cgtgtgtcct gtccggcttg 60
gtggctctag gaagtgccgg acctactgct gcgtcggcaa atgcacaagt agcggcggct 120
gcggctgccc aagcctgggt tccagatggc tactatgtgc cgccgtatta tcctgcgccg 180
tacggtggat gggtggagga ttggcaagag agttatacca aggccaaggc tctggtcgat 240
tccatgactc tggcagaaaa gaccaatatc actgcaggaa ctggcatcta tatgggtgag 300
tggtggcttc cttgaggatg agaggcagtt tcagactgtg tgtggatgag cgagtaattg 360
tatgctaacc agcgagaaca gggtgagtta tttggagcct tgccaacgta cagaccgaag 420
cctcttcaag ggatgaaagg cttttgtcag aacttcagca aacgggtctt gtacaccata 480
tgctgggatg atgataaccc attgagagat tataaactaa cactgatcaa ggcgatgtgc 540
cggaaacaca ggctctgcgt ttcgagtttc ctttccccag ctgtgtctca acgacagccc 600
ggcgggtgtc cgacatgcgg acaatgtaac agcttttcct gacggaatca ccgtcggcgc 660
aacgttcgac aaggccctca tgtacaagcg cggcgtggcc attggaaagg agaaccgcgg 720
aaagggcgtc aatgtctggc tgggaccaac tgttgggccg ataggccgca agccaaaggg 780
cggccgcaac tgggaaggct tcggtgctga tcctgtgctc caggccgtcg gtgctcgaga 840
gacgatcaag ggcgttcaag agcagggggt cattgcgact atcaaacact ttatcggcaa 900
cgagcaggag atgtaccgca tgtacaatcc ctttcaatac gcatacagct caaatattgg 960
tacgtgtttc cagcggaatg aagagtaaaa gtgagataag atgctgacaa tgagttgacg 1020
ggtaaaagac gataggacgc tgcatgaagt atatgcctgg ccattcgccg agggcatccg 1080
agcgggtgtt ggcgccgtca tgatggctta caatgctgta agtgattctg agacgagtgg 1140
cttgattgtc cagcagctaa ttgcttttct cccgccaggt gaacgggact gcgtgctctc 1200
aacacccata tcttatgagt gcccttctca aggatgagat gggctttcaa ggcttcatca 1260
tgaccgactg gctcgctcac atgtctggcg tcgcttctgc cattgccggc ctggatatgg 1320
acatgcctgg cgacgttcag attcccttct tcggcggctc atactggatg tatgagctca 1380
ctcggtctgc tcttaatggt tcggttccca tggatcgcat caacgacgcg gctacacgca 1440
tagccgctgc gtggtacaag atgggccagg acaagggatt ccccgcgacg aatttcgaca 1500
ccaactctcg tgctgccttc aacccgctgt atcctgccgc gctgccactt tcgccatttg 1560
gcatcacaaa cgagtttgta ccggtccaag acgatcacga cgtaattgct cgccaaattt 1620
ctcaggaagc aatcaccctg ttgaagaacg acggcgacat cctccctcta tcgccttcgc 1680
agcacctcaa ggttttcggc accgatgctc agaaaaaccc agacggcatc aattcatgca 1740
cagatcgcaa ttgcaacaag ggaactctgg gccaaggatg gggctcggga actgtcgatt 1800
acccctattt ggacgatccc atctcagcta tcactgcaga ggccgacaat gtcacgtttt 1860
acaacacgga caagttccct tctgtcggcg aagtatcaga tagcgacgtc gccatcgtct 1920
ttgtcaactc ggatgctggc gaaaacacct acaccgtgga aggcaaccac ggcgaccgtg 1980
acaaatccgg cttgtacgca tggcacgacg gcgataagct tgtgcaggac gccgcgagca 2040
agttcagcaa cgtcattgtc gtgatacaca ctgttggccc cctgatcctc gagaagtgga 2100
tcgatctgcc atccgtcaag gcggtgctcg tcgcacatct gcccggccag gaagccggca 2160
agtctcttac aaacgtcctc tttggacacg cctcgccgtg cggacacttg ccttattcca 2220
tcacgaaaga ggaggacgac cttcccaaaa gcgtcactac gctcattgac tcggagttcc 2280
tcaaccagcc ccaagacaca tacacagagg gtctgtacat tgactaccgc tggctcaaca 2340
agaacaagac caagcctcgc tacgcctttg gccacggtct gagctacaca aacttcacct 2400
tcaaggcagc atccatcaaa caggtcgcca ggctgagcgc atacccgccg gctcgcccag 2460
ccaagggcag cacccccgat tttgcgcaat ccattccatc cgcaagcgaa gccgttgcgc 2520
cgtcaggctt cggcaagatc ccccgctaca tctactcttg gctgtcgcag ggcgatgcca 2580
accgggccat ttcagacggc aagacgggca agtatccgta tcccgacggc tactctacca 2640
cccagaagcc cggcgctcgc gctggcggcg gcgagggagg caaccccgcg ctctgggacg 2700
ttgcgtacag cctcacggtg acggtgcaga atacgggcga tgagtatgct ggcaaggcct 2760
ccgtgcaggc atatctccaa ttcccggacg atatcgacta cgatacgcca atcattcagc 2820
tccgtgactt tgaaaagaca aaggagctaa agccggggga gacgacgact gtgacgctga 2880
ctcttacacg caaggacgtc agtgtgtggg acgtggtggc gcaggattgg aaggttcctg 2940
cggttgacgg agggtataag gtgtggattg gggatgccag tgattcgtta agtattgtat 3000
gtcatacgga tacgttggag tgtgagactg gtgttgttgg gcctgtgtag 3050
<210> 4
<211> 4140
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 4
atggacaaga agtacagcat tggcctggac attggcacga actcggtcgg ctgggccgtc 60
atcacggacg agtacaaggt cccctccaag aagtttaagg tcctgggcaa caccgaccgc 120
cactccatca agaagaacct cattggcgcc ctgctcttcg actccggcga gaccgccgag 180
gccacccgcc tcaagcgcac cgcccgccgc cgatacacgc gccgcaagaa ccgcatctgc 240
tacctgcagg agattttctc caacgagatg gccaaggtcg acgactcctt ctttcaccgc 300
ctggaggagt cgttcctcgt cgaggaagac aagaagcacg agcgccaccc catctttggc 360
aacattgtcg acgaggtcgc ctaccacgag aagtacccca cgatctacca cctgcgcaag 420
aagctcgtcg actccaccga caaggccgac ctccgcctga tctacctcgc cctggcccac 480
atgattaagt tccgcggcca ctttctgatc gagggcgacc tcaaccccga caacagcgac 540
gtcgacaagc tgttcatcca gctcgtccag acctacaacc agctctttga ggagaacccc 600
attaacgcct ccggcgtcga cgccaaggcc atcctctcgg cccgcctctc caagagccgc 660
cgactcgaga acctgatcgc ccagctgccc ggcgagaaga agaacggcct gttcggcaac 720
ctcatcgccc tctccctggg cctcaccccc aacttcaagt cgaactttga cctcgccgag 780
gacgccaagc tgcagctctc caaggacacc tacgacgacg acctggacaa cctcctggcc 840
cagatcggcg accagtacgc cgacctgttc ctcgccgcca agaacctgtc cgacgccatc 900
ctcctgtcgg acattctccg cgtcaacacc gagattacga aggcccctct ctccgcctcg 960
atgatcaagc gctacgacga gcaccaccag gacctgaccc tgctcaaggc cctggtccgc 1020
cagcagctcc ccgagaagta caaggagatc ttctttgacc agagcaagaa cggctacgcc 1080
ggctacatcg acggcggcgc tagccaagag gagttctaca agtttatcaa gcccattctg 1140
gagaagatgg acggcacgga ggagctcctg gtcaagctca accgcgagga cctcctgcgc 1200
aagcagcgca ccttcgacaa cggcagcatc ccccaccaga ttcacctcgg cgagctgcac 1260
gccatcctcc gccgacaaga ggacttctac ccctttctca aggacaaccg cgagaagatc 1320
gagaagattc tgacgttccg catcccctac tacgtcggcc ccctggcccg cggcaacagc 1380
cgctttgcct ggatgacccg caagtccgag gagaccatca cgccctggaa cttcgaggaa 1440
gtcgtcgaca agggcgcctc ggcccagtcc ttcatcgagc gcatgaccaa ctttgacaag 1500
aacctgccca acgagaaggt cctccccaag cactcgctcc tgtacgagta cttcaccgtc 1560
tacaacgagc tcacgaaggt caagtacgtc accgagggca tgcgcaagcc cgccttcctg 1620
tcgggcgagc agaagaaggc catcgtcgac ctcctgttta agaccaaccg caaggtcacg 1680
gtcaagcagc tcaaggaaga ctacttcaag aagattgagt gctttgacag cgtcgagatc 1740
tccggcgtcg aggaccgctt taacgcctcc ctgggcacct accacgacct cctgaagatc 1800
attaaggaca aggacttcct ggacaacgag gagaacgagg acatcctcga ggacattgtc 1860
ctgaccctca cgctgtttga ggaccgcgag atgatcgagg agcgcctgaa gacgtacgcc 1920
cacctcttcg acgacaaggt catgaagcag ctcaagcgcc gccgatacac cggctggggc 1980
cgcctgagcc gcaagctcat caacggcatt cgcgacaagc agtcgggcaa gacgatcctc 2040
gacttcctga agagcgacgg cttcgccaac cgcaacttta tgcagctgat tcacgacgac 2100
tccctcacct tcaaggaaga catccagaag gcccaggtct ccggccaggg cgactccctg 2160
cacgagcaca tcgccaacct cgccggcagc cccgccatca agaagggcat tctgcagacc 2220
gtcaaggtcg tcgacgagct cgtcaaggtc atgggccgcc acaagcccga gaacatcgtc 2280
attgagatgg cccgcgagaa ccagaccacg cagaagggcc agaagaacag ccgcgagcgc 2340
atgaagcgca tcgaggaagg catcaaggag ctgggctccc agatcctcaa ggagcacccc 2400
gtcgagaaca cccagctgca gaacgagaag ctctacctgt actacctcca gaacggccgc 2460
gacatgtacg tcgaccagga gctggacatt aaccgcctct cggactacga cgtcgaccac 2520
atcgtccccc agagcttcct gaaggacgac tccatcgaca acaaggtcct cacccgcagc 2580
gacaagaacc gcggcaagag cgacaacgtc ccctccgagg aagtcgtcaa gaagatgaag 2640
aactactggc gccagctcct gaacgccaag ctgatcacgc agcgcaagtt tgacaacctc 2700
accaaggccg agcgaggcgg cctctcggag ctggacaagg ccggcttcat caagcgccag 2760
ctggtcgaga cccgccagat cacgaagcac gtcgcccaga ttctcgactc gcgcatgaac 2820
acgaagtacg acgagaacga caagctgatc cgcgaggtca aggtcattac cctgaagtcg 2880
aagctcgtca gcgacttccg caaggacttc cagttttaca aggtccgcga gatcaacaac 2940
taccaccacg cccacgacgc ctacctcaac gccgtcgtcg gcaccgccct gatcaagaag 3000
taccccaagc tcgagtccga gttcgtctac ggcgactaca aggtctacga cgtccgcaag 3060
atgatcgcca agtccgagca ggagattggc aaggccaccg ccaagtactt cttttactcg 3120
aacatcatga acttctttaa gaccgagatc accctcgcca acggcgagat ccgcaagcgc 3180
cccctcattg agaccaacgg cgagaccggc gagatcgtct gggacaaggg ccgcgacttc 3240
gccaccgtcc gcaaggtcct cagcatgccc caggtcaaca tcgtcaagaa gaccgaggtc 3300
cagacgggcg gcttctcgaa ggagagcatt ctgcccaagc gcaactccga caagctcatc 3360
gcccgcaaga aggactggga ccccaagaag tacggtggct tcgactcccc caccgtcgcc 3420
tactcggtcc tggtcgtcgc caaggtcgag aagggcaagt cgaagaagct caagagcgtc 3480
aaggagctcc tgggcatcac cattatggag cgcagctcct tcgagaagaa ccccatcgac 3540
tttctcgagg ccaagggcta caaggaagtc aagaaggacc tgatcattaa gctccccaag 3600
tactccctct tcgagctgga gaacggccgc aagcgcatgc tcgcctccgc cggcgagctc 3660
cagaagggca acgagctcgc cctgcccagc aagtacgtca acttcctcta cctggccagc 3720
cactacgaga agctcaaggg ctcccccgag gacaacgagc agaagcagct gtttgtcgag 3780
cagcacaagc actacctcga cgagatcatt gagcagattt ccgagttctc gaagcgcgtc 3840
atcctggccg acgccaacct ggacaaggtc ctcagcgcct acaacaagca ccgcgacaag 3900
cccatccgcg agcaggccga gaacatcatt cacctcttca ccctgaccaa cctcggcgcc 3960
cccgccgcct tcaagtactt tgacaccacg atcgaccgca agcgctacac ctcgacgaag 4020
gaagtcctgg acgccaccct catccaccag agcattaccg gcctctacga gacgcgcatc 4080
gacctcagcc agctcggcgg cgactcccgc gccgacccca agaagaagcg caaggtctaa 4140
<210> 5
<211> 846
<212> DNA
<213> 链霉菌属(Streptomyces sp. )
<400> 5
atgaagcaat tctctgcaaa attcgctctc gcgctgtctg cggccgccgg gcaggccttg 60
gcggcctcta cgcagggcat ctccgaagac ctctacaacc gtctggttga gatggccacc 120
atctcgcaag cggcctacgc caacatgtgc aacatcccct cgaccatcac cgtcggggag 180
aagatctaca acgcccagac cgacatcaac ggatgggtgc tccgcgacga cagcaccaag 240
gaaatcatca ccgtcttccg cggcaccggc agcgacacca acctgcagct cgacaccaac 300
tacaccctca cgcccttcag caccttctcc gagtgcagcg gctgcgaggt gcacggcggg 360
tactttatcg gctggtcctc tgtccaagac caagtcatgt cgcttgtcaa agaacaggcc 420
gaccagtacc cggactacac gctgactgtc accggccaca gtctgggagc gtcaatggcg 480
actctcgctg ccgcccagct gtccggtaca tacgacaaca tcaccctgta cacgttcggc 540
gagccgcgca gcggcaacga ggccttcgcg tcgtacatga acgacaagtt caccgccacg 600
agcgcggaca cgaccaagta cttccgggtc actcattcca acgacggtat cccgaacctg 660
cccccggctg agcagggcta tgtccatggc ggtgtggagt actggagcgt ggacccctac 720
agtgcccaga acacctatgt ctgcactggg gatgaggtcc agtgctgtga ggcgcagggc 780
ggacagggcg tcaacgatgc gcacaccact tactttggaa tgaccagcgg agcttgcacc 840
tggtaa 846
<210> 6
<211> 23
<212> DNA
<213> 引物(Primer)
<400> 6
atgtcccaat acgactccat cgg 23
<210> 7
<211> 21
<212> DNA
<213> 引物(Primer)
<400> 7
ttacaaccgc aactaaaaca c 21
<210> 8
<211> 23
<212> DNA
<213> 引物(Primer)
<400> 8
atggacaaga agtacagcat tgg 23
<210> 9
<211> 24
<212> DNA
<213> 引物(Primer)
<400> 9
ttagaccttg cgcttcttct tggg 24
<210> 10
<211> 41
<212> DNA
<213> 引物(Primer)
<400> 10
acgacggcca gtgccaagct taggacttcc agggctactt g 41
<210> 11
<211> 41
<212> DNA
<213> 引物(Primer)
<400> 11
gattgtgctg tagctgcgct gctttgatcg ttttgaggtg c 41
<210> 12
<211> 18
<212> DNA
<213> 引物(Primer)
<400> 12
tacgtcggcc ccctggcc 18
<210> 13
<211> 23
<212> DNA
<213> 引物(Primer)
<400> 13
gaggttgtca aacttgcgct gcg 23
<210> 14
<211> 57
<212> DNA
<213> 向导RNA(sgRNA)
<400> 14
taatacgact cactataggg aaaggcacaa ttcgtgtgtt ttagagctag aaatagc 57
<210> 15
<211> 895
<212> PRT
<213> 里氏木霉(Trichoderma reesei)
<400> 15
Met Thr Ser Phe His Asp Gly Val Lys Leu Ser Thr Val Thr Cys Val
1 5 10 15
Leu Ser Gly Leu Val Ala Leu Gly Ser Ala Gly Pro Thr Ala Ala Ser
20 25 30
Ala Asn Ala Gln Val Ala Ala Ala Ala Ala Ala Gln Ala Trp Val Pro
35 40 45
Asp Gly Tyr Tyr Val Pro Pro Tyr Tyr Pro Ala Pro Tyr Gly Gly Trp
50 55 60
Val Glu Asp Trp Gln Glu Ser Tyr Thr Lys Ala Lys Ala Leu Val Asp
65 70 75 80
Ser Met Thr Leu Ala Glu Lys Thr Asn Ile Thr Ala Gly Thr Gly Ile
85 90 95
Tyr Met Gly Glu Arg Cys Ala Gly Asn Thr Gly Ser Ala Phe Arg Val
100 105 110
Ser Phe Pro Gln Leu Cys Leu Asn Asp Ser Pro Ala Gly Val Arg His
115 120 125
Ala Asp Asn Val Thr Ala Phe Pro Asp Gly Ile Thr Val Gly Ala Thr
130 135 140
Phe Asp Lys Ala Leu Met Tyr Lys Arg Gly Val Ala Ile Gly Lys Glu
145 150 155 160
Asn Arg Gly Lys Gly Val Asn Val Trp Leu Gly Pro Thr Val Gly Pro
165 170 175
Ile Gly Arg Lys Pro Lys Gly Gly Arg Asn Trp Glu Gly Phe Gly Ala
180 185 190
Asp Pro Val Leu Gln Ala Val Gly Ala Arg Glu Thr Ile Lys Gly Val
195 200 205
Gln Glu Gln Gly Val Ile Ala Thr Ile Lys His Phe Ile Gly Asn Glu
210 215 220
Gln Glu Met Tyr Arg Met Tyr Asn Pro Phe Gln Tyr Ala Tyr Ser Ser
225 230 235 240
Asn Ile Asp Asp Arg Thr Leu His Glu Val Tyr Ala Trp Pro Phe Ala
245 250 255
Glu Gly Ile Arg Ala Gly Val Gly Ala Val Met Met Ala Tyr Asn Ala
260 265 270
Val Asn Gly Thr Ala Cys Ser Gln His Pro Tyr Leu Met Ser Ala Leu
275 280 285
Leu Lys Asp Glu Met Gly Phe Gln Gly Phe Ile Met Thr Asp Trp Leu
290 295 300
Ala His Met Ser Gly Val Ala Ser Ala Ile Ala Gly Leu Asp Met Asp
305 310 315 320
Met Pro Gly Asp Val Gln Ile Pro Phe Phe Gly Gly Ser Tyr Trp Met
325 330 335
Tyr Glu Leu Thr Arg Ser Ala Leu Asn Gly Ser Val Pro Met Asp Arg
340 345 350
Ile Asn Asp Ala Ala Thr Arg Ile Ala Ala Ala Trp Tyr Lys Met Gly
355 360 365
Gln Asp Lys Gly Phe Pro Ala Thr Asn Phe Asp Thr Asn Ser Arg Ala
370 375 380
Ala Phe Asn Pro Leu Tyr Pro Ala Ala Leu Pro Leu Ser Pro Phe Gly
385 390 395 400
Ile Thr Asn Glu Phe Val Pro Val Gln Asp Asp His Asp Val Ile Ala
405 410 415
Arg Gln Ile Ser Gln Glu Ala Ile Thr Leu Leu Lys Asn Asp Gly Asp
420 425 430
Ile Leu Pro Leu Ser Pro Ser Gln His Leu Lys Val Phe Gly Thr Asp
435 440 445
Ala Gln Lys Asn Pro Asp Gly Ile Asn Ser Cys Thr Asp Arg Asn Cys
450 455 460
Asn Lys Gly Thr Leu Gly Gln Gly Trp Gly Ser Gly Thr Val Asp Tyr
465 470 475 480
Pro Tyr Leu Asp Asp Pro Ile Ser Ala Ile Thr Ala Glu Ala Asp Asn
485 490 495
Val Thr Phe Tyr Asn Thr Asp Lys Phe Pro Ser Val Gly Glu Val Ser
500 505 510
Asp Ser Asp Val Ala Ile Val Phe Val Asn Ser Asp Ala Gly Glu Asn
515 520 525
Thr Tyr Thr Val Glu Gly Asn His Gly Asp Arg Asp Lys Ser Gly Leu
530 535 540
Tyr Ala Trp His Asp Gly Asp Lys Leu Val Gln Asp Ala Ala Ser Lys
545 550 555 560
Phe Ser Asn Val Ile Val Val Ile His Thr Val Gly Pro Leu Ile Leu
565 570 575
Glu Lys Trp Ile Asp Leu Pro Ser Val Lys Ala Val Leu Val Ala His
580 585 590
Leu Pro Gly Gln Glu Ala Gly Lys Ser Leu Thr Asn Val Leu Phe Gly
595 600 605
His Ala Ser Pro Cys Gly His Leu Pro Tyr Ser Ile Thr Lys Glu Glu
610 615 620
Asp Asp Leu Pro Lys Ser Val Thr Thr Leu Ile Asp Ser Glu Phe Leu
625 630 635 640
Asn Gln Pro Gln Asp Thr Tyr Thr Glu Gly Leu Tyr Ile Asp Tyr Arg
645 650 655
Trp Leu Asn Lys Asn Lys Thr Lys Pro Arg Tyr Ala Phe Gly His Gly
660 665 670
Leu Ser Tyr Thr Asn Phe Thr Phe Lys Ala Ala Ser Ile Lys Gln Val
675 680 685
Ala Arg Leu Ser Ala Tyr Pro Pro Ala Arg Pro Ala Lys Gly Ser Thr
690 695 700
Pro Asp Phe Ala Gln Ser Ile Pro Ser Ala Ser Glu Ala Val Ala Pro
705 710 715 720
Ser Gly Phe Gly Lys Ile Pro Arg Tyr Ile Tyr Ser Trp Leu Ser Gln
725 730 735
Gly Asp Ala Asn Arg Ala Ile Ser Asp Gly Lys Thr Gly Lys Tyr Pro
740 745 750
Tyr Pro Asp Gly Tyr Ser Thr Thr Gln Lys Pro Gly Ala Arg Ala Gly
755 760 765
Gly Gly Glu Gly Gly Asn Pro Ala Leu Trp Asp Val Ala Tyr Ser Leu
770 775 780
Thr Val Thr Val Gln Asn Thr Gly Asp Glu Tyr Ala Gly Lys Ala Ser
785 790 795 800
Val Gln Ala Tyr Leu Gln Phe Pro Asp Asp Ile Asp Tyr Asp Thr Pro
805 810 815
Ile Ile Gln Leu Arg Asp Phe Glu Lys Thr Lys Glu Leu Lys Pro Gly
820 825 830
Glu Thr Thr Thr Val Thr Leu Thr Leu Thr Arg Lys Asp Val Ser Val
835 840 845
Trp Asp Val Val Ala Gln Asp Trp Lys Val Pro Ala Val Asp Gly Gly
850 855 860
Tyr Lys Val Trp Ile Gly Asp Ala Ser Asp Ser Leu Ser Ile Val Cys
865 870 875 880
His Thr Asp Thr Leu Glu Cys Glu Thr Gly Val Val Gly Pro Val
885 890 895
<210> 16
<211> 40
<212> DNA
<213> 引物(Primer)
<400> 16
agaagaagag gaaggtgtga cccggcatga agtctgaccg 40
<210> 17
<211> 41
<212> DNA
<213> 引物(Primer)
<400> 17
taattgcgcg gatcctctag atggacgcct cgatgtcttc c 41
<210> 18
<211> 57
<212> DNA
<213> 向导RNA(sgRNA)
<400> 18
taatacgact cactataggc aagtatccgt atcccgagtt ttagagctag aaatagc 57
<210> 19
<211> 57
<212> DNA
<213> 向导RNA(sgRNA)
<400> 19
taatacgact cactataggc aagtatccgt atcccgagtt ttagagctag aaatagc 57
Claims (13)
1. 一种提高里氏木霉的蛋白表达效率的方法,其特征在于,敲除或沉默里氏木霉基因组中的靶基因,所述靶基因为编码ORF4蛋白的基因;所述ORF4蛋白的氨基酸序列如SEQ IDNO: 2所示;所述里氏木霉的蛋白是里氏木霉内源蛋白或异源蛋白,所述里氏木霉内源蛋白为纤维素酶;所述异源蛋白为SEQ ID NO: 5所示序列的异源阿魏酸酯酶FEA;所述的异源蛋白的编码基因插入到里氏木霉基因组中cbh1位置。
2.如权利要求1所述的方法,其特征在于,通过采用CRISPR/Cas技术进行基因编辑以敲除所述的靶基因。
3.如权利要求1所述的方法,其特征在于,通过同源重组技术,对靶基因所在区段进行缺失或插入的操作以敲除所述的靶基因。
4.如权利要求1所述的方法,其特征在于,通过干扰分子干扰靶基因的表达;所述的干扰分子是以所述靶基因或其转录本为抑制或沉默靶标的dsRNA、反义核酸、小干扰RNA、微小RNA,或表达或形成所述dsRNA、反义核酸、小干扰RNA、微小RNA的构建物。
5. 如权利要求1所述的方法,其特征在于,所述编码ORF4蛋白的基因的核苷酸序列如SEQ ID NO: 1所示,基于该序列,靶向于其第257-277位,使该靶位置发生剪切,破坏基因功能。
6. 靶基因的调节剂的用途,用于提高里氏木霉的蛋白表达效率;所述调节剂为下调剂,其靶向于ORF4蛋白或其编码基因;所述ORF4蛋白的氨基酸序列如SEQ ID NO: 2所示;所述的蛋白是里氏木霉内源蛋白或异源蛋白,所述里氏木霉内源蛋白为纤维素酶;所述异源蛋白为SEQ ID NO: 5所示序列的异源阿魏酸酯酶FEA;所述下调剂为靶向且敲除靶基因的CRISPR/Cas基因编辑试剂,或通过同源重组对靶基因进行缺失或插入操作、从而敲除所述的靶基因的试剂,或干扰分子;所述的异源蛋白的编码基因插入到里氏木霉基因组中cbh1位置。
7.如权利要求6所述的用途,其特征在于,所述的干扰分子为:以所述靶基因或其转录本为抑制或沉默靶标的dsRNA、反义核酸、小干扰RNA、微小RNA,或表达或形成所述dsRNA、反义核酸、小干扰RNA、微小RNA的构建物。
8. 一种重组的里氏木霉菌株或其孢子、菌丝体、原生质体的用途,用于表达里氏木霉内源蛋白或异源蛋白,所述重组的里氏木霉菌株的基因组中的靶基因被敲除或沉默;其中,敲除或沉默的靶基因为编码ORF4蛋白的基因;所述ORF4蛋白的氨基酸序列如SEQ ID NO: 2所示;所述里氏木霉内源蛋白为纤维素酶;所述异源蛋白为SEQ ID NO: 5所示序列的异源阿魏酸酯酶FEA;所述的异源蛋白的编码基因插入到里氏木霉基因组中cbh1位置。
9.如权利要求8所述的里氏木霉菌株或其孢子、菌丝体、原生质体的用途,其特征在于,所述的里氏木霉菌株或其孢子、菌丝体、原生质体中,所述靶基因被敲除或沉默。
10.如权利要求8所述的里氏木霉菌株或其孢子、菌丝体、原生质体的用途,其特征在于,所述异源蛋白的编码基因由表达载体引入到里氏木霉细胞内,插入到里氏木霉基因组中cbh1位置。
11. 一种重组表达异源蛋白方法,其特征在于,所述方法包括:将异源蛋白的编码基因插入到重组的里氏木霉菌株或其孢子、菌丝体、原生质体中,所述重组的里氏木霉菌株的基因组中的靶基因被敲除或沉默;其中,敲除或沉默的靶基因为编码ORF4蛋白的基因;所述ORF4蛋白的氨基酸序列如SEQ ID NO: 2所示;所述异源蛋白为SEQ ID NO: 5所示序列异源阿魏酸酯酶FEA;所述的异源蛋白的编码基因插入到里氏木霉或其孢子、菌丝体、原生质体基因组中的cbh1位置;
培养该里氏木霉菌株或其孢子、菌丝体、原生质体,从而表达异源蛋白。
12.一种重组表达里氏木霉内源蛋白方法,其特征在于,所述方法包括:培养重组的里氏木霉菌株或其孢子、菌丝体、原生质体,从而重组表达里氏木霉内源蛋白;所述里氏木霉内源蛋白为纤维素酶;
其中,所述重组的里氏木霉菌株的基因组中的靶基因被敲除或沉默;敲除或沉默的靶基因为编码ORF4蛋白的基因;所述ORF4蛋白的氨基酸序列如SEQ ID NO: 2所示。
13.如权利要求12所述的方法,其特征在于,所述的重组的里氏木霉菌株或其孢子、菌丝体、原生质体被置于试剂盒中。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010032308.XA CN113106114B (zh) | 2020-01-13 | 2020-01-13 | 调控里氏木霉蛋白表达效率的因子、调控方法及应用 |
EP21741052.1A EP4092128A4 (en) | 2020-01-13 | 2021-01-13 | FACTOR-REGULATING PROTEIN EXPRESSION EFFICIENCY OF TRICHODERMA REESEI AND METHOD FOR REGULATION AND USE THEREOF |
PCT/CN2021/071349 WO2021143696A1 (zh) | 2020-01-13 | 2021-01-13 | 调控里氏木霉蛋白表达效率的因子、调控方法及应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010032308.XA CN113106114B (zh) | 2020-01-13 | 2020-01-13 | 调控里氏木霉蛋白表达效率的因子、调控方法及应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113106114A CN113106114A (zh) | 2021-07-13 |
CN113106114B true CN113106114B (zh) | 2024-04-12 |
Family
ID=76708863
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010032308.XA Active CN113106114B (zh) | 2020-01-13 | 2020-01-13 | 调控里氏木霉蛋白表达效率的因子、调控方法及应用 |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP4092128A4 (zh) |
CN (1) | CN113106114B (zh) |
WO (1) | WO2021143696A1 (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3523415A1 (en) * | 2016-10-04 | 2019-08-14 | Danisco US Inc. | Protein production in filamentous fungal cells in the absence of inducing substrates |
CN115820746B (zh) * | 2022-11-08 | 2023-08-18 | 华南理工大学 | 激酶基因在调控丝状真菌菌丝形态中的应用 |
CN117343950B (zh) * | 2023-12-06 | 2024-02-06 | 中国农业科学院北京畜牧兽医研究所 | 一种提高里氏木霉表达外源蛋白表达水平的方法 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102220369A (zh) * | 2011-05-11 | 2011-10-19 | 天津大学 | 里氏木霉β-葡萄糖苷酶基因BGL1的重组载体、重组菌及在重组菌中的表达 |
WO2011161063A1 (en) * | 2010-06-21 | 2011-12-29 | Technische Universität Wien | Leaa from trichoderma reesei |
CN105861338A (zh) * | 2016-05-04 | 2016-08-17 | 东南大学 | 一种高产纤维素酶里氏木霉工程菌及其制备方法与应用 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6936449B2 (en) * | 2001-03-14 | 2005-08-30 | Genencor International, Inc. | Regulatable growth of filamentous fungi |
AU2012229063A1 (en) * | 2011-03-15 | 2013-09-19 | The Regents Of The University Of California | Mutant cells for protein secretion and lignocellulose degradation |
CN102787108B (zh) * | 2012-07-27 | 2013-12-04 | 中国科学院微生物研究所 | 来源于里氏木霉的一个蛋白质及其基因的用途 |
US20180215797A1 (en) * | 2015-08-13 | 2018-08-02 | Glykos Finland Oy | Regulatory Protein Deficient Trichoderma Cells and Methods of Use Thereof |
EP3523415A1 (en) * | 2016-10-04 | 2019-08-14 | Danisco US Inc. | Protein production in filamentous fungal cells in the absence of inducing substrates |
WO2018093752A1 (en) * | 2016-11-15 | 2018-05-24 | Danisco Us Inc. | Filamentous fungi with improved protein production |
MX2019012940A (es) * | 2017-05-03 | 2019-12-16 | Firmenich Incorporated | Metodos para producir edulcorantes de alta intensidad. |
-
2020
- 2020-01-13 CN CN202010032308.XA patent/CN113106114B/zh active Active
-
2021
- 2021-01-13 WO PCT/CN2021/071349 patent/WO2021143696A1/zh unknown
- 2021-01-13 EP EP21741052.1A patent/EP4092128A4/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011161063A1 (en) * | 2010-06-21 | 2011-12-29 | Technische Universität Wien | Leaa from trichoderma reesei |
CN102220369A (zh) * | 2011-05-11 | 2011-10-19 | 天津大学 | 里氏木霉β-葡萄糖苷酶基因BGL1的重组载体、重组菌及在重组菌中的表达 |
CN105861338A (zh) * | 2016-05-04 | 2016-08-17 | 东南大学 | 一种高产纤维素酶里氏木霉工程菌及其制备方法与应用 |
Non-Patent Citations (3)
Title |
---|
The putative β-glucosidase BGL3I regulates cellulase induction in Trichoderma reesei;Gen Zou等;《Biotechnology for Biofuels volume》(第11期);第1-14页 * |
甲基转移酶sMET调控里氏木霉产纤维素酶的机制研究;邹根等;《多彩菌物 美丽中国——中国菌物学会2019年学术年会论文摘要》;20190831;第304页 * |
非转录因子调控里氏木霉产纤维素酶的机制研究;邹根等;《中国菌物学会2018年学术年会论文汇编》;20180831;第249页 * |
Also Published As
Publication number | Publication date |
---|---|
CN113106114A (zh) | 2021-07-13 |
EP4092128A1 (en) | 2022-11-23 |
EP4092128A4 (en) | 2024-02-14 |
WO2021143696A1 (zh) | 2021-07-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113106114B (zh) | 调控里氏木霉蛋白表达效率的因子、调控方法及应用 | |
CN105586355B (zh) | 在镶片镰孢的酶缺陷突变体中产生多肽的方法 | |
EP0758020A2 (en) | The use of homologous amdS genes as selectable markers | |
JP7376505B2 (ja) | 粘性が低下した表現型を含む糸状菌株 | |
EP3384002A1 (en) | Method of producing proteins in filamentous fungi with decreased clr2 activity | |
US20080070277A1 (en) | Homologous Amds Genes as Selectable Marker | |
CN110760537A (zh) | 用于快速筛选重组菌株的重组表达载体及应用 | |
US9512415B2 (en) | Method for protein production in filamentous fungi | |
WO2011151513A1 (en) | Method for improved protein production in filamentous fungi | |
KR20220097451A (ko) | 향상된 단백질 생산성 표현형을 포함하는 진균 균주 및 이의 방법 | |
US20230174998A1 (en) | Compositions and methods for enhanced protein production in filamentous fungal cells | |
CN108070609B (zh) | 利用里氏木霉作为宿主表达重组蛋白的方法 | |
CA3004977A1 (en) | Method of producing proteins in filamentous fungi with decreased clr1 activity | |
US9499826B2 (en) | Production of proteins in filamentous fungi | |
CN108048479B (zh) | 一种利用里氏木霉作为宿主表达重组蛋白的方法 | |
CN113755509A (zh) | 溶血磷脂酶变体及其构建方法和在黑曲霉菌株中的表达 | |
WO2019167788A1 (ja) | 変異β-グルコシダーゼ | |
CN108148853B (zh) | 一种基因组编辑载体、其组成的基因组编辑系统及应用 | |
CN113201055B (zh) | 蛋白pox01387及其相关生物材料与应用 | |
KR101547333B1 (ko) | 메타게놈 유래 신규 셀룰라아제 | |
WO2023074901A1 (ja) | エリスリトール誘導性プロモーター、及びこれを用いた目的物質の製造方法 | |
CN111088244B (zh) | 蛋白酶基因在促进纤维素酶生产和复杂氮源利用中的应用 | |
CN111944779B (zh) | 一种海藻糖合成双功能酶编码基因TvTPS/TPP及其应用 | |
WO2024102556A1 (en) | Filamentous fungal strains comprising enhanced protein productivity phenotypes and methods thereof | |
KR20240110854A (ko) | 진균 세포에서의 향상된 단백질 생산을 위한 조성물 및 방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |