CN113789314B - Application of ribosome sigma factor B and mutant thereof in improving yield of lipstatin - Google Patents
Application of ribosome sigma factor B and mutant thereof in improving yield of lipstatin Download PDFInfo
- Publication number
- CN113789314B CN113789314B CN202111260753.2A CN202111260753A CN113789314B CN 113789314 B CN113789314 B CN 113789314B CN 202111260753 A CN202111260753 A CN 202111260753A CN 113789314 B CN113789314 B CN 113789314B
- Authority
- CN
- China
- Prior art keywords
- ala
- gly
- leu
- arg
- pro
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- OQMAKWGYQLJJIA-CUOOPAIESA-N lipstatin Chemical compound CCCCCC[C@H]1[C@H](C[C@H](C\C=C/C\C=C/CCCCC)OC(=O)[C@H](CC(C)C)NC=O)OC1=O OQMAKWGYQLJJIA-CUOOPAIESA-N 0.000 title claims abstract description 39
- SIKWOTFNWURSAY-UHFFFAOYSA-N Lipstatin Natural products CCCCCCC1C(CC(CC=CCC=CCCCCC)C(=O)OC(CC(C)C)NC=O)OC1=O SIKWOTFNWURSAY-UHFFFAOYSA-N 0.000 title claims abstract description 38
- 101000703737 Arabidopsis thaliana RNA polymerase sigma factor sigA Proteins 0.000 title claims abstract description 23
- 101000634105 Arabidopsis thaliana RNA polymerase sigma factor sigB Proteins 0.000 title claims abstract description 23
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 24
- 239000002773 nucleotide Substances 0.000 claims abstract description 16
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 16
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 11
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 3
- 210000003705 ribosome Anatomy 0.000 claims description 26
- 239000013612 plasmid Substances 0.000 claims description 23
- 241000894006 Bacteria Species 0.000 claims description 16
- 210000004027 cell Anatomy 0.000 claims description 2
- 230000009261 transgenic effect Effects 0.000 claims description 2
- 230000002018 overexpression Effects 0.000 abstract description 10
- 238000012216 screening Methods 0.000 abstract description 4
- 150000001413 amino acids Chemical group 0.000 description 33
- 229940024606 amino acid Drugs 0.000 description 23
- 108010047495 alanylglycine Proteins 0.000 description 14
- 241000946767 Streptomyces toxytricini Species 0.000 description 12
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 12
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 12
- 241000187747 Streptomyces Species 0.000 description 11
- 108010047857 aspartylglycine Proteins 0.000 description 11
- 101710198277 RNA polymerase sigma factor sigA Proteins 0.000 description 10
- 239000012634 fragment Substances 0.000 description 10
- 108010087924 alanylproline Proteins 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 108020004414 DNA Proteins 0.000 description 8
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 8
- 241000588724 Escherichia coli Species 0.000 description 7
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 7
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 6
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 6
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 6
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 6
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 6
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 6
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 6
- 238000010276 construction Methods 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 238000000855 fermentation Methods 0.000 description 6
- 230000004151 fermentation Effects 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 6
- 238000000034 method Methods 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 5
- 108091023040 Transcription factor Proteins 0.000 description 5
- 102000040945 Transcription factor Human genes 0.000 description 5
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 4
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 4
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 4
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 4
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 4
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 4
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 4
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 4
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 4
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 4
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 4
- WPIKRJDRQVFRHP-TUSQITKMSA-N Leu-Trp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O WPIKRJDRQVFRHP-TUSQITKMSA-N 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 4
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 4
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 4
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 4
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 238000001962 electrophoresis Methods 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- 108010040030 histidinoalanine Proteins 0.000 description 4
- 239000013600 plasmid vector Substances 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 3
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 3
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 3
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 3
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 3
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 108010079364 N-glycylalanine Proteins 0.000 description 3
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 108040006206 sigma factor activity proteins Proteins 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 239000005660 Abamectin Substances 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 2
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 2
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 2
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 2
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 2
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- DEAGTWNKODHUIY-MRFFXTKBSA-N Ala-Tyr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DEAGTWNKODHUIY-MRFFXTKBSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 2
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 2
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 2
- YWENWUYXQUWRHQ-LPEHRKFASA-N Arg-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O YWENWUYXQUWRHQ-LPEHRKFASA-N 0.000 description 2
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 2
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 2
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 2
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 2
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 2
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 2
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 2
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 2
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- KPNUCOPMVSGRCR-DCAQKATOSA-N Asp-His-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KPNUCOPMVSGRCR-DCAQKATOSA-N 0.000 description 2
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 2
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 2
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 2
- ZOKPRHVIFAUJPV-GUBZILKMSA-N Cys-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O ZOKPRHVIFAUJPV-GUBZILKMSA-N 0.000 description 2
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 2
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 2
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 2
- RGRMOYQUIJVQQD-SRVKXCTJSA-N Gln-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N RGRMOYQUIJVQQD-SRVKXCTJSA-N 0.000 description 2
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 2
- LLRJEFPKIIBGJP-DCAQKATOSA-N Gln-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LLRJEFPKIIBGJP-DCAQKATOSA-N 0.000 description 2
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 2
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 2
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 2
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 2
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 2
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 2
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- JPAACTMBBBGAAR-HOTGVXAUSA-N Gly-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)CC(C)C)C(O)=O)=CNC2=C1 JPAACTMBBBGAAR-HOTGVXAUSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- CTKINSOISVBQLD-UHFFFAOYSA-N Glycidol Chemical compound OCC1CO1 CTKINSOISVBQLD-UHFFFAOYSA-N 0.000 description 2
- 244000068988 Glycine max Species 0.000 description 2
- 235000010469 Glycine max Nutrition 0.000 description 2
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 2
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 2
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 2
- FCPSGEVYIVXPPO-QTKMDUPCSA-N His-Thr-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FCPSGEVYIVXPPO-QTKMDUPCSA-N 0.000 description 2
- UXZMINKIEWBEQU-SZMVWBNQSA-N His-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N UXZMINKIEWBEQU-SZMVWBNQSA-N 0.000 description 2
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 2
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 2
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 2
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 2
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 2
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 2
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 2
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 2
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 2
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 2
- LCWXSALTPTZKNM-CIUDSAMLSA-N Pro-Cys-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O LCWXSALTPTZKNM-CIUDSAMLSA-N 0.000 description 2
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 2
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 2
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 2
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- MCPXQHVVCPTRIM-HJOGWXRNSA-N Pro-Trp-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)[C@@H]1CCCN1 MCPXQHVVCPTRIM-HJOGWXRNSA-N 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- 241001468227 Streptomyces avermitilis Species 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 2
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 2
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 2
- 229920004890 Triton X-100 Polymers 0.000 description 2
- 239000013504 Triton X-100 Substances 0.000 description 2
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 2
- RCMHSGRBJCMFLR-BPUTZDHNSA-N Trp-Met-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 RCMHSGRBJCMFLR-BPUTZDHNSA-N 0.000 description 2
- ZZDFLJFVSNQINX-HWHUXHBOSA-N Trp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O ZZDFLJFVSNQINX-HWHUXHBOSA-N 0.000 description 2
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 2
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 2
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 2
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 2
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 2
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 2
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- XZNUGFQTQHRASN-XQENGBIVSA-N apramycin Chemical compound O([C@H]1O[C@@H]2[C@H](O)[C@@H]([C@H](O[C@H]2C[C@H]1N)O[C@@H]1[C@@H]([C@@H](O)[C@H](N)[C@@H](CO)O1)O)NC)[C@@H]1[C@@H](N)C[C@@H](N)[C@H](O)[C@H]1O XZNUGFQTQHRASN-XQENGBIVSA-N 0.000 description 2
- 229950006334 apramycin Drugs 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- RRZXIRBKKLTSOM-XPNPUAGNSA-N avermectin B1a Chemical compound C1=C[C@H](C)[C@@H]([C@@H](C)CC)O[C@]11O[C@H](C\C=C(C)\[C@@H](O[C@@H]2O[C@@H](C)[C@H](O[C@@H]3O[C@@H](C)[C@H](O)[C@@H](OC)C3)[C@@H](OC)C2)[C@@H](C)\C=C\C=C/2[C@]3([C@H](C(=O)O4)C=C(C)[C@@H](O)[C@H]3OC\2)O)C[C@H]4C1 RRZXIRBKKLTSOM-XPNPUAGNSA-N 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 238000003766 bioinformatics method Methods 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 239000011248 coating agent Substances 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 238000007865 diluting Methods 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 235000013312 flour Nutrition 0.000 description 2
- 238000012268 genome sequencing Methods 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 2
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 108010037379 ribosome releasing factor Proteins 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 235000012424 soybean oil Nutrition 0.000 description 2
- 239000008223 sterile water Substances 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 231100000331 toxic Toxicity 0.000 description 2
- 230000002588 toxic effect Effects 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- 239000004395 L-leucine Substances 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 102000019280 Pancreatic lipases Human genes 0.000 description 1
- 108050006759 Pancreatic lipases Proteins 0.000 description 1
- 101100397225 Schizosaccharomyces pombe (strain 972 / ATCC 24843) isp3 gene Proteins 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000003579 anti-obesity Effects 0.000 description 1
- 230000036528 appetite Effects 0.000 description 1
- 235000019789 appetite Nutrition 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- VEZXCJBBBCKRPI-UHFFFAOYSA-N beta-propiolactone Chemical compound O=C1CCO1 VEZXCJBBBCKRPI-UHFFFAOYSA-N 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 235000019577 caloric intake Nutrition 0.000 description 1
- 210000003169 central nervous system Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 235000021588 free fatty acids Nutrition 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 230000036571 hydration Effects 0.000 description 1
- 238000006703 hydration reaction Methods 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 210000004347 intestinal mucosa Anatomy 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229960003136 leucine Drugs 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 150000002759 monoacylglycerols Chemical class 0.000 description 1
- AHLBNYSZXLDEJQ-FWEHEUNISA-N orlistat Chemical compound CCCCCCCCCCC[C@H](OC(=O)[C@H](CC(C)C)NC=O)C[C@@H]1OC(=O)[C@H]1CCCCCC AHLBNYSZXLDEJQ-FWEHEUNISA-N 0.000 description 1
- 229960001243 orlistat Drugs 0.000 description 1
- 229940116369 pancreatic lipase Drugs 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 229960000380 propiolactone Drugs 0.000 description 1
- 238000011218 seed culture Methods 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 231100000167 toxic agent Toxicity 0.000 description 1
- 239000003440 toxic substance Substances 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- UFTFJSFQGQCHQW-UHFFFAOYSA-N triformin Chemical compound O=COCC(OC=O)COC=O UFTFJSFQGQCHQW-UHFFFAOYSA-N 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
- C12N9/127—RNA-directed RNA polymerase (2.7.7.48), i.e. RNA replicase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
- C12N15/76—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Actinomyces; for Streptomyces
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/02—Oxygen as only ring hetero atoms
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/07—Nucleotidyltransferases (2.7.7)
- C12Y207/07048—RNA-directed RNA polymerase (2.7.7.48), i.e. RNA replicase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/101—Plasmid DNA for bacteria
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention discloses a ribosome sigma factor B and application of a mutant thereof in improving the yield of lipstatin. The nucleotide sequence of the ribosome sigma factor B is shown as SEQ ID NO.7, and the amino acid sequence of the encoded protein is shown as SEQ ID NO. 8. The ribosome sigma factor B and the mutant overexpression thereof obtained by screening can improve the yield of lipstatin.
Description
The application is a divisional application with the application date 2020-10-29 and the application number 202011176931.9, and the patent name is 'application of a ribosome sigma factor, mutant thereof and coded protein in improving the yield of lipstatin'.
Technical Field
The invention belongs to the technical field of microbial genetic engineering, and particularly relates to application of a ribosome sigma factor and a mutant thereof in improving the yield of lipstatin.
Background
The tetrahydroderivative of lipstatin, orlistat, is currently the most popular weight-reducing product for use by more than 4 million people worldwide as an anti-obesity therapeutic agent that is currently certified by the U.S. FDA, eu EMA and chinese SFDA without suppressing appetite, acting on the central nervous system. As a long-acting specific pancreatic lipase inhibitor, it can prevent triglyceride from hydrolyzing into free fatty acid and monoacylglycerol which can be absorbed by small intestinal mucosa, thereby reducing caloric intake and controlling body weight. The lipstatin is originally isolated from streptomycete Streptomyces toxytricini and has a beta-propiolactone structural unit, wherein the 2 and 3 positions of the lipstatin are respectively replaced by a 6-carbon alkyl hydrocarbon chain and a 13-carbon alkyl hydrocarbon chain, and the hydroxyl group at the C5 position of the 13-carbon alkyl hydrocarbon chain forms an ester bond with N-formoxyl-L-leucine. Because of the complex molecular structure of lipstatin, the chemical synthesis process is complex, and a great deal of manpower and material resources are consumed. At present, the lipstatin is mainly produced by adopting a semi-biological fermentation process, so that compared with a chemical method, the use of toxic solvents is reduced, the pollution to the environment is greatly reduced, and toxic substance residues are greatly reduced.
At present, research on the synthesis of lipstatin by lipstatin producing bacteria at home and abroad mainly adopts a method of combining a single factor experiment and an orthogonal design experiment to optimize a fermentation medium and fermentation parameters, such as the influence of biotin and ATP addition on lipid and lipstatin synthesis (Dong Huijun and the like, the influence of biotin and ATP addition on the synthesis of toxic trichlungin lipid and lipstatin, journal of Chinese medical industry, 2014, 01, pages 19-24).
Ribosomal sigma transcription factors are important components of RNA polymerase and play an important role in the transcription process of prokaryotes. Overexpression of frr as in Streptomyces avermitilis increases avermectin production (Li L et al, (2010) Over expression of ribosome recycling factor causes increased production of avermectin in Streptomyces avermitilis strains.J Ind Microbiol Biotechnol 37 (7): 673-679), but no ribosomal sigma transcription factor affecting lipstatin synthesis is currently reported.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides a ribosome sigma factor and a mutant thereof and application of the encoded protein in improving the yield of lipstatin, the invention compares and analyzes the ribosome factors through genome sequencing and bioinformatics analysis to find out 15 different ribosome factors, then through evaluation, the over-expression of the ribosome sigma transcription factors A and B in Streptomyces toxytricini can improve the yield of the lipstatin, and provides the mutants of the two ribosome sigma transcription factors A and B and application of the mutants in the production of the lipstatin.
In order to achieve the above purpose, the technical scheme adopted by the invention for solving the technical problems is as follows:
use of a ribosomal sigma factor, either ribosomal sigma factor a or ribosomal sigma factor B, for increasing the yield of lipstatin.
Further, the nucleotide sequence of the ribosome sigma factor A is shown as SEQ ID NO.5, and the amino acid sequence of the encoded protein is shown as SEQ ID NO. 6.
Further, the nucleotide sequence of the ribosome sigma factor B is shown as SEQ ID NO.7, and the amino acid sequence of the encoded protein is shown as SEQ ID NO. 8.
A mutant of the ribosome sigma factor A, the nucleotide sequence of which is shown in SEQ ID NO. 1.
Further, the amino acid sequence of the protein encoded by the nucleotide is shown as SEQ ID NO.2, and compared with the amino acid sequence of the normal ribosome sigma factor A, the amino acid at 19 th position is replaced by R, the amino acid at 327 th position is replaced by G, the amino acid at 338 th position is replaced by G, the amino acid at 344 th position is replaced by F, the amino acid at 347 th position is replaced by S, and the amino acid at 538 th position is replaced by T.
A mutant of the ribosome sigma factor B, the nucleotide sequence of which is shown in SEQ ID NO. 3.
Further, the amino acid sequence of the protein encoded by the nucleotide is shown as SEQ ID NO.4, and compared with the amino acid sequence of the normal ribosome sigma factor B, the amino acid at 95 th position is replaced by T, the amino acid at 274 th position is replaced by S, the amino acid at 275 th position is replaced by T, the amino acid at 351 th position is replaced by V, and the amino acid at 411 th position is replaced by D.
The application of the mutant of the ribosome sigma factor A or the coded protein thereof in improving the yield of lipstatin.
The application of the mutant of the ribosome sigma factor B or the coded protein thereof in improving the yield of lipstatin.
A plasmid comprising the aforementioned ribosomal sigma factor, a mutant of ribosomal sigma factor a, or a mutant of ribosomal sigma factor B.
A transgenic cell line comprising the aforementioned ribosomal sigma factor, a mutant of ribosomal sigma factor a, or a mutant of ribosomal sigma factor B.
An engineering bacterium comprising the above-mentioned ribosomal sigma factor, a mutant of ribosomal sigma factor A, or a mutant of ribosomal sigma factor B.
Further, overexpression is performed by introducing an expression vector containing the coding gene into Streptomyces Streptomyces toxytricini.
Further, the gene or mutant gene is located in a plasmid or chromosome.
The beneficial effects of the invention are as follows:
screening researches on 10 ribosomal factors show that the ribosomal sigma factors A and B and mutant genes thereof are overexpressed in streptomycete Streptomyces toxytricini, so that the method is biologically safe for genetically engineered bacteria (the overexpression in the bacteria does not influence the growth of the bacteria), and meanwhile, the capability of producing lipstatin by streptomycete Streptomyces toxytricini can be effectively improved. Experimental data indicate that overexpression of the original gene at Streptomyces toxytricini increases the ability of product epstein, especially their mutant genes, further increases the ability of product epstein.
Drawings
FIG. 1 shows a map of the plasmid vector pKC1139-liprA,
FIG. 2 is a map of the plasmid vector pKC1139-liprB,
FIG. 3 is a map of the plasmid vector pKC1139-liprA-mut,
FIG. 4 is a map of plasmid vector pKC1139-liprB-mut,
Fig. 5 is a standard plot of lipstatin concentration.
Detailed Description
The following description of the embodiments of the present invention is provided to facilitate understanding of the present invention by those skilled in the art, but it should be understood that the present invention is not limited to the scope of the embodiments, and all the inventions which make use of the inventive concept are protected by the spirit and scope of the present invention as defined and defined in the appended claims to those skilled in the art.
1. Material
Streptomyces toxytricini As a typical Streptomyces, the endomycelia are branched, the aerial mycelia are slightly thick, and part of aerial mycelia are differentiated into spiral spore filaments, so that the characteristic of typical Streptomyces is achieved.
The formula of the culture medium comprises:
seed culture medium: 10g of soya bean flour, 5g of Bacto soytone, 5mL of glycidol, 10mL of soya oil, and 2mL of Triton X-100[pH 6.5]per liter.
Fermentation culture: 30g of soya bean flour, 14mL of glycidol, 1g of Bacto soytone, 1mL of Triton X-100, and 60mL of soya oil[pH 7.0]per liter.
2. Detection method
(1) Sample pretreatment
1mL of the fermentation broth was placed in a 10mL centrifuge tube, 9mL of methanol was added, the mixture was stirred in a vortex mixer, sonicated for 30min, centrifuged (12000 rpm,5 min), and then filtered through a 0.22 μm filter, and the filtrate (20. Mu.L) was taken for HPLC analysis.
(2) Preparation of standards
Lipstatin standard (0.125 g/L,0.25g/L,0.5g/L,1g/L,2 g/L) was configured
(3) HPLC detection conditions
Chromatographic column: YMC-Pack ODS-A; detection wavelength: 205nm; mobile phase: acetonitrile: water = 90:10; flow rate: 1.0mL/min; column temperature: 40 ℃.
(4) Drawing of lipstatin standard curve
And (3) carrying out HPLC detection on the standard substances with different concentrations according to the conditions, and drawing a standard curve of the concentration of the lipstatin with peak areas. And drawing a standard curve of the lipstatin by taking the measured peak area A as an ordinate and recording the mass concentration C (g/L) of the lipstatin as an abscissa. See fig. 5, yielding the regression equation y=5e+6x+101960, r 2 =0.996, absorbance was a good linear relationship with mass concentration. Calculating the sample yield according to the lipstatin standard curve after the liquid phase is ended
Example 1 obtaining of strains
Diluting and coating the frozen strain on ISP3 plate culture medium, placing in a 30 ℃ hydration incubator for inversion culture, and after bacterial colony grows out,single colonies were picked, re-cultured and transferred to 2 passages. Taking a fresh long plate, washing surface spores with 10mL of sterile water, placing in a triangular flask with glass beads, culturing at 28deg.C under shaking at 200r/min for 2-3 h, filtering, serial diluting, and making into spore with concentration of about 1×10 7 Single spore suspension per mL was used.
EXAMPLE 2 Strain genome analysis
The strain Streptomyces toxytricini is sent to the China large gene company for whole genome sequencing, the comparison analysis is carried out on the ribosomal factors through genome sequencing and bioinformatics analysis, 15 different ribosomal factors are found, and through screening, the overexpression of the ribosomal sigma transcription factors A and B can improve the yield of lipstatin.
Based on the ribosome sigma transcription factors A and B obtained by screening, respectively obtaining a mutant gene of the ribosome sigma factor A and a mutant gene of the ribosome sigma factor B through substitution of nucleotides, wherein the nucleotide sequences of the mutant genes are respectively shown as SEQ ID NO.1 and SEQ ID NO. 3;
meanwhile, corresponding amino acid mutants are obtained, specifically:
mutant of ribosomal sigma factor a: the 19 th amino acid of the amino acid sequence coded by the ribosome sigma transcription factor A is replaced by R, the 327 th amino acid is replaced by G, the 338 th amino acid is replaced by G, the 344 th amino acid is replaced by F, the 347 rd amino acid is replaced by S, and the 538 th amino acid is replaced by T.
Mutant of ribosomal sigma factor B: the 95 th amino acid of the amino acid sequence coded by the ribosome sigma transcription factor B is replaced by T, the 274 th amino acid is replaced by S, the 275 th amino acid is replaced by T, the 351 th amino acid is replaced by V, and the 411 th amino acid is replaced by D.
The effect of overexpression of the substituted mutants in the strain on lipstatin production was each verified.
EXAMPLE 3 construction of plasmid
1. Construction of pKC1139-liprA
The primers of the primer pairs liprA-XbaI-F and liprA-EcoRV-R in the table 1 are respectively utilized, streptomyces toxytricini genome is used as a template, and through PCR amplification, xbaI and EcoRV enzyme cutting sites are introduced to obtain liprA fragments, and after electrophoresis verification, dpnI enzyme treatment and electrophoresis gel recovery, the purified liprA fragments are obtained. The sequence of the liprA gene is shown as SEQ ID No.5, and the encoded amino acid sequence is shown as SEQ ID No. 6.
The purified liprA fragment and pKC1139 plasmid were double digested with XbaI and EcoRV, respectively, and the double digested product of the liprA fragment was ligated with the double digested product of pKC1139 plasmid overnight at 4 ℃ with T4 ligase. The ligation product was transformed into E.coli DH 5. Alpha. And spread on LB solid plates containing 50mg/L of apramycin, after 16h of incubation, colony PCR detection was performed, and after correct sequencing, the positive bacteria obtained were designated E.coli/pKC1139-liprA. Plasmid pKC1139-liprA was extracted with plasmid kit for use, and the plasmid map is shown in FIG. 1.
2、pKC1139-liprA-mut:
The nucleotide sequence is shown as SEQ ID NO:1 to obtain a liprA-mut fragment, and constructing with the same primers according to the construction process of pKC1139-liprA to obtain plasmid pKC1139-liprA-mut. The plasmid map is shown in FIG. 3.
3. Construction of pKC 1139-liprB:
the primers of the primer pairs liprB-XbaI-F and liprB-EcoRV-R in the table 1 are respectively utilized, streptomyces toxytricini genome is used as a template, and through PCR amplification, xbaI and EcoRV enzyme cutting sites are introduced to obtain liprB fragments, and after electrophoresis verification, dpnI enzyme treatment and electrophoresis gel recovery, purified liprB fragments are obtained. The sequence of the liprB gene is shown as SEQ ID No.7, and the encoded amino acid sequence is shown as SEQ ID No. 8.
The purified liprB fragment and pKC1139 plasmid were double digested with XbaI and EcoRV, respectively, and the double digested product of the liprB fragment was ligated with the double digested product of the pKC1139 plasmid overnight at 4℃with T4 ligase. The ligation product was transformed into E.coli DH 5. Alpha. And spread on LB solid plates containing 50mg/L of apramycin, after 16h of incubation, colony PCR detection was performed, and after correct sequencing, the obtained positive bacteria were designated E.coli/pKC1139-liprB. Plasmid pKC1139-liprB was extracted with plasmid kit and the plasmid map is shown in FIG. 2.
4、pKC1139-liprB-mut:
The nucleotide sequence is shown as SEQ ID NO:3 to obtain a liprB-mut fragment, and constructing the plasmid pKC1139-liprB-mut by using the same primer according to the construction process of pKC1139-liprB. The plasmid map is shown in FIG. 4.
TABLE 1 primers
EXAMPLE 4 construction of plasmid-containing vector Strain
1. Preparation of E.coli ET12567/pUZ8002 containing exogenous plasmid
And transforming the constructed plasmid into escherichia coli ET12567/pUZ8002 to obtain donor bacteria containing recombinant plasmid. The donor bacteria were inoculated in 5mL of LB medium (containing 25. Mu.g/mL Km, cm and 50. Mu.g/mL Am), incubated overnight at 37℃and transferred to 10mL of LB medium (containing 25. Mu.g/mL Km, cm and 50. Mu.g/mL Am) at 1:50-100, and incubated at 37℃to OD 600 =0.3 to 0.4, cultured for about 3 hours, collected by centrifugation, washed twice with drug-free LB, and suspended in 1mL of drug-free LB medium. ET12567/pUZ8002 containing the corresponding plasmid was obtained.
2. Treatment of spores of heterologous expression strains
Streptomyces toxytricini culturing for about 7d, washing spores with 5mL of 10% glycerol, placing into 50mL test tube, shaking, mixing, filtering absorbent cotton in clean test tube, centrifuging at 3500rpm for 10min, adding 1mL of 2 XYT, and thermally activating at 50deg.C for 10min.
3. Coli ET12567/pUZ8002 and Streptomyces toxytricini strain conjugal transfer and isolation identification of strain
Respectively mixing equal amount of Streptomyces spore liquid and Escherichia coli bacterial liquid 0.5mL, and coating with 10mM MgCl 2 Is cultured overnight at 28℃with sterile water containing Am (50. Mu.g/mL) and Tri (100. Mu.g/mL) and cultured at 28℃for 7-10 d, and the positive colonies grown are picked up on IWL4 or MS and containing Am (50. Mu.g/mL) and Tri (100. Mu.g/mL)After subculturing in R2YE medium, the corresponding colonies were picked. SK/pKC1139 (control bacterium), SK/pKC1139-liprA (abbreviated SK 1), SK/pKC1139-liprA-mut (abbreviated SK 2), SK/pKC1139-liprB (abbreviated SK 3), SK/pKC1139-liprB-mut (abbreviated SK 4).
EXAMPLE 5 evaluation of different strains
Control bacteria and SK1, SK2, SK3, SK4 were subjected to shake flask fermentation, each in 3 replicates, and then subjected to the assay and specific price of lipstatin, the results of which are shown in Table 2. From the test results in table 2, it is found that the ability of the experimental group bacteria to produce lipstatin is improved compared with the control bacteria, thereby demonstrating that the overexpression of ribosomal sigma factors a and B and mutant genes thereof can improve the yield of lipstatin.
TABLE 2 Liprastatin production by different strains
Strain | Lipstatin yield (g/L) |
Control bacteria | 1.12±0.11 |
SK1 | 1.43±0.17 |
SK2 | 1.67±0.15 |
SK3 | 1.27±0.14 |
SK4 | 1.56±0.12 |
Sequence listing
<110> university of light chemical industry in Sichuan
<120> application of ribosome sigma factor B and mutant thereof in improving lipstatin yield
<160> 12
<170> SIPOSequenceListing 1.0
<210> 1
<211> 2061
<212> DNA
<213> Streptomyces sp
<400> 1
atgacttcag agcacacggc agagctggtc gcggcggccc gcgcgggaga cctccgcgcg 60
caggacgagc tggtcagcgc gtatctgccg ctggtctaca acatcgtcgg gcgcgccatg 120
aacggctccg tcgacgtgga cgacgtcgtg caggacacga tgctgcgggc cctcgacggc 180
ctcggcaccc tccgttcgga cgacagcttc cgctcgtggt tggtggccat cgcgatgaac 240
cgggtgcggg cgtactggca ggcccggcgc accgctcccg gcgagagcgg tctggaggcc 300
gcctgggagc tcgccgaccc gggggcggac ttcgtcgacc tgaccgtcgt ccggctggcc 360
ctggaggggc agcgccgcga gacggcgcgc gccacccgct ggctggagcc ggacgaccgg 420
gcgctgctgt cgctgtggtg gctggagtgc gcgggggagc tgacccgggg cgaggtggcc 480
gcggcgctgg agctgacgcc gcagcacacg gccgtgcggg tgcagcggat gaaggcgcag 540
ctggagtccg cgcgcgtcgt ggagggggcg ctggcggcgc agccgccgtg cgaggcgctg 600
gccgcggtga cggcctcctg ggacggccag ccgtccactc tgtggcgcaa gcgaatagcc 660
cggcacgccc gcgagtgcct gcggtgcgcc gggctgtgga acggtctgct cccggcggag 720
gggctgctgg cgggcctggc gctcgtcccg gtgtcggcgg cgctgctggc cggggtgcgg 780
tcggccgccg cgggcggttt cgcccccgcg ggcgcggcgt acgcggcggg cgtgcccgca 840
gacggccagg cggtgtccgc cggctgggcg cccgccgacg gccaggcggc atccggcggc 900
tggacgcccg cgtacgaagc accgcccggc ggcacgggcg gcggcggcca cggcttcgcc 960
ggcggtaccg gtacggacgg cggcaacggc ttggcttccg gtgcaagtgc gggtgacggc 1020
ggcagcggct tcggcgccgg tgccgggccg cacggcggcg gcaacggctt cgcttccggt 1080
gcgggtgacg gcggcaacgg cttggcttcc ggtgcaagtg cgggtgacgg cggcagcggc 1140
cccggcgccg gtacgcgcag tggcgacgac ggtttcggcg ccggggcgca cgggggcggc 1200
agcggcttcg gcggcggtga cggcggtccg ggtggcggcg aggccgtggc ccggcttgtg 1260
ccggccggtt ccgcggaggg aggtgccgat ggcgcggcgg gcggcggccg gagcgcgttg 1320
cgcaagcggc ggcgcagtcg gcggcgggcc gtcgggggtg ccgtgctcgc cgcctgcgtc 1380
gcgggcggcg gcctcgccta cctgggcggc ttccccggct ccgccggcga ggacggcgga 1440
gccgccccgg cagctccgct gaacgcgctg tccgcaggcg agtccgccgg cccgtcgccg 1500
agcggcccgg cccagccgtc cgcgtccgct tcggcctcgc cgtccccgtc ggccaccgcg 1560
tcgccctcgg tgtcggcgtc ggcgagccct tccggcgcgg gcaaggcgtc cactggcgcg 1620
ccgaccccgt ccccgtccgc gccgcgcccc gcgtcgccgg ccccggtgcc cgccccgcag 1680
ggaccggtgg gccaggtcgt cgccctcgtc aacgccgagc gggcgaaggc gggctgcggc 1740
ccgctgaagg acgacgcgca gctgcggaag gccgcgcagc ggcactccga ggacatggcg 1800
agccggaact tcttctcgca cacggccccg gacggctccg acccgggcga gcggacgacg 1860
gccgccggct accggtgggc gacgtacggc gagaacatag cgcgcgggca gagcaccgcc 1920
gagtcggtca tgaactcgtg gatgaacagc gacggccacc gcgcgaacat cctgaactgc 1980
tccttcaagg acataggcgt cggactccac cagggccccg gcggcccgtg gtggacgcag 2040
aacttcggcg cccgcatgtg a 2061
<210> 2
<211> 686
<212> PRT
<213> Streptomyces sp
<400> 2
Met Thr Ser Glu His Thr Ala Glu Leu Val Ala Ala Ala Arg Ala Gly
1 5 10 15
Asp Leu Arg Ala Gln Asp Glu Leu Val Ser Ala Tyr Leu Pro Leu Val
20 25 30
Tyr Asn Ile Val Gly Arg Ala Met Asn Gly Ser Val Asp Val Asp Asp
35 40 45
Val Val Gln Asp Thr Met Leu Arg Ala Leu Asp Gly Leu Gly Thr Leu
50 55 60
Arg Ser Asp Asp Ser Phe Arg Ser Trp Leu Val Ala Ile Ala Met Asn
65 70 75 80
Arg Val Arg Ala Tyr Trp Gln Ala Arg Arg Thr Ala Pro Gly Glu Ser
85 90 95
Gly Leu Glu Ala Ala Trp Glu Leu Ala Asp Pro Gly Ala Asp Phe Val
100 105 110
Asp Leu Thr Val Val Arg Leu Ala Leu Glu Gly Gln Arg Arg Glu Thr
115 120 125
Ala Arg Ala Thr Arg Trp Leu Glu Pro Asp Asp Arg Ala Leu Leu Ser
130 135 140
Leu Trp Trp Leu Glu Cys Ala Gly Glu Leu Thr Arg Gly Glu Val Ala
145 150 155 160
Ala Ala Leu Glu Leu Thr Pro Gln His Thr Ala Val Arg Val Gln Arg
165 170 175
Met Lys Ala Gln Leu Glu Ser Ala Arg Val Val Glu Gly Ala Leu Ala
180 185 190
Ala Gln Pro Pro Cys Glu Ala Leu Ala Ala Val Thr Ala Ser Trp Asp
195 200 205
Gly Gln Pro Ser Thr Leu Trp Arg Lys Arg Ile Ala Arg His Ala Arg
210 215 220
Glu Cys Leu Arg Cys Ala Gly Leu Trp Asn Gly Leu Leu Pro Ala Glu
225 230 235 240
Gly Leu Leu Ala Gly Leu Ala Leu Val Pro Val Ser Ala Ala Leu Leu
245 250 255
Ala Gly Val Arg Ser Ala Ala Ala Gly Gly Phe Ala Pro Ala Gly Ala
260 265 270
Ala Tyr Ala Ala Gly Val Pro Ala Asp Gly Gln Ala Val Ser Ala Gly
275 280 285
Trp Ala Pro Ala Asp Gly Gln Ala Ala Ser Gly Gly Trp Thr Pro Ala
290 295 300
Tyr Glu Ala Pro Pro Gly Gly Thr Gly Gly Gly Gly His Gly Phe Ala
305 310 315 320
Gly Gly Thr Gly Thr Asp Gly Gly Asn Gly Leu Ala Ser Gly Ala Ser
325 330 335
Ala Gly Asp Gly Gly Ser Gly Phe Gly Ala Gly Ala Gly Pro His Gly
340 345 350
Gly Gly Asn Gly Phe Ala Ser Gly Ala Gly Asp Gly Gly Asn Gly Leu
355 360 365
Ala Ser Gly Ala Ser Ala Gly Asp Gly Gly Ser Gly Pro Gly Ala Gly
370 375 380
Thr Arg Ser Gly Asp Asp Gly Phe Gly Ala Gly Ala His Gly Gly Gly
385 390 395 400
Ser Gly Phe Gly Gly Gly Asp Gly Gly Pro Gly Gly Gly Glu Ala Val
405 410 415
Ala Arg Leu Val Pro Ala Gly Ser Ala Glu Gly Gly Ala Asp Gly Ala
420 425 430
Ala Gly Gly Gly Arg Ser Ala Leu Arg Lys Arg Arg Arg Ser Arg Arg
435 440 445
Arg Ala Val Gly Gly Ala Val Leu Ala Ala Cys Val Ala Gly Gly Gly
450 455 460
Leu Ala Tyr Leu Gly Gly Phe Pro Gly Ser Ala Gly Glu Asp Gly Gly
465 470 475 480
Ala Ala Pro Ala Ala Pro Leu Asn Ala Leu Ser Ala Gly Glu Ser Ala
485 490 495
Gly Pro Ser Pro Ser Gly Pro Ala Gln Pro Ser Ala Ser Ala Ser Ala
500 505 510
Ser Pro Ser Pro Ser Ala Thr Ala Ser Pro Ser Val Ser Ala Ser Ala
515 520 525
Ser Pro Ser Gly Ala Gly Lys Ala Ser Thr Gly Ala Pro Thr Pro Ser
530 535 540
Pro Ser Ala Pro Arg Pro Ala Ser Pro Ala Pro Val Pro Ala Pro Gln
545 550 555 560
Gly Pro Val Gly Gln Val Val Ala Leu Val Asn Ala Glu Arg Ala Lys
565 570 575
Ala Gly Cys Gly Pro Leu Lys Asp Asp Ala Gln Leu Arg Lys Ala Ala
580 585 590
Gln Arg His Ser Glu Asp Met Ala Ser Arg Asn Phe Phe Ser His Thr
595 600 605
Ala Pro Asp Gly Ser Asp Pro Gly Glu Arg Thr Thr Ala Ala Gly Tyr
610 615 620
Arg Trp Ala Thr Tyr Gly Glu Asn Ile Ala Arg Gly Gln Ser Thr Ala
625 630 635 640
Glu Ser Val Met Asn Ser Trp Met Asn Ser Asp Gly His Arg Ala Asn
645 650 655
Ile Leu Asn Cys Ser Phe Lys Asp Ile Gly Val Gly Leu His Gln Gly
660 665 670
Pro Gly Gly Pro Trp Trp Thr Gln Asn Phe Gly Ala Arg Met
675 680 685
<210> 3
<211> 1566
<212> DNA
<213> Streptomyces sp
<400> 3
atggggctca tggacgcgga ccacgcgggc ctggtcgtcg cggctcaggc cggggacgac 60
cgggcgcgcg aagagctgat cgccgcctac ctgccgctgg tctacaacat cgtccggcgg 120
gcgctgagcg cacatgccga cgtcgacgac gtcgtccagg agacgctgtt gcgcgtggtg 180
cgcgaccttc ctgccctgcg cgctcccgac agcttccgct cctggctggt gtcgatcacg 240
ctccgccaga tacagaccca ctggcagcgg cagcgcgcgg tcaccaaccg gaccgccgtc 300
atcgacgagg cgctcgacct gccggatgcc ggcttcgaac ccgaggacgc gacgatcctg 360
cgcctccgtg tgtcggacga gcgccgtcgg atcgccgaag ccggccggtg gctcgacccg 420
gaccaccgcg ccctgctgtc cctctggtgg caggagcacg ccgggctgct gacccgggag 480
gacatcgcgg ccgcgacggg cctcaccgcc gcccacgccg gagtgcgcct gcagcgcatg 540
cgcgagcagc tggacctgag ccggacgatc gtcgccgccc tggaggccca cccgcgatgc 600
ccgcagctgg gcgagaccgt cgccggctgg gacggcctgc gtacctcggt gtggcggaag 660
cggatcgcgc ggcacacccg cgactgcccg gtctgcacgg cgacgacggc gaaccgggtc 720
cctgccgagc agctgctgct cggcctcgcg ccgctggccg tccccgccgg actcctcgcc 780
acgctgaccg ccaagggcct gctgtcgggt tcggccgcga gcaccgccgc gcctgccatg 840
gcgccggtcg cggccggcgc ggcggtgaag gcaggcggtc tgcacggcgc ggcgaccggc 900
aaactccacg cggtgaccgc tcacccgctg gcaagcatcg ccgccggtgc ggtgctcatc 960
accggagccg ccacgtacgc ggcctggccg gagccgacgc cccgggcgcc cggcgtcacc 1020
gccgccccca ccgtcgcccc cacggccgcc gttcccgcgc cggtcccgtc gggcaccccc 1080
acgccggccg caccgccgcc ggcgagtccg tccgccgtcc ccgcgggcac ggttccgctg 1140
ggcgcgcaca cactggagtc cgtcgaccag cccgacctct acctgacgta cgccggcgac 1200
ttcgcgacgc tcggccgggc cgccgactcc gacagcacgc aggcacggca gcgcgtcacg 1260
ttcacggtgg tccgggggct ggccgacggg cggtgcgtca ccttccgcgc ggccgacggc 1320
cgttacctgc ggcaccacta cctgcggctg cggctgagca ccgacgacgg cagcgaactc 1380
ttccggaagg acgcgacctt ctgcccccgc cccggagcgg tcgcggggtc ggtgaccctg 1440
tactcccaca actacccggg atcggtcgtc cgccaccgcg acggcggcat ctggctcgac 1500
ggctccgacg gcacccgggc cttcgccggc caggcctcct tcgtcgtccg caaggcccgg 1560
ccctga 1566
<210> 4
<211> 521
<212> PRT
<213> Streptomyces sp
<400> 4
Met Gly Leu Met Asp Ala Asp His Ala Gly Leu Val Val Ala Ala Gln
1 5 10 15
Ala Gly Asp Asp Arg Ala Arg Glu Glu Leu Ile Ala Ala Tyr Leu Pro
20 25 30
Leu Val Tyr Asn Ile Val Arg Arg Ala Leu Ser Ala His Ala Asp Val
35 40 45
Asp Asp Val Val Gln Glu Thr Leu Leu Arg Val Val Arg Asp Leu Pro
50 55 60
Ala Leu Arg Ala Pro Asp Ser Phe Arg Ser Trp Leu Val Ser Ile Thr
65 70 75 80
Leu Arg Gln Ile Gln Thr His Trp Gln Arg Gln Arg Ala Val Thr Asn
85 90 95
Arg Thr Ala Val Ile Asp Glu Ala Leu Asp Leu Pro Asp Ala Gly Phe
100 105 110
Glu Pro Glu Asp Ala Thr Ile Leu Arg Leu Arg Val Ser Asp Glu Arg
115 120 125
Arg Arg Ile Ala Glu Ala Gly Arg Trp Leu Asp Pro Asp His Arg Ala
130 135 140
Leu Leu Ser Leu Trp Trp Gln Glu His Ala Gly Leu Leu Thr Arg Glu
145 150 155 160
Asp Ile Ala Ala Ala Thr Gly Leu Thr Ala Ala His Ala Gly Val Arg
165 170 175
Leu Gln Arg Met Arg Glu Gln Leu Asp Leu Ser Arg Thr Ile Val Ala
180 185 190
Ala Leu Glu Ala His Pro Arg Cys Pro Gln Leu Gly Glu Thr Val Ala
195 200 205
Gly Trp Asp Gly Leu Arg Thr Ser Val Trp Arg Lys Arg Ile Ala Arg
210 215 220
His Thr Arg Asp Cys Pro Val Cys Thr Ala Thr Thr Ala Asn Arg Val
225 230 235 240
Pro Ala Glu Gln Leu Leu Leu Gly Leu Ala Pro Leu Ala Val Pro Ala
245 250 255
Gly Leu Leu Ala Thr Leu Thr Ala Lys Gly Leu Leu Ser Gly Ser Ala
260 265 270
Ala Ser Thr Ala Ala Pro Ala Met Ala Pro Val Ala Ala Gly Ala Ala
275 280 285
Val Lys Ala Gly Gly Leu His Gly Ala Ala Thr Gly Lys Leu His Ala
290 295 300
Val Thr Ala His Pro Leu Ala Ser Ile Ala Ala Gly Ala Val Leu Ile
305 310 315 320
Thr Gly Ala Ala Thr Tyr Ala Ala Trp Pro Glu Pro Thr Pro Arg Ala
325 330 335
Pro Gly Val Thr Ala Ala Pro Thr Val Ala Pro Thr Ala Ala Val Pro
340 345 350
Ala Pro Val Pro Ser Gly Thr Pro Thr Pro Ala Ala Pro Pro Pro Ala
355 360 365
Ser Pro Ser Ala Val Pro Ala Gly Thr Val Pro Leu Gly Ala His Thr
370 375 380
Leu Glu Ser Val Asp Gln Pro Asp Leu Tyr Leu Thr Tyr Ala Gly Asp
385 390 395 400
Phe Ala Thr Leu Gly Arg Ala Ala Asp Ser Asp Ser Thr Gln Ala Arg
405 410 415
Gln Arg Val Thr Phe Thr Val Val Arg Gly Leu Ala Asp Gly Arg Cys
420 425 430
Val Thr Phe Arg Ala Ala Asp Gly Arg Tyr Leu Arg His His Tyr Leu
435 440 445
Arg Leu Arg Leu Ser Thr Asp Asp Gly Ser Glu Leu Phe Arg Lys Asp
450 455 460
Ala Thr Phe Cys Pro Arg Pro Gly Ala Val Ala Gly Ser Val Thr Leu
465 470 475 480
Tyr Ser His Asn Tyr Pro Gly Ser Val Val Arg His Arg Asp Gly Gly
485 490 495
Ile Trp Leu Asp Gly Ser Asp Gly Thr Arg Ala Phe Ala Gly Gln Ala
500 505 510
Ser Phe Val Val Arg Lys Ala Arg Pro
515 520
<210> 5
<211> 2061
<212> DNA
<213> Streptomyces sp
<400> 5
atgacttcag agcacacggc agagctggtc gcggcggccc gcgcgggaga cctccacgcg 60
caggacgagc tggtcagcgc gtatctgccg ctggtctaca acatcgtcgg gcgcgccatg 120
aacggctccg tcgacgtgga cgacgtcgtg caggacacga tgctgcgggc cctcgacggc 180
ctcggcaccc tccgttcgga cgacagcttc cgctcgtggt tggtggccat cgcgatgaac 240
cgggtgcggg cgtactggca ggcccggcgc accgctcccg gtgagagcgg tctggaggcc 300
gcctgggagc tcgccgaccc gggggcggac ttcgtcgacc tgaccgtcgt ccggctggcc 360
ctggaggggc agcgccgcga gacggcgcgc gcgacccgct ggctggagcc ggacgaccgg 420
gcgctgctgt cgctgtggtg gctggagtgc gcgggggagc tgacccgggg cgaggtggcc 480
gcggcgctgg agctgacgcc gcagcacacg gccgtgcggg tgcagcggat gaaggcgcag 540
ctggagtccg cgcgcgtggt ggagggggcg ctggcggcgc agccgccgtg cgaggcgctg 600
gccgcggtga cggcctcctg ggacgggcag ccgtccactc tgtggcgcaa gcgaatagcc 660
cggcacgccc gcgagtgcct gcggtgcgcc gggctgtgga acggtctgct cccggcggag 720
gggctgctgg cgggcctggc gctcgtcccg gtgtcggcgg cgctgctggc cggggtgcgg 780
tcggccgccg cgggcggttt cgcccccgcg ggcgcggcgt acgcggcggg cgtgcccgca 840
gacggccagg cggtgtccgc cggctgggcg cccgccgacg gccaggcggc atccggcggc 900
tggacgcccg cgtacgaagc accgcccggc ggcacgggcg gcggcggcca cggcttcgcc 960
ggcggtaccg gtacggacga cggcaacggc ttggcttccg gtgcaagtgc ggatgacggc 1020
ggcagcggcc tcggcgccgg tgccgggccg cacggcggcg gcaacggctt cgcttccggt 1080
gcgggtgacg gcggcaacgg cttggcttcc ggtgcaagtg cgggtgacgg cggcagcggc 1140
gcgggtgacg gcacgcgcag tggcgacgac ggtttcggcg ccggggcgca cgggggcggc 1200
agcggcttcg gcggcggtga cggcggtccg ggtggcggcg aggccgtggc ccggcttgtg 1260
ccggccggtt ccgcggaggg aggtgccgat ggcgcggcgg gcggcggccg gagcgcgttg 1320
cgcaagcggc ggcgcagtcg gcggcgggcc gtcgggggtg ccgtgctcgc cgcctgcgtc 1380
gcgggcggcg gcctcgccta cctgggcggc ttccccggct ccgccggcga ggacggcgga 1440
gccgccccgg cagctccgct gaacgcgctg tccgcaggcg agtccgccgg cccgtcgccg 1500
agcggcccgg cccagccatc cgcgtccgct tcggcctcgc cgtccccgtc ggccaccgcg 1560
tcgccctcgg tgtcggcgtc ggcgagccct tccggcgcgg gcaaggcgtc cagtggcgcg 1620
ccgaccccgt ccccgtccgc gccgcgcccc gcgtcgccgg ccccggtgcc cgccccgcag 1680
ggaccggtgg gccaggtcgt cgccctcgtc aacgccgagc gggcgaaggc gggctgcggc 1740
ccgctgaagg acgacgcgca gctgcggaag gccgcgcagc ggcactccga ggacatggcg 1800
agccggaact tcttctcgca cacggccccg gacggctccg acccgggcga gcggacgacg 1860
gccgccggct accggtgggc gacgtacggc gagaacatag cgcgcgggca gagcaccgcc 1920
gagtcggtca tgaactcgtg gatgaacagc gacggccacc gcgcgaacat cctgaactgc 1980
tccttcaagg acataggcgt cggactccac cagggccccg gcggcccgtg gtggacgcag 2040
aacttcggcg cccgcatgtg a 2061
<210> 6
<211> 686
<212> PRT
<213> Streptomyces sp
<400> 6
Met Thr Ser Glu His Thr Ala Glu Leu Val Ala Ala Ala Arg Ala Gly
1 5 10 15
Asp Leu His Ala Gln Asp Glu Leu Val Ser Ala Tyr Leu Pro Leu Val
20 25 30
Tyr Asn Ile Val Gly Arg Ala Met Asn Gly Ser Val Asp Val Asp Asp
35 40 45
Val Val Gln Asp Thr Met Leu Arg Ala Leu Asp Gly Leu Gly Thr Leu
50 55 60
Arg Ser Asp Asp Ser Phe Arg Ser Trp Leu Val Ala Ile Ala Met Asn
65 70 75 80
Arg Val Arg Ala Tyr Trp Gln Ala Arg Arg Thr Ala Pro Gly Glu Ser
85 90 95
Gly Leu Glu Ala Ala Trp Glu Leu Ala Asp Pro Gly Ala Asp Phe Val
100 105 110
Asp Leu Thr Val Val Arg Leu Ala Leu Glu Gly Gln Arg Arg Glu Thr
115 120 125
Ala Arg Ala Thr Arg Trp Leu Glu Pro Asp Asp Arg Ala Leu Leu Ser
130 135 140
Leu Trp Trp Leu Glu Cys Ala Gly Glu Leu Thr Arg Gly Glu Val Ala
145 150 155 160
Ala Ala Leu Glu Leu Thr Pro Gln His Thr Ala Val Arg Val Gln Arg
165 170 175
Met Lys Ala Gln Leu Glu Ser Ala Arg Val Val Glu Gly Ala Leu Ala
180 185 190
Ala Gln Pro Pro Cys Glu Ala Leu Ala Ala Val Thr Ala Ser Trp Asp
195 200 205
Gly Gln Pro Ser Thr Leu Trp Arg Lys Arg Ile Ala Arg His Ala Arg
210 215 220
Glu Cys Leu Arg Cys Ala Gly Leu Trp Asn Gly Leu Leu Pro Ala Glu
225 230 235 240
Gly Leu Leu Ala Gly Leu Ala Leu Val Pro Val Ser Ala Ala Leu Leu
245 250 255
Ala Gly Val Arg Ser Ala Ala Ala Gly Gly Phe Ala Pro Ala Gly Ala
260 265 270
Ala Tyr Ala Ala Gly Val Pro Ala Asp Gly Gln Ala Val Ser Ala Gly
275 280 285
Trp Ala Pro Ala Asp Gly Gln Ala Ala Ser Gly Gly Trp Thr Pro Ala
290 295 300
Tyr Glu Ala Pro Pro Gly Gly Thr Gly Gly Gly Gly His Gly Phe Ala
305 310 315 320
Gly Gly Thr Gly Thr Asp Asp Gly Asn Gly Leu Ala Ser Gly Ala Ser
325 330 335
Ala Asp Asp Gly Gly Ser Gly Leu Gly Ala Gly Ala Gly Pro His Gly
340 345 350
Gly Gly Asn Gly Phe Ala Ser Gly Ala Gly Asp Gly Gly Asn Gly Leu
355 360 365
Ala Ser Gly Ala Ser Ala Gly Asp Gly Gly Ser Gly Ala Gly Asp Gly
370 375 380
Thr Arg Ser Gly Asp Asp Gly Phe Gly Ala Gly Ala His Gly Gly Gly
385 390 395 400
Ser Gly Phe Gly Gly Gly Asp Gly Gly Pro Gly Gly Gly Glu Ala Val
405 410 415
Ala Arg Leu Val Pro Ala Gly Ser Ala Glu Gly Gly Ala Asp Gly Ala
420 425 430
Ala Gly Gly Gly Arg Ser Ala Leu Arg Lys Arg Arg Arg Ser Arg Arg
435 440 445
Arg Ala Val Gly Gly Ala Val Leu Ala Ala Cys Val Ala Gly Gly Gly
450 455 460
Leu Ala Tyr Leu Gly Gly Phe Pro Gly Ser Ala Gly Glu Asp Gly Gly
465 470 475 480
Ala Ala Pro Ala Ala Pro Leu Asn Ala Leu Ser Ala Gly Glu Ser Ala
485 490 495
Gly Pro Ser Pro Ser Gly Pro Ala Gln Pro Ser Ala Ser Ala Ser Ala
500 505 510
Ser Pro Ser Pro Ser Ala Thr Ala Ser Pro Ser Val Ser Ala Ser Ala
515 520 525
Ser Pro Ser Gly Ala Gly Lys Ala Ser Ser Gly Ala Pro Thr Pro Ser
530 535 540
Pro Ser Ala Pro Arg Pro Ala Ser Pro Ala Pro Val Pro Ala Pro Gln
545 550 555 560
Gly Pro Val Gly Gln Val Val Ala Leu Val Asn Ala Glu Arg Ala Lys
565 570 575
Ala Gly Cys Gly Pro Leu Lys Asp Asp Ala Gln Leu Arg Lys Ala Ala
580 585 590
Gln Arg His Ser Glu Asp Met Ala Ser Arg Asn Phe Phe Ser His Thr
595 600 605
Ala Pro Asp Gly Ser Asp Pro Gly Glu Arg Thr Thr Ala Ala Gly Tyr
610 615 620
Arg Trp Ala Thr Tyr Gly Glu Asn Ile Ala Arg Gly Gln Ser Thr Ala
625 630 635 640
Glu Ser Val Met Asn Ser Trp Met Asn Ser Asp Gly His Arg Ala Asn
645 650 655
Ile Leu Asn Cys Ser Phe Lys Asp Ile Gly Val Gly Leu His Gln Gly
660 665 670
Pro Gly Gly Pro Trp Trp Thr Gln Asn Phe Gly Ala Arg Met
675 680 685
<210> 7
<211> 1566
<212> DNA
<213> Streptomyces sp
<400> 7
atggggctca tggacgcgga ccacgcgggc ctggtcgtcg cggctcaggc cggggacgac 60
cgggcgcgcg aagagctgat cgccgcctac ctgccgctgg tctacaacat cgtccggcgg 120
gcgctgagcg cacatgccga cgtcgacgac gtcgtccagg agacgctgtt gcgcgtggtg 180
cgcgaccttc ctgccctgcg cgctcccgac agcttccgct cctggctggt gtcgatcacg 240
ctccgccaga tacagaccca ctggcagcgg cagcgcgcgg tcgccaaccg gaccgccgtc 300
atcgacgagg cgctcgacct gccggatgcc ggcttcgaac ccgaggacgc gacgatcctg 360
cgcctccgtg tgtcggacga gcgccgtcgg atcgccgaag ccggccggtg gctcgacccg 420
gaccaccgcg ccctgctgtc cctctggtgg caggagcacg ccgggctgct gacccgggag 480
gacatcgcgg ccgcgacggg cctcaccgcc gcccacgccg gagtgcgcct gcagcgcatg 540
cgcgagcagc tggacctgag ccggacgatc gtcgccgccc tggaggccca cccgcgatgc 600
ccgcagctgg gcgagaccgt cgccggctgg gacggcctgc gtacctcggt gtggcggaag 660
cggatcgcgc ggcacacccg cgactgcccg gtctgcacgg cgacgacggc gaaccgggtc 720
cctgccgagc agctgctgct cggcctcgcg ccgctggccg tccccgccgg actcctcgcc 780
acgctgaccg ccaagggcct gctgtcgggt tcggccgcgg gcgccgccgc gcctgccatg 840
gcgccggtcg cggccggcgc ggcggtgaag gcaggcggtc tgcacggcgc ggcgaccggc 900
aaactccacg cggtgaccgc tcacccgctg gcaagcatcg ccgccggtgc ggtgctcatc 960
accggagccg cgacgtacgc agcctggccg gagccgacgc cccgggcgcc cggcgtcacc 1020
gccgccccca ccgtcgcgcc cacggccgcc gctcccgcgc cggtcccgtc gggcaccccc 1080
acgccggccg caccgccacc ggcgagtccg tccgccgtcc ccgcgggcac ggttccgctg 1140
ggcgcgcaca cactggagtc cgtcgaccag cccgacctct acctgacgta cgccggcgac 1200
ttcgcgacgc tcggccgggc cgccgactcc ggcagcacgc aggcacggca gcgcgtcacg 1260
ttcacggtgg tccgggggct ggccgacggg cggtgcgtca ccttccgcgc ggccgacggc 1320
cgttacctgc ggcaccacta cctgcggctg cggctgagca ccgacgacgg cagcgaactc 1380
ttccggaagg acgcgacctt ctgcccccgc cccggagcgg tcgcggggtc ggtgaccctg 1440
tactcccaca actacccggg atcggtcgtc cgccaccgcg acggcggcat ctggctcgac 1500
ggctccgacg gcacccgggc cttcgccggc caggcctcct tcgtcgtccg caaggcccgg 1560
ccctga 1566
<210> 8
<211> 521
<212> PRT
<213> Streptomyces sp
<400> 8
Met Gly Leu Met Asp Ala Asp His Ala Gly Leu Val Val Ala Ala Gln
1 5 10 15
Ala Gly Asp Asp Arg Ala Arg Glu Glu Leu Ile Ala Ala Tyr Leu Pro
20 25 30
Leu Val Tyr Asn Ile Val Arg Arg Ala Leu Ser Ala His Ala Asp Val
35 40 45
Asp Asp Val Val Gln Glu Thr Leu Leu Arg Val Val Arg Asp Leu Pro
50 55 60
Ala Leu Arg Ala Pro Asp Ser Phe Arg Ser Trp Leu Val Ser Ile Thr
65 70 75 80
Leu Arg Gln Ile Gln Thr His Trp Gln Arg Gln Arg Ala Val Ala Asn
85 90 95
Arg Thr Ala Val Ile Asp Glu Ala Leu Asp Leu Pro Asp Ala Gly Phe
100 105 110
Glu Pro Glu Asp Ala Thr Ile Leu Arg Leu Arg Val Ser Asp Glu Arg
115 120 125
Arg Arg Ile Ala Glu Ala Gly Arg Trp Leu Asp Pro Asp His Arg Ala
130 135 140
Leu Leu Ser Leu Trp Trp Gln Glu His Ala Gly Leu Leu Thr Arg Glu
145 150 155 160
Asp Ile Ala Ala Ala Thr Gly Leu Thr Ala Ala His Ala Gly Val Arg
165 170 175
Leu Gln Arg Met Arg Glu Gln Leu Asp Leu Ser Arg Thr Ile Val Ala
180 185 190
Ala Leu Glu Ala His Pro Arg Cys Pro Gln Leu Gly Glu Thr Val Ala
195 200 205
Gly Trp Asp Gly Leu Arg Thr Ser Val Trp Arg Lys Arg Ile Ala Arg
210 215 220
His Thr Arg Asp Cys Pro Val Cys Thr Ala Thr Thr Ala Asn Arg Val
225 230 235 240
Pro Ala Glu Gln Leu Leu Leu Gly Leu Ala Pro Leu Ala Val Pro Ala
245 250 255
Gly Leu Leu Ala Thr Leu Thr Ala Lys Gly Leu Leu Ser Gly Ser Ala
260 265 270
Ala Gly Ala Ala Ala Pro Ala Met Ala Pro Val Ala Ala Gly Ala Ala
275 280 285
Val Lys Ala Gly Gly Leu His Gly Ala Ala Thr Gly Lys Leu His Ala
290 295 300
Val Thr Ala His Pro Leu Ala Ser Ile Ala Ala Gly Ala Val Leu Ile
305 310 315 320
Thr Gly Ala Ala Thr Tyr Ala Ala Trp Pro Glu Pro Thr Pro Arg Ala
325 330 335
Pro Gly Val Thr Ala Ala Pro Thr Val Ala Pro Thr Ala Ala Ala Pro
340 345 350
Ala Pro Val Pro Ser Gly Thr Pro Thr Pro Ala Ala Pro Pro Pro Ala
355 360 365
Ser Pro Ser Ala Val Pro Ala Gly Thr Val Pro Leu Gly Ala His Thr
370 375 380
Leu Glu Ser Val Asp Gln Pro Asp Leu Tyr Leu Thr Tyr Ala Gly Asp
385 390 395 400
Phe Ala Thr Leu Gly Arg Ala Ala Asp Ser Gly Ser Thr Gln Ala Arg
405 410 415
Gln Arg Val Thr Phe Thr Val Val Arg Gly Leu Ala Asp Gly Arg Cys
420 425 430
Val Thr Phe Arg Ala Ala Asp Gly Arg Tyr Leu Arg His His Tyr Leu
435 440 445
Arg Leu Arg Leu Ser Thr Asp Asp Gly Ser Glu Leu Phe Arg Lys Asp
450 455 460
Ala Thr Phe Cys Pro Arg Pro Gly Ala Val Ala Gly Ser Val Thr Leu
465 470 475 480
Tyr Ser His Asn Tyr Pro Gly Ser Val Val Arg His Arg Asp Gly Gly
485 490 495
Ile Trp Leu Asp Gly Ser Asp Gly Thr Arg Ala Phe Ala Gly Gln Ala
500 505 510
Ser Phe Val Val Arg Lys Ala Arg Pro
515 520
<210> 9
<211> 28
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 9
gctctagaat gacttcagag cacacggc 28
<210> 10
<211> 28
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 10
gcgatatctc acatgcgggc gccgaagt 28
<210> 11
<211> 28
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 11
gctctagaat ggggctcatg gacgcgga 28
<210> 12
<211> 28
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 12
gcgatatctc agggccgggc cttgcgga 28
Claims (6)
1. The application of the ribosome sigma factor B in improving the yield of lipstatin is characterized in that the nucleotide sequence of the ribosome sigma factor B is shown as SEQ ID NO.7, and the amino acid sequence of the encoded protein is shown as SEQ ID NO. 8.
2. A mutant of a ribosome sigma factor B, which is characterized in that the nucleotide sequence of the mutant is shown as SEQ ID NO. 3; the amino acid sequence of the protein coded by the nucleotide is shown as SEQ ID NO. 4.
3. Use of a mutant of ribosomal sigma factor B according to claim 2 for increasing the yield of lipstatin.
4. A plasmid comprising the nucleotide sequence of the mutant ribosomal sigma factor B of claim 2.
5. A transgenic cell line comprising the nucleotide sequence of the mutant ribosomal sigma factor B of claim 2.
6. An engineered bacterium comprising the nucleotide sequence of the mutant of ribosomal sigma factor B of claim 2.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111260753.2A CN113789314B (en) | 2020-10-29 | 2020-10-29 | Application of ribosome sigma factor B and mutant thereof in improving yield of lipstatin |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011176931.9A CN112342203B (en) | 2020-10-29 | 2020-10-29 | Application of ribosome sigma factor, mutant thereof and protein obtained by encoding to increase yield of lipstatin |
CN202111260753.2A CN113789314B (en) | 2020-10-29 | 2020-10-29 | Application of ribosome sigma factor B and mutant thereof in improving yield of lipstatin |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011176931.9A Division CN112342203B (en) | 2020-10-29 | 2020-10-29 | Application of ribosome sigma factor, mutant thereof and protein obtained by encoding to increase yield of lipstatin |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113789314A CN113789314A (en) | 2021-12-14 |
CN113789314B true CN113789314B (en) | 2023-05-02 |
Family
ID=74357118
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011176931.9A Active CN112342203B (en) | 2020-10-29 | 2020-10-29 | Application of ribosome sigma factor, mutant thereof and protein obtained by encoding to increase yield of lipstatin |
CN202111260753.2A Active CN113789314B (en) | 2020-10-29 | 2020-10-29 | Application of ribosome sigma factor B and mutant thereof in improving yield of lipstatin |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011176931.9A Active CN112342203B (en) | 2020-10-29 | 2020-10-29 | Application of ribosome sigma factor, mutant thereof and protein obtained by encoding to increase yield of lipstatin |
Country Status (1)
Country | Link |
---|---|
CN (2) | CN112342203B (en) |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6238887B1 (en) * | 1997-10-03 | 2001-05-29 | Brigham & Women's Hospital | Ribosome recycling factor (FRR) of Staphylococcus aureus |
EP2141236A1 (en) * | 2008-07-03 | 2010-01-06 | KRKA, D.D., Novo Mesto | Process for production of lipstatin and microorganisms therefore |
CN101386829B (en) * | 2008-10-20 | 2010-10-20 | 中国农业大学 | Avid kyowamycin genetic engineering bacterium and use thereof |
CN103468603B (en) * | 2013-07-17 | 2015-02-11 | 杭州华东医药集团新药研究院有限公司 | Streptomyces toxytricini and applications of streptomyces toxytricini in preparation of lipstatin |
CN103820510B (en) * | 2014-01-17 | 2016-01-20 | 浙江工业大学 | A kind of fermentable produces method and the substratum of Lipstatin |
CN103937847B (en) * | 2014-03-04 | 2017-11-14 | 鲁南新时代生物技术有限公司 | A kind of method of fermenting and producing Lipstatin |
CN108753861B (en) * | 2018-06-08 | 2022-02-01 | 福建省微生物研究所 | Culture medium and method for producing lipstatin by fermenting streptomyces toxytricini |
CN110903358B (en) * | 2020-01-07 | 2020-05-12 | 中国科学院天津工业生物技术研究所 | Ribosomal factor and its mutant and its use in preparing vitamin B12In (1) |
-
2020
- 2020-10-29 CN CN202011176931.9A patent/CN112342203B/en active Active
- 2020-10-29 CN CN202111260753.2A patent/CN113789314B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN112342203B (en) | 2021-12-28 |
CN112342203A (en) | 2021-02-09 |
CN113789314A (en) | 2021-12-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2012118225A1 (en) | A method for producing pyrroloquinoline quinone using a bacterium of the genus methylobacterium or hyphomicrobium | |
CN112746036B (en) | Streptomyces and method for producing pseudouridine by fermenting same | |
CN114891764B (en) | Phospholipase, gene thereof, engineering bacteria and preparation method | |
CN112029701B (en) | Genetically engineered bacterium and application thereof in preparation of 22-hydroxy-23, 24-bis-cholesta-4-en-3-one | |
CN117778439A (en) | Genetically engineered bacterium for producing lincomycin, construction method and application | |
CN113789314B (en) | Application of ribosome sigma factor B and mutant thereof in improving yield of lipstatin | |
CN109055417B (en) | Recombinant microorganism, preparation method thereof and application thereof in production of coenzyme Q10 | |
CN113462628B (en) | Gene engineering bacterium for producing heme as well as construction method and application thereof | |
CN112195129B (en) | Violacein biosynthesis gene cluster and application thereof | |
CN112409372B (en) | Yuxithromycin analogue, preparation method and application thereof | |
CN112410353B (en) | fkbS gene, genetic engineering bacterium containing fkbS gene, and preparation method and application of fkbS gene | |
CN113755420A (en) | Genetic engineering bacterium for increasing content of butanedione and acetoin in yoghourt and application | |
CN109097315B (en) | Genetically engineered bacterium for high-yield lipopeptide and construction method and application thereof | |
CN110551739A (en) | Pyrazolomycin biosynthesis gene cluster, recombinant bacterium and application thereof | |
CN107541481B (en) | Genetic engineering bacterium for producing epirubicin and application thereof | |
CN114369582B (en) | Brucella bifidus source ester synthetase JG536_25355, coding gene and application | |
CN116042416B (en) | Multi-gene over-expression streptomycete engineering strain for high-yield epsilon-polylysine, method and application | |
CN116445515B (en) | Gene cluster participating in Li Pusi statin and structural analogue synthesis and application thereof | |
CN113604413B (en) | Recombinant strain, preparation method and application | |
CN116286575B (en) | Method for efficiently expressing raw starch alpha-amylase by using bacillus subtilis | |
CN114717135B (en) | Streptomyces pratensis and method for producing pseudouridine and 1-methyl-pseudouridine by fermentation of Streptomyces pratensis | |
CN118063531B (en) | Preparation and application of macrolide compound PA-46101s C-E | |
CN115125179B (en) | Genetic engineering bacteria for producing rapamycin and application thereof | |
CN112410274B (en) | Genetic engineering bacterium for producing ascomycin and preparation method and application thereof | |
CN118291337A (en) | Integrated bacillus licheniformis for expressing phospholipase D and construction method and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |