CN109851670B - Metallothionein gene MT18, metallothionein obtained by encoding metallothionein gene MT18, expression and application of metallothionein - Google Patents
Metallothionein gene MT18, metallothionein obtained by encoding metallothionein gene MT18, expression and application of metallothionein Download PDFInfo
- Publication number
- CN109851670B CN109851670B CN201910290545.3A CN201910290545A CN109851670B CN 109851670 B CN109851670 B CN 109851670B CN 201910290545 A CN201910290545 A CN 201910290545A CN 109851670 B CN109851670 B CN 109851670B
- Authority
- CN
- China
- Prior art keywords
- cys
- gly
- ala
- ser
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000157 Metallothionein Proteins 0.000 title claims abstract description 59
- 102000003792 Metallothionein Human genes 0.000 title claims description 20
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 title claims description 17
- BDOSMKKIYDKNTQ-UHFFFAOYSA-N cadmium atom Chemical compound [Cd] BDOSMKKIYDKNTQ-UHFFFAOYSA-N 0.000 claims abstract description 21
- 229910052793 cadmium Inorganic materials 0.000 claims abstract description 19
- 229910001385 heavy metal Inorganic materials 0.000 claims abstract description 16
- 239000002689 soil Substances 0.000 claims abstract description 12
- 241000588724 Escherichia coli Species 0.000 claims description 20
- 108090000623 proteins and genes Proteins 0.000 claims description 11
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 claims description 4
- 229910052802 copper Inorganic materials 0.000 claims description 4
- 239000010949 copper Substances 0.000 claims description 4
- 238000005067 remediation Methods 0.000 claims description 4
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 3
- 238000011109 contamination Methods 0.000 claims description 2
- 239000013598 vector Substances 0.000 claims 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 229910017052 cobalt Inorganic materials 0.000 claims 1
- 239000010941 cobalt Substances 0.000 claims 1
- GUTLYIVDDKVIGB-UHFFFAOYSA-N cobalt atom Chemical compound [Co] GUTLYIVDDKVIGB-UHFFFAOYSA-N 0.000 claims 1
- 239000013604 expression vector Substances 0.000 claims 1
- 239000011133 lead Substances 0.000 claims 1
- 239000002351 wastewater Substances 0.000 claims 1
- 244000005700 microbiome Species 0.000 abstract description 5
- 230000000694 effects Effects 0.000 abstract description 4
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 34
- 108010016616 cysteinylglycine Proteins 0.000 description 33
- RATXDYWHIYNZLE-DCAQKATOSA-N Met-Lys-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N RATXDYWHIYNZLE-DCAQKATOSA-N 0.000 description 32
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 28
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 24
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 23
- PLBJMUUEGBBHRH-ZLUOBGJFSA-N Cys-Ala-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLBJMUUEGBBHRH-ZLUOBGJFSA-N 0.000 description 22
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 21
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 18
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 17
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 14
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 13
- 108010015792 glycyllysine Proteins 0.000 description 13
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 12
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 11
- 108010047857 aspartylglycine Proteins 0.000 description 11
- 230000012010 growth Effects 0.000 description 11
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 10
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 10
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 10
- 238000012163 sequencing technique Methods 0.000 description 9
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 8
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 8
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 8
- 230000001580 bacterial effect Effects 0.000 description 8
- 108010010147 glycylglutamine Proteins 0.000 description 8
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 108010064235 lysylglycine Proteins 0.000 description 7
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 6
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 6
- URDUGPGPLNXXES-WHFBIAKZSA-N Cys-Gly-Cys Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O URDUGPGPLNXXES-WHFBIAKZSA-N 0.000 description 6
- YYQGVXNKAXUTJU-YUMQZZPRSA-N Gly-Cys-His Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O YYQGVXNKAXUTJU-YUMQZZPRSA-N 0.000 description 6
- 229910052751 metal Inorganic materials 0.000 description 6
- 239000002184 metal Substances 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 5
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 5
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 5
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 5
- 241000196324 Embryophyta Species 0.000 description 5
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 5
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 5
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 5
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 5
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 5
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 5
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 5
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 5
- 239000012620 biological material Substances 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 238000000034 method Methods 0.000 description 5
- OILNWMNBLIHXQK-ZLUOBGJFSA-N Ala-Cys-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O OILNWMNBLIHXQK-ZLUOBGJFSA-N 0.000 description 4
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 4
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 4
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 4
- 239000002028 Biomass Substances 0.000 description 4
- CVOZXIPULQQFNY-ZLUOBGJFSA-N Cys-Ala-Cys Chemical compound C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O CVOZXIPULQQFNY-ZLUOBGJFSA-N 0.000 description 4
- UEHCDNYDBBCQEL-CIUDSAMLSA-N Cys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N UEHCDNYDBBCQEL-CIUDSAMLSA-N 0.000 description 4
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 4
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 4
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 4
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 4
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 108010040030 histidinoalanine Proteins 0.000 description 4
- 230000002401 inhibitory effect Effects 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 241000007910 Acaryochloris marina Species 0.000 description 3
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 3
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 3
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 3
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 3
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 3
- 241000192682 Calothrix sp. Species 0.000 description 3
- JIVJXVJMOBVCJF-ZLUOBGJFSA-N Cys-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)C(=O)N JIVJXVJMOBVCJF-ZLUOBGJFSA-N 0.000 description 3
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 3
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 3
- ZXGLLNZQSBLQLT-SRVKXCTJSA-N Gln-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZXGLLNZQSBLQLT-SRVKXCTJSA-N 0.000 description 3
- QXQDADBVIBLBHN-FHWLQOOXSA-N Gln-Tyr-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QXQDADBVIBLBHN-FHWLQOOXSA-N 0.000 description 3
- PNAOVYHADQRJQU-GUBZILKMSA-N Glu-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N PNAOVYHADQRJQU-GUBZILKMSA-N 0.000 description 3
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 3
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 3
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 3
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 3
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 3
- HIJIJPFILYPTFR-ACRUOGEOSA-N His-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HIJIJPFILYPTFR-ACRUOGEOSA-N 0.000 description 3
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 3
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 3
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 3
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 3
- ZAWOJFFMBANLGE-CIUDSAMLSA-N Lys-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N ZAWOJFFMBANLGE-CIUDSAMLSA-N 0.000 description 3
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 3
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 3
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 3
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 3
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 3
- 241000192710 Microcystis aeruginosa Species 0.000 description 3
- BVTYXOFTHDXSNI-IHRRRGAJSA-N Pro-Tyr-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 BVTYXOFTHDXSNI-IHRRRGAJSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 3
- RJBFAHKSFNNHAI-XKBZYTNZSA-N Thr-Gln-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O RJBFAHKSFNNHAI-XKBZYTNZSA-N 0.000 description 3
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 3
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 3
- SJLVYVZBFDTRCG-DCAQKATOSA-N Val-Lys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N SJLVYVZBFDTRCG-DCAQKATOSA-N 0.000 description 3
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 3
- 238000002835 absorbance Methods 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 230000008439 repair process Effects 0.000 description 3
- 238000001179 sorption measurement Methods 0.000 description 3
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- 241001677621 Aliterella atlantica Species 0.000 description 2
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 2
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 2
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- XMTDCXXLDZKAGI-ACZMJKKPSA-N Cys-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N XMTDCXXLDZKAGI-ACZMJKKPSA-N 0.000 description 2
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 2
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 2
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 2
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 2
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 2
- 238000007400 DNA extraction Methods 0.000 description 2
- 241000892911 Geitlerinema Species 0.000 description 2
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 2
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 2
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 2
- YDWZGVCXMVLDQH-WHFBIAKZSA-N Gly-Cys-Asn Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O YDWZGVCXMVLDQH-WHFBIAKZSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 2
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 2
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 2
- QNILDNVBIARMRK-XVYDVKMFSA-N His-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N QNILDNVBIARMRK-XVYDVKMFSA-N 0.000 description 2
- VLPMGIJPAWENQB-SRVKXCTJSA-N His-Cys-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O VLPMGIJPAWENQB-SRVKXCTJSA-N 0.000 description 2
- JUIOPCXACJLRJK-AVGNSLFASA-N His-Lys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N JUIOPCXACJLRJK-AVGNSLFASA-N 0.000 description 2
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 2
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 2
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- 241000218314 Liriodendron tulipifera Species 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 2
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 2
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 2
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- GRYLNZFGIOXLOG-UHFFFAOYSA-N Nitric acid Chemical compound O[N+]([O-])=O GRYLNZFGIOXLOG-UHFFFAOYSA-N 0.000 description 2
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 2
- CZCCVJUUWBMISW-FXQIFTODSA-N Pro-Ser-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O CZCCVJUUWBMISW-FXQIFTODSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 241000190950 Rhodopseudomonas palustris Species 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- COLJZWUVZIXSSS-CIUDSAMLSA-N Ser-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N COLJZWUVZIXSSS-CIUDSAMLSA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 2
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 2
- MQUZMZBFKCHVOB-HJGDQZAQSA-N Thr-Gln-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O MQUZMZBFKCHVOB-HJGDQZAQSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- PUEWAXRPXOEQOW-HJGDQZAQSA-N Thr-Met-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O PUEWAXRPXOEQOW-HJGDQZAQSA-N 0.000 description 2
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- 241000682509 Tolypothrix campylonemoides Species 0.000 description 2
- CGDZGRLRXPNCOC-SRVKXCTJSA-N Tyr-Cys-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CGDZGRLRXPNCOC-SRVKXCTJSA-N 0.000 description 2
- ZAGPDPNPWYPEIR-SRVKXCTJSA-N Tyr-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ZAGPDPNPWYPEIR-SRVKXCTJSA-N 0.000 description 2
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 229910052785 arsenic Inorganic materials 0.000 description 2
- RQNWIZPPADIBDY-UHFFFAOYSA-N arsenic atom Chemical compound [As] RQNWIZPPADIBDY-UHFFFAOYSA-N 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 238000003766 bioinformatics method Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000007689 inspection Methods 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 229910017604 nitric acid Inorganic materials 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 235000018102 proteins Nutrition 0.000 description 2
- 102000004169 proteins and genes Human genes 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 108010000998 wheylin-2 peptide Proteins 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 1
- IAMNNSSEBXDJMN-CIUDSAMLSA-N Asp-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N IAMNNSSEBXDJMN-CIUDSAMLSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- 241000453722 Chamaesiphon minutus Species 0.000 description 1
- 241000907165 Coleofasciculus chthonoplastes Species 0.000 description 1
- 241001464430 Cyanobacterium Species 0.000 description 1
- 241000199492 Cyanobacterium aponinum Species 0.000 description 1
- 241001464431 Cyanobacterium stanieri Species 0.000 description 1
- 241000159506 Cyanothece Species 0.000 description 1
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 1
- ZJBWJHQDOIMVLM-WHFBIAKZSA-N Cys-Cys-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZJBWJHQDOIMVLM-WHFBIAKZSA-N 0.000 description 1
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 1
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 1
- UUOYKFNULIOCGJ-GUBZILKMSA-N Cys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N UUOYKFNULIOCGJ-GUBZILKMSA-N 0.000 description 1
- GCDLPNRHPWBKJJ-WDSKDSINSA-N Cys-Gly-Glu Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GCDLPNRHPWBKJJ-WDSKDSINSA-N 0.000 description 1
- RWAZRMXTVSIVJR-YUMQZZPRSA-N Cys-Gly-His Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CNC=N1)C(O)=O RWAZRMXTVSIVJR-YUMQZZPRSA-N 0.000 description 1
- DYBIDOHFRRUMLW-CIUDSAMLSA-N Cys-Leu-Cys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O DYBIDOHFRRUMLW-CIUDSAMLSA-N 0.000 description 1
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 1
- RWVBNRYBHAGYSG-GUBZILKMSA-N Cys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N RWVBNRYBHAGYSG-GUBZILKMSA-N 0.000 description 1
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 1
- 241000192601 Fischerella Species 0.000 description 1
- 241000637228 Geminocystis Species 0.000 description 1
- 241000637231 Geminocystis herdmanii Species 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- SAEBUDRWKUXLOM-ACZMJKKPSA-N Glu-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O SAEBUDRWKUXLOM-ACZMJKKPSA-N 0.000 description 1
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 1
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 1
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 1
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- 241000999742 Halomicronema hongdechloris Species 0.000 description 1
- 241000813894 Hapalosiphonaceae Species 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- CZXKZMQKXQZDEX-YUMQZZPRSA-N His-Gly-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N CZXKZMQKXQZDEX-YUMQZZPRSA-N 0.000 description 1
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 1
- QEYUCKCWTMIERU-SRVKXCTJSA-N His-Lys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QEYUCKCWTMIERU-SRVKXCTJSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- 241000568637 Hydrocoleum Species 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- RFMDODRWJZHZCR-BJDJZHNGSA-N Ile-Lys-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O RFMDODRWJZHZCR-BJDJZHNGSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 241000029491 Kamptonema Species 0.000 description 1
- 241001105211 Leptolyngbya ohadii Species 0.000 description 1
- 241000215469 Leptolyngbya sp. Species 0.000 description 1
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- 241001659204 Limnoraphis robusta Species 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- 241001226557 Lyngbya confervoides Species 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- 241000243392 Mastigocladopsis repens Species 0.000 description 1
- 241000823566 Mastigocoleus testarum Species 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 1
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241001430258 Methylobacterium radiotolerans Species 0.000 description 1
- 241000264662 Microcoleus vaginatus Species 0.000 description 1
- 241000186366 Mycobacterium bovis Species 0.000 description 1
- 241000211133 Mycobacterium caprae Species 0.000 description 1
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 241000763357 Oscillatoria brevis Species 0.000 description 1
- 241000122845 Phormidesmis priestleyi Species 0.000 description 1
- 241000576909 Phormidium tenue Species 0.000 description 1
- 241000219000 Populus Species 0.000 description 1
- 241000183024 Populus tremula Species 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- OGRYXQOUFHAMPI-DCAQKATOSA-N Pro-Cys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O OGRYXQOUFHAMPI-DCAQKATOSA-N 0.000 description 1
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- 241000192509 Pseudanabaena sp. Species 0.000 description 1
- 241001575211 Rivularia <snail> Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 241000387897 Scytonema tolypothrichoides Species 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- 241000530636 Spirulina subsalsa Species 0.000 description 1
- 241001464991 Stanieria cyanosphaera Species 0.000 description 1
- 241000192707 Synechococcus Species 0.000 description 1
- 241001313706 Thermosynechococcus Species 0.000 description 1
- 241001453191 Thermosynechococcus vulcanus Species 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- DHPPWTOLRWYIDS-XKBZYTNZSA-N Thr-Cys-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O DHPPWTOLRWYIDS-XKBZYTNZSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- 241000758869 Tolypothrix bouteillei Species 0.000 description 1
- HLDFBNPSURDYEN-VHWLVUOQSA-N Trp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HLDFBNPSURDYEN-VHWLVUOQSA-N 0.000 description 1
- SEFNTZYRPGBDCY-IHRRRGAJSA-N Tyr-Arg-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O SEFNTZYRPGBDCY-IHRRRGAJSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- UPODKYBYUBTWSV-BZSNNMDCSA-N Tyr-Phe-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 UPODKYBYUBTWSV-BZSNNMDCSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- 101710120355 Uncharacterized membrane protein ycf78 Proteins 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- 241000511384 Xenococcus sp. Species 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 238000000184 acid digestion Methods 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 150000001413 amino acids Chemical group 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 101150081303 arsM gene Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 102000023732 binding proteins Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 230000007773 growth pattern Effects 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- QSHDDOUJBYECFT-UHFFFAOYSA-N mercury Chemical compound [Hg] QSHDDOUJBYECFT-UHFFFAOYSA-N 0.000 description 1
- 229910052753 mercury Inorganic materials 0.000 description 1
- 150000002739 metals Chemical class 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Images
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The invention discloses a metallothionein gene MT18, the base sequence of which is shown as SEQ ID NO: 1, derived from soil microorganisms, the metallothionein gene has good heavy metal tolerance and can still survive and grow in a cadmium solution with the concentration of 1.0M. And has obvious effect on enriching cadmium metal, thereby having great application potential in industries of heavy metal pollution treatment and the like.
Description
Technical Field
The invention belongs to the technical field of genetic engineering, and particularly relates to a metallothionein gene, metallothionein obtained by encoding the metallothionein gene, and expression and application of the metallothionein gene.
Background
Metallothionein (MT) is a metal binding protein commonly existing in organisms. Metallothioneins are low molecular weight proteins with metal binding capacity and high induction properties. The cysteine-rich short peptide has high affinity to various heavy metals. It is a protein with low molecular mass and extremely high cysteine residue and metal content. The metals bound thereto are mainly cadmium, copper and zinc, widely present in various organisms from microorganisms to humans, and the structures thereof are highly conserved.
Heavy metal bioremediation is a technology for heavy metal treatment by using biological materials, and heavy metal resistance genes are important resources for bioremediation. The biological material based on gene modification can break through the character limitation of natural biological materials, obtain the optimized combination of biomass, growth speed and repair efficiency, and the heavy metal resistance gene from microorganisms has shown great potential in the aspect of development of high-efficiency repair biological materials. As early as 1998, Nature Biotechnology (Rugh et al, 1998) reported that Populus tremula (yellow poplar, also known as Liriodendron tulipifera) stably expresses the bacterial mercury reducing gene merA, and that this transgenic plant exhibits an eco-friendly plant repair pathway. Arsenic methylation gene arsM from Rhodopseudomonas palustris (Rhodopseudomonas palustris) has recently been used to construct genetically engineered bacteria that demonstrate the potential for efficient remediation of arsenic contaminated soil (Chen et al, 2014). A few studies have also shown the potential of genetically engineered biomaterials in cadmium contamination remediation. For example, Shim et al transferred the yeast cadmium resistance gene YCF1 into poplar, promoted the growth of the plant in cadmium contaminated soil, and also enhanced the cadmium enrichment capacity of the plant (Shim et al, 2013); kang et al transferred three microbial cadmium resistance genes SpPCS, GshI and MntA into Escherichia coli, so that the cadmium enrichment capacity of the transgenic strain was increased by 25 times (Kang et al, 2007).
At the present stage, the metallothionein source with heavy metal tolerance/enrichment effect is usually from animals and plants, and the metallothionein source from microorganisms is rarely researched.
Disclosure of Invention
The invention aims to obtain a metallothionein gene with heavy metal tolerance/enrichment effect, so as to obtain metallothionein coded by the gene, and further apply the gene to the field of heavy metal pollution treatment.
The first aspect of the present invention provides a metallothionein gene, wherein the base sequence of the metallothionein gene is shown as SEQ ID NO: 1, and the amino acid sequence obtained by the gene coding is shown as SEQ ID NO: 2, respectively. The metallothionein gene is obtained by the following steps:
s1, using a copper solution to pollute soil, and culturing for 14 days at a constant temperature of 25 ℃ in a dark place
S2, extracting the whole genome of the soil by using a commercial DNA extraction kit
S3, using a second generation Illumia sequencing platform HiSeq to carry out complete sequencing on genome
S4, performing quality inspection, filtration and assembly on the sequencing sequence by a general bioinformatics method
S5, searching all related metallothionein gene sequences from public gene databases UniProt and GenBank
S6, manually checking the reliability of the metallothionein gene sequences, summarizing the reliable metallothionein gene sequences after numbering, and translating by software to form a metallothionein amino acid sequence database, wherein the metallothionein amino acid sequence database is shown in SEQ ID NO: 3 to SEQ ID NO: 54, a first electrode;
s7, searching possible metallothionein gene sequences in the soil genome assembled by sequencing by using a bioinformatics retrieval method BLAST, and taking the metallothionein amino acid sequence database as a retrieval template
S8, manually checking the obtained possible metallothionein gene sequence, checking nucleotide preference, metal binding site and other information of the sequence, and further determining the metallothionein gene sequence with high reliability
S9, function verification
9.1, chemically synthesizing the high-reliability metallothionein gene sequence to obtain a high-reliability suspected metallothionein sequence, directly connecting the high-reliability suspected metallothionein sequence to a pET28a (+) plasmid, and then transforming escherichia coli BL21(DE3) to obtain a transformed strain;
9.2 determination of cadmium resistance of said E.coli BL21(DE3) and said transformed strain by Dropassay test, using said E.coli BL21(DE3) as a control, said highly reliable metallothionein gene sequence being considered as a functional/active metallothionein gene when the transformed strain survives at minimal inhibitory concentration;
9.3 obtaining the growth curve of the Escherichia coli BL21(DE3) by measuring the absorbance value at 600nm in the growth tube;
9.4 harvesting the E.coli BL21 in the growth curve assay (DE 3);
9.5 digestion of the E.coli BL21(DE3) with strong acid, determination of the solution cadmium concentration using atomic absorption and calculation of the cadmium uptake in the E.coli BL21(DE 3).
The invention has the advantages and beneficial effects that:
the metallothionein gene disclosed by the invention is derived from soil microorganisms, has good heavy metal tolerance and can still survive and grow in a 1.0M cadmium solution. And has obvious effect on enriching cadmium metal, thereby having great application potential in industries of heavy metal pollution treatment and the like.
Drawings
FIG. 1 shows the growth of E.coli in various cadmium solutions.
FIG. 2 is a graph showing the growth of E.coli according to the present disclosure.
For a person skilled in the art, other relevant figures can be obtained from the above figures without inventive effort.
Detailed Description
In order to make the technical solution of the present invention better understood, the technical solution of the present invention is further described below with reference to specific examples.
Example one
The process for screening metallothionein genes comprises the following steps:
s1, using a copper solution to pollute soil, and culturing for 14 days at a constant temperature of 25 ℃ in a dark place;
s2, operating and extracting a Soil whole genome by using a commercial DNA extraction Kit (DNeasy PowerMax Soil Kit) according to a flow specified by a specification, and determining the DNA quality by gel electrophoresis;
s3, submitting commercial sequencing, and performing total sequencing on a genome by using a second-generation Illumia sequencing platform HiSeq
S4, performing quality inspection, filtration and assembly on the sequencing sequence by a general bioinformatics method
S5, searching all related metallothionein gene sequences from public gene databases UniProt and GenBank
S6, manually checking the reliability of the metallothionein gene sequences, summarizing the reliable metallothionein gene sequences after numbering, and translating by software to form a metallothionein amino acid sequence database, wherein the metallothionein amino acid sequence database is shown in SEQ ID NO: 3 to SEQ ID NO: 54, a first electrode;
s7, searching possible metallothionein gene sequences in the soil genome assembled by sequencing by using a bioinformatics retrieval method BLAST, taking a metallothionein amino acid sequence database as a retrieval template,
s8, manually checking the obtained possible metallothionein gene sequence, checking nucleotide preference, metal binding site and other information of the sequence, and further determining the reliable metallothionein gene sequence, wherein the base sequence of the reliable metallothionein gene sequence is shown as SEQ ID NO: 1, numbered MT 18.
Example two
9.1 chemically synthesizing the high-reliability metallothionein gene sequence MT18 to obtain the high-reliability suspected metallothionein sequence shown as SEQ ID NO: 2 (completed by GENEWIZ, Suzhou, a commercial company), the high-reliability suspected metallothionein sequence is directly connected to a pET28a (+) plasmid, and the plasmid is transformed into Escherichia coli (e.coli) BL21(DE3) through electrotransformation to obtain a transformed strain;
EXAMPLE III
Detailed procedure of function verification.
9.2 Minimum Inhibitory Concentration (MIC), the MIC value of Cd was assessed by measuring the ability of bacterial cells to grow in medium at different Cd concentrations. First, MT18 containing the transformed strain and its corresponding blank (empty pET28a (+)) were grown on Lauria Bertani (LB) -agar plates overnight at 29 ℃. Then, the recombinant plasmids containing bacterial colonies were spread onto LB-plate plates containing various concentrations (0-16mM) of Cd, including kanamycin (50mg/L), and an inducer (IPTG-isopopyl-B-D-1-thiogalactopyranoside, 100 mg/L). ) Their MICs were determined.
Determining a blank control Escherichia coli (e.coli) BL21(DE3) and Cd minimum inhibitory concentration, and determining said Escherichia coli BL21(DE3) by DropAssay; referring to FIG. 1, the blank E.coli (E.coli) BL21(DE3) labeled as pET28a was not viable at a Cd concentration of 1.0mmol/L and its minimum inhibitory concentration was Cd1.0 mmol/L; the transformed strain, the reference number of which is MT18, can survive when the concentration of Cd is 1.1mmol/L and 1.2mmol/L, and the minimum inhibition concentration of Cd is 1.3mmol/L proves that the transformed strain has good cadmium tolerance.
9.3 growth patterns (growth curves) for the more tolerant strains in different Cd solutions were screened.
For this assay, first, the stored bacterial cells were transferred to LB-agar plates containing kanamycin and IPTG and incubated overnight at 37 ℃ for activation. Then, individual bacterial colonies were transferred to 100mL LB liquid medium containing the antibiotics kanamycin (50. mu.g/mL), IPTG (100. mu.g/mL) and 1.0mM Cd. The bacterial samples were continuously shaken at 37 deg.C, 160 ℃ and 180 rpm. After 12-14 hours of shaking, bacterial samples were examined every 1 hour for absorbance at 600nm (optical density OD600) for 24 hours by a nanodrop spectrophotometer. All readings were plotted in FIG. 2 to obtain a growth curve for each bacterium in 1.0mM Cd.
As shown in fig. 2, PET 1-PET 3 are blank control Escherichia coli BL21(DE3), the transformed strain is labeled MT18 in the figure, during the growth process, the absorbance (number of colonies) of the transformed strain is higher than that of three blank control Escherichia coli samples, and the transformed strain maintains higher growth, reaches the highest peak in 15 th hour, the number of colonies of the transformed strain is 20% higher than that of the blank control Escherichia coli, and the Cd tolerance performance and the lifetime are proved to be good under the Cd environment.
9.4, 9.5Cd binding assay
After examination of the OD600, bacterial cells were collected using a high-speed centrifuge. Will collectThe biomass of (2) was incubated at 65 ℃ for 24-48 hours for drying. Then adding the dried biomassIn the tube. Biomass was digested by using nitric acid (65% strength) overnight. After complete digestion, heat is applied to remove the liquid. Then 2ml of 3% nitric acid was added to each tube. After 14-16 hours, the solution was filtered and transferred to a clean and dry tube. After completion of the acid digestion, the cadmium concentration in all samples was determined using an atomic absorption spectrophotometer (TAS-990). By detecting that the cadmium adsorption value of a blank control escherichia coli sample is 93.63g/kg, the cadmium adsorption value of the transformation strain is 119.64g/kg, and the cadmium adsorption capacity of the transformation strain is stronger than that of the blank control escherichia coli sample.
SEQUENCE LISTING
<110> research center of agricultural resources of institute of genetics and developmental biology of Chinese academy of sciences
<120> metallothionein gene MT18, metallothionein obtained by coding metallothionein gene MT18, expression and application thereof
<130> 1
<160> 54
<170> PatentIn version 3.5
<210> 1
<211> 279
<212> DNA
<213> Unknown
<220>
<223> unknown
<400> 1
atgttgcgat gctggattga ttcactgatc gaccgggagg ttcccatggc tgtctgcgaa 60
gtgtgcggaa acgactacta cctgtccttc gaagttgtca tggccggatc gcgccatgtg 120
ttcgacagct tcgagtgtgc catccatgcg ctcgcaccgg tctgcgagca ctgcggctgc 180
aagatcgtgg gccacggaat cgaggcggag gggacgttct actgctgcgc ttcatgtgcc 240
cgcaaggaag ggatcgtggc cgcggtggat cacgtgtga 279
<210> 2
<211> 92
<212> PRT
<213> Unknown
<220>
<223> unknown
<400> 2
Met Leu Arg Cys Trp Ile Asp Ser Leu Ile Asp Arg Glu Val Pro Met
1 5 10 15
Ala Val Cys Glu Val Cys Gly Asn Asp Tyr Tyr Leu Ser Phe Glu Val
20 25 30
Val Met Ala Gly Ser Arg His Val Phe Asp Ser Phe Glu Cys Ala Ile
35 40 45
His Ala Leu Ala Pro Val Cys Glu His Cys Gly Cys Lys Ile Val Gly
50 55 60
His Gly Ile Glu Ala Glu Gly Thr Phe Tyr Cys Cys Ala Ser Cys Ala
65 70 75 80
Arg Lys Glu Gly Ile Val Ala Ala Val Asp His Val
85 90
<210> 3
<211> 57
<212> PRT
<213> Thermosynechococcus vulcanus
<400> 3
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Pro His Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Asn Asp Ala Ile Met Val Asp Gly Lys Pro Tyr Cys
20 25 30
Ser Glu Val Cys Ala Asn Gly Thr Cys Lys Glu Asn Ser Gly Cys Gly
35 40 45
His Ala Gly Cys Gly Cys Gly Ser Ala
50 55
<210> 4
<211> 54
<212> PRT
<213> Microcystis aeruginosa
<400> 4
Met Ile Ala Val Thr Thr Met Lys Cys Ala Cys Gly Ser Cys Thr Cys
1 5 10 15
Gln Val Ser Ile Ala Asp Ala Ile Lys Lys Asn Asp Gln Tyr Tyr Cys
20 25 30
Cys Gln Ala Cys Ala Asn Gly His Val Lys Glu Lys Gly Cys Gly His
35 40 45
Pro Gly Cys Val Cys Gly
50
<210> 5
<211> 57
<212> PRT
<213> Thermosynechococcus sp.
<400> 5
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Pro His Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Ser Asp Ala Ile Met Val Asp Gly Lys Pro Tyr Cys
20 25 30
Ser Glu Val Cys Ala Asn Gly Thr Cys Lys Glu Ser Asn Gly Cys Gly
35 40 45
His Ser Gly Cys Gly Cys Gly Ser Ala
50 55
<210> 6
<211> 57
<212> PRT
<213> Methylobacterium radiotolerans
<400> 6
Met Ala Ser Val Asp Val Glu Met Val Lys Cys Ala Cys Gln Asp Cys
1 5 10 15
Val Cys Val Ile Pro Val Ala Lys Ala Val Ser Arg Asp Gly Lys Ala
20 25 30
Tyr Cys Cys Asp Asp Cys Ala Asp Gly His Lys Asp His Ala Gly Cys
35 40 45
Glu His Ala Gly Cys Ala Cys His Gly
50 55
<210> 7
<211> 56
<212> PRT
<213> Microcystis aeruginosa
<400> 7
Met Ile Ala Val Thr Met Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Leu Ile Ala Asp Ala Ile Lys Lys Asn Asp Gln Tyr Tyr Cys
20 25 30
Ser Gln Ala Cys Ala Asn Gly His Val Asn Glu Asn Glu Lys Gly Cys
35 40 45
Gly His Gln Gly Cys Gly Cys Val
50 55
<210> 8
<211> 56
<212> PRT
<213> Microcystis aeruginosa
<400> 8
Met Ile Ala Val Thr Met Met Lys Cys Ala Cys Lys Pro Cys Leu Cys
1 5 10 15
Val Val Ser Ile Ala Asp Ala Ile Lys Glu Asn Asp Lys Tyr Tyr Cys
20 25 30
Ser Gln Ala Cys Ala Asn Gly His Val Asn Glu Asn Glu Lys Gly Cys
35 40 45
Gly His Gln Gly Cys Gly Cys Val
50 55
<210> 9
<211> 56
<212> PRT
<213> Synechococcus
<400> 9
Met Thr Ser Thr Thr Leu Val Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Asn Val Asp Pro Ser Lys Ala Ile Asp Arg Asn Gly Leu Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asp Gly His Thr Gly Gly Ser Lys Gly Cys Gly
35 40 45
His Thr Gly Cys Asn Cys His Gly
50 55
<210> 10
<211> 53
<212> PRT
<213> Mycobacterium tuberculosis
<400> 10
Met Arg Val Ile Arg Met Thr Asn Tyr Glu Ala Gly Thr Leu Leu Thr
1 5 10 15
Cys Ser His Glu Gly Cys Gly Cys Arg Val Arg Ile Glu Val Pro Cys
20 25 30
His Cys Ala Gly Ala Gly Asp Ala Tyr Arg Cys Thr Cys Gly Asp Glu
35 40 45
Leu Ala Pro Val Lys
50
<210> 11
<211> 48
<212> PRT
<213> Mycobacterium caprae
<400> 11
Met Thr Asn Tyr Glu Ala Gly Thr Leu Leu Thr Cys Ser His Glu Gly
1 5 10 15
Cys Gly Cys Arg Val Arg Ile Glu Val Pro Cys His Cys Ala Gly Ala
20 25 30
Gly Asp Ala Tyr Arg Cys Thr Cys Gly Asp Glu Leu Ala Pro Val Lys
35 40 45
<210> 12
<211> 53
<212> PRT
<213> Mycobacterium bovis
<400> 12
Met Arg Val Ile Arg Met Thr Asn Tyr Glu Ala Gly Thr Leu Leu Thr
1 5 10 15
Cys Ser His Glu Gly Cys Gly Cys Arg Val Arg Ile Glu Val Pro Cys
20 25 30
His Cys Ala Gly Ala Gly Asp Ala Tyr Arg Cys Thr Cys Gly Asp Glu
35 40 45
Leu Ala Pro Val Lys
50
<210> 13
<211> 55
<212> PRT
<213> Leptolyngbya sp.
<400> 13
Met Ala Thr Val Thr Gln Met Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Ile Val Asp Ile Ser Lys Ala Ile Gln Lys Asp Gly Gln Tyr Tyr Cys
20 25 30
Ser Glu Gly Cys Ala Ser Gly His Gly Asp Asn Ser Lys Gly Cys Gly
35 40 45
His Thr Gly Cys Asn Cys His
50 55
<210> 14
<211> 54
<212> PRT
<213> filamentous cyanobacterium
<400> 14
Met Ala Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Asp Ile Ser Lys Ala Ile Glu Lys Glu Gly Gln Tyr Tyr Cys
20 25 30
Gly Glu Ala Cys Ala Asn Gly His Ser Glu Gly Ser Thr Gly Cys Gly
35 40 45
His Pro Gly Cys Asn Cys
50
<210> 15
<211> 54
<212> PRT
<213> Halomicronema hongdechloris
<400> 15
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Asp Ser Cys Leu Cys
1 5 10 15
Ile Val Asp Thr Ser Lys Ala Val Glu Lys Glu Gly His Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asn Gly His Pro Glu Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Thr Cys His
50
<210> 16
<211> 55
<212> PRT
<213> Geitlerinema sp.
<400> 16
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Asp Thr Asp Lys Ala Val Glu Lys Asp Gly Gln Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asn Gly His Pro Asp Gly Ser Gly Cys Gly His
35 40 45
Gln Gly Cys Thr Cys His Ala
50 55
<210> 17
<211> 55
<212> PRT
<213> Phormidium tenue
<400> 17
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Asp Ser Cys Leu Cys
1 5 10 15
Val Val Asp Thr Ser Gln Ala Val Glu Lys Asp Gly His Tyr Phe Cys
20 25 30
Ser Glu Ala Cys Ala Asn Gly His Pro Glu Gly Ser Ala Gly Cys Gly
35 40 45
His Pro Gly Cys Gly Cys Asn
50 55
<210> 18
<211> 54
<212> PRT
<213> Lyngbya confervoides
<400> 18
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Asp Ser Cys Leu Cys
1 5 10 15
Ile Val Asn Thr Ser Lys Ala Val Glu Lys Glu Gly His Tyr Tyr Cys
20 25 30
Ser Asp Ala Cys Ala Asn Gly His Pro Glu Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Thr Cys His
50
<210> 19
<211> 54
<212> PRT
<213> Kamptonema
<400> 19
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Ser Ser Cys Leu Cys
1 5 10 15
Val Val Ser Leu Thr Glu Ala Ile Glu Lys Asn Gly Gln Tyr Tyr Cys
20 25 30
Ser Asn Ala Cys Ala Asp Gly His Pro Asn Gly Thr Gly Cys Gly His
35 40 45
Ala Gly Cys Gly Cys His
50
<210> 20
<211> 48
<212> PRT
<213> Spirulina subsalsa
<400> 20
Val Lys Cys Ala Cys Ser Thr Cys Glu Cys Met Val Ser Pro Asp Lys
1 5 10 15
Ala Ile Glu Lys Asp Gly Lys Tyr Tyr Cys Gly Glu Ala Cys Ala Asn
20 25 30
Gly His Thr Asp Gly Ser His Gly Cys Gly His Pro Gly Cys Asn Cys
35 40 45
<210> 21
<211> 54
<212> PRT
<213> Acaryochloris marina
<400> 21
Met Ala Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Thr Ile Ser Asp Ala Ile Gln Lys Gly Gly Gln Tyr Phe Cys
20 25 30
Gly Gln Ala Cys Ala Asp Gly His Pro Ser Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys His
50
<210> 22
<211> 54
<212> PRT
<213> Acaryochloris marina
<400> 22
Met Ala Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Thr Ile Ser Asp Ala Ile Gln Lys Gly Gly Gln Tyr Phe Cys
20 25 30
Gly Gln Ala Cys Ala Asp Gly His Pro Ser Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys His
50
<210> 23
<211> 53
<212> PRT
<213> Tolypothrix campylonemoides
<400> 23
Met Thr Asn Val Thr Gln Leu Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Val Val Ser Leu Glu Asp Ala Ile Gln Lys Asp Gly Lys Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Glu Gly His Gln Thr Met Gln Gly Cys Gly His
35 40 45
Ser Gly Cys Gly Cys
50
<210> 24
<211> 53
<212> PRT
<213> Tolypothrix campylonemoides
<400> 24
Met Thr Asn Val Thr Gln Leu Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Val Val Ser Leu Glu Asp Ala Ile Gln Lys Asp Gly Lys Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Glu Gly His Gln Thr Met Gln Gly Cys Gly His
35 40 45
Ser Gly Cys Gly Cys
50
<210> 25
<211> 53
<212> PRT
<213> Calothrix sp.
<400> 25
Met Thr Thr Val Thr Met Met Lys Cys Ala Cys Glu Arg Cys Leu Cys
1 5 10 15
Val Val Ser Thr Ala Asp Ala Ile Glu Lys Glu Gly Lys Tyr Tyr Cys
20 25 30
Ser Gln Ala Cys Ala Asp Gly His Lys Asp Glu Lys Gly Cys Ala His
35 40 45
Ser Gly Cys Gly Cys
50
<210> 26
<211> 52
<212> PRT
<213> Cyanobacterium aponinum
<400> 26
Thr Thr Val Thr Gln Met Lys Cys Ala Cys Pro Ser Cys Leu Cys Ile
1 5 10 15
Ile Asp Ile Ser Gln Ala Ile Ser Arg Asp Gly His Tyr Tyr Cys Ser
20 25 30
Thr Ala Cys Ala Glu Gly His Lys Glu Gly Glu Gly Cys Gly His Ser
35 40 45
Gly Cys Gly Cys
50
<210> 27
<211> 53
<212> PRT
<213> Scytonema tolypothrichoides
<400> 27
Met Thr Ser Val Thr Gln Met Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Glu Asn Ala Ile Gln Lys Asp Glu Lys Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Glu Gly His Lys Thr Met Lys Gly Cys Gly His
35 40 45
Asn Gly Cys Gly Cys
50
<210> 28
<211> 53
<212> PRT
<213> Limnoraphis robusta
<400> 28
Met Thr Ser Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Ser Leu Glu Ser Ala Ile Lys Lys Asp Gly Lys Pro Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asn Gly His Ser Asn Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Thr Cys
50
<210> 29
<211> 54
<212> PRT
<213> Oscillatoria brevis
<400> 29
Met Thr Thr Val Thr Gln Ile Lys Cys Ala Cys Pro Ser Cys Leu Cys
1 5 10 15
Val Val Ser Leu Thr Glu Ala Ile Glu Lys Ser Gly Lys Ser Tyr Cys
20 25 30
Ser Ser Ala Cys Ala Asp Gly His Pro Asn Gly Thr Gly Cys Gly His
35 40 45
Thr Gly Cys Glu Cys His
50
<210> 30
<211> 53
<212> PRT
<213> Fischerella
<400> 30
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Ser Ile Glu Asp Ala Ile Gln Lys Asp Asn Lys Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asp Gly His Gln Thr Thr Lys Gly Cys Gly His
35 40 45
Ser Gly Cys Gly Cys
50
<210> 31
<211> 54
<212> PRT
<213> Leptolyngbya ohadii
<400> 31
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Ser Asp Cys Leu Cys
1 5 10 15
Ile Val Asn Leu Asn Asp Ala Ile Met Lys Asp Gly Lys Ala Tyr Cys
20 25 30
Gly Asp Ala Cys Ala Asn Gly His Thr Gly Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys His
50
<210> 32
<211> 53
<212> PRT
<213> Acaryochloris marina
<400> 32
Met Ala Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Thr Ile Ser Asp Ala Ile Gln Lys Gly Gly Gln Tyr Phe Cys
20 25 30
Gly Gln Ala Cys Ala Asp Gly His Pro Ser Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Arg Cys
50
<210> 33
<211> 54
<212> PRT
<213> Phormidesmis priestleyi
<400> 33
Met Thr Ala Val Thr Gln Met Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Ile Val Thr Thr Glu Gly Ala Val Gln Lys Asp Gly Lys Leu Tyr Cys
20 25 30
Ser Glu Val Cys Ala Asp Gly His Pro Asn Gly His Gly Asp Cys Gly
35 40 45
His Lys Gly Cys Thr Cys
50
<210> 34
<211> 54
<212> PRT
<213> Microcoleus vaginatus
<400> 34
Met Thr Thr Ala Thr Gln Thr Lys Cys Ala Cys Pro Ser Cys Ser Cys
1 5 10 15
Val Val Asn Val Ser Glu Ala Ile Glu Lys Asp Gly Lys Thr Tyr Cys
20 25 30
Ser Ser Ala Cys Ala Asp Gly His Pro Asn Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Glu Cys His
50
<210> 35
<211> 54
<212> PRT
<213> Geminocystis herdmanii
<400> 35
Thr Val Thr Gln Met Lys Cys Ala Cys Pro Ser Cys Leu Cys Ile Val
1 5 10 15
Asp Ile Ala Ser Ala Ile Gln Lys Asp Asn Gln Tyr Phe Cys Ser Asp
20 25 30
Ala Cys Ala Asn Gly His Lys Glu Gly Thr Thr Gly Cys Ser His Ser
35 40 45
Gly Cys Gly Cys His Gly
50
<210> 36
<211> 54
<212> PRT
<213> Geitlerinema sp.
<400> 36
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Asn Leu Ser Asp Ala Val His Lys Asp Glu Lys Tyr Tyr Cys
20 25 30
Cys Glu Ala Cys Ala Asn Gly His Gln Ser Gly Asp Gly Cys Gly His
35 40 45
Ser Gly Cys Gly Cys His
50
<210> 37
<211> 53
<212> PRT
<213> Mastigocladopsis repens
<400> 37
Met Thr Ser Val Thr Gln Met Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Glu Asp Ala Ile Gln Lys Asp Asp Lys Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Glu Gly His Gln Thr Met Lys Gly Cys Gly His
35 40 45
Asn Gly Cys Gly Cys
50
<210> 38
<211> 52
<212> PRT
<213> Cyanothece sp.
<400> 38
Thr Val Thr Gln Met Lys Cys Ala Cys Ser Ser Cys Val Cys Ile Val
1 5 10 15
Asp Leu Ser Asp Ala Ile Gln Lys Asp Gly Lys Tyr Tyr Cys Ser Asp
20 25 30
Ala Cys Ala Asn Gly His Pro Asp Gly Ala Gly Cys Ser His His Gly
35 40 45
Cys Glu Cys His
50
<210> 39
<211> 53
<212> PRT
<213> Mastigocoleus testarum
<400> 39
Met Ala Asp Val Thr Ser Met Lys Cys Ala Cys Ala Asp Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Lys Asp Ala Ile Ala Lys Asn Gly Gln Tyr Tyr Cys
20 25 30
Ser Glu Val Cys Ala Asn Gly His Val Asp Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Lys Cys
50
<210> 40
<211> 53
<212> PRT
<213> Calothrix sp.
<400> 40
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Ser Ser Ala Val Met Lys Glu Gly Lys Pro Tyr Cys
20 25 30
Gly Glu Ala Cys Ala Asn Gly His Ala Asp Gly Lys Gly Cys Gly His
35 40 45
Thr Gly Cys Glu Cys
50
<210> 41
<211> 56
<212> PRT
<213> Pseudanabaena sp.
<400> 41
Met Ala Ser Ala Thr Leu Val Lys Cys Ala Cys Ser Lys Cys Leu Cys
1 5 10 15
Val Ile Asp Pro Ser Asp Ala Ile Glu Ala Asn Gly Lys Tyr Tyr Cys
20 25 30
Cys Lys Ala Cys Ala Ser Gly His Val Asp Gly Thr Asn Asp Ser His
35 40 45
Cys Ser Asp Val Gly Cys Glu Cys
50 55
<210> 42
<211> 58
<212> PRT
<213> Geminocystis sp.
<400> 42
Met Thr Thr Ala Thr Ile Thr Gln Met Lys Cys Ala Cys Pro Ser Cys
1 5 10 15
Leu Cys Ile Val Asp Ile Gly Thr Ala Leu Gln Lys Glu Gly Lys Tyr
20 25 30
Phe Cys Ser Thr Ala Cys Ala Glu Gly His Lys Glu Gly Thr Thr Gly
35 40 45
Cys Ser His Thr Gly Cys Gly Cys Asn Gly
50 55
<210> 43
<211> 53
<212> PRT
<213> Calothrix sp.
<400> 43
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Ser Ser Ala Val Met Lys Glu Gly Lys Pro Tyr Cys
20 25 30
Gly Glu Ala Cys Ala Asn Gly His Gln Asp Gly Lys Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys
50
<210> 44
<211> 53
<212> PRT
<213> Rivularia sp.
<400> 44
Met Ala Ala Val Asp Leu Met Lys Cys Ala Cys Asp Lys Cys Leu Cys
1 5 10 15
Ile Val Lys Val Glu Thr Ala Ile Asp Arg Asp Gly Lys His Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Glu Gly His Lys Thr Ile Thr Gly Cys Gly His
35 40 45
Ser Gly Cys Gly Cys
50
<210> 45
<211> 55
<212> PRT
<213> Coleofasciculus chthonoplastes
<400> 45
Met Thr Thr Ala Thr Gln Thr Gln Cys Ala Cys Asp Ser Cys Ala Cys
1 5 10 15
Met Val Ser Thr Asp Ser Ala Val Gln Lys Asp Gly Lys Tyr Tyr Cys
20 25 30
Ser Asp Ala Cys Ala Asn Gly His Pro Asn Gly Ala Gly Cys Gly His
35 40 45
Ser Gly Cys Glu Cys His Ala
50 55
<210> 46
<211> 53
<212> PRT
<213> Stanieria cyanosphaera
<400> 46
Met Ser Thr Val Thr Ser Met Lys Cys Ala Cys Asp Arg Cys Leu Cys
1 5 10 15
Val Val Ser Leu Glu Asp Ala Val Lys Lys Asp Gly Lys Tyr Tyr Cys
20 25 30
Cys Glu Ala Cys Ala Asn Gly His Thr Asp Gly Ser Gly Cys Gly His
35 40 45
Gln Gly Cys Gly Cys
50
<210> 47
<211> 54
<212> PRT
<213> Hydrocoleum sp.
<400> 47
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Ser Ile Glu Ser Ala Val Lys Lys Asn Gly Gln Asn Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asn Asn His Pro Asp Gly Ala Gly Cys Gly His
35 40 45
Glu Gly Cys Glu Cys Asn
50
<210> 48
<211> 53
<212> PRT
<213> Chamaesiphon minutus
<400> 48
Met Ser Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Ser Asp Ala Ile Val Lys Asp Gly Lys His Tyr Cys
20 25 30
Gly Asp Ala Cys Ala Asn Gly His Pro Ala Gly Gln Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys
50
<210> 49
<211> 55
<212> PRT
<213> Tolypothrix bouteillei
<400> 49
Met Thr Thr Val Ser Gln Met Lys Cys Ala Cys Lys Ser Cys Leu Cys
1 5 10 15
Val Val Ser Leu Ser Asp Ala Leu Met Lys Asp Gly Lys Ala Tyr Cys
20 25 30
Gly Glu Ala Cys Ala Asn Gly His Thr Asn Gly Glu Cys Cys Gly His
35 40 45
Thr Gly Cys Asp Cys His Ala
50 55
<210> 50
<211> 54
<212> PRT
<213> Hapalosiphonaceae
<400> 50
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Ser Leu Thr Asp Ala Val Ile Lys Asp Gly Lys Pro Tyr Cys
20 25 30
Gly Glu Ala Cys Ala Asn Gly His Pro Asn Gly Glu Gly Cys Gly His
35 40 45
Gln Gly Cys Gly Cys His
50
<210> 51
<211> 53
<212> PRT
<213> Aliterella atlantica
<400> 51
Met Thr Thr Ala Thr Gln Thr Gln Cys Ala Cys Glu Ser Cys His Cys
1 5 10 15
Pro Val Ser Glu Thr Glu Ala Val Gln Lys Asp Gly Lys Thr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Gln Gly His Pro Asp Gly Lys Gly Cys Gly His
35 40 45
Ala Gly Cys Asp Cys
50
<210> 52
<211> 53
<212> PRT
<213> Aliterella atlantica
<400> 52
Met Thr Thr Ala Thr Gln Thr Gln Cys Ala Cys Glu Ser Cys His Cys
1 5 10 15
Pro Val Ser Glu Thr Glu Ala Val Gln Lys Asp Gly Lys Thr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Gln Gly His Pro Asp Gly Lys Gly Cys Gly His
35 40 45
Ala Gly Cys Asp Cys
50
<210> 53
<211> 53
<212> PRT
<213> Cyanobacterium stanieri
<400> 53
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Pro Ser Cys Leu Cys
1 5 10 15
Ile Val Asn Leu Ser Asp Ala Ile Gln Lys Asn Asp His Tyr Tyr Cys
20 25 30
Cys Gln Ala Cys Ala Asp Gly His Pro Asn Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys
50
<210> 54
<211> 53
<212> PRT
<213> Xenococcus sp.
<400> 54
Thr Val Thr Gln Met Lys Cys Ala Cys Pro Ser Cys Leu Cys Ile Val
1 5 10 15
Asn Val Ser Asp Ala Ile Ser Lys Glu Gly Lys Tyr Tyr Cys Ser Asp
20 25 30
Ala Cys Ala Lys Gly His Ser Glu Gly Ala Gly Cys Ser His Ala Gly
35 40 45
Cys Gly Cys His Ala
50
Claims (9)
1. A metallothionein gene, characterized in that the base sequence thereof is as shown in SEQ ID NO: 1 is shown.
2. A metallothionein encoded by the gene of claim 1, having an amino acid sequence as set forth in SEQ ID NO: 2, respectively.
3. A recombinant vector comprising the metallothionein gene according to claim 1.
4. The recombinant vector according to claim 3, wherein the vector is an E.coli expression vector.
5. A recombinant strain comprising the metallothionein gene of claim 1.
6. The recombinant strain of claim 5, wherein the strain is E.coli.
7. Use of the metallothionein gene according to claim 1, the metallothionein gene according to claim 2 or the recombinant strain according to any one of claims 5 to 6 in the treatment of heavy metal contamination.
8. The use according to claim 7, wherein the use is in heavy metal pollution remediation of wastewater and/or in heavy metal pollution remediation of soil.
9. Use according to claim 8, wherein the heavy metal is copper, cobalt, lead and/or cadmium.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910290545.3A CN109851670B (en) | 2019-04-11 | 2019-04-11 | Metallothionein gene MT18, metallothionein obtained by encoding metallothionein gene MT18, expression and application of metallothionein |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910290545.3A CN109851670B (en) | 2019-04-11 | 2019-04-11 | Metallothionein gene MT18, metallothionein obtained by encoding metallothionein gene MT18, expression and application of metallothionein |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109851670A CN109851670A (en) | 2019-06-07 |
CN109851670B true CN109851670B (en) | 2021-12-31 |
Family
ID=66903868
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910290545.3A Expired - Fee Related CN109851670B (en) | 2019-04-11 | 2019-04-11 | Metallothionein gene MT18, metallothionein obtained by encoding metallothionein gene MT18, expression and application of metallothionein |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109851670B (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5824512A (en) * | 1996-11-22 | 1998-10-20 | The United States Of America As Represented By The Secretary Of The Navy | Bacteria expressing metallothionein gene into the periplasmic space, and method of using such bacteria in environment cleanup |
CN101892187A (en) * | 2009-12-30 | 2010-11-24 | 华中农业大学 | Recombinant Pseudomonas putida CH01 capable of adsorbing heavy metal cadmium and application thereof |
CN106519023A (en) * | 2016-06-03 | 2017-03-22 | 哈尔滨工业大学(威海) | Gene cloning method, expressing method and application of psychrophilic yeast metallothionein |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7273962B2 (en) * | 2001-09-06 | 2007-09-25 | Mgp Biotechnologies, Llc | Compositions and methods for removing heavy metals from contaminated samples using membranes provided with purified metallothionein (MT) proteins |
-
2019
- 2019-04-11 CN CN201910290545.3A patent/CN109851670B/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5824512A (en) * | 1996-11-22 | 1998-10-20 | The United States Of America As Represented By The Secretary Of The Navy | Bacteria expressing metallothionein gene into the periplasmic space, and method of using such bacteria in environment cleanup |
CN101892187A (en) * | 2009-12-30 | 2010-11-24 | 华中农业大学 | Recombinant Pseudomonas putida CH01 capable of adsorbing heavy metal cadmium and application thereof |
CN106519023A (en) * | 2016-06-03 | 2017-03-22 | 哈尔滨工业大学(威海) | Gene cloning method, expressing method and application of psychrophilic yeast metallothionein |
Non-Patent Citations (5)
Title |
---|
"Advances in metallothionein structure and functions";Milan Vasak;《Journal of Trace Elements in Medicine and Biology》;20051231;第19卷;第13-17页 * |
"Metagenomics-Guided Discovery of Potential Bacterial Metallothionein Genes from the Soil Microbiome That Confer Cu and/or Cd Resistance";Xiaofang Li et al.;《Applied and Environmental Microbiology》;20200417;第86卷(第9期);第1-16页 * |
"Uncultured bacterium clone MT18 metallothionein gene,complete cds,Accession:MT035821";Li X et al.;《EMBL-EBI》;20200319;第1页 * |
"大型真菌重金属富集能力与机制研究进展";陈苗苗 等;《农业资源与环境学报》;20171130;第34卷(第6期);第499-508页 * |
"微生物修复技术在重金属污染治理中的研究进展";薛高尚 等;《中国农学通报》;20121231;第28卷(第11期);第266-271页 * |
Also Published As
Publication number | Publication date |
---|---|
CN109851670A (en) | 2019-06-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Ballot et al. | Diversity of cyanobacteria and cyanotoxins in Hartbeespoort Dam, South Africa | |
Giovagnetti et al. | The evolution of the photoprotective antenna proteins in oxygenic photosynthetic eukaryotes | |
CN101781653B (en) | Jujube tree metallothionein gene and application thereof to treatment of heavy metal pollution | |
CN107177599B (en) | Encoding gene for enhancing tolerance of plant to cadmium poison and reducing cadmium content of plant and application | |
CN110205332B (en) | Encoding gene for enhancing tolerance of plant to cadmium poison and reducing cadmium content of plant and application | |
CN112094838B (en) | Application of glucose-6-phosphate isomerase in regulation and control of plant starch content and biomass | |
Li et al. | Functional expression of MxIRT1, from Malus xiaojinensis, complements an iron uptake deficient yeast mutant for plasma membrane targeting via membrane vesicles trafficking process | |
CN111349632A (en) | Rice bacterial leaf blight resistant genes Xa2, Xa14 and Xa45(t) and application thereof | |
CN109207483B (en) | Watermelon disease-resistant gene Cltlp3 and coding protein and application thereof | |
CN109943571B (en) | Metallothionein gene MT20, metallothionein obtained by encoding metallothionein gene MT20, expression and application of metallothionein | |
CN112322648A (en) | ABC transporter gene MRP1S and preparation method and application thereof | |
WO2018017828A1 (en) | Biocatalyst comprising photoautotrophic organisms producing recombinant enzyme for degradation of harmful algal bloom toxins | |
CN109851670B (en) | Metallothionein gene MT18, metallothionein obtained by encoding metallothionein gene MT18, expression and application of metallothionein | |
CN110105446B (en) | Metallothionein gene MT27, metallothionein obtained by encoding metallothionein gene MT27, expression and application of metallothionein | |
Goh et al. | NRC immune receptor networks show diversified hierarchical genetic architecture across plant lineages | |
CN110004152B (en) | Metallothionein gene MT16, metallothionein obtained by coding metallothionein gene MT16, expression and application of metallothionein | |
Bai et al. | Research advances in mechanisms of arsenic hyperaccumulation of Pteris vittata: perspectives from plant physiology, molecular biology, and phylogeny | |
CN113004382B (en) | Application of EmBP1 gene or protein thereof | |
CN101942468B (en) | Gene LcGST for enhancing transgenic organism salt tolerance, preparation thereof and protein coded by using same | |
CN109207496A (en) | A kind of heavy metal cadmium binding proteins specific gene BjHMA4R and its coding albumen and application | |
Geiß et al. | Investigations on cyanobacterial diversity in a shallow estuary (Southern Baltic Sea) including genes relevant to salinity resistance and iron starvation acclimation | |
CN102604901B (en) | Heavy-metal mercury-resistance related protein DbsMerA and encoding genes and application thereof | |
CN108588109B (en) | Recombinant expression vector of C2H2 type transcription factor gene asr1 and application thereof | |
CN105037516B (en) | Corn OXS2 gene family, its coding albumen and application | |
CN102417899A (en) | Metallic cadmium resistance associated protein KdpD and coding gene and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20211231 |