CN109943571A - 一种金属硫蛋白基因mt20、其编码得到的金属硫蛋白及其表达和应用 - Google Patents
一种金属硫蛋白基因mt20、其编码得到的金属硫蛋白及其表达和应用 Download PDFInfo
- Publication number
- CN109943571A CN109943571A CN201910289750.8A CN201910289750A CN109943571A CN 109943571 A CN109943571 A CN 109943571A CN 201910289750 A CN201910289750 A CN 201910289750A CN 109943571 A CN109943571 A CN 109943571A
- Authority
- CN
- China
- Prior art keywords
- cys
- gly
- ala
- ser
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108090000157 Metallothionein Proteins 0.000 title claims abstract description 50
- 102000003792 Metallothionein Human genes 0.000 title claims description 17
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 title claims description 12
- BDOSMKKIYDKNTQ-UHFFFAOYSA-N cadmium atom Chemical compound [Cd] BDOSMKKIYDKNTQ-UHFFFAOYSA-N 0.000 claims abstract description 20
- 229910052793 cadmium Inorganic materials 0.000 claims abstract description 18
- 229910001385 heavy metal Inorganic materials 0.000 claims abstract description 16
- 241000588724 Escherichia coli Species 0.000 claims description 17
- 239000002689 soil Substances 0.000 claims description 11
- 108090000623 proteins and genes Proteins 0.000 claims description 10
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 claims description 4
- 229910052802 copper Inorganic materials 0.000 claims description 4
- 239000010949 copper Substances 0.000 claims description 4
- 239000013598 vector Substances 0.000 claims 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 229910017052 cobalt Inorganic materials 0.000 claims 1
- 239000010941 cobalt Substances 0.000 claims 1
- GUTLYIVDDKVIGB-UHFFFAOYSA-N cobalt atom Chemical compound [Co] GUTLYIVDDKVIGB-UHFFFAOYSA-N 0.000 claims 1
- 239000013604 expression vector Substances 0.000 claims 1
- 239000011133 lead Substances 0.000 claims 1
- 239000002351 wastewater Substances 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 4
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 34
- RATXDYWHIYNZLE-DCAQKATOSA-N Met-Lys-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N RATXDYWHIYNZLE-DCAQKATOSA-N 0.000 description 32
- 108010016616 cysteinylglycine Proteins 0.000 description 32
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 28
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 24
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 23
- PLBJMUUEGBBHRH-ZLUOBGJFSA-N Cys-Ala-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLBJMUUEGBBHRH-ZLUOBGJFSA-N 0.000 description 22
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 21
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 18
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 17
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 14
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 13
- 108010015792 glycyllysine Proteins 0.000 description 13
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 12
- 230000012010 growth Effects 0.000 description 12
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 11
- 108010047857 aspartylglycine Proteins 0.000 description 11
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 10
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 10
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 10
- 230000001580 bacterial effect Effects 0.000 description 9
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 8
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 8
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 8
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 8
- 108010010147 glycylglutamine Proteins 0.000 description 8
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 108010064235 lysylglycine Proteins 0.000 description 7
- 238000000034 method Methods 0.000 description 7
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 6
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 6
- YYQGVXNKAXUTJU-YUMQZZPRSA-N Gly-Cys-His Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O YYQGVXNKAXUTJU-YUMQZZPRSA-N 0.000 description 6
- 229910052751 metal Inorganic materials 0.000 description 6
- 239000002184 metal Substances 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 5
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 5
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 5
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 5
- URDUGPGPLNXXES-WHFBIAKZSA-N Cys-Gly-Cys Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O URDUGPGPLNXXES-WHFBIAKZSA-N 0.000 description 5
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 5
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 5
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 5
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 5
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 5
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 5
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 5
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 5
- 239000012620 biological material Substances 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 5
- 230000002401 inhibitory effect Effects 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- OILNWMNBLIHXQK-ZLUOBGJFSA-N Ala-Cys-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O OILNWMNBLIHXQK-ZLUOBGJFSA-N 0.000 description 4
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 4
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 4
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 4
- 239000002028 Biomass Substances 0.000 description 4
- CVOZXIPULQQFNY-ZLUOBGJFSA-N Cys-Ala-Cys Chemical compound C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O CVOZXIPULQQFNY-ZLUOBGJFSA-N 0.000 description 4
- UEHCDNYDBBCQEL-CIUDSAMLSA-N Cys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N UEHCDNYDBBCQEL-CIUDSAMLSA-N 0.000 description 4
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 4
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 4
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 4
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 108010040030 histidinoalanine Proteins 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 241000007910 Acaryochloris marina Species 0.000 description 3
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 3
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 3
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 3
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 3
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 3
- 241000192682 Calothrix sp. Species 0.000 description 3
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 3
- JIVJXVJMOBVCJF-ZLUOBGJFSA-N Cys-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)C(=O)N JIVJXVJMOBVCJF-ZLUOBGJFSA-N 0.000 description 3
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 3
- ZXGLLNZQSBLQLT-SRVKXCTJSA-N Gln-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZXGLLNZQSBLQLT-SRVKXCTJSA-N 0.000 description 3
- QXQDADBVIBLBHN-FHWLQOOXSA-N Gln-Tyr-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QXQDADBVIBLBHN-FHWLQOOXSA-N 0.000 description 3
- PNAOVYHADQRJQU-GUBZILKMSA-N Glu-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N PNAOVYHADQRJQU-GUBZILKMSA-N 0.000 description 3
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 3
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 3
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 3
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 3
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 3
- HIJIJPFILYPTFR-ACRUOGEOSA-N His-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HIJIJPFILYPTFR-ACRUOGEOSA-N 0.000 description 3
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 3
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 3
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 3
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 3
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 3
- ZAWOJFFMBANLGE-CIUDSAMLSA-N Lys-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N ZAWOJFFMBANLGE-CIUDSAMLSA-N 0.000 description 3
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 3
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 3
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 3
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 3
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 3
- 241000192710 Microcystis aeruginosa Species 0.000 description 3
- BVTYXOFTHDXSNI-IHRRRGAJSA-N Pro-Tyr-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 BVTYXOFTHDXSNI-IHRRRGAJSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 3
- RJBFAHKSFNNHAI-XKBZYTNZSA-N Thr-Gln-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O RJBFAHKSFNNHAI-XKBZYTNZSA-N 0.000 description 3
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 3
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 3
- SJLVYVZBFDTRCG-DCAQKATOSA-N Val-Lys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N SJLVYVZBFDTRCG-DCAQKATOSA-N 0.000 description 3
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 3
- 238000002835 absorbance Methods 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 150000002739 metals Chemical class 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 244000005700 microbiome Species 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 238000001179 sorption measurement Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 2
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- 241001677621 Aliterella atlantica Species 0.000 description 2
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 2
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 2
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- XMTDCXXLDZKAGI-ACZMJKKPSA-N Cys-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N XMTDCXXLDZKAGI-ACZMJKKPSA-N 0.000 description 2
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 2
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 2
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 2
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 2
- 238000007400 DNA extraction Methods 0.000 description 2
- 241000892911 Geitlerinema Species 0.000 description 2
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 2
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 2
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 2
- YDWZGVCXMVLDQH-WHFBIAKZSA-N Gly-Cys-Asn Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O YDWZGVCXMVLDQH-WHFBIAKZSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 2
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 2
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 2
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 2
- QNILDNVBIARMRK-XVYDVKMFSA-N His-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N QNILDNVBIARMRK-XVYDVKMFSA-N 0.000 description 2
- VLPMGIJPAWENQB-SRVKXCTJSA-N His-Cys-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O VLPMGIJPAWENQB-SRVKXCTJSA-N 0.000 description 2
- JUIOPCXACJLRJK-AVGNSLFASA-N His-Lys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N JUIOPCXACJLRJK-AVGNSLFASA-N 0.000 description 2
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 2
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 2
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- 241000218314 Liriodendron tulipifera Species 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 2
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 2
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 2
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 2
- GRYLNZFGIOXLOG-UHFFFAOYSA-N Nitric acid Chemical compound O[N+]([O-])=O GRYLNZFGIOXLOG-UHFFFAOYSA-N 0.000 description 2
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 2
- CZCCVJUUWBMISW-FXQIFTODSA-N Pro-Ser-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O CZCCVJUUWBMISW-FXQIFTODSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- COLJZWUVZIXSSS-CIUDSAMLSA-N Ser-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N COLJZWUVZIXSSS-CIUDSAMLSA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 2
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 2
- MQUZMZBFKCHVOB-HJGDQZAQSA-N Thr-Gln-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O MQUZMZBFKCHVOB-HJGDQZAQSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- PUEWAXRPXOEQOW-HJGDQZAQSA-N Thr-Met-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O PUEWAXRPXOEQOW-HJGDQZAQSA-N 0.000 description 2
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- 241000682509 Tolypothrix campylonemoides Species 0.000 description 2
- ZAGPDPNPWYPEIR-SRVKXCTJSA-N Tyr-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ZAGPDPNPWYPEIR-SRVKXCTJSA-N 0.000 description 2
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 238000003766 bioinformatics method Methods 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 2
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000007689 inspection Methods 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 229910017604 nitric acid Inorganic materials 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 235000018102 proteins Nutrition 0.000 description 2
- 102000004169 proteins and genes Human genes 0.000 description 2
- 230000008439 repair process Effects 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 108010000998 wheylin-2 peptide Proteins 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 1
- IAMNNSSEBXDJMN-CIUDSAMLSA-N Asp-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N IAMNNSSEBXDJMN-CIUDSAMLSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 1
- 241000453722 Chamaesiphon minutus Species 0.000 description 1
- 241000907165 Coleofasciculus chthonoplastes Species 0.000 description 1
- 241001464430 Cyanobacterium Species 0.000 description 1
- 241000199492 Cyanobacterium aponinum Species 0.000 description 1
- 241001464431 Cyanobacterium stanieri Species 0.000 description 1
- 241000159506 Cyanothece Species 0.000 description 1
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 1
- OJQJUQUBJGTCRY-WFBYXXMGSA-N Cys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N OJQJUQUBJGTCRY-WFBYXXMGSA-N 0.000 description 1
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 1
- ZJBWJHQDOIMVLM-WHFBIAKZSA-N Cys-Cys-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZJBWJHQDOIMVLM-WHFBIAKZSA-N 0.000 description 1
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 1
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 1
- GCDLPNRHPWBKJJ-WDSKDSINSA-N Cys-Gly-Glu Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GCDLPNRHPWBKJJ-WDSKDSINSA-N 0.000 description 1
- RWAZRMXTVSIVJR-YUMQZZPRSA-N Cys-Gly-His Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CNC=N1)C(O)=O RWAZRMXTVSIVJR-YUMQZZPRSA-N 0.000 description 1
- VNXXMHTZQGGDSG-CIUDSAMLSA-N Cys-His-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O VNXXMHTZQGGDSG-CIUDSAMLSA-N 0.000 description 1
- DYBIDOHFRRUMLW-CIUDSAMLSA-N Cys-Leu-Cys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O DYBIDOHFRRUMLW-CIUDSAMLSA-N 0.000 description 1
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 1
- RWVBNRYBHAGYSG-GUBZILKMSA-N Cys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N RWVBNRYBHAGYSG-GUBZILKMSA-N 0.000 description 1
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 1
- 101000844746 Drosophila melanogaster Drosomycin Proteins 0.000 description 1
- 101150015738 Fev gene Proteins 0.000 description 1
- 241000192601 Fischerella Species 0.000 description 1
- 241000637228 Geminocystis Species 0.000 description 1
- 241000637231 Geminocystis herdmanii Species 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 1
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 1
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- 241000999742 Halomicronema hongdechloris Species 0.000 description 1
- 241000813894 Hapalosiphonaceae Species 0.000 description 1
- OHOXVDFVRDGFND-YUMQZZPRSA-N His-Cys-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O OHOXVDFVRDGFND-YUMQZZPRSA-N 0.000 description 1
- CZXKZMQKXQZDEX-YUMQZZPRSA-N His-Gly-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N CZXKZMQKXQZDEX-YUMQZZPRSA-N 0.000 description 1
- QEYUCKCWTMIERU-SRVKXCTJSA-N His-Lys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QEYUCKCWTMIERU-SRVKXCTJSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- 241000568637 Hydrocoleum Species 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- RFMDODRWJZHZCR-BJDJZHNGSA-N Ile-Lys-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O RFMDODRWJZHZCR-BJDJZHNGSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 241000029491 Kamptonema Species 0.000 description 1
- 241001105211 Leptolyngbya ohadii Species 0.000 description 1
- 241000215469 Leptolyngbya sp. Species 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- 241001659204 Limnoraphis robusta Species 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- 241001226557 Lyngbya confervoides Species 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- 241000243392 Mastigocladopsis repens Species 0.000 description 1
- 241000823566 Mastigocoleus testarum Species 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 1
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241001430258 Methylobacterium radiotolerans Species 0.000 description 1
- 241000264662 Microcoleus vaginatus Species 0.000 description 1
- 241000186366 Mycobacterium bovis Species 0.000 description 1
- 241000211133 Mycobacterium caprae Species 0.000 description 1
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 241000763357 Oscillatoria brevis Species 0.000 description 1
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 1
- UEADQPLTYBWWTG-AVGNSLFASA-N Phe-Glu-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEADQPLTYBWWTG-AVGNSLFASA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- 241000122845 Phormidesmis priestleyi Species 0.000 description 1
- 241000576909 Phormidium tenue Species 0.000 description 1
- 241000219000 Populus Species 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- OGRYXQOUFHAMPI-DCAQKATOSA-N Pro-Cys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O OGRYXQOUFHAMPI-DCAQKATOSA-N 0.000 description 1
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- 102100037681 Protein FEV Human genes 0.000 description 1
- 241000192509 Pseudanabaena sp. Species 0.000 description 1
- 241000190950 Rhodopseudomonas palustris Species 0.000 description 1
- 241001575211 Rivularia <snail> Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 241000387897 Scytonema tolypothrichoides Species 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- 241000530636 Spirulina subsalsa Species 0.000 description 1
- 241001464991 Stanieria cyanosphaera Species 0.000 description 1
- 241000192707 Synechococcus Species 0.000 description 1
- 241001313706 Thermosynechococcus Species 0.000 description 1
- 241001453191 Thermosynechococcus vulcanus Species 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 1
- DHPPWTOLRWYIDS-XKBZYTNZSA-N Thr-Cys-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O DHPPWTOLRWYIDS-XKBZYTNZSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- 241000758869 Tolypothrix bouteillei Species 0.000 description 1
- SEFNTZYRPGBDCY-IHRRRGAJSA-N Tyr-Arg-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O SEFNTZYRPGBDCY-IHRRRGAJSA-N 0.000 description 1
- CGDZGRLRXPNCOC-SRVKXCTJSA-N Tyr-Cys-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CGDZGRLRXPNCOC-SRVKXCTJSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- UPODKYBYUBTWSV-BZSNNMDCSA-N Tyr-Phe-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 UPODKYBYUBTWSV-BZSNNMDCSA-N 0.000 description 1
- 101710120355 Uncharacterized membrane protein ycf78 Proteins 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- 241000511384 Xenococcus sp. Species 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 238000000184 acid digestion Methods 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 150000001413 amino acids Chemical group 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 101150081303 arsM gene Proteins 0.000 description 1
- 229910052785 arsenic Inorganic materials 0.000 description 1
- RQNWIZPPADIBDY-UHFFFAOYSA-N arsenic atom Chemical compound [As] RQNWIZPPADIBDY-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 102000023732 binding proteins Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 230000007773 growth pattern Effects 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- QSHDDOUJBYECFT-UHFFFAOYSA-N mercury Chemical compound [Hg] QSHDDOUJBYECFT-UHFFFAOYSA-N 0.000 description 1
- 229910052753 mercury Inorganic materials 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000005067 remediation Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Landscapes
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了一种金属硫蛋白基因MT20,其碱基序列如SEQ ID NO:1所示,来源于土壤微生物,该金属硫蛋白基因重金属耐受性好,在1.0M浓度镉溶液中仍然可以存活生长。并且对镉金属富集效应明显,因而在重金属污染治理等行业具有较大的应用潜力。
Description
技术领域
本发明属于基因工程技术领域,具体涉及一种金属硫蛋白基因、其编码得到的金属硫蛋白及其表达和应用。
背景技术
金属硫蛋白(metallothionein,简称MT)是一类普遍存在于生物体内的金属结合蛋白。金属硫蛋白是具有结合金属能力和高诱导特性的低分子量蛋白质。富含半胱氨酸的短肽,对多种重金属有高度亲和性。它是分子质量较低,半胱氨酸残基和金属含量极高的蛋白质。与其结合的金属主要是镉、铜和锌,广泛地存在于从微生物到人类各种生物中,其结构高度保守。
重金属生物修复是利用生物材料进行重金属治理的技术,而重金属抗性基因是生物修复的重要资源。基于基因改造的生物材料可以突破自然生物材料的性状限制,获得生物量、生长速度和修复效率的优化组合,来自微生物的重金属抗性基因已经在高效修复生物材料的开发方面显示出巨大的潜力。早在1998年,Nature Biotechnology(Rugh et al.,1998)就报道了稳定表达细菌汞还原基因merA的黄杨(yellow poplar,又译作鹅掌楸),该转基因植物展示了一种生态友好的植物修复途径。来自沼泽红假单胞菌(Rhodopseudomonas palustris)的砷甲基化基因arsM最近被用于了构建基因工程菌,该工程菌展示了高效修复砷污染土壤的潜力(Chen et al.,2014)。少数研究也展示了基因改造生物材料在镉污染修复上的潜力。例如,Shim等人将酵母的镉抗性基因YCF1转入杨树,促进了该植株在镉污染土壤中的生长,也增强了该植株的镉富集能力(Shim et al.,2013);Kang等人将三个微生物镉抗性基因SpPCS、GshI和MntA转入大肠杆菌,使转基因菌株的镉富集能力提高了25倍(Kang et al.,2007)。
现阶段对于重金属耐受/富集效果的金属硫蛋白来源往往来自于动物、植物,对微生物来源的金属硫蛋白研究较少。
发明内容
本发明的目的在于获得一种重金属耐受/富集效果的金属硫蛋白基因,从而得到由其编码的金属硫蛋白,进而将其应用于重金属污染治理领域中。
本发明第一方面是提供一种,金属硫蛋白基因,其碱基序列如SEQ ID NO:1所示,由该基因编码得到的氨基酸序列如SEQ ID NO:2所示。本发明通过以下步骤分离得到上述金属硫蛋白基因:
S1.用铜溶液污染土壤,避光恒温25度培养14天
S2.使用商用DNA提取试剂盒提取土壤全基因组
S3.使用二代Illumia测序平台HiSeq对基因组进行全测序
S4.通过通用生物信息学方法对测序序列进行质检、过滤、组装
S5.从公开基因数据库UniProt和GenBank中搜索所有有关的金属硫蛋白基因序列
S6.手动检查所述金属硫蛋白基因序列可靠性,编号后将可靠金属硫蛋白基因序列汇总,经软件翻译形成金属硫蛋白氨基酸序列数据库,所述金属硫蛋白氨基酸序列数据库见SEQ ID NO:3~SEQ ID NO:54;
S7.利用生物信息学检索方法BLAST,在测序组装的土壤基因组中搜索可能的金属硫蛋白基因序列,以所述金属硫蛋白氨基酸序列数据库作为检索模板
S8.获得的所述可能的金属硫蛋白基因序列进行手动检查,核对序列的核苷酸偏好性、金属结合位点等信息,进一步确定高可靠性的金属硫蛋白基因序列
S9.功能验证
9.1将所述高可靠性的金属硫蛋白基因序列通过化学合成获得高可靠性疑似金属硫蛋白序列,所述高可靠性疑似金属硫蛋白序列直接连接至pET28a(+)质粒,然后转化大肠杆菌BL21(DE3),得到转化菌株;
9.2通过DropAssay试验测定所述大肠杆菌BL21(DE3)及所述转化菌株的镉抗性,使用所述大肠杆菌BL21(DE3)作为对照,当最小抑制浓度下该转化菌株成活,认为所述高可靠性的金属硫蛋白基因序列为有功能/活性的金属硫蛋白基因;
9.3通过测定生长管内600nm处吸光值获得所述大肠杆菌BL21(DE3)的生长曲线;
9.4收获生长曲线测定中的所述大肠杆菌BL21(DE3);
9.5通过强酸消解所述大肠杆菌BL21(DE3),使用原子吸收测定溶液镉浓度并计算所述大肠杆菌BL21(DE3)内镉吸收量。
本发明的优点和有益效果为:
本发明公开的金属硫蛋白基因,来源于土壤微生物,该金属硫蛋白基因重金属耐受性好,在1.0M浓度镉溶液中仍然可以存活生长。并且对镉金属富集效应明显,因而在重金属污染治理等行业具有较大的应用潜力。
附图说明
图1是本发明公开的大肠杆菌在不同浓度镉溶液中的生长情况。
图2是本发明公开的大肠杆菌生长曲线。
对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,可以根据以上附图获得其他的相关附图。
具体实施方式
为了使本技术领域的人员更好地理解本发明方案,下面结合具体实施例进一步说明本发明的技术方案。
实施例一
金属硫蛋白基因筛选获得的过程:
S1.用铜溶液污染土壤,避光恒温25度培养14天;
S2.使用商用DNA提取试剂盒(DNeasy PowerMax Soil Kit)按照其说明书规定流程进行操作提取土壤全基因组,凝胶电泳测定DNA质量;
S3.提交商业测序,使用二代Illumia测序平台HiSeq对基因组进行全测序
S4.通过通用生物信息学方法对测序序列进行质检、过滤、组装
S5.从公开基因数据库UniProt和GenBank中搜索所有有关的金属硫蛋白基因序列
S6.手动检查所述金属硫蛋白基因序列可靠性,编号后将可靠金属硫蛋白基因序列汇总,经过软件翻译形成金属硫蛋白氨基酸序列数据库,所述金属硫蛋白氨基酸序列数据库见SEQ ID NO:3~SEQ ID NO:54;
S7.利用生物信息学检索方法BLAST,在测序组装的土壤基因组中搜索可能的金属硫蛋白基因序列,以所述金属硫蛋白氨基酸序列数据库作为检索模板,
S8.获得的所述可能的金属硫蛋白基因序列进行手动检查,核对序列的核苷酸偏好性、金属结合位点等信息,进一步确定高可靠性的金属硫蛋白基因序列,所述可靠性的金属硫蛋白基因序列碱基序列如SEQ ID NO:1所示,编号为MT20。
实施例二
9.1将所述高可靠性的金属硫蛋白基因序列MT20通过化学合成获得高可靠性疑似金属硫蛋白序列如SEQ ID NO:2所示(通过商业公司GENEWIZ,Suzhou,完成),所述高可靠性疑似金属硫蛋白序列直接连接至pET28a(+)质粒,质粒通过电转化转入宿主大肠杆菌Escherichia coli(E.coli)BL21(DE3),得到转化菌株;
实施例三
功能验证的详细过程。
9.2最小抑制浓度(MIC),通过测量不同Cd浓度培养基中细菌细胞的生长能力来评估Cd的MIC值。首先,含有转化菌株的MT20及其相应的空白对照(空pET28a(+))在LauriaBertani(LB)-琼脂平板上于29℃生长过夜。然后,将含有细菌菌落的重组质粒扩散到含有各种浓度(0-16mM)Cd(包括卡那霉素(50mg/L))和诱导剂(IPTG-Isopropyl-B-D-1-thiogalactopyranoside,100mg/L)的LB-plate平板中。)测定他们的MIC。
测定空白对照大肠杆菌Escherichia coli(E.coli)BL21(DE3),Cd最小抑制浓度,DropAssay试验测定所述大肠杆菌BL21(DE3);见图1,在Cd浓度为0.8mmol/L时,空白对照大肠杆菌Escherichia coli(E.coli)BL21(DE3),图中标号为pET28a无法存活,其最小抑制浓度为Cd 0.8mmol/L;而转化菌株,图中标号为MT20,在Cd浓度为0.8mmol/L、0.9mmol/L时均可存活,其最小抑制浓度为Cd1.0mmol/L证明该转化菌株镉耐受性好。
9.3用于筛选不同Cd溶液中更耐受菌株的生长模式(生长曲线)。
对于该测定,首先,将储存的细菌细胞转移到含有卡那霉素和IPTG的LB-琼脂平板中,并在37℃下孵育过夜以进行活化。然后将单个细菌菌落转移到含有抗生素卡那霉素(50μg/mL),IPTG(100μg/mL)和1.0mM Cd的100ml LB液体培养基中。将细菌样品在37℃,160-180rpm连续摇动。摇动12-14小时后,通过nanodrop分光光度计每隔1小时检查细菌样品在600nm处的吸光值(光密度OD600),持续24小时。将所有读数绘制成图2,以获得1.0mM Cd中每种细菌的生长曲线。
如图2所示,其中PET1~PET3均为空白对照大肠杆菌Escherichia coli(E.coli)BL21(DE3),转化菌株图中标号为MT20,在生长过程中,转化菌株的吸光值(菌群数量)均高于三个空白对照大肠杆菌样品,并且维持较高增长,在第13个小时达到最高峰,转换菌株的菌群数量比空白对照大肠杆菌的菌群数量高约50%,证明其在Cd环境下耐受性能好,寿命长。
9.4、9.5Cd结合试验
检查OD600后,使用高速离心机收集细菌细胞。将收集的生物质在65℃温育24-48小时以进行干燥。然后将干燥的生物质加入管中。通过使用硝酸(浓度65%)过夜消解生物质。完全消化后,加热以除去液体。然后向每个管中加入2ml3%硝酸。14-16小时后,将溶液过滤并转移到干净且干燥的试管中。完成酸消解后,使用原子吸收分光光度计(TAS-990)测定所有样品中的镉浓度。通过检测空白对照大肠杆菌样品的镉吸附值为93.63g/kg,转化菌株的镉吸附值为159.02g/kg,该转化菌株的镉吸附能力强于空白对照大肠杆菌样品。
SEQUENCE LISTING
<110> 中国科学院遗传与发育生物学研究所农业资源研究中心
<120> 一种金属硫蛋白基因MT20、其编码得到的金属硫蛋白及其表达和应用
<130> 1
<160> 54
<170> PatentIn version 3.5
<210> 1
<211> 261
<212> DNA
<213> Unknown
<220>
<223> unknown
<400> 1
atggccgaat gcgcgtggtg ccacaacgag tacgacaaaa cgttttctgt gacgtacgga 60
gatcagtcgt acgtctttga ttgcttcgaa tgcgcgatca cgatgctggc gccaacgtgc 120
gctcactgtg gctgtcgcat catcggacac ggcgatgagg tggacgggcg cttctactgc 180
tgcgcttcct gtgtaagggc cgccgaaccg gtggatctgc acgaccgcgc gactgctggc 240
cacgtgggtg gcacacgcta g 261
<210> 2
<211> 86
<212> PRT
<213> Unknown
<220>
<223> unknown
<400> 2
Met Ala Glu Cys Ala Trp Cys His Asn Glu Tyr Asp Lys Thr Phe Ser
1 5 10 15
Val Thr Tyr Gly Asp Gln Ser Tyr Val Phe Asp Cys Phe Glu Cys Ala
20 25 30
Ile Thr Met Leu Ala Pro Thr Cys Ala His Cys Gly Cys Arg Ile Ile
35 40 45
Gly His Gly Asp Glu Val Asp Gly Arg Phe Tyr Cys Cys Ala Ser Cys
50 55 60
Val Arg Ala Ala Glu Pro Val Asp Leu His Asp Arg Ala Thr Ala Gly
65 70 75 80
His Val Gly Gly Thr Arg
85
<210> 3
<211> 57
<212> PRT
<213> Thermosynechococcus vulcanus
<400> 3
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Pro His Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Asn Asp Ala Ile Met Val Asp Gly Lys Pro Tyr Cys
20 25 30
Ser Glu Val Cys Ala Asn Gly Thr Cys Lys Glu Asn Ser Gly Cys Gly
35 40 45
His Ala Gly Cys Gly Cys Gly Ser Ala
50 55
<210> 4
<211> 54
<212> PRT
<213> Microcystis aeruginosa
<400> 4
Met Ile Ala Val Thr Thr Met Lys Cys Ala Cys Gly Ser Cys Thr Cys
1 5 10 15
Gln Val Ser Ile Ala Asp Ala Ile Lys Lys Asn Asp Gln Tyr Tyr Cys
20 25 30
Cys Gln Ala Cys Ala Asn Gly His Val Lys Glu Lys Gly Cys Gly His
35 40 45
Pro Gly Cys Val Cys Gly
50
<210> 5
<211> 57
<212> PRT
<213> Thermosynechococcus sp.
<400> 5
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Pro His Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Ser Asp Ala Ile Met Val Asp Gly Lys Pro Tyr Cys
20 25 30
Ser Glu Val Cys Ala Asn Gly Thr Cys Lys Glu Ser Asn Gly Cys Gly
35 40 45
His Ser Gly Cys Gly Cys Gly Ser Ala
50 55
<210> 6
<211> 57
<212> PRT
<213> Methylobacterium radiotolerans
<400> 6
Met Ala Ser Val Asp Val Glu Met Val Lys Cys Ala Cys Gln Asp Cys
1 5 10 15
Val Cys Val Ile Pro Val Ala Lys Ala Val Ser Arg Asp Gly Lys Ala
20 25 30
Tyr Cys Cys Asp Asp Cys Ala Asp Gly His Lys Asp His Ala Gly Cys
35 40 45
Glu His Ala Gly Cys Ala Cys His Gly
50 55
<210> 7
<211> 56
<212> PRT
<213> Microcystis aeruginosa
<400> 7
Met Ile Ala Val Thr Met Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Leu Ile Ala Asp Ala Ile Lys Lys Asn Asp Gln Tyr Tyr Cys
20 25 30
Ser Gln Ala Cys Ala Asn Gly His Val Asn Glu Asn Glu Lys Gly Cys
35 40 45
Gly His Gln Gly Cys Gly Cys Val
50 55
<210> 8
<211> 56
<212> PRT
<213> Microcystis aeruginosa
<400> 8
Met Ile Ala Val Thr Met Met Lys Cys Ala Cys Lys Pro Cys Leu Cys
1 5 10 15
Val Val Ser Ile Ala Asp Ala Ile Lys Glu Asn Asp Lys Tyr Tyr Cys
20 25 30
Ser Gln Ala Cys Ala Asn Gly His Val Asn Glu Asn Glu Lys Gly Cys
35 40 45
Gly His Gln Gly Cys Gly Cys Val
50 55
<210> 9
<211> 56
<212> PRT
<213> Synechococcus
<400> 9
Met Thr Ser Thr Thr Leu Val Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Asn Val Asp Pro Ser Lys Ala Ile Asp Arg Asn Gly Leu Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asp Gly His Thr Gly Gly Ser Lys Gly Cys Gly
35 40 45
His Thr Gly Cys Asn Cys His Gly
50 55
<210> 10
<211> 53
<212> PRT
<213> Mycobacterium tuberculosis
<400> 10
Met Arg Val Ile Arg Met Thr Asn Tyr Glu Ala Gly Thr Leu Leu Thr
1 5 10 15
Cys Ser His Glu Gly Cys Gly Cys Arg Val Arg Ile Glu Val Pro Cys
20 25 30
His Cys Ala Gly Ala Gly Asp Ala Tyr Arg Cys Thr Cys Gly Asp Glu
35 40 45
Leu Ala Pro Val Lys
50
<210> 11
<211> 48
<212> PRT
<213> Mycobacterium caprae
<400> 11
Met Thr Asn Tyr Glu Ala Gly Thr Leu Leu Thr Cys Ser His Glu Gly
1 5 10 15
Cys Gly Cys Arg Val Arg Ile Glu Val Pro Cys His Cys Ala Gly Ala
20 25 30
Gly Asp Ala Tyr Arg Cys Thr Cys Gly Asp Glu Leu Ala Pro Val Lys
35 40 45
<210> 12
<211> 53
<212> PRT
<213> Mycobacterium bovis
<400> 12
Met Arg Val Ile Arg Met Thr Asn Tyr Glu Ala Gly Thr Leu Leu Thr
1 5 10 15
Cys Ser His Glu Gly Cys Gly Cys Arg Val Arg Ile Glu Val Pro Cys
20 25 30
His Cys Ala Gly Ala Gly Asp Ala Tyr Arg Cys Thr Cys Gly Asp Glu
35 40 45
Leu Ala Pro Val Lys
50
<210> 13
<211> 55
<212> PRT
<213> Leptolyngbya sp.
<400> 13
Met Ala Thr Val Thr Gln Met Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Ile Val Asp Ile Ser Lys Ala Ile Gln Lys Asp Gly Gln Tyr Tyr Cys
20 25 30
Ser Glu Gly Cys Ala Ser Gly His Gly Asp Asn Ser Lys Gly Cys Gly
35 40 45
His Thr Gly Cys Asn Cys His
50 55
<210> 14
<211> 54
<212> PRT
<213> filamentous cyanobacterium
<400> 14
Met Ala Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Asp Ile Ser Lys Ala Ile Glu Lys Glu Gly Gln Tyr Tyr Cys
20 25 30
Gly Glu Ala Cys Ala Asn Gly His Ser Glu Gly Ser Thr Gly Cys Gly
35 40 45
His Pro Gly Cys Asn Cys
50
<210> 15
<211> 54
<212> PRT
<213> Halomicronema hongdechloris
<400> 15
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Asp Ser Cys Leu Cys
1 5 10 15
Ile Val Asp Thr Ser Lys Ala Val Glu Lys Glu Gly His Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asn Gly His Pro Glu Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Thr Cys His
50
<210> 16
<211> 55
<212> PRT
<213> Geitlerinema sp.
<400> 16
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Asp Thr Asp Lys Ala Val Glu Lys Asp Gly Gln Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asn Gly His Pro Asp Gly Ser Gly Cys Gly His
35 40 45
Gln Gly Cys Thr Cys His Ala
50 55
<210> 17
<211> 55
<212> PRT
<213> Phormidium tenue
<400> 17
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Asp Ser Cys Leu Cys
1 5 10 15
Val Val Asp Thr Ser Gln Ala Val Glu Lys Asp Gly His Tyr Phe Cys
20 25 30
Ser Glu Ala Cys Ala Asn Gly His Pro Glu Gly Ser Ala Gly Cys Gly
35 40 45
His Pro Gly Cys Gly Cys Asn
50 55
<210> 18
<211> 54
<212> PRT
<213> Lyngbya confervoides
<400> 18
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Asp Ser Cys Leu Cys
1 5 10 15
Ile Val Asn Thr Ser Lys Ala Val Glu Lys Glu Gly His Tyr Tyr Cys
20 25 30
Ser Asp Ala Cys Ala Asn Gly His Pro Glu Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Thr Cys His
50
<210> 19
<211> 54
<212> PRT
<213> Kamptonema
<400> 19
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Ser Ser Cys Leu Cys
1 5 10 15
Val Val Ser Leu Thr Glu Ala Ile Glu Lys Asn Gly Gln Tyr Tyr Cys
20 25 30
Ser Asn Ala Cys Ala Asp Gly His Pro Asn Gly Thr Gly Cys Gly His
35 40 45
Ala Gly Cys Gly Cys His
50
<210> 20
<211> 48
<212> PRT
<213> Spirulina subsalsa
<400> 20
Val Lys Cys Ala Cys Ser Thr Cys Glu Cys Met Val Ser Pro Asp Lys
1 5 10 15
Ala Ile Glu Lys Asp Gly Lys Tyr Tyr Cys Gly Glu Ala Cys Ala Asn
20 25 30
Gly His Thr Asp Gly Ser His Gly Cys Gly His Pro Gly Cys Asn Cys
35 40 45
<210> 21
<211> 54
<212> PRT
<213> Acaryochloris marina
<400> 21
Met Ala Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Thr Ile Ser Asp Ala Ile Gln Lys Gly Gly Gln Tyr Phe Cys
20 25 30
Gly Gln Ala Cys Ala Asp Gly His Pro Ser Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys His
50
<210> 22
<211> 54
<212> PRT
<213> Acaryochloris marina
<400> 22
Met Ala Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Thr Ile Ser Asp Ala Ile Gln Lys Gly Gly Gln Tyr Phe Cys
20 25 30
Gly Gln Ala Cys Ala Asp Gly His Pro Ser Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys His
50
<210> 23
<211> 53
<212> PRT
<213> Tolypothrix campylonemoides
<400> 23
Met Thr Asn Val Thr Gln Leu Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Val Val Ser Leu Glu Asp Ala Ile Gln Lys Asp Gly Lys Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Glu Gly His Gln Thr Met Gln Gly Cys Gly His
35 40 45
Ser Gly Cys Gly Cys
50
<210> 24
<211> 53
<212> PRT
<213> Tolypothrix campylonemoides
<400> 24
Met Thr Asn Val Thr Gln Leu Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Val Val Ser Leu Glu Asp Ala Ile Gln Lys Asp Gly Lys Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Glu Gly His Gln Thr Met Gln Gly Cys Gly His
35 40 45
Ser Gly Cys Gly Cys
50
<210> 25
<211> 53
<212> PRT
<213> Calothrix sp.
<400> 25
Met Thr Thr Val Thr Met Met Lys Cys Ala Cys Glu Arg Cys Leu Cys
1 5 10 15
Val Val Ser Thr Ala Asp Ala Ile Glu Lys Glu Gly Lys Tyr Tyr Cys
20 25 30
Ser Gln Ala Cys Ala Asp Gly His Lys Asp Glu Lys Gly Cys Ala His
35 40 45
Ser Gly Cys Gly Cys
50
<210> 26
<211> 52
<212> PRT
<213> Cyanobacterium aponinum
<400> 26
Thr Thr Val Thr Gln Met Lys Cys Ala Cys Pro Ser Cys Leu Cys Ile
1 5 10 15
Ile Asp Ile Ser Gln Ala Ile Ser Arg Asp Gly His Tyr Tyr Cys Ser
20 25 30
Thr Ala Cys Ala Glu Gly His Lys Glu Gly Glu Gly Cys Gly His Ser
35 40 45
Gly Cys Gly Cys
50
<210> 27
<211> 53
<212> PRT
<213> Scytonema tolypothrichoides
<400> 27
Met Thr Ser Val Thr Gln Met Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Glu Asn Ala Ile Gln Lys Asp Glu Lys Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Glu Gly His Lys Thr Met Lys Gly Cys Gly His
35 40 45
Asn Gly Cys Gly Cys
50
<210> 28
<211> 53
<212> PRT
<213> Limnoraphis robusta
<400> 28
Met Thr Ser Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Ser Leu Glu Ser Ala Ile Lys Lys Asp Gly Lys Pro Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asn Gly His Ser Asn Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Thr Cys
50
<210> 29
<211> 54
<212> PRT
<213> Oscillatoria brevis
<400> 29
Met Thr Thr Val Thr Gln Ile Lys Cys Ala Cys Pro Ser Cys Leu Cys
1 5 10 15
Val Val Ser Leu Thr Glu Ala Ile Glu Lys Ser Gly Lys Ser Tyr Cys
20 25 30
Ser Ser Ala Cys Ala Asp Gly His Pro Asn Gly Thr Gly Cys Gly His
35 40 45
Thr Gly Cys Glu Cys His
50
<210> 30
<211> 53
<212> PRT
<213> Fischerella
<400> 30
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Ser Ile Glu Asp Ala Ile Gln Lys Asp Asn Lys Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asp Gly His Gln Thr Thr Lys Gly Cys Gly His
35 40 45
Ser Gly Cys Gly Cys
50
<210> 31
<211> 54
<212> PRT
<213> Leptolyngbya ohadii
<400> 31
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Ser Asp Cys Leu Cys
1 5 10 15
Ile Val Asn Leu Asn Asp Ala Ile Met Lys Asp Gly Lys Ala Tyr Cys
20 25 30
Gly Asp Ala Cys Ala Asn Gly His Thr Gly Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys His
50
<210> 32
<211> 53
<212> PRT
<213> Acaryochloris marina
<400> 32
Met Ala Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Thr Ile Ser Asp Ala Ile Gln Lys Gly Gly Gln Tyr Phe Cys
20 25 30
Gly Gln Ala Cys Ala Asp Gly His Pro Ser Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Arg Cys
50
<210> 33
<211> 54
<212> PRT
<213> Phormidesmis priestleyi
<400> 33
Met Thr Ala Val Thr Gln Met Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Ile Val Thr Thr Glu Gly Ala Val Gln Lys Asp Gly Lys Leu Tyr Cys
20 25 30
Ser Glu Val Cys Ala Asp Gly His Pro Asn Gly His Gly Asp Cys Gly
35 40 45
His Lys Gly Cys Thr Cys
50
<210> 34
<211> 54
<212> PRT
<213> Microcoleus vaginatus
<400> 34
Met Thr Thr Ala Thr Gln Thr Lys Cys Ala Cys Pro Ser Cys Ser Cys
1 5 10 15
Val Val Asn Val Ser Glu Ala Ile Glu Lys Asp Gly Lys Thr Tyr Cys
20 25 30
Ser Ser Ala Cys Ala Asp Gly His Pro Asn Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Glu Cys His
50
<210> 35
<211> 54
<212> PRT
<213> Geminocystis herdmanii
<400> 35
Thr Val Thr Gln Met Lys Cys Ala Cys Pro Ser Cys Leu Cys Ile Val
1 5 10 15
Asp Ile Ala Ser Ala Ile Gln Lys Asp Asn Gln Tyr Phe Cys Ser Asp
20 25 30
Ala Cys Ala Asn Gly His Lys Glu Gly Thr Thr Gly Cys Ser His Ser
35 40 45
Gly Cys Gly Cys His Gly
50
<210> 36
<211> 54
<212> PRT
<213> Geitlerinema sp.
<400> 36
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Asn Leu Ser Asp Ala Val His Lys Asp Glu Lys Tyr Tyr Cys
20 25 30
Cys Glu Ala Cys Ala Asn Gly His Gln Ser Gly Asp Gly Cys Gly His
35 40 45
Ser Gly Cys Gly Cys His
50
<210> 37
<211> 53
<212> PRT
<213> Mastigocladopsis repens
<400> 37
Met Thr Ser Val Thr Gln Met Lys Cys Ala Cys Glu Pro Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Glu Asp Ala Ile Gln Lys Asp Asp Lys Tyr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Glu Gly His Gln Thr Met Lys Gly Cys Gly His
35 40 45
Asn Gly Cys Gly Cys
50
<210> 38
<211> 52
<212> PRT
<213> Cyanothece sp.
<400> 38
Thr Val Thr Gln Met Lys Cys Ala Cys Ser Ser Cys Val Cys Ile Val
1 5 10 15
Asp Leu Ser Asp Ala Ile Gln Lys Asp Gly Lys Tyr Tyr Cys Ser Asp
20 25 30
Ala Cys Ala Asn Gly His Pro Asp Gly Ala Gly Cys Ser His His Gly
35 40 45
Cys Glu Cys His
50
<210> 39
<211> 53
<212> PRT
<213> Mastigocoleus testarum
<400> 39
Met Ala Asp Val Thr Ser Met Lys Cys Ala Cys Ala Asp Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Lys Asp Ala Ile Ala Lys Asn Gly Gln Tyr Tyr Cys
20 25 30
Ser Glu Val Cys Ala Asn Gly His Val Asp Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Lys Cys
50
<210> 40
<211> 53
<212> PRT
<213> Calothrix sp.
<400> 40
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Ser Ser Ala Val Met Lys Glu Gly Lys Pro Tyr Cys
20 25 30
Gly Glu Ala Cys Ala Asn Gly His Ala Asp Gly Lys Gly Cys Gly His
35 40 45
Thr Gly Cys Glu Cys
50
<210> 41
<211> 56
<212> PRT
<213> Pseudanabaena sp.
<400> 41
Met Ala Ser Ala Thr Leu Val Lys Cys Ala Cys Ser Lys Cys Leu Cys
1 5 10 15
Val Ile Asp Pro Ser Asp Ala Ile Glu Ala Asn Gly Lys Tyr Tyr Cys
20 25 30
Cys Lys Ala Cys Ala Ser Gly His Val Asp Gly Thr Asn Asp Ser His
35 40 45
Cys Ser Asp Val Gly Cys Glu Cys
50 55
<210> 42
<211> 58
<212> PRT
<213> Geminocystis sp.
<400> 42
Met Thr Thr Ala Thr Ile Thr Gln Met Lys Cys Ala Cys Pro Ser Cys
1 5 10 15
Leu Cys Ile Val Asp Ile Gly Thr Ala Leu Gln Lys Glu Gly Lys Tyr
20 25 30
Phe Cys Ser Thr Ala Cys Ala Glu Gly His Lys Glu Gly Thr Thr Gly
35 40 45
Cys Ser His Thr Gly Cys Gly Cys Asn Gly
50 55
<210> 43
<211> 53
<212> PRT
<213> Calothrix sp.
<400> 43
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Ser Ser Ala Val Met Lys Glu Gly Lys Pro Tyr Cys
20 25 30
Gly Glu Ala Cys Ala Asn Gly His Gln Asp Gly Lys Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys
50
<210> 44
<211> 53
<212> PRT
<213> Rivularia sp.
<400> 44
Met Ala Ala Val Asp Leu Met Lys Cys Ala Cys Asp Lys Cys Leu Cys
1 5 10 15
Ile Val Lys Val Glu Thr Ala Ile Asp Arg Asp Gly Lys His Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Glu Gly His Lys Thr Ile Thr Gly Cys Gly His
35 40 45
Ser Gly Cys Gly Cys
50
<210> 45
<211> 55
<212> PRT
<213> Coleofasciculus chthonoplastes
<400> 45
Met Thr Thr Ala Thr Gln Thr Gln Cys Ala Cys Asp Ser Cys Ala Cys
1 5 10 15
Met Val Ser Thr Asp Ser Ala Val Gln Lys Asp Gly Lys Tyr Tyr Cys
20 25 30
Ser Asp Ala Cys Ala Asn Gly His Pro Asn Gly Ala Gly Cys Gly His
35 40 45
Ser Gly Cys Glu Cys His Ala
50 55
<210> 46
<211> 53
<212> PRT
<213> Stanieria cyanosphaera
<400> 46
Met Ser Thr Val Thr Ser Met Lys Cys Ala Cys Asp Arg Cys Leu Cys
1 5 10 15
Val Val Ser Leu Glu Asp Ala Val Lys Lys Asp Gly Lys Tyr Tyr Cys
20 25 30
Cys Glu Ala Cys Ala Asn Gly His Thr Asp Gly Ser Gly Cys Gly His
35 40 45
Gln Gly Cys Gly Cys
50
<210> 47
<211> 54
<212> PRT
<213> Hydrocoleum sp.
<400> 47
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Ser Ile Glu Ser Ala Val Lys Lys Asn Gly Gln Asn Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Asn Asn His Pro Asp Gly Ala Gly Cys Gly His
35 40 45
Glu Gly Cys Glu Cys Asn
50
<210> 48
<211> 53
<212> PRT
<213> Chamaesiphon minutus
<400> 48
Met Ser Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Ile Val Ser Leu Ser Asp Ala Ile Val Lys Asp Gly Lys His Tyr Cys
20 25 30
Gly Asp Ala Cys Ala Asn Gly His Pro Ala Gly Gln Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys
50
<210> 49
<211> 55
<212> PRT
<213> Tolypothrix bouteillei
<400> 49
Met Thr Thr Val Ser Gln Met Lys Cys Ala Cys Lys Ser Cys Leu Cys
1 5 10 15
Val Val Ser Leu Ser Asp Ala Leu Met Lys Asp Gly Lys Ala Tyr Cys
20 25 30
Gly Glu Ala Cys Ala Asn Gly His Thr Asn Gly Glu Cys Cys Gly His
35 40 45
Thr Gly Cys Asp Cys His Ala
50 55
<210> 50
<211> 54
<212> PRT
<213> Hapalosiphonaceae
<400> 50
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Glu Ser Cys Leu Cys
1 5 10 15
Val Val Ser Leu Thr Asp Ala Val Ile Lys Asp Gly Lys Pro Tyr Cys
20 25 30
Gly Glu Ala Cys Ala Asn Gly His Pro Asn Gly Glu Gly Cys Gly His
35 40 45
Gln Gly Cys Gly Cys His
50
<210> 51
<211> 53
<212> PRT
<213> Aliterella atlantica
<400> 51
Met Thr Thr Ala Thr Gln Thr Gln Cys Ala Cys Glu Ser Cys His Cys
1 5 10 15
Pro Val Ser Glu Thr Glu Ala Val Gln Lys Asp Gly Lys Thr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Gln Gly His Pro Asp Gly Lys Gly Cys Gly His
35 40 45
Ala Gly Cys Asp Cys
50
<210> 52
<211> 53
<212> PRT
<213> Aliterella atlantica
<400> 52
Met Thr Thr Ala Thr Gln Thr Gln Cys Ala Cys Glu Ser Cys His Cys
1 5 10 15
Pro Val Ser Glu Thr Glu Ala Val Gln Lys Asp Gly Lys Thr Tyr Cys
20 25 30
Ser Glu Ala Cys Ala Gln Gly His Pro Asp Gly Lys Gly Cys Gly His
35 40 45
Ala Gly Cys Asp Cys
50
<210> 53
<211> 53
<212> PRT
<213> Cyanobacterium stanieri
<400> 53
Met Thr Thr Val Thr Gln Met Lys Cys Ala Cys Pro Ser Cys Leu Cys
1 5 10 15
Ile Val Asn Leu Ser Asp Ala Ile Gln Lys Asn Asp His Tyr Tyr Cys
20 25 30
Cys Gln Ala Cys Ala Asp Gly His Pro Asn Gly Ser Gly Cys Gly His
35 40 45
Thr Gly Cys Gly Cys
50
<210> 54
<211> 53
<212> PRT
<213> Xenococcus sp.
<400> 54
Thr Val Thr Gln Met Lys Cys Ala Cys Pro Ser Cys Leu Cys Ile Val
1 5 10 15
Asn Val Ser Asp Ala Ile Ser Lys Glu Gly Lys Tyr Tyr Cys Ser Asp
20 25 30
Ala Cys Ala Lys Gly His Ser Glu Gly Ala Gly Cys Ser His Ala Gly
35 40 45
Cys Gly Cys His Ala
50
Claims (9)
1.一种金属硫蛋白基因,其特征在于,其碱基序列如SEQ ID NO:1所示。
2.一种金属硫蛋白,其特征在于,所述金属硫蛋白由权利要求1所述的基因编码得到,其氨基酸序列如SEQ ID NO:2所示。
3.包含权利要求1所述的金属硫蛋白基因的重组载体。
4.根据权利要求4所述的重组载体,其特征在于,所述载体为大肠杆菌表达载体。
5.包含权利要求1所述的金属硫蛋白基因的重组菌株。
6.根据权利要求5所述的重组菌株,其特征在于,所述菌株为大肠杆菌。
7.一种根据权利要求1所述的金属硫蛋白基因、权利要求2所述的金属硫蛋白或权利要求5/6所述的重组菌株在重金属污染治理中的应用。
8.根据权利要求7所述的应用,其特征在于,所述应用为在废水重金属污染治理和/或在土壤重金属污染治理中的应用。
9.根据权利要求8所述的应用,其特征在于,所述重金属为铜、钴、铅和/或镉。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910289750.8A CN109943571B (zh) | 2019-04-11 | 2019-04-11 | 一种金属硫蛋白基因mt20、其编码得到的金属硫蛋白及其表达和应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910289750.8A CN109943571B (zh) | 2019-04-11 | 2019-04-11 | 一种金属硫蛋白基因mt20、其编码得到的金属硫蛋白及其表达和应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109943571A true CN109943571A (zh) | 2019-06-28 |
CN109943571B CN109943571B (zh) | 2022-09-02 |
Family
ID=67014793
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910289750.8A Expired - Fee Related CN109943571B (zh) | 2019-04-11 | 2019-04-11 | 一种金属硫蛋白基因mt20、其编码得到的金属硫蛋白及其表达和应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109943571B (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110386671A (zh) * | 2019-07-29 | 2019-10-29 | 中国科学院地理科学与资源研究所 | 一种转基因植物强化河道湖泊污染水体原位修复的方法 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003319787A (ja) * | 2002-02-27 | 2003-11-11 | Mitsubishi Chemicals Corp | 重金属の吸収方法 |
JP2004000092A (ja) * | 2002-05-31 | 2004-01-08 | Foundation For Nara Institute Of Science & Technology | 重金属を特異的に結合するポリペプタイドおよび当該ポリペプタイドをコードする遺伝子 |
CN101048221A (zh) * | 2004-10-19 | 2007-10-03 | Mgp生物工艺有限公司 | 使用被提供有经纯化金属硫蛋白(mt)的膜从受污染样品中除去重金属的组合物和方法 |
CN101781653A (zh) * | 2009-07-15 | 2010-07-21 | 山西省农业生物技术研究中心 | 枣树金属硫蛋白基因及其在处理重金属污染中的应用 |
US20110220570A1 (en) * | 2009-08-20 | 2011-09-15 | Ruiz Oscar N | Heavy metal remediation system |
CN108129556A (zh) * | 2017-12-20 | 2018-06-08 | 中国农业科学院生物技术研究所 | 水稻来源的金属镉结合蛋白及其编码基因和应用 |
-
2019
- 2019-04-11 CN CN201910289750.8A patent/CN109943571B/zh not_active Expired - Fee Related
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003319787A (ja) * | 2002-02-27 | 2003-11-11 | Mitsubishi Chemicals Corp | 重金属の吸収方法 |
JP2004000092A (ja) * | 2002-05-31 | 2004-01-08 | Foundation For Nara Institute Of Science & Technology | 重金属を特異的に結合するポリペプタイドおよび当該ポリペプタイドをコードする遺伝子 |
CN101048221A (zh) * | 2004-10-19 | 2007-10-03 | Mgp生物工艺有限公司 | 使用被提供有经纯化金属硫蛋白(mt)的膜从受污染样品中除去重金属的组合物和方法 |
CN101781653A (zh) * | 2009-07-15 | 2010-07-21 | 山西省农业生物技术研究中心 | 枣树金属硫蛋白基因及其在处理重金属污染中的应用 |
US20110220570A1 (en) * | 2009-08-20 | 2011-09-15 | Ruiz Oscar N | Heavy metal remediation system |
CN108129556A (zh) * | 2017-12-20 | 2018-06-08 | 中国农业科学院生物技术研究所 | 水稻来源的金属镉结合蛋白及其编码基因和应用 |
Non-Patent Citations (3)
Title |
---|
LI, X.,ET AL: ""Uncultured bacterium clone MT20 metallothionein gene, complete cds,MT035823.1"", 《NCBI GENBANK》 * |
XIAOFANG LI,ET AL.: ""Metagenomics-Guided Discovery of Potential Bacterial Metallothionein Genes from the Soil Microbiome That Confer Cu and/or Cd Resistance"", 《APPL ENVIRON MICROBIOL. 》 * |
李停停 等: ""金属硫蛋白的研究进展"", 《安徽农业科学》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110386671A (zh) * | 2019-07-29 | 2019-10-29 | 中国科学院地理科学与资源研究所 | 一种转基因植物强化河道湖泊污染水体原位修复的方法 |
Also Published As
Publication number | Publication date |
---|---|
CN109943571B (zh) | 2022-09-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Lurthy et al. | Impact of bacterial siderophores on iron status and ionome in pea | |
Ballot et al. | Diversity of cyanobacteria and cyanotoxins in Hartbeespoort Dam, South Africa | |
CN101892187A (zh) | 吸附重金属镉的重组恶臭假单胞菌ch01及应用 | |
CN102757969A (zh) | 一种与大豆根瘤磷转运相关的磷转运蛋白基因GmPT5及其应用 | |
US10787489B2 (en) | Biocatalyst comprising photoautotrophic organisms producing recombinant enzyme for degradation of harmful algal bloom toxins | |
CN116253784B (zh) | 小麦叶锈病抗性蛋白及其编码基因和应用 | |
CN109943571B (zh) | 一种金属硫蛋白基因mt20、其编码得到的金属硫蛋白及其表达和应用 | |
CN105505950A (zh) | 五个烟草金属硫蛋白基因的新应用 | |
CN112322648A (zh) | 一种abc转运蛋白基因mrp1s及其制备方法和应用 | |
CN110004152B (zh) | 一种金属硫蛋白基因mt16、其编码得到的金属硫蛋白及其表达和应用 | |
CN109851670B (zh) | 一种金属硫蛋白基因mt18、其编码得到的金属硫蛋白及其表达和应用 | |
CN110105446B (zh) | 一种金属硫蛋白基因mt27、其编码得到的金属硫蛋白及其表达和应用 | |
CN101942468B (zh) | 提高转基因生物耐盐性的基因LcGST及其制备以及利用其编码的蛋白 | |
CN103205454B (zh) | 金属硫蛋白转基因酵母的构建及利用其制备重金属生物吸附材料的方法 | |
CN107828801A (zh) | 莱茵衣藻rfc1基因在调控莱茵衣藻镉耐受性中的应用 | |
CN109207496A (zh) | 一种重金属镉特异性结合蛋白基因BjHMA4R及其编码蛋白和应用 | |
CN108588109B (zh) | C2H2型转录因子基因asr1的重组表达载体及应用 | |
WO2022155445A1 (en) | Non-naturally occurring host cells for enhanced plant growth | |
CN102604901A (zh) | 一种重金属汞抗性相关蛋白DbsMerA及其编码基因和应用 | |
CN109439573B (zh) | 对s-敌草胺具有专一转化功能的菌株、酰胺水解酶、编码基因及其应用 | |
CN102618515B (zh) | 一种重金属镉抗性相关蛋白FKCadA1及其编码基因和应用 | |
CN114736908A (zh) | 调节植物镉含量以及镉耐受性的基因及其应用 | |
TWI290955B (en) | Method for manufacturing plant with capability of adsorbing heavy metal substance and method for utilizing such plant to treat heavy metal pollution | |
CN111560055A (zh) | 水稻基因OsLAT3在调节敌草快的吸收累积中的应用 | |
CN102603873A (zh) | 一种重金属镉抗性相关蛋白DbsCzcA及其编码基因和应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20220902 |
|
CF01 | Termination of patent right due to non-payment of annual fee |