AU2020102633A4 - Oxidation resistant red fluorescent proteins and use thereof - Google Patents
Oxidation resistant red fluorescent proteins and use thereof Download PDFInfo
- Publication number
- AU2020102633A4 AU2020102633A4 AU2020102633A AU2020102633A AU2020102633A4 AU 2020102633 A4 AU2020102633 A4 AU 2020102633A4 AU 2020102633 A AU2020102633 A AU 2020102633A AU 2020102633 A AU2020102633 A AU 2020102633A AU 2020102633 A4 AU2020102633 A4 AU 2020102633A4
- Authority
- AU
- Australia
- Prior art keywords
- protein
- red fluorescent
- proteins
- val
- fusion
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 108010054624 red fluorescent protein Proteins 0.000 title claims abstract description 31
- 230000003647 oxidation Effects 0.000 title claims abstract description 16
- 238000007254 oxidation reaction Methods 0.000 title claims abstract description 16
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 76
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 74
- 235000018102 proteins Nutrition 0.000 claims abstract description 73
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 19
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 19
- 239000000700 radioactive tracer Substances 0.000 claims abstract description 5
- 235000018417 cysteine Nutrition 0.000 claims description 35
- 150000001945 cysteines Chemical class 0.000 claims description 25
- 230000004927 fusion Effects 0.000 claims description 16
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 12
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 10
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 4
- 238000004519 manufacturing process Methods 0.000 claims description 4
- 102000004190 Enzymes Human genes 0.000 claims description 3
- 108090000790 Enzymes Proteins 0.000 claims description 3
- 235000001014 amino acid Nutrition 0.000 claims description 3
- 150000001413 amino acids Chemical class 0.000 claims description 3
- 108010078791 Carrier Proteins Proteins 0.000 claims description 2
- 101710172711 Structural protein Proteins 0.000 claims description 2
- 102000040945 Transcription factor Human genes 0.000 claims description 2
- 108091023040 Transcription factor Proteins 0.000 claims description 2
- 230000000975 bioactive effect Effects 0.000 abstract description 4
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 abstract description 4
- 238000004040 coloring Methods 0.000 abstract description 3
- 108091006047 fluorescent proteins Proteins 0.000 description 25
- 102000034287 fluorescent proteins Human genes 0.000 description 25
- 238000000746 purification Methods 0.000 description 9
- 102200081846 rs7902757 Human genes 0.000 description 7
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 6
- 210000004027 cell Anatomy 0.000 description 6
- 230000001413 cellular effect Effects 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 238000002835 absorbance Methods 0.000 description 5
- CVXBEEMKQHEXEN-UHFFFAOYSA-N carbaryl Chemical compound C1=CC=C2C(OC(=O)NC)=CC=CC2=C1 CVXBEEMKQHEXEN-UHFFFAOYSA-N 0.000 description 5
- 229960005286 carbaryl Drugs 0.000 description 5
- 230000005284 excitation Effects 0.000 description 5
- 108010017391 lysylvaline Proteins 0.000 description 5
- 235000014393 valine Nutrition 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 4
- 150000003680 valines Chemical class 0.000 description 4
- 241000242771 Entacmaea quadricolor Species 0.000 description 3
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 3
- 235000013339 cereals Nutrition 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 235000013305 food Nutrition 0.000 description 3
- 230000002209 hydrophobic effect Effects 0.000 description 3
- 238000003018 immunoassay Methods 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 229910052759 nickel Inorganic materials 0.000 description 3
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 2
- 102100033400 4F2 cell-surface antigen heavy chain Human genes 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 2
- NYDIVDKTULRINZ-AVGNSLFASA-N Arg-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NYDIVDKTULRINZ-AVGNSLFASA-N 0.000 description 2
- OWSMKCJUBAPHED-JYJNAYRXSA-N Arg-Pro-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OWSMKCJUBAPHED-JYJNAYRXSA-N 0.000 description 2
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 2
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 2
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 2
- KPNUCOPMVSGRCR-DCAQKATOSA-N Asp-His-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KPNUCOPMVSGRCR-DCAQKATOSA-N 0.000 description 2
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 2
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 2
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 2
- XFHMVFKCQSHLKW-HJGDQZAQSA-N Gln-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XFHMVFKCQSHLKW-HJGDQZAQSA-N 0.000 description 2
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- MCGNJCNXIMQCMN-DCAQKATOSA-N Glu-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCC(O)=O MCGNJCNXIMQCMN-DCAQKATOSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 2
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 2
- 101000800023 Homo sapiens 4F2 cell-surface antigen heavy chain Proteins 0.000 description 2
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 2
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 2
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 2
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 2
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 2
- YVIVIQWMNCWUFS-UFYCRDLUSA-N Phe-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N YVIVIQWMNCWUFS-UFYCRDLUSA-N 0.000 description 2
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 2
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 2
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 2
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 2
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 2
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 2
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 2
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 2
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 2
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 2
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 2
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 2
- PKUJMYZNJMRHEZ-XIRDDKMYSA-N Trp-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKUJMYZNJMRHEZ-XIRDDKMYSA-N 0.000 description 2
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 2
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 2
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 2
- OFQGGTGZTOTLGH-NHCYSSNCSA-N Val-Met-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N OFQGGTGZTOTLGH-NHCYSSNCSA-N 0.000 description 2
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- LEVWYRKDKASIDU-IMJSIDKUSA-N cystine group Chemical group C([C@@H](C(=O)O)N)SSC[C@@H](C(=O)O)N LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 108010009297 diglycyl-histidine Proteins 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 239000002917 insecticide Substances 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 235000004400 serine Nutrition 0.000 description 2
- 238000012916 structural analysis Methods 0.000 description 2
- 238000012800 visualization Methods 0.000 description 2
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 1
- 238000012286 ELISA Assay Methods 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 description 1
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 101150097410 R3 gene Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 108010003723 Single-Domain Antibodies Proteins 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- 241001416177 Vicugna pacos Species 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000008033 biological extinction Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000002866 fluorescence resonance energy transfer Methods 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000010859 live-cell imaging Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 235000006109 methionine Nutrition 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/43504—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates
- C07K14/43595—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates from coelenteratae, e.g. medusae
Abstract
We disclosed a red fluorescent protein R3, which is oxidation resistant and
oligomer-formation resistant, constructed by removing its cysteine residues. The R3 protein
5 is highly soluble and displays intense red color and fluorescence. We also disclosed a fusion
protein of the R3 protein joined by a nanobody as an example of coloring a bioactive protein.
The stable red fluorescent protein, as a tracer molecule, will be a valuable tool in location
and quantitation of bioactive proteins.
Description
Oxidation resistant red fluorescent proteins and use thereof
Technical field
The field of this invention is biological technology, and more specifically, chromoproteins and fluorescent proteins.
Background
Proteins are usually colorless, because the twenty natural amino acid residues do not absorb visible light appreciably under normal circumstance. Most of the absorbance comes from aromatic residues, such as tryptophan, histidine, phenylalanine, and tyrosine, and proteins with high number of aromatic residues can show their presence when detected in UV region near 280 nm. Fluorescent proteins are a special class of chromoproteins and fluorescent proteins exist in nature. The chromophore is formed by chemical reactions of amino acid residues, such as Ser65-Tyr66-Gly67 in avGFP, due to their spatial proximity and oxidation. The distinct color and fluorescence of these unique proteins has been widely used in biological applications as tracing signals (Cubitt, A. B., R. Heim, S. R. Adams, A. E. Boyd, L. A. Gross and R. Y. Tsien (1995). "Understanding, improving and using green fluorescent proteins." Trends Biochem Sci 20(11): 448-455.). The gene of a fluorescent protein can be easily fused to another gene to produce a fusion protein, which can be expressed inside cells to trace the function of individual proteins or produced in large quality in pure form as biological reagents to study structure and function of proteins, protein-protein interactions, and a range of other biological properties. Fluorescent proteins have cysteines, and their number and locations are unusually not conserved among proteins from different species. Cysteines are chemically active, and they can be oxidized, or react with another cysteine to form a covalent disulfide bond. This chemical activity can cause at least two problems: inactivation by oxidation and oligomer-formation. Oxidation of cysteine residue will cause structural change, leading to change in structure integrity, light absorbance, and fluorescence emission. Formation of unnatural and unintended disulfide bonds results in oligomers, complicating analysis, protein purification, and protein stability. These oxidation and oligomer-formation problems are particularly troublesome when the fluorescent protein or its fusion protein will be used in multiple cellular compartments, oxidative cellular environments, and biological membranes. (Costantini, L. M., M. Baloban, M. L. Markwardt, M. Rizzo, F. Guo, V. V. Verkhusha and E. L. Snapp (2015). "A palette of fluorescent proteins optimized for diverse cellular environments." Nat Commun 6: 7670.). Replacement of cysteine in fluorescent proteins have been attempted, and successful examples exist. The replacement includes Ser, Ala, Val, Met, among others. However, cysteines are usually buried inside a hydrophobic interior, and outcome of the replacement is not predictable, ranging from decreased brightness with single cysteine mutation to complete loss of fluorescence.
Most fluorescent proteins emit green fluorescence, which is noticeable on the dark background. Genetic mutations around the chromophore have expanded the excitation and emission wavelengths to yellow and red region, to created fluorescent proteins cover entire visible spectral range and beyond. This is significant because dark colors, such as red, are more noticeable on a light background. We intend to produce cysteine-free red fluorescent proteins that have intense color and fluorescence and be oxidation resistant and oligomer-formation resistant, to make it stable and easy to use on a light background.
Summary
The object of the present disclosure is to provide a red fluorescent protein that is oxidation resistant and oligomer-formation resistant, due to its lack of cysteines. Itself or its fusion protein can be used as a tracer for location and quantitation. In the first aspect of the present disclosure, it provides the method of making the red fluorescent protein by replacing the cysteine or cystines with an amino acid residue, selected from 20 natural amino acids minus cysteine. In the second aspect of the present disclosure, it provides the method of making the red fluorescent protein, based on a known red fluorescent protein, its sequence set forth by SEQ ID NO: 1, by replacing the cysteine or cystines with an amino acid residue, preferably from the group of amino acids serine, alanine, valine, and methionine. In the third aspect of the present disclosure, it provides the method of making the red fluorescent protein, based on the known red fluorescent protein, its sequence set forth by SEQ ID NO: 1, by replacing its cysteine-30 and cysteine-176 with two valines. The newly created red fluorescent protein R3 contains: (a) the sequence set forth by SEQ ID NO: 2; or (b) a sequence with identity of 90% or more to SEQ ID NO: 2. The fourth aspect of the present disclosure provides a fusion protein, and said fusion protein comprises: (a) the red fluorescent protein according to the first aspect, the second aspect, or the third aspect of the present disclosure; and (b) a fusion part selected from the group comprising antibody, enzyme, transporter, structural protein, and transcription factor. In another preferred embodiment, said antibody is selected from the group comprising nanobody (single domain antibody), antibody, antibody scFv fragment, antibody Fab fragment, and antibody Fc fragment. In another preferred embodiment, said enzyme is selected from the group comprising protease, phosphatase, and kinase. In another preferred embodiment, said fusion part is a protein that has affinity towards another protein target on cell surface or inside cell. In another embodiment, said fusion protein has the red fluorescent protein and the fusion part linked together by a short linker sequence consisting of glycine, serine, or combinations of the two. In another embodiment, said fusion protein has the red fluorescent protein first and the fusion part second, or in reverse order. In another preferred embodiment, said fusion protein contains multivalent (such as bivalent) the red fluorescent protein and the fusion part. The fifth aspect of the present disclosure provides an expression system to produce the red fluorescent protein or the fusion protein comprising the steps of: (a) constructing an expression system for said proteins under a condition suitable for producing the proteins, thereby obtaining a culture containing said proteins; and (b) isolating or recovering said proteins from said culture. In another preferred embodiment, said expression system includes prokaryote and eukaryote. In another preferred embodiment, said expression is carried out in E coli cells or yeast cells. The sixth aspect of the present disclosure provides uses of the red fluorescent protein according to the first, the second, or the third aspect of the present disclosure; or the fusion protein according to the fourth aspect of the present disclosure. In another preferred embodiment, said uses comprise: (a) coloring or labeling the fusion part for visualization to facilitate its purification, localization, and detection; (b) quantitating proteins in western blot, ELISA assay, and other bioassays; (c) bioimaging cellular proteins under microscope or other devices; (d) incorporating into an assay kit for detection and quantitation; (e) revealing abundance and distribution of a target protein in live animal or human for diagnostics. These and other features, aspects, and advantages of the present invention will become better understood with reference to the following description and claims.
Description of the drawings
Figure 1. This drawing is to illustrate the selection of proper residues to replace cysteines. Sequence alignment between the fluorescent proteins mCherry and mRuby3 showes the homology between the two proteins. The identical residues in the two proteins are shaded, and the two cysteines (C30 and C176) are highlighted by underline and in bold. This alignment provides the rationale for the equivalent locations of C30 in mRuby3 at 144 in mCherry and C176 in mRuby3 at A180 in mCherry. Figure 2. This drawing is to illustrate the locations of two replaced cysteines (C30V, C176V) in the R3 protein in 3-D space, based on the X-ray structure of mCherry (2h5q). Both C30 and C176 are located on the beta-sheets with their sidechains pointing inwards of the beta barrel. The inwards orientation makes its selection of replacement residues more critical to avoid changes in its structural integrity, because inwards orientation, in comparison with outwards one, will have less space to accommodate the replacement residues. This inwards orientation also suggests the vulnerability of inactivation by oxidation of cysteines, emphasizing the need to make it oxidation resistant. Figure 3. This drawing is to disclose the protein sequence of R3 protein, with a his-tag at its C-terminus, used in our expression system. A skilled person can keep the R3 protein core sequence and add additional sequences or tags for its expression. Figure 4. This drawing is to show the actual purification process of the R3 protein on nickel column. Due to its intense red color, protein movement on the column is clearly visible, following the time sequence of 1-4, indicated by arrows. Figure 5. This drawing is to show the size and purity of the purified red fluorescent protein R3 and the purified fusion protein R3C, as analyzed by PAGE analysis. The R3 protein is shown on the far-right lane (lane 2, 27 kDa), and the R3C protein is shown on the far-left lane (lane 1, 40 kDa). Two marker lanes are in the middle with their molecular weight (kDa) indicated. Figure 6. This drawing is to provide the actual color and fluorescence properties of the purified R3 protein. The UV absorbance or the excitation for fluorescence is shown as dotted line. The fluorescence emission is shown as solid line. This R3 protein has intense red color (maximum UV absorbance at 555 nm), and strong fluorescence (excitation maximum at 555 nm and emission maximum at 590 nm). Figure 7. This drawing is to disclose the protein sequence of the R3C protein. In our example, we have the R3 protein at N-terminus, followed by the fusion part (nanobody C), with a his-tag at the C-terminus. The R3 protein and the fusion part is joined with a short linker. A skilled person can change the linker, or the order between the R3 protein core sequence and the fusion part; and add additional sequences or tags for its expression.
Detailed description
The current invention describes the generation and characterization of cysteine-free red fluorescent proteins and their uses in fusion with another protein. Lock of cysteines in the fluorescent protein renders it much more stable due to resistance to oxidation and to oligomer-formation. The intense red color makes it very visible on light background and suitable for visualization with naked eyes. Its fusion with antibodies or other bioactive proteins will find many uses in scientific research and disease diagnostics. The fluorescent protein mRuby3 is a monomeric fluorescent protein reported with relative high extinction coefficient at 128,000 at 528 nm (Bajar, B. T., E. S. Wang, A. J. Lam, B. B. Kim, C. L. Jacobs, E. S. Howe, M. W. Davidson, M. Z. Lin and J. Chu (2016). "Improving brightness and photostability of green and red fluorescent proteins for live cell imaging and FRET reporting." Sci Rep 6: 20889.). It has origin from Entacmaea quadricolor, and no study has been done to replace its two cysteines at C30 and C176. Several X-ray structures of fluorescent proteins have been solved, and they are single domain protein and have a common beta barrel structure with the chromophore sitting inside the barrel. Both cysteines of mRuby3 are in the middle of beta-sheets, facing inward, according to our modeling based on the X-ray structure of mCherry 2h5q. Although most fluorescent proteins have 1-3 cysteines, the cysteine residues are chemically active. The cysteines can be oxidized to impact the structural integrity or react with another cysteine intramolecularly or intermolecularly to form unintended disulfide bonds to complicate analysis and purification. For example, cysteine-containing proteins usually form complex mixtures of proteins in different sizes when expressed in E. coli or during protein in vitro refolding. Oligomer-formation inside cells also complicates analysis (Costantini, L. M., M. Baloban, M. L.
Markwardt, M. Rizzo, F. Guo, V. V. Verkhusha and E. L. Snapp (2015). "A palette of fluorescent proteins optimized for diverse cellular environments." Nat Commun 6: 7670.). To overcome these problems, cysteine-free fluorescent proteins have been generated and showed improvements (Costantini, L. M., M. Baloban, M. L. Markwardt, M. Rizzo, F. Guo, V. V. Verkhusha and E. L. Snapp (2015). "A palette of fluorescent proteins optimized for diverse cellular environments." Nat Commun 6: 7670.). However, no cysteine-free red fluorescent protein mRuby3 or other red fluorescent proteins had been produced and tested. The impact of removal of their two naturally existing cysteines in mRuby3 on its stability and color and fluorescence properties was unknown. Cysteine replacement in other oxidation resistant fluorescent proteins showed that Cys can be replaced with Ser, Ala, and Val. There is no rule for which is better than the other, and each needs to be tested. According to Nagano et al. (Nagano, N., M. Ota and K. Nishikawa (1999). "Strong hydrophobic nature of cysteine residues in proteins." FEBS Lett 458(1): 69-71.), although similar in size, Ser may not be always a good replacement for Cys due to its hydrophilic nature. Based on our own structural analysis, what we decided to do was to replace the two cysteines with two valines, to produce the new cysteine-free fluorescent protein cf-mRuby3 (C30V, C176V) (we call it R3 protein). We constructed a highly efficient E. coli expression system with a his tag at its C-terminus. The R3 protein expresses at high level and easily purified on nickel column. It is highly soluble and has intense red color with absorbance maximum at 555 nm and fluorescence emission at 590 nm. The R3 protein (239 AA, 27 kDa) is small enough to be easily fused to another protein with its intense red color as a visible tracer. To demonstrate the feasibility, we fused the R3 gene with that of a nanobody that has affinity towards insecticide carbaryl (Liu, Z., K. Wang, S. Wu, Z. Wang, G. Ding, X. Hao, Q. X. Li, J. Li, S. J. Gee, B. D. Hammock and T. Xu (2019). "Development of an immunoassay for the detection of carbaryl in cereals based on a camelid variable heavy-chain antibody domain." J Sci Food Agric 99(9): 4383-4390.), to create the fusion protein R3C (364 AA, 40 kDa), as an example. The R3C protein was colored, highly expressed, and easily purified to homogeneity. Omission of the easily oxidizable cysteines in R3 protein, its intense red color and fluorescence, high solubility, and good stability, make it an ideal tracer in many biological applications.
The main advantages of the present invention include
(a) The fluorescent protein is oxidation-resistant, (b) The fluorescent protein is free from oligomer-formation, (c) This fluorescent protein has an intense red color that is easily visible on light background, (d) The protein is fluorescent with excitation maximum at 555 nm and emission maximum at 590 nm.
The present invention is further described in combination with specific embodiments. It should be understood that these examples are only for illustrating the present invention and are not intended to limit the scope of the present invention.
Example 1: Location of cysteines in mRuby3 for mutation by sequence alignment and structural analysis. mCherry is a fluorescent protein with high sequence homology with mRuby3 (Figure 1), and its X-ray structure has been reported with the PDB access code of 2h5q. mRuby3 has two cysteines, C30 and C176. Many residues before and after these two cysteines are identical between mCherry and mRuby3, so that it is reasonable to assume that C30 in mRuby3 occupies the position of 144 in mCherry and C176 occupies the position of A180 in mCherry. Therefore, both cysteines (C30 and C176) of mRuby3 project towards inside as shown in Figure 2. There are limited spaces to replace both cysteines with two valines with similar hydrophobic properties, resulting our newly designed red fluorescent protein R3 (Figures 2 and 3).
Example 2: Expression and purification of the red fluorescent protein R3. We replaced two cysteines (C30, C176) in mRuby3 with two valines, to create the R3 protein, and the protein sequence is shown in Figure 3. We decided to express the R3 protein in E. coli, so that we designed a synthetic gene with codons optimized for E. coli expression. Additionally, to facilitate its purification, a his-tag was added to its C-terminus. This protein was highly expressed under a T7 promoter in E. coli strain BL21(DE3) and easily purified on nickel affinity column. One desirable feature of this protein is its intense red color, seen even before breaking the cells and clearly seen when purified the protein from mixtures on the column (Figure 4). The movement of the red protein during purification on column was easily visible. The protein is highly soluble with its correct molecular weight (27 kDa), as assessed by comparing with markers on PAGE gel (Figure 5). It has the excitation maximum at 555 nm and emission maximum at 590 nm (Figure 6).
Example 3: Expression and purification of the fusion proteinR3C. The nanobody C is a nanobody reported in the literature with affinity for insecticide Carbaryl. This nanobody was obtained from immunized alpaca and has been used to establish an immunoassay for the detection of carbaryl in cereal for food safety (Liu, Z., K. Wang, S. Wu, Z. Wang, G. Ding, X. Hao, Q. X. Li, J. Li, S. J. Gee, B. D. Hammock and T. Xu (2019). "Development of an immunoassay for the detection of carbaryl in cereals based on a camelid variable heavy-chain antibody domain." J Sci Food Agric 99(9): 4383-4390.). The R3 protein has the intense red color and its fusion with nanobody C will colorize the nanobody and demonstrate the feasibility of coloring a bioactive protein by making the fusion protein. Of cause, beside color detection, fluorescence of the R3 protein can be used for tracing in a similar situation. The fusion was constructed by placing the R3 protein at the N-terminus, and the nanobody C as the fusion part at the C-terminus, with a short linker of GGG in between. The protein sequence of the fusion protein R3C is shown in Figure 7. Similar to the purification of the R3 protein, the R3C protein was purified with the colored protein visible during purification process. The purified protein has the expected molecular weight (40 kDa) when analyzed on PAGE (Figure 5).
Although the present invention has been described in considerable details with reference to certain preferred versions thereof, other versions are possible. Therefore, the spirit and scope of the appended claims should not be limited to the description of the preferred versions contained therein.
SEQUENCE LISTING 07 Oct 2020
<110> Next Medicine Co., Ltd.
<120> Oxidation resistant red fluorescent proteins and use thereof
<130> temp 2020102633
<160> 2
<170> PatentIn version 3.5
<210> 1 <211> 227 <212> PRT <213> Entacmaea quadricolor
<400> 1
Val Ser Lys Gly Glu Glu Leu Ile Lys Glu Asn Met Arg Met Lys Val 1 5 10 15
Val Met Glu Gly Ser Val Asn Gly His Gln Phe Lys Cys Thr Gly Glu 20 25 30
Gly Glu Gly Arg Pro Tyr Glu Gly Val Gln Thr Met Arg Ile Lys Val 35 40 45
Ile Glu Gly Gly Pro Leu Pro Phe Ala Phe Asp Ile Leu Ala Thr Ser 50 55 60
Phe Met Tyr Gly Ser Arg Thr Phe Ile Lys Tyr Pro Ala Asp Ile Pro 65 70 75 80
Asp Phe Phe Lys Gln Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Val 85 90 95
Thr Arg Tyr Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Thr Ser
100 105 110 07 Oct 2020
Leu Glu Asp Gly Glu Leu Val Tyr Asn Val Lys Val Arg Gly Val Asn 115 120 125
Phe Pro Ser Asn Gly Pro Val Met Gln Lys Lys Thr Lys Gly Trp Glu 2020102633
130 135 140
Pro Asn Thr Glu Met Met Tyr Pro Ala Asp Gly Gly Leu Arg Gly Tyr 145 150 155 160
Thr Asp Ile Ala Leu Lys Val Asp Gly Gly Gly His Leu His Cys Asn 165 170 175
Phe Val Thr Thr Tyr Arg Ser Lys Lys Thr Val Gly Asn Ile Lys Met 180 185 190
Pro Gly Val His Ala Val Asp His Arg Leu Glu Arg Ile Glu Glu Ser 195 200 205
Asp Asn Glu Thr Tyr Val Val Gln Arg Glu Val Ala Val Ala Lys Tyr 210 215 220
Ser Asn Leu 225
<210> 2 <211> 227 <212> PRT <213> Entacmaea quadricolor
<400> 2
Val Ser Lys Gly Glu Glu Leu Ile Lys Glu Asn Met Arg Met Lys Val 1 5 10 15
Val Met Glu Gly Ser Val Asn Gly His Gln Phe Lys Val Thr Gly Glu 20 25 30
Gly Glu Gly Arg Pro Tyr Glu Gly Val Gln Thr Met Arg Ile Lys Val 35 40 45 2020102633
Ile Glu Gly Gly Pro Leu Pro Phe Ala Phe Asp Ile Leu Ala Thr Ser 50 55 60
Phe Met Tyr Gly Ser Arg Thr Phe Ile Lys Tyr Pro Ala Asp Ile Pro 65 70 75 80
Asp Phe Phe Lys Gln Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Val 85 90 95
Thr Arg Tyr Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Thr Ser 100 105 110
Leu Glu Asp Gly Glu Leu Val Tyr Asn Val Lys Val Arg Gly Val Asn 115 120 125
Phe Pro Ser Asn Gly Pro Val Met Gln Lys Lys Thr Lys Gly Trp Glu 130 135 140
Pro Asn Thr Glu Met Met Tyr Pro Ala Asp Gly Gly Leu Arg Gly Tyr 145 150 155 160
Thr Asp Ile Ala Leu Lys Val Asp Gly Gly Gly His Leu His Val Asn 165 170 175
Phe Val Thr Thr Tyr Arg Ser Lys Lys Thr Val Gly Asn Ile Lys Met 180 185 190
Pro Gly Val His Ala Val Asp His Arg Leu Glu Arg Ile Glu Glu Ser 195 200 205
Asp Asn Glu Thr Tyr Val Val Gln Arg Glu Val Ala Val Ala Lys Tyr 210 215 220 2020102633
Ser Asn Leu
Claims (6)
1. A red fluorescent protein, wherein said protein has its cysteine or cysteines replaced by other natural amino acids. The absence of cysteines in the protein makes it oxidation resistant and oligomer-formation resistant.
2. A red fluorescent protein, wherein said protein contains a protein sequence of SEQ ID NO: 1 and its cysteines C30 and C176 replaced individually or in combination by the preferred residues Ser, Ala, Val, and Met.
3. A red fluorescent protein, wherein said protein contains the sequence of SEQ ID NO: 2, or said protein has sequence identity of 90% or more to SEQ ID NO: 2.
4. A fusion protein, wherein said fusion protein contains: (a) the red florescent protein according to claim 1, claim 2, or claim 3; and (b) a fusion part comprising antibody, enzyme, transporter, structural protein, and transcription factor.
5. A method for producing the red fluorescent protein according to claim 1, claim 2, or claim 3; or the fusion protein according to claim 4; comprising steps of: (a) constructing their expression system under a condition suitable for producing said proteins, thereby obtaining a culture containing said proteins; and (b) isolating or recovering said proteins from said culture.
6. Use of the red fluorescent protein according to claims, claim 2, or claim 3; or the fusion protein according to claim 4; as a tracer for location and quantitation.
Figures 2020102633
Figure 1
Figure 2
Figure 3
Figure 5 Figure 4
Figure 7 Figure 6
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2020102633A AU2020102633A4 (en) | 2020-10-07 | 2020-10-07 | Oxidation resistant red fluorescent proteins and use thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2020102633A AU2020102633A4 (en) | 2020-10-07 | 2020-10-07 | Oxidation resistant red fluorescent proteins and use thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
AU2020102633A4 true AU2020102633A4 (en) | 2020-11-26 |
Family
ID=73458038
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2020102633A Ceased AU2020102633A4 (en) | 2020-10-07 | 2020-10-07 | Oxidation resistant red fluorescent proteins and use thereof |
Country Status (1)
Country | Link |
---|---|
AU (1) | AU2020102633A4 (en) |
-
2020
- 2020-10-07 AU AU2020102633A patent/AU2020102633A4/en not_active Ceased
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6780599B2 (en) | Methods of detecting interactions between proteins, peptides or libraries thereof using fusion proteins | |
US20050208624A1 (en) | Fluorescent protein | |
EP1994149B1 (en) | Novel fluorescent proteins and methods for using same | |
US20060068451A1 (en) | Dimeric fluorescent polypeptides | |
Peelle et al. | Characterization and use of green fluorescent proteins from Renilla mulleri and Ptilosarcus guernyi for the human cell display of functional peptides | |
US20070224647A2 (en) | Monitor protein for measuring processing of protein | |
US7678893B2 (en) | Fluorescent proteins from Copepoda species and methods for using same | |
AU2020102633A4 (en) | Oxidation resistant red fluorescent proteins and use thereof | |
US9279021B2 (en) | Method for increasing the expression and/or stability of a protein in a cell and a peptide for use in such method | |
US7833749B2 (en) | Isolated photoprotein mtClytin, and use thereof | |
US7544484B2 (en) | Nucleic acids encoding membrane-binding proteins and methods of using same | |
US8563703B2 (en) | Fluorescent proteins and methods for using same | |
Suzuki et al. | Multipurpose indicator in cells using intramolecular FRET between GFP and chemical compounds | |
WO2008041107A2 (en) | Fluorescent proteins from the ctenophora phylum and methods of use thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FGI | Letters patent sealed or granted (innovation patent) | ||
MK22 | Patent ceased section 143a(d), or expired - non payment of renewal fee or expiry |