WO2020023488A1 - Single molecule sequencing identification of post-translational modifications on proteins - Google Patents
Single molecule sequencing identification of post-translational modifications on proteins Download PDFInfo
- Publication number
- WO2020023488A1 WO2020023488A1 PCT/US2019/042998 US2019042998W WO2020023488A1 WO 2020023488 A1 WO2020023488 A1 WO 2020023488A1 US 2019042998 W US2019042998 W US 2019042998W WO 2020023488 A1 WO2020023488 A1 WO 2020023488A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- peptide
- protein
- amino acid
- post
- translational modification
- Prior art date
Links
- 0 CC(Cc(cc1)ccc1O)C(*)=O Chemical compound CC(Cc(cc1)ccc1O)C(*)=O 0.000 description 3
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6803—General methods of protein analysis not limited to specific proteins or families of proteins
- G01N33/6818—Sequencing of polypeptides
- G01N33/6824—Sequencing of polypeptides involving N-terminal degradation, e.g. Edman degradation
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6803—General methods of protein analysis not limited to specific proteins or families of proteins
- G01N33/6818—Sequencing of polypeptides
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/58—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving labelled substances
- G01N33/582—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving labelled substances with fluorescent label
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2440/00—Post-translational modifications [PTMs] in chemical analysis of biological material
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2440/00—Post-translational modifications [PTMs] in chemical analysis of biological material
- G01N2440/12—Post-translational modifications [PTMs] in chemical analysis of biological material alkylation, e.g. methylation, (iso-)prenylation, farnesylation
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2440/00—Post-translational modifications [PTMs] in chemical analysis of biological material
- G01N2440/14—Post-translational modifications [PTMs] in chemical analysis of biological material phosphorylation
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2440/00—Post-translational modifications [PTMs] in chemical analysis of biological material
- G01N2440/18—Post-translational modifications [PTMs] in chemical analysis of biological material citrullination
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2440/00—Post-translational modifications [PTMs] in chemical analysis of biological material
- G01N2440/26—Post-translational modifications [PTMs] in chemical analysis of biological material nitrosylation
Definitions
- Post-translational modifications (PTMs) of proteins are covalent attachments of chemical moieties on the side chains of select amino acids or the N and C terminus of a peptide or a protein.
- the activity and functions of many proteins are modulated by the nature of their PTMs.
- Some non-limiting examples of PTMs include phosphorylation, glycosylation, alkylation, acylation, hydroxylation, or the attachment of a cofactor or nucleotide.
- PTMs include phosphorylation, glycosylation, alkylation, acylation, hydroxylation, or the attachment of a cofactor or nucleotide.
- One such example is the C-terminal domain of the Epidermal growth factor receptor (EGFR) family of proteins that contains approximately 20 tyrosine residues capable of being phosphorylated.
- EGFR Epidermal growth factor receptor
- the downstream processes can range from cell proliferation, differentiation, anti-apoptosis (survival), adhesion, migration, and angiogenesis (Huang et al. , 2011). Understanding and mapping these sites is thus critical not only to better understand cell signaling pathways, but also develop the current therapeutic drugs.
- mapping post- translational modifications have been intrinsically challenging due to their low abundance and sample heterogeneity.
- the current methods do not allow for precise determination of the specific location of PTMs while also allowing for quantitative determination of the PTMs. Therefore, there remains an unmet need to identify methods which allow from improved detection of PTMs in a protein or peptide.
- the present disclosure provides methods and systems for protein or peptide sequencing and/or protein or peptide identification. Methods and systems of the present disclosure may be used to sequence a protein or peptide for the determination of a post- translational modification(s) and the location(s) of such post-translational modification(s).
- the present disclosure provides methods of identifying a post translational modification on an amino acid residue of a peptide or protein, the method comprising:
- the post translational modification on the amino acid residue is phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation.
- the post translational modification on the amino acid residue is phosphorylation on tyrosine, serine, or threonine.
- the post translational modification on the amino acid residue is phosphorylation on a serine.
- the post translational modification on the amino acid residue is phosphorylation on a threonine.
- the post translational modification on the amino acid residue is an /V-glycosylation.
- the post translational modification on the amino acid residue is glycosylation of asparagine or arginine. In other embodiments, the post translational modification on the amino acid residue is an O- glycosylation. In some embodiments, the post translational modification on the amino acid residue is glycosylation of serine, threonine, or tyrosine. In other embodiments, the post translational modification on the amino acid residue is trimethylation. In some embodiments, the post translational modification on the amino acid residue is trimethylation of lysine. In other embodiments, the post translation modification on the amino acid residue is nitrosylation. In some embodiments, the post translation modification on the amino acid residue is nitrosylation of a cysteine or tyrosine.
- the post translation modification on the amino acid residue is nitrosylation of a cysteine. In other embodiments, the post translation modification on the amino acid residue is nitrosylation of a tyrosine. In other embodiments, the post translation modification on the amino acid residue is citrullination. In other embodiments, the post translation modification on the amino acid residue is sulfenylation. In some embodiments, the post translational modification on the amino acid residue is sulfenylation of a cysteine.
- the post translation modification is on an amino acid residue of a protein. In other embodiments, the post translation modification is on an amino acid residue of a peptide.
- the labeling reagent comprises a thiol group. In some embodiments, the labeling reagent comprises two thiol groups. In some embodiments, the labeling reagent comprises an amine reactive group such as a succinimidyl ester. In some embodiments, the labeling reagent comprises a glyoxal group. In some embodiments, the labeling reagent comprises a l,3-cycloalkanedione group such as a 1,3- hexanedione.
- the labeling reagent is a fluorophore, oligonucleotide, or peptide-nucleic acid.
- the labeling reagent is a fluorophore.
- the labeling reagent is a thiol containing fluorophore.
- the fluorophore is a xanthene dye such as a rhodamine dye.
- the methods involve treating the peptide or protein with the labeling reagent comprises:
- the reactive peptide or protein is formed by treating the peptide or protein comprising a phosphorylation post translational modification with a base.
- the base is a rare earth metal hydroxide such as Ba(OH)2.
- the reactive peptide or protein is formed by treating the peptide or protein comprising a phosphorylation post translational modification with an activating agent and a base.
- the activating agent is a carbodiimide such as l-ethyl-3-(3-dimethylaminopropyl)carbodiimide (EDC).
- the base is a heteroaromatic base such as an imidazole.
- the reactive peptide or protein is formed by treating the peptide or protein comprising a trimethyl post translational modification with silver oxide (Ag20). In some embodiments, the peptide or protein comprising a trimethyl post translational modification is treated with silver oxide in the presence of heat. In some embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a trimethyl post translational modification with a base. In some embodiments, the base is a nitrogenous base such as diisopropylethylamine or trimethylamine.
- the reactive peptide or protein is formed by treating the peptide or protein comprising a glycosylation post translational modification with an oxidizing agent.
- the oxidizing agent is a hypervalent iodide reagent such as sodium periodate.
- the reactive peptide or protein is formed by treating the peptide or protein comprising a nitrosylation post translational modification with a reducing agent.
- the reducing agent is disulfide reducing agent such as dithiothreitol.
- the reducing agent further comprises heme.
- the reactive peptide or protein is formed by treating the peptide or protein comprising a nitrosylation post translational modification with phosphine.
- the phosphine is an unsubstituted or substituted trialkylphosphine or an unsubstituted or substituted a triarylphosphine.
- the phosphine is an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triphenylphosphine. In some embodiments, the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a phosphine. In some embodiments, the phosphine is an unsubstituted or substituted trialkylphosphine or an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triphenylphosphine. In some embodiments, the phosphine is covalently linked to the labeling reagent.
- the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a glyoxal group.
- the glyoxal group is covalently linked to the labeling reagent.
- the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a l,3-cycloalkanedione such as a l,3-cyclohexanedione.
- the l,3-cycloalkanedione is covalently bonded to the labeling reagent.
- the reactive group on the reactive peptide or protein is a double bond.
- the reactive peptide or protein is treated with the labeling reagent comprising a thiolene-click reaction to form a labeled peptide or protein.
- the reactive peptide or protein is treated with the labeling reagent with a double bond in the presence of an olefin metathesis reagent to form a labeled peptide or protein.
- the reactive peptide or protein is treated with the labeling reagent comprising a cycloaddition reaction to form a labeled peptide or protein.
- the reactive group on the reactive peptide or protein is an aldehyde.
- the labeling reagent is treated with the reactive group on the reactive peptide or protein comprising nucleophilic addition, nucleophilic substitution, or radical addition.
- the labeling reagent forms a thioether when treated with the reactive group on the reactive peptide or protein.
- the labeling reagent forms a dithiane.
- the reactive peptide or protein is treated with the labeling reagent to form an amide bond. In some embodiments, the amide bond formation provides the labeled peptide or protein.
- the reactive peptide or protein is treated with the labeling reagent to form a disulfide bond. In some embodiments, the disulfide bond formation provides the labeled peptide of protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent to form a heterocycloalkane. In some embodiments, the heterocycloalkyl group formation provides the labeled peptide of protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent to form a thioether bond. In some embodiments, the thioether bond formation provides the labeled peptide of protein. [0018] In some embodiments, the sequencing comprises a fluorosequencing method.
- the sequencing is at a single molecular level.
- the fluorosequencing method comprises labeling at least one amino acid of the peptide or protein which does not contain a post translational modification with a second labeling reagent.
- the fluorosequencing method comprises labeling one, two, three, four, or five distinct amino acids of the peptide or protein which do not contain a post translation modification.
- each amino acid is labeled with a distinct second labeling reagent.
- the peptide or protein is bound to a solid support such as a surface.
- the solid support is a resin, a bead, or a modified glass surface.
- the solid support is the modified glass surface such as an aminosilicate surface.
- the fluorosequencing method further comprises removing at least one amino acid residue of the peptide or protein. In some embodiments, the fluorosequencing method comprises sequentially removing two or more consecutive amino acid residues of the peptide or protein. In some embodiments, the fluorosequencing method comprises sequentially removing amino acid residues of the peptide or protein until a labeled amino acid comprising a modified post translational modification is removed. In some embodiments, the fluorosequencing method comprises sequentially removing from 1 to 20 amino acid residues of the peptide or protein until a labeled amino acid comprising a modified post translational modification is removed. In some embodiments, the amino acid residues are removed by Edman degradation. In some embodiments, the amino acid residue is removed by treating the /V- terminal amino acid residue with a thiourea and an acid, microwave irradiation, or heat. In some embodiments, the amino acid residues are removed by an enzyme.
- the peptide or protein is digested by a protease. In some embodiments, the peptide or protein is digested by a protease before labeling the amino acid comprising the post translational modification. In some embodiments, the peptide or protein is obtained from a biological sample. In some embodiments, the biological sample is a cell-free biological sample. In some embodiments, the biological sample is derived from blood. In other embodiments, the biological sample is derived from urine. In other embodiments, the biological sample is derived from mucous. In other embodiments, the biological sample is derived from saliva.
- a covalent bond between the post translational modification on the amino acid residue of the peptide or protein and the labeling reagent is formed.
- the labeling reagent or derivative thereof is directly covalently bonded to the amino acid residue.
- the labeling reagent or derivative thereof is covalently coupled to the amino acid residue through an intermediary molecule.
- the present disclosure provides methods of determining the status of a disease or disorder in a subject, the method comprising:
- the methods further comprise obtaining a biological sample from the subject.
- determining the status of a disease or disorder is determining the prognosis of the patient that has the disease.
- determining the status of a disease or disorder is diagnosing the patient with the disease.
- determining the status of a disease or disorder is determining if the patient is at risk of having the disease.
- the change in post translation modification of a protein or peptide is a change in the phosphorylation of the protein. In other embodiments, the change in post translation modification of a protein or peptide is a change in the trimethylation of the protein. In other embodiments, the change in post translation modification of a protein or peptide is a change in the glycosylation of the protein. In other embodiments, the change in post translation modification of a protein or peptide is a change in the nitrosylation of the protein. In some embodiments, the change in post translation modification of a protein or peptide is a change in the citrullination of the protein. In some embodiments, the change in post translation modification of a protein or peptide is a change in the sulfenylation of the protein.
- the biological sample is a cell-free biological sample such as saliva, mucous, urine, serum, plasma, or whole blood.
- the method conveys the presence of one or more post translational modifications. In some embodiments, the method conveys the presence of two or more post translation modifications. In some embodiments, the method conveys the absence of one or more post translational modifications. In some embodiments, the method conveys the absence of one or more post translational modifications and the presence of one or more post translational modifications.
- the method conveys the type of the post translational modification in the protein. In some embodiments, the method conveys the identity of the post translational modification in the protein. In some embodiments, the method conveys the quantity of the post translational modification in the protein. In some embodiments, the method conveys the position of the post translational modification in the protein. In some embodiments, the subject is a mammal such as a human.
- the method further comprises enriching the protein before determining the type, identity, quantity, or position of the post translational modifications.
- the protein is enriched by purification of the biological sample.
- the protein is subjected to degradation before determining the types or identities of the post translational modifications.
- the protein is degraded by a protease.
- the protein is immobilized on a solid support.
- the solid support is a surface.
- the solid support is a resin, a bead, or a modified glass surface.
- the solid support is the modified glass surface such as an aminosilicate surface.
- the method comprises determining the type, identity, quantity, or position of post translational modification on two or more peptides or proteins.
- the present disclosure provides methods for determining the status of a disease or disorder in a subject, the method comprising: detecting a change in a type, identity, quantity, or position of the post translational modifications on the protein or peptide using the methods described herein related to the disease or disorder.
- the methods further comprise obtaining a biological sample from the subject.
- the present disclosure provides modified peptides or proteins comprising a peptide or protein comprising one or more post translational modifications, wherein at least one post translational modification of said peptide or protein comprising one or more post translational modifications is altered with at least a first labeling moiety, thereby forming a labeled peptide or protein comprising one or more post translational modifications.
- the at least the first labeling moiety is a fluorophore.
- the peptide or protein comprises a second labeling moiety attached to one or more amino acid residues of the peptide or protein.
- the second labeling moiety is a fluorophore.
- said at least one post translational modification is selected from the group consisting of phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, trimethylation, or any combination thereof.
- each post translational modification selected from the group consisting of phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation is altered by a distinct labeling moiety.
- the modified peptide or protein comprises from 3 amino acid residues to about 250 amino acid residues. In some embodiments, the modified peptide or protein comprises from 5 amino acid residues to about 100 amino acid residues. In some embodiments, the modified peptide or protein comprises from about 7 amino acid residues to about 50 amino acid residues.
- the first labeling reagent replaces the post translational modification on the amino acid residue.
- the post translation modification is on an amino acid residue of a protein.
- the post translation modification is on an amino acid residue of a peptide.
- the first labeling reagent comprises a thiol group.
- the first labeling reagent comprises two thiol groups.
- the first labeling reagent comprises an amine reactive group such as a succinimidyl ester.
- the first labeling reagent comprises a glyoxal group.
- the first labeling reagent comprises a l,3-cycloalkanedione group such as a l,3-hexanedione.
- the first or second labeling reagent are a fluorophore, oligonucleotide, or peptide-nucleic acid.
- the one of the first or second labeling reagent is a fluorophore.
- the labeling reagent is a thiol containing fluorophore.
- the fluorophore is a xanthene dye such as a rhodamine dye.
- the second labeling moiety is attached to a different type of amino acid of the peptide or protein than the first labeling moiety.
- the methods further comprise one or more additional labeling moieties attached to one or more distinct amino acids of the peptide or protein.
- the peptide or protein is immobilized adjacent to a solid support.
- the solid support is a surface.
- the solid support is a resin, a bead, or a modified glass surface.
- the solid support is a modified glass surface such as an aminosilicate surface.
- the peptide or protein has been degraded by a protease.
- the post translation modification is phosphorylation of the peptide or protein.
- the post translation modification is trimethylation of the peptide or protein.
- the post translation modification is glycosylation of the peptide or protein.
- the post translation modification is nitrosylation of the peptide or protein.
- the post translation modification is citrullination of the peptide or protein.
- the post translation modification is sulfenylation of the peptide or protein.
- the post translational modification on the amino acid residue is phosphorylation on tyrosine, serine, or threonine. In some embodiments, the post translational modification on the amino acid residue is phosphorylation on a serine. In other embodiments, the post translational modification on the amino acid residue is phosphorylation on a threonine. In other embodiments, the post translational modification on the amino acid residue is an /V-glycosylation. In some embodiments, the post translational modification on the amino acid residue is glycosylation of asparagine or arginine. In other embodiments, the post translational modification on the amino acid residue is an O- glycosylation.
- the post translational modification on the amino acid residue is glycosylation of serine, threonine, or tyrosine. In other embodiments, the post translational modification on the amino acid residue is trimethylation. In some embodiments, the post translational modification on the amino acid residue is trimethylation of lysine. In other embodiments, the post translation modification on the amino acid residue is nitrosylation. In some embodiments, the post translation modification on the amino acid residue is nitrosylation of a cysteine or tyrosine. In some embodiments, the post translation modification on the amino acid residue is nitrosylation of a cysteine. In other embodiments, the post translation modification on the amino acid residue is nitrosylation of a tyrosine.
- the post translation modification on the amino acid residue is citrullination. In other embodiments, the post translation modification on the amino acid residue is sulfenylation. In some embodiments, the post translational modification on the amino acid residue is sulfenylation of a cysteine.
- the present disclosure provides methods of sequencing a peptide or protein comprising:
- the post translational modification on the amino acid residue is phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation.
- the post translational modification on the amino acid residue is phosphorylation on tyrosine, serine, or threonine.
- the post translational modification on the amino acid residue is phosphorylation on a serine.
- the post translational modification on the amino acid residue is phosphorylation on a threonine.
- the post translational modification on the amino acid residue is an /V-glycosylation.
- the post translational modification on the amino acid residue is glycosylation of asparagine or arginine. In other embodiments, the post translational modification on the amino acid residue is an O- glycosylation. In some embodiments, the post translational modification on the amino acid residue is glycosylation of serine, threonine, or tyrosine. In other embodiments, the post translational modification on the amino acid residue is trimethylation. In some embodiments, the post translational modification on the amino acid residue is trimethylation of lysine. In other embodiments, the post translation modification on the amino acid residue is nitrosylation. In some embodiments, the post translation modification on the amino acid residue is nitrosylation of a cysteine or tyrosine.
- the post translation modification on the amino acid residue is nitrosylation of a cysteine. In other embodiments, the post translation modification on the amino acid residue is nitrosylation of a tyrosine. In other embodiments, the post translation modification on the amino acid residue is citrullination. In other embodiments, the post translation modification on the amino acid residue is sulfenylation. In some embodiments, the post translational modification on the amino acid residue is sulfenylation of a cysteine.
- the labeling reagent replaces the post translational modification on the amino acid residue.
- the post translation modification is on an amino acid residue of a protein.
- the post translation modification is on an amino acid residue of a peptide.
- the labeling reagent comprises a thiol group.
- the labeling reagent comprises two thiol groups.
- the labeling reagent comprises an amine reactive group such as a succinimidyl ester.
- the labeling reagent comprises a glyoxal group.
- the labeling reagent comprises a 1,3- cycloalkanedione group such as a l,3-hexanedione.
- the labeling reagent is a fluorophore, oligonucleotide, or peptide-nucleic acid. In some embodiments, the labeling reagent is a fluorophore. In some embodiments, the labeling reagent is a thiol containing fluorophore. In some embodiments, the fluorophore is a xanthene dye such as a rhodamine dye.
- the reactive peptide or protein is formed by treating the peptide or protein comprising a phosphorylation post translational modification with a base.
- the base is a rare earth metal hydroxide such as Ba(OH)2.
- the reactive peptide or protein is formed by treating the peptide or protein comprising a phosphorylation post translational modification with an activating agent and a base.
- the activating agent is a carbodiimide such as l-ethyl-3-(3-dimethylaminopropyl)carbodiimide (EDC).
- the base is a heteroaromatic base such as an imidazole.
- the reactive peptide or protein is formed by treating the peptide or protein comprising a trimethyl post translational modification with silver oxide (Ag20). In some embodiments, the peptide or protein comprising a trimethyl post translational modification is treated with silver oxide in the presence of heat. In some embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a trimethyl post translational modification with a base. In some embodiments, the base is a nitrogenous base such as diisopropylethylamine or trimethylamine.
- the reactive peptide or protein is formed by treating the peptide or protein comprising a glycosylation post translational modification with an oxidizing agent.
- the oxidizing agent is a hypervalent iodide reagent such as sodium periodate.
- the reactive peptide or protein is formed by treating the peptide or protein comprising a nitrosylation post translational modification with a reducing agent.
- the reducing agent is disulfide reducing agent such as dithiothreitol.
- the reducing agent further comprises heme.
- the reactive peptide or protein is formed by treating the peptide or protein comprising a nitrosylation post translational modification with phosphine.
- the phosphine is an unsubstituted or substituted trialkylphosphine or an unsubstituted or substituted a triarylphosphine.
- the phosphine is an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triphenylphosphine. In some embodiments, the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a phosphine. In some embodiments, the phosphine is an unsubstituted or substituted trialkylphosphine or an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triphenylphosphine. In some embodiments, the phosphine is covalently linked to the labeling reagent.
- the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a glyoxal group.
- the glyoxal group is covalently linked to the labeling reagent.
- the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a l,3-cycloalkanedione such as a l,3-cyclohexanedione.
- the l,3-cycloalkanedione is covalently bonded to the labeling reagent.
- the reactive group on the reactive peptide or protein is a double bond.
- the reactive peptide or protein is treated with the labeling reagent comprising a thiolene-click reaction to form a labeled peptide or protein.
- the reactive peptide or protein is treated with the labeling reagent with a double bond in the presence of an olefin metathesis reagent to form a labeled peptide or protein.
- the reactive peptide or protein is treated with the labeling reagent comprising a cycloaddition reaction to form a labeled peptide or protein.
- the reactive group on the reactive peptide or protein is an aldehyde.
- the labeling reagent is treated with the reactive group on the reactive peptide or protein comprising nucleophilic addition, nucleophilic substitution, or radical addition.
- the labeling reagent forms a thioether when treated with the reactive group on the reactive peptide or protein.
- the labeling reagent forms a dithiane.
- the reactive peptide or protein is treated with the labeling reagent to form an amide bond. In some embodiments, the amide bond formation provides the labeled peptide or protein.
- the reactive peptide or protein is treated with the labeling reagent to form a disulfide bond. In some embodiments, the disulfide bond formation provides the labeled peptide of protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent to form a heterocycloalkane. In some embodiments, the heterocycloalkyl group formation provides the labeled peptide of protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent to form a thioether bond. In some embodiments, the thioether bond formation provides the labeled peptide of protein.
- the sequencing comprises a fluorosequencing method. In some embodiments, the sequencing is at a single molecular level. In some embodiments, the fluorosequencing method comprises labeling at least one amino acid of the peptide or protein which does not contain a post translational modification with a second labeling reagent. In some embodiments, the fluorosequencing method comprises labeling one, two, three, four, or five distinct amino acids of the peptide or protein which do not contain a post translation modification. In some embodiments, each amino acid is labeled with a distinct second labeling reagent.
- the peptide or protein is bound to a solid support such as a surface.
- the solid support is a resin, a bead, or a modified glass surface.
- the solid support is the modified glass surface such as an aminosilicate surface.
- the fluorosequencing method further comprises removing at least one amino acid residue of the peptide or protein. In some embodiments, the fluorosequencing method comprises sequentially removing two or more consecutive amino acid residues of the peptide or protein. In some embodiments, the fluorosequencing method comprises sequentially removing amino acid residues of the peptide or protein until a labeled amino acid comprising a modified post translational modification is removed. In some embodiments, the fluorosequencing method comprises sequentially removing from 1 to 20 amino acid residues of the peptide or protein until a labeled amino acid comprising a modified post translational modification is removed. In some embodiments, the amino acid residues are removed by Edman degradation.
- the amino acid residue is removed by treating the /V- terminal amino acid residue with a thiourea and an acid, microwave irradiation, or heat. In some embodiments, the amino acid residues are removed by an enzyme. [0056] In some embodiments, the peptide or protein is digested by a protease. In some embodiments, the peptide or protein is digested by a protease before labeling the amino acid comprising the post translational modification.
- the present disclosure provides methods for polypeptide sequence identification, comprising:
- said first polypeptide is a protein.
- the present disclosure provides methods for processing or analyzing a protein or peptide containing or suspected of containing at least one post-translational modification, comprising: (A) sequencing said protein or peptide, and
- said sequencing comprises subjecting said protein or peptide to degradation conditions to sequentially remove amino acid sub-units from said protein or peptide, and detecting at least a subset of said amino acid sub-units. In some embodiments, less than all amino acid sub-units of said peptide or protein are labeled, and wherein said sequencing comprises detecting a subset of said amino acid sub-units.
- said at least one post-translational modification is identified during said sequencing. In some embodiments, said at least one post-translational modification is identified prior to said sequencing.
- said protein or peptide is obtained from a sample and processed to label said at least one post-translational modification.
- said sample is a cell-free sample.
- said sequencing comprises labeling said at least one post-translational modification of said protein or peptide with a label, and detecting said label to thereby identify said at least one post- translational modification on said protein or peptide.
- the present disclosure provides methods for processing or analyzing a protein or peptide, comprising subjecting said protein or peptide to conditions sufficient to specifically label different post-translational modifications of said protein or peptide, and detecting labels corresponding to said different post-translational modifications of said protein or peptide to thereby detect said different post-translational modifications of said protein or peptide.
- said different post-translational modifications comprise phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation.
- “essentially free,” in terms of a specified component may refer to a specified component being absent from a composition or the component is present as a contaminant or in trace amounts. The total amount of the specified component resulting from any unintended contamination of a composition can be below 0.1%. In some embodiments, a composition in which no amount of the specified component can be detected with standard analytical methods.
- “a” or“an” may refer to one or more.
- the words“a” or“an” when used in conjunction with the word“comprising”, the words“a” or“an” may refer to one or more than one.
- “another” or“a further” may refer to at least a second or more.
- the term“about” is used to indicate that a value includes the inherent variation of error for the device, the method being employed to determine the value, or the variation that exists among the study subjects. In some embodiments, the term“about” refers to ⁇ 5% of the listed value.
- FIG. 1 Correct identification of phosphoserine residues on synthetic CTD heptad peptide by fluorosequencing.
- (Top) Phosphoserine is present at the 2 nd position.
- (Bottom) Phosphoserine is present at the 5 th position.
- Representative raw imaging data are shown for two individual peptide molecules from each experiment. For each individual molecule, the images are organized as a horizontal strip of consecutive TIRF micrographs (each corresponding to a square of 3 x 3 microns) centered on the peptide molecule. Each image represents one successive observation of emitted fluorescent light from that molecule after a round of Edman chemistry.
- a sharp reduction in fluorescence follows the Edman cycle in which the ammo acid with the attached fluorescent dye was removed, thus revealing the amino acid sequence position of the phosphorylated residue m the original peptide.
- the heatmap denotes the frequency histogram, tallying the counts of individual peptide molecules having lost fluorescence after every Edman degradation cycle over the background counts.
- the phosphorylated serine residue in the 2 nd position (top) and 5 th position (bottom) have significantly higher counts of fluorescent loss at the 2 nd and 5 th position, respectively, when analyzed by the fluorosequencing method.
- FIG. 2 shows fluorosequencing position counts between two biological samples. Proteins from two different HEK-293T samples were digested, labeled, and sequenced on the fluorosequencing platform. Read counts were observed to be highly correlated between these biological replicates (Pearson coefficient 0.9582). Data is counts and plotted on a loglO scale DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS
- the present disclosure provides methods of typing, identifying, quantifying, or locating a post translational modification (PTM) in a peptide or protein. These methods may be used to determine the type, location, quantity, or position of a PTM such as phosphorylation, glycosylation, or alkylation in a peptide or protein. These methods may be used in conjunction with a fluorosequencing method such as those which include labeling of the post translational modification with a labeling moiety such as a fluorophore. These methods may further include the removal of one or more amino acid residues from the peptide or protein. In some aspects, these methods may be used to determine the progression or status of a disease or disorder in a patient.
- PTM post translational modification
- Fluorosequencing has been found to provide single molecule resolution for the sequencing of proteins of interest (Swaminathan, 2010; U.S. Patent No. 9,625,469; U.S. Patent Application Serial No. 15/461,034; U.S. Patent Application Serial No. 15/510,962).
- fluorosequencing is introduction of a fluorophore or other label into specific amino acid residues of the peptide sequence. This can involve the introduction of one or more amino acid residues with a unique labeling moiety.
- one, two, three, four, five, or more different amino acids residues are labeled with a labeling moiety.
- the labeling moiety that may be used include fluorophores, chromophores, or a quencher.
- Each of these amino acid residues may include cysteine, lysine, glutamic acid, aspartic acid, tryptophan, tyrosine, serine, threonine, arginine, histidine, methionine, asparagine, and glutamine.
- Each of these amino acid residues may be labeled with a different labeling moiety.
- multiple amino acid residues may be labeled with the same labeling moiety such as aspartic acid and glutamic acid or asparagine and glutamine. While this technique may be used with labeling moieties such as those described above, it is also contemplated that other labeling moiety may be used in fluorosequencing-like methods such as synthetic oligonucleotides or peptide-nucleic acid may be used. In particular, the labeling moiety used in the instant applications may be suitable to withstand the conditions of removing one or more of the amino acid residues.
- labeling moieties that may be used in the instant methods include those which emit a fluorescence signal in the red to infrared spectra such as an Alexa Fluor® dye, an Atto dye, a rhodamine dye, or other similar dyes. Examples of each of these dyes which were capable of withstanding the conditions of removing the amino acid residues include Alexa Fluor® 405, Rhodamine B, tetramethyl rhodamine, Alexa Fluor 555, Atto647N, and (5)6-napthofluorescein. In other aspects, it is contemplated that the labeling moiety may be a fluorescent peptide or protein or a quantum dot.
- oligonucleotides or oligonucleotide derivatives may be used as the labeling moiety for the peptides.
- thiolated oligonucleotides may be coupled to peptides using the presented methods.
- Commonly available thiol modifications are 5' thiol modifications, 3' thiol modifications, and dithiol modifications and each of these modifications may be used to modify the peptide.
- the peptides may be subjected to Edman degradation (Edman el al, 1950) and the oligonucleotides may be used to determine the presence of a specific amino acid residue in the remaining peptide sequence.
- the labeling moiety may be a peptide-nucleic acid.
- the peptide-nucleic acid may be attached to the peptide sequence on specific amino acid residues.
- One element of fluorosequencing is the removal of the labeled peptides through such techniques such as Edman degradation and subsequent visualization to detect a reduction in fluorescence, indicating a specific amino acid has been cleaved. Removal of each amino acid residue is carried out through a variety of different techniques including Edman degradation and proteolytic cleavage.
- the techniques include using Edman degradation to remove the terminal amino acid residue.
- the techniques involve using an enzyme to remove the terminal amino acid residue. These terminal amino acid residues may be removed from either the C terminus or the N terminus of the peptide chain. In situations in which Edman degradation is used, the amino acid residue at the N terminus of the peptide chain is removed.
- the methods of sequencing or imaging the peptide sequence may comprise immobilizing the peptide on a surface.
- the peptide may be immobilized using an cysteine residue, the N terminus, or the C terminus.
- the peptide is immobilized by reacting the cysteine residue with the surface.
- the present disclosure contemplates immobilizing the peptides on a surface such as a surface that is optically transparent across the visible spectra, the infrared spectra, or a combination thereof possesses a refractive index between 1.3 and 1.6, is between 10 to 50 nm thick, is chemically resistant to organic solvents as well as strong acid such as trifluoroacetic acid, or any combination thereof.
- a large range of substrates like fluoropolymers (Teflon-AF (Dupont), Cytop® (Asahi Glass, Japan)), aromatic polymers (polyxylenes (Parylene, Kisco, Calif.), polystyrene, polymethmethylacrytate) and metal surfaces (Gold coating)), coating schemes (spin-coating, dip-coating, electron beam deposition for metals, thermal vapor deposition and plasma enhanced chemical vapor deposition) and functionalization methodologies (polyallylamine grafting, use of ammonia gas in PECVD, doping of long chain end-functionalized fluorous alkanes etc) may be used in the methods described herein as a useful surface.
- a 20 nm thick, optically transparent fluoropolymer surface made of Cytop® may be used in the methods described herein.
- the surfaces used herein may be further derivatized with a variety of fluoroalkanes that will sequester peptides for sequencing and modified targets for selection.
- an aminosilane modified surfaces may be used in the methods described herein.
- the methods described herein may comprise immobilizing the peptides on the surface of beads, resins, gels, quartz particles, glass beads, or combinations thereof.
- the methods contemplate using peptides that have been immobilized on the surface of Tentagel® beads, Tentagel® resins, or other similar beads or resins.
- the surface used herein may be coated with a polymer, such as polyethylene glycol.
- the surface is amine functionalized.
- the surface is thiol functionalized.
- Each of these sequencing techniques involves imaging the peptide sequence to determine the presence of one or more labeling moiety on the peptide sequence. In some embodiments, these images are taken after each removal of an amino acid residue and used to determine the location of the specific amino acid in the peptide sequence. In some embodiments, the methods can result in the elucidation of the location of the specific amino acid in the peptide sequence. These methods may be used to determine the locations of specific amino acid residues in the peptide sequence or these results may be used to determine the entire list of amino acid residues in the peptide sequence. The methods may involve determining the location of one or more amino acid residues in the peptide sequence and comparing these locations to specific peptide sequences and determining the entire list of amino acid residues in the peptide sequence.
- the methods may comprise labeling one or more additional amino acid residues which do not contain a post translational modification.
- These amino acids may be labeled with a labeling moiety which is different from the label used to label the amino acid residue containing the post translational modification. If more than one position on the peptide is labeled, it is contemplated that the amino acids are labeled in the following order: cysteine, lysine, N terminus, C terminus, amino acids with carboxylic acid groups on the side chain, tryptophan, or any combination thereof. It is contemplated that one or more of these particular amino acids may be labeled or all of these amino acid residues may be labeled with different labels.
- the imaging methods used in the sequencing techniques may involve a variety of different methods such as fluorimetry and fluorescence microscopy.
- the fluorescent methods may employ such fluorescent techniques such as fluorescence polarization, Forster resonance energy transfer (FRET), or time-resolved fluorescence.
- fluorescence microscopy may be used to determine the presence of one or more fluorophores in the single molecule quantity.
- imaging methods may be used to determine the presence or absence of a label on a specific peptide sequence. After repeated cycles of removing an amino acid residue and imaging the peptide sequence, the position of the labeled amino acid residue can be determined in the peptide.
- the present methods comprise labeling and determining the presence and position, location, quantity, type of a post translational modification of a peptide sequence, or any combination thereof.
- Post translational modifications are used to refer to a covalent modification of a protein or peptide through enzymatic or non-enzymatic modification of the protein or peptide.
- the post translational modification includes both natural as well as non-natural modifications.
- Post translational modifications may be used to describe a variety of different types of covalent modifications including a modification to the side chain of an amino acid or cleaving of peptide (or amide) bonds, or as a result of oxidative stress. Often post translational modifications are attached to the side chain of an amino acid.
- side chains of amino acids which contain a nucleophilic side chain are often the site of a post translational modification.
- the side chains of amino acids, which may be modified include nucleophilic sites such as the hydroxyl groups of amino acids serine, threonine, and tyrosine, the amine group of amino acids lysine, arginine, and histidine, the thiol group of cysteine, and the carboxylic acid group of aspartate and glutamine.
- post translational modifications include addition of a hydrophobic group such as alkylation which may be used to introduce one or more alkyl such as methyl groups, acylation which may be used to introduce one or more acyl group such as acetylation, formylation, or acylation with a fatty acid, or prenylation which introduces a isoprenoid group.
- Other post translational modifications may include the introduction of a cofactor or translation factors such as a flavin moiety, a heme moiety, lipoylation, or diphthamide formation.
- Other post translation modification may comprise the introduction of another protein such as SUMOylation, which attaches a SUMO protein, or ubiquitination, which attaches the protein ubiquitin.
- Post translational modifications may further comprise the introduction of a chemical group to an existing amino acid residue.
- chemical groups which can be used to modify an amino acid residue include acylation, alkylation, amide bond formulation, carboxylation, glycosylation, hydroxylation, iodination, phosphorylation, nitrosylation, sulfmylation, sulfenylation, sulfation, or succinylation.
- the present methods may be used to determine the presence of one or more of these post translational modifications.
- the post translational modification is an alkylation specifically a methylation to introduce a mono, di or trimethylamine group to the side chain of the lysine residue.
- the post translational modification is the phosphorylation of a hydroxyl group on tyrosine, threonine, or serine residue especially a threonine or a serine residue.
- the post translational modification is a glycosylation of a nitrogen or oxygen atom in the side chain of an amino acid.
- the peptides or proteins with a post translational modification described herein may be obtained from a biological sample.
- These biological samples may be obtained from an animal or plant source.
- One potential animal source is a mammal source such as a sample obtained from a human.
- the human source may be obtained from a baby, an adolescent, or an adult human.
- These biological samples may include cell-free samples.
- a cell-free sample may be a sample which is free of cells, substantially free of cells or essentially free of cells.
- a cell-free biological sample may include a protein(s), peptide(s), amino acid(s), a nucleic acid molecule(s) (e.g., ribonucleic acid molecule or deoxyribonucleic acid molecule), or any combination thereof. While a sample may be denoted as cell-free, the sample may contain a small number of cells or cell debris while still being considered cell- free.
- these samples may include less than or equal to about 50 cells or fewer per milliliter of sample, 45 cells per milliliter, 40 cells per milliliter, 35 cells per milliliter, 30 cells per milliliter, 25 cells per milliliter, 20 cells per milliliter, 15 cells per milliliter, 10 cells per milliliter, 5 cells per milliliter, 1 cell per milliliter, or less.
- these samples may include greater than or equal to about 1 cell per milliliter, 5 cells per milliliter, 10 cells per milliliter, 15 cells per milliliter, 20 cells per milliliter, 25 cells per milliliter, 30 cells per milliliter, 35 cells per milliliter, 40 cells per milliliter, 45 cells per milliliter, 45 cells per milliliter, 50 cells per milliliter, or more.
- Such cell-free samples may include blood (e.g., whole blood), serum, plasma, saliva, urine, or mucous, for example.
- amino acid in general refers to organic compounds that contain at least one amino group,— NFh which may be present in its ionized form,— NH 3 + , and one carboxyl group,— COOH, which may be present in its ionized form,— COO .
- carboxylic acids are deprotonated at neutral pH, having the basic formula of NH2CHRCOOH.
- An amino acid and thus a peptide has an N (amino)-terminal residue region and a C (carboxy)-terminal residue region.
- Types of amino acids include at least 20 that are considered“natural” as they comprise the majority of biological proteins in mammals and include amino acid such as lysine, cysteine, tyrosine, threonine, etc.
- Amino acids may also be grouped based upon their side chains such as those with a carboxylic acid groups (at neutral pH), including aspartic acid or aspartate (Asp; D) and glutamic acid or glutamate (Glu; E); and basic amino acids (at neutral pH), including lysine (Lys; L), arginine (Arg; N), and histidine (His; H).
- terminal is referred to as singular terminus and plural termini.
- side chains refers to unique structures attached to the alpha carbon (attaching the amine and carboxylic acid groups of the amino acid) that render uniqueness to each type of amino acid.
- R groups have a variety of shapes, sizes, charges, and reactivities, such as charged polar side chains, either positively or negatively charged, such as lysine (+), arginine (+), histidine (+), aspartate (-) and glutamate (-), amino acids can also be basic, such as lysine, or acidic, such as glutamic acid; uncharged polar side chains have hydroxyl, amide, or thiol groups, such as cysteine having a chemically reactive side chain, i.e.
- Non-polar hydrophobic amino acid side chains include the amino acid glycine; alanine, valine, leucine, and isoleucine having aliphatic hydrocarbon side chains ranging in size from a methyl group for alanine to isomeric butyl groups for leucine and isoleucine; methionine (Met) has a thiol ether side chain, proline (Pro) has a cyclic pyrrolidine side group.
- Phenylalanine (with its phenyl moiety) (Phe) and typtophan (Trp) (with its indole group) contain aromatic side groups, which are characterized by bulk as well as nonpolarity.
- Amino acids can also be referred to by a name or 3-letter code or 1 -letter code, for example, Cysteine; Cys; C, Lysine; Lys; K, Tryptophan; Trp; W, respectively.
- Amino acids may be classified as nutritionally essential or nonessential, with the caveat that nonessential vs. essential may vary from organism to organism or vary during different developmental stages. Nonessential or conditional amino acids for a particular organism is one that is synthesized adequately in the body, typically in a pathway using enzymes encoded by several genes, as substrates allow for protein synthesis.
- Essential amino acids are amino acids that the organism is not unable to produce or not able to produce enough naturally, via de novo pathways, for example lysine in humans. Humans obtain essential amino acids through their diet, including synthetic supplements, meat, plants and other organisms.
- “Unnatural” amino acids are those not naturally encoded or found in the genetic code nor produced via de novo pathways in mammals and plants. They can be synthesized by adding side chains not normally found or rarely found on amino acids in nature.
- b amino acids which have their amino group bonded to the b carbon rather than the a carbon as in the 20 standard biological amino acids, are unnatural amino acids.
- a common naturally occurring b amino acid is b-alanine.
- the term the terms“amino acid sequence”,“peptide”,“peptide sequence”,“polypeptide”, and“polypeptide sequence” are used interchangeably herein to refer to at least two amino acids or amino acid analogs that are covalently linked by a peptide (amide) bond or an analog of a peptide bond.
- the term peptide includes oligomers and polymers of amino acids or amino acid analogs.
- the term peptide also includes molecules that are commonly referred to as peptides, which generally contain from about two (2) to about twenty (20) amino acids.
- the term peptide also includes molecules that are commonly referred to as polypeptides, which generally contain from about twenty (20) to about fifty amino acids (50).
- peptide also includes molecules that are commonly referred to as proteins, which generally contain from about fifty (50) to about three thousand (3000) amino acids.
- the amino acids of the peptide may be /.-amino acids or //-amino acids.
- a peptide, polypeptide or protein may be synthetic, recombinant or naturally occurring.
- a synthetic peptide is a peptide that is produced by artificially in vitro.
- the term“subset” refers to the A-terminal amino acid residue of an individual peptide molecule.
- A“subset” of individual peptide molecules with an N- terminal lysine residue is distinguished from a“subset” of individual peptide molecules with an A-terminal residue that is not lysine.
- the term“substituted” may refer to a compound in which one or more hydrogen atoms on the parent molecule has been replaced with another group such that the group does not substantially alter the essential function for which the compound. More specifically, the term“substituted” means that the referenced group may be substituted with one or more additional group(s) individually and independently selected from alkyl, cycloalkyl, aryl, heteroaryl, heterocycloalkyl, -OH, alkoxy, aryloxy, alkylthio, arylthio, alkylsulfoxide, arylsulfoxide, alkylsulfone, arylsulfone, -CN, alkyne, Ci-C6alkylalkyne, halo, acyl, acyloxy, -CO2H, -C02-alkyl, nitro, haloalkyl, fluoroalkyl, and amino, including mono- and di-substituted
- the protecting groups that may form the protective derivatives of the above substituents are found in sources such as Greene and Wuts, above.
- a non-limiting list of possible chemical groups includes -OH, -F, -Cl, -Br, -I, -NH 2 , -NO2, -CO2H, -CO2CH3, -CO2CH2CH3, -CN, -SH, -OCH3, -OCH2CH3, -C(0)CH 3 , -NHCH3, -NHCH2CH3, -N(CH 3 ) 2 , -C(0)NH 2 , -C(0)NHCH 3 , -C(0)N(CH 3 ) 2 , -OC(0)CH 3 , -NHC(0)CH 3 , -S(0) 2 OH, or -S(0) 2 NH 2 .
- fluorescence refers to the emission of visible light by a substance that has absorbed light of a different wavelength.
- fluorescence provides a non-destructive way of tracking, analyzing, or a combination of tracking and analyzing biological molecules based on the fluorescent emission at a specific wavelength.
- Proteins including antibodies
- peptides including nucleic acid, oligonucleotides (including single stranded and double stranded primers) may be“labeled” with a variety of extrinsic fluorescent molecules referred to as fluorophores.
- sequencing of peptides“at the single molecule level” refers to amino acid sequence information obtained from individual (i.e. single) peptide molecules in a mixture of diverse peptide molecules.
- the present disclosure may not be limited to methods where the amino acid sequence information obtained from an individual peptide molecule is the complete or contiguous amino acid sequence of an individual peptide molecule. In some embodiment, it is sufficient that partial amino acid sequence information is obtained, allowing for identification of the peptide or protein. Partial amino acid sequence information, including for example the pattern of a specific amino acid residue (i.e. lysine) within individual peptide molecules, may be sufficient to uniquely identify an individual peptide molecule.
- a pattern of amino acids such as X-X-X-Lys-X-X-X-X-Lys-X-Lys, which indicates the distribution of lysine molecules within an individual peptide molecule, may be searched against a specific proteome of a given organism to identify the individual peptide molecule. It is not intended that sequencing of peptides at the single molecule level be limited to identifying the pattern of lysine residues in an individual peptide molecule; sequence information for any amino acid residue (including multiple amino acid residues) may be used to identify individual peptide molecules in a mixture of diverse peptide molecules.
- single molecule resolution refers to the ability to acquire data (including, for example, amino acid sequence information) from individual peptide molecules in a mixture of diverse peptide molecules.
- the mixture of diverse peptide molecules may be immobilized on a solid surface (including, for example, a glass slide, or a glass slide whose surface has been chemically modified).
- this may include the ability to simultaneously record the fluorescent intensity of multiple individual (i.e. single) peptide molecules distributed across the glass surface.
- optical devices that can be applied in this manner. For example, a conventional microscope equipped with total internal reflection illumination and an intensified charge- couple device (CCD) detector is available (see Braslaysky el al, 2003).
- Imaging with a high sensitivity CCD camera allows the instrument to simultaneously record the fluorescent intensity of multiple individual (i.e. single) peptide molecules distributed across a surface.
- image collection may be performed using an image splitter that directs light through two band pass filters (one suitable for each fluorescent molecule) to be recorded as two side-by-side images on the CCD surface.
- Using a motorized microscope stage with automated focus control to image multiple stage positions in the flow cell may allow millions of individual single peptides (or more) to be sequenced in one experiment.
- Attribution probability mass function for a given fluorosequence, the posterior probability mass function of its source proteins, i.e. the set of probabilities P(pi/fi) of each source protein pi, given an observed fluorosequence fi.
- the peptide was precipitated with cold ether and centrifuged for 10 mins at 8000 ref.
- the pellet was resuspended in acetonitrile/water (1 : 1 v:v mixture) and purified by high-performance liquid chromatography (Shimadzu Inc.) with an Agilent® Zorbax® column (4.6 c 250 mm) operating at 10 mL/min flow rate with a gradient of 5-95% methanol (0.1% formic acid) over 90 minutes.
- the fraction containing the peptide was collected, and the volume reduced using a rotary evaporator before lyophilization.
- Atto647N-SH Single dye-thiol reagent Atto647N-SH was prepared by reacting the Atto647N-S-S-Atto647N reagent with 1 mM tris(2-carboxyethyl)phosphine (TCEP) and incubating it for 1 h at 60 °C.
- TCEP tris(2-carboxyethyl)phosphine
- the TCEP addition to break the disulfide linkage in the dye-thiol reagent can be performed prior to the addition of the dye-thiol reagent to the mixture.
- the entire contents of the reaction was then diluted to 2 mL with acetonitrile/water mixture (1 : 1 v:v), and HPLC separated (as above).
- the fluorescent fractions monitored at 640 nm absorbance by the diode-array detector on HPLC, were then collected, as they correspond to the phosphorylated peptide.
- labeled phosphorylated peptide was lyophilized.
- the phosphate group present on any modified amino acids can be labeled by the EDC/Imidazole reaction mechanism (shown in Scheme 1).
- the reaction has been described for oligonucleotides and can also be used for labeling pyrophosphates on amino acids as well and has been adapted from Wang el al. , 1993.
- the phosphorylated peptide is reacted with 0.1 M imidazole, 0.1 M EDC and 0.25 M of donor amine (fluorophore) in pH 7.5 buffer such as PBS buffer (e.g., ⁇ 10 mM).
- the reaction is kept at 50 °C for 20 minutes.
- the labeled peptide is subsequently purified and sequenced by single molecule sequencing method.
- Scheme 1 Pan Modification of Phosphorylated Amino Acid Residues
- HEK-293T Human Embryonic kidney 293 transgenic (HEK-293T) cells were cultured and lysed using a modified RIPA buffer. Proteins were quantified and isolated from the cell lysate prior to labeling. Proteins were then denatured, and digested with the protease trypsin at a 1:50 ratio of trypsin enzyme to protein. Following digestion, a 10 kDa filter was used to filter out peptides. All phosphorylated serines and threonines in solution were then labeled using the following techniques. Phosphorylated residues were converted to the beta-eliminated variants using Ba(OH)2. A Michael addition reaction was then used to couple the fluorophore Atto 647N with a thiol modification to the beta-eliminated resides. Fluorescently labeled peptides were then purified and lyophilized.
- Fluorosequencing allows for low abundance variations of protein/peptide molecules to be identified and is described in Swaminathan, 2010; U.S. Patent No. 9,625,469; U.S. Patent Application Serial No. 15/461034; U.S. Patent Application Serial No. 15/150,962.
- This method relies on specific labeling of amino acids with fluorophores to determine its position in the peptide chain. This method can be similarly extended to identify the positions of modified amino acids by use of sugar specific fluorophores.
- the concept for labeling glyocosylated amino acids is a two-step process.
- the first step oxidizes the alcohol groups of sugar moieties to aldehydes.
- the second step then reacts the dithiol reagent with the aldehyde group of the sugar molecule. It has been shown that l,3-dithiane does not degrade when exposed to sequencing conditions, thus the inventors identified ways to modify fluorophores to have a 1, 3-dithiol tether to label glycosylated amino acids.
- Atto647N-SH Single dye-thiol reagent Atto647N-SH was prepared by reacting the Atto647N-S-S-Atto647N reagent with 1 mM tris(2-carboxyethyl)phosphine (TCEP) and incubating it for 1 h at 60 °C.
- TCEP tris(2-carboxyethyl)phosphine
- Fluorosequencing has been shown to precisely map the positions of fluorescently labeled amino acid residues on peptides at a sensitivity of a single molecule, and may be useful for the identification of lysine trimethylation as described in Swaminathan, 2010; U.S. Patent No. 9,625,469; U.S. Patent Application Serial No. 15/461034; U.S. Patent
- Nitric oxide is a cell-signaling molecule that is synthesized by a family of enzymes known as nitric oxide synthetases. NO can react with metalloproteins or covalently modify tyrosine and cysteine residues through oxidation or production of reactive nitrogen species. Nitrosylation is this category of post-translational modification that produce a covalent addition of L'-nitrosylation on cysteines or nitration on tyrosine residues (See Scheme 7). Detecting and quantifying the modification have implications for better understanding of the signaling processes during stress or inflammation and developing diagnostics (Abello et al, 2009).
- Protein/peptide isolation Proteins are harvested from the cells using protocols common in molecular biology (Lee, 2017) and digested into peptides by common proteases, such as trypsin or GluC. In some scenarios it is feasible to fix cells by treating it with cold methanol (-20 °C) or other methods of cell fixation. Following fixation, the cells may be directly reacted with the reagent to label surface accessible PTM.
- Blocking free thiols In order to carry out the L'-nitrosylation labeling reaction, the free thiols present on cysteine should be blocked. Two common reagents used in the procedure are iodoacetamide and A-methy 1 mal ei mi de. 2-20 mM of the reagent is used at pH 7.5 buffer in order to block thiols on the peptides.
- Labeling the SNO group Up to 3 mM of reagent (with or without fluorophore) is incubated with the peptides or fixed cells for from about 30 mins to about 2 hours at room temperature. The excess reagent is separated by rinsing/HPLC separation or other methods such as dialysis. 4. Fluorosequencing: Fluorosequencing is performed on the fluorescently labeled peptides.
- This method can thus localize the residues of modification and quantify the stoichiometry of PTM labeling of the cysteine residue.
- Other variants of ligation of fluorophore with the intermediate phosphine adduct can be performed such as dehydroalanine formation as indicated in literature (Devari e-Baez el al, 2013).
- the common chemical derivatization strategy for nitrotyrosine, used in mass-spectrometry proteomics is a two-step process.
- the first step is the reduction of the nitro group to the amino group followed by covalently labeling the amino group with a specialized reagent.
- the other amino groups on the peptides/proteins are blocked, typically by acetylation (Abello et ctl, 2010; Devarie-Baez et ctl, 2013).
- This strategy (See Scheme 8) can be directly adapted for labeling the nitrotyrosine group with a distinct fluorophore for fluorosequencing.
- a method for labeling the nitrotyrosine for fluorosequencing application is described as follows: 1. Protein/peptide isolation: The isolated proteins and peptides are solubilized in sodium phosphate buffer (pH 7.5). The digested proteins or peptides can be lyophilized prior to analysis. The approximate concentration of the peptide is 10 mM.
- Acetylation of amines All the free amines and other nucleophiles are acetylated by incubating 190 pL of the nitrated peptide with NHS-Acetate (final concentration of 25 mM) for 2 h at room temperature. The //-acetylations were reversed and excess reagent hydrolyzed by boiling the reaction for 15 minutes.
- Citrullination is a post-translational modification caused by enzyme Protein Arginine deiminase (PAD) where the arginine side chain is converted to citrulline (process called deimination).
- PAD Protein Arginine deiminase
- the conversion leads to a change in the mass by lDa, the loss of the positive charge and two potential hydrogen bond donors.
- the modification has a major effect on protein structure and stability and is implicated in autoimmune disorders, neurodegenerative diseases and in tumor biology (Gy orgy et al, 2006).
- the small mass change overlaps with the isotopic distribution of unmodified Arginine residues in peptide mass-spectrometry, making its identification challenging.
- a chemoselective strategy for targeting citrullinated residue has been demonstrated.
- a phenylglyoxal reagent reacts with arginine (under basic) and citrulline (under acidic conditions) forming a five membered ring.
- the reagent additionally binds to homocitrulline and cysteine, the thiohemiacetal ring formed with cysteine is hydrolysed in neutral pH.
- Protein/peptide isolation The isolated proteins are digested or the peptide is isolated according to standard well optimized procedures. About 50 mM of citrullinated peptides is lyophilized or solubilized in 50 mM HEPES buffer (pH 7.5)
- citrulline containing peptide was incubated with 5 mM phenylglyoxal reagent and 20% Trichloroacetic acid (pH ⁇ l) for 3 hours at 37 °C.
- the phenylglyoxal reagent can be directly coupled with a fluorophore or contain a handle (click handle) for subsequent reaction with a fluorophore.
- Rhodamine-Phenylglyoxal reagent Selective labeling of citrullinated residue by Rhodamine-Phenylglyoxal reagent.
- A Reaction conditions for labeling of citrullinated residue.
- B Rhodamine - phenylglyoxal reagent used for fluorescently labeling citrullinated residues for fluorosequencing.
- Sulfenic acid is one of a specific oxidative modification of cysteine residue which is formed upon reaction of the thiol side chain with mild oxidizing environment.
- the modification is a readout of early stages of reactive oxygen species formation, the intermediate step for formation of disulfide bond formation and also involved in redox signaling (Poole et al, 2004).
- the unstable nature of the bond under commonly used ionization conditions in mass spectrometers makes localizing and quantifying the modification extremely challenging.
- the reactive nature of the group enables chemical coupling and enrichment of the modified peptides (Poole et al. , 2007; Reddie et al. , 2008) feasible.
- the principle is the selective reaction of the sulfenic acid with dimedone (5,5- dimethyl-l,3-cyclohexanedione) which has been linked to several fluorescent reagents (See Scheme 11). Additionally, a biotin labeled reagent may be used (Millipore; Cat # NS1226- 1MG).
- Protein/peptide isolation The proteins were digested or the peptides were isolated using common standardized procedures. About 1-10 pmol peptides were lyophibzed or solubilized in phosphate buffer (pH 7; 25 mM) and 1 mM EDTA. 2. Labeling of sulfenic acid: The fluorescent reagent was added to a concentration of 5 mM and incubated for 2 h at 37 °C. The reagent can be two halves - one with an azide handle and the second with a fluorophore that specifically reacts with the linker.
- troponin is a diagnostic biomarker for cardiac dysregulation (Wijnker et al, 2014).
- the site-specific nature of the phosphorylation is an important diagnostic and therapeutic marker for understanding and treating heart failures (Zhang et al. , 2012).
- the diagnosis may range from exercise to a disease state as severe as cardiac myopathy.
- the methods presented above can be easily adopted to assess the phosphorylation state of a number of potential phosphorylation related biomarkers.
- the first step would be to perform a standard antibody pulldown for the protein of interest, i.e. troponin.
- the enriched protein may be digested into shorter peptides using a protease, such as GluC or trypsin, producing peptides of a specific length.
- the phosphorylation sites can then be labelled on the peptide molecules as described in Example 1.
- This methodology may also be applied to assessing the methylation or glycosylation of any protein as well, providing new biomarkers for diseases which are characterized by post-translational modifications of the proteins.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Urology & Nephrology (AREA)
- Biomedical Technology (AREA)
- Hematology (AREA)
- Immunology (AREA)
- Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Food Science & Technology (AREA)
- Pathology (AREA)
- Cell Biology (AREA)
- Medicinal Chemistry (AREA)
- Biotechnology (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Microbiology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Peptides Or Proteins (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The present disclosure provides methods of selectively label an amino acid residue on a peptide by replacing a post translational modification with a labeling moiety and sequencing the peptide to obtain the location of the amino acid residue and the identity of the post translational modification. In some aspects, the disclosure also provides methods of identifying the position, quantity, the identity of a post translational modification, or any combination thereof, in peptides which may be used for therapeutic purposes.
Description
DESCRIPTION
SINGLE MOLECULE SEQUENCING IDENTIFICATION OF POST- TRANSLATIONAL MODIFICATIONS ON PROTEINS
[0001] This application claims the benefit of priority to United States Provisional Application Serial No. 62/702,318, filed on July 23, 2018, the entire content of which is hereby incorporated by reference.
[0002] This invention was made with government support under Grant Nos. R35 GM122480 and OD009572 awarded by the National Institutes of Health. The government has certain rights in the invention.
BACKGROUND
[0003] Post-translational modifications (PTMs) of proteins are covalent attachments of chemical moieties on the side chains of select amino acids or the N and C terminus of a peptide or a protein. The activity and functions of many proteins are modulated by the nature of their PTMs. Some non-limiting examples of PTMs include phosphorylation, glycosylation, alkylation, acylation, hydroxylation, or the attachment of a cofactor or nucleotide. Of the many different types of PTMs, one important class of chemical modifications - phosphorylation - is ubiquitous and extensively studied. This is due to their important role in cell-signaling and in diagnosing diseased states (Ardito et a/., 2017; Stowell et al, 2015). Detecting and mapping the amino acid residues modified by PTMs is biologically important to study with its understanding translating into effective disease treatments.
[0004] One such example is the C-terminal domain of the Epidermal growth factor receptor (EGFR) family of proteins that contains approximately 20 tyrosine residues capable of being phosphorylated. Depending on the combination of these phosphorylated sites in an activated cell, the downstream processes can range from cell proliferation, differentiation, anti-apoptosis (survival), adhesion, migration, and angiogenesis (Huang et al. , 2011). Understanding and mapping these sites is thus critical not only to better understand cell signaling pathways, but also develop the current therapeutic drugs. However, mapping post- translational modifications have been intrinsically challenging due to their low abundance and sample heterogeneity. The current methods do not allow for precise determination of the specific location of PTMs while also allowing for quantitative determination of the PTMs.
Therefore, there remains an unmet need to identify methods which allow from improved detection of PTMs in a protein or peptide.
SUMMARY
[0005] The present disclosure provides methods and systems for protein or peptide sequencing and/or protein or peptide identification. Methods and systems of the present disclosure may be used to sequence a protein or peptide for the determination of a post- translational modification(s) and the location(s) of such post-translational modification(s).
[0006] In some aspects, the present disclosure provides methods of identifying a post translational modification on an amino acid residue of a peptide or protein, the method comprising:
(A) treating the peptide or protein with a labeling reagent under conditions such that the labeling reagent interacts with the post translational modification on the amino acid residue of the peptide or protein, to covalently couple the labeling reagent or derivative thereof to the amino acid residue and yield a labeled peptide or protein; and
(B) sequencing the labeled peptide or protein.
[0007] In some embodiments, the post translational modification on the amino acid residue is phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation. In some embodiments, the post translational modification on the amino acid residue is phosphorylation on tyrosine, serine, or threonine. In some embodiments, the post translational modification on the amino acid residue is phosphorylation on a serine. In other embodiments, the post translational modification on the amino acid residue is phosphorylation on a threonine. In other embodiments, the post translational modification on the amino acid residue is an /V-glycosylation. In some embodiments, the post translational modification on the amino acid residue is glycosylation of asparagine or arginine. In other embodiments, the post translational modification on the amino acid residue is an O- glycosylation. In some embodiments, the post translational modification on the amino acid residue is glycosylation of serine, threonine, or tyrosine. In other embodiments, the post translational modification on the amino acid residue is trimethylation. In some embodiments, the post translational modification on the amino acid residue is trimethylation of lysine. In other embodiments, the post translation modification on the amino acid residue is
nitrosylation. In some embodiments, the post translation modification on the amino acid residue is nitrosylation of a cysteine or tyrosine. In some embodiments, the post translation modification on the amino acid residue is nitrosylation of a cysteine. In other embodiments, the post translation modification on the amino acid residue is nitrosylation of a tyrosine. In other embodiments, the post translation modification on the amino acid residue is citrullination. In other embodiments, the post translation modification on the amino acid residue is sulfenylation. In some embodiments, the post translational modification on the amino acid residue is sulfenylation of a cysteine.
[0008] In some embodiments, the post translation modification is on an amino acid residue of a protein. In other embodiments, the post translation modification is on an amino acid residue of a peptide. In some embodiments, the labeling reagent comprises a thiol group. In some embodiments, the labeling reagent comprises two thiol groups. In some embodiments, the labeling reagent comprises an amine reactive group such as a succinimidyl ester. In some embodiments, the labeling reagent comprises a glyoxal group. In some embodiments, the labeling reagent comprises a l,3-cycloalkanedione group such as a 1,3- hexanedione.
[0009] In some embodiments, the labeling reagent is a fluorophore, oligonucleotide, or peptide-nucleic acid. In some embodiments, the labeling reagent is a fluorophore. In some embodiments, the labeling reagent is a thiol containing fluorophore. In some embodiments, the fluorophore is a xanthene dye such as a rhodamine dye.
[0010] In some embodiments, the methods involve treating the peptide or protein with the labeling reagent comprises:
(0 reacting the peptide or protein under conditions such that the post translational modification on the peptide or protein is converted to a reactive group to form a reactive peptide or protein;
(//) reacting the labeling reagent with the reactive peptide or protein to form the labeled peptide or protein.
[0011] In some embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a phosphorylation post translational modification with a base. In some embodiments, the base is a rare earth metal hydroxide such as Ba(OH)2.
[0012] In other embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a phosphorylation post translational modification with an activating agent and a base. In some embodiments, the activating agent is a carbodiimide such as l-ethyl-3-(3-dimethylaminopropyl)carbodiimide (EDC). In some embodiments, the base is a heteroaromatic base such as an imidazole.
[0013] In other embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a trimethyl post translational modification with silver oxide (Ag20). In some embodiments, the peptide or protein comprising a trimethyl post translational modification is treated with silver oxide in the presence of heat. In some embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a trimethyl post translational modification with a base. In some embodiments, the base is a nitrogenous base such as diisopropylethylamine or trimethylamine.
[0014] In other embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a glycosylation post translational modification with an oxidizing agent. In some embodiments, the oxidizing agent is a hypervalent iodide reagent such as sodium periodate.
[0015] In other embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a nitrosylation post translational modification with a reducing agent. In some embodiments, the reducing agent is disulfide reducing agent such as dithiothreitol. In some embodiments, the reducing agent further comprises heme. In some embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a nitrosylation post translational modification with phosphine. In some embodiments, the phosphine is an unsubstituted or substituted trialkylphosphine or an unsubstituted or substituted a triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triphenylphosphine. In some embodiments, the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a phosphine. In some embodiments, the phosphine is an unsubstituted or substituted trialkylphosphine or an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an
unsubstituted or substituted triphenylphosphine. In some embodiments, the phosphine is covalently linked to the labeling reagent.
[0016] In some embodiments, the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a glyoxal group. In some embodiments, the glyoxal group is covalently linked to the labeling reagent. In other embodiments, the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a l,3-cycloalkanedione such as a l,3-cyclohexanedione. In some embodiments, the l,3-cycloalkanedione is covalently bonded to the labeling reagent. In some embodiments, the reactive group on the reactive peptide or protein is a double bond. In some embodiments, the reactive peptide or protein is treated with the labeling reagent comprising a thiolene-click reaction to form a labeled peptide or protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent with a double bond in the presence of an olefin metathesis reagent to form a labeled peptide or protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent comprising a cycloaddition reaction to form a labeled peptide or protein.
[0017] In some embodiments, the reactive group on the reactive peptide or protein is an aldehyde. In some embodiments, the labeling reagent is treated with the reactive group on the reactive peptide or protein comprising nucleophilic addition, nucleophilic substitution, or radical addition. In some embodiments, the labeling reagent forms a thioether when treated with the reactive group on the reactive peptide or protein. In some embodiments, the labeling reagent forms a dithiane. In some embodiments, the reactive peptide or protein is treated with the labeling reagent to form an amide bond. In some embodiments, the amide bond formation provides the labeled peptide or protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent to form a disulfide bond. In some embodiments, the disulfide bond formation provides the labeled peptide of protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent to form a heterocycloalkane. In some embodiments, the heterocycloalkyl group formation provides the labeled peptide of protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent to form a thioether bond. In some embodiments, the thioether bond formation provides the labeled peptide of protein.
[0018] In some embodiments, the sequencing comprises a fluorosequencing method. In some embodiments, the sequencing is at a single molecular level. In some embodiments, the fluorosequencing method comprises labeling at least one amino acid of the peptide or protein which does not contain a post translational modification with a second labeling reagent. In some embodiments, the fluorosequencing method comprises labeling one, two, three, four, or five distinct amino acids of the peptide or protein which do not contain a post translation modification. In some embodiments, each amino acid is labeled with a distinct second labeling reagent.
[0019] In some embodiments, the peptide or protein is bound to a solid support such as a surface. In some embodiments, the solid support is a resin, a bead, or a modified glass surface. In some embodiments, the solid support is the modified glass surface such as an aminosilicate surface.
[0020] In some embodiments, the fluorosequencing method further comprises removing at least one amino acid residue of the peptide or protein. In some embodiments, the fluorosequencing method comprises sequentially removing two or more consecutive amino acid residues of the peptide or protein. In some embodiments, the fluorosequencing method comprises sequentially removing amino acid residues of the peptide or protein until a labeled amino acid comprising a modified post translational modification is removed. In some embodiments, the fluorosequencing method comprises sequentially removing from 1 to 20 amino acid residues of the peptide or protein until a labeled amino acid comprising a modified post translational modification is removed. In some embodiments, the amino acid residues are removed by Edman degradation. In some embodiments, the amino acid residue is removed by treating the /V- terminal amino acid residue with a thiourea and an acid, microwave irradiation, or heat. In some embodiments, the amino acid residues are removed by an enzyme.
[0021] In some embodiments, the peptide or protein is digested by a protease. In some embodiments, the peptide or protein is digested by a protease before labeling the amino acid comprising the post translational modification. In some embodiments, the peptide or protein is obtained from a biological sample. In some embodiments, the biological sample is a cell-free biological sample. In some embodiments, the biological sample is derived from blood. In other embodiments, the biological sample is derived from urine. In other
embodiments, the biological sample is derived from mucous. In other embodiments, the biological sample is derived from saliva.
[0022] In some embodiments, a covalent bond between the post translational modification on the amino acid residue of the peptide or protein and the labeling reagent is formed. In some embodiments, the labeling reagent or derivative thereof is directly covalently bonded to the amino acid residue. In some embodiments, the labeling reagent or derivative thereof is covalently coupled to the amino acid residue through an intermediary molecule.
[0023] In still another aspect, the present disclosure provides methods of determining the status of a disease or disorder in a subject, the method comprising:
(A) detecting a change in a type, identity, quantity, or position of a post translational modification or a plurality of post translational modifications on a protein or peptide using the methods described herein; and
(B) determining the status of the disease or disorder in the subject according to at least said change.
[0024] In some embodiments, the methods further comprise obtaining a biological sample from the subject. In some embodiments, determining the status of a disease or disorder is determining the prognosis of the patient that has the disease. In other embodiments, determining the status of a disease or disorder is diagnosing the patient with the disease. In other embodiments, determining the status of a disease or disorder is determining if the patient is at risk of having the disease.
[0025] In some embodiments, the change in post translation modification of a protein or peptide is a change in the phosphorylation of the protein. In other embodiments, the change in post translation modification of a protein or peptide is a change in the trimethylation of the protein. In other embodiments, the change in post translation modification of a protein or peptide is a change in the glycosylation of the protein. In other embodiments, the change in post translation modification of a protein or peptide is a change in the nitrosylation of the protein. In some embodiments, the change in post translation modification of a protein or peptide is a change in the citrullination of the protein. In some
embodiments, the change in post translation modification of a protein or peptide is a change in the sulfenylation of the protein.
[0026] In some embodiments, the biological sample is a cell-free biological sample such as saliva, mucous, urine, serum, plasma, or whole blood. In some embodiments, the method conveys the presence of one or more post translational modifications. In some embodiments, the method conveys the presence of two or more post translation modifications. In some embodiments, the method conveys the absence of one or more post translational modifications. In some embodiments, the method conveys the absence of one or more post translational modifications and the presence of one or more post translational modifications.
[0027] In some embodiments, the method conveys the type of the post translational modification in the protein. In some embodiments, the method conveys the identity of the post translational modification in the protein. In some embodiments, the method conveys the quantity of the post translational modification in the protein. In some embodiments, the method conveys the position of the post translational modification in the protein. In some embodiments, the subject is a mammal such as a human.
[0028] In some embodiments, the method further comprises enriching the protein before determining the type, identity, quantity, or position of the post translational modifications. In some embodiments, the protein is enriched by purification of the biological sample. In some embodiments, the protein is subjected to degradation before determining the types or identities of the post translational modifications. In some embodiments, the protein is degraded by a protease.
[0029] In some embodiments, the protein is immobilized on a solid support. In some embodiments, the solid support is a surface. In some embodiments, the solid support is a resin, a bead, or a modified glass surface. In some embodiments, the solid support is the modified glass surface such as an aminosilicate surface.
[0030] In some embodiments, the method comprises determining the type, identity, quantity, or position of post translational modification on two or more peptides or proteins.
[0031] In yet another aspect, the present disclosure provides methods for determining the status of a disease or disorder in a subject, the method comprising:
detecting a change in a type, identity, quantity, or position of the post translational modifications on the protein or peptide using the methods described herein related to the disease or disorder.
[0032] In some embodiments, the methods further comprise obtaining a biological sample from the subject.
[0033] In still another aspect, the present disclosure provides modified peptides or proteins comprising a peptide or protein comprising one or more post translational modifications, wherein at least one post translational modification of said peptide or protein comprising one or more post translational modifications is altered with at least a first labeling moiety, thereby forming a labeled peptide or protein comprising one or more post translational modifications.
[0034] In some embodiments, the at least the first labeling moiety is a fluorophore. In some embodiments, the peptide or protein comprises a second labeling moiety attached to one or more amino acid residues of the peptide or protein. In some embodiments, the second labeling moiety is a fluorophore. In some embodiments, said at least one post translational modification is selected from the group consisting of phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, trimethylation, or any combination thereof. In some embodiments, each post translational modification selected from the group consisting of phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation is altered by a distinct labeling moiety. In some embodiments, the modified peptide or protein comprises from 3 amino acid residues to about 250 amino acid residues. In some embodiments, the modified peptide or protein comprises from 5 amino acid residues to about 100 amino acid residues. In some embodiments, the modified peptide or protein comprises from about 7 amino acid residues to about 50 amino acid residues.
[0035] In some embodiments, the first labeling reagent replaces the post translational modification on the amino acid residue. In some embodiments, the post translation modification is on an amino acid residue of a protein. In other embodiments, the post translation modification is on an amino acid residue of a peptide. In some embodiments, the first labeling reagent comprises a thiol group. In some embodiments, the first labeling reagent comprises two thiol groups. In some embodiments, the first labeling reagent comprises an amine reactive group such as a succinimidyl ester. In some embodiments, the
first labeling reagent comprises a glyoxal group. In some embodiments, the first labeling reagent comprises a l,3-cycloalkanedione group such as a l,3-hexanedione.
[0036] In some embodiments, the first or second labeling reagent are a fluorophore, oligonucleotide, or peptide-nucleic acid. In some embodiments, the one of the first or second labeling reagent is a fluorophore. In some embodiments, the labeling reagent is a thiol containing fluorophore. In some embodiments, the fluorophore is a xanthene dye such as a rhodamine dye.
[0037] In some embodiments, the second labeling moiety is attached to a different type of amino acid of the peptide or protein than the first labeling moiety. In some embodiments, the methods further comprise one or more additional labeling moieties attached to one or more distinct amino acids of the peptide or protein.
[0038] In some embodiments, the peptide or protein is immobilized adjacent to a solid support. In some embodiments, the solid support is a surface. In some embodiments, the solid support is a resin, a bead, or a modified glass surface. In some embodiments, the solid support is a modified glass surface such as an aminosilicate surface.
[0039] In some embodiments, the peptide or protein has been degraded by a protease. In some embodiments, the post translation modification is phosphorylation of the peptide or protein. In other embodiments, the post translation modification is trimethylation of the peptide or protein. In other embodiments, the post translation modification is glycosylation of the peptide or protein. In other embodiments, the post translation modification is nitrosylation of the peptide or protein. In other embodiments, the post translation modification is citrullination of the peptide or protein. In other embodiments, the post translation modification is sulfenylation of the peptide or protein.
[0040] In some embodiments, the post translational modification on the amino acid residue is phosphorylation on tyrosine, serine, or threonine. In some embodiments, the post translational modification on the amino acid residue is phosphorylation on a serine. In other embodiments, the post translational modification on the amino acid residue is phosphorylation on a threonine. In other embodiments, the post translational modification on the amino acid residue is an /V-glycosylation. In some embodiments, the post translational modification on the amino acid residue is glycosylation of asparagine or arginine. In other embodiments, the post translational modification on the amino acid residue is an O-
glycosylation. In some embodiments, the post translational modification on the amino acid residue is glycosylation of serine, threonine, or tyrosine. In other embodiments, the post translational modification on the amino acid residue is trimethylation. In some embodiments, the post translational modification on the amino acid residue is trimethylation of lysine. In other embodiments, the post translation modification on the amino acid residue is nitrosylation. In some embodiments, the post translation modification on the amino acid residue is nitrosylation of a cysteine or tyrosine. In some embodiments, the post translation modification on the amino acid residue is nitrosylation of a cysteine. In other embodiments, the post translation modification on the amino acid residue is nitrosylation of a tyrosine. In other embodiments, the post translation modification on the amino acid residue is citrullination. In other embodiments, the post translation modification on the amino acid residue is sulfenylation. In some embodiments, the post translational modification on the amino acid residue is sulfenylation of a cysteine.
[0041] In another aspect, the present disclosure provides methods of sequencing a peptide or protein comprising:
(A) obtaining a cell-free biological sample and separating the peptide or protein from the cell-free biological sample;
(B) labeling the peptide or protein under conditions sufficient to interact with at least one amino acid residue of the peptide or protein associated with a post translational modification with a first labeling moiety to form at least one labeled amino acid residue of the peptide or protein;
(C) subjecting the peptide or protein to conditions sufficient to remove one or more individual amino acid residues from the peptide or protein; and
(D) detecting at least one signal from the at least one labeled amino acid residue, thereby identifying the sequence of the peptide or protein.
[0042] In some embodiments, the post translational modification on the amino acid residue is phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation. In some embodiments, the post translational modification on the amino acid residue is phosphorylation on tyrosine, serine, or threonine. In some embodiments, the post translational modification on the amino acid residue is phosphorylation on a serine. In other embodiments, the post translational modification on the amino acid residue is
phosphorylation on a threonine. In other embodiments, the post translational modification on the amino acid residue is an /V-glycosylation. In some embodiments, the post translational modification on the amino acid residue is glycosylation of asparagine or arginine. In other embodiments, the post translational modification on the amino acid residue is an O- glycosylation. In some embodiments, the post translational modification on the amino acid residue is glycosylation of serine, threonine, or tyrosine. In other embodiments, the post translational modification on the amino acid residue is trimethylation. In some embodiments, the post translational modification on the amino acid residue is trimethylation of lysine. In other embodiments, the post translation modification on the amino acid residue is nitrosylation. In some embodiments, the post translation modification on the amino acid residue is nitrosylation of a cysteine or tyrosine. In some embodiments, the post translation modification on the amino acid residue is nitrosylation of a cysteine. In other embodiments, the post translation modification on the amino acid residue is nitrosylation of a tyrosine. In other embodiments, the post translation modification on the amino acid residue is citrullination. In other embodiments, the post translation modification on the amino acid residue is sulfenylation. In some embodiments, the post translational modification on the amino acid residue is sulfenylation of a cysteine.
[0043] In some embodiments, the labeling reagent replaces the post translational modification on the amino acid residue. In some embodiments, the post translation modification is on an amino acid residue of a protein. In other embodiments, the post translation modification is on an amino acid residue of a peptide. In some embodiments, the labeling reagent comprises a thiol group. In some embodiments, the labeling reagent comprises two thiol groups. In some embodiments, the labeling reagent comprises an amine reactive group such as a succinimidyl ester. In some embodiments, the labeling reagent comprises a glyoxal group. In some embodiments, the labeling reagent comprises a 1,3- cycloalkanedione group such as a l,3-hexanedione.
[0044] In some embodiments, the labeling reagent is a fluorophore, oligonucleotide, or peptide-nucleic acid. In some embodiments, the labeling reagent is a fluorophore. In some embodiments, the labeling reagent is a thiol containing fluorophore. In some embodiments, the fluorophore is a xanthene dye such as a rhodamine dye.
[0045] In some embodiments, the methods further comprise labeling the peptide or protein with the first labeling moiety comprises:
(/) treating the peptide or protein under conditions such that the post translational modification on the peptide or protein is converted to a reactive group to form a reactive peptide or protein;
(//) treating the first labeling moiety with the reactive peptide or protein to form a labeled peptide or protein.
[0046] In some embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a phosphorylation post translational modification with a base. In some embodiments, the base is a rare earth metal hydroxide such as Ba(OH)2.
[0047] In other embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a phosphorylation post translational modification with an activating agent and a base. In some embodiments, the activating agent is a carbodiimide such as l-ethyl-3-(3-dimethylaminopropyl)carbodiimide (EDC). In some embodiments, the base is a heteroaromatic base such as an imidazole.
[0048] In other embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a trimethyl post translational modification with silver oxide (Ag20). In some embodiments, the peptide or protein comprising a trimethyl post translational modification is treated with silver oxide in the presence of heat. In some embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a trimethyl post translational modification with a base. In some embodiments, the base is a nitrogenous base such as diisopropylethylamine or trimethylamine.
[0049] In other embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a glycosylation post translational modification with an oxidizing agent. In some embodiments, the oxidizing agent is a hypervalent iodide reagent such as sodium periodate.
[0050] In other embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a nitrosylation post translational modification with a reducing agent. In some embodiments, the reducing agent is disulfide reducing agent such as dithiothreitol. In some embodiments, the reducing agent further comprises heme. In some embodiments, the reactive peptide or protein is formed by treating the peptide or protein comprising a nitrosylation post translational modification with phosphine. In some embodiments, the phosphine is an unsubstituted or substituted trialkylphosphine or an
unsubstituted or substituted a triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triphenylphosphine. In some embodiments, the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a phosphine. In some embodiments, the phosphine is an unsubstituted or substituted trialkylphosphine or an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triarylphosphine. In some embodiments, the phosphine is an unsubstituted or substituted triphenylphosphine. In some embodiments, the phosphine is covalently linked to the labeling reagent.
[0051] In some embodiments, the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a glyoxal group. In some embodiments, the glyoxal group is covalently linked to the labeling reagent. In other embodiments, the methods involve contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a l,3-cycloalkanedione such as a l,3-cyclohexanedione. In some embodiments, the l,3-cycloalkanedione is covalently bonded to the labeling reagent. In some embodiments, the reactive group on the reactive peptide or protein is a double bond. In some embodiments, the reactive peptide or protein is treated with the labeling reagent comprising a thiolene-click reaction to form a labeled peptide or protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent with a double bond in the presence of an olefin metathesis reagent to form a labeled peptide or protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent comprising a cycloaddition reaction to form a labeled peptide or protein.
[0052] In some embodiments, the reactive group on the reactive peptide or protein is an aldehyde. In some embodiments, the labeling reagent is treated with the reactive group on the reactive peptide or protein comprising nucleophilic addition, nucleophilic substitution, or radical addition. In some embodiments, the labeling reagent forms a thioether when treated with the reactive group on the reactive peptide or protein. In some embodiments, the labeling reagent forms a dithiane. In some embodiments, the reactive peptide or protein is treated with the labeling reagent to form an amide bond. In some embodiments, the amide bond formation provides the labeled peptide or protein. In some embodiments, the reactive peptide
or protein is treated with the labeling reagent to form a disulfide bond. In some embodiments, the disulfide bond formation provides the labeled peptide of protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent to form a heterocycloalkane. In some embodiments, the heterocycloalkyl group formation provides the labeled peptide of protein. In some embodiments, the reactive peptide or protein is treated with the labeling reagent to form a thioether bond. In some embodiments, the thioether bond formation provides the labeled peptide of protein.
[0053] In some embodiments, the sequencing comprises a fluorosequencing method. In some embodiments, the sequencing is at a single molecular level. In some embodiments, the fluorosequencing method comprises labeling at least one amino acid of the peptide or protein which does not contain a post translational modification with a second labeling reagent. In some embodiments, the fluorosequencing method comprises labeling one, two, three, four, or five distinct amino acids of the peptide or protein which do not contain a post translation modification. In some embodiments, each amino acid is labeled with a distinct second labeling reagent.
[0054] In some embodiments, the peptide or protein is bound to a solid support such as a surface. In some embodiments, the solid support is a resin, a bead, or a modified glass surface. In some embodiments, the solid support is the modified glass surface such as an aminosilicate surface.
[0055] In some embodiments, the fluorosequencing method further comprises removing at least one amino acid residue of the peptide or protein. In some embodiments, the fluorosequencing method comprises sequentially removing two or more consecutive amino acid residues of the peptide or protein. In some embodiments, the fluorosequencing method comprises sequentially removing amino acid residues of the peptide or protein until a labeled amino acid comprising a modified post translational modification is removed. In some embodiments, the fluorosequencing method comprises sequentially removing from 1 to 20 amino acid residues of the peptide or protein until a labeled amino acid comprising a modified post translational modification is removed. In some embodiments, the amino acid residues are removed by Edman degradation. In some embodiments, the amino acid residue is removed by treating the /V- terminal amino acid residue with a thiourea and an acid, microwave irradiation, or heat. In some embodiments, the amino acid residues are removed by an enzyme.
[0056] In some embodiments, the peptide or protein is digested by a protease. In some embodiments, the peptide or protein is digested by a protease before labeling the amino acid comprising the post translational modification.
[0057] In yet another aspect, the present disclosure provides methods for polypeptide sequence identification, comprising:
(A) obtaining a first polypeptide from a cell-free biological sample of a subject;
(B) using said first polypeptide to generate a second polypeptide immobilized to a support, wherein said second polypeptide comprises labeled amino acids;
(C) subjecting said second polypeptide to conditions sufficient to remove amino acids from said polypeptide; and
(D) during or subsequent to removal of said amino acids from said polypeptide, detecting signals from at least a subset of said labeled amino acids, thereby identifying a sequence of said second polypeptide to determine a sequence of said first polypeptide from said cell-free biological sample.
[0058] In some embodiment, less than all types of amino acids of said second polypeptide are labeled. In some embodiments, said first polypeptide is a protein.
[0059] In still yet another aspect, the present disclosure provides methods for processing or analyzing a protein or peptide containing or suspected of containing at least one post-translational modification, comprising: (A) sequencing said protein or peptide, and
(B) identifying said at least one post-translational modification in at least one amino acid subunit of said protein or peptide, or derivative thereof.
[0060] In some embodiments, said sequencing comprises subjecting said protein or peptide to degradation conditions to sequentially remove amino acid sub-units from said protein or peptide, and detecting at least a subset of said amino acid sub-units. In some embodiments, less than all amino acid sub-units of said peptide or protein are labeled, and wherein said sequencing comprises detecting a subset of said amino acid sub-units. In some embodiments, said at least one post-translational modification is identified during said sequencing. In some embodiments, said at least one post-translational modification is
identified prior to said sequencing. In some embodiments, said protein or peptide is obtained from a sample and processed to label said at least one post-translational modification. In some embodiments, said sample is a cell-free sample. In some embodiments, said sequencing comprises labeling said at least one post-translational modification of said protein or peptide with a label, and detecting said label to thereby identify said at least one post- translational modification on said protein or peptide.
[0061] In yet another aspect, the present disclosure provides methods for processing or analyzing a protein or peptide, comprising subjecting said protein or peptide to conditions sufficient to specifically label different post-translational modifications of said protein or peptide, and detecting labels corresponding to said different post-translational modifications of said protein or peptide to thereby detect said different post-translational modifications of said protein or peptide.
[0062] In some embodiments, said different post-translational modifications comprise phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation. [0063] As used herein,“essentially free,” in terms of a specified component, may refer to a specified component being absent from a composition or the component is present as a contaminant or in trace amounts. The total amount of the specified component resulting from any unintended contamination of a composition can be below 0.1%. In some embodiments, a composition in which no amount of the specified component can be detected with standard analytical methods.
[0064] As used herein in the specification and claims,“a” or“an” may refer to one or more. As used herein in the specification and claims, when used in conjunction with the word“comprising”, the words“a” or“an” may refer to one or more than one. As used herein, in the specification and claim,“another” or“a further” may refer to at least a second or more.
[0065] As used herein in the specification and claims, the term“about” is used to indicate that a value includes the inherent variation of error for the device, the method being employed to determine the value, or the variation that exists among the study subjects. In some embodiments, the term“about” refers to ±5% of the listed value.
[0066] Other objects, features and advantages of the present disclosure will become apparent from the following detailed description. The detailed description and the specific examples, while indicating certain embodiments, are given by way of illustration, since various changes and modifications within the spirit and scope will become apparent from this detailed description.
BRIEF DESCRIPTION OF THE DRAWINGS
[0067] The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present disclosure. The disclosure may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.
[0068] FIG. 1: Correct identification of phosphoserine residues on synthetic CTD heptad peptide by fluorosequencing. (Top) Phosphoserine is present at the 2nd position. (Bottom) Phosphoserine is present at the 5th position. Representative raw imaging data are shown for two individual peptide molecules from each experiment. For each individual molecule, the images are organized as a horizontal strip of consecutive TIRF micrographs (each corresponding to a square of 3 x 3 microns) centered on the peptide molecule. Each image represents one successive observation of emitted fluorescent light from that molecule after a round of Edman chemistry. A sharp reduction in fluorescence follows the Edman cycle in which the ammo acid with the attached fluorescent dye was removed, thus revealing the amino acid sequence position of the phosphorylated residue m the original peptide. The heatmap denotes the frequency histogram, tallying the counts of individual peptide molecules having lost fluorescence after every Edman degradation cycle over the background counts. The phosphorylated serine residue in the 2nd position (top) and 5th position (bottom) have significantly higher counts of fluorescent loss at the 2nd and 5th position, respectively, when analyzed by the fluorosequencing method.
[0069] FIG. 2 shows fluorosequencing position counts between two biological samples. Proteins from two different HEK-293T samples were digested, labeled, and sequenced on the fluorosequencing platform. Read counts were observed to be highly correlated between these biological replicates (Pearson coefficient 0.9582). Data is counts and plotted on a loglO scale
DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS
[0070] In some aspects, the present disclosure provides methods of typing, identifying, quantifying, or locating a post translational modification (PTM) in a peptide or protein. These methods may be used to determine the type, location, quantity, or position of a PTM such as phosphorylation, glycosylation, or alkylation in a peptide or protein. These methods may be used in conjunction with a fluorosequencing method such as those which include labeling of the post translational modification with a labeling moiety such as a fluorophore. These methods may further include the removal of one or more amino acid residues from the peptide or protein. In some aspects, these methods may be used to determine the progression or status of a disease or disorder in a patient.
I. Peptide Sequencing Methods
[0071] There exist many methods of identifying the sequence of a peptide including fluorosequencing, mass spectroscopy, identifying the peptide sequence from the nucleic acid sequence, and Edman degradation. Fluorosequencing has been found to provide single molecule resolution for the sequencing of proteins of interest (Swaminathan, 2010; U.S. Patent No. 9,625,469; U.S. Patent Application Serial No. 15/461,034; U.S. Patent Application Serial No. 15/510,962). One of the hallmarks of fluorosequencing is introduction of a fluorophore or other label into specific amino acid residues of the peptide sequence. This can involve the introduction of one or more amino acid residues with a unique labeling moiety. In some embodiments, one, two, three, four, five, or more different amino acids residues are labeled with a labeling moiety. The labeling moiety that may be used include fluorophores, chromophores, or a quencher. Each of these amino acid residues may include cysteine, lysine, glutamic acid, aspartic acid, tryptophan, tyrosine, serine, threonine, arginine, histidine, methionine, asparagine, and glutamine. Each of these amino acid residues may be labeled with a different labeling moiety. In some embodiments, multiple amino acid residues may be labeled with the same labeling moiety such as aspartic acid and glutamic acid or asparagine and glutamine. While this technique may be used with labeling moieties such as those described above, it is also contemplated that other labeling moiety may be used in fluorosequencing-like methods such as synthetic oligonucleotides or peptide-nucleic acid may be used. In particular, the labeling moiety used in the instant applications may be suitable to withstand the conditions of removing one or more of the amino acid residues. Some non-limiting examples of potential labeling moieties that may be used in the instant methods include those which emit a fluorescence signal in the red to infrared spectra such as
an Alexa Fluor® dye, an Atto dye, a rhodamine dye, or other similar dyes. Examples of each of these dyes which were capable of withstanding the conditions of removing the amino acid residues include Alexa Fluor® 405, Rhodamine B, tetramethyl rhodamine, Alexa Fluor 555, Atto647N, and (5)6-napthofluorescein. In other aspects, it is contemplated that the labeling moiety may be a fluorescent peptide or protein or a quantum dot.
[0072] Alternatively, synthetic oligonucleotides or oligonucleotide derivatives may be used as the labeling moiety for the peptides. For example, thiolated oligonucleotides may be coupled to peptides using the presented methods. Commonly available thiol modifications are 5' thiol modifications, 3' thiol modifications, and dithiol modifications and each of these modifications may be used to modify the peptide. Following oligonucleotide coupling to the peptides as above, the peptides may be subjected to Edman degradation (Edman el al, 1950) and the oligonucleotides may be used to determine the presence of a specific amino acid residue in the remaining peptide sequence. In other embodiments, the labeling moiety may be a peptide-nucleic acid. The peptide-nucleic acid may be attached to the peptide sequence on specific amino acid residues.
[0073] One element of fluorosequencing is the removal of the labeled peptides through such techniques such as Edman degradation and subsequent visualization to detect a reduction in fluorescence, indicating a specific amino acid has been cleaved. Removal of each amino acid residue is carried out through a variety of different techniques including Edman degradation and proteolytic cleavage. In some embodiments, the techniques include using Edman degradation to remove the terminal amino acid residue. In other embodiments, the techniques involve using an enzyme to remove the terminal amino acid residue. These terminal amino acid residues may be removed from either the C terminus or the N terminus of the peptide chain. In situations in which Edman degradation is used, the amino acid residue at the N terminus of the peptide chain is removed.
[0074] In some aspects, the methods of sequencing or imaging the peptide sequence may comprise immobilizing the peptide on a surface. The peptide may be immobilized using an cysteine residue, the N terminus, or the C terminus. In some embodiments, the peptide is immobilized by reacting the cysteine residue with the surface. In some embodiments, the present disclosure contemplates immobilizing the peptides on a surface such as a surface that is optically transparent across the visible spectra, the infrared spectra, or a combination thereof possesses a refractive index between 1.3 and 1.6, is between 10 to 50 nm thick, is
chemically resistant to organic solvents as well as strong acid such as trifluoroacetic acid, or any combination thereof. A large range of substrates (like fluoropolymers (Teflon-AF (Dupont), Cytop® (Asahi Glass, Japan)), aromatic polymers (polyxylenes (Parylene, Kisco, Calif.), polystyrene, polymethmethylacrytate) and metal surfaces (Gold coating)), coating schemes (spin-coating, dip-coating, electron beam deposition for metals, thermal vapor deposition and plasma enhanced chemical vapor deposition) and functionalization methodologies (polyallylamine grafting, use of ammonia gas in PECVD, doping of long chain end-functionalized fluorous alkanes etc) may be used in the methods described herein as a useful surface. A 20 nm thick, optically transparent fluoropolymer surface made of Cytop® may be used in the methods described herein. The surfaces used herein may be further derivatized with a variety of fluoroalkanes that will sequester peptides for sequencing and modified targets for selection. Alternatively, an aminosilane modified surfaces may be used in the methods described herein. In other embodiments, the methods described herein may comprise immobilizing the peptides on the surface of beads, resins, gels, quartz particles, glass beads, or combinations thereof. In some non-limiting examples, the methods contemplate using peptides that have been immobilized on the surface of Tentagel® beads, Tentagel® resins, or other similar beads or resins. The surface used herein may be coated with a polymer, such as polyethylene glycol. In other embodiments, the surface is amine functionalized. In other embodiments, the surface is thiol functionalized.
[0075] Each of these sequencing techniques involves imaging the peptide sequence to determine the presence of one or more labeling moiety on the peptide sequence. In some embodiments, these images are taken after each removal of an amino acid residue and used to determine the location of the specific amino acid in the peptide sequence. In some embodiments, the methods can result in the elucidation of the location of the specific amino acid in the peptide sequence. These methods may be used to determine the locations of specific amino acid residues in the peptide sequence or these results may be used to determine the entire list of amino acid residues in the peptide sequence. The methods may involve determining the location of one or more amino acid residues in the peptide sequence and comparing these locations to specific peptide sequences and determining the entire list of amino acid residues in the peptide sequence.
[0076] In some aspects, the methods may comprise labeling one or more additional amino acid residues which do not contain a post translational modification. These amino
acids may be labeled with a labeling moiety which is different from the label used to label the amino acid residue containing the post translational modification. If more than one position on the peptide is labeled, it is contemplated that the amino acids are labeled in the following order: cysteine, lysine, N terminus, C terminus, amino acids with carboxylic acid groups on the side chain, tryptophan, or any combination thereof. It is contemplated that one or more of these particular amino acids may be labeled or all of these amino acid residues may be labeled with different labels.
[0077] In some aspects, the imaging methods used in the sequencing techniques may involve a variety of different methods such as fluorimetry and fluorescence microscopy. The fluorescent methods may employ such fluorescent techniques such as fluorescence polarization, Forster resonance energy transfer (FRET), or time-resolved fluorescence. In some embodiments, fluorescence microscopy may be used to determine the presence of one or more fluorophores in the single molecule quantity. Such imaging methods may be used to determine the presence or absence of a label on a specific peptide sequence. After repeated cycles of removing an amino acid residue and imaging the peptide sequence, the position of the labeled amino acid residue can be determined in the peptide.
II. Post Translational Modifications
[0078] In some aspects, the present methods comprise labeling and determining the presence and position, location, quantity, type of a post translational modification of a peptide sequence, or any combination thereof. Post translational modifications are used to refer to a covalent modification of a protein or peptide through enzymatic or non-enzymatic modification of the protein or peptide. As used herein, the post translational modification includes both natural as well as non-natural modifications. Post translational modifications may be used to describe a variety of different types of covalent modifications including a modification to the side chain of an amino acid or cleaving of peptide (or amide) bonds, or as a result of oxidative stress. Often post translational modifications are attached to the side chain of an amino acid. These side chains of amino acids which contain a nucleophilic side chain are often the site of a post translational modification. The side chains of amino acids, which may be modified, include nucleophilic sites such as the hydroxyl groups of amino acids serine, threonine, and tyrosine, the amine group of amino acids lysine, arginine, and histidine, the thiol group of cysteine, and the carboxylic acid group of aspartate and glutamine.
[0079] Some non-limiting examples of post translational modifications include addition of a hydrophobic group such as alkylation which may be used to introduce one or more alkyl such as methyl groups, acylation which may be used to introduce one or more acyl group such as acetylation, formylation, or acylation with a fatty acid, or prenylation which introduces a isoprenoid group. Other post translational modifications may include the introduction of a cofactor or translation factors such as a flavin moiety, a heme moiety, lipoylation, or diphthamide formation. Other post translation modification may comprise the introduction of another protein such as SUMOylation, which attaches a SUMO protein, or ubiquitination, which attaches the protein ubiquitin.
[0080] Post translational modifications may further comprise the introduction of a chemical group to an existing amino acid residue. Some non-limiting examples of chemical groups which can be used to modify an amino acid residue include acylation, alkylation, amide bond formulation, carboxylation, glycosylation, hydroxylation, iodination, phosphorylation, nitrosylation, sulfmylation, sulfenylation, sulfation, or succinylation. In some embodiments, the present methods may be used to determine the presence of one or more of these post translational modifications. In some embodiments, the post translational modification is an alkylation specifically a methylation to introduce a mono, di or trimethylamine group to the side chain of the lysine residue. In other embodiments, the post translational modification is the phosphorylation of a hydroxyl group on tyrosine, threonine, or serine residue especially a threonine or a serine residue. In still another embodiment, the post translational modification is a glycosylation of a nitrogen or oxygen atom in the side chain of an amino acid.
[0081] The peptides or proteins with a post translational modification described herein may be obtained from a biological sample. These biological samples may be obtained from an animal or plant source. One potential animal source is a mammal source such as a sample obtained from a human. The human source may be obtained from a baby, an adolescent, or an adult human. These biological samples may include cell-free samples. A cell-free sample may be a sample which is free of cells, substantially free of cells or essentially free of cells. A cell-free biological sample may include a protein(s), peptide(s), amino acid(s), a nucleic acid molecule(s) (e.g., ribonucleic acid molecule or deoxyribonucleic acid molecule), or any combination thereof. While a sample may be denoted as cell-free, the sample may contain a small number of cells or cell debris while still being considered cell-
free. For example, these samples may include less than or equal to about 50 cells or fewer per milliliter of sample, 45 cells per milliliter, 40 cells per milliliter, 35 cells per milliliter, 30 cells per milliliter, 25 cells per milliliter, 20 cells per milliliter, 15 cells per milliliter, 10 cells per milliliter, 5 cells per milliliter, 1 cell per milliliter, or less. In some embodiments, these samples may include greater than or equal to about 1 cell per milliliter, 5 cells per milliliter, 10 cells per milliliter, 15 cells per milliliter, 20 cells per milliliter, 25 cells per milliliter, 30 cells per milliliter, 35 cells per milliliter, 40 cells per milliliter, 45 cells per milliliter, 45 cells per milliliter, 50 cells per milliliter, or more. Such cell-free samples may include blood (e.g., whole blood), serum, plasma, saliva, urine, or mucous, for example.
III. Definitions
[0082] As used herein, the term“amino acid” in general refers to organic compounds that contain at least one amino group,— NFh which may be present in its ionized form,— NH3 +, and one carboxyl group,— COOH, which may be present in its ionized form,— COO . where the carboxylic acids are deprotonated at neutral pH, having the basic formula of NH2CHRCOOH. An amino acid and thus a peptide has an N (amino)-terminal residue region and a C (carboxy)-terminal residue region. Types of amino acids include at least 20 that are considered“natural” as they comprise the majority of biological proteins in mammals and include amino acid such as lysine, cysteine, tyrosine, threonine, etc. Amino acids may also be grouped based upon their side chains such as those with a carboxylic acid groups (at neutral pH), including aspartic acid or aspartate (Asp; D) and glutamic acid or glutamate (Glu; E); and basic amino acids (at neutral pH), including lysine (Lys; L), arginine (Arg; N), and histidine (His; H).
[0083] As used herein, the term“terminal” is referred to as singular terminus and plural termini.
[0084] As used herein, the term“side chains” or“R” refers to unique structures attached to the alpha carbon (attaching the amine and carboxylic acid groups of the amino acid) that render uniqueness to each type of amino acid. R groups have a variety of shapes, sizes, charges, and reactivities, such as charged polar side chains, either positively or negatively charged, such as lysine (+), arginine (+), histidine (+), aspartate (-) and glutamate (-), amino acids can also be basic, such as lysine, or acidic, such as glutamic acid; uncharged polar side chains have hydroxyl, amide, or thiol groups, such as cysteine having a chemically reactive side chain, i.e. a thiol group that can form bonds with another cysteine, serine (Ser)
and threonine (Thr), that have hydroxylic R side chains of different sizes; asparagine (Asn), glutamine (Gln), and tyrosine (Tyr); Non-polar hydrophobic amino acid side chains include the amino acid glycine; alanine, valine, leucine, and isoleucine having aliphatic hydrocarbon side chains ranging in size from a methyl group for alanine to isomeric butyl groups for leucine and isoleucine; methionine (Met) has a thiol ether side chain, proline (Pro) has a cyclic pyrrolidine side group. Phenylalanine (with its phenyl moiety) (Phe) and typtophan (Trp) (with its indole group) contain aromatic side groups, which are characterized by bulk as well as nonpolarity.
[0085] Amino acids can also be referred to by a name or 3-letter code or 1 -letter code, for example, Cysteine; Cys; C, Lysine; Lys; K, Tryptophan; Trp; W, respectively.
[0086] Amino acids may be classified as nutritionally essential or nonessential, with the caveat that nonessential vs. essential may vary from organism to organism or vary during different developmental stages. Nonessential or conditional amino acids for a particular organism is one that is synthesized adequately in the body, typically in a pathway using enzymes encoded by several genes, as substrates allow for protein synthesis. Essential amino acids are amino acids that the organism is not unable to produce or not able to produce enough naturally, via de novo pathways, for example lysine in humans. Humans obtain essential amino acids through their diet, including synthetic supplements, meat, plants and other organisms.
[0087] “Unnatural” amino acids are those not naturally encoded or found in the genetic code nor produced via de novo pathways in mammals and plants. They can be synthesized by adding side chains not normally found or rarely found on amino acids in nature.
[0088] As used herein, b amino acids, which have their amino group bonded to the b carbon rather than the a carbon as in the 20 standard biological amino acids, are unnatural amino acids. A common naturally occurring b amino acid is b-alanine.
[0089] As used herein, the term the terms“amino acid sequence”,“peptide”,“peptide sequence”,“polypeptide”, and“polypeptide sequence” are used interchangeably herein to refer to at least two amino acids or amino acid analogs that are covalently linked by a peptide (amide) bond or an analog of a peptide bond. The term peptide includes oligomers and polymers of amino acids or amino acid analogs. The term peptide also includes molecules
that are commonly referred to as peptides, which generally contain from about two (2) to about twenty (20) amino acids. The term peptide also includes molecules that are commonly referred to as polypeptides, which generally contain from about twenty (20) to about fifty amino acids (50). The term peptide also includes molecules that are commonly referred to as proteins, which generally contain from about fifty (50) to about three thousand (3000) amino acids. The amino acids of the peptide may be /.-amino acids or //-amino acids. A peptide, polypeptide or protein may be synthetic, recombinant or naturally occurring. A synthetic peptide is a peptide that is produced by artificially in vitro.
[0090] As used herein, the term“subset” refers to the A-terminal amino acid residue of an individual peptide molecule. A“subset” of individual peptide molecules with an N- terminal lysine residue is distinguished from a“subset” of individual peptide molecules with an A-terminal residue that is not lysine.
[0091] As used herein the term“substituted” may refer to a compound in which one or more hydrogen atoms on the parent molecule has been replaced with another group such that the group does not substantially alter the essential function for which the compound. More specifically, the term“substituted” means that the referenced group may be substituted with one or more additional group(s) individually and independently selected from alkyl, cycloalkyl, aryl, heteroaryl, heterocycloalkyl, -OH, alkoxy, aryloxy, alkylthio, arylthio, alkylsulfoxide, arylsulfoxide, alkylsulfone, arylsulfone, -CN, alkyne, Ci-C6alkylalkyne, halo, acyl, acyloxy, -CO2H, -C02-alkyl, nitro, haloalkyl, fluoroalkyl, and amino, including mono- and di-substituted amino groups (e.g. -NH2, -NHR, -N(R)2), and the protected derivatives thereof. By way of example, a substituent may be LSRS, wherein each Ls is independently selected from a bond, -0-, -C(=0)-, -S-, -S(=0)-, -S(=0)2-, -NH-, -NHC(O)-, - C(0)NH-, S(=0)2NH-, -NHS(=0)2, -OC(0)NH-, -NHC(0)0-, -(Ci-Cealkyl)-, or -(C2- Cealkenyl)-; and each Rs is independently selected from among H, (Ci-C6alkyl), (C3- C8cycloalkyl), aryl, heteroaryl, heterocycloalkyl, and Ci-C6heteroalkyl. The protecting groups that may form the protective derivatives of the above substituents are found in sources such as Greene and Wuts, above. A non-limiting list of possible chemical groups includes -OH, -F, -Cl, -Br, -I, -NH2, -NO2, -CO2H, -CO2CH3, -CO2CH2CH3, -CN, -SH, -OCH3, -OCH2CH3, -C(0)CH3, -NHCH3, -NHCH2CH3, -N(CH3)2, -C(0)NH2, -C(0)NHCH3, -C(0)N(CH3)2, -OC(0)CH3, -NHC(0)CH3, -S(0)2OH, or -S(0)2NH2.
[0092] As used herein, the term“fluorescence” refers to the emission of visible light by a substance that has absorbed light of a different wavelength. In some embodiments, fluorescence provides a non-destructive way of tracking, analyzing, or a combination of tracking and analyzing biological molecules based on the fluorescent emission at a specific wavelength. Proteins (including antibodies), peptides, nucleic acid, oligonucleotides (including single stranded and double stranded primers) may be“labeled” with a variety of extrinsic fluorescent molecules referred to as fluorophores.
[0093] As used herein, sequencing of peptides“at the single molecule level” refers to amino acid sequence information obtained from individual (i.e. single) peptide molecules in a mixture of diverse peptide molecules. The present disclosure may not be limited to methods where the amino acid sequence information obtained from an individual peptide molecule is the complete or contiguous amino acid sequence of an individual peptide molecule. In some embodiment, it is sufficient that partial amino acid sequence information is obtained, allowing for identification of the peptide or protein. Partial amino acid sequence information, including for example the pattern of a specific amino acid residue (i.e. lysine) within individual peptide molecules, may be sufficient to uniquely identify an individual peptide molecule. For example, a pattern of amino acids such as X-X-X-Lys-X-X-X-X-Lys-X-Lys, which indicates the distribution of lysine molecules within an individual peptide molecule, may be searched against a specific proteome of a given organism to identify the individual peptide molecule. It is not intended that sequencing of peptides at the single molecule level be limited to identifying the pattern of lysine residues in an individual peptide molecule; sequence information for any amino acid residue (including multiple amino acid residues) may be used to identify individual peptide molecules in a mixture of diverse peptide molecules.
[0094] As used herein,“single molecule resolution” refers to the ability to acquire data (including, for example, amino acid sequence information) from individual peptide molecules in a mixture of diverse peptide molecules. In one non-limiting example, the mixture of diverse peptide molecules may be immobilized on a solid surface (including, for example, a glass slide, or a glass slide whose surface has been chemically modified). In one embodiment, this may include the ability to simultaneously record the fluorescent intensity of multiple individual (i.e. single) peptide molecules distributed across the glass surface. There are numerous optical devices that can be applied in this manner. For example, a conventional
microscope equipped with total internal reflection illumination and an intensified charge- couple device (CCD) detector is available (see Braslaysky el al, 2003). Imaging with a high sensitivity CCD camera allows the instrument to simultaneously record the fluorescent intensity of multiple individual (i.e. single) peptide molecules distributed across a surface. In one embodiment, image collection may be performed using an image splitter that directs light through two band pass filters (one suitable for each fluorescent molecule) to be recorded as two side-by-side images on the CCD surface. Using a motorized microscope stage with automated focus control to image multiple stage positions in the flow cell may allow millions of individual single peptides (or more) to be sequenced in one experiment.
[0095] Attribution probability mass function— for a given fluorosequence, the posterior probability mass function of its source proteins, i.e. the set of probabilities P(pi/fi) of each source protein pi, given an observed fluorosequence fi.
III. Examples
[0096] The following examples are included to demonstrate certain embodiments of the disclosure. The techniques disclosed in the examples which follow represent techniques discovered by the inventor to function well in the practice of the disclosure. However, in light of the present disclosure, many changes can be made in the specific embodiments which are disclosed to still obtain a like or similar result without departing from the spirit and scope of the disclosure.
EXAMPLE 1 - MAPPING THE POSITIONS OF POST-TRANSLATIONAL PHOSPHORYLATION ON PROTEINS AT SINGLE MOLECULE SENSITIVITY.
Materials and methods
[0097] Labeling protocol for phosphorylation peptide synthesis and purification - All peptides were synthesized with standard Fmoc chemistry using an automated solid-phase peptide synthesizer (Liberty Blue microwave peptide synthesizer; CEM Corporation). The standard Fmoc-amino acid building blocks and the Fmoc-O-benzylphosphoserine (Cat #: 03734) were purchased from Chemlmpex Inc (IL, USA). The peptides were cleaved and de- protected using acid cleavage cocktail, comprising TFA:water:triisopropylsilane (9.5:0.25:0.25 v:v:v mixture). After removal of TFA by drying with nitrogen, the peptide was precipitated with cold ether and centrifuged for 10 mins at 8000 ref. The pellet was resuspended in acetonitrile/water (1 : 1 v:v mixture) and purified by high-performance liquid
chromatography (Shimadzu Inc.) with an Agilent® Zorbax® column (4.6 c 250 mm) operating at 10 mL/min flow rate with a gradient of 5-95% methanol (0.1% formic acid) over 90 minutes. The fraction containing the peptide was collected, and the volume reduced using a rotary evaporator before lyophilization.
[0098] Synthesis of Dye-thiol reagent - 3 mg of Atto 647N-NHS (Cat#: AD647N35; Atto-tec) was mixed with 150 pL basic cysteamine solution (5.1 mg cysteamine and 7.5 pL DIPEA in 1500 pL dry DMF). The mixture was incubated for 3 h and the Atto647N-S-S- Atto647N product was confirmed by mass spectrometry (Scheme 1). The product was aliquoted into glass vials, each containing 200 pg of the reagent. Single dye-thiol reagent Atto647N-SH was prepared by reacting the Atto647N-S-S-Atto647N reagent with 1 mM tris(2-carboxyethyl)phosphine (TCEP) and incubating it for 1 h at 60 °C.
[0099] Labeling phosphate groups with dye-thiol reagent - Phosphorylated peptide was solubilized in 100 pL mixture of acetonitrile and water (1 : 1 v:v). To this solution, 46 pL of saturated barium hydroxide and 4 pL of 4M sodium hydroxide was added and incubated for 3h at room temperature. 100 pL of DMF, 100 pL of water and 1.4 mg of TCEP was then added to the peptide solution. The entire mixture was transferred to the 200 pg of the dye- thiol reagent and incubated overnight. The TCEP addition to break the disulfide linkage in the dye-thiol reagent can be performed prior to the addition of the dye-thiol reagent to the mixture. The entire contents of the reaction was then diluted to 2 mL with acetonitrile/water mixture (1 : 1 v:v), and HPLC separated (as above). The fluorescent fractions, monitored at 640 nm absorbance by the diode-array detector on HPLC, were then collected, as they correspond to the phosphorylated peptide. Two signature peaks present at retention time of 54 and 55 mins, and corresponds to the unreacted dye-thiol reagent, were not collected. Following HPLC purification, labeled phosphorylated peptide was lyophilized. The N- termini of the peptides were protected by tert-Butyloxy carbonyl (“Boc”) protecting group by solubilizing the labeled peptide in DMF and incubating the mixture with te/7-Butyl N- succinimidyl carbonate overnight. The solution was diluted and aliquoted into 200 pg or 2 mM.
[00100] Detection of labeled peptides - Labeled peptides were detected as in Swaminathan et al, 2010; U.S. Patent No. 9,625,469; U.S. Patent Application Serial No. 15/461,034; U.S. Patent Application Serial No. 15/150,962 with minor modification. These minor modifications are: (a) The peptides were immobilized on the solid substrate via the
peptide’s carboxyl-terminal to an amine functionalized glass slides (b) Prior to the experimental cycle, the“Boc” group protecting the amine termini of the peptide was de- protected by incubating the immobilized peptides with 90% Trifluoroacetic acid for 5 h at 40 °C. (c) 1 mM of Trolox (6-Hydroxy-2,5,7,8-tetramethylchroman-2-carboxylic acid) dissolved in methanol was used as the imaging buffer.
Additional Labeling Strategies for Pan Phosphorylation Labeling
[00101] The phosphate group present on any modified amino acids (Serine, Threonine, Tyrosine, Histidine) can be labeled by the EDC/Imidazole reaction mechanism (shown in Scheme 1). The reaction has been described for oligonucleotides and can also be used for labeling pyrophosphates on amino acids as well and has been adapted from Wang el al. , 1993. The phosphorylated peptide is reacted with 0.1 M imidazole, 0.1 M EDC and 0.25 M of donor amine (fluorophore) in pH 7.5 buffer such as PBS buffer (e.g., <10 mM). The reaction is kept at 50 °C for 20 minutes. The labeled peptide is subsequently purified and sequenced by single molecule sequencing method. Scheme 1: Pan Modification of Phosphorylated Amino Acid Residues
Phosphoryiated
Results and Discussion
[00102] Beta elimination and Michael addition of a fluorophore via thiol conjugation has been described to fluorescently label phosphorylated peptides (Stevens el al, 2005; U.S. Patent No. 7,476,656). However, a suitable thiol dye reagent for use in fluorosequencing, such as the Atto647N-thiol dye reagent, which contains both a sequencing suitable dye and an appropriate functional group handle, is not readily accessible. Therefore, Atto647N-S-S-Atto647N was synthesized by reaction of Atto647N-NHS with cysteamine
(Scheme 2). This reaction was carried out in non-reducing and anhydrous conditions, as the presence of water can hydrolyze the NHS dye and lead to significant reduction in the reaction yield.
Scheme 2: Preparation of Atto647N-S-S-Atto647N
[00103] To verify and optimize the labelling and fluorosequencing procedure, three phosphorylated variants of a heptad peptide were synthesized: YpSPTSPS, YSPTpSPS,
and YpSPTpSPS, where pS is a phosphoserine. These heptads were then labeled by beta elimination followed by Michael addition, to fluorescently and covalently label phosphorylated serine residues with the Atto647N-thiol dye (see Scheme 3).
Scheme 3: Preparation of Labeled Phosphorylated Serine Residue
R=Atto647N
[00104] The labeled heptads were then purified by HPLC and immobilized on an aminosilane glass surface for sequencing by fluorosequencing as described in Swaminathan, 2010; U.S. Patent No. 9,625,469; U.S. Patent Application Serial No. 15/461034; U.S. Patent Application Serial No. 15/150,962; each incorporated herein by reference. As described, the fluorosequencing for a uniform population of peptides can be best described by a frequency histogram. By imaging and aligning individual peptide molecules following an Edman degradation cycle, the counts of the peptide molecules that have lost their fluorescence after the Edman cycle can be obtained. Then, by tallying the counts of peptides which lost fluorescence as a function of the Edman cycle, a frequency histogram can be obtained. By subtracting the background counts, which occur due to photobleaching and dye-losses, the counts for the significant loss events can be represented (FIG. 1). As is evident from FIG. 1, there are reductions in peptide fluorescence after the 2nd Edman cycle, corresponding to the phosphoserine in the 2nd position of the peptide, and after
the 5th Edman cycle, corresponding to the phosphoserine at the 5th position. These results indicate that thiol conjugation of a fluorescent label, and subsequent additional fluorosequencing cycles, can be used map the positions of post-translational phosphorylation modifications on proteins.
[00105] An example of the method used for identifying phosphorylated residues of proteins extracted from cells is described herein. Human Embryonic kidney 293 transgenic (HEK-293T) cells were cultured and lysed using a modified RIPA buffer. Proteins were quantified and isolated from the cell lysate prior to labeling. Proteins were then denatured, and digested with the protease trypsin at a 1:50 ratio of trypsin enzyme to protein. Following digestion, a 10 kDa filter was used to filter out peptides. All phosphorylated serines and threonines in solution were then labeled using the following techniques. Phosphorylated residues were converted to the beta-eliminated variants using Ba(OH)2. A Michael addition reaction was then used to couple the fluorophore Atto 647N with a thiol modification to the beta-eliminated resides. Fluorescently labeled peptides were then purified and lyophilized.
[00106] Purified peptide samples were coupled onto an amine functionalized slide surface and sequenced on the fluorosequencing platform. Counts of fluorescent drops across all amino acid positions were taken for the sequenced sample. This experiment was repeated with a different biological sample of the same cell type (HEK-293T) which was prepared and sequenced in an identical manner, serving as a source of biological replicate. These samples were sequenced and the counts of fluorescent drops across all amino acid positions were obtained. The counts from the first biological sample and the second biological sample were then plotted against each other to make the plot shown in FIG. 2. Consistent patterns denote the multiple phosphorylated residues on proteins obtained from the cell and can serve as a profile of a cell’s phosphorylation status. The quantitative nature of the results spanning four orders of magnitude suggests the use for quantitative phosphoproteomics.
EXAMPLE 2 - MAPPING THE POSITIONS OF POST-TRANSLATIONAL GLYCOSYLATION ON PROTEINS AT SINGLE MOLECULE SENSITIVITY.
Materials and methods
[00107] Synthesis of 1,3-dithiol modified fluorophore Lipoic acid was reacted with te/7-butyl (2-aminoethyl)carbamate using AOV'-dicYclohexylcarbodiimide (Scheme 4). The Boc protecting group was then removed by dissolving the sample in trifluoroacetic acid (TFA) and precipitating with diethyl ether. The product of this reaction, 5-[l,2]dithiolan-3-yl- pentanoic acid (2-amino-ethyl)-amide was then purified by HPLC (as above).
Scheme 4: Preparation of Flourophore Containing Dithiol
[00108] The 5-[l,2]dithiolan-3-yl-pentanoic acid (2-amino-ethyl)-amide product was then coupled with NHS activated tetramethylrhodamine (TMR) by dissolving 9.5 mg of 5-[l,2]dithiolan-3-yl-pentanoic acid (2-amino-ethyl)-amide with 10 mg of the
NHS-TMR dissolved in 400 pL of an 8 mM solution of DIPEA in dimethylformamide and shaking overnight (Scheme 3). The product of this reaction was purified by HPLC (as above
this l,2-dithiolane product then had the dithiolane group reduced to 1, 3-dithiol using tris(2- carboxyethyl)phosphine (TCEP) in order to form the reactive moiety for coupling to aldehydes (Scheme 3).
[00109] Conversion of 1,2-diols in sugars to aldehydes - /V-acetyl-D- glucosamine will be treated with sodium periodate (Scheme 5) and the cleavage of the 1,2- diols will be verified with LCMS and NMR. Glycosylated peptides will be treated identically, to cleave the l,2-diol groups and prepare the glycosylated peptides for fluorophore binding.
Scheme 5: Conversion of 1,2-diols into dialdehydes
Results and discussion
[00110] Fluorosequencing allows for low abundance variations of protein/peptide molecules to be identified and is described in Swaminathan, 2010; U.S. Patent No. 9,625,469; U.S. Patent Application Serial No. 15/461034; U.S. Patent Application Serial No. 15/150,962. This method relies on specific labeling of amino acids with fluorophores to determine its position in the peptide chain. This method can be similarly extended to identify the positions of modified amino acids by use of sugar specific fluorophores.
[00111] The concept for labeling glyocosylated amino acids is a two-step process. The first step oxidizes the alcohol groups of sugar moieties to aldehydes. The second step then reacts the dithiol reagent with the aldehyde group of the sugar molecule. It has been shown that l,3-dithiane does not degrade when exposed to sequencing conditions, thus the inventors identified ways to modify fluorophores to have a 1, 3-dithiol tether to label glycosylated amino acids.
[00112] Preparation of 1,3-dithiol tethered fluorophore - Lipoic acid was determined to be an excellent candidate for the coupling chemistry as it has a protected 1,2- dithiolane at one terminus, and a carboxylic acid on the other. The lipoic acid and NHS activated tetramethylrhodamine (TMR) were reacted according to Scheme 4, in order to
generate a 1, 3-dithiol modified fluorophore. This 1, 3-dithiol modified fluorophore (Scheme 4, compound 10) is ready to react with glycosylated peptides to form the Edman stable 1,3- dithiane. It is important to note that this method may be used to link any NHS activated fluorophore, such as Atto657N or others, to a 1, 3-dithiol tether.
[00113] Conversion of 1,2-diols in sugars to aldehydes - To confirm the viability of using sodium periodate to oxidatively cleave l,2-diols to aldehydes while preserving the rest of the sugar structure, A-acetyl- -glucosamine was selected. A-acetyl- - glucosamine will be treated with sodium periodate (Scheme 5) and the cleavage of the 1,2- diols will be verified with LCMS and NMR. Interestingly, the l,2-diol on the ring of A- Acetyl-D-glucosamine will produce two aldehydes covalently bound to each other (Scheme 5). This increases the opportunity to attach the fluorophore to the oxidized species, and may potentially lead to two fluorophores being attached at the same position of the peptide, thus increasing the brightness in scope and potentially aiding in the fluorosequencing of gly copeptides.
[00114] Fluorosequencing determination of glycosylated amino acids - It is thought that this scheme of oxidatively cleaving the l,2-diols may then be applied to glycoproteins and glycopeptides to provide a substrate for fluorophore binding. Following fluorophore binding, these bound glycoproteins or glycopeptides can be sequenced by fluorosequencing. Fluorosequencing may be performed as above, in order to determine the location of the labeled glycosylated residue(s). This labelling and sequencing scheme is invariant to the type of glycosidic linkages, and provides a de novo method for determining the positions of the glycosylated residues on known protein or peptides.
EXAMPLE 3 - MAPPING THE POSITIONS OF POST-TRANSLATIONAL LYSINE
TRIMETHYLATION AT SINGLE MOLECULE SENSITIVITY.
Materials and Methods
[00115] Synthesis of Dye-thiol reagent As prepared for detection of post- translational phosphorylation, 3 mg of Atto 647N-NHS (Cat#: AD647N35; Atto-tec) was mixed with 150 pL basic cysteamine solution (5.1 mg cysteamine and 7.5 pL DIPEA in 1500 pL dry DMF). The mixture was incubated for 3 h and the Atto647N-S-S-Atto647N product was confirmed by mass spectrometry (FIG. 1). The product was aliquoted into glass vials, each containing 200 pg of the reagent. Single dye-thiol reagent Atto647N-SH was prepared
by reacting the Atto647N-S-S-Atto647N reagent with 1 mM tris(2-carboxyethyl)phosphine (TCEP) and incubating it for 1 h at 60 °C.
[00116] Hofmann elimination and reaction of peptides with fluorophore - Adapting the techniques used in the Hofmann elimination reaction, and from Brown et al, 1997, the peptides will be treated with heat and silver oxide or DIPEA in order to generate an alkene at trimethylated lysine residues (Scheme 6). These alkene containing peptides can then be reacted with a thiol-linked fluorophore such as Atto647N-SH as described above to generate peptides labeled with a fluorophore at sites of lysine trimethylation.
Scheme 6: Labeling of Trimethylated Amino Acid Residues
Peptide R = Fluorophore
Expected Results
[00117] Fluorosequencing has been shown to precisely map the positions of fluorescently labeled amino acid residues on peptides at a sensitivity of a single molecule, and may be useful for the identification of lysine trimethylation as described in Swaminathan, 2010; U.S. Patent No. 9,625,469; U.S. Patent Application Serial No. 15/461034; U.S. Patent
Application Serial No. 15/150,962. The specific attachment of a fluorophore to the trimethylated lysine residues would extend the fluorosequencing technology to map the trimethylation marks on the histone proteins, thereby aiding in the identification of the histone code. [00118] Hofmann elimination chemistry may be used to modify the trimethylated lysine residue to a reactive alkene group, which would allow for efficient labeling with a fluorophore containing a thiol group as described above. The labeled peptides may then be sequenced by the fluorosequencing method to obtain the positions of the trimethylated lysines at single molecule resolution.
EXAMPLE 4 - MAPPING THE POSITIONS OF POST-TRANSLATIONAL NITROSYLATION AT SINGLE MOLECULE SENSITIVITY.
[00119] Nitric oxide (NO) is a cell-signaling molecule that is synthesized by a family of enzymes known as nitric oxide synthetases. NO can react with metalloproteins or covalently modify tyrosine and cysteine residues through oxidation or production of reactive nitrogen species. Nitrosylation is this category of post-translational modification that produce a covalent addition of L'-nitrosylation on cysteines or nitration on tyrosine residues (See Scheme 7). Detecting and quantifying the modification have implications for better understanding of the signaling processes during stress or inflammation and developing diagnostics (Abello et al, 2009). The use of peptide mass-spectrometry for identifying the sites of nitrosylation is challenging due to - (a) unstable nature of the nitro groups and (b) the extremely low abundant modification (estimated 1 in 106 tyrosine residues) (Zhan et al. , 2015). Thus, single molecule fluorosequencing method would provide the ideal solution to detecting and quantifying low levels of nitrosylation modifications on tyrosines or cysteines. Scheme 7: Formation of Nitrosylated Amino Acids
Cysteine- S-Nitrosyiation
Tyrosine - Nitration
(8)
[00120] Similar to the principles used for quantifying sites of other post- translational modifications by fluorosequencing, the labeling reactions specifically targeting the nitrosyl modifications has been developed. The strategies for targeting the two different types of nitrosyl modifications are described below.
A. Cysteine - -nitrosylation
[00121] Bioorthogonal labeling of SNO modification has been demonstrated by organophosphine based reactions (Devarie-Baez el al, 2013) with a one-step disulfide formation. Using the same reaction principle, a one-step reaction of covalently attaching a fluorophore (reagent 2B) to the L'-nitrosylated cysteine residue proposed in Scheme 7. The class of reagent comprises the organophosphine group with terminal handles (alkyne, azides) or fluorophore reagent. A two-step reaction, first with a non-fluorescing reagent followed by a fluorophore reaction to the terminal handle would produce S-nitrosyl specific fluorophore conjugate addition. A general overview of the techniques involved in modifying these amino acids are:
1. Protein/peptide isolation: Proteins are harvested from the cells using protocols common in molecular biology (Lee, 2017) and digested into peptides by common proteases, such as trypsin or GluC. In some scenarios it is feasible to fix cells by treating it with cold methanol (-20 °C) or other methods of cell fixation. Following fixation, the cells may be directly reacted with the reagent to label surface accessible PTM.
2. Blocking free thiols: In order to carry out the L'-nitrosylation labeling reaction, the free thiols present on cysteine should be blocked. Two common reagents used in the procedure are iodoacetamide and A-methy 1 mal ei mi de. 2-20 mM of the reagent is used at pH 7.5 buffer in order to block thiols on the peptides.
3. Labeling the SNO group: Up to 3 mM of reagent (with or without fluorophore) is incubated with the peptides or fixed cells for from about 30 mins to about 2 hours at room temperature. The excess reagent is separated by rinsing/HPLC separation or other methods such as dialysis.
4. Fluorosequencing: Fluorosequencing is performed on the fluorescently labeled peptides.
Scheme 8: Labeling of Nitrosylated Tyrosine
Schematic of the techniques for labeling 3-nitrotyrosine residue in peptides or proteins with fluorophore. The (1) nitrated tyrosine (shown in this example as the /V-terminal residue) is reacted with NHS-acetate that acetylates all the free amines present on the peptide (2). Addition of Heme/DTT under boiling conditions converts the nitro group into an amine moiety (3). This amine group reacts with fluorophore - succinimidyl ester to covalently label the 3-nitrotyrosine residue (4). The fluorescently labeled peptide can now be subjected to fluorosequencing for analysis.
[00122] This method can thus localize the residues of modification and quantify the stoichiometry of PTM labeling of the cysteine residue. Other variants of ligation of fluorophore with the intermediate phosphine adduct can be performed such as dehydroalanine formation as indicated in literature (Devari e-Baez el al, 2013).
B. Tyrosine nitration:
[00123] The common chemical derivatization strategy for nitrotyrosine, used in mass-spectrometry proteomics is a two-step process. The first step is the reduction of the nitro group to the amino group followed by covalently labeling the amino group with a specialized reagent. Prior to this step, the other amino groups on the peptides/proteins are
blocked, typically by acetylation (Abello et ctl, 2010; Devarie-Baez et ctl, 2013). This strategy (See Scheme 8) can be directly adapted for labeling the nitrotyrosine group with a distinct fluorophore for fluorosequencing. A method for labeling the nitrotyrosine for fluorosequencing application is described as follows: 1. Protein/peptide isolation: The isolated proteins and peptides are solubilized in sodium phosphate buffer (pH 7.5). The digested proteins or peptides can be lyophilized prior to analysis. The approximate concentration of the peptide is 10 mM.
2. Acetylation of amines: All the free amines and other nucleophiles are acetylated by incubating 190 pL of the nitrated peptide with NHS-Acetate (final concentration of 25 mM) for 2 h at room temperature. The //-acetylations were reversed and excess reagent hydrolyzed by boiling the reaction for 15 minutes.
3. Reduction of nitrotyrosine to aminotyrosine: DTT (final concentration: 20 mM) and Hemin (25 pM) was added to the sample and incubated for 15 minutes in a boiling water bath. 4. Fluorescent labeling: Atto-NHS or other fluorophore-NHS (2 mM) was added to the solution and incubated for 2 h at room temperature. Excess dyes were removed by HPLC or other separation method prior to fluorosequencing.
Scheme 9: Labeling of Nitrosylated Cysteine
Schematic of the one-pot reaction for selective labeling of L'-nitrosylated cysteine. (A) After alkylating the free thiols, the use of an organophosphine reagent yields a disulfide linkage. (B) A generic example of a reagent with a fluorophore connected to the phosphine group is provided.
[00124] The one-pot process described in the above section is uniquely suited for localizing and quantifying the nitrotyrosine positions on peptides and proteins.
EXAMPLE 5 - MAPPING THE POSITIONS OF POST-TRANSLATIONAL
CITRULLINATION AT SINGLE MOLECULE SENSITIVITY.
[00125] Citrullination is a post-translational modification caused by enzyme Protein Arginine deiminase (PAD) where the arginine side chain is converted to citrulline (process called deimination). The conversion leads to a change in the mass by lDa, the loss of the positive charge and two potential hydrogen bond donors. The modification has a major effect on protein structure and stability and is implicated in autoimmune disorders, neurodegenerative diseases and in tumor biology (Gy orgy et al, 2006). The small mass change overlaps with the isotopic distribution of unmodified Arginine residues in peptide mass-spectrometry, making its identification challenging. Similar to the other questions in PTM, developing an assay for localizing and quantifying the low abundant citrullinated residue is important.
[00126] A chemoselective strategy for targeting citrullinated residue has been demonstrated. A phenylglyoxal reagent reacts with arginine (under basic) and citrulline (under acidic conditions) forming a five membered ring. Although under acidic conditions, the reagent additionally binds to homocitrulline and cysteine, the thiohemiacetal ring formed with cysteine is hydrolysed in neutral pH. A method has been described for fluorescently labeling citrullinated residues with rhodamine using the phenylglyoxal reagent (Bicker el al, 2012). This procedure would be adapted for fluorosequencing as follows (See Scheme 10):
1. Protein/peptide isolation: The isolated proteins are digested or the peptide is isolated according to standard well optimized procedures. About 50 mM of citrullinated peptides is lyophilized or solubilized in 50 mM HEPES buffer (pH 7.5)
2. Thiol group on cysteines are capped using iodoacetamide or fluorescent dyes, which prevents the cross-reactivity of the citrulline specific reagent. 2 mM iodoacetamide alkylates the thiol groups in the protein digest.
3. The citrulline containing peptide was incubated with 5 mM phenylglyoxal reagent and 20% Trichloroacetic acid (pH <l) for 3 hours at 37 °C.
4. The phenylglyoxal reagent can be directly coupled with a fluorophore or contain a handle (click handle) for subsequent reaction with a fluorophore.
5. The excess reagent is purified from the labeled citrullinated peptide for
fluorosequencing.
Scheme 10: Labeling of Citrullination Modified Amino Acids
Selective labeling of citrullinated residue by Rhodamine-Phenylglyoxal reagent. (A) Reaction conditions for labeling of citrullinated residue. (B) Rhodamine - phenylglyoxal reagent used for fluorescently labeling citrullinated residues for fluorosequencing.
EXAMPLE 6 - MAPPING THE POSITIONS OF POST-TRANSLATIONAL
SULFENYLATION AT SINGLE MOLECULE SENSITIVITY.
[00127] Sulfenic acid is one of a specific oxidative modification of cysteine residue which is formed upon reaction of the thiol side chain with mild oxidizing environment. The modification is a readout of early stages of reactive oxygen species formation, the intermediate step for formation of disulfide bond formation and also involved in redox signaling (Poole et al, 2004). The unstable nature of the bond under commonly used ionization conditions in mass spectrometers makes localizing and quantifying the modification extremely challenging. However, the reactive nature of the group enables chemical coupling and enrichment of the modified peptides (Poole et al. , 2007; Reddie et al. , 2008) feasible. The principle is the selective reaction of the sulfenic acid with dimedone (5,5- dimethyl-l,3-cyclohexanedione) which has been linked to several fluorescent reagents (See
Scheme 11). Additionally, a biotin labeled reagent may be used (Millipore; Cat # NS1226- 1MG).
Scheme 11: Labeling of Sulfenic Acid Modified Amino Acids
Reaction illustrating the selective labeling of sulfenic acid with l,3-cyclohexanedione reagent derivative. (A) High yielding reaction was demonstrated by using dimedone (5,5-dimethyl- l,3-cyclohexanedione). (B) An example of Rhodamine-derivative for labeling sulfenic acid modification feasible for fluorosequencing
[00128] Below is a reaction method for labeling sulfenic acid on peptides with derivatized rhodamine for fluorosequencing:
1. Protein/peptide isolation: The proteins were digested or the peptides were isolated using common standardized procedures. About 1-10 pmol peptides were lyophibzed or solubilized in phosphate buffer (pH 7; 25 mM) and 1 mM EDTA.
2. Labeling of sulfenic acid: The fluorescent reagent was added to a concentration of 5 mM and incubated for 2 h at 37 °C. The reagent can be two halves - one with an azide handle and the second with a fluorophore that specifically reacts with the linker.
3. The excess reagents and fluorophores are purified away before fluorosequencing.
[00129] There are a number of other labeling reactions involving different reagents and reaction mechanisms that have also been demonstrated (Gupta and Carroll, 2014).
EXAMPLE 7 - MEASUREMENT OF POST-TRANSLATIONAL MODIFICATION
AS A BIOMARKER.
[00130] As described above, the precise sites of post-translational modifications, such as phosphorylation state, affects the function of proteins and may serve as a reliable indicator of disease state. One such molecule, troponin, is a diagnostic biomarker for cardiac dysregulation (Wijnker et al, 2014). However, the site-specific nature of the phosphorylation is an important diagnostic and therapeutic marker for understanding and treating heart failures (Zhang et al. , 2012). Depending on the phosphorylation state and sites on the troponin molecule, the diagnosis may range from exercise to a disease state as severe as cardiac myopathy.
[00131] The methods presented above can be easily adopted to assess the phosphorylation state of a number of potential phosphorylation related biomarkers. The first step would be to perform a standard antibody pulldown for the protein of interest, i.e. troponin. Then the enriched protein may be digested into shorter peptides using a protease, such as GluC or trypsin, producing peptides of a specific length. The phosphorylation sites can then be labelled on the peptide molecules as described in Example 1. This would allow for the exact locations of the post-translational modifications to be identified and quantified by fluorosequencing, offering significant advantages over current diagnostic tests such as semi-quantitative antibody assays like those used to measure the levels of troponin or phosphorylated troponin in a sample. This methodology may also be applied to assessing the methylation or glycosylation of any protein as well, providing new biomarkers for diseases which are characterized by post-translational modifications of the proteins.
F F F
[00132] All of the methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods have been described in terms of certain embodiments, variations may be applied to the methods and in the techniques or in the sequence of techniques of the method(s) described herein without departing from the concept, spirit and scope of the disclosure. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications are deemed to be within the spirit, scope and concept of the disclosure as defined by the appended claims.
REFERENCES
The following references, to the extent that they provide procedural or other details supplementary to those set forth herein, are specifically incorporated herein by reference.
Abello et al, Talanta Analytical Proteomics, 80: 1503-1512, 2010.
Abello et al, J. Proteome Res., 8:3222-3238, 2009.
Aebersold et al., Nat Chem Biol., 14: 206-214, 2018.
Ardito et al. ntJMolMed.,; 40: 271-280, 2017. doi: l0.3892/ijmm.20l7.3036
Bicker et al. , J. Am. Chem. Soc., 134: 17015-17018, 2012.
Braslaysky et al, Proc. Natl. Acad. Sci., USA, l00(7):3960-4, 2003.
Brown et al, J. Am. Chem. Soc., 119(14): 3288-3295, 1997.
Czemik et al, Regulatory Protein Modification, Humana Press, pp. 219-250, 1997.
Devarie-Baez et al. , Methods San Diego Calif, 62: 171-176, 2013.
Du and Huang, Yi chuan = Hered., 29: 387-92, 2007.
Frese et al., J Proteome Res. 12: 1520-5, 2013.
Garcia et al, Nat Methods., 4: 487-489, 2007.
Gupta and Carroll, Acta BBA - Gen. Subj., Current Methods to Study Reactive Oxygen Species - Pros and Cons, 1840, 847-875, 2014.
Gyorgy et al., Int. J. Biochem. Cell Biol., 38: 1662-1677, 2006.
Huang and Chang, Prostate Cancer - From Bench to Bedside, Ch.8, 2011.
Korff et al, Heart, 92: 987-93, 2006.
Lee, Endocrinol. Metab., 32: 18-22, 2017.
Mondragon-Rodriguez et al, Neuropathol Appl Neurobiol.,. 40(2): 121-35, 2014.
Onder et al, Expert Rev Proteomics, 12: 499-517, 2015.
Pool Q et al., Annu. Rev. Pharmacol. Toxicol., 44:325-347, 2004.
Pool Q et al, Bioconjug. Chem., 18:2004-2017, 2007.
Reddie et aI., MoI. Biosyst., 4:521-531, 2008.
Solari et al., Mol Biosyst., 11: 1487-93, 2015.
Stevens et al, Rapid Commun Mass Spectrom., 19: 2157-2162; 2005.
Stowell et al, Annu Rev Pathol Mech Dis. 10:473-510, 2015.
Swaminathan R, Biology S. Jagannath Swaminathan. Education. doi: l0. l002/rcm.3l79, 2010
U.S. Patent Application Serial No. 15/510,962.
U.S. Patent Application Serial No. 15/461,034.
U.S. Patent No. 7,476,656.
U.S. Patent No. 9,625,469.
von Hofmann, Ann der Chemie undPharm., 78:253-286, 1851. Wagner and Carpenter, Nat Rev Mol Cell Biol., 13: 115-126, 2012. Wijnker et al., Neth Heart J., 22: 463-9, 2014.
Zhan et al., Mass Spectrom. Rev., 34:423-448, 2015.
Zhang et al., Circulation, 126: 1828-1837, 2012.
Claims
1. A method of identifying a post translational modification on an amino acid residue of a peptide or protein, the method comprising:
(A) treating the peptide or protein with a labeling reagent under conditions such that the labeling reagent interacts with the post translational modification on the amino acid residue of the peptide or protein, to covalently couple the labeling reagent or derivative thereof to the amino acid residue and yield a labeled peptide or protein; and
(B) sequencing the labeled peptide or protein.
2. The method of claim 1, wherein the post translational modification on the amino acid residue is phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation.
3. The method of either claim 1 or claim 2, wherein the labeling reagent is a fluorophore, oligonucleotide, or peptide-nucleic acid.
4. The method of claim 3, wherein the labeling reagent is a fluorophore.
5. The method according to any one of claims 1-4, wherein treating the peptide or protein with the labeling reagent comprises:
(0 reacting the peptide or protein under conditions such that the post translational modification on the peptide or protein is converted to a reactive group to form a reactive peptide or protein;
(//) reacting the labeling reagent with the reactive peptide or protein to form the labeled peptide or protein.
6. The method of claim 5, wherein the reactive peptide or protein is formed by treating the peptide or protein comprising a phosphorylation post translational modification with a base.
7. The method of claim 5, wherein the reactive peptide or protein is formed by treating the peptide or protein comprising a phosphorylation post translational modification with an activating agent and a base.
8. The method of claim 5, wherein the reactive peptide or protein is formed by treating the peptide or protein comprising a trimethyl post translational modification with silver oxide (Ag20).
9. The method of claim 5, wherein the reactive peptide or protein is formed by treating the peptide or protein comprising a glycosylation post translational modification with an oxidizing agent.
10. The method of claim 5, wherein the reactive peptide or protein is formed by treating the peptide or protein comprising a nitrosylation post translational modification with a reducing agent.
11. The method of claim 5, wherein the reactive peptide or protein is formed by treating the peptide or protein comprising a nitrosylation post translational modification with phosphine.
12. The method according to any one of claims 1-5, wherein contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a phosphine.
13. The method according to any one of claims 1-4, wherein contacting the peptide or protein with the labeling reagent comprises reacting the peptide or protein comprising a post translational modification with a glyoxal group.
14. The method according to any one of claims 1-13, wherein the sequencing comprises a fluorosequencing method.
15. The method according to any one of claims 1-14, wherein the sequencing is at a single molecular level.
16. The method of either claim 14 or claim 15, wherein the fluorosequencing method comprises labeling at least one amino acid of the peptide or protein which does not contain a post translational modification with a second labeling reagent.
17. The method according to any one of claims 1-16, wherein the peptide or protein is bound to a solid support.
18. The method according to any one of claims 1-17, wherein the fluorosequencing method further comprises removing at least one amino acid residue of the peptide or protein.
19. The method of claim 18, wherein the fluorosequencing method comprises sequentially removing amino acid residues of the peptide or protein until a labeled amino acid comprising a modified post translational modification is removed.
20. The method of either claim 18 or claim 19, wherein the amino acid residues are removed by Edman degradation.
21. The method of either claim 18 or claim 19, where the amino acid residue is removed by treating the TV-terminal amino acid residue with a thiourea and an acid, microwave irradiation, or heat.
22. The method of either claim 18 or claim 19, wherein the amino acid residues are removed by an enzyme.
23. The method according to any one of claims 1-22, wherein the peptide or protein is obtained from a biological sample.
24. The method of claim 23, wherein the biological sample is a cell-free biological sample.
25. The method according to any one of claims 1-24, wherein a covalent bond between the post translational modification on the amino acid residue of the peptide or protein and the labeling reagent is formed.
26. The method according to any one of claims 1-24, wherein the labeling reagent or derivative thereof is directly covalently bonded to the amino acid residue.
27. The method according to any one of claims 1-24, wherein the labeling reagent or derivative thereof is covalently coupled to the amino acid residue through an intermediary molecule.
28. A method of determining the status of a disease or disorder in a subject, the method comprising:
(A) detecting a change in a type, identity, quantity, or position of a post translational modification or a plurality of post translational modifications on a protein or peptide using the methods according to any one of claims 1-27; and
(B) determining the status of the disease or disorder in the subject according to at least said change.
29. The method of claim 28, further comprising obtaining a biological sample from the subject.
30. The method of either claim 28 or claim 29, wherein the biological sample is a cell- free biological sample.
31. The method according to any one of claims 28-30, wherein the method conveys the presence of one or more post translational modifications.
32. The method according to any one of claims 28-31, wherein the method conveys the absence of one or more post translational modifications.
33. The method according to any one of claims 28-32, wherein the subject is a mammal.
34. The method according to any one of claims 28-33, wherein the method further comprises enriching the protein before determining the type, identity, quantity, or position of the post translational modifications.
35. The method according to any one of claims 28-34, wherein the protein is immobilized on a solid support.
36. A method for determining the status of a disease or disorder in a subject, the method comprising: detecting a change in a type, identity, quantity, or position of the post translational modifications on the protein or peptide using the methods according to any one of claims 1-27 related to the disease or disorder.
37. The assay method of claim 36, further comprising obtaining a biological sample from the subject.
38. A modified peptide or protein comprising a peptide or protein comprising one or more post translational modifications, wherein at least one post translational modification of said peptide or protein comprising one or more post translational modifications is altered with at least a first labeling moiety, thereby forming a labeled peptide or protein comprising one or more post translational modifications.
39. The modified peptide or protein of claim 38, wherein at least the first labeling moiety is a fluorophore.
40. The modified peptide or protein of either claim 38 or claim 39, wherein said at least one post translational modification is selected from the group consisting of phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, trimethylation, or any combination thereof.
41. The modified peptide or protein of claim 40, wherein each post translational modification selected from the group consisting of phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation is altered by a distinct labeling moiety.
42. The modified peptide or protein according to any one of claims 38-41, wherein the peptide or protein is immobilized adjacent to a solid support.
43. A method of sequencing a peptide or protein comprising:
(A) obtaining a cell-free biological sample and separating the peptide or protein from the cell-free biological sample;
(B) labeling the peptide or protein under conditions sufficient to interact with at least one amino acid residue of the peptide or protein associated with a post translational modification with a first labeling moiety to form at least one labeled amino acid residue of the peptide or protein;
(C) subjecting the peptide or protein to conditions sufficient to remove one or more individual amino acid residues from the peptide or protein; and
(D) detecting at least one signal from the at least one labeled amino acid residue, thereby identifying the sequence of the peptide or protein.
44. The method of claim 43, wherein the post translational modification on the amino acid residue is phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation.
45. The method of either claim 43 or claim 44, wherein the first labeling moiety is a fluorophore, oligonucleotide, or peptide-nucleic acid.
46. The method according to any one of claims 43-45, wherein labeling the peptide or protein with the first labeling moiety comprises:
(0 treating the peptide or protein under conditions such that the post translational modification on the peptide or protein is converted to a reactive group to form a reactive peptide or protein;
(//) treating the first labeling moiety with the reactive peptide or protein to form a labeled peptide or protein.
47. The method according to any one of claims 43-46, wherein the sequencing comprises a fluorosequencing method.
48. The method according to any one of claims 43-47, wherein the sequencing is at a single molecular level.
49. The method of either claim 47 or claim 48, wherein the fluorosequencing method comprises labeling at least one amino acid residue of the peptide or protein with a second labeling reagent.
50. The method according to any one of claims 43-49, wherein the peptide or protein is bound to a solid support.
51. The method according to any one of claims 43-50, wherein the fluorosequencing method further comprises removing at least one amino acid residue of the peptide or protein.
52. The method of claim 51, wherein the fluorosequencing method comprises sequentially removing each amino acid residues of the peptide or protein until a labeled amino acid comprising a modified post translational modification is removed.
53. A method for polypeptide sequence identification, comprising:
(A) obtaining a first polypeptide from a cell-free biological sample of a subject;
(B) using said first polypeptide to generate a second polypeptide immobilized to a support, wherein said second polypeptide comprises labeled amino acids;
(C) subjecting said second polypeptide to conditions sufficient to remove amino acids from said polypeptide; and
(D) during or subsequent to removal of said amino acids from said polypeptide, detecting signals from at least a subset of said labeled amino acids, thereby
identifying a sequence of said second polypeptide to determine a sequence of said first polypeptide from said cell-free biological sample.
54. A method for processing or analyzing a protein or peptide containing or suspected of containing at least one post-translational modification, comprising:
(A) sequencing said protein or peptide, and
(B) identifying said at least one post-translational modification in at least one amino acid subunit of said protein or peptide, or derivative thereof.
55. The method of claim 54, wherein said sequencing comprises subjecting said protein or peptide to degradation conditions to sequentially remove amino acid sub-units from said protein or peptide, and detecting at least a subset of said amino acid sub-units.
56. The method of claim 54, wherein said protein or peptide is obtained from a sample and processed to label said at least one post-translational modification.
57. The method of claim 56, wherein said sample is a cell-free sample.
58. The method of claim 54, wherein said sequencing comprises labeling said at least one post-translational modification of said protein or peptide with a label, and detecting said label to thereby identify said at least one post-translational modification on said protein or peptide.
59. A method for processing or analyzing a protein or peptide, comprising subjecting said protein or peptide to conditions sufficient to specifically label different post- translational modifications of said protein or peptide, and detecting labels corresponding to said different post-translational modifications of said protein or peptide to thereby detect said different post-translational modifications of said protein or peptide.
60. The method of claim 59, wherein said different post-translational modifications comprise phosphorylation, glycosylation, nitrosylation, citrullination, sulfenylation, or trimethylation.
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP19840848.6A EP3827093A4 (en) | 2018-07-23 | 2019-07-23 | IDENTIFICATION BY SINGLE MOLECULE SEQUENCING OF POST-TRANSLATIONAL MODIFICATIONS ON PROTEINS |
CN201980048949.XA CN112469832A (en) | 2018-07-23 | 2019-07-23 | Single molecule sequencing identification of post-translational modifications on proteins |
JP2021503788A JP2021530549A (en) | 2018-07-23 | 2019-07-23 | Identification of post-translational modifications in proteins by single molecule sequencing |
US17/155,298 US20210215706A1 (en) | 2018-07-23 | 2021-01-22 | Single molecule sequencing identification of post-translational modifications on proteins |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201862702318P | 2018-07-23 | 2018-07-23 | |
US62/702,318 | 2018-07-23 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/155,298 Continuation US20210215706A1 (en) | 2018-07-23 | 2021-01-22 | Single molecule sequencing identification of post-translational modifications on proteins |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020023488A1 true WO2020023488A1 (en) | 2020-01-30 |
Family
ID=69182408
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2019/042998 WO2020023488A1 (en) | 2018-07-23 | 2019-07-23 | Single molecule sequencing identification of post-translational modifications on proteins |
Country Status (5)
Country | Link |
---|---|
US (1) | US20210215706A1 (en) |
EP (1) | EP3827093A4 (en) |
JP (1) | JP2021530549A (en) |
CN (1) | CN112469832A (en) |
WO (1) | WO2020023488A1 (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021086901A1 (en) * | 2019-10-28 | 2021-05-06 | Quantum-Si Incorporated | Methods of preparing an enriched sample for polypeptide sequencing |
US11105812B2 (en) | 2011-06-23 | 2021-08-31 | Board Of Regents, The University Of Texas System | Identifying peptides at the single molecule level |
US11162952B2 (en) | 2014-09-15 | 2021-11-02 | Board Of Regents, The University Of Texas System | Single molecule peptide sequencing |
US11435358B2 (en) | 2011-06-23 | 2022-09-06 | Board Of Regents, The University Of Texas System | Single molecule peptide sequencing |
US11959920B2 (en) | 2018-11-15 | 2024-04-16 | Quantum-Si Incorporated | Methods and compositions for protein sequencing |
US12065466B2 (en) | 2020-05-20 | 2024-08-20 | Quantum-Si Incorporated | Methods and compositions for protein sequencing |
US12196760B2 (en) | 2018-07-12 | 2025-01-14 | Board Of Regents, The University Of Texas System | Molecular neighborhood detection by oligonucleotides |
US12360114B2 (en) | 2019-12-10 | 2025-07-15 | Quantum-Si Incorporated | Methods and compositions for protein sequencing |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116270639A (en) * | 2023-03-31 | 2023-06-23 | 中山大学 | New application of N- (2- (5- (1, 2-dithiolane-3-yl) pentanamido) ethyl) nicotinamide |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040053356A1 (en) * | 2001-12-28 | 2004-03-18 | Mds Proteomics Inc. | Enzyme/chemical reactor based protein processing method for proteomics analysis by mass spectrometry |
WO2010044892A1 (en) * | 2008-10-17 | 2010-04-22 | President And Fellows Of Harvard College | Diagnostic method based on large scale identification of post-translational modification of proteins |
US20140024124A1 (en) * | 2011-02-16 | 2014-01-23 | National University Corporation Hokkaido University | Labeling agent for analyzing post-translational modifications of serine and threonine |
US20150087526A1 (en) * | 2012-01-24 | 2015-03-26 | The Regents Of The University Of Colorado, A Body Corporate | Peptide identification and sequencing by single-molecule detection of peptides undergoing degradation |
WO2017219027A1 (en) * | 2016-06-17 | 2017-12-21 | The Broad Institute Inc. | Type vi crispr orthologs and systems |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7476656B2 (en) * | 2004-11-09 | 2009-01-13 | University Of Florida Research Foundation, Inc. | Fluorescent affinity tag to enhance phosphoprotein detection and characterization |
CN104789536A (en) * | 2005-10-12 | 2015-07-22 | 斯克利普斯研究院 | Selective posttranslational modification of phage-displayed polypeptides |
-
2019
- 2019-07-23 CN CN201980048949.XA patent/CN112469832A/en active Pending
- 2019-07-23 WO PCT/US2019/042998 patent/WO2020023488A1/en unknown
- 2019-07-23 JP JP2021503788A patent/JP2021530549A/en active Pending
- 2019-07-23 EP EP19840848.6A patent/EP3827093A4/en active Pending
-
2021
- 2021-01-22 US US17/155,298 patent/US20210215706A1/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040053356A1 (en) * | 2001-12-28 | 2004-03-18 | Mds Proteomics Inc. | Enzyme/chemical reactor based protein processing method for proteomics analysis by mass spectrometry |
WO2010044892A1 (en) * | 2008-10-17 | 2010-04-22 | President And Fellows Of Harvard College | Diagnostic method based on large scale identification of post-translational modification of proteins |
US20140024124A1 (en) * | 2011-02-16 | 2014-01-23 | National University Corporation Hokkaido University | Labeling agent for analyzing post-translational modifications of serine and threonine |
US20150087526A1 (en) * | 2012-01-24 | 2015-03-26 | The Regents Of The University Of Colorado, A Body Corporate | Peptide identification and sequencing by single-molecule detection of peptides undergoing degradation |
WO2017219027A1 (en) * | 2016-06-17 | 2017-12-21 | The Broad Institute Inc. | Type vi crispr orthologs and systems |
Non-Patent Citations (2)
Title |
---|
DOLL FRANZISKA, BUNTZ ANNETTE, SPÄTE ANNE-KATRIN, SCHART VERENA F., TIMPER ALEXANDER, SCHRIMPF WALDEMAR, HAUCK CHRISTOF R., ZUMBUS: "Visualization of Protein-Specific Glycosylation inside Living Cells", ANGEWANDTE CHEMIE, vol. 55, no. 6, 12 January 2016 (2016-01-12), pages 2262 - 2266, XP055779670, ISSN: 1433-7851, DOI: 10.1002/anie.201503183 * |
See also references of EP3827093A4 * |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11105812B2 (en) | 2011-06-23 | 2021-08-31 | Board Of Regents, The University Of Texas System | Identifying peptides at the single molecule level |
US11435358B2 (en) | 2011-06-23 | 2022-09-06 | Board Of Regents, The University Of Texas System | Single molecule peptide sequencing |
US11162952B2 (en) | 2014-09-15 | 2021-11-02 | Board Of Regents, The University Of Texas System | Single molecule peptide sequencing |
US12196760B2 (en) | 2018-07-12 | 2025-01-14 | Board Of Regents, The University Of Texas System | Molecular neighborhood detection by oligonucleotides |
US11959920B2 (en) | 2018-11-15 | 2024-04-16 | Quantum-Si Incorporated | Methods and compositions for protein sequencing |
US12000835B2 (en) | 2018-11-15 | 2024-06-04 | Quantum-Si Incorporated | Methods and compositions for protein sequencing |
US12055548B2 (en) | 2018-11-15 | 2024-08-06 | Quantum-Si Incorporated | Methods and compositions for protein sequencing |
US12174196B2 (en) | 2018-11-15 | 2024-12-24 | Quantum-Si Incorporated | Methods and compositions for protein sequencing |
US12259391B2 (en) | 2018-11-15 | 2025-03-25 | Quantum-Si Incorporated | Methods and compositions for protein sequencing |
WO2021086901A1 (en) * | 2019-10-28 | 2021-05-06 | Quantum-Si Incorporated | Methods of preparing an enriched sample for polypeptide sequencing |
US12360114B2 (en) | 2019-12-10 | 2025-07-15 | Quantum-Si Incorporated | Methods and compositions for protein sequencing |
US12065466B2 (en) | 2020-05-20 | 2024-08-20 | Quantum-Si Incorporated | Methods and compositions for protein sequencing |
Also Published As
Publication number | Publication date |
---|---|
EP3827093A1 (en) | 2021-06-02 |
JP2021530549A (en) | 2021-11-11 |
US20210215706A1 (en) | 2021-07-15 |
EP3827093A4 (en) | 2022-10-05 |
CN112469832A (en) | 2021-03-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210215706A1 (en) | Single molecule sequencing identification of post-translational modifications on proteins | |
US20240302380A1 (en) | Single molecule peptide sequencing | |
US20210356473A1 (en) | Solid-phase n-terminal peptide capture and release | |
US11435358B2 (en) | Single molecule peptide sequencing | |
US20240002925A1 (en) | Methods, systems and kits for polypeptide processing and analysis | |
US20230076975A1 (en) | Peptide and protein c-terminus labeling | |
US20240287106A1 (en) | Method for Producing Fluorescent Probe Library Using Solid-Phase Extraction and Method of Measuring Enzyme Activity Using Same | |
US20240201198A1 (en) | Compositions, methods, and utility of conjugated biomolecule barcodes | |
US20070128729A1 (en) | Method for the identification and relative quantification of proteins based on the selective isolation of RRnK peptides for the simplification of complex mixtures of proteins | |
US20200158737A1 (en) | Methods of measuring ubiquitin-like modifications | |
US20240426831A1 (en) | Structural profiling of native proteins using fluorosequencing, a single molecule protein sequencing technology | |
US7476656B2 (en) | Fluorescent affinity tag to enhance phosphoprotein detection and characterization | |
HK40047061A (en) | Single molecule sequencing identification of post-translational modifications on proteins | |
US20250035639A1 (en) | Protein quantification, tracking, and identification via peptide barcodes | |
WO2024076928A1 (en) | Fluorophore-polymer conjugates and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19840848 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2021503788 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2019840848 Country of ref document: EP Effective date: 20210223 |