WO2018047197A1 - Site-selective peptides and protein labelling and synthesis of homogenous protein conjugates - Google Patents

Site-selective peptides and protein labelling and synthesis of homogenous protein conjugates Download PDF

Info

Publication number
WO2018047197A1
WO2018047197A1 PCT/IN2017/050362 IN2017050362W WO2018047197A1 WO 2018047197 A1 WO2018047197 A1 WO 2018047197A1 IN 2017050362 W IN2017050362 W IN 2017050362W WO 2018047197 A1 WO2018047197 A1 WO 2018047197A1
Authority
WO
WIPO (PCT)
Prior art keywords
protein
site
alkyl
proteins
aryl
Prior art date
Application number
PCT/IN2017/050362
Other languages
French (fr)
Inventor
Vishal Rai
Maheshwerreddy CHILAMARI
Landa PURUSHOTTAM
Original Assignee
Indian Institute Of Science Education And Research Bhopal
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Indian Institute Of Science Education And Research Bhopal filed Critical Indian Institute Of Science Education And Research Bhopal
Publication of WO2018047197A1 publication Critical patent/WO2018047197A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K1/00General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
    • C07K1/107General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length by chemical modification of precursor peptides
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K1/00General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
    • C07K1/13Labelling of peptides

Definitions

  • the invention is in the field of biotechnology and concerns protein labelling.
  • Epsilon -Nth of lysine ( ⁇ ⁇ - ⁇ 2 ) is one of the most abundant naturally occurring reactive functional groups on the surface of proteins, among various other nucleophilic residues.
  • Site- directed mutagenesis enabled the introduction of no n- natural amino acids into the protein backbone, and this has further enabled access to certain functional groups with unique reactivity. This specific non-proteinogenic residue is utilized for the attachment of a tag in subsequent bio-orthogonal transformation.
  • proximity driven approach through ligand directed labeling is the only method. Chemical approaches were not in focus for site- selective protein labeling as it would involve addressing several challenging questions related to the selectivity.
  • the pre -requisite of the labeling method involves the development of a chemical transformation that would be effective in near neutral aqueous buffer at ambient temperature.
  • a functional group such as ⁇ ⁇ - ⁇ 2 would have a stark confrontation with the other nucleophilic functionalities (chemoselectivity) and multiple ⁇ ⁇ - ⁇ 2 residues (site- selectivity).
  • the focal challenge revolves around the tenet that difference in nucleophilicity of multiple copies of an amino acid residue is inadequate to impart significant reactivity to a particular site.
  • the technology still revolves around the most commonly adapted protocol that involves pre-engineered proteins with unnatural amino acids amenable to bio-orthogonal chemistry or enzyme-tag pairs. This limits the access of methodology to the majority of native proteins and antibodies.
  • an affinity probe or fluorophore has to be attached to a native protein, it will result in a heterogeneously labeled mixture of proteins.
  • ADCs antibody drug conjugates
  • the two approved drugs in this segment for directed therapeutics are synthesized using chemical approach. Both the drugs are available as heterogeneous mixtures.
  • Another approach is the proximity driven approach and is driven by the specificity of ligand-receptor interactions. This technique has been applied for the site-specific modification of endogenous proteins. This chemo-enzymatic approach also has a limited set of applications.
  • there is a need for an effective tool that would allow site-selective protein labeling of native (un-engineered) protein at single-site which can offer further flexibility to the user for the introduction of the desired tag in a late-stage modification.
  • the object of the invention is for a of method site-selective labeling of native (un-engineered) peptides and protein at single-site ( ⁇ ⁇ - ⁇ 2 ) which can offer further flexibility for installation of the desired tag in late-stage modification.
  • Another object of the invention is to synthesize labeled native proteins, labeled antibody-Fab fragment, and labeled monoclonal antibody, and for further synthesis of homogenous protein conjugates.
  • Figure 1 depicts the reagents used in the method of the invention.
  • Figure 2 depicts reaction for the formylation and single site labelling of peptides.
  • Figure 3 depicts reaction for the formylation and single site labelling of proteins.
  • Figure 4 depicts reaction 1 and 2 for the formylation and interception of formylation for the site specific single site labelling of proteins.
  • Figure 5 depicts alkynylation of native proteins.
  • Figure 6 depicts aminophosphonation of native proteins.
  • Figure 7 depicts tagging of a protein through (A) aldehyde component, and (B) late-stage oxime formation.
  • Figure 8 depicts aminophosphonation of antibody-Fab fragment and antibody for installation of drug molecule.
  • the invention is for a method of site-selective single site labelling of peptides and proteins at single-site ( ⁇ ⁇ - ⁇ 2) which can offer further flexibility for installation of the desired tag in late-stage modification.
  • the invention is for the synthesis of labeled native proteins, labeled antibody-Fab fragment, and labeled monoclonal antibody and for further synthesis of homogenous protein conjugates.
  • the functional groups introduced are the aldehyde, maleimide, and can be expanded to various bio-orthogonal functionalities. These functional groups enable the late- stage modification of the proteins.
  • the reagents involved in the introduction of the functional groups involve reagents such as aldehyde, alkyne, phosphite, and the like.
  • Labels can be introduced by using them as one of the components in reaction.
  • Labels introduced by late stage modification are affinity tags, biophysical probes such as fluorophore tags, and drug molecules.
  • the fluorophore tags is selected from 11a, lib, and 11c. It can be extended to hydroxylamine derivative of any fluorophore.
  • the drug molecules is selected from 12a and 12b. It can be extended to hydroxylamine derivative of any drug molecules.
  • the reaction time and temperature are with reference to the physiological conditions and can be varied depending on the molecules to be conjugated.
  • the method of the introduction of the functional groups for single-site protein labeling are selected from (I) Formylation of protein (II) Interception of formylation (III) Alkynylation of protein (TV) Aminophosphonation of protein.
  • transformation results in ⁇ ⁇ -formylation of lysine residue which is affected with the aldehyde as a formylating pre-reagent and which deliver high levels of efficiency with both peptides and native proteins in physiological conditions.
  • the method involves two sequential steps wherein the reversible reaction of the a-amine with aldehyde renders it unreactive towards further reaction with formate generated from an aldehyde or any electrophile to enable selective formylation of ⁇ ⁇ - ⁇ 2 .
  • the ⁇ ⁇ -formylation of lysine residue is intercepted by another electrophile, acylating reagent, to provide a site for late-stage modification.
  • the interception of formylation can be performed by any electrophile delivered to the protein with a regulated rate of addition.
  • the acylating reagents used for the introduction of the bio orthogonal functionalities is selected from 4a, 4b, 4c, 4d and 5.
  • chemoselective and site-selective ⁇ -amine modification is performed with a multicomponent transformation that operates under physiological conditions in the presence of protein, aldehyde, acetylene, and Cu-ligand complex.
  • the propargylamine is introduced at the labeled site is primed for late-stage bio- orthogonal modification.
  • chemoselective and site-selective ⁇ -amine modification is performed with a multicomponent transformation that operates under physiological conditions in the presence of protein, aldehyde, and triethylphosphite or diethylphosphite. Transformation is performed without the requirement of the metal catalyst.
  • the aldehyde component derivatives can introduce tags directly or through a late-stage bio- orthogonal modification. Reagents used for technology I (-Formylation): la-g, 2a-c, 3a-n.
  • Reagents used for technology II (-Interception of formylation): la-g, 2a-c, 3a -n, 4a-d, 5.
  • Reagents used for technology III (-Alkynylation of protein): 3a, 6, 7a-g, 8a-b.
  • R is independently selected from H; alkyl; cycloalkyl; aryl and Ri, R 2 , R3 and R4 are independently selected from H; hydroxyl; -B(OR * )(O R ** ) wherein R * and R ** are independently selected from H; alkyl; lower alkyl; cycloalkyl; aryl; heteroaryl; alkenyl; heterocycle; halides; nitro; - C(0)OR * wherein R * is selected from H, alkyl; cycloalkyl and aryl; -C(0)NR ** R *** , wherein R ** and R *** are independently selected from H, alkyl; cycloalkyl and aryl; -CH2C(0)R a , wherein R a is selected from -OH, lower alkyl, cycloalkyl; aryl,
  • R5 , R6, and R7 are independently selected from H; nitro; cyano; halides; alkyl; cycloalkyl; aryl and C(0)OR * wherein R * is selected from H, alkyl; cycloalkyl and aryl; -C(0)NR ** R *** , wherein R ** and R *** are independently selected from H, alkyl; cycloalkyl and aryl.
  • Rs and R9 are independently selected from H; halides; alkyl; cycloalkyl and aryl.
  • Rio is selected from H; nitro; cyano; halides; alkyl; cycloalkyl; aryl and C(0)OR * wherein R * is selected from H, alkyl; cycloalkyl and aryl; -C(0)NR ** R *** , wherein R ** and R *** are independently selected from H, alkyl; cycloalkyl and aryl.
  • Rn, R 12 , R 1 3 and R14 are independently selected from H; alkyl; cycloalkyl; aryl and -SO3R wherein R is selected from H; Na.
  • R15 and R1 ⁇ 2 are independently selected from H; alkyl; cycloalkyl; aryl and C(0)OR * wherein R * is selected from H, alkyl; cycloalkyl and aryl; -C(0)NR ** R *** , wherein R ** and R *** are independently selected from H, alkyl; cycloalkyl and aryl.
  • R17 and Ri8 are independently selected from H; halides; alkyl; cycloalkyl and aryl.
  • R19, R20, and R21 are independently selected from H; alkyl; aryl and C(0)OR * wherein R * is selected from H, alkyl; cycloalkyl and aryl; -C(0)NR ** R *** , wherein R ** and R *** are independently selected from H, alkyl; cycloalkyl and aryl.
  • suitable substituent is meant to include independently H; hydroxyl; cyano; alkyl, such as lower alkyl, such as methyl, ethyl, propyl, n-butyl, t-butyl, hexyl and the like; alkoxy, such as lower alkoxy such as methoxy, ethoxy, and the like; aryloxy, such as phenoxy and the like; vinyl; alkenyl, such as hexenyl and the like; alkynyl; formyl; haloalkyl, such as lower haloalkyl which includes CF3, CCI3 and the like; halide; aryl, such as phenyl and napthyl; heteroaryl, such as thienyl and furanyl and the like; amide such as C(0)NR**R***, , where R** and R*** are independently selected from lower alkyl, aryl or benzyl, and the like; acyl, such as C(0)NR**R
  • lower alkyl as used herein either alone or in combination with another substituent means acyclic, straight or branched chain alkyl substituent containing from one to six carbons and includes for example, methyl, ethyl, 1 -methylethyl, 1 -methylpropyl, 2-methylpropyl, and the like.
  • lower alkoxy as used herein includes methoxy, ethoxy, i-butoxy.
  • alkyl encompasses lower alkyl, and also includes alkyl groups having more than six carbon atoms, such as, for example, acyclic, straight or branched chain alkyl substituents having seven to ten carbon atoms.
  • aryl as used herein, either alone or in combination with another substituent, means an aromatic monocyclic system or an aromatic polycyclic system.
  • aryl includes a phenyl or a napthyl ring, and may also include larger aromatic polycyclic systems, such as fluorescent (eg. anthracene) or radioactive labels and their derivatives.
  • heteroaryl as used herein, either alone or in combination with another substituent means a 5, 6, or 7-membered unsaturated heterocycle containing from one to 4 heteroatoms selected from nitrogen, oxygen, and sulphur and which form an aromatic system.
  • heteroaryl also includes a polycyclic aromatic system comprising a 5, 6, or 7-membered unsaturated heterocycle containing from one to 4 heteroatoms selected from nitrogen, oxygen, and sulphur.
  • cycloalkyl as used herein, either alone or in combination with another substituent, means a cycloalkyl substituent that includes for example, but is not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl and cycloheptyl.
  • cycloalkyl-alkyl- that means an alkyl radical to which a cycloalkyl radical is directly linked; and includes, but is not limited to, cyclopropylmethyl, cyclobutylmethyl, cyclopentylmethyl,
  • aryl-alkyl- means an alkyl radical, to which an aryl is bonded.
  • aryl-alkyl- include, but are not limited to, benzyl (phenylmethyl), 1 -phenylethyl, 2-phenylethyl and phenylpropyl.
  • heterocycle either alone or in combination with another radical, means a monovalent radical derived by removal of a hydrogen from a three- to seven-membered saturated or unsaturated (including aromatic) heterocycle containing from one to four heteroatoms selected from nitrogen, oxygen and sulfur.
  • heterocycles include, but are not limited to, pyrrolidine, tetrahydrofuran, thiazolidine, pyrrole, thiophene, hydantoin, diazepine, imidazole, isoxazole, thiazole, tetrazole, piperidine, piperazine, homopiperidine, homopiperazine, 1 ,4-dioxane, 4-morpholine, 4-thiomorpholine, pyridine, pyridine-N-oxide or pyrimidine, and the like.
  • alkenyl as used herein, either alone or in combination with another radical, is intended to mean an unsaturated, acyclic straight chain radical containing two or more carbon atoms, at least two of which are bonded to each other by a double bond.
  • examples of such radicals include, but are not limited to, ethenyl (vinyl), 1 -propenyl, 2-propenyl, and 1 -butenyl.
  • alkynyl as used herein is intended to mean an unsaturated, acyclic straight chain radical containing two or more carbon atoms, at least two of which are bonded to each other by a triple bond.
  • examples of such radicals include, but are not limited to, ethynyl, 1 -propynyl, 2-propynyl, and 1 -butynyl.
  • alkoxy as used herein, either alone or in combination with another radical, means the radical -0-(Ci- n )alkyl wherein alkyl is as defined above containing 1 or more carbon atoms, and includes for example mefhoxy, ethoxy, propoxy, 1 -mefhylethoxy, butoxy and 1,1-dimethylethoxy.
  • n 1 to 6
  • aryloxy as used herein alone or in combination with another radical means -O-aryl, wherein aryl is defined as noted above.
  • R5, R 6 , and R7 can be sourced independently from any alkyl or cyclic alkyl functionalities that would not perturb the nucleophilicity of the phosphite.
  • R 8 can be sourced from any alkyl or cyclic alkyl functionalities that would not perturb the nucleophilicity of hydroxylamine.
  • affinity tags for example, affinity tags, fluorescence tags, drugs, and toxins.
  • antibody-Fab fragment and a monoclonal antibody are synthesized with the functional groups by the site-selective labelling.
  • Antibody drug conjugates were synthesized by using the doxorubicin derivative which acts as a fluorescent label and a drug.
  • the progress of the transformation was followed by sampling aliquots at various time intervals by MALDI-ToF-MS using sinapic acid as matrix. After 24-48 h, the reaction mixture was further diluted with water (0.4 ml) and unreacted aldehyde was extracted using ethyl acetate/hexane (8:2, 5x1 ml).
  • reaction mixture was subjected to dialysis and the sample was concentrated by lyophilization before subjecting it to the digestion, peptide mapping and sequencing by MS-MS (Fig 1 and 2).
  • Table 1 % conversion and yield of the peptide/protein by the method
  • the reaction was stopped in another 15 minutes by extraction of unreacted electrophile (acylating reagent) and aldehyde using ethyl acetate/he xane (8:2, 5x1 ml).
  • the reaction mixture was subjected to dialysis and the sample was concentrated by lyophilization.
  • Mono-labeled protein was dissolved in phosphate buffer (95 ⁇ , 0.1 M, pH 7.0) and treated with 4-methyl-7-mercaptocoumarin (5, 3.65 mM, 365 nmole, from freshly prepared solution) in DMSO (5 ⁇ ) to give the fluorescent labeled protein. Protein labeling was monitored by using MALDI-ToF.
  • reaction mixture was treated with hydroxylamine hydrochloride (250 ⁇ g, 3.65 ⁇ mol ) in water (5 ⁇ ) for 6 h to remove the excess formaldehyde.
  • the reaction mixture was subjected to dialysis [EDTA (0.05 M)-phosphate buffer (pH 7.8, 0.1 M)] and the sample was concentrated by lyophilization before subjecting it to digestion, peptide mapping and sequencing by MS-MS. The percent conversion and yield resulting out of the reaction is provided in Table 2.
  • the progress of reaction was monitored by MALDI-ToF MS using sinapic acid as matrix and ESI-MS.
  • the excess aldehyde in the reaction mixture was quenched with hydroxylamine hydrochloride (250 ⁇ g, 3.65 ⁇ ) in water (5 ⁇ ) for 2 h.
  • the protein solution was desalted by using the concentrators (3 kDa MWCO) and then the sample was concentrated by lyophilization before subjecting it to digestion, peptide mapping and sequencing by MS-MS.
  • the percent conversion and yield resulting out of the reaction is provided in Table 3.
  • the invention eliminates the requirement of pre-protein engineering and un-natural amino acids for labelling of proteins. Distinguishing a unique Lysine residue as a reactivity hotspot and open the doors for the single- site derivatization of proteins.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Biochemistry (AREA)
  • Analytical Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Medicinal Chemistry (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Peptides Or Proteins (AREA)

Abstract

The invention pertains to the method for the site selective modification of peptides and proteins comprising of the activation of the single site (Nɛ-NH2) of peptides and proteins and the method comprising one of formylation; interception of formylation; alkynylation or amino phosphonation of peptides and proteins.

Description

SITE-SELECTIVE PEPTIDES AND PROTEIN LABELLING AND SYNTHESIS OF HOMOGENOUS PROTEIN CONJUGATES
FIELD OF INVENTION:
The invention is in the field of biotechnology and concerns protein labelling.
BACKGROUND OF THE INVENTION:
The Epsilon -Nth of lysine (Νε-ΝΗ2) is one of the most abundant naturally occurring reactive functional groups on the surface of proteins, among various other nucleophilic residues. Site- directed mutagenesis enabled the introduction of no n- natural amino acids into the protein backbone, and this has further enabled access to certain functional groups with unique reactivity. This specific non-proteinogenic residue is utilized for the attachment of a tag in subsequent bio-orthogonal transformation. For site-selective protein backbone modification (other than N-terminus) with endogenous/native proteins, proximity driven approach through ligand directed labeling is the only method. Chemical approaches were not in focus for site- selective protein labeling as it would involve addressing several challenging questions related to the selectivity. The pre -requisite of the labeling method involves the development of a chemical transformation that would be effective in near neutral aqueous buffer at ambient temperature. Besides, a functional group such as Νε-ΝΗ2 would have a stark confrontation with the other nucleophilic functionalities (chemoselectivity) and multiple Νε-ΝΗ2 residues (site- selectivity). The focal challenge revolves around the tenet that difference in nucleophilicity of multiple copies of an amino acid residue is inadequate to impart significant reactivity to a particular site. Thus, the technology still revolves around the most commonly adapted protocol that involves pre-engineered proteins with unnatural amino acids amenable to bio-orthogonal chemistry or enzyme-tag pairs. This limits the access of methodology to the majority of native proteins and antibodies. For example, if an affinity probe or fluorophore has to be attached to a native protein, it will result in a heterogeneously labeled mixture of proteins. Such limitation with the available methodology is also evident in the case of antibody drug conjugates (ADCs). The two approved drugs in this segment for directed therapeutics are synthesized using chemical approach. Both the drugs are available as heterogeneous mixtures. Another approach is the proximity driven approach and is driven by the specificity of ligand-receptor interactions. This technique has been applied for the site-specific modification of endogenous proteins. This chemo-enzymatic approach also has a limited set of applications. Hence, there is a need for an effective tool that would allow site-selective protein labeling of native (un-engineered) protein at single-site which can offer further flexibility to the user for the introduction of the desired tag in a late-stage modification.
OBJECT OF THE INVENTION:
The object of the invention is for a of method site-selective labeling of native (un-engineered) peptides and protein at single-site (Νε-ΝΗ2) which can offer further flexibility for installation of the desired tag in late-stage modification.
Another object of the invention is to synthesize labeled native proteins, labeled antibody-Fab fragment, and labeled monoclonal antibody, and for further synthesis of homogenous protein conjugates. FIGURES AND DRAWINGS:
Figure 1 depicts the reagents used in the method of the invention.
Figure 2 depicts reaction for the formylation and single site labelling of peptides. Figure 3 depicts reaction for the formylation and single site labelling of proteins. Figure 4 depicts reaction 1 and 2 for the formylation and interception of formylation for the site specific single site labelling of proteins.
Figure 5 depicts alkynylation of native proteins.
Figure 6 depicts aminophosphonation of native proteins. Figure 7 depicts tagging of a protein through (A) aldehyde component, and (B) late-stage oxime formation.
Figure 8 depicts aminophosphonation of antibody-Fab fragment and antibody for installation of drug molecule. DETAILED DESCRIPTION OF THE INVENTION:
Accordingly, the invention is for a method of site-selective single site labelling of peptides and proteins at single-site (Νε-ΝΗ2) which can offer further flexibility for installation of the desired tag in late-stage modification.
In addition, the invention is for the synthesis of labeled native proteins, labeled antibody-Fab fragment, and labeled monoclonal antibody and for further synthesis of homogenous protein conjugates.
Accordingly, chemical methods for the effective modification of single site labelling of proteins is described. In one embodiment, the functional groups introduced are the aldehyde, maleimide, and can be expanded to various bio-orthogonal functionalities. These functional groups enable the late- stage modification of the proteins.
The reagents involved in the introduction of the functional groups involve reagents such as aldehyde, alkyne, phosphite, and the like.
Labels can be introduced by using them as one of the components in reaction.
Labels introduced by late stage modification are affinity tags, biophysical probes such as fluorophore tags, and drug molecules.
The fluorophore tags is selected from 11a, lib, and 11c. It can be extended to hydroxylamine derivative of any fluorophore.
The drug molecules is selected from 12a and 12b. It can be extended to hydroxylamine derivative of any drug molecules.
The reaction time and temperature are with reference to the physiological conditions and can be varied depending on the molecules to be conjugated. The method of the introduction of the functional groups for single-site protein labeling are selected from (I) Formylation of protein (II) Interception of formylation (III) Alkynylation of protein (TV) Aminophosphonation of protein.
Further, by the method of the invention transformation results in Νε -formylation of lysine residue which is affected with the aldehyde as a formylating pre-reagent and which deliver high levels of efficiency with both peptides and native proteins in physiological conditions. Additionally, the method involves two sequential steps wherein the reversible reaction of the a-amine with aldehyde renders it unreactive towards further reaction with formate generated from an aldehyde or any electrophile to enable selective formylation of Νε-ΝΗ2.
For the synthesis of proteins with the functional group with a bio-orthogonal site for late-stage modification of proteins with the desired tag, the Νε -formylation of lysine residue is intercepted by another electrophile, acylating reagent, to provide a site for late-stage modification. The interception of formylation can be performed by any electrophile delivered to the protein with a regulated rate of addition. The acylating reagents used for the introduction of the bio orthogonal functionalities is selected from 4a, 4b, 4c, 4d and 5.
In addition, for a single Νε-ΝΗ2 modification, chemoselective and site-selective ε-amine modification is performed with a multicomponent transformation that operates under physiological conditions in the presence of protein, aldehyde, acetylene, and Cu-ligand complex. The propargylamine is introduced at the labeled site is primed for late-stage bio- orthogonal modification.
Further, for a single Νε-ΝΗ2 modification, chemoselective and site-selective ε-amine modification is performed with a multicomponent transformation that operates under physiological conditions in the presence of protein, aldehyde, and triethylphosphite or diethylphosphite. Transformation is performed without the requirement of the metal catalyst. The aldehyde component derivatives can introduce tags directly or through a late-stage bio- orthogonal modification.
Figure imgf000007_0001
Reagents used for technology I (-Formylation): la-g, 2a-c, 3a-n.
Reagents used for technology II (-Interception of formylation): la-g, 2a-c, 3a -n, 4a-d, 5.
Reagents used for technology III (-Alkynylation of protein): 3a, 6, 7a-g, 8a-b.
Reagents used for technology IV(-Aminophosphonation of protein.): la-g, 2a-c, 3a-n, 9, 10, lla-c.
Aldehyde reagents for the technologies I, II, III and IV: n = 1 -10, X = NH, O, S. R is independently selected from H; alkyl; cycloalkyl; aryl and Ri, R2, R3 and R4 are independently selected from H; hydroxyl; -B(OR*)(O R**) wherein R* and R** are independently selected from H; alkyl; lower alkyl; cycloalkyl; aryl; heteroaryl; alkenyl; heterocycle; halides; nitro; - C(0)OR* wherein R* is selected from H, alkyl; cycloalkyl and aryl; -C(0)NR**R***, wherein R** and R*** are independently selected from H, alkyl; cycloalkyl and aryl; -CH2C(0)Ra, wherein Ra is selected from -OH, lower alkyl, cycloalkyl; aryl, -lower alkyl-aryl, -cycloalkyl- aryl; or -NRbRc, where Rb and Rc are independently selected from H, lower alkyl, cycloalkyl; aryl or -lower alkyl-aryl; -C(0)Rd, wherein Rd is selected from lower alkyl, cycloalkyl; aryl or -lower alkyl-aryl; or -lower aikyl-ORe, wherein Re is a suitable protecting group or OH group. R5 , R6, and R7 are independently selected from H; nitro; cyano; halides; alkyl; cycloalkyl; aryl and C(0)OR* wherein R* is selected from H, alkyl; cycloalkyl and aryl; -C(0)NR**R***, wherein R** and R*** are independently selected from H, alkyl; cycloalkyl and aryl. Rs and R9 are independently selected from H; halides; alkyl; cycloalkyl and aryl. Rio is selected from H; nitro; cyano; halides; alkyl; cycloalkyl; aryl and C(0)OR* wherein R* is selected from H, alkyl; cycloalkyl and aryl; -C(0)NR**R***, wherein R** and R*** are independently selected from H, alkyl; cycloalkyl and aryl. Rn, R12, R13 and R14 are independently selected from H; alkyl; cycloalkyl; aryl and -SO3R wherein R is selected from H; Na. R15 and R½ are independently selected from H; alkyl; cycloalkyl; aryl and C(0)OR* wherein R* is selected from H, alkyl; cycloalkyl and aryl; -C(0)NR**R***, wherein R** and R*** are independently selected from H, alkyl; cycloalkyl and aryl. R17 and Ri8 are independently selected from H; halides; alkyl; cycloalkyl and aryl. R19, R20, and R21 are independently selected from H; alkyl; aryl and C(0)OR* wherein R* is selected from H, alkyl; cycloalkyl and aryl; -C(0)NR**R***, wherein R** and R*** are independently selected from H, alkyl; cycloalkyl and aryl.
The term "suitable substituent" is meant to include independently H; hydroxyl; cyano; alkyl, such as lower alkyl, such as methyl, ethyl, propyl, n-butyl, t-butyl, hexyl and the like; alkoxy, such as lower alkoxy such as methoxy, ethoxy, and the like; aryloxy, such as phenoxy and the like; vinyl; alkenyl, such as hexenyl and the like; alkynyl; formyl; haloalkyl, such as lower haloalkyl which includes CF3, CCI3 and the like; halide; aryl, such as phenyl and napthyl; heteroaryl, such as thienyl and furanyl and the like; amide such as C(0)NR**R***, , where R** and R*** are independently selected from lower alkyl, aryl or benzyl, and the like; acyl, such as C(0)-C6Hs, and the like; ester such as -C(0)OCH3 the like; ethers and thioethers, such as O-Bn and the like; thioalkoxy; phosphino; and -NRbRc, where Rb and Rc are independently selected from lower alkyl, aryl or benzyl, and the like. It is to be understood that a suitable substituent as used in the context of the present invention is meant to denote a substituent that does not interfere with the formation of the desired product by the processes of the present invention.
As used in the context of the present invention, the term "lower alkyl" as used herein either alone or in combination with another substituent means acyclic, straight or branched chain alkyl substituent containing from one to six carbons and includes for example, methyl, ethyl, 1 -methylethyl, 1 -methylpropyl, 2-methylpropyl, and the like. A similar use of the term is to be understood for "lower alkoxy", "lower thioalkyl", "lower alkenyl" and the like in respect of the number of carbon atoms. For example, "lower alkoxy" as used herein includes methoxy, ethoxy, i-butoxy.
The term "alkyl" encompasses lower alkyl, and also includes alkyl groups having more than six carbon atoms, such as, for example, acyclic, straight or branched chain alkyl substituents having seven to ten carbon atoms.
The term "aryl" as used herein, either alone or in combination with another substituent, means an aromatic monocyclic system or an aromatic polycyclic system. For example, the term "aryl" includes a phenyl or a napthyl ring, and may also include larger aromatic polycyclic systems, such as fluorescent (eg. anthracene) or radioactive labels and their derivatives.
The term "heteroaryl" as used herein, either alone or in combination with another substituent means a 5, 6, or 7-membered unsaturated heterocycle containing from one to 4 heteroatoms selected from nitrogen, oxygen, and sulphur and which form an aromatic system. The term "heteroaryl" also includes a polycyclic aromatic system comprising a 5, 6, or 7-membered unsaturated heterocycle containing from one to 4 heteroatoms selected from nitrogen, oxygen, and sulphur.
The term "cycloalkyl" as used herein, either alone or in combination with another substituent, means a cycloalkyl substituent that includes for example, but is not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl and cycloheptyl. The term also involves "cycloalkyl-alkyl-" that means an alkyl radical to which a cycloalkyl radical is directly linked; and includes, but is not limited to, cyclopropylmethyl, cyclobutylmethyl, cyclopentylmethyl,
1 - cyclopentylethyl, 2-cyclopentylethyl, cyclohexylmethyl, 1 -cyclohexylethyl and
2- cyclohexylethyl. A similar use of the "alkyl" or "lower alkyl" terms is to be understood for aryl-alkyl-, aryl-lower alkyl- (eg. benzyl), -lower alkyl-alkenyl (eg. ally 1) , heteroaryl-alkyl-, and the like as used herein. For example, the term "aryl-alkyl-" means an alkyl radical, to which an aryl is bonded. Examples of aryl-alkyl- include, but are not limited to, benzyl (phenylmethyl), 1 -phenylethyl, 2-phenylethyl and phenylpropyl. As used herein, the term "heterocycle", either alone or in combination with another radical, means a monovalent radical derived by removal of a hydrogen from a three- to seven-membered saturated or unsaturated (including aromatic) heterocycle containing from one to four heteroatoms selected from nitrogen, oxygen and sulfur. Examples of such heterocycles include, but are not limited to, pyrrolidine, tetrahydrofuran, thiazolidine, pyrrole, thiophene, hydantoin, diazepine, imidazole, isoxazole, thiazole, tetrazole, piperidine, piperazine, homopiperidine, homopiperazine, 1 ,4-dioxane, 4-morpholine, 4-thiomorpholine, pyridine, pyridine-N-oxide or pyrimidine, and the like.
The term "alkenyl", as used herein, either alone or in combination with another radical, is intended to mean an unsaturated, acyclic straight chain radical containing two or more carbon atoms, at least two of which are bonded to each other by a double bond. Examples of such radicals include, but are not limited to, ethenyl (vinyl), 1 -propenyl, 2-propenyl, and 1 -butenyl.
The term "alkynyl", as used herein is intended to mean an unsaturated, acyclic straight chain radical containing two or more carbon atoms, at least two of which are bonded to each other by a triple bond. Examples of such radicals include, but are not limited to, ethynyl, 1 -propynyl, 2-propynyl, and 1 -butynyl.
The term "alkoxy" as used herein, either alone or in combination with another radical, means the radical -0-(Ci-n)alkyl wherein alkyl is as defined above containing 1 or more carbon atoms, and includes for example mefhoxy, ethoxy, propoxy, 1 -mefhylethoxy, butoxy and 1,1-dimethylethoxy. Where n is 1 to 6, the term "lower alkoxy" applies, as noted above, whereas the term "alkoxy" encompasses "lower alkoxy" as well as alkoxy groups where n is greater than 6 (for example, n = 7 to 10). The term "aryloxy" as used herein alone or in combination with another radical means -O-aryl, wherein aryl is defined as noted above. R5, R6, and R7 can be sourced independently from any alkyl or cyclic alkyl functionalities that would not perturb the nucleophilicity of the phosphite.
R8 can be sourced from any alkyl or cyclic alkyl functionalities that would not perturb the nucleophilicity of hydroxylamine. For example, affinity tags, fluorescence tags, drugs, and toxins.
Further, several native proteins, antibody-Fab fragment, and a monoclonal antibody are synthesized with the functional groups by the site-selective labelling. Antibody drug conjugates were synthesized by using the doxorubicin derivative which acts as a fluorescent label and a drug.
EXAMPLES:
A more clear and concise functioning of the invention is provided in the examples and drawings, which are provided for better explanation and not intended to limit the disclosure of the invention.
EXAMPLE 1 :
(a) Formylation of protein
In a 2 ml Eppendorf tube, protein (14, 7.3 nmol) in phosphate buffer (95 μΐ, 0.1 M, pH 7.0) was taken. Aldehyde, ethyl-4-(4-formylphenoxy)butanoate (31, Ri, R2, R3, R4 = H, X = O and n = 2, 4.38 μmol ) in DMSO (5 μΐ) was taken from a freshly prepared stock solution and added to the protein followed by vortexing (350 rpm) at 25 °C or 37 °C. The overall concentration of protein (14a, 14c-14g) and aldehyde (31, R1, R2, R3, R4 = H, X = O and n = 2) was 73 μΜ and 43.8 mM respectively. The progress of the transformation was followed by sampling aliquots at various time intervals by MALDI-ToF-MS using sinapic acid as matrix. After 24-48 h, the reaction mixture was further diluted with water (0.4 ml) and unreacted aldehyde was extracted using ethyl acetate/hexane (8:2, 5x1 ml). The reaction mixture was subjected to dialysis and the sample was concentrated by lyophilization before subjecting it to the digestion, peptide mapping and sequencing by MS-MS (Fig 1 and 2). In case of Melittin (14b), concentration of aldehyde (31, Ri, R2, R3, R4 = H, X = O and n = 2, 4.38 μmol , 21.9 mM, 2.19 μmol ) was reduced to control the selectivity. All the other steps were similar to other proteins. In case of RNase A (14a) (7.3 nmol), 4-(decyloxy) benzaldehyde (3k, Ri, R2, R3, R4 = H, X = O, R = C10H21, 4.38 μmol ) was used as formylating pre-reagent. The percent conversion and yield resulting out of the reaction is provided in Table 1.
Table 1 : % conversion and yield of the peptide/protein by the method
Figure imgf000013_0001
Figure imgf000014_0001
EXAMPLE 2:
(b) Interception of formylation
In a 2 ml Eppendorf tube, RNase A (14) (0 (7.3 nmol) in phosphate buffer (95 μΐ, 0.1 M, pH 7.0) was taken. Aldehyde, ethyl 4-(4-formylphenoxy) butanoate (31, Ri, R2, R3, R4 = H, X = O and n = 2, 4.38 μmol ) in DMSO (5 μΐ) was taken from a freshly prepared stock solution and added to the protein followed by vortexing (350 rpm) at 25 °C. After 3 h, 2,3,5,6- tetrafiuorophenyl 3-(2,5-dioxo-2,5-dihydro-lH-pyrrol-l-yl) propanoate (4a, spacer la where n = 2, acylating reagent, 109.5 nmol) in DMSO (3 μΐ) from freshly prepared stock solution was added to the reaction mixture gradually over a period of 75 minutes. The concentration of the protein, aldehyde and acylating reagent (4a) in the reaction mixture was 73 μΜ, 43.8 mM and 1.09 mM respectively. The reaction was stopped in another 15 minutes by extraction of unreacted electrophile (acylating reagent) and aldehyde using ethyl acetate/he xane (8:2, 5x1 ml). The reaction mixture was subjected to dialysis and the sample was concentrated by lyophilization. Mono-labeled protein was dissolved in phosphate buffer (95 μΐ, 0.1 M, pH 7.0) and treated with 4-methyl-7-mercaptocoumarin (5, 3.65 mM, 365 nmole, from freshly prepared solution) in DMSO (5 μΐ) to give the fluorescent labeled protein. Protein labeling was monitored by using MALDI-ToF. After the late-stage modification, unreacted 4-methyl-7- mercaptocoumarin (5) was extracted using ethyl acetate/hexane (8:2, 5x1 ml). The reaction mixture was subjected to dialysis and the sample was concentrated by lyophilization, before subjecting it to digestion and peptide mapping. EXAMPLE 3:
(c) Alkynyhtion of protein (procedure 1 )
In a 1.5 ml Eppendorf tube, protein (14, 7.3 nmol) in phosphate buffer (90 μΐ, pH 7.8, 0.1 M) was taken. Formaldehyde (3a, R = H, 60 nl, 0.73 μmol ) in DMSO (3 μΐ) was taken from freshly prepared stock solution. To the reaction mixture, 1 , 10-phenanthroline (7g, 522 μg, 2.92 μmol ) in DMSO (2 μΐ), phenylacetylene (6, Ri, R2, R3, R4 = H, 78 nl, 0.73 μmol ) in DMSO (2.5 μΐ) and copper(I) iodide (8b, 120 μg, 0.73 μmol) in DMSO (2.5 μΐ) were added. After addition of all the reagents, a heterogeneous reaction mixture is formed that turns into a clear solution after 3-4 h. The reaction mixture was further vortexed for 72 h. Alkynylation of RNase A was followed by MALDI-ToF MS using sinapic acid as matrix. After 72 h, the reaction mixture was treated with hydroxylamine hydrochloride (250 μg, 3.65 μmol. ) in water (5 μΐ) for 6 h to remove the excess formaldehyde. The reaction mixture was subjected to dialysis [EDTA (0.05 M)-phosphate buffer (pH 7.8, 0.1 M)] and the sample was concentrated by lyophilization before subjecting it to digestion, peptide mapping, and sequencing by MS-MS.
(c) Alkynyhtion of protein (procedure 2 )
In a 0.5 ml Eppendorf tube, 1, 10-phenanthroline (7g, 522 μg, 2.92 μmol ) in DMSO (2 μΐ), phenylacetylene (6, Ri, R2, R3, R4 = H, 78 nl, 0.73 μmol ) in DMSO (2.5 μΐ) and copper(I) iodide (8b, 120 μg, 0.73 μmol ) in DMSO (2.5 μΐ) were mixed together and allowed to vortex for 1 h. In a separate 1.5 ml Eppendorf tube, protein (14, 7.3 nmol) in phosphate buffer (90 μΐ, pH 7.8, 0.1 M) and formaldehyde (3a, R = H, 60 nl, 0.73 μmol ) in DMSO (3 μΐ) were mixed. To the later mixture, pre-generated copper alkylidine complex was added. The reaction mixture is heterogeneous initially and turns into a clear solution after 3-4 h. The reaction mixture was further vortexed for 72 h. Alkynylation of protein was followed by MALDI-ToF MS using sinapic acid as matrix. After 72 h, the reaction mixture was treated with hydroxylamine hydrochloride (250 μg, 3.65 μmol ) in water (5 μΐ) for 6 h to remove the excess formaldehyde. The reaction mixture was subjected to dialysis [EDTA (0.05 M)-phosphate buffer (pH 7.8, 0.1 M)] and the sample was concentrated by lyophilization before subjecting it to digestion, peptide mapping and sequencing by MS-MS. The percent conversion and yield resulting out of the reaction is provided in Table 2.
Table 2: % conversion of the protein by the method
Figure imgf000016_0001
EXAMPLE 4:
(d) Aminophosphonation of protein
In a 1.5 ml Eppendorf tube, a protein (11, 7.3 nmol) in phosphate buffer (90 μΐ, pH 7.8, 0.1 M) was taken. Aldehyde (3b, R1 = Ph, 2.1 μιηοΐ) in DMSO (5 μΐ) and triethylphosphite (6, Rs, Re, R7 = Etthyl, 2.1 μιηοΐ) or diethylphosphite (7, R5, R6 = Etthyl, 2.1 μιηοΐ) in DMSO (5 μΐ) were added from freshly prepared stock solutions. The reaction mixture was vortexed (350 rpm) in incubate shaker at 25 °C. The progress of reaction was monitored by MALDI-ToF MS using sinapic acid as matrix and ESI-MS. The excess aldehyde in the reaction mixture was quenched with hydroxylamine hydrochloride (250 μg, 3.65 μιηοΐ) in water (5 μΐ) for 2 h. The protein solution was desalted by using the concentrators (3 kDa MWCO) and then the sample was concentrated by lyophilization before subjecting it to digestion, peptide mapping and sequencing by MS-MS. The percent conversion and yield resulting out of the reaction is provided in Table 3.
Table 3: % conversion of the proteins by the method
Figure imgf000017_0001
ADVANTAGES:
The invention eliminates the requirement of pre-protein engineering and un-natural amino acids for labelling of proteins. Distinguishing a unique Lysine residue as a reactivity hotspot and open the doors for the single- site derivatization of proteins.
Application in probing protein interactions, ligand discovery, disease diagnosis, high- throughput screening, and regulating properties of therapeutic proteins. The later can be achieved by conjugating drugs, polymer chains, glycosylation, chromophores, and biohybrid materials to the proteins.

Claims

The Claims:
1. A method for the site selective modification of peptides and proteins comprising of the activation of the single site (Νε-ΝΗ2) of peptides and proteins wherein the method comprising one of formylation; interception of formylation; alkynylation or amino phosphonation of peptides and proteins.
2. The method for the site selective modification of peptides and proteins as claimed in claim 1 , wherein reagents for formylation is selected from la-g, 2a-c, 3a-n.
Figure imgf000018_0001
3. The method for the site selective modification of peptides and proteins as claimed in claim 1, wherein reagents for interception formylation is selected from la-g, 2a-c, 3a-n, 4a -d, 5.
Figure imgf000019_0001
5. The method for the site selective modification of peptides and proteins as claimed in claim 1, wherein reagents for aminophosphonation is selected from la-g, 2a-c, 3a-n, 9, 10, lla-c, 12a-b.
Figure imgf000020_0001
6. The peptides and protein conjugates obtained by site selective modification of peptides and proteins comprising of the activation of the single site (Νε-ΝΗ2) by the methods as claimed in claim 1.
PCT/IN2017/050362 2016-09-07 2017-08-29 Site-selective peptides and protein labelling and synthesis of homogenous protein conjugates WO2018047197A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IN201621030484 2016-09-07
IN201621030484 2016-09-07

Publications (1)

Publication Number Publication Date
WO2018047197A1 true WO2018047197A1 (en) 2018-03-15

Family

ID=61562717

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IN2017/050362 WO2018047197A1 (en) 2016-09-07 2017-08-29 Site-selective peptides and protein labelling and synthesis of homogenous protein conjugates

Country Status (1)

Country Link
WO (1) WO2018047197A1 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009150865A1 (en) * 2008-06-13 2009-12-17 国立大学法人東京大学 Method for production of modified protein and modified protein produced by the method, and protein modification kit

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009150865A1 (en) * 2008-06-13 2009-12-17 国立大学法人東京大学 Method for production of modified protein and modified protein produced by the method, and protein modification kit

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
BASLE E ET AL.: "PROTEIN CHEMICAL MODIFICATION ON ENDOGENOUS AMINO ACIDS", CHEMISTRY & BIOLOGY, vol. 17, no. 3, 26 March 2010 (2010-03-26), pages 213 - 227, XP026984435 *
CHILAMARI ET AL.: "SITE SELECTIVE LABELING OF NATIVE PROTEINS BY A MULTICOMPONENT APPROACH", CHEMISTRY-A EUROPEAN JOURNAL, vol. 23, no. 16, 2017, pages 3819 - 3823, XP055488760 *
FEENEY ET AL.: "Chemical Modification of Proteins: An Overview", ADVANCES IN CHEMISTRY, 1982, pages 3 - 55 *
PURUSHOTTAM, L. ET AL.: "CHEMOSELECTIVE AND SITE-SELECTIVE PEPTIDE AND NATIVE PROTEIN MODIFICATION ENABLED BY ALDEHYDE AUTO-OXIDATION", CHEMICAL COMMUNICATIONS, vol. 53, no. 5, 2017, pages 959 - 962, XP055488758 *

Similar Documents

Publication Publication Date Title
Burke et al. Exploring chemoselective S-to-N acyl transfer reactions in synthesis and chemical biology
Aussedat et al. Total synthesis of the α-subunit of human glycoprotein hormones: toward fully synthetic homogeneous human follicle-stimulating hormone
Tang et al. Concanavalin A‐immobilized magnetic nanoparticles for selective enrichment of glycoproteins and application to glycoproteomics in hepatocelluar carcinoma cell line
Abbas et al. Allenamides as orthogonal handles for selective modification of cysteine in peptides and proteins
AU2006308854A1 (en) Cell-free synthesis of membrane bound polypeptides
CN108341781B (en) Method for analyzing related enzymes in biosynthetic pathway of plant secondary metabolites
Bouchenna et al. Total Chemical Synthesis of All SUMO-2/3 Dimer Combinations
Hayashi et al. Simultaneous and traceless ligation of peptide fragments on DNA scaffold
Deuss et al. Parallel synthesis and splicing redirection activity of cell-penetrating peptide conjugate libraries of a PNA cargo
KR20110043859A (en) Ambivalence peptides promoting production of target mirna and a method of regulating the production of target mirna
Kitoun et al. Phosphine-mediated bioconjugation of the 3′-end of RNA
Chang et al. A photo-cleavable biotin affinity tag for the facile release of a photo-crosslinked carbohydrate-binding protein
Richardson et al. Exploring neoglycoprotein assembly through native chemical ligation using neoglycopeptide thioesters prepared via N→ S acyl transfer
Dardashti et al. Selenolysine: a new tool for traceless isopeptide bond formation
Röckendorf et al. Glucuronic acid derivatives as branching units for the synthesis of glycopeptide mimetics
Popov et al. Ligations from tyrosine isopeptides via 12-to 19-membered cyclic transition states
WO2018047197A1 (en) Site-selective peptides and protein labelling and synthesis of homogenous protein conjugates
Tolbert et al. Conjugation of glycopeptide thioesters to expressed protein fragments: semisynthesis of glycosylated interleukin-2
Baumruck et al. Native chemical ligation of highly hydrophobic peptides in ionic liquid-containing media
Sarkar et al. Site-specific fluorescent labeling of the cysteine-rich toxin, DkTx, for TRPV1 ion channel imaging and membrane binding studies
CN101903047A (en) The selective radiolabeling of biomolecule
Xia et al. Enzymatic Generation of Thioaldehyde Motifs by Flavin-Dependent Cysteine Decarboxylases for Peptide Bioconjugation and Macrocyclization
CN112898172B (en) Synthesis method of amphiphilic functional group compound capable of being enzymolyzed by carboxypeptidase
Maurel et al. Toward high yield synthesis of peptide–oligonucleotide chimera through a disulfide bridge: A simplified method for oligonucleotide activation
US11149058B2 (en) Multi-functional chemical agents, and the method for protein modification

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17848304

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17848304

Country of ref document: EP

Kind code of ref document: A1